Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCGTGGAGTATGGATTATAAAATGTGTCATAAATATGGGTTAAACCACGCTTTTTCTCATCTTCGACTTTATCAAAATATTTTGACTTCGGAAGGAATAAAATATTATTATTTTTTTAATCATATTCCAGCTGTCGTTCAACTACGTGGCCACGTCGGGACCTATTTTGGTAATTTCCTCCAAATTCGCGAACTCCAAAATATCAAACCCGCCCTTTATTTCCCACTCCTTGAAATTTGTAAAAAGAATTAAAAAAAAAAGAAAAAAGAAAAAAGGAAAAGAAACCCTAGAAGCTGAAGCGGAAACGCCCCCTCCCTGTTTCTAAAATTTTCTCTTCGAACCTAGAAAGAATCTGACCGTCGATCTCTTTATCCAACGGCTCCCGTCAGTCGGCGACCGCTTTTTCCCCTTCTTTCCACCCCCACTTTCAGTGGTTTTTATCATAAACCCTAGCCGTCTTCTCTCTCTCTTCTGCTCTGTCAGTTTGCATGAAACTCTCTCACTCGCAGCGATTCTCGATCATTCTTTCGGATCCAATCTCTGAAGTTTCTTTTTGGGTTCTCCATCGTTCTCTTTTTCTTACATCTTTTTTCGGGTTTTTCATGGCCTCCGTAAATGGATCTTTGAGGATTTTGTTTTTCCTTTTTTCTTGTGTTTTCGTCTAGATTATTGCGGCGAAAGTGTGCGATTCGCTGATGGTTTTTGTCTCTCTTACTGCTGCATGTCCCTGTTTGTTTCTGTTCTTTGATTTTAAGTTTTCCCCCTCGAATTCTTTGGCGCCAAGTTTCCGTTTCGAGTTCGGCTTAGATTGCGTTTGAAGATTCCGGGTCGTCGTGTAGACAGACACTGGTTTCTCACTTCCATCATTCCTGATTATGGATGCGTGTTGAGTTCTGGCGGTTCTACTCCAAGGCCGCCGGGTCCGATACCGGTCCGAGTTAGGTGCTGAGTCAATTCGGAGAGCGACTCGGTGAGTCGGGGACAGAAGGTCGAGTGTCGAGTGGTGCCAATTCTTCATAACCGTACGGCAATGCCGTCGGTGGGGATGAGAAGGACGAGAGTTTTCGGTGTGGTAAAGGGTGTAGATGGAGCTAGAGTTTTGAGGTCTGGAAGGCGTCTATGGCCCGAATCGGGTGAGGTGAAGCTTAAGAAATCCAAAGATGCCAGTGACTGGTACCCTGTTATTGAGAACAGAGGAAATGGGGGTGGAAGTGGCCAGGTTAGGCTCCATGGTAAGTGGACTCAAGTTCGTAATGTCAACCCGAAGCGGGTTGTCGTCGTTGACATTCATGAGGAGGATGATACTTGCGTAGCTGATGTGCCTAAACCTGTAAAAGTTGTTGCTAGGAATGGTGGTGATGGTGAGTCTGGGGTTGTGGATAGGATGTTTGGGAAAGTGTACAGAAGAAAGAGGAAGAGGGGTCTTTCAGAGAATCGCGTTATTTTCGATGAAACGGAGGGAGAGAATGGTTTGGCAGGAGATAGAATGTTTGGACTCCGTTTCATTAGAAGACAGAGAACGAGGAAGACTAACGATGGAAATTGGGAGCCTACTGCAGGTGGTCGTTTTAGAGTGCACTTTTCTAGGCAAAGTACTTTGCAGCTGACCCGGGATCGGGTTCTAACTATTTTTTCTGGGAGTAGTCATGATGGTGGCTGTTTCTCAGATTTTATGCTCTCGATTCTTAGACACCTGAAGAGTCCACTGCTGAGGGTGGCTAAGCTTTCTGAATTTTTGTTATCCGATCCAATCTGTGGAGTTTTTGCTTCAAAGGGAATTCGGTTCTTGCAGGTATTACCTTCTTACTGCCGTTTGTGTTCTGATGCAATTGTTGGGCTTTTTTTTCTTTTTTCTTTTGATGAAATGTTAGGCTTCAATGTAATTGGGCAGCAATGAAGTTTTATATCATAAGGTTATCTTCTGGTGATTAAGTTATCTTGTTTCTTGATCCGAACAAATTCCGTCCTTTTTTGTTCCTGGGACCTTTTTTTTTTTGTCTGTCTAAGCATTTCTCTGTTTTTATCTTACCCTCTGTCTTTCAGGGTTATCCTCCTACTGGAAGCTCTGGCATTTGTGTGATTTTTGGGGCCACGCAGTCGACGCCAATGTTCCATTTAGATTTTTCCGCCATTCCTCTCTATTTTATGTACTTGCATTCAAGGATGTTTCTTAGAGCAACTTGGATTCAGGCTCGTCTTGTATATAATAACAAGCAGTTAGATGTAGATATGAGTAGTGATAGTGAAGAAGACAGCATTGAAGAGCAACATGTTCCCAGTCCTCCTGGAAGAAGTTCTTTGGAGTGCAAAACCGTGACATTTGCAGTAGATCACCCTAAGAGCCGACCCATTTTACATCCATCTGTTAGAGCTTCAAGGTTAGGTGGTCGGAACATGCAATACAGAAACAGTTTCAGTTCTCGTGGTGTGCGTAAAAGGAGGAGTTCGCTGAGGATGAGGAGGCCTAGAAACTCTTCTCTTGCTGCTATGCAAAAAGCATCAGAATCAGTTCATGATACCAAACGCAGTGTATCCTTTTCTTCTGCAGCATCTTATAACAAGAACAAGTACTTAGCCCTGAGAGATTCTGCTGGGCGCATCAGAGAAGGGAGTTCTACTGCATTGAGATCAGCAATGGTTGTTGACCCATCATGCTGCAATGCAAATATATTAATTGTGGAATCTGATAAATGTTTGAGAGAAGAGGGAGCCAATATTGTGTTAGAGTTTTCTGCATCGTGTGAATGGCTTCTAGTGGTCAAGAAAGATGGTTCAGCTAGATACACCCACAAAGCAGAAAAAGTTATGAAGCCCTCTTCTTGCAATCGTTTTACGCATGCAATAATTTGGTCCACGGATAGTGGTTGGAAGCTAGAGTTTCCAAATCGAAGGGATTGGTTTATTTTCAAGGATCTCTACAAGAAATGTTCTGATCGCAATATCCCATGTTCTACTGCTAAAGCTATTCCTGTGCCCAGAGTGTGTGAAGTTCCAGGTTATGTGGATAGTAGTTGTGCTTCTTTTCAAAGGCCAGATATGTACATCTCTATACATGATGATGAGGTATTTAGAGCCTTGGCAAAGACAACCGCAAACTATGACATGGACTCTGACGACGAGGAATGGTTAAGCAAGTCTAACGACGAGCTTATTGTGACAGACAAGCATCAAGAATGTATTTCAGTGGATAACTTTGAGTTGATGATCGATGCCTTTGAGAAGGGACTTTATTGTAATCCTGATGCCTTCCCTGATGAGAAAGCTCCTGCTGAAATATGCACGCATCTTGGTAACCGTCAAATGGTTGAGTCTGTATTTACTTATTGGATGAAGAAACGAAGACAGAAGAAATCATCCTTGGTTAGGGTTTTCCAGGTACTATTTTTTGTTCTTCCTTAAAAACTTTATTTGGGTGTGAGTCAACTCTTAAGACTTGCATGTTTTTTGCATTCCAAGAGGATACTTACCTTTTTTATATTCAAACTATTTTTTAGTGTTTACTGAAGAAGCATAATATTTAGCTCGACCTAAGTTAAGTTGTTGAGGGTGGCAGTTCATGTGAACCATTTGGATGATCTTTTTCTTCCAGGCTCATCAGGCTAAGAGGAAACCTCTGGTGGTCCCTAAACCTATCATGCGAAGAAGGAGATCATTCAAAAGAAAGCCTAGCCAAATTGGGAGAGCTACACACTCAAATCTTTTGGAAGGTATAACACTTTGGTAGCAAGTTTTTCTTTAAAAGAAAATGATAGATGGTTGTATGTACCTATGAAATAGCTCAATAAGTGACATTTGATACATGATAGGTTATAGATTTGAATCTCCACTAGTTGTGATGTGATGTAAAATTCTGAGTTTTGTGCTGGTTGGAGTGTACAATCTCTAACGATTCTAAGTCGGGGTTGTGCAGCCATAGTTTTGAGGCGAGATGCTGAGGAGGACCAGAATGCCATGGCAAAATACCAAGAAGCGAAGGCAACGGCGGAGAAAGCCTTTGAATGTGCTGTGGGGAAGCGGCAGAGGGCACAGTTGCTGTTGGAGAATGCAGATTTGGCAGCTTACAAAGCCATGATGGCACTGAGGATTGCCGAAGCAATTCAAGCATCAGAAATATTGCCGGATGCTGAAGCTGAATCTGAAGCTGCTGCTGCTGCATCTTTTTTTCTCGAGTGATTTTGTTTTCGGGAAGTGGCGGGATTCATTCACGGCTGTTTTCTTTTGAGAGTCCCCAATTGTTTTAATTATTATTGTATAGTATTTCATTAATGTGTACATTCTATTTTGTATGATTAGAATCTCTCACACCCATAATGTACAATGAGACGTCCTAATTCTTCCTAGGTTCAAAAATATACGACTTAAGTTAGTATATCAATATGCTTCCCTACTCCCTGTATTACCTTTTTTCATCCAAATCAAATCATAAATGATAAAATCAAGCATAATAACAAGAATTTGTGAGTAAAAAGAACTTCATTATTCCATGTCTGCTGTTAGCTTTGTAGAAAGAGCGTTTTACAACTCCAAAATTTCCTTTTTGCTCTTTATGGTACTTAACTTCCTTACATTGAATTCTACATCATCCAAATCCTCTGAAATTCCCAAGTATCCATAAAGCATTCTCTCTTCAGTCTTGTCTCCCTCCATTTCTGCTCCCCCATTTCTGCACTTTGCCACCTCGAAATTCACTCTTCCATGTCTCCTTTCAGAAAAGAATTTATGGAGTCTTTCTGCATCTGTCAAACCAGAAAACATGGCCAGGAACTTCAACACCATCACACTTTGATCTGCAGGACTCCCCAAAGTTATTTTCACTCTCCCCTTCAGCAAATTCTTACCTGTGAACAACAAAACGATTCGTACACCGCATGCAGGTTACTATAGCTAGAAACATTAGACATGCCATGAGATATTAGTGAACATTCGTCAGATCCTATTAAACATGATAATGTGAGAATAAAACGTCACTTACTTCTTAAGAAAGTCTCGAGTGCTTCGATCGTTACAACTCTCCACCTATCGGGGTTGCTGTGCGACAGAGAGATGTTGCGGATGATAATAACTGGAGGCCAGATGATAAGATCTTCCTTCTGAACCACAGCTTCTTCCTTGGGCAACACCTCGGGAACCCATGTTACAGTGTCTTGGGGCATGGCACTATTCCACCCCATCAAAACGCATATGGCTTTGGCAAGACCTAAATGCTGAGCCCTCAACCCAGTCCTGTGGGACATGTAAGCATGCTTTACCAGGCGTTTAGTATCCAAAAATTCCTTTGAGGAGCTGCAATGGCAGGGATAGGCAATATAAAGCTAGAATTAGCGATTGAAAGTTAATAAAAACAACATTTCTCCTCGCGAACAGCACGTCTACAAAAGGTGCTAGAATTAAAACAAGACAATAAGCAAACAGCAACTGTGAATACCAATTCAAAGAACTATAAAAATCACCATCATAAAGAAAAAAAAATCATAATCACAAATTTAGCAAATTAACATTTAGAAGTCCCAGGAATCTCAAACTTCAAGAAGTAAACCCCGTGACTTCATATTCATAATAACAGAAATCCTCGACCTTTTGACATTTACCCATGGAGCCGATACTGGAGATCACAAGTTGTTCTTGCAACACTTGGTAGTCAACAAACTTCTGAGAAAATAAATACCAAGGGATGTGATCGATCTTAAGCTTTTGACCTATTTAGCTACATCCATGGACGCTTAACCACCATGTCTCTATAAAATATTGAGGCTTCAGTTTCTGGCATTATAATCATAGCTTTAGATTCACAACACTATCATGAAAGCTGGTAAAACTTTATTAACCTCGTTAGCTTTTCGAAGACCCCGCAGGGGATTTGTGAGTTCTTTTCATTTTTTGGACAAAAAACTGCTTCCATTGCAAAAATGAAAAGTACACAAGTGGGGTCAAAACAGAAGCCCTCTAGCCAAAAGTAACAAAATACTCTAATATAGGAAACAGAGGACGGACCACAGCAAGAAGTTCTCCTAATTATCCTCCCACCATTCAATACCATCAAATCTCTTCTTCAATGGTTCATCTTCATTTTGTACCGTCCATACACTTTAAAGTCCATACACTACAGGGAAATCTATTCATTAGATGACGTCTTATACTGGGTAACTTCTAAACCAATCGTTCAAATGGTCAATTCTACTTTTAAAGTTCCATCCTTGTTGACTAAAAGAAAAGTTGAACCATTGGAAAATAACTACCGAGTAATCCAATCATGCTGTTAATTAAGGATGGTCACTTTACAGCACAATTTTACCAATCTTCACTGACATACGTGATACTACTATATCAAATTAAATGCATGTCAATGAATGACCATAATATGTAAAGAAGCTACTGAAAAGCAGCAATGGAGATCATATGCGTCGCTGAAGGTCCAAGCCAATATATCTTCTCCTAAATTCTGCAACATGAGCTAGAAAATTAATAACATTTCTGAACGCGAAGTTCTGGCACAGTGATAAATATCAAAGAAAACCCAAATCTTCACGTGAAACTAAAGCTTGCTAAGGTGGAAAACTGGCCATAAATCTACTCGTGAGATTGGTTATATCATGAGACAAAAAGCAAACCAAACCCTCTAGACCGGATCTCTCTACAATGAGCTATACTGCTCTGATAATTCTAAAATGGTTGAGCATGGACAATGGGTAGTCCACCCACCAAGGTACCTTCCAGACCCCCAGTGATATCCTCTGATTATCTTTATAATGTCTGTCCAAAGAAATTCTCTACTATCCTGAAGTCCTCCACATTGACATGCACTTCATAGAATATGTTCACTAGTATAAAAGTGCACGAAGAAACTTGTAAAGCTTAATCAGAACACCCATATCTTTCTCCAGTAGTTTGCCGTTATTAACATAATTGTGCATTTCACAGGATACATTCACTACAATTGGAAAAGTGACCAACGTAAATACAATCTCAAAGCTTACTCCGAACAACTCCCAGTTTCTGAACTCCATTAAACTAGGTATATGGCCAGTCATCATTCTCAAACGACATCATTGAAGCCACTATCATGACTCCAGAGATTCACACCTTGCTGCAGTGAAAGGGAACCCTCTAAAGCTCCAAGATTATGTACCTCCCTCCATCCCCCCCTTCCCAACCCAAAAAGATAATAAATAAAATAATATGATGGTCCATACAAAATAATGGAAGCACGGCGACTATACAATGGGAATCTTATAATCCAAAGAAACCTACGGAAGAAAACCTAGAAGGTTTCTTATAATCTTACTTGAATACGTTGCACAATAGCAATTAGGAAACCCCTGAGTCAATTTATGAAACGAAGTCTCCACGATTAGGAAATAGGGAAGAAGACAGACCTTATGCCACATACGATGCAATATAAACTACCAGCATTTCCTTGCTCCTTGTACTTTTTCCTGACAGTAGGCTTCATATTCAGCTTTTTGGAGCACTTCAGAAATGCCTCATGAACCATTTGTTTGAACTCTTCAGAATCCTCAGGTGGTTCAGACTCTGTAGGTTTTATGAAATCATCTGACAGGCCATCGTCGTTTACATGAACATAATCGTTATCTTCATCAGTATTTTTCCAAATTTTAAGCGGTCTATGAAGACTATTTCTGAGATCAACATTTTGTGACTTGATCCAGCTTGAAGGACCACGTCTTCTGTTGCCCTTCATATATTTAGGACCATTTTTGAATTTCTGTGCATGGTCTAAATACCTTTCACATGAAAATGAATCATCAGAAGCATGGAAATTATGACGATCGTATTTCTTATTCGACTTCCTGTATTTGCCATGGTCAAATCCTGCTTTCCTGGATGTATATCTGTGACTGATATCATCATCCATCCACGCTTCACCACTATCATACATATCCACAACCTTGTCGGGTGTGTATTCACATGGCATACTTATTCTATGATCAAGTAAATTCATATCTTCCTCGAAATAATTTCTTTTTGGCATCTGATCCCTCAACTTATAAGCGGAAACACCCTCTGTCATTCTCTGCACTCTTTCAGAATTTCTATAAGTTTCTCCATCACACTTAGACATGGAAGATCTATGCAACCTCTCTTTCAAATAATGTGAACCTACTTCTCTTTCAGAGCCATAATCTGATCTTTCATGCAATTTTTTACTATACTCAGAAGGGTTGATGTCCGAAACTTCATAATCTTGAGTTACTCTCCTTCCCACTTCAACATGGATGGATCCAGTACTTGCATATTCACAGTCATGGTTTATTGTTTTTGAGGTTAGTCCTATATCAGGGTAATCTAAAACTCTCTGTTCTCTAAGACTATAACGGTTAAGTGCTATTCCAGTACTATGATTTGCAAACAAGTTCTCAGGATTTTGTAATTTCAGCGAACTGGAGTTTGCAACTTTTGGTCTTGAAAAGTCTCCAAGATCAGACACAATTGCTCTACCATAATCAATATAATCATGTGTTTGATTAATTTTCTGCATCACTCGAGATGGATCTTCATTTGAATCATTAACTGTTCCCTCAGGTTTAGAATAAAAATAGCTCATATGCTCCCTCATAGGACTATTAGTACTTTGTTGATGAGGAGTAAAATTCCTTTCTCCACCAACAAGATCAATGGCTGAATCTACAAGGGTCCTTTTCCCATATGACTTAAAGTCTAAACGGTCCATAAGGTCTCCATGGCTCCTTGTTGGAAAATCATCCGAAACCTGAAAGTACTGGCCATCCTGATAAGATTCCAAAGACTTGCTTGTAAATGGCCCTGACGAAGATCTTTCATATTCTTTGGAATAAAATCCCGAGCTTGCCAAATATCCAATATTCCTTGAACTAAACCTACGGCTTTCTTCAACTTCATGGGACTCTGTGACTTGTGATTTATCTGAAACTACATGACTTCGAAATTTAAGTCTTTCATCGTCAAGACTTCGCACATCCAAACTCTGAGAAGATGGTAAGAAGTCTTTGTAAATTGACGTGGAACCCATATTCAAAGATGAACTATATGATCCCATTGCCGTGCTCTCTTCCATAGCCAAAAGTTTTTGATCCGTCATCCTTTGTCCACTGCCAACCGACCATCTCCCTTCAATAGTTTCTTTATCTTTCCTAATTCTCAAATCATCATGCCTATACCGAAGGTCATGCTCATAATCTAAAACTTCATCACCAGCATGAAATTTCCTTGGCTCAGGTACCACACGCGTCTGCTGCAAATCCGAAAGCTGCCTGTGATCATTCTGATGGTACAGTTCCTCAAAGTTAGGTTTTTTCCTCGCCTGGCCATAAGAATGTGACCTTGAGTCGACGTTATTGTTCCTTCCAGTTCTGAGATGCCAATCCTCGTCCCTCCTTTCAATGGTGTCGACCCTCTGACCCAAACCCACTTCGCGTCGGGAAGCACCAACTCTGTGAGGGCTCAAACTCCTCCTTAGACGCGGAGACCGATCCAGTGCCTCTCGACGGGTCTTCCCATATCGATCATGATCCAGATGAAGCCGATCCTGAGCGTGTAGTTTCATACTTTCCGACTCTCTAACATAGTAATCATCACGCCTTCGACATTGCATCTTTAACTAATTAGTTGAAATTATTACTAACAAAGTCTGGAATAATACCCTAAAATCCTCGAATCGATTCGCACTAAATCGTCCCAATAAACAGGCAGAAACTTTCAATCCCCATGACCTAAAAGTAAAACACCCAGTCGGAATATCGAACCAGAATATGGTATGGATTGCAATAAAAATCCAACGAAAATCCGAGACAAGGATTATCAAACTGGGAAATTTTTCAAACCTTCTGGGTCTCTGATCCTCCTTCGCTTAAGCCTGCCGGATGACCGGATTCTTGCATCCGTTACCGAAATTCACAAATCACGAGCAGAGAGTTTGTTCGCAACGGAGTTTTGTTTAAGTTCCGGCTTCTGAGAGAAAGTATAGAGCCGAAATCCGCACTGCAGACGACAAGAGCGAGAGAAACCGATTATTTATAGCGAATGCGAATGTTCGTCAAATTGTTTTTGGG
mRNA sequence
GCCGTGGAGTATGGATTATAAAATGTGTCATAAATATGGGTTAAACCACGCTTTTTCTCATCTTCGACTTTATCAAAATATTTTGACTTCGGAAGGAATAAAATATTATTATTTTTTTAATCATATTCCAGCTGTCGTTCAACTACGTGGCCACGTCGGGACCTATTTTGGTAATTTCCTCCAAATTCGCGAACTCCAAAATATCAAACCCGCCCTTTATTTCCCACTCCTTGAAATTTGTAAAAAGAATTAAAAAAAAAAGAAAAAAGAAAAAAGGAAAAGAAACCCTAGAAGCTGAAGCGGAAACGCCCCCTCCCTGTTTCTAAAATTTTCTCTTCGAACCTAGAAAGAATCTGACCGTCGATCTCTTTATCCAACGGCTCCCGTCAGTCGGCGACCGCTTTTTCCCCTTCTTTCCACCCCCACTTTCAGTGGTTTTTATCATAAACCCTAGCCGTCTTCTCTCTCTCTTCTGCTCTGTCAGTTTGCATGAAACTCTCTCACTCGCAGCGATTCTCGATCATTCTTTCGGATCCAATCTCTGAAGTTTCTTTTTGGGTTCTCCATCGTTCTCTTTTTCTTACATCTTTTTTCGGGTTTTTCATGGCCTCCGTAAATGGATCTTTGAGGATTTTGTTTTTCCTTTTTTCTTGTGTTTTCGTCTAGATTATTGCGGCGAAAGTGTGCGATTCGCTGATGGTTTTTGTCTCTCTTACTGCTGCATGTCCCTGTTTGTTTCTGTTCTTTGATTTTAAGTTTTCCCCCTCGAATTCTTTGGCGCCAAGTTTCCGTTTCGAGTTCGGCTTAGATTGCGTTTGAAGATTCCGGGTCGTCGTGTAGACAGACACTGGTTTCTCACTTCCATCATTCCTGATTATGGATGCGTGTTGAGTTCTGGCGGTTCTACTCCAAGGCCGCCGGGTCCGATACCGGTCCGAGTTAGGTGCTGAGTCAATTCGGAGAGCGACTCGGTGAGTCGGGGACAGAAGGTCGAGTGTCGAGTGGTGCCAATTCTTCATAACCGTACGGCAATGCCGTCGGTGGGGATGAGAAGGACGAGAGTTTTCGGTGTGGTAAAGGGTGTAGATGGAGCTAGAGTTTTGAGGTCTGGAAGGCGTCTATGGCCCGAATCGGGTGAGGTGAAGCTTAAGAAATCCAAAGATGCCAGTGACTGGTACCCTGTTATTGAGAACAGAGGAAATGGGGGTGGAAGTGGCCAGGTTAGGCTCCATGGTAAGTGGACTCAAGTTCGTAATGTCAACCCGAAGCGGGTTGTCGTCGTTGACATTCATGAGGAGGATGATACTTGCGTAGCTGATGTGCCTAAACCTGTAAAAGTTGTTGCTAGGAATGGTGGTGATGGTGAGTCTGGGGTTGTGGATAGGATGTTTGGGAAAGTGTACAGAAGAAAGAGGAAGAGGGGTCTTTCAGAGAATCGCGTTATTTTCGATGAAACGGAGGGAGAGAATGGTTTGGCAGGAGATAGAATGTTTGGACTCCGTTTCATTAGAAGACAGAGAACGAGGAAGACTAACGATGGAAATTGGGAGCCTACTGCAGGTGGTCGTTTTAGAGTGCACTTTTCTAGGCAAAGTACTTTGCAGCTGACCCGGGATCGGGTTCTAACTATTTTTTCTGGGAGTAGTCATGATGGTGGCTGTTTCTCAGATTTTATGCTCTCGATTCTTAGACACCTGAAGAGTCCACTGCTGAGGGTGGCTAAGCTTTCTGAATTTTTGTTATCCGATCCAATCTGTGGAGTTTTTGCTTCAAAGGGAATTCGGTTCTTGCAGGGTTATCCTCCTACTGGAAGCTCTGGCATTTGTGTGATTTTTGGGGCCACGCAGTCGACGCCAATGTTCCATTTAGATTTTTCCGCCATTCCTCTCTATTTTATGTACTTGCATTCAAGGATGTTTCTTAGAGCAACTTGGATTCAGGCTCGTCTTGTATATAATAACAAGCAGTTAGATGTAGATATGAGTAGTGATAGTGAAGAAGACAGCATTGAAGAGCAACATGTTCCCAGTCCTCCTGGAAGAAGTTCTTTGGAGTGCAAAACCGTGACATTTGCAGTAGATCACCCTAAGAGCCGACCCATTTTACATCCATCTGTTAGAGCTTCAAGGTTAGGTGGTCGGAACATGCAATACAGAAACAGTTTCAGTTCTCGTGGTGTGCGTAAAAGGAGGAGTTCGCTGAGGATGAGGAGGCCTAGAAACTCTTCTCTTGCTGCTATGCAAAAAGCATCAGAATCAGTTCATGATACCAAACGCAGTGTATCCTTTTCTTCTGCAGCATCTTATAACAAGAACAAGTACTTAGCCCTGAGAGATTCTGCTGGGCGCATCAGAGAAGGGAGTTCTACTGCATTGAGATCAGCAATGGTTGTTGACCCATCATGCTGCAATGCAAATATATTAATTGTGGAATCTGATAAATGTTTGAGAGAAGAGGGAGCCAATATTGTGTTAGAGTTTTCTGCATCGTGTGAATGGCTTCTAGTGGTCAAGAAAGATGGTTCAGCTAGATACACCCACAAAGCAGAAAAAGTTATGAAGCCCTCTTCTTGCAATCGTTTTACGCATGCAATAATTTGGTCCACGGATAGTGGTTGGAAGCTAGAGTTTCCAAATCGAAGGGATTGGTTTATTTTCAAGGATCTCTACAAGAAATGTTCTGATCGCAATATCCCATGTTCTACTGCTAAAGCTATTCCTGTGCCCAGAGTGTGTGAAGTTCCAGGTTATGTGGATAGTAGTTGTGCTTCTTTTCAAAGGCCAGATATGTACATCTCTATACATGATGATGAGGTATTTAGAGCCTTGGCAAAGACAACCGCAAACTATGACATGGACTCTGACGACGAGGAATGGTTAAGCAAGTCTAACGACGAGCTTATTGTGACAGACAAGCATCAAGAATGTATTTCAGTGGATAACTTTGAGTTGATGATCGATGCCTTTGAGAAGGGACTTTATTGTAATCCTGATGCCTTCCCTGATGAGAAAGCTCCTGCTGAAATATGCACGCATCTTGGTAACCGTCAAATGGTTGAGTCTGTATTTACTTATTGGATGAAGAAACGAAGACAGAAGAAATCATCCTTGGTTAGGGTTTTCCAGGCTCATCAGGCTAAGAGGAAACCTCTGGTGGTCCCTAAACCTATCATGCGAAGAAGGAGATCATTCAAAAGAAAGCCTAGCCAAATTGGGAGAGCTACACACTCAAATCTTTTGGAAGCCATAGTTTTGAGGCGAGATGCTGAGGAGGACCAGAATGCCATGGCAAAATACCAAGAAGCGAAGGCAACGGCGGAGAAAGCCTTTGAATGTGCTGTGGGGAAGCGGCAGAGGGCACAGTTGCTGTTGGAGAATGCAGATTTGGCAGCTTACAAAGCCATGATGGCACTGAGGATTGCCGAAGCAATTCAAGCATCAGAAATATTGCCGGATGCTGAAGCTGAATCTGAAGCTGCTGCTGCTGCATCTTTTTTTCTCGAGTGATTTTGTTTTCGGGAAGTGGCGGGATTCATTCACGGCTGTTTTCTTTTGAGAGTCCCCAATTGTTTTAATTATTATTGTATAGTATTTCATTAATGTGTACATTCTATTTTGTATGATTAGAATCTCTCACACCCATAATGTACAATGAGACGTCCTAATTCTTCCTAGGTTCAAAAATATACGACTTAAGTTAGTATATCAATATGCTTCCCTACTCCCTGTATTACCTTTTTTCATCCAAATCAAATCATAAATGATAAAATCAAGCATAATAACAAGAATTTGTGAGTAAAAAGAACTTCATTATTCCATGTCTGCTGTTAGCTTTGTAGAAAGAGCGTTTTACAACTCCAAAATTTCCTTTTTGCTCTTTATGGTACTTAACTTCCTTACATTGAATTCTACATCATCCAAATCCTCTGAAATTCCCAAGTATCCATAAAGCATTCTCTCTTCAGTCTTGTCTCCCTCCATTTCTGCTCCCCCATTTCTGCACTTTGCCACCTCGAAATTCACTCTTCCATGTCTCCTTTCAGAAAAGAATTTATGGAGTCTTTCTGCATCTGTCAAACCAGAAAACATGGCCAGGAACTTCAACACCATCACACTTTGATCTGCAGGACTCCCCAAAGTTATTTTCACTCTCCCCTTCAGCAAATTCTTACCTGTGAACAACAAAACGATTCGTACACCGCATGCAGGTTACTATAGCTAGAAACATTAGACATGCCATGAGATATTAGTGAACATTCGTCAGATCCTATTAAACATGATAATGTGAGAATAAAACGTCACTTACTTCTTAAGAAAGTCTCGAGTGCTTCGATCGTTACAACTCTCCACCTATCGGGGTTGCTGTGCGACAGAGAGATGTTGCGGATGATAATAACTGGAGGCCAGATGATAAGATCTTCCTTCTGAACCACAGCTTCTTCCTTGGGCAACACCTCGGGAACCCATGTTACAGTGTCTTGGGGCATGGCACTATTCCACCCCATCAAAACGCATATGGCTTTGGCAAGACCTAAATGCTGAGCCCTCAACCCAGTCCTGTGGGACATGTAAGCATGCTTTACCAGGCGTTTAGTATCCAAAAATTCCTTTGAGGAGCTGCAATGGCAGGGATAGGCAATATAAAGCTAGAATTAGCGATTGAAAGTTAATAAAAACAACATTTCTCCTCGCGAACAGCACGTCTACAAAAGGTGCTAGAATTAAAACAAGACAATAAGCAAACAGCAACTGTGAATACCAATTCAAAGAACTATAAAAATCACCATCATAAAGAAAAAAAAATCATAATCACAAATTTAGCAAATTAACATTTAGAAGTCCCAGGAATCTCAAACTTCAAGAAGTAAACCCCGTGACTTCATATTCATAATAACAGAAATCCTCGACCTTTTGACATTTACCCATGGAGCCGATACTGGAGATCACAAGTTGTTCTTGCAACACTTGGTAGTCAACAAACTTCTGAGAAAATAAATACCAAGGGATGTGATCGATCTTAAGCTTTTGACCTATTTAGCTACATCCATGGACGCTTAACCACCATGTCTCTATAAAATATTGAGGCTTCAGTTTCTGGCATTATAATCATAGCTTTAGATTCACAACACTATCATGAAAGCTGGTAAAACTTTATTAACCTCGTTAGCTTTTCGAAGACCCCGCAGGGGATTTGTGAGTTCTTTTCATTTTTTGGACAAAAAACTGCTTCCATTGCAAAAATGAAAAGTACACAAGTGGGGTCAAAACAGAAGCCCTCTAGCCAAAAGTAACAAAATACTCTAATATAGGAAACAGAGGACGGACCACAGCAAGAAGTTCTCCTAATTATCCTCCCACCATTCAATACCATCAAATCTCTTCTTCAATGGTTCATCTTCATTTTGTACCGTCCATACACTTTAAAGTCCATACACTACAGGGAAATCTATTCATTAGATGACGTCTTATACTGGGTAACTTCTAAACCAATCGTTCAAATGGTCAATTCTACTTTTAAAGTTCCATCCTTGTTGACTAAAAGAAAAGTTGAACCATTGGAAAATAACTACCGAGTAATCCAATCATGCTGTTAATTAAGGATGGTCACTTTACAGCACAATTTTACCAATCTTCACTGACATACGTGATACTACTATATCAAATTAAATGCATGTCAATGAATGACCATAATATGTAAAGAAGCTACTGAAAAGCAGCAATGGAGATCATATGCGTCGCTGAAGGTCCAAGCCAATATATCTTCTCCTAAATTCTGCAACATGAGCTAGAAAATTAATAACATTTCTGAACGCGAAGTTCTGGCACAGTGATAAATATCAAAGAAAACCCAAATCTTCACGTGAAACTAAAGCTTGCTAAGGTGGAAAACTGGCCATAAATCTACTCGTGAGATTGGTTATATCATGAGACAAAAAGCAAACCAAACCCTCTAGACCGGATCTCTCTACAATGAGCTATACTGCTCTGATAATTCTAAAATGGTTGAGCATGGACAATGGGTAGTCCACCCACCAAGGTACCTTCCAGACCCCCAGTGATATCCTCTGATTATCTTTATAATGTCTGTCCAAAGAAATTCTCTACTATCCTGAAGTCCTCCACATTGACATGCACTTCATAGAATATGTTCACTAGTATAAAAGTGCACGAAGAAACTTGTAAAGCTTAATCAGAACACCCATATCTTTCTCCAGTAGTTTGCCGTTATTAACATAATTGTGCATTTCACAGGATACATTCACTACAATTGGAAAAGTGACCAACGTAAATACAATCTCAAAGCTTACTCCGAACAACTCCCAGTTTCTGAACTCCATTAAACTAGGTATATGGCCAGTCATCATTCTCAAACGACATCATTGAAGCCACTATCATGACTCCAGAGATTCACACCTTGCTGCAGTGAAAGGGAACCCTCTAAAGCTCCAAGATTATGTACCTCCCTCCATCCCCCCCTTCCCAACCCAAAAAGATAATAAATAAAATAATATGATGGTCCATACAAAATAATGGAAGCACGGCGACTATACAATGGGAATCTTATAATCCAAAGAAACCTACGGAAGAAAACCTAGAAGGTTTCTTATAATCTTACTTGAATACGTTGCACAATAGCAATTAGGAAACCCCTGAGTCAATTTATGAAACGAAGTCTCCACGATTAGGAAATAGGGAAGAAGACAGACCTTATGCCACATACGATGCAATATAAACTACCAGCATTTCCTTGCTCCTTGTACTTTTTCCTGACAGTAGGCTTCATATTCAGCTTTTTGGAGCACTTCAGAAATGCCTCATGAACCATTTGTTTGAACTCTTCAGAATCCTCAGGTGGTTCAGACTCTGTAGGTTTTATGAAATCATCTGACAGGCCATCGTCGTTTACATGAACATAATCGTTATCTTCATCAGTATTTTTCCAAATTTTAAGCGGTCTATGAAGACTATTTCTGAGATCAACATTTTGTGACTTGATCCAGCTTGAAGGACCACGTCTTCTGTTGCCCTTCATATATTTAGGACCATTTTTGAATTTCTGTGCATGGTCTAAATACCTTTCACATGAAAATGAATCATCAGAAGCATGGAAATTATGACGATCGTATTTCTTATTCGACTTCCTGTATTTGCCATGGTCAAATCCTGCTTTCCTGGATGTATATCTGTGACTGATATCATCATCCATCCACGCTTCACCACTATCATACATATCCACAACCTTGTCGGGTGTGTATTCACATGGCATACTTATTCTATGATCAAGTAAATTCATATCTTCCTCGAAATAATTTCTTTTTGGCATCTGATCCCTCAACTTATAAGCGGAAACACCCTCTGTCATTCTCTGCACTCTTTCAGAATTTCTATAAGTTTCTCCATCACACTTAGACATGGAAGATCTATGCAACCTCTCTTTCAAATAATGTGAACCTACTTCTCTTTCAGAGCCATAATCTGATCTTTCATGCAATTTTTTACTATACTCAGAAGGGTTGATGTCCGAAACTTCATAATCTTGAGTTACTCTCCTTCCCACTTCAACATGGATGGATCCAGTACTTGCATATTCACAGTCATGGTTTATTGTTTTTGAGGTTAGTCCTATATCAGGGTAATCTAAAACTCTCTGTTCTCTAAGACTATAACGGTTAAGTGCTATTCCAGTACTATGATTTGCAAACAAGTTCTCAGGATTTTGTAATTTCAGCGAACTGGAGTTTGCAACTTTTGGTCTTGAAAAGTCTCCAAGATCAGACACAATTGCTCTACCATAATCAATATAATCATGTGTTTGATTAATTTTCTGCATCACTCGAGATGGATCTTCATTTGAATCATTAACTGTTCCCTCAGGTTTAGAATAAAAATAGCTCATATGCTCCCTCATAGGACTATTAGTACTTTGTTGATGAGGAGTAAAATTCCTTTCTCCACCAACAAGATCAATGGCTGAATCTACAAGGGTCCTTTTCCCATATGACTTAAAGTCTAAACGGTCCATAAGGTCTCCATGGCTCCTTGTTGGAAAATCATCCGAAACCTGAAAGTACTGGCCATCCTGATAAGATTCCAAAGACTTGCTTGTAAATGGCCCTGACGAAGATCTTTCATATTCTTTGGAATAAAATCCCGAGCTTGCCAAATATCCAATATTCCTTGAACTAAACCTACGGCTTTCTTCAACTTCATGGGACTCTGTGACTTGTGATTTATCTGAAACTACATGACTTCGAAATTTAAGTCTTTCATCGTCAAGACTTCGCACATCCAAACTCTGAGAAGATGGTAAGAAGTCTTTGTAAATTGACGTGGAACCCATATTCAAAGATGAACTATATGATCCCATTGCCGTGCTCTCTTCCATAGCCAAAAGTTTTTGATCCGTCATCCTTTGTCCACTGCCAACCGACCATCTCCCTTCAATAGTTTCTTTATCTTTCCTAATTCTCAAATCATCATGCCTATACCGAAGGTCATGCTCATAATCTAAAACTTCATCACCAGCATGAAATTTCCTTGGCTCAGGTACCACACGCGTCTGCTGCAAATCCGAAAGCTGCCTGTGATCATTCTGATGGTACAGTTCCTCAAAGTTAGGTTTTTTCCTCGCCTGGCCATAAGAATGTGACCTTGAGTCGACGTTATTGTTCCTTCCAGTTCTGAGATGCCAATCCTCGTCCCTCCTTTCAATGGTGTCGACCCTCTGACCCAAACCCACTTCGCGTCGGGAAGCACCAACTCTGTGAGGGCTCAAACTCCTCCTTAGACGCGGAGACCGATCCAGTGCCTCTCGACGGGTCTTCCCATATCGATCATGATCCAGATGAAGCCGATCCTGAGCGTGTAGTTTCATACTTTCCGACTCTCTAACATAGTAATCATCACGCCTTCGACATTGCATCTTTAACTAATTAGTTGAAATTATTACTAACAAAGTCTGGAATAATACCCTAAAATCCTCGAATCGATTCGCACTAAATCGTCCCAATAAACAGGCAGAAACTTTCAATCCCCATGACCTAAAAGTAAAACACCCAGTCGGAATATCGAACCAGAATATGGTATGGATTGCAATAAAAATCCAACGAAAATCCGAGACAAGGATTATCAAACTGGGAAATTTTTCAAACCTTCTGGGTCTCTGATCCTCCTTCGCTTAAGCCTGCCGGATGACCGGATTCTTGCATCCGTTACCGAAATTCACAAATCACGAGCAGAGAGTTTGTTCGCAACGGAGTTTTGTTTAAGTTCCGGCTTCTGAGAGAAAGTATAGAGCCGAAATCCGCACTGCAGACGACAAGAGCGAGAGAAACCGATTATTTATAGCGAATGCGAATGTTCGTCAAATTGTTTTTGGG
Coding sequence (CDS)
ATGCCGTCGGTGGGGATGAGAAGGACGAGAGTTTTCGGTGTGGTAAAGGGTGTAGATGGAGCTAGAGTTTTGAGGTCTGGAAGGCGTCTATGGCCCGAATCGGGTGAGGTGAAGCTTAAGAAATCCAAAGATGCCAGTGACTGGTACCCTGTTATTGAGAACAGAGGAAATGGGGGTGGAAGTGGCCAGGTTAGGCTCCATGGTAAGTGGACTCAAGTTCGTAATGTCAACCCGAAGCGGGTTGTCGTCGTTGACATTCATGAGGAGGATGATACTTGCGTAGCTGATGTGCCTAAACCTGTAAAAGTTGTTGCTAGGAATGGTGGTGATGGTGAGTCTGGGGTTGTGGATAGGATGTTTGGGAAAGTGTACAGAAGAAAGAGGAAGAGGGGTCTTTCAGAGAATCGCGTTATTTTCGATGAAACGGAGGGAGAGAATGGTTTGGCAGGAGATAGAATGTTTGGACTCCGTTTCATTAGAAGACAGAGAACGAGGAAGACTAACGATGGAAATTGGGAGCCTACTGCAGGTGGTCGTTTTAGAGTGCACTTTTCTAGGCAAAGTACTTTGCAGCTGACCCGGGATCGGGTTCTAACTATTTTTTCTGGGAGTAGTCATGATGGTGGCTGTTTCTCAGATTTTATGCTCTCGATTCTTAGACACCTGAAGAGTCCACTGCTGAGGGTGGCTAAGCTTTCTGAATTTTTGTTATCCGATCCAATCTGTGGAGTTTTTGCTTCAAAGGGAATTCGGTTCTTGCAGGGTTATCCTCCTACTGGAAGCTCTGGCATTTGTGTGATTTTTGGGGCCACGCAGTCGACGCCAATGTTCCATTTAGATTTTTCCGCCATTCCTCTCTATTTTATGTACTTGCATTCAAGGATGTTTCTTAGAGCAACTTGGATTCAGGCTCGTCTTGTATATAATAACAAGCAGTTAGATGTAGATATGAGTAGTGATAGTGAAGAAGACAGCATTGAAGAGCAACATGTTCCCAGTCCTCCTGGAAGAAGTTCTTTGGAGTGCAAAACCGTGACATTTGCAGTAGATCACCCTAAGAGCCGACCCATTTTACATCCATCTGTTAGAGCTTCAAGGTTAGGTGGTCGGAACATGCAATACAGAAACAGTTTCAGTTCTCGTGGTGTGCGTAAAAGGAGGAGTTCGCTGAGGATGAGGAGGCCTAGAAACTCTTCTCTTGCTGCTATGCAAAAAGCATCAGAATCAGTTCATGATACCAAACGCAGTGTATCCTTTTCTTCTGCAGCATCTTATAACAAGAACAAGTACTTAGCCCTGAGAGATTCTGCTGGGCGCATCAGAGAAGGGAGTTCTACTGCATTGAGATCAGCAATGGTTGTTGACCCATCATGCTGCAATGCAAATATATTAATTGTGGAATCTGATAAATGTTTGAGAGAAGAGGGAGCCAATATTGTGTTAGAGTTTTCTGCATCGTGTGAATGGCTTCTAGTGGTCAAGAAAGATGGTTCAGCTAGATACACCCACAAAGCAGAAAAAGTTATGAAGCCCTCTTCTTGCAATCGTTTTACGCATGCAATAATTTGGTCCACGGATAGTGGTTGGAAGCTAGAGTTTCCAAATCGAAGGGATTGGTTTATTTTCAAGGATCTCTACAAGAAATGTTCTGATCGCAATATCCCATGTTCTACTGCTAAAGCTATTCCTGTGCCCAGAGTGTGTGAAGTTCCAGGTTATGTGGATAGTAGTTGTGCTTCTTTTCAAAGGCCAGATATGTACATCTCTATACATGATGATGAGGTATTTAGAGCCTTGGCAAAGACAACCGCAAACTATGACATGGACTCTGACGACGAGGAATGGTTAAGCAAGTCTAACGACGAGCTTATTGTGACAGACAAGCATCAAGAATGTATTTCAGTGGATAACTTTGAGTTGATGATCGATGCCTTTGAGAAGGGACTTTATTGTAATCCTGATGCCTTCCCTGATGAGAAAGCTCCTGCTGAAATATGCACGCATCTTGGTAACCGTCAAATGGTTGAGTCTGTATTTACTTATTGGATGAAGAAACGAAGACAGAAGAAATCATCCTTGGTTAGGGTTTTCCAGGCTCATCAGGCTAAGAGGAAACCTCTGGTGGTCCCTAAACCTATCATGCGAAGAAGGAGATCATTCAAAAGAAAGCCTAGCCAAATTGGGAGAGCTACACACTCAAATCTTTTGGAAGCCATAGTTTTGAGGCGAGATGCTGAGGAGGACCAGAATGCCATGGCAAAATACCAAGAAGCGAAGGCAACGGCGGAGAAAGCCTTTGAATGTGCTGTGGGGAAGCGGCAGAGGGCACAGTTGCTGTTGGAGAATGCAGATTTGGCAGCTTACAAAGCCATGATGGCACTGAGGATTGCCGAAGCAATTCAAGCATCAGAAATATTGCCGGATGCTGAAGCTGAATCTGAAGCTGCTGCTGCTGCATCTTTTTTTCTCGAGTGA
Protein sequence
MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGGSGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMFGKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRFRVHFSRQSTLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLSDPICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMFLRATWIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDHPKSRPILHPSVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASESVHDTKRSVSFSSAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREEGANIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPNRRDWFIFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEVFRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDAFPDEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMRRRRSFKRKPSQIGRATHSNLLEAIVLRRDAEEDQNAMAKYQEAKATAEKAFECAVGKRQRAQLLLENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE
Homology
BLAST of MC04g1434 vs. NCBI nr
Match:
XP_022133811.1 (uncharacterized protein LOC111006281 [Momordica charantia])
HSP 1 Score: 1617 bits (4188), Expect = 0.0
Identity = 822/824 (99.76%), Postives = 823/824 (99.88%), Query Frame = 0
Query: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG
Sbjct: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
Query: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF 120
SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF
Sbjct: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF 120
Query: 121 GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF 180
GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF
Sbjct: 121 GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF 180
Query: 181 RVHFSRQSTLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLSDP 240
RVHFSRQSTLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLSDP
Sbjct: 181 RVHFSRQSTLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLSDP 240
Query: 241 ICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMFLRAT 300
ICGVFASKGIRFLQGYPPTGSSGICVIFGATQS PMFHLDFSAIPLYFMYLHSRMFLRAT
Sbjct: 241 ICGVFASKGIRFLQGYPPTGSSGICVIFGATQSMPMFHLDFSAIPLYFMYLHSRMFLRAT 300
Query: 301 WIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDHPKSRPILHP 360
WIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDHPKSRPILHP
Sbjct: 301 WIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDHPKSRPILHP 360
Query: 361 SVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASESVHDTKRSVSFS 420
SVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASESVHDTKRSVSFS
Sbjct: 361 SVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASESVHDTKRSVSFS 420
Query: 421 SAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREEGANIV 480
SAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREEGANIV
Sbjct: 421 SAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREEGANIV 480
Query: 481 LEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPNRRDWF 540
LEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPNRRDWF
Sbjct: 481 LEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPNRRDWF 540
Query: 541 IFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEVFRALA 600
IFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEVFRALA
Sbjct: 541 IFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEVFRALA 600
Query: 601 KTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDAFPDEK 660
KTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDAFPDEK
Sbjct: 601 KTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDAFPDEK 660
Query: 661 APAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMRRRRSF 720
APAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMRRRRSF
Sbjct: 661 APAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMRRRRSF 720
Query: 721 KRKPSQIGRATHSNLLEAIVLRRDAEEDQNAMAKYQEAKATAEKAFECAVGKRQRAQLLL 780
KRKPSQIGRATHSNLLEAIVLRRDAEEDQNA+AKYQEAKATAEKAFECAVGKRQRAQLLL
Sbjct: 721 KRKPSQIGRATHSNLLEAIVLRRDAEEDQNAVAKYQEAKATAEKAFECAVGKRQRAQLLL 780
Query: 781 ENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE 824
ENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE
Sbjct: 781 ENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE 824
BLAST of MC04g1434 vs. NCBI nr
Match:
XP_022992589.1 (uncharacterized protein LOC111488892 [Cucurbita maxima])
HSP 1 Score: 1172 bits (3032), Expect = 0.0
Identity = 613/826 (74.21%), Postives = 695/826 (84.14%), Query Frame = 0
Query: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
MPS GMRRTRVFG+VKG+DGARVLRSGRRLWPESGEVKLKKSKDASDWYPVI++RGNGGG
Sbjct: 1 MPS-GMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIDSRGNGGG 60
Query: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF 120
SGQVRLHGKWTQVRNV PKRVVVV+I EE+D CV VP+P+KV R G GESG VDRMF
Sbjct: 61 SGQVRLHGKWTQVRNVKPKRVVVVNIREEEDACVVKVPEPLKVFPRIGSGGESGDVDRMF 120
Query: 121 GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF 180
GKVY RKRKRG +EN +FDE EG+N ++GDRMFGLRFIRRQR+RKT+ WEPTA GR
Sbjct: 121 GKVYSRKRKRGRAENGYVFDEMEGDNAISGDRMFGLRFIRRQRSRKTDIAQWEPTASGRS 180
Query: 181 -RVHFSRQS-TLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLS 240
++H R S + L DRVLTIF+GSS + GCFSDF+ S+LRHL SP L VAKL+ FLLS
Sbjct: 181 TKLHLHRPSISPPLPSDRVLTIFAGSSINNGCFSDFIQSVLRHLNSPDLNVAKLASFLLS 240
Query: 241 DPICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMFLR 300
+ I GVFAS G+RFLQGYPPTGSSG+CVIFG+ Q P+FHLDFSA+P FMYLHS+MFLR
Sbjct: 241 NTINGVFASMGMRFLQGYPPTGSSGMCVIFGSRQCIPLFHLDFSAVPFPFMYLHSKMFLR 300
Query: 301 ATWIQARLVYNNKQLDVDMSSDSEEDS-IEEQHVPSPPGRSSLECKTVTFAVDHPKSRPI 360
TWIQARLVYNN+QLDVDMSSDSEEDS +EEQHV +PP R+SL+CK+V F VDH +R
Sbjct: 301 QTWIQARLVYNNEQLDVDMSSDSEEDSMVEEQHVSNPPVRNSLDCKSVAFGVDHTNTRSN 360
Query: 361 LHPSVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASE--SVHDTKR 420
H SVRASRLG R +QYRN FSSRG+RKRRSSLRMRRPR+ SLAAMQK + D KR
Sbjct: 361 SHSSVRASRLGSRALQYRNGFSSRGIRKRRSSLRMRRPRSHSLAAMQKTVGIFGIDDMKR 420
Query: 421 SVSFSSAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREE 480
SVSF S AS N++K ALRDS+GR+ SSTAL SAM VD SCCNANILIVE+D+C+REE
Sbjct: 421 SVSFPSVASCNRHKNSALRDSSGRV---SSTALGSAMDVDSSCCNANILIVEADRCMREE 480
Query: 481 GANIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPN 540
GANIVLEFSASCEWLL VKK+GS RYTHK E VMKP+ CNRFTHAI+WS D+GWKLEFPN
Sbjct: 481 GANIVLEFSASCEWLLAVKKNGSTRYTHKVETVMKPAYCNRFTHAILWSADNGWKLEFPN 540
Query: 541 RRDWFIFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEV 600
RRDW IFKDLYK+CSDRNIPC TAKAIPVPRV EVP YVDSSC F+RPD YIS++DDEV
Sbjct: 541 RRDWLIFKDLYKECSDRNIPCFTAKAIPVPRVSEVPDYVDSSCTYFKRPDTYISVNDDEV 600
Query: 601 FRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDA 660
RA AK+TANYDMDS+DEEWLSK NDELI TDK +C+S D+FELMIDAFEK L+CNPDA
Sbjct: 601 CRARAKSTANYDMDSEDEEWLSKFNDELIATDKQHDCLSGDSFELMIDAFEKELFCNPDA 660
Query: 661 FPDEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMR 720
F DEKAP ++ LG+R VES+FTYW +KRRQ+KS L+RVFQAHQ+KRKP VVPKPIMR
Sbjct: 661 FSDEKAPTDMFMLLGSRPTVESLFTYWTRKRRQRKSCLIRVFQAHQSKRKPPVVPKPIMR 720
Query: 721 RRRSFKRKPSQIGRATHSNLLEAIVLRRDAEEDQNAMAKYQEAKATAEKAFECAVGKRQR 780
R+RS KR+ SQ GRAT S++L+AIV RRDA E+QNAM KY++AKA AE+ E AV KRQR
Sbjct: 721 RKRSIKRQTSQSGRATQSSILKAIVSRRDAVEEQNAMQKYEDAKAAAERCMESAVSKRQR 780
Query: 781 AQLLLENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASF 821
AQLLLENADLAAYKA++ALRIAEAIQASE LP A A + AAAAA F
Sbjct: 781 AQLLLENADLAAYKAVVALRIAEAIQASE-LPGAAAAAAAAAAACF 821
BLAST of MC04g1434 vs. NCBI nr
Match:
XP_023550905.1 (uncharacterized protein LOC111808903 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1169 bits (3025), Expect = 0.0
Identity = 621/831 (74.73%), Postives = 699/831 (84.12%), Query Frame = 0
Query: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
MPS GMRRTRVFG+VKG+DGARVLRSGRRLWPESGEVKLKKSKDASDWYPVI++RGNGGG
Sbjct: 1 MPS-GMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIDSRGNGGG 60
Query: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF 120
SGQVRLHGKWTQVRNV PKRVVVV+I EE+D CV VP+P+KV+ R G DGESG VDRMF
Sbjct: 61 SGQVRLHGKWTQVRNVKPKRVVVVNIREEEDACVVKVPEPLKVLPRIGSDGESGDVDRMF 120
Query: 121 GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF 180
GKVY RKRKRG EN FDE EG+N ++GDRMFGLRFIRRQR+RKT+ +WEPTA GR
Sbjct: 121 GKVYSRKRKRGRPENGYGFDEMEGDNAISGDRMFGLRFIRRQRSRKTDITHWEPTASGRS 180
Query: 181 -RVHFSRQSTLQ-LTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLS 240
++HF R S L RDRVLTIF+GSS + GCFSDF+ S+LRHL SP L VAKLS FLLS
Sbjct: 181 TKLHFHRPSVSPPLPRDRVLTIFAGSSINNGCFSDFIQSVLRHLNSPELNVAKLSSFLLS 240
Query: 241 DPICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMFLR 300
+ I GVFAS G+RFLQGYPPTGSSG+CVIFG+ Q PMFHLDFSA+P FMYLHS+MFLR
Sbjct: 241 NTINGVFASTGMRFLQGYPPTGSSGMCVIFGSRQCIPMFHLDFSAVPFPFMYLHSKMFLR 300
Query: 301 ATWIQARLVYNNKQLDVDMSSDSEEDS-IEEQHVPSPPGRSSLECKTVTFAVDHPKSRPI 360
T IQARLVYNN+QLDVDMSSDSEEDS +EEQHV +PP RSSL+CKTV F VDH +R
Sbjct: 301 QTRIQARLVYNNEQLDVDMSSDSEEDSMVEEQHVSNPPVRSSLDCKTVAFGVDHTNTRSN 360
Query: 361 LHPSVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASE--SVHDTKR 420
SVRASRLG R +QYRN FSSRG+RKRRSSLRMRRPR+ SLAAMQK + D KR
Sbjct: 361 SQLSVRASRLGSRALQYRNGFSSRGIRKRRSSLRMRRPRSHSLAAMQKTVGIFGIDDMKR 420
Query: 421 SVSFSSAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREE 480
SVSF S AS N++K ALRDS+GR+ SSTAL SAM VD SCCNANILIVE+D+C+REE
Sbjct: 421 SVSFPSVASCNRHKNSALRDSSGRV---SSTALGSAMDVDSSCCNANILIVEADRCMREE 480
Query: 481 GANIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPN 540
GANIVLEFSASCEWLL VKK+GS RYTHKAE VMKP+ CNRFTHAI+WS D+GWKLEFPN
Sbjct: 481 GANIVLEFSASCEWLLAVKKNGSTRYTHKAETVMKPAYCNRFTHAILWSADNGWKLEFPN 540
Query: 541 RRDWFIFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEV 600
RRDW IFKDLYK+CSDRNIPC TAKAIPVPRV EVP YVDSSC F+RPD YIS++DDEV
Sbjct: 541 RRDWLIFKDLYKECSDRNIPCFTAKAIPVPRVSEVPDYVDSSCTYFKRPDTYISVNDDEV 600
Query: 601 FRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDA 660
RA AK+TANYDMDS+DEEWLSK ND+LI TDK EC+S D+FELMIDAFEK L+CNPDA
Sbjct: 601 CRARAKSTANYDMDSEDEEWLSKFNDKLIATDKQHECLSGDSFELMIDAFEKELFCNPDA 660
Query: 661 FPDEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMR 720
F DEKAP ++ LG+R VES+FTYW +KRRQ+KS L+RVFQAHQ+KRKP VVPKPIMR
Sbjct: 661 FSDEKAPTDMFMLLGSRSTVESLFTYWTRKRRQRKSCLIRVFQAHQSKRKPPVVPKPIMR 720
Query: 721 RRRSFKRKPSQIG--RATHSNLLEAIVLRRDAEEDQNAMAKYQEAKATAEKAFECAVGKR 780
R+RS KR+PSQ G RAT S++L+AIV RRDA E+QNA+ KY+EAKA AE+ E AV KR
Sbjct: 721 RKRSIKRQPSQSGSGRATQSSILKAIVSRRDAVEEQNAVQKYEEAKAAAERCMESAVSKR 780
Query: 781 QRAQLLLENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE 824
QRAQLLLENADLAAYKA++ALRIAEAIQASE LP+A A A AAA+ FLE
Sbjct: 781 QRAQLLLENADLAAYKAVVALRIAEAIQASE-LPEAAA---ATAAAACFLE 823
BLAST of MC04g1434 vs. NCBI nr
Match:
XP_022939906.1 (uncharacterized protein LOC111445630 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1160 bits (3002), Expect = 0.0
Identity = 617/831 (74.25%), Postives = 695/831 (83.63%), Query Frame = 0
Query: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
MPS GMRRTRVFG+VKG+DGARVLRSGRRLWPESGEVKLKKSKDASDWYPVI++RGNGGG
Sbjct: 1 MPS-GMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIDSRGNGGG 60
Query: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF 120
SGQVRLHGKWTQVRNV PKRVVVV+I EE+D CVA VP+P+K++ R G DGESG VDRMF
Sbjct: 61 SGQVRLHGKWTQVRNVKPKRVVVVNIREEEDACVAKVPEPLKILPRIGSDGESGDVDRMF 120
Query: 121 GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF 180
GKVY RKRKRG EN FDE EG+N ++GDRMFGLRFIRRQR+RKT+ +WEPTA GR
Sbjct: 121 GKVYSRKRKRGRRENGYGFDEMEGDNAISGDRMFGLRFIRRQRSRKTDIAHWEPTASGRS 180
Query: 181 -RVHFSRQSTLQ-LTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLS 240
++HF R S L DRVLTIF+GSS + GCFSDF+ S+LRHL SP L VAKL+ FLLS
Sbjct: 181 TKLHFHRPSVSPPLPHDRVLTIFAGSSINNGCFSDFIQSVLRHLNSPELNVAKLASFLLS 240
Query: 241 DPICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMFLR 300
+ I GVFAS G+ FLQGYPPTGSSG+CVIFG+ Q PMFHLDFSA+P FMYLHS MFLR
Sbjct: 241 NTINGVFASTGMHFLQGYPPTGSSGMCVIFGSRQCIPMFHLDFSAVPSPFMYLHSEMFLR 300
Query: 301 ATWIQARLVYNNKQLDVDMSSDSEEDS-IEEQHVPSPPGRSSLECKTVTFAVDHPKSRPI 360
TWIQARLVYNN+QLDVDMSSDSEEDS +EEQHV +PP SSL+CKTV F VDH R
Sbjct: 301 QTWIQARLVYNNEQLDVDMSSDSEEDSMVEEQHVSNPPV-SSLDCKTVAFGVDHTNPRSN 360
Query: 361 LHPSVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASE--SVHDTKR 420
SVRASRLG R +QYRN FSSRG+RKRRSSLRMRRPR+ SLAAMQK + D KR
Sbjct: 361 SQLSVRASRLGSRALQYRNGFSSRGIRKRRSSLRMRRPRSHSLAAMQKTVGIFGIDDMKR 420
Query: 421 SVSFSSAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREE 480
SVSF S AS N++K ALRDS+G + SSTAL SAM VD SCCNANILIVE+D+C+REE
Sbjct: 421 SVSFPSVASCNRHKNSALRDSSGHV---SSTALGSAMDVDSSCCNANILIVEADRCMREE 480
Query: 481 GANIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPN 540
GANIVLEFSASCEWLL VKK+GS RYTHKAE VMKP+ CNRFTHAI+WS D+GWKLEFPN
Sbjct: 481 GANIVLEFSASCEWLLAVKKNGSTRYTHKAETVMKPAYCNRFTHAILWSADNGWKLEFPN 540
Query: 541 RRDWFIFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEV 600
RRDW IFKDLYK+CSDRNIPC TAKAIPVPRV EVP YVDSSC F+RPD YIS+++DEV
Sbjct: 541 RRDWLIFKDLYKECSDRNIPCFTAKAIPVPRVSEVPDYVDSSCTYFKRPDTYISVNNDEV 600
Query: 601 FRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDA 660
R AK+TANYDMDS+DEEWLSK NDELI TDK EC+S D+FELMIDAFEK L+CNPDA
Sbjct: 601 CRTRAKSTANYDMDSEDEEWLSKFNDELIATDKQHECLSGDSFELMIDAFEKELFCNPDA 660
Query: 661 FPDEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMR 720
F DEKAP ++ LG+R VES+FTYW +KRRQ+KS L+RVFQAHQ+KRKP VVPKPIMR
Sbjct: 661 FSDEKAPTDMFMLLGSRSTVESLFTYWTRKRRQRKSCLIRVFQAHQSKRKPPVVPKPIMR 720
Query: 721 RRRSFKRKPSQIG--RATHSNLLEAIVLRRDAEEDQNAMAKYQEAKATAEKAFECAVGKR 780
R+RS KR+PSQ G RAT S++L+AIV RRDA E+QNA+ KY+EAKA AE+ E AV KR
Sbjct: 721 RKRSIKRQPSQSGSGRATQSSILKAIVSRRDAVEEQNAVQKYEEAKAAAERCMESAVSKR 780
Query: 781 QRAQLLLENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE 824
QRAQLLLENADLAAYKA++ALRIAEAIQASE LP+A A AAAAA+ FLE
Sbjct: 781 QRAQLLLENADLAAYKAVVALRIAEAIQASE-LPEAAA---AAAAAACFLE 822
BLAST of MC04g1434 vs. NCBI nr
Match:
XP_023516479.1 (uncharacterized protein LOC111780334 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1143 bits (2956), Expect = 0.0
Identity = 612/835 (73.29%), Postives = 681/835 (81.56%), Query Frame = 0
Query: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
MP+VGMRRTRV G+ KGVDG RVLRSGRRL ES E KLKK+KD SDWYPV+ RGNGGG
Sbjct: 1 MPTVGMRRTRVIGL-KGVDGGRVLRSGRRLCIESVEAKLKKTKDVSDWYPVVNKRGNGGG 60
Query: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEE--DDTCVADVPKPVKVVARNGGDGESGVVDR 120
SGQVR HGKW +RNV PK VVVV+I EE DD CVA+VPKPVKV+AR GDGE G VDR
Sbjct: 61 SGQVRFHGKWQGIRNVKPKSVVVVNIREEEEDDACVAEVPKPVKVLARINGDGEFGYVDR 120
Query: 121 MFGKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGG 180
MFG+VYRRKRKRGLSEN +FDE DRMFGLRFIRRQR+RK +WEPTAGG
Sbjct: 121 MFGEVYRRKRKRGLSENGDVFDEM--------DRMFGLRFIRRQRSRKNTVEHWEPTAGG 180
Query: 181 RF-RVHFSRQSTLQLTRDRVLTIFSGSSHDG-GCFSDFMLSILRHLKSPLLRVAKLSEFL 240
++HF +QS RDRVLT+F+GS HDG GCFSDFMLS+LRH KSP L +AK S FL
Sbjct: 181 HSAKLHFHKQSISPPPRDRVLTVFAGSDHDGVGCFSDFMLSVLRHFKSPELGMAKFSAFL 240
Query: 241 LSDPICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMF 300
LS PI VFASKG+RFLQ YP GSSG+CVIFGA QS PMFHLDFSA+PL FM+LHS M
Sbjct: 241 LSSPIHDVFASKGMRFLQSYPSIGSSGMCVIFGAVQSIPMFHLDFSAVPLCFMHLHSLML 300
Query: 301 LRATWIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDHPKSRP 360
R TWIQARLVYNN QLDVDMSSD+ EDS EE V SPPG SSLECK++ VDH KSR
Sbjct: 301 FRVTWIQARLVYNNNQLDVDMSSDNAEDSNEEYLVSSPPG-SSLECKSMVVGVDHTKSRS 360
Query: 361 ILHPSVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASESV--HDTK 420
I HPSVRASRLG R +QYRN FS RG+RKRRSS MRRPR+ SLAAMQKA S+ D K
Sbjct: 361 ISHPSVRASRLGSRTLQYRNGFSFRGIRKRRSSRGMRRPRSHSLAAMQKAIGSLGADDMK 420
Query: 421 RSVSFSSAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLRE 480
RSVSF SAAS ++K LA RDSAGRIRE SSTALRSAM V SCCNANILIVESDKCLRE
Sbjct: 421 RSVSFPSAASCIRHKNLARRDSAGRIREESSTALRSAMDVSSSCCNANILIVESDKCLRE 480
Query: 481 EGANIVLEFSASCEWLLVVKKDGSARYTHKAEKV-MKPSSCNRFTHAIIWSTDSGWKLEF 540
EGA+IVLEFSASCEWLLVVKKDGS RYT KA+KV MKP+SCNRFTHAI+WS+D+GWKLEF
Sbjct: 481 EGASIVLEFSASCEWLLVVKKDGSTRYTFKADKVIMKPASCNRFTHAILWSSDNGWKLEF 540
Query: 541 PNRRDWFIFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDD 600
PNRRDWF+FKDLYK+CSDRNIPCS AKAIPVP V EVPGYVDSS SF+RPD YIS++DD
Sbjct: 541 PNRRDWFVFKDLYKECSDRNIPCSAAKAIPVPIVSEVPGYVDSSGVSFRRPDTYISVNDD 600
Query: 601 EVFRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNP 660
EV RA+AK+TANYDMDSDDEEWLSK NDEL+ TD H EC+S DNFELM+DAFEKG +CNP
Sbjct: 601 EVCRAMAKSTANYDMDSDDEEWLSKFNDELVATDNHHECVSADNFELMVDAFEKGFFCNP 660
Query: 661 DAFPDEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPI 720
DAF +E+APA+ICTHLG++ +VES+F YW KKR+Q+KSSL+RVFQAHQAKRKP V+PK I
Sbjct: 661 DAFSNEEAPADICTHLGSQSIVESLFAYWTKKRKQRKSSLIRVFQAHQAKRKPPVIPKHI 720
Query: 721 MRRRRSFKRKPSQIG---RATHSNLLEAIVLRRDA-EEDQNAMAKYQEAKATAEKAFECA 780
MRRRRSFKR+PSQ G RAT S++LE RRDA E QN M KY+E KA A++ E A
Sbjct: 721 MRRRRSFKRQPSQSGCGGRATQSSILEDTFSRRDAMEHHQNGMQKYEEVKAAADRCVETA 780
Query: 781 VGKRQRAQLLLENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE 824
V KRQRAQLLL+NADLA YKAM ALRIAEAIQASE+L A + AAA AS FLE
Sbjct: 781 VSKRQRAQLLLQNADLATYKAMTALRIAEAIQASELLEAEAAAAAAAATASCFLE 825
BLAST of MC04g1434 vs. ExPASy TrEMBL
Match:
A0A6J1BW77 (Enhancer of polycomb-like protein OS=Momordica charantia OX=3673 GN=LOC111006281 PE=3 SV=1)
HSP 1 Score: 1617 bits (4188), Expect = 0.0
Identity = 822/824 (99.76%), Postives = 823/824 (99.88%), Query Frame = 0
Query: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG
Sbjct: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
Query: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF 120
SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF
Sbjct: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF 120
Query: 121 GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF 180
GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF
Sbjct: 121 GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF 180
Query: 181 RVHFSRQSTLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLSDP 240
RVHFSRQSTLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLSDP
Sbjct: 181 RVHFSRQSTLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLSDP 240
Query: 241 ICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMFLRAT 300
ICGVFASKGIRFLQGYPPTGSSGICVIFGATQS PMFHLDFSAIPLYFMYLHSRMFLRAT
Sbjct: 241 ICGVFASKGIRFLQGYPPTGSSGICVIFGATQSMPMFHLDFSAIPLYFMYLHSRMFLRAT 300
Query: 301 WIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDHPKSRPILHP 360
WIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDHPKSRPILHP
Sbjct: 301 WIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDHPKSRPILHP 360
Query: 361 SVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASESVHDTKRSVSFS 420
SVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASESVHDTKRSVSFS
Sbjct: 361 SVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASESVHDTKRSVSFS 420
Query: 421 SAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREEGANIV 480
SAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREEGANIV
Sbjct: 421 SAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREEGANIV 480
Query: 481 LEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPNRRDWF 540
LEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPNRRDWF
Sbjct: 481 LEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPNRRDWF 540
Query: 541 IFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEVFRALA 600
IFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEVFRALA
Sbjct: 541 IFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEVFRALA 600
Query: 601 KTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDAFPDEK 660
KTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDAFPDEK
Sbjct: 601 KTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDAFPDEK 660
Query: 661 APAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMRRRRSF 720
APAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMRRRRSF
Sbjct: 661 APAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMRRRRSF 720
Query: 721 KRKPSQIGRATHSNLLEAIVLRRDAEEDQNAMAKYQEAKATAEKAFECAVGKRQRAQLLL 780
KRKPSQIGRATHSNLLEAIVLRRDAEEDQNA+AKYQEAKATAEKAFECAVGKRQRAQLLL
Sbjct: 721 KRKPSQIGRATHSNLLEAIVLRRDAEEDQNAVAKYQEAKATAEKAFECAVGKRQRAQLLL 780
Query: 781 ENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE 824
ENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE
Sbjct: 781 ENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE 824
BLAST of MC04g1434 vs. ExPASy TrEMBL
Match:
A0A6J1JXY7 (Enhancer of polycomb-like protein OS=Cucurbita maxima OX=3661 GN=LOC111488892 PE=3 SV=1)
HSP 1 Score: 1172 bits (3032), Expect = 0.0
Identity = 613/826 (74.21%), Postives = 695/826 (84.14%), Query Frame = 0
Query: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
MPS GMRRTRVFG+VKG+DGARVLRSGRRLWPESGEVKLKKSKDASDWYPVI++RGNGGG
Sbjct: 1 MPS-GMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIDSRGNGGG 60
Query: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF 120
SGQVRLHGKWTQVRNV PKRVVVV+I EE+D CV VP+P+KV R G GESG VDRMF
Sbjct: 61 SGQVRLHGKWTQVRNVKPKRVVVVNIREEEDACVVKVPEPLKVFPRIGSGGESGDVDRMF 120
Query: 121 GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF 180
GKVY RKRKRG +EN +FDE EG+N ++GDRMFGLRFIRRQR+RKT+ WEPTA GR
Sbjct: 121 GKVYSRKRKRGRAENGYVFDEMEGDNAISGDRMFGLRFIRRQRSRKTDIAQWEPTASGRS 180
Query: 181 -RVHFSRQS-TLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLS 240
++H R S + L DRVLTIF+GSS + GCFSDF+ S+LRHL SP L VAKL+ FLLS
Sbjct: 181 TKLHLHRPSISPPLPSDRVLTIFAGSSINNGCFSDFIQSVLRHLNSPDLNVAKLASFLLS 240
Query: 241 DPICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMFLR 300
+ I GVFAS G+RFLQGYPPTGSSG+CVIFG+ Q P+FHLDFSA+P FMYLHS+MFLR
Sbjct: 241 NTINGVFASMGMRFLQGYPPTGSSGMCVIFGSRQCIPLFHLDFSAVPFPFMYLHSKMFLR 300
Query: 301 ATWIQARLVYNNKQLDVDMSSDSEEDS-IEEQHVPSPPGRSSLECKTVTFAVDHPKSRPI 360
TWIQARLVYNN+QLDVDMSSDSEEDS +EEQHV +PP R+SL+CK+V F VDH +R
Sbjct: 301 QTWIQARLVYNNEQLDVDMSSDSEEDSMVEEQHVSNPPVRNSLDCKSVAFGVDHTNTRSN 360
Query: 361 LHPSVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASE--SVHDTKR 420
H SVRASRLG R +QYRN FSSRG+RKRRSSLRMRRPR+ SLAAMQK + D KR
Sbjct: 361 SHSSVRASRLGSRALQYRNGFSSRGIRKRRSSLRMRRPRSHSLAAMQKTVGIFGIDDMKR 420
Query: 421 SVSFSSAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREE 480
SVSF S AS N++K ALRDS+GR+ SSTAL SAM VD SCCNANILIVE+D+C+REE
Sbjct: 421 SVSFPSVASCNRHKNSALRDSSGRV---SSTALGSAMDVDSSCCNANILIVEADRCMREE 480
Query: 481 GANIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPN 540
GANIVLEFSASCEWLL VKK+GS RYTHK E VMKP+ CNRFTHAI+WS D+GWKLEFPN
Sbjct: 481 GANIVLEFSASCEWLLAVKKNGSTRYTHKVETVMKPAYCNRFTHAILWSADNGWKLEFPN 540
Query: 541 RRDWFIFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEV 600
RRDW IFKDLYK+CSDRNIPC TAKAIPVPRV EVP YVDSSC F+RPD YIS++DDEV
Sbjct: 541 RRDWLIFKDLYKECSDRNIPCFTAKAIPVPRVSEVPDYVDSSCTYFKRPDTYISVNDDEV 600
Query: 601 FRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDA 660
RA AK+TANYDMDS+DEEWLSK NDELI TDK +C+S D+FELMIDAFEK L+CNPDA
Sbjct: 601 CRARAKSTANYDMDSEDEEWLSKFNDELIATDKQHDCLSGDSFELMIDAFEKELFCNPDA 660
Query: 661 FPDEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMR 720
F DEKAP ++ LG+R VES+FTYW +KRRQ+KS L+RVFQAHQ+KRKP VVPKPIMR
Sbjct: 661 FSDEKAPTDMFMLLGSRPTVESLFTYWTRKRRQRKSCLIRVFQAHQSKRKPPVVPKPIMR 720
Query: 721 RRRSFKRKPSQIGRATHSNLLEAIVLRRDAEEDQNAMAKYQEAKATAEKAFECAVGKRQR 780
R+RS KR+ SQ GRAT S++L+AIV RRDA E+QNAM KY++AKA AE+ E AV KRQR
Sbjct: 721 RKRSIKRQTSQSGRATQSSILKAIVSRRDAVEEQNAMQKYEDAKAAAERCMESAVSKRQR 780
Query: 781 AQLLLENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASF 821
AQLLLENADLAAYKA++ALRIAEAIQASE LP A A + AAAAA F
Sbjct: 781 AQLLLENADLAAYKAVVALRIAEAIQASE-LPGAAAAAAAAAAACF 821
BLAST of MC04g1434 vs. ExPASy TrEMBL
Match:
A0A6J1FH41 (Enhancer of polycomb-like protein OS=Cucurbita moschata OX=3662 GN=LOC111445630 PE=3 SV=1)
HSP 1 Score: 1160 bits (3002), Expect = 0.0
Identity = 617/831 (74.25%), Postives = 695/831 (83.63%), Query Frame = 0
Query: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
MPS GMRRTRVFG+VKG+DGARVLRSGRRLWPESGEVKLKKSKDASDWYPVI++RGNGGG
Sbjct: 1 MPS-GMRRTRVFGLVKGLDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIDSRGNGGG 60
Query: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF 120
SGQVRLHGKWTQVRNV PKRVVVV+I EE+D CVA VP+P+K++ R G DGESG VDRMF
Sbjct: 61 SGQVRLHGKWTQVRNVKPKRVVVVNIREEEDACVAKVPEPLKILPRIGSDGESGDVDRMF 120
Query: 121 GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF 180
GKVY RKRKRG EN FDE EG+N ++GDRMFGLRFIRRQR+RKT+ +WEPTA GR
Sbjct: 121 GKVYSRKRKRGRRENGYGFDEMEGDNAISGDRMFGLRFIRRQRSRKTDIAHWEPTASGRS 180
Query: 181 -RVHFSRQSTLQ-LTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLS 240
++HF R S L DRVLTIF+GSS + GCFSDF+ S+LRHL SP L VAKL+ FLLS
Sbjct: 181 TKLHFHRPSVSPPLPHDRVLTIFAGSSINNGCFSDFIQSVLRHLNSPELNVAKLASFLLS 240
Query: 241 DPICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMFLR 300
+ I GVFAS G+ FLQGYPPTGSSG+CVIFG+ Q PMFHLDFSA+P FMYLHS MFLR
Sbjct: 241 NTINGVFASTGMHFLQGYPPTGSSGMCVIFGSRQCIPMFHLDFSAVPSPFMYLHSEMFLR 300
Query: 301 ATWIQARLVYNNKQLDVDMSSDSEEDS-IEEQHVPSPPGRSSLECKTVTFAVDHPKSRPI 360
TWIQARLVYNN+QLDVDMSSDSEEDS +EEQHV +PP SSL+CKTV F VDH R
Sbjct: 301 QTWIQARLVYNNEQLDVDMSSDSEEDSMVEEQHVSNPPV-SSLDCKTVAFGVDHTNPRSN 360
Query: 361 LHPSVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASE--SVHDTKR 420
SVRASRLG R +QYRN FSSRG+RKRRSSLRMRRPR+ SLAAMQK + D KR
Sbjct: 361 SQLSVRASRLGSRALQYRNGFSSRGIRKRRSSLRMRRPRSHSLAAMQKTVGIFGIDDMKR 420
Query: 421 SVSFSSAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREE 480
SVSF S AS N++K ALRDS+G + SSTAL SAM VD SCCNANILIVE+D+C+REE
Sbjct: 421 SVSFPSVASCNRHKNSALRDSSGHV---SSTALGSAMDVDSSCCNANILIVEADRCMREE 480
Query: 481 GANIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPN 540
GANIVLEFSASCEWLL VKK+GS RYTHKAE VMKP+ CNRFTHAI+WS D+GWKLEFPN
Sbjct: 481 GANIVLEFSASCEWLLAVKKNGSTRYTHKAETVMKPAYCNRFTHAILWSADNGWKLEFPN 540
Query: 541 RRDWFIFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEV 600
RRDW IFKDLYK+CSDRNIPC TAKAIPVPRV EVP YVDSSC F+RPD YIS+++DEV
Sbjct: 541 RRDWLIFKDLYKECSDRNIPCFTAKAIPVPRVSEVPDYVDSSCTYFKRPDTYISVNNDEV 600
Query: 601 FRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDA 660
R AK+TANYDMDS+DEEWLSK NDELI TDK EC+S D+FELMIDAFEK L+CNPDA
Sbjct: 601 CRTRAKSTANYDMDSEDEEWLSKFNDELIATDKQHECLSGDSFELMIDAFEKELFCNPDA 660
Query: 661 FPDEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMR 720
F DEKAP ++ LG+R VES+FTYW +KRRQ+KS L+RVFQAHQ+KRKP VVPKPIMR
Sbjct: 661 FSDEKAPTDMFMLLGSRSTVESLFTYWTRKRRQRKSCLIRVFQAHQSKRKPPVVPKPIMR 720
Query: 721 RRRSFKRKPSQIG--RATHSNLLEAIVLRRDAEEDQNAMAKYQEAKATAEKAFECAVGKR 780
R+RS KR+PSQ G RAT S++L+AIV RRDA E+QNA+ KY+EAKA AE+ E AV KR
Sbjct: 721 RKRSIKRQPSQSGSGRATQSSILKAIVSRRDAVEEQNAVQKYEEAKAAAERCMESAVSKR 780
Query: 781 QRAQLLLENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE 824
QRAQLLLENADLAAYKA++ALRIAEAIQASE LP+A A AAAAA+ FLE
Sbjct: 781 QRAQLLLENADLAAYKAVVALRIAEAIQASE-LPEAAA---AAAAAACFLE 822
BLAST of MC04g1434 vs. ExPASy TrEMBL
Match:
A0A6J1JI48 (Enhancer of polycomb-like protein OS=Cucurbita maxima OX=3661 GN=LOC111487183 PE=3 SV=1)
HSP 1 Score: 1142 bits (2954), Expect = 0.0
Identity = 615/839 (73.30%), Postives = 687/839 (81.88%), Query Frame = 0
Query: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
MP+VGMRRTRV G+ KGVDG RVLRSGRRL ES E KLKK+KD SDW+PV++ RGNGGG
Sbjct: 1 MPTVGMRRTRVIGL-KGVDGGRVLRSGRRLCIESVEAKLKKTKDVSDWFPVVDKRGNGGG 60
Query: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEE-DDTCVADVPKPVKVVARNGGDGESGVVDRM 120
SGQV HGKW +RNV P RVVVV+I EE DD CVA+VPKPVKV+AR GDGE G VDRM
Sbjct: 61 SGQVSFHGKWQGIRNVKPNRVVVVNIREEEDDACVAEVPKPVKVLARINGDGEFGYVDRM 120
Query: 121 FGKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGR 180
FG+VYRRKRKRGLSEN +FDE DRMFGLRFIRRQR+RK +WEPTAGG
Sbjct: 121 FGEVYRRKRKRGLSENGDVFDEM--------DRMFGLRFIRRQRSRKNTVEHWEPTAGGH 180
Query: 181 F-RVHFSRQST------LQLTRDRVLTIFSGSSHDG-GCFSDFMLSILRHLKSPLLRVAK 240
++HF +QS L RDRVLTIFSGS DG GCFSDFMLS+LR+LKSP L +AK
Sbjct: 181 SAKLHFHKQSISPRPRPLPPPRDRVLTIFSGSDLDGVGCFSDFMLSVLRYLKSPELGMAK 240
Query: 241 LSEFLLSDPICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYL 300
S FLLS+PI VFASKG+RFLQ YPP GSSG+CVIFGA QS PMFHLDFSA+PL FM+L
Sbjct: 241 FSAFLLSNPIHDVFASKGMRFLQSYPPIGSSGMCVIFGAVQSIPMFHLDFSAVPLCFMHL 300
Query: 301 HSRMFLRATWIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDH 360
HS M R TWIQARLVYNN QLDVDMSSD+EEDS EE V SPPG SSLECK++ VDH
Sbjct: 301 HSLMLFRVTWIQARLVYNNNQLDVDMSSDNEEDSNEEYLVSSPPG-SSLECKSMVVGVDH 360
Query: 361 PKSRPILHPSVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASESV- 420
KSR I HPSVRASRLG R +QYRN FS RG+RKRRSS MRRPR+ SLAAMQKA S+
Sbjct: 361 TKSRSISHPSVRASRLGSRTLQYRNGFSFRGIRKRRSSRGMRRPRSHSLAAMQKAIGSLG 420
Query: 421 -HDTKRSVSFSSAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESD 480
D KRSVSF SAAS ++K LA RDSAGRIRE SSTALRSAM V SCCNANILIVESD
Sbjct: 421 ADDMKRSVSFPSAASCIRHKNLARRDSAGRIREESSTALRSAMDVSSSCCNANILIVESD 480
Query: 481 KCLREEGANIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGW 540
KCLREEGA+IVLEFSASCEWLLVVKKDGS RYT KA+KVMKP+SCNRFTHAI+WS+D+GW
Sbjct: 481 KCLREEGASIVLEFSASCEWLLVVKKDGSTRYTFKADKVMKPASCNRFTHAILWSSDNGW 540
Query: 541 KLEFPNRRDWFIFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYIS 600
KLEFPNRRDWFIFKDLYK+CSDRNIPCS AKAIPVP V EVPGYVDSS SF+RPDMYIS
Sbjct: 541 KLEFPNRRDWFIFKDLYKECSDRNIPCSAAKAIPVPIVSEVPGYVDSSGVSFRRPDMYIS 600
Query: 601 IHDDEVFRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGL 660
++DDEV RA+AK+TANYDMDSDDEEWLSK NDEL+ TD H EC+SVDNFELM+DAFEKG
Sbjct: 601 VNDDEVCRAMAKSTANYDMDSDDEEWLSKFNDELVATDNHHECVSVDNFELMVDAFEKGF 660
Query: 661 YCNPDAFPDEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVV 720
+ NPDAF +E+APA+ICTHLG++ +VES+F YW KKR+Q+KSSL+RVFQAHQAKRKP V+
Sbjct: 661 FNNPDAFSNEEAPADICTHLGSQSIVESLFAYWTKKRKQRKSSLIRVFQAHQAKRKPPVI 720
Query: 721 PKPIMRRRRSFKRKPSQIG---RATHSNLLEAIVLRRDA-EEDQNAMAKYQEAKATAEKA 780
PK IMRRRRSFKR+PSQ G RAT S++LE I RRDA E QN + KY+E KA A++
Sbjct: 721 PKHIMRRRRSFKRQPSQSGCGGRATQSSILEDIFSRRDAMEHHQNGVQKYEEVKAAADRC 780
Query: 781 FECAVGKRQRAQLLLENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE 824
E AV KRQRAQLLL+NADLA YKAM ALRIAEAIQASE+L A + AAA AS +LE
Sbjct: 781 VETAVSKRQRAQLLLQNADLATYKAMTALRIAEAIQASELLEAEAAAAAAAATASCYLE 829
BLAST of MC04g1434 vs. ExPASy TrEMBL
Match:
A0A0A0K9C9 (Enhancer of polycomb-like protein OS=Cucumis sativus OX=3659 GN=Csa_6G045070 PE=3 SV=1)
HSP 1 Score: 1141 bits (2952), Expect = 0.0
Identity = 607/829 (73.22%), Postives = 682/829 (82.27%), Query Frame = 0
Query: 1 MPSVGMRRTRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDASDWYPVIENRGNGGG 60
MPS GMRRTRVFG+VKG DGARVLRSGRRLWPESGEVKLKKSKDASDWYP+I+ RGNGGG
Sbjct: 1 MPS-GMRRTRVFGLVKGSDGARVLRSGRRLWPESGEVKLKKSKDASDWYPIIDGRGNGGG 60
Query: 61 SGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVDRMF 120
SG RLHGKWTQVRNV PKRVVVV+I E+DD CV VP+PVKV R G D +S VDRMF
Sbjct: 61 SGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPVKVFPRIGNDDKSSGVDRMF 120
Query: 121 GKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAGGRF 180
GKVY RKRKRG E+ +FDE E +N L+GDRMFGLRFIRRQR+RKT+ +WE TAGGR
Sbjct: 121 GKVYSRKRKRGRLEDGEVFDEMESDNVLSGDRMFGLRFIRRQRSRKTDVEHWESTAGGRT 180
Query: 181 -RVHFSRQSTLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLLSD 240
+HF RQ L RD LTIF+GSS DGGCFSDF+L++LRH KSP L VAK S FLLS+
Sbjct: 181 SNLHFHRQRILH-PRDCALTIFAGSSVDGGCFSDFILTVLRHFKSPGLSVAKFSAFLLSN 240
Query: 241 PICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMFLRA 300
PI VFA KG+RFLQGYPPTG G+ IFG+ QS PMFHLDFSAIPL FM+L+S MFLR
Sbjct: 241 PINEVFALKGMRFLQGYPPTGCCGMFAIFGSRQSIPMFHLDFSAIPLPFMFLYSEMFLRV 300
Query: 301 TWIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDHPKSRPILH 360
T IQARLVYNN QLDVD+SSDSEEDS+EE HVPSP SSLE K + F D PK+R + H
Sbjct: 301 TRIQARLVYNNNQLDVDISSDSEEDSVEELHVPSPV--SSLERKPMAFLFDRPKTRSVSH 360
Query: 361 PSVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQKASE--SVHDTKRSV 420
PSVRA+RLG R MQYRN FSSRG+RKRRSSLR+RRPR+ SLAAMQK+ +V D K V
Sbjct: 361 PSVRATRLGTRTMQYRNGFSSRGIRKRRSSLRIRRPRSHSLAAMQKSIGPLAVDDVKLGV 420
Query: 421 SFSSAASYNKNKYLALRDSAGRIREGSSTALRSAMVVDPSCCNANILIVESDKCLREEGA 480
SF S AS N++K A+RDSAGRIRE +STAL SAM VD SCC ANILIVE+DKCLREEGA
Sbjct: 421 SFPSGASCNRHKSSAVRDSAGRIRETNSTALGSAMDVDSSCCKANILIVEADKCLREEGA 480
Query: 481 NIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLEFPNRR 540
NIVLEFSASCEWLLVVKKDGS RYTHKAE+VMKPSSCNRFTHAI+WS D+GWKLEFPNRR
Sbjct: 481 NIVLEFSASCEWLLVVKKDGSTRYTHKAERVMKPSSCNRFTHAILWSIDNGWKLEFPNRR 540
Query: 541 DWFIFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVDSSCASFQRPDMYISIHDDEVFR 600
DWFIFKDLYK+CSDRNIPC AKAIPVPRV EVP YVDSS ASFQRPD YIS++DDEV R
Sbjct: 541 DWFIFKDLYKECSDRNIPCLIAKAIPVPRVSEVPDYVDSSGASFQRPDTYISVNDDEVCR 600
Query: 601 ALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKGLYCNPDAFP 660
A+ K+TANYDMDS+DEEWL + ND LI TDKHQEC S DNFE M+DAFEKG YCNPDAF
Sbjct: 601 AMTKSTANYDMDSEDEEWLIEFNDGLIATDKHQECFSEDNFESMVDAFEKGFYCNPDAFS 660
Query: 661 DEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKPLVVPKPIMRRR 720
DEKAPA+ICT L + +VES++TYW KKR+Q+KSSL+RVFQA+Q+KRKP +VPKP+MRR+
Sbjct: 661 DEKAPADICTPLASPSIVESLYTYWTKKRKQRKSSLIRVFQAYQSKRKPPLVPKPMMRRK 720
Query: 721 RSFKRKPSQIG--RATHSNLLEAIVLRRDAEEDQNAMAKYQEAKATAEKAFECAVGKRQR 780
RS KR+PSQ G R ++LEAI+ RRDA EDQNAM KY+E+KA EK E AV KRQR
Sbjct: 721 RSLKRQPSQSGSGRTPQPSILEAILWRRDAVEDQNAMQKYEESKAAVEKCIENAVSKRQR 780
Query: 781 AQLLLENADLAAYKAMMALRIAEAIQASEILPDAEAESEAAAAASFFLE 824
AQLLLENADLA YKAM ALRIAEAI+ S+ P+A AA AA+ FLE
Sbjct: 781 AQLLLENADLAVYKAMSALRIAEAIETSDS-PEA-----AATAAACFLE 819
BLAST of MC04g1434 vs. TAIR 10
Match:
AT5G04670.1 (Enhancer of polycomb-like transcription factor protein )
HSP 1 Score: 479.6 bits (1233), Expect = 5.2e-135
Identity = 329/819 (40.17%), Postives = 470/819 (57.39%), Query Frame = 0
Query: 1 MPSVGMRR-TRVFGVVKGVDGARVLRSGRRLWPESGEVKLKKSKDA--SDWYPVIENRGN 60
MPSVGMRR TRVFGVVK DGARVLRSGRR+WP GE K++++ D D V++N+
Sbjct: 1 MPSVGMRRTTRVFGVVKAADGARVLRSGRRIWPNVGEPKVRRAHDVVDRDCDSVLKNQNK 60
Query: 61 GGGSGQVRLHGKWTQVRNVNPKRVVVVDIHEEDDTCVADVPKPVKVVARNGGDGESGVVD 120
G+ ++ + + +PK+V E + V D P + RN G G+ VD
Sbjct: 61 SKGN---KVSSGKSNSQPCSPKQV-----SSEKEDKVDDFPVTKRRKVRNEGVGDEKTVD 120
Query: 121 RMFGKVYRRKRKRGLSENRVIFDETEGENGLAGDRMFGLRFIRRQRTRKTNDGNWEPTAG 180
+MFG VY RKRKR L E + + + + L+F RR+R
Sbjct: 121 KMFGIVYSRKRKR-LCE--------PSSSDRSEEPLRSLKFYRRRR-------------- 180
Query: 181 GRFRVHFSRQSTLQLTRDRVLTIFSGSSHDGGCFSDFMLSILRHLKSPLLRVAKLSEFLL 240
++ S L LT D S D + F L+ +R+++ LR++ L+ F L
Sbjct: 181 ---KLSQRVSSVLTLTVD-------WSCEDCWFLTVFGLA-MRYIRREELRLSSLASFFL 240
Query: 241 SDPICGVFASKGIRFLQGYPPTGSSGICVIFGATQSTPMFHLDFSAIPLYFMYLHSRMFL 300
S PI VFA G+RFL P S G+C FGA P+F DF+ IP +FM +H +F+
Sbjct: 241 SQPINQVFADHGVRFLV-RSPLSSRGVCKFFGAMSCLPLFSADFAVIPRWFMDMHFTLFV 300
Query: 301 RATWIQARLVYNNKQLDVDMSSDSEEDSIEEQHVPSPPGRSSLECKTVTFAVDHPKSRPI 360
R + + K L + + E DS E +P P C P++ +
Sbjct: 301 RV--LPRSFFFVEKSLYLLNNPIEESDSESELALPEP-------CT--------PRNGVV 360
Query: 361 --LHPSVRASRLGGRNMQYRNSFSSRGVRKRRSSLRMRRPRNSSLAAMQ-KASESVHDTK 420
LHPSVRAS+L G N QYR + S +KRRSSLR RR RN S A + V D
Sbjct: 361 VGLHPSVRASKLTGGNAQYRGNLGSHSFQKRRSSLRRRRARNLSHNAHKLNNGTPVFDIS 420
Query: 421 RSVSFSSAASYNKNKYLALRDSAGRIREGSS--TALRSAMVVDPSCCNANILIVESDKCL 480
S +AA +K ++ ++ + G S ++ +D CC+ANIL++ SD+C
Sbjct: 421 GSRKNRTAAVSSKKLRSSVLSNSSPVSNGISIIPMTKTKEELDSICCSANILMIHSDRCT 480
Query: 481 REEGANIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWKLE 540
REEG +++LE S+S EW LV+KKDG+ RY+H A++ M+P S NR THA +W WKLE
Sbjct: 481 REEGFSVMLEASSSKEWFLVIKKDGAIRYSHMAQRTMRPFSSNRITHATVWMGGDNWKLE 540
Query: 541 FPNRRDWFIFKDLYKKCSDRNIPCSTAKAIPVPRVCEVPGYVD--SSCASFQRPDM-YIS 600
F +R+DW FKD+YK+C +RN+ + K IP+P V EV GY + + SF RP + YIS
Sbjct: 541 FCDRQDWLGFKDIYKECYERNLLEQSVKVIPIPGVREVCGYAEYIDNFPSFSRPPVSYIS 600
Query: 601 IHDDEVFRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQEC-ISVDNFELMIDAFEKG 660
+++DEV RA+A++ A YDMDS+DEEWL + N +++ + Q + + FELMID FEK
Sbjct: 601 VNEDEVSRAMARSIALYDMDSEDEEWLERQNQKMLNEEDDQYLQLQREAFELMIDGFEKY 660
Query: 661 LYCNP-DAFPDEKAPA-EICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQAHQAKRKP 720
+ +P D DEKA ++LG +++VE+V YW+KKR+Q+K+ L+R+FQ HQ K+
Sbjct: 661 HFHSPADDLLDEKAATIGSISYLGRQEVVEAVHDYWLKKRKQRKAPLLRIFQGHQVKKTQ 720
Query: 721 LVVPKPIMRRRRSFKRKPSQI-GRATHSNLLEAIVLRRDAEEDQNAMAKYQEAKATAEKA 780
L + KP+ R+RRSFKR+ SQ+ G+A ++ V + EE+ + + + +EAK A+K
Sbjct: 721 L-LSKPVFRKRRSFKRQGSQLHGKAKQTSPWMVAVKAAEPEEEDDIL-RMEEAKVLADKT 757
Query: 781 FECAVGKRQRAQLLLENADLAAYKAMMALRIAEAIQASE 805
E A+ KR+RAQ+L ENADLA YKAM ALRIAEAI+ +E
Sbjct: 781 METAIAKRRRAQILAENADLAVYKAMRALRIAEAIKEAE 757
BLAST of MC04g1434 vs. TAIR 10
Match:
AT4G32620.1 (Enhancer of polycomb-like transcription factor protein )
HSP 1 Score: 150.6 bits (379), Expect = 5.5e-36
Identity = 91/290 (31.38%), Postives = 150/290 (51.72%), Query Frame = 0
Query: 413 TKRSVSFSSAASYNKNKYLALRDSAG-RIREGSSTALRSAMV-VDPSCCNANILIVESDK 472
T+ S S S S ++NK L+ RIR ++ + ++ S C+AN+L+ D+
Sbjct: 991 TQVSYSLPSGGSDSRNKGSLLKGMPNKRIRRSTADVTKGIQKDLESSLCDANVLVTLGDR 1050
Query: 473 CLREEGANIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWK 532
RE GA I LE + EW L VK G+ +Y+H+A + ++P S NRFTHA++W W
Sbjct: 1051 GWREYGAQIFLEPFDNNEWRLAVKISGTTKYSHRAHQFLQPGSVNRFTHAMMWKGGKDWT 1110
Query: 533 LEFPNRRDWFIFKDLYKKCSDRNIPCSTAKAIPVP--RVCEVPGYVDSSCASFQRPDMYI 592
LEFP+R WF+FK+++++C +RN + + IP+P R+ E + + + Y
Sbjct: 1111 LEFPDRGQWFLFKEMHEECYNRNTRAALVRNIPIPGIRMIERDNFDGTETEFIRSSSKYF 1170
Query: 593 SIHDDEVFRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKG 652
+ +V AL + YDMDSDDE+ L + + + I+ D FE +D FEK
Sbjct: 1171 RQTETDVEMALDPSRVMYDMDSDDEQCLLRIRECSSAENSGSCEITEDMFEKAMDMFEKA 1230
Query: 653 LYCNPDAFPDEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQ 699
+ E+ +G+ + +E+++ W KR++K L+R Q
Sbjct: 1231 SFVKQRDNFTLIEIQELTAGVGSLEAMETIYELWRTKRQRKGMPLIRHLQ 1280
BLAST of MC04g1434 vs. TAIR 10
Match:
AT4G32620.2 (Enhancer of polycomb-like transcription factor protein )
HSP 1 Score: 150.6 bits (379), Expect = 5.5e-36
Identity = 91/290 (31.38%), Postives = 150/290 (51.72%), Query Frame = 0
Query: 413 TKRSVSFSSAASYNKNKYLALRDSAG-RIREGSSTALRSAMV-VDPSCCNANILIVESDK 472
T+ S S S S ++NK L+ RIR ++ + ++ S C+AN+L+ D+
Sbjct: 991 TQVSYSLPSGGSDSRNKGSLLKGMPNKRIRRSTADVTKGIQKDLESSLCDANVLVTLGDR 1050
Query: 473 CLREEGANIVLEFSASCEWLLVVKKDGSARYTHKAEKVMKPSSCNRFTHAIIWSTDSGWK 532
RE GA I LE + EW L VK G+ +Y+H+A + ++P S NRFTHA++W W
Sbjct: 1051 GWREYGAQIFLEPFDNNEWRLAVKISGTTKYSHRAHQFLQPGSVNRFTHAMMWKGGKDWT 1110
Query: 533 LEFPNRRDWFIFKDLYKKCSDRNIPCSTAKAIPVP--RVCEVPGYVDSSCASFQRPDMYI 592
LEFP+R WF+FK+++++C +RN + + IP+P R+ E + + + Y
Sbjct: 1111 LEFPDRGQWFLFKEMHEECYNRNTRAALVRNIPIPGIRMIERDNFDGTETEFIRSSSKYF 1170
Query: 593 SIHDDEVFRALAKTTANYDMDSDDEEWLSKSNDELIVTDKHQECISVDNFELMIDAFEKG 652
+ +V AL + YDMDSDDE+ L + + + I+ D FE +D FEK
Sbjct: 1171 RQTETDVEMALDPSRVMYDMDSDDEQCLLRIRECSSAENSGSCEITEDMFEKAMDMFEKA 1230
Query: 653 LYCNPDAFPDEKAPAEICTHLGNRQMVESVFTYWMKKRRQKKSSLVRVFQ 699
+ E+ +G+ + +E+++ W KR++K L+R Q
Sbjct: 1231 SFVKQRDNFTLIEIQELTAGVGSLEAMETIYELWRTKRQRKGMPLIRHLQ 1280
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022133811.1 | 0.0 | 99.76 | uncharacterized protein LOC111006281 [Momordica charantia] | [more] |
XP_022992589.1 | 0.0 | 74.21 | uncharacterized protein LOC111488892 [Cucurbita maxima] | [more] |
XP_023550905.1 | 0.0 | 74.73 | uncharacterized protein LOC111808903 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022939906.1 | 0.0 | 74.25 | uncharacterized protein LOC111445630 isoform X1 [Cucurbita moschata] | [more] |
XP_023516479.1 | 0.0 | 73.29 | uncharacterized protein LOC111780334 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1BW77 | 0.0 | 99.76 | Enhancer of polycomb-like protein OS=Momordica charantia OX=3673 GN=LOC111006281... | [more] |
A0A6J1JXY7 | 0.0 | 74.21 | Enhancer of polycomb-like protein OS=Cucurbita maxima OX=3661 GN=LOC111488892 PE... | [more] |
A0A6J1FH41 | 0.0 | 74.25 | Enhancer of polycomb-like protein OS=Cucurbita moschata OX=3662 GN=LOC111445630 ... | [more] |
A0A6J1JI48 | 0.0 | 73.30 | Enhancer of polycomb-like protein OS=Cucurbita maxima OX=3661 GN=LOC111487183 PE... | [more] |
A0A0A0K9C9 | 0.0 | 73.22 | Enhancer of polycomb-like protein OS=Cucumis sativus OX=3659 GN=Csa_6G045070 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT5G04670.1 | 5.2e-135 | 40.17 | Enhancer of polycomb-like transcription factor protein | [more] |
AT4G32620.1 | 5.5e-36 | 31.38 | Enhancer of polycomb-like transcription factor protein | [more] |
AT4G32620.2 | 5.5e-36 | 31.38 | Enhancer of polycomb-like transcription factor protein | [more] |