Sgr016823 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr016823
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionProtein phosphatase methylesterase 1
Locationtig00153010: 1398432 .. 1419244 (+)
RNA-Seq ExpressionSgr016823
SyntenySgr016823
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTAATTGAAGCTGTGAAGTATAAGATGCCTCGGTAATAGCCAAGAGTTTTGGTTTTATTAGTTGCGGTAGCAAATGGAAGTTGGGGGGAAAATAATGCATTGCTAATTGCCCAGGAGTCTTTACTCTGAGAAGGTTGGCTTAATGTAAAAAAAGGTTCTAGCTGGGGTTGACAAAATATTATGTGGTCTTTGGATAATTAGTTATTGTTTGAACTATTGGTTGATCAAATCTTATCATGGCATAAGGGTCATATTGGGTAACTAGGGTATTTTGGTATGTTGGTCTGACCTGGGCGTATTTGGAATCCGACCCAAGGTGAACATGTTGGTTTGACCTTAGCGTATTTGGAATCCAACCCAAGGTGACCATGTTTGAGGTAGTAACTGGATGAGTCACTGATTGTTTTTCAACTTTTACTGAGTTTGATATACTTACCAAAGCTTTATTGGGGGAGCTAGTCCTTGGGCTGTCACACATGTCAGCCCATTTTGACTTATTTGAATTATTTGTTCCTCTCATTTTCCTGTAAGAGTTTAGGGGAAAGAAAGTCAGGGATCCGATTATTATCTCCTTTGAGATACTCAATTTCAAAATCAAACATACTTAATAAAGATTGCCATCTAGCAAAAATTTGTTTAGATGCAATATTTTTTACATTCTTTTCTATCACATATTTTATCGATTTGCAACCTATCCTTAAAAGGAACTTTTGATTTAAAAGATCTTGTTCAAATTTTTGAACGCAAATGACTATGGCTAGAATTTCTTTTTGATTATGTTATAGCTAACTTGAGTTTTATTCCTTACTCCAGAGGTAAATCTCTTACTGTTTGTTTTTTATTACCACATCTCTGTAGCAATATCTCTCCATAGCCCATTTCTGAAGCATCAGTTTGAACAATCTTATAAGCTTCAAGGTTAGGAATGTTGAGACATGGTAATGCTTTCACATCTATATATATATTTTCTTGACTAATTCAATATGTTGATTTGCAACTAGGAGGATTTTTTCTTTAGTCCATCATAGAATGGCTTGTATTCAACTCTAAGATACTATAAAAATTAGAGATATGATTCAGGCTCCTAAGGAACCTTTGCAATTGTGCCCTATCTGTTATTTCATCTGGGAATTTATCGGCAAATTCAATAATCGGTCCTATTGGGTTAATTATTGACTAGTAGATTTCGAACCCTAGAAACCTGATCTTTGTTTGGAATAATTTTATCTTGGAGAGACAGGACTGAGCCACTTTCACTTATGACTTCGACAAACATTGCCAGTGTTCATCTATTGTTTTGAAAATATTAGTACATCATCAATATAAACGATTAAAAATTTAAGTTAAAAATGGCATTCATAATATTTTAGAATTCACAGGAGCATTCTTCAATCTAAATGACATTGCATTCCACTCATAATGTCCAAAAGGGGATACAAAAGATGTTTTGTAATTGTCTAATTTAGCTATGTATTTGCCAAAATTTAGAAAATTATTGCTTCGCTGAGCTTATTAATTAAGTCTCTTTTGTTCCGAATTGAATATCTAATTCATTCTAATATGTTATATATATAAAGGAGAATGTTGGAAAAATATTTTTGGACTAATATGTTCCTTCTTTAATTTCTAAATTTATAAAGTATGTCATTGACAATCAATTAAAAATAGTAAATTAATTAATTATCAATAACAAATAATAATAAATGTATAGAATCTAAAAGGTTGGAATACTAAAGATGCATTCGGTTGGTGATCTAAAAGAAAAAATTTGAATACAAAAAATTTAATTGAAAATAAAGTTGTATTTCATGTTTTCACTGTTTTCGAATATATATGTGGTAGCAGAATTAGCATATGAAAATACTAAGAAAATATATATACGTTGTTTTCAAGATTTTCACTTTTTGGGTCACAGTATCAAAATACAATCCAAATATTGTTTTTAGAAATAAACCTACCAAACAGAAATTCACTAAATTCATTTGATTGAGTGAATCTGAAAACAAGAAATTAAATCTAGATAAGGTATCAAACATGCTCTAAAAGTTTGTCATTTCATTTTCCAAGAATTTAAAATTTTTTTTAATTATAATTAATAATGGAATTTAAAATGTTTTGTAACTTTTTCATCATTTGTTGCTATAAAAATAGACTTTGTTTCACTATTTTTTCACCAACACTTGAGCTCTATTTGTTCTTTGCTTTTAATCGTCTCTCAATTTATTGCTCTCAAAGCTTCTTCAATCTTGAAATTAGGTCAAGAAGTAATGTGGTACATTCAAGTTGTAAGGATGCATGTGTCGAATTTATATTCCAAATCCTAAAAGCGTGGACCATTATATAATTTCACTTGATGAAGTTGAATAATAGTATTTGTCTAGCAATTATAAGGTGAGGCGTGTTGGGATTTATGCCCTAAAGCCTCGTAATCTAAATAAAAATACCTGTTTAATTATATAAATTATTTATTTTGTGCAATATATTTTTATTATCCATTTTATGAAATCTCAGGTTACTAATGTAATTTTAAACAGTATGTAGTTGACATATGAGAGGATCATGTTCAATAAGTAACCTAAAGAGTCTACAGTATATGGATAAGGTTGGAAGCCCTATCCTGGTAACACTATTGATACGGCCCACTTTGTAATAGTTAAATGAGTTGATCCTAATCATTCATGTGGAGACATGCGAGTGAGGGTATCATATACAATGAGTTTGTATAAGACTGGACGGTGAAATATTTAATCTTTCTTTGTAAATCCGTTGAAAGAAAAGATAAATATTTCACATGATGATCATATGCAACTCGATCTCAATCCTGAGTAAATTATGAACTCCTGCCTGTGAAGGCATGTCCTTTGATTTACATGGATGAGAATGGCATGTATAGCTCATTCAATATGCCTACCATTTTGGGGACATGACCAAATAGAGAGCTGGGGACATAATCTCACAAGATGGAATTTACTCCTTCCCATATAGGGTAAGTAAATGAGTAGTTCTCTTAAGTGCTGATTTTGGGACTTGAATAATGGGCCATACTTTCTCACTGGCCTGAGAGGGACTTAGTTTATGGTTGGATCATAAACAGATTGTTCATTAGAGAATCAATGGTACTTAAGGTATAAGATGTAATTATAGGGGTAAAACGAACTTTTGACCTAACTGTAATTATGAACAACTCGTGAAGGGTCGACTTGCATGTAGTGATTATATCCATGAACATAACTTGTCCTAAAGTGCATAATAGTGCAACTTTGGGTTTATAGTAGAGTAACCTTGAGTTAATGAATGGAAATTAATTAATTAATGAGTTTAATTAATTAATTTTGTATCATTGGAGCTTATAATCTATAGGTCCATAAGGTCTCCATGCTAGCTCACAAAATCATTGATGTATCAATCATGTTGAAAAAAAAAATTGAAATGTTTAAATTTACATTTAACGTATTGGATATGTTAAATATTCAAGAAATTAATATTTAAATGTATTAGATACATTAAATAAAAGAGGAAAATTAAATATTTTATTAATATATTAGATATATAATAAAATATTAATTTGAGTATGACTCAAATAATTAACATTTGAATATATAAAATATATTAAAAGAGAAATTTAGAAATTTGAGTTGGACTCAAATTGTGCCCTCATGATCTTTATAAATAGGATCCTCGGGCTATACCTAACGTATCAAACTCTCATCAATTAGAAATAAATAAATTTCTCTCTAAATTTTTCCTTCCCTTTGGGGTGAACATCAAGAAGGAAGAAGAGGCTTCCAAGTTCAGGTTTCTTTCAAGGAGGCTCCCACCAGCCTTGATCTTATGATTGGAGTCATAGCAAGAAAGATCTAATGGAGATCCACATGGGTGTGAATACAAGAAGAAGTAATTCTTCAAAGGTATGATATTTCTTAATTTAATTCTAAATAAATTCAATGCCGGTCCAATAATTGATTTCTAAATTGTTTAGAAATCAACATCTTCCGCTGCGGGATGATGTGTTCATTCCTAGCAAGGCGAACCTATTTGAAGAAATGTGTGAATAGTTAGGATTTGTTTTTTTTTTCTTCTTAGTATTGTTTTTTTATTGTTCTTTTATAATTAGTTTTATTATTTTTTTAGTTAAAATGTTTGTGCATGCTTTAAGACCCTGTTTACTTCAAGGGAATCAGAATGGAGGAATGAAAATGAAAATTTTCATTTCTTTTAAAATGCAGGAATAGAATTAATTGGTGAGACCCATCACTAATTTGGGGAACCACTCCCCCAAAAAAAGGGTAATGTCATTCCTCTCCTACAACCTCATCAATCTCATTCTTTATTTACAATATTTACTATCTCTCTTATTTATACTACAAATAATTTTTCTTTTAATAAAAAAAAATCATACTAATTTTTTCATTAGTACATGTGAATTTTTATAAAATTATGAATTTCTATTTAATTTGTTAATTATTTAATTTTAATTAGTCATTGTTTATTATTTAATTTGTTAAAATAGAATATTGTTATTATATAATGTATACAAATAAAATTGATCTATACTTTCCACTAATAAAAAACTATAGTAACCTTTTGTTAGTTTATGTGATTTTATTACAAAATATTACAAATCTCTATTTAATTTTACTTTGTTATTATTAATTTTTATCAATTAAATTGTTAAAATATAATTCATTATTGTCAAATATCATAACAATAAAATTTAATAATATTTATATCAATAATTGTAATTCTAATAAAAAACTATATTAATCTTTTTTTAGTTGATGTAATTAGATTTTAATTGATTATTATTTCTTAATATCAAATAAAATGTTAAAAATATAATTCATTATTATTAAATATCATAACAATAAAATTTAATAAAATTTATATTATTTTATTAAGGTTTAAATATCATTTTAGTCCTTGAACTTTCAAGTTTTTTCCATTTTGGTCCCTGAACTTTCAAAATGTCCATTTTGGTCCCTGAACTTTCAAGTTTGTTCTATTTTGGTCCCTGAACTTTCCAAAAATAACCATTTTGGTCCTTAATATATTATTTTGATTCCTAATATCTACTTGATTAATACTTACCTCCAACCTATATGACTTGAATACATGCTTACTTTAACATGCTAATACATATACCCAAGGCTATGTCATTAATTTGTTAAAAATAAAATAAAGGTAATGACCAAAATGGTTATTTTTGGAAAGTTCAGGGACCAAAATAGAACAAACTTGAAAGTTCAGGGACCAAAATGATATTTAAACCTTTTATTAAGTAACTATTATTTGTTAATATCAAATAAATGGTTAAAATAATAAGATATTACTATCAAATAAAATAGTAATAAAATAAATTAAAAATTATTATTGGTTATTGTTATTTTAATTAAAAATTTTATTAACATAGTATTTAGTTAAATGTATTTTTAGTTGGCTAGAAATATAAAGGGTATTCTAGTAATTATTATATCTATGATTCTTATTTCGATCTCAGACAATTTTTAGTAAACAACATCAAAATGTTTTTTATTCTGATTCCAGACAATTTTTAGTAAACAACATCAAATGAATAACATTTCTATTCCCATTGATCTCATTCCAACTGATTTTCATTCCGATTTCTTGTCATTTTGATTCCTGGTAAGTAAACGGGACCTAAATTTGTTAACTTACTTTTCTCTCGATTCAAAGTTTATAAGAAATTTAGCATGTTTATTAAATTATTATATTGTTTTTCTAATAAAAATAATTTAAAAATAGAATGAATTGCACAAACTTCTCCTAAACTATAGTGATTGTTATAATTTTACCAAAAAAATTTAATTGAAAGAAGATATAAAAATGTTGGGTAATATAAATGTGTGTAGTATATATATATATATATATATATATGTAAATGACGGGGAAAAATCACTGCAGCGTTTCCCAGAGCCCAATGGACGATTAACAGGCTGTGGGCTGGGCTTAGCCGGACGGTGGATGGCCCAACTGTGAGCGTCGGGCGACCAGATTTCCGACCCGGTGCCTCCCCATCATTCTGCAGAGATGCTTCATAAGTTTTCCACTCAACAATCTGTGATACATTGTACTCGCTGTACGGTTTTTGTTAAAACTAAAGTGATTCGAATTGAACTTCGTGAGTGAACAATCGATTGATCTTGAAGCCAGCAATGGAATCATCGTCTTCTCACCTTGGTTCGCTTCCCGAGGAAGTTTCAGAAGGTCAAGCACAACGACAGGAGAATCAGAAATCGGTCTTCGCCTCCAAACCCGATCGCCCTCCGACTCTGTGAGTTCTGAATTTTCTTGCTTCACCTCAACTCTAGCACTACCTTCCCACTCCATGTGCGCATATTTCATCGCCTGTTGCCGCACCATGTTTTCAGAATATCTGCATGGACTAACCTTTCGTTTCTTCTGAATTGTTTGGAACTGAATCGATTACAGAGAGAGAGAGAGAGTGATCTGTGAACATATGGAAATGCCAACACTTCCTTGATCGAAGTTCCTTCTCTTAGATTTTGGTATTAATTGGTTAATGTTGCGTAAATTCGAACTTTTTTAACTCCTTGAAACCCAAAGCATTATTTCTTTTCCTTTCTTTCTATGGCGGGAAGATAGAATCTTGGTTCTCTTCGGAGTTCGGATAGTTGATCTTGGGATAGAAATCATTCCCGGTGGGATACGTGATCGTCGCGTCGTATTCACACCGTGGTAAATGCATGGAGCATCCTTTAACTGATTTGTAGAAAGCAGCTTCTGCTTATAATATATTGGAAACAAATAACAGACTGAAATACAGCAGCAGGTTTTTTGACAAAAAAAAAAAAATCTTTGGAATGAATTTGCTAGAAAGAAGCTGAAATGTTAATTGCACCAATAAGTGTATCCGTAGATGTCTTTACTTCCTATATCATGTTTTCCGGATCCTTTTAATTCAAGATCTTAGTAATTGGTGTGTGTCAAAATTTACTAGGTACAGAACTTTTTTCATCTTTGATTTGTTTGTCTTTTACTGTGGAGAAAAAGATTCATTCTATTCTAAGGTTTCATGTACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGCATTTTATGTGAATGCATGAAGTTCGTGTTGAGCTGAGTATTTTTTGGTATAATTTTGTTTTATTCTGGCACTTGGATTTAGTTTTTGATTCGATGTCTTGTCTTATAGGAATTCTTCACAGAAGTACTTGCCCCTGGATTGGCTTGGTTATTTCGACCAAGAGGATGATATTTCCATTCCGAACTCAAACGATGTTAGTATTAATTTGCATATCTCTCTCACGCACTCAAGAATGCACACACTCAGACAAACACACACACACAAACACTTGCATGCAAGTATATATGCACAAGCGGAAATGAACTTTCCAGCAATTGTTCTTTATTGCCTAATATGCATTATCCTTTTAGGTCTTCCATGTATATATGGCGGGAAAAGAAGGACCAGTTGTATTTTGTCTGCATGGAGGAGGCTATTCTGGGTGTGTGATTATTTCTCTCAAACTCCTATTTTTGCTATTCATTTTAAATTTTCTGGGGCTTTAAGTACGTGGACACCCATATTGTTTAACAGGAAATGCTACTACAGTAAGTTTGTAAGTGCCACACACTTGACTTAATCTGACCGCTACTTTTAGAAGTAGCTCGGTACCTAGCAACTGAATATTGCAAAATGGAGATATGATCCACAGCTTGTTTCCTAGATCTTCCAACTGAAGCAACGACACCATTCTAAGAAAAAATATGTTCACTTTATGGATTGTTAGAATTACATTTTAATTTTATTTTAACCATTAAATGATGTTGATAGTTTTAAGTTAACTTGAGACTTAAGAACTATGTTCTTAGGGTATGTCTGAGAGTGCCTTAATCATTTAATTTAGCCATGACAAAAGTCACTATGTTTGGTTTACAAGCATTTCTCAAAAAGTTAAAAGCACTTAGAAATGCTTTTGAAAGAAGCAAGTAATAAGTACTTTATCAAGTGCTTTTGGCCTAAACAATTTACCCTTAATCTCTCACAAAGCACACCCACTTTTAAGAATTACTTTCCAGTGCACTTTCTCTCGCTAATTCCTTTTTCTCACACTTTCTTTCACTAGCTTCTCTCAACGGTCTAAACACTTCATCTCATTAATCTCTTTCTTCACATACTTCCTCTCACTTCTGAAACATACTTTTTTTCACTAATTTCTATGTTCACGCATACTTTCTCTCACCAATTTGTTTATACACACATTTTCTTTTATCATTTTCTCTACACAAACTTTCTATCACATTATGTGGCTAATATCTTTTCTCACGAAACTATCATGGGTTGGCCTAGTGATCATAAGGGCCAATGAAAAAAGTAAGGGCTTAGAGGGAATGGGTTCAAGTATGGTGGCCACTTACCTAGGATTTAATATCCTATAAGTTACTTTGACAACCAAATGTAGTAGGGTCAGACGGTTATCCTGTGAGATTAGTTGAAGTGCGCGTAAGTTGGCTTGGACACTCACGGATATTAAAAAAAAAAAAAAAAAAAACCGTCCTCATAGGCACACGTTCTCTATGCTCTCTCCCTTATTTTGTTTAAAAAATAATTTTAGTCATTTCCATTTTAGTAACTTTTTAGAACTAAACATTAAAGTGTTTTATTGTTAAATAGTTTTTTATTAACGTATTTATAATGGAAATATTGTTTATAAGAGCATTTCCTACTTCTCAAACACTCTTTTTCTTACTAAACTCTTTCATCACACACTTTCTCTCACCAATTTGATATATACACTCTAGTAGGACAAAAAGATTCCCGCTACAATGGAGGCTTCAAAATATGCCTCCAAACTCATTCTCACTAGCTTCTCTCTAGATACATACTTTTCTTACTAGTTTCTCTCTATACACTTTCTCTCACTAATATCTCTCTACCGGTCAACCATACACTTTCTCTCACTTATTTCATTTTTTTAAATTCTAATTGTTTTCATTGTTGTAATTTTTCAATTCTAGAAATATTTGATAAGTAAATATTGATCATTCAAAAAGAAACTCTAATTAGATCAAGAACAAGTTCAAAATAGAACTGAAACAATAAAAATGTTTTTTTATTGTTAAACAATTTTTAAAAAAAATATATATGAGTGTTCGGACTAGCTTACGTGCACCCTTGCTAAATCTCACAGGACAACTGCATGACCTTACAAGATTTAGTTGCCAAGGAACCCATAGGATTTTAAAGTCTAGGTAGGTGGCCACCATGGTTTGAACCCATTTCTTTTAAACCCTTTACTTTTTTACTATTCCCAATGATCACTAGGCCAATTATTAAGCAGTGTTTATTAACATATTTATAATGGAGATATTTTTGATAAGATCATTTCTTTTAAAAGTGCTCCCAAAACACACACTTAATATCATTGTTGAATCATAGTATGGATGTTTGTTTTATTTTATTTAGTAGTCCTATGCCCACAAACTTAGGCCCTGTAGATATTGGATGGTGAATTGGTTGTTTGCTACAGCCAAAGTCAGCCTACTTTGTTAGGGGCATAAAAAACTTGAGGATGGAATATAGGCAAAAGAATTTAAATTGTAAGTAGTACTTAAAATTAATCACAACTCACTAGATACTGTATTTTCTGATGGTTTCTGGGCCATGCTATAGTTATGATGTTTCTCCCCCCTTCCATATTTTTTTTTCTGCATTAGTTTGGCATCAGTTAGCTTTGGTCGTGATTTTTATGTTTGGGTATTCTAAGATGCATCCTAATTTTGCCGAATCTGGTTTTCTGTGTTTCAATGCATCTCTTTTCCTTTATTCCAATTATATGCATGTTTTATGTACTGTACCAGTTAAGTATGTTGATGTTAATTTTTTTTTGCTCTTTATTTACAATATTCTGTTTCACATATCTCTATAGGAAGTTTTTTTATGATAACTTTTCAAATATCAATGAAAAGTTCGCATTTCTTGTAAAAAGTAAAAACTGTTTTACATATCTGGTAATAATTTGCATATATCTGTAGTATAATTCCAAATTTCGTTACCTGGCTAATTGTCAAAAATTATTTCATTAGGCTTTCTTTTGCTCTATCAGCTGGTAAAATAAAGGAAAAAGCTAGAGTAGTTGCAATGGACTTGAGAGGACATGGAAAATCGTCTTCTGACAATGATCTTGACTTATCAATTGAGGTACTGGTATGCTAATTTGATGATTGTTTATCTATCTGGAGTTCTTAATTTTTCTACTGTGCTGGGAACTTGGTATCTATTTATGAAGCGTTGTATTAAATTTGCAGACTATGTGCAATGATGTTTTGGCAGTTATAAGGAGCATGTATGGTGATTCACCTCCTGCAATTGTGCTTGTTGGTCACAGGTAGTGTATCTTTTAGTTAGTTAGTTACTTTTGTCCCTTTAAAAGAAAGAAAAAATAAATAAGACACAAGAAACTTTTCATTGATGAAATGACAAGTATACAAAAGACCCAACAGGCTACAGCTAATAAAAAACTGCATCATCTTGAATTGTTCTAAAAGGACCATAGTTACCAAATAAATTTGAGACAGAGCACGGAATGAAGCATGGTATCTAGCTAGCACCGATATCATTTCCCAATCTCTACTTCTTTCATTGAAAATCCTTTGGTTCTTGCCAACCAAGTCTCCCAAAGGTTAACTTTTATAGCATTGGACCAGAAGACCTTCTTTCCTTTGAAAAGGTAAAAGGTGTTCTAAACAATCTGCTAATGCATTTTAGAGAGCCCATTTAATACCAAATTCCTTGAAGAAAAGAGTTCCACAAATTTCGGCTGAAGGGGCCATGATTAACAGATGGGCTTGGTTCTCTTCTCCGCTCATACACTAAACTCATGGGATGTTCACAAACTTCACATATAATATTTCCAAAAGCACCTAGTGCAAAACTTTCTCCTGAGGTTGTAAATTATGTCTAGCTGACCTACTGCCGATAGTGATTAAGGAAAGTGTAGGTTCATCACAATTTTACATTTCATTTTTAAACAGTCTAGATGGAAAAATAGGAAAAAAAAAAGAATGGGTATATTACTTGAAATTGAATCCTCCATGTTAAAAATCAATCTTCAAATAGACATCTCATGGATCCATGACACAGTTGTATGGCTCCATGAAGAATGCAGAGTTTGGGATGTTTATCTACCGTTCTTTGTACATTGTACGATAACAATACTCAATTCATGTAATTTATTTGTTGAAGAATTTGGCAGTTGCATATTTTAATGTGCCTACTAAGTTTAATGTGAATGCTACTTTGGTGGATAGGACATATGTGCATTCCTACAGTTCTCACATCTACTGTTTCTTTTGTTAAACTAAAAAAGTCGATATAATGTTTTAATTCTATTTATAATTGTGCTATTTTGCTCCCAATTTTAATAAAATTTATATGTTCATTACAATGATTTAAAAAAGTATGGGAGGTTCAGTTGCTGTACATGTTGCTGCAAAGAAAGCATTATCTAGCCTAGCTGGTCTAGTAGTTGTGGACGTTGTGGAGGTATGTTTTTACTGTAATGTTGGTATTTCCTGAATCACAATGAGTTATGAATGGAATTTTGAAGCAATTTTCAGCCTTCCTGCGATTTTTTGGATTTAGGGAACTGCCATGGCTTCATTGATTCATATGCAAAAAATCTTATCCAGCAGGATGCAACATTTTTCTAGCATTGAGAAAGCGGTACACTTATATCCTCGGTCTAGAACATTATTTATATGTTTCTCTCTGATGTTTCTTGTTTCTCTAAGGTGGCCCTTATTTCTTAGCTTTATTTTAGTTCTTTTGGAGATTTTGCTTCTTATTCGTTCTTTTGTACCTGACTGAATGAATTTACCTTTTCTCCTTTTAAAGAATGAATGAAGTCAATTTGTTCTATTTGAACGATAATATTAATTAATGAATTAATTGGAATTAACTACGCTTAAAATTCTCATGTATCAGATTTACTAATGAAGGGAGTTGTAAAAAATTGTAGCAGGTAACAATTAAAGAGTTGATGAGTTGGATTGTATTTGACATAACTTTTGTATTAGTAAAAAGTGCTTTCTTTCTTCTTTGTTTTTTTTTCCTTATAGAAAAGTGCTTACTTTGAAAACACTATTTTCGAACACACTTCTACTCTCTTTTTTCTAAGTCTTTATTAAGAGTGAACCATTCAGATACATTCACAAAGTCGTCTCACCTCTGTATAGAATTTTCTAGTGTCTTATGCAATATGATTGACCCTGAAACCCTAAAAAATTTATTTATATAATATATAGAGACACGTGCAAAAATATAATAAAAGGAGAATATTGGGCAATCGTTTAGACTATGTTATCCACTCTTTTAAAGTATGAGAACTATTAGATACAACTTTGAAACTTCAATGACCTATTAGAAATTTTTTAAAATACATAGACCTATTGGATACAATCTTAAAAGTTTAAGGACCAAACTTGTAATTTGACCTAAAAGTTAAGTGTATTTTAACTTTCTAGACAATAGAAGTTCTTAACTTTCAATCAAATTCATCAAAATTTATTTTGTTAAAATGTGTAATAAAACTAAAAATTTATTTGCATCACACATGCTAATAGATTAGTGTACATTAATATAAATCATAAATGTCTATCTTTCTTTTGTATCTCTATGTGTCTGAATCTTTACTGGATTTCTTTATCTTCATCTAGTTATCTGATATTATTTATTAATTTTTTATGCAACTAATTCTTTAATAGCGGCAGTAGGATATTTCATGTTAAATAAATGTCGTCTGATAACCAATGATAACCATTCTTTCTAATCTGATGCAGCCTTTAGACCTTAAGGTGAAATAACACATTTTGATCTGCCATATTCTTCAATTAAAATGACGTATTCTCTTTGTTAACTTATTGTAGTAAAATTTGGCCATGTTAAGGAATTGAAAAATGCATGAATGTGCTTCTCTTTGTTAGCTTATTGTAGTAAAGTTTGGCCATGTTAACAAATTGAAAAATGCATGAATCTGTAGATTGAATGGAGTGTAAAAGGAGGTTCTTTGAGGAATGTTGACTCTGCCCGTGTATCAATTCCCTCTACACTTGAGTACGATGATTCAAAGAAATGGTAGGTCACTTTTTGTCAATTATACAATTCGTTGAGTAGTATACTAAAAGCTTGTTCATTTTTACCTTAATGATTAGCTGACTTCATCAACTGCTCAGATCTTCGTTTTGCATCTATGAAATATCTGCCCGTGTATGTATTTATTTCTCATAAAAAAAATATCTGAATTTATGGTGCAGCTATACTTACCGAGCAAAACTGGAAGAAACGGAACGGTATTGGAGAAGCTGGTATGAAAATGAATATCCGTTGATTACCTATTTCGCTACCACATTCTAACTAGTCCTTATATTGTTATGTAACTTTCCCACCCCCTCCCTGTTTCAAATACAAATTATAGTATCTCAGGAAAGAGAGGAGACTAGGTATCCATCCAAAGTAGCCAAGGAGATTGCAGAAACTCTCCAATTGGCTAGAATAGGAAAGAAAAAAGGAAGAGAATTAAAACTTTTTGAGTCAGATGCCCGAGAAGATGCACAAAAGCTCTCCTATTCTCATATAAATGTATTAGAAATATTATATAAATAGGCAGGACTCTTCATTCTAATTTTATAGGCTAGGTTGAAAGCGATTTATTAACTATCTTGTTGTGCTTGGATCTGTTACTTATAAATTTACCAAATTGGAGAAGCGGTTGTTGTCAGTTGTTGAAAAAAGAGCTGCAAGTTTTGGGTGCAACATGAGGCCTATTGTGCGTGCCGCATTTACTAAGACTAGGTTTTGTATAAAATTCACTGGAGCCAGTGTTTCAAAATTATATGCTCATATCCCTTATAATGGATCATTTTTATAATGATTGTCACATCCCTGATATTCTTATTTCACTATGTGCATGAAGGATATGCTGTTAACTCCATTGGTAAAATTATGGATGGTTAGAATGGGGTAAAAATATCTATGGTGCCTTTATGTCAAGTAGTTTATTTTTTCAGGTAAATTGTGATGACGGAGAATAATTTAGAATAGTCTGGGTTTTATAGTCTAGTTGGTTATTCTGTTTGCCTTATTTTCTTTTTCCCGTTTCATCATTTTGTAAGGCTATATAACGCTAAGAGATTAAGTGTAAGACAATTAGATTTTGATATTGAAATTTATCGGTTCTTTATCTTCATCATGGGAATAAATATCTCAGGTTGAAAGCATGCACAGGCAATCGTTTTTCAATTTCATGATCAGTATCATCTGTCTTCTCAAAAAAAAATAATAATAAAAAAAAATTTCTTCTAAAAAATGATCAGTATCATCTGAAACAATAGTGAACTAGAATTTCTCCTATGAAAATCAGAGAGTTTTCTCCTTTTTGTTCAGGCATGGAGTTCTGCATAAGAGAACTTTCATTAACCCACCCCCACCCACCCGCACACACAAAAAAAGAAGGGAAAAAAAAGAGTGCATAAGTCCTAGTTAATAACAGATTAACTGAACTCAAGGCCACTCAAATGATGAATGATAAAGTACTTCTATTTCCGATGATTGTTGTTTTAATTCTATTTCTCTAATAACTTGCAATTTATTGGTCTATCATTTGACTTTCTTTTGGTTTTTTTCTCTTCTTCTTCTTCTTTTATGCGTGTGGGTTGGGTTTTGGTTGGGGGGGGGGGGGGAATCTTTGATTATTGACATGTTGTATTTAATAAATTTGTTAATAATCTGGCAGGTATGAGGGTCTTTCAGAAAAATTTCTTTCATGTCCTGTTCCAAAGCTTCTGCTTCTAGCAGGGACTGACAGATTGGACAGGTTAGTAAGCACACCCTGTAAACCTCAAGTTTTCCATTATTAGTACTCAACCATCGTGGCATTGACTTATATATTTTGGTTTGCTTCTGAAGTTGATTTCGTTTGGAATAATAACAAATTTTACACTTGAGCAGGTTCCTTTCTCCACTCTACGGGGAATCTATCTTTCCCAGTTTTATTCTGCTTTCCTTTCCATAACTCCAAAATGAATGAATAGTTTAATATGTAATATTTAAATTTTTATTGTCATCAATTAAAATCTATACATTCATGCATAAATTGTATATCTTGCACATGAAATAGGATATTTAAACTCTGTTTTTTGGGGGGATAAGAAACAAAGCATTCACTCCACGAGAGAAAAGAATGTACAAAGAAAAGAACGTGAAGGCACAAAAAAGGCATCTAACCGGCCATCAAAACAATTAGGGTGTAATTACAAAAAAGATTAGATCTATTACACCAAAAAGATTTGTGATAAATAATAACATCCCATATTTATTCTTTAGTCTTCTCTTCATCTTGAAAGATTCTTTCTGTTGCATTCTGAAAAAACATTAATCCACAAAACCTTCTCATGTTTCTTGAATCTGTCCTGAAAGACCTTGGACCACCAAATCCTCAGAATATCTTGGTAAGGTTGAAGGTTCTAAAAACTTTGACCATACTACCGTCACGAAGGGGCAAGTACTGATAATATGCTTAGCGTCCTCCTAACTGTTCTTACAGAAGGATTTCATAGAAAATCACCAACTGCTATCCACCTTCCAAAGTCTCTTATCTTTTGAGTTACAAAGCTTCCTAGCATCAAGAAGTCTCTTTAGGTTCTCCCACTCCTCCCAACTCTATCAAAGAACGGTCTGCTGAATAACAGGCTCCAATTGCTTTTTGACTATCCCAAACATCCTTAGTCTTCATCAATATCTGAGAGGAGAGGAGAGGAGAGTGAAAAAGATCTAGTGAAGCAATCAGCCAATATGCCAAAACTGCACCAGTCGTCCAACCATAAGAAAGTTTCCTACCATCCATGATTATAATGTCGGAGTGCTGGCTGAATGGCTCATCTAGCTTGATAATATTAAACCATCCATTATTTTAATGTTGGGTGTGCAGACTATTGGCCCCTATGGGTAGAAGAAATTTTTGAGATTCAAATGGGTTATTTAAACTATAAAAAAATAGAAATAAGTTTTGTTAGTTCTTTTTTTTGAGATCCAAATACGTATGGATGATGAGGTGTATCAACGTAAATATATATTTGGAAGAAATTTGAAGGGGGAATAAGCTGTAGAGGGAACTATCCGTTACCCGGGGAATCAGGGTGAGGATGAGCAGGCACTGGATGGAAGTTGCATGCTTGACCTTACTTGATCCATAATTTCAGTTTTAATGATTAGTTGCATAAACTATAATGGGTTGGCCTAGTGGTCATTGAGGCCAATGAAAAAAGTAAAGAACTTAGAGAGAATAGGTTCAAACCATGGTGGCCACCTACCTAGAATTTAATATCCTACAAGTTACCTTGACAACCGAATATAGTAGAGTCAAACGGTTGTCCCGTGAGATTTAGTTGAGGTGCGCATAAGCTAGCCCAAACGCTCACGGATATTTAGAAAAAAAAAGACCAGTTGCATAATGCAGTCTTACATGTGCAGATCCTCTGCAATAATGCACTTATTTTTTATAGTAGAAGAGATTAAGATCCAAGATGCATATTTCATGTACGAGCCCGTGTAGTTAAATGTATATTGCTTGTGCTGAATTCCGGTGGTGGTTCTTGGAGATCTCAGTAATTTAGACCTTCATCCTCAAAATATTTATTAATAACAATGTGAATGTCCATTTTATTTTGTGCTTTTTTGTTTTAATTCCTTTCACTTGAGATGCGTTCTGTAGAAGAAAATACTGAAATTTTTAAAGATGCCCTCAAAAACTGTTTCATGTATTATGACAGAACACTTACAATTGGTCAAATGCAAGGGAAGTTTCAGATGGTGGTAGTCAGACATACTGGGCATGCTATTCAGGTTCCTCTTTGTTACAACATAACAAATTTCTTTTACCATCTTGCTAGTATGTTTTGTGAGTTGTAATTATTGTTCAAACTGTGCAGGAAGATACCCCCGATGAGTTTTCGAATCTGATACTAAATTTTATTTCTCGTAACCGAATAGGTCCACATGGGGTCGAGGTTGGTATCAAATTCCAGTCATACTTTTAATGCAATGTTATTACGAGACATTAGATCTTTTTAAGTCCCTTATTTTTATCATTTTTCTTCATCTTGTACGCTGTACCTCTTGGACGTGGTGGAAATTCAGTGGGAAAAAGATTGGAAAAAAGGGTTAGGTGGAGTAAAAAATGAAGGCTTCATTCTGTATATAATGTTTTCTTGTTATGTTCTACTATCTCACAGGTTAACCTCATGACGTAAAATTCATGTTGCAGATACCGGGACTTCGCAAACCACAGCAGTAGCAACCTCCAAATGGGCACGGGAGAAAAAGATCATTTGCAGTTTCTCCTCCAGGTTTGATCTCTCAGAATAATTTCCTAAAGGGTTGTTTCCCAAGTAGGATTTACCGAAAATCGAAGAGATTTTTTTTTTGGCACAGCAGTATTGAATCTGAGGTAGATGACAAGTAAATGTGTTCCGAGCATCGTGGTCGCTTCGGAGTAAAGCTAGTTTTGAAATCTGCAAGTACATTTCTTAAAGCATCATCCATCCAACTCGTAAGAACTGATTTCGATTGAAGCGGTCAGGATTACTTTTCTCAGCTCGTGTGAACATTGATGGTACCAATAGTTCTTATGAGGTTGGATGATTCTGTTATGCCCAAAGGACTGTAAATGAGATCCTAAGCTCAATAACTGATATGGCTATTAATAAGTGTTGAAGCCGCGACCGGGGCGTTCTTGCATGAACTCTTTCAGGTTAACTTATCTTACTACTTTGGATGAATCTATAAACCCTAATATATTCTGCGATATTAACAATGGAGTATAAGCTTCTTTTCTTTTTCCTGAAAATAGTTTACATATCATACACAATTATATAATGCACTATATTTCATCGATACTATTATGTAATTATTTTTTAGTTTTAAAAATTACAATTTAATTTGAGTTTTACAACATAGTTTAAGGGCATTTTTCTATTAGCTAATCGTAGTTGATTGGACATTTGTATTTAATAATTTTATAATATTAACGATGACAAAAAAATATTTTACAAAATTCAGGAGTTAAATTATAAGTTTTAAAAATTAGAGACCAGAAACTTTGTATGCTAAACGTGTAATTCAGCCATTAATTTGTGTTTTTTTATTTCATGAATGTTTATTTATATATTTTTCAATAAATGAGGCAAATCTATTCTTTAGAAAATGATTACATATATGCGTTGAGGCATTTTTCACATTATAAGCAATTCAGCCATTAATTTGTATTTTCTATTTGTTATTAAAATTTAAATAAATTTTTACAAATTTCTTAAAAGTTAATATTGATATTTAAGATTGTATATTAAGTAAAATATACATAAGCTAGCTATGATTATAAAATATGTACTCTTATTTTATTATATAGATAAATACGTTTTTAATTCTTGAGGGTTTAGAAATAATTTTCTTTGCTTTAAATTTCAAAATCTTATAATTCAACTCTCAAGGTCTTGAAAATGCTCCAAAATATAATTTTATGTTAGTTACTTTAAAAGAAAGATGATGTGATGTTTTAAATTATTTTAATGCTAATATGACAAATTTTTACAATATTTTTTAGGGTGACTAATGGAAATAAACCCTTCATTTTATCTTGGAAGTCTTTGATTTCCACATAAGTTAGAAACCCATAATTGAGTTGACATTGATTTTACTAGATTTATAAAATAAGAAAAACCTACCTTTTCTTTGAAGATCCTCTCATCGATGAGACTTCTACAAAGTCTTCTTTGCCATATCTCAATCACTAGATCAAGGTGTGTGGACACTTTTTTCAAAGAAGTATCAGTAGAAAACTTAATCACATGCTAAAAGGAAAGGGATGAGAGGAAATTTTCTAGAGAGAATATTATTTTATTCTTTTAGTATTCTAGGATGCTTAAAAGCTAAAGGTCTTACTTATAGAGACATCAGTTGAGTGTTCTTTTAGTATTCTAGGATTAGTAGTTAAAGATTTTTGGAAATATGTGCAAAAGACTTGCCCTAGAAAAATCTCTTCAAAGAAATTTCAGAAAGCCCTTTTTTCCATTCCTAGAGTTTCCCACTTCTTTAGTTAACCCATGGGTGCCCGAAGGTCCAAATGGGGTTGGAGGTGAGGTGGGTGGGGAGGAAGATAATTTTGAAGAAAGAGAGGAGAGTGATCCATGGGGCAAAGGGATGCACAAGTGTTGGTTGTTTTTTGTGATCTTGGACTTCCTTGTTTTGTTTTTTCTTTTTTTAATTTGTTATTTATTTTAGGTTAAAAAAGACCTTTGGTCTCTGTATTTTCATTGAAGCAACGATTTAATCCCTGGACATATAACAATTTAGTCCACGTACTTTCAAATTTGTAATAATTTAGTCCTTAAACTTTAGTATGTAACAATTTAATCCTTATTATTTAAAACTTGTAACAATTTAGCCCCTATCATGTAAAATTTTGTTAAAATTTAATGGGGTTTCTTATATATGTAGATCGATAAACTGCGTTGAGACTAATTGTTTTTATAAACTACAAGGACTAAATCGTTATATAATAAAAATTGATACTTCACCTTGATGAGATTTTTCATGTTATAGACTAAATTGTTACAAATTTTAAACTACAGTGACTAAATTGTTACATCCTAAAGTCTAGGGACGAAATTGTTACAAATTTAAAAGTATAAGAACCAAATCAATACATATTAAAATTTAGGAACTAAATTGTTATTTCAATGAAAGTAAAGGGACCAAAAGTGTATTTTAACCTTTATTTTACTATTTTAGTCTCTTTATTTATCTCTGTCTCTTTTTTTTTTTTTTTTGTGAGGACACAACACATACTGGAGTTTGAGGGTTTAGTCTCCTTGTTTACCTCTTTCTCTGTCCCTCTTCCTTTTGGGTTGTTGGTTGTCTAGTATGATGAGTGCGTAAAGTTTTTTTTTTGTAATTTTAGTAATTTCTTTCTCCTTATCGTTCTTCATTTTAGGAAGCTTGCTCTTTATATTTTAGTGGTTCTTTATTTGTTTTCATTTTGTTAGTATTAAGAGTGTTACTTTGTTGTTAGTGGTTCTATTTGTATCTACTCTCATTTCTATGTAGCCAATTGAAGCTTAGTTTTAACTTTTAGCAAAGTTACATCAAAATTTTGGATAGTAATCTTATGCGATTCCACCTAGTTCTTACTCTAGTTTTGTTACGACATAGCTATAGGGCAATACAAACCGTCTGTATTAATGGGAGCAATTGAGAGTAGTTTACTATTCTACTTAATCTCCTTACTTTTTGGTCAAGTCAATTTGCTTTCCTTAATTTCTTAAGGGCTTTTCTTTTCAGTCTAATTCATGTACATATCTTCTTTATTTCTCCATGGCTAATGCTACTCGAGGATTGGGAACGGAAGAAAGGAGGATTGAAGTGAAATCTAGAACATGCCTAGTTGGAAATCTATCCTTTACCAACTTGCTGAACTAGGTAACTTCAATCAAGCTTGACAAGGAAAACTTTCTCCCATGGCAGATTATAGTTCTCCCTATTCTAAAGAGCTACAAGTTGGAATGGCACTTGATTGGAAAGACTCCATGCCCTGAACTATCAATCATCATACCACCATTAGAGGACGAGCCAAAAGGGTTACTTCTTCCTAACCCTAAAGATGACATACAGCACGACAGTCAGCAAGTGTACCAAGGTCTTCTGATCATTATGGAGAATTTAATCCTCCTCAAGAAAGTCATTAGTCAGTAG

mRNA sequence

ATGGTAATTGAAGCTGTGAAGTATAAGATGCCTCGGCTGTGGGCTGGGCTTAGCCGGACGGTGGATGGCCCAACTGTGAGCGTCGGGCGACCAGATTTCCGACCCGTGAACAATCGATTGATCTTGAAGCCAGCAATGGAATCATCGTCTTCTCACCTTGGTTCGCTTCCCGAGGAAGTTTCAGAAGGTCAAGCACAACGACAGGAGAATCAGAAATCGGTCTTCGCCTCCAAACCCGATCGCCCTCCGACTCTGAATTCTTCACAGAAGTACTTGCCCCTGGATTGGCTTGGTTATTTCGACCAAGAGGATGATATTTCCATTCCGAACTCAAACGATGTCTTCCATGTATATATGGCGGGAAAAGAAGGACCAGTTGTATTTTGTCTGCATGGAGGAGGCTATTCTGGGCTTTCTTTTGCTCTATCAGCTGGTAAAATAAAGGAAAAAGCTAGAGTAGTTGCAATGGACTTGAGAGGACATGGAAAATCGTCTTCTGACAATGATCTTGACTTATCAATTGAGACTATGTGCAATGATGTTTTGGCAGTTATAAGGAGCATGTATGGTGATTCACCTCCTGCAATTGTGCTTGTTGGTCACAGTATGGGAGGTTCAGTTGCTGTACATGTTGCTGCAAAGAAAGCATTATCTAGCCTAGCTGGTCTAGTAGTTGTGGACGTTGTGGAGGGAACTGCCATGGCTTCATTGATTCATATGCAAAAAATCTTATCCAGCAGGATGCAACATTTTTCTAGCATTGAGAAAGCGATTGAATGGAGTGTAAAAGGAGGTTCTTTGAGGAATGTTGACTCTGCCCGTGTATCAATTCCCTCTACACTTGAGTACGATGATTCAAAGAAATGCTATACTTACCGAGCAAAACTGGAAGAAACGGAACGGTATTGGAGAAGCTGGTATGAGGGTCTTTCAGAAAAATTTCTTTCATGTCCTGTTCCAAAGCTTCTGCTTCTAGCAGGGACTGACAGATTGGACAGATACCGGGACTTCGCAAACCACAGCAGTAGCAACCTCCAAATGGGCACGGGAGAAAAAGATCATTTGCAGTTTCTCCTCCAGCATCGTGGTCGCTTCGGAGTAAAGCTAGTTTTGAAATCTGCAAGTACATTTCTTAAAGCATCATCCATCCAACTCTCTAATTCATGTACATATCTTCTTTATTTCTCCATGGCTAATGCTACTCGAGGATTGGGAACGGAAGAAAGGAGGATTGAAGTGAAATCTAGAACATGCCTAATTATAGTTCTCCCTATTCTAAAGAGCTACAAGTTGGAATGGCACTTGATTGGAAAGACTCCATGCCCTGAACTATCAATCATCATACCACCATTAGAGGACGAGCCAAAAGGGTTACTTCTTCCTAACCCTAAAGATGACATACAGCACGACAGTCAGCAAGTGTACCAAGGTCTTCTGATCATTATGGAGAATTTAATCCTCCTCAAGAAAGTCATTAGTCAGTAG

Coding sequence (CDS)

ATGGTAATTGAAGCTGTGAAGTATAAGATGCCTCGGCTGTGGGCTGGGCTTAGCCGGACGGTGGATGGCCCAACTGTGAGCGTCGGGCGACCAGATTTCCGACCCGTGAACAATCGATTGATCTTGAAGCCAGCAATGGAATCATCGTCTTCTCACCTTGGTTCGCTTCCCGAGGAAGTTTCAGAAGGTCAAGCACAACGACAGGAGAATCAGAAATCGGTCTTCGCCTCCAAACCCGATCGCCCTCCGACTCTGAATTCTTCACAGAAGTACTTGCCCCTGGATTGGCTTGGTTATTTCGACCAAGAGGATGATATTTCCATTCCGAACTCAAACGATGTCTTCCATGTATATATGGCGGGAAAAGAAGGACCAGTTGTATTTTGTCTGCATGGAGGAGGCTATTCTGGGCTTTCTTTTGCTCTATCAGCTGGTAAAATAAAGGAAAAAGCTAGAGTAGTTGCAATGGACTTGAGAGGACATGGAAAATCGTCTTCTGACAATGATCTTGACTTATCAATTGAGACTATGTGCAATGATGTTTTGGCAGTTATAAGGAGCATGTATGGTGATTCACCTCCTGCAATTGTGCTTGTTGGTCACAGTATGGGAGGTTCAGTTGCTGTACATGTTGCTGCAAAGAAAGCATTATCTAGCCTAGCTGGTCTAGTAGTTGTGGACGTTGTGGAGGGAACTGCCATGGCTTCATTGATTCATATGCAAAAAATCTTATCCAGCAGGATGCAACATTTTTCTAGCATTGAGAAAGCGATTGAATGGAGTGTAAAAGGAGGTTCTTTGAGGAATGTTGACTCTGCCCGTGTATCAATTCCCTCTACACTTGAGTACGATGATTCAAAGAAATGCTATACTTACCGAGCAAAACTGGAAGAAACGGAACGGTATTGGAGAAGCTGGTATGAGGGTCTTTCAGAAAAATTTCTTTCATGTCCTGTTCCAAAGCTTCTGCTTCTAGCAGGGACTGACAGATTGGACAGATACCGGGACTTCGCAAACCACAGCAGTAGCAACCTCCAAATGGGCACGGGAGAAAAAGATCATTTGCAGTTTCTCCTCCAGCATCGTGGTCGCTTCGGAGTAAAGCTAGTTTTGAAATCTGCAAGTACATTTCTTAAAGCATCATCCATCCAACTCTCTAATTCATGTACATATCTTCTTTATTTCTCCATGGCTAATGCTACTCGAGGATTGGGAACGGAAGAAAGGAGGATTGAAGTGAAATCTAGAACATGCCTAATTATAGTTCTCCCTATTCTAAAGAGCTACAAGTTGGAATGGCACTTGATTGGAAAGACTCCATGCCCTGAACTATCAATCATCATACCACCATTAGAGGACGAGCCAAAAGGGTTACTTCTTCCTAACCCTAAAGATGACATACAGCACGACAGTCAGCAAGTGTACCAAGGTCTTCTGATCATTATGGAGAATTTAATCCTCCTCAAGAAAGTCATTAGTCAGTAG

Protein sequence

MVIEAVKYKMPRLWAGLSRTVDGPTVSVGRPDFRPVNNRLILKPAMESSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDDISIPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSSSDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVVVDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDDSKKCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLDRYRDFANHSSSNLQMGTGEKDHLQFLLQHRGRFGVKLVLKSASTFLKASSIQLSNSCTYLLYFSMANATRGLGTEERRIEVKSRTCLIIVLPILKSYKLEWHLIGKTPCPELSIIIPPLEDEPKGLLLPNPKDDIQHDSQQVYQGLLIIMENLILLKKVISQ
Homology
BLAST of Sgr016823 vs. NCBI nr
Match: KAG6592359.1 (Protein phosphatase methylesterase 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 559.3 bits (1440), Expect = 3.3e-155
Identity = 289/354 (81.64%), Postives = 304/354 (85.88%), Query Frame = 0

Query: 46  MESSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDD 105
           MESSSSHLGSLPE+VSE QAQ QENQKSVFASKP RPPTLNSSQKYLPLDWLGYFDQE+D
Sbjct: 1   MESSSSHLGSLPEDVSEDQAQPQENQKSVFASKPSRPPTLNSSQKYLPLDWLGYFDQEED 60

Query: 106 ISIPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSS 165
           ISIPNSN+VFHVY AGKEGPVVFCLHGGGYSGLSFALSAG IKEKARVVAMD RGHGKSS
Sbjct: 61  ISIPNSNEVFHVYTAGKEGPVVFCLHGGGYSGLSFALSAGIIKEKARVVAMDFRGHGKSS 120

Query: 166 SDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVV 225
           S+NDLDLS+ETMCNDVLAVI++M+GDSPPAIVLVGHSMGGSVAVHVAAK+AL SLAGLVV
Sbjct: 121 SENDLDLSVETMCNDVLAVIKTMFGDSPPAIVLVGHSMGGSVAVHVAAKRALPSLAGLVV 180

Query: 226 VDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDD 285
           VDVVEGTAMASLIHMQKILS+RMQHF SIEKAIEWSVK GSLRNVDSARVSIPSTL YDD
Sbjct: 181 VDVVEGTAMASLIHMQKILSNRMQHFPSIEKAIEWSVKAGSLRNVDSARVSIPSTLTYDD 240

Query: 286 SKKCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLD------------- 345
           SKKCYTYRAKLE+TE+YW+SWYEGLSEKFLSCPVPKLLLLAGTDRLD             
Sbjct: 241 SKKCYTYRAKLEQTEQYWKSWYEGLSEKFLSCPVPKLLLLAGTDRLDRTLTIGQMQGKFQ 300

Query: 346 -------------------RYRDFANHSSSNLQMGTGEKDHLQFLLQHRGRFGV 368
                              RY+DF N SSSNLQ  TGEKDHLQFLLQ R   G+
Sbjct: 301 MVVVRHTGHAIQEDVRMGSRYQDFVNRSSSNLQTDTGEKDHLQFLLQLRNGVGM 354

BLAST of Sgr016823 vs. NCBI nr
Match: XP_022925364.1 (protein phosphatase methylesterase 1 isoform X1 [Cucurbita moschata])

HSP 1 Score: 551.6 bits (1420), Expect = 6.9e-153
Identity = 290/380 (76.32%), Postives = 304/380 (80.00%), Query Frame = 0

Query: 46  MESSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDD 105
           MESSSSHLGSLPE+VSE QAQ QENQKSVFASKP RPPTLNSSQKYLPLDWLGYFDQE+D
Sbjct: 1   MESSSSHLGSLPEDVSEDQAQPQENQKSVFASKPSRPPTLNSSQKYLPLDWLGYFDQEED 60

Query: 106 ISIPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSS 165
           ISIPNSN+VFHVY AGKEGPVVFCLHGGGYSGLSFALSAG IKEKARVVAMD RGHGKSS
Sbjct: 61  ISIPNSNEVFHVYTAGKEGPVVFCLHGGGYSGLSFALSAGIIKEKARVVAMDFRGHGKSS 120

Query: 166 SDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVV 225
           S+NDLDLS+ETMCNDVLAVI++M+GDSPPAIVLVGHSMGGSVAVHVAAK+AL SLAGLVV
Sbjct: 121 SENDLDLSVETMCNDVLAVIKTMFGDSPPAIVLVGHSMGGSVAVHVAAKRALPSLAGLVV 180

Query: 226 VDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDD 285
           VDVVEGTAMASLIHMQKILS+RMQHF SIEKAIEWSVK GSLRNVDSARVSIPSTL YDD
Sbjct: 181 VDVVEGTAMASLIHMQKILSNRMQHFPSIEKAIEWSVKAGSLRNVDSARVSIPSTLTYDD 240

Query: 286 SKKCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLD------------- 345
           SKKCYTYRAKLE+TE+YW+SWYEGLSEKFLSCPVPKLLLLAGTDRLD             
Sbjct: 241 SKKCYTYRAKLEQTEQYWKSWYEGLSEKFLSCPVPKLLLLAGTDRLDRTLTIGQMQGKFQ 300

Query: 346 ---------------------------------------------RYRDFANHSSSNLQM 368
                                                        RYRDF N SSSNLQ 
Sbjct: 301 MVVVRHTGHAIQEDVPDEFSNLILNFISRNRIGPHGVEVSIKFQSRYRDFVNRSSSNLQT 360

BLAST of Sgr016823 vs. NCBI nr
Match: XP_023535503.1 (protein phosphatase methylesterase 1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 551.2 bits (1419), Expect = 9.0e-153
Identity = 290/380 (76.32%), Postives = 305/380 (80.26%), Query Frame = 0

Query: 46  MESSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDD 105
           MESSSSHLGSLPE+VSE QAQ QENQKSVFASKP+RPPTLNSSQKYLPLDWLGYFDQE+D
Sbjct: 1   MESSSSHLGSLPEDVSEDQAQPQENQKSVFASKPNRPPTLNSSQKYLPLDWLGYFDQEED 60

Query: 106 ISIPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSS 165
           ISIPNSN+VFHVY AGKEGPVVFCLHGGGYSGLSFALSAG IKEKARVVAMD RGHGKSS
Sbjct: 61  ISIPNSNEVFHVYTAGKEGPVVFCLHGGGYSGLSFALSAGIIKEKARVVAMDFRGHGKSS 120

Query: 166 SDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVV 225
           S+NDLDLS+ETMCNDVLAVI++M+GDSPPAIVLVGHSMGGSVAVHVAAK+AL SLAGLVV
Sbjct: 121 SENDLDLSVETMCNDVLAVIKTMFGDSPPAIVLVGHSMGGSVAVHVAAKRALPSLAGLVV 180

Query: 226 VDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDD 285
           VDVVEGTAMASLIHMQKILS+RMQHF SIEKAIEWSVK GSLRNVDSARVSIPSTL YDD
Sbjct: 181 VDVVEGTAMASLIHMQKILSNRMQHFPSIEKAIEWSVKAGSLRNVDSARVSIPSTLTYDD 240

Query: 286 SKKCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLD------------- 345
           SKKCYTYRAKLE+TE+YW+SWYEGLSEKFLSCPVPKLLLLAGTDRLD             
Sbjct: 241 SKKCYTYRAKLEQTEQYWKSWYEGLSEKFLSCPVPKLLLLAGTDRLDRTLTIGQMQGKFQ 300

Query: 346 ---------------------------------------------RYRDFANHSSSNLQM 368
                                                        RYRDF N SSSNLQ 
Sbjct: 301 MVVVRHTGHAIQEDVPDEFSNLILNFISRNRIGPHGVEVSIKFQSRYRDFVNRSSSNLQT 360

BLAST of Sgr016823 vs. NCBI nr
Match: XP_038890779.1 (protein phosphatase methylesterase 1 isoform X2 [Benincasa hispida])

HSP 1 Score: 528.1 bits (1359), Expect = 8.2e-146
Identity = 263/286 (91.96%), Postives = 278/286 (97.20%), Query Frame = 0

Query: 48  SSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDDIS 107
           SSSSHLGSLPE+VSE Q Q QENQKSVFASKPDRPPT+NSSQKYLPLDWLGYFDQEDDIS
Sbjct: 4   SSSSHLGSLPEDVSENQGQPQENQKSVFASKPDRPPTINSSQKYLPLDWLGYFDQEDDIS 63

Query: 108 IPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSSSD 167
           IP+SN+VFHVYMAGKEGPVVFCLHGGGYSGLSFALSAG IKEKARVVAMD RGHGKSSS+
Sbjct: 64  IPDSNEVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGIIKEKARVVAMDFRGHGKSSSE 123

Query: 168 NDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVVVD 227
           NDLDLSI+TMCNDVLAVI++M+GDSPPAI+LVGHSMGGSVAVHVAAKKAL SLAGLVVVD
Sbjct: 124 NDLDLSIKTMCNDVLAVIKTMFGDSPPAIILVGHSMGGSVAVHVAAKKALPSLAGLVVVD 183

Query: 228 VVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDDSK 287
           VVEGTAMASLIHMQKILS+RMQHF SIEKAIEWSVKGGSLRN++SARVSIPSTL YDDSK
Sbjct: 184 VVEGTAMASLIHMQKILSNRMQHFPSIEKAIEWSVKGGSLRNINSARVSIPSTLTYDDSK 243

Query: 288 KCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 334
           KCYTYRAKLEETE+YW+SWYEGLSEKFLSCPVPKLLLLAGTDRLDR
Sbjct: 244 KCYTYRAKLEETEQYWKSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 289

BLAST of Sgr016823 vs. NCBI nr
Match: XP_022973537.1 (protein phosphatase methylesterase 1 isoform X2 [Cucurbita maxima])

HSP 1 Score: 527.7 bits (1358), Expect = 1.1e-145
Identity = 264/288 (91.67%), Postives = 279/288 (96.88%), Query Frame = 0

Query: 46  MESSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDD 105
           MESSSSHLGSLPE+VSE QA+ QENQKSVFASKP+RPPTLNSSQKYLPLDWLGYFDQE+D
Sbjct: 1   MESSSSHLGSLPEDVSEDQARPQENQKSVFASKPNRPPTLNSSQKYLPLDWLGYFDQEED 60

Query: 106 ISIPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSS 165
           ISIPNSN+VFHVY AGKEGPVVFCLHGGGYSGLSFALSAG IKEKARVVAMD RGHGKSS
Sbjct: 61  ISIPNSNEVFHVYTAGKEGPVVFCLHGGGYSGLSFALSAGIIKEKARVVAMDFRGHGKSS 120

Query: 166 SDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVV 225
           S+NDLDLS+ETMCNDVLAVI++M+GDSPPAIVLVGHSMGGSVAVHVAAK+AL SLAGLVV
Sbjct: 121 SENDLDLSVETMCNDVLAVIKTMFGDSPPAIVLVGHSMGGSVAVHVAAKRALPSLAGLVV 180

Query: 226 VDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDD 285
           VDVVEGTAMASLIHMQKILS+RMQHF SIEKAIEWSVK GSLRNVDSARVSIPSTL YDD
Sbjct: 181 VDVVEGTAMASLIHMQKILSNRMQHFPSIEKAIEWSVKAGSLRNVDSARVSIPSTLTYDD 240

Query: 286 SKKCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 334
           SKKCYTYRAKLE+TE+YW+SWYEGLSEKFLSCPVPKLLLLAGTDRLDR
Sbjct: 241 SKKCYTYRAKLEQTEQYWKSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 288

BLAST of Sgr016823 vs. ExPASy Swiss-Prot
Match: Q8BVQ5 (Protein phosphatase methylesterase 1 OS=Mus musculus OX=10090 GN=Ppme1 PE=1 SV=5)

HSP 1 Score: 236.1 bits (601), Expect = 8.4e-61
Identity = 141/330 (42.73%), Postives = 186/330 (56.36%), Query Frame = 0

Query: 50  SSHLGSLPEEVS-EGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDDISI 109
           S HLG LP      G    Q   K      P R       + + P+ W  YF+  +D+ +
Sbjct: 7   SMHLGRLPSRPPLPGSGGSQSGAK--MRMGPGR------KRDFTPVPWSQYFESMEDVEV 66

Query: 110 PN--SNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKI--KEKARVVAMDLRGHGKS 169
            N    D F VY +G EGPV+  LHGGG+S LS+A+    I  + + R+VA+DLRGHG++
Sbjct: 67  ENETGKDTFRVYKSGSEGPVLLLLHGGGHSALSWAVFTAAIISRVQCRIVALDLRGHGET 126

Query: 170 SSDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLV 229
              N  DLS ETM  DV  V+ +MYGD PP ++L+GHSMGG++AVH AA   + SL GL 
Sbjct: 127 KVKNSEDLSAETMAKDVGNVVEAMYGDLPPPVMLIGHSMGGAIAVHTAAANLVPSLLGLC 186

Query: 230 VVDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEY- 289
           ++DVVEGTAM +L  MQ  L  R + F S+E AIEWSVK G +RN++SARVS+   ++  
Sbjct: 187 MIDVVEGTAMDALNSMQNFLRGRPKTFKSLENAIEWSVKSGQIRNLESARVSMVGQVKQC 246

Query: 290 ------------------------------------DD--SKK--CYTYRAKLEETERYW 334
                                               DD  +KK   YT+R +L +TE+YW
Sbjct: 247 EGITSPEGSKSIVEGIIEEEEEDEEGSESVNKRKKEDDMETKKDHPYTWRIELAKTEKYW 306

BLAST of Sgr016823 vs. ExPASy Swiss-Prot
Match: Q58DN4 (Protein phosphatase methylesterase 1 OS=Bos taurus OX=9913 GN=PPME1 PE=2 SV=3)

HSP 1 Score: 235.0 bits (598), Expect = 1.9e-60
Identity = 141/330 (42.73%), Postives = 186/330 (56.36%), Query Frame = 0

Query: 50  SSHLGSLPEEVS-EGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDDISI 109
           S HLG LP      G    Q   K      P R       + + P+ W  YF+  +D+ +
Sbjct: 7   SMHLGRLPSRPPLPGSGGSQSGAK--MRMGPGR------KRDFSPVPWSQYFESMEDVEV 66

Query: 110 PN--SNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKI--KEKARVVAMDLRGHGKS 169
            N    D F VY +G EGPV+  LHGGG+S LS+A+    I  + + R+VA+DLRGHG++
Sbjct: 67  ENETGKDTFRVYKSGSEGPVLLLLHGGGHSALSWAVFTAAIISRVQCRIVALDLRGHGET 126

Query: 170 SSDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLV 229
              N  DLS ETM  DV  V+ +MYGD PP I+L+GHSMGG++AVH A+   + SL GL 
Sbjct: 127 KVRNSEDLSAETMAKDVGNVVEAMYGDLPPPIMLIGHSMGGAIAVHTASSNLVPSLLGLC 186

Query: 230 VVDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEY- 289
           ++DVVEGTAM +L  MQ  L  R + F S+E AIEWSVK G +RN++SARVS+   ++  
Sbjct: 187 MIDVVEGTAMDALNSMQNFLRGRPKTFKSLENAIEWSVKSGQIRNLESARVSMVGQVKQC 246

Query: 290 ------------------------------------DD--SKK--CYTYRAKLEETERYW 334
                                               DD  +KK   YT+R +L +TE+YW
Sbjct: 247 EGITSPEGSKSIVEGIIEEEEEDEEGSESVNKRKKEDDMETKKDHPYTWRIELAKTEKYW 306

BLAST of Sgr016823 vs. ExPASy Swiss-Prot
Match: Q4FZT2 (Protein phosphatase methylesterase 1 OS=Rattus norvegicus OX=10116 GN=Ppme1 PE=1 SV=2)

HSP 1 Score: 234.2 bits (596), Expect = 3.2e-60
Identity = 138/330 (41.82%), Postives = 183/330 (55.45%), Query Frame = 0

Query: 50  SSHLGSLPEEVS-EGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDDISI 109
           S HLG LP      G    Q   K      P R       + + P+ W  YF+  +D+ +
Sbjct: 7   SMHLGRLPSRPPLPGSGGSQSGAK--MRMGPGR------KRDFTPVPWSQYFESMEDVEV 66

Query: 110 PN--SNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKI--KEKARVVAMDLRGHGKS 169
            N    D F VY +G EGPV+  LHGGG+S LS+A+    I  + + R+VA+DLR HG++
Sbjct: 67  ENETGKDTFRVYKSGSEGPVLLLLHGGGHSALSWAVFTAAIISRVQCRIVALDLRSHGET 126

Query: 170 SSDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLV 229
              N  DLS ETM  DV  V+ +MYGD PP ++L+GHSMGG++AVH AA   + SL GL 
Sbjct: 127 KVKNSEDLSAETMAKDVGNVVEAMYGDLPPPVMLIGHSMGGAIAVHTAAANLVPSLLGLC 186

Query: 230 VVDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEY- 289
           ++DVVEGTAM +L  MQ  L  R + F S+E AIEWSVK G +RN++SARVS+   ++  
Sbjct: 187 MIDVVEGTAMDALNSMQNFLRGRPKTFKSLENAIEWSVKSGQIRNLESARVSMVGQVKQC 246

Query: 290 ------DDSKKC----------------------------------YTYRAKLEETERYW 334
                 + SK                                    YT+R +L +TE+YW
Sbjct: 247 EGITSPESSKSIVEGIIEEEEEDEEGSESVNKRKKEDDMETKKDHPYTWRIELAKTEKYW 306

BLAST of Sgr016823 vs. ExPASy Swiss-Prot
Match: Q9Y570 (Protein phosphatase methylesterase 1 OS=Homo sapiens OX=9606 GN=PPME1 PE=1 SV=3)

HSP 1 Score: 232.6 bits (592), Expect = 9.3e-60
Identity = 141/330 (42.73%), Postives = 184/330 (55.76%), Query Frame = 0

Query: 50  SSHLGSLPEEVS-EGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDDISI 109
           S HLG LP      G    Q   K      P R       + + P+ W  YF+  +D+ +
Sbjct: 7   SMHLGRLPSRPPLPGSGGSQSGAK--MRMGPGR------KRDFSPVPWSQYFESMEDVEV 66

Query: 110 PN--SNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKI--KEKARVVAMDLRGHGKS 169
            N    D F VY +G EGPV+  LHGGG+S LS+A+    I  + + R+VA+DLR HG++
Sbjct: 67  ENETGKDTFRVYKSGSEGPVLLLLHGGGHSALSWAVFTAAIISRVQCRIVALDLRSHGET 126

Query: 170 SSDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLV 229
              N  DLS ETM  DV  V+ +MYGD PP I+L+GHSMGG++AVH A+   + SL GL 
Sbjct: 127 KVKNPEDLSAETMAKDVGNVVEAMYGDLPPPIMLIGHSMGGAIAVHTASSNLVPSLLGLC 186

Query: 230 VVDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARV--------- 289
           ++DVVEGTAM +L  MQ  L  R + F S+E AIEWSVK G +RN++SARV         
Sbjct: 187 MIDVVEGTAMDALNSMQNFLRGRPKTFKSLENAIEWSVKSGQIRNLESARVSMVGQVKQC 246

Query: 290 ----------------------------SIPSTLEYDD--SKK--CYTYRAKLEETERYW 334
                                       SI    + DD  +KK   YT+R +L +TE+YW
Sbjct: 247 EGITSPEGSKSIVEGIIEEEEEDEEGSESISKRKKEDDMETKKDHPYTWRIELAKTEKYW 306

BLAST of Sgr016823 vs. ExPASy Swiss-Prot
Match: Q5R4F9 (Protein phosphatase methylesterase 1 OS=Pongo abelii OX=9601 GN=PPME1 PE=2 SV=3)

HSP 1 Score: 229.9 bits (585), Expect = 6.0e-59
Identity = 140/330 (42.42%), Postives = 183/330 (55.45%), Query Frame = 0

Query: 50  SSHLGSLPEEVS-EGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDDISI 109
           S HLG LP      G    Q   K      P R       + + P+ W  YF+  +D+ +
Sbjct: 7   SMHLGRLPSRPPLPGSGGSQSGAK--MRMGPGR------KRDFSPVPWSQYFESMEDVEV 66

Query: 110 PN--SNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKI--KEKARVVAMDLRGHGKS 169
            N    D F VY +G EGPV+  LHGGG+S LS+A+    I  + + R+VA+DLR HG++
Sbjct: 67  ENETGKDTFRVYKSGSEGPVLLLLHGGGHSALSWAVFTAAIISRVQCRIVALDLRSHGET 126

Query: 170 SSDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLV 229
              N  DLS ETM  DV  V+ +MYGD PP I+L+GHSMGG++AVH A+   + SL GL 
Sbjct: 127 KVKNPEDLSAETMAKDVGNVVEAMYGDLPPPIMLIGHSMGGAIAVHTASSNLVPSLLGLC 186

Query: 230 VVDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARV--------- 289
           ++ VVEGTAM +L  MQ  L  R + F S+E AIEWSVK G +RN++SARV         
Sbjct: 187 MIGVVEGTAMDALNSMQNFLRGRPKTFKSLENAIEWSVKSGQIRNLESARVSMVGQVKQC 246

Query: 290 ----------------------------SIPSTLEYDD--SKK--CYTYRAKLEETERYW 334
                                       SI    + DD  +KK   YT+R +L +TE+YW
Sbjct: 247 EGITSPEGSKSIVEGIIEEEEEDEEGSESISKRKKEDDMETKKDHPYTWRIELAKTEKYW 306

BLAST of Sgr016823 vs. ExPASy TrEMBL
Match: A0A6J1EBY6 (Protein phosphatase methylesterase 1 OS=Cucurbita moschata OX=3662 GN=LOC111432673 PE=3 SV=1)

HSP 1 Score: 551.6 bits (1420), Expect = 3.3e-153
Identity = 290/380 (76.32%), Postives = 304/380 (80.00%), Query Frame = 0

Query: 46  MESSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDD 105
           MESSSSHLGSLPE+VSE QAQ QENQKSVFASKP RPPTLNSSQKYLPLDWLGYFDQE+D
Sbjct: 1   MESSSSHLGSLPEDVSEDQAQPQENQKSVFASKPSRPPTLNSSQKYLPLDWLGYFDQEED 60

Query: 106 ISIPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSS 165
           ISIPNSN+VFHVY AGKEGPVVFCLHGGGYSGLSFALSAG IKEKARVVAMD RGHGKSS
Sbjct: 61  ISIPNSNEVFHVYTAGKEGPVVFCLHGGGYSGLSFALSAGIIKEKARVVAMDFRGHGKSS 120

Query: 166 SDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVV 225
           S+NDLDLS+ETMCNDVLAVI++M+GDSPPAIVLVGHSMGGSVAVHVAAK+AL SLAGLVV
Sbjct: 121 SENDLDLSVETMCNDVLAVIKTMFGDSPPAIVLVGHSMGGSVAVHVAAKRALPSLAGLVV 180

Query: 226 VDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDD 285
           VDVVEGTAMASLIHMQKILS+RMQHF SIEKAIEWSVK GSLRNVDSARVSIPSTL YDD
Sbjct: 181 VDVVEGTAMASLIHMQKILSNRMQHFPSIEKAIEWSVKAGSLRNVDSARVSIPSTLTYDD 240

Query: 286 SKKCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLD------------- 345
           SKKCYTYRAKLE+TE+YW+SWYEGLSEKFLSCPVPKLLLLAGTDRLD             
Sbjct: 241 SKKCYTYRAKLEQTEQYWKSWYEGLSEKFLSCPVPKLLLLAGTDRLDRTLTIGQMQGKFQ 300

Query: 346 ---------------------------------------------RYRDFANHSSSNLQM 368
                                                        RYRDF N SSSNLQ 
Sbjct: 301 MVVVRHTGHAIQEDVPDEFSNLILNFISRNRIGPHGVEVSIKFQSRYRDFVNRSSSNLQT 360

BLAST of Sgr016823 vs. ExPASy TrEMBL
Match: A0A6J1I7S5 (Protein phosphatase methylesterase 1 OS=Cucurbita maxima OX=3661 GN=LOC111472065 PE=3 SV=1)

HSP 1 Score: 527.7 bits (1358), Expect = 5.2e-146
Identity = 264/288 (91.67%), Postives = 279/288 (96.88%), Query Frame = 0

Query: 46  MESSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDD 105
           MESSSSHLGSLPE+VSE QA+ QENQKSVFASKP+RPPTLNSSQKYLPLDWLGYFDQE+D
Sbjct: 1   MESSSSHLGSLPEDVSEDQARPQENQKSVFASKPNRPPTLNSSQKYLPLDWLGYFDQEED 60

Query: 106 ISIPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSS 165
           ISIPNSN+VFHVY AGKEGPVVFCLHGGGYSGLSFALSAG IKEKARVVAMD RGHGKSS
Sbjct: 61  ISIPNSNEVFHVYTAGKEGPVVFCLHGGGYSGLSFALSAGIIKEKARVVAMDFRGHGKSS 120

Query: 166 SDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVV 225
           S+NDLDLS+ETMCNDVLAVI++M+GDSPPAIVLVGHSMGGSVAVHVAAK+AL SLAGLVV
Sbjct: 121 SENDLDLSVETMCNDVLAVIKTMFGDSPPAIVLVGHSMGGSVAVHVAAKRALPSLAGLVV 180

Query: 226 VDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDD 285
           VDVVEGTAMASLIHMQKILS+RMQHF SIEKAIEWSVK GSLRNVDSARVSIPSTL YDD
Sbjct: 181 VDVVEGTAMASLIHMQKILSNRMQHFPSIEKAIEWSVKAGSLRNVDSARVSIPSTLTYDD 240

Query: 286 SKKCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 334
           SKKCYTYRAKLE+TE+YW+SWYEGLSEKFLSCPVPKLLLLAGTDRLDR
Sbjct: 241 SKKCYTYRAKLEQTEQYWKSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 288

BLAST of Sgr016823 vs. ExPASy TrEMBL
Match: A0A6J1I8X2 (Protein phosphatase methylesterase 1 OS=Cucurbita maxima OX=3661 GN=LOC111472065 PE=3 SV=1)

HSP 1 Score: 527.7 bits (1358), Expect = 5.2e-146
Identity = 264/288 (91.67%), Postives = 279/288 (96.88%), Query Frame = 0

Query: 46  MESSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDD 105
           MESSSSHLGSLPE+VSE QA+ QENQKSVFASKP+RPPTLNSSQKYLPLDWLGYFDQE+D
Sbjct: 1   MESSSSHLGSLPEDVSEDQARPQENQKSVFASKPNRPPTLNSSQKYLPLDWLGYFDQEED 60

Query: 106 ISIPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSS 165
           ISIPNSN+VFHVY AGKEGPVVFCLHGGGYSGLSFALSAG IKEKARVVAMD RGHGKSS
Sbjct: 61  ISIPNSNEVFHVYTAGKEGPVVFCLHGGGYSGLSFALSAGIIKEKARVVAMDFRGHGKSS 120

Query: 166 SDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVV 225
           S+NDLDLS+ETMCNDVLAVI++M+GDSPPAIVLVGHSMGGSVAVHVAAK+AL SLAGLVV
Sbjct: 121 SENDLDLSVETMCNDVLAVIKTMFGDSPPAIVLVGHSMGGSVAVHVAAKRALPSLAGLVV 180

Query: 226 VDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDD 285
           VDVVEGTAMASLIHMQKILS+RMQHF SIEKAIEWSVK GSLRNVDSARVSIPSTL YDD
Sbjct: 181 VDVVEGTAMASLIHMQKILSNRMQHFPSIEKAIEWSVKAGSLRNVDSARVSIPSTLTYDD 240

Query: 286 SKKCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 334
           SKKCYTYRAKLE+TE+YW+SWYEGLSEKFLSCPVPKLLLLAGTDRLDR
Sbjct: 241 SKKCYTYRAKLEQTEQYWKSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 288

BLAST of Sgr016823 vs. ExPASy TrEMBL
Match: A0A0A0KSW0 (Protein phosphatase methylesterase 1 OS=Cucumis sativus OX=3659 GN=Csa_5G649860 PE=3 SV=1)

HSP 1 Score: 526.9 bits (1356), Expect = 8.8e-146
Identity = 264/288 (91.67%), Postives = 276/288 (95.83%), Query Frame = 0

Query: 46  MESSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDD 105
           MESSSSHLGSLPE+VSE Q Q Q NQKSVFASKP RPPT N S+KYLPLDWLGYFDQEDD
Sbjct: 1   MESSSSHLGSLPEDVSEDQGQSQGNQKSVFASKPQRPPTQNVSEKYLPLDWLGYFDQEDD 60

Query: 106 ISIPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSS 165
           ISIPNSN+VFHVYMAGKEGPVVFCLHGGGYSGLSFALSAG IKEKARVVAMD RGHGKSS
Sbjct: 61  ISIPNSNEVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGIIKEKARVVAMDFRGHGKSS 120

Query: 166 SDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVV 225
           S+NDLDLSIETMCNDVLAVI++MYGDSPPAIVLVGHSMGGSVAVHVAAK+AL SLAGLVV
Sbjct: 121 SENDLDLSIETMCNDVLAVIKTMYGDSPPAIVLVGHSMGGSVAVHVAAKRALPSLAGLVV 180

Query: 226 VDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDD 285
           VDVVEGTAMASLIHMQKILS+RMQHF S+EKAIEWSVKGGSLRNVDSARVSIPSTL YDD
Sbjct: 181 VDVVEGTAMASLIHMQKILSNRMQHFPSVEKAIEWSVKGGSLRNVDSARVSIPSTLTYDD 240

Query: 286 SKKCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 334
           SKKCYTYRAKLEETE+YW+SWYEGLSEKFLSCPVPKLLLLAGTDRLD+
Sbjct: 241 SKKCYTYRAKLEETEQYWKSWYEGLSEKFLSCPVPKLLLLAGTDRLDK 288

BLAST of Sgr016823 vs. ExPASy TrEMBL
Match: A0A6J1CLM9 (Protein phosphatase methylesterase 1 OS=Momordica charantia OX=3673 GN=LOC111012093 PE=3 SV=1)

HSP 1 Score: 513.1 bits (1320), Expect = 1.3e-141
Identity = 260/288 (90.28%), Postives = 273/288 (94.79%), Query Frame = 0

Query: 46  MESSSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDD 105
           MESSSS LGSL E+        QENQKSVFASKPDRPPT+NSSQKY+PLDWL YFD+EDD
Sbjct: 1   MESSSSPLGSLSED-------GQENQKSVFASKPDRPPTMNSSQKYMPLDWLDYFDEEDD 60

Query: 106 ISIPNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSS 165
           ISIP+SNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMD RGHGKSS
Sbjct: 61  ISIPDSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDFRGHGKSS 120

Query: 166 SDNDLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVV 225
           S+NDLDLSIETMCNDVLAVI++MYGDSPPAIVLVGHSMGGSVAVHVAAKKAL SLAGLVV
Sbjct: 121 SENDLDLSIETMCNDVLAVIKTMYGDSPPAIVLVGHSMGGSVAVHVAAKKALPSLAGLVV 180

Query: 226 VDVVEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDD 285
           VDVVEGTAMASLIHMQKILS+RMQHF SIEKAIEWSVKGGSLRNV+SARVSIPSTL YDD
Sbjct: 181 VDVVEGTAMASLIHMQKILSNRMQHFPSIEKAIEWSVKGGSLRNVESARVSIPSTLTYDD 240

Query: 286 SKKCYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 334
           SKKCYTYRAKL++TE+YWRSWYEGLSEKFLSCPVPKLLLLAGTDRLDR
Sbjct: 241 SKKCYTYRAKLKQTEQYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 281

BLAST of Sgr016823 vs. TAIR 10
Match: AT4G10050.1 (esterase/lipase/thioesterase family protein )

HSP 1 Score: 437.2 bits (1123), Expect = 1.8e-122
Identity = 219/285 (76.84%), Postives = 250/285 (87.72%), Query Frame = 0

Query: 49  SSSHLGSLPEEVSEGQAQRQENQKSVFASKPDRPPTLNSSQKYLPLDWLGYFDQEDDISI 108
           SSS+L  L E+ SE   +  E   SVFA++P RPP+ NSS+KY P++W  YFD+EDDISI
Sbjct: 3   SSSNLNPLREDTSEEHLEPIEAPASVFAARPLRPPSTNSSEKYSPVEWKSYFDKEDDISI 62

Query: 109 PNSNDVFHVYMAGKEGPVVFCLHGGGYSGLSFALSAGKIKEKARVVAMDLRGHGKSSSDN 168
             S+DVFHVYMAG EGPVVFCLHGGGYSGLSF++ A KIKEKARVVAMDLRGHGKS S+N
Sbjct: 63  TGSDDVFHVYMAGNEGPVVFCLHGGGYSGLSFSIVASKIKEKARVVAMDLRGHGKSVSEN 122

Query: 169 DLDLSIETMCNDVLAVIRSMYGDSPPAIVLVGHSMGGSVAVHVAAKKALSSLAGLVVVDV 228
           +L+LS+ETM NDVLAVI+ +YGDSPPAIVLVGHSMGGSVAV VAA K L SLAGLVVVDV
Sbjct: 123 ELELSLETMSNDVLAVIKELYGDSPPAIVLVGHSMGGSVAVQVAANKTLPSLAGLVVVDV 182

Query: 229 VEGTAMASLIHMQKILSSRMQHFSSIEKAIEWSVKGGSLRNVDSARVSIPSTLEYDDSKK 288
           VEGTA++SLIHMQKILS+RMQHF SIEKAIE+SV+GGSLRN+DSARVSIP+TL+YDDSK 
Sbjct: 183 VEGTAISSLIHMQKILSNRMQHFPSIEKAIEYSVRGGSLRNIDSARVSIPTTLKYDDSKH 242

Query: 289 CYTYRAKLEETERYWRSWYEGLSEKFLSCPVPKLLLLAGTDRLDR 334
           CY YR +LEETE+YW+ WY+GLSEKFLS PVPKLLLLAGTDRLDR
Sbjct: 243 CYVYRTRLEETEQYWKGWYDGLSEKFLSSPVPKLLLLAGTDRLDR 287

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6592359.13.3e-15581.64Protein phosphatase methylesterase 1, partial [Cucurbita argyrosperma subsp. sor... [more]
XP_022925364.16.9e-15376.32protein phosphatase methylesterase 1 isoform X1 [Cucurbita moschata][more]
XP_023535503.19.0e-15376.32protein phosphatase methylesterase 1 [Cucurbita pepo subsp. pepo][more]
XP_038890779.18.2e-14691.96protein phosphatase methylesterase 1 isoform X2 [Benincasa hispida][more]
XP_022973537.11.1e-14591.67protein phosphatase methylesterase 1 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q8BVQ58.4e-6142.73Protein phosphatase methylesterase 1 OS=Mus musculus OX=10090 GN=Ppme1 PE=1 SV=5[more]
Q58DN41.9e-6042.73Protein phosphatase methylesterase 1 OS=Bos taurus OX=9913 GN=PPME1 PE=2 SV=3[more]
Q4FZT23.2e-6041.82Protein phosphatase methylesterase 1 OS=Rattus norvegicus OX=10116 GN=Ppme1 PE=1... [more]
Q9Y5709.3e-6042.73Protein phosphatase methylesterase 1 OS=Homo sapiens OX=9606 GN=PPME1 PE=1 SV=3[more]
Q5R4F96.0e-5942.42Protein phosphatase methylesterase 1 OS=Pongo abelii OX=9601 GN=PPME1 PE=2 SV=3[more]
Match NameE-valueIdentityDescription
A0A6J1EBY63.3e-15376.32Protein phosphatase methylesterase 1 OS=Cucurbita moschata OX=3662 GN=LOC1114326... [more]
A0A6J1I7S55.2e-14691.67Protein phosphatase methylesterase 1 OS=Cucurbita maxima OX=3661 GN=LOC111472065... [more]
A0A6J1I8X25.2e-14691.67Protein phosphatase methylesterase 1 OS=Cucurbita maxima OX=3661 GN=LOC111472065... [more]
A0A0A0KSW08.8e-14691.67Protein phosphatase methylesterase 1 OS=Cucumis sativus OX=3659 GN=Csa_5G649860 ... [more]
A0A6J1CLM91.3e-14190.28Protein phosphatase methylesterase 1 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
Match NameE-valueIdentityDescription
AT4G10050.11.8e-12276.84esterase/lipase/thioesterase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000073Alpha/beta hydrolase fold-1PRINTSPR00111ABHYDROLASEcoord: 150..165
score: 41.81
coord: 198..211
score: 47.04
IPR000073Alpha/beta hydrolase fold-1PFAMPF12697Abhydrolase_6coord: 127..339
e-value: 1.3E-19
score: 71.7
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 89..349
e-value: 4.8E-77
score: 261.5
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 114..331
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 48..86
IPR016812Protein phosphatase methylesterase, eukaryoticPANTHERPTHR14189PROTEIN PHOSPHATASE METHYLESTERASE-1 RELATEDcoord: 61..333

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr016823.1Sgr016823.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006482 protein demethylation
molecular_function GO:0051723 protein methylesterase activity