Sgr011659 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr011659
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionMethionine aminopeptidase
Locationtig00153024: 246486 .. 267518 (-)
RNA-Seq ExpressionSgr011659
SyntenySgr011659
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCTGCAAGAGACGTATAATCGGCCTTCCCCATTAGATTTTTCTCTTACTCATAAACTAGAGGAACAGCTTGACCGACTTCTTGCGGAGGAGGAAATATATTGGAAGCAATGGTCCCATGAAAACTGGCTGAAGTGGGGTGACCATAATATGAAATGGTTTCATAGAAAGGCGTCTATGCGAAAGAAGAAGAACAAATTTATTGGGATTTCGTATGTTCATGGCAATTGGTGAGCGGACAAGGCCAAGGTTAAGGGGATTTTTGTTGACTATTTTCAGTCTATCTTTAGTACGACAAGTCCATTGAGTGTTGTAGGAAGGAGGGGGAGGGGGCTATCAAGCAAATGTATCCAATTAAAGCCCCAGGTCCCAATGTATTTCTAACACTTTTCTATCAAAAGTATTAGGATATTATTAGCAACAAAATGGTTGCTAGCTGCTTGGAGATATTAAACGTCCGACATTCGATCAGGAACTGGAACTCAACTAATATTGCTCTTATCTAGAAGGTAAATTCACCAAAGCTAGTGGCAGATTATCGATTTATCAGTCTGTATAATGTCAGCTATAAAATAGTGACCAAGGTTCTAGTGAATAGACTAAAATCGGTTTTGGGGAGCATTGTGTTGGACTCTCAATCAGCATTCATTATAGGGAAAGCTGTCGTGGATAATATTATTCTTGACCATGAGTGTTTATATTATATTAAAAACAGGAAGGAGGGGAGGAAGGGTTTGGCAACACTGAAGTTGGACATGAGCAAAGCATATGATAGAGTGGAATGGATTTTCTTGGAGAAAATCATGTTGTATTTGGGCTTTGACGTCAAATGGGTGGATATTGTTATGGACTGTATACCAAAGTTAGCATTTCAGTTCTTATAAACGATGAGGCCAAAGGGTGTATTACCCCGAGCAGAGGATTACGTCAAGGTGACCTCCTATCACCCTACCTTTTTATTATCTGTGCTGAAGGTTTATCTTCTGCTTTGTAGTATGCTTACTCCCATAATCAGTTATCTGGGATTGCAATACATAGAAACAGTCCGAATGTGTCTCATTTAATTTTTGCGGATGATAGGTTGATTTTCTGTAAGGCTGAGGCTTCTGAATGTGTATTGGTTAAATTTTTTTTTGAGGAATTATGAAGTTGCGTCGGGCCAGTGCATTAACTTTTGTAAATCAGCTTTATTTATCTCCTCCAATGTTGGTTTGGATTTCAAATTGTATCTTCAGGATGTGCTAGACATCGATGTAGTCCCTAATCTTGGGAAGTATCTTGGTCTTCCTTCCTGTTTCTCTAAGAGTAAGGGGGAAGATTTTAGATATATTAAGGAAAGGGTTTGGAAGGTGATGCAAGGATGGAATGATCAATTGTTTTTAGTGGGTGGAAAAGAAGTTCTGATTAAAAGTGCTGCACAAGCTATTCCAACCTATGTGATGGGTTGTTTTAAGCTCTCGAAACAATTTTGTGCGGAGATTACTAAAATGATTGCAAGGTTTTAGTAGGGGTCAACGATTGCGAAGAGATGGATTCATTGGAAAAGTTGGGAAAAGATGTGTTATCCTAAGGAATTGAGGGGTTTTAATTTCAGGGATTTTAAAGGGTTCTTAAGAACCCAAATTTAATGGTTTTAAAGGTGCTTTATGGTAAATACTATTTGGGGTCTTCCATTTTTAGGCTACTTTAGGGTATTGTCCTTCTTATTTCTGGTGCGACTTTTTATGAGGCCGTGACTACTAAACAAAGGACTTCGATCTCAGTTCTTTACTGATCGTTGGCTTTCGAGAGAAAGTTCTTTCCAGACTATTACGCAAAATTTGTCCAGTAATGAGATGTTGGTGGTTGATTTTATTTCCCCCTCTATGTGCCGGGATGTGGGGGACATTGAGGTCTTATGTTTGTAATGAAGATCTCGAAGTTATTTTAAAAACTCCAATCAGTATCATGGATCAGCAGGACAGTTGGGTGTGGGATTACAACGAAGAATACAGTGTCAAGAGTGGATACAAATTGTTCATGTTGCTAAAACATAATGAAACTACTTTTTATGCTGCCGCTATGAATGAGTGGTGGAAGTGTGTCTGGAAGCTTCAAATTCCTTCCAAAATTAAAATATTCATCTGTCATGCTTTTCATGATTGTATCCCTACGTTATGGAATCTAAAGAATAGGGGAGTAAGCGTGGCGTCTGATTGCCCTTTATGCTTGAAAAGAGTGGAAACCACTAACCATACTCTGTTTACCTATAAACGGGCAAAATCCGTATGGAGAACTATTTTACCATCAGTTCCAGTTGAGGTGGATGTTAATTAGAACGTTCAGGACAAATTTTTCTTATTAAAGGAGTCTCTAACGACGAAGGAGTTGGAAATTTCTAGTGTGTGCTGTTGGGCTTTATGGAATGACTACAACTCGCTTGGTCGAGGTGGTCAAATACCAAGTATTGTAGTAAGAGGTGACTGGGTCTTGAATTATGTAGAGGAGTTCGGCAAAGGTAGAGGTTGTACTTTTTCTAATTCGACAAACAACTTTGGGAATGAGAAGGCGCGTAATGGGGGCTGGGTGGATTCGACCCAAGAGCAACTGGATCAAGATTAACGTCGATGTAACTTGCAAACGAAACCCACTGGGAACTGGATTAGGGGTAATATGTCGTGATGATAGGGGTTTACTGCTGGCAACGATGTTGTGTTATCTTCGAGTTACTTATGTATTTTTACAAGCTGAGATTCTTTCCATTCTTGACGCACTCCAGTTGGCTCAACATTTAAATTTCAAGAAGGTTTGTGTGAAATTAGATGCCTTTCAAGCAATACAAGTCATTAACAATAGGGATGCCACTTTACTGTCTGTTGGAACGTGGGTGGAGGACATCCAAAATCTCATTGAGAGTTTTGAGGCGGTTGAATTTTCTATCTGCGTAGAGAGGGAAATTTGGCTGCTCACATTCCTTGGCTTTTGAAGCAGTTTTTACTTCTTCAAGCACGCTTTATCGTTCAAAGTTTCCTTCATGGCTGTTAAATGTTATTTAGAAGAAGGTTAGGCATGTAGCTCACGTGGCAATACCATCTTGATTTCAATGATGTCTCTATCTTTCAAAAAAAAAAAGACTAAAAATATTGGATCCTAAATGGGCCTCATGGGCCCTACCTGATAAGCAGCCGCTCCTTGCATTGATTGGGCCCATCTAACTCGGCTCAGTACCCCCACAACGTTATCCTCAATTCCTCATGTTGTTGTTGGCTGCTGTTCGTTAGTTGTTGCAGAAAATGGCATTGGGCGCTTCGCTCGTTCGAGTTTCATTACTGAAGCTCTCTGCAGTCTCACATGGTGAAGCTTTCTCACCTTCGTCGAAGTCGGCGTCCTTTATGGGCGCGCCACTTAATTTTCTTCCCCCTTCTCGCTCTGGTATGCTAGTCCTTCTGTTCTCTAAATGTGCGAACTATCCTCAAACGTTGTAGCTTGTTTGCTTCCATGGAATATTTGGAAGCTTTATGACTTCTGATTGCGTTGATTATCATTTATTTCAGCAGCCAGTTACTTGCATTCGAACTAATTAAGTTCATTTTATCTGTTAGCCAGTGGGTCTAATTTTGTTTAACCTGACACATGTGCATCTTGTTTTTTTCGTTCTATGCAGGGAAACAAAAGCATTTTCATGAGAATCTTGTCGTCATATCGAAGAGAATATCGGGTCTGGAGGAAGCCATGAGAATTAGGAGGTACACCAGAGTCATGAGCTTTAAGGCTCTTTGAGCACGTATGGGACTGTTTAATAATTTGTTGATCTTATACATTTTTTTCTTATTGATACCAGTCTAATGCACTGCTTAAATACTTGTTGACGGTTTAGGCTGCTTTATGATGTTGCTTTATAGTAATACTCTTTACTATGTTATAGATTCTATTGTTTCTGTTCCAATGAATATAGTAATTAAAAATTTATTGTGTTGTCTCTCTCATAGACAATCTCGTATACACTCTTGCTATGAGATGGGATTTTGTGGCCTGTGGGTACTAGAATGAGATAGAGTCTACTTTGTATTGAAGAATTAAACATTAGAACTTTACTTCTAATGATGTATAAAAAGCTATGAACTTTTATCTTCATTTTCTCAATGAACGTGTCTCTTGCATGTTTACATATGCCCGTGTGAACCTGTTTGTGTTGAATATGTCTGATACCACAGCTCTACTTAATTTAGTGTATTCATGCTTTCTTGTCCTCTTTACTTAGACATGTTGATTTTATTAAATGAGGCAGTCTTAGTCTGCCTACCCACACTATATTTATTGGTGCAGTATATGGCTACCTAAGTTATTTATTGGTGCACTGTATGGCTACCTAAGATATTGGCACTCTCATTGCTTTTATTGTAATTTTCAAATTATTTGATTGTAGCTACCAATTTTATCATGTAGAGAGCGAGAGCTTGAAATTGTGAGAAAAGTTAGAAAGAGACAACCATTGAGGCGTGGAAAGGTATCTCCACGTCTTCCTGTGCCTGATCACATACAAAAGCCTCCTTATGTTGGTTCTTCTATACTGCCAGAAATTTCAAGTGAGTATCAAATGCATGATTCTGAAGGAATTGCTCAAATGAGGGCTGCATGTCAGCTTGCTGCTCGTGTGTTAGACTATGCAGGAACACTAGTGAGAGTAAGTTTTATGCTTGTACGTTCAATATCAGTTTTATAAGCTTTTGGCTTGTACATTCAATATCAGTTTTTTTATCATCTACTATATTGGTATTGGAACCTTGTAGTGTTTATTTGTGATTTACTTTTTGTTAGTAGATCATATTCAGATAATTTGGATGTGTTAAATGGCAATTTGAGTATCTCTTGACACGCATGATTATGATTTTTTTTATCAGCCCTCAGTAACAACAAATGAAATTGACAAAGCAGTGCATCAGATGATTATTGATGCTGGTGCTTATCCTTCACCTCTTGGCTACGGGGGATTCCCAAAGAGTGTTTGCACATCAGTTAATGAGTGCATGTGTCACGGAATACCGGACTCTCGTCAATTACAGGTTCCATTTTTGTATATCTAATAATTACATAAAATACATGTTTATTATGCTAATAATTACTCTTTTGTGCAGAGTGGTGATATAATAAACATTGACGTGACAGTCTACCTAAATGTACGTTCTTAATTGTTTCTTTCATTAGATTTAGCACCTTTTTATCCTGTAATCTACTTAAATGCTTGATTTAGTTTACTCCGCATTTGCATTGCAAACCAATATTCTGGAAGCTTTCTTCATTTCTGAATTCATATATGAGGCACGTGAGAGAAGAGTAAAAATTGTAATGATGAAGTTGTCTTCTTCAAAAAAGAAATCACATCTTTACTGTTTATTTTTCATTATACCAAAACCAAATAAAAACATAAATCTTAGTGGGAGTTCTCTTAAACCAAATCAAACAAAGAGATTCAAATTTTGGATGAAGGAATTTTTCACATAATGATAATAAGATTTGCAAACAAATGTTTCATTACCATAAACCATTGAAGGAATGATAATAACTTGAAGAATGGACATCAAAGAAGTAAACTCTTAAATTTTTTAAGCAATAGAATTTGTTCTAAATGAAAGATTCCAAATTGATTTTATAAGAAGGATAAAAGGACAGAGATAGCGTGCAATGATCCAATGGAACAGGTTGAAGGAAATGAAGTCATAGAAAGTAAATACTGAATATCTTCCTGTAATTGACTTTTGATCAATTAACAAGGGTTCCTTTTTAAAAATGAATGGGGTCTGTTTATTTAATAATTACTGAAGTGTTGACTGTTGAACCTGATCTAACATTGCATGGAGAATCTATTTCAGCTTGGATTCTCCACAACTCTGTAAAGTTATTGAATTCTGATAATTATTTTTTAAGGATTAAGGTGGGAGAAGGTTTATCAGTTGTAAGTTATTATTGAAGTCGCAAGAAGTTTATGAAATTATGGTCCAGAAACAGAAATGAAAACTATATATATATGCCGGCACAAAATTTAGGCAATTGTCCATCTAGGAATTTATAGATTTTATCTTTCATATCTGATTTTAAGTTTGTCTTGCAAGTTTAAAACCATTCTCAATACTATTAGTTTTTTCCCCCTTGGAGATGTCTTCACTAATTTTTCATCCTTAAAATTAAATTTATTTATCAAATGTTTGGGTCCCAGTTATAGAGAAATATTTGCAAGGAATGACTTGGTACTCACTGGGAGAAGATTAGACGAGGGGTTTCTAAGAAGAATGTTGAAGGAAAATGTGGAATCACGAATTTAATTTAATTTTTTATTACTATCTTTTTTAATGAAAAACAACTTAAATGGCTTCTTTTACATTGATGAAATTGTTTCTTTCTCTAAAAGAATATATATATATATAGTATCATATCTATACATATATATTAATAGATAGATAGATAGCAAAACTTGCCTTCAGCTCTGGGTGATAGTACCCTGATATGTTGGACATTGAATGATGTATGGCACAAGTTAGGGTTGAATGTCCCAATTGTCTGGTAATAAGTTTTCACGTTCTCAAAGAATTCATGTTTTCAAAGAACTTTCTAAAGAACAAAAATTTTTATGCACAATTCCATGCGAGACCTGTCCTCAATAATTCTTTGAATTTTGTAAGTTGACCTGCAGCACCTACGTGTTAGTTTTCTTCTGAGATGGCTAGTTCTTTTTCCTCAAATGTTAGGGATATCATGGAGACACATCGAAGACATATTTTTGCGGGGATGTAAGCGATGGAATGAAACGTCTTGTGAAGGTATTTTTTGTTATTTTCAGAATAAATAATTCTATAATTACTGGCTTAATACTGCAGGTTGTTTGAAGTTTGTAGTGCAATGACCTAATGCTTCTCTTCGAATAAATTGTTCATATCTCTTGTGTACTAATTTCAATTCTATACTAACCCTGGTATTCTATTTTGCGGTACTTCTTTGCTTTTTTCCCCTTTATTATTTATATTCTTTTGGTGTTTCAGGTTACAGAGGAATGTCTCGAGAGAGGTATAGCTGTATGCAAGGATGGTGCTAGTTTTAAGAAAATTGGAAAGAGAATCAGGTATTTGATGCAATCTCAATTACATGAAATAGATTCTCTCTCTCTCTCTTTCTCGAAACAAAATCTGTGATCGAGTCATTCATTTCAAACCAGCAATATATACAAAATACTTATATCAATAGTTAAGAAGCAGAGAATCAAGGAAGATTCCTCTCCCCATTAATCATCTCTTCTAAGGATTGTTGATAAAATGCCAGTAATTCAAACATCAACAACCAGAAGATGATAAAAAGAAAGTTAAACTACTGATAAATTCATAGAGGAAGTTACTCAATCCTGTAAAAGATGACCTTATAGATGCAGTATCAAAACTGAGACCAAAGGTGAAGGAAACTGAACAAAACACCTCAAGAAGCAAGAGCAGTAAATGGAATGCTAGTCTTCAATTTATTAGATCAACGACCAATGTTTGGATACTCTTTGGTGAACTGTCCAGATTTATCACAGCTATACTAGTTTCCATCATCACTAGCATACTTATCTCCACCAACCACCTTGAGATCCACTTTGGTAGCAGTCCCTGGCCATATGCCTTATGATTTTGGGAAAACTCTTTTCTCTTCAAATATGGAAAGGAAATAGGTTAGGATTTTGGGAGGACCCTTTGCTCTTGCCATCCCCCTTTCTTTAGAATTTCCCTCACAAAAGAGAAGACCATTTACAATGTTTGGGACCCATGTTCGCTAGATTGGAATCTACAATTAAGAAGGAAATTGAAAGATGAGGAGTTTGAGGAGTGGGTGGTACTTATTATCAGATTGGATGATTTCTCCCCAAACCAAGAGGAGGATAGGCTGCTTTGGGCAAAAAACTCCAATGGGAGTTACACGGTACAGTTAGCAATTTCAGCTCTTATTAGCTCTAGGCCAAGTTGGAGGGACAATTGACTGAGCCGATTTGGGAGAAAAAAACTCCTGAAAAGGTCAAGTTCTTTTTGTGGACCCTGGCCTACAAGAGCATCAATGAACAAGAGAAGCTACAAAAGAAGAATCCGCACCATATGCTCTGTCCTCAAGCCTCTTGGAAGTGAAGAGACCATAGACCATCTCTTTATTAGTTGTTTATTGTGGCAGATGTGTGGGTTCTCAGCTTTGTAATCTGTTTAATTTGAGCTGCACCTTCCACAAGGATATTAGAATTGTTCTTCCTAAAATTATTTCAGGCTGGTGGTCAAAAGCAAAGTCTAAGATCTTTTGGATGAACGCGTTGAGAGCAGTTTTTTAGGTAATTTGGCTAGAAAGAAACAACAGAATTTTTATTACACACAAAAGTTATAGAGAAAATTTGGGAGAATGTACAGCATATAGCTTCCCTTTGGTCTAAGTTTTATGAATGTTTTTATAATTTCCCTTTTGTGATCAATTTGGATTGGGGACCTTTTTTAACACCCAGGTTTTGATTAGTTGGAAGGAGATGCCTAATCTCCTTGCCTTTGGTTGTATCTCCTTTGATCCTTTGGAATGAAATCTGTGTGTCTCTTATCCAAAAAAGAAATAACAATAATAATAATAAAAATAATTCTGATGGATATTAAACCTTTGATCAGGGGAGGACAAATGGACATTTGTGAGAACACTACTGGTAACAAAGAAAGTGAAAAGCTTTCTGGTATGAAATTGATAGTGCAGCTATAAATGAACACATTATAATTATGGTCACCGTTGGATATGTGCTGCTATAATAGTTGAATTGTCCATTGAGAGTAAAACAGTGAAATGGGATAATCAATTAGCTAGATATATTCAGAGTGAACACTCTTGGGTGCTGAGGAATGTGCTGGAAGCACGTATTTTCTATGTTCTACCATTAAGACCTAAAAATCCCAATTTTCCAAATCTTAATGCATTTGAATGATATTGCGATGCTAGTTTGAAAAATGTGGTGGGGGTCTAAAATTTGTAATGCCACTAGATTTAAGGCTTTGAGCTTCAGTGCAAGGATGCAGATGAAGAATGAGAAAAAAGAATCTATAAGATGAACATGTGCCAAACTCTTTGATCCTGTATGATGTAGGGCTGCTACTTGGTTGTCAACTTGTCATGCTGCACGTGATATCGTAACGTTCTTCTCATGAAAGTTTTTATGTTAAAGTTCCTGGTCTTTTAGTCAATGTGTAAGGATGTCAATCTTGCAATCGTCCTGAAACGTGCATCTGTGCAGAATATGAAAATATCTTTTTATAGGGAAATTTTAGTAAAATGATGAAATATACAAAAGGAAAATTCCAAGCCAAAGGAGTTCTTTAGATTTTTTTGGGAATACTTCTTGTGTGGATGTATCTTCAGTTTTGTAATTTTAAAGCATAGGTTCACGTTTCGTTTCGATCTAAATGTTGAAGTTCACTGTTTTGGTTTATAGAGTTTCAATGTGGAGTAGTTTTCTTAATTTGGTTGATGATCAAGAGCATATATTTCTGGGATCATTTAGTCTGAGGCCGTTTCATCAAAGCAAAGGAGTTTTTGGTTTTGATACAGCAGCTGATCTCATGAGAGGAATAACTCGTTTAGATTTTTATCTTCTGCTGGTTTCTTGTTAGTGATGGCTGGTTTGATTTTGTTAGTTCTGTCTGGGTTTTTAGGGGCTGTATTGTTCTTTTCTCCAGAGAGGTGTTTGTTTTTAATCTTCTTACTTTGTTCATCATCTTGTATCCTCCTAATCGGAGTTACATACTTTGTAACTTTTTACTGCTTAAGTTCTGTTTCCATAAAAAAATATAAAATATAAAATATAAAAAAATCCAAGCCAAAAGAATAACAAAGCACCTTCCAATTTTCACATATCATACGGAGTGAATCATTACAAAAAGAATTGAGTGGGCACTCCAAAAAGACGCCACAAAAATAATAATGTCCCAAAAATCTCTTTTGAAAAAGTAATGTCCCAAAAATCCTAAAAAATTCTTGTACAGCTTGAAAAACTCTAGTTCCTTCGGAACCTTAGCTCTTAAAAGATAGGTCTCACCTCGTTAAGCTACAAATTTCTGCCTTGTCCCTAAACGGTGGCATCTAAGCATGAGCAAAGGGGAGTCCTTGAGGTTGAGGCTTCTAGGAAAAACCAGGTGTGTATGGAATATACTCAAAATCCTCTCCCACGCCTTGATAGCTAAGAGCCACCATGAAAAAATTGATAACCCATTTATCACTATTCTTGGTGCAATTTTTCTGTTCTATACGGAAGTGGACCTAGCATTTACTCTATCCTGAATAGCTTACTAACGTATTTCAATAGCACACATGTTTACATGTTTAAGCAAGCATACATATTGATCAGGAGGTCATAGCTGCTTTACACAATTCACTTAACTGTTAATTTATAAAGCTATAGCGTCCTACTAACTCTAGTTCGAAGTGGTTATCTGATAGGCCCCTGATTAAGAAAGTATATGGTAGAAAGAGGGGTAGAGCAGTTCAATGAGTAGGAAGGGGTGGTTGTGGGGCCCTCCTGTGGGACCCATGTTAGTTACCAGTGTATATAGTAAAGGTTAGAGAGGAGAGGGGTATCTTTTTGACTAATCCCGTAAGTGAGTGTGAGGAGAAAGACTTTAAAGGTAAGTGAGTGAGTGTAAGTAAGTGAGTGTGAGTGTGTGTCACCCAAGACAAAGACTTTAAAGGTGAAAGGAGTGTTGGCAGATACGGAGGTTATTGTGTTAACAGATAGCGGGGCGACACATAACTTCTTTTCATCTGCGTTAGTTAAGGAATTGAAGGTGCCAGTGAGTGAAACTTCAGGATACGGGATTGGTTTGGGCACGGGTGATGAAGTTGAATCAGTGGGGGTTTGTAGGATTGTGGTGTTAACCTTGTCAAAGTTGACTGTGATACAGGATTTTTTGCCAATTCTGCTAGGGAGTGCTGACGTGATTCTGTGGGTTCAGTGGCTATCAACCTTGGGAGCGATTACCTTGAATTAGGATGTGTTAACCATGGAATTCAAAGTAGGCAATTGCAAAGTATTATTAAAAGGGGATTCGAGTTTACAAAAGACTCAAGTGTCCCTCAAGGACATGATGCGTGTAATACGGAAAGAGAAACAAGGCATTTTACTCAAGCTGAATGCAACCGACATGTGCCAAATGGGAATAGAGAAAGCGGGGAGCATTGACATTCCAGAGGAATTGCTGGTAGTTTTACACCAGTACTCGTCCATTTTTCATCCTCACCAAGGTTTGCCTCCTGTCACGACAGGTATCATGCTATTGAGTTGATGCCTTCAACGGGGTCAATGAATGTACGTCCATATAGATACCCATAATTTCAGAAAGATGAAATTGAAAGGCTGGTGCGGGAGATGTTGCTGGCAGGAATTATACAGCCCAGCAAAAGTGCTTTCTCTAGTCCGGTTCTCCTTGTTCGAAAGAAGGATGGGAGCTGACATTTTTGCATAGACTATAGAGCATTTCGATTTGAAGTCCAAGTACCATTAGATCAGAGTGCGACCGAAAGATGTTCATAATACTGCCTTTCGAACTCATGAGGGGCATTATGAATTCTTGGTGATGCCTTTCGGTCTTCTCTTCTGAACGCACCATCTACTTTCCAGTCGCTTATAAACGACGTATTACGGCCTTTTCTTCGTAAGTTTGCTTTAGTATTTTTTTATGACATTTTGGTCTACAGTGTCTCTATTCAGGAGCATAAACATTTAGCAGCGGTTTTGGAGGCTTTGTCCAAGCATTAGCTGATTGCGAATGAAAAGAAATGCCAATTTGGACGGCCCCAGATTGAGTACCTTGGTCACATTGTCTAGGGTCGAGGGGTTGCCGTAGATCCTGCTAAGATCACAACCATGAAGGACTAGCCAGTACCTACTAATTTGAAAGATTTGCGAGGTTTCCTTGGCTTCACTGGATATTAATGTAAGTTCGTTGCTAATTATGGTGCAATTGCTTTTCCTTTAACACAGTTGCTAAAAAAAGATAGTTTTGTGTGGGGGGAGGAAGCGAATATTGCCTTTGAGTCATTGAAGTCCGCTCTGGTGACAGTGCCGGTCTTAGGACTCCCTGATCTCTTAAAAACATTTGTCATTGAATCTAATGCCTCTGGAGTTGGGTAGGGTGTTGTGTTGATGCAAGACCAACGTCCCCTTGCATACTTCAGTCATGCTTTGTCCTCTGCACATAGACATAAAGCAGTTTATGAATGAGAGCTTATGGCGATTGTTATGGCAGTCCAAAAGTGGAGACCTTATCTGCTTGGGCGGCATTTCATTGTGCACACTGATCAGCAGGCTCTAAAATTTCTCTTGGATCAACGGGTTATTCCAGGGGAGTATCAACGGTGGATAACGAAGCTCATGGGATATGATTTTGACGTTGAATATAAGCCTAGGTTGGAAAAGAAGGCGGCTGATGCCCTCTCGCGACGGCCTAAGACGGTGGATTTGCACAGTATTACAGTGGTGGGGTGTATCAATGCGGCTGTCATCATTGAGCAGATTCAAAGGGATGAAGAGTTGTCCAAAATTGCCATAGCCTTACGTGATGGGAAGGAGGCACCTGCAGATTACTCCTTGCGAAGTATCTTTTATATTACAAAGGAAGGTTGGTCATTCCTACAAATTCGCCCTCTATTCCCTTTGATTCTCCAAGAATTTCATTGTTCTCCGGTTGGAGGTCATAGTGGGTTCCTTAAAACGCTACAGCGGGTTGCTAAGGAGGTTTATTGGAAGGGGATGCGGAATAGGGTCCGGGTGTTCATAAGTGAGTGTGCCATATATCAACAAGCTAAATATTTAGCACTTGCTCCGGCGGGGTTGTTCCATTGCCGATTCCGGATCGTATATGGGAGGATGTTTCAATGGACTTCATCGAGGGGTTACCGTAGTTAGAAGGGTATGATTTGATATTGGTCGTGGTGGACCGGTTATCTAAGTATGCCCACTTCATTCCTCTTAAACATCCCTTCACAGTCGTGTTCATAGCCCAAGTTTTCGTGAGGGAAGTTTTACATCTTCATGGGGTGCCGCGTGTAACAATCGTGTTCTCACAAAGCTCACGAAACGATTGTGCGGCACTTGCTTCCGCTCTCAAGCAAGTAAGCCTAATTGAAATGACACAGCCAACGGCACAATGGGACTATCTCGCAACCTCCGGCCACGTTTCGGGAAAAACGGTTTTAGAAAACGTGGTTGGAGAAAAGGTTTTGGAAAGCATTGGAAAGCATTTTTGAAAGCATGTTGTAGGCTTGAAAGCAATAAGATAAACAACGGCTTAATAAACATACAACTCAAGGTAGGGAGGATGGTCACTAGTGGTACCCCCTATAGTACATTATAAAGCATTTTAACTATGGCAGGAGTAGACATGCAAAGTGGCGAGGAGTCACTTGCCACCAAAAGTACAAGGAAATGAAATGAACATTCAAAACTAAATACATGGTGTGACAGCATGAGGGTAGAAGACGTTATGGGCATGCGGCCACAGGCAATAGGCCCTAGACGAGCATGACCATGACATTCTCCCCCACTTAAAGGGTTGACGTCCCTGTCAACCGTTGTCGCTCGAACTCCTCGATCTGAGTTGCAGCTGACTTTAGGTCCTCGACACGCTCCCAACTAATTTCTTCATCTAGAAGCCCTTTCCACTTCACAAGAAATTCTTGTACATCGTGTCGCGGTCTGCCAACGCTTCGTGTTCTCTCTGCTAGAATCTCTTTGACTTCCTTCTCTATTGTATTCTTCATTGCGACTAGCGGTCTCGCTGTTTCATTTCGATCATTGTCGTCTAGATCTGGGTGAAAAGGTTTCAGGTTACTGACATGGATGATGGGGTGAATTCGCATCTATGATGGCAACTGTACTCTGTATGACGTCTTGCCTACTTTGTGAATTACTTCGATGGGTCCCTTATACTTCCTCACAAGCCTTTGGTCTTTGTTCCCTCTAAACCGAAGTTGTTCGGGTCTTAATTTGATAAGTACTTGGTCCCCTGCTTGGAATTTCAGAGGTCTACGCTTCTTGTCTGCCCACTTCTTCATTCGCTTGGACGCTTTTTCTAGATAAGCTCGGGCCACTTCTGATGTCTGTTTCCATTCTTTGGTGAAGTTAAAGGCCTGTGGACTCTTTCCCGCATAGGGGTGGTCGACAACATGGGGCATGAGCGGTTGTCTTCCACACACAATCTCAAAAGGAGTTTTCCAGTTGAAGAGCTGTGCTGGCAATTGAAGCTGAATTGGGCTACGTCAAGCATCTGAACCCAATTCTTTTGTTTGGCATCAATGAAGTGTCGTAGGTATTCTTCAAGCATACTATTGAAGCGCTCTGTTTGGCCATCTGTCTGTGGGTGATAACTAAAGGATATGTTCAAACTGGAGCCCAACAACCTGAACAACTCTGTCCAAAAGGTGCCGGTGAATCGCCCATCTCGATCGCTGATTATGCTCTCTGGGACCCCCCACAGTTTAAAATATTCTTGAAGAACAATTGCGCTGTCATCTCTGCGGAACACATCTTCGGGGTTGGGACAAACGTAGCGTACTTTGAGAATCTGTCGACAATCACGAGGATCGCCTCAAACTCTCCTACCTTCAGAAGGTGTGTGATGAAGTCGAGGGAAACACTCTCCCATGGCCTTGATGGCACTGGTAGAGGTTCTAGCAACCCTGCTATTTTAGCTCTTTCGACCTTGTCTTGCTGGCAGATAAGACAAGTCTTAGTGAACTGCATGACGCCATCTCGTAAATTTGGCCAGTAGTAACCTTTTTTCAACAAGGCATAAGTTCGCTGCCAACCCGCGTGGCCTGGCCACAAGGTGTCGTGACACTCTTGCATCTAAGTCTTTCTTAGGTTCCCAGATCGAGGAACATAAAGGCGATTGCCCTTTGTGAGGTGAAGACCATCTTCTACTCAGAACTGTCGCGTCTTTCCATCCTTGGCCAGTTGGACTATCGCTTGGGCTGTTGGATCATTTTCCAAGTGGCTCTTGATGGCTTCACGGAGAGTACCAGCTATCTTACTGGCCTGTAAGTGAGCTAACATTCACAGAGCTGCATGTTCACTCTTTCTGCTGAGGGCATCTGCCGCCTGATTAGTTCGTCCAGACTTATGTTCAAATTGGAAGTCAAACTTTGAGAGGTACTCTTGCCATCGAGCCTGTTTGTAGGTCAACTTGTGTTGCGTGAAGAAGTGACAAATGGAATTGTTGTCTATTTTGACAGTAAATTTGGCCCCTAGAAGGTACTGTCTCCAAGCGCGCACGCAATGCACAACAACTAACATTTCTTTCTCAGAAGCAACGTAACTCCTCTCGGCGTCGTTCAATTTTCGACTCTCATATGCAATGGGGTGTCCATTCTGGAGAAGAACGCCACCCAAGGCAAAGTCTGAAGCATCAGTTTCTACTTCGAAGGGTTTTGTGACATCGATGATCTCGAGTATGGGCCTTTCTATCATCGCTCTCTTCAACTCGTCAAAAGCCGCTTGACATTTTGGGTTCCAACTCCAAGTGCAGTCCTTTTTCAACAATTCAGTTAATGGCCCCGCCTTTTTCGAGAATCCTTCCCCAAATCTTCGATAGTAATTGGCTAGTCCCAAGAAGGATCGCAATTCCGTGACTGTAATCGGCACCTTCCAGTTTTGGATGGCCTTAATCTTTCCATCTTCCATATCGATCCTTCCACATTCTATTATATGACCTAAGATGGTGATGCGTCTTTGTGCAAAGGAACATTTTTTCCTTTTCACATATAGCTGGTTCTTCCTTAATTTCTTGAAGACGAGTTGTAGGTGAAGTTTATGTTCTGCCAAGGTCGAACTGTAGACCACAATGTCATCCAAGTAAACCACAACGAATTTGTCGAGGTACTCGTGAAACACTTGGTTCATAAGGGTACAAAATGTGGCTAGGGCATTGGTAAGGCCAAAAGGCATGACTAGGTATTCAAAGGCCCCATTCCGTGTGACGCAGGTTGTCTTAGGTTCGTCTCCCTCTGCAGTACGTACTTGGTAATATCCAGACCTCAAGTCTAGTTTTGAAAAATGTTTTGCTCTATATAATTGGTCGAATAGATCGGTAATGATGGGTAATGAATATTTATTGAGGACTGTGAGCTTGTTCAAGGCACGGTAGTCTATGCATAGGCGGATGCTACCGTCTTTCTTCTTCTGAAAAAGTACTGGGGCTCCATAAGGAGCCTTTGCCGGACGGATAAATCCTGCACTCAATAACTCATCAAGTTGTTTTCTAAGTTCTGCCAACTCGGGAGGAGCCATACGATAGGCGTTCTTTGCGGGCGGTTTCGTTCCCACAACTTCGATGGTGGATCTAAAGTCGTTCGAAAGATCGCGAACCATTTGAAGCATCAATTTATGGGAGCTGTCCAATTCGTCGACGCGCCCTCCATATGAACTACAGAGCTCGTCGAGCTATCCCCACGCTCGAAGCTATTAGTTCTCGTAGTTTTGTTTTCAAGGGTCTCGACCCTTAACATCAGTTCTTGGACGGGTAACGCATCTATTTGGGCGTTCACCGCGTCAATTCCATCTACTTTTCCTGGCATCTCATCGAGCCGAGATTCCAGGAATCGAACATTATCAGGGACTTCTTGCAGAGACAACATCTGTTCCTCCATCTCCACAAGTCTTTCAACATGGGTCTTGTTTAACTGTTTCGATGCCGACATGATTACTGTCTTTTGATCTGAGGAGCCAACTAGGCTTTGATACCAACTGTCACAATCGTGTTCTCACAAAGCTCACGAAACGATTGTGCGGCACTTACTTCCGCTTTCAAGCAAGTAAGCCTAATTGAAATAACATAGCCAACGGCACAATGGGACTATCTCACAACCTCCGGCCACGTTTCGGGAAAAACAGTTTTAGAAAACGTGGTTGGAGAAAAGGTTTTGGAAAGCATTTTTTTGAAAGCATGTTGTAGACTTGAAAGCAGTAAGATAAATAACGGCTTAATAAGCATACAACTCAAGGTAGGGAGCATGGTCACTAGTGGTACCCCCGTATAGTACATTATAAAGCATTTTAACTATGGTAGGAGTAGATATGCAAAGTGGCGAGGAGTCACTCGCCACCAAAAGTACAAGAAATGAAATGAACATGCAAAACTAAATACATGGTGTGAAAGCATGAGGGTAGAGGACGTTATGGGCATGCGGCCACAGGCAACAGGCCCTGGACGAGCATGACCATGACACCGCGCAGCATCGTGTTCGATAGGGACAGAGTGTTTACGAGCCTCTTTTGGGAAGAGTTGTTTTGCTGTCAGGGTACTCAACTTCAGCGCAGTACAACGTACCATCCTCAATCGGACGGGTATACGAAAGTTGTGAATAGGGGGTTGGAAACGTATTTGCGTTGTTTCACTATGGCTTCTCCTTCTAAATGGGCCAAGTGGCTGTCATGGGCCGAGTTTAGCTACAATACTTCGTATCATATAGCCACTAAAACAACCCCGTTCGAAGCTGTATATGGGCGAGCACCCCTTACTGTGTTGCCTTATACTTCCGACACGTCCCCTGTTTCCACCATGGATCAGCAGCTCAAGGAACGTGACCAGATGCTTCAACAATTGAAGATCCATTTGCACGCAACCCAGCAACAAATGGTTAATCAAGCTAATGTCAAAAAGAGGGAGGTGATGTTGAATGTGGGGGATTGGGTTTATTTTAAATTCAAACCTTACCACCAAGATTCGTTAGCTCCTTGCTCGAATGCGAAGTTGGCCCCACGGTTTGTAGGACCCTTTTGGGTGATCCAGCGGGTAGGTTCAGCGGCTTATAAGTCAGCGCTGCCGGATTCCCCCACTGTTCATCTGATTTTTCACGTGTCCCAGTTGCGTAAAGCTGTCTGGCACACTTTGCCTGTTATCCCTTTGTTGCGTAACCTTATGAATGAGGGGGTACTCCAAGCCCAGCCGGCAGAGTTATTGGGTGTTTGTTCTACTGGGGATTCAGCTGGTGATATTGAGGTGTTGGTCCATTGGGAGGGGGCGACCGCACAAGAAGCTACTTGGGAATCAACTACAGCAATTCAAGCCCAATTTCCTGATTTTCACCTTGAGGACAAGGTGTTTCTTTGGGGGTGGGGGTGGGGTAATGATAGGCCTCCGATTAAGAAAGTATATGGTAGAAAGAGGGGTAGAGCGGTTCAATAAGTAGGAAGGGGTGGTTGTGGGGCCCACCTGTGGGACCCATGTTAGTTACCAGTGTATATAGTAAAGGTTAGAGAGGCGAGGGGTATCTTTTTGGCTAATCCTGTAATTGAGTGTAAGGAGAGGTAGGGAGCTCTTGAATCCTCCCTTAGCTCCTTATAACTCTCTCTCAGATATCAATAATAGACTCTCGTTCCATCGTTATCATCCATATAGTTTTATATACTGCTGACAACTTCATAACTATGATGGATACGTGCCATTAGTTAATTTATACTGTGCCACATTGATGGATTGATATTTTCTACACTATTTCTAAAGATCTACTACGCACAATTACCAGTGAGCATGCTGAAAAATATGGCTATGGGGTGGTGGAGCGTTTTGTTGGGCATGGTGTGGGAACTGTATTTCATTCTGAGCCCCTAATATATCACCACCGTAAGTGGTCCACTCTCAATTTTTTTACGATCCTTTTGCTGTTATCTTATTACATGGTAGTCTGTTTTTGTAACTCATTGTTATTTGGTATGGCCAGGCAATGACGAACCTGGTCATATGGTCGAAGGTCAAACTTTTACAATTGGTGAGTGGTCGACCTTGTTGCTCTAAAATCAATATTTCCTTTCTGTGTTCATGCATTTTTAGAAGCTTATCTTTAGTATCTAATTGAAAATTTCTTTTGTTCGTTAGATGAAGGATAGTTTTGTTATACATAGACATTACTTTGAAGTATTAAGAACAAATAGAGATATTAATTTTCCATTGGAATGAAAATTAACAAAATGAATACATGTGAAGTAAGGTTTCACAACCAGCACAATTGTGCGTCCAAGTAATTTTAGGAAAAATTTGTCCAAGTGATGTGGCATATTATTGATTCAAGGTAGAGGAAAAATATCAAATAAAACTGAAATATAAATTTTATAACATCATTTGCTCGAAGAAAACCCTTTTTTGCCTTACAAGAAACAATATCATCATCATTAGAGGGTCGAATGTAGAAGAAAGGGAAGACAATTCATTCTTTCCTAAAAAATCCAAAGGAATTATAGAAAGGCTCTCTCATCGTTTTGAATATTAAAACAAATTTTATATTAAAAGAATTATAGTTACAAAAAGATTTAGCCTAAGAAACCATCCATATAACCTATCAAAGTTGCCTTTGCATGTTCTTAATCTACAAATACATCCATCCTGAAAAACACAACGATTTATCTCAAGCTAAAATTTCCAAAATGCAGCACCAGTTAAGTCTGCTAAACCTTTGGTTTTCCATACAAACCATGTCCCAAAAATATTGAAATGATAAAATCATCAAGATGATGTGGGCGGCACCATTAAAGATTGAAGGTGGTAGCAGTTCCAAACTATGCTGGAGTCCAAAAAGAGAAACTTACTTAGTTACTTACCATTGAAGGTGGTAGCAGTTCCAAACTATGCTGGAGTCCAAAAAGAGAAACTTAGTTACTTACCATTGAAGCTGGTAGCAGTTCCAAACTATGAGCATAAGTTTGGATTCTATATATCCAAAATATTGATTTTAAGATACTTCAGAGTTGAAAAAGAAAAAATTACTTGTTTAGAGGTCTGATGTCCAAACAAAACATGTGAAACATATAGCCCATTCTATAGCTTCCCTCTCAATATGATTACTCCAAACTCTCAGGTTCGAACTTGCTCATCAGTCTAAACCTCTTTTATAGTGTTCTGATGGCTGTCAGCCAATATATATATAATTTAGGAAAAAGAATGTGGAAGGCTGTGGAAGGCTATTCCAGCTCTCCAAGGATCCTTGCAAAAATTAATTCTACAGTCAAGTTCCTGCACTTTTCCAATATGAAACCATGGGACATCTCGAGCTGCTAAATCTACAAGAAGTAAACCCCCTGTAGGAATCAGTTCCATGCTCTCAATAATGCAGTGCAAAAAATTACTGTGCTCTCTAGAGAATCTCCAAAGCCATTTAGAAATCTTAGCTAGATTTTCTTCACCACATTGTGGATACGAAGCTCTCCAGTTCTTTGGGAGGGAGGCCTCCCCCAATTGACTAAATAGCACCCACCCTCCGAATGAAATTCTAAATTTAGTTACTCCTTACGTTTATTTTTGGGTAATGATACACATACGAATAGGAACTCGGTGCTTCTCAAAAATAATTTTTTTAGTTGGACTCCTTGGTATTCCCTCTGCGGCAGTTTCCTGTCTTAAATATTCCACTTTTGATGTATATATATCTTTTAAAGCTTGTGACATGGTCAGTTGTGGAAAGCGAAAACTGCTCCCTCTAAACCCAGCCTCTTTCATTTTTGATCCTCCCACCACACCAAAACCTTGATATCTCTGAAGTTATGCTCTATTTTCTTATCTAAGAAGTATGTACTGTCTTGTCTATGATTTGATAATACTAATTATGGCTTATGCGCATAAGATACTAATAGGAAGGAGAAGCCACCTGTGCACAGAAGGGGGGAGTATGTGAGGGTAAGAAAGGGATTTGTATGGGGAAGTTGAGAGTTGGCGCGACTAAAAGAAATGGGTGTGATGTAAGTCTGCGTGTGTGCATGATCTCATGTAAGATTTCGGTGGTTAGCAGGTGTGGTTATTAGTCTTATTCAGTCAGGAAGATTACCTATAAGTAAAACTGCAAATCAAGGGAACGATAGGCGATAGTGAATGTTAGGGAGAGACTCAAACCTCTCGAAAGTCTCTGGAATTTTGGCTTTTAATCTCTCTTGTGGCATTTAATTGAACATACCTCCAAACTGACTTCATTTTATATTCACAGTTTTAATTTCTTCACTGCAGAACCAATTCTTACAATGGAAGGCATTGA

mRNA sequence

ATGGCTCTGCAAGAGACGTATAATCGGCCTTCCCCATTAGATTTTTCTCTTACTCATAAACTAGAGGAACAGCTTGACCGACTTCTTGCGGAGGAGGAAATATATTGGAAGCAATGGTCCCATGAAAACTGGCTGAAGTGGGGTGACCATAATATGAAATGGTTTCATAGAAAGGCGTCTATGCGAAAGAAGAAGAACAAATTTATTGGGATTTCGTATGTTCATGGCAATTGTTCTTTACTGATCGTTGGCTTTCGAGAGAAAGTTCTTTCCAGACTATTACGCAAAATTTGTCCAGTAATGAGATGTTGGTGGTTGATTTTATTTCCCCCTCTATGTGCCGGGATGTGGGGGACATTGAGGTCTTATGTTTGTAATGAAGATCTCGAAGTTATTTTAAAAACTCCAATCAGTATCATGGATCAGCAGGACAGTTGGGTGTGGGATTACAACGAAGAATACAGTGTCAAGAGTGGATACAAATTGTTCATGTTGCTAAAACATAATGAAACTACTTTTTATGCTGCCGCTATGAATGAGTGGTGGAAGTGTGTCTGGAAGCTTCAAATTCCTTCCAAAATTAAAATATTCATCTGTCATGCTTTTCATGATTGTATCCCTACGTTATGGAATCTAAAGAATAGGGGAGTAAGCGTGGCGTCTGATTGCCCTTTATGCTTGAAAAGAGTGGAAACCACTAACCATACTCTGTTTACCTATAAACGGGCAAAATCCGTATGGAGAACTATTTTACCATCAGTTCCAGTTGAGAGGAGTTCGGCAAAGGTAGAGGTTGTACTTTTTCTAATTCGACAAACAACTTTGGGAATGAGAAGGCGCGTAATGGGGGCTGGGTGGATTCGACCCAAGAGCAACTGGATCAAGATTAACGTCGATGTAACTTGCAAACGAAACCCACTGGGAACTGGATTAGGGGTAATATGTCGTGATGATAGGGGTTTACTGCTGGCAACGATGTTGTGTTATCTTCGAGTTACTTATGTATTTTTACAAGCTGAGATTCTTTCCATTCTTGACGCACTCCAGTTGGCTCAACATTTAAATTTCAAGAAGGTTTGTGTGAAATTAGATGCCTTTCAAGCAATACAAGTCATTAACAATAGGGATGCCACTTTACTGTCTGTTGGAACGTGGGTGGAGGACATCCAAAATCTCATTGAGAGTTTTGAGGCGGTTGAATTTTCTATCTGCAAAATGGCATTGGGCGCTTCGCTCGTTCGAGTTTCATTACTGAAGCTCTCTGCAGTCTCACATGGTGAAGCTTTCTCACCTTCGTCGAAGTCGGCGTCCTTTATGGGCGCGCCACTTAATTTTCTTCCCCCTTCTCGCTCTGGGAAACAAAAGCATTTTCATGAGAATCTTGTCGTCATATCGAAGAGAATATCGGGTCTGGAGGAAGCCATGAGAATTAGGAGAGAGCGAGAGCTTGAAATTGTGAGAAAAGTTAGAAAGAGACAACCATTGAGGCGTGGAAAGGTATCTCCACGTCTTCCTGTGCCTGATCACATACAAAAGCCTCCTTATGTTGGTTCTTCTATACTGCCAGAAATTTCAAGTGAGTATCAAATGCATGATTCTGAAGGAATTGCTCAAATGAGGGCTGCATGTCAGCTTGCTGCTCGTGTGTTAGACTATGCAGGAACACTAGTGAGACCCTCAGTAACAACAAATGAAATTGACAAAGCAGTGCATCAGATGATTATTGATGCTGGTGCTTATCCTTCACCTCTTGGCTACGGGGGATTCCCAAAGAGTGTTTGCACATCAGTTAATGAGTGCATGTGTCACGGAATACCGGACTCTCGTCAATTACAGAGTGGTGATATAATAAACATTGACGTGACAGTCTACCTAAATGGATATCATGGAGACACATCGAAGACATATTTTTGCGGGGATGTAAGCGATGGAATGAAACGTCTTGTGAAGGTTACAGAGGAATGTCTCGAGAGAGGTATAGCTGTATGCAAGGATGGTGCTAGTTTTAAGAAAATTGGAAAGAGAATCAGTGAGCATGCTGAAAAATATGGCTATGGGGTGGTGGAGCGTTTTGTTGGGCATGGTGTGGGAACTGTATTTCATTCTGAGCCCCTAATATATCACCACCGCAATGACGAACCTGGTCATATGGTCGAAGGTCAAACTTTTACAATTGAACCAATTCTTACAATGGAAGGCATTGA

Coding sequence (CDS)

ATGGCTCTGCAAGAGACGTATAATCGGCCTTCCCCATTAGATTTTTCTCTTACTCATAAACTAGAGGAACAGCTTGACCGACTTCTTGCGGAGGAGGAAATATATTGGAAGCAATGGTCCCATGAAAACTGGCTGAAGTGGGGTGACCATAATATGAAATGGTTTCATAGAAAGGCGTCTATGCGAAAGAAGAAGAACAAATTTATTGGGATTTCGTATGTTCATGGCAATTGTTCTTTACTGATCGTTGGCTTTCGAGAGAAAGTTCTTTCCAGACTATTACGCAAAATTTGTCCAGTAATGAGATGTTGGTGGTTGATTTTATTTCCCCCTCTATGTGCCGGGATGTGGGGGACATTGAGGTCTTATGTTTGTAATGAAGATCTCGAAGTTATTTTAAAAACTCCAATCAGTATCATGGATCAGCAGGACAGTTGGGTGTGGGATTACAACGAAGAATACAGTGTCAAGAGTGGATACAAATTGTTCATGTTGCTAAAACATAATGAAACTACTTTTTATGCTGCCGCTATGAATGAGTGGTGGAAGTGTGTCTGGAAGCTTCAAATTCCTTCCAAAATTAAAATATTCATCTGTCATGCTTTTCATGATTGTATCCCTACGTTATGGAATCTAAAGAATAGGGGAGTAAGCGTGGCGTCTGATTGCCCTTTATGCTTGAAAAGAGTGGAAACCACTAACCATACTCTGTTTACCTATAAACGGGCAAAATCCGTATGGAGAACTATTTTACCATCAGTTCCAGTTGAGAGGAGTTCGGCAAAGGTAGAGGTTGTACTTTTTCTAATTCGACAAACAACTTTGGGAATGAGAAGGCGCGTAATGGGGGCTGGGTGGATTCGACCCAAGAGCAACTGGATCAAGATTAACGTCGATGTAACTTGCAAACGAAACCCACTGGGAACTGGATTAGGGGTAATATGTCGTGATGATAGGGGTTTACTGCTGGCAACGATGTTGTGTTATCTTCGAGTTACTTATGTATTTTTACAAGCTGAGATTCTTTCCATTCTTGACGCACTCCAGTTGGCTCAACATTTAAATTTCAAGAAGGTTTGTGTGAAATTAGATGCCTTTCAAGCAATACAAGTCATTAACAATAGGGATGCCACTTTACTGTCTGTTGGAACGTGGGTGGAGGACATCCAAAATCTCATTGAGAGTTTTGAGGCGGTTGAATTTTCTATCTGCAAAATGGCATTGGGCGCTTCGCTCGTTCGAGTTTCATTACTGAAGCTCTCTGCAGTCTCACATGGTGAAGCTTTCTCACCTTCGTCGAAGTCGGCGTCCTTTATGGGCGCGCCACTTAATTTTCTTCCCCCTTCTCGCTCTGGGAAACAAAAGCATTTTCATGAGAATCTTGTCGTCATATCGAAGAGAATATCGGGTCTGGAGGAAGCCATGAGAATTAGGAGAGAGCGAGAGCTTGAAATTGTGAGAAAAGTTAGAAAGAGACAACCATTGAGGCGTGGAAAGGTATCTCCACGTCTTCCTGTGCCTGATCACATACAAAAGCCTCCTTATGTTGGTTCTTCTATACTGCCAGAAATTTCAAGTGAGTATCAAATGCATGATTCTGAAGGAATTGCTCAAATGAGGGCTGCATGTCAGCTTGCTGCTCGTGTGTTAGACTATGCAGGAACACTAGTGAGACCCTCAGTAACAACAAATGAAATTGACAAAGCAGTGCATCAGATGATTATTGATGCTGGTGCTTATCCTTCACCTCTTGGCTACGGGGGATTCCCAAAGAGTGTTTGCACATCAGTTAATGAGTGCATGTGTCACGGAATACCGGACTCTCGTCAATTACAGAGTGGTGATATAATAAACATTGACGTGACAGTCTACCTAAATGGATATCATGGAGACACATCGAAGACATATTTTTGCGGGGATGTAAGCGATGGAATGAAACGTCTTGTGAAGGTTACAGAGGAATGTCTCGAGAGAGGTATAGCTGTATGCAAGGATGGTGCTAGTTTTAAGAAAATTGGAAAGAGAATCAGTGAGCATGCTGAAAAATATGGCTATGGGGTGGTGGAGCGTTTTGTTGGGCATGGTGTGGGAACTGTATTTCATTCTGAGCCCCTAATATATCACCACCGCAATGACGAACCTGGTCATATGGTCGAAGGTCAAACTTTTACAATTGAACCAATTCTTACAATGGAAGGCATTGA

Protein sequence

MALQETYNRPSPLDFSLTHKLEEQLDRLLAEEEIYWKQWSHENWLKWGDHNMKWFHRKASMRKKKNKFIGISYVHGNCSLLIVGFREKVLSRLLRKICPVMRCWWLILFPPLCAGMWGTLRSYVCNEDLEVILKTPISIMDQQDSWVWDYNEEYSVKSGYKLFMLLKHNETTFYAAAMNEWWKCVWKLQIPSKIKIFICHAFHDCIPTLWNLKNRGVSVASDCPLCLKRVETTNHTLFTYKRAKSVWRTILPSVPVERSSAKVEVVLFLIRQTTLGMRRRVMGAGWIRPKSNWIKINVDVTCKRNPLGTGLGVICRDDRGLLLATMLCYLRVTYVFLQAEILSILDALQLAQHLNFKKVCVKLDAFQAIQVINNRDATLLSVGTWVEDIQNLIESFEAVEFSICKMALGASLVRVSLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVISKRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEISSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLGYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGMKRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPLIYHHRNDEPGHMVEGQTFTIEPILTMEGIX
Homology
BLAST of Sgr011659 vs. NCBI nr
Match: XP_038874534.1 (methionine aminopeptidase 1B, chloroplastic [Benincasa hispida])

HSP 1 Score: 631.7 bits (1628), Expect = 7.8e-177
Identity = 314/329 (95.44%), Postives = 321/329 (97.57%), Query Frame = 0

Query: 406 MALGASLVRVSLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVIS 465
           MA+GASLVRVS LKLS+VSHG+ FSPSSK ASFMGAPLNFLP  RSGKQK FHENLVV+S
Sbjct: 1   MAIGASLVRVSSLKLSSVSHGDDFSPSSKLASFMGAPLNFLPSYRSGKQKKFHENLVVVS 60

Query: 466 KRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEIS 525
           KRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEIS
Sbjct: 61  KRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEIS 120

Query: 526 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 585
           SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEID AVHQMIIDAGAYPSPLG
Sbjct: 121 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDNAVHQMIIDAGAYPSPLG 180

Query: 586 YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 645
           YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM
Sbjct: 181 YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 240

Query: 646 KRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL 705
           +RLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVG+VFHSEPL
Sbjct: 241 RRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGSVFHSEPL 300

Query: 706 IYHHRNDEPGHMVEGQTFTIEPILTMEGI 735
           IYHHRN+EPGHMVEGQTFTIEPILTM GI
Sbjct: 301 IYHHRNEEPGHMVEGQTFTIEPILTMGGI 329

BLAST of Sgr011659 vs. NCBI nr
Match: XP_022155198.1 (methionine aminopeptidase 1B, chloroplastic [Momordica charantia])

HSP 1 Score: 627.9 bits (1618), Expect = 1.1e-175
Identity = 312/329 (94.83%), Postives = 319/329 (96.96%), Query Frame = 0

Query: 406 MALGASLVRVSLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVIS 465
           MA GASLVRVS  KLS VSH EA SPSSKSASFMGAPLNFLP SRSGKQ HFHENLVV+S
Sbjct: 1   MAFGASLVRVSSPKLSLVSHAEALSPSSKSASFMGAPLNFLPSSRSGKQGHFHENLVVLS 60

Query: 466 KRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEIS 525
           KRISGLEEAMRIRREREL IVRKVRKRQPLRRGKVSPRLPVPDHIQKPPY+GSSILPEIS
Sbjct: 61  KRISGLEEAMRIRRERELAIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYIGSSILPEIS 120

Query: 526 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 585
           +EYQMHDSEGIAQMRAACQLAARVLD+AGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG
Sbjct: 121 TEYQMHDSEGIAQMRAACQLAARVLDHAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 180

Query: 586 YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 645
           YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM
Sbjct: 181 YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 240

Query: 646 KRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL 705
           +RLVKVTEECLE+GIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL
Sbjct: 241 RRLVKVTEECLEKGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL 300

Query: 706 IYHHRNDEPGHMVEGQTFTIEPILTMEGI 735
           IYHHRN+EPGHMVEGQTFTIEPILTM GI
Sbjct: 301 IYHHRNEEPGHMVEGQTFTIEPILTMGGI 329

BLAST of Sgr011659 vs. NCBI nr
Match: XP_004147182.1 (methionine aminopeptidase 1B, chloroplastic isoform X1 [Cucumis sativus] >KAE8651804.1 hypothetical protein Csa_006683 [Cucumis sativus])

HSP 1 Score: 612.5 bits (1578), Expect = 4.9e-171
Identity = 306/330 (92.73%), Postives = 318/330 (96.36%), Query Frame = 0

Query: 406 MALGASLVRVSLLK-LSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVI 465
           MA GASLVRVS LK LS+VSHG+ FS SSKS+SFMGAPLNFLP  RS KQK FHENLV++
Sbjct: 1   MATGASLVRVSSLKLLSSVSHGDDFSSSSKSSSFMGAPLNFLPSYRSRKQKPFHENLVIV 60

Query: 466 SKRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEI 525
           SK+ISGLEEAMRIRREREL IV+KVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEI
Sbjct: 61  SKKISGLEEAMRIRRERELGIVQKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEI 120

Query: 526 SSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPL 585
           SSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPL
Sbjct: 121 SSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPL 180

Query: 586 GYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDG 645
           GYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDG
Sbjct: 181 GYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDG 240

Query: 646 MKRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEP 705
           M+ LVKVTEECL+RGIAVCKDGASFKKIGKRISEHAEKYGYGVV+RFVGHGVG+VFHSEP
Sbjct: 241 MRNLVKVTEECLDRGIAVCKDGASFKKIGKRISEHAEKYGYGVVDRFVGHGVGSVFHSEP 300

Query: 706 LIYHHRNDEPGHMVEGQTFTIEPILTMEGI 735
           LIYHHRN+EPGHMVEGQTFTIEPILTM GI
Sbjct: 301 LIYHHRNEEPGHMVEGQTFTIEPILTMGGI 330

BLAST of Sgr011659 vs. NCBI nr
Match: XP_022974354.1 (methionine aminopeptidase 1B, chloroplastic isoform X1 [Cucurbita maxima])

HSP 1 Score: 606.3 bits (1562), Expect = 3.5e-169
Identity = 300/329 (91.19%), Postives = 314/329 (95.44%), Query Frame = 0

Query: 406 MALGASLVRVSLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVIS 465
           MA+ ASLVRVS LKLS+VSHG  FSPSSKS+SFMG PLNFLP  RSGKQK FH+NLVV+S
Sbjct: 1   MAISASLVRVSSLKLSSVSHGHDFSPSSKSSSFMGVPLNFLPSFRSGKQKQFHDNLVVVS 60

Query: 466 KRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEIS 525
           KR SGLEEA+R  RE++LE VRKVRK  PLRRGKVSPRLPVP+HIQKPPYVGSSILPEIS
Sbjct: 61  KRTSGLEEALRNLREQKLETVRKVRKIPPLRRGKVSPRLPVPEHIQKPPYVGSSILPEIS 120

Query: 526 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 585
           SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG
Sbjct: 121 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 180

Query: 586 YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 645
           YGGFPKSVCTSVNECMCHGIPDSRQLQ+GDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM
Sbjct: 181 YGGFPKSVCTSVNECMCHGIPDSRQLQNGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 240

Query: 646 KRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL 705
           +RLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYG+GVVERFVGHGVG+VFHSEPL
Sbjct: 241 RRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGFGVVERFVGHGVGSVFHSEPL 300

Query: 706 IYHHRNDEPGHMVEGQTFTIEPILTMEGI 735
           IYHHRN+EPGHMVEGQTFTIEPILTM GI
Sbjct: 301 IYHHRNEEPGHMVEGQTFTIEPILTMGGI 329

BLAST of Sgr011659 vs. NCBI nr
Match: XP_022922375.1 (methionine aminopeptidase 1B, chloroplastic isoform X1 [Cucurbita moschata])

HSP 1 Score: 604.4 bits (1557), Expect = 1.3e-168
Identity = 298/329 (90.58%), Postives = 314/329 (95.44%), Query Frame = 0

Query: 406 MALGASLVRVSLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVIS 465
           MA+ ASLVRVS  KLS+VSHG+ FSPSSKS+SFMG PLNFLP  RSGKQK FH+NLVV+S
Sbjct: 1   MAISASLVRVSSFKLSSVSHGDDFSPSSKSSSFMGVPLNFLPSFRSGKQKQFHDNLVVVS 60

Query: 466 KRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEIS 525
           KR SGLEEA+R  RE++LE VRKVR+  PLRRGKVSPRLPVP+HIQKPPYVGSSILPEIS
Sbjct: 61  KRTSGLEEALRNLREQKLETVRKVRRIPPLRRGKVSPRLPVPEHIQKPPYVGSSILPEIS 120

Query: 526 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 585
           SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG
Sbjct: 121 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 180

Query: 586 YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 645
           YGGFPKSVCTSVNECMCHGIPDSRQLQ+GDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM
Sbjct: 181 YGGFPKSVCTSVNECMCHGIPDSRQLQNGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 240

Query: 646 KRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL 705
           +RLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYG+GVVERFVGHGVG+VFHSEPL
Sbjct: 241 RRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGFGVVERFVGHGVGSVFHSEPL 300

Query: 706 IYHHRNDEPGHMVEGQTFTIEPILTMEGI 735
           IYHHRN+EPGHMVEGQTFTIEPILTM GI
Sbjct: 301 IYHHRNEEPGHMVEGQTFTIEPILTMGGI 329

BLAST of Sgr011659 vs. ExPASy Swiss-Prot
Match: Q9FV52 (Methionine aminopeptidase 1B, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MAP1B PE=2 SV=2)

HSP 1 Score: 479.2 bits (1232), Expect = 8.5e-134
Identity = 240/317 (75.71%), Postives = 272/317 (85.80%), Query Frame = 0

Query: 416 SLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFH-ENLVVISKRISGLEEA 475
           S L+L +  HGE  +P   S  F+GAP+     S SGK+  +      V +K++SGLEEA
Sbjct: 14  SSLQLCSSFHGEYLAP---SRCFLGAPVTSSSLSLSGKKNSYSPRQFHVSAKKVSGLEEA 73

Query: 476 MRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEISSEYQMHDSE 535
           +RIR+ RELE   KVR+  PLRRG+VSPRL VPDHI +PPYV S +LP+ISSE+Q+   E
Sbjct: 74  IRIRKMRELETKSKVRRNPPLRRGRVSPRLLVPDHIPRPPYVESGVLPDISSEFQIPGPE 133

Query: 536 GIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLGYGGFPKSVC 595
           GIA+MRAAC+LAARVL+YAGTLV+PSVTTNEIDKAVH MII+AGAYPSPLGYGGFPKSVC
Sbjct: 134 GIAKMRAACELAARVLNYAGTLVKPSVTTNEIDKAVHDMIIEAGAYPSPLGYGGFPKSVC 193

Query: 596 TSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGMKRLVKVTEE 655
           TSVNECMCHGIPDSRQLQSGDIINIDVTVYL+GYHGDTS+T+FCG+V +G KRLVKVTEE
Sbjct: 194 TSVNECMCHGIPDSRQLQSGDIINIDVTVYLDGYHGDTSRTFFCGEVDEGFKRLVKVTEE 253

Query: 656 CLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPLIYHHRNDEP 715
           CLERGIAVCKDGASFKKIGKRISEHAEK+GY VVERFVGHGVG VFHSEPLIYH+RNDEP
Sbjct: 254 CLERGIAVCKDGASFKKIGKRISEHAEKFGYNVVERFVGHGVGPVFHSEPLIYHYRNDEP 313

Query: 716 GHMVEGQTFTIEPILTM 732
           G MVEGQTFTIEPILT+
Sbjct: 314 GLMVEGQTFTIEPILTI 327

BLAST of Sgr011659 vs. ExPASy Swiss-Prot
Match: Q9FV51 (Methionine aminopeptidase 1C, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=MAP1C PE=2 SV=2)

HSP 1 Score: 381.7 bits (979), Expect = 1.8e-104
Identity = 202/312 (64.74%), Postives = 238/312 (76.28%), Query Frame = 0

Query: 422 AVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVISKRISGLEEAMRIRRER 481
           ++ +G+ F P    A   GAP NF+    SGK+K                  ++RI+R +
Sbjct: 10  SLCNGDQFKPLIYLA---GAPTNFISSPLSGKKK----------------SSSLRIKRIQ 69

Query: 482 ELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEISSEYQMHDSEGIAQMRA 541
           +L+   + R   PL  G VSPRL VPDHI KP YV SS +PEISSE Q+ DS GI +M+ 
Sbjct: 70  QLQSTLEDRINPPLVCGTVSPRLSVPDHILKPLYVESSKVPEISSELQIPDSIGIVKMKK 129

Query: 542 ACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLGYGGFPKSVCTSVNECM 601
           AC+LAARVLDYAGTLVRP VTT+EIDKAVHQM+I+ GAYPSPLGYGGFPKSVCTSVNECM
Sbjct: 130 ACELAARVLDYAGTLVRPFVTTDEIDKAVHQMVIEFGAYPSPLGYGGFPKSVCTSVNECM 189

Query: 602 CHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGMKRLVKVTEECLERGIA 661
            HGIPDSR LQ+GDIINIDV VYL+GYHGDTSKT+ CGDV+  +K+LVKVTEECLE+GI+
Sbjct: 190 FHGIPDSRPLQNGDIINIDVAVYLDGYHGDTSKTFLCGDVNGSLKQLVKVTEECLEKGIS 249

Query: 662 VCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPLIYHHRN--DEPGHMVE 721
           VCKDGASFK+IGK ISEHA KYGY  +ERF+GHGVGTV HSEPLIY H N   E  +M+E
Sbjct: 250 VCKDGASFKQIGKIISEHAAKYGYN-MERFIGHGVGTVLHSEPLIYLHSNYDYELEYMIE 301

Query: 722 GQTFTIEPILTM 732
           GQTFT+EPILT+
Sbjct: 310 GQTFTLEPILTI 301

BLAST of Sgr011659 vs. ExPASy Swiss-Prot
Match: Q9FV50 (Methionine aminopeptidase 1D, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=MAP1D PE=1 SV=1)

HSP 1 Score: 343.2 bits (879), Expect = 7.3e-93
Identity = 170/268 (63.43%), Postives = 209/268 (77.99%), Query Frame = 0

Query: 464 ISKRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPE 523
           +S+  SGL + +   R  E E++   RKR  LR G VSPR PVP HI KPPYV S   P 
Sbjct: 44  LSRTFSGLTDLL-FNRRNEDEVIDGKRKR--LRPGNVSPRRPVPGHITKPPYVDSLQAPG 103

Query: 524 ISSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSP 583
           ISS  ++HD +GI  MRA+  LAARV DYAGTLV+P VTT+EID+AVH MII+ GAYPSP
Sbjct: 104 ISSGLEVHDKKGIECMRASGILAARVRDYAGTLVKPGVTTDEIDEAVHNMIIENGAYPSP 163

Query: 584 LGYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSD 643
           LGYGGFPKSVCTSVNEC+CHGIPDSR L+ GDIINIDVTVYLNGYHGDTS T+FCG+V +
Sbjct: 164 LGYGGFPKSVCTSVNECICHGIPDSRPLEDGDIINIDVTVYLNGYHGDTSATFFCGNVDE 223

Query: 644 GMKRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSE 703
             K+LV+VT+E L++ I++C  G  +KKIGK I + A+K+ YGVV +FVGHGVG+VFH++
Sbjct: 224 KAKKLVEVTKESLDKAISICGPGVEYKKIGKVIHDLADKHKYGVVRQFVGHGVGSVFHAD 283

Query: 704 PLIYHHRNDEPGHMVEGQTFTIEPILTM 732
           P++ H RN+E G MV  QTFTIEP+LT+
Sbjct: 284 PVVLHFRNNEAGRMVLNQTFTIEPMLTI 308

BLAST of Sgr011659 vs. ExPASy Swiss-Prot
Match: Q54VU7 (Methionine aminopeptidase 1D, mitochondrial OS=Dictyostelium discoideum OX=44689 GN=metap1d PE=3 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 2.4e-67
Identity = 130/268 (48.51%), Postives = 171/268 (63.81%), Query Frame = 0

Query: 464 ISKRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYV-GSSILP 523
           ++K+ +   E M  +  R++           +R G VSP+  +P HI+KP YV G  ++ 
Sbjct: 91  LTKKTASPLEGMNRKERRKMTTKLYRNPDNLIRGGIVSPQPLIPAHIKKPKYVLGEPVID 150

Query: 524 -EISSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYP 583
            EI    ++H +E I  MR   ++A  VL+YAGTLVRP +TT+EIDK VHQ IID GAYP
Sbjct: 151 FEIDDPIEIHTAESIEHMRVVGKMAKEVLEYAGTLVRPGITTDEIDKLVHQNIIDRGAYP 210

Query: 584 SPLGYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDV 643
           SPLGY GFPKS+CTS+NE +CHGIPD R L+ GDI+ IDVT+Y NGYHGDT  T+  G++
Sbjct: 211 SPLGYKGFPKSICTSINEVLCHGIPDDRPLEFGDIVKIDVTLYYNGYHGDTCATFPVGEI 270

Query: 644 SDGMKRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFH 703
               KRL++ TE+ L   I   KDGA F KIGK+I   A KY   V   F GHG+G +FH
Sbjct: 271 DSSSKRLIEATEKALYAAIGEVKDGALFNKIGKKIQLVANKYSLSVTPEFTGHGIGQLFH 330

Query: 704 SEPLIYHHRNDEPGHMVEGQTFTIEPIL 730
           + P ++   N+    M EG  FTIEP+L
Sbjct: 331 TAPFVFQCANEFDSVMKEGMIFTIEPVL 358

BLAST of Sgr011659 vs. ExPASy Swiss-Prot
Match: Q6UB28 (Methionine aminopeptidase 1D, mitochondrial OS=Homo sapiens OX=9606 GN=METAP1D PE=1 SV=1)

HSP 1 Score: 258.1 bits (658), Expect = 3.1e-67
Identity = 125/231 (54.11%), Postives = 153/231 (66.23%), Query Frame = 0

Query: 500 VSPRLPVPDHIQKPPYVGSSILPEISSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRP 559
           VS   PVP HI+KP YV + I+P+     ++ + + I  +  ACQLA  VL  AG  ++ 
Sbjct: 58  VSSAHPVPKHIKKPDYVTTGIVPDWGDSIEVKNEDQIQGLHQACQLARHVLLLAGKSLKV 117

Query: 560 SVTTNEIDKAVHQMIIDAGAYPSPLGYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINI 619
            +TT EID  VH+ II   AYPSPLGYGGFPKSVCTSVN  +CHGIPDSR LQ GDIINI
Sbjct: 118 DMTTEEIDALVHREIISHNAYPSPLGYGGFPKSVCTSVNNVLCHGIPDSRPLQDGDIINI 177

Query: 620 DVTVYLNGYHGDTSKTYFCGDVSDGMKRLVKVTEECLERGIAVCKDGASFKKIGKRISEH 679
           DVTVY NGYHGDTS+T+  G+V +  K+LV+V   C +  IA C+ GA F  IG  IS  
Sbjct: 178 DVTVYYNGYHGDTSETFLVGNVDECGKKLVEVARRCRDEAIAACRAGAPFSVIGNTISHI 237

Query: 680 AEKYGYGVVERFVGHGVGTVFHSEPLIYHHRNDEPGHMVEGQTFTIEPILT 731
             + G+ V   FVGHG+G+ FH  P I+HH ND    M EG  FTIEPI+T
Sbjct: 238 THQNGFQVCPHFVGHGIGSYFHGHPEIWHHANDSDLPMEEGMAFTIEPIIT 288

BLAST of Sgr011659 vs. ExPASy TrEMBL
Match: A0A6J1DMC0 (Methionine aminopeptidase OS=Momordica charantia OX=3673 GN=LOC111022340 PE=3 SV=1)

HSP 1 Score: 627.9 bits (1618), Expect = 5.5e-176
Identity = 312/329 (94.83%), Postives = 319/329 (96.96%), Query Frame = 0

Query: 406 MALGASLVRVSLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVIS 465
           MA GASLVRVS  KLS VSH EA SPSSKSASFMGAPLNFLP SRSGKQ HFHENLVV+S
Sbjct: 1   MAFGASLVRVSSPKLSLVSHAEALSPSSKSASFMGAPLNFLPSSRSGKQGHFHENLVVLS 60

Query: 466 KRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEIS 525
           KRISGLEEAMRIRREREL IVRKVRKRQPLRRGKVSPRLPVPDHIQKPPY+GSSILPEIS
Sbjct: 61  KRISGLEEAMRIRRERELAIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYIGSSILPEIS 120

Query: 526 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 585
           +EYQMHDSEGIAQMRAACQLAARVLD+AGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG
Sbjct: 121 TEYQMHDSEGIAQMRAACQLAARVLDHAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 180

Query: 586 YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 645
           YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM
Sbjct: 181 YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 240

Query: 646 KRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL 705
           +RLVKVTEECLE+GIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL
Sbjct: 241 RRLVKVTEECLEKGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL 300

Query: 706 IYHHRNDEPGHMVEGQTFTIEPILTMEGI 735
           IYHHRN+EPGHMVEGQTFTIEPILTM GI
Sbjct: 301 IYHHRNEEPGHMVEGQTFTIEPILTMGGI 329

BLAST of Sgr011659 vs. ExPASy TrEMBL
Match: A0A6J1IHD3 (Methionine aminopeptidase OS=Cucurbita maxima OX=3661 GN=LOC111472984 PE=3 SV=1)

HSP 1 Score: 606.3 bits (1562), Expect = 1.7e-169
Identity = 300/329 (91.19%), Postives = 314/329 (95.44%), Query Frame = 0

Query: 406 MALGASLVRVSLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVIS 465
           MA+ ASLVRVS LKLS+VSHG  FSPSSKS+SFMG PLNFLP  RSGKQK FH+NLVV+S
Sbjct: 1   MAISASLVRVSSLKLSSVSHGHDFSPSSKSSSFMGVPLNFLPSFRSGKQKQFHDNLVVVS 60

Query: 466 KRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEIS 525
           KR SGLEEA+R  RE++LE VRKVRK  PLRRGKVSPRLPVP+HIQKPPYVGSSILPEIS
Sbjct: 61  KRTSGLEEALRNLREQKLETVRKVRKIPPLRRGKVSPRLPVPEHIQKPPYVGSSILPEIS 120

Query: 526 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 585
           SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG
Sbjct: 121 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 180

Query: 586 YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 645
           YGGFPKSVCTSVNECMCHGIPDSRQLQ+GDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM
Sbjct: 181 YGGFPKSVCTSVNECMCHGIPDSRQLQNGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 240

Query: 646 KRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL 705
           +RLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYG+GVVERFVGHGVG+VFHSEPL
Sbjct: 241 RRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGFGVVERFVGHGVGSVFHSEPL 300

Query: 706 IYHHRNDEPGHMVEGQTFTIEPILTMEGI 735
           IYHHRN+EPGHMVEGQTFTIEPILTM GI
Sbjct: 301 IYHHRNEEPGHMVEGQTFTIEPILTMGGI 329

BLAST of Sgr011659 vs. ExPASy TrEMBL
Match: A0A6J1E8K3 (Methionine aminopeptidase OS=Cucurbita moschata OX=3662 GN=LOC111430387 PE=3 SV=1)

HSP 1 Score: 604.4 bits (1557), Expect = 6.5e-169
Identity = 298/329 (90.58%), Postives = 314/329 (95.44%), Query Frame = 0

Query: 406 MALGASLVRVSLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVIS 465
           MA+ ASLVRVS  KLS+VSHG+ FSPSSKS+SFMG PLNFLP  RSGKQK FH+NLVV+S
Sbjct: 1   MAISASLVRVSSFKLSSVSHGDDFSPSSKSSSFMGVPLNFLPSFRSGKQKQFHDNLVVVS 60

Query: 466 KRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEIS 525
           KR SGLEEA+R  RE++LE VRKVR+  PLRRGKVSPRLPVP+HIQKPPYVGSSILPEIS
Sbjct: 61  KRTSGLEEALRNLREQKLETVRKVRRIPPLRRGKVSPRLPVPEHIQKPPYVGSSILPEIS 120

Query: 526 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 585
           SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG
Sbjct: 121 SEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLG 180

Query: 586 YGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 645
           YGGFPKSVCTSVNECMCHGIPDSRQLQ+GDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM
Sbjct: 181 YGGFPKSVCTSVNECMCHGIPDSRQLQNGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGM 240

Query: 646 KRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPL 705
           +RLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYG+GVVERFVGHGVG+VFHSEPL
Sbjct: 241 RRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGFGVVERFVGHGVGSVFHSEPL 300

Query: 706 IYHHRNDEPGHMVEGQTFTIEPILTMEGI 735
           IYHHRN+EPGHMVEGQTFTIEPILTM GI
Sbjct: 301 IYHHRNEEPGHMVEGQTFTIEPILTMGGI 329

BLAST of Sgr011659 vs. ExPASy TrEMBL
Match: A0A1S3CD18 (Methionine aminopeptidase OS=Cucumis melo OX=3656 GN=LOC103499439 PE=3 SV=1)

HSP 1 Score: 603.6 bits (1555), Expect = 1.1e-168
Identity = 302/330 (91.52%), Postives = 316/330 (95.76%), Query Frame = 0

Query: 406 MALGASLVRVSLLK-LSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVI 465
           MA GASLVRVS LK LS+VSHG+ FS SSKS+SF+GAPLNFLP  RSGK+  FHENLV++
Sbjct: 1   MATGASLVRVSSLKLLSSVSHGDDFSSSSKSSSFLGAPLNFLPSYRSGKRTPFHENLVIV 60

Query: 466 SKRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEI 525
           SKRISGLEEAMRIRREREL IV+KVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEI
Sbjct: 61  SKRISGLEEAMRIRRERELGIVQKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEI 120

Query: 526 SSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPL 585
           SSEYQMHDSEGIA+MRAACQLAARVL+YAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPL
Sbjct: 121 SSEYQMHDSEGIAKMRAACQLAARVLEYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPL 180

Query: 586 GYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDG 645
           GYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTY CGDVSDG
Sbjct: 181 GYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYICGDVSDG 240

Query: 646 MKRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEP 705
           M+RLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVV+RFVGHGVG+VFHSEP
Sbjct: 241 MRRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVDRFVGHGVGSVFHSEP 300

Query: 706 LIYHHRNDEPGHMVEGQTFTIEPILTMEGI 735
           LIYHHRN+EPG MVEG TFTIEPILTM GI
Sbjct: 301 LIYHHRNEEPGQMVEGLTFTIEPILTMGGI 330

BLAST of Sgr011659 vs. ExPASy TrEMBL
Match: A0A0A0LI62 (Methionine aminopeptidase OS=Cucumis sativus OX=3659 GN=Csa_2G172500 PE=3 SV=1)

HSP 1 Score: 599.4 bits (1544), Expect = 2.1e-167
Identity = 299/322 (92.86%), Postives = 311/322 (96.58%), Query Frame = 0

Query: 406 MALGASLVRVSLLK-LSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVI 465
           MA GASLVRVS LK LS+VSHG+ FS SSKS+SFMGAPLNFLP  RS KQK FHENLV++
Sbjct: 1   MATGASLVRVSSLKLLSSVSHGDDFSSSSKSSSFMGAPLNFLPSYRSRKQKPFHENLVIV 60

Query: 466 SKRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEI 525
           SK+ISGLEEAMRIRREREL IV+KVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEI
Sbjct: 61  SKKISGLEEAMRIRRERELGIVQKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEI 120

Query: 526 SSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPL 585
           SSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPL
Sbjct: 121 SSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPL 180

Query: 586 GYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDG 645
           GYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDG
Sbjct: 181 GYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDG 240

Query: 646 MKRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEP 705
           M+ LVKVTEECL+RGIAVCKDGASFKKIGKRISEHAEKYGYGVV+RFVGHGVG+VFHSEP
Sbjct: 241 MRNLVKVTEECLDRGIAVCKDGASFKKIGKRISEHAEKYGYGVVDRFVGHGVGSVFHSEP 300

Query: 706 LIYHHRNDEPGHMVEGQTFTIE 727
           LIYHHRN+EPGHMVEGQTFTIE
Sbjct: 301 LIYHHRNEEPGHMVEGQTFTIE 322

BLAST of Sgr011659 vs. TAIR 10
Match: AT1G13270.1 (methionine aminopeptidase 1B )

HSP 1 Score: 479.2 bits (1232), Expect = 6.0e-135
Identity = 240/317 (75.71%), Postives = 272/317 (85.80%), Query Frame = 0

Query: 416 SLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFH-ENLVVISKRISGLEEA 475
           S L+L +  HGE  +P   S  F+GAP+     S SGK+  +      V +K++SGLEEA
Sbjct: 14  SSLQLCSSFHGEYLAP---SRCFLGAPVTSSSLSLSGKKNSYSPRQFHVSAKKVSGLEEA 73

Query: 476 MRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEISSEYQMHDSE 535
           +RIR+ RELE   KVR+  PLRRG+VSPRL VPDHI +PPYV S +LP+ISSE+Q+   E
Sbjct: 74  IRIRKMRELETKSKVRRNPPLRRGRVSPRLLVPDHIPRPPYVESGVLPDISSEFQIPGPE 133

Query: 536 GIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLGYGGFPKSVC 595
           GIA+MRAAC+LAARVL+YAGTLV+PSVTTNEIDKAVH MII+AGAYPSPLGYGGFPKSVC
Sbjct: 134 GIAKMRAACELAARVLNYAGTLVKPSVTTNEIDKAVHDMIIEAGAYPSPLGYGGFPKSVC 193

Query: 596 TSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGMKRLVKVTEE 655
           TSVNECMCHGIPDSRQLQSGDIINIDVTVYL+GYHGDTS+T+FCG+V +G KRLVKVTEE
Sbjct: 194 TSVNECMCHGIPDSRQLQSGDIINIDVTVYLDGYHGDTSRTFFCGEVDEGFKRLVKVTEE 253

Query: 656 CLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPLIYHHRNDEP 715
           CLERGIAVCKDGASFKKIGKRISEHAEK+GY VVERFVGHGVG VFHSEPLIYH+RNDEP
Sbjct: 254 CLERGIAVCKDGASFKKIGKRISEHAEKFGYNVVERFVGHGVGPVFHSEPLIYHYRNDEP 313

Query: 716 GHMVEGQTFTIEPILTM 732
           G MVEGQTFTIEPILT+
Sbjct: 314 GLMVEGQTFTIEPILTI 327

BLAST of Sgr011659 vs. TAIR 10
Match: AT3G25740.1 (methionine aminopeptidase 1C )

HSP 1 Score: 381.7 bits (979), Expect = 1.3e-105
Identity = 202/312 (64.74%), Postives = 238/312 (76.28%), Query Frame = 0

Query: 422 AVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFHENLVVISKRISGLEEAMRIRRER 481
           ++ +G+ F P    A   GAP NF+    SGK+K                  ++RI+R +
Sbjct: 10  SLCNGDQFKPLIYLA---GAPTNFISSPLSGKKK----------------SSSLRIKRIQ 69

Query: 482 ELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEISSEYQMHDSEGIAQMRA 541
           +L+   + R   PL  G VSPRL VPDHI KP YV SS +PEISSE Q+ DS GI +M+ 
Sbjct: 70  QLQSTLEDRINPPLVCGTVSPRLSVPDHILKPLYVESSKVPEISSELQIPDSIGIVKMKK 129

Query: 542 ACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLGYGGFPKSVCTSVNECM 601
           AC+LAARVLDYAGTLVRP VTT+EIDKAVHQM+I+ GAYPSPLGYGGFPKSVCTSVNECM
Sbjct: 130 ACELAARVLDYAGTLVRPFVTTDEIDKAVHQMVIEFGAYPSPLGYGGFPKSVCTSVNECM 189

Query: 602 CHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGMKRLVKVTEECLERGIA 661
            HGIPDSR LQ+GDIINIDV VYL+GYHGDTSKT+ CGDV+  +K+LVKVTEECLE+GI+
Sbjct: 190 FHGIPDSRPLQNGDIINIDVAVYLDGYHGDTSKTFLCGDVNGSLKQLVKVTEECLEKGIS 249

Query: 662 VCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPLIYHHRN--DEPGHMVE 721
           VCKDGASFK+IGK ISEHA KYGY  +ERF+GHGVGTV HSEPLIY H N   E  +M+E
Sbjct: 250 VCKDGASFKQIGKIISEHAAKYGYN-MERFIGHGVGTVLHSEPLIYLHSNYDYELEYMIE 301

Query: 722 GQTFTIEPILTM 732
           GQTFT+EPILT+
Sbjct: 310 GQTFTLEPILTI 301

BLAST of Sgr011659 vs. TAIR 10
Match: AT1G13270.2 (methionine aminopeptidase 1B )

HSP 1 Score: 375.9 bits (964), Expect = 7.2e-104
Identity = 191/262 (72.90%), Postives = 220/262 (83.97%), Query Frame = 0

Query: 416 SLLKLSAVSHGEAFSPSSKSASFMGAPLNFLPPSRSGKQKHFH-ENLVVISKRISGLEEA 475
           S L+L +  HGE  +P   S  F+GAP+     S SGK+  +      V +K++SGLEEA
Sbjct: 14  SSLQLCSSFHGEYLAP---SRCFLGAPVTSSSLSLSGKKNSYSPRQFHVSAKKVSGLEEA 73

Query: 476 MRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPEISSEYQMHDSE 535
           +RIR+ RELE   KVR+  PLRRG+VSPRL VPDHI +PPYV S +LP+ISSE+Q+   E
Sbjct: 74  IRIRKMRELETKSKVRRNPPLRRGRVSPRLLVPDHIPRPPYVESGVLPDISSEFQIPGPE 133

Query: 536 GIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLGYGGFPKSVC 595
           GIA+MRAAC+LAARVL+YAGTLV+PSVTTNEIDKAVH MII+AGAYPSPLGYGGFPKSVC
Sbjct: 134 GIAKMRAACELAARVLNYAGTLVKPSVTTNEIDKAVHDMIIEAGAYPSPLGYGGFPKSVC 193

Query: 596 TSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGMKRLVKVTEE 655
           TSVNECMCHGIPDSRQLQSGDIINIDVTVYL+GYHGDTS+T+FCG+V +G KRLVKVTEE
Sbjct: 194 TSVNECMCHGIPDSRQLQSGDIINIDVTVYLDGYHGDTSRTFFCGEVDEGFKRLVKVTEE 253

Query: 656 CLERGIAVCKDGASFKKIGKRI 677
           CLERGIAVCKDGASFKKIGKRI
Sbjct: 254 CLERGIAVCKDGASFKKIGKRI 272

BLAST of Sgr011659 vs. TAIR 10
Match: AT4G37040.1 (methionine aminopeptidase 1D )

HSP 1 Score: 343.2 bits (879), Expect = 5.2e-94
Identity = 170/268 (63.43%), Postives = 209/268 (77.99%), Query Frame = 0

Query: 464 ISKRISGLEEAMRIRRERELEIVRKVRKRQPLRRGKVSPRLPVPDHIQKPPYVGSSILPE 523
           +S+  SGL + +   R  E E++   RKR  LR G VSPR PVP HI KPPYV S   P 
Sbjct: 44  LSRTFSGLTDLL-FNRRNEDEVIDGKRKR--LRPGNVSPRRPVPGHITKPPYVDSLQAPG 103

Query: 524 ISSEYQMHDSEGIAQMRAACQLAARVLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSP 583
           ISS  ++HD +GI  MRA+  LAARV DYAGTLV+P VTT+EID+AVH MII+ GAYPSP
Sbjct: 104 ISSGLEVHDKKGIECMRASGILAARVRDYAGTLVKPGVTTDEIDEAVHNMIIENGAYPSP 163

Query: 584 LGYGGFPKSVCTSVNECMCHGIPDSRQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSD 643
           LGYGGFPKSVCTSVNEC+CHGIPDSR L+ GDIINIDVTVYLNGYHGDTS T+FCG+V +
Sbjct: 164 LGYGGFPKSVCTSVNECICHGIPDSRPLEDGDIINIDVTVYLNGYHGDTSATFFCGNVDE 223

Query: 644 GMKRLVKVTEECLERGIAVCKDGASFKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSE 703
             K+LV+VT+E L++ I++C  G  +KKIGK I + A+K+ YGVV +FVGHGVG+VFH++
Sbjct: 224 KAKKLVEVTKESLDKAISICGPGVEYKKIGKVIHDLADKHKYGVVRQFVGHGVGSVFHAD 283

Query: 704 PLIYHHRNDEPGHMVEGQTFTIEPILTM 732
           P++ H RN+E G MV  QTFTIEP+LT+
Sbjct: 284 PVVLHFRNNEAGRMVLNQTFTIEPMLTI 308

BLAST of Sgr011659 vs. TAIR 10
Match: AT2G45240.1 (methionine aminopeptidase 1A )

HSP 1 Score: 230.3 bits (586), Expect = 4.9e-60
Identity = 116/246 (47.15%), Postives = 160/246 (65.04%), Query Frame = 0

Query: 494 PLRRGKVSPRLPVPDHIQKPPY-VGSSILPEISSEYQ----MHDSEGIAQMRAACQLAAR 553
           PL++  +S +  VP  I+KP + +  +   E +S+ Q    +   E I +MR  C++A  
Sbjct: 100 PLKQYPISTKRVVPAEIEKPDWAIDGTPKVEPNSDLQHVVEIKTPEQIQRMRETCKIARE 159

Query: 554 VLDYAGTLVRPSVTTNEIDKAVHQMIIDAGAYPSPLGYGGFPKSVCTSVNECMCHGIPDS 613
           VLD A  ++ P VTT+EID+ VH+  I AG YPSPL Y  FPKS CTSVNE +CHGIPD+
Sbjct: 160 VLDAAARVIHPGVTTDEIDRVVHEATIAAGGYPSPLNYYFFPKSCCTSVNEVICHGIPDA 219

Query: 614 RQLQSGDIINIDVTVYLNGYHGDTSKTYFCGDVSDGMKRLVKVTEECLERGIAVCKDGAS 673
           R+L+ GDI+N+DVTV   G HGD ++TYF G+V +  ++LVK T ECLE+ IA+ K G  
Sbjct: 220 RKLEDGDIVNVDVTVCYKGCHGDLNETYFVGNVDEASRQLVKCTYECLEKAIAIVKPGVR 279

Query: 674 FKKIGKRISEHAEKYGYGVVERFVGHGVGTVFHSEPLIYHH-RNDEPGHMVEGQTFTIEP 733
           F++IG+ ++ HA   G  VV  + GHG+G +FH  P I H+ RN   G M  GQTFTIEP
Sbjct: 280 FREIGEIVNRHATMSGLSVVRSYCGHGIGDLFHCAPNIPHYARNKAVGVMKAGQTFTIEP 339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038874534.17.8e-17795.44methionine aminopeptidase 1B, chloroplastic [Benincasa hispida][more]
XP_022155198.11.1e-17594.83methionine aminopeptidase 1B, chloroplastic [Momordica charantia][more]
XP_004147182.14.9e-17192.73methionine aminopeptidase 1B, chloroplastic isoform X1 [Cucumis sativus] >KAE865... [more]
XP_022974354.13.5e-16991.19methionine aminopeptidase 1B, chloroplastic isoform X1 [Cucurbita maxima][more]
XP_022922375.11.3e-16890.58methionine aminopeptidase 1B, chloroplastic isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9FV528.5e-13475.71Methionine aminopeptidase 1B, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=M... [more]
Q9FV511.8e-10464.74Methionine aminopeptidase 1C, chloroplastic/mitochondrial OS=Arabidopsis thalian... [more]
Q9FV507.3e-9363.43Methionine aminopeptidase 1D, chloroplastic/mitochondrial OS=Arabidopsis thalian... [more]
Q54VU72.4e-6748.51Methionine aminopeptidase 1D, mitochondrial OS=Dictyostelium discoideum OX=44689... [more]
Q6UB283.1e-6754.11Methionine aminopeptidase 1D, mitochondrial OS=Homo sapiens OX=9606 GN=METAP1D P... [more]
Match NameE-valueIdentityDescription
A0A6J1DMC05.5e-17694.83Methionine aminopeptidase OS=Momordica charantia OX=3673 GN=LOC111022340 PE=3 SV... [more]
A0A6J1IHD31.7e-16991.19Methionine aminopeptidase OS=Cucurbita maxima OX=3661 GN=LOC111472984 PE=3 SV=1[more]
A0A6J1E8K36.5e-16990.58Methionine aminopeptidase OS=Cucurbita moschata OX=3662 GN=LOC111430387 PE=3 SV=... [more]
A0A1S3CD181.1e-16891.52Methionine aminopeptidase OS=Cucumis melo OX=3656 GN=LOC103499439 PE=3 SV=1[more]
A0A0A0LI622.1e-16792.86Methionine aminopeptidase OS=Cucumis sativus OX=3659 GN=Csa_2G172500 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G13270.16.0e-13575.71methionine aminopeptidase 1B [more]
AT3G25740.11.3e-10564.74methionine aminopeptidase 1C [more]
AT1G13270.27.2e-10472.90methionine aminopeptidase 1B [more]
AT4G37040.15.2e-9463.43methionine aminopeptidase 1D [more]
AT2G45240.14.9e-6047.15methionine aminopeptidase 1A [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001714Peptidase M24, methionine aminopeptidasePRINTSPR00599MAPEPTIDASEcoord: 615..631
score: 47.59
coord: 685..697
score: 41.96
coord: 715..727
score: 40.79
coord: 593..606
score: 52.38
IPR000994Peptidase M24PFAMPF00557Peptidase_M24coord: 538..728
e-value: 1.7E-48
score: 165.1
IPR036005Creatinase/aminopeptidase-likeGENE3D3.90.230.10Creatinase/methionine aminopeptidase superfamilycoord: 472..734
e-value: 1.9E-79
score: 268.7
IPR036005Creatinase/aminopeptidase-likeSUPERFAMILY55920Creatinase/aminopeptidasecoord: 488..730
IPR002467Peptidase M24A, methionine aminopeptidase, subfamily 1TIGRFAMTIGR00500TIGR00500coord: 531..731
e-value: 4.9E-66
score: 220.7
IPR002467Peptidase M24A, methionine aminopeptidase, subfamily 1PROSITEPS00680MAP_1coord: 691..709
IPR002467Peptidase M24A, methionine aminopeptidase, subfamily 1CDDcd01086MetAP1coord: 536..731
e-value: 3.29313E-105
score: 319.439
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 292..408
e-value: 4.5E-8
score: 35.2
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 154..247
e-value: 2.3E-16
score: 60.3
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 297..402
e-value: 1.2E-14
score: 54.2
NoneNo IPR availablePANTHERPTHR43330:SF19METHIONINE AMINOPEPTIDASEcoord: 413..734
NoneNo IPR availablePANTHERPTHR43330METHIONINE AMINOPEPTIDASEcoord: 413..734
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 296..402
e-value: 8.76369E-15
score: 69.2652
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 292..405

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr011659.1Sgr011659.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070084 protein initiator methionine removal
biological_process GO:0006508 proteolysis
molecular_function GO:0046872 metal ion binding
molecular_function GO:0070006 metalloaminopeptidase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity