MS009702 (gene) Bitter gourd (TR) v1

Overview
NameMS009702
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPeptidase_M3 domain-containing protein
Locationscaffold411: 764 .. 19683 (-)
RNA-Seq ExpressionMS009702
SyntenyMS009702
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGACGAGGAGATTGGTACTGCAAGCCCTTCTTATAGCCAACATGTTAATGGCGTCCCGCATTCTACTCGCCGCTTCCGTACATCCACTCCTTAAAACGACATATTCGCTCTCCATTTCCTCTCCTAACCATTTACAGAAATCTATTCCTTGTCCCCTCTGGTCATCTTCCTTCTCCTTCTGCCTTCACAACCTTCACAACTCCGTTACCTCTTCTTCCATCCACTCTTCTTCTCCCTGTTTTTCGCTTTCTTCCCCGTCAATGGCTGCTTCTGCCGTCGTGGATGAAATTTCTCAATCGAATCCTCTTCTCCAAGACTTCTATTTTCCGCCTTTCGATGTCGTCGAAGCTAAGCATGTTCGCCCGGGCATTCTTACGCTATTGAAGAAGCTTGTATGCCCCTTTCAATTTTCTTGTTTTTTTTTGTTAAAAAAAACTTTTCCCCCTATTTTGTTGTTTAATGGTCCTATCTGGGCTTTATTTGCTGAAGTTTTTGGTCTTTAATTAATCTTCCGAGCTTTTCTTTCTTTCTTTAAAAACTTTCCCTCTCTAATTGGGCCGCGCATGCGGGTCAATCGCATGTATGCGGCACTTTCAGGAGGGGGATCTGGAGGAATTAGAGCGAACAGTGGAGCCATCTTGGTCAAAGTTGGTTGAACCATTAGAGAAGATTATAGACCGTTTGAATGTGGTTTGGGGTATTGTCAACCATCTGAAGTCTGTGAAGGATACTACAGATCTTCGGACCGCCATCGAAGAAGTTCAGGTATATTCATCGTATTTAGAAGAGCGTGTACGCTTTTGTTCGACGGATCATATTTTGCATGTTTATTTTTGGATTAGCCAGAGATCCAAGAGAGAAACCCATGTAATGTACATGTACTCATATTAGAAACCAATATCCAGTAAAACCAGTTGATTAAAAATGTGGGAAGATATAATGGTTTAGCATAGTCTTGTGTCTTTTTAGCATGAAAAGAGTTCGCGCTACATATTGAGAATGATAGTAGAAAATGCTCAGTCCTCCTTTATGCTTATTGCGTGTAAAGTCAGGATGAATTGGGAGCAAACTTAGGAAAATTATTCACAATTCAGAAGTCCAAGTAAGGAAAAAAAATCAGCCAGCATTAGAAATTATTTAGTGACGAAGTGTTCCGCATGGCTTGTTGGGATCTCCTTTTACTCATTAAATTTATTTCTGGTCACGAGATTTGAGTTATCTTGCTCAATTCTTCTTCGAGACTAATGTTATCTATTTTGGCTGCAGCCAGAGAAAGTTAAATTTCAGCTTAGGTTGGGGCAAAGCAAACCTATTTACAATGCTTTTAAAGCTATTCGAGAATCCTCCGAGTGGAAGATGCTCAATGATGCTCGCAAACGCATAGTGGAATGTCTGTATTTCAATTTTCTTGCATCATATCAATTAAAATGATTACTTTGTTCTTAGTTTTATTGTTTACACTGCATCATGCTAATTTGCAACAGTTTATGTGGAATTGATCATAACCTTGTTAAATAACATAAGGGAAGGACGAAAAACATACTTTTCTTTAGTTAGGGAAAAGCTGAAGATTATTTGGAAAAGATACGAAATTTTCATTGATAAATGAAAAGGAACAAAAATTTTCAAGGATACAAACTCCCAAAGGAGTGAAAGCAAACGAACTTCTCATTGAAATAATGAAAATGAACTTAAAGGTTCCACGACACAAACTCTCAAAAGAGTGAAAGAAGACAAAATAAAACAAGAAACAAGAATTAAACATTACAACAAAAAGAATCCCAATTAGACCAATGAGAGAAGAAGAGTAATTCCCAAAATTAGTTGATAATGAACACCAATGAGAAGCTTTAAATTTAGCTATGCTAAACCGATCAAATGGGGATTTATGCTGTCCCTCGAAATTCTGAGATTCCTTTCAAACCATAATTTATAGAGCAAAGCCTTAATAGCATTAAACCACAGTAGTTGGCCTTGTTAAGGCAAAACCAAACCAAGGAACAAATGAGTGACATTGTCTGCAACAATACCATTGAATGCCCAAGATAAATTGAATTCCTTGAACAGATAGAACCAGCACTTCATTGAAAGAAAAGATGAACAACACTTTCCCCACTTGCTTGCCTAAGCATGCAGATTGAGGGTGATAGACAAGAAGCTGAAAGCTTCTTTTAGAGAACATATGCAGTGTTTAGACTTTAGCTTCCCATATACAGGATGATCCAAACAAAAAAGCTAACTTTCTTTGGACAATTTGTTTCCCAAATTGCCTGATAGAGGTCAGCTGGCATTAAATTTGGTGAACAAGGCTTACATAAAGAAGCCACTGTGAACAAACCCGAGGACTCTAAAGACCAGAATTTGCCATCATCCAAGCTGCTAGGGGTGAATGGATTCAATGTGGACAACAACTCAATAAATTTTGCAGCAATTGGTGTGTTCTTTGCAAGCGGGATTCCGAGGACCTTGATCATATCTTATGGCTTTGCCAGTTCTCCCAGAAAATTTGAGGCAGGTTCTTTGAGATTTTCAAGATTTCTTGGGTTCTCCATAGAGAGTGGGTGTCTATGGTGGAGGAAGTGCTTTTGTATTCTCCTTTTCGCGAGAAAGAGATGTTTTTATGGCATGGGGGTTTCTTGGGGATTTTGTGTGGAATTTGGTTGGAAAGAAAGAATAGAATCTTTAGAGGGATGGAGAGAGATTTGAAGTCTGTTTGAGACCTCATAAGGTTTAATGTCTCTTGAATTTATTTTGTAATTATCAGTTAGGTTCTATTCTTTTGGATTGGAAACCCTTTTTTTAAGCTGGCTTACTTTTTCTGGGCTGAAAAAAAATCTTTAAGTAGCCTTTTTGTTTGAATTTTCCATGACAAGGTAGAAGAATCCCAAAAATCAGCTATAGTACCATCGGGATGGCGATCCAAATATGATGCCAAAAGAGAATTCGAAGGCTAGAGCCTAACTTAAACTTGGAGTAGTGCTCCACCCTCTTCCTTTCGGCAATATTAATCCTAGGACTTCGTAAACAACCATTCTATTTTTCTCCTTACGAGATTAGTGAGACCGTACAAACTTTCCACCGCTTGATGCCATAGAGCTGAAGATTCAAGCCCGAAGCGCCACCCTCATTTAGCTAGAAGTGCGCTATTCTTTGTGGACAAACCGCAAACTCCCAAACCGCCATCTTCCAAATGTTTGGAAGTAATATCCCAATGCAGCAGATGAAGTAGGCTCCTGTTAGAACCATTATTCTAGAGAAAAATTTTCAATCTTTTCTCAATTTGTTTACAAACTCCTGAGGGCATTTGAAATAAAGAGAGATAATATATTGAAATGTTAGATAAAACTGATTTGCAAAGGGTTAAGCGATCCCTCTTGATAAATGAAATCGCCTCCAACAATCCAACTTCTTTTCGATTTTTAGCGGTAAGCCAAGGTACATGATAGGCAAAGAAACCACCTGACAATTGGCTGAAGATGCAACCGCTGAATTCTCCCTTGTGAGATATTAATACCGTTAATGGATGATTTCTCCCAATTAACGAAGTCCACCATATGATTTTGAAACCATAAAAGTTGAGAAGAAAAAAGGAATCAACGAATATTTGAAGAAAAAAGGAAAGAATTAATGCTTCCGCATTGCAAAGATGAAAGCTTCCCATTGGTGTTCTCTTGATGCCTATTTTTCTAGTTATTCTCCTAGTATGATATGTTGTAATTGGTAGAGCTTTATATCTCCTCTTTAGGTTTGATTTTGTCTTAGGGACTGTTATGTTTCTTGCTTTTTACTGGTCTCTTTTTTACTCATTTGAGAGTTTGTATCTTTGAACAATTTTGTTCCTTTTCATTTCTTCAATGAAAAGTTTGTATCTTGTTAAAAAAAAAAAGATATGAATGTTTTTCCACATCTCTCTTGCAATGGACACACACCTCTTAGAGAAATGGATAAATTAATAAATTTTGATGGAGTTTTGTTGTCACATAAATTTATAGTGGTCCATTATCTAAATCTCTGAAGCACATCCTTTTTGTTTGACTATTTATTTCTGGACCAATAATTTGAAGGATGTTGTTTTGAAGTTTATCTGTAGTGGACTGTTTCTTTTCCTCTTAGGCTTGCTGTCCATGATTATGATACTTTTAATTTTAGTCAAGTCAACAAATTCACATATTTTATTATTATTTTTAACACTTTCCTAGACTTTACTTGACGTTGAGCAGCTCAAATAAAAGAAGCCCTTCTTCATGGCGTTACTCTTGAAGATGATAAAAGGGAGCATTTCAACAAAATTCAGCAGGTGAGCCTTCACTCCAAGTTAAGTAATGGTGCAAAAGTTGGAAAATATCTTTCTCACGGTCACACCTTGGTTGGTAGTTGGTTTTTGTTTTTCTCCTTTTGGAGGTTCTATCCTTGAACTCTTTGTATCTTTTCATCTATCAATGAAAAGTTGTTTATTGTTAAAAAAAAAAAAAAGAGAACAAATAGAGAATTCTCCATCTAGGAAGCACGAACACTTTATTTTTTTACAAAGTGTCCGTGTCGATGGACACGTTCCGTACATGTGTTCGGCATGCAAAATAGCGTGTCATTTTTTTGGTTATTTTTTTATTTTATTTTATTTCGGACACGTGGTGGACACGCCATAGACAGCTGTGTACACGCCCAGTTTATGGTGTTTGCCGTCGTTGGTGGTTTTTCTATGTTAGCCTTCGTTCAAAGGGTTGTTTTTAAAATTAGAAAAAAAGGGGTAAAGGTAGAAGAATAGCGAAGGATGAAGAAAAAAGGATAATGAGAAGGAGTGACAAAAGAGAGGACGAAAATTTCGGCACAAAAGAAGTTGGAGAAGAAAGAGAGAGAAAGATGCAGAAGGGAAGAAATTTCCAGCAGTATGAAGTTGAAGAAGAAAGGGAAAAAATAAGAAGTAGAAGAGACAAAATTCGAAGGGGAAGAAGAAATAAAAACGTTCCGGCTGCTGCATGTAGCATTCGCTCACAATGTTTGTGATGTAGTTAAATGACTGACCTTTAGCCTTTCTCATTTTATTATCTACTTTTATAACTTTTATTTAAGCAGCTTTACTTTTTCTGCACAGTAACAATAGTTACTCGTGTAGCTCTTAAGTTTAAAACCTCTATTTATAGGCTTCAACTCCTTTAATAAAAGATCATCGATTCATTTCTTCAAAAACCCTTCTAAAGGCTACATTAGTTGGTATCTGAGCACCCTAGTTCTGGGTACTCAGAAAGAAACCAACAGTGGGAGATTCTTCCTTAGCCAACAAAGAAACGAAACAAATTTCCGTCCTTTCACCACGATCATCGACCATACGCTTGCAGTTGGTCGAAGGAGATGTGAAAATCATCCAAAAGGATGAGAGTGAGATTAAACAAATCCTGGAAATCATCCATGTTCAGCTGGAGAACTTGAGTCTTAACAAAATCCTACAAGATCTGATCAAGAACCAAGAGGAGTTGATCATTAAAGCAACTACCATCAAGAGTAGCAAGAGGAGTTGATCATCAAAGCAACTACCATCAAGAGTAGCACCCTGTGCAGCCAAGAAGGTACCAAGATCGTGTTCTTGAACCAAGAATGCTCCAAGAACCGCGAGCTGCCCGACAAGACTTCAACCAAAACCCCTTGTTTTAGTGTAGGAATGATTTTTTTTTGGATTCTTCTTATGATGAGGATGATATTCTAGAACTAGAAGAAGACACAAGGCTACATCGATTCAACCAACGCCAATATCATGAGTCTGTTCTATGAGTTTTAAGTTGTTCGGATTGTTGGAATGTTCTCAATTATACCTTCTGCAGATTTCAAAGCTTTGCAGTCTTAGTTCGGTTGTTTGGTTGATGAGATTTTCAAAGTTTTTCTCCAGCGTGTGGTTTGACATTTAGTGGTTTATATTTGGAGGCTTTTGAAGACAACATCGAGGCTTAGTATTTGGTATTTTCTCCTTAGTAGCATTTTGGCTGCGGTCTGTTATTGTGCCCTTTCTTCTACTTGGTTCTTTTTGGAGTTTGGTCTTCGCTCTTGGTGTTAAGGAAGGTTTTCCCTCATGTGGTTGTTGACCCTGAAATTAGTTTGATTTCCCAAGCTCGGCTGCAAGCAGTTTTTCTGAGTTCTGTCTTAGAATGCTTTTTTCTCTTTTGAGTTTGTATTGCTCTTGTTATCGAGTATAGATTTCTATTCCTTTTCATTAAATCAATGTAAAGTTCGTTTCTTGTAAAAAAAAAAAAACAAAATACTATCGAGTTAGATTCCCAATTCCTCGCATTAGTGATTTGTTGGACCAACTAGGAGGTGCTACAATATTTTCATTCCCTATTCCTCGCAAACATAGATCCTTTATGGACTATTTTATCATAGCCAACTTCAAAGCCTCACAATGGTGTGCCTTTTCCGATCTTTTCTCTAATTATTCTTTGAGTGTGATTTGTATGAATTGGGAGACCTTTATAACTCCCTCTTAGTGGTGGTTTTTTGATGTTGGTGATTATATTTTTATTTTTATGCTTTCTTTTTACTTCTTTTGTTCACTCCTTTTGGAGTGTTTGTATCCTTGAGCAATTTTGTTCATTTTTATCGATCGATGAAAGGTTTGTATCTTGTTTTTAAAAAAAAATTATCAGTGTAATTCAGATGTGATGAATGTAGCTACCTTAACTTTGTGTTAGCTCTTTCTAGATTCATAATTGTTTTGAAATACAGGAACTGGAAAGATTATCACAGAAATTTGAGGAAAATGTTTTGGATGCTACAAAGAAGTTTGAGAAGTTAGTTGTTGATAAGAATGAAATTGATGGACTGCCTTCTACTGCTCTAGGAATGGCAGCACAAACAGCAGTTTCCAAGGTATTTCACTGTCAGTGATGTGCTTTATTTTAATTATCTTTATATTTGGTTAAGCCATTGTGTTTTACAGCATCTAATTTGATATTTTTTTTGGTATTTTTTCAAGTAATTCTTTGACTTCTCTATTCACATCAGGGTCATGAAAAGGCTACTGCAGAAAATGGTCCATGGATAATTTCTTTGGATGCTCCATGTTATCTTTCTGTCATGCAACATGCTAAGAATCGATCCTTGCGGAAGGAAATTTACTACGCTTATGTAACTCGTGCCTCTAGTGGAGAAATGGACAACACGGCAATAATTGACCAAATTTTGAAGCTTAGGTTGGAAAAGGCTAAGATTCTCAATTTCAATAATTATGCCGAGGTATGTTCTATTTATATGTATAGTTTTTGTTCCCCAACTTCTTTCTGCTTACATGAGTTGTACTTCTCCAGGTTAGCATGGAGACCAAAATGGCTACGGTTGAGAAAGCTGAAGAGCTTCTAGAAAAGCTTCGTAGCGCTTCCTGGAATGCTGCAGTTCAAGGTGGGTCATTTTGTGTTTTTATATTCCTTTCCATGTTTTTCTAATGGACTGTATTATTTACTCAATTTTTTTTTTTCTGTTCAAGTGGTATCTTCAATTTAGCTTCAATGGTTAACTTCTGAAAATACTTTAAGATTTTAACTAATTGTCATATTATGTTGAAAAATCTGTGTGTTTGTTTGTTTCACAAATTTAAGTAGTTCTGACTTCTTAGATCATTTGTACTCCACTTCTATATGTCATTTAAATGCTTTTGATTGCTGAGGTCGCGTTAAATCCATAGGAATTCTCACTACACTGTAATTTTCTCTTTTTTCCACTTCAAGAGTAGCCTATTAAGATATTCAAATGTTTACATCCAAAGCTAGTCGTTTAACCTCACAGGGATGTACTTCCTTTGGGATGAATGGTCTAATAATACTCTTTCTCAAAACTGATAAAATACAGAAAGTACTTAAGTATGTATTCATTGTGTTAGTACATATTATTGGTGTAGAAGAATGTTCCTGCTGGACAAGAATATGAGGTCCTTTCATTTATGATTCTTATAGATATTGAAGATCTTAAGGATTTTGCAAAAAAACAAGGCGCGCCAGAAGGCAATGATTTGAACCATTGGGATATTAGCTTCTGGAGTGAGAGGCTTCGGGAGTCAAAATTTGATGTCAATGAGGTTGGATCTTTTCTGTCTCTCTTTGTATATGGTTGTGTACGCCGATGTCCTTTCATTTAATGTTTCAATCCTTCTGAAATTCATGTTGTACTTTTCAGAAATAATTTTGCTTACATGCAAATATTTCTAATTCTCTCTCTGTTAGGAAAGAAATTTTTTGATTTTTGTTTGTTTTTGTTTTTCTTTTGCTCCTTTGGAGGCTTGTATCATTTTCATCTATCAATGAAAACTGAAAAGTTGTTTGTTAAAAAAAAATATATATATATATATTGTAAAGAGAAACCAGAAAAAAGCTTGGGAAGACGTATAAATTTTTTGTTTTATTAGATGTTGGTACACAATGGACTCTCCATTTGGTTCTCTGATGGCTGGAGTATATGATGTATTTAAGTTCACAGCCTATAATAGTTATTTTCAATTCTTCACATTTTTGTGCTACTAGGCTTTAGATCTATTGGATTTGTCCATGCTTTATTTAGTTCTGTGGTTCCTCAATTCTTTTAAACAACTTATTTTCAGGAAGAACTACGGCCATTTTTTTCCTTGCCAAATGTTATGGATGGCCTTTTTAGTCTTGCAAAGACACTTTTTGAGATTGATATTCAACCAGCCGATGGTCTTGCTCCGGTATATATCCCATTTATTTTATTTTCTGAAACCTTATTTCCGCATCCACTAACTTGGCCGCTTCCCTGTAGGTTTGGAATAAAGATGTAAAATTTTACCGGGTAAATGACTCTTCTGGAAGTCCCATTGCATACCTCTATTTTGATCCATATACACGTCCATCGGAGAAAAGGGGAGGTGCTTGGATGGATGAGGTTGTTTCACGAAGCTGTGTGTTAGCACAAGATGGTGCCCCTGCAAGGTTGCCTATTGCGCATATGGTGTGCAATCAAACACCGCCAGTTGGAGAAAGACCTAGTTTAATGACATTTCGTGAGGTAAAGCTATTGACTTATTAATAATTGCACATGTTCTTGAGATATTGAGGCTAACATACATTCTAAACTTCGAAGTAATAGACAGCCTGATTTGGTTGTGGTTTAGAGGTACAAGCAAAAAAGGTACTGGGGGTGAGGACCCTGGCTATGTGATCCACAAAATATATTTAGCTATGTTGAAAATTTATAATACTGTCTGGTTAATAAAATTTCCAAAAATTCTTTTGTAGTCTTGTTCAGAACATGGGAAATCATTCAGAGTTTAATCGCATGCTAACGAATTAATATTGAATTGAGGGGTATAGCTGTAATAAATTATGATGTCGTTGTGATTGGGAGCATTAAAATAATACTATTTTATTTGGGATCGACATGGTAATCATGTCATGGAACCAACTCTTTTTCTGTTTTGTCTGGTTCTTCTTGAAGGATATATCTTCTTCAATTGATCTAAATTGCTATTCAAGCAACTTTATCATTTCTTCTGAAGTGCTTGACATCCTTTACAGGTAGAGACAGTATTCCATGAATTTGGTCATGCACTTCAGCATATGTTGACAAAACAAGATGAAGGTTTAGTTGCTGGTATTCGTGGAATAGAATGGGATGCTGTTGAATTGCCGTCTCAGTTCATGGAAAACTGGTGCTATCATAGGTAATTTGATTCCCAATTTTGTTGATTGTTGTTCTACGTATCTGCTATTGTTGCCTAATTTTTTCTTAATAAGAAACAAACAAAATATTATTGAAGGAACAAAAATTATAAAAGAGGGTGATAAGGAATGAATCCTCAAGCCAAGGGGATTACAAAACCAGCACCATTTTCCTCTCTCTCTTTTAATTCAAAGGCTCTAACTAACCAAGAGTTAGATGATTTATACTGTCTTTGGTCAAAATTGACACAAATATTCATTATGTATATGGGTCTGGCTTTTGCATTATATTGTTGAGGTTCTATCGATCTGCTCCGTAGAATAAAAAATGCACAAGATGGCTATTCTAGTTCTGTTTGTGGGTATCCAATGTAGTCTAGTACCCTTTCAGCCTTTCATTACTCTTATGGCAATTCATTGATCCATCTTTGTCAACCTTAAAACTCTTGCAGAGACACTTTGATGGGCATTGCAAAGCACTATGAAACAGGGGAAAGTCTCCCGGAAGAAGTGTACTTGAAACTTCTTGCAGCTAGGACATTTCGTGCAGGTTCCTTGAGTCTTCGTCAGGTTGGTAATGTTTTAGGTGCCATATCTTTATTTTCTGTTCCTTGCTTTTGTTTTAAATGTTTTATTTATAAAAGGCTTTGGAAGCAAATGGATTTCGTGGATCAACGGGTGTATTATAAAGACACCAAATTCTCTATCATCATTAATGGTCATGGTTGACCGAGAGGAAGAATTCAAGCTTCAAGGGGTTTATTAAGGCAAGGAGATCCCCTTTCTCCTTTCCTGTTCCTTATTGTGAGTGATGTGATTAGTAGTTTGATGGACCATATCTATTCTAAAGAGGTATTTGAGGGCTTCAAAGTGGGGTGAAACTCGGTGCATGTTTCTCATTTGCAATTTGCAGATGATACCCTCTTGTTTCATAAAGATAATGTTGATATGTTGGCTGTTCTGTTTGATGCAATCAAAGCTTTTGAATGACTTTCGGGGTCGAAGGTTAACTGGGAGAAGTCCTCTCTTACTGGGAATATGCATTTTCAAAAAATACAACAAATGGCCATCAGATTTAATTGTAAAGCAGAACCTTCCAATGTTCTATTTAGGCACCCCTTTGGGGGGTAATCCAAAACATGATGGTTTCTGGAGTCCCATTCAAGATAAGATCTGTAAGAAGCTCGATCGATGGAAGCGTTACAAACTTTCTAGGGGTGGAAGATTGACTTTATGACTACATCAATTCCCCCGACCAAAAAAACAACTCGATGGAGCCAAAGAAAAGTCGTCCGACGCATGATTGGAATAGATGTCAGCCACGTTAAAGACCGGATAGGAAGCCCAAGTATGAAATCCATGGATAGGTCTTCCCATATATTCCCAGGAATAGGAAGCGGTGTGTACAACCCTGTGCTTTGGGATGTGCACTTATAGGACTGGCATATAAATCATCTGTTAATATAGTTGGTCACATCCTTGGGGATTTGTGGCCAAAAGAAATGGGCAGTAACCAAAGCAAGGGTTTTGTCATGGCCTAAGTGGCCTGCCAACCTTCCCGCATGCAAATCCTTTATAGTAGCCTCCCTCTAGAGAAGTACGTGGGATGGATAGGACATTGCCCTTGAATAAGTATCCATTTATCATATGGAAATTGTCGGTTGCTCTGTTATTAGTACATCGATTCCATATGTCTTGAAAATCTTGATCATTGGCATAGGTATCAGGGAGGTGGTCAAAAGCAACAATCTCTCCTTTAAGGAGAGCAAGAAGGCTGCCTTTTATACTTAAGGCATCAACAGCCTTGTTCGTGATCTTGGATGTATTTTTAATAATGAAATCAAAGCGTTGAAGAAAAGAGAGCCATCGAGCATGCATTTGGCTTATAGACTTTTGAGTGTGCAGGAATTTTGAAGAGAAATGGTCAGTAAATAACACAAATTCCTTACTCAAAAGGTAATGTTCCCATTATTTCAAGGCTCGTACCAAGGAGTAAAGCTCTTGTGCATAGGTACTCCATTTTTGTCGAGAAGGACTTAATTTCTCGCTGAAATATTCGATTGGGTGGCTGTTTCGGGATAGAACAACACCAATTCCTAACCCAGATGCATCAACATCCACCTCAAATGGTTGGTCAAAATTGGGCAATGCAAGAACAGGGCTACTACTTAGCTTTTCTTTAATTATTTTAAAGCTCTCAGATTGGCTTTCCCCCACATGAACTGACCCTTTTTTAAACAACTAGTTAAAGGAGCAACAATAGTATTGAAATTCTTAATAAACTTACGATAAAAGGATGGAAGACCAAGAAAACTTTGAACCTCCCTTACTGTTTTTGGTTCCATCCCGTCAAAAATAGTTTGTATTTTTGTAGGGTTCATTTTAACCTCATTGTGTCCTATTATAAAGCCTAAAGGAAATTTTAGTGGTGACAAACAAACATTTCTTAAGGTTAATCACTAACTTATTATGAGCTAATGTAGAGAATAACAAACGTAGATGATTAAGATGGTCAGACATGCTTTGACTATAAATAAGAATTCCATCGAAATAAACTACAACATATTTGTTAATGAAGGGAAGTAAAACCTGGTTCATAAGCCGCATGAAGGTGCTTGGAGCATTGGATAGGCCAAATGGCACGACAAGCCATTCAAACAAGCCTTCATTGGTTTTAAAGGCAATCTTCCATACATCCTCTGGTCTTATGCGGATTTGATGATAGCCACTCTTAAGGTCTATTTTGGAAAAGATGGTGACACCACCAAGTTGATCCATCAAACTGTTAATGCGAGGAATGAGAAATCGATATTTGATAGTGATTTTATTTATAGCACGGGTGTCTATGCACATATGCCAAGTGCCGTCCTTTTTCAGTGTCAACTGCCCAAGGACTAAGGCTAGGTTGTATATGGCCTTTATTGAGTAATTATTGTATTTGGTCATGAAGAACGTTATATTCCGAAGGATTCATCCTATAATGGGACAAATTGGGTAGTGTGGCACTAGGGAAGAAGTCTATATTATGTTGTATATCCCGAAGGGGTGGTAATCTGGTAGGTGTCTCATCAATATCCTTGAACTCACGTAATAGGCTTTGTATTTCAGGATGCAGTGTGGAGCTTTGCTTCTCTGTCCCAAAATTTTTAATGACAACTGACAAGACCGAGGATAGGCAACTCTTTATGAAAGAGGAGTTGTTTGGCCAAGGATAGAGTAAACAGGGCACCTTTTTCTGTGTTCTTGGTGCTGTCCTTCACAGAAACAAGTTTGGATGGGGCAAGATGAAGAAACACTACCTTTTTGCCCATCCACATGAACTCATAGGTGTTATGCACTGCTTGGATGTCATATTGCCACGGACGTCCCAATAAGATGTGGCACACATCCATGTTGAGGACATCACAAACTATTTGATCCTTGTAACTTGTCCCAAAGGACAAAGGCACGGTACAAGTATGACTGATTTGAGCCTCCCCTCTTTTTTTATCCAAGTAACTTTATAAGGGGCGAGGTGAGGAGTTAAAGAAAGATTTAGAGCAGCCATGATTTTACTAGCCACGATGTTCTCGGTGCTGCAACTATCAATTATTAGGTTGCAAATCTTACCATTGATAGTACATCTTGTTTTGAAGAAAGCATGGAGTTGGTATGCTGGTTTAGTTTTTGGGGTTAATAGAATTTGTTGTAACATGCAAGTGATGGGTTCTCCCTCATCAGAGGCAGCATATTCAATTTCTTCCTCTTGGTCCTCACCATCACTGTAAACCGGCTGGTCCTCCTCATCCACGAAGGTAATTGCTTTTCGTTGAGGACATTCATTAGAAAGATGTTCAAATTGTCCACAACGAAAGCACTTACCTAAATTTGGACGATTATATGTAGTGGTAGGTTTCTTGCTTGCAAGTCTATCTTGGACAGCTTCTTGTATAGTATCTTTGTCACTCTGCTTGCCCTTAGCAGTAGCATTATTTGCATGGAAAGAATTTCTGTCCATAGCAGAACCTTTTTTTGAAAGTACTACCTTGGTCCCAAGTAGTCCAGCGTGAGTATTGTTTTTTGTACCAATTGTTAATTTGTTCTTCAATTGTTGTGGCAGTAGAAATGGCCTCATTAAGATATCCAATGGGTTGTAGAGAAATCTGCTCCTTGATATCAGAACGTAGGCCGCCAATATACCTAGCCACCAAGTAATGTTGATTCTCTGCAGGATTCGTCCGCGCCAAGCCTATGGAATTCTTCTGTATAATCAACAACCGTACAATTACCTTGGTGGCAGTTTTGATATTGATTATAGAGAATTTGCTCAAAGTTTACGGGAAGAAAACGCTCCTGCATTAGTTTCTTCATGTGTACCCAGCTTCGAATTGGTCTTTTCCCGTAGTGTTGGCAATTAATCTCAAGTTGATCCCACCATATGAAGCTCCACCTTTTAATTTCAAAGCTACAAGACACGCCTTTTTATGTTCTTGAGTACCCATGTAGCTGAAATTTTTTTCAACATTTCGAACCCATCTTCCCATTGAAGCTAGGAAGATCTATTTTCATCTTGTAATCGCTGGGTTTCTTGCTGCCTATTTGGACGAATCTGCATACCGTGTCTAATTGTCTTCAAACTCCAGCATATCTTCTTCCTCATCACTTGAGGAATCAATGAAGGCATCTTGTTGCCGTTGAAACAGAGGATTAACATTGCCTTCTTGATAATGTTGGCGCTGTTCTTGGTGGATTCTTGGACCCAAATTCACTTCTTGAACCCTTCTTGGAGGAGCTGCTAACCTATCTTGAATATGGATACTTTGTCGATCATCTCTACGAGGTTCTTGGTCAGTTTTTGTAGGGACTTGCTGAGCGATTCTTGCTGGATTTTGATTTAAACTCAAAGTCTCAACTGCTCATGTACGATCCCCAAGATTTGTTTAATCTCACCCACATCTCTTTGTATGATCTTGATATCCCTTTCGACCAACTACAAGTGTGTGGTCATAGAGCATGGTGAGAGAACAGATTGTTGATCCGCTTCTTTACCGCTGGCTAAGGAAGAATCGTTGATCCGCTTCTTTACCGCCGGCTAAGGAAGAATCTCCCAACGTTGCTGGTTTCTTTCCTGCCATAGAGTGCCCAGAAATTTTGGGTGCTCTGATACCAATTGATGGAATCTTGGACAAAAGTTTTAACAATGATTCAATCTATCATTAACAAATGAAGGAGAAACCTATTTATAGAGGTTTCTGACTTAAACAGTAACTTGCTGTGTAGTGAAATGAAAACTTAAAAGAAACATAACTCATAAGCCACGTTTACAAGCTTACAAATACTTAAAGCCAAAAACAACTTAGAAGGCTTCTAAGACTACATCAGCATGTATTTCTTATTAGCAGCCTGATTTTTTTTAAAATGACTTGTTTTTTTCCCTCTCTTTGTTCAGCGGCTATCCTTGCTTCAGTCTCGGTCTTTTGACTTTAGGACCCACTCAGTGATACCATCTGCCTGAGAGAATTTCTTGGTACCAAGTTCAGTGACAGCATGCTAGTATTCTCATCATCCTCATTTGGCTTTTTTATACCCACAATTTATAAATTTTTTGGTAAACCATCATTTCTTGTTGGGGAGCTACACTTATTGTTTTTCCTTTGTTTTCCAAAGTCTTCTTATCAATGATATGCCTCTTAAAAGAGATTGATAAAGATACATGACTGCAGGCACACATGGATGTGGTAGCAGAAAATGCGATTTTAATTCTCTTGCTGTCTAATGTAATTTGCAGCTTAGATTTGCAAGTATTGATTTGGAGCTTCATACTAAATATGTTCCTGGGGAGCCAGAGGCAATCTTTGTGGTTGACCAGAGAGTGTGCAAACAAACACAAGTGATACCTCCCTTAGCAGACGATAAATTCCTTTGTAGTTTCAGCCATATTTTTGCAGGTTCGTCTTCAATCATTTTATTTAATTTTTGATATACAAACTCCTCCATTTCCTTCGTTTCCCAGGTATCAAAGTAGTCACGAACCTTTGTTCTTGTTTTCCTTTAGGTGGATATGCCGCAGGATATTACAGTTATAAGGTATGCATTTTTCCACAAGCCATAATCTTTCATCTTTGACATGACTACGTTGGAAAGCAGCATTCTGGATTGGATATTTGTTCTGCAGAGTATTGATTAAGAACTGTGCAATAGCTATGATGGAGTCGAGTAACTTTAGCTATTTTGATTATTGTGCTTACTCACAAACAAAATTTTTCATTGATGAAATTAAAAATTGCACAAGAGTACCAACAACAGTAAAGAGAGCAAAAGCTACAGCAACCAAAGAAATAAAACCCTCTGAACTGAAAACTCAAGAGAAAAGTTTAACCAACTTAAACCAACTATCAACTTTAAACAAAAATAAAGATCAAAGACTCGATCTTGATGACTGCAACCACACTTTTCTGGAAAGAACCTTCTGATTTCATTTTGTTAACAACAAAACTCAAGACTGCAGAAACTTGATGAAGAAAGGAACCCTCATCTTGACCTTAACAATCAACCTCACCAGAATGCTCAATGAAAATCCTTTGAAGACCCCAAGCTTTAAAATCTAGACACTTGCCACAGATCTTGAAGCTACGGTTGAAAGTGCTTTACACAAGGAAAGCCAATTCTACGGAAGAAAGTCGATCTAATTATATAGAAAAATTTGTATCTAAAGTACCAGGTAACTGAGATTTTAAAGATAAATAAAAACACATGACAGACAAGGCTGTAGCCTTGGAACCATTGATCAATTTCTTTACTATATTTCTTCTCTTTTTTATATCCTCCATTTTTTTCTTCTTTTTCTCTTTTCCCTCTACTATTTCCAACTTATTTTTTCTTTTCTTCTTTCTTTTCTACCTTCCTTTTTACTTTTTACTTCTCCTTTTCCTTCTCTTGGTTGACTTTTCAGCTTATTCATCTCTCAGATGACCCTCTCACACAACCCAAAATCATACTCCCTTCCCTTTCCCTCCTACTGCGAACGAACATTCACAGCTTTCCTCAAACCACCCCTCAAAACTGCTATCCTTTCCATCAGAACTCCCCACTGCCCAGATTTTGTTTCCTGAAAGATCCATCCTCTTTAACAGCAACTTCACCTTTCCGGAAAGGACCTTCTGATTGCATTTTGTTAACAACAAAACTCAAGAATGCAGAAACTTGATGAAAAGAGAAACCCTCATCTTGAGCTTAACCCCACCATAATGCTGAATGAAAATCCTTTGAAGACCACAAGCTTTAAAATCTGGAAACCCGCCACAGATCTTGGAGCCACGGTTGAAAGTGCATTACACAAGAGAGCCAGTTCTATGGAAGAAAGTTGATCCAATCTTATAGAAAAATTTGTATCCAAAGTGCCAGTCAACAGAGATTTGAACGATAAATAAAAACAAATGATAGACTAGGCTGTAACATATCCCCTGTCCCCTCTGACCTTAAACATATCCTTTTCCAATGGTTTTTACTTACTTTTTTGTTTTCCATCAGAACTTCCAATCATGATTGTTAACGATTACACGTTGTCATGTGGATTATTATTAGAACCCATAACATACGAAACTTCCGAACGAAGCATTCCACAAATTTTTGGCTATATGAACGATGCAAAAACACGATTCATATTTTCCTCTGCCATCATGCATAAAACACACAAATTAGGGATTAAACACATGAAAAGACTTTTTTTTTTTTTTTTTTTTTNTAACAACATTCACTGTGTTTAAACTTCCATTTGATTTGTATGGAATAAAAATATTGGTTTTCCAATAATATATTGGCATCTTAAATTGTAAATAAGAATGTCTAAATTGAAGAGAAACCCAATACAAGTCCTCTTACGAATGTTGATTCGATGTTTAAATTTAGATAATTAAATCTATTTTCCAGTATTCAATATTTATAATGTCTTAGTGTGTCATGTTCTAATTTTTTAGAATCGGTGTCATATCTGTGACGCATGTCCACATCTTCGCTCCTTATATTATAATCTTGCTGGAAAGTATTTGAACACTTTGAGAGTTTTTTCCTATCTGATGTTTTCTCCCCTTCATTATTCTAAAAATTTATCTACTGCAGTGGGCTGAGGTTTTGTCTGCAGATGCATTCTCTGCATTTGAGGATGCTGGGTTGAATGATAGAAAGGTATGTTGGAATGAGGTTGGTCGTATTTATTAGCCTTGGTTTTTGAATTTGTTGTGTACTTTCTTAGAACAGAGAAACATCCTATGTACGTTGGGTTTGTCTTCAAGGGGACTGGTTAAATGCTTTACTTCATCGTTTACTGTAGTCTAAAAGTTGAATTTGCCATGTGATGCACGCCTGATTAAGCAAGCAAATCAAGGCAATCTCACTCGTGTGCGCTTGCCTACCTAGCTTTTTTGGGGCATAAAGCCCGGTACCTCGATTTTTTTTAATTATTATTATTTTTAATTTGCAACTCTTTATCTCTCATGCCATTGAAAACCAAGCCGCATTTTTTAAATTTCTATTTAATTTTACCTGTTAATGTCGAATAGTAATAAAAATATTGTTATTGTGCGCCTCACCTTATTAATACTTGTGCTTTTTTCCCCCTCCCCTCTACAGGCTGTAGAGGAAACCGGGCGTAGATTCCGAGAAACCATTCTTGCTCTTGGAGGTGGAAGAGCTCCATCAGAGGTTAGATGTCAAACATTGTTTTTATAGAAATAATCAACATTGAAGTTGTCCATTATTGTTATAAAGCAAATTCATCTTTATTCAATTTAGGGTGTGTTTGAGGGTGATTTTGACATGACAAAATCACTTTTGCCATTGCCAAAACCACTTCCCTTTTAATTGATCATGTTTGACAAATTAAAAAATTGATTTAGGAAAACTTTAAATCACTTAAAATAGATTTTTAAGGTATCACTTGGAGGGTGATTTTGAAAAATCAATTATACTTAAATCTATTTTTTCAAAATCACTCTTGACATCACTCCCAAACACACCATTCATATATTCTAACAAGTTCTTTTTAACTAAGCACTTTAAAGTTCTGACATTTAAATTAACCCCAAAATAACTTCTTTTTTTTACTAATATAACTTTTGAATAAATCATAGTCATTATTGAAGAAAAGAATCTTAAATATGTCGTCAATATTGACCGACTTAATAGTGTCTATCATGTAGGTTTTTGTTGAATTTCGAGGGCGGGAACCTTCACCAGAGCCACTGCTCCGGCACAGTGGCCTTCTGCCTGGCATAGCGGCAGCT

mRNA sequence

ATGAAGACGAGGAGATTGGTACTGCAAGCCCTTCTTATAGCCAACATGTTAATGGCGTCCCGCATTCTACTCGCCGCTTCCGTACATCCACTCCTTAAAACGACATATTCGCTCTCCATTTCCTCTCCTAACCATTTACAGAAATCTATTCCTTGTCCCCTCTGGTCATCTTCCTTCTCCTTCTGCCTTCACAACCTTCACAACTCCGTTACCTCTTCTTCCATCCACTCTTCTTCTCCCTGTTTTTCGCTTTCTTCCCCGTCAATGGCTGCTTCTGCCGTCGTGGATGAAATTTCTCAATCGAATCCTCTTCTCCAAGACTTCTATTTTCCGCCTTTCGATGTCGTCGAAGCTAAGCATGTTCGCCCGGGCATTCTTACGCTATTGAAGAAGCTTGAGGGGGATCTGGAGGAATTAGAGCGAACAGTGGAGCCATCTTGGTCAAAGTTGGTTGAACCATTAGAGAAGATTATAGACCGTTTGAATGTGGTTTGGGGTATTGTCAACCATCTGAAGTCTGTGAAGGATACTACAGATCTTCGGACCGCCATCGAAGAAGTTCAGCCAGAGAAAGTTAAATTTCAGCTTAGGTTGGGGCAAAGCAAACCTATTTACAATGCTTTTAAAGCTATTCGAGAATCCTCCGAGTGGAAGATGCTCAATGATGCTCGCAAACGCATAGTGGAATCTCAAATAAAAGAAGCCCTTCTTCATGGCGTTACTCTTGAAGATGATAAAAGGGAGCATTTCAACAAAATTCAGCAGGAACTGGAAAGATTATCACAGAAATTTGAGGAAAATGTTTTGGATGCTACAAAGAAGTTTGAGAAGTTAGTTGTTGATAAGAATGAAATTGATGGACTGCCTTCTACTGCTCTAGGAATGGCAGCACAAACAGCAGTTTCCAAGGGTCATGAAAAGGCTACTGCAGAAAATGGTCCATGGATAATTTCTTTGGATGCTCCATGTTATCTTTCTGTCATGCAACATGCTAAGAATCGATCCTTGCGGAAGGAAATTTACTACGCTTATGTAACTCGTGCCTCTAGTGGAGAAATGGACAACACGGCAATAATTGACCAAATTTTGAAGCTTAGGTTGGAAAAGGCTAAGATTCTCAATTTCAATAATTATGCCGAGGTTAGCATGGAGACCAAAATGGCTACGGTTGAGAAAGCTGAAGAGCTTCTAGAAAAGCTTCGTAGCGCTTCCTGGAATGCTGCAGTTCAAGATATTGAAGATCTTAAGGATTTTGCAAAAAAACAAGGCGCGCCAGAAGGCAATGATTTGAACCATTGGGATATTAGCTTCTGGAGTGAGAGGCTTCGGGAGTCAAAATTTGATGTCAATGAGGAAGAACTACGGCCATTTTTTTCCTTGCCAAATGTTATGGATGGCCTTTTTAGTCTTGCAAAGACACTTTTTGAGATTGATATTCAACCAGCCGATGGTCTTGCTCCGGTTTGGAATAAAGATGTAAAATTTTACCGGGTAAATGACTCTTCTGGAAGTCCCATTGCATACCTCTATTTTGATCCATATACACGTCCATCGGAGAAAAGGGGAGGTGCTTGGATGGATGAGGTTGTTTCACGAAGCTGTGTGTTAGCACAAGATGGTGCCCCTGCAAGGTTGCCTATTGCGCATATGGTGTGCAATCAAACACCGCCAGTTGGAGAAAGACCTAGTTTAATGACATTTCGTGAGGTAGAGACAGTATTCCATGAATTTGGTCATGCACTTCAGCATATGTTGACAAAACAAGATGAAGGTTTAGTTGCTGGTATTCGTGGAATAGAATGGGATGCTGTTGAATTGCCGTCTCAGTTCATGGAAAACTGGTGCTATCATAGAGACACTTTGATGGGCATTGCAAAGCACTATGAAACAGGGGAAAGTCTCCCGGAAGAAGTGTACTTGAAACTTCTTGCAGCTAGGACATTTCGTGCAGGTTCCTTGAGTCTTCGTCAGCTTAGATTTGCAAGTATTGATTTGGAGCTTCATACTAAATATGTTCCTGGGGAGCCAGAGGCAATCTTTGTGGTTGACCAGAGAGTGTGCAAACAAACACAAGTGATACCTCCCTTAGCAGACGATAAATTCCTTTGTAGTTTCAGCCATATTTTTGCAGGTGGATATGCCGCAGGATATTACAGTTATAAGTGGGCTGAGGTTTTGTCTGCAGATGCATTCTCTGCATTTGAGGATGCTGGGTTGAATGATAGAAAGGCTGTAGAGGAAACCGGGCGTAGATTCCGAGAAACCATTCTTGCTCTTGGAGGTGGAAGAGCTCCATCAGAGGTTTTTGTTGAATTTCGAGGGCGGGAACCTTCACCAGAGCCACTGCTCCGGCACAGTGGCCTTCTGCCTGGCATAGCGGCAGCT

Coding sequence (CDS)

ATGAAGACGAGGAGATTGGTACTGCAAGCCCTTCTTATAGCCAACATGTTAATGGCGTCCCGCATTCTACTCGCCGCTTCCGTACATCCACTCCTTAAAACGACATATTCGCTCTCCATTTCCTCTCCTAACCATTTACAGAAATCTATTCCTTGTCCCCTCTGGTCATCTTCCTTCTCCTTCTGCCTTCACAACCTTCACAACTCCGTTACCTCTTCTTCCATCCACTCTTCTTCTCCCTGTTTTTCGCTTTCTTCCCCGTCAATGGCTGCTTCTGCCGTCGTGGATGAAATTTCTCAATCGAATCCTCTTCTCCAAGACTTCTATTTTCCGCCTTTCGATGTCGTCGAAGCTAAGCATGTTCGCCCGGGCATTCTTACGCTATTGAAGAAGCTTGAGGGGGATCTGGAGGAATTAGAGCGAACAGTGGAGCCATCTTGGTCAAAGTTGGTTGAACCATTAGAGAAGATTATAGACCGTTTGAATGTGGTTTGGGGTATTGTCAACCATCTGAAGTCTGTGAAGGATACTACAGATCTTCGGACCGCCATCGAAGAAGTTCAGCCAGAGAAAGTTAAATTTCAGCTTAGGTTGGGGCAAAGCAAACCTATTTACAATGCTTTTAAAGCTATTCGAGAATCCTCCGAGTGGAAGATGCTCAATGATGCTCGCAAACGCATAGTGGAATCTCAAATAAAAGAAGCCCTTCTTCATGGCGTTACTCTTGAAGATGATAAAAGGGAGCATTTCAACAAAATTCAGCAGGAACTGGAAAGATTATCACAGAAATTTGAGGAAAATGTTTTGGATGCTACAAAGAAGTTTGAGAAGTTAGTTGTTGATAAGAATGAAATTGATGGACTGCCTTCTACTGCTCTAGGAATGGCAGCACAAACAGCAGTTTCCAAGGGTCATGAAAAGGCTACTGCAGAAAATGGTCCATGGATAATTTCTTTGGATGCTCCATGTTATCTTTCTGTCATGCAACATGCTAAGAATCGATCCTTGCGGAAGGAAATTTACTACGCTTATGTAACTCGTGCCTCTAGTGGAGAAATGGACAACACGGCAATAATTGACCAAATTTTGAAGCTTAGGTTGGAAAAGGCTAAGATTCTCAATTTCAATAATTATGCCGAGGTTAGCATGGAGACCAAAATGGCTACGGTTGAGAAAGCTGAAGAGCTTCTAGAAAAGCTTCGTAGCGCTTCCTGGAATGCTGCAGTTCAAGATATTGAAGATCTTAAGGATTTTGCAAAAAAACAAGGCGCGCCAGAAGGCAATGATTTGAACCATTGGGATATTAGCTTCTGGAGTGAGAGGCTTCGGGAGTCAAAATTTGATGTCAATGAGGAAGAACTACGGCCATTTTTTTCCTTGCCAAATGTTATGGATGGCCTTTTTAGTCTTGCAAAGACACTTTTTGAGATTGATATTCAACCAGCCGATGGTCTTGCTCCGGTTTGGAATAAAGATGTAAAATTTTACCGGGTAAATGACTCTTCTGGAAGTCCCATTGCATACCTCTATTTTGATCCATATACACGTCCATCGGAGAAAAGGGGAGGTGCTTGGATGGATGAGGTTGTTTCACGAAGCTGTGTGTTAGCACAAGATGGTGCCCCTGCAAGGTTGCCTATTGCGCATATGGTGTGCAATCAAACACCGCCAGTTGGAGAAAGACCTAGTTTAATGACATTTCGTGAGGTAGAGACAGTATTCCATGAATTTGGTCATGCACTTCAGCATATGTTGACAAAACAAGATGAAGGTTTAGTTGCTGGTATTCGTGGAATAGAATGGGATGCTGTTGAATTGCCGTCTCAGTTCATGGAAAACTGGTGCTATCATAGAGACACTTTGATGGGCATTGCAAAGCACTATGAAACAGGGGAAAGTCTCCCGGAAGAAGTGTACTTGAAACTTCTTGCAGCTAGGACATTTCGTGCAGGTTCCTTGAGTCTTCGTCAGCTTAGATTTGCAAGTATTGATTTGGAGCTTCATACTAAATATGTTCCTGGGGAGCCAGAGGCAATCTTTGTGGTTGACCAGAGAGTGTGCAAACAAACACAAGTGATACCTCCCTTAGCAGACGATAAATTCCTTTGTAGTTTCAGCCATATTTTTGCAGGTGGATATGCCGCAGGATATTACAGTTATAAGTGGGCTGAGGTTTTGTCTGCAGATGCATTCTCTGCATTTGAGGATGCTGGGTTGAATGATAGAAAGGCTGTAGAGGAAACCGGGCGTAGATTCCGAGAAACCATTCTTGCTCTTGGAGGTGGAAGAGCTCCATCAGAGGTTTTTGTTGAATTTCGAGGGCGGGAACCTTCACCAGAGCCACTGCTCCGGCACAGTGGCCTTCTGCCTGGCATAGCGGCAGCT

Protein sequence

MKTRRLVLQALLIANMLMASRILLAASVHPLLKTTYSLSISSPNHLQKSIPCPLWSSSFSFCLHNLHNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKHVRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGVTLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTAVSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIIDQILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAKKQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQPADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRFASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPEPLLRHSGLLPGIAAA
Homology
BLAST of MS009702 vs. NCBI nr
Match: XP_022141290.1 (probable cytosolic oligopeptidase A [Momordica charantia])

HSP 1 Score: 1565.1 bits (4051), Expect = 0.0e+00
Identity = 791/795 (99.50%), Postives = 792/795 (99.62%), Query Frame = 0

Query: 1   MKTRRLVLQALLIANMLMASRILLAASVHPLLKTTYSLSISSPNHLQKSIPCPLWSSSFS 60
           MKT+ LVLQALLIANMLMASRILLAASVHPLLKTTYSLS SSPNHLQKSIPCPLWSSSFS
Sbjct: 1   MKTKGLVLQALLIANMLMASRILLAASVHPLLKTTYSLSTSSPNHLQKSIPCPLWSSSFS 60

Query: 61  FCLHNLHNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH 120
           FCLHNL NSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH
Sbjct: 61  FCLHNLRNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH 120

Query: 121 VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL 180
           VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL
Sbjct: 121 VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL 180

Query: 181 RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV 240
           RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV
Sbjct: 181 RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV 240

Query: 241 TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA 300
           TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA
Sbjct: 241 TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA 300

Query: 301 VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID 360
           VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID
Sbjct: 301 VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID 360

Query: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK 420
           QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK
Sbjct: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK 420

Query: 421 KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ 480
           KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ
Sbjct: 421 KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ 480

Query: 481 PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540
           PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG
Sbjct: 481 PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540

Query: 541 APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600
           APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE
Sbjct: 541 APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600

Query: 601 WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF 660
           WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF
Sbjct: 601 WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF 660

Query: 661 ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY 720
           ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY
Sbjct: 661 ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY 720

Query: 721 KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE 780
           KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE
Sbjct: 721 KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE 780

Query: 781 PLLRHSGLLPGIAAA 796
           PLLRHSGLLPGIAAA
Sbjct: 781 PLLRHSGLLPGIAAA 795

BLAST of MS009702 vs. NCBI nr
Match: XP_038895779.1 (LOW QUALITY PROTEIN: probable cytosolic oligopeptidase A [Benincasa hispida])

HSP 1 Score: 1440.6 bits (3728), Expect = 0.0e+00
Identity = 716/795 (90.06%), Postives = 751/795 (94.47%), Query Frame = 0

Query: 1   MKTRRLVLQALLIANMLMASRILLAASVHPLLKTTYSLSISSPNHLQKSIPCPLWSSSFS 60
           M+ + +VLQALLIANMLMA RI L AS+HPL++ T+SLSISSP HL KS+PCPLWSSSFS
Sbjct: 1   MEKKSIVLQALLIANMLMAFRITLTASIHPLVRRTHSLSISSPGHLPKSLPCPLWSSSFS 60

Query: 61  FCLHNLHNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH 120
           FCLHN  NSVTSSSIH SS C S  +PSM AS  +DEI QSNPLLQDFYFPPFD VEAKH
Sbjct: 61  FCLHNRRNSVTSSSIHYSSSCSSRFAPSMTASGSMDEIPQSNPLLQDFYFPPFDAVEAKH 120

Query: 121 VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL 180
           VR GILTLLKKLE DLEELERTVEPSWSKLVEPLEKI+DRLNVVWGIVNHLKSVKDT DL
Sbjct: 121 VRSGILTLLKKLESDLEELERTVEPSWSKLVEPLEKIVDRLNVVWGIVNHLKSVKDTADL 180

Query: 181 RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV 240
           R AIEEVQPEKVKFQLRLGQSKPIYNAFKAIRES EW  LNDARKRIVESQIKEALLHGV
Sbjct: 181 RIAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESPEWNTLNDARKRIVESQIKEALLHGV 240

Query: 241 TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA 300
           TLE DKR+ FNKIQQELERLS KF+ENVLDATKKFEKL+VDK+E+DGLPSTALGMAAQTA
Sbjct: 241 TLEGDKRDDFNKIQQELERLSHKFDENVLDATKKFEKLIVDKHEVDGLPSTALGMAAQTA 300

Query: 301 VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID 360
           VSKGHE ATAE+GPWII+LDAPCYLSVMQHAKNRSLRKE+YYAY+TRASSGEMDNT+IID
Sbjct: 301 VSKGHENATAEHGPWIITLDAPCYLSVMQHAKNRSLRKEVYYAYITRASSGEMDNTSIID 360

Query: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK 420
           QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWN AVQD+EDL+DFAK
Sbjct: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNPAVQDVEDLQDFAK 420

Query: 421 KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ 480
           KQGAPE NDLNHWDI+FWSERLRESKFD+NEEELRPFFSLP VMDGLFSLAKTLF+IDIQ
Sbjct: 421 KQGAPEANDLNHWDITFWSERLRESKFDINEEELRPFFSLPKVMDGLFSLAKTLFDIDIQ 480

Query: 481 PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540
           PADGLAPVW+KDVKFYRVN+SSGSPIAY YFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG
Sbjct: 481 PADGLAPVWDKDVKFYRVNNSSGSPIAYFYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540

Query: 541 APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600
           AP RLP+AHMVCNQTPPVGE+PSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE
Sbjct: 541 APTRLPVAHMVCNQTPPVGEKPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600

Query: 601 WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF 660
           WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF
Sbjct: 601 WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF 660

Query: 661 ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY 720
           AS+DLELHTKYVPGEPE I+ VDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY
Sbjct: 661 ASVDLELHTKYVPGEPELIYAVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY 720

Query: 721 KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE 780
           KWAEVLSADAFSAFEDAGLND KAV+ETG RFRETILALGGG+AP EVFVEFRGREPSPE
Sbjct: 721 KWAEVLSADAFSAFEDAGLNDIKAVKETGHRFRETILALGGGKAPLEVFVEFRGREPSPE 780

Query: 781 PLLRHSGLLPGIAAA 796
           PLLRHSGLL G+  A
Sbjct: 781 PLLRHSGLLSGVGTA 795

BLAST of MS009702 vs. NCBI nr
Match: XP_022922757.1 (probable cytosolic oligopeptidase A [Cucurbita moschata])

HSP 1 Score: 1435.2 bits (3714), Expect = 0.0e+00
Identity = 716/795 (90.06%), Postives = 750/795 (94.34%), Query Frame = 0

Query: 1   MKTRRLVLQALLIANMLMASRILLAASVHPLLKTTYSLSISSPNHLQKSIPCPLWSSSFS 60
           MK + L LQALLIA +LMASR  L AS+ P L  TYSLS+SSP+ L KS+PCPLWSSSFS
Sbjct: 1   MKIKSLALQALLIAGLLMASRFTLTASIQPFLIRTYSLSVSSPSRLPKSLPCPLWSSSFS 60

Query: 61  FCLHNLHNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH 120
           FCLHNLHNSVTSSSI   S C S+SSPSMAASA +DEI QSNPLLQ+FYFPPFD VEAKH
Sbjct: 61  FCLHNLHNSVTSSSILHFSSCSSISSPSMAASAALDEIPQSNPLLQNFYFPPFDAVEAKH 120

Query: 121 VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL 180
           VRPGILTLLK LEGDLEELERTVEPSWSKLVEPLEKI DRLNVVWGIVNHLKSVKDT DL
Sbjct: 121 VRPGILTLLKTLEGDLEELERTVEPSWSKLVEPLEKITDRLNVVWGIVNHLKSVKDTADL 180

Query: 181 RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV 240
           R AIEEVQPEKVKFQLRLGQSKPIYNAFKAIRES EW  LNDARKRIVESQIKEALLHGV
Sbjct: 181 RIAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESPEWNTLNDARKRIVESQIKEALLHGV 240

Query: 241 TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA 300
           TLEDDKR++FNKIQQELERLS KF+ENVLDATKKFEKL+VDK+E+DGLPSTALGMAAQTA
Sbjct: 241 TLEDDKRDNFNKIQQELERLSHKFDENVLDATKKFEKLIVDKHEVDGLPSTALGMAAQTA 300

Query: 301 VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID 360
           VSKGHE AT+ENGPWII+LDAPCYLSVMQHAKNRSLR+E+YYAY+TRASSGEMDNT+IID
Sbjct: 301 VSKGHENATSENGPWIITLDAPCYLSVMQHAKNRSLREEVYYAYITRASSGEMDNTSIID 360

Query: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK 420
           QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQD+EDLKDF+K
Sbjct: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDLEDLKDFSK 420

Query: 421 KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ 480
           KQGA E NDLNHWDISFWSERLRESKFD+NEEELRPFFSLP VMDGLFSLAKTLF+IDI+
Sbjct: 421 KQGAVEANDLNHWDISFWSERLRESKFDINEEELRPFFSLPKVMDGLFSLAKTLFDIDIE 480

Query: 481 PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540
            ADGLA VW+KDVKFYRVN+SSGSPIAY YFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG
Sbjct: 481 SADGLAAVWDKDVKFYRVNNSSGSPIAYFYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540

Query: 541 APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600
           APARLPIAHMVCNQTPPVGE+PSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE
Sbjct: 541 APARLPIAHMVCNQTPPVGEKPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600

Query: 601 WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF 660
           WDAVELPSQFMENWCYHR+TLM IAKHYETGESLPEEVY KLL ARTFRAGSLSLRQLRF
Sbjct: 601 WDAVELPSQFMENWCYHRETLMSIAKHYETGESLPEEVYFKLLTARTFRAGSLSLRQLRF 660

Query: 661 ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY 720
           AS+DLELHTKYVPGEPE I+ VDQRVCKQTQV+PPLADDKFLCSFSHIFAGGYAAGYYSY
Sbjct: 661 ASVDLELHTKYVPGEPELIYAVDQRVCKQTQVLPPLADDKFLCSFSHIFAGGYAAGYYSY 720

Query: 721 KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE 780
           KWAEVLSADAFSAFEDAGLND KAV+ETG RFRETILALGGGRAP EVFVEFRGREPSPE
Sbjct: 721 KWAEVLSADAFSAFEDAGLNDSKAVKETGHRFRETILALGGGRAPLEVFVEFRGREPSPE 780

Query: 781 PLLRHSGLLPGIAAA 796
           PLLRHSGLLPG AAA
Sbjct: 781 PLLRHSGLLPGSAAA 795

BLAST of MS009702 vs. NCBI nr
Match: XP_022984450.1 (probable cytosolic oligopeptidase A [Cucurbita maxima])

HSP 1 Score: 1430.2 bits (3701), Expect = 0.0e+00
Identity = 712/795 (89.56%), Postives = 749/795 (94.21%), Query Frame = 0

Query: 1   MKTRRLVLQALLIANMLMASRILLAASVHPLLKTTYSLSISSPNHLQKSIPCPLWSSSFS 60
           MK + L LQALLIA +LMASR  L AS+ P L  TYSLS+ SP+ L KS+PCPLWSSSFS
Sbjct: 1   MKIKSLALQALLIAGLLMASRFTLTASIQPFLVRTYSLSVPSPSRLPKSLPCPLWSSSFS 60

Query: 61  FCLHNLHNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH 120
           FCLHNLHNSVTSSSI   S C S+S+PSMAASA +DEI QSNPLLQ+FYFPPFD VEAKH
Sbjct: 61  FCLHNLHNSVTSSSIPHFSSCSSISAPSMAASAALDEIPQSNPLLQNFYFPPFDAVEAKH 120

Query: 121 VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL 180
           VRPGILTLLKKLEGDL+ELERTVEPSWSKLVEPLEKI DRLNVVWGIVNHLKSVKDT DL
Sbjct: 121 VRPGILTLLKKLEGDLKELERTVEPSWSKLVEPLEKITDRLNVVWGIVNHLKSVKDTADL 180

Query: 181 RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV 240
           R AIEEVQPEKVKFQLRLGQSKPIYNAFKAIRES EW  LNDARKRIVESQIKEALLHGV
Sbjct: 181 RIAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESPEWNTLNDARKRIVESQIKEALLHGV 240

Query: 241 TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA 300
           TLEDDKR+ FNKIQQELERLS KF+ENVLDATKKFEKL+VDK+E+DGLPSTALGMAAQTA
Sbjct: 241 TLEDDKRDKFNKIQQELERLSHKFDENVLDATKKFEKLIVDKHEVDGLPSTALGMAAQTA 300

Query: 301 VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID 360
           VSKGHE AT+ENGPWII+LDAPCYLSVMQHAKNRSLR+E+YYAY+TRASSGEMDNT+IID
Sbjct: 301 VSKGHENATSENGPWIITLDAPCYLSVMQHAKNRSLREEVYYAYITRASSGEMDNTSIID 360

Query: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK 420
           QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQD+EDLKDF+K
Sbjct: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDLEDLKDFSK 420

Query: 421 KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ 480
           KQGA E NDLNHWDISFWSERLRESKFD+NEEELRPFFSLP VMDGLFSLAKTLF+IDI+
Sbjct: 421 KQGAVEANDLNHWDISFWSERLRESKFDINEEELRPFFSLPKVMDGLFSLAKTLFDIDIE 480

Query: 481 PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540
            ADGLA VW+KDVKFYRVN+SSGSPIAY YFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG
Sbjct: 481 SADGLAAVWDKDVKFYRVNNSSGSPIAYFYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540

Query: 541 APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600
           APARLPIAHMVCNQTPPVGE+PSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE
Sbjct: 541 APARLPIAHMVCNQTPPVGEKPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600

Query: 601 WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF 660
           WDAVELPSQFMENWCYH++TLM IAKHYETGESLPEEVY KLL ARTFRAGSLSLRQLRF
Sbjct: 601 WDAVELPSQFMENWCYHKETLMSIAKHYETGESLPEEVYFKLLTARTFRAGSLSLRQLRF 660

Query: 661 ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY 720
           AS+DLELHTKYVPGEPE I+ VDQRVCK+TQV+PPLADDKFLCSFSHIFAGGYAAGYYSY
Sbjct: 661 ASVDLELHTKYVPGEPELIYAVDQRVCKKTQVLPPLADDKFLCSFSHIFAGGYAAGYYSY 720

Query: 721 KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE 780
           KWAEVLSADAFSAFEDAGLND KAV+ETG RFRETILALGGGRAP EVFVEFRGREPSPE
Sbjct: 721 KWAEVLSADAFSAFEDAGLNDSKAVKETGHRFRETILALGGGRAPLEVFVEFRGREPSPE 780

Query: 781 PLLRHSGLLPGIAAA 796
           PLLRHSGLLPG AAA
Sbjct: 781 PLLRHSGLLPGSAAA 795

BLAST of MS009702 vs. NCBI nr
Match: XP_008451681.1 (PREDICTED: probable cytosolic oligopeptidase A [Cucumis melo])

HSP 1 Score: 1412.9 bits (3656), Expect = 0.0e+00
Identity = 709/798 (88.85%), Postives = 749/798 (93.86%), Query Frame = 0

Query: 1   MKTRRLVLQALLIANMLMASRILLAASVHPLL-KTTYSL--SISSPNHLQKSIPCPLWSS 60
           M+ + +V QALLIA+MLMASRI L AS+HPLL + T+SL  SISSP  L KS PCPLWSS
Sbjct: 1   MEKKSVVFQALLIASMLMASRITLTASIHPLLVRRTHSLSISISSPYQLPKSFPCPLWSS 60

Query: 61  SFSFCLHNLHNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVE 120
           SFSFCLHN   SVTSSS+H  S C S S+P+MAA   +D+ISQSNPLLQDFYFPPFD VE
Sbjct: 61  SFSFCLHNRRKSVTSSSVHYFSSCSSHSAPTMAAFGDIDQISQSNPLLQDFYFPPFDAVE 120

Query: 121 AKHVRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDT 180
           A HVRPGIL LLKKLEGDLEELERTVEPSWSKLVEPLEKI+DRL VVWGIV+HLKSVKDT
Sbjct: 121 ANHVRPGILALLKKLEGDLEELERTVEPSWSKLVEPLEKIVDRLTVVWGIVSHLKSVKDT 180

Query: 181 TDLRTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALL 240
            DLR AIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESS+W  L+DARKRIVESQIKEALL
Sbjct: 181 ADLRIAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSKWNTLDDARKRIVESQIKEALL 240

Query: 241 HGVTLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAA 300
            GVTLE DKR++FNKI+QELERLS KF+ENVLDATKKFEKL+VDK+E+DGLPSTALGMAA
Sbjct: 241 RGVTLEGDKRDNFNKIEQELERLSHKFDENVLDATKKFEKLIVDKHEVDGLPSTALGMAA 300

Query: 301 QTAVSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTA 360
           QTAVSKGHE ATAENGPWII+LDAPCYLSVMQHAKNRSLR+EIYYAY+TRASSGEMDNT+
Sbjct: 301 QTAVSKGHENATAENGPWIITLDAPCYLSVMQHAKNRSLREEIYYAYITRASSGEMDNTS 360

Query: 361 IIDQILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKD 420
           IIDQILKLR EKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQD+EDL+D
Sbjct: 361 IIDQILKLRQEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDVEDLQD 420

Query: 421 FAKKQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEI 480
           FAK+QGAPE NDLNHWDISFWSERLRESKFD+NEEELRPFFSLP VMDGLFSLAKTLF+I
Sbjct: 421 FAKRQGAPEANDLNHWDISFWSERLRESKFDINEEELRPFFSLPRVMDGLFSLAKTLFDI 480

Query: 481 DIQPADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLA 540
           DIQPADGLAPVW+KDVKFYRVN+SSGSPIAY YFDPYTRPSEKRGGAWMDEVVSRSCVLA
Sbjct: 481 DIQPADGLAPVWDKDVKFYRVNNSSGSPIAYFYFDPYTRPSEKRGGAWMDEVVSRSCVLA 540

Query: 541 QDGAPARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIR 600
           QDGAPARLPIAHMVCNQTPPVG +PSLMTFREVETVFHEFGHALQHMLTKQ EGLVAGIR
Sbjct: 541 QDGAPARLPIAHMVCNQTPPVGGKPSLMTFREVETVFHEFGHALQHMLTKQGEGLVAGIR 600

Query: 601 GIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQ 660
           GIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQ
Sbjct: 601 GIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQ 660

Query: 661 LRFASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGY 720
           LRFAS+DLELHTKYVPGEPE IF VDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGY
Sbjct: 661 LRFASVDLELHTKYVPGEPELIFAVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGY 720

Query: 721 YSYKWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREP 780
           YSYKWAEVLSADAFSAFEDAGLND +AV+ETG RFRET+LALGGGRAP EVFVEFRGREP
Sbjct: 721 YSYKWAEVLSADAFSAFEDAGLNDIEAVKETGHRFRETVLALGGGRAPLEVFVEFRGREP 780

Query: 781 SPEPLLRHSGLLPGIAAA 796
           SPEPLLRHSGLL G+A A
Sbjct: 781 SPEPLLRHSGLLAGLATA 798

BLAST of MS009702 vs. ExPASy Swiss-Prot
Match: Q94AM1 (Organellar oligopeptidase A, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=OOP PE=1 SV=1)

HSP 1 Score: 1197.6 bits (3097), Expect = 0.0e+00
Identity = 590/760 (77.63%), Postives = 668/760 (87.89%), Query Frame = 0

Query: 43  PNHLQKSIPCPLWSSSFSFCLHNLHNSVTSSSIHSSS-------PCFSLSSPSMAASAVV 102
           P+  +KS PCP+WSSSFSFCL     S TS+S+ SSS       P  S ++ +   S V 
Sbjct: 33  PSTFRKSYPCPIWSSSFSFCLPP-PRSTTSTSLSSSSFRPFSSPPSMSSAAAAAVESVVS 92

Query: 103 DEISQSNPLLQDFYFPPFDVVEAKHVRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLE 162
           DE   SNPLLQDF FPPFD V+A HVRPGI  LL+ LE +LEELE++VEP+W KLVEPLE
Sbjct: 93  DETLSSNPLLQDFDFPPFDSVDASHVRPGIRALLQHLEAELEELEKSVEPTWPKLVEPLE 152

Query: 163 KIIDRLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESS 222
           KI+DRL VVWG++NHLK+VKDT +LR AIE+VQPEKVKFQLRLGQSKPIYNAFKAIRES 
Sbjct: 153 KIVDRLTVVWGMINHLKAVKDTPELRAAIEDVQPEKVKFQLRLGQSKPIYNAFKAIRESP 212

Query: 223 EWKMLNDARKRIVESQIKEALLHGVTLEDDKREHFNKIQQELERLSQKFEENVLDATKKF 282
           +W  L++AR+R+VE+QIKEA+L G+ L+D+KRE FNKI+QELE+LS KF ENVLDATKKF
Sbjct: 213 DWSSLSEARQRLVEAQIKEAVLIGIALDDEKREEFNKIEQELEKLSHKFSENVLDATKKF 272

Query: 283 EKLVVDKNEIDGLPSTALGMAAQTAVSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRS 342
           EKL+ DK EI+GLP +ALG+ AQ AVSKGHE ATAENGPWII+LDAP YL VMQHAKNR+
Sbjct: 273 EKLITDKKEIEGLPPSALGLFAQAAVSKGHENATAENGPWIITLDAPSYLPVMQHAKNRA 332

Query: 343 LRKEIYYAYVTRASSGEMDNTAIIDQILKLRLEKAKILNFNNYAEVSMETKMATVEKAEE 402
           LR+E+Y AY++RASSG++DNTAIIDQILKLRLEKAK+L +NNYAEVSM  KMATVEKA E
Sbjct: 333 LREEVYRAYLSRASSGDLDNTAIIDQILKLRLEKAKLLGYNNYAEVSMAMKMATVEKAAE 392

Query: 403 LLEKLRSASWNAAVQDIEDLKDFAKKQGAPEGNDLNHWDISFWSERLRESKFDVNEEELR 462
           LLEKLRSASW+AAVQD+EDLK FAK QGA E + + HWD +FWSERLRESK+D+NEEELR
Sbjct: 393 LLEKLRSASWDAAVQDMEDLKSFAKNQGAAESDSMTHWDTTFWSERLRESKYDINEEELR 452

Query: 463 PFFSLPNVMDGLFSLAKTLFEIDIQPADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYT 522
           P+FSLP VMDGLFSLAKTLF IDI+PADGLAPVWN DV+FYRV DSSG+PIAY YFDPY+
Sbjct: 453 PYFSLPKVMDGLFSLAKTLFGIDIEPADGLAPVWNNDVRFYRVKDSSGNPIAYFYFDPYS 512

Query: 523 RPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVCNQTPPVGERPSLMTFREVETVFH 582
           RPSEKRGGAWMDEVVSRS V+AQ G+  RLP+AHMVCNQTPPVG++PSLMTFREVETVFH
Sbjct: 513 RPSEKRGGAWMDEVVSRSRVMAQKGSSVRLPVAHMVCNQTPPVGDKPSLMTFREVETVFH 572

Query: 583 EFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLP 642
           EFGHALQHMLTKQDEGLVAGIR IEWDAVELPSQFMENWCYHRDTLM IAKHYETGE+LP
Sbjct: 573 EFGHALQHMLTKQDEGLVAGIRNIEWDAVELPSQFMENWCYHRDTLMSIAKHYETGETLP 632

Query: 643 EEVYLKLLAARTFRAGSLSLRQLRFASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPP 702
           EEVY KLLAARTFRAGS SLRQL+FAS+DLELHTKYVPG PE+I+ VDQRV  +TQVIPP
Sbjct: 633 EEVYKKLLAARTFRAGSFSLRQLKFASVDLELHTKYVPGGPESIYDVDQRVSVKTQVIPP 692

Query: 703 LADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRET 762
           L +D+FLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGL+D KAV+ETG+RFR T
Sbjct: 693 LPEDRFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLDDIKAVKETGQRFRNT 752

Query: 763 ILALGGGRAPSEVFVEFRGREPSPEPLLRHSGLLPGIAAA 796
           ILALGGG+AP +VFVEFRGREPSPEPLLRH+GLL   A+A
Sbjct: 753 ILALGGGKAPLKVFVEFRGREPSPEPLLRHNGLLAASASA 791

BLAST of MS009702 vs. ExPASy Swiss-Prot
Match: Q949P2 (Probable cytosolic oligopeptidase A OS=Arabidopsis thaliana OX=3702 GN=CYOP PE=1 SV=1)

HSP 1 Score: 1134.4 bits (2933), Expect = 0.0e+00
Identity = 549/698 (78.65%), Postives = 623/698 (89.26%), Query Frame = 0

Query: 96  DEISQSNPLLQDFYFPPFDVVEAKHVRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLE 155
           ++   SNPLLQ+F FPPFD V+A HVRPGI  LL++LE +LE+LE+ VEPSW KLVEPLE
Sbjct: 4   EDTLSSNPLLQNFDFPPFDSVDAHHVRPGIRALLQQLEAELEQLEKAVEPSWPKLVEPLE 63

Query: 156 KIIDRLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESS 215
           KIIDRL+VVWG++NHLK+VKDT +LR AIEEVQPEKVKFQLRLGQSKPIYNAFKAIRES 
Sbjct: 64  KIIDRLSVVWGMINHLKAVKDTPELRAAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESP 123

Query: 216 EWKMLNDARKRIVESQIKEALLHGVTLEDDKREHFNKIQQELERLSQKFEENVLDATKKF 275
           +W  L++AR+R+VE+QIKEA+L G+ LEDDKRE FNKI+QELE+LS KF ENVLDATKKF
Sbjct: 124 DWNSLSEARQRLVEAQIKEAVLSGIALEDDKREEFNKIEQELEKLSHKFSENVLDATKKF 183

Query: 276 EKLVVDKNEIDGLPSTALGMAAQTAVSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRS 335
           EKL+ DK EI+GLP +ALG+ AQ AVSKGHE ATA+ GPW+I+LDAP YL VMQHAKNR+
Sbjct: 184 EKLITDKKEIEGLPPSALGLFAQAAVSKGHETATADTGPWLITLDAPSYLPVMQHAKNRA 243

Query: 336 LRKEIYYAYVTRASSGEMDNTAIIDQILKLRLEKAKILNFNNYAEVSMETKMATVEKAEE 395
           LR+E+Y AY++RASSG++DNTAIIDQILKLRLEKAK+L + NYAEVSM TKMATVEKA+E
Sbjct: 244 LREEVYRAYLSRASSGDLDNTAIIDQILKLRLEKAKLLGYRNYAEVSMATKMATVEKADE 303

Query: 396 LLEKLRSASWNAAVQDIEDLKDFAKKQGAPEGNDLNHWDISFWSERLRESKFDVNEEELR 455
           LLEKLRSASW+ AVQDIEDLK FAK QGA E + L HWDI+FWSERLRESK+D+NEEELR
Sbjct: 304 LLEKLRSASWDPAVQDIEDLKSFAKNQGAAEADSLTHWDITFWSERLRESKYDINEEELR 363

Query: 456 PFFSLPNVMDGLFSLAKTLFEIDIQPADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYT 515
           P+FSLP VMD LF LAKTLF ID+ PADG+APVWN DV+FY V DSSG+P AY YFDPY+
Sbjct: 364 PYFSLPKVMDALFGLAKTLFGIDVVPADGVAPVWNSDVRFYCVKDSSGNPTAYFYFDPYS 423

Query: 516 RPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVCNQTPPVGERPSLMTFREVETVFH 575
           RPSEKR GAWMDEV SRS V+AQ G+  RLP+A MVCNQTPPVG++PSLMTFREVETVFH
Sbjct: 424 RPSEKRDGAWMDEVFSRSRVMAQKGSSVRLPVAQMVCNQTPPVGDKPSLMTFREVETVFH 483

Query: 576 EFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLP 635
           EFGHALQHMLTK+DEGLVAGIR IEWDAVELPSQFMENWCYHRDTLM IAKHY+TGE+LP
Sbjct: 484 EFGHALQHMLTKEDEGLVAGIRNIEWDAVELPSQFMENWCYHRDTLMSIAKHYQTGETLP 543

Query: 636 EEVYLKLLAARTFRAGSLSLRQLRFASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPP 695
           E VY KLLAARTFRAGSLSLRQL+FA++DLELHTKY+PG  E I+ VDQRV  +TQVIPP
Sbjct: 544 ENVYKKLLAARTFRAGSLSLRQLKFATVDLELHTKYMPGGAETIYEVDQRVSIKTQVIPP 603

Query: 696 LADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRET 755
           L +D+FLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGL+D KAV+ETG+RFR T
Sbjct: 604 LPEDRFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLDDIKAVKETGQRFRNT 663

Query: 756 ILALGGGRAPSEVFVEFRGREPSPEPLLRHSGLLPGIA 794
           ILALGGG+AP +VFVEFRGREPSPEPLLRH+GLL   A
Sbjct: 664 ILALGGGKAPLKVFVEFRGREPSPEPLLRHNGLLAASA 701

BLAST of MS009702 vs. ExPASy Swiss-Prot
Match: P44573 (Oligopeptidase A OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=prlC PE=3 SV=1)

HSP 1 Score: 526.2 bits (1354), Expect = 6.6e-148
Identity = 285/698 (40.83%), Postives = 423/698 (60.60%), Query Frame = 0

Query: 98  ISQSNPLLQDFYFPPFDVVEAKHVRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKI 157
           +S SNPLL     PPF  ++ +H+RP +  L++     +E++ +    +W   + PL + 
Sbjct: 1   MSMSNPLLNIQGLPPFSQIKPEHIRPAVEKLIQDCRNTIEQVLKQPHFTWENFILPLTET 60

Query: 158 IDRLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEW 217
            DRLN  W  V+HL SVK++T+LR A +   P   ++   +GQ K +YNA+ A++ S+E+
Sbjct: 61  NDRLNRAWSPVSHLNSVKNSTELREAYQTCLPLLSEYSTWVGQHKGLYNAYLALKNSAEF 120

Query: 218 KMLNDARKRIVESQIKEALLHGVTLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEK 277
              + A+K+ +E+ +++  L G+ L ++K++ + +I   L  L+ +F  NVLDAT  +EK
Sbjct: 121 ADYSIAQKKAIENSLRDFELSGIGLSEEKQQRYGEIVARLSELNSQFSNNVLDATMGWEK 180

Query: 278 LVVDKNEIDGLPSTALGMAAQTAVSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLR 337
           L+ ++ E+ GLP +AL  A Q+A SKG +        +  +L+ P YL VM + +NR+LR
Sbjct: 181 LIENEAELAGLPESALQAAQQSAESKGLK-------GYRFTLEIPSYLPVMTYCENRALR 240

Query: 338 KEIYYAYVTRAS-----SGEMDNTAIIDQILKLRLEKAKILNFNNYAEVSMETKMA-TVE 397
           +E+Y AY TRAS     +G+ DN+ ++++IL LR+E AK+L FN Y E+S+ TKMA   +
Sbjct: 241 EEMYRAYATRASEQGPNAGKWDNSKVMEEILTLRVELAKLLGFNTYTELSLATKMAENPQ 300

Query: 398 KAEELLEKLRSASWNAAVQDIEDLKDFAKKQGAPEGNDLNHWDISFWSERLRESKFDVNE 457
           +  + L+ L   +     +++++LK + +K+      +L  WDI F+SE+ ++  + +N+
Sbjct: 301 QVLDFLDHLAERAKPQGEKELQELKGYCEKEFGV--TELAPWDIGFYSEKQKQHLYAIND 360

Query: 458 EELRPFFSLPNVMDGLFSLAKTLFEIDIQPADGLAPVWNKDVKFYRVNDSSGSPIAYLYF 517
           EELRP+F    V+ GLF L K +F I      G+   W+KDV+F+ + D +       Y 
Sbjct: 361 EELRPYFPENRVISGLFELIKRIFNIRAVERKGV-DTWHKDVRFFDLIDENDQLRGSFYL 420

Query: 518 DPYTRPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVCNQTPPVGERPSLMTFREVE 577
           D Y R   KRGGAWMD+ + R   L  DG+    P+A++ CN   P+G +P+L T  EV 
Sbjct: 421 DLYAR-EHKRGGAWMDDCIGRKRKL--DGS-IETPVAYLTCNFNAPIGNKPALFTHNEVT 480

Query: 578 TVFHEFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFMENWCYHRDTLMGIAKHYETG 637
           T+FHEFGH + HMLT+ D   VAGI G+ WDAVELPSQFMENWC+  + L  I+ HYETG
Sbjct: 481 TLFHEFGHGIHHMLTQIDVSDVAGINGVPWDAVELPSQFMENWCWEEEALAFISGHYETG 540

Query: 638 ESLPEEVYLKLLAARTFRAGSLSLRQLRFASIDLELHTKYVPGEPEAIFVVDQRVCKQTQ 697
           E LP+E   +LL A+ F+A    LRQL F   D  LH  +   +   I    + V  Q  
Sbjct: 541 EPLPKEKLTQLLKAKNFQAAMFILRQLEFGIFDFRLHHTFDAEKTNQILDTLKSVKSQVA 600

Query: 698 VIPPLADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLNDRKAVEETGRR 757
           VI  +   +   SFSHIFAGGYAAGYYSY WAEVLSADA+S FE+ G+ +      TG+ 
Sbjct: 601 VIKGVDWARAPHSFSHIFAGGYAAGYYSYLWAEVLSADAYSRFEEEGIFN----PITGKS 660

Query: 758 FRETILALGGGRAPSEVFVEFRGREPSPEPLLRHSGLL 790
           F + IL  GG   P E+F  FRGREP  + LLRH G++
Sbjct: 661 FLDEILTRGGSEEPMELFKRFRGREPQLDALLRHKGIM 680

BLAST of MS009702 vs. ExPASy Swiss-Prot
Match: P27237 (Oligopeptidase A OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) OX=99287 GN=prlC PE=1 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 1.9e-139
Identity = 269/697 (38.59%), Postives = 415/697 (59.54%), Query Frame = 0

Query: 101 SNPLLQDFYFPPFDVVEAKHVRPGILTLLKKLEGDLEELERTVEP-SWSKLVEPLEKIID 160
           +NPLL  F  PPF  ++ +HV P +   L      +E +     P SW  L +PL +  D
Sbjct: 2   TNPLLTSFSLPPFSAIKPEHVVPAVTKALADCRAAVEGVVAHGAPYSWENLCQPLAEADD 61

Query: 161 RLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKM 220
            L  ++  ++HL SVK++ +LR A E+  P   ++   +GQ + +YNA++ +R+   +  
Sbjct: 62  VLGRIFSPISHLNSVKNSPELREAYEQTLPLLSEYSTWVGQHEGLYNAYRDLRDGDHYAT 121

Query: 221 LNDARKRIVESQIKEALLHGVTLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLV 280
           LN A+K+ V++ +++  L G+ L  +K++ + +I   L  L  ++  NVLDAT  + KL+
Sbjct: 122 LNTAQKKAVDNALRDFELSGIGLPKEKQQRYGEIATRLSELGNQYSNNVLDATMGWTKLI 181

Query: 281 VDKNEIDGLPSTALGMAAQTAVSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKE 340
            D+ E+ G+P +AL  A   A +K       E   ++++LD P YL VM +  N++LR+E
Sbjct: 182 TDEAELAGMPESALAAAKAQAEAK-------EQEGYLLTLDIPSYLPVMTYCDNQALREE 241

Query: 341 IYYAYVTRAS-----SGEMDNTAIIDQILKLRLEKAKILNFNNYAEVSMETKMATVEKAE 400
           +Y AY TRAS     +G+ DN+ ++++IL LR E A++L F NYA  S+ TKMA  E  +
Sbjct: 242 MYRAYSTRASDQGPNAGKWDNSPVMEEILALRHELAQLLGFENYAHESLATKMA--ENPQ 301

Query: 401 ELLEKLRSASWNAAVQ---DIEDLKDFAKKQGAPEGNDLNHWDISFWSERLRESKFDVNE 460
           ++L+ L   +  A  Q   ++  L+ FAK +   E  +L  WDI+++SE+ ++  + +++
Sbjct: 302 QVLDFLTDLAKRARPQGEKELAQLRAFAKAEFGVE--ELQPWDIAYYSEKQKQHLYSISD 361

Query: 461 EELRPFFSLPNVMDGLFSLAKTLFEIDIQPADGLAPVWNKDVKFYRVNDSSGSPIAYLYF 520
           E+LRP+F     ++GLF + K ++ I  +    +  VW+ +V+F+ + D +       Y 
Sbjct: 362 EQLRPYFPENKAVNGLFEVVKRIYGITAKERTDV-DVWHPEVRFFELYDENNELRGSFYL 421

Query: 521 DPYTRPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVCNQTPPVGERPSLMTFREVE 580
           D Y R   KRGGAWMD+ V +   + +     + P+A++ CN   PV  +P+L T  EV 
Sbjct: 422 DLYAR-EHKRGGAWMDDCVGQ---MRKADGTLQKPVAYLTCNFNRPVNGKPALFTHDEVI 481

Query: 581 TVFHEFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFMENWCYHRDTLMGIAKHYETG 640
           T+FHEFGH L HMLT+ +   V+GI G+ WDAVELPSQFMENWC+  + L  I+ HYETG
Sbjct: 482 TLFHEFGHGLHHMLTRIETAGVSGISGVPWDAVELPSQFMENWCWEPEALAFISGHYETG 541

Query: 641 ESLPEEVYLKLLAARTFRAGSLSLRQLRFASIDLELHTKYVPGEPEAIFVVDQRVCKQTQ 700
           E LP+E+  K+LAA+ ++A    LRQL F   D  LH ++ P +   I      + KQ  
Sbjct: 542 EPLPKELLDKMLAAKNYQAALFILRQLEFGLFDFRLHAEFNPQQGAKILETLFEIKKQVA 601

Query: 701 VIPPLADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLNDRKAVEETGRR 760
           V+P     +F  +FSHIFAGGYAAGYYSY WA+VL+ADA+S FE+ G+ +R    ETG+ 
Sbjct: 602 VVPSPTWGRFPHAFSHIFAGGYAAGYYSYLWADVLAADAYSRFEEEGIFNR----ETGQS 661

Query: 761 FRETILALGGGRAPSEVFVEFRGREPSPEPLLRHSGL 789
           F + IL  GG   P E+F  FRGREP  + +L H G+
Sbjct: 662 FLDNILTRGGSEEPMELFKRFRGREPQLDAMLEHYGI 678

BLAST of MS009702 vs. ExPASy Swiss-Prot
Match: P27298 (Oligopeptidase A OS=Escherichia coli (strain K12) OX=83333 GN=prlC PE=3 SV=3)

HSP 1 Score: 494.2 bits (1271), Expect = 2.8e-138
Identity = 270/697 (38.74%), Postives = 418/697 (59.97%), Query Frame = 0

Query: 101 SNPLLQDFYFPPFDVVEAKHVRPGILTLLKKLEGDLEELERTVEP-SWSKLVEPLEKIID 160
           +NPLL  F  PPF  +  +HV P +   L     ++E +     P +W  L +PL ++ D
Sbjct: 2   TNPLLTPFELPPFSKILPEHVVPAVTKALNDCRENVERVVAQGAPYTWENLCQPLAEVDD 61

Query: 161 RLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKM 220
            L  ++  V+HL SVK++ +LR A E+  P   ++   +GQ + +Y A++ +R+   +  
Sbjct: 62  VLGRIFSPVSHLNSVKNSPELREAYEQTLPLLSEYSTWVGQHEGLYKAYRDLRDGDHYAT 121

Query: 221 LNDARKRIVESQIKEALLHGVTLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLV 280
           LN A+K+ V++ +++  L G+ L  +K++ + +I   L  L  ++  NVLDAT  + KLV
Sbjct: 122 LNTAQKKAVDNALRDFELSGIGLPKEKQQRYGEIATRLSELGNQYSNNVLDATMGWTKLV 181

Query: 281 VDKNEIDGLPSTALGMAAQTAVSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKE 340
            D+ E+ G+P +AL  A   A +K       E   ++++LD P YL VM +  N++LR+E
Sbjct: 182 TDEAELAGMPESALAAAKAQAEAK-------ELEGYLLTLDIPSYLPVMTYCDNQALREE 241

Query: 341 IYYAYVTRAS-----SGEMDNTAIIDQILKLRLEKAKILNFNNYAEVSMETKMATVEKAE 400
           +Y AY TRAS     +G+ DN+ ++++IL LR E A++L F NYA  S+ TKMA  E  +
Sbjct: 242 MYRAYSTRASDQGPNAGKWDNSKVMEEILALRHELAQLLGFENYAFKSLATKMA--ENPQ 301

Query: 401 ELLEKLRSASWNAAVQ---DIEDLKDFAKKQGAPEGNDLNHWDISFWSERLRESKFDVNE 460
           ++L+ L   +  A  Q   ++  L+ FAK +   +  +L  WDI+++SE+ ++  + +++
Sbjct: 302 QVLDFLTDLAKRARPQGEKELAQLRAFAKAEFGVD--ELQPWDIAYYSEKQKQHLYSISD 361

Query: 461 EELRPFFSLPNVMDGLFSLAKTLFEIDIQPADGLAPVWNKDVKFYRVNDSSGSPIAYLYF 520
           E+LRP+F     ++GLF + K ++ I  +    +  VW+ DV+F+ + D +       Y 
Sbjct: 362 EQLRPYFPENKAVNGLFEVVKRIYGITAKERKDV-DVWHPDVRFFELYDENNELRGSFYL 421

Query: 521 DPYTRPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVCNQTPPVGERPSLMTFREVE 580
           D Y R   KRGGAWMD+ V +  +   DG+  + P+A++ CN   PV  +P+L T  EV 
Sbjct: 422 DLYAR-ENKRGGAWMDDCVGQ--MRKADGS-LQKPVAYLTCNFNRPVNGKPALFTHDEVI 481

Query: 581 TVFHEFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFMENWCYHRDTLMGIAKHYETG 640
           T+FHEFGH L HMLT+ +   V+GI G+ WDAVELPSQFMENWC+  + L  I+ HYETG
Sbjct: 482 TLFHEFGHGLHHMLTRIETAGVSGISGVPWDAVELPSQFMENWCWEPEALAFISGHYETG 541

Query: 641 ESLPEEVYLKLLAARTFRAGSLSLRQLRFASIDLELHTKYVPGEPEAIFVVDQRVCKQTQ 700
           E LP+E+  K+LAA+ ++A    LRQL F   D  LH ++ P +   I      + K   
Sbjct: 542 EPLPKELLDKMLAAKNYQAALFILRQLEFGLFDFRLHAEFRPDQGAKILETLAEIKKLVA 601

Query: 701 VIPPLADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLNDRKAVEETGRR 760
           V+P  +  +F  +FSHIFAGGYAAGYYSY WA+VL+ADAFS FE+ G+ +R    ETG+ 
Sbjct: 602 VVPSPSWGRFPHAFSHIFAGGYAAGYYSYLWADVLAADAFSRFEEEGIFNR----ETGQS 661

Query: 761 FRETILALGGGRAPSEVFVEFRGREPSPEPLLRHSGL 789
           F + IL+ GG   P ++F  FRGREP  + +L H G+
Sbjct: 662 FLDNILSRGGSEEPMDLFKRFRGREPQLDAMLEHYGI 678

BLAST of MS009702 vs. ExPASy TrEMBL
Match: A0A6J1CHM9 (probable cytosolic oligopeptidase A OS=Momordica charantia OX=3673 GN=LOC111011725 PE=3 SV=1)

HSP 1 Score: 1565.1 bits (4051), Expect = 0.0e+00
Identity = 791/795 (99.50%), Postives = 792/795 (99.62%), Query Frame = 0

Query: 1   MKTRRLVLQALLIANMLMASRILLAASVHPLLKTTYSLSISSPNHLQKSIPCPLWSSSFS 60
           MKT+ LVLQALLIANMLMASRILLAASVHPLLKTTYSLS SSPNHLQKSIPCPLWSSSFS
Sbjct: 1   MKTKGLVLQALLIANMLMASRILLAASVHPLLKTTYSLSTSSPNHLQKSIPCPLWSSSFS 60

Query: 61  FCLHNLHNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH 120
           FCLHNL NSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH
Sbjct: 61  FCLHNLRNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH 120

Query: 121 VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL 180
           VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL
Sbjct: 121 VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL 180

Query: 181 RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV 240
           RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV
Sbjct: 181 RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV 240

Query: 241 TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA 300
           TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA
Sbjct: 241 TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA 300

Query: 301 VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID 360
           VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID
Sbjct: 301 VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID 360

Query: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK 420
           QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK
Sbjct: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK 420

Query: 421 KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ 480
           KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ
Sbjct: 421 KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ 480

Query: 481 PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540
           PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG
Sbjct: 481 PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540

Query: 541 APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600
           APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE
Sbjct: 541 APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600

Query: 601 WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF 660
           WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF
Sbjct: 601 WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF 660

Query: 661 ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY 720
           ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY
Sbjct: 661 ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY 720

Query: 721 KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE 780
           KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE
Sbjct: 721 KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE 780

Query: 781 PLLRHSGLLPGIAAA 796
           PLLRHSGLLPGIAAA
Sbjct: 781 PLLRHSGLLPGIAAA 795

BLAST of MS009702 vs. ExPASy TrEMBL
Match: A0A6J1E479 (probable cytosolic oligopeptidase A OS=Cucurbita moschata OX=3662 GN=LOC111430657 PE=3 SV=1)

HSP 1 Score: 1435.2 bits (3714), Expect = 0.0e+00
Identity = 716/795 (90.06%), Postives = 750/795 (94.34%), Query Frame = 0

Query: 1   MKTRRLVLQALLIANMLMASRILLAASVHPLLKTTYSLSISSPNHLQKSIPCPLWSSSFS 60
           MK + L LQALLIA +LMASR  L AS+ P L  TYSLS+SSP+ L KS+PCPLWSSSFS
Sbjct: 1   MKIKSLALQALLIAGLLMASRFTLTASIQPFLIRTYSLSVSSPSRLPKSLPCPLWSSSFS 60

Query: 61  FCLHNLHNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH 120
           FCLHNLHNSVTSSSI   S C S+SSPSMAASA +DEI QSNPLLQ+FYFPPFD VEAKH
Sbjct: 61  FCLHNLHNSVTSSSILHFSSCSSISSPSMAASAALDEIPQSNPLLQNFYFPPFDAVEAKH 120

Query: 121 VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL 180
           VRPGILTLLK LEGDLEELERTVEPSWSKLVEPLEKI DRLNVVWGIVNHLKSVKDT DL
Sbjct: 121 VRPGILTLLKTLEGDLEELERTVEPSWSKLVEPLEKITDRLNVVWGIVNHLKSVKDTADL 180

Query: 181 RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV 240
           R AIEEVQPEKVKFQLRLGQSKPIYNAFKAIRES EW  LNDARKRIVESQIKEALLHGV
Sbjct: 181 RIAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESPEWNTLNDARKRIVESQIKEALLHGV 240

Query: 241 TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA 300
           TLEDDKR++FNKIQQELERLS KF+ENVLDATKKFEKL+VDK+E+DGLPSTALGMAAQTA
Sbjct: 241 TLEDDKRDNFNKIQQELERLSHKFDENVLDATKKFEKLIVDKHEVDGLPSTALGMAAQTA 300

Query: 301 VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID 360
           VSKGHE AT+ENGPWII+LDAPCYLSVMQHAKNRSLR+E+YYAY+TRASSGEMDNT+IID
Sbjct: 301 VSKGHENATSENGPWIITLDAPCYLSVMQHAKNRSLREEVYYAYITRASSGEMDNTSIID 360

Query: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK 420
           QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQD+EDLKDF+K
Sbjct: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDLEDLKDFSK 420

Query: 421 KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ 480
           KQGA E NDLNHWDISFWSERLRESKFD+NEEELRPFFSLP VMDGLFSLAKTLF+IDI+
Sbjct: 421 KQGAVEANDLNHWDISFWSERLRESKFDINEEELRPFFSLPKVMDGLFSLAKTLFDIDIE 480

Query: 481 PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540
            ADGLA VW+KDVKFYRVN+SSGSPIAY YFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG
Sbjct: 481 SADGLAAVWDKDVKFYRVNNSSGSPIAYFYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540

Query: 541 APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600
           APARLPIAHMVCNQTPPVGE+PSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE
Sbjct: 541 APARLPIAHMVCNQTPPVGEKPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600

Query: 601 WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF 660
           WDAVELPSQFMENWCYHR+TLM IAKHYETGESLPEEVY KLL ARTFRAGSLSLRQLRF
Sbjct: 601 WDAVELPSQFMENWCYHRETLMSIAKHYETGESLPEEVYFKLLTARTFRAGSLSLRQLRF 660

Query: 661 ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY 720
           AS+DLELHTKYVPGEPE I+ VDQRVCKQTQV+PPLADDKFLCSFSHIFAGGYAAGYYSY
Sbjct: 661 ASVDLELHTKYVPGEPELIYAVDQRVCKQTQVLPPLADDKFLCSFSHIFAGGYAAGYYSY 720

Query: 721 KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE 780
           KWAEVLSADAFSAFEDAGLND KAV+ETG RFRETILALGGGRAP EVFVEFRGREPSPE
Sbjct: 721 KWAEVLSADAFSAFEDAGLNDSKAVKETGHRFRETILALGGGRAPLEVFVEFRGREPSPE 780

Query: 781 PLLRHSGLLPGIAAA 796
           PLLRHSGLLPG AAA
Sbjct: 781 PLLRHSGLLPGSAAA 795

BLAST of MS009702 vs. ExPASy TrEMBL
Match: A0A6J1J8N1 (probable cytosolic oligopeptidase A OS=Cucurbita maxima OX=3661 GN=LOC111482754 PE=3 SV=1)

HSP 1 Score: 1430.2 bits (3701), Expect = 0.0e+00
Identity = 712/795 (89.56%), Postives = 749/795 (94.21%), Query Frame = 0

Query: 1   MKTRRLVLQALLIANMLMASRILLAASVHPLLKTTYSLSISSPNHLQKSIPCPLWSSSFS 60
           MK + L LQALLIA +LMASR  L AS+ P L  TYSLS+ SP+ L KS+PCPLWSSSFS
Sbjct: 1   MKIKSLALQALLIAGLLMASRFTLTASIQPFLVRTYSLSVPSPSRLPKSLPCPLWSSSFS 60

Query: 61  FCLHNLHNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKH 120
           FCLHNLHNSVTSSSI   S C S+S+PSMAASA +DEI QSNPLLQ+FYFPPFD VEAKH
Sbjct: 61  FCLHNLHNSVTSSSIPHFSSCSSISAPSMAASAALDEIPQSNPLLQNFYFPPFDAVEAKH 120

Query: 121 VRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDL 180
           VRPGILTLLKKLEGDL+ELERTVEPSWSKLVEPLEKI DRLNVVWGIVNHLKSVKDT DL
Sbjct: 121 VRPGILTLLKKLEGDLKELERTVEPSWSKLVEPLEKITDRLNVVWGIVNHLKSVKDTADL 180

Query: 181 RTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGV 240
           R AIEEVQPEKVKFQLRLGQSKPIYNAFKAIRES EW  LNDARKRIVESQIKEALLHGV
Sbjct: 181 RIAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESPEWNTLNDARKRIVESQIKEALLHGV 240

Query: 241 TLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTA 300
           TLEDDKR+ FNKIQQELERLS KF+ENVLDATKKFEKL+VDK+E+DGLPSTALGMAAQTA
Sbjct: 241 TLEDDKRDKFNKIQQELERLSHKFDENVLDATKKFEKLIVDKHEVDGLPSTALGMAAQTA 300

Query: 301 VSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIID 360
           VSKGHE AT+ENGPWII+LDAPCYLSVMQHAKNRSLR+E+YYAY+TRASSGEMDNT+IID
Sbjct: 301 VSKGHENATSENGPWIITLDAPCYLSVMQHAKNRSLREEVYYAYITRASSGEMDNTSIID 360

Query: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAK 420
           QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQD+EDLKDF+K
Sbjct: 361 QILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDLEDLKDFSK 420

Query: 421 KQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQ 480
           KQGA E NDLNHWDISFWSERLRESKFD+NEEELRPFFSLP VMDGLFSLAKTLF+IDI+
Sbjct: 421 KQGAVEANDLNHWDISFWSERLRESKFDINEEELRPFFSLPKVMDGLFSLAKTLFDIDIE 480

Query: 481 PADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540
            ADGLA VW+KDVKFYRVN+SSGSPIAY YFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG
Sbjct: 481 SADGLAAVWDKDVKFYRVNNSSGSPIAYFYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDG 540

Query: 541 APARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600
           APARLPIAHMVCNQTPPVGE+PSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE
Sbjct: 541 APARLPIAHMVCNQTPPVGEKPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIE 600

Query: 601 WDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRF 660
           WDAVELPSQFMENWCYH++TLM IAKHYETGESLPEEVY KLL ARTFRAGSLSLRQLRF
Sbjct: 601 WDAVELPSQFMENWCYHKETLMSIAKHYETGESLPEEVYFKLLTARTFRAGSLSLRQLRF 660

Query: 661 ASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSY 720
           AS+DLELHTKYVPGEPE I+ VDQRVCK+TQV+PPLADDKFLCSFSHIFAGGYAAGYYSY
Sbjct: 661 ASVDLELHTKYVPGEPELIYAVDQRVCKKTQVLPPLADDKFLCSFSHIFAGGYAAGYYSY 720

Query: 721 KWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPE 780
           KWAEVLSADAFSAFEDAGLND KAV+ETG RFRETILALGGGRAP EVFVEFRGREPSPE
Sbjct: 721 KWAEVLSADAFSAFEDAGLNDSKAVKETGHRFRETILALGGGRAPLEVFVEFRGREPSPE 780

Query: 781 PLLRHSGLLPGIAAA 796
           PLLRHSGLLPG AAA
Sbjct: 781 PLLRHSGLLPGSAAA 795

BLAST of MS009702 vs. ExPASy TrEMBL
Match: A0A1S3BS19 (probable cytosolic oligopeptidase A OS=Cucumis melo OX=3656 GN=LOC103492908 PE=3 SV=1)

HSP 1 Score: 1412.9 bits (3656), Expect = 0.0e+00
Identity = 709/798 (88.85%), Postives = 749/798 (93.86%), Query Frame = 0

Query: 1   MKTRRLVLQALLIANMLMASRILLAASVHPLL-KTTYSL--SISSPNHLQKSIPCPLWSS 60
           M+ + +V QALLIA+MLMASRI L AS+HPLL + T+SL  SISSP  L KS PCPLWSS
Sbjct: 1   MEKKSVVFQALLIASMLMASRITLTASIHPLLVRRTHSLSISISSPYQLPKSFPCPLWSS 60

Query: 61  SFSFCLHNLHNSVTSSSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVE 120
           SFSFCLHN   SVTSSS+H  S C S S+P+MAA   +D+ISQSNPLLQDFYFPPFD VE
Sbjct: 61  SFSFCLHNRRKSVTSSSVHYFSSCSSHSAPTMAAFGDIDQISQSNPLLQDFYFPPFDAVE 120

Query: 121 AKHVRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDT 180
           A HVRPGIL LLKKLEGDLEELERTVEPSWSKLVEPLEKI+DRL VVWGIV+HLKSVKDT
Sbjct: 121 ANHVRPGILALLKKLEGDLEELERTVEPSWSKLVEPLEKIVDRLTVVWGIVSHLKSVKDT 180

Query: 181 TDLRTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALL 240
            DLR AIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESS+W  L+DARKRIVESQIKEALL
Sbjct: 181 ADLRIAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESSKWNTLDDARKRIVESQIKEALL 240

Query: 241 HGVTLEDDKREHFNKIQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAA 300
            GVTLE DKR++FNKI+QELERLS KF+ENVLDATKKFEKL+VDK+E+DGLPSTALGMAA
Sbjct: 241 RGVTLEGDKRDNFNKIEQELERLSHKFDENVLDATKKFEKLIVDKHEVDGLPSTALGMAA 300

Query: 301 QTAVSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTA 360
           QTAVSKGHE ATAENGPWII+LDAPCYLSVMQHAKNRSLR+EIYYAY+TRASSGEMDNT+
Sbjct: 301 QTAVSKGHENATAENGPWIITLDAPCYLSVMQHAKNRSLREEIYYAYITRASSGEMDNTS 360

Query: 361 IIDQILKLRLEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKD 420
           IIDQILKLR EKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQD+EDL+D
Sbjct: 361 IIDQILKLRQEKAKILNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDVEDLQD 420

Query: 421 FAKKQGAPEGNDLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEI 480
           FAK+QGAPE NDLNHWDISFWSERLRESKFD+NEEELRPFFSLP VMDGLFSLAKTLF+I
Sbjct: 421 FAKRQGAPEANDLNHWDISFWSERLRESKFDINEEELRPFFSLPRVMDGLFSLAKTLFDI 480

Query: 481 DIQPADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLA 540
           DIQPADGLAPVW+KDVKFYRVN+SSGSPIAY YFDPYTRPSEKRGGAWMDEVVSRSCVLA
Sbjct: 481 DIQPADGLAPVWDKDVKFYRVNNSSGSPIAYFYFDPYTRPSEKRGGAWMDEVVSRSCVLA 540

Query: 541 QDGAPARLPIAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIR 600
           QDGAPARLPIAHMVCNQTPPVG +PSLMTFREVETVFHEFGHALQHMLTKQ EGLVAGIR
Sbjct: 541 QDGAPARLPIAHMVCNQTPPVGGKPSLMTFREVETVFHEFGHALQHMLTKQGEGLVAGIR 600

Query: 601 GIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQ 660
           GIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQ
Sbjct: 601 GIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQ 660

Query: 661 LRFASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGY 720
           LRFAS+DLELHTKYVPGEPE IF VDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGY
Sbjct: 661 LRFASVDLELHTKYVPGEPELIFAVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGY 720

Query: 721 YSYKWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREP 780
           YSYKWAEVLSADAFSAFEDAGLND +AV+ETG RFRET+LALGGGRAP EVFVEFRGREP
Sbjct: 721 YSYKWAEVLSADAFSAFEDAGLNDIEAVKETGHRFRETVLALGGGRAPLEVFVEFRGREP 780

Query: 781 SPEPLLRHSGLLPGIAAA 796
           SPEPLLRHSGLL G+A A
Sbjct: 781 SPEPLLRHSGLLAGLATA 798

BLAST of MS009702 vs. ExPASy TrEMBL
Match: A0A5D3D7N4 (Putative cytosolic oligopeptidase A OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold416G00520 PE=3 SV=1)

HSP 1 Score: 1402.1 bits (3628), Expect = 0.0e+00
Identity = 701/783 (89.53%), Postives = 737/783 (94.13%), Query Frame = 0

Query: 16  MLMASRILLAASVHPLL-KTTYSL--SISSPNHLQKSIPCPLWSSSFSFCLHNLHNSVTS 75
           MLMASRI L AS+HPLL + T+SL  SISSP  L KS PCPLWSSSFSFCLHN   SVTS
Sbjct: 1   MLMASRITLTASIHPLLVRRTHSLSISISSPYQLPKSFPCPLWSSSFSFCLHNRRKSVTS 60

Query: 76  SSIHSSSPCFSLSSPSMAASAVVDEISQSNPLLQDFYFPPFDVVEAKHVRPGILTLLKKL 135
           SS+H  S C S S+P+MAA   +D+ISQSNPLLQDFYFPPFD VEA HVRPGIL LLKKL
Sbjct: 61  SSVHYFSSCSSHSAPTMAAFGDIDQISQSNPLLQDFYFPPFDAVEANHVRPGILALLKKL 120

Query: 136 EGDLEELERTVEPSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKV 195
           EGDLEELERTVEPSWSKLVEPLEKI+DRL VVWGIV+HLKSVKDT DLR AIEEVQPEKV
Sbjct: 121 EGDLEELERTVEPSWSKLVEPLEKIVDRLTVVWGIVSHLKSVKDTADLRIAIEEVQPEKV 180

Query: 196 KFQLRLGQSKPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGVTLEDDKREHFNK 255
           KFQLRLGQSKPIYNAFKAIRESS+W  L+DARKRIVESQIKEALL GVTLE DKR++FNK
Sbjct: 181 KFQLRLGQSKPIYNAFKAIRESSKWNTLDDARKRIVESQIKEALLRGVTLEGDKRDNFNK 240

Query: 256 IQQELERLSQKFEENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTAVSKGHEKATAEN 315
           I+QELERLS KF+ENVLDATKKFEKL+VDK+E+DGLPSTALGMAAQTAVSKGHE ATAEN
Sbjct: 241 IEQELERLSHKFDENVLDATKKFEKLIVDKHEVDGLPSTALGMAAQTAVSKGHENATAEN 300

Query: 316 GPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIIDQILKLRLEKAKI 375
           GPWII+LDAPCYLSVMQHAKNRSLR+EIYYAY+TRASSGEMDNT+IIDQILKLR EKAKI
Sbjct: 301 GPWIITLDAPCYLSVMQHAKNRSLREEIYYAYITRASSGEMDNTSIIDQILKLRQEKAKI 360

Query: 376 LNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDIEDLKDFAKKQGAPEGNDLNH 435
           LNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQD+EDL+DFAK+QGAPE NDLNH
Sbjct: 361 LNFNNYAEVSMETKMATVEKAEELLEKLRSASWNAAVQDVEDLQDFAKRQGAPEANDLNH 420

Query: 436 WDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQPADGLAPVWNKD 495
           WDISFWSERLRESKFD+NEEELRPFFSLP VMDGLFSLAKTLF+IDIQPADGLAPVW+KD
Sbjct: 421 WDISFWSERLRESKFDINEEELRPFFSLPRVMDGLFSLAKTLFDIDIQPADGLAPVWDKD 480

Query: 496 VKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVC 555
           VKFYRVN+SSGSPIAY YFDPYTRPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVC
Sbjct: 481 VKFYRVNNSSGSPIAYFYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVC 540

Query: 556 NQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFME 615
           NQTPPVG +PSLMTFREVETVFHEFGHALQHMLTKQ EGLVAGIRGIEWDAVELPSQFME
Sbjct: 541 NQTPPVGGKPSLMTFREVETVFHEFGHALQHMLTKQGEGLVAGIRGIEWDAVELPSQFME 600

Query: 616 NWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRFASIDLELHTKYV 675
           NWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRFAS+DLELHTKYV
Sbjct: 601 NWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRFASVDLELHTKYV 660

Query: 676 PGEPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFS 735
           PGEPE IF VDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFS
Sbjct: 661 PGEPELIFAVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFS 720

Query: 736 AFEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPEPLLRHSGLLPGI 795
           AFEDAGLND +AV+ETG RFRET+LALGGGRAP EVFVEFRGREPSPEPLLRHSGLL G+
Sbjct: 721 AFEDAGLNDIEAVKETGHRFRETVLALGGGRAPLEVFVEFRGREPSPEPLLRHSGLLAGL 780

BLAST of MS009702 vs. TAIR 10
Match: AT5G65620.1 (Zincin-like metalloproteases family protein )

HSP 1 Score: 1197.6 bits (3097), Expect = 0.0e+00
Identity = 590/760 (77.63%), Postives = 668/760 (87.89%), Query Frame = 0

Query: 43  PNHLQKSIPCPLWSSSFSFCLHNLHNSVTSSSIHSSS-------PCFSLSSPSMAASAVV 102
           P+  +KS PCP+WSSSFSFCL     S TS+S+ SSS       P  S ++ +   S V 
Sbjct: 33  PSTFRKSYPCPIWSSSFSFCLPP-PRSTTSTSLSSSSFRPFSSPPSMSSAAAAAVESVVS 92

Query: 103 DEISQSNPLLQDFYFPPFDVVEAKHVRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLE 162
           DE   SNPLLQDF FPPFD V+A HVRPGI  LL+ LE +LEELE++VEP+W KLVEPLE
Sbjct: 93  DETLSSNPLLQDFDFPPFDSVDASHVRPGIRALLQHLEAELEELEKSVEPTWPKLVEPLE 152

Query: 163 KIIDRLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESS 222
           KI+DRL VVWG++NHLK+VKDT +LR AIE+VQPEKVKFQLRLGQSKPIYNAFKAIRES 
Sbjct: 153 KIVDRLTVVWGMINHLKAVKDTPELRAAIEDVQPEKVKFQLRLGQSKPIYNAFKAIRESP 212

Query: 223 EWKMLNDARKRIVESQIKEALLHGVTLEDDKREHFNKIQQELERLSQKFEENVLDATKKF 282
           +W  L++AR+R+VE+QIKEA+L G+ L+D+KRE FNKI+QELE+LS KF ENVLDATKKF
Sbjct: 213 DWSSLSEARQRLVEAQIKEAVLIGIALDDEKREEFNKIEQELEKLSHKFSENVLDATKKF 272

Query: 283 EKLVVDKNEIDGLPSTALGMAAQTAVSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRS 342
           EKL+ DK EI+GLP +ALG+ AQ AVSKGHE ATAENGPWII+LDAP YL VMQHAKNR+
Sbjct: 273 EKLITDKKEIEGLPPSALGLFAQAAVSKGHENATAENGPWIITLDAPSYLPVMQHAKNRA 332

Query: 343 LRKEIYYAYVTRASSGEMDNTAIIDQILKLRLEKAKILNFNNYAEVSMETKMATVEKAEE 402
           LR+E+Y AY++RASSG++DNTAIIDQILKLRLEKAK+L +NNYAEVSM  KMATVEKA E
Sbjct: 333 LREEVYRAYLSRASSGDLDNTAIIDQILKLRLEKAKLLGYNNYAEVSMAMKMATVEKAAE 392

Query: 403 LLEKLRSASWNAAVQDIEDLKDFAKKQGAPEGNDLNHWDISFWSERLRESKFDVNEEELR 462
           LLEKLRSASW+AAVQD+EDLK FAK QGA E + + HWD +FWSERLRESK+D+NEEELR
Sbjct: 393 LLEKLRSASWDAAVQDMEDLKSFAKNQGAAESDSMTHWDTTFWSERLRESKYDINEEELR 452

Query: 463 PFFSLPNVMDGLFSLAKTLFEIDIQPADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYT 522
           P+FSLP VMDGLFSLAKTLF IDI+PADGLAPVWN DV+FYRV DSSG+PIAY YFDPY+
Sbjct: 453 PYFSLPKVMDGLFSLAKTLFGIDIEPADGLAPVWNNDVRFYRVKDSSGNPIAYFYFDPYS 512

Query: 523 RPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVCNQTPPVGERPSLMTFREVETVFH 582
           RPSEKRGGAWMDEVVSRS V+AQ G+  RLP+AHMVCNQTPPVG++PSLMTFREVETVFH
Sbjct: 513 RPSEKRGGAWMDEVVSRSRVMAQKGSSVRLPVAHMVCNQTPPVGDKPSLMTFREVETVFH 572

Query: 583 EFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLP 642
           EFGHALQHMLTKQDEGLVAGIR IEWDAVELPSQFMENWCYHRDTLM IAKHYETGE+LP
Sbjct: 573 EFGHALQHMLTKQDEGLVAGIRNIEWDAVELPSQFMENWCYHRDTLMSIAKHYETGETLP 632

Query: 643 EEVYLKLLAARTFRAGSLSLRQLRFASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPP 702
           EEVY KLLAARTFRAGS SLRQL+FAS+DLELHTKYVPG PE+I+ VDQRV  +TQVIPP
Sbjct: 633 EEVYKKLLAARTFRAGSFSLRQLKFASVDLELHTKYVPGGPESIYDVDQRVSVKTQVIPP 692

Query: 703 LADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRET 762
           L +D+FLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGL+D KAV+ETG+RFR T
Sbjct: 693 LPEDRFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLDDIKAVKETGQRFRNT 752

Query: 763 ILALGGGRAPSEVFVEFRGREPSPEPLLRHSGLLPGIAAA 796
           ILALGGG+AP +VFVEFRGREPSPEPLLRH+GLL   A+A
Sbjct: 753 ILALGGGKAPLKVFVEFRGREPSPEPLLRHNGLLAASASA 791

BLAST of MS009702 vs. TAIR 10
Match: AT5G10540.1 (Zincin-like metalloproteases family protein )

HSP 1 Score: 1134.4 bits (2933), Expect = 0.0e+00
Identity = 549/698 (78.65%), Postives = 623/698 (89.26%), Query Frame = 0

Query: 96  DEISQSNPLLQDFYFPPFDVVEAKHVRPGILTLLKKLEGDLEELERTVEPSWSKLVEPLE 155
           ++   SNPLLQ+F FPPFD V+A HVRPGI  LL++LE +LE+LE+ VEPSW KLVEPLE
Sbjct: 4   EDTLSSNPLLQNFDFPPFDSVDAHHVRPGIRALLQQLEAELEQLEKAVEPSWPKLVEPLE 63

Query: 156 KIIDRLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESS 215
           KIIDRL+VVWG++NHLK+VKDT +LR AIEEVQPEKVKFQLRLGQSKPIYNAFKAIRES 
Sbjct: 64  KIIDRLSVVWGMINHLKAVKDTPELRAAIEEVQPEKVKFQLRLGQSKPIYNAFKAIRESP 123

Query: 216 EWKMLNDARKRIVESQIKEALLHGVTLEDDKREHFNKIQQELERLSQKFEENVLDATKKF 275
           +W  L++AR+R+VE+QIKEA+L G+ LEDDKRE FNKI+QELE+LS KF ENVLDATKKF
Sbjct: 124 DWNSLSEARQRLVEAQIKEAVLSGIALEDDKREEFNKIEQELEKLSHKFSENVLDATKKF 183

Query: 276 EKLVVDKNEIDGLPSTALGMAAQTAVSKGHEKATAENGPWIISLDAPCYLSVMQHAKNRS 335
           EKL+ DK EI+GLP +ALG+ AQ AVSKGHE ATA+ GPW+I+LDAP YL VMQHAKNR+
Sbjct: 184 EKLITDKKEIEGLPPSALGLFAQAAVSKGHETATADTGPWLITLDAPSYLPVMQHAKNRA 243

Query: 336 LRKEIYYAYVTRASSGEMDNTAIIDQILKLRLEKAKILNFNNYAEVSMETKMATVEKAEE 395
           LR+E+Y AY++RASSG++DNTAIIDQILKLRLEKAK+L + NYAEVSM TKMATVEKA+E
Sbjct: 244 LREEVYRAYLSRASSGDLDNTAIIDQILKLRLEKAKLLGYRNYAEVSMATKMATVEKADE 303

Query: 396 LLEKLRSASWNAAVQDIEDLKDFAKKQGAPEGNDLNHWDISFWSERLRESKFDVNEEELR 455
           LLEKLRSASW+ AVQDIEDLK FAK QGA E + L HWDI+FWSERLRESK+D+NEEELR
Sbjct: 304 LLEKLRSASWDPAVQDIEDLKSFAKNQGAAEADSLTHWDITFWSERLRESKYDINEEELR 363

Query: 456 PFFSLPNVMDGLFSLAKTLFEIDIQPADGLAPVWNKDVKFYRVNDSSGSPIAYLYFDPYT 515
           P+FSLP VMD LF LAKTLF ID+ PADG+APVWN DV+FY V DSSG+P AY YFDPY+
Sbjct: 364 PYFSLPKVMDALFGLAKTLFGIDVVPADGVAPVWNSDVRFYCVKDSSGNPTAYFYFDPYS 423

Query: 516 RPSEKRGGAWMDEVVSRSCVLAQDGAPARLPIAHMVCNQTPPVGERPSLMTFREVETVFH 575
           RPSEKR GAWMDEV SRS V+AQ G+  RLP+A MVCNQTPPVG++PSLMTFREVETVFH
Sbjct: 424 RPSEKRDGAWMDEVFSRSRVMAQKGSSVRLPVAQMVCNQTPPVGDKPSLMTFREVETVFH 483

Query: 576 EFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFMENWCYHRDTLMGIAKHYETGESLP 635
           EFGHALQHMLTK+DEGLVAGIR IEWDAVELPSQFMENWCYHRDTLM IAKHY+TGE+LP
Sbjct: 484 EFGHALQHMLTKEDEGLVAGIRNIEWDAVELPSQFMENWCYHRDTLMSIAKHYQTGETLP 543

Query: 636 EEVYLKLLAARTFRAGSLSLRQLRFASIDLELHTKYVPGEPEAIFVVDQRVCKQTQVIPP 695
           E VY KLLAARTFRAGSLSLRQL+FA++DLELHTKY+PG  E I+ VDQRV  +TQVIPP
Sbjct: 544 ENVYKKLLAARTFRAGSLSLRQLKFATVDLELHTKYMPGGAETIYEVDQRVSIKTQVIPP 603

Query: 696 LADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLNDRKAVEETGRRFRET 755
           L +D+FLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGL+D KAV+ETG+RFR T
Sbjct: 604 LPEDRFLCSFSHIFAGGYAAGYYSYKWAEVLSADAFSAFEDAGLDDIKAVKETGQRFRNT 663

Query: 756 ILALGGGRAPSEVFVEFRGREPSPEPLLRHSGLLPGIA 794
           ILALGGG+AP +VFVEFRGREPSPEPLLRH+GLL   A
Sbjct: 664 ILALGGGKAPLKVFVEFRGREPSPEPLLRHNGLLAASA 701

BLAST of MS009702 vs. TAIR 10
Match: AT5G51540.1 (Zincin-like metalloproteases family protein )

HSP 1 Score: 170.2 bits (430), Expect = 6.5e-42
Identity = 153/656 (23.32%), Postives = 287/656 (43.75%), Query Frame = 0

Query: 145 PSWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKVK---FQLRLGQS 204
           PS  ++++ +++I D    V  +V+  +  + T   R  +EE     ++   +   L  +
Sbjct: 67  PSSPEIIKAMDEISD---TVCCVVDSAELCRQTHPDREFVEEANKAAIEMNDYLHHLNTN 126

Query: 205 KPIYNAFKAIRESSEWKMLNDARKRIVESQIKEALLHGVTLEDDKREHFNKIQQELERLS 264
             +Y A K   + S   +L     R       +    G+ L+ +K +  N +   + +L 
Sbjct: 127 HTLYAAVKKAEQDS--NLLTKEASRTAHHLRMDFERGGIHLDPEKLDKVNNLTTNIFQLC 186

Query: 265 QKFEENVLDATKKFEKLVVDKNEIDGLPSTAL----------GMAAQTAVSKGHEKAT-- 324
           ++F EN+ D          D   +D  P + +             + +  S+G  ++   
Sbjct: 187 REFSENIAD----------DPGHVDIFPGSRIPRHLHHLLNPTYRSTSGGSRGSTRSAHK 246

Query: 325 AENGPWIISLDAPCYLSVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIIDQILKLRLEK 384
           ++   + I+ D     S++Q   +  +RK +Y     + +S    N  ++++++  R E 
Sbjct: 247 SKQKGFRINTDPRTVSSILQWTSDEEVRKMVY----IQGNSVPHANHGVLEKLIAARHEL 306

Query: 385 AKILNFNNYAEVSMETKMATVEK-AEELLEKLRSASWNAAVQDIEDLKDFAKKQGAPEGN 444
           ++++  N+YA++ +E  +A   K     L++L       A ++   ++DF +++      
Sbjct: 307 SQMMGCNSYADIMVEPNLAKSPKVVTSFLQELSKTVKPKADEEFIAIRDFKREKCGNPSA 366

Query: 445 DLNHWDISFWSERLRESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQPADGLAP- 504
           +L  WD ++++  ++ S  DV+   +  +F LP  ++GL  L ++LF         LAP 
Sbjct: 367 ELEPWDETYYTSMMKSSINDVDTAVVASYFPLPQCIEGLKVLVESLFGATFHTIP-LAPG 426

Query: 505 -VWNKDVKFYRVNDSSGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLAQDGAPARLP 564
             W+ +V    ++      + YLY D Y+R  +  G A       R           +LP
Sbjct: 427 ESWHPNVVKLSLHHPDEGDLGYLYLDLYSRKGKYPGCASFAIRGGRKI----SETEYQLP 486

Query: 565 IAHMVCNQTPPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIEWDAVEL 624
           +  +VCN +         +   EVE +FHEFGHAL  +L++ D    +G R +  D  E+
Sbjct: 487 VIALVCNFSRACDSSIVKLNHSEVEVLFHEFGHALHSLLSRTDYQHFSGTR-VALDLAEM 546

Query: 625 PSQFMENWCYHRDTLMGIAKHYETGESLPEEVYLKLLAARTFRAGSLSLRQLRFASIDLE 684
           PS   E + +    L   A+HY TGE++PE++   L  AR   A +   RQ+ +A ID  
Sbjct: 547 PSNLFEYYAWDYRLLKRFARHYSTGETIPEKLVNSLQGARNMFAATEMQRQVFYALIDQM 606

Query: 685 LHTKYVPGEPEAIFVVDQRVC---KQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSYKWA 744
           L  +    +PE    V   V    +Q      +    +   FSH+    Y AGYYSY +A
Sbjct: 607 LFGE----QPETARDVSHLVAELKRQHTSWNHVEGTHWYIRFSHLL--NYGAGYYSYLYA 666

Query: 745 EVLSADAFSAF---EDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGRE 777
           +  ++  + +    +   LN       TG   RE     GG + P+E+  +  G+E
Sbjct: 667 KCFASTIWQSICEEDPLSLN-------TGTLLREKFFKHGGAKDPAELLTDLAGKE 684

BLAST of MS009702 vs. TAIR 10
Match: AT1G67690.1 (Zincin-like metalloproteases family protein )

HSP 1 Score: 167.2 bits (422), Expect = 5.5e-41
Identity = 154/650 (23.69%), Postives = 299/650 (46.00%), Query Frame = 0

Query: 146 SWSKLVEPLEKIIDRLNVVWGIVNHLKSVKDTTDLRTAIEEVQPEKVKFQLRLGQSKPIY 205
           S+  +V PL ++  R   +       K +    ++R A  E + +     L   + + +Y
Sbjct: 97  SYENVVLPLAELEARQLSLIQCCVFPKMLSPHDNVRKASTEAEQKIDAHILSCRKREDVY 156

Query: 206 NAFKAIRESSEWKMLNDARKRIVESQIKEALLHGVTLEDDKREHFNKIQQELERLSQKFE 265
              K      E   ++   K  ++  +++   +G+ L   KRE   +++ E++ LS ++ 
Sbjct: 157 RIIKIYAAKGE--SISPEAKCYLQCLVRDFEDNGLNLTAIKREEVERLKYEIDELSLRYI 216

Query: 266 ENVLDATKKFEKLVVDKNEIDGLPSTALGMAAQTAVSKGHEKATAENGPWIISLDAPCYL 325
           +N+    +    L   ++E+ GLP   L    +T           +N  + ++L++    
Sbjct: 217 QNL---NEDSSCLFFTEDELAGLPLEFLQNLEKT-----------QNKEFKLTLESRHVA 276

Query: 326 SVMQHAKNRSLRKEIYYAYVTRASSGEMDNTAIIDQILKLRLEKAKILNFNNYAEVSMET 385
           ++++  K    RK +  AY  R       N  ++ ++++ R   A +  + ++A+ +++ 
Sbjct: 277 AILELCKIAKTRKTVAMAYGKRCGD---TNIPVLQRLVQSRHRLACVCGYAHFADYALDR 336

Query: 386 KMA-TVEKAEELLEKLRSASWNAAVQDIEDLKDFAKKQGA--PEGNDLNHWDISFWSERL 445
           +M+ T  +    LE + S+  + A+++   L+D  +K+    P G +    D+ ++ +R+
Sbjct: 337 RMSKTSMRVIRFLEDISSSLTDLAIREFSILEDLKRKEEGEIPFGVE----DLLYYIKRV 396

Query: 446 RESKFDVNEEELRPFFSLPNVMDGLFSLAKTLFEIDIQPADGLAPVWNKDVKFYRVNDS- 505
            E +FD++  ++R +F +  V+ G+F + + LF I  +    +  VW  D++ + V DS 
Sbjct: 397 EELQFDLDFGDIRQYFPVNLVLSGIFKICQDLFGIKFEEVTEV-DVWYHDIRAFAVFDSG 456

Query: 506 SGSPIAYLYFDPYTRPSEKRGGAWMDEVVSRSCVLA-QDGA-----PARLPIAHMVCNQT 565
           SG  + Y Y D +TR  +           + SCV+A Q+ A       ++P+A ++    
Sbjct: 457 SGKLLGYFYLDMFTREGK----------CNHSCVVALQNNALFSNGACQIPVALLIAQFA 516

Query: 566 PPVGERPSLMTFREVETVFHEFGHALQHMLTKQDEGLVAGIRGIEWDAVELPSQFMENWC 625
                    + F +V  +FHEFGH +QH+  +      +G+R ++ D  E+PSQ +ENWC
Sbjct: 517 KDGSGEAVPLGFSDVVNLFHEFGHVVQHICNRASFARFSGLR-VDPDFREIPSQLLENWC 576

Query: 626 YHRDTLMGIAKH-YETGESLPEEVYLKLLAARTFRAGSLSLRQLRFASIDLELHTKYVPG 685
           Y   TL  I+ +  +  + L +EV   L   R   +   SL+++ +   D  +   Y   
Sbjct: 577 YESFTLKLISGYRQDITKPLVDEVCKTLKRWRYSFSALKSLQEILYCLFDQII---YSDD 636

Query: 686 EPEAIFVVDQRVCKQTQVIPPLADDKFLCSFSHIFAGGYAAGYYSYKWAEVLSADAF-SA 745
           + + + ++     K    +P +        F     G  A   YS  W+EV +AD F S 
Sbjct: 637 DADLLQLIRSLHPKVMIGLPVVEGTNPASCFPRAVIGSEAT-CYSRLWSEVYAADIFASK 696

Query: 746 FEDAGLNDRKAVEETGRRFRETILALGGGRAPSEVFVEFRGREPSPEPLL 784
           F D   N        G +FR+ +LA GGG+ P E+   F GREPS +  +
Sbjct: 697 FGDGHPN-----LYAGLQFRDKVLAPGGGKEPMELLTNFLGREPSTQAFI 702

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022141290.10.0e+0099.50probable cytosolic oligopeptidase A [Momordica charantia][more]
XP_038895779.10.0e+0090.06LOW QUALITY PROTEIN: probable cytosolic oligopeptidase A [Benincasa hispida][more]
XP_022922757.10.0e+0090.06probable cytosolic oligopeptidase A [Cucurbita moschata][more]
XP_022984450.10.0e+0089.56probable cytosolic oligopeptidase A [Cucurbita maxima][more]
XP_008451681.10.0e+0088.85PREDICTED: probable cytosolic oligopeptidase A [Cucumis melo][more]
Match NameE-valueIdentityDescription
Q94AM10.0e+0077.63Organellar oligopeptidase A, chloroplastic/mitochondrial OS=Arabidopsis thaliana... [more]
Q949P20.0e+0078.65Probable cytosolic oligopeptidase A OS=Arabidopsis thaliana OX=3702 GN=CYOP PE=1... [more]
P445736.6e-14840.83Oligopeptidase A OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20... [more]
P272371.9e-13938.59Oligopeptidase A OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720)... [more]
P272982.8e-13838.74Oligopeptidase A OS=Escherichia coli (strain K12) OX=83333 GN=prlC PE=3 SV=3[more]
Match NameE-valueIdentityDescription
A0A6J1CHM90.0e+0099.50probable cytosolic oligopeptidase A OS=Momordica charantia OX=3673 GN=LOC1110117... [more]
A0A6J1E4790.0e+0090.06probable cytosolic oligopeptidase A OS=Cucurbita moschata OX=3662 GN=LOC11143065... [more]
A0A6J1J8N10.0e+0089.56probable cytosolic oligopeptidase A OS=Cucurbita maxima OX=3661 GN=LOC111482754 ... [more]
A0A1S3BS190.0e+0088.85probable cytosolic oligopeptidase A OS=Cucumis melo OX=3656 GN=LOC103492908 PE=3... [more]
A0A5D3D7N40.0e+0089.53Putative cytosolic oligopeptidase A OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
Match NameE-valueIdentityDescription
AT5G65620.10.0e+0077.63Zincin-like metalloproteases family protein [more]
AT5G10540.10.0e+0078.65Zincin-like metalloproteases family protein [more]
AT5G51540.16.5e-4223.32Zincin-like metalloproteases family protein [more]
AT1G67690.15.5e-4123.69Zincin-like metalloproteases family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 243..274
NoneNo IPR availableCOILSCoilCoilcoord: 383..403
NoneNo IPR availableGENE3D1.10.1370.40coord: 688..793
e-value: 8.6E-31
score: 109.2
coord: 98..268
e-value: 5.2E-45
score: 156.0
NoneNo IPR availableGENE3D1.10.1370.40coord: 294..679
e-value: 3.2E-104
score: 351.3
NoneNo IPR availablePANTHERPTHR11804:SF73CYTOSOLIC OLIGOPEPTIDASE A-RELATEDcoord: 99..790
NoneNo IPR availableSUPERFAMILY55486Metalloproteases ("zincins"), catalytic domaincoord: 101..787
IPR001567Peptidase M3A/M3B catalytic domainPFAMPF01432Peptidase_M3coord: 327..787
e-value: 3.2E-132
score: 442.0
IPR045090Peptidase M3A/M3BPANTHERPTHR11804PROTEASE M3 THIMET OLIGOPEPTIDASE-RELATEDcoord: 99..790
IPR034005Peptidyl-dipeptidase DCPCDDcd06456M3A_DCPcoord: 120..788
e-value: 0.0
score: 930.713

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS009702.1MS009702.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004222 metalloendopeptidase activity
molecular_function GO:0008233 peptidase activity