Sgr022199 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr022199
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionMetallopeptidase M24 family protein isoform 1
Locationtig00153962: 122293 .. 132849 (-)
RNA-Seq ExpressionSgr022199
SyntenySgr022199
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGATCGCAAACTTTCCAAGACAATTGTCTCTCCAGAAGCTCCCAGAGCTGGGCTTCCCGACCACGTCCATCACTTTCCACCGCGCTTTGAAAGCGCCGGCGACAATTTCCTGATCGGAAGCAACCGAGTGCCAGAGACGGCAAACGCAGCGGGAACGAGCCAAGTCGGGAATCGAAAGCTTCTCGAAGATAAGGCGCAGGATGTCGCGGCACGAGAGGGCGAAGAAGTGAGAGTTCATCGGAGATAGTACGGCGGTGGAGGAATCCGCCGCCGGAAGTCGGTCGGGTGGAGATTGAACGGCCGACGAGGGATTCAGGTGCCTCATGATCTCAACGTCGTCGTCTTGGTCGCAGCAGCATCCCATTTTAGAGAGAGAGAGAGAGAGAGAGAGGGGATGAATGGTGGTTGTTTGTTGGGGATTTTATTAGGAAATTACGATAGGATGTTGGGGAAGAATTGAAAATATCTTCAAAGTTAGTGTCATAATCGTCTATTCGCCAATTGATTTTTAAGAAATTATTAATATTCAATTCATCTTTTGAAGAAATAAGTTCAAACTTTTAAAAAAAATTAAGATGCATTGTTAATTAATAATTTTTATTATTGAGTTGTTTAATTTCAATTATTTTCCAATTTTATTTATACTTTATACCTAAAATTTACAAGCCAAGAGCTAAGTGACTTAAGACATTTCTATTTTCTTTGAGAAGTTGTAAGTTCAAATTCATATCTTATATTTATATTTGTAACGTCATATTCTAAAAAATATATATAATTAAAATTCATAATAATTATTTTTTTTTATGAGAAACATAATAATTAATTTTTGTAATCATACAAATTGATAACAATGTATTTGCTTTTATAATTCACTTTTGTGTAATTGTATCTATTTTCTAACATTAGTCTTAGCTGCACGAGCTAAAGTTTGTTTGTATATTCAGAACGACTACAAATAGCTAAATCTAGGCCCCCTTCATTCTATCATACCAAAACTATAATGTCTAAATTGTAAGATTGTTGTGCAGATATGAACACTTATTCTATGTCAACCTAAACAAAAAGCGACACACTCTTATCTCGAGAAAGCAATTGATTTATCTTCCACCCATATTATTGAGAACCATACCTGCAAAACATTTATCCTAATGTATAAATAGGTTCATTATAATGTATGCTAGGGGAGAGAGAAAAGGAAGATAAAAAATAAATAATAAAAAGAGAGAAGCTATTTTGAAACATTTATCATAACATTTACCACACAAGTATGATTACAAAATTTAAAAAAAAAAAAAAAAAAAAAAAAAAGCACGTCAACAAAGTTACGTCAGCATGAAAAATGGCAAATCAACTTTAGGTTATAGTTAGCTAATTGAGATTTTGAAATTTATTGACCTTCGAGAAATTATTTATAAAATCTTAGGATCTAAAGGTATGTTTATCTGTAATTTTAAGTAAATAACCATTATTTTTAGTTATATTACGAATATTAAATTACAAATTTGATTTTTATAATTTAGACAGAGTTTTAATTTAGTCCTTATAATTTTAAAAGTTTCATTTTAGTTCTTATGATTTAGATTTAGTTTTAATTTAGTCTATATGATTTTAAAAGTTTTAATTTGATCTTTATGATTTAGTCAAACCTCATAAATTGTCTTTATAGAAACTTGGCGCATTTTTTTTAAAGTTGACGTTTGTTATTTGGTTAATTGTATTTGTTGGCTATATTAGAATTTAAGAAAATAGATGAGTTTTTCGGGTGAGTAAAAAATAGTTTAGAGACAATTTATAAGATTCAACCAAATTATATAGACTAAATTGAAACTTTTAAAATTATAAAAATTAAATTAAAACTAATATCAAATTATAGAGAGTAAATTGAAGCTTTTAAAATTACAACAATTAAATTAAAATTTTATTTATATTATAAGATCAAATTTATAATTTAACGTGTCGTCAGTCAACCCAAAATTTCTGAAGAAAAAAAAAAGAATACAGAGATAAAAGTGGTGGCCGGACCGGCGGGACGCAGCTGCTGTCTCCAGGGACCTTATCTTCTCTCTCAGATCATGCACTCGCTACCGTCGCAAGCAATTCGTCCTCTTTCTCCCTCTTCTTCTTCACGCTCCCGATACCTCCGTTTCATCTCTTCTGCCTTCCCAATTTCCCCCTATTTCAATTCCCAATCCACAGTTTTCGCCACCATTTCCCGGAGACTACGACGTTCCACCATCAGAAACTGCTCCTCCATCACCGCCAAGCCTTCCTCGGAGCTCAAGAGGACCCGCCCTAAGCCCGAGCCCGATGCGAAGCTTCAGGCTCTCCGGGAGTTGTTCTCCAAGCCTAGCATCGGTGTCGACGCCTATATAATCCCCTCGCAGGACGCTCACCAGGTTGGTTTCATAATTTTGTTTAGCCTGGTGCTTCACAGTCGTTAGTATATGCAGCACTTGTTCTGATAATTACTTGAAAGGTTGCCAGTTTTCTAGTGACTGGAGGATATTTTTTGGGTACGAGAGATCTGGATAATTAAGACTTTATGCAGTTGAATTCTTACCAATTTCCTCTTTTGATTCTCTTCAGAGTGAATTCATCGCAGAATGTTACATGAGGAGGGCCTATATATCTGGATTTACCGGCAGTGCCGGCACTGCTGTTGTCACAAAGGATAAAGCAGCACTTTGGACAGATGGGCGGTATTTTCTTCAGGTTGAAGAATAAAATTTGCACATTTTCTTGTTATTTTCTGAATCTTTAGGTTGATATCTGTGGGAAACAAGCCACGAGCTATGCATATGTCAAACAAACTTTTTTTTTTTTGGGGGGGGGGGGGGGGGGGGGGGTTTCATATTGTTGCTTCAAATTTTGTGTACTTGGCCATGTTTAGGTAGTAAGAATAATTTCTCAATTATTGATGATCAGGCCGAGAAGCAGCTAAGCTCCAGTTGGATTCTCATGCGAGCCGGAAATCACGGAGTGCCCACCCCTGGTGAATGGCTTGCTGATATTTTAGCTCCTGGAGGTGTTGTCGGAATAGATCCTGTGAGCAATGCTTAATCTTTTGGAATTGGCTATCTTCATATCTATTGCTCTTGGTTTCCCATAAGTGAGAAAACAGAAAAGCACCTCGCGTGGACTTGAACTGATGTTTATGTGAATTATGGGATTCTTTTTTTCACTTTCATCATTGATGTTCACATATTAAGATGTTTTCCTCGTCCTTTTTTCCAAATTTAACTAGGTTTCAAAATTTGCTCCTCTATTCTATAACCACAGAAAATTTCAGTTTTTTCTACTCCTTCCTCTATCTAATGCCACCTATTGATTTCATGAAGGGAGCAGTTTCTGTTTTCTGCCGATGCTGCAGAAGATTTGAAAGAGATCATCTCTAGGAAGAATCATAAGTTGGTTTACCTATATGATTACAATCTCGTGGATGAGATATGGAAAGAATCCAGACCAAAGCCACCTAAGGGCCCTATGAGAGTGCATGATCTTAGATATGCTGGATTAGATGTTGCATCAAAGTTGACTTCTTTGAGGTCTGAGCTTGGAGAATCTGGTTCATCTGCAATCATTATATCTATGCTCGATGAAATTGCCTGGCTGTTGAACTTGGTAAAGTTCTATCTGTTCTTAGCCTTTTTATTTGATGGATTTTTTTAATTGAATATGAGAATTTTAGGGTTTATCAAAATTTTAGCTGTCATATTTTGATTTACAATTTTGAAACAAACTACAACGTTATCCTTATTGTTTTCATTACTTTTCTTTGAACTTCTAATCTTTAAGCTGGAACTAGAAAAACATTCTCCATTTTTGCTGTTGCGACTTATAATGAATGGTATTCAGTAGAACATTCTATATATAATTTTTCTATTCCTTTGCAGAGAGGAAATGATGTTCCAAACTCACCTGTTATGTATGCATACTTAATAGTTGAAATTGATGGAGCAAAACTGTTTGTAGATAATTCTAAAGTCACATCAGAGGTGATGGATCACTTGAAAAGTGCAGGAATCGAGTTAAGACCATATGATTCTATTATTTCTGAAATTGAAAAGTAAGTGAAGTTGATTGAAACTTTTTCCTCCAAGTATTTTTTAGGATTGGGATTGTTTTCTATATTATTTTTTAAAATTTGAGCTCTCCGATACTTTTCCATATGGAAGACACATCAGCATAAAATTAGACATGGACCAATCTTAGCTTACATTTGAATGTGAAAGATCTGGAACTTGCATGAGGAAGTTTCTTTATCATGGTATGCTTAGGGGGTTATGGTAAAAGAATACAAAACGAAGTATGATATGCTTTCCGTGTTTTGCATTTTGAATGAAGATACCCCTTTGTGCCTCAGTTATCAAGTGCTATTTTTAATGTTGCAGTGATCAGAATGCTTCCTAGTGCTCATGTATATGGTCATACCTTCATTTCAATGATGGCTGCATTCTTTTTCAGTTTGGCAGAAAAGGGGGCTAACCTTTGGCTGGACACATCATCAGTTAATGCTGCAATTGCAAATGCTTATAGGACTGCATGTGATAGATACTTTATACGCCTTGGGAATAAAAGAAAAGGCAAGGGTAAGACTTTTGAGAGCTCAAATAGTCAGGTCGGACCTACTGGAGTCTATAAGGTATCTCCAGTTTCAATGGCAAAGGCCATAAAAAATCATGCTGAGTTAGAAGGGATGCGGAATTCTCATTTGAGGTAATACGATACTGTTTTATATCATCAACATTTTGCTTCGCTATTAGATTGTGTAATTCTGTAGTAAATATGCCATGCCACTTGAAATAGCTTTTGATTTGGTAACCGATGACCAGCTACAGCCTATAACTTATCCATTTTGGATATGCGTTACTTGCTAATAATTCTTGTTTTTCAGTACCAGATATCTGCTCTTAATATACTTAAACTAGTATTTTGTTTAGTGAATATGTGGTACGATATTTTCCATGTAGAGATGCAGCTGCTCTTGTTCAATTCTGGTTCTGGTTGGAGGAGGAAATTCTTAATGGTGTCAAACTAACGGAGGTAGAAGTTGCAGACAAACTTCTGGAATTTCGTAAGAGGCAAGATGGTTTTGTTGACACAAGTTTCGATACCATTAGTGGTACATAAAATGCCTTATGCAATTCTTATTTTTACAAATCCATTATTTAAATAAATGTTCCTATTCTCAGTTCTTCAGCTTCATTACCTGTTATTGGAATGATTTTTCAGCCTCTGGTGCAAATGGTGCAATCATACACTATAAACCAGAACCTAGTGATTGTTCTGTTGTGGATCCAAATAAACTCTTTCTATTGGACAGTGGAGCGCAATATGTAGACGGAACAACCGATATAACTCGTACAGTACATTTTGGTGAACCAACCACACATCAAAAAGAGTGCTTTACGAGAGTCCTACAAGTATGAAATCTCAATTTGCCTCATTTCTGTTATATGTACCAAATTGCATCTGGACCTAATTGAGTTCCAAGGAGGACTTATAATTATTGCTATTTATTTTCTGAAAGATCTCTGTTGGATGTGGAAAAACATTTATTCATTTACTTCGGTTTCCCCAAAGATGTTTTTCATATCTGCAGTTCAAAACCTATTAAGTTATAACTATTATCTTGGATCCTGTACTGCTTACCTCCCATCCTACATCTTGCTTCATGAAGTTTGTGATGTCAGTGGTATTTCTTGCTTCCTTCCAAAATAGTTTACAAATGAAAAAGAGAAACTACAACAGATGAAGTATCTATATGTTTACGGTCATGCTGTACTTCATAGTTGTATCTCTGTTCATTATGGAGGGGATGTGATTGTTTCCATGATTTATCTTTTCTACCATTGTGGAATAAAATATAAATATCAACTCATTTAGTGGAATCATAGAGAAAAGCCTTTTTCTGATGATGTAATTCACTGTATATTCTTTCTAAACAACCTTGTGCTAAAGATGTCTGCACTGCAATATTAGGCAGGGATAAAAATGTTGGTCGTCCATGTATATAGTTTGTAATTCTTTTTTCTTTGCATTCATTGGTATCTATAATTATTTGTTGCTGCCCAATTAGAATTGGAGAAGCTGAATGTATATCCCCTAAAATATCAAACAATTGCAGGGCCATATAGCTTTAGATCAAGCAGTGTTCCCTCAGAATACCCCTGGTTTCGTATTAGATGCATTTGCTCGTTCTTCTCTCTGGAAGATTGGGCTTGATTATCGGCATGGTATTTCTTCTTTCTGTTGACTAATGATCTTCTGGAAAAACACATATAATTTCACTTGATTAGAAATATACTGATTAGTTGTTTTTTTTTCTTCTCCTTTGTCTTTCAGGGACTGGGCATGGTGTAGGGGCTGCACTAAATGTTCATGAGGGGCCCCAAAGTATAAGCTTCCGATTTGGGAATATGACTGGCTTACAAAATGGCATGATCGTTAGTAATGAGCCAGGCTATTATGAAGACCACTCTTTTGGTATTAGGATTGAGGTAAAGCAAATAAAATGAGAATGTATAGGTGTGTACACTAGTTAATTGGAATTGATGCATAGACTAAATAAAGGACGAGTTGGATCATTGATATTTTTTTTTAGGATTTAGGGTTTAACACATAAAAATTTTCGAACTCTATATTTTAAATGTGAAATGGCCAAAAATTTTCTTCATAGAGTCCAAAGTCATTTGGTAATGGCCTTGCAGCTTCAACTTCTTGTTTTGTTAGACCTATTTGTGCTGTTTTTACTCTCCTAGTAGAGAAACAGAATGTTCAAGTTTACTCTCCCTATCTCATTTTACACAGATGTAAATATTATTATGTCTCTTGTAGCTTTCTGCTTTGTTTTTGCTCATTGTTCTAGCACTGATTTTGGTGGTCTTCTCGTGCAGAATCTCCTTATTGTGAGGGAGGTCGACACTCCAAACCGTTTTGGAGGCATTGGATATTTAGGATTTGAAAAACTCACGTTTGTACCCATGCAGGTAAACAAGTTATTCATAAATTATGAAAATTATACAGATGAGAAATAGAAGAGAAAAACATTTAGGAGAATATGATCATGCTGTTCTAGAGATCAACATACTAGCATCTATGCTTGGCCTGCAAGATAGAAAGCATTTAAGGGGAAATAATAACACTTTCATTGTGTAGCGTAAATGTAAGAACATGGACTGTCTGAGCTAGCCCATATGGATTTATGCTATTGTTTTGCCTCCAAGTAGGACGTAGCAGTTAGCACACATAGTGTATTCCCCCCATATTTAATGCAGGTTGCTTTAAGGCTTCAAAAGTAATTACTGGACTTAGCAAAAAAGAAAAAGACGTAATTTTCAGAGAAAGTTTCTGATGAGACGAATAGCCATGGGCTATATGTATGATAAACAATATCACAAGCAGAGCAAAGATGTTCTCCTTTATTTATTCCGAACTCTTCAGACTTCTTTTTGTTGAAGTACTAACTGGGGAAGTTAAACTTTTTGACCAGTTAAGCGCCTCCGCTTCATTCATAATAGATATCAAAAAAATTTGTTGTTTCTTGTCTCTGGTATTTAGCACTAATTATCCTTCTTATAGCAACATAAAAGTTCGTGTATTACATCCAACCCTTCATAACCTATTGCAAAATTTTGATCCTGTCAATTAGTAACAGGGAAGTCAGATTAGTTTATTAAGCTAGCATCAAATATAGAGGAAGCCTCATTATGTTCTTATTATTTTGAGTCATATGATTTATTGAAATAGTTTTGTTTCTAACGAAAAAACGCTTGCATCAATGTAATTTATAATTATTGAAGCCATTTGTGTGGGCTGCCTCTTGTCTGTTGATTGGAACTTTTTACTTTGAAATATGTTCACTTCAGAGAGAAATCCACGTCCTGCGACTATTTTTCTATTACGCCCTGTTTTGAACACTTTTTGCTGGTTTTCTTATGCAGACTAAACTGGTTGATTTCTCTTTGCTGTCTGCTGCGGAGGTCAATTGGCTTAATGATTACCATTCACAAGTCTGGGAAAAGGTTTGCCCTTCATACTACAATAGTTATCCGGTTCTAATTTTGCATCACTACGGAGCTTGACAAAACAAGTTTTCTGCTTTAATTTTTAAAACAAAATAAATGTTTTTAAAAACCATTGATTTGAGTAATATGTAATAATGTACAGATTAAGGTTATTGACACTTGATATCAATGTTTACCGACTTATAATTTCTTATCGGAAGAATATTTGAAGTTTGTTCGGGTTGCAACTAATGTTCCACGTTAAATTAGGGGTAGGAAAGATCACAGGTTTATAATGAAGGACCATATTTCTATTAGTACGAGGCATTTTGAATAGAATCTAAAAGTAAAATCATGCACATTTAGGTTTAAAGTGGACGATATCAATGTAGAGAGAAAAAGAGGCTCTGTTTTTCTAACAAACTTGTCTGAAAAAAGGTTATTTGTGAGAAGGAGAAAACAACTTTTACTAAATAATCATCCAATTTTTATTTTGAAAAACATAAAACTATTTCTGGAAACAGATTTCAAGTAGTCTCTTGATCCTTTAATCCTGGCCTTGGGGGCTGAGTTTGGAGTCACTATGGAGGAGTAGAAGACGGATTTGGGGCTTTATGAAAGCAGGAAGGAAAAGTTTGAGAGAGAGGACAAGAAAAAGGCAAGAAGGAATGGGTTGGCCCATTGGCCATGTGCTTAAAAAGAATGGGCTCAAACCATTGTAACCACCCACCTAAGATTTAATATCCTATGAGTTTTCTTGACAATCAAATATAGTAGGGTCAAGTAGTTGTCTCGTAAGATTAGCCAAGCCGTGAAAGTTGATATGGGGTTGGCAACTTCTACGTATTATGATATGTTGTTTGTGAATGTCTTTACTTCTCCAGTTTATTTGACTCATGGTTGTACAGGTTTCTCCATTGCTTGAAGGCTCTGCTCGCCAATGGCTATGGAATAACACTCGTCCGTTAGCAAAATCCTGATTTTCGTCCTTTTTCCTGTTGTTATTTGACATACAGTTGCAATTTGTGCCTATTGAATCAAATGTAGATAAAATATCAAACTGATCCTCCATTGTACTCTACCATGAATATGATGCTGAAATTGGGTTTGAACTTCACTTGGAATTTTACTGCTATAAGAAATGGGTTGGATTTGTATTCAAAAGGTCTGAGTTCTCTCGCTTCCTTTGTGAAGGATTTTTGTGGTTGTCAACTGTTCTTCCTCTTTCTTTCTTCTTTTTTTTTTTTCAAGTACAACAAATGTGGGGGGCAGGAATTAAACTTTCAACCGAAAGTATAGGTGCCTTAATTGCTAAACTATGATCAAGGGCTTGGTAATTGGTTTTTGAGTTTTTTTTTCAGATTTTTTTATAAATATAATGTCATTTTAAAATATAATTTCAAAAATAGAACTAATGACAAAAATATGGTAGTTTTGACTTAAAATCGTTGGTAAGATGTCCTTATTATAAACAGTTAGTATACTCCCTTGTCACTTATTCCAAATTATTACTACTTACACTACTTAGTTCAACGAATGTATCACTTCAATATTTACCCTGTAAAATAAAATGATTTGTGATCAAATGTGATTTCAAATTTTATTTTTACTTAAAAAATTTTCATACTAAAATGGTTGGATTTTGCAATATTTCATATTTTAGTGTTTTTATTTATAAATTAGAATTGAAGGGCCATATTAATAGTGTCAAATTAGCTCAATTAGTTAAGCACCCAGTAACCTCTCAAAAATAAGGGGTTAAAATCTTCAACCCCACCTCTGGTTGAACTAAAAAAAAGGCTATATTAATAAAAAAAATTTGTTTTCTTTCTTTTCCTCCATCGAATATCTAAACAAAAATATATTAATGACGCTTAATTCAAATTTTTTTCATTCCAAATAAAATAAAAAAGATTTTTTTTAAAAAATATTATATTATAAATACGAGTGAAAATTTGAACTTACGACTTTTTAGAGAAAGTAGAAATACTTTAACCTATGTTCATGTTGGAAAAAAATTTGTGATTTAAAATTATTTATTATGCTTAATGCCAAATGGAGGGCTAAGTAATGGTGTTACATATAATGTCTAACTAAAATAAAAATTTAGAGATGTAATTAAAATTTTTAAAATTGAAAACTTGTGACTAAATAAATATGTAGGGATTATATTTTGTAACTTAACCGTCATCCCGGCTATAGGAATTAACCGTACTTAATCCCTCCATCCAATCGGAGCCAGTTCGACGGCAGTCGCTCTTTCTCAACCAAAACAGGTCGCTGTCTCCGAGTCTCATTTCATGGATGAAACTCAAATGGGTATTCCAAAAACTGACCTCCCGACTGCCCTCTTGGGCTTCCTCTCCAATCTCCCCCTTCAGAAACCGATTCCATCAAAACCCATTTGCAGAAACCTCCTCAAGATTCGTTCTCAACCATGTAGACGTAAGCTTCCTTCTATCCATTTGTGGAAGGGAGGGGCACCTCCATCTGGGCTCTTCCCTCCATGCCTCCATCGTCAAGAGCTTCGAGCTCTGCAACCATGA

mRNA sequence

ATGGGAGATCGCAAACTTTCCAAGACAATTGTCTCTCCAGAAGCTCCCAGAGCTGGGCTTCCCGACCACGTCCATCACTTTCCACCGCGCTTTGAAAGCGCCGGCGACAATTTCCTGATCGGAAGCAACCGAGTGCCAGAGACGGCAAACGCAGCGGGAACGAGCCAAGTCGGGAATCGAAAGCTTCTCGAAGATAAGGCGCAGGATGTCGCGGCACGAGAGGGCGAAGAAGTGAGAGTTCATCGGAGATATCAACCCAAAATTTCTGAAGAAAAAAAAAAGAATACAGAGATAAAAGTGGTGGCCGGACCGGCGGGACGCAGCTGCTGTCTCCAGGGACCTTATCTTCTCTCTCAGATCATGCACTCGCTACCGTCGCAAGCAATTCGTCCTCTTTCTCCCTCTTCTTCTTCACGCTCCCGATACCTCCGTTTCATCTCTTCTGCCTTCCCAATTTCCCCCTATTTCAATTCCCAATCCACAGTTTTCGCCACCATTTCCCGGAGACTACGACGTTCCACCATCAGAAACTGCTCCTCCATCACCGCCAAGCCTTCCTCGGAGCTCAAGAGGACCCGCCCTAAGCCCGAGCCCGATGCGAAGCTTCAGGCTCTCCGGGAGTTGTTCTCCAAGCCTAGCATCGGTGTCGACGCCTATATAATCCCCTCGCAGGACGCTCACCAGAGTGAATTCATCGCAGAATGTTACATGAGGAGGGCCTATATATCTGGATTTACCGGCAGTGCCGGCACTGCTGTTGTCACAAAGGATAAAGCAGCACTTTGGACAGATGGGCGGTATTTTCTTCAGGCCGAGAAGCAGCTAAGCTCCAGTTGGATTCTCATGCGAGCCGGAAATCACGGAGTGCCCACCCCTGGTGAATGGCTTGCTGATATTTTAGCTCCTGGAGGTGTTGTCGGAATAGATCCTTTTCTGTTTTCTGCCGATGCTGCAGAAGATTTGAAAGAGATCATCTCTAGGAAGAATCATAAGTTGGTTTACCTATATGATTACAATCTCGTGGATGAGATATGGAAAGAATCCAGACCAAAGCCACCTAAGGGCCCTATGAGAGTGCATGATCTTAGATATGCTGGATTAGATGTTGCATCAAAGTTGACTTCTTTGAGGTCTGAGCTTGGAGAATCTGGTTCATCTGCAATCATTATATCTATGCTCGATGAAATTGCCTGGCTGTTGAACTTGAGAGGAAATGATGTTCCAAACTCACCTGTTATGTATGCATACTTAATAGTTGAAATTGATGGAGCAAAACTGTTTGTAGATAATTCTAAAGTCACATCAGAGGTGATGGATCACTTGAAAAGTGCAGGAATCGAGTTAAGACCATATGATTCTATTATTTCTGAAATTGAAAATTTGGCAGAAAAGGGGGCTAACCTTTGGCTGGACACATCATCAGTTAATGCTGCAATTGCAAATGCTTATAGGACTGCATGTGATAGATACTTTATACGCCTTGGGAATAAAAGAAAAGGCAAGGGTAAGACTTTTGAGAGCTCAAATAGTCAGGTCGGACCTACTGGAGTCTATAAGGTATCTCCAGTTTCAATGGCAAAGGCCATAAAAAATCATGCTGAGTTAGAAGGGATGCGGAATTCTCATTTGAGAGATGCAGCTGCTCTTGTTCAATTCTGGTTCTGGTTGGAGGAGGAAATTCTTAATGGTGTCAAACTAACGGAGGTAGAAGTTGCAGACAAACTTCTGGAATTTCGTAAGAGGCAAGATGGTTTTGTTGACACAAGTTTCGATACCATTAGTGCCTCTGGTGCAAATGGTGCAATCATACACTATAAACCAGAACCTAGTGATTGTTCTGTTGTGGATCCAAATAAACTCTTTCTATTGGACAGTGGAGCGCAATATGTAGACGGAACAACCGATATAACTCGTACAGTACATTTTGGTGAACCAACCACACATCAAAAAGAGTGCTTTACGAGAGTCCTACAAGGCCATATAGCTTTAGATCAAGCAGTGTTCCCTCAGAATACCCCTGGTTTCGTATTAGATGCATTTGCTCGTTCTTCTCTCTGGAAGATTGGGCTTGATTATCGGCATGGGACTGGGCATGGTGTAGGGGCTGCACTAAATGTTCATGAGGGGCCCCAAAGTATAAGCTTCCGATTTGGGAATATGACTGGCTTACAAAATGGCATGATCGTTAGTAATGAGCCAGGCTATTATGAAGACCACTCTTTTGGTATTAGGATTGAGAATCTCCTTATTGTGAGGGAGGTCGACACTCCAAACCGTTTTGGAGGCATTGGATATTTAGGATTTGAAAAACTCACGTTTGTACCCATGCAGACTAAACTGGTTGATTTCTCTTTGCTGTCTGCTGCGGAGGTCAATTGGCTTAATGATTACCATTCACAAGTCTGGGAAAAGGTCGCTGTCTCCGAGTCTCATTTCATGGATGAAACTCAAATGGGTATTCCAAAAACTGACCTCCCGACTGCCCTCTTGGGCTTCCTCTCCAATCTCCCCCTTCAGAAACCGATTCCATCAAAACCCATTTGCAGAAACCTCCTCAAGATTCGTTCTCAACCATGTAGACGTAAGCTTCCTTCTATCCATTTGTGGAAGGGAGGGGCACCTCCATCTGGGCTCTTCCCTCCATGCCTCCATCGTCAAGAGCTTCGAGCTCTGCAACCATGA

Coding sequence (CDS)

ATGGGAGATCGCAAACTTTCCAAGACAATTGTCTCTCCAGAAGCTCCCAGAGCTGGGCTTCCCGACCACGTCCATCACTTTCCACCGCGCTTTGAAAGCGCCGGCGACAATTTCCTGATCGGAAGCAACCGAGTGCCAGAGACGGCAAACGCAGCGGGAACGAGCCAAGTCGGGAATCGAAAGCTTCTCGAAGATAAGGCGCAGGATGTCGCGGCACGAGAGGGCGAAGAAGTGAGAGTTCATCGGAGATATCAACCCAAAATTTCTGAAGAAAAAAAAAAGAATACAGAGATAAAAGTGGTGGCCGGACCGGCGGGACGCAGCTGCTGTCTCCAGGGACCTTATCTTCTCTCTCAGATCATGCACTCGCTACCGTCGCAAGCAATTCGTCCTCTTTCTCCCTCTTCTTCTTCACGCTCCCGATACCTCCGTTTCATCTCTTCTGCCTTCCCAATTTCCCCCTATTTCAATTCCCAATCCACAGTTTTCGCCACCATTTCCCGGAGACTACGACGTTCCACCATCAGAAACTGCTCCTCCATCACCGCCAAGCCTTCCTCGGAGCTCAAGAGGACCCGCCCTAAGCCCGAGCCCGATGCGAAGCTTCAGGCTCTCCGGGAGTTGTTCTCCAAGCCTAGCATCGGTGTCGACGCCTATATAATCCCCTCGCAGGACGCTCACCAGAGTGAATTCATCGCAGAATGTTACATGAGGAGGGCCTATATATCTGGATTTACCGGCAGTGCCGGCACTGCTGTTGTCACAAAGGATAAAGCAGCACTTTGGACAGATGGGCGGTATTTTCTTCAGGCCGAGAAGCAGCTAAGCTCCAGTTGGATTCTCATGCGAGCCGGAAATCACGGAGTGCCCACCCCTGGTGAATGGCTTGCTGATATTTTAGCTCCTGGAGGTGTTGTCGGAATAGATCCTTTTCTGTTTTCTGCCGATGCTGCAGAAGATTTGAAAGAGATCATCTCTAGGAAGAATCATAAGTTGGTTTACCTATATGATTACAATCTCGTGGATGAGATATGGAAAGAATCCAGACCAAAGCCACCTAAGGGCCCTATGAGAGTGCATGATCTTAGATATGCTGGATTAGATGTTGCATCAAAGTTGACTTCTTTGAGGTCTGAGCTTGGAGAATCTGGTTCATCTGCAATCATTATATCTATGCTCGATGAAATTGCCTGGCTGTTGAACTTGAGAGGAAATGATGTTCCAAACTCACCTGTTATGTATGCATACTTAATAGTTGAAATTGATGGAGCAAAACTGTTTGTAGATAATTCTAAAGTCACATCAGAGGTGATGGATCACTTGAAAAGTGCAGGAATCGAGTTAAGACCATATGATTCTATTATTTCTGAAATTGAAAATTTGGCAGAAAAGGGGGCTAACCTTTGGCTGGACACATCATCAGTTAATGCTGCAATTGCAAATGCTTATAGGACTGCATGTGATAGATACTTTATACGCCTTGGGAATAAAAGAAAAGGCAAGGGTAAGACTTTTGAGAGCTCAAATAGTCAGGTCGGACCTACTGGAGTCTATAAGGTATCTCCAGTTTCAATGGCAAAGGCCATAAAAAATCATGCTGAGTTAGAAGGGATGCGGAATTCTCATTTGAGAGATGCAGCTGCTCTTGTTCAATTCTGGTTCTGGTTGGAGGAGGAAATTCTTAATGGTGTCAAACTAACGGAGGTAGAAGTTGCAGACAAACTTCTGGAATTTCGTAAGAGGCAAGATGGTTTTGTTGACACAAGTTTCGATACCATTAGTGCCTCTGGTGCAAATGGTGCAATCATACACTATAAACCAGAACCTAGTGATTGTTCTGTTGTGGATCCAAATAAACTCTTTCTATTGGACAGTGGAGCGCAATATGTAGACGGAACAACCGATATAACTCGTACAGTACATTTTGGTGAACCAACCACACATCAAAAAGAGTGCTTTACGAGAGTCCTACAAGGCCATATAGCTTTAGATCAAGCAGTGTTCCCTCAGAATACCCCTGGTTTCGTATTAGATGCATTTGCTCGTTCTTCTCTCTGGAAGATTGGGCTTGATTATCGGCATGGGACTGGGCATGGTGTAGGGGCTGCACTAAATGTTCATGAGGGGCCCCAAAGTATAAGCTTCCGATTTGGGAATATGACTGGCTTACAAAATGGCATGATCGTTAGTAATGAGCCAGGCTATTATGAAGACCACTCTTTTGGTATTAGGATTGAGAATCTCCTTATTGTGAGGGAGGTCGACACTCCAAACCGTTTTGGAGGCATTGGATATTTAGGATTTGAAAAACTCACGTTTGTACCCATGCAGACTAAACTGGTTGATTTCTCTTTGCTGTCTGCTGCGGAGGTCAATTGGCTTAATGATTACCATTCACAAGTCTGGGAAAAGGTCGCTGTCTCCGAGTCTCATTTCATGGATGAAACTCAAATGGGTATTCCAAAAACTGACCTCCCGACTGCCCTCTTGGGCTTCCTCTCCAATCTCCCCCTTCAGAAACCGATTCCATCAAAACCCATTTGCAGAAACCTCCTCAAGATTCGTTCTCAACCATGTAGACGTAAGCTTCCTTCTATCCATTTGTGGAAGGGAGGGGCACCTCCATCTGGGCTCTTCCCTCCATGCCTCCATCGTCAAGAGCTTCGAGCTCTGCAACCATGA

Protein sequence

MGDRKLSKTIVSPEAPRAGLPDHVHHFPPRFESAGDNFLIGSNRVPETANAAGTSQVGNRKLLEDKAQDVAAREGEEVRVHRRYQPKISEEKKKNTEIKVVAGPAGRSCCLQGPYLLSQIMHSLPSQAIRPLSPSSSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTIRNCSSITAKPSSELKRTRPKPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMRRAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLADILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMRVHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLIVEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVNAAIANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELEGMRNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISASGANGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFGNMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQTKLVDFSLLSAAEVNWLNDYHSQVWEKVAVSESHFMDETQMGIPKTDLPTALLGFLSNLPLQKPIPSKPICRNLLKIRSQPCRRKLPSIHLWKGGAPPSGLFPPCLHRQELRALQP
Homology
BLAST of Sgr022199 vs. NCBI nr
Match: XP_038877034.1 (aminopeptidase P2 [Benincasa hispida])

HSP 1 Score: 1278.5 bits (3307), Expect = 0.0e+00
Identity = 627/691 (90.74%), Postives = 658/691 (95.22%), Query Frame = 0

Query: 121 MHSLPSQAIRPLSPS-----SSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTI 180
           MHSLPSQAIRPLS S     SSS S YLRFISS FP+SPYFN QS VF  ISRRLRRSTI
Sbjct: 1   MHSLPSQAIRPLSLSSSSSFSSSSSLYLRFISSTFPVSPYFNPQSPVFTAISRRLRRSTI 60

Query: 181 RNCSSITAKPSSELKRTRPKPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAEC 240
           R+CSSITAKPSSE++RTRPK EPD+KL+ALR+LFSKP+IG+DAY+IPSQDAHQSEFIAEC
Sbjct: 61  RSCSSITAKPSSEIRRTRPKDEPDSKLRALRDLFSKPNIGIDAYVIPSQDAHQSEFIAEC 120

Query: 241 YMRRAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEW 300
           YMRRAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQL+SSW LMRAGNHGVPTP EW
Sbjct: 121 YMRRAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLNSSWTLMRAGNHGVPTPSEW 180

Query: 301 LADILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKG 360
           +ADILAPGGVVGIDPFLFSADAAEDLKE ISRKNHKLVYLYDYNLVDEIWKESRP PPKG
Sbjct: 181 VADILAPGGVVGIDPFLFSADAAEDLKETISRKNHKLVYLYDYNLVDEIWKESRPNPPKG 240

Query: 361 PMRVHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYA 420
           P+RVHDLRY GLDVASKL SLRSELGE+GSSAIIISMLDEIAWLLNLRGNDVPNSPVMYA
Sbjct: 241 PIRVHDLRYGGLDVASKLASLRSELGEAGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYA 300

Query: 421 YLIVEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSV 480
           YL+VEIDGAKLFVDN KV  EVMDHLK+AGIELRPYDSIISEIENLA+KGANLWLDTSS+
Sbjct: 301 YLLVEIDGAKLFVDNCKVAPEVMDHLKTAGIELRPYDSIISEIENLADKGANLWLDTSSI 360

Query: 481 NAAIANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAEL 540
           NAAIANAYR+ACD+YFIRLGNKRKGKGKT E+SNSQVGPTGVYK SP+SMAKAIKNHAEL
Sbjct: 361 NAAIANAYRSACDKYFIRLGNKRKGKGKTSETSNSQVGPTGVYKSSPISMAKAIKNHAEL 420

Query: 541 EGMRNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISA 600
           EGMRNSHLRDAAAL QFW WLE+EILNGVKLTEVEVADKLLEFRK+QDGFVDTSFDTISA
Sbjct: 421 EGMRNSHLRDAAALAQFWLWLEQEILNGVKLTEVEVADKLLEFRKKQDGFVDTSFDTISA 480

Query: 601 SGANGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTR 660
           SGANGAIIHYKPEP DCSVVD NKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTR
Sbjct: 481 SGANGAIIHYKPEPGDCSVVDANKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTR 540

Query: 661 VLQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISF 720
           VLQGHIALDQAVFPQ+TPGFVLDAFARSSLWK+GLDYRHGTGHGVGAALNVHEGPQSISF
Sbjct: 541 VLQGHIALDQAVFPQHTPGFVLDAFARSSLWKVGLDYRHGTGHGVGAALNVHEGPQSISF 600

Query: 721 RFGNMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVP 780
           RFGNMTGLQNGMIVSNEPGYYED+SFGIRIENLLIVR+ DTPN FGGIGYLGFEKLTFVP
Sbjct: 601 RFGNMTGLQNGMIVSNEPGYYEDYSFGIRIENLLIVRDADTPNHFGGIGYLGFEKLTFVP 660

Query: 781 MQTKLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           +QTKLVD +LLS AEVNWLNDYHSQVWEKV+
Sbjct: 661 IQTKLVDITLLSVAEVNWLNDYHSQVWEKVS 691

BLAST of Sgr022199 vs. NCBI nr
Match: XP_022150774.1 (probable Xaa-Pro aminopeptidase P [Momordica charantia])

HSP 1 Score: 1277.3 bits (3304), Expect = 0.0e+00
Identity = 634/690 (91.88%), Postives = 658/690 (95.36%), Query Frame = 0

Query: 121 MHSLPSQAIRPL----SPSSSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTIR 180
           MHSLPSQAIRPL    S SSSSR RYLRFISS FPISP+FNSQSTVFA ISRRLRRS IR
Sbjct: 1   MHSLPSQAIRPLSLSSSSSSSSRFRYLRFISSTFPISPFFNSQSTVFAAISRRLRRSPIR 60

Query: 181 NCSSITAKPSSELKRTRPKPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECY 240
           +CSSITAKPSSELK+  PK E D KL  LR+LFSKPSIG+DAY+IPSQDAHQSEFIAECY
Sbjct: 61  SCSSITAKPSSELKKVYPKSETDGKLLLLRDLFSKPSIGIDAYVIPSQDAHQSEFIAECY 120

Query: 241 MRRAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWL 300
           MRRAYISGFTGSAGTAVVTKD+AALWTDGRYFLQAEKQLSSSWILMR+GN  VPTPGEWL
Sbjct: 121 MRRAYISGFTGSAGTAVVTKDEAALWTDGRYFLQAEKQLSSSWILMRSGNQEVPTPGEWL 180

Query: 301 ADILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP 360
           AD LAPGGVVGIDPFLFSADAAEDLKE ISRKNHKLVY+YDYNLVDEIWKESRPKPPKGP
Sbjct: 181 ADTLAPGGVVGIDPFLFSADAAEDLKETISRKNHKLVYIYDYNLVDEIWKESRPKPPKGP 240

Query: 361 MRVHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAY 420
           +RVHDLRYAGLDVASKL SLRSELGE+GSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAY
Sbjct: 241 IRVHDLRYAGLDVASKLASLRSELGEAGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAY 300

Query: 421 LIVEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVN 480
           LIVEIDGAKLFVD+SKVT EVMDHLKSAGIELRPYDSIISEIE LAEKGANLWLDTSSVN
Sbjct: 301 LIVEIDGAKLFVDDSKVTPEVMDHLKSAGIELRPYDSIISEIEKLAEKGANLWLDTSSVN 360

Query: 481 AAIANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELE 540
           AAIANAYRTA DRY+IRLGNKRKGKGKT+E+SNSQVGPTGVYK+SP+S+AKAIKNHAELE
Sbjct: 361 AAIANAYRTAGDRYYIRLGNKRKGKGKTYETSNSQVGPTGVYKLSPISIAKAIKNHAELE 420

Query: 541 GMRNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISAS 600
           GMR+SHLRD AAL QFWFWLEE+ILNGVKLTEVEVADKLLEFRK+QDGFVDTSFDTISAS
Sbjct: 421 GMRSSHLRDGAALAQFWFWLEEKILNGVKLTEVEVADKLLEFRKKQDGFVDTSFDTISAS 480

Query: 601 GANGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRV 660
           GANGAIIHYKPEP DCSVVD NKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRV
Sbjct: 481 GANGAIIHYKPEPDDCSVVDANKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRV 540

Query: 661 LQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 720
           LQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR
Sbjct: 541 LQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 600

Query: 721 FGNMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPM 780
           FGNMTGLQ GMIVSNEPGYYEDHSFGIRIENLLIV+EVDTPNRFGGIGYLGFEKLTFVP+
Sbjct: 601 FGNMTGLQTGMIVSNEPGYYEDHSFGIRIENLLIVKEVDTPNRFGGIGYLGFEKLTFVPI 660

Query: 781 QTKLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           Q KLVD SLLSAAEVNWLNDYHS VWEKV+
Sbjct: 661 QAKLVDVSLLSAAEVNWLNDYHSLVWEKVS 690

BLAST of Sgr022199 vs. NCBI nr
Match: XP_022965604.1 (probable Xaa-Pro aminopeptidase P [Cucurbita maxima])

HSP 1 Score: 1256.5 bits (3250), Expect = 0.0e+00
Identity = 620/690 (89.86%), Postives = 652/690 (94.49%), Query Frame = 0

Query: 121 MHSLPSQAIRPLS----PSSSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTIR 180
           MHSLPSQAIRPLS     SSSS S YLRFISS FPISPYFN QS VFA ISRRLRRSTIR
Sbjct: 1   MHSLPSQAIRPLSFSSFSSSSSSSLYLRFISSTFPISPYFNPQSPVFAAISRRLRRSTIR 60

Query: 181 NCSSITAKPSSELKRTRPKPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECY 240
           +CS ITAKPSS+L+ TR K E D+KLQALR+LFSKP I +DAY+IPSQDAHQSEFI ECY
Sbjct: 61  SCSFITAKPSSDLRNTRSKDESDSKLQALRKLFSKPGIDIDAYVIPSQDAHQSEFIGECY 120

Query: 241 MRRAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWL 300
           MRRAYISGFTGSAGTAVVTKD+AALWTDGRYFLQAEKQLSSSW+LMRAGNHGVPTP EWL
Sbjct: 121 MRRAYISGFTGSAGTAVVTKDQAALWTDGRYFLQAEKQLSSSWVLMRAGNHGVPTPSEWL 180

Query: 301 ADILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP 360
           AD LAPGGVVGIDPFLFSADAAEDLKE ISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP
Sbjct: 181 ADTLAPGGVVGIDPFLFSADAAEDLKETISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP 240

Query: 361 MRVHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAY 420
           +RVHDL+YAGLDVASKL SLRSELGE+GSSAIIISMLDEIAWLLNLRG+DVPNSPVMYAY
Sbjct: 241 IRVHDLKYAGLDVASKLASLRSELGEAGSSAIIISMLDEIAWLLNLRGSDVPNSPVMYAY 300

Query: 421 LIVEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVN 480
           LIVE+DGAKLFVD SKV+SEVMDHLKSAG+ELRPYDSIISEIENLAEKGANLWLD  SVN
Sbjct: 301 LIVEMDGAKLFVDTSKVSSEVMDHLKSAGVELRPYDSIISEIENLAEKGANLWLDPFSVN 360

Query: 481 AAIANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELE 540
           AAIANAYR+ACD+YFIRLGNK+KGKGKT E+SNS+VGPTGVYK SPVS+AKA+KNHAELE
Sbjct: 361 AAIANAYRSACDKYFIRLGNKKKGKGKTSETSNSEVGPTGVYKSSPVSIAKAVKNHAELE 420

Query: 541 GMRNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISAS 600
           GMRNSHLRDAAAL QFW W EEEILNGVKLTEVEVADKLLEFRK+QDGFVDTSFDTISAS
Sbjct: 421 GMRNSHLRDAAALAQFWSWFEEEILNGVKLTEVEVADKLLEFRKKQDGFVDTSFDTISAS 480

Query: 601 GANGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRV 660
           GANGAIIHYKPEPSDCS VD NKLFLLDSGAQYVDGTTDITRTVHFGEPTT+QKECFTRV
Sbjct: 481 GANGAIIHYKPEPSDCSAVDANKLFLLDSGAQYVDGTTDITRTVHFGEPTTYQKECFTRV 540

Query: 661 LQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 720
           LQGHIALDQAVFPQ+TPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR
Sbjct: 541 LQGHIALDQAVFPQHTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 600

Query: 721 FGNMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPM 780
           FGNMTGLQ+GMIVSNEPGYYEDHSFGIRIENLL+VR+  TPN FGGIGYLGFEKLTFVP+
Sbjct: 601 FGNMTGLQDGMIVSNEPGYYEDHSFGIRIENLLVVRDARTPNCFGGIGYLGFEKLTFVPI 660

Query: 781 QTKLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           QTK+VD SLLS AEVNWLNDYHSQVWEKV+
Sbjct: 661 QTKMVDISLLSVAEVNWLNDYHSQVWEKVS 690

BLAST of Sgr022199 vs. NCBI nr
Match: KAG7021104.1 (AMPP protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1256.5 bits (3250), Expect = 0.0e+00
Identity = 623/690 (90.29%), Postives = 650/690 (94.20%), Query Frame = 0

Query: 121 MHSLPSQAIRPLS----PSSSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTIR 180
           MHSLPSQAIRPLS     SSSS S YLRFISS FPISPYFN QS VFA ISRRLRRSTIR
Sbjct: 1   MHSLPSQAIRPLSFSSFSSSSSSSLYLRFISSTFPISPYFNPQSPVFAAISRRLRRSTIR 60

Query: 181 NCSSITAKPSSELKRTRPKPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECY 240
           +CS ITAKPSS+L+ TR K E D+KLQALR+LFSKP I +DAY+IPSQDAHQSEFI ECY
Sbjct: 61  SCSFITAKPSSDLRNTRSKDESDSKLQALRKLFSKPGIDIDAYVIPSQDAHQSEFIGECY 120

Query: 241 MRRAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWL 300
           MRRAYISGFTGSAGTAVVTKD+AALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTP EWL
Sbjct: 121 MRRAYISGFTGSAGTAVVTKDQAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPSEWL 180

Query: 301 ADILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP 360
           AD LAPGGVVGIDPFLFSADAAEDLKE ISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP
Sbjct: 181 ADTLAPGGVVGIDPFLFSADAAEDLKETISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP 240

Query: 361 MRVHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAY 420
           +RVHDL+YAGLDVASKL SLRSELGE+GSSAIIISMLDEIAWLLNLRG+DVPNSPVMYAY
Sbjct: 241 IRVHDLKYAGLDVASKLASLRSELGEAGSSAIIISMLDEIAWLLNLRGSDVPNSPVMYAY 300

Query: 421 LIVEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVN 480
           LIVEIDGAKLFVD SKV+SEVMDHLKSAG+ELRPYDSIISEIENLAEKGANLWLD  SVN
Sbjct: 301 LIVEIDGAKLFVDTSKVSSEVMDHLKSAGVELRPYDSIISEIENLAEKGANLWLDPVSVN 360

Query: 481 AAIANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELE 540
           AAIANAYR ACD+YFIRLGNKRK K KT E+SNS VGPTGVYK SPVS+AKA+KNHAELE
Sbjct: 361 AAIANAYRNACDKYFIRLGNKRKSKDKTSETSNSHVGPTGVYKSSPVSIAKAVKNHAELE 420

Query: 541 GMRNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISAS 600
           GMRNSHLRDAAAL QFW WLEEEILNGVKLTEVEVADKLLEFRK+QDGFVDTSFDTISAS
Sbjct: 421 GMRNSHLRDAAALAQFWSWLEEEILNGVKLTEVEVADKLLEFRKKQDGFVDTSFDTISAS 480

Query: 601 GANGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRV 660
           GANGAIIHYKPEPSDCSVVD NKLFLLDSGAQYVDGTTDITRTVHFGEPTT+QKECFTRV
Sbjct: 481 GANGAIIHYKPEPSDCSVVDANKLFLLDSGAQYVDGTTDITRTVHFGEPTTYQKECFTRV 540

Query: 661 LQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 720
           LQGHIALDQAVFPQ+TPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR
Sbjct: 541 LQGHIALDQAVFPQHTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 600

Query: 721 FGNMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPM 780
           FGNMTGLQ+GMIVSNEPGYYEDHSFGIRIENLL+VR+  TPN FGGIGYLGFEKLTFVP+
Sbjct: 601 FGNMTGLQDGMIVSNEPGYYEDHSFGIRIENLLVVRDAKTPNCFGGIGYLGFEKLTFVPI 660

Query: 781 QTKLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           QTK+VD SLLS AEVNWLNDYHSQVWEKV+
Sbjct: 661 QTKMVDISLLSVAEVNWLNDYHSQVWEKVS 690

BLAST of Sgr022199 vs. NCBI nr
Match: KAA0047987.1 (putative Xaa-Pro aminopeptidase P isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 1254.2 bits (3244), Expect = 0.0e+00
Identity = 617/688 (89.68%), Postives = 654/688 (95.06%), Query Frame = 0

Query: 121 MHSLPSQAIRPLS-PSSSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTIRNCS 180
           MHSLPSQAIRPLS  SSSS S YLR ISS F +SP+FN QS VFA IS RLRRST+R+CS
Sbjct: 1   MHSLPSQAIRPLSLSSSSSTSLYLRSISSTFSVSPFFNLQSPVFAAISSRLRRSTVRSCS 60

Query: 181 SITAKPSSELKRTRP-KPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMR 240
           SITAKPSSE++RTRP   EPD+KL+ALR+LFSKP IG+DAYIIPSQDAHQSEFIAECYMR
Sbjct: 61  SITAKPSSEIRRTRPNNDEPDSKLRALRDLFSKPDIGIDAYIIPSQDAHQSEFIAECYMR 120

Query: 241 RAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLAD 300
           RAYISGFTGSAGTAVVT DKAALWTDGRYFLQAEKQL+SSW LMRAGNHGVPTP EWLAD
Sbjct: 121 RAYISGFTGSAGTAVVTSDKAALWTDGRYFLQAEKQLNSSWTLMRAGNHGVPTPSEWLAD 180

Query: 301 ILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMR 360
           ILAPGGVVGIDPFLFSADAAEDLKE +SRKNHKLVYLYDYNLVDEIWK+SRPKPP+GP+R
Sbjct: 181 ILAPGGVVGIDPFLFSADAAEDLKETVSRKNHKLVYLYDYNLVDEIWKDSRPKPPRGPIR 240

Query: 361 VHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLI 420
           VHDLRYAGLDVASKL SLRSEL E+GSSAIIIS+LDEIAWLLNLRG+DVPNSPVMYAYL+
Sbjct: 241 VHDLRYAGLDVASKLASLRSELKEAGSSAIIISVLDEIAWLLNLRGSDVPNSPVMYAYLL 300

Query: 421 VEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVNAA 480
           VE+DGAKLFVDN KVTSEVMDHLK+AG+ELRPYDSIIS IENLAEKGANLWLDTSS+NAA
Sbjct: 301 VELDGAKLFVDNCKVTSEVMDHLKTAGVELRPYDSIISAIENLAEKGANLWLDTSSINAA 360

Query: 481 IANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELEGM 540
           IANAYR+ACD+YFIRLGNKRKGKGKT E+SNSQVGPTGVYK SP+SMAKAIKN+AELEGM
Sbjct: 361 IANAYRSACDKYFIRLGNKRKGKGKTSETSNSQVGPTGVYKSSPISMAKAIKNYAELEGM 420

Query: 541 RNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISASGA 600
           RNSHLRDAAAL QFWFWLE+EILNGVKLTEVEVADKLLEFRK+QDGFVDTSFDTISASGA
Sbjct: 421 RNSHLRDAAALAQFWFWLEQEILNGVKLTEVEVADKLLEFRKKQDGFVDTSFDTISASGA 480

Query: 601 NGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQ 660
           NGAIIHYKPEPSDCSVVD NKLFLLDSGAQYVDGTTDITRTVHFGEPTT QKECFTRVLQ
Sbjct: 481 NGAIIHYKPEPSDCSVVDANKLFLLDSGAQYVDGTTDITRTVHFGEPTTRQKECFTRVLQ 540

Query: 661 GHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG 720
           GHIALDQAVFPQ+TPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG
Sbjct: 541 GHIALDQAVFPQDTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG 600

Query: 721 NMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQT 780
           NMTGL +GMIVSNEPGYYEDHSFGIRIENLLIV++ DTPN FGGIGYLGFEKLTFVP+QT
Sbjct: 601 NMTGLHSGMIVSNEPGYYEDHSFGIRIENLLIVKDADTPNHFGGIGYLGFEKLTFVPIQT 660

Query: 781 KLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           KLVD +LLS  EVNWLNDYHSQVWEKV+
Sbjct: 661 KLVDITLLSVEEVNWLNDYHSQVWEKVS 688

BLAST of Sgr022199 vs. ExPASy Swiss-Prot
Match: Q8RY11 (Aminopeptidase P2 OS=Arabidopsis thaliana OX=3702 GN=APP2 PE=2 SV=1)

HSP 1 Score: 982.6 bits (2539), Expect = 2.9e-285
Identity = 489/688 (71.08%), Postives = 569/688 (82.70%), Query Frame = 0

Query: 131 PLSPSSSSRSRYL----RFISSAFPISPYFNSQSTVFATI-------SRRLRRSTIRNCS 190
           PL+ SS S +R +    R+  S F  +  FNS S +   +       +R    S+  + S
Sbjct: 3   PLTLSSPSLNRLVLSTSRYSHSLFLSN--FNSLSLIHRKLPYKPLFGARCHASSSSSSSS 62

Query: 191 SITAKPSSELKRTRPKPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMRR 250
           S TAK S E+++ + K   D KL ++R LFS+P +G+DAYIIPSQDAHQSEFIAECY RR
Sbjct: 63  SFTAKSSKEIRKAQTKVVVDEKLSSIRRLFSEPGVGIDAYIIPSQDAHQSEFIAECYARR 122

Query: 251 AYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLADI 310
           AYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQL+SSWILMRAGN GVPT  EW+AD+
Sbjct: 123 AYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLNSSWILMRAGNPGVPTASEWIADV 182

Query: 311 LAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMRV 370
           LAPGG VGIDPFLFSADAAE+LKE+I++KNH+LVYLY+ NLVDEIWK+SRPKPP   +R+
Sbjct: 183 LAPGGRVGIDPFLFSADAAEELKEVIAKKNHELVYLYNVNLVDEIWKDSRPKPPSRQIRI 242

Query: 371 HDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLIV 430
           HDL+YAGLDVASKL SLR+++ ++G+SAI+ISMLDEIAW+LNLRG+DVP+SPVMYAYLIV
Sbjct: 243 HDLKYAGLDVASKLLSLRNQIMDAGTSAIVISMLDEIAWVLNLRGSDVPHSPVMYAYLIV 302

Query: 431 EIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVNAAI 490
           E+D A+LFVDNSKVT EV DHLK+AGIELRPYDSI+  I++LA +GA L +D S++N AI
Sbjct: 303 EVDQAQLFVDNSKVTVEVKDHLKNAGIELRPYDSILQGIDSLAARGAQLLMDPSTLNVAI 362

Query: 491 ANAYRTACDRYFIRLGNKRKGKGK-TFESSNSQVGPTGVYKVSPVSMAKAIKNHAELEGM 550
            + Y++AC+RY     ++ K K K T  SS     P+G+Y  SP+S AKAIKN AEL+GM
Sbjct: 363 ISTYKSACERYSRNFESEAKVKTKFTDSSSGYTANPSGIYMQSPISWAKAIKNDAELKGM 422

Query: 551 RNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISASGA 610
           +NSHLRDAAAL  FW WLEEE+     LTEV+VAD+LLEFR  QDGF+DTSFDTIS SGA
Sbjct: 423 KNSHLRDAAALAHFWAWLEEEVHKNANLTEVDVADRLLEFRSMQDGFMDTSFDTISGSGA 482

Query: 611 NGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQ 670
           NGAIIHYKPEP  CS VDP KLFLLDSGAQYVDGTTDITRTVHF EP+  +KECFTRVLQ
Sbjct: 483 NGAIIHYKPEPESCSRVDPQKLFLLDSGAQYVDGTTDITRTVHFSEPSAREKECFTRVLQ 542

Query: 671 GHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG 730
           GHIALDQAVFP+ TPGFVLD FARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR+G
Sbjct: 543 GHIALDQAVFPEGTPGFVLDGFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRYG 602

Query: 731 NMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQT 790
           NMT LQNGMIVSNEPGYYEDH+FGIRIENLL VR+ +TPNRFGG  YLGFEKLTF P+QT
Sbjct: 603 NMTPLQNGMIVSNEPGYYEDHAFGIRIENLLHVRDAETPNRFGGATYLGFEKLTFFPIQT 662

Query: 791 KLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           K+VD SLLS  EV+WLN YH++VWEKV+
Sbjct: 663 KMVDVSLLSDTEVDWLNSYHAEVWEKVS 688

BLAST of Sgr022199 vs. ExPASy Swiss-Prot
Match: D1ZKF3 (Probable Xaa-Pro aminopeptidase P OS=Sordaria macrospora (strain ATCC MYA-333 / DSM 997 / K(L3346) / K-hell) OX=771870 GN=AMPP PE=3 SV=1)

HSP 1 Score: 580.5 bits (1495), Expect = 3.3e-164
Identity = 309/608 (50.82%), Postives = 392/608 (64.47%), Query Frame = 0

Query: 201 KLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMRRAYISGFTGSAGTAVVTKDKAA 260
           +L ALR L  + S  VD Y++PS+D+H SE+I +C  RR +ISGF+GSAGTAVVT DKAA
Sbjct: 8   RLAALRSLMKERS--VDIYVVPSEDSHASEYITDCDARRTFISGFSGSAGTAVVTLDKAA 67

Query: 261 LWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLADILAPGGVVGIDPFLFSADAAED 320
           L TDGRYF QA KQL  +W L++ G   VPT  EW AD  A G  VGIDP L S   AE 
Sbjct: 68  LATDGRYFNQASKQLDENWHLLKTGLQDVPTWQEWTADESAGGKTVGIDPTLISPAVAEK 127

Query: 321 LKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMRVHDLRYAGLDVASKLTSLRSEL 380
           L   I +     +     NLVD +W ESRP  P  P+ +   +YAG   A KLT LR EL
Sbjct: 128 LNGDIKKHGGSGLKAVTENLVDLVWGESRPPRPSEPVFLLGAKYAGKGAAEKLTDLRKEL 187

Query: 381 GESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLIVEIDGAKLFVDNSKVTSEVMDH 440
            +  ++A ++SMLDEIAWL NLRGND+  +PV ++Y IV  D A L+VD SK+T EV  +
Sbjct: 188 EKKKAAAFVVSMLDEIAWLFNLRGNDITYNPVFFSYAIVTKDSATLYVDESKLTDEVKQY 247

Query: 441 LKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVNAAIANAYRTACDRYFIRLGNKRKG 500
           L   G E++PY  +  + E LA             NAA + +      +Y +   NK   
Sbjct: 248 LAENGTEIKPYTDLFKDTEVLA-------------NAAKSTSESEKPTKYLV--SNKASW 307

Query: 501 KGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELEGMRNSHLRDAAALVQFWFWLEEEI 560
             K        V        SP+  AKAIKN  ELEGMR  H+RD AAL++++ WLE+++
Sbjct: 308 ALKLALGGEKHVDEVR----SPIGDAKAIKNETELEGMRKCHIRDGAALIKYFAWLEDQL 367

Query: 561 LN-GVKLTEVEVADKLLEFRKRQDGFVDTSFDTISASGANGAIIHYKPEPSDCSVVDPNK 620
           +N   KL EVE AD+L +FR  Q  FV  SFDTIS++G NGAIIHYKPE   CSV+DPN 
Sbjct: 368 VNKKAKLNEVEAADQLEKFRSEQSDFVGLSFDTISSTGPNGAIIHYKPERGACSVIDPNA 427

Query: 621 LFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQGHIALDQAVFPQNTPGFVLDA 680
           ++L DSGAQ+ DGTTD+TRT+HFG+PT  +K+ +T VL+G+IALD AVFP+ T GF LDA
Sbjct: 428 IYLCDSGAQFYDGTTDVTRTLHFGQPTAAEKKSYTLVLKGNIALDTAVFPKGTSGFALDA 487

Query: 681 FARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFGNM-TGLQNGMIVSNEPGYYED 740
            AR  LWK GLDYRHGTGHGVG+ LNVHEGP  I  R   +   L  G ++S EPGYYED
Sbjct: 488 LARQFLWKYGLDYRHGTGHGVGSFLNVHEGPIGIGTRKAYIDVPLAPGNVLSIEPGYYED 547

Query: 741 HSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQTKLVDFSLLSAAEVNWLNDYH 800
            ++GIRIENL IVREV T ++FG   YLGFE +T VP   KL+D SLL+  E +WLN  +
Sbjct: 548 GNYGIRIENLAIVREVKTEHQFGDKPYLGFEHITMVPYCRKLIDESLLTQEEKDWLNKSN 594

Query: 801 SQVWEKVA 807
            ++ + +A
Sbjct: 608 EEIRKNMA 594

BLAST of Sgr022199 vs. ExPASy Swiss-Prot
Match: Q7RYL6 (Probable Xaa-Pro aminopeptidase P OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) OX=367110 GN=ampp PE=3 SV=2)

HSP 1 Score: 572.8 bits (1475), Expect = 6.9e-162
Identity = 304/608 (50.00%), Postives = 395/608 (64.97%), Query Frame = 0

Query: 201 KLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMRRAYISGFTGSAGTAVVTKDKAA 260
           +L ALR L  + +  VD Y++PS+D+H SE+IAEC  RRA+ISGFTGSAGTAVVT DKAA
Sbjct: 78  RLAALRSLMKERN--VDIYVVPSEDSHASEYIAECDARRAFISGFTGSAGTAVVTLDKAA 137

Query: 261 LWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLADILAPGGVVGIDPFLFSADAAED 320
           L TDGRYF QA KQL  +W L++ G   VPT  EW AD  A G  VGIDP L S   A+ 
Sbjct: 138 LATDGRYFNQASKQLDENWHLLKTGLQDVPTWQEWTADESAGGKSVGIDPTLISPAVADK 197

Query: 321 LKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMRVHDLRYAGLDVASKLTSLRSEL 380
           L   I +     +   + NLVD +W +SRP  P  P+ +   +Y+G   A KLT+LR EL
Sbjct: 198 LDGDIKKHGGAGLKAINENLVDLVWGDSRPPRPSEPVFLLGAKYSGKGTAEKLTNLRKEL 257

Query: 381 GESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLIVEIDGAKLFVDNSKVTSEVMDH 440
            +  ++A ++SMLDE+AWL NLRGND+  +PV ++Y IV  D A L+VD SK+  EV  +
Sbjct: 258 EKKKAAAFVVSMLDEVAWLFNLRGNDITYNPVFFSYAIVTKDSATLYVDESKLNDEVKQY 317

Query: 441 LKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVNAAIANAYRTACDRYFIRLGNKRKG 500
           L   G  ++PY+ +  + E LA             NAA + +      +Y +   NK   
Sbjct: 318 LAENGTGIKPYNDLFKDTEILA-------------NAAKSTSESDKPTKYLV--SNKASW 377

Query: 501 KGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELEGMRNSHLRDAAALVQFWFWLEEEI 560
             K        V        SP+  AKAIKN  ELEGMR  H+RD AAL++++ WLE+++
Sbjct: 378 ALKLALGGEKHVDEVR----SPIGDAKAIKNETELEGMRRCHIRDGAALIKYFAWLEDQL 437

Query: 561 LN-GVKLTEVEVADKLLEFRKRQDGFVDTSFDTISASGANGAIIHYKPEPSDCSVVDPNK 620
           +N   KL EVE AD+L +FR  Q  FV  SFDTIS++G NGAIIHYKPE   CSV+DP+ 
Sbjct: 438 INKKAKLDEVEAADQLEQFRSEQADFVGLSFDTISSTGPNGAIIHYKPERGACSVIDPDA 497

Query: 621 LFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQGHIALDQAVFPQNTPGFVLDA 680
           ++L DSGAQ+ DGTTD+TRT+HFG+PT  +++ +T VL+G+IALD AVFP+ T GF LDA
Sbjct: 498 IYLCDSGAQFCDGTTDVTRTLHFGQPTDAERKSYTLVLKGNIALDTAVFPKGTSGFALDA 557

Query: 681 FARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFGNM-TGLQNGMIVSNEPGYYED 740
            AR  LWK GLDYRHGTGHGVG+ LNVHEGP  I  R   +   L  G ++S EPGYYED
Sbjct: 558 LARQFLWKYGLDYRHGTGHGVGSFLNVHEGPIGIGTRKAYIDVPLAPGNVLSIEPGYYED 617

Query: 741 HSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQTKLVDFSLLSAAEVNWLNDYH 800
            ++GIRIENL IVREV T ++FG   YLGFE +T VP   KL+D SLL+  E +WLN  +
Sbjct: 618 GNYGIRIENLAIVREVKTEHQFGDKPYLGFEHVTMVPYCRKLIDESLLTQEEKDWLNKSN 664

Query: 801 SQVWEKVA 807
            ++ + +A
Sbjct: 678 EEIRKNMA 664

BLAST of Sgr022199 vs. ExPASy Swiss-Prot
Match: D5GAC6 (Probable Xaa-Pro aminopeptidase P OS=Tuber melanosporum (strain Mel28) OX=656061 GN=AMPP PE=3 SV=1)

HSP 1 Score: 570.5 bits (1469), Expect = 3.4e-161
Identity = 290/608 (47.70%), Postives = 390/608 (64.14%), Query Frame = 0

Query: 200 AKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMRRAYISGFTGSAGTAVVTKDKA 259
           ++L  LREL  +    VD Y++PS+DAH SE+I     RRA+ISGFTGSAG A+VT++KA
Sbjct: 8   SRLAKLRELMKRER--VDVYVVPSEDAHSSEYICAADARRAFISGFTGSAGCAIVTQEKA 67

Query: 260 ALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLADILAPGGVVGIDPFLFSADAAE 319
           AL TDGRYF QA +QL  +W L++ G   VPT  EW+A     G  VG+D  + +A  A+
Sbjct: 68  ALSTDGRYFNQAARQLDENWELLKQGLPDVPTWQEWVAQQAEGGKNVGVDATVITAQQAK 127

Query: 320 DLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMRVHDLRYAGLDVASKLTSLRSE 379
            L+  I +K    +     NL+DE+W   RP  P  P+ V D +Y+G +   K+ ++R E
Sbjct: 128 SLETRIKKKGGTSLLGIPNNLIDEVWGADRPNRPNNPVMVLDEKYSGKEFPLKIEAVRKE 187

Query: 380 LGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLIVEIDGAKLFVDNSKVTSEVMD 439
           L    S   ++SMLDEIAWL NLRG D+P +PV ++Y  +  +   L++D+SK+  +V+ 
Sbjct: 188 LENKKSPGFVVSMLDEIAWLFNLRGTDIPYNPVFFSYAFISPESTTLYIDSSKLDEKVIA 247

Query: 440 HLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVNAAIANAYRTACDRYFIRLGNKRK 499
           HL SA +++RPY  I  EI+ LA+K                   +   D      G K  
Sbjct: 248 HLGSA-VKIRPYHEIFDEIDLLAQK---------------LKVGQPETDSKASEDGGKWL 307

Query: 500 GKGKTFESSNSQVGPTGVYKV--SPVSMAKAIKNHAELEGMRNSHLRDAAALVQFWFWLE 559
              KT  + +  +G     +V  SPV   KA+KN  E EGM+  H+RD AAL +++ WLE
Sbjct: 308 VSNKTSWALSKALGGDDAIEVIRSPVEEEKAVKNDTEKEGMKRCHIRDGAALTEYFAWLE 367

Query: 560 EEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISASGANGAIIHYKPEPSDCSVVDP 619
           +E+L G K+ EV+ ADKL + R R + F+  SFDTIS++G N A+IHYKPE  +CSV+DP
Sbjct: 368 DELLKGTKIDEVQAADKLEQIRSRGENFMGLSFDTISSTGPNAAVIHYKPEAGNCSVIDP 427

Query: 620 NKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQGHIALDQAVFPQNTPGFVL 679
             ++L DSGAQY+DGTTD TRT+HFGEPT  +++ +T VL+G IALD+A+FP+ T GF L
Sbjct: 428 KAIYLCDSGAQYLDGTTDTTRTLHFGEPTDMERKSYTLVLKGMIALDRAIFPKGTSGFAL 487

Query: 680 DAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG-NMTGLQNGMIVSNEPGYY 739
           D  AR  LW  GLDYRHGTGHGVG+ LNVHEGP  I  R   +   L  GM VSNEPGYY
Sbjct: 488 DILARQFLWSEGLDYRHGTGHGVGSFLNVHEGPFGIGTRIQYSEVALSPGMFVSNEPGYY 547

Query: 740 EDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQTKLVDFSLLSAAEVNWLND 799
           ED SFGIRIEN+++V+EV T + FG   Y GFE++T VPM  KL+D  LL+ AE  WLN 
Sbjct: 548 EDGSFGIRIENIIMVKEVKTSHSFGDRPYFGFERVTMVPMCRKLIDAGLLTPAETEWLNS 597

Query: 800 YHSQVWEK 805
           YH++V+EK
Sbjct: 608 YHAEVFEK 597

BLAST of Sgr022199 vs. ExPASy Swiss-Prot
Match: B0DZL3 (Probable Xaa-Pro aminopeptidase P OS=Laccaria bicolor (strain S238N-H82 / ATCC MYA-4686) OX=486041 GN=AMPP PE=3 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 2.2e-160
Identity = 294/612 (48.04%), Postives = 396/612 (64.71%), Query Frame = 0

Query: 201 KLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMRRAYISGFTGSAGTAVVTKDKAA 260
           +L  LREL  + S  V A+++PS+D H SE++A C  RRA+ISGF GSAG A++T DKA 
Sbjct: 45  RLAKLRELMKQHS--VQAFVVPSEDQHSSEYLANCDKRRAFISGFDGSAGCAIITTDKAY 104

Query: 261 LWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLADILAPGGVVGIDPFLFSADAAED 320
           L+TDGRYFLQAEKQL  +W LM+ G   VPT  ++L   L P   +GID  L +A  AE 
Sbjct: 105 LFTDGRYFLQAEKQLDKNWKLMKQGLPDVPTWQDFLYKNLGPHTQIGIDATLLAASDAES 164

Query: 321 LKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMRVHDLRYAGLDVASKLTSLRSEL 380
           L + ++ K  KLV L + NLVD +W E RP  P+  +   D++Y+G     K+ +LR E+
Sbjct: 165 LTKQLTPKYSKLVSLKE-NLVDVVWGEDRPSRPQNSVFHLDVKYSGQSHLDKIATLREEM 224

Query: 381 GESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLIVEIDGAKLFVDNSKVTSEVMDH 440
            +  + AI+++MLDE+AWLLNLRG+D+  +PV +AY +V +D   LF+D++++      +
Sbjct: 225 KKKKAEAIVVTMLDEVAWLLNLRGSDIEYNPVFFAYAVVTMDEVILFIDSAQLDDTARHN 284

Query: 441 LKSAGIELRPYDSIISEIENLAEKGANLWLDTSSV-----NAAIANAYRTACDRYFIRLG 500
           L+   +   PY++I   + +L+     L LD  S       A++A A     D Y I   
Sbjct: 285 LEH--VYTMPYEAIFEHLNSLSR---TLELDRDSKVLIGDRASLAVADAIGKDNYTI--- 344

Query: 501 NKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELEGMRNSHLRDAAALVQFWFW 560
                                    SP++  KAIKN  ELEG R SH+RD AALV+++ W
Sbjct: 345 -----------------------VRSPIADLKAIKNKTELEGFRQSHIRDGAALVRYFAW 404

Query: 561 LEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISASGANGAIIHYKPEPSDCSVV 620
           LEE++ +G  + E + ADKL  FR   D F   SFDTIS +G NGAIIHYKP+P+DC+++
Sbjct: 405 LEEQLNHGTVINESQGADKLEAFRSELDLFRGLSFDTISGTGPNGAIIHYKPDPNDCAII 464

Query: 621 DPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQGHIALDQAVFPQNTPGF 680
             ++++L DSG Q++DGTTD+TRT HFG PT  +K  FTRVLQGHIA+D AVFP  T G+
Sbjct: 465 KKDQVYLCDSGGQFLDGTTDVTRTWHFGTPTDEEKRAFTRVLQGHIAIDTAVFPNGTTGY 524

Query: 681 VLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG-NMTGLQNGMIVSNEPG 740
           V+DAFAR +LW+ GLDYRHGTGHGVG  LNVHEGP  I  R   N T L+ GM VSNEPG
Sbjct: 525 VIDAFARRALWQDGLDYRHGTGHGVGHFLNVHEGPHGIGVRIALNNTPLKAGMTVSNEPG 584

Query: 741 YYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQTKLVDFSLLSAAEVNWL 800
           YY D  FGIRIE++++VREV TPN FG  GYLGFE +T  P+   LVD SLL+  E  WL
Sbjct: 585 YYADGKFGIRIESIVLVREVKTPNNFGDKGYLGFENVTMCPIHKNLVDVSLLNEQEKKWL 622

Query: 801 NDYHSQVWEKVA 807
           ++YH++ W+KV+
Sbjct: 645 DEYHAETWDKVS 622

BLAST of Sgr022199 vs. ExPASy TrEMBL
Match: A0A6J1DB28 (probable Xaa-Pro aminopeptidase P OS=Momordica charantia OX=3673 GN=LOC111018840 PE=3 SV=1)

HSP 1 Score: 1277.3 bits (3304), Expect = 0.0e+00
Identity = 634/690 (91.88%), Postives = 658/690 (95.36%), Query Frame = 0

Query: 121 MHSLPSQAIRPL----SPSSSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTIR 180
           MHSLPSQAIRPL    S SSSSR RYLRFISS FPISP+FNSQSTVFA ISRRLRRS IR
Sbjct: 1   MHSLPSQAIRPLSLSSSSSSSSRFRYLRFISSTFPISPFFNSQSTVFAAISRRLRRSPIR 60

Query: 181 NCSSITAKPSSELKRTRPKPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECY 240
           +CSSITAKPSSELK+  PK E D KL  LR+LFSKPSIG+DAY+IPSQDAHQSEFIAECY
Sbjct: 61  SCSSITAKPSSELKKVYPKSETDGKLLLLRDLFSKPSIGIDAYVIPSQDAHQSEFIAECY 120

Query: 241 MRRAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWL 300
           MRRAYISGFTGSAGTAVVTKD+AALWTDGRYFLQAEKQLSSSWILMR+GN  VPTPGEWL
Sbjct: 121 MRRAYISGFTGSAGTAVVTKDEAALWTDGRYFLQAEKQLSSSWILMRSGNQEVPTPGEWL 180

Query: 301 ADILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP 360
           AD LAPGGVVGIDPFLFSADAAEDLKE ISRKNHKLVY+YDYNLVDEIWKESRPKPPKGP
Sbjct: 181 ADTLAPGGVVGIDPFLFSADAAEDLKETISRKNHKLVYIYDYNLVDEIWKESRPKPPKGP 240

Query: 361 MRVHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAY 420
           +RVHDLRYAGLDVASKL SLRSELGE+GSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAY
Sbjct: 241 IRVHDLRYAGLDVASKLASLRSELGEAGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAY 300

Query: 421 LIVEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVN 480
           LIVEIDGAKLFVD+SKVT EVMDHLKSAGIELRPYDSIISEIE LAEKGANLWLDTSSVN
Sbjct: 301 LIVEIDGAKLFVDDSKVTPEVMDHLKSAGIELRPYDSIISEIEKLAEKGANLWLDTSSVN 360

Query: 481 AAIANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELE 540
           AAIANAYRTA DRY+IRLGNKRKGKGKT+E+SNSQVGPTGVYK+SP+S+AKAIKNHAELE
Sbjct: 361 AAIANAYRTAGDRYYIRLGNKRKGKGKTYETSNSQVGPTGVYKLSPISIAKAIKNHAELE 420

Query: 541 GMRNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISAS 600
           GMR+SHLRD AAL QFWFWLEE+ILNGVKLTEVEVADKLLEFRK+QDGFVDTSFDTISAS
Sbjct: 421 GMRSSHLRDGAALAQFWFWLEEKILNGVKLTEVEVADKLLEFRKKQDGFVDTSFDTISAS 480

Query: 601 GANGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRV 660
           GANGAIIHYKPEP DCSVVD NKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRV
Sbjct: 481 GANGAIIHYKPEPDDCSVVDANKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRV 540

Query: 661 LQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 720
           LQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR
Sbjct: 541 LQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 600

Query: 721 FGNMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPM 780
           FGNMTGLQ GMIVSNEPGYYEDHSFGIRIENLLIV+EVDTPNRFGGIGYLGFEKLTFVP+
Sbjct: 601 FGNMTGLQTGMIVSNEPGYYEDHSFGIRIENLLIVKEVDTPNRFGGIGYLGFEKLTFVPI 660

Query: 781 QTKLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           Q KLVD SLLSAAEVNWLNDYHS VWEKV+
Sbjct: 661 QAKLVDVSLLSAAEVNWLNDYHSLVWEKVS 690

BLAST of Sgr022199 vs. ExPASy TrEMBL
Match: A0A6J1HRG3 (probable Xaa-Pro aminopeptidase P OS=Cucurbita maxima OX=3661 GN=LOC111465452 PE=3 SV=1)

HSP 1 Score: 1256.5 bits (3250), Expect = 0.0e+00
Identity = 620/690 (89.86%), Postives = 652/690 (94.49%), Query Frame = 0

Query: 121 MHSLPSQAIRPLS----PSSSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTIR 180
           MHSLPSQAIRPLS     SSSS S YLRFISS FPISPYFN QS VFA ISRRLRRSTIR
Sbjct: 1   MHSLPSQAIRPLSFSSFSSSSSSSLYLRFISSTFPISPYFNPQSPVFAAISRRLRRSTIR 60

Query: 181 NCSSITAKPSSELKRTRPKPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECY 240
           +CS ITAKPSS+L+ TR K E D+KLQALR+LFSKP I +DAY+IPSQDAHQSEFI ECY
Sbjct: 61  SCSFITAKPSSDLRNTRSKDESDSKLQALRKLFSKPGIDIDAYVIPSQDAHQSEFIGECY 120

Query: 241 MRRAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWL 300
           MRRAYISGFTGSAGTAVVTKD+AALWTDGRYFLQAEKQLSSSW+LMRAGNHGVPTP EWL
Sbjct: 121 MRRAYISGFTGSAGTAVVTKDQAALWTDGRYFLQAEKQLSSSWVLMRAGNHGVPTPSEWL 180

Query: 301 ADILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP 360
           AD LAPGGVVGIDPFLFSADAAEDLKE ISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP
Sbjct: 181 ADTLAPGGVVGIDPFLFSADAAEDLKETISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP 240

Query: 361 MRVHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAY 420
           +RVHDL+YAGLDVASKL SLRSELGE+GSSAIIISMLDEIAWLLNLRG+DVPNSPVMYAY
Sbjct: 241 IRVHDLKYAGLDVASKLASLRSELGEAGSSAIIISMLDEIAWLLNLRGSDVPNSPVMYAY 300

Query: 421 LIVEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVN 480
           LIVE+DGAKLFVD SKV+SEVMDHLKSAG+ELRPYDSIISEIENLAEKGANLWLD  SVN
Sbjct: 301 LIVEMDGAKLFVDTSKVSSEVMDHLKSAGVELRPYDSIISEIENLAEKGANLWLDPFSVN 360

Query: 481 AAIANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELE 540
           AAIANAYR+ACD+YFIRLGNK+KGKGKT E+SNS+VGPTGVYK SPVS+AKA+KNHAELE
Sbjct: 361 AAIANAYRSACDKYFIRLGNKKKGKGKTSETSNSEVGPTGVYKSSPVSIAKAVKNHAELE 420

Query: 541 GMRNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISAS 600
           GMRNSHLRDAAAL QFW W EEEILNGVKLTEVEVADKLLEFRK+QDGFVDTSFDTISAS
Sbjct: 421 GMRNSHLRDAAALAQFWSWFEEEILNGVKLTEVEVADKLLEFRKKQDGFVDTSFDTISAS 480

Query: 601 GANGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRV 660
           GANGAIIHYKPEPSDCS VD NKLFLLDSGAQYVDGTTDITRTVHFGEPTT+QKECFTRV
Sbjct: 481 GANGAIIHYKPEPSDCSAVDANKLFLLDSGAQYVDGTTDITRTVHFGEPTTYQKECFTRV 540

Query: 661 LQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 720
           LQGHIALDQAVFPQ+TPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR
Sbjct: 541 LQGHIALDQAVFPQHTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 600

Query: 721 FGNMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPM 780
           FGNMTGLQ+GMIVSNEPGYYEDHSFGIRIENLL+VR+  TPN FGGIGYLGFEKLTFVP+
Sbjct: 601 FGNMTGLQDGMIVSNEPGYYEDHSFGIRIENLLVVRDARTPNCFGGIGYLGFEKLTFVPI 660

Query: 781 QTKLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           QTK+VD SLLS AEVNWLNDYHSQVWEKV+
Sbjct: 661 QTKMVDISLLSVAEVNWLNDYHSQVWEKVS 690

BLAST of Sgr022199 vs. ExPASy TrEMBL
Match: A0A5A7U190 (Putative Xaa-Pro aminopeptidase P isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold385G00500 PE=3 SV=1)

HSP 1 Score: 1254.2 bits (3244), Expect = 0.0e+00
Identity = 617/688 (89.68%), Postives = 654/688 (95.06%), Query Frame = 0

Query: 121 MHSLPSQAIRPLS-PSSSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTIRNCS 180
           MHSLPSQAIRPLS  SSSS S YLR ISS F +SP+FN QS VFA IS RLRRST+R+CS
Sbjct: 1   MHSLPSQAIRPLSLSSSSSTSLYLRSISSTFSVSPFFNLQSPVFAAISSRLRRSTVRSCS 60

Query: 181 SITAKPSSELKRTRP-KPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMR 240
           SITAKPSSE++RTRP   EPD+KL+ALR+LFSKP IG+DAYIIPSQDAHQSEFIAECYMR
Sbjct: 61  SITAKPSSEIRRTRPNNDEPDSKLRALRDLFSKPDIGIDAYIIPSQDAHQSEFIAECYMR 120

Query: 241 RAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLAD 300
           RAYISGFTGSAGTAVVT DKAALWTDGRYFLQAEKQL+SSW LMRAGNHGVPTP EWLAD
Sbjct: 121 RAYISGFTGSAGTAVVTSDKAALWTDGRYFLQAEKQLNSSWTLMRAGNHGVPTPSEWLAD 180

Query: 301 ILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMR 360
           ILAPGGVVGIDPFLFSADAAEDLKE +SRKNHKLVYLYDYNLVDEIWK+SRPKPP+GP+R
Sbjct: 181 ILAPGGVVGIDPFLFSADAAEDLKETVSRKNHKLVYLYDYNLVDEIWKDSRPKPPRGPIR 240

Query: 361 VHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLI 420
           VHDLRYAGLDVASKL SLRSEL E+GSSAIIIS+LDEIAWLLNLRG+DVPNSPVMYAYL+
Sbjct: 241 VHDLRYAGLDVASKLASLRSELKEAGSSAIIISVLDEIAWLLNLRGSDVPNSPVMYAYLL 300

Query: 421 VEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVNAA 480
           VE+DGAKLFVDN KVTSEVMDHLK+AG+ELRPYDSIIS IENLAEKGANLWLDTSS+NAA
Sbjct: 301 VELDGAKLFVDNCKVTSEVMDHLKTAGVELRPYDSIISAIENLAEKGANLWLDTSSINAA 360

Query: 481 IANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELEGM 540
           IANAYR+ACD+YFIRLGNKRKGKGKT E+SNSQVGPTGVYK SP+SMAKAIKN+AELEGM
Sbjct: 361 IANAYRSACDKYFIRLGNKRKGKGKTSETSNSQVGPTGVYKSSPISMAKAIKNYAELEGM 420

Query: 541 RNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISASGA 600
           RNSHLRDAAAL QFWFWLE+EILNGVKLTEVEVADKLLEFRK+QDGFVDTSFDTISASGA
Sbjct: 421 RNSHLRDAAALAQFWFWLEQEILNGVKLTEVEVADKLLEFRKKQDGFVDTSFDTISASGA 480

Query: 601 NGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQ 660
           NGAIIHYKPEPSDCSVVD NKLFLLDSGAQYVDGTTDITRTVHFGEPTT QKECFTRVLQ
Sbjct: 481 NGAIIHYKPEPSDCSVVDANKLFLLDSGAQYVDGTTDITRTVHFGEPTTRQKECFTRVLQ 540

Query: 661 GHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG 720
           GHIALDQAVFPQ+TPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG
Sbjct: 541 GHIALDQAVFPQDTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG 600

Query: 721 NMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQT 780
           NMTGL +GMIVSNEPGYYEDHSFGIRIENLLIV++ DTPN FGGIGYLGFEKLTFVP+QT
Sbjct: 601 NMTGLHSGMIVSNEPGYYEDHSFGIRIENLLIVKDADTPNHFGGIGYLGFEKLTFVPIQT 660

Query: 781 KLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           KLVD +LLS  EVNWLNDYHSQVWEKV+
Sbjct: 661 KLVDITLLSVEEVNWLNDYHSQVWEKVS 688

BLAST of Sgr022199 vs. ExPASy TrEMBL
Match: A0A6J1FH61 (probable Xaa-Pro aminopeptidase P OS=Cucurbita moschata OX=3662 GN=LOC111443950 PE=3 SV=1)

HSP 1 Score: 1253.0 bits (3241), Expect = 0.0e+00
Identity = 621/690 (90.00%), Postives = 649/690 (94.06%), Query Frame = 0

Query: 121 MHSLPSQAIRPLS----PSSSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTIR 180
           MHSLPSQAIRPLS     SSSS S YLRFISS FPISPYFN QS VFA ISRRLRRSTIR
Sbjct: 1   MHSLPSQAIRPLSFSSFSSSSSSSLYLRFISSTFPISPYFNPQSPVFAAISRRLRRSTIR 60

Query: 181 NCSSITAKPSSELKRTRPKPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECY 240
           +CS ITAKPSS+L+ TR K E D+KLQALR+LFSKP I +DAY+IPSQDAHQSEFI ECY
Sbjct: 61  SCSFITAKPSSDLRNTRSKDESDSKLQALRKLFSKPGIDIDAYVIPSQDAHQSEFIGECY 120

Query: 241 MRRAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWL 300
           MRRAYISGFTGSAGTAVVTKD+AALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTP EWL
Sbjct: 121 MRRAYISGFTGSAGTAVVTKDQAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPSEWL 180

Query: 301 ADILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP 360
           AD LAPGGVVGIDPFLFSADAAEDLKE ISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP
Sbjct: 181 ADTLAPGGVVGIDPFLFSADAAEDLKETISRKNHKLVYLYDYNLVDEIWKESRPKPPKGP 240

Query: 361 MRVHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAY 420
           +RVHDL+YAGLDVASKL SLRSELGE+GSSAIIISMLDEIAWLLNLRG+DVPNSPVMYAY
Sbjct: 241 IRVHDLKYAGLDVASKLASLRSELGEAGSSAIIISMLDEIAWLLNLRGSDVPNSPVMYAY 300

Query: 421 LIVEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVN 480
           LIVE+DGAKLFVD SKV+SEVMDHLKSAG+ELRPYDSIISEIENLAEKGANLWLD  SVN
Sbjct: 301 LIVEMDGAKLFVDTSKVSSEVMDHLKSAGVELRPYDSIISEIENLAEKGANLWLDPVSVN 360

Query: 481 AAIANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELE 540
           AAIANAYR ACD+YFIRLGNKRK K KT E+SNS VGPTGVYK SPVS+AKA+KNHAELE
Sbjct: 361 AAIANAYRNACDKYFIRLGNKRKSKDKTSETSNSHVGPTGVYKSSPVSIAKAVKNHAELE 420

Query: 541 GMRNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISAS 600
           GMRNSHLRDAAAL QFW WLEEEILNGVKLTEVEVADKLLEFRK+QDGFVDTSFDTISAS
Sbjct: 421 GMRNSHLRDAAALAQFWSWLEEEILNGVKLTEVEVADKLLEFRKKQDGFVDTSFDTISAS 480

Query: 601 GANGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRV 660
           GANGAIIHYKPEPSDCSVVD NKLFLLDSGAQYVDGTTDITRTVHFGEP T+QKECFTRV
Sbjct: 481 GANGAIIHYKPEPSDCSVVDANKLFLLDSGAQYVDGTTDITRTVHFGEPITYQKECFTRV 540

Query: 661 LQGHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 720
           LQGHIALDQAVFPQ+TPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR
Sbjct: 541 LQGHIALDQAVFPQHTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR 600

Query: 721 FGNMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPM 780
           FGNMTGLQ+GMIVSNEPGYYEDHSFGIRIENLL+VR+  TPN FGGIGYLGFEKLTFVP+
Sbjct: 601 FGNMTGLQDGMIVSNEPGYYEDHSFGIRIENLLVVRDAKTPNCFGGIGYLGFEKLTFVPI 660

Query: 781 QTKLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           QTK+VD SLLS AEVNWLNDYHSQVWEKV+
Sbjct: 661 QTKMVDISLLSVAEVNWLNDYHSQVWEKVS 690

BLAST of Sgr022199 vs. ExPASy TrEMBL
Match: A0A0A0LIP6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G022830 PE=3 SV=1)

HSP 1 Score: 1245.3 bits (3221), Expect = 0.0e+00
Identity = 615/688 (89.39%), Postives = 652/688 (94.77%), Query Frame = 0

Query: 121 MHSLPSQAIRPLS-PSSSSRSRYLRFISSAFPISPYFNSQSTVFATISRRLRRSTIRNCS 180
           MHS+PSQAIRPLS  SSSS S YLR ISS F ISPYFN QS VFA ISRRLRRST+R+CS
Sbjct: 1   MHSIPSQAIRPLSLSSSSSTSLYLRSISSTFSISPYFNLQSPVFAAISRRLRRSTLRSCS 60

Query: 181 SITAKPSSELKRTRP-KPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMR 240
           SITAKPSSE++R R    EPD+KL+ALR+LFSKP+IG+DAYIIPSQDAHQSEFIAECYMR
Sbjct: 61  SITAKPSSEIRRNRTNNDEPDSKLRALRDLFSKPNIGIDAYIIPSQDAHQSEFIAECYMR 120

Query: 241 RAYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLAD 300
           RAYISGFTGSAGTAVVT DKAALWTDGRYFLQAEKQL+SSW LMRAGNHGVPTP EWLAD
Sbjct: 121 RAYISGFTGSAGTAVVTNDKAALWTDGRYFLQAEKQLNSSWTLMRAGNHGVPTPSEWLAD 180

Query: 301 ILAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMR 360
           ILAPGGVVGIDPFLFSADAAEDLKE ISRKNHKLVYLYDYNLVD IWK+SR KPP+GP+R
Sbjct: 181 ILAPGGVVGIDPFLFSADAAEDLKETISRKNHKLVYLYDYNLVDAIWKDSRSKPPRGPIR 240

Query: 361 VHDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLI 420
           VHDLRYAGLDVASKL SLRSEL E+GSSAIIISMLDEIAWLLNLRG+DVPNSPVMYAYL+
Sbjct: 241 VHDLRYAGLDVASKLASLRSELKEAGSSAIIISMLDEIAWLLNLRGSDVPNSPVMYAYLL 300

Query: 421 VEIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVNAA 480
           VE+DGAKLFVD+ KVTSEVMDHLK+AG+ELRPYDSIIS IENLAEKGANLWLDTSS+NAA
Sbjct: 301 VELDGAKLFVDDCKVTSEVMDHLKTAGVELRPYDSIISAIENLAEKGANLWLDTSSINAA 360

Query: 481 IANAYRTACDRYFIRLGNKRKGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELEGM 540
           IANAYR+ACD+YFIRLGNKRKGK KT E+SNSQVGPTGVYK SP+SMAKAIKN+AELEGM
Sbjct: 361 IANAYRSACDKYFIRLGNKRKGKSKTSETSNSQVGPTGVYKSSPISMAKAIKNYAELEGM 420

Query: 541 RNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISASGA 600
           RNSHLRDAAAL QFWFWLE+EILNGVKLTEVEVADKLLEFRK+QDGFVDTSFDTISASGA
Sbjct: 421 RNSHLRDAAALAQFWFWLEQEILNGVKLTEVEVADKLLEFRKKQDGFVDTSFDTISASGA 480

Query: 601 NGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQ 660
           NGAIIHYKPEPSDCSVVD NKLFLLDSGAQYVDGTTDITRTVHFGEPT  QKECFTRVLQ
Sbjct: 481 NGAIIHYKPEPSDCSVVDANKLFLLDSGAQYVDGTTDITRTVHFGEPTARQKECFTRVLQ 540

Query: 661 GHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG 720
           GHIALDQAVFPQ+TPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG
Sbjct: 541 GHIALDQAVFPQDTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG 600

Query: 721 NMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQT 780
           NMTGL NGMIVSNEPGYYEDHSFGIRIENLLIV++ +TPN FGGIGYLGFEKLTFVP+QT
Sbjct: 601 NMTGLHNGMIVSNEPGYYEDHSFGIRIENLLIVKDANTPNHFGGIGYLGFEKLTFVPIQT 660

Query: 781 KLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           KLVD +LLSA+EVNWLNDYHSQVWEKV+
Sbjct: 661 KLVDITLLSASEVNWLNDYHSQVWEKVS 688

BLAST of Sgr022199 vs. TAIR 10
Match: AT3G05350.1 (Metallopeptidase M24 family protein )

HSP 1 Score: 982.6 bits (2539), Expect = 2.0e-286
Identity = 489/688 (71.08%), Postives = 569/688 (82.70%), Query Frame = 0

Query: 131 PLSPSSSSRSRYL----RFISSAFPISPYFNSQSTVFATI-------SRRLRRSTIRNCS 190
           PL+ SS S +R +    R+  S F  +  FNS S +   +       +R    S+  + S
Sbjct: 3   PLTLSSPSLNRLVLSTSRYSHSLFLSN--FNSLSLIHRKLPYKPLFGARCHASSSSSSSS 62

Query: 191 SITAKPSSELKRTRPKPEPDAKLQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMRR 250
           S TAK S E+++ + K   D KL ++R LFS+P +G+DAYIIPSQDAHQSEFIAECY RR
Sbjct: 63  SFTAKSSKEIRKAQTKVVVDEKLSSIRRLFSEPGVGIDAYIIPSQDAHQSEFIAECYARR 122

Query: 251 AYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLADI 310
           AYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQL+SSWILMRAGN GVPT  EW+AD+
Sbjct: 123 AYISGFTGSAGTAVVTKDKAALWTDGRYFLQAEKQLNSSWILMRAGNPGVPTASEWIADV 182

Query: 311 LAPGGVVGIDPFLFSADAAEDLKEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMRV 370
           LAPGG VGIDPFLFSADAAE+LKE+I++KNH+LVYLY+ NLVDEIWK+SRPKPP   +R+
Sbjct: 183 LAPGGRVGIDPFLFSADAAEELKEVIAKKNHELVYLYNVNLVDEIWKDSRPKPPSRQIRI 242

Query: 371 HDLRYAGLDVASKLTSLRSELGESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLIV 430
           HDL+YAGLDVASKL SLR+++ ++G+SAI+ISMLDEIAW+LNLRG+DVP+SPVMYAYLIV
Sbjct: 243 HDLKYAGLDVASKLLSLRNQIMDAGTSAIVISMLDEIAWVLNLRGSDVPHSPVMYAYLIV 302

Query: 431 EIDGAKLFVDNSKVTSEVMDHLKSAGIELRPYDSIISEIENLAEKGANLWLDTSSVNAAI 490
           E+D A+LFVDNSKVT EV DHLK+AGIELRPYDSI+  I++LA +GA L +D S++N AI
Sbjct: 303 EVDQAQLFVDNSKVTVEVKDHLKNAGIELRPYDSILQGIDSLAARGAQLLMDPSTLNVAI 362

Query: 491 ANAYRTACDRYFIRLGNKRKGKGK-TFESSNSQVGPTGVYKVSPVSMAKAIKNHAELEGM 550
            + Y++AC+RY     ++ K K K T  SS     P+G+Y  SP+S AKAIKN AEL+GM
Sbjct: 363 ISTYKSACERYSRNFESEAKVKTKFTDSSSGYTANPSGIYMQSPISWAKAIKNDAELKGM 422

Query: 551 RNSHLRDAAALVQFWFWLEEEILNGVKLTEVEVADKLLEFRKRQDGFVDTSFDTISASGA 610
           +NSHLRDAAAL  FW WLEEE+     LTEV+VAD+LLEFR  QDGF+DTSFDTIS SGA
Sbjct: 423 KNSHLRDAAALAHFWAWLEEEVHKNANLTEVDVADRLLEFRSMQDGFMDTSFDTISGSGA 482

Query: 611 NGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQ 670
           NGAIIHYKPEP  CS VDP KLFLLDSGAQYVDGTTDITRTVHF EP+  +KECFTRVLQ
Sbjct: 483 NGAIIHYKPEPESCSRVDPQKLFLLDSGAQYVDGTTDITRTVHFSEPSAREKECFTRVLQ 542

Query: 671 GHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRFG 730
           GHIALDQAVFP+ TPGFVLD FARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR+G
Sbjct: 543 GHIALDQAVFPEGTPGFVLDGFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFRYG 602

Query: 731 NMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQT 790
           NMT LQNGMIVSNEPGYYEDH+FGIRIENLL VR+ +TPNRFGG  YLGFEKLTF P+QT
Sbjct: 603 NMTPLQNGMIVSNEPGYYEDHAFGIRIENLLHVRDAETPNRFGGATYLGFEKLTFFPIQT 662

Query: 791 KLVDFSLLSAAEVNWLNDYHSQVWEKVA 807
           K+VD SLLS  EV+WLN YH++VWEKV+
Sbjct: 663 KMVDVSLLSDTEVDWLNSYHAEVWEKVS 688

BLAST of Sgr022199 vs. TAIR 10
Match: AT4G36760.1 (aminopeptidase P1 )

HSP 1 Score: 534.6 bits (1376), Expect = 1.5e-151
Identity = 280/641 (43.68%), Postives = 395/641 (61.62%), Query Frame = 0

Query: 202 LQALRELFSKPSIGVDAYIIPSQDAHQSEFIAECYMRRAYISGFTGSAGTAVVTKDKAAL 261
           L +LR L +  S  +DA ++PS+D HQSE+++    RR ++SGF+GSAG A++TK +A L
Sbjct: 5   LSSLRSLMASHSPPLDALVVPSEDYHQSEYVSARDKRREFVSGFSGSAGLALITKKEARL 64

Query: 262 WTDGRYFLQAEKQLSSSWILMRAGNHGVPTPGEWLADILAPGGVVGIDPFLFSADAAEDL 321
           WTDGRYFLQA +QLS  W LMR G    P    W++D L     +G+D +  S D A   
Sbjct: 65  WTDGRYFLQALQQLSDEWTLMRMGED--PLVEVWMSDNLPEEANIGVDSWCVSVDTANRW 124

Query: 322 KEIISRKNHKLVYLYDYNLVDEIWKESRPKPPKGPMRVHDLRYAGLDVASKLTSLRSELG 381
            +  ++KN KL+     +LVDE+WK SRP     P+ VH L +AG  V+ K   LR++L 
Sbjct: 125 GKSFAKKNQKLI-TTTTDLVDEVWK-SRPPSEMSPVVVHPLEFAGRSVSHKFEDLRAKLK 184

Query: 382 ESGSSAIIISMLDEIAWLLNLRGNDVPNSPVMYAYLIVEIDGAKLFVDNSKVTSEVMDHL 441
           + G+  ++I+ LDE+AWL N+RG DV   PV++A+ I+  D A L+VD  KV+ E   + 
Sbjct: 185 QEGARGLVIAALDEVAWLYNIRGTDVAYCPVVHAFAILTTDSAFLYVDKKKVSDEANSYF 244

Query: 442 KSAGIELRPYDSIISEIENLAEKGANLWLDTSSVNAAIANAYRTAC---DRYFIRLGNKR 501
              G+E+R Y  +IS++  LA         + +V    A          DR ++   +  
Sbjct: 245 NGLGVEVREYTDVISDVALLASDRLISSFASKTVQHEAAKDMEIDSDQPDRLWVDPASCC 304

Query: 502 KGKGKTFESSNSQVGPTGVYKVSPVSMAKAIKNHAELEGMRNSHLRDAAALVQFWFWLEE 561
                  ++    + P      SP+S++KA+KN  ELEG++N+H+RD AA+VQ+  WL+ 
Sbjct: 305 YALYSKLDAEKVLLQP------SPISLSKALKNPVELEGIKNAHVRDGAAVVQYLVWLDN 364

Query: 562 EI--LNGV------------------KLTEVEVADKLLEFRKRQDGFVDTSFDTISASGA 621
           ++  L G                   KLTEV V+DKL   R  ++ F   SF TIS+ G+
Sbjct: 365 QMQELYGASGYFLEAEASKKKPSETSKLTEVTVSDKLESLRASKEHFRGLSFPTISSVGS 424

Query: 622 NGAIIHYKPEPSDCSVVDPNKLFLLDSGAQYVDGTTDITRTVHFGEPTTHQKECFTRVLQ 681
           N A+IHY PEP  C+ +DP+K++L DSGAQY+DGTTDITRTVHFG+P+ H+KEC+T V +
Sbjct: 425 NAAVIHYSPEPEACAEMDPDKIYLCDSGAQYLDGTTDITRTVHFGKPSAHEKECYTAVFK 484

Query: 682 GHIALDQAVFPQNTPGFVLDAFARSSLWKIGLDYRHGTGHGVGAALNVHEGPQSISFR-F 741
           GH+AL  A FP+ T G+ LD  AR+ LWK GLDYRHGTGHGVG+ L VHEGP  +SFR  
Sbjct: 485 GHVALGNARFPKGTNGYTLDILARAPLWKYGLDYRHGTGHGVGSYLCVHEGPHQVSFRPS 544

Query: 742 GNMTGLQNGMIVSNEPGYYEDHSFGIRIENLLIVREVDTPNRFGGIGYLGFEKLTFVPMQ 801
                LQ  M V++EPGYYED +FGIR+EN+L+V + +T   FG  GYL FE +T+ P Q
Sbjct: 545 ARNVPLQATMTVTDEPGYYEDGNFGIRLENVLVVNDAETEFNFGDKGYLQFEHITWAPYQ 604

Query: 802 TKLVDFSLLSAAEVNWLNDYHSQVWEKVAVSESHFMDETQM 819
            KL+D   L+  E++WLN YHS+  + +A     FM++T+M
Sbjct: 605 VKLIDLDELTREEIDWLNTYHSKCKDILA----PFMNQTEM 631

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038877034.10.0e+0090.74aminopeptidase P2 [Benincasa hispida][more]
XP_022150774.10.0e+0091.88probable Xaa-Pro aminopeptidase P [Momordica charantia][more]
XP_022965604.10.0e+0089.86probable Xaa-Pro aminopeptidase P [Cucurbita maxima][more]
KAG7021104.10.0e+0090.29AMPP protein, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
KAA0047987.10.0e+0089.68putative Xaa-Pro aminopeptidase P isoform X1 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q8RY112.9e-28571.08Aminopeptidase P2 OS=Arabidopsis thaliana OX=3702 GN=APP2 PE=2 SV=1[more]
D1ZKF33.3e-16450.82Probable Xaa-Pro aminopeptidase P OS=Sordaria macrospora (strain ATCC MYA-333 / ... [more]
Q7RYL66.9e-16250.00Probable Xaa-Pro aminopeptidase P OS=Neurospora crassa (strain ATCC 24698 / 74-O... [more]
D5GAC63.4e-16147.70Probable Xaa-Pro aminopeptidase P OS=Tuber melanosporum (strain Mel28) OX=656061... [more]
B0DZL32.2e-16048.04Probable Xaa-Pro aminopeptidase P OS=Laccaria bicolor (strain S238N-H82 / ATCC M... [more]
Match NameE-valueIdentityDescription
A0A6J1DB280.0e+0091.88probable Xaa-Pro aminopeptidase P OS=Momordica charantia OX=3673 GN=LOC111018840... [more]
A0A6J1HRG30.0e+0089.86probable Xaa-Pro aminopeptidase P OS=Cucurbita maxima OX=3661 GN=LOC111465452 PE... [more]
A0A5A7U1900.0e+0089.68Putative Xaa-Pro aminopeptidase P isoform X1 OS=Cucumis melo var. makuwa OX=1194... [more]
A0A6J1FH610.0e+0090.00probable Xaa-Pro aminopeptidase P OS=Cucurbita moschata OX=3662 GN=LOC111443950 ... [more]
A0A0A0LIP60.0e+0089.39Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G022830 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G05350.12.0e-28671.08Metallopeptidase M24 family protein [more]
AT4G36760.11.5e-15143.68aminopeptidase P1 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036005Creatinase/aminopeptidase-likeGENE3D3.90.230.10Creatinase/methionine aminopeptidase superfamilycoord: 528..818
e-value: 2.7E-99
score: 334.1
IPR036005Creatinase/aminopeptidase-likeSUPERFAMILY55920Creatinase/aminopeptidasecoord: 531..790
NoneNo IPR availablePFAMPF16189Creatinase_N_2coord: 349..534
e-value: 4.4E-46
score: 156.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..23
NoneNo IPR availablePANTHERPTHR43763XAA-PRO AMINOPEPTIDASE 1coord: 172..806
NoneNo IPR availablePANTHERPTHR43763:SF6XAA-PRO AMINOPEPTIDASE 1coord: 172..806
IPR000587Creatinase, N-terminalPFAMPF01321Creatinase_Ncoord: 203..328
e-value: 3.0E-15
score: 57.0
IPR032416Peptidase M24, C-terminal domainPFAMPF16188Peptidase_M24_Ccoord: 765..806
e-value: 1.2E-16
score: 60.5
IPR029149Creatinase/Aminopeptidase P/Spt16, N-terminalGENE3D3.40.350.10coord: 196..345
e-value: 4.2E-56
score: 190.8
IPR029149Creatinase/Aminopeptidase P/Spt16, N-terminalGENE3D3.40.350.10coord: 348..490
e-value: 1.6E-43
score: 150.1
IPR029149Creatinase/Aminopeptidase P/Spt16, N-terminalSUPERFAMILY53092Creatinase/prolidase N-terminal domaincoord: 216..331
IPR000994Peptidase M24PFAMPF00557Peptidase_M24coord: 536..753
e-value: 3.7E-43
score: 147.7
IPR033740Aminopeptidase PCDDcd01085APPcoord: 537..761
e-value: 6.03844E-132
score: 392.31

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr022199.1Sgr022199.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0070006 metalloaminopeptidase activity
molecular_function GO:0016787 hydrolase activity