Sgr020497 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr020497
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionCarboxypeptidase
Locationtig00153533: 813132 .. 830994 (-)
RNA-Seq ExpressionSgr020497
SyntenySgr020497
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCGGCCCACCTGGTTTTTTCTCGAGGTCTTCGCCATTTTCTGCTATCTCCTCCATCTGGGCATCGGCGTTTCAACTTCTTCGGAAGACCCACTTGTCCAGCAGGAATTGGACAGGATCGTTGAGCTTCCGGGCCAGAACTTCGAGGTGAACTTCGAACACTTTTCTGGTTATATCACAGTGAATGAGGATTCTGGGAGAGCGCTGTTCTACTGGTTTGTTGAGGCTGTAGAGGACTCTGGCTCTAAACCTCTTCTTTTATGGCTTACTGGAGGTCCTCTACTTGAGCTATTTCCCACGATATTTTAGTAGTTCTTCTTTTATTTACTTTGAGCTCTTTAGTCGTTAAATTTGAATCCATTGTCTTTTCTTCTGAAAAATTAATTTTAGAATGTGTACTGTGTTTGTGTGTGTCTATATATATATAATTTTGTTTATTTAAATATATGGCGCCACCACCCCCCCGCCCAAATAATAATAATAATAATAATTGAAGTGAAATTTGTGTGGGAGTAAATCTCATTTGGTATTGAGCTTTCAAGAATATTCTACTTTTGGTTCCTAAACTTTTAATGTTTATTTTAGTCATTGAATTGTTAGGTTTGTTCTATTTTTTATCTTTTAAAATATATATTTTAATTCATGAACTTAAAAAAAAAAAGAATCACTTTGGTTCTTATCATTATTTTTCTTTTAATGAATGGATATTATAACTTCGAGTATATGTATTAATGTTATGATGTGGGCATGTAGAGCCATGTCGATGCTAGTATTGGGTGAACGGATATTAAATGGCCAAATAATAGCAATAACGAAAATGATTATTTTTGAAAGTTCAGTGATTAAAATAGACATTTTGAAAATTCAGAAACCAAAAAAATTATTTTTTTTAAAGTTCAAAAATAAAAATAGATATTTTAAAAGTTTAGGGATTGACATGAAATAAACATAAAAGTTTAGAGACCAAAATAAGATTTAAATATTTTAGAATGTATATTGTGTTTTTCCCTTTTAATTTTGTTTATTTAAATATGTTGTTCCTTTTTTTTAAAATTGAAAGTTCTTTTTCTCTCTCCCTCTCTCTTTTTTTCCCCTCAAGAAGAAAAAGAAAATCTGTGGGGGTAATTCTAGATTGTAAAATGAGGCAATTACAAAAATAGTAGAATAGAAATAGTAAATTATATTTATTATAGTTTTTAAATTTACAAAAATAATAAAATAAAATAGAGAAACTAAGCCAAAACATCTCCTATGCCATTGAGTTAAGAAAAAAAAAAATGCTCTGCTCGTCTTTCATACTTGTGACTATTGTCTGTTGGTAGATGGTTTCTAAGATAATTGATATTGCATATATAATAAAATGAGTTTTTAAATCTTGAGTTGTTTCTCTAGGACCTGGATGTTCATCAATTGCCTATGGTGAGGCAGAAGAAGTAGGCCCATTTCACATTAATGCTGATGGGAAATCCCTTTACTTGAACCCTTATTCTTGGAATGAAGGTAAAATTATATTTATATATTTATATATATATATACCTCTTTTAGAAATTAACTGATTATTTCACAAATGAGTTTGTAGAATACAGATATTGTATTTAACATCTTATAACTTCTCCTACCTATCTACATTATCTTTTGCAGTTGCCAACATTATATTTATTGAGTCCCCTGTTGGAGTTGGTTTTTTCATATTCGAACACATCTTCAGACTTGCTGAATAATGGAGATAAAAGGACTGGTATTACTGTTCTATTTGCTTTGATTGGACCTATTATGTCATAGCAAAGCCATCTAATTTGGTTTTGTTTCACCTTTCTATTTTTAGCTGAGGACTCTCTGAGTTTCTTATTGAAGTGGTTCGAACGCTTTCCCCAGTTTAAAGGAAGGGATTTTTATATTATAGGAGAGAGCTATGGGGGTAAGGAGGAAATCATTGCTCAAATTTATGACATAAAATAACAGTAGTTTTTATCTTTTAAGGATTCATGTCAAGATGTTTGGTTTCTCTAAGGATGTCATCCATTCCTGGCATCTTCTTCCATTTCTAACTTTTTTTTTTAATATGCACTGTTTATATTGTGCTTTCCCACATTAGTTTTAGTTGTATATCTCTTTCTTTTTAGACTAGCCTTAACTTTATTACCATAGCAGTCGGTTAATGAAATGAATATGTAGGACATTATGCCCCTCAGCTAAGCCAAGCAATTCTAAAGAACAACCTACTAACCATGGAAAAATCAATTAATTTGAAGGGTTATATGGTATTCACATCTTCTCCTCACAACATTCAAAATTTTATAGTAATATTATGGTATATGCAAAGTAGCCATATTCACTTCTATACATTACTGATGTAGGTGGGCAATGCTGTTTTTGATGATCATTACGATCGCATGGGAGTTTTCAATTTTTTGTGGTCGATTGGTTTGATTTCTGATCAGACATACAAGCAACTTAATCTACATTGTGAAAATCAGTCTTTCGTACATCACTCGAAATCTTGTGACAAGATACTAGATGTTGTGGATAAAGAACTTGGAAACATTGACCCTTTTAGCATCTACACTCCTTCATGCGCTGATGGCCCTTCAAATCGACTTTTGAAGCGAATGCACGTGAGTTTAGAAATACCAATCAATGTCTTTAATTATTCAATTTTTTCCTTGTCTGCTATACAATAATAACATGAATGAAGGAATTAAGCCAACATATAATTTTCTTAAAAAAATGGATGGAGGCTTCTCATTTGCTGAAAATTTAAATCATACCCCAATGGTAGACTTGTTAATCTTCTGCTTCATTGAATTTACAATTCAAATGATACCCTGAAATAAGCTTAAGAAGATTCTGTTAAGTGAACTAAGGATCACTTAGTGGCAATTTCCTGAAATAAGCTCGAGATTGTGACTTGTTTACTTCTTTTTCTCAATATTTATTTAATCTTTTAAGAGCTTAAGGTAAGTTACATAAATACATTTGAAGTTATGACATCTTCAAGTACTCCTTCAAATCAACATTGAAAATTTCATTTGCAATGGAGAGAGCTTCTCTCACTATTTTCGATTATAGAAATTCTTACATCCATTAATTTACTGTGACTGTGCTTCTTAGATGGTTGGGCGTGATGGTGAAAGCTTTGATCCTTGCACCAAGAAACACTCAATCATATACTTCAATCTACCTAAAGTTCAAAAGGCGCTTCATGTTGATCCTAAGCATGCACCATCTAAATGGGAAATCTGCAGGTACAACATTTAGAGAACAAGTATAAAATTTGTGGTGACATCTGTTGCATATCTCTCCAGAAACTCTATAGATTAGGTGTGAAGTTCACCAAGAAAATATAACTATTGACCTGAAACAAGATTCTTATTCTATTTTTTTTTTTATAGAATATTATATTACAAATGCGGAGGTAGAGATTCGAACCTACAACCTCTTAGAGGAGGAAGTGATGTCTTAACCACTAAGCTATGCTCAAGTTGATAGATTCTTATTCTATGTAAGTATATATTAGGTTTCAAACCACCTATCCCTAATAGGGAAAAGTATTGATATACACACAAAATAGATTAAAAGATAGTCACCTAGATGATGCTTTTTCTTCTTATTTTCCTATCCCCGCCCTTTCTTGTCCTTGGTGAGGGAGATTTGGCATTTAGGTGGGGATATAGGTCCAGAGCTCCCGAAAGAAACTCTTTTTTGTGAGGATATTTTGATTGCCACTTAAATTACTTCCTTGTTTAATAATAATTTTGTTTTTAGTTTTCTGTTTTAAAAAATTATGCATGCTGTCTTACAAGCCTTTTTTGCTGTGTTTATTACATTTCTTAAAAACGCATTTGAATTCCCAAATTTTAAAAACAAAAAGAGGTTTTTAAAAACTACTATTTTTAGTTTTTTAAATTTAGCTTAGATTTTGAAAACTATTTTAAAAAGTAGATAAAAAAACAAAGGAACACACAGGTGGAAGTAGTATTTATAAGATTAATTTTTGAAACAGAAAACCAAAAAATAAAATGGTTATCAAATGGGATCTAACAATCACATTTAGTTAAGTATAATAATGGTAGGCAACAAGCAACTTTTTATTGTCTGCCCAACATTGATGTAAACAGGACAAAAAAAGGTAAACATTTAAATGATTCCATTGACATCTTGGTCATGTTAGTTTTCACCCCTTTGAATTTTAAGGGCATCATGTGAGAAAATTTACAGATTTGATAAAAGTTCATTTGCGCATAAAAAAAAATCTTTTGCTTGCAAATGAACTTGTGAAAATCTTATGTTTATGTCATATCATTACTTTTTGTGGAACTTCTAAATTTTTATTAATTTTCCATTAGACATACTTCTTCCCTTGATTACATTATTTTCCAAGTTATGTGGTTAACCATAACTGGAAGGATGCTCCTGGCTCAGTGTTGGACATTTATCAGGAGGTGATACATTCAGGACTTCGAATTTGGGTGTTTAGGTTCGTCTTAAGATTTAATTTTTTATATCATAATAAGAAGCAATTAGTCTATGTTTTCACATACTAGTACATCTGTTGTTTACTAATTAGTTGGTTAAATTACAAGTTTGGTTTCTTAACTTTGAGCATTATTTTGAATAAGTCCTTGAATTTAAAAATTGTCTTAAGTCCCTAAACTTTTAATGATATATCTAATAGGTCCCTAAACTTTAAGAAGTGTTATAAGTCTCTAAATTTTCAATTTTGTGTCTAATAGATCTAGGACTTTAAAAAGTGTCTAATAAGTCAGAGATCTATTATACACAAAATTGAATGTTCAAGGATCTATTAGATACAAAATTGAAGGAAAGGCAAATATTTTGTGGAATTGTGTAGTGAGATCCTTGCTGTGGCTGATTTGGAAGGAGAGAAATCAAAGGATCTTTGAAGGAAAAAGCACGCCTGTGAATTCTTTTTGTTTAAATGTACAACATACAACCTCTTGGTGGTGCTCAAACCACAAGAAATTCTTTTGTAACTATAGTCTCCTTATGATCATAATTGATTCGAAGGCTTTGCTTTTGTAGTTTCTCTTGGGCGAGGGCTCCCTCTACTCCTGGCCCCTAGGCTGTTATTTTTTTTCTCTTTTAATTGATCTTTGTTTCTTATCAAAAAAAAAAAAAAAGATACTTTTTTAAGTACTAGGACCTATTAGACATAACATTGAAAGTTTAGGGACCTATTAGACATTTGCTAAAGTCTAAGAACTTATTAAACACAACACTAAAAGTTCAAAGATCAAACTTATAATTTAATCTAATTAGTTTTATAGAGGAAGAAGCATTATGGAGACCCTCAATTTAAAAAGAAAAAAACAAAAGGCTGACTTTGTGTTAAAAACTTAATATAAATTAACATATTAACTTTGGTTTTATACTCAAATTTTTGGATTAGTAATAATTTAACATGATTTCAAAATTAATAATAATCTGTCATTTAATGGTGGGATACTCATAATAGCGCAGTTTAGATTTAATTTTGTTTTATTTTAGTTTTTAAAGGAAACAAATATTTGATTAATGGTATGAAATATTACAAAAGATGGGAGGAGAGTCAAGCCCCAAAGTCAAGGAGAGTTTTAGAGTATAGCTTATGCGATCTTATGAAAAACTGGAAGCCTTTTTGTAATCCAGTTGACCAGGATCTGTTCCCTCTTTTGTATATTTCATTATGAAATATTCGTTTCTCATAAAAAAAAAGGAGAGTGACAAAAAAGCCTTCTATCTAGCCGAAAGAATACTAAGATTATAATCAATAAAGAGAGGATATTTACACCAAGAGAGGGCCTGTAAAAAAACCGAATTTAAAAATGAACCAAAAAGATTTTGAAGTGTTATTGAAGGTTCAGTTGTAGGGATCTGTTTAGATTTTAGATGTTAACCTTGTTGGACTTCTTCATTTAGTTGTCTTTTCTTTTTCTATATTCTAGAGTATTTCATCCCTCCATGAATATGAACTTTTGCCATGGCATGTAAGGAACAGTTATAATAAATTTGAGCATTACTAGAAGCCACAACCCCTTAGTCTCTAGTTTTCTGTCTCTCTCATCTTCCTCTCTCTTGATTTATCCTCCACTGTCTTGCAACATTATGTTTATTTGTTTTCTAACACCTTAAGTTGTCTCTTAATCTGCTTAACTGATGGCTTATCCATTTTTGACTTTGCCCTTGTTCTATTATTGACTCTGGTTAAGTTTAAACCGGTTTTTACAAAATGGACATTTCATTCGGTTTGCACATATAACAATCTGTATGGCACGTTTCGATTTTCGATTTTGAGAAAATGAACCAAACTGATCTGTGCACACCCCTAATTATTTTGATAGCATGACCATGATGTCTTAAATTTCTTACTAAACTAGTGATTAAAGGCTCTCTCCGGAAAACTGGTTCTAGGCTTTTTTTTTTTTTGCTGCTCCAAATTCTCAATCATCCATCAACTTTTCATTTTACTGAATTTGATTATTAGTTTTTGTTTCTCAATGACAGTGGCGACGCAGATACTTCACTCCCAACCACATCTACTCGTTATAGTATCAACGCCCTTAAGCTTCCACTTGCCGGACCTTGGCGTGCTTGGTATGGTGATGGCCAGGTAAATCATTTTCAAGAACTTCATATTCAAATCGTTCATGGCTCGGATCACCGATTTTCGTTATTCATTTTCAAAAATTTACTTTCAATAGTGATATCTTAAAACTATTTTTAGTTTGAAAAGCTTTTAGAATTTATGTTCTTATATTCATCTTCATTCGGTTTTTATCTGTTGAGAATATATGATACCTAAGAAGGCATTTATCAGTGAAATAACTTCAAATATAATAAAAGAAAAGACTTGACATCAGAATTTGATTTACAATAACTTCAACTACAATAAGGTAAGCTCTCAAAAACACTTGTAAAGATTAAGAATTATGTTTTTTTAAGGGAGAACAACTTTAATTACATTCTAAAAAATTAAGCCATTCATAAAAACACACCAAAGTAAAATAAAGCAAAAAACTCAGTTACCCTTTTTCACTACATTTTTGTTTCGAAACAGAAACAGTTGTCACCTCCCAAGCAACCCTTAAAACTTGTTTTCAAACAGGTGCTTCTCTGATTTTGAAATCGAAATATAATATTCAATCTTGTCGATTCGAAAAAATTTCTGTTCATTATCCAAAGAGTTAATACTCGTAAGGAGTCCTATTGAAACCAAGAAATATAGGCCTGCCTGCCATCCACACCAGAATAAATGAAGTTTTTCGAAAAAACCTACTAGAATACGCATGCAAGATGAAATGGTTGTTGCTATCTGTTACAATAACGAATCATTGTTTTAACTGAATAACTAAATAAAATAGATAGACCTTTCTCTTCGTCTTAGGTCGACGGATCTTCCTTTGATTCAAGAGATTGAGGGGATGTAATCTTTTCTCAAAACTGAGAGGCCTGATAACAAACCACCCCAACCCCCCCCCCCCCCCCCCGCGCTATAACTCAATCAATTCTATGAGAATCTTTAGCTCTATGTAATCGTAGTTTAAATAGATTTAAATAGGTCCTTATATGATTCAGAGTTCTATACGTGTTTACTTTTAAGGCTCTTTAGAGCTCTCATATTAGATGTAAACTTGCAATAGGACTATTTACATGCTATTTACATGTAGTTATTAATTTCTCTATAGCCCGTTGACGAATTGAGACACAAATTGGTAGAAAAATCACGCGAATTATATGAGATTAGTTGTTGTGTACAGGTTGGAGGATGGATTCAAGAATATGAAGGACCGACCTTGTGTTTGTGAGAGCAGGCCATAAAGCCCCTCTGCACAAACCCAAACTGGCTCTCCAAATCATCAAAGCCTATTTATCAGGAAACTCGCTCCCATCCCACAGTGACACTAGAAAAACTTCAAATCTCCTTTTCTGCTCTTTTTTGGTTTCTTCAGTCAAAAGGAGCACAAAAAGTGTTCATTGTTGGAATGTAATTAGGGAGTAGAATTCTTAGCCAGGCCATAATTGCTTGTTCAACATCAACATTAGTTCTAGTTGAAATTAATAAAGATGGCATTGGATTGGAAAAAATGTAACTAAATTGGGATTCATCTCAACTTCTTATTGATTTGTGCGTTCGGTTTGGCCGTTTGATCGATGAAATTGAATTGTTAGGTTCTGTTTGATATCAATTTTCTCGATGGATTTTAATAGCTGAGGTAGTTTTGCACTGTTAAAAAAATGTTGTATTTGGTTAAAATTTGCTGAATATAATTGATATGTGGAGTTATACTTGGGTTGGTGCCAAATCTAGAGAGACAGAAATGTTAAGTATTATAAATATGGGATATTATTTTTCTTGAGTTCAAAGAAAGGAGTAAGAATTCGAACCCACAACATAACGAAGGCTATGGGTTTGGTTTATCCAGGAGAGTATATAATGTATTTTTAAAAAAATTGATTTAAGTTTAATTGTAGTCTTTCAACTTTTGTTTTTACTTTATTTTAGTCCCTAAACTTTCAAACATTTTATTTAGTTCTTAGATTTTACATAGAAAATTATTTTAGTCTATATCATTAATTTTTTTAATTACATAATAAAATTTGTCAAAGGTATATATTTTTTTATTTACTTGACAAGCAAATATATACGTAAACAAATATTTAATGCTACATATAGTAGACCAGATATCAAGACATTAAGATTTTTATTAAATGATTAATGAAAATTAACTAAATTAACTAAAAAGAGTTATTTTTTAATGTAAAGTTTATGACCAAAACAAGTTTAAGAACTAAAACATAATAAATACAAAGTTTAAGCATCAAAATAGAATTTAATGTGTCATTTAATTTTTTTTTTTTTTTTTTGAGGTCAACAAGATGTGGGAGGATTCGAAGCTTCAATCTTAGAAAGGTAAGTAATGCTTTGATCAATTGAGCTATGCTTATGTCAGTACTTTTGGTTAGTTATTTAATAACTTTTTTTTTCTTCTTGACCCACATATTCTCTCTCTAATTCTCTCTCTCTCAAACATAGAAATTCGAATCTACAACTTTTTTTAAAAAGTCGAGATACCTTCATCAGTAAAATATACTTAAGTTGACACTTATTTCTACTCATCTATGTCTTTTTTTTTGTTTTGAGTTCAACAATTATGTAGTGGAGATTAAACCATCTATCTTATCTCTTGAGTTATATCCGAAATGGCCTCATCTATGTTTTCTCGACCACAAACTCACCCATCACATGCAAACCACCGAAGCACTCTCCAAAAAGGGGAAAAAAAAAAAGAGAAAAAGAAAAAGCAAAGTACCTAAATCCAGATTTTTATAGATTTTTGAAAATTCTCCGTAGAATCCAAATCGAAATTTGTCTCCAATTTCAAGGCTCCTCTCTGTCGCATCCGTCCAGTGTCAACTACGCCATCGAATTAAACAAAGAAAAAAAATAAAATAATGATTAAAGGTCACATCAATTACACTAAATTATATAAAATGCAAATGAACTATAACATTGGTTTGAAAAATGCATGTATTTTAAAAATTTTCATTTAATTTATCTAAAAAAACGGTTCATTAGATTCTTTTGTTAAAATCAAAACAAATTTCTAACTTAACGGTAAAATGTAAGTTTGATATTTAAGATTTTGTAGAAAAAATAAAACAGAGTACGTGTGGTTATTATTTTTCTTTACTTTTTTCACTTTCAAAGTCAACAGAAAAAAAAAAAATATATTAGAGCCTCCTTCTCACGATAAATGAGAAAAAAAAAATAAAATTAAGCTTCCGATTTAAAAAATGGTGGAGCATAACAACTTAACATGCATATCATTGAAGAGAAGAGACGAGAAGATGATCGTGCTATAATGGATGAATGCTCAATATTGGTAACTTCATAGATGAATAAAGGGTTTTTTTAAAACAAATTAAAATTTTAACGGTTAAAATGAAATTTTTGAAACGATGAAGATATAATTGAAACCAAGACTACAATTTTGGGGTATATTGTATAATTAAGTCAATGATTAAATTTTACATTTGGTTCCTCAACTTTCAAGACGGTGTCTAATAAGTCTATGTATTTAAAAGTTTTGTAATAAGTTTTTAACCTTTTAATTTCGTGTTTAATAAATTATTTTTATCAAATCTGTCAGTTTAACACTAATGATGCATGCTAATTGTATTATTTAAAGATTATATGATATATTAATTTCGGACAAGTTTAAGCATTTACGACGTGAGTGTTAAACTAACAAAGTTAATGGTAAGAACCTATTAGATATAAATTAAAAATTAAGACCTATTTAAAATTTTATAAGATATAAAAATTTATTAGATACAATTTTAGAATTTTTAGGTAACTTGTCATAAACTGTGTGAAGTTATAAAAACTTATTAGATTTATTGGAAGAAGTTCACACCCCAACTGGTGATTGAACTTTAAGCTTAGCCAATCAAATATGAGAGTATTTTGCACTTGACAAAGCTGTTCATTGGTGAATCAGAAAGCAGAAACGTAGTTGTCAGAGAAACCCCAACAAAAAGGGGATGGGGAGACAATCATTGCCTTGGCCTTTTATCAGTCTTCCCCTTTAAAAATTCCATCCATTTCCGCATTTCAAGGCACTCCTTCATCCGCCATCGCCAAGAAGAAGTGAGCTTGCTCAGAATGGCTCGACCCACCTGTTTTTTTCTCCAGGTCTTCGCCATTTTTGGCTATCTCCATCTGGGTATCGGCGTTCCAACTTCTTCGGACTACCCACTTGTCCAGCGGGAATTGGATAGGATCGTTGAGCTTCCGGGCCAGAACTTCGAGGTGAACTTCGCACACTATTCTGGTTATATCACAGTCAATGAGGATTCTGGGAGAGTGCTGTTCTACTGGTTTATTGAGGCTGCAGAGGATTCTGGCTCTAAACCCCTTGTTCTATGGCTTAACGGAGGTCCTCTACTTGGAACTATCTCCTTTGGTTTATTTCTTCTTCTTCCTCTCTCTCATGTTTTTTATTTACTTTGAGCTCTTTAGCTGTTGTGTTGAATCCCTTGTCTTTTCTTCAGAAAAATTAATTTAAGAATCTTGTGTATAGTAGTTCATTTTTAAATTTAAAAACAATATTAAAATAGAAAAGAGAAAACAAGCCACATCTCCTTGTCACTGAGTTAAAAATAAAGAAGAAGAAGAAGAAGAAGCCCTGTTCTTCCTTTTATACTTGTGACTGTTGGTAGATGGTTTCTGAGATAATTGGTATTGCTTGTACAAAACACTGTGGCTCTAGTTTTTAAATCTTGAGTTGTTTCTCTAGGACCTGGATGTTCATCAATTGCTTATGGTGAGGCAGAAGAAATAGGCCCATTTCACATTAATGCTGATGGGAAGACCCTTTACTTGAACCCTTATTCTTGGAATGAAGGTAAGGTTTACATATATATATATATATATATAATCTCTGAAAATTGGATAGTTTCTCCTTTTCTTCTTCATCCTTTTCCCCCTCTTTTAGATTTTAGCTGATTATTTCACAGATGAGTAGGTGGAATACAGATATTGTATTTAACATATTCAGTTATGAACAAACAAATTGTTCTAAGCCCAGTTTTTATGTTCAGGATGATGGAAAAACAATGAGTTCTTGATCTATAGGGTGTGATGCCAGTTGTCCATCCCCGCAACATTTAGATGGTTAACAAAAATTAGATGTATTTGAACAAATATTCTATCACAAGTAGCTGACTGCTCTATTGGGTTCACTTTCTATGATTGTGTTACTTCAGTATATTTTTAGGCCTCCTGTTAGAAAGGGAAAAAAGGATAATGTACCCTCTGATATTTAGTAAGCTCAGTATGTGTTTTGTATCTATAGTTAGGAATGTCATGATGATGACATAGGAAACATTGTGTGGTGTATTTGGTAGGAGAGTAGCTCGACCTCTCGAACTGGCTAGGCTTGTGCTTGTATGGTGCTCTCTTGCACCCTCTGGTCACTGATAGTAAATAGTAGAATTGATTCTTAATATATACCTCATGTTGTTCTACTTTGAATAGTTAAATGAACTTCTTGTGATTTCTCCTACCTATCTACATTACCTTTTGCAGTTGCCAATGTTATATTTCTCGAGTCCCCTGTTGGAGTTGGTTTTTCATATACGAATACATCTTCAGACTTGCTAAATAATGGAGATAAAAGAACTGGTAATACTTACACAACATGTTCTATCTGCTTTGATTGGTCCTATTATGTCACGGAAAAGCTATCTAATTTGATTTTGTTTCAACTTCCCTTTTTGCAGCTGAGGACTCTCTGGTTTTCTTATTGAAGTGGTTCGAACGCTTTCCCCAATTCAAAGGAAGGGATTTCTATATTACAGGAGAGAGCTATGCAGGTAAGCAAGAAATCATTCCTCAATTTTATGACATAAAATAGCAGTAGTTTGTATCTTTAAGTGTTCATGTCATGCACTGTTTATATTGTGCTCTCCCACATCAGTTTTAGTTGTATATCTCTTTCTTTTTAGATTTGCCTTAACTTCATTATCGTAGGAGTTAGTTAATGACATGAAATGCATATGTAGGACATTATGTTCCTCAGCTAAGCCAAGCAATTGTAAGGAACAACCTGCTAACCAAGGAAAAATCAATTAATCTGAAGGGTTATATGGTATTTACATCTTCTCCTCATAGCATAAAAATTTTATAGTAATATTGTGTTAAATGCAAAGTAGCCATATTCACTTTTATACATTACTCATGTAGGTGGGGAATGCTCTTTTTGATGACCACCATGATCATGTGGGAGTTTTCGAATTCATGTGGTCTGCTGGTTTGATTTCTGATCAGACATACAAGCAACTTAATCTACAATGTGAAAATCAGTCTTTTGTACATCCCTCAGAATCTTGTGACAAGATACTAGAGGTTGTTGATAAAGAACTCGGAAACATTGACCCTTATAGCATCTTCACTCCTTCGTGCTCTGATGGCCCTTCAAATCGGCTCCTGAAGCGAATGCACGTGAGTTTAGAAATATTGATCAATTCTAATGATTCAATTTTTTTTCCTGTCTGCAATACAATAATAACACGAATGAAGGATTGAAGCCAACATATGATTTTCTTAAAAACTAGATGTTATATTCAGAGACTTACCCTTCCTGTTGGCACATGATTTTTTATCACACACTGTTGCAATCATGAGGGCTTCTCATTTGCTGAAAATTTAAATCATTCCCAATGGGAGACTTGTTAATCTTCTGCTTCATTGAATTTACACTTCTAATCACACCCTCAAATAAGCTCAAGATTTGGTTAAGTGAACCAAATGTTCACTTAGTTGGAATTTCCTGAAATAAGCTAAAAATTTATTTATTTAGGCTTTTAATAGTGTAAGCTATTATAAATACATTTGATGTTATGACATATTCATGTAGCATCCCTTCACCAAATCAACATTGAAAAATTCATTTGCAGTGGAAAGAGTTTCTCTTCTGTCAATTTTAGAAATTTTTACATCCATTAATTTACTGTGACTGTGTTTCTTAGATGGTTGGGCGTGTTGGTGAACGCTATGATCCTTGCACTGAGAAGCACTCGGTCGCATACTTCAATCTTCCTGAAGTTCAAAAGGCGCTTCATGTTGATCCTAAGCATGCACCATCTAAATGGGAAACTTGCAGGTACAACATTTAAAGAATTAAGTATAAAATTTGTGGTGGCATTTAGATATCTTTTCTCCTTTATTTCAATTTCCTAATTTCCTTCTTGGTGAGGGAGATTTGGCATTTAGGTGGGGATATAGGTCTAGAGCTCCCTGCAAGAAGCTCTATTGTGAGGATATTTATATTGCCACTTAAAGACAATTACAGTTTGTAAAGTAGAACAGTGGAAGGCAACAAGCATCTCTGTTCTTGTTAGCTCGACATTGATTTAAACAGGATATAGAGTAAATAATTAATACCCATTTGATAACTATTTTGATTTTGGTTTTTAGTTTTTGAAGATTAAGCTTATAAACACACTTCCACCTATATGTTTCTTTGTTTGGTTATCTACTTTTTAAAAACGTTTCAAAATATAAACAAATTTTAAAAACTAAAAAAAATAGTTTTTAAAAATCCATTTTTGTTTTTGAAATTTGGTTAAAAATTCAAATGTGTTTTTACTAAAAAATTTTTTTGAAGAAGTAGGCATGATTTTTAAAAATAGAAATCAGACAGGGCCTAAATGATTTTATTCACATCTCGGTCATGTTAGTTTTCACCTCGAATTTTAAGGGGATTATGTGGGAAGTTTTGTGGGTTTGATAAAAGTTCCACCTCATTGCACATAATTATAGCTGTTTCTTCTATAAAAAACAAAGAATTTCTTCTGCTTGCAAATGAACTTGCAAAAATCTTGTGTCTATGTCATATCATTACTTTTTTATGGAACTTCAAATTTTTTAATAATTTTCTATTGGACATTTTTTTTTTCTTTTTGAGTAAAGACATACTTCTGCCCTTCATAACATTATTCTCAAGCCAGGATGTTCTTATATTAGTGTTTGATGGAACTCATAATGTCCATAGTGACCTGGTTAACGCTCACTGGAAGGATTCTGTTGGCTCAGTACTGGACATATATCAGGAGCTGATACATTCAGGACTTCGTATTTGGGTGTTCAGGTTTGTCTTAAAATCCAAAGTCTTATCTCATAAAACGAAACAAAAGTCTGACTTCATGTTTAAAACTTGACATTGGTTAGGTTTAAACCAATTTGTACAAAAAGGACATCGCATTCGGTTTGCACATATGGCCAAGGATTAAAATATCGAGGTCTCAATTTTACGGATATATCAATGGATGCATCGACAAAATATCGACATGGACGGATATTTCTCAAAAATTATGGAAATGAAGAAACTAATATAAATTATTTCAATTGATAAATAAGCATTTTATACGTCTAAAACAAGTTATAAATCTTATTATTAGTATCTATATTCAAATCGGTGATTTTTTTTGTTGATATTTTCCATGTTTCATTGATATTTATACGATATATTAGAAATATTGATTGACCCCTAATATGGATGTCGAACCCTCCGATTTACGGATATATCAACATGTCGACAGATATTTTAATCCTTGCATATAGTAAACCATATAGCACGTTTTGGTTTTTGGTTCGGAGAAATGAATTGAAATACGTGCATACCCCTGATTGTTTTGATACATGACCACGATGTCTTGTATTTCTTACTAAACTAGTGATTAAGGATTCTCTGAAAAACTGGTTTTGGACTTTTTTTTTTTGCTGCTCCAGTTTCTCAATCATCCATCAACTTTTCATTTTACTGAATTTGATTATTGGTTTCTGTTTCTCAATGACAGTGGTGACACAGATTCTGTACTCCCAATCACATCTACCCGTTACAGTATAGACGCCCTTAAGCTTCCATCTATTGGGTCTTGGCGTGCTTGGTATGATGATGGTCAGGTAAATTATTACCAAGAACTTCATATTCATATAGTTCACGGCTTGCATTGTTGATTTTCGTTATTCGTTTTCAAAAAACTACTTTCGGTAGTCATATCTTAAAACTTTTTTTAGTTTGAAAATTTGTAGATTTTATGTTCTTATTTTTAATCTGTTAGAGGATATATGATGCCTAAGAAGGCATTTATCTGTGAAATAACTTCAAATATAATAAAGAAAACACTTGATGACATCAGAATATGATTTACAATAACTTCAACTACAATAAGAAAAACACTAAATAACATCTGAATATCATTCATGTTTTTAACTGTTCTTTTCTTTAGTAAAGTTATTCTATTTCTAGGAGGAGAAAGCAGAAGTTTGGGCTTGTTTAGGAGATTGTTTCCAAAATAGCTTTGTGCATTTGTTTTCAAACTTTTAAGAATATGATGGGGCGACTTGTAAAGATTAAGAATTATATTTTTTTAATGGATAACAACTTCAATTATTCTGAAAAAATATAAGTCATTTGCAAAAACACACCTAAGTAAAATAAGCGAGAGAAAACATTCCTTCTTTTTTTCTTTAACTTATCAATTCTACACGAATCTTTAGCTATGCATAATCATAGTTTAAAGAGGTTAGAAAAAGGACCCTATTTGATTCAGAGTTAAATACGTTTTAAAAGGCTCACATGCAGTTGTTATTTACTCTATAGCCCGCTGACCACTCGAGACGACGAGTTGGCAGGGAAAAAGACTGCAAATTAAATGAGATTAGTTGTTGTGTACAGGTTGGAGGATGGATTCAAGAATATGAAGGGCTGACCCTTGTATCTGTGAGAGGAGCAGGCCATGAAGTCCCTCTGCACAAACCCAAACTTGCTCTTCAACTCATCAAAGCCTTTCTAGCAGGAAACTCACTTCCTACCCTTCAACTCCACAGTGACACTTAAAAAAAAATTCAAATCTCTTTTTCTGCCCTTTTTTTGGTGTCTTCAGACAAAAAGGAGCACAAAAAGTGTTAATTGTTGGAATGTAATGAGTGGAGTTGAATTCTTAGCCAGGCCATTGCTTGTTCAACATCAACATTGATTCAGGTTATAGTTAATAAAGATGGCATTGGATTGGAAAATGTAACTAGATTTGGATGATTTCTTCTAATTTGCTTTCATCTCAACTTTTTTTTTTCTTGCTTTTTGGGCTAAGTTGTTTGCTGTTTGATGGATGAAATTGAATTCTTAGGTCCTATTTAATATCAATTTTGTTGAGATTTTTAATCGTTTTGTTACATAGAATTCAATCAAATTGTCCTTTAGATTAAGGTGGTGGGGAAAGGGGTTGAAATAAGAAAAAGTAAAAAGATAATTTTGTTGTGTGAAAATAAAAACAAACTTATTTAAGGATATTTCAAATATTACCCTTAAAAACAATTTAAAAAAATAATACATCCAACTATGAAAAATGGTTGAATTGAAATTTGATTGACTTCGAAATATGGATAAAAGTGATTGATTAGGGATGGCAACGAAAATAGGGCAGGATTGCGGATGCCTTCTCTGTCCCGGTGGAAAATTTTCAATATAGACAATATAGTCAAGCTGATGGATTTCTAGTATATCAACAAAAGGGAACCATAATTTAAAAAATGGTAGTTAGGGCATCCAAGTATTTTCATAAATACTTTATACCAGCGATGCTAAATTAAATCCAATGAGGACAATATAATCTTGTTGGGTAAAAAAAGATCTCTTGCCTTAATTATTGAAAGAATATTGTATGCAGTTTACTAGGGAAACTATACACAAACACAAAATGAAGTCTTCTAATAACTTACTTTTCCTTTCTATTTGATACAAAGAAAAGTTTATACAGAAGTTATGGGGGAAAGAATTACTCATTTGCAGCATAAAAATGAACCCTCAATCTGGCATCCACATGAAAAATGCTGCCAACTAGAAGGCATATGCTCAAGTCTCGGTTTTGGCTCAACCAGTATCTCAGCTCATAGATAGCTTGAATATTCGTCGAAATCAGCTTGAAGGACGACCCGAGATAACCTTCATTTCCCTGACAACAGGATATTTAGAGTGTGCTGACAGGATTACAGAGTGGGAAGAGGCAAGCAACGAATCGGAATGCTTTCATGATTCCTGCCATAATAGAAGCGGCTCCGGGCTCTGCTCCTGTTCCTCGTATCAACTTTACATCTATTTTCCAAAATCGCTTCCTATACAAGTATTCACTAAACCTATCTGCAGATTGTCATGGCTGCTGTTGGGCCAAGAAAAATCAGTTGAAGAAGTACGGAGATTTAAGAACCTCTTGCAAGTCAAAATCGCCTCTCTCGGGGAGGGAAGGGAATGTCAATCCTGGGTATGTTGTTTCGAAGCCTTCAAGCAACTGGTGA

mRNA sequence

ATGGCTCGGCCCACCTGGTTTTTTCTCGAGGTCTTCGCCATTTTCTGCTATCTCCTCCATCTGGGCATCGGCGTTTCAACTTCTTCGGAAGACCCACTTGTCCAGCAGGAATTGGACAGGATCGTTGAGCTTCCGGGCCAGAACTTCGAGGTGAACTTCGAACACTTTTCTGGTTATATCACAGTGAATGAGGATTCTGGGAGAGCGCTGTTCTACTGGTTTGTTGAGGCTGTAGAGGACTCTGGCTCTAAACCTCTTCTTTTATGGCTTACTGGAGGACCTGGATGTTCATCAATTGCCTATGGTGAGGCAGAAGAAGTAGGCCCATTTCACATTAATGCTGATGGGAAATCCCTTTACTTGAACCCTTATTCTTGGAATGAAGCTGAGGACTCTCTGAGTTTCTTATTGAAGTGGTTCGAACGCTTTCCCCAGTTTAAAGGAAGGGATTTTTATATTATAGGAGAGAGCTATGGGGGACATTATGCCCCTCAGCTAAGCCAAGCAATTCTAAAGAACAACCTACTAACCATGGAAAAATCAATTAATTTGAAGGGTTATATGGTGGGCAATGCTGTTTTTGATGATCATTACGATCGCATGGGAGTTTTCAATTTTTTGTGGTCGATTGGTTTGATTTCTGATCAGACATACAAGCAACTTAATCTACATTGTGAAAATCAGTCTTTCGTACATCACTCGAAATCTTGTGACAAGATACTAGATGTTGTGGATAAAGAACTTGGAAACATTGACCCTTTTAGCATCTACACTCCTTCATGCGCTGATGGCCCTTCAAATCGACTTTTGAAGCGAATGCACATGGTTGGGCGTGATGGTGAAAGCTTTGATCCTTGCACCAAGAAACACTCAATCATATACTTCAATCTACCTAAAGTTCAAAAGGCGCTTCATGTTGATCCTAAGCATGCACCATCTAAATGGGAAATCTGCAGTGGCGACGCAGATACTTCACTCCCAACCACATCTACTCGTTATAGTATCAACGCCCTTAAGCTTCCACTTGCCGGACCTTGGCGTGCTTGGTATGGTGATGGCCAGCTGTTCATTGGTGAATCAGAAAGCAGAAACGTAGTTGTCAGAGAAACCCCAACAAAAAGGGGATGGGGAGACAATCATTGCCTTGGCCTTTTATCAGTCTTCCCCTTTAAAAATTCCATCCATTTCCGCATTTCAAGGCACTCCTTCATCCGCCATCGCCAAGAAGAAGTGAGCTTGCTCAGAATGGCTCGACCCACCTGTTTTTTTCTCCAGGTCTTCGCCATTTTTGGCTATCTCCATCTGGGTATCGGCGTTCCAACTTCTTCGGACTACCCACTTGTCCAGCGGGAATTGGATAGGATCGTTGAGCTTCCGGGCCAGAACTTCGAGGTGAACTTCGCACACTATTCTGGTTATATCACAGTCAATGAGGATTCTGGGAGAGTGCTGTTCTACTGGTTTATTGAGGCTGCAGAGGATTCTGGCTCTAAACCCCTTGTTCTATGGCTTAACGGAGGACCTGGATGTTCATCAATTGCTTATGGTGAGGCAGAAGAAATAGGCCCATTTCACATTAATGCTGATGGGAAGACCCTTTACTTGAACCCTTATTCTTGGAATGAAGTTGCCAATGTTATATTTCTCGAGTCCCCTGTTGGAGTTGGTTTTTCATATACGAATACATCTTCAGACTTGCTAAATAATGGAGATAAAAGAACTGCTGAGGACTCTCTGGTTTTCTTATTGAAGTGGTTCGAACGCTTTCCCCAATTCAAAGGAAGGGATTTCTATATTACAGGAGAGAGCTATGCAGGACATTATGTTCCTCAGCTAAGCCAAGCAATTGTAAGGAACAACCTGCTAACCAAGGAAAAATCAATTAATCTGAAGGGTTATATGGTGGGGAATGCTCTTTTTGATGACCACCATGATCATGTGGGAGTTTTCGAATTCATGTGGTCTGCTGGTTTGATTTCTGATCAGACATACAAGCAACTTAATCTACAATGTGAAAATCAGTCTTTTGTACATCCCTCAGAATCTTGTGACAAGATACTAGAGGTTGTTGATAAAGAACTCGGAAACATTGACCCTTATAGCATCTTCACTCCTTCGTGCTCTGATGGCCCTTCAAATCGGCTCCTGAAGCGAATGCACATGGTTGGGCGTGTTGGTGAACGCTATGATCCTTGCACTGAGAAGCACTCGGTCGCATACTTCAATCTTCCTGAAGTTCAAAAGGCGCTTCATGTTGATCCTAAGCATGCACCATCTAAATGGGAAACTTGCAGTGACCTGGTTAACGCTCACTGGAAGGATTCTGTTGGCTCAGTACTGGACATATATCAGGAGCTGATACATTCAGGACTTCGTATTTGGGTGTTCAGTGGTGACACAGATTCTGTACTCCCAATCACATCTACCCGTTACAGTATAGACGCCCTTAAGCTTCCATCTATTGGGTCTTGGCGTGCTTGGTATGATGATGGTCAGGTTGGAGGATGGATTCAAGAATATGAAGGGCTGACCCTTGTATCTGTGAGAGGAGCAGGCCATGAAGTCCCTCTGCACAAACCCAAACTTGCTCTTCAACTCATCAAAGCCTTTCTAGCAGGAAACTCACTTCCTACCCTTCAACTCCACAGATTACAGAGTGGGAAGAGGCAAGCAACGAATCGGAATGCTTTCATGATTCCTGCCATAATAGAAGCGGCTCCGGGCTCTGCTCCTGTTCCTCGTATCAACTTTACATCTATTTTCCAAAATCGCTTCCTATACAAGTATTCACTAAACCTATCTGCAGATTGTCATGGCTGCTGTTGGGCCAAGAAAAATCAGTTGAAGAAGTACGGAGATTTAAGAACCTCTTGCAAGTCAAAATCGCCTCTCTCGGGGAGGGAAGGGAATGTCAATCCTGGGTATGTTGTTTCGAAGCCTTCAAGCAACTGGTGA

Coding sequence (CDS)

ATGGCTCGGCCCACCTGGTTTTTTCTCGAGGTCTTCGCCATTTTCTGCTATCTCCTCCATCTGGGCATCGGCGTTTCAACTTCTTCGGAAGACCCACTTGTCCAGCAGGAATTGGACAGGATCGTTGAGCTTCCGGGCCAGAACTTCGAGGTGAACTTCGAACACTTTTCTGGTTATATCACAGTGAATGAGGATTCTGGGAGAGCGCTGTTCTACTGGTTTGTTGAGGCTGTAGAGGACTCTGGCTCTAAACCTCTTCTTTTATGGCTTACTGGAGGACCTGGATGTTCATCAATTGCCTATGGTGAGGCAGAAGAAGTAGGCCCATTTCACATTAATGCTGATGGGAAATCCCTTTACTTGAACCCTTATTCTTGGAATGAAGCTGAGGACTCTCTGAGTTTCTTATTGAAGTGGTTCGAACGCTTTCCCCAGTTTAAAGGAAGGGATTTTTATATTATAGGAGAGAGCTATGGGGGACATTATGCCCCTCAGCTAAGCCAAGCAATTCTAAAGAACAACCTACTAACCATGGAAAAATCAATTAATTTGAAGGGTTATATGGTGGGCAATGCTGTTTTTGATGATCATTACGATCGCATGGGAGTTTTCAATTTTTTGTGGTCGATTGGTTTGATTTCTGATCAGACATACAAGCAACTTAATCTACATTGTGAAAATCAGTCTTTCGTACATCACTCGAAATCTTGTGACAAGATACTAGATGTTGTGGATAAAGAACTTGGAAACATTGACCCTTTTAGCATCTACACTCCTTCATGCGCTGATGGCCCTTCAAATCGACTTTTGAAGCGAATGCACATGGTTGGGCGTGATGGTGAAAGCTTTGATCCTTGCACCAAGAAACACTCAATCATATACTTCAATCTACCTAAAGTTCAAAAGGCGCTTCATGTTGATCCTAAGCATGCACCATCTAAATGGGAAATCTGCAGTGGCGACGCAGATACTTCACTCCCAACCACATCTACTCGTTATAGTATCAACGCCCTTAAGCTTCCACTTGCCGGACCTTGGCGTGCTTGGTATGGTGATGGCCAGCTGTTCATTGGTGAATCAGAAAGCAGAAACGTAGTTGTCAGAGAAACCCCAACAAAAAGGGGATGGGGAGACAATCATTGCCTTGGCCTTTTATCAGTCTTCCCCTTTAAAAATTCCATCCATTTCCGCATTTCAAGGCACTCCTTCATCCGCCATCGCCAAGAAGAAGTGAGCTTGCTCAGAATGGCTCGACCCACCTGTTTTTTTCTCCAGGTCTTCGCCATTTTTGGCTATCTCCATCTGGGTATCGGCGTTCCAACTTCTTCGGACTACCCACTTGTCCAGCGGGAATTGGATAGGATCGTTGAGCTTCCGGGCCAGAACTTCGAGGTGAACTTCGCACACTATTCTGGTTATATCACAGTCAATGAGGATTCTGGGAGAGTGCTGTTCTACTGGTTTATTGAGGCTGCAGAGGATTCTGGCTCTAAACCCCTTGTTCTATGGCTTAACGGAGGACCTGGATGTTCATCAATTGCTTATGGTGAGGCAGAAGAAATAGGCCCATTTCACATTAATGCTGATGGGAAGACCCTTTACTTGAACCCTTATTCTTGGAATGAAGTTGCCAATGTTATATTTCTCGAGTCCCCTGTTGGAGTTGGTTTTTCATATACGAATACATCTTCAGACTTGCTAAATAATGGAGATAAAAGAACTGCTGAGGACTCTCTGGTTTTCTTATTGAAGTGGTTCGAACGCTTTCCCCAATTCAAAGGAAGGGATTTCTATATTACAGGAGAGAGCTATGCAGGACATTATGTTCCTCAGCTAAGCCAAGCAATTGTAAGGAACAACCTGCTAACCAAGGAAAAATCAATTAATCTGAAGGGTTATATGGTGGGGAATGCTCTTTTTGATGACCACCATGATCATGTGGGAGTTTTCGAATTCATGTGGTCTGCTGGTTTGATTTCTGATCAGACATACAAGCAACTTAATCTACAATGTGAAAATCAGTCTTTTGTACATCCCTCAGAATCTTGTGACAAGATACTAGAGGTTGTTGATAAAGAACTCGGAAACATTGACCCTTATAGCATCTTCACTCCTTCGTGCTCTGATGGCCCTTCAAATCGGCTCCTGAAGCGAATGCACATGGTTGGGCGTGTTGGTGAACGCTATGATCCTTGCACTGAGAAGCACTCGGTCGCATACTTCAATCTTCCTGAAGTTCAAAAGGCGCTTCATGTTGATCCTAAGCATGCACCATCTAAATGGGAAACTTGCAGTGACCTGGTTAACGCTCACTGGAAGGATTCTGTTGGCTCAGTACTGGACATATATCAGGAGCTGATACATTCAGGACTTCGTATTTGGGTGTTCAGTGGTGACACAGATTCTGTACTCCCAATCACATCTACCCGTTACAGTATAGACGCCCTTAAGCTTCCATCTATTGGGTCTTGGCGTGCTTGGTATGATGATGGTCAGGTTGGAGGATGGATTCAAGAATATGAAGGGCTGACCCTTGTATCTGTGAGAGGAGCAGGCCATGAAGTCCCTCTGCACAAACCCAAACTTGCTCTTCAACTCATCAAAGCCTTTCTAGCAGGAAACTCACTTCCTACCCTTCAACTCCACAGATTACAGAGTGGGAAGAGGCAAGCAACGAATCGGAATGCTTTCATGATTCCTGCCATAATAGAAGCGGCTCCGGGCTCTGCTCCTGTTCCTCGTATCAACTTTACATCTATTTTCCAAAATCGCTTCCTATACAAGTATTCACTAAACCTATCTGCAGATTGTCATGGCTGCTGTTGGGCCAAGAAAAATCAGTTGAAGAAGTACGGAGATTTAAGAACCTCTTGCAAGTCAAAATCGCCTCTCTCGGGGAGGGAAGGGAATGTCAATCCTGGGTATGTTGTTTCGAAGCCTTCAAGCAACTGGTGA

Protein sequence

MARPTWFFLEVFAIFCYLLHLGIGVSTSSEDPLVQQELDRIVELPGQNFEVNFEHFSGYITVNEDSGRALFYWFVEAVEDSGSKPLLLWLTGGPGCSSIAYGEAEEVGPFHINADGKSLYLNPYSWNEAEDSLSFLLKWFERFPQFKGRDFYIIGESYGGHYAPQLSQAILKNNLLTMEKSINLKGYMVGNAVFDDHYDRMGVFNFLWSIGLISDQTYKQLNLHCENQSFVHHSKSCDKILDVVDKELGNIDPFSIYTPSCADGPSNRLLKRMHMVGRDGESFDPCTKKHSIIYFNLPKVQKALHVDPKHAPSKWEICSGDADTSLPTTSTRYSINALKLPLAGPWRAWYGDGQLFIGESESRNVVVRETPTKRGWGDNHCLGLLSVFPFKNSIHFRISRHSFIRHRQEEVSLLRMARPTCFFLQVFAIFGYLHLGIGVPTSSDYPLVQRELDRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEAAEDSGSKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNGDKRTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINLKGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKILEVVDKELGNIDPYSIFTPSCSDGPSNRLLKRMHMVGRVGERYDPCTEKHSVAYFNLPEVQKALHVDPKHAPSKWETCSDLVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSVLPITSTRYSIDALKLPSIGSWRAWYDDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLPTLQLHRLQSGKRQATNRNAFMIPAIIEAAPGSAPVPRINFTSIFQNRFLYKYSLNLSADCHGCCWAKKNQLKKYGDLRTSCKSKSPLSGREGNVNPGYVVSKPSSNW
Homology
BLAST of Sgr020497 vs. NCBI nr
Match: PWA58649.1 (serine carboxypeptidase-like 23 [Artemisia annua])

HSP 1 Score: 960.7 bits (2482), Expect = 9.8e-276
Identity = 488/1049 (46.52%), Postives = 623/1049 (59.39%), Query Frame = 0

Query: 25   VSTSSEDPLVQQELDRIVELPGQNFEVNFEHFSGYITVNEDSGRALFYWFVEAVEDSGSK 84
            +S    D +  Q+ D +  LPGQNF V+F  ++GY+TVNE++GRALFYW  EA +D  SK
Sbjct: 14   MSVCDADSVSDQKKDLVTNLPGQNFNVDFAQYAGYVTVNEENGRALFYWLTEATQDPASK 73

Query: 85   PLLLWLTGGPGCSSIAYGEAEEVGPFHINADGKSLYLNPYSWN----------------- 144
            PL+LWL GGPGCSSI YG +EEVGPFH++ DGKS+Y NPYSWN                 
Sbjct: 74   PLVLWLNGGPGCSSIGYGMSEEVGPFHVDKDGKSIYSNPYSWNTVANLLFIDSPVGVGYS 133

Query: 145  ---------------EAEDSLSFLLKWFERFPQFKGRDFYIIGESYGGHYAPQLSQAILK 204
                            AED L FLL W ERFPQ+KGRDFY+ GESY GHY PQLSQ I++
Sbjct: 134  YSNTSADIESNGDKRTAEDLLQFLLNWLERFPQYKGRDFYMTGESYAGHYVPQLSQVIVR 193

Query: 205  NNLLTMEKSINLKGYMVGNAVFDDHYDRMGVFNFLWSIGLISDQTYKQLNLHCENQSFVH 264
             N       INL GY+VGNA+ D+H+D +G+F F W++GLISDQTYK+LN  C+   ++ 
Sbjct: 194  YNKENTGSPINLLGYLVGNALTDEHHDHLGLFQFWWTVGLISDQTYKKLNDVCDKAFYMR 253

Query: 265  HSKSCDKILDVVDKELGNIDPFSIYTPSC-ADGPSNRLLKRMHMVGRDGESFDPCTKKHS 324
             S+ C +I + + +E+GNID +SI+TPSC A+  +N+LL+R H  G  G S+DPCT++HS
Sbjct: 254  PSQECYQIRENLIQEVGNIDSYSIFTPSCTANDVNNKLLRRWHKFGYIGRSYDPCTEQHS 313

Query: 325  IIYFNLPKVQKALHVDPKHAPSKWEIC--------------------------------S 384
             IYFNLP+VQKALHV   +    WE+C                                S
Sbjct: 314  TIYFNLPEVQKALHVYHSNTSRSWEVCSDFIETSWQDSPLSVLDVYHELITAGLRIWMFS 373

Query: 385  GDADTSLPTTSTRYSINALKLPLAGPWRAWYGDGQL-----------FIGESESRNVVVR 444
            GD D  +P TSTRY+INAL L    PWRAWY DGQ+           F+    + + V  
Sbjct: 374  GDTDGVIPVTSTRYTINALNLTTISPWRAWYEDGQVGGWTQMYKGLTFVAVRGAGHKVPL 433

Query: 445  ETPTKRGWGDNHCLGLLSVFPFKNSI--HFRISRHSFIRHRQEEVSLLRMARPTCFF--L 504
              P          L LL  F     +    ++   S                 +CF+  +
Sbjct: 434  HKP-------KLALTLLKSFMAGTPMPSFEQVIEPSRCPKSNRRFQEFLGGSVSCFYVGV 493

Query: 505  QVFAIFGYLHLGIGVPTSSDYPLVQRELDRIVELPGQNFEVNFAHYSGYITVNEDSGRVL 564
            ++  +FG       V    D PL +++ D++ +LPGQNF V+FA Y+GY+TVNE++GR L
Sbjct: 494  RIGMMFGVCDDECNV---DDDPLTEQKQDQVFDLPGQNFNVDFAQYAGYVTVNEENGRAL 553

Query: 565  FYWFIEAAEDSGSKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVA 624
            FYW  EA +D  S+PLVLWLNGGPGCSSIAYG +EE+GPFH++ DGK +Y NPYSWN VA
Sbjct: 554  FYWLTEATQDPASQPLVLWLNGGPGCSSIAYGMSEEVGPFHVDKDGKFVYSNPYSWNTVA 613

Query: 625  NVIFLESPVGVGFSYTNTSSDLLNNGDKRTAEDSLVFLLKWFERFPQFKGRDFYITGESY 684
            N++F++SPVGVG+SY+NTS+D+ +NGDKRTAED L FLL W ERFPQ+KGRDFY+TGESY
Sbjct: 614  NLLFIDSPVGVGYSYSNTSADIESNGDKRTAEDLLQFLLNWLERFPQYKGRDFYMTGESY 673

Query: 685  AGHYVPQLSQAIVRNNLLTKEKSINLKGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTY 744
            AGHYVPQLSQ IVR N       INL GYMVGNA+ D+++DH G+F++MW+ G+ISDQT+
Sbjct: 674  AGHYVPQLSQVIVRYNKENTGSPINLLGYMVGNAVTDEYNDHFGLFQYMWTVGMISDQTF 733

Query: 745  KQLNLQCENQSFVHPSESCDKILEVVDKELGNIDPYSIFTPSCSDGPSN-RLLKRMHMVG 804
            K+LN  C    ++ PS  C +I+E +  ELGNID YSIFTPSC+    N RLL++ H  G
Sbjct: 734  KKLNDVCAKSFYIRPSPECRQIIEYMFHELGNIDHYSIFTPSCTTNDVNDRLLRKWHKSG 793

Query: 805  RVGERYDPCTEKHSVAYFNLPEVQKALHV------------------------------- 864
             +G  YDPCTE+HS  YFNLPEVQKALHV                               
Sbjct: 794  YIGRSYDPCTEQHSTIYFNLPEVQKALHVYHSNTSRSWEVCSDFIETSWQDSPLSVLDVY 853

Query: 865  ----------------------------------------------------------DP 881
                                                                      DP
Sbjct: 854  HELITSGLRIWMFSGDTDAVIPVTSTRYTINALNLTAISPWRAWYEDGQKSGYIGRSYDP 913

BLAST of Sgr020497 vs. NCBI nr
Match: XP_022158143.1 (serine carboxypeptidase II-2 isoform X2 [Momordica charantia])

HSP 1 Score: 890.2 bits (2299), Expect = 1.6e-254
Identity = 414/467 (88.65%), Postives = 437/467 (93.58%), Query Frame = 0

Query: 416 MARPTCFFLQVFAIFGYLHLGIGVPTSSDYPLVQRELDRIVELPGQNFEVNFAHYSGYIT 475
           MARPT  FLQ FAIFG+LHLGI V T SD PL+Q+ELDR++ELPGQ F+VNFAHYSGYIT
Sbjct: 1   MARPTWVFLQAFAIFGFLHLGISVSTYSDNPLLQQELDRVIELPGQKFDVNFAHYSGYIT 60

Query: 476 VNEDSGRVLFYWFIEAAEDSGSKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKTLYL 535
           VNEDSGR LFYWF EA EDS SKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGK LYL
Sbjct: 61  VNEDSGRALFYWFFEATEDSASKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKFLYL 120

Query: 536 NPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNGDKRTAEDSLVFLLKWFERFPQFKGR 595
           NPYSWNEVANVIF++SPVGVGFSY+NTSSDLLNNGDKRTAEDSL FLLKWFERFPQFKGR
Sbjct: 121 NPYSWNEVANVIFIDSPVGVGFSYSNTSSDLLNNGDKRTAEDSLAFLLKWFERFPQFKGR 180

Query: 596 DFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINLKGYMVGNALFDDHHDHVGVFEFMWS 655
           +FYITGESYAGHYVPQLSQAIVRNNLL KEKSINLKGYMVGNALFDDHHDHVGVFEF+WS
Sbjct: 181 EFYITGESYAGHYVPQLSQAIVRNNLLFKEKSINLKGYMVGNALFDDHHDHVGVFEFLWS 240

Query: 656 AGLISDQTYKQLNLQCENQSFVHPSESCDKILEVVDKELGNIDPYSIFTPSCSDGPSNRL 715
            GLISDQTYKQLNLQC NQSF+H SESCD+IL+VV+KELGNIDPYSIFTP CSDG SNRL
Sbjct: 241 TGLISDQTYKQLNLQCRNQSFIHSSESCDEILDVVNKELGNIDPYSIFTPPCSDGSSNRL 300

Query: 716 LKRMHMVGRVGERYDPCTEKHSVAYFNLPEVQKALHVDPKHAPSKWETCSDLVNAHWKDS 775
            KRMHMVG V ERYDPCTEKHSVAYFNLPEVQKALHVDPKHAP+ WETCS+L+N +WKDS
Sbjct: 301 WKRMHMVGHVAERYDPCTEKHSVAYFNLPEVQKALHVDPKHAPAMWETCSELINTNWKDS 360

Query: 776 VGSVLDIYQELIHSGLRIWVFSGDTDSVLPITSTRYSIDALKLPSIGSWRAWYDDGQVGG 835
            GSVLDIY+ELIHSGLRIWVFSGDTDS+LPITSTRYSIDAL+LP  G W AWYDDGQVGG
Sbjct: 361 AGSVLDIYRELIHSGLRIWVFSGDTDSILPITSTRYSIDALELPQTGPWHAWYDDGQVGG 420

Query: 836 WIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLPTLQLH 883
           WIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLP+LQLH
Sbjct: 421 WIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLPSLQLH 467

BLAST of Sgr020497 vs. NCBI nr
Match: XP_038890726.1 (serine carboxypeptidase II-2 [Benincasa hispida])

HSP 1 Score: 870.5 bits (2248), Expect = 1.3e-248
Identity = 408/467 (87.37%), Postives = 434/467 (92.93%), Query Frame = 0

Query: 416 MARPTCFFLQVFAIFGYLHLGIGVPTSSDYPLVQRELDRIVELPGQNFEVNFAHYSGYIT 475
           MARPT  FLQVFAIFG+LHLGIGVP SS+ PL Q+ELDRI ELPGQNF V FAHYSGYIT
Sbjct: 1   MARPTWVFLQVFAIFGFLHLGIGVPISSENPLRQQELDRIAELPGQNFGVKFAHYSGYIT 60

Query: 476 VNEDSGRVLFYWFIEAAEDSGSKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKTLYL 535
           VNEDSGR LFYWF EA +DS SKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADG+++YL
Sbjct: 61  VNEDSGRALFYWFFEATQDSASKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGQSVYL 120

Query: 536 NPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNGDKRTAEDSLVFLLKWFERFPQFKGR 595
           NPYSWNEVANV+FLESPVGVGFSY+NTSSDL+NNGDKRTAEDSL FLLKWFERF QFKG 
Sbjct: 121 NPYSWNEVANVLFLESPVGVGFSYSNTSSDLVNNGDKRTAEDSLTFLLKWFERFSQFKGN 180

Query: 596 DFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINLKGYMVGNALFDDHHDHVGVFEFMWS 655
           DFYITGESY GHYVPQLSQAIVRNN L KEKSINLKGYMVGNALFDDHHDHVGVFEF+WS
Sbjct: 181 DFYITGESYGGHYVPQLSQAIVRNNHLFKEKSINLKGYMVGNALFDDHHDHVGVFEFLWS 240

Query: 656 AGLISDQTYKQLNLQCENQSFVHPSESCDKILEVVDKELGNIDPYSIFTPSCSDGPSNRL 715
            GLISDQTYKQLNL C NQSFVH SESCD+IL+V +KELGNID YSIFTP C+D  SNRL
Sbjct: 241 TGLISDQTYKQLNLLCANQSFVHSSESCDEILDVANKELGNIDHYSIFTPPCTDSSSNRL 300

Query: 716 LKRMHMVGRVGERYDPCTEKHSVAYFNLPEVQKALHVDPKHAPSKWETCSDLVNAHWKDS 775
            KRMHMVGRVGERYDPCTEKHSVAYFNLPEVQ+ALHVDPK+APSKW+TCS+LVN +WKDS
Sbjct: 301 RKRMHMVGRVGERYDPCTEKHSVAYFNLPEVQQALHVDPKYAPSKWDTCSELVNLNWKDS 360

Query: 776 VGSVLDIYQELIHSGLRIWVFSGDTDSVLPITSTRYSIDALKLPSIGSWRAWYDDGQVGG 835
            GSVLDIY+ELI +GLRIWVFSGDTD+VLPITSTRYS+DALKLP IG WRAWYDDGQVGG
Sbjct: 361 AGSVLDIYRELIQAGLRIWVFSGDTDAVLPITSTRYSVDALKLPVIGPWRAWYDDGQVGG 420

Query: 836 WIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLPTLQLH 883
           WIQEYEGLTLVSVRGAGHEVPLH+PKLALQLIKAFLAGNSLP+LQLH
Sbjct: 421 WIQEYEGLTLVSVRGAGHEVPLHQPKLALQLIKAFLAGNSLPSLQLH 467

BLAST of Sgr020497 vs. NCBI nr
Match: PWA58648.1 (serine carboxypeptidase-like 23 [Artemisia annua])

HSP 1 Score: 858.2 bits (2216), Expect = 6.9e-245
Identity = 443/970 (45.67%), Postives = 566/970 (58.35%), Query Frame = 0

Query: 104 AEEVGPFHINADGKSLYLNPYSWN--------------------------------EAED 163
           +EEVGPFH++ DGKS+Y NPYSWN                                 AED
Sbjct: 2   SEEVGPFHVDKDGKSIYSNPYSWNTVANLLFIDSPVGVGYSYSNTSADIESNGDKRTAED 61

Query: 164 SLSFLLKWFERFPQFKGRDFYIIGESYGGHYAPQLSQAILKNNLLTMEKSINLKGYMVGN 223
            L FLL W ERFPQ+KGRDFY+ GESY GHY PQLSQ I++ N       INL GY+VGN
Sbjct: 62  LLQFLLNWLERFPQYKGRDFYMTGESYAGHYVPQLSQVIVRYNKENTGSPINLLGYLVGN 121

Query: 224 AVFDDHYDRMGVFNFLWSIGLISDQTYKQLNLHCENQSFVHHSKSCDKILDVVDKELGNI 283
           A+ D+H+D +G+F F W++GLISDQTYK+LN  C+   ++  S+ C +I + + +E+GNI
Sbjct: 122 ALTDEHHDHLGLFQFWWTVGLISDQTYKKLNDVCDKAFYMRPSQECYQIRENLIQEVGNI 181

Query: 284 DPFSIYTPSC-ADGPSNRLLKRMHMVGRDGESFDPCTKKHSIIYFNLPKVQKALHVDPKH 343
           D +SI+TPSC A+  +N+LL+R H  G  G S+DPCT++HS IYFNLP+VQKALHV   +
Sbjct: 182 DSYSIFTPSCTANDVNNKLLRRWHKFGYIGRSYDPCTEQHSTIYFNLPEVQKALHVYHSN 241

Query: 344 APSKWEIC--------------------------------SGDADTSLPTTSTRYSINAL 403
               WE+C                                SGD D  +P TSTRY+INAL
Sbjct: 242 TSRSWEVCSDFIETSWQDSPLSVLDVYHELITAGLRIWMFSGDTDGVIPVTSTRYTINAL 301

Query: 404 KLPLAGPWRAWYGDGQL-----------FIGESESRNVVVRETPTKRGWGDNHCLGLLSV 463
            L    PWRAWY DGQ+           F+    + + V    P          L LL  
Sbjct: 302 NLTTISPWRAWYEDGQVGGWTQMYKGLTFVAVRGAGHKVPLHKP-------KLALTLLKS 361

Query: 464 FPFKNSI--HFRISRHSFIRHRQEEVSLLRMARPTCFF--LQVFAIFGYLHLGIGVPTSS 523
           F     +    ++   S                 +CF+  +++  +FG       V    
Sbjct: 362 FMAGTPMPSFEQVIEPSRCPKSNRRFQEFLGGSVSCFYVGVRIGMMFGVCDDECNV---D 421

Query: 524 DYPLVQRELDRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEAAEDSGSKPLVLW 583
           D PL +++ D++ +LPGQNF V+FA Y+GY+TVNE++GR LFYW  EA +D  S+PLVLW
Sbjct: 422 DDPLTEQKQDQVFDLPGQNFNVDFAQYAGYVTVNEENGRALFYWLTEATQDPASQPLVLW 481

Query: 584 LNGGPGCSSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTS 643
           LNGGPGCSSIAYG +EE+GPFH++ DGK +Y NPYSWN VAN++F++SPVGVG+SY+NTS
Sbjct: 482 LNGGPGCSSIAYGMSEEVGPFHVDKDGKFVYSNPYSWNTVANLLFIDSPVGVGYSYSNTS 541

Query: 644 SDLLNNGDKRTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLT 703
           +D+ +NGDKRTAED L FLL W ERFPQ+KGRDFY+TGESYAGHYVPQLSQ IVR N   
Sbjct: 542 ADIESNGDKRTAEDLLQFLLNWLERFPQYKGRDFYMTGESYAGHYVPQLSQVIVRYNKEN 601

Query: 704 KEKSINLKGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESC 763
               INL GYMVGNA+ D+++DH G+F++MW+ G+ISDQT+K+LN  C    ++ PS  C
Sbjct: 602 TGSPINLLGYMVGNAVTDEYNDHFGLFQYMWTVGMISDQTFKKLNDVCAKSFYIRPSPEC 661

Query: 764 DKILEVVDKELGNIDPYSIFTPSCSDGPSN-RLLKRMHMVGRVGERYDPCTEKHSVAYFN 823
            +I+E +  ELGNID YSIFTPSC+    N RLL++ H  G +G  YDPCTE+HS  YFN
Sbjct: 662 RQIIEYMFHELGNIDHYSIFTPSCTTNDVNDRLLRKWHKSGYIGRSYDPCTEQHSTIYFN 721

Query: 824 LPEVQKALHV-------------------------------------------------- 881
           LPEVQKALHV                                                  
Sbjct: 722 LPEVQKALHVYHSNTSRSWEVCSDFIETSWQDSPLSVLDVYHELITSGLRIWMFSGDTDA 781

BLAST of Sgr020497 vs. NCBI nr
Match: XP_004144720.1 (serine carboxypeptidase II-2 [Cucumis sativus] >KAE8651642.1 hypothetical protein Csa_021201 [Cucumis sativus])

HSP 1 Score: 849.7 bits (2194), Expect = 2.4e-242
Identity = 399/467 (85.44%), Postives = 425/467 (91.01%), Query Frame = 0

Query: 416 MARPTCFFLQVFAIFGYLHLGIGVPTSSDYPLVQRELDRIVELPGQNFEVNFAHYSGYIT 475
           MARPT  FLQ+F IF +LHLGI V  S++ PL Q+ELDRI ELPGQNFEV F HYSGYIT
Sbjct: 1   MARPTWVFLQLFTIFAFLHLGIAV--STENPLRQQELDRIAELPGQNFEVKFGHYSGYIT 60

Query: 476 VNEDSGRVLFYWFIEAAEDSGSKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKTLYL 535
           VNE+SGR LFYWF EA EDS SKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGK +YL
Sbjct: 61  VNEESGRALFYWFFEATEDSASKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKPVYL 120

Query: 536 NPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNGDKRTAEDSLVFLLKWFERFPQFKGR 595
           NPYSWNEVANV+FL+SP GVGFSY+NTSSDL+NNGDKRTAEDSL FLLKWFERFPQFKGR
Sbjct: 121 NPYSWNEVANVLFLDSPAGVGFSYSNTSSDLMNNGDKRTAEDSLAFLLKWFERFPQFKGR 180

Query: 596 DFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINLKGYMVGNALFDDHHDHVGVFEFMWS 655
           DFYITGESY GHYVPQLSQAIVRNNLL KEKSINLKGYMVGNALFDDHHDHVGVFEF+WS
Sbjct: 181 DFYITGESYGGHYVPQLSQAIVRNNLLFKEKSINLKGYMVGNALFDDHHDHVGVFEFLWS 240

Query: 656 AGLISDQTYKQLNLQCENQSFVHPSESCDKILEVVDKELGNIDPYSIFTPSCSDGPSNRL 715
            GLISDQTYKQLNL C NQSFVH S SCD+ILEV DKE+GNID YSIFTP CS+  SNRL
Sbjct: 241 TGLISDQTYKQLNLLCANQSFVHSSASCDEILEVADKEIGNIDHYSIFTPPCSEASSNRL 300

Query: 716 LKRMHMVGRVGERYDPCTEKHSVAYFNLPEVQKALHVDPKHAPSKWETCSDLVNAHWKDS 775
            KRMHM+GRVGERYDPCTEKHSVAYFNLPEVQ+ALHVDPK APSKWETCS L+N +WKDS
Sbjct: 301 RKRMHMIGRVGERYDPCTEKHSVAYFNLPEVQQALHVDPKFAPSKWETCSYLINGNWKDS 360

Query: 776 VGSVLDIYQELIHSGLRIWVFSGDTDSVLPITSTRYSIDALKLPSIGSWRAWYDDGQVGG 835
            GSVLDIY+ELI +GLRIWVFSGDTD+VLPITSTRYS+DALKLP IGSWR WYD GQVGG
Sbjct: 361 AGSVLDIYRELIQAGLRIWVFSGDTDAVLPITSTRYSVDALKLPVIGSWRPWYDGGQVGG 420

Query: 836 WIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLPTLQLH 883
           WIQEYEG+TLVSVRGAGHEVPLH+PKLALQLIK+FLAGNSL  LQLH
Sbjct: 421 WIQEYEGVTLVSVRGAGHEVPLHQPKLALQLIKSFLAGNSLSPLQLH 465

BLAST of Sgr020497 vs. ExPASy Swiss-Prot
Match: Q949Q7 (Serine carboxypeptidase-like 29 OS=Arabidopsis thaliana OX=3702 GN=SCPL29 PE=2 SV=1)

HSP 1 Score: 656.8 bits (1693), Expect = 4.0e-187
Identity = 297/434 (68.43%), Postives = 355/434 (81.80%), Query Frame = 0

Query: 447 LVQRELDRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEAAEDSGSKPLVLWLNG 506
           L Q+E D++ +LPGQNF V+FAHYSG++  NE  GR LFYW  EA ED+ SKPLVLWLNG
Sbjct: 30  LSQKEQDKVSKLPGQNFNVSFAHYSGFVATNEQLGRALFYWLFEAVEDAKSKPLVLWLNG 89

Query: 507 GPGCSSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDL 566
           GPGCSS+AYGEAEEIGPFHI ADGKTLYLN YSWN+ AN++FL++PVGVG+SY+NTSSDL
Sbjct: 90  GPGCSSVAYGEAEEIGPFHIKADGKTLYLNQYSWNQAANILFLDAPVGVGYSYSNTSSDL 149

Query: 567 LNNGDKRTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEK 626
            +NGDKRTAEDSL FLLKW ERFP++KGRDFYI GESYAGHY+PQLS+AIV++N  + + 
Sbjct: 150 KSNGDKRTAEDSLKFLLKWVERFPEYKGRDFYIVGESYAGHYIPQLSEAIVKHNQGSDKN 209

Query: 627 SINLKGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKI 686
           SINLKGYMVGN L DD HD +G+F+++WS G ISDQTY  L LQC  +SF+H S+ C+KI
Sbjct: 210 SINLKGYMVGNGLMDDFHDRLGLFQYIWSLGFISDQTYSLLQLQCGFESFIHSSKQCNKI 269

Query: 687 LEVVDKELGNIDPYSIFTPSC--SDGPSNRLLKRMHMVGRVGERYDPCTEKHSVAYFNLP 746
           LE+ DKE+GNID YS+FTP+C  +   SN LLK+  M  RV E+YDPCTEKH+  YFNLP
Sbjct: 270 LEIADKEIGNIDQYSVFTPACVANASQSNMLLKKRPMTSRVSEQYDPCTEKHTTVYFNLP 329

Query: 747 EVQKALHVDPKHAPSKWETCSDLVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSVL 806
           EVQKALHV P  APSKW+TCSD+V+ HW DS  SVL+IY ELI +GLRIWVFSGD D+V+
Sbjct: 330 EVQKALHVPPGLAPSKWDTCSDVVSEHWNDSPSSVLNIYHELIAAGLRIWVFSGDADAVV 389

Query: 807 PITSTRYSIDALKLPSIGSWRAWYDDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLAL 866
           P+TSTRYSIDAL L  + ++  WY DGQVGGW Q+Y GL  V+VRGAGHEVPLH+PK AL
Sbjct: 390 PVTSTRYSIDALNLRPLSAYGPWYLDGQVGGWSQQYAGLNFVTVRGAGHEVPLHRPKQAL 449

Query: 867 QLIKAFLAGNSLPT 879
            L KAF++G  L T
Sbjct: 450 ALFKAFISGTPLST 463

BLAST of Sgr020497 vs. ExPASy Swiss-Prot
Match: P55748 (Serine carboxypeptidase II-2 (Fragment) OS=Hordeum vulgare OX=4513 GN=CXP;2-2 PE=1 SV=1)

HSP 1 Score: 649.0 bits (1673), Expect = 8.3e-185
Identity = 289/429 (67.37%), Postives = 356/429 (82.98%), Query Frame = 0

Query: 455 IVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEAAEDSGSKPLVLWLNGGPGCSSIA 514
           +  +PGQ F+ +FAHY+GY+TV+ED G  LFYWF EAA D  SKPL+LWLNGGPGCSSIA
Sbjct: 1   VPRVPGQAFDASFAHYAGYVTVSEDRGAALFYWFFEAAHDPASKPLLLWLNGGPGCSSIA 60

Query: 515 YGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNGDKRT 574
           +G  EE+GPFH+NADGK +++NPYSWN+VAN++FL+SPVGVG+SY+NTS+D+L+NGD+RT
Sbjct: 61  FGVGEEVGPFHVNADGKGVHMNPYSWNQVANILFLDSPVGVGYSYSNTSADILSNGDERT 120

Query: 575 AEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINLKGYM 634
           A+DSLVFL KW ERFPQ+K R+FY+TGESYAGHYVPQL+QAI R++  T +KSINLKGYM
Sbjct: 121 AKDSLVFLTKWLERFPQYKEREFYLTGESYAGHYVPQLAQAIKRHHEATGDKSINLKGYM 180

Query: 635 VGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKILEVVDKEL 694
           VGNAL DD HDH G+F++MW+ GLISDQTYK LN+ C+ +SFVH S  CDKIL++   E 
Sbjct: 181 VGNALTDDFHDHYGIFQYMWTTGLISDQTYKLLNIFCDFESFVHTSPQCDKILDIASTEA 240

Query: 695 GNIDPYSIFTPSCSD---GPSNRLLKRMHMVGRVGERYDPCTEKHSVAYFNLPEVQKALH 754
           GNID YSIFTP+C        N+++KR+  VG++GE+YDPCTEKHS+ YFNL EVQKALH
Sbjct: 241 GNIDSYSIFTPTCHSSFASSRNKVVKRLRSVGKMGEQYDPCTEKHSIVYFNLHEVQKALH 300

Query: 755 VDPKHAPSKWETCSDLVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSVLPITSTRY 814
           V+P    SKWETCS+++N +WKD   SVL IY ELI  GLRIW+FSGDTD+V+P+TSTRY
Sbjct: 301 VNPVIGKSKWETCSEVINTNWKDCERSVLHIYHELIQYGLRIWMFSGDTDAVIPVTSTRY 360

Query: 815 SIDALKLPSIGSWRAWY-DDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAF 874
           SIDALKLP++  W AWY DDG+VGGW Q Y+GL  V+VRGAGHEVPLH+PK AL LIK+F
Sbjct: 361 SIDALKLPTVTPWHAWYDDDGEVGGWTQGYKGLNFVTVRGAGHEVPLHRPKQALTLIKSF 420

Query: 875 LAGNSLPTL 880
           LAG  +P L
Sbjct: 421 LAGRPMPVL 429

BLAST of Sgr020497 vs. ExPASy Swiss-Prot
Match: Q9ZQQ0 (Serine carboxypeptidase-like 26 OS=Arabidopsis thaliana OX=3702 GN=SCPL26 PE=2 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 6.9e-155
Identity = 261/433 (60.28%), Postives = 318/433 (73.44%), Query Frame = 0

Query: 449 QRELDRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEA--AEDSGSKPLVLWLNG 508
           ++E DRI  LPG+  +V+F+H+SGYITVNE +GR LFYW  E+  +E+  SKPLVLWLNG
Sbjct: 24  EQEKDRIFHLPGEPNDVSFSHFSGYITVNESAGRALFYWLTESPPSENPESKPLVLWLNG 83

Query: 509 GPGCSSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDL 568
           GPGCSS+AYG AEEIGPF IN DGKTLY NPYSWN++AN++FLESP GVGFSY+NT+SDL
Sbjct: 84  GPGCSSVAYGAAEEIGPFRINPDGKTLYHNPYSWNKLANLLFLESPAGVGFSYSNTTSDL 143

Query: 569 LNNGDKRTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEK 628
              GD+RTAED+ VFL+KWFERFPQ+K R+FYI GESYAGHYVPQLSQ +       +  
Sbjct: 144 YTAGDQRTAEDAYVFLVKWFERFPQYKHREFYIAGESYAGHYVPQLSQIVYEK----RNP 203

Query: 629 SINLKGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKI 688
           +IN KG++VGNA+ DD+HD+VG+FE+ W+ GLISD TY  L + CE  S  HPS  C K 
Sbjct: 204 AINFKGFIVGNAVIDDYHDYVGLFEYWWAHGLISDLTYHNLRITCEFGSSEHPSSKCTKA 263

Query: 689 LEVVDKELGNIDPYSIFTPSCSDGPSNRLLKRMHMVGR--VGERYDPCTEKHSVAYFNLP 748
           +E  D E GNIDPYSI+T +C    +  L  R   V    +   YDPCTEK+S  YFN P
Sbjct: 264 MEAADLEQGNIDPYSIYTVTCKK-EAAALRSRFSRVRHPWMWRAYDPCTEKYSGMYFNSP 323

Query: 749 EVQKALHVDPKHAPSKWETCSDLVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSVL 808
           EVQKA+H +       W+ CSD+V   W DS  S+L IY+ELI +GLRIWVFSGDTDSV+
Sbjct: 324 EVQKAMHANITGLAYPWKGCSDIVGEKWADSPLSMLPIYKELIAAGLRIWVFSGDTDSVV 383

Query: 809 PITSTRYSIDALKLPSIGSWRAWYDDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLAL 868
           PIT TRYSI ALKL  +  W  W DDGQVGGW Q Y+GLTLV++ GAGHEVPL +P+ A 
Sbjct: 384 PITGTRYSIRALKLQPLSKWYPWNDDGQVGGWSQVYKGLTLVTIHGAGHEVPLFRPRRAF 443

Query: 869 QLIKAFLAGNSLP 878
            L ++FL    LP
Sbjct: 444 LLFQSFLDNKPLP 451

BLAST of Sgr020497 vs. ExPASy Swiss-Prot
Match: Q9SFB5 (Serine carboxypeptidase-like 27 OS=Arabidopsis thaliana OX=3702 GN=SCPL27 PE=2 SV=1)

HSP 1 Score: 537.0 bits (1382), Expect = 4.6e-151
Identity = 252/430 (58.60%), Postives = 317/430 (73.72%), Query Frame = 0

Query: 453 DRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEA--AEDSGSKPLVLWLNGGPGC 512
           DRI  LPGQ   V+F  YSGY+TV+E+ GR LFYW +E+  A D  S+PLVLWLNGGPGC
Sbjct: 32  DRISNLPGQPSNVDFRQYSGYVTVHEERGRALFYWLVESPLARDPKSRPLVLWLNGGPGC 91

Query: 513 SSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNG 572
           SS+AYG AEEIGPF + +DGKTL+   Y+WN++AN++FLESP GVGFSY+NT+SDL   G
Sbjct: 92  SSVAYGAAEEIGPFRVGSDGKTLHSKLYAWNKLANLLFLESPAGVGFSYSNTTSDLYTTG 151

Query: 573 DKRTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINL 632
           D+RTAEDS +FL+ WFERFPQ+K R+FYI GESYAGH+VPQLS+ +   N   K  +INL
Sbjct: 152 DQRTAEDSYIFLVNWFERFPQYKHREFYIVGESYAGHFVPQLSKLVHERNKGFKNPAINL 211

Query: 633 KGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKILEVV 692
           KG+MVGNA+ DD+HD++G FE+ W+ GLISD TY QL   C + S  HPS  C   L   
Sbjct: 212 KGFMVGNAVTDDYHDYIGTFEYWWNHGLISDSTYHQLKTACYSVSSQHPSMQCMVALRNA 271

Query: 693 DKELGNIDPYSIFTPSCSDGPSNRLLKRMHMVGR---VGERYDPCTEKHSVAYFNLPEVQ 752
           + E GNIDPYSIFT  C+   S   LKR  + GR   +   YDPCTE++S  YFN  +VQ
Sbjct: 272 ELEQGNIDPYSIFTKPCN---STVALKRF-LKGRYPWMSRAYDPCTERYSNVYFNRLDVQ 331

Query: 753 KALHVDPKHAPSKWETCSDLVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSVLPIT 812
           KALH +       W+ CSD+V ++W DS  S+L IY+ELI +GL+IWVFSGDTD+V+PIT
Sbjct: 332 KALHANVTRLSYPWKACSDIVGSYWDDSPLSMLPIYKELITAGLKIWVFSGDTDAVVPIT 391

Query: 813 STRYSIDALKLPSIGSWRAWYDDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLALQLI 872
           +TRYS+DALKL +I +W  WYD G+VGGW Q Y+GLTLV+V GAGHEVPLH+P+ A  L 
Sbjct: 392 ATRYSVDALKLATITNWYPWYDHGKVGGWSQVYKGLTLVTVAGAGHEVPLHRPRQAFILF 451

Query: 873 KAFLAGNSLP 878
           ++FL    +P
Sbjct: 452 RSFLESKPMP 457

BLAST of Sgr020497 vs. ExPASy Swiss-Prot
Match: P08819 (Serine carboxypeptidase 2 OS=Triticum aestivum OX=4565 GN=CBP2 PE=1 SV=2)

HSP 1 Score: 526.9 bits (1356), Expect = 4.8e-148
Identity = 254/434 (58.53%), Postives = 311/434 (71.66%), Query Frame = 0

Query: 453 DRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEAAEDSGSKPLVLWLNGGPGCSS 512
           DRI  LPGQ   V+F  YSGYITV+E +GR LFY   EA ED+   PLVLWLNGGPGCSS
Sbjct: 9   DRIARLPGQP-AVDFDMYSGYITVDEGAGRSLFYLLQEAPEDAQPAPLVLWLNGGPGCSS 68

Query: 513 IAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNGDK 572
           +AYG +EE+G F +   G  L LN Y WN+VANV+FL+SP GVGFSYTNTSSD+  +GD 
Sbjct: 69  VAYGASEELGAFRVKPRGAGLVLNEYRWNKVANVLFLDSPAGVGFSYTNTSSDIYTSGDN 128

Query: 573 RTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINLKG 632
           RTA DS  FL KWFERFP +K RDFYI GESYAGHYVP+LSQ + R    +K   INLKG
Sbjct: 129 RTAHDSYAFLAKWFERFPHYKYRDFYIAGESYAGHYVPELSQLVHR----SKNPVINLKG 188

Query: 633 YMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKILEVVDK 692
           +MVGN L DD+HD+VG FEF W+ G++SD TY++L   C + SF+HPS +CD   +V   
Sbjct: 189 FMVGNGLIDDYHDYVGTFEFWWNHGIVSDDTYRRLKEACLHDSFIHPSPACDAATDVATA 248

Query: 693 ELGNIDPYSIFTPSC-----SDGPSNRLLKRMHMVGR---VGERYDPCTEKHSVAYFNLP 752
           E GNID YS++TP C     S   S+ L ++    GR   +   YDPCTE++S AY+N  
Sbjct: 249 EQGNIDMYSLYTPVCNITSSSSSSSSSLSQQRRSRGRYPWLTGSYDPCTERYSTAYYNRR 308

Query: 753 EVQKALHVDPKHAPS-KWETCSDLVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSV 812
           +VQ ALH +   A +  W TCSD +N HW D+  S+L IY+ELI +GLRIWVFSGDTD+V
Sbjct: 309 DVQMALHANVTGAMNYTWATCSDTINTHWHDAPRSMLPIYRELIAAGLRIWVFSGDTDAV 368

Query: 813 LPITSTRYSIDALKLPSIGSWRAWYDDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLA 872
           +P+T+TRYSI AL LP+  SW  WYDD +VGGW Q Y+GLTLVSVRGAGHEVPLH+P+ A
Sbjct: 369 VPLTATRYSIGALGLPTTTSWYPWYDDQEVGGWSQVYKGLTLVSVRGAGHEVPLHRPRQA 428

Query: 873 LQLIKAFLAGNSLP 878
           L L + FL G  +P
Sbjct: 429 LVLFQYFLQGKPMP 437

BLAST of Sgr020497 vs. ExPASy TrEMBL
Match: A0A2U1MBL6 (Carboxypeptidase OS=Artemisia annua OX=35608 GN=CTI12_AA204490 PE=3 SV=1)

HSP 1 Score: 960.7 bits (2482), Expect = 4.8e-276
Identity = 488/1049 (46.52%), Postives = 623/1049 (59.39%), Query Frame = 0

Query: 25   VSTSSEDPLVQQELDRIVELPGQNFEVNFEHFSGYITVNEDSGRALFYWFVEAVEDSGSK 84
            +S    D +  Q+ D +  LPGQNF V+F  ++GY+TVNE++GRALFYW  EA +D  SK
Sbjct: 14   MSVCDADSVSDQKKDLVTNLPGQNFNVDFAQYAGYVTVNEENGRALFYWLTEATQDPASK 73

Query: 85   PLLLWLTGGPGCSSIAYGEAEEVGPFHINADGKSLYLNPYSWN----------------- 144
            PL+LWL GGPGCSSI YG +EEVGPFH++ DGKS+Y NPYSWN                 
Sbjct: 74   PLVLWLNGGPGCSSIGYGMSEEVGPFHVDKDGKSIYSNPYSWNTVANLLFIDSPVGVGYS 133

Query: 145  ---------------EAEDSLSFLLKWFERFPQFKGRDFYIIGESYGGHYAPQLSQAILK 204
                            AED L FLL W ERFPQ+KGRDFY+ GESY GHY PQLSQ I++
Sbjct: 134  YSNTSADIESNGDKRTAEDLLQFLLNWLERFPQYKGRDFYMTGESYAGHYVPQLSQVIVR 193

Query: 205  NNLLTMEKSINLKGYMVGNAVFDDHYDRMGVFNFLWSIGLISDQTYKQLNLHCENQSFVH 264
             N       INL GY+VGNA+ D+H+D +G+F F W++GLISDQTYK+LN  C+   ++ 
Sbjct: 194  YNKENTGSPINLLGYLVGNALTDEHHDHLGLFQFWWTVGLISDQTYKKLNDVCDKAFYMR 253

Query: 265  HSKSCDKILDVVDKELGNIDPFSIYTPSC-ADGPSNRLLKRMHMVGRDGESFDPCTKKHS 324
             S+ C +I + + +E+GNID +SI+TPSC A+  +N+LL+R H  G  G S+DPCT++HS
Sbjct: 254  PSQECYQIRENLIQEVGNIDSYSIFTPSCTANDVNNKLLRRWHKFGYIGRSYDPCTEQHS 313

Query: 325  IIYFNLPKVQKALHVDPKHAPSKWEIC--------------------------------S 384
             IYFNLP+VQKALHV   +    WE+C                                S
Sbjct: 314  TIYFNLPEVQKALHVYHSNTSRSWEVCSDFIETSWQDSPLSVLDVYHELITAGLRIWMFS 373

Query: 385  GDADTSLPTTSTRYSINALKLPLAGPWRAWYGDGQL-----------FIGESESRNVVVR 444
            GD D  +P TSTRY+INAL L    PWRAWY DGQ+           F+    + + V  
Sbjct: 374  GDTDGVIPVTSTRYTINALNLTTISPWRAWYEDGQVGGWTQMYKGLTFVAVRGAGHKVPL 433

Query: 445  ETPTKRGWGDNHCLGLLSVFPFKNSI--HFRISRHSFIRHRQEEVSLLRMARPTCFF--L 504
              P          L LL  F     +    ++   S                 +CF+  +
Sbjct: 434  HKP-------KLALTLLKSFMAGTPMPSFEQVIEPSRCPKSNRRFQEFLGGSVSCFYVGV 493

Query: 505  QVFAIFGYLHLGIGVPTSSDYPLVQRELDRIVELPGQNFEVNFAHYSGYITVNEDSGRVL 564
            ++  +FG       V    D PL +++ D++ +LPGQNF V+FA Y+GY+TVNE++GR L
Sbjct: 494  RIGMMFGVCDDECNV---DDDPLTEQKQDQVFDLPGQNFNVDFAQYAGYVTVNEENGRAL 553

Query: 565  FYWFIEAAEDSGSKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVA 624
            FYW  EA +D  S+PLVLWLNGGPGCSSIAYG +EE+GPFH++ DGK +Y NPYSWN VA
Sbjct: 554  FYWLTEATQDPASQPLVLWLNGGPGCSSIAYGMSEEVGPFHVDKDGKFVYSNPYSWNTVA 613

Query: 625  NVIFLESPVGVGFSYTNTSSDLLNNGDKRTAEDSLVFLLKWFERFPQFKGRDFYITGESY 684
            N++F++SPVGVG+SY+NTS+D+ +NGDKRTAED L FLL W ERFPQ+KGRDFY+TGESY
Sbjct: 614  NLLFIDSPVGVGYSYSNTSADIESNGDKRTAEDLLQFLLNWLERFPQYKGRDFYMTGESY 673

Query: 685  AGHYVPQLSQAIVRNNLLTKEKSINLKGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTY 744
            AGHYVPQLSQ IVR N       INL GYMVGNA+ D+++DH G+F++MW+ G+ISDQT+
Sbjct: 674  AGHYVPQLSQVIVRYNKENTGSPINLLGYMVGNAVTDEYNDHFGLFQYMWTVGMISDQTF 733

Query: 745  KQLNLQCENQSFVHPSESCDKILEVVDKELGNIDPYSIFTPSCSDGPSN-RLLKRMHMVG 804
            K+LN  C    ++ PS  C +I+E +  ELGNID YSIFTPSC+    N RLL++ H  G
Sbjct: 734  KKLNDVCAKSFYIRPSPECRQIIEYMFHELGNIDHYSIFTPSCTTNDVNDRLLRKWHKSG 793

Query: 805  RVGERYDPCTEKHSVAYFNLPEVQKALHV------------------------------- 864
             +G  YDPCTE+HS  YFNLPEVQKALHV                               
Sbjct: 794  YIGRSYDPCTEQHSTIYFNLPEVQKALHVYHSNTSRSWEVCSDFIETSWQDSPLSVLDVY 853

Query: 865  ----------------------------------------------------------DP 881
                                                                      DP
Sbjct: 854  HELITSGLRIWMFSGDTDAVIPVTSTRYTINALNLTAISPWRAWYEDGQKSGYIGRSYDP 913

BLAST of Sgr020497 vs. ExPASy TrEMBL
Match: A0A6J1DV98 (Carboxypeptidase OS=Momordica charantia OX=3673 GN=LOC111024700 PE=3 SV=1)

HSP 1 Score: 890.2 bits (2299), Expect = 7.9e-255
Identity = 414/467 (88.65%), Postives = 437/467 (93.58%), Query Frame = 0

Query: 416 MARPTCFFLQVFAIFGYLHLGIGVPTSSDYPLVQRELDRIVELPGQNFEVNFAHYSGYIT 475
           MARPT  FLQ FAIFG+LHLGI V T SD PL+Q+ELDR++ELPGQ F+VNFAHYSGYIT
Sbjct: 1   MARPTWVFLQAFAIFGFLHLGISVSTYSDNPLLQQELDRVIELPGQKFDVNFAHYSGYIT 60

Query: 476 VNEDSGRVLFYWFIEAAEDSGSKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKTLYL 535
           VNEDSGR LFYWF EA EDS SKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGK LYL
Sbjct: 61  VNEDSGRALFYWFFEATEDSASKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKFLYL 120

Query: 536 NPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNGDKRTAEDSLVFLLKWFERFPQFKGR 595
           NPYSWNEVANVIF++SPVGVGFSY+NTSSDLLNNGDKRTAEDSL FLLKWFERFPQFKGR
Sbjct: 121 NPYSWNEVANVIFIDSPVGVGFSYSNTSSDLLNNGDKRTAEDSLAFLLKWFERFPQFKGR 180

Query: 596 DFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINLKGYMVGNALFDDHHDHVGVFEFMWS 655
           +FYITGESYAGHYVPQLSQAIVRNNLL KEKSINLKGYMVGNALFDDHHDHVGVFEF+WS
Sbjct: 181 EFYITGESYAGHYVPQLSQAIVRNNLLFKEKSINLKGYMVGNALFDDHHDHVGVFEFLWS 240

Query: 656 AGLISDQTYKQLNLQCENQSFVHPSESCDKILEVVDKELGNIDPYSIFTPSCSDGPSNRL 715
            GLISDQTYKQLNLQC NQSF+H SESCD+IL+VV+KELGNIDPYSIFTP CSDG SNRL
Sbjct: 241 TGLISDQTYKQLNLQCRNQSFIHSSESCDEILDVVNKELGNIDPYSIFTPPCSDGSSNRL 300

Query: 716 LKRMHMVGRVGERYDPCTEKHSVAYFNLPEVQKALHVDPKHAPSKWETCSDLVNAHWKDS 775
            KRMHMVG V ERYDPCTEKHSVAYFNLPEVQKALHVDPKHAP+ WETCS+L+N +WKDS
Sbjct: 301 WKRMHMVGHVAERYDPCTEKHSVAYFNLPEVQKALHVDPKHAPAMWETCSELINTNWKDS 360

Query: 776 VGSVLDIYQELIHSGLRIWVFSGDTDSVLPITSTRYSIDALKLPSIGSWRAWYDDGQVGG 835
            GSVLDIY+ELIHSGLRIWVFSGDTDS+LPITSTRYSIDAL+LP  G W AWYDDGQVGG
Sbjct: 361 AGSVLDIYRELIHSGLRIWVFSGDTDSILPITSTRYSIDALELPQTGPWHAWYDDGQVGG 420

Query: 836 WIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLPTLQLH 883
           WIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLP+LQLH
Sbjct: 421 WIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLPSLQLH 467

BLAST of Sgr020497 vs. ExPASy TrEMBL
Match: A0A0A0LJ45 (Carboxypeptidase OS=Cucumis sativus OX=3659 GN=Csa_2G036620 PE=3 SV=1)

HSP 1 Score: 860.1 bits (2221), Expect = 8.8e-246
Identity = 405/480 (84.38%), Postives = 432/480 (90.00%), Query Frame = 0

Query: 403 FIRHRQEEVSLLRMARPTCFFLQVFAIFGYLHLGIGVPTSSDYPLVQRELDRIVELPGQN 462
           F RH Q E S + MARPT  FLQ+F IF +LHLGI V  S++ PL Q+ELDRI ELPGQN
Sbjct: 62  FFRHPQGEASFIIMARPTWVFLQLFTIFAFLHLGIAV--STENPLRQQELDRIAELPGQN 121

Query: 463 FEVNFAHYSGYITVNEDSGRVLFYWFIEAAEDSGSKPLVLWLNGGPGCSSIAYGEAEEIG 522
           FEV F HYSGYITVNE+SGR LFYWF EA EDS SKPLVLWLNGGPGCSSIAYGEAEEIG
Sbjct: 122 FEVKFGHYSGYITVNEESGRALFYWFFEATEDSASKPLVLWLNGGPGCSSIAYGEAEEIG 181

Query: 523 PFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNGDKRTAEDSLVFL 582
           PFHINADGK +YLNPYSWNEVANV+FL+SP GVGFSY+NTSSDL+NNGDKRTAEDSL FL
Sbjct: 182 PFHINADGKPVYLNPYSWNEVANVLFLDSPAGVGFSYSNTSSDLMNNGDKRTAEDSLAFL 241

Query: 583 LKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINLKGYMVGNALFDD 642
           LKWFERFPQFKGRDFYITGESY GHYVPQLSQAIVRNNLL KEKSINLKGYMVGNALFDD
Sbjct: 242 LKWFERFPQFKGRDFYITGESYGGHYVPQLSQAIVRNNLLFKEKSINLKGYMVGNALFDD 301

Query: 643 HHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKILEVVDKELGNIDPYSI 702
           HHDHVGVFEF+WS GLISDQTYKQLNL C NQSFVH S SCD+ILEV DKE+GNID YSI
Sbjct: 302 HHDHVGVFEFLWSTGLISDQTYKQLNLLCANQSFVHSSASCDEILEVADKEIGNIDHYSI 361

Query: 703 FTPSCSDGPSNRLLKRMHMVGRVGERYDPCTEKHSVAYFNLPEVQKALHVDPKHAPSKWE 762
           FTP CS+  SNRL KRMHM+GRVGERYDPCTEKHSVAYFNLPEVQ+ALHVDPK APSKWE
Sbjct: 362 FTPPCSEASSNRLRKRMHMIGRVGERYDPCTEKHSVAYFNLPEVQQALHVDPKFAPSKWE 421

Query: 763 TCSDLVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSVLPITSTRYSIDALKLPSIG 822
           TCS L+N +WKDS GSVLDIY+ELI +GLRIWVFSGDTD+VLPITSTRYS+DALKLP IG
Sbjct: 422 TCSYLINGNWKDSAGSVLDIYRELIQAGLRIWVFSGDTDAVLPITSTRYSVDALKLPVIG 481

Query: 823 SWRAWYDDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLPTLQLH 882
           SWR WYD GQVGGWIQEYEG+TLVSVRGAGHEVPLH+PKLALQLIK+FLAGNSL  LQLH
Sbjct: 482 SWRPWYDGGQVGGWIQEYEGVTLVSVRGAGHEVPLHQPKLALQLIKSFLAGNSLSPLQLH 539

BLAST of Sgr020497 vs. ExPASy TrEMBL
Match: A0A2U1MBM1 (Carboxypeptidase OS=Artemisia annua OX=35608 GN=CTI12_AA204490 PE=3 SV=1)

HSP 1 Score: 858.2 bits (2216), Expect = 3.3e-245
Identity = 443/970 (45.67%), Postives = 566/970 (58.35%), Query Frame = 0

Query: 104 AEEVGPFHINADGKSLYLNPYSWN--------------------------------EAED 163
           +EEVGPFH++ DGKS+Y NPYSWN                                 AED
Sbjct: 2   SEEVGPFHVDKDGKSIYSNPYSWNTVANLLFIDSPVGVGYSYSNTSADIESNGDKRTAED 61

Query: 164 SLSFLLKWFERFPQFKGRDFYIIGESYGGHYAPQLSQAILKNNLLTMEKSINLKGYMVGN 223
            L FLL W ERFPQ+KGRDFY+ GESY GHY PQLSQ I++ N       INL GY+VGN
Sbjct: 62  LLQFLLNWLERFPQYKGRDFYMTGESYAGHYVPQLSQVIVRYNKENTGSPINLLGYLVGN 121

Query: 224 AVFDDHYDRMGVFNFLWSIGLISDQTYKQLNLHCENQSFVHHSKSCDKILDVVDKELGNI 283
           A+ D+H+D +G+F F W++GLISDQTYK+LN  C+   ++  S+ C +I + + +E+GNI
Sbjct: 122 ALTDEHHDHLGLFQFWWTVGLISDQTYKKLNDVCDKAFYMRPSQECYQIRENLIQEVGNI 181

Query: 284 DPFSIYTPSC-ADGPSNRLLKRMHMVGRDGESFDPCTKKHSIIYFNLPKVQKALHVDPKH 343
           D +SI+TPSC A+  +N+LL+R H  G  G S+DPCT++HS IYFNLP+VQKALHV   +
Sbjct: 182 DSYSIFTPSCTANDVNNKLLRRWHKFGYIGRSYDPCTEQHSTIYFNLPEVQKALHVYHSN 241

Query: 344 APSKWEIC--------------------------------SGDADTSLPTTSTRYSINAL 403
               WE+C                                SGD D  +P TSTRY+INAL
Sbjct: 242 TSRSWEVCSDFIETSWQDSPLSVLDVYHELITAGLRIWMFSGDTDGVIPVTSTRYTINAL 301

Query: 404 KLPLAGPWRAWYGDGQL-----------FIGESESRNVVVRETPTKRGWGDNHCLGLLSV 463
            L    PWRAWY DGQ+           F+    + + V    P          L LL  
Sbjct: 302 NLTTISPWRAWYEDGQVGGWTQMYKGLTFVAVRGAGHKVPLHKP-------KLALTLLKS 361

Query: 464 FPFKNSI--HFRISRHSFIRHRQEEVSLLRMARPTCFF--LQVFAIFGYLHLGIGVPTSS 523
           F     +    ++   S                 +CF+  +++  +FG       V    
Sbjct: 362 FMAGTPMPSFEQVIEPSRCPKSNRRFQEFLGGSVSCFYVGVRIGMMFGVCDDECNV---D 421

Query: 524 DYPLVQRELDRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEAAEDSGSKPLVLW 583
           D PL +++ D++ +LPGQNF V+FA Y+GY+TVNE++GR LFYW  EA +D  S+PLVLW
Sbjct: 422 DDPLTEQKQDQVFDLPGQNFNVDFAQYAGYVTVNEENGRALFYWLTEATQDPASQPLVLW 481

Query: 584 LNGGPGCSSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTS 643
           LNGGPGCSSIAYG +EE+GPFH++ DGK +Y NPYSWN VAN++F++SPVGVG+SY+NTS
Sbjct: 482 LNGGPGCSSIAYGMSEEVGPFHVDKDGKFVYSNPYSWNTVANLLFIDSPVGVGYSYSNTS 541

Query: 644 SDLLNNGDKRTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLT 703
           +D+ +NGDKRTAED L FLL W ERFPQ+KGRDFY+TGESYAGHYVPQLSQ IVR N   
Sbjct: 542 ADIESNGDKRTAEDLLQFLLNWLERFPQYKGRDFYMTGESYAGHYVPQLSQVIVRYNKEN 601

Query: 704 KEKSINLKGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESC 763
               INL GYMVGNA+ D+++DH G+F++MW+ G+ISDQT+K+LN  C    ++ PS  C
Sbjct: 602 TGSPINLLGYMVGNAVTDEYNDHFGLFQYMWTVGMISDQTFKKLNDVCAKSFYIRPSPEC 661

Query: 764 DKILEVVDKELGNIDPYSIFTPSCSDGPSN-RLLKRMHMVGRVGERYDPCTEKHSVAYFN 823
            +I+E +  ELGNID YSIFTPSC+    N RLL++ H  G +G  YDPCTE+HS  YFN
Sbjct: 662 RQIIEYMFHELGNIDHYSIFTPSCTTNDVNDRLLRKWHKSGYIGRSYDPCTEQHSTIYFN 721

Query: 824 LPEVQKALHV-------------------------------------------------- 881
           LPEVQKALHV                                                  
Sbjct: 722 LPEVQKALHVYHSNTSRSWEVCSDFIETSWQDSPLSVLDVYHELITSGLRIWMFSGDTDA 781

BLAST of Sgr020497 vs. ExPASy TrEMBL
Match: A0A5A7ULU8 (Carboxypeptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold546G00240 PE=3 SV=1)

HSP 1 Score: 846.3 bits (2185), Expect = 1.3e-241
Identity = 395/467 (84.58%), Postives = 427/467 (91.43%), Query Frame = 0

Query: 416 MARPTCFFLQVFAIFGYLHLGIGVPTSSDYPLVQRELDRIVELPGQNFEVNFAHYSGYIT 475
           MARPT  FLQ+F I  +LHLGI V  S++ PL Q+ELDRI ELPGQNFEV F HYSGYIT
Sbjct: 1   MARPTWVFLQLFTIVAFLHLGIAV--STENPLRQQELDRIAELPGQNFEVKFGHYSGYIT 60

Query: 476 VNEDSGRVLFYWFIEAAEDSGSKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKTLYL 535
           VNE+SGR LFYWF EA EDS SKPLVLWLNGGPGCSSIA+GEAEEIGPFHINADGK++YL
Sbjct: 61  VNEESGRALFYWFFEATEDSASKPLVLWLNGGPGCSSIAFGEAEEIGPFHINADGKSVYL 120

Query: 536 NPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNGDKRTAEDSLVFLLKWFERFPQFKGR 595
           NPYSWNEVANV+FL+SP GVGFSY+NTSSDL+NNGDKRTAEDSL FLLKWFERFPQFKGR
Sbjct: 121 NPYSWNEVANVLFLDSPAGVGFSYSNTSSDLMNNGDKRTAEDSLAFLLKWFERFPQFKGR 180

Query: 596 DFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINLKGYMVGNALFDDHHDHVGVFEFMWS 655
           DFYITGESY GHYVPQLSQAIVRNNLL KEKSINLKGYMVGNALFDDHHDHVGVFEF+WS
Sbjct: 181 DFYITGESYGGHYVPQLSQAIVRNNLLFKEKSINLKGYMVGNALFDDHHDHVGVFEFLWS 240

Query: 656 AGLISDQTYKQLNLQCENQSFVHPSESCDKILEVVDKELGNIDPYSIFTPSCSDGPSNRL 715
            GLISDQTYKQLNL+C N+SFVHPS SCD+ILEV DKE+GNID YSIFTP CS+  SNRL
Sbjct: 241 TGLISDQTYKQLNLRCANESFVHPSASCDEILEVADKEIGNIDHYSIFTPPCSEVSSNRL 300

Query: 716 LKRMHMVGRVGERYDPCTEKHSVAYFNLPEVQKALHVDPKHAPSKWETCSDLVNAHWKDS 775
            KRMHMVGR+GERYDPCTE+HSVAYFNLPEVQ+ALHVDPK APSKWETCS ++N +WKDS
Sbjct: 301 RKRMHMVGRIGERYDPCTEQHSVAYFNLPEVQQALHVDPKFAPSKWETCSYVINGNWKDS 360

Query: 776 VGSVLDIYQELIHSGLRIWVFSGDTDSVLPITSTRYSIDALKLPSIGSWRAWYDDGQVGG 835
            GSVLDIY+ELI +GLRIWVFSGDTD+VLPITSTRYS+DALKLP IG WRAWYD+GQVGG
Sbjct: 361 AGSVLDIYRELIQAGLRIWVFSGDTDAVLPITSTRYSVDALKLPVIGPWRAWYDEGQVGG 420

Query: 836 WIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLPTLQLH 883
           WIQEYEG+TLVSVRGAGHEVPLH+PKLALQL KAFLAGNSL  LQLH
Sbjct: 421 WIQEYEGVTLVSVRGAGHEVPLHQPKLALQLFKAFLAGNSLSPLQLH 465

BLAST of Sgr020497 vs. TAIR 10
Match: AT4G30810.1 (serine carboxypeptidase-like 29 )

HSP 1 Score: 656.8 bits (1693), Expect = 2.8e-188
Identity = 297/434 (68.43%), Postives = 355/434 (81.80%), Query Frame = 0

Query: 447 LVQRELDRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEAAEDSGSKPLVLWLNG 506
           L Q+E D++ +LPGQNF V+FAHYSG++  NE  GR LFYW  EA ED+ SKPLVLWLNG
Sbjct: 30  LSQKEQDKVSKLPGQNFNVSFAHYSGFVATNEQLGRALFYWLFEAVEDAKSKPLVLWLNG 89

Query: 507 GPGCSSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDL 566
           GPGCSS+AYGEAEEIGPFHI ADGKTLYLN YSWN+ AN++FL++PVGVG+SY+NTSSDL
Sbjct: 90  GPGCSSVAYGEAEEIGPFHIKADGKTLYLNQYSWNQAANILFLDAPVGVGYSYSNTSSDL 149

Query: 567 LNNGDKRTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEK 626
            +NGDKRTAEDSL FLLKW ERFP++KGRDFYI GESYAGHY+PQLS+AIV++N  + + 
Sbjct: 150 KSNGDKRTAEDSLKFLLKWVERFPEYKGRDFYIVGESYAGHYIPQLSEAIVKHNQGSDKN 209

Query: 627 SINLKGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKI 686
           SINLKGYMVGN L DD HD +G+F+++WS G ISDQTY  L LQC  +SF+H S+ C+KI
Sbjct: 210 SINLKGYMVGNGLMDDFHDRLGLFQYIWSLGFISDQTYSLLQLQCGFESFIHSSKQCNKI 269

Query: 687 LEVVDKELGNIDPYSIFTPSC--SDGPSNRLLKRMHMVGRVGERYDPCTEKHSVAYFNLP 746
           LE+ DKE+GNID YS+FTP+C  +   SN LLK+  M  RV E+YDPCTEKH+  YFNLP
Sbjct: 270 LEIADKEIGNIDQYSVFTPACVANASQSNMLLKKRPMTSRVSEQYDPCTEKHTTVYFNLP 329

Query: 747 EVQKALHVDPKHAPSKWETCSDLVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSVL 806
           EVQKALHV P  APSKW+TCSD+V+ HW DS  SVL+IY ELI +GLRIWVFSGD D+V+
Sbjct: 330 EVQKALHVPPGLAPSKWDTCSDVVSEHWNDSPSSVLNIYHELIAAGLRIWVFSGDADAVV 389

Query: 807 PITSTRYSIDALKLPSIGSWRAWYDDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLAL 866
           P+TSTRYSIDAL L  + ++  WY DGQVGGW Q+Y GL  V+VRGAGHEVPLH+PK AL
Sbjct: 390 PVTSTRYSIDALNLRPLSAYGPWYLDGQVGGWSQQYAGLNFVTVRGAGHEVPLHRPKQAL 449

Query: 867 QLIKAFLAGNSLPT 879
            L KAF++G  L T
Sbjct: 450 ALFKAFISGTPLST 463

BLAST of Sgr020497 vs. TAIR 10
Match: AT2G35780.1 (serine carboxypeptidase-like 26 )

HSP 1 Score: 549.7 bits (1415), Expect = 4.9e-156
Identity = 261/433 (60.28%), Postives = 318/433 (73.44%), Query Frame = 0

Query: 449 QRELDRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEA--AEDSGSKPLVLWLNG 508
           ++E DRI  LPG+  +V+F+H+SGYITVNE +GR LFYW  E+  +E+  SKPLVLWLNG
Sbjct: 24  EQEKDRIFHLPGEPNDVSFSHFSGYITVNESAGRALFYWLTESPPSENPESKPLVLWLNG 83

Query: 509 GPGCSSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDL 568
           GPGCSS+AYG AEEIGPF IN DGKTLY NPYSWN++AN++FLESP GVGFSY+NT+SDL
Sbjct: 84  GPGCSSVAYGAAEEIGPFRINPDGKTLYHNPYSWNKLANLLFLESPAGVGFSYSNTTSDL 143

Query: 569 LNNGDKRTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEK 628
              GD+RTAED+ VFL+KWFERFPQ+K R+FYI GESYAGHYVPQLSQ +       +  
Sbjct: 144 YTAGDQRTAEDAYVFLVKWFERFPQYKHREFYIAGESYAGHYVPQLSQIVYEK----RNP 203

Query: 629 SINLKGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKI 688
           +IN KG++VGNA+ DD+HD+VG+FE+ W+ GLISD TY  L + CE  S  HPS  C K 
Sbjct: 204 AINFKGFIVGNAVIDDYHDYVGLFEYWWAHGLISDLTYHNLRITCEFGSSEHPSSKCTKA 263

Query: 689 LEVVDKELGNIDPYSIFTPSCSDGPSNRLLKRMHMVGR--VGERYDPCTEKHSVAYFNLP 748
           +E  D E GNIDPYSI+T +C    +  L  R   V    +   YDPCTEK+S  YFN P
Sbjct: 264 MEAADLEQGNIDPYSIYTVTCKK-EAAALRSRFSRVRHPWMWRAYDPCTEKYSGMYFNSP 323

Query: 749 EVQKALHVDPKHAPSKWETCSDLVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSVL 808
           EVQKA+H +       W+ CSD+V   W DS  S+L IY+ELI +GLRIWVFSGDTDSV+
Sbjct: 324 EVQKAMHANITGLAYPWKGCSDIVGEKWADSPLSMLPIYKELIAAGLRIWVFSGDTDSVV 383

Query: 809 PITSTRYSIDALKLPSIGSWRAWYDDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLAL 868
           PIT TRYSI ALKL  +  W  W DDGQVGGW Q Y+GLTLV++ GAGHEVPL +P+ A 
Sbjct: 384 PITGTRYSIRALKLQPLSKWYPWNDDGQVGGWSQVYKGLTLVTIHGAGHEVPLFRPRRAF 443

Query: 869 QLIKAFLAGNSLP 878
            L ++FL    LP
Sbjct: 444 LLFQSFLDNKPLP 451

BLAST of Sgr020497 vs. TAIR 10
Match: AT3G07990.1 (serine carboxypeptidase-like 27 )

HSP 1 Score: 537.0 bits (1382), Expect = 3.3e-152
Identity = 252/430 (58.60%), Postives = 317/430 (73.72%), Query Frame = 0

Query: 453 DRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEA--AEDSGSKPLVLWLNGGPGC 512
           DRI  LPGQ   V+F  YSGY+TV+E+ GR LFYW +E+  A D  S+PLVLWLNGGPGC
Sbjct: 32  DRISNLPGQPSNVDFRQYSGYVTVHEERGRALFYWLVESPLARDPKSRPLVLWLNGGPGC 91

Query: 513 SSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNG 572
           SS+AYG AEEIGPF + +DGKTL+   Y+WN++AN++FLESP GVGFSY+NT+SDL   G
Sbjct: 92  SSVAYGAAEEIGPFRVGSDGKTLHSKLYAWNKLANLLFLESPAGVGFSYSNTTSDLYTTG 151

Query: 573 DKRTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINL 632
           D+RTAEDS +FL+ WFERFPQ+K R+FYI GESYAGH+VPQLS+ +   N   K  +INL
Sbjct: 152 DQRTAEDSYIFLVNWFERFPQYKHREFYIVGESYAGHFVPQLSKLVHERNKGFKNPAINL 211

Query: 633 KGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKILEVV 692
           KG+MVGNA+ DD+HD++G FE+ W+ GLISD TY QL   C + S  HPS  C   L   
Sbjct: 212 KGFMVGNAVTDDYHDYIGTFEYWWNHGLISDSTYHQLKTACYSVSSQHPSMQCMVALRNA 271

Query: 693 DKELGNIDPYSIFTPSCSDGPSNRLLKRMHMVGR---VGERYDPCTEKHSVAYFNLPEVQ 752
           + E GNIDPYSIFT  C+   S   LKR  + GR   +   YDPCTE++S  YFN  +VQ
Sbjct: 272 ELEQGNIDPYSIFTKPCN---STVALKRF-LKGRYPWMSRAYDPCTERYSNVYFNRLDVQ 331

Query: 753 KALHVDPKHAPSKWETCSDLVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSVLPIT 812
           KALH +       W+ CSD+V ++W DS  S+L IY+ELI +GL+IWVFSGDTD+V+PIT
Sbjct: 332 KALHANVTRLSYPWKACSDIVGSYWDDSPLSMLPIYKELITAGLKIWVFSGDTDAVVPIT 391

Query: 813 STRYSIDALKLPSIGSWRAWYDDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLALQLI 872
           +TRYS+DALKL +I +W  WYD G+VGGW Q Y+GLTLV+V GAGHEVPLH+P+ A  L 
Sbjct: 392 ATRYSVDALKLATITNWYPWYDHGKVGGWSQVYKGLTLVTVAGAGHEVPLHRPRQAFILF 451

Query: 873 KAFLAGNSLP 878
           ++FL    +P
Sbjct: 452 RSFLESKPMP 457

BLAST of Sgr020497 vs. TAIR 10
Match: AT4G30610.1 (alpha/beta-Hydrolases superfamily protein )

HSP 1 Score: 488.4 bits (1256), Expect = 1.3e-137
Identity = 240/471 (50.96%), Postives = 318/471 (67.52%), Query Frame = 0

Query: 416 MARPTCFFLQVFAIFGYLHLGIGVPTSSDYPLVQRELDRIVELPGQNFEVNFAHYSGYIT 475
           MAR    FL + A+     L    P+SS     ++E DRI  LPGQ  +V F+ YSGY+ 
Sbjct: 1   MARTHFIFLLLVAL-----LSTTFPSSSSSR--EQEKDRIKALPGQP-KVAFSQYSGYVN 60

Query: 476 VNEDSGRVLFYWFIEAAEDS-GSKPLVLWLNGGPGCSSIAYGEAEEIGPFHINADGKTLY 535
           VN+  GR LFYW  E++  S  +KPL+LWLNGGPGCSSIAYG +EEIGPF IN  G  LY
Sbjct: 61  VNQSHGRALFYWLTESSSPSPHTKPLLLWLNGGPGCSSIAYGASEEIGPFRINKTGSNLY 120

Query: 536 LNPYSWNEVANVIFLESPVGVGFSYTNTSSDLLNNGDKRTAEDSLVFLLKWFERFPQFKG 595
           LN ++WN+ AN++FLESP GVG+SYTNTSSDL ++GD+RTA+D+L+FL+KW  RFPQ+K 
Sbjct: 121 LNKFAWNKDANLLFLESPAGVGYSYTNTSSDLKDSGDERTAQDNLIFLIKWLSRFPQYKY 180

Query: 596 RDFYITGESYAGHYVPQLSQAIVRNNLLTKEKSINLKGYMVGNALFDDHHDHVGVFEFMW 655
           RDFYI GESYAGHYVPQL++ I   N    +  INLKG++VGNA+ D+ +D +G   + W
Sbjct: 181 RDFYIAGESYAGHYVPQLAKKINDYNKAFSKPIINLKGFLVGNAVTDNQYDSIGTVTYWW 240

Query: 656 SAGLISDQTYKQLNLQCENQSFVHPSESCDKILE-VVDKELGNIDPYSIFTPSCSDGPSN 715
           +  +ISD++YK +   C N +    S+ CD  +   ++ E G+ID YSI+TP+C      
Sbjct: 241 THAIISDKSYKSILKYC-NFTVERVSDDCDNAVNYAMNHEFGDIDQYSIYTPTCVAAQQK 300

Query: 716 R-------LLKRMHMVGRVGERYDPCTEKHSVAYFNLPEVQKALHVDPKHAPSKWETCSD 775
           +        +K   +  R+   YDPCTE ++  YFN P+VQ+A+H +      KW  CSD
Sbjct: 301 KNTTGFFVRMKNTLLRRRLVSGYDPCTESYAEKYFNRPDVQRAMHANVTGIRYKWTACSD 360

Query: 776 LVNAHWKDSVGSVLDIYQELIHSGLRIWVFSGDTDSVLPITSTRYSIDALKLPSIGSWRA 835
           ++   WKDS  ++L IY+EL  SGLRIW+FSGDTDSV+P+T+TR+S+  L LP    W  
Sbjct: 361 VLIKTWKDSDKTMLPIYKELAASGLRIWIFSGDTDSVVPVTATRFSLSHLNLPVKTRWYP 420

Query: 836 WYDDGQVGGWIQEYEGLTLVSVRGAGHEVPLHKPKLALQLIKAFLAGNSLP 878
           WY D QVGGW + Y+GLT  +VRGAGHEVPL +PK AL L ++FLAG  LP
Sbjct: 421 WYTDNQVGGWTEVYKGLTFATVRGAGHEVPLFEPKRALILFRSFLAGKELP 462

BLAST of Sgr020497 vs. TAIR 10
Match: AT3G02110.1 (serine carboxypeptidase-like 25 )

HSP 1 Score: 484.6 bits (1246), Expect = 1.9e-136
Identity = 231/443 (52.14%), Postives = 306/443 (69.07%), Query Frame = 0

Query: 449 QRELDRIVELPGQNFEVNFAHYSGYITVNEDSGRVLFYWFIEAAEDSGSKPLVLWLNGGP 508
           + E DRI  LPGQ   V F  +SGY+TV++ SGR LFYW  EA++   SKPLV+WLNGGP
Sbjct: 32  EAEADRITSLPGQP-NVTFEQFSGYVTVDKLSGRSLFYWLTEASDLPLSKPLVIWLNGGP 91

Query: 509 GCSSIAYGEAEEIGPFHINADGKTLYLNPYSWNEVANVIFLESPVGVGFSYTNTSSDLLN 568
           GCSS+AYG +EEIGPF I+  G  LYLN ++WN ++N++FLE+P GVGFSYTN SSDL N
Sbjct: 92  GCSSVAYGASEEIGPFRISKGGSGLYLNKFAWNSISNLLFLEAPAGVGFSYTNRSSDLFN 151

Query: 569 NGDKRTAEDSLVFLLKWFERFPQFKGRDFYITGESYAGHYVPQLSQAIVRNNLLTKEKSI 628
            GD+RTA+DSL FL++W  RFP++  R+ YITGESYAGHYVPQL++ I+  N  +K   +
Sbjct: 152 TGDRRTAKDSLQFLIQWLHRFPRYNHREIYITGESYAGHYVPQLAKEIMNYNKRSK-NPL 211

Query: 629 NLKGYMVGNALFDDHHDHVGVFEFMWSAGLISDQTYKQLNLQCENQSFVHPSESCDKILE 688
           NLKG MVGNA+ D+H+D++G   + WS  +ISD+TY QL   C+  S    S+ C+ +  
Sbjct: 212 NLKGIMVGNAVTDNHYDNLGTVSYWWSHAMISDRTYHQLISTCD-FSRQKESDECETLYS 271

Query: 689 -VVDKELGNIDPYSIFTPSCS---------DGPSNRLLKRM----HMVGRVGERYDPCTE 748
             +++E GNID Y+I+ P C+         +G S R   R+    H V R    YDPCTE
Sbjct: 272 YAMEQEFGNIDQYNIYAPPCNKSSDGGGSYNGSSGRRSMRLPHLPHSVLRKISGYDPCTE 331

Query: 749 KHSVAYFNLPEVQKALHVDPKHAPSKWETCSDLVNAHWKDSVGSVLDIYQELIHSGLRIW 808
           +++  Y+N P+VQKALH +    P KW  CS+++N +W D+  +VL IY+E+I  G+R+W
Sbjct: 332 RYAEIYYNRPDVQKALHANTTKIPYKWTACSEVLNRNWNDTDSTVLPIYREMIAGGIRVW 391

Query: 809 VFSGDTDSVLPITSTRYSIDALKLPSIGSWRAWYDDGQVGGWIQEYEGLTLVSVRGAGHE 868
           VFSGD DSV+P+T+TRYS+  L L +   W  WY   QVGGW + YEGLT V+VRGAGHE
Sbjct: 392 VFSGDVDSVVPVTATRYSLARLSLSTKLPWYPWYVKKQVGGWTEVYEGLTFVTVRGAGHE 451

Query: 869 VPLHKPKLALQLIKAFLAGNSLP 878
           VPL KP+ A +L K FL G  LP
Sbjct: 452 VPLFKPRAAFELFKYFLRGKPLP 471

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PWA58649.19.8e-27646.52serine carboxypeptidase-like 23 [Artemisia annua][more]
XP_022158143.11.6e-25488.65serine carboxypeptidase II-2 isoform X2 [Momordica charantia][more]
XP_038890726.11.3e-24887.37serine carboxypeptidase II-2 [Benincasa hispida][more]
PWA58648.16.9e-24545.67serine carboxypeptidase-like 23 [Artemisia annua][more]
XP_004144720.12.4e-24285.44serine carboxypeptidase II-2 [Cucumis sativus] >KAE8651642.1 hypothetical protei... [more]
Match NameE-valueIdentityDescription
Q949Q74.0e-18768.43Serine carboxypeptidase-like 29 OS=Arabidopsis thaliana OX=3702 GN=SCPL29 PE=2 S... [more]
P557488.3e-18567.37Serine carboxypeptidase II-2 (Fragment) OS=Hordeum vulgare OX=4513 GN=CXP;2-2 PE... [more]
Q9ZQQ06.9e-15560.28Serine carboxypeptidase-like 26 OS=Arabidopsis thaliana OX=3702 GN=SCPL26 PE=2 S... [more]
Q9SFB54.6e-15158.60Serine carboxypeptidase-like 27 OS=Arabidopsis thaliana OX=3702 GN=SCPL27 PE=2 S... [more]
P088194.8e-14858.53Serine carboxypeptidase 2 OS=Triticum aestivum OX=4565 GN=CBP2 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A2U1MBL64.8e-27646.52Carboxypeptidase OS=Artemisia annua OX=35608 GN=CTI12_AA204490 PE=3 SV=1[more]
A0A6J1DV987.9e-25588.65Carboxypeptidase OS=Momordica charantia OX=3673 GN=LOC111024700 PE=3 SV=1[more]
A0A0A0LJ458.8e-24684.38Carboxypeptidase OS=Cucumis sativus OX=3659 GN=Csa_2G036620 PE=3 SV=1[more]
A0A2U1MBM13.3e-24545.67Carboxypeptidase OS=Artemisia annua OX=35608 GN=CTI12_AA204490 PE=3 SV=1[more]
A0A5A7ULU81.3e-24184.58Carboxypeptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold546G002... [more]
Match NameE-valueIdentityDescription
AT4G30810.12.8e-18868.43serine carboxypeptidase-like 29 [more]
AT2G35780.14.9e-15660.28serine carboxypeptidase-like 26 [more]
AT3G07990.13.3e-15258.60serine carboxypeptidase-like 27 [more]
AT4G30610.11.3e-13750.96alpha/beta-Hydrolases superfamily protein [more]
AT3G02110.11.9e-13652.14serine carboxypeptidase-like 25 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001563Peptidase S10, serine carboxypeptidasePRINTSPR00724CRBOXYPTASECcoord: 843..856
score: 54.12
coord: 536..548
score: 54.88
coord: 585..610
score: 57.69
coord: 549..559
score: 71.15
IPR001563Peptidase S10, serine carboxypeptidasePFAMPF00450Peptidase_S10coord: 459..872
e-value: 1.2E-138
score: 463.0
coord: 128..319
e-value: 8.9E-42
score: 143.9
IPR001563Peptidase S10, serine carboxypeptidasePANTHERPTHR11802SERINE PROTEASE FAMILY S10 SERINE CARBOXYPEPTIDASEcoord: 317..355
IPR001563Peptidase S10, serine carboxypeptidasePANTHERPTHR11802SERINE PROTEASE FAMILY S10 SERINE CARBOXYPEPTIDASEcoord: 437..880
coord: 129..318
IPR001563Peptidase S10, serine carboxypeptidasePANTHERPTHR11802SERINE PROTEASE FAMILY S10 SERINE CARBOXYPEPTIDASEcoord: 12..128
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 450..709
e-value: 1.9E-100
score: 337.8
coord: 131..263
e-value: 3.1E-38
score: 133.9
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 33..130
e-value: 1.7E-35
score: 124.9
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 39..354
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 453..876
NoneNo IPR availableGENE3D3.40.50.11320coord: 779..873
e-value: 7.3E-29
score: 101.6
NoneNo IPR availableGENE3D6.10.250.940coord: 286..322
e-value: 2.2E-11
score: 45.3
coord: 732..770
e-value: 8.4E-14
score: 53.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 960..984
NoneNo IPR availablePANTHERPTHR11802:SF32SERINE CARBOXYPEPTIDASE-LIKE 29coord: 12..128
NoneNo IPR availablePANTHERPTHR11802:SF32SERINE CARBOXYPEPTIDASE-LIKE 29coord: 437..880
coord: 129..318
coord: 317..355
IPR033124Serine carboxypeptidases, histidine active sitePROSITEPS00560CARBOXYPEPT_SER_HIScoord: 843..860
IPR018202Serine carboxypeptidase, serine active sitePROSITEPS00131CARBOXYPEPT_SER_SERcoord: 599..606
IPR018202Serine carboxypeptidase, serine active sitePROSITEPS00131CARBOXYPEPT_SER_SERcoord: 153..160

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr020497.1Sgr020497.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004185 serine-type carboxypeptidase activity