Sgr020787 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr020787
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionCysteine protease
Locationtig00153572: 1087478 .. 1099159 (-)
RNA-Seq ExpressionSgr020787
SyntenySgr020787
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAGGGGGAAAGATCTGAAATCAATATGCTCTTCTGAAAGTGCAACTGATATTGTAGATAGAAGTCAGCAATCTGTTTGTCCAGAGTCAGGGTCTAAAAATCGTATATCCTCCAAAGCCTCCTTATGGTCAGGCTTTTTTGCATCTACTTTCTCAATCTTTGAACGTCACAATGAGTCCTCTGTCAGTGAAAAGAAGGCAGTTCAATCTCGACAAAATGGCTGGACAACAACTGTGAGAAAAGTTATGACTAGTGGCTCAATGAGGAGAATTCACGAGCGCATACTGGGTTCGCGCAAGAGTGGTGTCTATAGCTCAGGTGGTGATATATGGCTTCTAGGTGTGTGCTATAAAGTTTCCCAAGATCAGGCTTCTGATGATGCAGTTACTAGCAATGGCGTAGCAGCATATGAGCTAGATTTTTCATCTAGAATTCTCATGACCTATCGTAAAGGTTTCATGAAATATGCTCTCTTTAGTTTTATCATAAGCTTGCATAAATTTCCTGGACAACTTTCGGTAACTTATTTTTTGTCTCTCCATTTTTAGGTTTTAATGTTATTCAAGATTCAAAATACACCAGTGATGTAAATTGGGGTTGCATGCTAAGAAGCAGCCAAATGCTTGTTGCTCAGGTAGATGACTTATTACTTTATTTATTTTGGTGAGATTTACGCTTTTGTCTTATCCACGGCTATTCTCAATCTCGGTTCTTCTTCTTCTTCTTCTTTTCTCTTAAAAAAAAAACCTTTGTATTGACCTTCTTGTTTTCTTTGGTTATGAGGTTTTCCAAAAAGTTATTGAGTTCTTTCCTTCTATTAATCTCCATATTATCTACAGGCATTACTTTTTCATAGATTAGGAAGGTCTTGGAGAAAAACTTCACAGAAGGTATGGCTGTATGGTGATGATACTGGAAGATTTGTTAGCATACTGATTCATTTTCCATGCAAAATTAGCGATAAACCTAATTGACTTTTATTGGCAGCCATTGGACAAAGAATATATTGAAATTTTGCACCTTTTTGGTGACTCGGAAATGTCGGCATTTTCAATCCACAATCTTCTTCAAGCAGGAAAGGCCTATGACCTTGCTGCTGGGTCATGGGTGGGACCATATGCCATGTGTAGGTCATGGGAAACTCTAGTCCGCTTAAAGAGGGAGATCCCTAATCTTCAAGACCTGCAACTTCCAATGGCCATGTATATTGTTTCTGGAGATGAAGATGGAGAGAGAGGTGGTGCTCCAGTTTTATGCATTGATGATGCCTGTAGACATTGTTTAGAGTTTTCTAAAGGCCAACTTGATTGGACCCCCATTCTTTTATTAGTTCCTTTGGTTCTTGGACTTGAAAAAATCAACCCAAGGTATGTTATAGTTGTTTACTGGTCTTGTATTTTTTAATGCTTTGGGCTTACCTACTAATTTTAATAATAAGAAATAGAATTTTCATTAAAGATCAACTACTTTTAGAGGCTATAGTTTTCCCCTAGTCATTGGTTTTGCATCTTCATGCTTTAGCGGTTTTATTGCAAATTGAAAGTTTTGGTAAGAAATTGTAAACTTTATCCCAATAGGTTTGGAAAGTGTGGCTGTATCGCAGCTCTTTTAAGTCTAGCATTGCTTCTTGGTGGGAGGAGTTGGGGGGAAGGGTGGGGGAAGATTTTACTATCAATTATTACATTGGATTTTGTGAAAAGAAAGTTGGCAAAATGAAATGTGAAGATGTTTGGAGGATTTGTTGCTTTTGGATGAGGCTAGTTGGAGACTAAAACCCAAGGTTAAATAGACTTAGAAAGGTAATTTTAATACCACATTCTTTGGAAATGTGGCTTCAACTAGAGGAAAATCAACTTGGTGAGCTTGTTAATTGCTGAAGATGAAACTATTCTTGTAACCTGTTAATTTTTTATTGAATTTCTTGAGCTTGCCTCTGGGTTAGGAATGAATCTAAAGAATTGACAGATTTATTCCTTTGCTCTTTAAGCACATTGCTCCTTAAGTATCTAGGTTTCCCCACTTGGTGAAACTGGAAATGGTGAAATAGTTCTTAGAATCTATAATATATTTTGCTAATTACTTAGAATCAGTTCTATTTATGGCTTTATACTGTAAATGTCTGCGACTCTGCATCCTTTTAAGGACTATGACCTTACTACCCTTATAGCTCGGTGGAAAAGTTTTTTGTAATCCACCTTTGGTGGTGTAAGGCCTCATCCTTCCCCTTTTGTACATTTCATTTGATCAATGAAATTCTTGTTTCTTCTCCCAAAAAAAAAAAAAAGGGGAGAGAAAAAGAAAAACGGAAAAAGAAATGGTGAAATACTGACTTTTGGGGAGACCCTTATGTAAAAAATTCAGCTGAGAATCGAAAAAATGGGGAAAAATTCTAATCTCTAAGGGAGGTAGATAGACCTTGGTGTAGTCGGTTCTTAATAGTCTTCCACTTATTACTTATATGCGTTTAAAGCACCCATCACTGTGATAAAAACTTTGATGAAATTGCCAAGAGATAACCATTGCAAGTAATATAGCGGGAAGACATTAATAATTGACTGCACGAAGATCAATCTTCCCATAGACATTAGGGTGAAATTTTTAGAGATTTGTTGTGGGAAGGAACTGATGGAAGAATCTTCAAGAGTCTTAACTAGATTTCGGATGTTATTTTGCTCTCGTCCTTCTTGCAACCTCAAGTCTAAACTTGTTTTTTGTTATTAGTTATTATCTACTATATTGGCCTTGGATAGCTTTTTAATAACTTCTTTGGTCCTTTGTTTTGTAAATTTCATAGATCAATGAAATGTCTATAGTTACTTTTAGAGAAAAGAAGTATTATATCTATACTCTCCAAGGAAGCCATATTGAAGCTATGTATGAGAGAAAAATGTATTTCTTATATCTTGAGTGGAGAGGAAAAACTCAAATTTATACAAGAGGATGATTGAGAATTCTCAATCATGTATGCTAGAATTAAGCAATCAATCTAAACTTATATGGAAAGGAAAAAATACAAGGAAACTAACTTAGTATGCAAGGAAAAAATCCCTAATAGGAAAAATTCCTAATTTACAGGTTGATTTTCTAATCTCTAATTTATAGCTTGATTTTCTACACTCTCCCTCAAGCTGAATGGTATATGCTAGTCATTCCAAGCTTGGAAATTAATCCACCAAGACTGGTTCGATGCAGAGTCTTAGTTAATATATCAACGGCTTGCTGTTGGGAAGGCACATATCTCAGATTGATTATATTTGTCTCAATTTTCTCACTAATGAAGTGACGATCTATCTCGACATGCTTCATTCGATCATGATGTATCGGATTTTTTGCAATGACTATGACAGATTGATTATCACATAGTACTTCTATAGTACCTTCTATCTCAACCTTGAGCTCAGTAAGAAGACGTTTCAGCCAAATTCCTTCACAAATCCCATGGGCTAATGATCGAAATTCAGCTTCAGCAATGCTTCGGGCTACAACTTGTTGTTTCTTGCTGCGCCAGGTGACTAAATTTTCCCAAACATAGGAACAATATCCTGATGTTGACTTACGGTCAATAGGAGATCTAACCCAATCTGCATCAGTGTAAACTTTTAAAGATTGATTCGTGGTCTTCTTGAACATGAGGCCTTTTCCAGGATCATGCTTGAGATATCTCAATATTCTATACACAGCTTCTAGATGTTCCTTAGAAGGTTTGTTCAGAAATTGACTTACCACGCTAACAGCAAAGCAAATGTCTGATCGAGTGTGAGTCAAGTATATTAACTTTCCAACTAGTCGTTGATACCTGTCTCGATCAACTGCTTCATCTTTAGGGTTAACTCCAAGTTTTGAATTCGCATCCATAGGTGTATCAACTGGTTTGCAACAACTCATCCCTGTCTCTCTTAAGAGATCTAAAGTGTATTTTTGTTGAGTGACTGAAATCTCTTTGCTAGATCTTGCTACTTCCATTCCTAGGAAGTACCTCAGATGTCCCAAGTCTTTTATCTCAAATTTCCTTGAAAGAAGTTGTTTCAACCGACTCATTTCTTCAAAGCATTTCTTGTGAGAACTATATCATCAACATATACCAGTGTTATAAAAAGCGCGAGGCGCACTAAAGCGCAATAGGCTTCTGGGGCTTAAGCGCGAGGCGCACAAAAAAGCGAGGGCTTTTTTTTGTGAGGCGCACAATAGAAAAATGTATATAAATTTTTTATATCTGAAAAAAATTAGTATTTTGAATAATAAGATATTGAAAAGTTTAAGAAATTAGGGTTTTTATAAGAGATTGCGGAGAAAAATGAACATTTTTTAAGAATATTTCTTAGTTTTTACCTAAAAAATTAAAAAATAAAAATAAAAAAGGGCTTCAAGCCCTGAGGCGCGCGCCTCATTAAGGGCGCCTCGCTTCTTTGAAGCGAGGCGCCTAGAGTGCGCCTTTCTTCGAAGCGCGCCTAGGCGAGCGCCTAGGCACGCTTTTTAAATCACTGACATATACTATTAACACAGTAATCTTACTTGGTGATAAATGCTTGATGAATAGGGTGTGATCGGATTGACACTGAGTATAACCATCTTGTTTCAACACATTTGTAAATTTTTCAAACCAAGCCTGAGGTGATTATTTTAGTCCATAGAGAGATTTCTTGAGTTTACATATCTTTCCTCTAGTGTATCTGTCCTCAAATCCAGGAGGTATGTCCATGTATACTTCCTCCACTAAATCATCATTCAAAAATGCATTCTTTACATTAAGTTGAAACAATGACAAATCTAGATTTGTTGCAATAGAGAGAAGTACTCAAATAGTATGTAATTTGGCAACTGGAGCAAAGGTTTTCTAATAGTCAATTCCATAAGATTGGGTGAATCCTTTAGCTACAAGACGAGCTTTGAACCGCTCTATACTACCGTATGGTTTGTGTTTGGTCGAAAAAATCCATTTACATCCAACCGTGCGCTTTCCTGGGGGCAAATCTGTAAGGATCCATGTCCTATTTTTCTCAAGAGCCTGTAGCTCTTCTAGAGTGGTTGCTTTTCATTCAGGTCTCTGTAAAGCTTCCTGAACTGTGTTGGGAGTTTGTATCTGATCTAGTGAAGTCACAATGCTCTAAAAGATGGTGACAGATTAGTGTAAGATAAGTGATAACGTATGGGATGTTGAGTACATGAGCGCACACATTTCCTTACAGCAATGGGTCTGTCCATATCATCTTCTGTTGTCTCTTGTACTATTTCAGGGACCTTGGAATTTTGGTTTAGATCTTGACTTTGCTGAATTTGTGTAAGTTGTATTTCGTCCCTCTCAGGTTGTCTTGCTCTTCGCGAATAAATCTGCAGTTTAGAGTCAGAGATGGATATGAGAGTCGGCTCAGAAGAGGGTACAGAAGAACTTTCGGATAGGCTTAGAGGAAAGTGAGGAATGGAGTCCCAATTCTGTAGTTCAAGATCATAATGCTCCCCTTGAATCACAGAATTGGAAAAATATGGTTGATTTTCAAAGAAGGTGACATCCATGGTATGAAAAAATTGGTGCATTGAATGATGATAACATTTGTATCCTTTCTTGTTTGGTGAGTATCCGAGAAAGATGCATTTCATAGATCGTGCATCAAGTTTACTACGGTGTTGAGGATATATATTAACAAATGTGGTACACCCAAATATTTTTAGTGGTAGTGGAGACACAAGATGGGATGTAGGATCTAGTGTAAGTACTCTGCAAGGGTGTGTTGAACTTGAGAATGCGGGAGGGCATTTGATTTATGAGATAGGTAGCAATTAGAACAGCGTCACCCCAAAAAGTTTTTGGAACGTGGGATGTGAGCATTAGGGATTAGGCTACATCAAGAAGATGACGATTTTTGCAATCAACTACACCATTTTGTTGAGGTGTGTCAACACAAGAACTCTGGTGAACAATACCATGAGATTGAAGATAGGGTCCAAGAGTAGAGTTGAAAAAATCACGAGCAATGTCAGTTTTAAGTACTTTGATATTGACTTGAAACTGATTTTGGATCATTTTATGAAAGACTGTGAACAGTTGGCTAGTTTCAGATTTATCTTTTATAAGGAATGTCCAACTCAAACGGGTGTGATCATCAACAAATAAAAGAAACCAACGTGCCCATTGATATTTTTTACCCTTGAAGGTCCCCAAATGTCTCCATGGATAAGAGAGAAAGGTTGAGATGGAGAATAAGTATGGGGTGAAAAATGAGCATGTGTGTGTTTGGAGAGTTGACAAATTTCACACTGGAAAAATTGACTCTTTTTATTGATAAATAAAGAGGGAAACAAATGCTCAAGATAGACAAAGTTGGGATGACCAAGACAATAGTGCAACATCATAACAAGACTATCTTTATTTGAATGATTTAAAGACTTGGAAACATATGCGGATCACGATTAGACTGAACTACAGTTTACTTGCTGCTTTGAGAAGATACAATCCTGCACATAATTTAGCATTGCCAATCATCTTCCCCGATCCCAAGTCCTGAAAAATACATGAATTTGCAAGGAATTTAGTTTCGCAATTCAAGTCTTTATTCAACTTACTCACAGATAGTAGGTTATATTCCAACTTTGGCACGAACAATACTGATTCAAGGGTTAGAGAGTTTGAAATTTGAATACTTCCAATCCCAGTTACTTAGCAGGTGTACCATCAGCAATTCGAACTGAAAAATTACCCGTATAAAGAGAAAAAGATGAGAATAAGGATCTATCACCTGTCATATGATCTGATGTTCCAGAGTCCACAATCCATTCAGATGGAAGGGTTTGAGTATGAAGGGCAAGGAAATTTTGATTACTTTGTTGTTGCACATTTCCAGAACCACTTATTGTTGTTGAATTGGTTTTGTTCAATAGTTGCTGTAGCATATCCAACAAATATTTGGTAAGTGGTTATTGGTTAGAAGTTTCGGAGACAGCAGCATTACCCCTAGATTCTCGATCTGATTTGGCAGCATTTGGCTTCCAATTTGCTGGTTTTCCATGTATTTTCCAACAATTGTCTTTGACATGTCCCAGTTTACAACAATGGTCACACCATGGTCGCCCTTTACATTGTCGACTTTCTCTAAATGATTGAGAATTTCCTCGAGCTGCAAGAGCAATTGGATCATTGGAGTGATCCAGTTGGGAAATAAGGGCAGATCCATTTTGAGCAGGAGGATGTTCGGTGGGTCCCAGCATTATTTGCTTTCGACTTTCTTCCCGGCGGACTTCAGCGAAAACTTCGCGAAGACTGGGTAATGGCTTAGTACCAAGGATTCGACAACGGACTTCATCCAAAGATTTATTGAGACCCATAAGAAACTTGAAAATTTTTTTGTCTCCACAATATTTTGGAAGAGAGCACCATCATGAGAACATTTTCATTCATGAGTTTCAAATAAGTCCAATGGTTGCTAATAGCGAGATAAAGTGGTGTAGTAAGTCGTAACAGATAAATCACCTTGTTTGAGATCTTGGAGAGTGGTCTCTATCTGGAAGAGTTCAGCAGTGTTTTCTTGATTGGAGAATGTGTCCCGAGCGCCATCCCAAATCTCTTTGGCAGTTGAAAATAGAAGAAAATTCTCAACCAGCTCATAACTTGATTGTTTTTGGCCCTCCATATGCAAAATTTTGGATCCTCGGGATTGGGTGAAGTTGCTTTACCTGTGATATAGTCTTCTCGTCCACGACCATACATGAACATCATCAGAGATTGGGACCACTGAAGATAGTTGTGGCCTTTGAGTTTGTGACATGTTATCTGAGTATTGTTGCTTTCGGTGGAGGGCGAAATGGTAATTTGCGATTCTGATCCTGATATTATATAAAAATAAGAGGAAGCAGAGAAGAAGAATTAGGTCACTGTTTGGAGAGAAAAACACAGAGGCAACAATGGCGGCGGTAGCGACGGTAGAGGCGGCGACAATAGCGGAGATGGCGGCGGCGGCAGTAGCGGAGATGGCGGTGCGGCAGCAATAGCGACGGTAGAGGCGGTGGTAGCAGCGGAGATGGCGGCTGTTGCTGCAGAGACGATGGCTGCTGCGACAGAGACAACTGCTACTGCAGCGGTGGCGGCGGCGGTGAGATCTGCGAGTTCGAGCAACAGCTAGGGTTTTGTGAAGGATGCGTGACGGCTAGGGTTTTCTAGATAGGGAGGCTCGACGGAAGAAGGCTAGTCCAGCAACGACGGCTAGGGTTCCGACGAGGAGGATAAGATTTGAAGAAATTCAAGGCTCTGATACCATGAAGCTATGTATGAGAGAAAAATGTATTTCTTATATCTTGAGTGGAGAGAAAAAACTCAAATTTATACAAGATAATGATTGAGAATTCTCAATCATGTATGTTATAATTAAGCAATTAATCTAAACTTATATGAAAAGAAAAAAATACAAGGAAACAAACTTAGTATACAAGGAAAAAATCTCTAATAGAAAAAATTCCTAATTTACAGCTTGATTTTCTAATTCCTAATTTACAGCTTGATTTTCTACGCATATCATCAACTGGTTCAGTTTGCCATGCAAAGGGGAGTTTCCAAGCTCCTATTTACATGATACTCCTTCAAACGTTCATCTCTTGATCTATAATACACACCAAACAGGCTAAGGAAAAATTTTCCCTGTTTGACTTAATACTCTTTAGCATAGGATGAAGTTCAATTACCTCCTTCACAAATGTAATCCTGCCCTCTTGATTTGTAGACAAGTGTTTTAAGGCAATTAAGTGGGCACATTTACCTAGTCTATAGATATCCACAAGGGATAAGCCTTCTACAAAGCCTATAGGCCTTCCTAAACTTGAGCCAGTATAGGTACAGCCTACAATGAACCAGCATTGCTTGAATTGTTAATAAACTCTACTGGTGGGGTGGCTTCTTCGTTTTATAGATCCTTCGTCAATTCAACCCAAATATAGTACCTTGCCAATCAACTATTGCTCTTTAATGTGCCTGAGTACGCACCCACTATACTGTTGTGAAGCTCTTGTAAAATTTGGATCCAGTAGTTGAAGTTGATGGCAACACAATTATGTTTATGGACCTTAGAATTCATTGTATTAAACGATGAAGTGCATTAGTTAGAGTACCCTTTTGAAGAGCTGATTTATTAGTTGAGAGCTTTGCATCACCCTCCAGTTGTCGGTTGATCTCCTACAAGTTCACCTTATAAGATAGATCACTGTTCAAGGGCAGGTTTAAAAGGTAGCTTCTACTTGCTCATCTGTTCGAAAATAAAGTGAGAATTTTATGGATTTGTGCAACTAAAAATCTCTTTTGGGAAGTTTGGATGTGAAGACATCTAAAGACTTGCCCAGAACTCATCTAGAAAATAATTGGAACTCCTACACTTTAAAGCCTCTCTTTAGTGCTCCCTTCATAAATCTTTTTGTAGTTACAATTATTTTTCAGTTAATGCCAACCGTTACTTGGGGAGGAATCTTTAGATCTTTCGTTAGTTTGCACCACAAATCATCGGACAAGTAAGGAAGTATTGCCTCTAAATTAAGGACCATGATGAGTTTAAATTTTTTAATAGATGACCATCAACTCTCATAAATTGGTTTGGTTCGTTGAAGCAGAAGGTTCACTAAGAAGCAATGAGTCATTCGGTTTACACTGTACAACTCCCAAGCCATCCCTTAAAGAATTAGTCTCAACCATAACAAGTTTGGAAAGTTTGGAAAGGCCAATATTGGAATGATTGTCATGGCTTCTTTAGCATTTGAAAAGCTTGTTTGCTTGCTGTGACCAACCAAACTTATCTTCGATTAATTGCTAAGTTAATGGCTCGCTATACTTCCATAACTTGCCACACATGGTCACTTCACATAAGACTTAAAGAATTCATGCAATTCTTTTTGGTTTTCTGGGCACTAGCCATCGCAACTTGGCCTCCACCTTTGTTGAATCTACCACTGCTCCTTGCTCGAAAACTGACTATCCAAGCAGTTAGTGTAAAGACACGCAGTTTGAGGACTAGCTGTCTGAGGCACTTTCAACAATCCAAGCACTGGATTAAGAAGGGGAATGAGAGATTTCCGTAGGATGCGTACTTTGGAGTTTGACTCCTATTTTTGTTATTCTCAAACCATTTTGTGTGAGTAAAATATAAACAACGGGAGCAATTAAGTGGATGATATTATTTTTTAACTCTGCATCATAACATCTCACCCCGAATTAGGAACCTTCTCATTGCCTTGGGATTCATGCTCTTAGCAACAAAAATGCTGAGGGTTATTCACTTCATGGCCCCTGGTCACTAGCAGCCATGACCTCGTTGGTTGGCGTACCAATTTCCAAACACCCTTGACCAAGCCTTAGTAAAGGTCTTGAAATCTTTATTGATGTTGTACTCTGTTATTAACTGCCATAAAAATGTATTTGTGTGTGCGTTTGCTGGCTCGTTTCCTAAAATATAGCAGCCTCACTTGACTAATCTGGAGATCTACGTGTTATAACTATAGTTTAACTAGTAGAAATTGGAGCTTAGAAATTTACGCTCATCATATTTTTGCAACTTGTGAAGGTACATCCCATCATTGCGAACAACATTTACCTTTCCTCAAAGCCTTGGGATCTTGGGAGGGAAACCTGGTGCTTCAACATATATCATGGGCATTCAAGATGAAAATGCTTTTTACCTTGATCCACATGAAGTCCAGCCGGTACATTAATTTCCTTTGGACCTTCTCTTGGAAGAATATGTAAACTTTCCTTTTGTATGTGTTTGATATCATGATGGTGATTCTTACAGAGAAATCCTGAACTTTTCTATACATTGGTCTTCTAGTACTCTAGCTAGATGCTTTGAATCTGGAGCAAACATATGTACATGTTAGACACCTGAAAGATGTTTTATAGTACTCTTTCGAACTTACGAATGTTCTAATAAATTGTAGGCATGTGGACTCAGCTAATAAATTGATAAAAAATAATGATTATTTCAATATGAAACTATATTGTAATACTTTTCTTAACTCTAATCTGAAAAAATGTCTTCTCTATTTATTTACTTAAATTTTACATTTGTGTAGATAACCTTAGTTGTAATTATTAGATGGTCAATCCCTCAAAATATGTTTATTTGGTTTTGGATATACTTTATAACCCTTCCTACAATGCTTACATTTTAGGTAATGAATATCGATAAGGATGATCTAGAGGCCGATACTTCATCTTATCACTGCAAGTGAGTTTTTTCAGATTATAAATTTCTCGTAGTTCACTGATTTTTATTTCTTTTGCCTTCCTCTTTTTTTTTTGGTTTCTCATGTGTTGTTTTCTGTAGTGTCATCCGGCACATCCCCTTAGAATCCATAGATCCTTCTCTAGCAATTGGATTTTATTGTCGAGACAAAGGTTTGCAAATCACTCTAACGTCAAAATTGTGCATGGGAACTTGATTGAAGCTTTTCATATTGTAAAACTTAATCTTGCTTCCCTTTTTTTCTTGTGCCCAGATGATTTTGATGATTTCTGTTATCGAGCTACGAAGTTAGCTGAAGAGTCAGATGGAGCTCCACTGTTTACAGTTGCTGAAACACATTCTGCAAATTCAGTGAGACACAGCAATGCATTGGATGGCGGTAGTAGGTTAGTAGAGGACGATACCCTCGGCATGGTGCACATGCCGAATGAAGAGGGCACACATGAGGACGACTGGCAATTTCTCTGA

mRNA sequence

ATGGGAAGGGGGAAAGATCTGAAATCAATATGCTCTTCTGAAAGTGCAACTGATATTGTAGATAGAAGTCAGCAATCTGTTTGTCCAGAGTCAGGGTCTAAAAATCGTATATCCTCCAAAGCCTCCTTATGGTCAGGCTTTTTTGCATCTACTTTCTCAATCTTTGAACGTCACAATGAGTCCTCTGTCAGTGAAAAGAAGGCAGTTCAATCTCGACAAAATGGCTGGACAACAACTGTGAGAAAAGTTATGACTAGTGGCTCAATGAGGAGAATTCACGAGCGCATACTGGGTTCGCGCAAGAGTGGTGTCTATAGCTCAGGTGGTGATATATGGCTTCTAGGTGTGTGCTATAAAGTTTCCCAAGATCAGGCTTCTGATGATGCAGTTACTAGCAATGGCGTAGCAGCATATGAGCTAGATTTTTCATCTAGAATTCTCATGACCTATCGTAAAGGTTTTAATGTTATTCAAGATTCAAAATACACCAGTGATGTAAATTGGGGTTGCATGCTAAGAAGCAGCCAAATGCTTGTTGCTCAGGCATTACTTTTTCATAGATTAGGAAGGTCTTGGAGAAAAACTTCACAGAAGCCATTGGACAAAGAATATATTGAAATTTTGCACCTTTTTGGTGACTCGGAAATGTCGGCATTTTCAATCCACAATCTTCTTCAAGCAGGAAAGGCCTATGACCTTGCTGCTGGGTCATGGGTGGGACCATATGCCATGTGTAGGTCATGGGAAACTCTAGTCCGCTTAAAGAGGGAGATCCCTAATCTTCAAGACCTGCAACTTCCAATGGCCATGTATATTGTTTCTGGAGATGAAGATGGAGAGAGAGGTGGTGCTCCAGTTTTATGCATTGATGATGCCTGTAGACATTGTTTAGAGTTTTCTAAAGGCCAACTTGATTGGACCCCCATTCTTTTATTAGTTCCTTTGGTTCTTGGACTTGAAAAAATCAACCCAAGGTACATCCCATCATTGCGAACAACATTTACCTTTCCTCAAAGCCTTGGGATCTTGGGAGGGAAACCTGGTGCTTCAACATATATCATGGGCATTCAAGATGAAAATGCTTTTTACCTTGATCCACATGAAGTCCAGCCGGTAATGAATATCGATAAGGATGATCTAGAGGCCGATACTTCATCTTATCACTGCAATGTCATCCGGCACATCCCCTTAGAATCCATAGATCCTTCTCTAGCAATTGGATTTTATTGTCGAGACAAAGATGATTTTGATGATTTCTGTTATCGAGCTACGAAGTTAGCTGAAGAGTCAGATGGAGCTCCACTGTTTACAGTTGCTGAAACACATTCTGCAAATTCAGTGAGACACAGCAATGCATTGGATGGCGGTAGTAGGTTAGTAGAGGACGATACCCTCGGCATGGTGCACATGCCGAATGAAGAGGGCACACATGAGGACGACTGGCAATTTCTCTGA

Coding sequence (CDS)

ATGGGAAGGGGGAAAGATCTGAAATCAATATGCTCTTCTGAAAGTGCAACTGATATTGTAGATAGAAGTCAGCAATCTGTTTGTCCAGAGTCAGGGTCTAAAAATCGTATATCCTCCAAAGCCTCCTTATGGTCAGGCTTTTTTGCATCTACTTTCTCAATCTTTGAACGTCACAATGAGTCCTCTGTCAGTGAAAAGAAGGCAGTTCAATCTCGACAAAATGGCTGGACAACAACTGTGAGAAAAGTTATGACTAGTGGCTCAATGAGGAGAATTCACGAGCGCATACTGGGTTCGCGCAAGAGTGGTGTCTATAGCTCAGGTGGTGATATATGGCTTCTAGGTGTGTGCTATAAAGTTTCCCAAGATCAGGCTTCTGATGATGCAGTTACTAGCAATGGCGTAGCAGCATATGAGCTAGATTTTTCATCTAGAATTCTCATGACCTATCGTAAAGGTTTTAATGTTATTCAAGATTCAAAATACACCAGTGATGTAAATTGGGGTTGCATGCTAAGAAGCAGCCAAATGCTTGTTGCTCAGGCATTACTTTTTCATAGATTAGGAAGGTCTTGGAGAAAAACTTCACAGAAGCCATTGGACAAAGAATATATTGAAATTTTGCACCTTTTTGGTGACTCGGAAATGTCGGCATTTTCAATCCACAATCTTCTTCAAGCAGGAAAGGCCTATGACCTTGCTGCTGGGTCATGGGTGGGACCATATGCCATGTGTAGGTCATGGGAAACTCTAGTCCGCTTAAAGAGGGAGATCCCTAATCTTCAAGACCTGCAACTTCCAATGGCCATGTATATTGTTTCTGGAGATGAAGATGGAGAGAGAGGTGGTGCTCCAGTTTTATGCATTGATGATGCCTGTAGACATTGTTTAGAGTTTTCTAAAGGCCAACTTGATTGGACCCCCATTCTTTTATTAGTTCCTTTGGTTCTTGGACTTGAAAAAATCAACCCAAGGTACATCCCATCATTGCGAACAACATTTACCTTTCCTCAAAGCCTTGGGATCTTGGGAGGGAAACCTGGTGCTTCAACATATATCATGGGCATTCAAGATGAAAATGCTTTTTACCTTGATCCACATGAAGTCCAGCCGGTAATGAATATCGATAAGGATGATCTAGAGGCCGATACTTCATCTTATCACTGCAATGTCATCCGGCACATCCCCTTAGAATCCATAGATCCTTCTCTAGCAATTGGATTTTATTGTCGAGACAAAGATGATTTTGATGATTTCTGTTATCGAGCTACGAAGTTAGCTGAAGAGTCAGATGGAGCTCCACTGTTTACAGTTGCTGAAACACATTCTGCAAATTCAGTGAGACACAGCAATGCATTGGATGGCGGTAGTAGGTTAGTAGAGGACGATACCCTCGGCATGGTGCACATGCCGAATGAAGAGGGCACACATGAGGACGACTGGCAATTTCTCTGA

Protein sequence

MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNESSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKVSQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFSKGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDENAFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDDWQFL
Homology
BLAST of Sgr020787 vs. NCBI nr
Match: XP_022155850.1 (cysteine protease ATG4-like [Momordica charantia])

HSP 1 Score: 912.1 bits (2356), Expect = 2.0e-261
Identity = 446/484 (92.15%), Postives = 458/484 (94.63%), Query Frame = 0

Query: 1   MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNE 60
           MGRGKDL SICSS +ATDIVDRSQQSVCPESGSKNR SSKASLW GFFASTFSIFE HNE
Sbjct: 1   MGRGKDLNSICSSGTATDIVDRSQQSVCPESGSKNRTSSKASLWPGFFASTFSIFEHHNE 60

Query: 61  SSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKV 120
           SSVSEKK VQSR NGWTTTVRK+MTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVC+K+
Sbjct: 61  SSVSEKKTVQSRHNGWTTTVRKIMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCHKI 120

Query: 121 SQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180
           SQD ASDDAVTSN VA YELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA
Sbjct: 121 SQDLASDDAVTSNSVATYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180

Query: 181 QALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVG 240
           QALL HRLGRSWRKT QKPLDKEYIEILHLFGDSE SAFSIHNLLQAG AYDLAAGSWVG
Sbjct: 181 QALLSHRLGRSWRKTPQKPLDKEYIEILHLFGDSETSAFSIHNLLQAGMAYDLAAGSWVG 240

Query: 241 PYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFS 300
           PYAMCRSWETLVRLKRE PNLQD QLPMAMYIVSGDEDGERGGAPVL IDDA RHC EFS
Sbjct: 241 PYAMCRSWETLVRLKRETPNLQDQQLPMAMYIVSGDEDGERGGAPVLFIDDASRHCFEFS 300

Query: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDEN 360
           KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYI+G+QDEN
Sbjct: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDEN 360

Query: 361 AFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420
           AFYLDPHEVQPV+N+DKDDLEADTSSYHCNVIRHIPLESIDPSLA+GFYCRDKDDFDDFC
Sbjct: 361 AFYLDPHEVQPVVNVDKDDLEADTSSYHCNVIRHIPLESIDPSLALGFYCRDKDDFDDFC 420

Query: 421 YRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDD 480
            RA+KLAEESDGAPLFTVAETHSANSVRHSNALDG SR VEDDT+  VH+P EEG HEDD
Sbjct: 421 SRASKLAEESDGAPLFTVAETHSANSVRHSNALDGSSRSVEDDTISTVHVPIEEGAHEDD 480

Query: 481 WQFL 485
           WQFL
Sbjct: 481 WQFL 484

BLAST of Sgr020787 vs. NCBI nr
Match: XP_038874853.1 (cysteine protease ATG4-like [Benincasa hispida])

HSP 1 Score: 873.2 bits (2255), Expect = 1.0e-249
Identity = 427/484 (88.22%), Postives = 449/484 (92.77%), Query Frame = 0

Query: 1   MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNE 60
           MGRGKDL S CSSES T+++DR+ QSVCPE GSKN ISSKASLWSGFF STFSIFE H E
Sbjct: 1   MGRGKDLNSTCSSESTTNVIDRTHQSVCPELGSKNHISSKASLWSGFFTSTFSIFEHHKE 60

Query: 61  SSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKV 120
           SSVSEKKA  SR N W  TVRKVMTSGSMRRI ERILGSR+SGVYSSGGDIWLLGVC+K+
Sbjct: 61  SSVSEKKAFHSRHNVW-ATVRKVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKI 120

Query: 121 SQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180
           SQDQA DDAVTS G A YELDFSSRILMTYRKGFN IQDSKYTSDVNWGCMLRSSQMLVA
Sbjct: 121 SQDQAPDDAVTSTGTAGYELDFSSRILMTYRKGFNAIQDSKYTSDVNWGCMLRSSQMLVA 180

Query: 181 QALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVG 240
           QALLFHRLGRSWRKTSQKPLDKEY+EILHLFGDSE SAFSIHNLLQAG+AYDLAAGSWVG
Sbjct: 181 QALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVG 240

Query: 241 PYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFS 300
           PYAMCRSWETLVRLKRE PNLQ+ QLPMA+YIVSGDEDGERGGAPVLCIDDA RHC EFS
Sbjct: 241 PYAMCRSWETLVRLKRETPNLQEQQLPMAIYIVSGDEDGERGGAPVLCIDDASRHCFEFS 300

Query: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDEN 360
           KGQLDW+PILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYI+G+QDEN
Sbjct: 301 KGQLDWSPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDEN 360

Query: 361 AFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420
           AFYLDPH+VQ V+NIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFD+FC
Sbjct: 361 AFYLDPHDVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 420

Query: 421 YRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDD 480
           YRA+KLA+ESDGAPLFTVAETHS NS RH +AL+  SRLVEDDT G+VHMPNEE  HEDD
Sbjct: 421 YRASKLADESDGAPLFTVAETHSTNSGRHGSALNDRSRLVEDDTDGVVHMPNEE-AHEDD 480

Query: 481 WQFL 485
           WQFL
Sbjct: 481 WQFL 482

BLAST of Sgr020787 vs. NCBI nr
Match: XP_022974003.1 (cysteine protease ATG4-like [Cucurbita maxima] >XP_022974004.1 cysteine protease ATG4-like [Cucurbita maxima] >XP_022974005.1 cysteine protease ATG4-like [Cucurbita maxima] >XP_022974006.1 cysteine protease ATG4-like [Cucurbita maxima])

HSP 1 Score: 847.4 bits (2188), Expect = 6.0e-242
Identity = 415/484 (85.74%), Postives = 443/484 (91.53%), Query Frame = 0

Query: 1   MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNE 60
           MGRGKDL S CSSES TDIVDRSQQS+CP  GS+N ISSKASLWSGFF STFS+FE + E
Sbjct: 1   MGRGKDLNSTCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKE 60

Query: 61  SSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKV 120
           SSVSEKKAV SR N W TTVR+VMTSGSMRRI ERILGSR+SGVYSSGGDIWLLGVC+K+
Sbjct: 61  SSVSEKKAVHSRHNVW-TTVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKI 120

Query: 121 SQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180
           SQDQASDD VTS+ +A +ELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA
Sbjct: 121 SQDQASDDVVTSDSIAGFELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180

Query: 181 QALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVG 240
           QALLFHRLGRSWRKTSQKPLDKEY+EILHLFGDSE SAFSIHNLLQAG+ YDLAAGSWVG
Sbjct: 181 QALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVG 240

Query: 241 PYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFS 300
           PYA+CRSWETLVRLKRE P  QDLQLPMA+YIVSG+EDGERGGAPVLCIDDA RHC +FS
Sbjct: 241 PYAVCRSWETLVRLKRETPTPQDLQLPMAIYIVSGEEDGERGGAPVLCIDDASRHCFQFS 300

Query: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDEN 360
            GQLDWTPILLLVPLVLGLEKINPRYIPSLR TFTFPQSLGILGGK GASTYI+G+QDEN
Sbjct: 301 NGQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDEN 360

Query: 361 AFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420
           AFYLDPHEVQ V+N DKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC
Sbjct: 361 AFYLDPHEVQQVVNTDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420

Query: 421 YRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDD 480
           YRA+KLA +S GAPLFTVAETHS+NSVRH NAL+ GSRLV D+    VH+P++EG  EDD
Sbjct: 421 YRASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDN--ADVHVPDDEGAQEDD 480

Query: 481 WQFL 485
           WQ L
Sbjct: 481 WQLL 481

BLAST of Sgr020787 vs. NCBI nr
Match: XP_022929983.1 (cysteine protease ATG4-like [Cucurbita moschata] >XP_022929984.1 cysteine protease ATG4-like [Cucurbita moschata] >XP_022929986.1 cysteine protease ATG4-like [Cucurbita moschata] >XP_022929987.1 cysteine protease ATG4-like [Cucurbita moschata] >XP_022929988.1 cysteine protease ATG4-like [Cucurbita moschata])

HSP 1 Score: 847.4 bits (2188), Expect = 6.0e-242
Identity = 418/484 (86.36%), Postives = 443/484 (91.53%), Query Frame = 0

Query: 1   MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNE 60
           MGRGKDL S CSSES TDIVDRSQQS+CP  GS+N ISSKASLWSGFF STFS+FE + E
Sbjct: 1   MGRGKDLNSTCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKE 60

Query: 61  SSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKV 120
           SSVSEKKAV SR N W TTVR+VMTSGSMRRI ERILGSR+SGVYSSGGDIWLLGVC+K+
Sbjct: 61  SSVSEKKAVHSRHNVW-TTVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKI 120

Query: 121 SQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180
           SQDQASDDAVTS+ VA +ELDFSSRILMTYRKGFN IQDSKYTSDVNWGCMLRSSQMLVA
Sbjct: 121 SQDQASDDAVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVA 180

Query: 181 QALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVG 240
           QALLFHRLGRSWRKTSQKPLDKEY+EILHLFGDSE SAFSIHNLLQAG+ YDLAAGSWVG
Sbjct: 181 QALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVG 240

Query: 241 PYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFS 300
           PYA+CRSWETLVRLKRE P  QD QLPMA+YIVSGDEDGERGGAPVLCID A RHC +FS
Sbjct: 241 PYAICRSWETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFS 300

Query: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDEN 360
           KGQLDWTPILLLVPLVLGLEKINPRYIPSLR TFTFPQSLGILGGK GASTYI+G+QDEN
Sbjct: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDEN 360

Query: 361 AFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420
           AFYLDPHEVQ V+NIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC
Sbjct: 361 AFYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420

Query: 421 YRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDD 480
           YRA+KLA +S GAPLFTVAETHS+NSVRH NAL+ GSRLV D+    VH+P+EEG  EDD
Sbjct: 421 YRASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDN--ADVHVPDEEGAQEDD 480

Query: 481 WQFL 485
           WQ L
Sbjct: 481 WQLL 481

BLAST of Sgr020787 vs. NCBI nr
Match: XP_023539406.1 (cysteine protease ATG4-like [Cucurbita pepo subsp. pepo] >XP_023539414.1 cysteine protease ATG4-like [Cucurbita pepo subsp. pepo] >XP_023539422.1 cysteine protease ATG4-like [Cucurbita pepo subsp. pepo] >XP_023539429.1 cysteine protease ATG4-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 846.7 bits (2186), Expect = 1.0e-241
Identity = 417/484 (86.16%), Postives = 443/484 (91.53%), Query Frame = 0

Query: 1   MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNE 60
           MGRGKDL S CSS+S TDIVDRSQQS+CP  GS+N ISSKASLWSGFF STFS+FE + E
Sbjct: 1   MGRGKDLNSTCSSKSVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKE 60

Query: 61  SSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKV 120
           SSVSEKKAV SR N W TTVR+VMTSGSMRRI ERILGSR+SGVYSSGGDIWLLGVC+K+
Sbjct: 61  SSVSEKKAVHSRHNVW-TTVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKI 120

Query: 121 SQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180
           SQDQASDDAVTS+ VA +ELDFSSRILMTYRKGFN IQDSKYTSDVNWGCMLRSSQMLVA
Sbjct: 121 SQDQASDDAVTSDSVAGFELDFSSRILMTYRKGFNDIQDSKYTSDVNWGCMLRSSQMLVA 180

Query: 181 QALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVG 240
           QALLFHRLGRSWRKTSQKPLDKEY+EILHLFGDSE SAFSIHNLLQAG+ YDLAAGSWVG
Sbjct: 181 QALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVG 240

Query: 241 PYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFS 300
           PYA+CRSWETLVRLKRE P  QD QLPMA+YIVSGDEDGERGGAPVLCIDDA RHC +FS
Sbjct: 241 PYAICRSWETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDDASRHCFQFS 300

Query: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDEN 360
           KGQLDWTPILLLVPLVLGLEKINPRYIPSLR TFTFPQSLGILGGK GASTYI+G+QDEN
Sbjct: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDEN 360

Query: 361 AFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420
           AFYLDPHEVQ V+NIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC
Sbjct: 361 AFYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420

Query: 421 YRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDD 480
           YRA+KLA +S GAPLFTVAETHS+NSVRH NAL+ GSRLV D+    VH+P+E G  EDD
Sbjct: 421 YRASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDN--ADVHVPDEGGAQEDD 480

Query: 481 WQFL 485
           WQ L
Sbjct: 481 WQIL 481

BLAST of Sgr020787 vs. ExPASy Swiss-Prot
Match: A2Q1V6 (Cysteine protease ATG4 OS=Medicago truncatula OX=3880 GN=ATG4 PE=3 SV=1)

HSP 1 Score: 652.9 bits (1683), Expect = 2.8e-186
Identity = 321/475 (67.58%), Postives = 386/475 (81.26%), Query Frame = 0

Query: 11  CSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNESSVSEKKAVQ 70
           CSS+S+T+IVD +Q     ++GS +    KASLWS FF S FS+ E ++ESS SEKK V 
Sbjct: 15  CSSKSSTEIVDNTQVPASSKAGSSDSKFPKASLWSTFFTSGFSVDETYSESSSSEKKTVH 74

Query: 71  SRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKVSQDQASDDAV 130
           SR +GW   VRKV++ GSMRR  ER+LGS ++ V SS GDIWLLGVC+K+SQ +++ D  
Sbjct: 75  SRNSGWAAAVRKVVSGGSMRRFQERVLGSCRTDVSSSDGDIWLLGVCHKISQHESTGDVD 134

Query: 131 TSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGR 190
             N  AA+E DF SRIL+TYRKGF+ I+DSKYTSDVNWGCMLRSSQMLVAQALLFH+LGR
Sbjct: 135 IRNVFAAFEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGR 194

Query: 191 SWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSWET 250
           SWRKT  KP+DKEYI+IL LFGDSE +AFSIHNLLQAGK Y LA GSWVGPYAMCR+WE 
Sbjct: 195 SWRKTVDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEV 254

Query: 251 LVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFSKGQLDWTPIL 310
           L R +RE     +  LPMA+Y+VSGDEDGERGGAPV+CI+DAC+ CLEFS+G + WTP+L
Sbjct: 255 LARNQREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCIEDACKRCLEFSRGLVPWTPLL 314

Query: 311 LLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDENAFYLDPHEVQ 370
           LLVPLVLGL+K+N RYIP L++TF FPQSLGILGGKPGASTYI+G+Q++ AFYLDPHEV+
Sbjct: 315 LLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQNDKAFYLDPHEVK 374

Query: 371 PVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRATKLAEES 430
           PV+NI  D  E +TSSYHCN+ RH+PL+SIDPSLAIGFYCRDKDDFDDFC RATKLAEES
Sbjct: 375 PVVNITGDTQEPNTSSYHCNISRHMPLDSIDPSLAIGFYCRDKDDFDDFCSRATKLAEES 434

Query: 431 DGAPLFTVAETHSANSVRHSNALDG-GSRLVEDDTLGMVHMPNEEGTHEDDWQFL 485
           +GAPLFTVA++ S      SN++ G  +R  EDD+L M ++ N+ G +EDDWQFL
Sbjct: 435 NGAPLFTVAQSRSLPMQVTSNSVSGDDTRFEEDDSLSM-NLVNDAG-NEDDWQFL 487

BLAST of Sgr020787 vs. ExPASy Swiss-Prot
Match: Q8S929 (Cysteine protease ATG4a OS=Arabidopsis thaliana OX=3702 GN=ATG4A PE=2 SV=1)

HSP 1 Score: 572.4 bits (1474), Expect = 4.9e-162
Identity = 285/475 (60.00%), Postives = 360/475 (75.79%), Query Frame = 0

Query: 11  CSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNESSVSEKKAVQ 70
           CSS S +D  D+S   +  +SG  +   SK +LWS  F S+ S+ + + ESS S  K V 
Sbjct: 13  CSSSSKSDTHDKS--PLVSDSGPSDN-KSKFTLWSNVFTSSSSVSQPYRESSTSGHKQVC 72

Query: 71  SRQNGWTTTVRKV-MTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKVSQDQASDDA 130
           + +NGWT  V++V M SG++RR  ER+LG  ++G+ S+  D+WLLGVCYK+S D+ S + 
Sbjct: 73  TTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGET 132

Query: 131 VTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLG 190
            T   +AA +LDFSS+ILMTYRKGF   +D+ YTSDVNWGCM+RSSQML AQALLFHRLG
Sbjct: 133 DTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLG 192

Query: 191 RSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSWE 250
           R+W K S+ P ++EY+E L  FGDSE SAFSIHNL+ AG +Y LAAGSWVGPYA+CR+WE
Sbjct: 193 RAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWE 252

Query: 251 TLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFSKGQLDWTPI 310
           +L   KR+  + ++  LPMA++IVSG EDGERGGAP+LCI+DA + CLEFSKGQ +WTPI
Sbjct: 253 SLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPI 312

Query: 311 LLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDENAFYLDPHEV 370
           +LLVPLVLGL+ +NPRYIPSL  TFTFPQS+GILGGKPGASTYI+G+Q++  FYLDPHEV
Sbjct: 313 ILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEV 372

Query: 371 QPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRATKLAEE 430
           Q V+ ++K+  + DTSSYHCNV+R++PLES+DPSLA+GFYCRDKDDFDDFC RA KLAEE
Sbjct: 373 QQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEE 432

Query: 431 SDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDDWQFL 485
           S+GAPLFTV +TH+A  +  SN         +DD+         E   EDDWQ L
Sbjct: 433 SNGAPLFTVTQTHTA--INQSN-----YGFADDDS---------EDEREDDWQML 467

BLAST of Sgr020787 vs. ExPASy Swiss-Prot
Match: Q9M1Y0 (Cysteine protease ATG4b OS=Arabidopsis thaliana OX=3702 GN=ATG4B PE=1 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 4.6e-152
Identity = 265/477 (55.56%), Postives = 342/477 (71.70%), Query Frame = 0

Query: 9   SICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNESSVSEKKA 68
           S CSS S ++  D S  +      + +   S  +L S   AS+  + +   E+S S    
Sbjct: 11  SKCSSSSTSEKRDISSPTSLVSDSASSDNKSNLTLCSDVVASSSPVSQLCREASTSGHNP 70

Query: 69  VQSRQNGWTTTVRKV-MTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKVSQDQASD 128
           V +  + WT  ++   M SG++RR  +R+LG  ++G+ SS  +IWLLGVCYK+S+ ++S+
Sbjct: 71  VCTTHSSWTVILKTASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSE 130

Query: 129 DAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHR 188
           +A     +AA+  DFSS ILMTYR+GF  I D+ YTSDVNWGCMLRS QML AQALLF R
Sbjct: 131 EADAGRVLAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQR 190

Query: 189 LGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRS 248
           LGRSWRK   +P D++Y+EIL LFGD+E SAFSIHNL+ AG++Y LAAGSWVGPYA+CRS
Sbjct: 191 LGRSWRKKDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRS 250

Query: 249 WETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFSKGQLDWT 308
           WE+L R  +E  + +     MA++IVSG EDGERGGAP+LCI+D  + CLEFS+G+ +W 
Sbjct: 251 WESLARKNKEETDDKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWP 310

Query: 309 PILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDENAFYLDPH 368
           PILLLVPLVLGL+++NPRYIPSL  TFTFPQSLGILGGKPGASTYI+G+Q++  FYLDPH
Sbjct: 311 PILLLVPLVLGLDRVNPRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPH 370

Query: 369 EVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRATKLA 428
           +VQ V+ + K++ + DTSSYHCN +R++PLES+DPSLA+GFYC+ KDDFDDFC RATKLA
Sbjct: 371 DVQQVVTVKKENQDVDTSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFCIRATKLA 430

Query: 429 EESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDDWQFL 485
            +S+GAPLFTV ++H  N        D G       T     +  EE  HEDDWQ L
Sbjct: 431 GDSNGAPLFTVTQSHRRN--------DCGIAETSSSTETSTEISGEE--HEDDWQLL 477

BLAST of Sgr020787 vs. ExPASy Swiss-Prot
Match: A2XHJ5 (Cysteine protease ATG4A OS=Oryza sativa subsp. indica OX=39946 GN=ATG4A PE=3 SV=1)

HSP 1 Score: 529.3 bits (1362), Expect = 4.7e-149
Identity = 266/474 (56.12%), Postives = 336/474 (70.89%), Query Frame = 0

Query: 13  SESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNESSVSEKKAVQSR 72
           S S++D +     +    S  ++   SK S+ S  F+S FSIFE H +SS        S 
Sbjct: 10  SPSSSDPLCEGNAAPSSSSSGQDLKQSKNSILSCVFSSPFSIFEAHQDSSAHRPLKPHSG 69

Query: 73  QNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKVSQDQASDDAVTS 132
              W+  +R++  +GSM     R LG+ K+    +  D+W LG CYK+S ++ S+ +   
Sbjct: 70  SYAWSRFLRRIACTGSM----WRFLGASKA---LTSSDVWFLGKCYKLSSEELSNSSDCE 129

Query: 133 NGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSW 192
           +G AA+  DFSSRI +TYRKGF+ I DSKYTSDVNWGCM+RSSQMLVAQAL+FH LGRSW
Sbjct: 130 SGNAAFLEDFSSRIWITYRKGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSW 189

Query: 193 RKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSWETLV 252
           RK SQKP   EYI ILH+FGDSE  AFSIHNLLQAGK+Y LAAGSWVGPYAMCR+W+TLV
Sbjct: 190 RKPSQKPYSPEYIGILHMFGDSEACAFSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLV 249

Query: 253 RLKREIPNLQD--LQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFSKGQLDWTPIL 312
           R  RE     D     PMA+Y+VSGDEDGERGGAPV+CID A + C +F+KGQ  W+PIL
Sbjct: 250 RTNREHHEAVDGNGNFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPIL 309

Query: 313 LLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDENAFYLDPHEVQ 372
           LLVPLVLGL+K+NPRYIP L+ TFTFPQSLGILGGKPG STY+ G+QD+   YLDPHEVQ
Sbjct: 310 LLVPLVLGLDKLNPRYIPLLKETFTFPQSLGILGGKPGTSTYVAGVQDDRVLYLDPHEVQ 369

Query: 373 PVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRATKLAEES 432
             ++I  D+LEADTSSYHC+ +R + L+ IDPSLAIGFYCRDKDDFDDFC RA++L +++
Sbjct: 370 LAVDIAADNLEADTSSYHCSTVRDLALDLIDPSLAIGFYCRDKDDFDDFCSRASELVDKA 429

Query: 433 DGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDDWQFL 485
           +GAPLFTV ++   +   ++     G  +   D + +  +     T E++WQ L
Sbjct: 430 NGAPLFTVMQSVQPSKQMYNEESSSGDGM---DIINVEGLDGSGETGEEEWQIL 473

BLAST of Sgr020787 vs. ExPASy Swiss-Prot
Match: Q2XPP4 (Cysteine protease ATG4B OS=Oryza sativa subsp. indica OX=39946 GN=ATG4B PE=1 SV=2)

HSP 1 Score: 527.7 bits (1358), Expect = 1.4e-148
Identity = 266/478 (55.65%), Postives = 339/478 (70.92%), Query Frame = 0

Query: 13  SESATDIVDRSQQSVCPESGSKNR----ISSKASLWSGFFASTFSIFERHNESSVSEKKA 72
           S S++D +     + C  S  +        SK S+ S  F S F+IFE H +SS ++   
Sbjct: 10  SSSSSDPLCEGNIAPCSSSSEQKEDCSLKQSKTSILSCVFNSPFNIFEAHQDSSANKSPK 69

Query: 73  VQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKVSQDQASDD 132
             S    W   +R+++ SGSM     R LG+ K     +  D+W LG CYK+S +++S D
Sbjct: 70  SSSGSYDWLRVLRRIVCSGSM----WRFLGTSK---VLTSSDVWFLGKCYKLSSEESSSD 129

Query: 133 AVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRL 192
           + + +G A +  DFSSRI +TYR+GF+ I DSKYTSDVNWGCM+RSSQMLVAQAL+FH L
Sbjct: 130 SDSESGHATFLEDFSSRIWITYRRGFDAISDSKYTSDVNWGCMVRSSQMLVAQALIFHHL 189

Query: 193 GRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSW 252
           GRSWR+ S+KP + EYI ILH+FGDSE  AFSIHNLLQAG +Y LAAGSWVGPYAMCR+W
Sbjct: 190 GRSWRRPSEKPYNPEYIGILHMFGDSEACAFSIHNLLQAGNSYGLAAGSWVGPYAMCRAW 249

Query: 253 ETLVRLKREIPNLQD--LQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFSKGQLDW 312
           +TLVR  RE   + D     PMA+Y+VSGDEDGERGGAPV+CID A + C +F+KGQ  W
Sbjct: 250 QTLVRTNREQHEVVDGNESFPMALYVVSGDEDGERGGAPVVCIDVAAQLCCDFNKGQSTW 309

Query: 313 TPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDENAFYLDP 372
           +PILLLVPLVLGL+KINPRYIP L+ TFTFPQSLGILGGKPG STYI G+QD+ A YLDP
Sbjct: 310 SPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGTSTYIAGVQDDRALYLDP 369

Query: 373 HEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRATKL 432
           HEVQ  ++I  D++EADTSSYHC+ +R + L+ IDPSLAIGFYCRDKDDFDDFC RAT+L
Sbjct: 370 HEVQMAVDIAADNIEADTSSYHCSTVRDLALDLIDPSLAIGFYCRDKDDFDDFCSRATEL 429

Query: 433 AEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDDWQFL 485
            ++++GAPLFTV ++   +   ++   D    +  D  + +  +     T E++WQ L
Sbjct: 430 VDKANGAPLFTVVQSVQPSKQMYNQ--DDVLGISGDGNINVEDLDASGETGEEEWQIL 478

BLAST of Sgr020787 vs. ExPASy TrEMBL
Match: A0A6J1DNK0 (Cysteine protease OS=Momordica charantia OX=3673 GN=LOC111022873 PE=3 SV=1)

HSP 1 Score: 912.1 bits (2356), Expect = 9.6e-262
Identity = 446/484 (92.15%), Postives = 458/484 (94.63%), Query Frame = 0

Query: 1   MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNE 60
           MGRGKDL SICSS +ATDIVDRSQQSVCPESGSKNR SSKASLW GFFASTFSIFE HNE
Sbjct: 1   MGRGKDLNSICSSGTATDIVDRSQQSVCPESGSKNRTSSKASLWPGFFASTFSIFEHHNE 60

Query: 61  SSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKV 120
           SSVSEKK VQSR NGWTTTVRK+MTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVC+K+
Sbjct: 61  SSVSEKKTVQSRHNGWTTTVRKIMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCHKI 120

Query: 121 SQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180
           SQD ASDDAVTSN VA YELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA
Sbjct: 121 SQDLASDDAVTSNSVATYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180

Query: 181 QALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVG 240
           QALL HRLGRSWRKT QKPLDKEYIEILHLFGDSE SAFSIHNLLQAG AYDLAAGSWVG
Sbjct: 181 QALLSHRLGRSWRKTPQKPLDKEYIEILHLFGDSETSAFSIHNLLQAGMAYDLAAGSWVG 240

Query: 241 PYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFS 300
           PYAMCRSWETLVRLKRE PNLQD QLPMAMYIVSGDEDGERGGAPVL IDDA RHC EFS
Sbjct: 241 PYAMCRSWETLVRLKRETPNLQDQQLPMAMYIVSGDEDGERGGAPVLFIDDASRHCFEFS 300

Query: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDEN 360
           KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYI+G+QDEN
Sbjct: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDEN 360

Query: 361 AFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420
           AFYLDPHEVQPV+N+DKDDLEADTSSYHCNVIRHIPLESIDPSLA+GFYCRDKDDFDDFC
Sbjct: 361 AFYLDPHEVQPVVNVDKDDLEADTSSYHCNVIRHIPLESIDPSLALGFYCRDKDDFDDFC 420

Query: 421 YRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDD 480
            RA+KLAEESDGAPLFTVAETHSANSVRHSNALDG SR VEDDT+  VH+P EEG HEDD
Sbjct: 421 SRASKLAEESDGAPLFTVAETHSANSVRHSNALDGSSRSVEDDTISTVHVPIEEGAHEDD 480

Query: 481 WQFL 485
           WQFL
Sbjct: 481 WQFL 484

BLAST of Sgr020787 vs. ExPASy TrEMBL
Match: A0A6J1ICU1 (Cysteine protease OS=Cucurbita maxima OX=3661 GN=LOC111472623 PE=3 SV=1)

HSP 1 Score: 847.4 bits (2188), Expect = 2.9e-242
Identity = 415/484 (85.74%), Postives = 443/484 (91.53%), Query Frame = 0

Query: 1   MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNE 60
           MGRGKDL S CSSES TDIVDRSQQS+CP  GS+N ISSKASLWSGFF STFS+FE + E
Sbjct: 1   MGRGKDLNSTCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKE 60

Query: 61  SSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKV 120
           SSVSEKKAV SR N W TTVR+VMTSGSMRRI ERILGSR+SGVYSSGGDIWLLGVC+K+
Sbjct: 61  SSVSEKKAVHSRHNVW-TTVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKI 120

Query: 121 SQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180
           SQDQASDD VTS+ +A +ELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA
Sbjct: 121 SQDQASDDVVTSDSIAGFELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180

Query: 181 QALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVG 240
           QALLFHRLGRSWRKTSQKPLDKEY+EILHLFGDSE SAFSIHNLLQAG+ YDLAAGSWVG
Sbjct: 181 QALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVG 240

Query: 241 PYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFS 300
           PYA+CRSWETLVRLKRE P  QDLQLPMA+YIVSG+EDGERGGAPVLCIDDA RHC +FS
Sbjct: 241 PYAVCRSWETLVRLKRETPTPQDLQLPMAIYIVSGEEDGERGGAPVLCIDDASRHCFQFS 300

Query: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDEN 360
            GQLDWTPILLLVPLVLGLEKINPRYIPSLR TFTFPQSLGILGGK GASTYI+G+QDEN
Sbjct: 301 NGQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDEN 360

Query: 361 AFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420
           AFYLDPHEVQ V+N DKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC
Sbjct: 361 AFYLDPHEVQQVVNTDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420

Query: 421 YRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDD 480
           YRA+KLA +S GAPLFTVAETHS+NSVRH NAL+ GSRLV D+    VH+P++EG  EDD
Sbjct: 421 YRASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDN--ADVHVPDDEGAQEDD 480

Query: 481 WQFL 485
           WQ L
Sbjct: 481 WQLL 481

BLAST of Sgr020787 vs. ExPASy TrEMBL
Match: A0A6J1ETR9 (Cysteine protease OS=Cucurbita moschata OX=3662 GN=LOC111436439 PE=3 SV=1)

HSP 1 Score: 847.4 bits (2188), Expect = 2.9e-242
Identity = 418/484 (86.36%), Postives = 443/484 (91.53%), Query Frame = 0

Query: 1   MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNE 60
           MGRGKDL S CSSES TDIVDRSQQS+CP  GS+N ISSKASLWSGFF STFS+FE + E
Sbjct: 1   MGRGKDLNSTCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKE 60

Query: 61  SSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKV 120
           SSVSEKKAV SR N W TTVR+VMTSGSMRRI ERILGSR+SGVYSSGGDIWLLGVC+K+
Sbjct: 61  SSVSEKKAVHSRHNVW-TTVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKI 120

Query: 121 SQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180
           SQDQASDDAVTS+ VA +ELDFSSRILMTYRKGFN IQDSKYTSDVNWGCMLRSSQMLVA
Sbjct: 121 SQDQASDDAVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVA 180

Query: 181 QALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVG 240
           QALLFHRLGRSWRKTSQKPLDKEY+EILHLFGDSE SAFSIHNLLQAG+ YDLAAGSWVG
Sbjct: 181 QALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVG 240

Query: 241 PYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFS 300
           PYA+CRSWETLVRLKRE P  QD QLPMA+YIVSGDEDGERGGAPVLCID A RHC +FS
Sbjct: 241 PYAICRSWETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFS 300

Query: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDEN 360
           KGQLDWTPILLLVPLVLGLEKINPRYIPSLR TFTFPQSLGILGGK GASTYI+G+QDEN
Sbjct: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDEN 360

Query: 361 AFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420
           AFYLDPHEVQ V+NIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC
Sbjct: 361 AFYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420

Query: 421 YRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDD 480
           YRA+KLA +S GAPLFTVAETHS+NSVRH NAL+ GSRLV D+    VH+P+EEG  EDD
Sbjct: 421 YRASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDN--ADVHVPDEEGAQEDD 480

Query: 481 WQFL 485
           WQ L
Sbjct: 481 WQLL 481

BLAST of Sgr020787 vs. ExPASy TrEMBL
Match: A0A6J1ELN4 (Cysteine protease OS=Cucurbita moschata OX=3662 GN=LOC111435689 PE=3 SV=1)

HSP 1 Score: 844.7 bits (2181), Expect = 1.9e-241
Identity = 415/485 (85.57%), Postives = 441/485 (90.93%), Query Frame = 0

Query: 1   MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNE 60
           MGRGKDL S CSS S TD +DR+QQSVCPE GSK+ +SSKASLWS F  ST +IFE H E
Sbjct: 1   MGRGKDLNSTCSSASTTDSIDRTQQSVCPELGSKDHVSSKASLWSSFLTSTLTIFEHHKE 60

Query: 61  SSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKV 120
            SVSEKK   SRQN W TTVRKVMTSGSMRR+ ERILGS +SGVYSSGGDIWLLGVC+K+
Sbjct: 61  PSVSEKKTFHSRQNVW-TTVRKVMTSGSMRRLQERILGSHRSGVYSSGGDIWLLGVCHKI 120

Query: 121 SQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180
           SQDQA DD VTSNG A +ELDFSSRIL+TYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA
Sbjct: 121 SQDQAPDD-VTSNGAAGFELDFSSRILITYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180

Query: 181 QALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVG 240
           QALLFHRLGRSWRKTS+KPLDKEY+EILHLFGDSE SAFSIHN+LQAG+AYDLAAGSWVG
Sbjct: 181 QALLFHRLGRSWRKTSRKPLDKEYVEILHLFGDSEKSAFSIHNILQAGRAYDLAAGSWVG 240

Query: 241 PYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFS 300
           PYAMCRSWETLVRLKRE PNLQD QLPMA+YIVSGDEDGERGGAPVL IDDA RHC EFS
Sbjct: 241 PYAMCRSWETLVRLKRETPNLQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFS 300

Query: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDEN 360
           KGQLDWTPILLLVPLVLGLEKINPRYI SL+TTFTFPQSLGILGGKP  STYI+G+QDEN
Sbjct: 301 KGQLDWTPILLLVPLVLGLEKINPRYILSLKTTFTFPQSLGILGGKPSVSTYIVGVQDEN 360

Query: 361 AFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420
           AFYLDPHEVQ V+NIDKDDL+ADTSSYHCNVIRHIPLES+DPSLAIGFYCRDKDDFDDFC
Sbjct: 361 AFYLDPHEVQQVVNIDKDDLDADTSSYHCNVIRHIPLESMDPSLAIGFYCRDKDDFDDFC 420

Query: 421 YRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPN-EEGTHED 480
           YRA+KLA++SDGAPLFTVAETHS NS RH NALD  SRLVEDDT G+VHMPN EE  HED
Sbjct: 421 YRASKLADKSDGAPLFTVAETHSTNSGRHGNALDNCSRLVEDDTDGVVHMPNEEEEAHED 480

Query: 481 DWQFL 485
           DWQFL
Sbjct: 481 DWQFL 483

BLAST of Sgr020787 vs. ExPASy TrEMBL
Match: A0A5A7SP82 (Cysteine protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold340G00200 PE=3 SV=1)

HSP 1 Score: 837.8 bits (2163), Expect = 2.3e-239
Identity = 410/484 (84.71%), Postives = 436/484 (90.08%), Query Frame = 0

Query: 1   MGRGKDLKSICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNE 60
           MGRGKDLKS CSSE+  D++DR+ +SVC E GSKN ISSKASLWSGFF+S FSI + H +
Sbjct: 1   MGRGKDLKSTCSSETTADVIDRTHRSVCSELGSKNHISSKASLWSGFFSSNFSICDHHKD 60

Query: 61  SSVSEKKAVQSRQNGWTTTVRKVMTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKV 120
           SSVSEKK   SR N W  TVRKVMTSGSMRRI ERILGSR+SGVY+SGGDIWLLGVC+K+
Sbjct: 61  SSVSEKKVFHSRHNVW-ATVRKVMTSGSMRRIQERILGSRRSGVYTSGGDIWLLGVCHKI 120

Query: 121 SQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVA 180
           SQD   DDA +S GVA +E DFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLV+
Sbjct: 121 SQDHLPDDAASSTGVAGFEQDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVS 180

Query: 181 QALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVG 240
           QALLFHRLGRSWRK SQKP DKEY+EILHLFGDSE SAFSIHNLLQAG+AYDLAAGSWVG
Sbjct: 181 QALLFHRLGRSWRKPSQKPFDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVG 240

Query: 241 PYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFS 300
           PYAMCRSWETLVR KRE P LQD QLPMA+YIVSGDEDGERGGAPVL IDDA RHC EFS
Sbjct: 241 PYAMCRSWETLVRSKRETPILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFS 300

Query: 301 KGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDEN 360
           KGQ DW+PILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYI+G+QDEN
Sbjct: 301 KGQHDWSPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDEN 360

Query: 361 AFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFC 420
           AFYLDPHEVQ V+NIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFD+FC
Sbjct: 361 AFYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 420

Query: 421 YRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDD 480
           YRA+KLAEESDGAPLFTVAETHS NS R S+AL+  SRLVEDD  G VHMPNEE  HEDD
Sbjct: 421 YRASKLAEESDGAPLFTVAETHSTNSGRQSSALNDHSRLVEDDADGAVHMPNEEEAHEDD 480

Query: 481 WQFL 485
           WQFL
Sbjct: 481 WQFL 483

BLAST of Sgr020787 vs. TAIR 10
Match: AT2G44140.1 (Peptidase family C54 protein )

HSP 1 Score: 572.4 bits (1474), Expect = 3.5e-163
Identity = 285/475 (60.00%), Postives = 360/475 (75.79%), Query Frame = 0

Query: 11  CSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNESSVSEKKAVQ 70
           CSS S +D  D+S   +  +SG  +   SK +LWS  F S+ S+ + + ESS S  K V 
Sbjct: 13  CSSSSKSDTHDKS--PLVSDSGPSDN-KSKFTLWSNVFTSSSSVSQPYRESSTSGHKQVC 72

Query: 71  SRQNGWTTTVRKV-MTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKVSQDQASDDA 130
           + +NGWT  V++V M SG++RR  ER+LG  ++G+ S+  D+WLLGVCYK+S D+ S + 
Sbjct: 73  TTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGET 132

Query: 131 VTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLG 190
            T   +AA +LDFSS+ILMTYRKGF   +D+ YTSDVNWGCM+RSSQML AQALLFHRLG
Sbjct: 133 DTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLG 192

Query: 191 RSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSWE 250
           R+W K S+ P ++EY+E L  FGDSE SAFSIHNL+ AG +Y LAAGSWVGPYA+CR+WE
Sbjct: 193 RAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWE 252

Query: 251 TLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFSKGQLDWTPI 310
           +L   KR+  + ++  LPMA++IVSG EDGERGGAP+LCI+DA + CLEFSKGQ +WTPI
Sbjct: 253 SLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPI 312

Query: 311 LLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDENAFYLDPHEV 370
           +LLVPLVLGL+ +NPRYIPSL  TFTFPQS+GILGGKPGASTYI+G+Q++  FYLDPHEV
Sbjct: 313 ILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEV 372

Query: 371 QPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRATKLAEE 430
           Q V+ ++K+  + DTSSYHCNV+R++PLES+DPSLA+GFYCRDKDDFDDFC RA KLAEE
Sbjct: 373 QQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEE 432

Query: 431 SDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDDWQFL 485
           S+GAPLFTV +TH+A  +  SN         +DD+         E   EDDWQ L
Sbjct: 433 SNGAPLFTVTQTHTA--INQSN-----YGFADDDS---------EDEREDDWQML 467

BLAST of Sgr020787 vs. TAIR 10
Match: AT2G44140.2 (Peptidase family C54 protein )

HSP 1 Score: 552.4 bits (1422), Expect = 3.7e-157
Identity = 268/426 (62.91%), Postives = 333/426 (78.17%), Query Frame = 0

Query: 60  ESSVSEKKAVQSRQNGWTTTVRKV-MTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCY 119
           ESS S  K V + +NGWT  V++V M SG++RR  ER+LG  ++G+ S+  D+WLLGVCY
Sbjct: 14  ESSTSGHKQVCTTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCY 73

Query: 120 KVSQDQASDDAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQML 179
           K+S D+ S +  T   +AA +LDFSS+ILMTYRKGF   +D+ YTSDVNWGCM+RSSQML
Sbjct: 74  KISADENSGETDTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQML 133

Query: 180 VAQALLFHRLGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSW 239
            AQALLFHRLGR+W K S+ P ++EY+E L  FGDSE SAFSIHNL+ AG +Y LAAGSW
Sbjct: 134 FAQALLFHRLGRAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSW 193

Query: 240 VGPYAMCRSWETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLE 299
           VGPYA+CR+WE+L   KR+  + ++  LPMA++IVSG EDGERGGAP+LCI+DA + CLE
Sbjct: 194 VGPYAICRAWESLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLE 253

Query: 300 FSKGQLDWTPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQD 359
           FSKGQ +WTPI+LLVPLVLGL+ +NPRYIPSL  TFTFPQS+GILGGKPGASTYI+G+Q+
Sbjct: 254 FSKGQSEWTPIILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQE 313

Query: 360 ENAFYLDPHEVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDD 419
           +  FYLDPHEVQ V+ ++K+  + DTSSYHCNV+R++PLES+DPSLA+GFYCRDKDDFDD
Sbjct: 314 DKGFYLDPHEVQQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDD 373

Query: 420 FCYRATKLAEESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHE 479
           FC RA KLAEES+GAPLFTV +TH+A  +  SN         +DD+         E   E
Sbjct: 374 FCLRALKLAEESNGAPLFTVTQTHTA--INQSN-----YGFADDDS---------EDERE 422

Query: 480 DDWQFL 485
           DDWQ L
Sbjct: 434 DDWQML 422

BLAST of Sgr020787 vs. TAIR 10
Match: AT3G59950.1 (Peptidase family C54 protein )

HSP 1 Score: 539.3 bits (1388), Expect = 3.2e-153
Identity = 265/477 (55.56%), Postives = 342/477 (71.70%), Query Frame = 0

Query: 9   SICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNESSVSEKKA 68
           S CSS S ++  D S  +      + +   S  +L S   AS+  + +   E+S S    
Sbjct: 11  SKCSSSSTSEKRDISSPTSLVSDSASSDNKSNLTLCSDVVASSSPVSQLCREASTSGHNP 70

Query: 69  VQSRQNGWTTTVRKV-MTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKVSQDQASD 128
           V +  + WT  ++   M SG++RR  +R+LG  ++G+ SS  +IWLLGVCYK+S+ ++S+
Sbjct: 71  VCTTHSSWTVILKTASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSE 130

Query: 129 DAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHR 188
           +A     +AA+  DFSS ILMTYR+GF  I D+ YTSDVNWGCMLRS QML AQALLF R
Sbjct: 131 EADAGRVLAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQR 190

Query: 189 LGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRS 248
           LGRSWRK   +P D++Y+EIL LFGD+E SAFSIHNL+ AG++Y LAAGSWVGPYA+CRS
Sbjct: 191 LGRSWRKKDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRS 250

Query: 249 WETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFSKGQLDWT 308
           WE+L R  +E  + +     MA++IVSG EDGERGGAP+LCI+D  + CLEFS+G+ +W 
Sbjct: 251 WESLARKNKEETDDKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWP 310

Query: 309 PILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIMGIQDENAFYLDPH 368
           PILLLVPLVLGL+++NPRYIPSL  TFTFPQSLGILGGKPGASTYI+G+Q++  FYLDPH
Sbjct: 311 PILLLVPLVLGLDRVNPRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPH 370

Query: 369 EVQPVMNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRATKLA 428
           +VQ V+ + K++ + DTSSYHCN +R++PLES+DPSLA+GFYC+ KDDFDDFC RATKLA
Sbjct: 371 DVQQVVTVKKENQDVDTSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFCIRATKLA 430

Query: 429 EESDGAPLFTVAETHSANSVRHSNALDGGSRLVEDDTLGMVHMPNEEGTHEDDWQFL 485
            +S+GAPLFTV ++H  N        D G       T     +  EE  HEDDWQ L
Sbjct: 431 GDSNGAPLFTVTQSHRRN--------DCGIAETSSSTETSTEISGEE--HEDDWQLL 477

BLAST of Sgr020787 vs. TAIR 10
Match: AT3G59950.2 (Peptidase family C54 protein )

HSP 1 Score: 341.3 bits (874), Expect = 1.3e-93
Identity = 170/324 (52.47%), Postives = 228/324 (70.37%), Query Frame = 0

Query: 9   SICSSESATDIVDRSQQSVCPESGSKNRISSKASLWSGFFASTFSIFERHNESSVSEKKA 68
           S CSS S ++  D S  +      + +   S  +L S   AS+  + +   E+S S    
Sbjct: 11  SKCSSSSTSEKRDISSPTSLVSDSASSDNKSNLTLCSDVVASSSPVSQLCREASTSGHNP 70

Query: 69  VQSRQNGWTTTVRKV-MTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKVSQDQASD 128
           V +  + WT  ++   M SG++RR  +R+LG  ++G+ SS  +IWLLGVCYK+S+ ++S+
Sbjct: 71  VCTTHSSWTVILKTASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSE 130

Query: 129 DAVTSNGVAAYELDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHR 188
           +A     +AA+  DFSS ILMTYR+GF  I D+ YTSDVNWGCMLRS QML AQALLF R
Sbjct: 131 EADAGRVLAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQR 190

Query: 189 LGRSWRKTSQKPLDKEYIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRS 248
           LGRSWRK   +P D++Y+EIL LFGD+E SAFSIHNL+ AG++Y LAAGSWVGPYA+CRS
Sbjct: 191 LGRSWRKKDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRS 250

Query: 249 WETLVRLKREIPNLQDLQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFSKGQLDWT 308
           WE+L R  +E  + +     MA++IVSG EDGERGGAP+LCI+D  + CLEFS+G+ +W 
Sbjct: 251 WESLARKNKEETDDKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWP 310

Query: 309 PILLLVPLVLGLEKINPR--YIPS 330
           PILLLVPLVLGL+++NP   ++PS
Sbjct: 311 PILLLVPLVLGLDRVNPSHFHVPS 334

BLAST of Sgr020787 vs. TAIR 10
Match: AT3G59950.3 (Peptidase family C54 protein )

HSP 1 Score: 332.0 bits (850), Expect = 7.9e-91
Identity = 151/244 (61.89%), Postives = 195/244 (79.92%), Query Frame = 0

Query: 84  MTSGSMRRIHERILGSRKSGVYSSGGDIWLLGVCYKVSQDQASDDAVTSNGVAAYELDFS 143
           M SG++RR  +R+LG  ++G+ SS  +IWLLGVCYK+S+ ++S++A     +AA+  DFS
Sbjct: 1   MASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSEEADAGRVLAAFRQDFS 60

Query: 144 SRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKTSQKPLDKE 203
           S ILMTYR+GF  I D+ YTSDVNWGCMLRS QML AQALLF RLGRSWRK   +P D++
Sbjct: 61  SLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQRLGRSWRKKDSEPADEK 120

Query: 204 YIEILHLFGDSEMSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSWETLVRLKREIPNLQD 263
           Y+EIL LFGD+E SAFSIHNL+ AG++Y LAAGSWVGPYA+CRSWE+L R  +E  + + 
Sbjct: 121 YLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRSWESLARKNKEETDDKH 180

Query: 264 LQLPMAMYIVSGDEDGERGGAPVLCIDDACRHCLEFSKGQLDWTPILLLVPLVLGLEKIN 323
               MA++IVSG EDGERGGAP+LCI+D  + CLEFS+G+ +W PILLLVPLVLGL+++N
Sbjct: 181 KSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWPPILLLVPLVLGLDRVN 240

Query: 324 PRYI 328
           PR++
Sbjct: 241 PRFV 244

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022155850.12.0e-26192.15cysteine protease ATG4-like [Momordica charantia][more]
XP_038874853.11.0e-24988.22cysteine protease ATG4-like [Benincasa hispida][more]
XP_022974003.16.0e-24285.74cysteine protease ATG4-like [Cucurbita maxima] >XP_022974004.1 cysteine protease... [more]
XP_022929983.16.0e-24286.36cysteine protease ATG4-like [Cucurbita moschata] >XP_022929984.1 cysteine protea... [more]
XP_023539406.11.0e-24186.16cysteine protease ATG4-like [Cucurbita pepo subsp. pepo] >XP_023539414.1 cystein... [more]
Match NameE-valueIdentityDescription
A2Q1V62.8e-18667.58Cysteine protease ATG4 OS=Medicago truncatula OX=3880 GN=ATG4 PE=3 SV=1[more]
Q8S9294.9e-16260.00Cysteine protease ATG4a OS=Arabidopsis thaliana OX=3702 GN=ATG4A PE=2 SV=1[more]
Q9M1Y04.6e-15255.56Cysteine protease ATG4b OS=Arabidopsis thaliana OX=3702 GN=ATG4B PE=1 SV=1[more]
A2XHJ54.7e-14956.12Cysteine protease ATG4A OS=Oryza sativa subsp. indica OX=39946 GN=ATG4A PE=3 SV=... [more]
Q2XPP41.4e-14855.65Cysteine protease ATG4B OS=Oryza sativa subsp. indica OX=39946 GN=ATG4B PE=1 SV=... [more]
Match NameE-valueIdentityDescription
A0A6J1DNK09.6e-26292.15Cysteine protease OS=Momordica charantia OX=3673 GN=LOC111022873 PE=3 SV=1[more]
A0A6J1ICU12.9e-24285.74Cysteine protease OS=Cucurbita maxima OX=3661 GN=LOC111472623 PE=3 SV=1[more]
A0A6J1ETR92.9e-24286.36Cysteine protease OS=Cucurbita moschata OX=3662 GN=LOC111436439 PE=3 SV=1[more]
A0A6J1ELN41.9e-24185.57Cysteine protease OS=Cucurbita moschata OX=3662 GN=LOC111435689 PE=3 SV=1[more]
A0A5A7SP822.3e-23984.71Cysteine protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold340G00... [more]
Match NameE-valueIdentityDescription
AT2G44140.13.5e-16360.00Peptidase family C54 protein [more]
AT2G44140.23.7e-15762.91Peptidase family C54 protein [more]
AT3G59950.13.2e-15355.56Peptidase family C54 protein [more]
AT3G59950.21.3e-9352.47Peptidase family C54 protein [more]
AT3G59950.37.9e-9161.89Peptidase family C54 protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005078Peptidase C54PFAMPF03416Peptidase_C54coord: 139..421
e-value: 8.2E-87
score: 291.2
IPR005078Peptidase C54PANTHERPTHR22624CYSTEINE PROTEASE ATG4coord: 56..484
NoneNo IPR availablePANTHERPTHR22624:SF54CYSTEINE PROTEASE ATG4Bcoord: 56..484
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 99..440

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr020787.1Sgr020787.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006914 autophagy
biological_process GO:0006508 proteolysis
cellular_component GO:0005737 cytoplasm
molecular_function GO:0008234 cysteine-type peptidase activity