CmoCh12G001520 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh12G001520
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionCysteine proteinases superfamily protein
LocationCmo_Chr12: 986368 .. 995113 (+)
RNA-Seq ExpressionCmoCh12G001520
SyntenyCmoCh12G001520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACCCAGTACGTTGAACACGGGTTTTTAGGAACCGGAGGGCTACTGATTCCGCGCCAAACAGCCCTAATTGGCTCCATTCTCAGCAACCAAGTGGCCCAGAGCTTTGACACGATCTTCGTTTACCAGAGCTTTACTTCGTCCTCCTGTGATTACGATGGCAGTTTGATTTCTTGGGATCGTTTGATCTCTCAGTGATCCGCTCTCATCTCGCCATGAACGACGCCTCTGTCAAAGGCCTCGAAGTCTTCGATTTCACCGAAGAAGACGAGCTTCCTGAATTGATCTCCGAGAAGTGTCTCAGCAAATTCAAAAACCCTAATCTCGAAACTAACTCTGTTTTGAAGTATGGATGTCTTGAATTAGGTAGGTATTCCTGGACTGGTACTACTTTTCGTTTTATTTATTTATTTGTCTGCAAACACTAGTTGGAGATGGATGGATGGATGGATGGATAAGAGTCTTTAATTTTGATTGTAAACTGAATATCTCAGAGAAGGTTGATCACTATATTGGTTCTGTTAGCCTTTTTCATTTGGTAGATCGTGGAGTAGTCTGTAGATTGCTAAACGTCTATTTCTTGTAGATTTACGGTGAAATACAGACGAATTTGAAATATGTTGAATGGCATCTCAATGAGTTGAATTGTTATTTCTGGCGTCTTGTAGCATATCGCCGATTGCTGAAAATTATTTACTTCCAATGGGGAATTAAAGTGAATATGTGATCTGATAACTCTGGAGATTTACGTTTGAAAGAGGATTTCTGGAATGATTTATTTTTCATCTTTCCTTATGGAGCACAAATTTCTTTACTTCTGGTATCATAAATTAGTGAAAATTGCTATGGTAACTTTTTTCCGCACACGTTTGATGATATCATTCGTCTTGTACTCTTATAGCTGGAAACAAACATTAACTACCGTGATGTTCATCTCTATTGTTGTGCTTTTTATTTAATTCTCAATATGGCTTGTCCTTGTCCAAGTTACCATTTCAGAATTGCTTTCTTTTTGTGCCCTTTCTTTAGTGAATGTCCGTTCAACAATGCTGCTGCAAGTTTCTTCTTGCTATCATTATCAAAATTAGTTTCTTTGTTGATGGCCATCCATTTTAATAATTAATGAAGATTAATTAACAGTGTCCAATGTGTTGAAGGGAAAGAGGTTGAAAATCCACATATGGATGTTGACTTGGATGAATGTAATCGTGGTTGTGACAATGGTATTTCACATATCCCCCCGGGCATATCTAAAGAGCAATTGATCATGGAAGAAGAGAAATATCAATTGGATGTCAATACAGAATCAGAATGGAATACTCATTCACAAGACATGTTTGTGCAAGTAAATAATCACGTGACAGGATGCTTTGGCTCTGAGCTTGGAAAAGTTGGATCTAGCTCTCAAAGTTCTATCCTGGGGTTAAATTGCTCTCTTCCTGAGTTTGCCGCTAAAGTACCCTAACTTATTGCTTTCTGAACATTACTTTCTGTTGAGATATATAGGAGTATTGGTCATGTTTGATTAACATTCTCTTTTTCAATGTACAGAGGGAGCAAGTGGATGCACTTTCATATCCTAATGGAAGCATGAATGGGAGTTCTCCAATGAGCTCTCCTTCGGAGCTTGTAGAGGATAGTGGTATGTTCTGTACACTGTGTCCACTTCAACATTCTTTTGGTTCTTTTCCTGAGGAGAACAATTGCATCAATAAGGAAGATCAGGGGACACACACACACACGCACACTATAACTAATAACTTACCCAAGTTCTATGCCTGACTATTATTATTTGGACAATTTCAGTTTCGCCGAATGGGAAGTCTTCAGATAAGTGCTCATCTGACAATGAAATGGTAAGTTTCTTCCTTGTGATCATTTAAACAGAAAAAGAAAAGGGCCACACACTTATGTCTTAAAATTTTCCTGAAATTTATTTCCAATTTTCATTTTTCGTTTAATCTCCTCCAGGACTTTTGAAGTCTTAACTACTTTTATCAAGAAGTATAGGCTGTCCGGATGTTCATGTTGTTCATACATATTATGAATTCTACATGATTGGTCCTTGTCAAGGGTTTTTGATCTTGAAGGAGACTATAGTTTCCTCTTATTCTTGAAGAGTGTTTGAAGAGAAAAACACTGGTTAGGAAGAAAGAATTGAAATCGTGAAATATGAGGCACCTTCTTGGATGCTGTAGTGTTTTATCAAGTTATTTTTGTAATTATAATTCTAAGGTGTCTGATTTCATTTGAGGAGCTTCTCTTTCTTTCTCTCTGTTTCTTTTAGTAATTGAATTGCTTACAACTTCGGATGTATAGTTGATTACTTAAGTGGAATATGTTTTTTGTCTTTTTTTTAATGATTATTTTCTGTAGGCAGATTCAACTTATCAAAATTCATTGTCCAGAATGTAGGCATTTGGGTACTGAAGCTTTTCTCTGTTACTTTTATAGGATGATCTTAACCAAGAGGTTGTTCTATGTCCCGATTATATTATATATGGAGATTCTTACTGTACCAGTTCGCAGTTAACCTTTTCACATAATGGCGTCAAAATTAATGGTTTTTCTGATTATGGAAGCAACGAGTTCCTTAACCTTGAATGGGGAGTTGATGATCTCATTAATATTGAGTGTCATTGGTTTCAAAGGGTGAGTTCTTATATCTTTTCTTTAGAAACTAAATTTTATGATACTTGAGTTCTATTTTTTTGTGAATTGCCAGTTTTTCATTTGGCTACAGGTTGAATTTGTGACGATTAAGCTCCATACTATATCAAAGGATGTTGGTCAACGTGATAATGGATGTGACACTTCTGGTCAGTGGACTTCTACAAGTCTATTTTGAGTTTTTTTTGGAATTAGATTTCAGTAGCATAAACATTTGATAGTATTTCAATGTATATCTTTGCTACTTTGACAGCTCATTAGAGAAGCCAGGTTCAGTTTAAGTTTTTTTTTCTTAAATTTTAATTTCAATTATGTTGACATGGCTAAGATTATTAGTAACTTCACGACATAACTTATGGCCTAACATCTACGTTCTTCTCGACATCCTCATTGCCTTTTTCTTGTGTCATTTCATCAGTGTTCTCAATAATGTTATGATAGAAATCTTTCTTGTTGTTAGTTTTTCTAATTCATATTTATGAACTTGCTTAAGGCCTTAAGGAAGTGAAGATTGTTCTAGTTGATCCTTGCTGGTCTGAGAAACAACAAAAAATCAGATCTTTGGATTCCAGATACATGGCTATTTGGAATATGTCTCTTGAGTAAGTTATCTGTGTGTTCTTATCATTATAGCTTTTATCACATGTATTTTTTTAAAAACAGAGTTATAAACCTAGCAACACTTCAAGATGTCAGTTTATGTGGCATATCTTAAAGAATTATCGGGGGTTACATTCTTCATTTCATGGTTAACACAGTCATGTGTAAAATGTCTCCCAAGTCACACGCACACACTGGTTGCATTCTCGAATAAACTTTGTTTTGATTCCTTTTACGTATATTGGTCTCCCTTGTTCATAGAATTATTGGTTACCTCTATTATGTCTTGTAGTGTGGGCGATGATGATTTGGGTGGACCAAGACAATATTTCCCCAAGTGAGTGGAATTTTCTACTTTTTTTTTTCCTTTTAAAGCTTATTTCATGTTTTGTAAACTAGTTTGTTTGAAATTCATTTCTAACTGCTTCTTAAAAACTATATATATTTCTTTTAATTTTGAGTAAAAGATTCATGTTTACCATGCTATGATATTCAAAATGTCATGCATGCAGATCATTCTTCTTTCAACTTCAGTTTTCAATTGTTATATGGTCAATACATGAACTCGAGTACATTATCGTTAATACTTTTTGAGATTGGCGGTGGCCACCTTTTTCAATGAGTATGAATCTGTGTCCAGCTTTGATGAGCATTTTGAAGAAGTTGTCTATCCCAAAGGAGATCCAGATGCCGTTTCCATTAGTAAGAGGGATGTTGACTTGTTGCAACCAGAGACATTTGTCAATGATACAATCATTGACTTTTACATCCAGTGAGTCAAATTATTTAGTGATTGCTTGTAATGCATGATTGAATACCATTACTGCCAGGAGGTTTTCCCCCTGTATATTTTCCTAGTGATGTAGGAATTTTCAGTTTCTCATTTAAAGAAGTTCTGATCCACAGTTCTTTTATTTTCTCCATTTCCTTGCTTGGATGACTGGTTTTCTTATTAACAAACTTTTATTGCAAATATTGAATGACCACCTTTCTAAATAGAATTGATTGACAGGTATTTGAAGAGCCAGATAGACCCTAAGGAGAAGCATAGATTTCACTTCTTCAATAGCTTTTTCTTTAGGAAGCTAGCTGACCTTGATAAAGATCCCTCAAGTGCTTCTGATGGCAAAGCTGCTTTTCTTCGTGTTCGAAAATGGACTCGGAAAGTGAATTTATTTGACAAGGATTATATCTTCATTCCTATAAACTTCAAGTAAGCCTTTTCTCTTGAAGGAAAGATTCTTTTGTGTGATAAGCTAAACAGCATCGGAAGTGGGTATCCTTTTCAACTTTATATGTTTGTTATAACAGCCTTCATTGGAGCTTAATGGTCATATGCCATCCTGGTGAAGTGGCTAGATATAGCGGTAAGTTCTTCACTCATTGGGTCAATTTGTGTCTCCCACCAGTCTTAACTTCTGTCTTGTCACAGATGAAAACCTGATGAAGTCAACAAAAGTACCGTGTATATTGCATATGGATTCTATTAAGGGAAGTCACGCAGGCCTTAAAAATCTCATTCAAAGGTATTCCTTTGTTTTTATAGCTTTTATCATGTTCAAGTTATTGGTTCGAACAGTGGATCTTAGATATCCGAACAAAGGTTAATCTTCCATGTCATGGTTAGTTTGACATTGCATCTTATTTTTATTTACTGACATAGAATTAGTCTGTTCCATGTTGAGAAGTTTGATGAAGACTTCAGAAGAGTTTTCAGTATTGCCTTCATTTGCATATTGCAACCAACCTCTTTTAATCTCTCTGTTAAGTAATTTTTTTGTTGCTTTTTATTTTTATAATTTTTTTTTATTGCTAGAGTAATGTGTGTATATATATAGGGGGCGTTTGTTTCTAAAGGTTGGGATGAGATTTGAGATGGGAAATGACATGAGTTTGTTTGGGGGTGGAGATTTTACTTATTGCCTCGGAAAACTCTTTTTCCAGGCAACAAAAAAGTTGCTATGGAAGTGATTTTAAAACGCATCCTTTCTTTCTTTCTTTTTTTTTTTTTTTGTTCCTTTTTCCTTTTTTCCTTTTGTATTATTTCTTCCTCCACTTATTTATTTATTATTTTTCATTTTTTTTACTTACTTACCATTTGGTATGGCAATTGGAGGAACTTATTCACAAAAAATGTCTCCCGGGGGTTTTGGTTGCCACCATTTGATATAGCAGCTTGAAGTAGGATCTTTCATAATCCTATCAAAAAAGAAGGATTATTCACAATGTTTTCCTCTTTTTGACCAAGAATACAACATTGAGTTGATACCTGCAAGTTTGAATTGAGGCTATCGATAGGTTCAACTTTCATTCTCTGTTCAATGTGTTAATCTTGCGATGGAGAGCCCAGAAAAAAGGATTATTCAATGTGTTAATTTTTAGGTATCTTCCATCTGCGAGCAAACACCACAGTTCTTTTAGTTGGTATTCTGTCCTTTTTAAAATGAAGGTAGTTTGCCGTCTTATAAAAAAAGGGTTTTTAGTAATTTGCTGACGCTGTTTGTTCGTTCTTACTACTTTTTGGCTTTTGCATTTTGTTTTCATTATTGGTTATAAAGCAAGCCAATTCCATCATTTCTGCGTTTATAGTTTCAATTTTTTTTTTCTTTTGCAAGTTCCTTCTTAATCACTGGGACCTCGTGATGCAATATATGACAGTTATTTGCTGGAAGAATGGAACGAAAGAAACAAGGATGCATCTGAAGATATTTCATCAAAGTTTAAGAACCTTCGGTTTCTCCCACTCGAGGTACTTCTGGCTGTGAACTTCATTATATTGATATTCAAGAACTTTCGGTTTTAGGATACTCATGCATCTGGTTCCTTTATTGTAACACATGATGTTTGGTAATATGGTCCTACGTTTATTTCTCTTTATCTTTCTTTTTTCGGGGGTTATCTTACACATCCATGTTTCATGTCAGCTTCCACAGCAGGAAAATTCATTTGATTGCGGATTATTTTTGCTCCACTATCTAGAACTCTTTTTGGCAGAAGCTCCTCTTGATTTCAGTCCGTTCAAAATCTCCAAGCTTTCAAAATTTGTAAGTTGTCCGCTTTGAACCCTTATTTGGAAATTTTAACTAATATATCTTAGATATAAGAAAATTTCTAGTTAACCACCTCGACTTAATAAATTTGTTTTTTTTTTTCAAACAATTGTGTGAGTTGCGGAAAAGATGCACGAGATTGGGTTTATTATTCTTATGGCTTATGTCTAATTTATAAATCCTTAGTTCTAATTCCTTTATTATCTTTTTATTGATGCCGTCGCATTTCATTTCGTCTCGATGAAATCCCTTTCCAGATTAGTCTCGAAGCTTTTTCCACATTATGAACTTTTAACTCTTCATCCTTCTGTTTTGTTAATTGGACATGCGGTGTGAAAGCCTTGTAGGGTGCCATTTCTGATCTCACCTTCACAATTGAAAGCATTGATGACATTCTTAGATCACCTCGACATGGAATTCTCACATGACTCATACTCAATAGGATGATTGACACGTGACACAACTTTGGGACTGGGATAAATTTGAACGTTTATTATTATTAAGAAAATGTTAGAAAGATAATATAGCTTTTAGATTTTTTCTGCGTTGTGTGTGTGTGTGTTCCATCAAAGTTTCTGAGTTCACAAAAATGAACTGTTATAAAAAAGAGAGATATGAACTAATCTTCCTTGGATTTACTTTTATCTTCTGAATTAGACGTTAATAATTTGATTGTTTTTTTTCAATGAATCTCCCCAGCTTAATGTGGATTGGTTCCCACCTGCTGAGGCTTACCTCAAGCGAACTTTAATACAGAGATTAATTTTTGAAATCCTTGAAAAACGATCTCGACAAATGTCCACCACTGCTTGCAGTGATGATCTCCTTTCTAAATTTCCATCAAACAATGAGGATGAAGCTGGTGTGGAGCTTGTTCCGGAAAGTGGAAGACTGGCAGAGACATTCAATCACAATTTGTCAAGCTCACAAGCTGCTGATGGGATTGAAATTACTCTGTTATCTGAATCTTCGAGTAGACATAACCACTTTATGGACGGTTCTGGCTTGGTTGTCAGAGAACTATTTGAACCAGGCACATCGAATGGATCATTACTTGGACACTATCAATCTTTTGCCCAGACATCATCGTATTTTGATACAAATGTTACCGTGTTGGAGGTATGTTTTAGTTTCTCTTATTATCTTTTGCCCAACAATAAAAGAAGCAAATTGATGCAATCTCTTACACAGTTTTTTATATTTTGTGTATCGTCACTTAGGCAAATGTGTATATATATATATTTCAATTATGATTGGTAATAATGGTTCATACTTGTGCCAACGTTAAGAAACATGATGAAAAATGGACAAAATATTTCCTTGCTATCTTCATGCAATTCCAGTTCGTTAGTTCTAGATGATACCTGTTCCTGTAGAGAACAATTGTTAGTATGCCTACAGCTCTGATGCTATAAAATTTTCTCATTAATAGGAAGATGCAGATACGGAAACTGGAGACCGTTTCATGTACTTGCCTTCAGAACAGGATGGTTTGCAGCCAGTTGATGCAATGACATCTCAAGCTTGTCGTTTTCCGGGTTCGTCAAGAGGCCTAGAATCCGAGACTGCTTTTGACTTGTGCATGTCTATTCAACCAGAGCATGGTAGTGGCATTGCCTCATCCCCCTCAAGTCACTCAGATGTCTTAGAAGATGTGGGAATCATTGAAAGTTGCGATGTTAGGGAACCAAGCCCTGGTAACAAAGAAGAAATTAACAGAAAAAGGCCCTTACCAATTGAGAACCTGGAACCTATAGCAGAATGCCCCACTTCTGCTGCTACCACGACGCAAGATACCGACACCATTGTCGTTTCTAAAGATACTAATGATACTTGTGAAGACATGGAAAATGATGGTTCTGATCCTCATTGTAAAGAAACTGTCGTTGCATCACCGTGTCTAGATGAGGATATAACAACCAAAACAGACATCGAACATGATGATGCAGTGTTGGTTGCTTCGGTTACTGAGGTCGACTTACACGAGCAACCTCCAGCCAAGAAATCGAGGCATTTGCCATACCCCGAAGAAGCAAGCAACGGTCTCGGAGACGTTCTGCTAGAGGATGCAGACTTATAAGATGGAAGGTTGGCGTGTTCTTTTTCGACATATTTGTAAATGATCATGTCAGTAGGGTTTGGCTTGCCATGCTGTCAATAGTGTAATGGCTCCATATCTGCTTAATTAGTACCGATAGATAATTCATTTGGGTAACGTTGATGTAACCCATGCTAGTTATATCAAGGATTAGTTTTAATTCCTCGCCATCACCAACATCACACTGTTCTTTTCTCTTCTTTTCCTCTAATTTTGCCATTTTATTTTATTTTTATTTTTATTTTTTATTGGAAGAATCAAAGTTTTGTGTAACTATTTGCTCTACAGCTGCAAGCTCCTTCCTAAACTCAACGCATATTGTCAATAGCG

mRNA sequence

AACCCAGTACGTTGAACACGGGTTTTTAGGAACCGGAGGGCTACTGATTCCGCGCCAAACAGCCCTAATTGGCTCCATTCTCAGCAACCAAGTGGCCCAGAGCTTTGACACGATCTTCGTTTACCAGAGCTTTACTTCGTCCTCCTGTGATTACGATGGCAGTTTGATTTCTTGGGATCGTTTGATCTCTCAGTGATCCGCTCTCATCTCGCCATGAACGACGCCTCTGTCAAAGGCCTCGAAGTCTTCGATTTCACCGAAGAAGACGAGCTTCCTGAATTGATCTCCGAGAAGTGTCTCAGCAAATTCAAAAACCCTAATCTCGAAACTAACTCTGTTTTGAAGTATGGATGTCTTGAATTAGGGAAAGAGGTTGAAAATCCACATATGGATGTTGACTTGGATGAATGTAATCGTGGTTGTGACAATGGTATTTCACATATCCCCCCGGGCATATCTAAAGAGCAATTGATCATGGAAGAAGAGAAATATCAATTGGATGTCAATACAGAATCAGAATGGAATACTCATTCACAAGACATGTTTGTGCAAGTAAATAATCACGTGACAGGATGCTTTGGCTCTGAGCTTGGAAAAGTTGGATCTAGCTCTCAAAGTTCTATCCTGGGGTTAAATTGCTCTCTTCCTGAGTTTGCCGCTAAAAGGGAGCAAGTGGATGCACTTTCATATCCTAATGGAAGCATGAATGGGAGTTCTCCAATGAGCTCTCCTTCGGAGCTTGTAGAGGATAGTGTTTCGCCGAATGGGAAGTCTTCAGATAAGTGCTCATCTGACAATGAAATGGATGATCTTAACCAAGAGGTTGTTCTATGTCCCGATTATATTATATATGGAGATTCTTACTGTACCAGTTCGCAGTTAACCTTTTCACATAATGGCGTCAAAATTAATGGTTTTTCTGATTATGGAAGCAACGAGTTCCTTAACCTTGAATGGGGAGTTGATGATCTCATTAATATTGAGTGTCATTGGTTTCAAAGGGTTGAATTTGTGACGATTAAGCTCCATACTATATCAAAGGATGTTGGTCAACGTGATAATGGATGTGACACTTCTGGCCTTAAGGAAGTGAAGATTGTTCTAGTTGATCCTTGCTGGTCTGAGAAACAACAAAAAATCAGATCTTTGGATTCCAGATACATGGCTATTTGGAATATGTCTCTTGACTTTGATGAGCATTTTGAAGAAGTTGTCTATCCCAAAGGAGATCCAGATGCCGTTTCCATTAGTAAGAGGGATGTTGACTTGTTGCAACCAGAGACATTTGTCAATGATACAATCATTGACTTTTACATCCAGTATTTGAAGAGCCAGATAGACCCTAAGGAGAAGCATAGATTTCACTTCTTCAATAGCTTTTTCTTTAGGAAGCTAGCTGACCTTGATAAAGATCCCTCAAGTGCTTCTGATGGCAAAGCTGCTTTTCTTCGTGTTCGAAAATGGACTCGGAAAGTGAATTTATTTGACAAGGATTATATCTTCATTCCTATAAACTTCAACCTTCATTGGAGCTTAATGGTCATATGCCATCCTGGTGAAGTGGCTAGATATAGCGATGAAAACCTGATGAAGTCAACAAAAGTACCGTGTATATTGCATATGGATTCTATTAAGGGAAGTCACGCAGGCCTTAAAAATCTCATTCAAAGTTATTTGCTGGAAGAATGGAACGAAAGAAACAAGGATGCATCTGAAGATATTTCATCAAAGTTTAAGAACCTTCGGTTTCTCCCACTCGAGCTTCCACAGCAGGAAAATTCATTTGATTGCGGATTATTTTTGCTCCACTATCTAGAACTCTTTTTGGCAGAAGCTCCTCTTGATTTCAGTCCGTTCAAAATCTCCAAGCTTTCAAAATTTCTTAATGTGGATTGGTTCCCACCTGCTGAGGCTTACCTCAAGCGAACTTTAATACAGAGATTAATTTTTGAAATCCTTGAAAAACGATCTCGACAAATGTCCACCACTGCTTGCAGTGATGATCTCCTTTCTAAATTTCCATCAAACAATGAGGATGAAGCTGGTGTGGAGCTTGTTCCGGAAAGTGGAAGACTGGCAGAGACATTCAATCACAATTTGTCAAGCTCACAAGCTGCTGATGGGATTGAAATTACTCTGTTATCTGAATCTTCGAGTAGACATAACCACTTTATGGACGGTTCTGGCTTGGTTGTCAGAGAACTATTTGAACCAGGCACATCGAATGGATCATTACTTGGACACTATCAATCTTTTGCCCAGACATCATCGTATTTTGATACAAATGTTACCGTGTTGGAGGAAGATGCAGATACGGAAACTGGAGACCGTTTCATGTACTTGCCTTCAGAACAGGATGGTTTGCAGCCAGTTGATGCAATGACATCTCAAGCTTGTCGTTTTCCGGGTTCGTCAAGAGGCCTAGAATCCGAGACTGCTTTTGACTTGTGCATGTCTATTCAACCAGAGCATGGTAGTGGCATTGCCTCATCCCCCTCAAGTCACTCAGATGTCTTAGAAGATGTGGGAATCATTGAAAGTTGCGATGTTAGGGAACCAAGCCCTGGTAACAAAGAAGAAATTAACAGAAAAAGGCCCTTACCAATTGAGAACCTGGAACCTATAGCAGAATGCCCCACTTCTGCTGCTACCACGACGCAAGATACCGACACCATTGTCGTTTCTAAAGATACTAATGATACTTGTGAAGACATGGAAAATGATGGTTCTGATCCTCATTGTAAAGAAACTGTCGTTGCATCACCGTGTCTAGATGAGGATATAACAACCAAAACAGACATCGAACATGATGATGCAGTGTTGGTTGCTTCGGTTACTGAGGTCGACTTACACGAGCAACCTCCAGCCAAGAAATCGAGGCATTTGCCATACCCCGAAGAAGCAAGCAACGGTCTCGGAGACGTTCTGCTAGAGGATGCAGACTTATAAGATGGAAGGTTGGCGTGTTCTTTTTCGACATATTTGTAAATGATCATGTCAGTAGGGTTTGGCTTGCCATGCTGTCAATAGTGTAATGGCTCCATATCTGCTTAATTAGTACCGATAGATAATTCATTTGGGTAACGTTGATGTAACCCATGCTAGTTATATCAAGGATTAGTTTTAATTCCTCGCCATCACCAACATCACACTGTTCTTTTCTCTTCTTTTCCTCTAATTTTGCCATTTTATTTTATTTTTATTTTTATTTTTTATTGGAAGAATCAAAGTTTTGTGTAACTATTTGCTCTACAGCTGCAAGCTCCTTCCTAAACTCAACGCATATTGTCAATAGCG

Coding sequence (CDS)

ATGAACGACGCCTCTGTCAAAGGCCTCGAAGTCTTCGATTTCACCGAAGAAGACGAGCTTCCTGAATTGATCTCCGAGAAGTGTCTCAGCAAATTCAAAAACCCTAATCTCGAAACTAACTCTGTTTTGAAGTATGGATGTCTTGAATTAGGGAAAGAGGTTGAAAATCCACATATGGATGTTGACTTGGATGAATGTAATCGTGGTTGTGACAATGGTATTTCACATATCCCCCCGGGCATATCTAAAGAGCAATTGATCATGGAAGAAGAGAAATATCAATTGGATGTCAATACAGAATCAGAATGGAATACTCATTCACAAGACATGTTTGTGCAAGTAAATAATCACGTGACAGGATGCTTTGGCTCTGAGCTTGGAAAAGTTGGATCTAGCTCTCAAAGTTCTATCCTGGGGTTAAATTGCTCTCTTCCTGAGTTTGCCGCTAAAAGGGAGCAAGTGGATGCACTTTCATATCCTAATGGAAGCATGAATGGGAGTTCTCCAATGAGCTCTCCTTCGGAGCTTGTAGAGGATAGTGTTTCGCCGAATGGGAAGTCTTCAGATAAGTGCTCATCTGACAATGAAATGGATGATCTTAACCAAGAGGTTGTTCTATGTCCCGATTATATTATATATGGAGATTCTTACTGTACCAGTTCGCAGTTAACCTTTTCACATAATGGCGTCAAAATTAATGGTTTTTCTGATTATGGAAGCAACGAGTTCCTTAACCTTGAATGGGGAGTTGATGATCTCATTAATATTGAGTGTCATTGGTTTCAAAGGGTTGAATTTGTGACGATTAAGCTCCATACTATATCAAAGGATGTTGGTCAACGTGATAATGGATGTGACACTTCTGGCCTTAAGGAAGTGAAGATTGTTCTAGTTGATCCTTGCTGGTCTGAGAAACAACAAAAAATCAGATCTTTGGATTCCAGATACATGGCTATTTGGAATATGTCTCTTGACTTTGATGAGCATTTTGAAGAAGTTGTCTATCCCAAAGGAGATCCAGATGCCGTTTCCATTAGTAAGAGGGATGTTGACTTGTTGCAACCAGAGACATTTGTCAATGATACAATCATTGACTTTTACATCCAGTATTTGAAGAGCCAGATAGACCCTAAGGAGAAGCATAGATTTCACTTCTTCAATAGCTTTTTCTTTAGGAAGCTAGCTGACCTTGATAAAGATCCCTCAAGTGCTTCTGATGGCAAAGCTGCTTTTCTTCGTGTTCGAAAATGGACTCGGAAAGTGAATTTATTTGACAAGGATTATATCTTCATTCCTATAAACTTCAACCTTCATTGGAGCTTAATGGTCATATGCCATCCTGGTGAAGTGGCTAGATATAGCGATGAAAACCTGATGAAGTCAACAAAAGTACCGTGTATATTGCATATGGATTCTATTAAGGGAAGTCACGCAGGCCTTAAAAATCTCATTCAAAGTTATTTGCTGGAAGAATGGAACGAAAGAAACAAGGATGCATCTGAAGATATTTCATCAAAGTTTAAGAACCTTCGGTTTCTCCCACTCGAGCTTCCACAGCAGGAAAATTCATTTGATTGCGGATTATTTTTGCTCCACTATCTAGAACTCTTTTTGGCAGAAGCTCCTCTTGATTTCAGTCCGTTCAAAATCTCCAAGCTTTCAAAATTTCTTAATGTGGATTGGTTCCCACCTGCTGAGGCTTACCTCAAGCGAACTTTAATACAGAGATTAATTTTTGAAATCCTTGAAAAACGATCTCGACAAATGTCCACCACTGCTTGCAGTGATGATCTCCTTTCTAAATTTCCATCAAACAATGAGGATGAAGCTGGTGTGGAGCTTGTTCCGGAAAGTGGAAGACTGGCAGAGACATTCAATCACAATTTGTCAAGCTCACAAGCTGCTGATGGGATTGAAATTACTCTGTTATCTGAATCTTCGAGTAGACATAACCACTTTATGGACGGTTCTGGCTTGGTTGTCAGAGAACTATTTGAACCAGGCACATCGAATGGATCATTACTTGGACACTATCAATCTTTTGCCCAGACATCATCGTATTTTGATACAAATGTTACCGTGTTGGAGGAAGATGCAGATACGGAAACTGGAGACCGTTTCATGTACTTGCCTTCAGAACAGGATGGTTTGCAGCCAGTTGATGCAATGACATCTCAAGCTTGTCGTTTTCCGGGTTCGTCAAGAGGCCTAGAATCCGAGACTGCTTTTGACTTGTGCATGTCTATTCAACCAGAGCATGGTAGTGGCATTGCCTCATCCCCCTCAAGTCACTCAGATGTCTTAGAAGATGTGGGAATCATTGAAAGTTGCGATGTTAGGGAACCAAGCCCTGGTAACAAAGAAGAAATTAACAGAAAAAGGCCCTTACCAATTGAGAACCTGGAACCTATAGCAGAATGCCCCACTTCTGCTGCTACCACGACGCAAGATACCGACACCATTGTCGTTTCTAAAGATACTAATGATACTTGTGAAGACATGGAAAATGATGGTTCTGATCCTCATTGTAAAGAAACTGTCGTTGCATCACCGTGTCTAGATGAGGATATAACAACCAAAACAGACATCGAACATGATGATGCAGTGTTGGTTGCTTCGGTTACTGAGGTCGACTTACACGAGCAACCTCCAGCCAAGAAATCGAGGCATTTGCCATACCCCGAAGAAGCAAGCAACGGTCTCGGAGACGTTCTGCTAGAGGATGCAGACTTATAA

Protein sequence

MNDASVKGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKYGCLELGKEVENPHMDVDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTGCFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDSVSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGSNEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDPCWSEKQQKIRSLDSRYMAIWNMSLDFDEHFEEVVYPKGDPDAVSISKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSASDGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVPCILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFDCGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILEKRSRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSESSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTETGDRFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHSDVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSKDTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQPPAKKSRHLPYPEEASNGLGDVLLEDADL
Homology
BLAST of CmoCh12G001520 vs. ExPASy Swiss-Prot
Match: Q8L7S0 (Probable ubiquitin-like-specific protease 2B OS=Arabidopsis thaliana OX=3702 GN=ULP2B PE=1 SV=3)

HSP 1 Score: 609.8 bits (1571), Expect = 5.2e-173
Identity = 378/908 (41.63%), Postives = 530/908 (58.37%), Query Frame = 0

Query: 7   KGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKY--------------------G 66
           K  EVFDF EEDEL E  + K L KF NP+   + VL+                      
Sbjct: 3   KNFEVFDFKEEDELAESAAGKLLEKFTNPSPCNSPVLQRQRIQSFCNEKRVEEEEMEGPS 62

Query: 67  CLELGKEVENPH-------------------MDVDLDECNRGCDNGISHIPPGISKEQLI 126
           C E    VE+                     +   L+  +   +    H+  G+    L 
Sbjct: 63  CAEPATAVESDDHQCEDDSTLVTEAKESRTILTFGLETTDHLEETDAEHVNQGLML-GLK 122

Query: 127 MEEEKYQLDVNTESE---WNTHSQDMFVQVN-NHVTGCFGSELGK---VGSSSQSSILGL 186
            E+   + D++ ++    +  +S+D   + + +H    F  +LG       +S  S   L
Sbjct: 123 TEDLAKETDIDHDNHGLMFGLNSEDDIEETDVDHRVESFSCQLGGNSFYAETSSYSQRQL 182

Query: 187 NCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDSVSPNGKSSDKCSSDNEMDDL 246
           N    + ++  EQ+D +S  + S++  S +S  S+  +D        ++ C +D E  DL
Sbjct: 183 NSPFSDSSSSEEQIDMMSAIDESLSDRSALSEASDSEDDE---EDWMTEHCFNDEEKIDL 242

Query: 247 NQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGSNEFLNLEWGVDDLINIECHW 306
           +  V++  +Y+I  D +C +S + FS NG+KI  F         + E+GV+D+++I+ +W
Sbjct: 243 STAVIMTSEYVILKDMHCAASLVIFSCNGIKIKSFLANNEEVPFSCEFGVEDIVSIQYNW 302

Query: 307 FQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDPCWSEKQQKIRSLDSRYMAIW 366
           +Q V  + +++  + KD    ++      ++E+KI + +  W  KQQKI SL  +Y A+W
Sbjct: 303 YQNVGLIILRIRVLLKDENCHED------MEELKIAVKEHNWPNKQQKINSLHVKYPAVW 362

Query: 367 NMSLD-------------------FDEHFEEVVYPKGDPDAVSISKRDVDLLQPETFVND 426
           N  L+                   FDE FE+VVYPKGDPDAVSI KRDV+LLQPETFVND
Sbjct: 363 NTDLEDDVEVSGYNLNQQKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPETFVND 422

Query: 427 TIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSASDGKAAFLRVRKWTRKV 486
           TIIDFYI YLK+QI  +EKHRFHFFNSFFFRKLADLDKDPSS +DGKAAFLRVRKWTRKV
Sbjct: 423 TIIDFYINYLKNQIQTEEKHRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKWTRKV 482

Query: 487 NLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVPCILHMDSIKGSHAGLK 546
           ++F KDYIF+P+N+NLHWSL+VICHPGEVA  +D +L  S KVPCILHMDSIKGSHAGLK
Sbjct: 483 DMFGKDYIFVPVNYNLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSHAGLK 542

Query: 547 NLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFDCGLFLLHYLELFLAEA 606
           NL+Q+YL EEW ER+K+ S+DISS+F NLRF+ LELPQQENSFDCGLFLLHYLELFLAEA
Sbjct: 543 NLVQTYLCEEWKERHKETSDDISSRFMNLRFVSLELPQQENSFDCGLFLLHYLELFLAEA 602

Query: 607 PLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILEKRSRQMSTTACSDDLLSK 666
           PL+FSPFKI   S FL ++WFPPAEA LKRTLIQ+LIFE+LE RSR++S      +   +
Sbjct: 603 PLNFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELLENRSREVSN---EQNQSCE 662

Query: 667 FPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSESSSRHNHFMDGSGLVV 726
            P    D+ G+E++ E        N +++ +Q   GIE+TLL  SS RH    + SG+V+
Sbjct: 663 SPVAVNDDMGIEVLSERCSPLIDCNGDMTQTQDDQGIEMTLLERSSMRHIQAANDSGMVL 722

Query: 727 RELFEPGTSN-GSLLGHYQS-FAQTSSYFD-TNVTVLEEDADTETGDRFMYLPSEQDGLQ 786
           R+LF+ G++N GSLL   Q  F   SS++  +N +   E  D ETG++FM L + +   Q
Sbjct: 723 RDLFDSGSNNTGSLLEQLQQPFEDPSSFYHLSNDSSAREQVDMETGEQFMCLNAGEGNFQ 782

Query: 787 PVDAMTSQACRFPGSSRGLESETAFDLCMS-IQPEHGSGIASSPSSHSDVLEDVGIIESC 838
            +   T        S R   S ++++L +  +Q E  + + S  S+ S+  + +GIIE  
Sbjct: 783 CITETT--------SPRASNSFSSWNLGIPLVQKEDETDLLSETSNSSN--DSIGIIEDN 842

BLAST of CmoCh12G001520 vs. ExPASy Swiss-Prot
Match: Q0WKV8 (Probable ubiquitin-like-specific protease 2A OS=Arabidopsis thaliana OX=3702 GN=ULP2A PE=2 SV=2)

HSP 1 Score: 400.6 bits (1028), Expect = 4.8e-110
Identity = 209/447 (46.76%), Postives = 290/447 (64.88%), Query Frame = 0

Query: 154 VDALSYPNGSMNGSSPMSSPSELVEDSVSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIY 213
           +D +S  +    G   ++S S    D VS  G++++  S  +E+D  N +V++ PD IIY
Sbjct: 99  IDVISNGSHRRIGIDSLTSSSLSENDEVS-TGEATNPASDPHEVDPENAQVLIIPDVIIY 158

Query: 214 GDSYCTSSQLTFSHNGVKINGFSDYGSNEFLNLEWGVDDLINIECHWFQRVEFVTIKLHT 273
           GD YCT+S+LTFS N + +   S   +    + +W ++D+I IE  W   VE   + +  
Sbjct: 159 GDIYCTNSKLTFSRNCMNVESSSVNATKGTFSCQWTIEDIIKIESQWCLEVETAFVNVLL 218

Query: 274 ISKDVGQRDNGCDTSGLKEVKIVLVDPCWSEKQQKIRSLDSRYMAIW------NMSLDFD 333
            S+     D   D SG+  +K  + DP WS++ + IRSLDSRY  IW      +  + F 
Sbjct: 219 KSRKPEGVDIAKDISGIDLLKFSVYDPKWSKEVETIRSLDSRYKNIWFDTITESEEIAFS 278

Query: 334 EH------------FEEVVYPKGDPDAVSISKRDVDLLQPETFVNDTIIDFYIQYLKSQI 393
            H            FE++VYP+G+PDAV + K+D++LL+P  F+NDTIIDFYI+YLK++I
Sbjct: 279 GHDLGTSLTNLADSFEDLVYPQGEPDAVVVRKQDIELLKPRRFINDTIIDFYIKYLKNRI 338

Query: 394 DPKEKHRFHFFNSFFFRKLADLDKDPSSASDGKAAFLRVRKWTRKVNLFDKDYIFIPINF 453
            PKE+ RFHFFN FFFRKLA+LDK   S   G+ A+ RV+KWT+ V+LF+KDYIFIPIN 
Sbjct: 339 SPKERGRFHFFNCFFFRKLANLDKGTPSTCGGREAYQRVQKWTKNVDLFEKDYIFIPINC 398

Query: 454 NLHWSLMVICHPGEVARYSDENLMKSTKVPCILHMDSIKGSH-AGLKNLIQSYLLEEWNE 513
           + HWSL++ICHPGE+     EN     +VPCILH+DSIKGSH  GL N+  SYL EEW  
Sbjct: 399 SFHWSLVIICHPGELVPSHVEN---PQRVPCILHLDSIKGSHKGGLINIFPSYLREEWKA 458

Query: 514 RNKDASEDISSKFKNLRFLPLELPQQENSFDCGLFLLHYLELFLAEAPLDFSPFKISKLS 573
           R+++ + D SS+  N++ + LELPQQENSFDCGLFLLHYL+LF+A+AP  F+P  IS+ +
Sbjct: 459 RHENTTND-SSRAPNMQSISLELPQQENSFDCGLFLLHYLDLFVAQAPAKFNPSLISRSA 518

Query: 574 KFLNVDWFPPAEAYLKRTLIQRLIFEI 582
            FL  +WFP  EA LKR  I  L++ +
Sbjct: 519 NFLTRNWFPAKEASLKRRNILELLYNL 540

BLAST of CmoCh12G001520 vs. ExPASy Swiss-Prot
Match: Q8RWN0 (Ubiquitin-like-specific protease 1C OS=Arabidopsis thaliana OX=3702 GN=ULP1C PE=1 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 1.7e-35
Identity = 93/269 (34.57%), Postives = 148/269 (55.02%), Query Frame = 0

Query: 331 EEVVYPKGDP----DAVSISKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEK--HRFH 390
           E++ YP  D     D V +S +D+  L P  ++   +I+FYI+Y++  +   +K     H
Sbjct: 315 EDIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPVINFYIRYVQHHVFSADKTAANCH 374

Query: 391 FFNSFFFRKLADLDKDPSSASDGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVI 450
           FFN+FF++KL +        +D  A F++ R+W +  +LF K YIFIPI+ +LHWSL++I
Sbjct: 375 FFNTFFYKKLTEAVS--YKGNDRDAYFVKFRRWWKGFDLFCKSYIFIPIHEDLHWSLVII 434

Query: 451 CHPGEVARYSDENLMKSTKVPCILHMDSIKGSHAG--LKNLIQSYLLEEWNERNKDASED 510
           C P +     DE+ +       I+H+DS+ G H    + N ++ +L EEWN  N+DA  D
Sbjct: 435 CIPDK----EDESGL------TIIHLDSL-GLHPRNLIFNNVKRFLREEWNYLNQDAPLD 494

Query: 511 ISSKFKNLRFLP-------LELPQQENSFDCGLFLLHYLELFLAEAPLDFSPFKISKLSK 570
           +    K  R LP       +++PQQ+N FDCGLFLL ++  F+ EAP   +   +    K
Sbjct: 495 LPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLTLQDL----K 554

Query: 571 FLNVDWFPPAEAYLKRTLIQRLIFEILEK 585
            ++  WF P EA   R  I  ++ ++  K
Sbjct: 555 MIHKKWFKPEEASALRIKIWNILVDLFRK 566

BLAST of CmoCh12G001520 vs. ExPASy Swiss-Prot
Match: Q2PS26 (Ubiquitin-like-specific protease 1D OS=Arabidopsis thaliana OX=3702 GN=ULP1D PE=1 SV=1)

HSP 1 Score: 143.3 bits (360), Expect = 1.4e-32
Identity = 89/264 (33.71%), Postives = 149/264 (56.44%), Query Frame = 0

Query: 331 EEVVYP-KGDPDAVSISKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEK--HRFHFFN 390
           E++ YP + DP  V +  +D++ L P  ++   +++FY+++L+ QI    +     HFFN
Sbjct: 330 EDICYPTRDDPHFVQVCLKDLECLAPREYLTSPVMNFYMRFLQQQISSSNQISADCHFFN 389

Query: 391 SFFFRKLADLDKDPSSASDGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHP 450
           ++F++KL+  D      +D  A F+R R+W + ++LF K YIFIPI+ +LHWSL+++C P
Sbjct: 390 TYFYKKLS--DAVTYKGNDKDAFFVRFRRWWKGIDLFRKAYIFIPIHEDLHWSLVIVCIP 449

Query: 451 GEVARYSDENLMKSTKVPCILHMDSIKGSHA--GLKNLIQSYLLEEWNERNK-DASEDIS 510
            +     DE+ +       ILH+DS+ G H+   +   ++ +L +EWN  N+ D S D+ 
Sbjct: 450 DK----KDESGL------TILHLDSL-GLHSRKSIVENVKRFLKDEWNYLNQDDYSLDLP 509

Query: 511 SKFKNLRFLP-------LELPQQENSFDCGLFLLHYLELFLAEAPLDFSPFKISKLSKFL 570
              K  + LP       +++PQQ+N FDCG F+L +++ F+ EAP      K   L  F 
Sbjct: 510 ISEKVWKNLPRRISEAVVQVPQQKNDFDCGPFVLFFIKRFIEEAP---QRLKRKDLGMF- 569

Query: 571 NVDWFPPAEAYLKRTLIQRLIFEI 582
           +  WF P EA   R  I+  + E+
Sbjct: 570 DKKWFRPDEASALRIKIRNTLIEL 576

BLAST of CmoCh12G001520 vs. ExPASy Swiss-Prot
Match: O13769 (Ubiquitin-like-specific protease 2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=ulp2 PE=1 SV=2)

HSP 1 Score: 137.1 bits (344), Expect = 9.8e-31
Identity = 83/231 (35.93%), Postives = 123/231 (53.25%), Query Frame = 0

Query: 333 VVYPKGDPDAVSISKRDVDLLQPETFVNDTIIDFYIQYLKSQI---DPKEKHRFHFFNSF 392
           +VYP    ++++I+  D+  L    F+NDTI+DFY++YL  ++   +P   +  H FN+F
Sbjct: 337 LVYPFSGTNSIAITNTDLTRLNEGEFLNDTIVDFYLRYLYCKLQTQNPSLANDTHIFNTF 396

Query: 393 FFRKLADLDKDPSSASDGKAAFLR-VRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPG 452
           F+ +L   DK      DGK    R VRKWT+KV+LF K YI +PIN   HW L +IC+  
Sbjct: 397 FYNRLTSKDK------DGKRLGHRGVRKWTQKVDLFHKKYIIVPINETFHWYLAIICNID 456

Query: 453 EV------ARYSDENLMKS---------------TKVPCILHMDSIKGSHAGLKNLIQSY 512
            +          DE +M S               +  P IL  DS+   H G  N ++ Y
Sbjct: 457 RLMPVDTKLEEQDEIVMSSVEQPSASKTRQAELTSNSPAILIFDSLANLHKGALNYLREY 516

Query: 513 LLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFDCGLFLLHYLELFL 539
           LLEE  ER     +++  K  ++R    ++PQQ N  DCG++ LH++ELFL
Sbjct: 517 LLEEAFER-----KNVHLKSTDIRGFHAKVPQQSNFSDCGIYALHFVELFL 556

BLAST of CmoCh12G001520 vs. ExPASy TrEMBL
Match: A0A6J1GHC8 (probable ubiquitin-like-specific protease 2B isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454203 PE=3 SV=1)

HSP 1 Score: 1831.2 bits (4742), Expect = 0.0e+00
Identity = 911/926 (98.38%), Postives = 911/926 (98.38%), Query Frame = 0

Query: 1   MNDASVKGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKYGCLELGKEVENPHMD 60
           MNDASVKGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKYGCLELGKEVENPHMD
Sbjct: 1   MNDASVKGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKYGCLELGKEVENPHMD 60

Query: 61  VDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG 120
           VDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG
Sbjct: 61  VDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG 120

Query: 121 CFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS 180
           CFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS
Sbjct: 121 CFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS 180

Query: 181 VSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGS 240
           VSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGS
Sbjct: 181 VSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGS 240

Query: 241 NEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDP 300
           NEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDP
Sbjct: 241 NEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDP 300

Query: 301 CWSEKQQKIRSLDSRYMAIWNMSLD---------------FDEHFEEVVYPKGDPDAVSI 360
           CWSEKQQKIRSLDSRYMAIWNMSLD               FDEHFEEVVYPKGDPDAVSI
Sbjct: 301 CWSEKQQKIRSLDSRYMAIWNMSLDVGDDDLGGPRQYFPNFDEHFEEVVYPKGDPDAVSI 360

Query: 361 SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS 420
           SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS
Sbjct: 361 SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS 420

Query: 421 DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVP 480
           DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVP
Sbjct: 421 DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVP 480

Query: 481 CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFD 540
           CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFD
Sbjct: 481 CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFD 540

Query: 541 CGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILEKR 600
           CGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILEKR
Sbjct: 541 CGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILEKR 600

Query: 601 SRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSE 660
           SRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSE
Sbjct: 601 SRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSE 660

Query: 661 SSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTETGD 720
           SSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTETGD
Sbjct: 661 SSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTETGD 720

Query: 721 RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS 780
           RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS
Sbjct: 721 RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS 780

Query: 781 DVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK 840
           DVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK
Sbjct: 781 DVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK 840

Query: 841 DTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQPPA 900
           DTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQPPA
Sbjct: 841 DTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQPPA 900

Query: 901 KKSRHLPYPEEASNGLGDVLLEDADL 912
           KKSRHLPYPEEASNGLGDVLLEDADL
Sbjct: 901 KKSRHLPYPEEASNGLGDVLLEDADL 926

BLAST of CmoCh12G001520 vs. ExPASy TrEMBL
Match: A0A6J1KHS5 (probable ubiquitin-like-specific protease 2B isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495917 PE=3 SV=1)

HSP 1 Score: 1773.4 bits (4592), Expect = 0.0e+00
Identity = 882/926 (95.25%), Postives = 897/926 (96.87%), Query Frame = 0

Query: 1   MNDASVKGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKYGCLELGKEVENPHMD 60
           MN+ASVKGLEVFDFTEEDELPELISE+CLSKFKNPNLE N+VLKYGCLELGKEVENPHMD
Sbjct: 1   MNNASVKGLEVFDFTEEDELPELISERCLSKFKNPNLEINTVLKYGCLELGKEVENPHMD 60

Query: 61  VDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG 120
           VDLDECNRGCDNGISHIP GISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG
Sbjct: 61  VDLDECNRGCDNGISHIPSGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG 120

Query: 121 CFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS 180
           CFGSELGKVGSSSQSSILGLNC+LPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS
Sbjct: 121 CFGSELGKVGSSSQSSILGLNCTLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS 180

Query: 181 VSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGS 240
           VSPNG+SSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNG+KINGFSDYGS
Sbjct: 181 VSPNGRSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGIKINGFSDYGS 240

Query: 241 NEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDP 300
           NEFLNLEW VDDL+NIECHWFQRVEFVTIKLHTISKDVGQRDN C TSGLKEVKIVLVDP
Sbjct: 241 NEFLNLEWRVDDLVNIECHWFQRVEFVTIKLHTISKDVGQRDNACGTSGLKEVKIVLVDP 300

Query: 301 CWSEKQQKIRSLDSRYMAIWNMSLD---------------FDEHFEEVVYPKGDPDAVSI 360
           CWSEKQQKIRSLDSRYMAIWNMSLD               FDEHFEEVVYPKGDPDAVSI
Sbjct: 301 CWSEKQQKIRSLDSRYMAIWNMSLDAGDDDLGGPRQYFPNFDEHFEEVVYPKGDPDAVSI 360

Query: 361 SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS 420
           SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS
Sbjct: 361 SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS 420

Query: 421 DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVP 480
           DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENL+KSTKVP
Sbjct: 421 DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLIKSTKVP 480

Query: 481 CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFD 540
           CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFD
Sbjct: 481 CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFD 540

Query: 541 CGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILEKR 600
           CGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILE R
Sbjct: 541 CGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILENR 600

Query: 601 SRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSE 660
           SR+MSTTACSDD LSKFPSNNEDEAGVELVPESGRLAETF+HNLSSSQAADGIEITLLSE
Sbjct: 601 SREMSTTACSDD-LSKFPSNNEDEAGVELVPESGRLAETFDHNLSSSQAADGIEITLLSE 660

Query: 661 SSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTETGD 720
           SSSRHNHF+DGSGLVVRELFEPGTSNGSLLGHYQ FAQTSSYFDTNVTVLEEDADTETGD
Sbjct: 661 SSSRHNHFIDGSGLVVRELFEPGTSNGSLLGHYQPFAQTSSYFDTNVTVLEEDADTETGD 720

Query: 721 RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS 780
           RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS
Sbjct: 721 RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS 780

Query: 781 DVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK 840
           DVLED+GIIESCDVREPSPGNKEEI RKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK
Sbjct: 781 DVLEDMGIIESCDVREPSPGNKEEIIRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK 840

Query: 841 DTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQPPA 900
           DTNDTCEDMENDGSDPH KETV ASPCLDEDI+TKTD+EHDDAV+ ASVTEVDLHEQPPA
Sbjct: 841 DTNDTCEDMENDGSDPHRKETVFASPCLDEDISTKTDVEHDDAVVDASVTEVDLHEQPPA 900

Query: 901 KKSRHLPYPEEASNGLGDVLLEDADL 912
           KK RH PYPEEASNGLGDVLLEDADL
Sbjct: 901 KKPRHFPYPEEASNGLGDVLLEDADL 925

BLAST of CmoCh12G001520 vs. ExPASy TrEMBL
Match: A0A6J1GIE5 (probable ubiquitin-like-specific protease 2B isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111454203 PE=3 SV=1)

HSP 1 Score: 1731.1 bits (4482), Expect = 0.0e+00
Identity = 871/926 (94.06%), Postives = 871/926 (94.06%), Query Frame = 0

Query: 1   MNDASVKGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKYGCLELGKEVENPHMD 60
           MNDASVKGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKYGCLELGKEVENPHMD
Sbjct: 1   MNDASVKGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKYGCLELGKEVENPHMD 60

Query: 61  VDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG 120
           VDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG
Sbjct: 61  VDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG 120

Query: 121 CFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS 180
           CFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS
Sbjct: 121 CFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS 180

Query: 181 VSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGS 240
           VSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGS
Sbjct: 181 VSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGS 240

Query: 241 NEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDP 300
           NEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDP
Sbjct: 241 NEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDP 300

Query: 301 CWSEKQQKIRSLDSRYMAIWNMSLD---------------FDEHFEEVVYPKGDPDAVSI 360
           CWSEKQQKIRSLDSRYMAIWNMSLD               FDEHFEEVVYPKGDPDAVSI
Sbjct: 301 CWSEKQQKIRSLDSRYMAIWNMSLDVGDDDLGGPRQYFPNFDEHFEEVVYPKGDPDAVSI 360

Query: 361 SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS 420
           SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS
Sbjct: 361 SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS 420

Query: 421 DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVP 480
           DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVP
Sbjct: 421 DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVP 480

Query: 481 CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFD 540
           CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLE         
Sbjct: 481 CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLE--------- 540

Query: 541 CGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILEKR 600
                                          LNVDWFPPAEAYLKRTLIQRLIFEILEKR
Sbjct: 541 -------------------------------LNVDWFPPAEAYLKRTLIQRLIFEILEKR 600

Query: 601 SRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSE 660
           SRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSE
Sbjct: 601 SRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSE 660

Query: 661 SSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTETGD 720
           SSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTETGD
Sbjct: 661 SSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTETGD 720

Query: 721 RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS 780
           RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS
Sbjct: 721 RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS 780

Query: 781 DVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK 840
           DVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK
Sbjct: 781 DVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK 840

Query: 841 DTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQPPA 900
           DTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQPPA
Sbjct: 841 DTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQPPA 886

Query: 901 KKSRHLPYPEEASNGLGDVLLEDADL 912
           KKSRHLPYPEEASNGLGDVLLEDADL
Sbjct: 901 KKSRHLPYPEEASNGLGDVLLEDADL 886

BLAST of CmoCh12G001520 vs. ExPASy TrEMBL
Match: A0A6J1GIH9 (probable ubiquitin-like-specific protease 2B isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111454203 PE=3 SV=1)

HSP 1 Score: 1713.4 bits (4436), Expect = 0.0e+00
Identity = 853/868 (98.27%), Postives = 853/868 (98.27%), Query Frame = 0

Query: 59  MDVDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHV 118
           MDVDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHV
Sbjct: 1   MDVDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHV 60

Query: 119 TGCFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVE 178
           TGCFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVE
Sbjct: 61  TGCFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVE 120

Query: 179 DSVSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDY 238
           DSVSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDY
Sbjct: 121 DSVSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDY 180

Query: 239 GSNEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLV 298
           GSNEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLV
Sbjct: 181 GSNEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLV 240

Query: 299 DPCWSEKQQKIRSLDSRYMAIWNMSLD---------------FDEHFEEVVYPKGDPDAV 358
           DPCWSEKQQKIRSLDSRYMAIWNMSLD               FDEHFEEVVYPKGDPDAV
Sbjct: 241 DPCWSEKQQKIRSLDSRYMAIWNMSLDVGDDDLGGPRQYFPNFDEHFEEVVYPKGDPDAV 300

Query: 359 SISKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSS 418
           SISKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSS
Sbjct: 301 SISKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSS 360

Query: 419 ASDGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTK 478
           ASDGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTK
Sbjct: 361 ASDGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTK 420

Query: 479 VPCILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENS 538
           VPCILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENS
Sbjct: 421 VPCILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENS 480

Query: 539 FDCGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILE 598
           FDCGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILE
Sbjct: 481 FDCGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILE 540

Query: 599 KRSRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLL 658
           KRSRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLL
Sbjct: 541 KRSRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLL 600

Query: 659 SESSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTET 718
           SESSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTET
Sbjct: 601 SESSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTET 660

Query: 719 GDRFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSS 778
           GDRFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSS
Sbjct: 661 GDRFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSS 720

Query: 779 HSDVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVV 838
           HSDVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVV
Sbjct: 721 HSDVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVV 780

Query: 839 SKDTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQP 898
           SKDTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQP
Sbjct: 781 SKDTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQP 840

Query: 899 PAKKSRHLPYPEEASNGLGDVLLEDADL 912
           PAKKSRHLPYPEEASNGLGDVLLEDADL
Sbjct: 841 PAKKSRHLPYPEEASNGLGDVLLEDADL 868

BLAST of CmoCh12G001520 vs. ExPASy TrEMBL
Match: A0A6J1KRR7 (probable ubiquitin-like-specific protease 2B isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111495917 PE=3 SV=1)

HSP 1 Score: 1673.7 bits (4333), Expect = 0.0e+00
Identity = 842/926 (90.93%), Postives = 857/926 (92.55%), Query Frame = 0

Query: 1   MNDASVKGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKYGCLELGKEVENPHMD 60
           MN+ASVKGLEVFDFTEEDELPELISE+CLSKFKNPNLE N+VLKYGCLELGKEVENPHMD
Sbjct: 1   MNNASVKGLEVFDFTEEDELPELISERCLSKFKNPNLEINTVLKYGCLELGKEVENPHMD 60

Query: 61  VDLDECNRGCDNGISHIPPGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG 120
           VDLDECNRGCDNGISHIP GISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG
Sbjct: 61  VDLDECNRGCDNGISHIPSGISKEQLIMEEEKYQLDVNTESEWNTHSQDMFVQVNNHVTG 120

Query: 121 CFGSELGKVGSSSQSSILGLNCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS 180
           CFGSELGKVGSSSQSSILGLNC+LPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS
Sbjct: 121 CFGSELGKVGSSSQSSILGLNCTLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDS 180

Query: 181 VSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGS 240
           VSPNG+SSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNG+KINGFSDYGS
Sbjct: 181 VSPNGRSSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGIKINGFSDYGS 240

Query: 241 NEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDP 300
           NEFLNLEW VDDL+NIECHWFQRVEFVTIKLHTISKDVGQRDN C TSGLKEVKIVLVDP
Sbjct: 241 NEFLNLEWRVDDLVNIECHWFQRVEFVTIKLHTISKDVGQRDNACGTSGLKEVKIVLVDP 300

Query: 301 CWSEKQQKIRSLDSRYMAIWNMSLD---------------FDEHFEEVVYPKGDPDAVSI 360
           CWSEKQQKIRSLDSRYMAIWNMSLD               FDEHFEEVVYPKGDPDAVSI
Sbjct: 301 CWSEKQQKIRSLDSRYMAIWNMSLDAGDDDLGGPRQYFPNFDEHFEEVVYPKGDPDAVSI 360

Query: 361 SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS 420
           SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS
Sbjct: 361 SKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSAS 420

Query: 421 DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVP 480
           DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENL+KSTKVP
Sbjct: 421 DGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLIKSTKVP 480

Query: 481 CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFD 540
           CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLE         
Sbjct: 481 CILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLE--------- 540

Query: 541 CGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILEKR 600
                                          LNVDWFPPAEAYLKRTLIQRLIFEILE R
Sbjct: 541 -------------------------------LNVDWFPPAEAYLKRTLIQRLIFEILENR 600

Query: 601 SRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSE 660
           SR+MSTTACSDD LSKFPSNNEDEAGVELVPESGRLAETF+HNLSSSQAADGIEITLLSE
Sbjct: 601 SREMSTTACSDD-LSKFPSNNEDEAGVELVPESGRLAETFDHNLSSSQAADGIEITLLSE 660

Query: 661 SSSRHNHFMDGSGLVVRELFEPGTSNGSLLGHYQSFAQTSSYFDTNVTVLEEDADTETGD 720
           SSSRHNHF+DGSGLVVRELFEPGTSNGSLLGHYQ FAQTSSYFDTNVTVLEEDADTETGD
Sbjct: 661 SSSRHNHFIDGSGLVVRELFEPGTSNGSLLGHYQPFAQTSSYFDTNVTVLEEDADTETGD 720

Query: 721 RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS 780
           RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS
Sbjct: 721 RFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMSIQPEHGSGIASSPSSHS 780

Query: 781 DVLEDVGIIESCDVREPSPGNKEEINRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK 840
           DVLED+GIIESCDVREPSPGNKEEI RKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK
Sbjct: 781 DVLEDMGIIESCDVREPSPGNKEEIIRKRPLPIENLEPIAECPTSAATTTQDTDTIVVSK 840

Query: 841 DTNDTCEDMENDGSDPHCKETVVASPCLDEDITTKTDIEHDDAVLVASVTEVDLHEQPPA 900
           DTNDTCEDMENDGSDPH KETV ASPCLDEDI+TKTD+EHDDAV+ ASVTEVDLHEQPPA
Sbjct: 841 DTNDTCEDMENDGSDPHRKETVFASPCLDEDISTKTDVEHDDAVVDASVTEVDLHEQPPA 885

Query: 901 KKSRHLPYPEEASNGLGDVLLEDADL 912
           KK RH PYPEEASNGLGDVLLEDADL
Sbjct: 901 KKPRHFPYPEEASNGLGDVLLEDADL 885

BLAST of CmoCh12G001520 vs. TAIR 10
Match: AT1G09730.2 (Cysteine proteinases superfamily protein )

HSP 1 Score: 609.8 bits (1571), Expect = 3.7e-174
Identity = 378/908 (41.63%), Postives = 530/908 (58.37%), Query Frame = 0

Query: 7   KGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKY--------------------G 66
           K  EVFDF EEDEL E  + K L KF NP+   + VL+                      
Sbjct: 3   KNFEVFDFKEEDELAESAAGKLLEKFTNPSPCNSPVLQRQRIQSFCNEKRVEEEEMEGPS 62

Query: 67  CLELGKEVENPH-------------------MDVDLDECNRGCDNGISHIPPGISKEQLI 126
           C E    VE+                     +   L+  +   +    H+  G+    L 
Sbjct: 63  CAEPATAVESDDHQCEDDSTLVTEAKESRTILTFGLETTDHLEETDAEHVNQGLML-GLK 122

Query: 127 MEEEKYQLDVNTESE---WNTHSQDMFVQVN-NHVTGCFGSELGK---VGSSSQSSILGL 186
            E+   + D++ ++    +  +S+D   + + +H    F  +LG       +S  S   L
Sbjct: 123 TEDLAKETDIDHDNHGLMFGLNSEDDIEETDVDHRVESFSCQLGGNSFYAETSSYSQRQL 182

Query: 187 NCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDSVSPNGKSSDKCSSDNEMDDL 246
           N    + ++  EQ+D +S  + S++  S +S  S+  +D        ++ C +D E  DL
Sbjct: 183 NSPFSDSSSSEEQIDMMSAIDESLSDRSALSEASDSEDDE---EDWMTEHCFNDEEKIDL 242

Query: 247 NQEVVLCPDYIIYGDSYCTSSQLTFSHNGVKINGFSDYGSNEFLNLEWGVDDLINIECHW 306
           +  V++  +Y+I  D +C +S + FS NG+KI  F         + E+GV+D+++I+ +W
Sbjct: 243 STAVIMTSEYVILKDMHCAASLVIFSCNGIKIKSFLANNEEVPFSCEFGVEDIVSIQYNW 302

Query: 307 FQRVEFVTIKLHTISKDVGQRDNGCDTSGLKEVKIVLVDPCWSEKQQKIRSLDSRYMAIW 366
           +Q V  + +++  + KD    ++      ++E+KI + +  W  KQQKI SL  +Y A+W
Sbjct: 303 YQNVGLIILRIRVLLKDENCHED------MEELKIAVKEHNWPNKQQKINSLHVKYPAVW 362

Query: 367 NMSLD-------------------FDEHFEEVVYPKGDPDAVSISKRDVDLLQPETFVND 426
           N  L+                   FDE FE+VVYPKGDPDAVSI KRDV+LLQPETFVND
Sbjct: 363 NTDLEDDVEVSGYNLNQQKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPETFVND 422

Query: 427 TIIDFYIQYLKSQIDPKEKHRFHFFNSFFFRKLADLDKDPSSASDGKAAFLRVRKWTRKV 486
           TIIDFYI YLK+QI  +EKHRFHFFNSFFFRKLADLDKDPSS +DGKAAFLRVRKWTRKV
Sbjct: 423 TIIDFYINYLKNQIQTEEKHRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKWTRKV 482

Query: 487 NLFDKDYIFIPINFNLHWSLMVICHPGEVARYSDENLMKSTKVPCILHMDSIKGSHAGLK 546
           ++F KDYIF+P+N+NLHWSL+VICHPGEVA  +D +L  S KVPCILHMDSIKGSHAGLK
Sbjct: 483 DMFGKDYIFVPVNYNLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSHAGLK 542

Query: 547 NLIQSYLLEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFDCGLFLLHYLELFLAEA 606
           NL+Q+YL EEW ER+K+ S+DISS+F NLRF+ LELPQQENSFDCGLFLLHYLELFLAEA
Sbjct: 543 NLVQTYLCEEWKERHKETSDDISSRFMNLRFVSLELPQQENSFDCGLFLLHYLELFLAEA 602

Query: 607 PLDFSPFKISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEILEKRSRQMSTTACSDDLLSK 666
           PL+FSPFKI   S FL ++WFPPAEA LKRTLIQ+LIFE+LE RSR++S      +   +
Sbjct: 603 PLNFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELLENRSREVSN---EQNQSCE 662

Query: 667 FPSNNEDEAGVELVPESGRLAETFNHNLSSSQAADGIEITLLSESSSRHNHFMDGSGLVV 726
            P    D+ G+E++ E        N +++ +Q   GIE+TLL  SS RH    + SG+V+
Sbjct: 663 SPVAVNDDMGIEVLSERCSPLIDCNGDMTQTQDDQGIEMTLLERSSMRHIQAANDSGMVL 722

Query: 727 RELFEPGTSN-GSLLGHYQS-FAQTSSYFD-TNVTVLEEDADTETGDRFMYLPSEQDGLQ 786
           R+LF+ G++N GSLL   Q  F   SS++  +N +   E  D ETG++FM L + +   Q
Sbjct: 723 RDLFDSGSNNTGSLLEQLQQPFEDPSSFYHLSNDSSAREQVDMETGEQFMCLNAGEGNFQ 782

Query: 787 PVDAMTSQACRFPGSSRGLESETAFDLCMS-IQPEHGSGIASSPSSHSDVLEDVGIIESC 838
            +   T        S R   S ++++L +  +Q E  + + S  S+ S+  + +GIIE  
Sbjct: 783 CITETT--------SPRASNSFSSWNLGIPLVQKEDETDLLSETSNSSN--DSIGIIEDN 842

BLAST of CmoCh12G001520 vs. TAIR 10
Match: AT1G09730.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 599.4 bits (1544), Expect = 5.0e-171
Identity = 380/937 (40.55%), Postives = 532/937 (56.78%), Query Frame = 0

Query: 7   KGLEVFDFTEEDELPELISEKCLSKFKNPNLETNSVLKY--------------------G 66
           K  EVFDF EEDEL E  + K L KF NP+   + VL+                      
Sbjct: 3   KNFEVFDFKEEDELAESAAGKLLEKFTNPSPCNSPVLQRQRIQSFCNEKRVEEEEMEGPS 62

Query: 67  CLELGKEVENPH-------------------MDVDLDECNRGCDNGISHIPPGISKEQLI 126
           C E    VE+                     +   L+  +   +    H+  G+    L 
Sbjct: 63  CAEPATAVESDDHQCEDDSTLVTEAKESRTILTFGLETTDHLEETDAEHVNQGLML-GLK 122

Query: 127 MEEEKYQLDVNTESE---WNTHSQDMFVQVN-NHVTGCFGSELGK---VGSSSQSSILGL 186
            E+   + D++ ++    +  +S+D   + + +H    F  +LG       +S  S   L
Sbjct: 123 TEDLAKETDIDHDNHGLMFGLNSEDDIEETDVDHRVESFSCQLGGNSFYAETSSYSQRQL 182

Query: 187 NCSLPEFAAKREQVDALSYPNGSMNGSSPMSSPSELVEDSV----------------SPN 246
           N    + ++  EQ+D +S  + S++  S +S  S+  +D                  SP 
Sbjct: 183 NSPFSDSSSSEEQIDMMSAIDESLSDRSALSEASDSEDDEEGTCYTFSLYFLIDMLGSPM 242

Query: 247 GK-------------SSDKCSSDNEMDDLNQEVVLCPDYIIYGDSYCTSSQLTFSHNGVK 306
                           ++ C +D E  DL+  V++  +Y+I  D +C +S + FS NG+K
Sbjct: 243 SDRVLISMLYALKDWMTEHCFNDEEKIDLSTAVIMTSEYVILKDMHCAASLVIFSCNGIK 302

Query: 307 INGFSDYGSNEFLNLEWGVDDLINIECHWFQRVEFVTIKLHTISKDVGQRDNGCDTSGLK 366
           I  F         + E+GV+D+++I+ +W+Q V  + +++  + KD    ++      ++
Sbjct: 303 IKSFLANNEEVPFSCEFGVEDIVSIQYNWYQNVGLIILRIRVLLKDENCHED------ME 362

Query: 367 EVKIVLVDPCWSEKQQKIRSLDSRYMAIWNMSLD-------------------FDEHFEE 426
           E+KI + +  W  KQQKI SL  +Y A+WN  L+                   FDE FE+
Sbjct: 363 ELKIAVKEHNWPNKQQKINSLHVKYPAVWNTDLEDDVEVSGYNLNQQKRYFPSFDEPFED 422

Query: 427 VVYPKGDPDAVSISKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEKHRFHFFNSFFFR 486
           VVYPKGDPDAVSI KRDV+LLQPETFVNDTIIDFYI YLK+QI  +EKHRFHFFNSFFFR
Sbjct: 423 VVYPKGDPDAVSICKRDVELLQPETFVNDTIIDFYINYLKNQIQTEEKHRFHFFNSFFFR 482

Query: 487 KLADLDKDPSSASDGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVICHPGEVAR 546
           KLADLDKDPSS +DGKAAFLRVRKWTRKV++F KDYIF+P+N+NLHWSL+VICHPGEVA 
Sbjct: 483 KLADLDKDPSSIADGKAAFLRVRKWTRKVDMFGKDYIFVPVNYNLHWSLIVICHPGEVAN 542

Query: 547 YSDENLMKSTKVPCILHMDSIKGSHAGLKNLIQSYLLEEWNERNKDASEDISSKFKNLRF 606
            +D +L  S KVPCILHMDSIKGSHAGLKNL+Q+YL EEW ER+K+ S+DISS+F NLRF
Sbjct: 543 RTDLDLDDSKKVPCILHMDSIKGSHAGLKNLVQTYLCEEWKERHKETSDDISSRFMNLRF 602

Query: 607 LPLELPQQENSFDCGLFLLHYLELFLAEAPLDFSPFKISKLSKFLNVDWFPPAEAYLKRT 666
           + LELPQQENSFDCGLFLLHYLELFLAEAPL+FSPFKI   S FL ++WFPPAEA LKRT
Sbjct: 603 VSLELPQQENSFDCGLFLLHYLELFLAEAPLNFSPFKIYNASNFLYLNWFPPAEASLKRT 662

Query: 667 LIQRLIFEILEKRSRQMSTTACSDDLLSKFPSNNEDEAGVELVPESGRLAETFNHNLSSS 726
           LIQ+LIFE+LE RSR++S      +   + P    D+ G+E++ E        N +++ +
Sbjct: 663 LIQKLIFELLENRSREVSN---EQNQSCESPVAVNDDMGIEVLSERCSPLIDCNGDMTQT 722

Query: 727 QAADGIEITLLSESSSRHNHFMDGSGLVVRELFEPGTSN-GSLLGHYQS-FAQTSSYFD- 786
           Q   GIE+TLL  SS RH    + SG+V+R+LF+ G++N GSLL   Q  F   SS++  
Sbjct: 723 QDDQGIEMTLLERSSMRHIQAANDSGMVLRDLFDSGSNNTGSLLEQLQQPFEDPSSFYHL 782

Query: 787 TNVTVLEEDADTETGDRFMYLPSEQDGLQPVDAMTSQACRFPGSSRGLESETAFDLCMS- 838
           +N +   E  D ETG++FM L + +   Q +   T        S R   S ++++L +  
Sbjct: 783 SNDSSAREQVDMETGEQFMCLNAGEGNFQCITETT--------SPRASNSFSSWNLGIPL 842

BLAST of CmoCh12G001520 vs. TAIR 10
Match: AT4G33620.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 399.8 bits (1026), Expect = 5.8e-111
Identity = 208/453 (45.92%), Postives = 291/453 (64.24%), Query Frame = 0

Query: 154 VDALSYPNGSMNGSSPMSSPSELVEDSVSPNGKSSDKCSSDNEMDDLNQEVVLCPDYIIY 213
           +D +S  +    G   ++S S    D VS  G++++  S  +E+D  N +V++ PD IIY
Sbjct: 99  IDVISNGSHRRIGIDSLTSSSLSENDEVS-TGEATNPASDPHEVDPENAQVLIIPDVIIY 158

Query: 214 GDSYCTSSQLTFSHNGVKINGFSDYGSNEFLNLEWGVDDLINIECHWFQRVEFVTIKLHT 273
           GD YCT+S+LTFS N + +   S   +    + +W ++D+I IE  W   VE   + +  
Sbjct: 159 GDIYCTNSKLTFSRNCMNVESSSVNATKGTFSCQWTIEDIIKIESQWCLEVETAFVNVLL 218

Query: 274 ISKDVGQRDNGCDTSGLKEVKIVLVDPCWSEKQQKIRSLDSRYMAIW------NMSLDFD 333
            S+     D   D SG+  +K  + DP WS++ + IRSLDSRY  IW      +  + F 
Sbjct: 219 KSRKPEGVDIAKDISGIDLLKFSVYDPKWSKEVETIRSLDSRYKNIWFDTITESEEIAFS 278

Query: 334 EH------------FEEVVYPKGDPDAVSISKRDVDLLQPETFVNDTIIDFYIQYLKSQI 393
            H            FE++VYP+G+PDAV + K+D++LL+P  F+NDTIIDFYI+YLK++I
Sbjct: 279 GHDLGTSLTNLADSFEDLVYPQGEPDAVVVRKQDIELLKPRRFINDTIIDFYIKYLKNRI 338

Query: 394 DPKEKHRFHFFNSFFFRKLADLDKDPSSASDGKAAFLRVRKWTRKVNLFDKDYIFIPINF 453
            PKE+ RFHFFN FFFRKLA+LDK   S   G+ A+ RV+KWT+ V+LF+KDYIFIPIN 
Sbjct: 339 SPKERGRFHFFNCFFFRKLANLDKGTPSTCGGREAYQRVQKWTKNVDLFEKDYIFIPINC 398

Query: 454 NLHWSLMVICHPGEVA------RYSDENLMKSTKVPCILHMDSIKGSH-AGLKNLIQSYL 513
           + HWSL++ICHPGE+          D+ +    +VPCILH+DSIKGSH  GL N+  SYL
Sbjct: 399 SFHWSLVIICHPGELVPSHVNFHSFDDEVENPQRVPCILHLDSIKGSHKGGLINIFPSYL 458

Query: 514 LEEWNERNKDASEDISSKFKNLRFLPLELPQQENSFDCGLFLLHYLELFLAEAPLDFSPF 573
            EEW  R+++ + D SS+  N++ + LELPQQENSFDCGLFLLHYL+LF+A+AP  F+P 
Sbjct: 459 REEWKARHENTTND-SSRAPNMQSISLELPQQENSFDCGLFLLHYLDLFVAQAPAKFNPS 518

Query: 574 KISKLSKFLNVDWFPPAEAYLKRTLIQRLIFEI 582
            IS+ + FL  +WFP  EA LKR  I  L++ +
Sbjct: 519 LISRSANFLTRNWFPAKEASLKRRNILELLYNL 549

BLAST of CmoCh12G001520 vs. TAIR 10
Match: AT1G10570.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 152.9 bits (385), Expect = 1.2e-36
Identity = 93/269 (34.57%), Postives = 148/269 (55.02%), Query Frame = 0

Query: 331 EEVVYPKGDP----DAVSISKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEK--HRFH 390
           E++ YP  D     D V +S +D+  L P  ++   +I+FYI+Y++  +   +K     H
Sbjct: 315 EDIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPVINFYIRYVQHHVFSADKTAANCH 374

Query: 391 FFNSFFFRKLADLDKDPSSASDGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVI 450
           FFN+FF++KL +        +D  A F++ R+W +  +LF K YIFIPI+ +LHWSL++I
Sbjct: 375 FFNTFFYKKLTEAVS--YKGNDRDAYFVKFRRWWKGFDLFCKSYIFIPIHEDLHWSLVII 434

Query: 451 CHPGEVARYSDENLMKSTKVPCILHMDSIKGSHAG--LKNLIQSYLLEEWNERNKDASED 510
           C P +     DE+ +       I+H+DS+ G H    + N ++ +L EEWN  N+DA  D
Sbjct: 435 CIPDK----EDESGL------TIIHLDSL-GLHPRNLIFNNVKRFLREEWNYLNQDAPLD 494

Query: 511 ISSKFKNLRFLP-------LELPQQENSFDCGLFLLHYLELFLAEAPLDFSPFKISKLSK 570
           +    K  R LP       +++PQQ+N FDCGLFLL ++  F+ EAP   +   +    K
Sbjct: 495 LPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLTLQDL----K 554

Query: 571 FLNVDWFPPAEAYLKRTLIQRLIFEILEK 585
            ++  WF P EA   R  I  ++ ++  K
Sbjct: 555 MIHKKWFKPEEASALRIKIWNILVDLFRK 566

BLAST of CmoCh12G001520 vs. TAIR 10
Match: AT1G10570.2 (Cysteine proteinases superfamily protein )

HSP 1 Score: 152.9 bits (385), Expect = 1.2e-36
Identity = 93/269 (34.57%), Postives = 148/269 (55.02%), Query Frame = 0

Query: 331 EEVVYPKGDP----DAVSISKRDVDLLQPETFVNDTIIDFYIQYLKSQIDPKEK--HRFH 390
           E++ YP  D     D V +S +D+  L P  ++   +I+FYI+Y++  +   +K     H
Sbjct: 314 EDIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPVINFYIRYVQHHVFSADKTAANCH 373

Query: 391 FFNSFFFRKLADLDKDPSSASDGKAAFLRVRKWTRKVNLFDKDYIFIPINFNLHWSLMVI 450
           FFN+FF++KL +        +D  A F++ R+W +  +LF K YIFIPI+ +LHWSL++I
Sbjct: 374 FFNTFFYKKLTEAVS--YKGNDRDAYFVKFRRWWKGFDLFCKSYIFIPIHEDLHWSLVII 433

Query: 451 CHPGEVARYSDENLMKSTKVPCILHMDSIKGSHAG--LKNLIQSYLLEEWNERNKDASED 510
           C P +     DE+ +       I+H+DS+ G H    + N ++ +L EEWN  N+DA  D
Sbjct: 434 CIPDK----EDESGL------TIIHLDSL-GLHPRNLIFNNVKRFLREEWNYLNQDAPLD 493

Query: 511 ISSKFKNLRFLP-------LELPQQENSFDCGLFLLHYLELFLAEAPLDFSPFKISKLSK 570
           +    K  R LP       +++PQQ+N FDCGLFLL ++  F+ EAP   +   +    K
Sbjct: 494 LPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLTLQDL----K 553

Query: 571 FLNVDWFPPAEAYLKRTLIQRLIFEILEK 585
            ++  WF P EA   R  I  ++ ++  K
Sbjct: 554 MIHKKWFKPEEASALRIKIWNILVDLFRK 565

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8L7S05.2e-17341.63Probable ubiquitin-like-specific protease 2B OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q0WKV84.8e-11046.76Probable ubiquitin-like-specific protease 2A OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8RWN01.7e-3534.57Ubiquitin-like-specific protease 1C OS=Arabidopsis thaliana OX=3702 GN=ULP1C PE=... [more]
Q2PS261.4e-3233.71Ubiquitin-like-specific protease 1D OS=Arabidopsis thaliana OX=3702 GN=ULP1D PE=... [more]
O137699.8e-3135.93Ubiquitin-like-specific protease 2 OS=Schizosaccharomyces pombe (strain 972 / AT... [more]
Match NameE-valueIdentityDescription
A0A6J1GHC80.0e+0098.38probable ubiquitin-like-specific protease 2B isoform X1 OS=Cucurbita moschata OX... [more]
A0A6J1KHS50.0e+0095.25probable ubiquitin-like-specific protease 2B isoform X1 OS=Cucurbita maxima OX=3... [more]
A0A6J1GIE50.0e+0094.06probable ubiquitin-like-specific protease 2B isoform X2 OS=Cucurbita moschata OX... [more]
A0A6J1GIH90.0e+0098.27probable ubiquitin-like-specific protease 2B isoform X3 OS=Cucurbita moschata OX... [more]
A0A6J1KRR70.0e+0090.93probable ubiquitin-like-specific protease 2B isoform X2 OS=Cucurbita maxima OX=3... [more]
Match NameE-valueIdentityDescription
AT1G09730.23.7e-17441.63Cysteine proteinases superfamily protein [more]
AT1G09730.15.0e-17140.55Cysteine proteinases superfamily protein [more]
AT4G33620.15.8e-11145.92Cysteine proteinases superfamily protein [more]
AT1G10570.11.2e-3634.57Cysteine proteinases superfamily protein [more]
AT1G10570.21.2e-3634.57Cysteine proteinases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.30.310.130coord: 383..514
e-value: 2.9E-82
score: 277.4
NoneNo IPR availableGENE3D1.10.418.20coord: 335..579
e-value: 2.9E-82
score: 277.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 158..190
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 158..195
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 879..911
NoneNo IPR availablePANTHERPTHR47764UBIQUITIN-LIKE-SPECIFIC PROTEASE 2B-RELATEDcoord: 8..866
NoneNo IPR availablePANTHERPTHR47764:SF2UBIQUITIN-LIKE-SPECIFIC PROTEASE 2B-RELATEDcoord: 8..866
IPR003653Ulp1 protease family, C-terminal catalytic domainPFAMPF02902Peptidase_C48coord: 358..547
e-value: 1.5E-39
score: 136.1
IPR003653Ulp1 protease family, C-terminal catalytic domainPROSITEPS50600ULP_PROTEASEcoord: 343..537
score: 28.425362
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 335..579

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh12G001520.1CmoCh12G001520.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity