HG10018823 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10018823
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionULP_PROTEASE domain-containing protein
LocationChr04: 9226045 .. 9240365 (+)
RNA-Seq ExpressionHG10018823
SyntenyHG10018823
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAGTCTTCTTTTGTTTTTCCGGAAAAGGATATTATCCAGCCGAGTAGATGTGCGTCTTTAGACATTCCATTCGAAGGAACTCAGCGGTTTGACAGCCCCTCAAAGGTTGGAGACTCGGTCATTTCCAACAGCCCAGCACCAGCCGATATTTTGAACTCTAGGCTCTCCAATTTGATTGGCGTTTCCTATTCTGAGGAAAAAGATGGATGCCAGCAGCGTATCTTTCTAGACCATCCTCTGTCTATAAAGGCCAAAGATCATGTTTCAGTTGTTGAGACACCTGCTCAACTCCACTCTCTCCCAAAAGAATTCTCAAGATTGTTGAAGCCTTATCCTAAGCATTTTTCCAGGAGAAAGAAATCTTTTCTCAAACCTCGAGTAATGGCTACCAACCCAGACTTGTTGGAAGAATGCTGCACCCGAGCTCTTGTCCCAAACCTTTCCATTAGGTCAGTTCATCAATCCCCCTACTCAACAATTCAAGCATTTAGTTTTCCACATCCCAACCATAAGACCAAATTTCTGAGAGGTACTCGGCGAATCTCCCCTATTTGTTAGTAAAAGAGGACTTATGATTTAGACAACGACTCAATAGTGAGCGCCAGCAGCGAAGAGTTAGAAAGATATGGGGAAGAAGATAATCAAGAAATTTCATAGCAAGTTGACAACTTTGTGGAGGGGCTTAATTCTATGTTTCAAATCGAAGAAGGGCAACAAACAGAAGAGGGTTGGAACTCTTGTCTTAAGCCACTGCCTCAGTTGGATATTCCTGCACATTTAAAGTCCATAATTGCAGATTGTGGACTTGTTATGGGATAATCCAACCCCTCTCAGTCTGTCTGGTTGTGGAGTTTGACAGAGTTTTAATATGAAATTTGTTTCTTGGTTCACATAGGGACTAAGTGTTGCACTTAAGCGGTAGGTCCTCAACAAAGTTCTTAAGAATTGTAACCCGGATATAGTTCTATCTCAAATTTCATAGAGGGACGACTTAGTTGTTGTTTTCACTAAAGCTTTTTGGAGAACAAATGAAATAGGGTGGGAGTCTGTTGAATTCTATGGCAAATCAGATTGGCGCTGGTTCCAAGTTGTCTCAGTTGAAGATGGTAGGGTTCTTCATAAGCTTTAGACGGTTTCACTTTAAATTCAGCCTTTTGGGTTTGGTCTACGGTTTGATTTGATTTGTTTTTCAGACTCGTGAAAGTTGAATCGGGCTTCCTCACCCTTGGAGTTTATCCTACTAGTTGTCCCCATCATAGTGTTGATTTGTTCGGTTAGTTTTGAATTGGTGGTTTCTGGGGCTTGATGTAAAAAGTCAATTTCTTCCAATTCGGGTCGAATTTGTTCATATTTGTTCGTATTTGTTTTTTCGGACTTGTCCAGCTTTGGTCTGTTTGGTCCGTTTGGTTGTATTTGTTTGTATTTGTTTTTCCGGACTTGTCCAGCTTTGATCTTTTTGGTTGTATTTGTTTTAAGTTGCTTTTTCCTTTTCTGTCACATTGTAATCTTGAGCATTAGTCTCTTTTCATTACATCAATGAGAAACGTGTTTCCTTTTCAATCAAGTTTCTTTTGGATTCTTCAAGTTTTCTTGGGGTTTCTCTTGGTTCTTTCAAGTTTCTTTAATTTATTGTGCTTTAACACGGTGTAAACATTCTCGAAAAAAAAGGAAAAGTTATTGCTCTTGATTCCATTCCAGAAACTTCCAATTATAACCCTGACACAATAGAAGAGTCATGTACACGACTTTTGAATCCTTTGGATCAATCTGAAGCTAGTCTTTCCCTGGATCATTCAGTGGAAACTCCAATGGTAGGCTGTTCAGCTTCACAGAGGAGGAATTATTGAATACTCAAGTGGAAGAGGCTCTGGTTAAGCAAAATATTCCTTGCTCATCTTCCGTTCCAAAGCAGGGGATGAAGATTATTACCTGGAACACCAGAAGTTCTTTCAGTCTTAGTGGCAAACATTCATTCTAAAGGATTGTTTGAAGGATTTATGGTTGAGGCAGCCAAGATACATGTACTGATTCTTCAATTTGTTGATGACACTTTGCTATTTTGCAAATATGATGATTCTATGTTGGAAACTCTCATGAAGACAGTAAAGGTGTTTGAGAGAATCTCAGGTTTAAAAGTTAATTGGCAGAAGTCATCCCTTTGTGGCATTGATGTTGAGAACTCCAAGCTTTCAGCCATGGCCATCTAAAACTAAGAAGGCTTATGGGTTAAAAAACAAGAGATGGAGTTTTGTCTCCACTTTCCTAACAATTGATATCTAAAGTTCTACACTCCATGGATGAAGAATCCGAAGAATCAAAAGAAGAAAAATTCAAATCAATTCAAAGGAAATCAAAAGCAAATCTTAGGGTTAAAAGTTTGTATTGAAAACAAAACGAAATCGAAACTTGAAAAGAACTCCGAGTTTCGGAAATGAAGGTTTCGGTTAAGCAGAGAAGCAGAGGGGATGAAGGAGAAGATGAAGAAGAGGGAAGATGAAAGAGATGAGAGAAGGTGAGAAGGTGAGAGAAACGTACAGACTTCACGTGTGAGAGGAGTAAATGAGGGGAGTAATGAGGGGAGTTGAGATTCTTGAGGGAATATTGAGGGAGGAGAAATTGGTGGGTTTAATAACCCACTGTTTTTAATTAACAGTGAATTGGGTGAGTTTTTTTTACCCACCCAACCCAACCCATGGGGTCAAACGCCCCCTTAGAGTTCTTCCTTCATGAATTCACTCCCAAGAAACACACACAAATAGAGAAAGAACAACACTTTTCTATCGATATTCAACCGATAACCATCCACAGTTACCCAAGAACTAGCCAAACAACGATACAATAAGAAAAGCACAACTATTAGCTATTTCGAGAGAGCTAGGAATCTCTCCAACAAAGACACTTAGCCAAACAGTTCGCAGAGACCTTCAGACTCCCTCTCTTCCAAATATAACGACATTTCCCTTCTTCTATACTCTCTGACTCATTTAAGATGCATTGGGTCATAGATTTAGGTGAGAGATCCATGATCTTTTGGGCCAGTTTTAGCAGGCTAAAGATCTGTTTGGAATTTTTAGGAATTGAACACTGAACCTGATGCACGCATGGAGATAAAACGTGCCCCTCCTCACCACTGAAACCAAGCTAAACCATTGATCTGTTTGGAATTGAAAGGACTCCTAATTAAACTTCAGTTGATGGCTACTAAATAAAGTTAAACTGGGATAAAAGGTAGGTGACTAAAAAGATAACCTCTTTCTTGGACGGTATTAGGAGACTACCTAGATTGGAAAAATTTGTACTATTTACAAATAATATATTTCTGGTTGTTAGAACTGACTGGAAGTAGTTTCAGAACTTTTGTTTTGCATTGTGAATCAATGTGAGTTCGTGTTAATGATTTTTTTTTTTCTGCCTTTTGTTCTCATTTTCCTGTTTTCTCCTTGAATTTTATATCATAGTTTATCTAATTATTATAGATCTCTTGTGATGTTAGTTTAAAAGCCTATATTTTGTAATATTCTTTGTAGTTTGTACTTTGAGATTCTAATGTAAAAGGTTATCGCATAATCATTTTTTAGATTCTAATGTTACAGTTATTTATGTGAAGAGTGGAAAGAGAGGTATGGTGATGGAGATGACAACGAGGACGTTTCTGCAATGTTCTTAACCTTGCCATTTTTCCCTCTAGAGGTATTGAAAATTTTCAAAAATTTTAGTCCTGGGAAACTAAAATATTGAATGTATGCTGCCATTGTCAAGATCCTCCGTTTTCTCTCCCACCCCACCCCTCTCAACACAAAATAATAATAATAATAATAATGACAAGCTGTTAATTTATGATGACAAGCTGTTAGTTTATGAATTTAATAATAATGATAAGCTGTTAGTTTATGAATTTGTTACTGTCTTGCTGTTGTTGCAGCTTCCGCATGTAATAGGGTCAAGCTGTTTTGCTTTTCATTTGGTCTTGGTGGTTGCTGTTTCTTTTTTTTTTTTTTTCCCTTTTTGGTCTAAGAGGTTTTTTCCCCATTGTATTTGGGGTTCTTATTACTTGTTCCAGTTTTAGTTTGTTATCGTCCTTCTGGATTTTTGCTATTTGTGCGGAAATGATGAGAGTGCTAAGGGGGTGTCAACCTAGTTGAGATGTCCGGGTGCACTCTCTAATCCATTGGTTTTATGCTCTTGTATCTCTCTTTGTAACTTAAGCATTAGTCTCATTTCATTAATTCAATGAAGAGACTCGTTTCTTTTTCAGAAAAAAATAAAATAATAATAATAATAATAAAATAAAAGAACGGAAATTGAAAGAATAATACTTCTCATATTGTATAATACCATGTAATCTTTCTTTTTCTTTCTAAAATTAACTTGAGAAGTTTCTTTTGGAGGTTGAGTGCATGTCTCAAATGGGACAGGAAATGTCCCAAATTTTAAATGTTGATTAATATGTGGAGGATATGGAGTCAGGCCTCATTCTCCTAGGTTCCATTTCCCCCCCTCCTGAATAGCACTGGTTGATTTAGGTCTTCAGGTATCTAATTAAGCAACCAAATGTTTTGACTGAGCCTACCGATGACAGAATCTTGAAGATAAATTCTCTTGCTGCATACGTACTTCTAGCTTGAGAATGTGATAAATGATATTTATCTTACATTGAGATGTGTGTTTATTGGTCAGACACTCACGGTTATTGATAAAACCTTGATTAATTTGGGGATTCTATGATGGAAGATTTCCCAGGTTTTGTGTTTGATTGATGCTCTAGGTGAAGAGCTTAATAAATGAAACTTCTTGTATTAGTGTTGTGCCACATGGGTGTTCTTGTGGAGTATAATGGGGTGGAGCATAATGCCACAATGACCAAAGAACAAAGATGTTACTTTTGTGCACCTGAAATGTCAATAGCATATACACAACGGTTTTTCTTTCTCTCTTTTTCTTGGTGAGGGGAGGAGGGGGGTTGCTTTTTCAAACATTTGTCTGATAGTAGATAATTCTGCATATTGTGTTCCCATGTCTAACCTTGACTTTTGTTTTTGCTGCTTGTTCAGTTGCCACAACAGGAAAATTCATTTGATTGTGGTCTCTTCTTACTCCATTATGTGGAACTTTTTTTGGAGGGTGCGCCAATTAACTTCAGCCCACTCAAAATCTTGAAGTCCTCAAACTTTGTAGGTCATTTGTCCTTCTATATGATCAAATTACAAATCCTTTATATGTGGCATATCCTGTTGGTTCTACCCACTTAATTAAATTTTATCTTTTTCATATCTTTTTCATATTTGCTACCTTATATTCAAGATTGTTAACTTGTGAATCGAAATGTGTATTGGAATTAAAATTTACTGTATCGCGTATCATAAATGTTGAAGACAATGTAACCTCTACCTATCTCCATGATGGTGTGGTACTATCTACTTTGAGCATAAACTCTCACTGTTTATGTTTGTTTCCCTCATTGTAACTTGAGCATTAGTCTCATTTCATTATTTTAATGAAAATACTCGTTTTTTTTTTTAAAAAAAAGCATAAACTCTCACCGTTTTCCTTTTGCTTTAAAAAAAAAAGACTCATTCCAATAGAGATAATTGCCCTTACTTATATACCATGAATCTCTTATCTTTTTCAACCAATGCGGGACTTTGTTTGAACCTAACAGTTCTCCCCTCAAACAAATTTCCACACTTGAGCCTCCCCTCAAAAAAATCTATCCACTCCCAACAGGCTTGCTCTTCATCTTGAGACACAACCAATCATTCTCAAGGCAACATTTGAATAGAGCATGAGCACTGTTTACATTGATTGACAATGAGAGTTCTATTTTCTCTTATTAAGGCGAGTTGGGAACATACAGAGAGAGCACGAGCATAGATCACATTGTGGGACCACCGTGAATCTCTCTTTTCTCTAGCTAATGTGAGACTTTGTTTGCACCTAACAATCCTTCCTTTAAACAAAGTACCACAAATGAGCCTCCTCTTGAACAAATTTGTGCAAAGTATATCTTAGAGCTTCATCTGGAGGCATAACAAATTCATCTCAATGCTACATCCAACTTACGTATGAGCTGGAATCACATCAATTGACAATGATAGTTTTATTTTTGTTATTATCGAGGCAAGTTAGGAATCCATTGTGGGATCTCCTTCGCTCAAACAAAGATATACTCCTCTCTACTCAACCTTTTAAAGGCTATGTCCCCATCTTCAGTTTCTATTGTCAGTCATGTTCTTTGTCTTATTCATCATCTCATCAAAAAAAAAAAAAAAAAAAAAATCAACATCATTGTCGTGTCCTTTCTTATTCACCATTCTCATAAGCCAGACACTTCTCGTAATTACTCAATACAAGCACAACCCAAGTTTAATCTTCTCACAAGCATACTTAACATCATCAATCAAAACATGTTGCATTCTTAGTCAATACAAACTATCAAGTGTACGACATAGAATCATGGATTCTCTCACAAGACTTCCAGCATACTAGCACACCGGTCTTAACCAAAACTCGACATCAAGAACTCAAAAGGACACGGTTGCGTACATTACAACTCATCAATTTATGTACTCTTATCAGTTACTGTAACATGCTATTTCATCAATAATTTCCATCATGCAATTTCTCAACATTATATTTATTTAGCACTTGGGCAGTTGGTACCTTAATGCGTACCTCCTTTAGTTCCGGTGCCTTTAGTTTTTTTGTAGAATTTTTATATGTATGATACAAGTACTGAACCTCAATCCTGTAATGGCCTTACATTGGTAACAATCATTGTTTTGTCTTTATTTGTGTAAGTTTCTGCATGTCTAACATTAATACAATCTCCAGCTAAGTCAGGATTGGTTCCATCCTGTGGAGGCTTCTCAAAAACGTGCACATATCCTACAATTAATTTATGAAATCATGGTCAGTAATCAAGCAGAGGAACTTTCTGGTAGCATCGGTAAATATCCTTCTTCTGATGCTGGTGACTCGGACAAAGTTTTATCCAGACATGTGTATCTTAGAGAGAGACATATTTTCACAATGACCCATTCTGACAACTTCTCGAGTATTGGAAAAGAATTTGGATCGGTGTCTAAAGTATCATCTGATACAAATTATCAACGAATAGAAAGACAATCAGGGAGTGTCATGCCACCCATTGAGGTAATTTGATGTTTTCTTTCATATGATGACTATACTTCTCGTTTGCTTATTGATTCAAAGTCATTGGTTATGAGAGGTGATTGTATATTATTTGTATACTAGTTTAATAATGAAACATTAGTTTGGCTAGTCTCTGCCAGCATTCCATGTGGTCTTCAATTTAGATGCTATGATAATTTATGGGATGGAGAGGAGAATGGGTGCTAAATAGGTTATGATAATGTCCCTGGATTTAGGGTAGATTTGAATTATGTGCCAGTTATTTCAAAAGTCTCTTTGAGGAGCATTGTCAATTTGTGTTACAAAGATAATTGTTTATACTCTTTCAAGTTGTTGAGAAGAAACTAGAACAGATACAAAAATTGATAAGTCAACAAATGGTACCTTATGACTAGGAGAAACAGTAGTATAGACAAGAAAAGAACGCCATCCTCCTCCTGATGCTCCTCTTGCTCTCGCTTGGGGTGATGAAAAGAAAAAAGTAAATAAAAAAAAGGTTAAAATACCATTTTGCTCTAGTTTGAGGTTCTTTTTATTTTAGTCCTTGTATTTTCAATTGGGTTTGATAAAAGACATAGACCAAGTAGAAAACACCAAACACTTTGGGTTTAATAAAATACAAAGACCAAGTAGAAAACAAAGTATGAAAAGGAATCTCACCGTAAAGCACAAAGGGGGGCATTCAATTAATGAAGAAGCAAGTTGTAGAAGCATCAACTTAGATGTGTTTTGGAAGAATAAACATCTTCTGGAAACAACAAGGGCATTGGTAGTATATACAACTTCATTTCCGAATACACGGATTGATGAAGAATTTCATGTTTGTCTAAGTAACAACTAAGGATATCAGAGAAGCATTATTATTGAATTTTTATGAGAAACATCGGCATTAAAAATTATAAGAATGAGAAAGCAGCAACTTGTCATTGGATATAATTAAGTACATGAGAGTAGTTGTAAAAAAATGTAACAAAATGTCATTGGACATAATTTCGTTTGTGGTCTGATTTATCGCTTAAAGTCTTTGAGTGTACTGCTTATGTTCACATCCCCAATGTTTCTCGATCTAAACTTGATCCTAGAGCTCTTAAATGTATTTTTCTCGAATATGCCTCTGACAAAAAGGGATATAGGTGTTTTGATCCCCTAACTAGAAAATACATTGAAAGTATGGATGTGGTGTTTTTGGAAAACGAATCATTTTTTGACAAAAATTCTCTTCAGGGGGAGATATCCAACTTATAAGATCATTTTTGGGATACATCTCCTCTTCACTCTATCATCCATCTCGAAACTAGTAGTCTCGTAATGCCAAACATGGGAGAGTCAACTTCAGGGGAGAAACACTTCGAAGAGATCTCACTGATCGACATCCTGAACTTCAGGTTTATACTAGAATGGCAGTCAATCAAAATAGTCAAGATCAAAGAGGCGACGTGCCACAAACCCAATCAGATGCCCTGACGAATGATGCTGAAAATTCAGGTAATTCTCCTTCTATTCTTATTTCTTCACCTTCTCAAAATCTCTTGCCTGATCTTGACATTCCAATAGCCCATAGGAAGGTATTTGTCGGTGCACAAAATATCCTATTGCAAACTACCTCTCTTACCATAGTGTCTGATGATTATAAAGCTTTCACCTCTAAGATAATAAATCTTTTTGTGCCAAGGAATATATCGGAAGCCTTGAATGATCCGAATTGAAATTTAGCAATTATGGATGATATGAATGCTCTAAAAGAAAATGGCACTTGGGAAATTGTTGAGCTTCCAAACGATAAGAAAATAGTGGGATGTAAATGGGTGTTCACTATAAAGTGCAAAGCTGATGGGAGCGTCGAAAGATACAAAGCGAGACTGGTTGCTAAAGGCTTCACACAAACCTTTGGAATTGATTATCAAGAAACATTTGCTCCTGTTGCAAGAATCAATTCCATTAGAGTGTAGTTATCTATCGCAGTTAATTTTGATTGGAGTCTCTATCAATTGGATATAAAAAATGCCTTCCTTAGTGGCGAGCTTGAAGAGAAGGTATTTATGAACTTGCCATCAGGTTTTGAAGCCAACCTTGGAACCAACAAAGTGTGCAAGTTAAAAGAAATTCTTTTATGGCCTTAGGCAGTCTCCTAGAGCGTGGTTTGAACATTTTGGGAAAGCAGTGACAGGTTATGGATTCTGTCAGAGTCAAGCAGATCACACCATTTTTTATAGACACACTCGTCCAGATATTGCTTTTGCTGTTAGTATGGTAAGCCAGTTCATGCATGCCCCAGGGCCAACCCACTTTAAAGCTGCATACAGAATTCTAAGATATTTGAAAGGGGACTCCAGGAAAAGGTATATTGTTTAAGAAGAATGGTCATTTACATGTGGATATTTACACCGACGTTGATTGGGTAGGTAGTACCACAAATAGGAGATCAACTTCTAGCTATTGTTCCTTTATTAGAGGCAATCTAGTTACATGGCGAAGCAAAAAACAAAGTGTTGTAGCCAGGAGTAATGCTGAAGCAGAATTTAGGGCTCTAGCACATGGTATTTGTTAGGGTATATGGATAAAAAGATTATTGGAAGAATTGAAATTTATGTTTATTGTGGCAATAAGGCTGTCATCTCCATAGCGCACAACCCAGTCCTGCATGATAGGACAAAACATATAGAGATTGATAAACACTTTATAAAGAAGAAGATTGATACTGGAGTGATATGCACTACCTATCTCCCTACAATAGAGCAAATTGCTGATTTACTGCAAAGGGGCTTTCCCAATTACAATTCGACAAATTGATTTACAAGCTAACAATGGAAGACATCTTTAAACCAGCTTGAGGGGGAGTGTTGATTTATGCCACAATATTTGATTTTATTTACTTTTGTTTGCTGTAATTATTCTGTAGGATTCTTTTCTTTTATATCAATATTTTCTTATATCATTAAGTCTGTATTTTATTTTATAGAATCTTGTATTTAGTTGATTATCTTTTGTATATTTATTTATATCTTTATTTAGGTGTATCTAGTTAGGTGAGATAAGTTGTTATTTATATCTTTATTTAGGTGTATCTAGTTAGTTGAGTAGTTGTTTCAATTTTATTTTTAGATCGTTATTCCTAGTTTTTTGCTAACCCTAGTTTAGCTTAATTGTATCTTTAAAAGGCCTTATAGCTCTTTGAAGTAAATGAAAAAGTATAATTCTACTCAAATTCATATATTTAAATTAGGGCTTTGTACCCTTGTGTAATAAGAAAGTGAGAAATTCAATAATATTTGAGAAATATTTACACAATCTTTTCTGTAAGATTAACTACTATGTTTTTACCATATATTTAAATGCTTTTGTGTGTTTGTGTGTGTCATATATATATATATATATAGGATCTTATTAGTGTGGCTCATTTCACTCAGACGATGCTTTTGTCGTCTTTTGTCTTGAGGCAATTACAAAACTTGTCACCTTAACGGTGCGTCTTGTACTTTGAAAGCACTGCTACATCTTGATTGAAAGGGAAAAATTCAAATTAGCTCCAAGGATTGGAGCTACAAAATGCGTAGGGCAGAAAACGAAAAAGGCAGCTAGCTGACCAGAGGGGGCTAAAGAGTCTTGAACAGCTTTCTTCAGTTAAATATGAATTCAAATTATTCTCCTCAAGAATCGGAGGAAATGAAGAATTGGTCTTTTTTTTTTTTTTTAATTTTTATTTTAAACTATAGGATAAATGATGGGTGGGCTACAACCCGCAACTAAACAATTCTTTATCTCTTTAACACTCCGAGGCTTCGAAGACTCAAAAGTCTTAAGGGAAACTTCAAAGAGTGTCTTGGAGTCGACACCTGAAACAATAACTCACTTGGCAATTGACTATCACGAATACAAAAACTCTCCTCGATCAACCGATTGTCATTAAGATGATTAATGAGATCAATTTGCCCCATTTTTTTTTTTTAAAAAAAAGGAAACAAACTTTTCATTGATATAATGAAAAGAGACTATTGCTCAAAATACACGGAAACATAGAAGCAGAGAAAACATAAAAATGAGGGGCTAGCAGATGCACCTAGACATCTTCACTAGGAAAATACACCCATAACACTCTTAACATATCCTTTTACATGAATTCCAAGAGTCTCGAGTACATGATTACACAGAAAAAATAAATACATTCTAATTAAGAAGTCGTAATGAGATGATCTGAAAATCCAATGTAAGTTGAAGTGGGAGAATGATTTTCACTAACACTTCCTGGAGGATGAACAAGTGAACAACAATACTGCCAATCTTTATCAATAACTGAACACAATTGACATATGAATGGGATAAAGAGCTTCACAAATTGACTTTCAAAAAGCTTAATATATCTTCTAATCAATAGAAGAAGAGGTAAACAAATGCTTGGACAATGATTTCACCAAAAACTCTCTGATGCATCCAAAGACCAAAACCTTGAATCTGTAACTTGATTAATCCTTAAGCCTACTGTTTTGCACAGTGGAAGAAGAACCGAAATTGCTCCAAATCTTTGATATGCTTTGATGGCCAAATGAACAACCAAGACTAGGGGTAGAAAGCCTTCTTTTTTCAGCTTTTAAAACAAAGAAATCCATATCTTATGGTAGAACTCTCCCCATTCTTTTTCCGTTTTATTTCATAATTCACAAAGCAAGTCTTCATCTAACATTCACCTTTCACAGCCTAACTTATTATAAAACCAAACCAGCCATTTACCTTTTTTTTTTTTTTTTGAAACAAGAAACAACCTCTTCATTAAGATAATGAAATGAGATAAAATCTCAAGTACAAAATGATACAATAGAAAAAGAACCTAAGGATCAACAGGTGCACCCAGGCATCTCAACTAGGTTGACACCCCTTTAGCACCCTCATCATATCCAAACAAAACATAATAGAAGAACAAAACCTAGCCCATAACAATTTAATCACATAAAAACAATATCTGCCACATCTAGAAACAAGCTAATCAATCTAGAAAAATAAAAGCTCTCCAATTCAAGCTGATGTCTGAAATGGAATAGTTTGTGAAGGACTTGGAAAGTGAGCACCACAAAGAAGCATTCAACTTTGCTGATTCAAAACGAGTAAACTAAGGAGTTTCCTGGTCATGGAACACTTTTTGATTCCTCTCAAACCAAAGATCAGCAAGAAGAGCTTTAACTGCGTTATTCCAAAGGATTTTTTATGCAGCCTTGAGCTTTGGCCCCACAAGAACCTGTAAAACATTATCACGAAAAAATTTTCCAAAAATCAACTAATGTTAAAACATTCAAAAACCGAATTCTAACAATTTCCAGAATATGGACAGTTAAAGAAAAGTGCTGGAGCCATTTATTTGTTTATCCATCAAGATTCTGGGATTATCAACAATCAGCCATGCCTTCTTAAAAACTGTCAATAAGACACCTCCTATGGAGATGCTATTTCCTGCATTTGAAAACCACATGCTGCAACTAAAGTTGAAAATTTTGAAGGAATTTTAGATGGAGACAGATGTGCAACACTTGCAGAATTTTGTTGCTGTTGAATGAAAAGTGTTCTATATGCTTCCCCCAAAGTATCGGGGGTGGATTCTTCAACAATTATTGGTTCATCTTGATCAAATTCCTCACTGGTGAACCATCAAATCTGTCATCCGAATCTTGGGATGTTTGCTTCTTATAATCCTTAGTAAAGGTTCCTTGGACAAAAACAACAATTGAACCCGGGATTGTGAATTTGGGATGAACAATTACAGAAGAAGAATGTCATTTTCTTGAGGAGCTGGCTCCTTGTGGAACTAATGCTTGTACACAAGCACCCTCTAGGGCATCTAGATGAGACAAATGGTTCACCAATGAATCAGCCGATTGCTTGAACTTGTGAATCTTGGCAAAATTTGGAGAAAATTTGTGCCTTGAATAATGGTGAGGAAGTGGTGGGGTTAAATTTGCCCCATTAAACCGCCATTTACCCAATTGTCTTTAAGATAGAATCTGTCCCCTGAGAAATGCTTCTACTTCTAATAAGCGTAAGAACAAGTTATCCCAAAAGGTCAATCTAGGTTCAAATCTTAGATCTCACTCAAATGGAAACCATACATTTTCAAGAAAGTCCTAAATCTCTAATACTATGCGGTAAATAGATATTTGGGAGAAAATTCCAGACTCTTGTTTACTAGAATTGTATATAATATGACAATAAAAGGAGAATATGATTAATAGACATATCGTTAAACTGCTTATAATATTCTCATTTGTAAACCTAACAGAAAGCATACAACTATAACTCACACACATCTATTATAGCGTGTTAAATGATAATGCACACAATACTAATGAACATCATACGTAATCTACAGCCACGATATAATAATTGTTTTGTTTTCTTACCCAAAAATTCCAATCAGTTTTATTATATTTAGTTTGTTCTTGGAATATTATCAGATTGGAGGCAATGTGTGGAGCACCTTGGCTATCACATACATCAGAGTTTGGTTGGAATTATGAATATTAATGTTCTTGGTCAGATCTGTATTCATATAGCTCTACATAAAAGTTATCCATAGTTCCATATTAAATTATATAAAAGATGTGCTTTTACATTTGTTTTTCGTTTCTATTTTTGGGGCACAGGAAGATGAGAATGGTGAAATAGCTGATTCACCACAATTCTTGGAAGATCGCCCCCAAGCTTCGGCAGTTTCTGAATGTTCATCAGCCTTCAGCTTTGGCCAACAATTTACAGAATTAGAAATATCTCGGGAGGGGAGATTTTCTAGAAACGTAAACGATACAGGTAGAACATTGTCGCCTCGGCCATTGCTTGGTGAGTCGCAAACCACATTGGAATTGAGACGAGATTGCACACCTCAAGCAACAAAGAGTCTGAACCATCCAACTGAAGCAGATAATGAACCAGAGATCTTAACCAGCTCAAGTGAAGAACTTGTCACTTGTGTAGTAGAAGACTCAGAGGAGGAGGGAAATGAAAGGAATGATGGAATCGAGATTGATGTTTCTTCCTCCTCAAGGAACGACTTGTTCCTGTCAAGGCAAGTGGTTGAATCTATTGCAAACTTAAGTGATAATAGACAACATGATCTGATACTAAGCAATGAACATCCAACGCTTGATTCTAGCAAGCAGCATTGTGCCAATAGGCCTAGCTAG

mRNA sequence

ATGAAGAAGTCTTCTTTTGTTTTTCCGGAAAAGGATATTATCCAGCCGAGTAGATGTGCGTCTTTAGACATTCCATTCGAAGGAACTCAGCGGTTTGACAGCCCCTCAAAGGTTGGAGACTCGGTCATTTCCAACAGCCCAGCACCAGCCGATATTTTGAACTCTAGGCTCTCCAATTTGATTGGCGTTTCCTATTCTGAGGAAAAAGATGGATGCCAGCAGCGTATCTTTCTAGACCATCCTCTGTCTATAAAGGCCAAAGATCATGTTTCAGTTGTTGAGACACCTGCTCAACTCCACTCTCTCCCAAAAGAATTCTCAAGATTGTTGAAGCCTTATCCTAAGCATTTTTCCAGGAGAAAGAAATCTTTTCTCAAACCTCGAGTAATGGCTACCAACCCAGACTTGTTGGAAGAATGCTGCACCCGAGCTCTTGTCCCAAACCTTTCCATTAGTTATTTATGTGAAGAGTGGAAAGAGAGGTATGGTGATGGAGATGACAACGAGGACGTTTCTGCAATGTTCTTAACCTTGCCATTTTTCCCTCTAGAGCTAAGTCAGGATTGGTTCCATCCTGTGGAGGCTTCTCAAAAACGTGCACATATCCTACAATTAATTTATGAAATCATGGTCAGTAATCAAGCAGAGGAACTTTCTGGTAGCATCGGTAAATATCCTTCTTCTGATGCTGGTGACTCGGACAAAGTTTTATCCAGACATGTGTATCTTAGAGAGAGACATATTTTCACAATGACCCATTCTGACAACTTCTCGAGTATTGGAAAAGAATTTGGATCGGTGTCTAAAGTATCATCTGATACAAATTATCAACGAATAGAAAGACAATCAGGGAGTGTCATGCCACCCATTGAGGAAGATGAGAATGGTGAAATAGCTGATTCACCACAATTCTTGGAAGATCGCCCCCAAGCTTCGGCAGTTTCTGAATGTTCATCAGCCTTCAGCTTTGGCCAACAATTTACAGAATTAGAAATATCTCGGGAGGGGAGATTTTCTAGAAACGTAAACGATACAGGTAGAACATTGTCGCCTCGGCCATTGCTTGGTGAGTCGCAAACCACATTGGAATTGAGACGAGATTGCACACCTCAAGCAACAAAGAGTCTGAACCATCCAACTGAAGCAGATAATGAACCAGAGATCTTAACCAGCTCAAGTGAAGAACTTGTCACTTGTGTAGTAGAAGACTCAGAGGAGGAGGGAAATGAAAGGAATGATGGAATCGAGATTGATGTTTCTTCCTCCTCAAGGAACGACTTGTTCCTGTCAAGGCAAGTGGTTGAATCTATTGCAAACTTAAGTGATAATAGACAACATGATCTGATACTAAGCAATGAACATCCAACGCTTGATTCTAGCAAGCAGCATTGTGCCAATAGGCCTAGCTAG

Coding sequence (CDS)

ATGAAGAAGTCTTCTTTTGTTTTTCCGGAAAAGGATATTATCCAGCCGAGTAGATGTGCGTCTTTAGACATTCCATTCGAAGGAACTCAGCGGTTTGACAGCCCCTCAAAGGTTGGAGACTCGGTCATTTCCAACAGCCCAGCACCAGCCGATATTTTGAACTCTAGGCTCTCCAATTTGATTGGCGTTTCCTATTCTGAGGAAAAAGATGGATGCCAGCAGCGTATCTTTCTAGACCATCCTCTGTCTATAAAGGCCAAAGATCATGTTTCAGTTGTTGAGACACCTGCTCAACTCCACTCTCTCCCAAAAGAATTCTCAAGATTGTTGAAGCCTTATCCTAAGCATTTTTCCAGGAGAAAGAAATCTTTTCTCAAACCTCGAGTAATGGCTACCAACCCAGACTTGTTGGAAGAATGCTGCACCCGAGCTCTTGTCCCAAACCTTTCCATTAGTTATTTATGTGAAGAGTGGAAAGAGAGGTATGGTGATGGAGATGACAACGAGGACGTTTCTGCAATGTTCTTAACCTTGCCATTTTTCCCTCTAGAGCTAAGTCAGGATTGGTTCCATCCTGTGGAGGCTTCTCAAAAACGTGCACATATCCTACAATTAATTTATGAAATCATGGTCAGTAATCAAGCAGAGGAACTTTCTGGTAGCATCGGTAAATATCCTTCTTCTGATGCTGGTGACTCGGACAAAGTTTTATCCAGACATGTGTATCTTAGAGAGAGACATATTTTCACAATGACCCATTCTGACAACTTCTCGAGTATTGGAAAAGAATTTGGATCGGTGTCTAAAGTATCATCTGATACAAATTATCAACGAATAGAAAGACAATCAGGGAGTGTCATGCCACCCATTGAGGAAGATGAGAATGGTGAAATAGCTGATTCACCACAATTCTTGGAAGATCGCCCCCAAGCTTCGGCAGTTTCTGAATGTTCATCAGCCTTCAGCTTTGGCCAACAATTTACAGAATTAGAAATATCTCGGGAGGGGAGATTTTCTAGAAACGTAAACGATACAGGTAGAACATTGTCGCCTCGGCCATTGCTTGGTGAGTCGCAAACCACATTGGAATTGAGACGAGATTGCACACCTCAAGCAACAAAGAGTCTGAACCATCCAACTGAAGCAGATAATGAACCAGAGATCTTAACCAGCTCAAGTGAAGAACTTGTCACTTGTGTAGTAGAAGACTCAGAGGAGGAGGGAAATGAAAGGAATGATGGAATCGAGATTGATGTTTCTTCCTCCTCAAGGAACGACTTGTTCCTGTCAAGGCAAGTGGTTGAATCTATTGCAAACTTAAGTGATAATAGACAACATGATCTGATACTAAGCAATGAACATCCAACGCTTGATTCTAGCAAGCAGCATTGTGCCAATAGGCCTAGCTAG

Protein sequence

MKKSSFVFPEKDIIQPSRCASLDIPFEGTQRFDSPSKVGDSVISNSPAPADILNSRLSNLIGVSYSEEKDGCQQRIFLDHPLSIKAKDHVSVVETPAQLHSLPKEFSRLLKPYPKHFSRRKKSFLKPRVMATNPDLLEECCTRALVPNLSISYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLELSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPSSDAGDSDKVLSRHVYLRERHIFTMTHSDNFSSIGKEFGSVSKVSSDTNYQRIERQSGSVMPPIEEDENGEIADSPQFLEDRPQASAVSECSSAFSFGQQFTELEISREGRFSRNVNDTGRTLSPRPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEILTSSSEELVTCVVEDSEEEGNERNDGIEIDVSSSSRNDLFLSRQVVESIANLSDNRQHDLILSNEHPTLDSSKQHCANRPS
Homology
BLAST of HG10018823 vs. NCBI nr
Match: XP_038888439.1 (probable ubiquitin-like-specific protease 2A isoform X1 [Benincasa hispida])

HSP 1 Score: 510.0 bits (1312), Expect = 2.2e-140
Identity = 279/363 (76.86%), Postives = 290/363 (79.89%), Query Frame = 0

Query: 148 NLSISYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLE----------------------- 207
           NL  +YLCEEWKERYGDGDD+EDVSAMFLTLPFFPLE                       
Sbjct: 473 NLFQNYLCEEWKERYGDGDDDEDVSAMFLTLPFFPLELPQQENSFDCGLFLLHYVELFLE 532

Query: 208 -----------------LSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPS 267
                            LSQ+WFHP+E S KRAHILQLIYEIMVSNQA+ELSGSIGKYPS
Sbjct: 533 GAPVNFNPLKILKFSNFLSQNWFHPMEVSLKRAHILQLIYEIMVSNQAKELSGSIGKYPS 592

Query: 268 SDAGDSDKVLSRHVYLRERHIFTMTHSDNFSSIGKEFGSVSKVSSDTNYQRIERQSGSVM 327
           SDA DSD        L E HIFTMTHSDNFSS+GKEFGSV KVSSDTNYQRI RQSGSVM
Sbjct: 593 SDASDSD--------LGEAHIFTMTHSDNFSSVGKEFGSVYKVSSDTNYQRIGRQSGSVM 652

Query: 328 PPIEEDENGEIAD-SPQFLEDRPQASAVSECSSAFSFGQQFTELEISREGRFSRNVNDTG 387
           PPIEE+E GEIAD SPQ LEDR QASAVSECSSAFSFGQQFTELEIS EGRFSRN+ DTG
Sbjct: 653 PPIEEEEKGEIADESPQCLEDRHQASAVSECSSAFSFGQQFTELEISWEGRFSRNIKDTG 712

Query: 388 RTLSPRPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEILTSSSEELVTCVVEDSEE 447
           RT SPRPL  ESQTT EL RDCTPQ TKSLNHPTEADNEP ILTSSSEELVTCVVEDSEE
Sbjct: 713 RTPSPRPLHRESQTTFELGRDCTPQETKSLNHPTEADNEPLILTSSSEELVTCVVEDSEE 772

Query: 448 EGNERNDGIEIDVSSSSRNDLFLSRQVVESIANLSDNRQHDLILSNEHPTLDSSKQHCAN 470
           EGNERNDGIEIDVSSSSRN+LFLS+QVVES AN SDNRQHDLILSNEHPT DSSKQHC+N
Sbjct: 773 EGNERNDGIEIDVSSSSRNNLFLSKQVVESTANSSDNRQHDLILSNEHPTFDSSKQHCSN 827

BLAST of HG10018823 vs. NCBI nr
Match: XP_038888441.1 (probable ubiquitin-like-specific protease 2A isoform X3 [Benincasa hispida])

HSP 1 Score: 510.0 bits (1312), Expect = 2.2e-140
Identity = 279/363 (76.86%), Postives = 290/363 (79.89%), Query Frame = 0

Query: 148 NLSISYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLE----------------------- 207
           NL  +YLCEEWKERYGDGDD+EDVSAMFLTLPFFPLE                       
Sbjct: 420 NLFQNYLCEEWKERYGDGDDDEDVSAMFLTLPFFPLELPQQENSFDCGLFLLHYVELFLE 479

Query: 208 -----------------LSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPS 267
                            LSQ+WFHP+E S KRAHILQLIYEIMVSNQA+ELSGSIGKYPS
Sbjct: 480 GAPVNFNPLKILKFSNFLSQNWFHPMEVSLKRAHILQLIYEIMVSNQAKELSGSIGKYPS 539

Query: 268 SDAGDSDKVLSRHVYLRERHIFTMTHSDNFSSIGKEFGSVSKVSSDTNYQRIERQSGSVM 327
           SDA DSD        L E HIFTMTHSDNFSS+GKEFGSV KVSSDTNYQRI RQSGSVM
Sbjct: 540 SDASDSD--------LGEAHIFTMTHSDNFSSVGKEFGSVYKVSSDTNYQRIGRQSGSVM 599

Query: 328 PPIEEDENGEIAD-SPQFLEDRPQASAVSECSSAFSFGQQFTELEISREGRFSRNVNDTG 387
           PPIEE+E GEIAD SPQ LEDR QASAVSECSSAFSFGQQFTELEIS EGRFSRN+ DTG
Sbjct: 600 PPIEEEEKGEIADESPQCLEDRHQASAVSECSSAFSFGQQFTELEISWEGRFSRNIKDTG 659

Query: 388 RTLSPRPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEILTSSSEELVTCVVEDSEE 447
           RT SPRPL  ESQTT EL RDCTPQ TKSLNHPTEADNEP ILTSSSEELVTCVVEDSEE
Sbjct: 660 RTPSPRPLHRESQTTFELGRDCTPQETKSLNHPTEADNEPLILTSSSEELVTCVVEDSEE 719

Query: 448 EGNERNDGIEIDVSSSSRNDLFLSRQVVESIANLSDNRQHDLILSNEHPTLDSSKQHCAN 470
           EGNERNDGIEIDVSSSSRN+LFLS+QVVES AN SDNRQHDLILSNEHPT DSSKQHC+N
Sbjct: 720 EGNERNDGIEIDVSSSSRNNLFLSKQVVESTANSSDNRQHDLILSNEHPTFDSSKQHCSN 774

BLAST of HG10018823 vs. NCBI nr
Match: XP_038888442.1 (uncharacterized protein LOC120078284 isoform X4 [Benincasa hispida])

HSP 1 Score: 508.8 bits (1309), Expect = 4.9e-140
Identity = 277/360 (76.94%), Postives = 289/360 (80.28%), Query Frame = 0

Query: 151 ISYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLE-------------------------- 210
           ++YLCEEWKERYGDGDD+EDVSAMFLTLPFFPLE                          
Sbjct: 359 VNYLCEEWKERYGDGDDDEDVSAMFLTLPFFPLELPQQENSFDCGLFLLHYVELFLEGAP 418

Query: 211 --------------LSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPSSDA 270
                         LSQ+WFHP+E S KRAHILQLIYEIMVSNQA+ELSGSIGKYPSSDA
Sbjct: 419 VNFNPLKILKFSNFLSQNWFHPMEVSLKRAHILQLIYEIMVSNQAKELSGSIGKYPSSDA 478

Query: 271 GDSDKVLSRHVYLRERHIFTMTHSDNFSSIGKEFGSVSKVSSDTNYQRIERQSGSVMPPI 330
            DSD        L E HIFTMTHSDNFSS+GKEFGSV KVSSDTNYQRI RQSGSVMPPI
Sbjct: 479 SDSD--------LGEAHIFTMTHSDNFSSVGKEFGSVYKVSSDTNYQRIGRQSGSVMPPI 538

Query: 331 EEDENGEIAD-SPQFLEDRPQASAVSECSSAFSFGQQFTELEISREGRFSRNVNDTGRTL 390
           EE+E GEIAD SPQ LEDR QASAVSECSSAFSFGQQFTELEIS EGRFSRN+ DTGRT 
Sbjct: 539 EEEEKGEIADESPQCLEDRHQASAVSECSSAFSFGQQFTELEISWEGRFSRNIKDTGRTP 598

Query: 391 SPRPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEILTSSSEELVTCVVEDSEEEGN 450
           SPRPL  ESQTT EL RDCTPQ TKSLNHPTEADNEP ILTSSSEELVTCVVEDSEEEGN
Sbjct: 599 SPRPLHRESQTTFELGRDCTPQETKSLNHPTEADNEPLILTSSSEELVTCVVEDSEEEGN 658

Query: 451 ERNDGIEIDVSSSSRNDLFLSRQVVESIANLSDNRQHDLILSNEHPTLDSSKQHCANRPS 470
           ERNDGIEIDVSSSSRN+LFLS+QVVES AN SDNRQHDLILSNEHPT DSSKQHC+NRPS
Sbjct: 659 ERNDGIEIDVSSSSRNNLFLSKQVVESTANSSDNRQHDLILSNEHPTFDSSKQHCSNRPS 710

BLAST of HG10018823 vs. NCBI nr
Match: XP_038888440.1 (probable ubiquitin-like-specific protease 2A isoform X2 [Benincasa hispida])

HSP 1 Score: 508.8 bits (1309), Expect = 4.9e-140
Identity = 278/359 (77.44%), Postives = 288/359 (80.22%), Query Frame = 0

Query: 152 SYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLE--------------------------- 211
           SYLCEEWKERYGDGDD+EDVSAMFLTLPFFPLE                           
Sbjct: 450 SYLCEEWKERYGDGDDDEDVSAMFLTLPFFPLELPQQENSFDCGLFLLHYVELFLEGAPV 509

Query: 212 -------------LSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPSSDAG 271
                        LSQ+WFHP+E S KRAHILQLIYEIMVSNQA+ELSGSIGKYPSSDA 
Sbjct: 510 NFNPLKILKFSNFLSQNWFHPMEVSLKRAHILQLIYEIMVSNQAKELSGSIGKYPSSDAS 569

Query: 272 DSDKVLSRHVYLRERHIFTMTHSDNFSSIGKEFGSVSKVSSDTNYQRIERQSGSVMPPIE 331
           DSD        L E HIFTMTHSDNFSS+GKEFGSV KVSSDTNYQRI RQSGSVMPPIE
Sbjct: 570 DSD--------LGEAHIFTMTHSDNFSSVGKEFGSVYKVSSDTNYQRIGRQSGSVMPPIE 629

Query: 332 EDENGEIAD-SPQFLEDRPQASAVSECSSAFSFGQQFTELEISREGRFSRNVNDTGRTLS 391
           E+E GEIAD SPQ LEDR QASAVSECSSAFSFGQQFTELEIS EGRFSRN+ DTGRT S
Sbjct: 630 EEEKGEIADESPQCLEDRHQASAVSECSSAFSFGQQFTELEISWEGRFSRNIKDTGRTPS 689

Query: 392 PRPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEILTSSSEELVTCVVEDSEEEGNE 451
           PRPL  ESQTT EL RDCTPQ TKSLNHPTEADNEP ILTSSSEELVTCVVEDSEEEGNE
Sbjct: 690 PRPLHRESQTTFELGRDCTPQETKSLNHPTEADNEPLILTSSSEELVTCVVEDSEEEGNE 749

Query: 452 RNDGIEIDVSSSSRNDLFLSRQVVESIANLSDNRQHDLILSNEHPTLDSSKQHCANRPS 470
           RNDGIEIDVSSSSRN+LFLS+QVVES AN SDNRQHDLILSNEHPT DSSKQHC+NRPS
Sbjct: 750 RNDGIEIDVSSSSRNNLFLSKQVVESTANSSDNRQHDLILSNEHPTFDSSKQHCSNRPS 800

BLAST of HG10018823 vs. NCBI nr
Match: XP_008455146.1 (PREDICTED: probable ubiquitin-like-specific protease 2A isoform X2 [Cucumis melo] >KAA0031447.1 putative ubiquitin-like-specific protease 2A isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 416.4 bits (1069), Expect = 3.3e-112
Identity = 237/338 (70.12%), Postives = 258/338 (76.33%), Query Frame = 0

Query: 152 SYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLE--------------------------- 211
           SYLCEEWKERYGDGDD+ED+SA+FLTLPF PLE                           
Sbjct: 481 SYLCEEWKERYGDGDDDEDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPV 540

Query: 212 -------------LSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPSSDAG 271
                        LSQDWFHP EAS KRAHIL+LIYEIMV NQA+ELSGS+GKYPSSDA 
Sbjct: 541 NFSPLKILKLSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSGSVGKYPSSDAN 600

Query: 272 DSDKVLSRHVYLRERHIFTMTHSDNFSSIGKEFGSVSKVSSDTNYQRIERQSGSVMPPIE 331
           DSD  LS+HV   E  IFTMTHSDNFSS+GKEFGSVSKVSSDTNYQRI  +  SVMPPIE
Sbjct: 601 DSDNDLSKHV-SGEADIFTMTHSDNFSSVGKEFGSVSKVSSDTNYQRIGGRE-SVMPPIE 660

Query: 332 EDENGEIADSPQFLEDRPQASAVSECSSAFSFGQQFTELEISREGRFSRNV-NDTGRTLS 391
           EDENGE ADSPQ L+DR QASAV E SSAFSFGQQFTELEIS EGR+S+NV  D  R  S
Sbjct: 661 EDENGETADSPQCLDDRLQASAVFEFSSAFSFGQQFTELEISWEGRYSQNVKEDMCRKPS 720

Query: 392 PRPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEILTSSSEELVTCVVEDSEEEGNE 448
           PRP L E QTTL+L +D TPQATK+ NHPTEADN+ EILTSSS+EL+ CVVEDSEEEGNE
Sbjct: 721 PRPSLHELQTTLKLGQDSTPQATKNPNHPTEADNQLEILTSSSDELINCVVEDSEEEGNE 780

BLAST of HG10018823 vs. ExPASy Swiss-Prot
Match: Q8L7S0 (Probable ubiquitin-like-specific protease 2B OS=Arabidopsis thaliana OX=3702 GN=ULP2B PE=1 SV=3)

HSP 1 Score: 53.1 bits (126), Expect = 9.6e-06
Identity = 37/118 (31.36%), Postives = 53/118 (44.92%), Query Frame = 0

Query: 142 TRALVPNLSISYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLELSQ-------------- 201
           + A + NL  +YLCEEWKER+ +  D  D+S+ F+ L F  LEL Q              
Sbjct: 527 SHAGLKNLVQTYLCEEWKERHKETSD--DISSRFMNLRFVSLELPQQENSFDCGLFLLHY 586

Query: 202 --------------------------DWFHPVEASQKRAHILQLIYEIMVSNQAEELS 220
                                     +WF P EAS KR  I +LI+E++  N++ E+S
Sbjct: 587 LELFLAEAPLNFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELL-ENRSREVS 641

BLAST of HG10018823 vs. ExPASy TrEMBL
Match: A0A5A7SPN7 (Putative ubiquitin-like-specific protease 2A isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold139G002040 PE=3 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 1.6e-112
Identity = 237/338 (70.12%), Postives = 258/338 (76.33%), Query Frame = 0

Query: 152 SYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLE--------------------------- 211
           SYLCEEWKERYGDGDD+ED+SA+FLTLPF PLE                           
Sbjct: 481 SYLCEEWKERYGDGDDDEDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPV 540

Query: 212 -------------LSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPSSDAG 271
                        LSQDWFHP EAS KRAHIL+LIYEIMV NQA+ELSGS+GKYPSSDA 
Sbjct: 541 NFSPLKILKLSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSGSVGKYPSSDAN 600

Query: 272 DSDKVLSRHVYLRERHIFTMTHSDNFSSIGKEFGSVSKVSSDTNYQRIERQSGSVMPPIE 331
           DSD  LS+HV   E  IFTMTHSDNFSS+GKEFGSVSKVSSDTNYQRI  +  SVMPPIE
Sbjct: 601 DSDNDLSKHV-SGEADIFTMTHSDNFSSVGKEFGSVSKVSSDTNYQRIGGRE-SVMPPIE 660

Query: 332 EDENGEIADSPQFLEDRPQASAVSECSSAFSFGQQFTELEISREGRFSRNV-NDTGRTLS 391
           EDENGE ADSPQ L+DR QASAV E SSAFSFGQQFTELEIS EGR+S+NV  D  R  S
Sbjct: 661 EDENGETADSPQCLDDRLQASAVFEFSSAFSFGQQFTELEISWEGRYSQNVKEDMCRKPS 720

Query: 392 PRPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEILTSSSEELVTCVVEDSEEEGNE 448
           PRP L E QTTL+L +D TPQATK+ NHPTEADN+ EILTSSS+EL+ CVVEDSEEEGNE
Sbjct: 721 PRPSLHELQTTLKLGQDSTPQATKNPNHPTEADNQLEILTSSSDELINCVVEDSEEEGNE 780

BLAST of HG10018823 vs. ExPASy TrEMBL
Match: A0A1S3C1G7 (probable ubiquitin-like-specific protease 2A isoform X2 OS=Cucumis melo OX=3656 GN=LOC103495387 PE=3 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 1.6e-112
Identity = 237/338 (70.12%), Postives = 258/338 (76.33%), Query Frame = 0

Query: 152 SYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLE--------------------------- 211
           SYLCEEWKERYGDGDD+ED+SA+FLTLPF PLE                           
Sbjct: 481 SYLCEEWKERYGDGDDDEDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPV 540

Query: 212 -------------LSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPSSDAG 271
                        LSQDWFHP EAS KRAHIL+LIYEIMV NQA+ELSGS+GKYPSSDA 
Sbjct: 541 NFSPLKILKLSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSGSVGKYPSSDAN 600

Query: 272 DSDKVLSRHVYLRERHIFTMTHSDNFSSIGKEFGSVSKVSSDTNYQRIERQSGSVMPPIE 331
           DSD  LS+HV   E  IFTMTHSDNFSS+GKEFGSVSKVSSDTNYQRI  +  SVMPPIE
Sbjct: 601 DSDNDLSKHV-SGEADIFTMTHSDNFSSVGKEFGSVSKVSSDTNYQRIGGRE-SVMPPIE 660

Query: 332 EDENGEIADSPQFLEDRPQASAVSECSSAFSFGQQFTELEISREGRFSRNV-NDTGRTLS 391
           EDENGE ADSPQ L+DR QASAV E SSAFSFGQQFTELEIS EGR+S+NV  D  R  S
Sbjct: 661 EDENGETADSPQCLDDRLQASAVFEFSSAFSFGQQFTELEISWEGRYSQNVKEDMCRKPS 720

Query: 392 PRPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEILTSSSEELVTCVVEDSEEEGNE 448
           PRP L E QTTL+L +D TPQATK+ NHPTEADN+ EILTSSS+EL+ CVVEDSEEEGNE
Sbjct: 721 PRPSLHELQTTLKLGQDSTPQATKNPNHPTEADNQLEILTSSSDELINCVVEDSEEEGNE 780

BLAST of HG10018823 vs. ExPASy TrEMBL
Match: A0A1S3C0Z2 (probable ubiquitin-like-specific protease 2A isoform X1 OS=Cucumis melo OX=3656 GN=LOC103495387 PE=3 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 1.6e-112
Identity = 237/338 (70.12%), Postives = 258/338 (76.33%), Query Frame = 0

Query: 152 SYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLE--------------------------- 211
           SYLCEEWKERYGDGDD+ED+SA+FLTLPF PLE                           
Sbjct: 492 SYLCEEWKERYGDGDDDEDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPV 551

Query: 212 -------------LSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPSSDAG 271
                        LSQDWFHP EAS KRAHIL+LIYEIMV NQA+ELSGS+GKYPSSDA 
Sbjct: 552 NFSPLKILKLSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSGSVGKYPSSDAN 611

Query: 272 DSDKVLSRHVYLRERHIFTMTHSDNFSSIGKEFGSVSKVSSDTNYQRIERQSGSVMPPIE 331
           DSD  LS+HV   E  IFTMTHSDNFSS+GKEFGSVSKVSSDTNYQRI  +  SVMPPIE
Sbjct: 612 DSDNDLSKHV-SGEADIFTMTHSDNFSSVGKEFGSVSKVSSDTNYQRIGGRE-SVMPPIE 671

Query: 332 EDENGEIADSPQFLEDRPQASAVSECSSAFSFGQQFTELEISREGRFSRNV-NDTGRTLS 391
           EDENGE ADSPQ L+DR QASAV E SSAFSFGQQFTELEIS EGR+S+NV  D  R  S
Sbjct: 672 EDENGETADSPQCLDDRLQASAVFEFSSAFSFGQQFTELEISWEGRYSQNVKEDMCRKPS 731

Query: 392 PRPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEILTSSSEELVTCVVEDSEEEGNE 448
           PRP L E QTTL+L +D TPQATK+ NHPTEADN+ EILTSSS+EL+ CVVEDSEEEGNE
Sbjct: 732 PRPSLHELQTTLKLGQDSTPQATKNPNHPTEADNQLEILTSSSDELINCVVEDSEEEGNE 791

BLAST of HG10018823 vs. ExPASy TrEMBL
Match: A0A0A0K633 (ULP_PROTEASE domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G065160 PE=3 SV=1)

HSP 1 Score: 358.6 bits (919), Expect = 3.9e-95
Identity = 215/339 (63.42%), Postives = 231/339 (68.14%), Query Frame = 0

Query: 152 SYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLE--------------------------- 211
           SYLCEEWKERYGDG D +D+SA+FLTLPF PLE                           
Sbjct: 473 SYLCEEWKERYGDG-DYKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPV 532

Query: 212 -------------LSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPSSDAG 271
                        LSQDWFHP EAS KRAHIL+LIYEIM  NQA+ELSGSIGKYPSSDA 
Sbjct: 533 NFSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMACNQAKELSGSIGKYPSSDAN 592

Query: 272 DSDKVLSRHVYLRERHIFTMTHSDNFSSIGKEFGSVSKVSSDTNYQRIERQSGSVMPPIE 331
           DSD  LS+HV   + HIFTMTHSDNFSS+GKE GSVSKVSSDTNYQ I R   SVMPPIE
Sbjct: 593 DSDNDLSKHV-SGQAHIFTMTHSDNFSSVGKEVGSVSKVSSDTNYQPIGRWE-SVMPPIE 652

Query: 332 EDENGEIADSPQFLEDRPQASAVSECSSAFSFGQQFTELEISREGRFSRNVNDTGRTLSP 391
           EDENGE ADSPQ LEDRPQAS VSECSSAFSFGQQFTELEI  EGR+S+NV +  R  SP
Sbjct: 653 EDENGERADSPQCLEDRPQASTVSECSSAFSFGQQFTELEICWEGRYSKNVKEMCRKPSP 712

Query: 392 RPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEILTSSSEELVTCVVEDSEEEGNER 447
           R  L E QT LEL                    +PEILTSSS+EL+ CVVEDSEEEGNER
Sbjct: 713 RLSLHELQTPLEL-------------------GQPEILTSSSDELINCVVEDSEEEGNER 772

BLAST of HG10018823 vs. ExPASy TrEMBL
Match: A0A6J1CM38 (probable ubiquitin-like-specific protease 2A isoform X6 OS=Momordica charantia OX=3673 GN=LOC111012217 PE=3 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 1.2e-91
Identity = 219/401 (54.61%), Postives = 243/401 (60.60%), Query Frame = 0

Query: 148 NLSISYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLE----------------------- 207
           NL  +YLCEEWKER+GDGD    +SA FL L F PLE                       
Sbjct: 470 NLFQNYLCEEWKERHGDGD--VGISAKFLALQFVPLELPQQENSFDCGLFLLHYVERFLE 529

Query: 208 -----------------LSQDWFHPVEASQKRAHILQLIYEIMVSNQAEELSGSIGKYPS 267
                            L+QDWF P+EAS KRAHILQLIY+IMV++  +E SGSIGKYPS
Sbjct: 530 GAPVNFSPFKISRFSNFLNQDWFPPIEASLKRAHILQLIYDIMVNDHGKEFSGSIGKYPS 589

Query: 268 SDAGDSDKVLSRHVYLRERHIFTMTHSDNFSSIGK------------------EFGSVSK 327
           S  G SD  LSRHVYL E H  T T SD F+S  K                  E G VSK
Sbjct: 590 SIVGHSDSDLSRHVYLEEVHTSTTTCSDKFTSGEKEKENEMSTLLACPPKRFRELGLVSK 649

Query: 328 VSSDTNYQRIERQSGSVMPPIEEDENGEIADSPQFLEDRPQASA-VSECSSAFSFGQQFT 387
           VSSDTNYQ+I  QS SVM PIEEDENGEI+DSPQ LEDR  ASA VSECSSA SFGQQF 
Sbjct: 650 VSSDTNYQQIGGQSRSVMSPIEEDENGEISDSPQCLEDRNHASAIVSECSSASSFGQQFR 709

Query: 388 ELEISREGRFSRNVNDTGRTLSPRPLLGESQTTLELRRDCTPQATKSLNHPTEADNEPEI 447
           ELEIS EGRFSRN  D  R  S +P LGES T     RD +PQA K L+HPTEAD EPE 
Sbjct: 710 ELEISTEGRFSRNAKDKDRRPSSQPSLGESHTISVSGRDYSPQANKRLDHPTEAD-EPET 769

Query: 448 LTSSSEELVTCVVEDSEEE---------------------GNERNDGIEIDVSSSSRNDL 469
           LT+SSEEL TCVVEDSEEE                     GN  +D  EI++SSS +ND 
Sbjct: 770 LTTSSEELATCVVEDSEEEYIVEDSEEADEMYNGIEVVQSGNSMHDSKEIEISSSLKNDS 829

BLAST of HG10018823 vs. TAIR 10
Match: AT1G09730.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 53.1 bits (126), Expect = 6.8e-07
Identity = 37/118 (31.36%), Postives = 53/118 (44.92%), Query Frame = 0

Query: 142 TRALVPNLSISYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLELSQ-------------- 201
           + A + NL  +YLCEEWKER+ +  D  D+S+ F+ L F  LEL Q              
Sbjct: 559 SHAGLKNLVQTYLCEEWKERHKETSD--DISSRFMNLRFVSLELPQQENSFDCGLFLLHY 618

Query: 202 --------------------------DWFHPVEASQKRAHILQLIYEIMVSNQAEELS 220
                                     +WF P EAS KR  I +LI+E++  N++ E+S
Sbjct: 619 LELFLAEAPLNFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELL-ENRSREVS 673

BLAST of HG10018823 vs. TAIR 10
Match: AT1G09730.2 (Cysteine proteinases superfamily protein )

HSP 1 Score: 53.1 bits (126), Expect = 6.8e-07
Identity = 37/118 (31.36%), Postives = 53/118 (44.92%), Query Frame = 0

Query: 142 TRALVPNLSISYLCEEWKERYGDGDDNEDVSAMFLTLPFFPLELSQ-------------- 201
           + A + NL  +YLCEEWKER+ +  D  D+S+ F+ L F  LEL Q              
Sbjct: 527 SHAGLKNLVQTYLCEEWKERHKETSD--DISSRFMNLRFVSLELPQQENSFDCGLFLLHY 586

Query: 202 --------------------------DWFHPVEASQKRAHILQLIYEIMVSNQAEELS 220
                                     +WF P EAS KR  I +LI+E++  N++ E+S
Sbjct: 587 LELFLAEAPLNFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELL-ENRSREVS 641

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888439.12.2e-14076.86probable ubiquitin-like-specific protease 2A isoform X1 [Benincasa hispida][more]
XP_038888441.12.2e-14076.86probable ubiquitin-like-specific protease 2A isoform X3 [Benincasa hispida][more]
XP_038888442.14.9e-14076.94uncharacterized protein LOC120078284 isoform X4 [Benincasa hispida][more]
XP_038888440.14.9e-14077.44probable ubiquitin-like-specific protease 2A isoform X2 [Benincasa hispida][more]
XP_008455146.13.3e-11270.12PREDICTED: probable ubiquitin-like-specific protease 2A isoform X2 [Cucumis melo... [more]
Match NameE-valueIdentityDescription
Q8L7S09.6e-0631.36Probable ubiquitin-like-specific protease 2B OS=Arabidopsis thaliana OX=3702 GN=... [more]
Match NameE-valueIdentityDescription
A0A5A7SPN71.6e-11270.12Putative ubiquitin-like-specific protease 2A isoform X2 OS=Cucumis melo var. mak... [more]
A0A1S3C1G71.6e-11270.12probable ubiquitin-like-specific protease 2A isoform X2 OS=Cucumis melo OX=3656 ... [more]
A0A1S3C0Z21.6e-11270.12probable ubiquitin-like-specific protease 2A isoform X1 OS=Cucumis melo OX=3656 ... [more]
A0A0A0K6333.9e-9563.42ULP_PROTEASE domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G06516... [more]
A0A6J1CM381.2e-9154.61probable ubiquitin-like-specific protease 2A isoform X6 OS=Momordica charantia O... [more]
Match NameE-valueIdentityDescription
AT1G09730.16.8e-0731.36Cysteine proteinases superfamily protein [more]
AT1G09730.26.8e-0731.36Cysteine proteinases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR47764:SF5UBIQUITIN-LIKE-SPECIFIC PROTEASE 2A-RELATEDcoord: 185..280
NoneNo IPR availablePANTHERPTHR47764UBIQUITIN-LIKE-SPECIFIC PROTEASE 2B-RELATEDcoord: 148..188
NoneNo IPR availablePANTHERPTHR47764UBIQUITIN-LIKE-SPECIFIC PROTEASE 2B-RELATEDcoord: 185..280
NoneNo IPR availablePANTHERPTHR47764:SF5UBIQUITIN-LIKE-SPECIFIC PROTEASE 2A-RELATEDcoord: 148..188

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10018823.1HG10018823.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0008233 peptidase activity