HG10007296 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007296
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionThiol protease aleurain-like
LocationChr10: 3417902 .. 3430985 (-)
RNA-Seq ExpressionHG10007296
SyntenyHG10007296
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCCTCCACGGCTGTTCTTCGTTTCCTCTGTTCTTCTGGTGCTTTGCTGTGCAGTTGCCGGATCCGTCTTCGATGATTCGAATCCCATTCGGATGGTCTCTGACCGTCTTCGTGAGTTGGAGTTGGAGGTCGTTCGAGTCATCGGTCAAGTTCCTCACGCTCTCCGATTCGCTCGATTCGCTCACAGGTTTGTTCGTTCGTTCGATCGTGTCCCTGATTGATTTCTAGTCGTTTCGCTTCTTTTTCTCTCGCTATCTGGTTTTTGTACTGAGGTACTAGTTGTGGATTGTTTCAGGTATGGGAAGAAGTACGAGACGGCGGAGGAGATGAAAATCCGATTTGGAATTTTCTTGGAGAGTTTGGAACTGATCAAATCGACTAATAGACAAGGCCTTTCTTACAAGCTTGGTGTTAATCGTAAGTCATGTGATTGATGTCTGATTTTAAATAGTTCACTTGAATTGTAGGATTATGAGAACTGGTCGATTGATTCTTTGTGAAAATAATCCGATTGTTTGCACTTCGGATTTGGATGACTATCTGATTCTACTGATTGATCGAGTCATCGAGAAGAATATCGTTTAGCGTAATTAGTATAGAAAGTCTGCAAAAAAGATGACGTTGAATCGATGGCGGTGTTTGTATTCTCAATTGCTGCAGAATTTGCGGATTGGACGTGGGAAGAGTTCAAGAAACACAGGCTAGGAGCCGCTCAAAACTGCTCTGCCACCACAAAGGGCAACCACAAACTTACTGATGTTGTTCTTCCTGAATCGGTACATCCTTGCATTTACTCTCGACAAGTTGAATCCTCATTTTCCTGATATTTTATTTGAACAGACTGGAGAGTGTCAGCGTCTATTGAATTCTGACCCTAATGATTCTGAGTCAAATTTTGATTTGAATCTGTGACAAACTTCCTTCTTGTTTTTGGTTCTACTTCTTCCCCTTTTCATCCCACATTTTATCAGCAATTTGGGATGAACATAATCAAATGCGTTCATCGTCTTATTGCCGAAAAGGTTATTTAAAACTAATTTAAAAGATGGCTTTACTAGATGAAAGGGTATCCCTGAGTAGAAGTGAGTGCTTGCTCTATTTCGTTTTTAAGGCCCTCCTTTTGTCAAATCATGAGCTACTTAGTATGGAACACTAATTTCATATCATACAATTGATCAGATGAAACTTTGTATTCAAATTGTGTTTAGTTTAGAAGTACACGCATTCAAACAAATTTGTGATGCTGAGACTTTGTTTACAACCTTCCTACGGCCGTTTTGCAGAAAGATTGGAGGGAAGATGGCATAGTTAGCCCTGTTAAAGATCAAGGCCACTGTGGTTCTTGCTGGACATTCAGGTTAGCTATATCTTCTGTACTTGGTCAAGGACGGAAGATAGGGATAAGGTTGCTCATTGCTACTCAATATTTATTCATATAATGGTTTTTTTCCCTTTCTCTTTTGCTGTTGCAGTACAACTGGAGCGCTTGAAGCGGCCTATGCACAAGCACATGGAAAGGGTATCTCCCTGTCCGAGCAGCAGCTGGTGGATTGTGCAGGTGCTTTTAACAACTTTGGCTGCAATGGTGGACTGCCTTCCCAAGCTTTTGAATACATCAAGTACAATGGTGGCCTTGACACTGAAACAGCATATCCTTACACTGGAAAAAACGGCCAATGCAAATTCTTATCTGAGAATGTTGGAGTACAAGTCATTGATTCTGTGAACATAACTCTTGTAAGAGCCAAGGACTAATTCCTTTACTCTTCTTTTTGATTCCTCCCGGCAGAGTTGCCATATTTTACTCCTGATTGTGATGAAGTATCCAAAATTCGTTTCACGATCATCTTCTAATTGTTGGGGGGGTTATATGCAGGGTGCTGAAGATGAATTGAAGCATGCAGTTGCTTTTGTTCGACCAGTAAGCGTAGCATTTGAGGTGGTTAGCGGTTTTCGCTTGTATTCAAAAGGAGTTTACACCAGTAACTCATGCGGCAGCACTCCTATGGTTAGCATCAACACCTTTTGGCTTTCTAAAAATTCTGACATTATTGTTCATCATCTTCAGTCAGCAGTGGCATGATAACGTGATAAGATCATGATGTCTGACAGTTTCTTGTTTCACTGGCAGGATGTAAACCACGCCGTGCTTGCAGTTGGTTATGGGGTCGAAGATGGTATCCCATACTGGCTTATAAAGAACTCATGGGGAGGAAACTGGGGTGACAAGGGCTACTTCAAGATGGAGCTGGGCAAGAACATGTGTGGTAAGTTTTCTTCACATAGTAATACTAGCGTAAAATATGTGTTCAATTGGAGTTATTATCTAGCGAGGCTGTCATAATGTTATCTGATTTTATATCTTTCATAACTAATTAGTTTTTGAGATGAAACTCATGTTATCTAATATGGTATCAGAGTCCAAACGAGCATTCAAATCTAAAAAGAAAATTGAGATCCAATCCAAGATTTGTGAACTCAGAGAAGTACCATCTCTAAGGGGCATGTTGAGGATTCTATATTGGAGAAATCAAGAAACCTCACAATCTTTTTAAGATAGATGAACTACTTTTCTCATTGCCAATTGATTTTAAGATGACAAAAATTGGGTTCCAATCCAAGAATAGTAAATTCATACATGAACTATTTTTGTCATAGCCAATTAATTTTGAGATGAAACTCTATGTTATCTAATAAATGCGACTACCTTGTAATATAACTATTAGACATAGATTCAAATTTGTATGTTTTATAACAACCACGAGATATTGATTCAAAAAATGTCCAACGGTTTCTTTCTTGTAAATTGAACAAGTGTAAAACTACGAGCTACATCAATAATGTTTCCTGCTAATTAAGTAGTTAGGTATTCTTCATAGATTTTTTTTCCTCTCTGTACTGATAAAGAAGTTTGGGCTTTTAGGTGTTGCAACTTGTGCTTCATACCCCATTGTTGCTTAGAGTATGAGAATGGTAAAGCTTCTTGCAAGGAAAGATACTGTGGTTTTCAGGTACGGCTATGGTGGCTAGAATGATGTGCAAATTAAAGTTACATATTCTTGTAGTTGATCATTTAATGCAGACTGTAACGTAATGTACTTGTGGTTGTAAATTTGCATACTCAGAGATGCTTGGATAAGTTTTTATAATGTACTTGCTGGTAGCTCTCTTGAGGTTATGTAAATTTACTGGCAGCTCCTCTATTAATTTCTATACTATGTGCCACTGTTTAGATATTAGAACCTTTCAATAAAAATATGCTTTTTTTTTTTTTTCTAATTTATTATTATTATTATTATTATTATTATTATTTTAAGGAGAATTTTTAAAAATAGAAAAATAAGAGAAACTATTTACATAAAATAGCAAAATTTTTAGATAGTTGTGATAGACGTGGATAGAAGTCTATTAGGGTCAGTGATAGAAATGATAGAAGTCTATCATCGATGGAGTCTATTAGTATTTTTTTTTGCTATTTTTTTGTAAATAGTTTGAAATTTTTTCTATTTGTGAAAATTTTTCTTATTTTAATATAAGTTTTAAACTAGTTAGGTTAAAAGAGACCTTTGGTTCATTGGTAACTACTTAATCTTGTACTTTTAAATTTGTAACAATTTAGTTTTAAACGTTAATAAATAATGATTTAGTTCTTGTATTTTACAATTTCTAACGATTTGGTTCTTAATGTGAAAAATATTATTGAGATTTGATGAAATAATACATTTAGATCTATAAATTGCTATCAAATGTTTTATAAAACAAAAAAAACTAAGTCGTTATATAATAGAAGTTGACACTTAATTTTGAAGAGATCTTTCACGATAGAGATTGAATTGTTACAAATTTGAATTATAGGGACTAAATTTGTATCCCTCATTCCGATAAATTTAAAACGAAGACTAAATCGTTACCGACTAAAGTTTAAGGACTAAATTATTATTTCTATAAAAGTTTAAGGACCAAAAATGTTTTTGAACCTTTTAATTTTATTGTTTCTATATTAATTAAGTCTTAGTTAAACCACCATTTTGGTCCTACTTTGGGTTTTACATTTCTATGAGCAATCAATATTTTTATTATATTGTTAAAAATTTTGAAAAATAGAAAAAATAAATAATAATATGTCACTAATTAAAAAGCATAAAATTTTTATTTATTAATTTAAATATTTTATTTTTATAATCTCGTTGTGTAAGACTAAGATCTCAATATTTTTTGGACATCAACATTTTAAACCTTACTGGTACCACAAAACTTACCAATGTTGTTCTTCCTGAATTGGTAAATCATTACATCTACTCACCACAACTTCGATCAGTATGGTTTCAATAGTTAATTTTTTTAAAAATAGTGAATTTTTAGCGTGTAATAGATAAATTTTATATAGTTTTGTTTTAATTATCACAAACTTTTAAAAGTTAGATAACCATTTTTATCATAATATTTTCATATTATCGCATCAGATCTTGAATATTTGCGTCTTACACTAATTGCAATTTCGATCTCGAATTTAAGTTATGAAACATATAAGTTTATTTCGTAGACTAAATATTAGAAAAGTTACATTCATTGTCTTTCAAATGAATACTTACAATCGAAAAGTCGGATCCATTGACAAAGGGCAACATATAAGTTTACTAAAAATGTAAGAGCAAAATTGGTATATAAAGATTTTAATCCCCTCAAAACCCCATTTATGTGTTGTATAGAATGTAGATTATGATTATTTTATGATTCAACATGAGGAGTTAGCTAAAGTGTTGACCTTATATGAAGGCATCTCCATAGTCATTGAATCAAGGTTACACCCTCTTTGAGATGAGACCGACTCCTCCATCCAATGAAAGCTCTTTTTGAGCTCTATTACTATGAAAATGAGTCCCTTAGTTTAACACATAATTATAGGAAAAAAGTACCTTGTTAGTCTCCAAATTTTATGAAACATGTATATTTGGTCTCTGAATTTAAGATCTAAAAGCAATTTTTGTTATTATTTTAAATAATAATAATTTTTTTTTTTTAAAAAAAAACTTCAAATAATCATTCCTTCTTTTCCTTCCCATTTTTCTCCTCTCTCTCCTACCTCCATCATCGACCCTTGTTTCCTTTTCGTCTTACATACCATCCTCTCTTCTTCCTTAATACTTAATAAAAAGTGATAATTAAAAATAATATTGAGAAATTTTTAAAAATAAAAATATAAGAAAAATTATTTACACAAAATAACAAAATTTTTAGATAGTTTTGGTAAACGCGGATAAAAGTCTATAGGATCTATCAGCGATAAAAATGATAGAAGTCTATCACTTTAAAACTAAATGTCATATCATTATTCGAGTTTGGAAGAATATGTACATCTACTAAAATCACTTTTGTCATTTTCAAAATATCATGAAATATGTTTTTAATTGGGTTGTTTTCAAATATAGAAAAATGAGTCAAACTATTTACAAAAATAGAAAAATTATTTTACGGTCTATCAGTGATAGAATTTATTTGTTTGAGCGATATAATTCTATCGCGGTGTATCACTGATAGGCAGTGAAATTTTTCATATTTGTAAATAGTTTGATATTTTTTCTGTTTATAGTAATTTCCCTTTTTAATTCTTCAAAACTAATTTTGATGATTAAAAATTATATTTAAAAGTATAAAATTAAAAATTAAATTAATTTCTTTAGTTATTTTTAAAATCACTTCTAAAACATGATTTAATCCGGAGTTATAAAGTAACTTATAAGTTACTCGTCATTTTCTACTAAAAAAAATGAAAAAGAAAAAAGCTATAACTCGTCATTCATTTTCATTCCCCTACGACAGAAAATACTCCTATGCTGAAATTAGTTTCTCTCCGGAAAGTCACCGATCATCGGAATCGGAGAAGCCAAATTGACGGTTTCGGAAGGCTTCTTCAGCGCAATTCAGCATCCAAGTCATTTTTCATCTCCGTTTAAGCCTCTACATTTCAAACATGGCAGCGGAGAGGCACGCCTCTTCGCGCGCAACATCATCTGAAGACAACGCGATGTAAAATTCTCCACATATTTCATATGTTTGATTGCGTTCATTCTGTTTGTTTCAATTGTCCGATTGTGTTTGATTTCAGACTCTGCCATTTCTATGTTTTTTGAATTTTCATTTCTAGTTATTCCTCTGTAGTCTTCGTGATTTCAGTACGAATCCGTCTGTCGAATGTTTAATTGACTTGCCTGAATGTTTTGTACTGTCTAGCTTCCTTCGGTGTTGATAATTCCTTCATTCGACTGGTACAGGTTTCTTGATATACTGCATGAGGCACCGTTATTTGGTCATCGGAAGCCTGCAAGAACAGTTGGGAGCATAATTTATTGTTTTGTTTTGGCAAGTATACTCTCCATCTAAGATGAACCATAATCCAGTCCTAATGCTAGTTAGGAAAAGTTTTATAGTTCTTGAATTGGCTGGCTTACTGTATAACTTCGTTTTGAAAGGTTGAAACGATTGAACTTCTCTCATAATTTAGGTAGGCTATGCTGCTCTAGCTATTGGAGCGCCATGGATTTTTCATCCTATAAAGCACTTGGTTGAACCATTGCTCTGCAGTTGTGATGTTGTTCTCTTGATGCTCACAGGTATTCTTTTAAACTCCAGAGCTATGCTTATTAAGATAAATAGAAATCAATTATTTTCTGGCCAGCAAGGTACTGGAGAATTTTATTTTCTGCTACTCGAGTAGCCAGTGTACTTTTCATCATTGGATGGTTTGAAATTTTAATTCAGTTCATTCTTAGGACTACAAGCTATTGAACTCAATCATTTAAGCACAACTTTACTCATCTTGACATATCAAAAGGCTTAGCTGAATAATTCTACCAAAGAGGTATGATTATGAAATGATCTTTATGCTTTAGAAATTAAACCGCTGTCTCTATTATTTCAGGCATCTTTCAGCAATATCTAGTATACCAAGTCCACAAAATTCGTTTGCAGGTACTTATATTTAGTTAGACAAAGCTTCTAAATAGTTTCTTTATGGAAGGAATTTTTTTGCTTCCTGTTTCCTTTGCAAGTTTGAAGAGGTTGAAGATTTTAGATATTGAACTTGCAATATGAACTAAAAGTATTGGTTGTTATGGTGATGAAAAGAAAATGTTTTCATGAGCCATGACTCATTGTTTTTTTTTTTATTGTACGTTTGCAATCTTGTTAGATTAGGTCTTTCCATTGCAATGAATTTTCTTGAAAGAAACGAGCTTTACTTGGACAAGATCTATTCTATTGGTTGTACAACTAAGTCCTTGAGTTTTAATGATTCTTTTTTAATGAATTCTGTATTTAGGTCAATCAGTTTCTTTTCAGTCTGGTGCTTGCATTTCATGTTCAGTGGTTACAATGCAATTGAATGGATTATCTCTCTTTTTTATTTGTAGGGATATTATAGCTTTAGCCAGAAGTTAAAGCATATTGTTCGTCTACCTTTTGCAGTTACTGCATATGGTATGTTAGATAGTTGTGTTGATCCAGTCTTTATAGATCTTCACATTTTAGTGCTGTTGCTATATGCCAGTTGGCAATGAAGCCTTTTTACCTTTTTCTGCAATTATGGATCCCAATCTTAAATAATGGTTCAATATCAGGAACTGCTGCCCTTTTACTTGTCATGGTATGGGAACCTCAAATCAGTGCACTTTCGATCCCCATAATTTTAAGGTAGTAAGGTTCTACAACATGTGAATGCTGAGTCTAATAGTTTTTCCAATTAACTGTTTTTCCTTTGTTTCATTTTTCCCTAGTTCCCTTCATCTTCACCATCCATTACTGATCTCTTCTTATTGTGATCATTTATCTGATCACCTAAAGATGTTCTATTTATATTTTCTGTATTTTATTGCATCCATGGATGTTCCAGTCAAATGTCCTACCTTAATTTTCTTTTTTTTGTAGATTGCATTAGGAACTATAGATACCAAACGTGCATATCATTTTCTACCTTATTTTCATTTTGTTGCATATGGATAGTATCGTTAAAAGCTTCTCCCTTCAAAGATTACTCCATAGTGTACGTGCTGAATCCATGTTCCCATCTTTACCTAGCTGTACGTTACTGACTTAATGTGGTCCTTGATGCACCTTTTATATCTGCAGGTTGATTATGCTAATTGAAGCGGTGTGCGCTGGATCGTTTATGATTATATATATCAGTAAGTTTTTTGTATTCATGGGATGGGATCTCTTCAGGAGTCAATTGATACCTTGTGTTTTCGTTTATTCAGTAAGTTTTTTGTATTCATGGGATGGGATCTCTTCAGGAGTCAATTGATACCTTGTGTTTTCGTTTATTGATTGGAATTGTTTTGTTCTTTTAAATATTCTCTATTATTGAATTATTTACTATGTTTTACTCAATTACTTGGGCCGGTACATTATTTTCATTGTGATTCTAGAAACTGTGATTTTGTAATTGGAAGGGTAACATGTTATTGGATATAGATGGCTACATAGGGTACTGTATTGTGTTGCTCACAAGTCCCAAGCTTCATCCAGTTGACTGAAACATAACTAATTCTTAAAGCTTTTCTGAAACAGATATTTCCAGTCTTTAAATCATATTTCCTTGTATTTATTCATCTTTTTGTTTTATGTGCCCCAAAAACGTTGTGAATAATATTGAAATGCATGGAATGCAGGCTACATACAGAAGTACAATTCATTAAATTCTCAGCCTGATGTTCTGAAGTCATTGTATTCTCCACTTCAGCAATCAAGTTCTTTGGAAGATCTAAGGTCAAGTTATTAAACTTATCACAATACGCTGCTCTTTAGTCTGTATCTAATGCTAGATGGTCACTGTCTTAGAATTCTTCTTGTTCAAATACATTTTCACCATATTTATCTAATTGGTTGAGTTTTTTTTGGAAAGTTACATTTTACTCCAAATAATATCAAGCTGAAATCTACAACTTTTCCTCTCTATTTTTTTTTTAAAAACTTCTCCATCTGTTTTTTCCCCCCATTTCTTTCTAAGAACGAACTCCATTAAAAAAGAATAACTCCAAGTTAATAATTACAAAAAGAATAACACCGAGTTAATAACTACAAAAAGTTTGATTAACGGATGCCCATAAAGAAGCACTAAACCTCACAACCTCCCCCACCTCCTCCAAAGACTTGTCTAACCCTTTAAAAGTTCTAGCCGCATTCCCCAGAATAGCACAAAAGCAAGCTTGCCACAAGATTTTTCTTTATCTTGAAAGGGAGAATTCAACAGAAACTCCTCCATCAAAGAACGACAGTTCCTATTATGCACCACCACCACACCAAATGCCTGAAAAAACTGGCTCCAAAGGGAATAAGTAAATTGGCAACCCCCACAAAAGATGATCCAGATTCTCCTTCTGCTAACAAGGGACACACTACGGTGGACAGAAAACAAATGATGAATTTCATTGAATATGATCTCTTGGTGTTCACTCTACCATTTAAAACTTGCCAAGCAAAGAGCCTCACTTTCCTAGAAACTTCCCCATTCCAAAGAGGGGGGAAAATGAGGTTTCCTGAGTGTTAGAAGATCCCCTTAGAATGAGGAAAAAAGAGTGACACGGGAAATCCGCAAACAGATTAGGAGCTGAAACCTGAACATCCCTTCTCCCCACGAAAAGTCTCTAAACAGAGATAGTGAAAAGAAGGCCCACCATCTCCAGTGACTTACAATCAGTGAGGGGACAATGAAAACTCAAAGGCCTTCAGAATTTCTTCTTGTTCTGTTTTGATAAAGTTGCCTTTCACAATGTAAATGTTTGATTTCTTTCTCCATAAAAGGGCAACATAATTTATATCTGCATTGATGAAATTACGAAACTAAGAACGAAGAAGCCAAGATTAAAGGTTAATGAACAGTATATTTGTGTGGTAGTTATTGAAGTACAAATTCCCTATGGTTGTATTTTGAAATTCTGTAAATATCAAGCTAACATCCATAGTCTTTCCTGCTTTTGTGTGATCAATTTGTCTTTATAATGTAAAGTTGTGTCTATCTCTGCAAAAGAGTGGCAAGATTTAGATTTGCAATGAGAAAGCAAAGAATGAAGCTGAGGTTCATGAACCAAAAAAAAAATGTTTTGTTAGTTATTTAAACCATTTGATGGCAACCCATGAGGGAGGACAACAGTTTCTCAAAAAAAAAAAAAAACAAAACAAAAGGGAGGACAACAGCTAGGAGGAAAAAAAGAGTTTTGTATAGATTTACTGCTTGTGTGACCAAACTTTAGTTCTGCCTAGATCTTTTTGAATGATGCTCAAGATTGGTTGACTTTCAATATCCAAATGTTCTAGTTTAATCACATGCAGTATGTGGCTCCTACGAAAACCATGTCCCTTCAAATGGACACATTATTTTATTCACACTATTTAAAATAAGAATTCTTTGGTGGCACAGGTATCATGATGTTGGTCGACTTTCTGATCAGCAAATGGCTCTGTTGCAATATCAGCGAGAGAATCTTCATTTTCTGAATGAGGAGGTATATAATATCCTCACATACATACTTGGTTTGGTACACAGCCGCTTCATTTAGTCATGCTCTTTCCTCAATTCCTCTTGAAACTTTGAAGGAGCTAAATCAATTGCTAAGGTAGATATAGATATTTAGTACCAATTAGCTGTAACCTTCGTTCCTGTTCATCTCAGATTCTTCGGTTGCAAGAGTGCTTAAGTAAATATGAACGGTCCAGTGATGGAAGCACGCCTCAGGTGATCATTTGAATTTGATGTTCGACATTCTTTTTGATCATTTATATAATCAAAGTTTGTACTGATTTGTAAGATTTCATTCATATGTGTGAAGTTAAGAAACCTCACTTTCTAAACAAAGCAGGACACGGACATGTTGAAACATGTTTGTTCTTAGAACATATGTTTATTATTATAGGGACATAAAAAGAATCGTTAAAAGGTATATAGTGTGTCAAGTGGGAGCACATTGGAGTATGATTTGAAAAAAGTGTAAGAGCATTGGTCGATCGAAGATTAATTCCATTATACAGGATTTAAAAGTCAATATGCCTAATTATTTAGCACCTTTAAAAGTACTCATGGATAGTTTTGGTTGGTTTATAATTCCTGAATGTTGGACTATGCGTATTTTTAATTTGGTGTTTGTGTGGAATAACGTTTTAGACAATATCGTCATTGGATGTTGAAGCTGGTTTTGTTTTGTTTTGTTTGTTTGTTTTTTTTTTTTTTTTTATATATATGTAATTATTATTATTATTTTAGTACTCCTGGTTTTATTATCTACTTCATTGGAGTATATGAAATGAAAACTATCTCATATGATAGATAGTCAATGATGAAGGAATAGGTCAAGGAGTGTAATATTATTGAATGAAAATGGCTAAAAAAAAAGTGGCTGCAGTTACCAATGGTTCCTTTTCAGGTTGATCTTGCCCATATGCTAGCTGCTCGTGATCAGGAATTGAGGACACTTTCGGCTGAGGTATGTCTTACATACAGTACTGTGGATATGTTTTGATTGGATTTTTAATAACATCCAACTTATGTGCATCTTAATTTTTTGTCATTAGATGTTGGTTCCTACTTATAAATTTGTATATCATAGGTAAGCCACCACTAGGATGTGAATTCATGTATTCTAAGAATTTTGGTTTTACCGTCTCTGTCTTTTGATCACTCCTGGGTGTGTGTGTGTGTATATTAACTGAAACTGAACCTTCGTTGTCATGCAAAAAAATATACAAGGACATACAAAAAAAACAAGTCAAGCAAAAGGGAACCCCAAAAAACAACCTACATAAAGTCTAAAAAGACACCTATTCAACAAAATCAAACCAAGCTTATAATTACAAAATGGCCTAGTGACCAACATCTCCCAAGGATACATTAAACCTCTCTGCCTTCCAAACCTTGTCACACGACCTCTCCAGTCTCCACTGTCAAAAAATCTATTGTTTCTCTCGAGCTACATGCTCCACCAGATGGTGGAAAAAAACAAGATTGCCATGAAATTGTGCCCTTCTCTTGAAAAGGTGAACTAACGAGTACCTCCTCAATCATATTATAGCATACCAATCACTATTCCATCAAACATATATCGAAGGAGTTCAACCACTGAGACTGGAGAGTCTGAACAAATTGACGGCTCCACAACAAACTATGCAAGTTCTTTGTCTGAGCATACAAAGGATACACCATTGCGGGAGCACCACAAAGGAAGAATGCCATGAACACGATCCATAGTATTGACCCTCTCATGTAAAACTTGCCACCCAAAAAAATTTACCTTTGTAGGCATTAACCTTCTAAAGTGAGCAGAAAGCAGAGGCAGTTCGGGTGGGGAAGGGATAAACAACAAAATGGAAGAAAGAATTACAAGAGAAGCCCCTAGAAAAATATGCGGTCTAAAACCTAAAGTCCCTCCTCCCTTGAAACAAACGCTGATCATGGAGAATAGAAAGGAGACCCGCCACATCTAACAAGTAACACCTCTCTATCAGAAAGATGGTGCTTGAAGCCTGATATCTTGTCAACTTCTGAAATTACCTTGATCGCACATGCATTCAAAATTTCCATAAATAACATGAATTATATGTTACAGATGAATCAGGTGACATCAGAACTTGGGCTTGCTCGATCGGTAATAGCTGAGAGGGATACTGAGATTCAGAAATTACTCACCACCAACAAGCAGGTACTTGTTTTATCCAAACCATAGTCCTCACACATGATCTTTTACCTCGAATTTAACACCTTGCTTACTTTAACTTTTCTTTTACATTGACTATCCTTAAAATAGTATGTAGAAGAAAATGAAAGACTGAGAGCTATTCTAGGAGAATGGAGTACACGGGCAGCAAAGGTATCACTGCTTTTTCTATTTGCTGCGTTTGATTGAAAAATAATATCTTCAAATGGAATGGAAGCACTCAACTTTTTCACTAAAATTATGACCTTTGATCGTTGTGCGAAAGCTCGAGAGAGCACTTGAAGCTGAGCGTATATCAAATCTTGAACTGCAAAAGAGGATTTCAGCACTAAAAAAGCAACCACATGCATCTGAAACATCAGAGCGGCAAGGGAGTTGA

mRNA sequence

ATGGCTCCTCCACGGCTGTTCTTCGTTTCCTCTGTTCTTCTGGTGCTTTGCTGTGCAGTTGCCGGATCCGTCTTCGATGATTCGAATCCCATTCGGATGGTCTCTGACCGTCTTCGTGAGTTGGAGTTGGAGGTCGTTCGAGTCATCGGTCAAGTTCCTCACGCTCTCCGATTCGCTCGATTCGCTCACAGGTATGGGAAGAAGTACGAGACGGCGGAGGAGATGAAAATCCGATTTGGAATTTTCTTGGAGAGTTTGGAACTGATCAAATCGACTAATAGACAAGGCCTTTCTTACAAGCTTGGTGTTAATCAATTTGCGGATTGGACGTGGGAAGAGTTCAAGAAACACAGGCTAGGAGCCGCTCAAAACTGCTCTGCCACCACAAAGGGCAACCACAAACTTACTGATGTTGTTCTTCCTGAATCGAAAGATTGGAGGGAAGATGGCATAGTTAGCCCTGTTAAAGATCAAGGCCACTGTGGTTCTTGCTGGACATTCAGTACAACTGGAGCGCTTGAAGCGGCCTATGCACAAGCACATGGAAAGGGTATCTCCCTGTCCGAGCAGCAGCTGGTGGATTGTGCAGGTGCTTTTAACAACTTTGGCTGCAATGGTGGACTGCCTTCCCAAGCTTTTGAATACATCAAGTACAATGGTGGCCTTGACACTGAAACAGCATATCCTTACACTGGAAAAAACGGCCAATGCAAATTCTTATCTGAGAATGTTGGAGTACAAGTCATTGATTCTGTGAACATAACTCTTGGTGCTGAAGATGAATTGAAGCATGCAGTTGCTTTTGTTCGACCAGTAAGCGTAGCATTTGAGGTGGTTAGCGGTTTTCGCTTGTATTCAAAAGGAGTTTACACCAGTAACTCATGCGGCAGCACTCCTATGGATGTAAACCACGCCGTGCTTGCAGTTGGTTATGGGGTCGAAGATGGTATCCCATACTGGCTTATAAAGAACTCATGGGGAGGAAACTGGGGTGACAAGGGCTACTTCAAGATGGAGCTGGGCAAGAACATGTGTGAGTATGAGAATGGTAAAGCTTCTTGCAAGGAAAGATACTGTGGTTTTCAGGTAGGCTATGCTGCTCTAGCTATTGGAGCGCCATGGATTTTTCATCCTATAAAGCACTTGGTTGAACCATTGCTCTGCAGTTGTGATGTTGTTCTCTTGATGCTCACAGGCATCTTTCAGCAATATCTAGTATACCAAGTCCACAAAATTCGTTTGCAGGGATATTATAGCTTTAGCCAGAAGTTAAAGCATATTGTTCGTCTACCTTTTGCAGTTACTGCATATGGAACTGCTGCCCTTTTACTTGTCATGGTATGGGAACCTCAAATCAGTGCACTTTCGATCCCCATAATTTTAAGGTTGATTATGCTAATTGAAGCGGTGTGCGCTGGATCGTTTATGATTATATATATCAGCTACATACAGAAGTACAATTCATTAAATTCTCAGCCTGATGTTCTGAAGTCATTGTATTCTCCACTTCAGCAATCAAGTTCTTTGGAAGATCTAAGGTATCATGATGTTGGTCGACTTTCTGATCAGCAAATGGCTCTGTTGCAATATCAGCGAGAGAATCTTCATTTTCTGAATGAGGAGATTCTTCGGTTGCAAGAGTGCTTAAGTAAATATGAACGGTCCAGTGATGGAAGCACGCCTCAGGTTGATCTTGCCCATATGCTAGCTGCTCGTGATCAGGAATTGAGGACACTTTCGGCTGAGATGAATCAGGTGACATCAGAACTTGGGCTTGCTCGATCGGTAATAGCTGAGAGGGATACTGAGATTCAGAAATTACTCACCACCAACAAGCAGTATGTAGAAGAAAATGAAAGACTGAGAGCTATTCTAGGAGAATGGAGTACACGGGCAGCAAAGCTCGAGAGAGCACTTGAAGCTGAGCGTATATCAAATCTTGAACTGCAAAAGAGGATTTCAGCACTAAAAAAGCAACCACATGCATCTGAAACATCAGAGCGGCAAGGGAGTTGA

Coding sequence (CDS)

ATGGCTCCTCCACGGCTGTTCTTCGTTTCCTCTGTTCTTCTGGTGCTTTGCTGTGCAGTTGCCGGATCCGTCTTCGATGATTCGAATCCCATTCGGATGGTCTCTGACCGTCTTCGTGAGTTGGAGTTGGAGGTCGTTCGAGTCATCGGTCAAGTTCCTCACGCTCTCCGATTCGCTCGATTCGCTCACAGGTATGGGAAGAAGTACGAGACGGCGGAGGAGATGAAAATCCGATTTGGAATTTTCTTGGAGAGTTTGGAACTGATCAAATCGACTAATAGACAAGGCCTTTCTTACAAGCTTGGTGTTAATCAATTTGCGGATTGGACGTGGGAAGAGTTCAAGAAACACAGGCTAGGAGCCGCTCAAAACTGCTCTGCCACCACAAAGGGCAACCACAAACTTACTGATGTTGTTCTTCCTGAATCGAAAGATTGGAGGGAAGATGGCATAGTTAGCCCTGTTAAAGATCAAGGCCACTGTGGTTCTTGCTGGACATTCAGTACAACTGGAGCGCTTGAAGCGGCCTATGCACAAGCACATGGAAAGGGTATCTCCCTGTCCGAGCAGCAGCTGGTGGATTGTGCAGGTGCTTTTAACAACTTTGGCTGCAATGGTGGACTGCCTTCCCAAGCTTTTGAATACATCAAGTACAATGGTGGCCTTGACACTGAAACAGCATATCCTTACACTGGAAAAAACGGCCAATGCAAATTCTTATCTGAGAATGTTGGAGTACAAGTCATTGATTCTGTGAACATAACTCTTGGTGCTGAAGATGAATTGAAGCATGCAGTTGCTTTTGTTCGACCAGTAAGCGTAGCATTTGAGGTGGTTAGCGGTTTTCGCTTGTATTCAAAAGGAGTTTACACCAGTAACTCATGCGGCAGCACTCCTATGGATGTAAACCACGCCGTGCTTGCAGTTGGTTATGGGGTCGAAGATGGTATCCCATACTGGCTTATAAAGAACTCATGGGGAGGAAACTGGGGTGACAAGGGCTACTTCAAGATGGAGCTGGGCAAGAACATGTGTGAGTATGAGAATGGTAAAGCTTCTTGCAAGGAAAGATACTGTGGTTTTCAGGTAGGCTATGCTGCTCTAGCTATTGGAGCGCCATGGATTTTTCATCCTATAAAGCACTTGGTTGAACCATTGCTCTGCAGTTGTGATGTTGTTCTCTTGATGCTCACAGGCATCTTTCAGCAATATCTAGTATACCAAGTCCACAAAATTCGTTTGCAGGGATATTATAGCTTTAGCCAGAAGTTAAAGCATATTGTTCGTCTACCTTTTGCAGTTACTGCATATGGAACTGCTGCCCTTTTACTTGTCATGGTATGGGAACCTCAAATCAGTGCACTTTCGATCCCCATAATTTTAAGGTTGATTATGCTAATTGAAGCGGTGTGCGCTGGATCGTTTATGATTATATATATCAGCTACATACAGAAGTACAATTCATTAAATTCTCAGCCTGATGTTCTGAAGTCATTGTATTCTCCACTTCAGCAATCAAGTTCTTTGGAAGATCTAAGGTATCATGATGTTGGTCGACTTTCTGATCAGCAAATGGCTCTGTTGCAATATCAGCGAGAGAATCTTCATTTTCTGAATGAGGAGATTCTTCGGTTGCAAGAGTGCTTAAGTAAATATGAACGGTCCAGTGATGGAAGCACGCCTCAGGTTGATCTTGCCCATATGCTAGCTGCTCGTGATCAGGAATTGAGGACACTTTCGGCTGAGATGAATCAGGTGACATCAGAACTTGGGCTTGCTCGATCGGTAATAGCTGAGAGGGATACTGAGATTCAGAAATTACTCACCACCAACAAGCAGTATGTAGAAGAAAATGAAAGACTGAGAGCTATTCTAGGAGAATGGAGTACACGGGCAGCAAAGCTCGAGAGAGCACTTGAAGCTGAGCGTATATCAAATCTTGAACTGCAAAAGAGGATTTCAGCACTAAAAAAGCAACCACATGCATCTGAAACATCAGAGCGGCAAGGGAGTTGA

Protein sequence

MAPPRLFFVSSVLLVLCCAVAGSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFAHRYGKKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSATTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASCKERYCGFQVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTGIFQQYLVYQVHKIRLQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPIILRLIMLIEAVCAGSFMIIYISYIQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELGLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLERALEAERISNLELQKRISALKKQPHASETSERQGS
Homology
BLAST of HG10007296 vs. NCBI nr
Match: KAG6428559.1 (hypothetical protein SASPL_112811 [Salvia splendens])

HSP 1 Score: 914.1 bits (2361), Expect = 7.2e-262
Identity = 474/697 (68.01%), Postives = 547/697 (78.48%), Query Frame = 0

Query: 1   MAPPRLFFVSSVLLVLCCAVAGSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFAR 60
           MA   L F+  ++     A +GS   D NPIR V D L ELE  +++ +G    A+ FAR
Sbjct: 83  MARLLLLFIGVLIASATVARSGSELLD-NPIRQVVDGLHELESSILKAVGDSRRAVSFAR 142

Query: 61  FAHRYGKKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLG 120
           FAHRYGK YE++EE++ RF +F E+L +I+S NR+GLSY +GVN+F D TW+EFKKHRLG
Sbjct: 143 FAHRYGKMYESSEEIQRRFQVFSENLRMIRSHNRKGLSYSMGVNEFTDLTWDEFKKHRLG 202

Query: 121 AAQNCSATTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQA 180
           AAQNCSAT  GNHKLTD VLP   DWR+ GIVSPVK+QG CGSCWTFS+TGALEAAYAQA
Sbjct: 203 AAQNCSATRSGNHKLTDAVLPTLIDWRKSGIVSPVKNQGSCGSCWTFSSTGALEAAYAQA 262

Query: 181 HGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFL 240
            GK ISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTE AYPYTGK+G CK+ 
Sbjct: 263 FGKSISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTEAAYPYTGKDGVCKYS 322

Query: 241 SENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPM 300
           SENV V+V+DSVNITLGAEDELKHAVAFVRPVSVAFEVV GF+ Y+ GVYTS +CGS PM
Sbjct: 323 SENVAVKVVDSVNITLGAEDELKHAVAFVRPVSVAFEVVDGFKAYNGGVYTSTTCGSDPM 382

Query: 301 DVNHAVLAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKAS--CKERY 360
           DVNHAVLAVGYGVE+G+PYWLIKNSWG +WGD GYFKME+GKNMC   +  +S    +R+
Sbjct: 383 DVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCATSDNFSSEMSMDRH 442

Query: 361 -----------------------CGFQV------GYAALAIGAPWIFHPIKHLVEPLLCS 420
                                  CG Q        YA  A+G  WI   ++ L   LLCS
Sbjct: 443 GSSYAASPEETNLFLDILQEAPLCGHQKRTRILGSYAIFALGITWILKSLEDLTVSLLCS 502

Query: 421 CDVVLLMLTGIFQQYLVYQVHKIR-----------LQGYYSFSQKLKHIVRLPFAVTAYG 480
           C+++LL++TGIF QYLVYQVHKIR           LQGYY FSQKLKHI+RLPFA  AYG
Sbjct: 503 CNILLLVVTGIFLQYLVYQVHKIRLQVIYAFDDWSLQGYYGFSQKLKHIIRLPFATIAYG 562

Query: 481 TAALLLVMVWEPQISALSIPIILRLIMLIEAVCAGSFMIIYISYIQKYNSLNSQPDVLKS 540
           TAA+LLVM W+  IS LSI ++LR+IML+EAVCAG FM +Y+ Y+ +YNSL+SQPD L S
Sbjct: 563 TAAMLLVMAWKHHISFLSISMLLRIIMLVEAVCAGFFMSVYVGYVHQYNSLDSQPDALNS 622

Query: 541 LYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDG 600
           LYSPLQQ+S LE LRYHD GRLSDQQMALLQYQRENLHFL+EEILRLQE LSKYER++DG
Sbjct: 623 LYSPLQQASPLEGLRYHDGGRLSDQQMALLQYQRENLHFLSEEILRLQESLSKYERTNDG 682

Query: 601 STPQVDLAHMLAARDQELRTLSAEMNQVTSELGLARSVIAERDTEIQKLLTTNKQYVEEN 656
           STPQVDLAH+LA RDQELRTLSAEMNQ+ SEL LARS+IAERD+EIQ +  TN QY+EEN
Sbjct: 683 STPQVDLAHLLATRDQELRTLSAEMNQLQSELRLARSLIAERDSEIQGVRNTNNQYIEEN 742

BLAST of HG10007296 vs. NCBI nr
Match: KAG5559120.1 (hypothetical protein RHGRI_008890 [Rhododendron griersonianum])

HSP 1 Score: 883.6 bits (2282), Expect = 1.0e-252
Identity = 490/769 (63.72%), Postives = 551/769 (71.65%), Query Frame = 0

Query: 12  VLLVLCCAVAG-SVFDDSNPIR-MVSDRLRELELEVVRVIGQVPHALRFARFAHRYGKKY 71
           +LL +  AVAG S FD+ NPIR +VS+ LRE E  VV V+G    AL FARFAHRYGK Y
Sbjct: 13  LLLAVAVAVAGASTFDEENPIRTVVSNVLREFETSVVNVVGYSRQALSFARFAHRYGKSY 72

Query: 72  ETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSATT 131
           ET EEMK+RF IF E+L+LIKS NR+GLSY + VN+FADWTWEEF + RLGAAQNCSAT 
Sbjct: 73  ETEEEMKLRFSIFSENLKLIKSHNRKGLSYTMAVNKFADWTWEEFHRLRLGAAQNCSATK 132

Query: 132 KGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSE 191
           KGNHKLTD +LPE KDWRE GIVSPVKDQGHCGSCWTFSTTGALEAAY QA GK +SLSE
Sbjct: 133 KGNHKLTDDLLPEMKDWREIGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAFGKEVSLSE 192

Query: 192 QQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQVI 251
           QQLVDCAGAFNNFGC+GGLPSQAFEYIK+NGGLDTE AYPYT K+G+CKF SENVGVQV+
Sbjct: 193 QQLVDCAGAFNNFGCSGGLPSQAFEYIKHNGGLDTEEAYPYTAKDGECKFSSENVGVQVL 252

Query: 252 DSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAV 311
           +SVNITLGAEDELKHAVAFVRPVSVAFEVV+GFRLY  GVYTS+SCG+TPMDVNHAVLAV
Sbjct: 253 ESVNITLGAEDELKHAVAFVRPVSVAFEVVNGFRLYKDGVYTSDSCGTTPMDVNHAVLAV 312

Query: 312 GYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMC---------EYENGKA-------- 371
           GYGVE+G+ YWL+KNSWG +WGD GYFKMELGKNMC          Y+  KA        
Sbjct: 313 GYGVENGVSYWLVKNSWGEDWGDNGYFKMELGKNMCGRFLLLILPVYKLTKAAHARLGHA 372

Query: 372 ------------------------------------------------------------ 431
                                                                       
Sbjct: 373 LQLVHHTPLLVDCCLRKRGLLSRDFWFMSSFFYLECSVSPIYKLKGHVVMLIAQAQCTNK 432

Query: 432 -------------------SCKERYCGFQVG---------------YAALAIGAPWIFHP 491
                              S + R   +Q                 YA LA  APWIFH 
Sbjct: 433 NRPSIPIVTVNWFEKLKNPSTRRRRRRWQQKDTPLRTSHLLKKTPCYAILAAAAPWIFHS 492

Query: 492 IKHLVEPLLCSCDVVLLMLTGIFQQYLVYQVHKIRLQGYYSFSQKLKHIVRLPFAVTAYG 551
           I+ L+ PLLCSC V+LL++TGIFQQYLVYQV KIRLQGYY FSQKLKHIVRLPFA TAYG
Sbjct: 493 IQPLLSPLLCSCGVILLIVTGIFQQYLVYQVQKIRLQGYYVFSQKLKHIVRLPFAATAYG 552

Query: 552 TAALLLVMVWEPQISALSIPIILRLIMLIEAVCAGSFMIIYISYIQKYNSLNSQPDVLKS 611
           TAA+LLVMVW+P IS LSI ++LR+IMLIE VCAG FM  YI YI +YNSL+SQPDVLKS
Sbjct: 553 TAAMLLVMVWKPHISILSISVLLRIIMLIEVVCAGFFMSAYIGYIYQYNSLDSQPDVLKS 612

Query: 612 LYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDG 668
           LYSPLQ SSSLE L            +A +    E +  L   ILRLQE LSKYERS+DG
Sbjct: 613 LYSPLQPSSSLEGL----------SMIAAVDCVLELM--LQLWILRLQESLSKYERSNDG 672

BLAST of HG10007296 vs. NCBI nr
Match: KAG5559119.1 (hypothetical protein RHGRI_008890 [Rhododendron griersonianum])

HSP 1 Score: 881.7 bits (2277), Expect = 4.0e-252
Identity = 492/801 (61.42%), Postives = 553/801 (69.04%), Query Frame = 0

Query: 12  VLLVLCCAVAG-SVFDDSNPIR-MVSDRLRELELEVVRVIGQVPHALRFARFAHRYGKKY 71
           +LL +  AVAG S FD+ NPIR +VS+ LRE E  VV V+G    AL FARFAHRYGK Y
Sbjct: 13  LLLAVAVAVAGASTFDEENPIRTVVSNVLREFETSVVNVVGYSRQALSFARFAHRYGKSY 72

Query: 72  ETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSATT 131
           ET EEMK+RF IF E+L+LIKS NR+GLSY + VN+FADWTWEEF + RLGAAQNCSAT 
Sbjct: 73  ETEEEMKLRFSIFSENLKLIKSHNRKGLSYTMAVNKFADWTWEEFHRLRLGAAQNCSATK 132

Query: 132 KGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSE 191
           KGNHKLTD +LPE KDWRE GIVSPVKDQGHCGSCWTFSTTGALEAAY QA GK +SLSE
Sbjct: 133 KGNHKLTDDLLPEMKDWREIGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAFGKEVSLSE 192

Query: 192 QQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQVI 251
           QQLVDCAGAFNNFGC+GGLPSQAFEYIK+NGGLDTE AYPYT K+G+CKF SENVGVQV+
Sbjct: 193 QQLVDCAGAFNNFGCSGGLPSQAFEYIKHNGGLDTEEAYPYTAKDGECKFSSENVGVQVL 252

Query: 252 DSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAV 311
           +SVNITLGAEDELKHAVAFVRPVSVAFEVV+GFRLY  GVYTS+SCG+TPMDVNHAVLAV
Sbjct: 253 ESVNITLGAEDELKHAVAFVRPVSVAFEVVNGFRLYKDGVYTSDSCGTTPMDVNHAVLAV 312

Query: 312 GYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMC---------EYENGKAS------- 371
           GYGVE+G+ YWL+KNSWG +WGD GYFKMELGKNMC          Y+  KA+       
Sbjct: 313 GYGVENGVSYWLVKNSWGEDWGDNGYFKMELGKNMCGRFLLLILPVYKLTKAAHARLGHA 372

Query: 372 ------------------------------------------------------------ 431
                                                                       
Sbjct: 373 LQLVHHTPLLVDCCLRKRGLLSRDFWFMSSFFYLECSVSPIYKLKGHVVMLIAQAQCTNK 432

Query: 432 ---------------------------------------------CKERYCG-------- 491
                                                        C   YC         
Sbjct: 433 NRPSIPIVTVNWFEKLKNPSTRRRRRRWQQKDTPLRTSHLLKKTPCSLTYCTSLPYLVIG 492

Query: 492 --------------FQVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTGIFQQYLV 551
                         +Q GYA LA  APWIFH I+ L+ PLLCSC V+LL++TGIFQQYLV
Sbjct: 493 SLQALSEVFYTVFYWQAGYAILAAAAPWIFHSIQPLLSPLLCSCGVILLIVTGIFQQYLV 552

Query: 552 YQVHKIRLQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPIILRLIML 611
           YQV KIRLQGYY FSQKLKHIVRLPFA TAYGTAA+LLVMVW+P IS LSI ++LR+IML
Sbjct: 553 YQVQKIRLQGYYVFSQKLKHIVRLPFAATAYGTAAMLLVMVWKPHISILSISVLLRIIML 612

Query: 612 IEAVCAGSFMIIYISYIQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMA 668
           IE VCAG FM  YI YI +YNSL+SQPDVLKSLYSPLQ SSSLE L            +A
Sbjct: 613 IEVVCAGFFMSAYIGYIYQYNSLDSQPDVLKSLYSPLQPSSSLEGL----------SMIA 672

BLAST of HG10007296 vs. NCBI nr
Match: KAF7149320.1 (hypothetical protein RHSIM_Rhsim03G0227400 [Rhododendron simsii])

HSP 1 Score: 876.3 bits (2263), Expect = 1.7e-250
Identity = 487/797 (61.10%), Postives = 551/797 (69.13%), Query Frame = 0

Query: 9   VSSVLLVLCCAVAG-SVFDDSNPIR-MVSDRLRELELEVVRVIGQVPHALRFARFAHRYG 68
           +S  LL+L  AVAG S FD+ NPIR +VS+ LRE E  VV V+G    AL FARFAHRYG
Sbjct: 8   LSLSLLLLAVAVAGASTFDEENPIRTVVSNVLREFETSVVNVVGYSRQALSFARFAHRYG 67

Query: 69  KKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCS 128
           K YET EEMK+RF IF E+L+LIKS NR+GLSY + VN+FADWTWE+F++HRLGAAQNCS
Sbjct: 68  KSYETEEEMKLRFSIFSENLKLIKSHNRKGLSYTMAVNKFADWTWEDFRRHRLGAAQNCS 127

Query: 129 ATTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGIS 188
           AT KGNHKLTD +LPE KDWRE GIVSPVKDQGHCGSCWTFSTTGALEAAY QA GK +S
Sbjct: 128 ATKKGNHKLTDDLLPEMKDWREIGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAFGKEVS 187

Query: 189 LSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGV 248
           LSEQQLVDCAGAFNNFGC+GGLPSQAFEYIK+NGGLDTE AYPYT K+G+CKF SENVGV
Sbjct: 188 LSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKHNGGLDTEEAYPYTAKDGECKFSSENVGV 247

Query: 249 QVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAV 308
           QV+DSVNITLGAEDELKHAVAFVRPVSVAFEVV GFRLY  GVYTS+SCG+TPMDVNHAV
Sbjct: 248 QVLDSVNITLGAEDELKHAVAFVRPVSVAFEVVDGFRLYKDGVYTSDSCGTTPMDVNHAV 307

Query: 309 LAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMC--------------------- 368
           LAVGYGVE+G+ YWLIKNSWG +WGD GYFKME+GKNMC                     
Sbjct: 308 LAVGYGVENGVSYWLIKNSWGEDWGDNGYFKMEMGKNMCGRRCNLCIIPHCCLTAVRGKR 367

Query: 369 --------------------------------------------EYENGKASCKERYCGF 428
                                                       + +  +    ER+   
Sbjct: 368 CLLSRDFWFMSSCFYLECSVSPIYKLKGLVVILLTASKNSKTQAQQQQREEMATERHAST 427

Query: 429 QV----------------------------GYAALAIGAPWIFHPIKHLVEPLLCSCDVV 488
            V                            GYA LA  APWIFH I+ L+ PLLCSC V+
Sbjct: 428 HVPPAEENALFLDILHESPLFGHRKPTSIAGYAILAAAAPWIFHSIQSLLSPLLCSCGVI 487

Query: 489 LLMLTGIFQQYLVYQVHKIRLQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVM------- 548
           LL++T              R  GYY FSQKLKHIVRLPFA TAYG    +L++       
Sbjct: 488 LLIVTD--------DDSNSRNMGYYVFSQKLKHIVRLPFATTAYGGLTFVLMLDCTYKNC 547

Query: 549 --------------------VWEPQISALS---------IPIILRLIMLIEAVCAGSFMI 608
                                 +  +S+LS         I   LR+IMLIE VCAG FM 
Sbjct: 548 CNAACHGVEAPYQYPFHLRPAQDENVSSLSDIAAFMASQIEFTLRIIMLIEVVCAGFFMS 607

Query: 609 IYISYIQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHF 668
            YI YI +YNSL+SQPDVLKSLYSPLQ SSSLE LRYHD GRL+DQQMALLQYQRENLHF
Sbjct: 608 AYIGYIYQYNSLDSQPDVLKSLYSPLQPSSSLEGLRYHDGGRLADQQMALLQYQRENLHF 667

BLAST of HG10007296 vs. NCBI nr
Match: KAG5559117.1 (hypothetical protein RHGRI_008890 [Rhododendron griersonianum])

HSP 1 Score: 851.7 bits (2199), Expect = 4.4e-243
Identity = 492/857 (57.41%), Postives = 553/857 (64.53%), Query Frame = 0

Query: 12  VLLVLCCAVAG-SVFDDSNPIR-MVSDRLRELELEVVRVIGQVPHALRFARFAHRYGKKY 71
           +LL +  AVAG S FD+ NPIR +VS+ LRE E  VV V+G    AL FARFAHRYGK Y
Sbjct: 13  LLLAVAVAVAGASTFDEENPIRTVVSNVLREFETSVVNVVGYSRQALSFARFAHRYGKSY 72

Query: 72  ETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSATT 131
           ET EEMK+RF IF E+L+LIKS NR+GLSY + VN+FADWTWEEF + RLGAAQNCSAT 
Sbjct: 73  ETEEEMKLRFSIFSENLKLIKSHNRKGLSYTMAVNKFADWTWEEFHRLRLGAAQNCSATK 132

Query: 132 KGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSE 191
           KGNHKLTD +LPE KDWRE GIVSPVKDQGHCGSCWTFSTTGALEAAY QA GK +SLSE
Sbjct: 133 KGNHKLTDDLLPEMKDWREIGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAFGKEVSLSE 192

Query: 192 QQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQVI 251
           QQLVDCAGAFNNFGC+GGLPSQAFEYIK+NGGLDTE AYPYT K+G+CKF SENVGVQV+
Sbjct: 193 QQLVDCAGAFNNFGCSGGLPSQAFEYIKHNGGLDTEEAYPYTAKDGECKFSSENVGVQVL 252

Query: 252 DSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAV 311
           +SVNITLGAEDELKHAVAFVRPVSVAFEVV+GFRLY  GVYTS+SCG+TPMDVNHAVLAV
Sbjct: 253 ESVNITLGAEDELKHAVAFVRPVSVAFEVVNGFRLYKDGVYTSDSCGTTPMDVNHAVLAV 312

Query: 312 GYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMC---------EYENGKAS------- 371
           GYGVE+G+ YWL+KNSWG +WGD GYFKMELGKNMC          Y+  KA+       
Sbjct: 313 GYGVENGVSYWLVKNSWGEDWGDNGYFKMELGKNMCGRFLLLILPVYKLTKAAHARLGHA 372

Query: 372 ------------------------------------------------------------ 431
                                                                       
Sbjct: 373 LQLVHHTPLLVDCCLRKRGLLSRDFWFMSSFFYLECSVSPIYKLKGHVVMLIAQAQCTNK 432

Query: 432 ---------------------------------------------CKERYCG-------- 491
                                                        C   YC         
Sbjct: 433 NRPSIPIVTVNWFEKLKNPSTRRRRRRWQQKDTPLRTSHLLKKTPCSLTYCTSLPYLVIG 492

Query: 492 --------------FQVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTGIFQQYLV 551
                         +Q GYA LA  APWIFH I+ L+ PLLCSC V+LL++TGIFQQYLV
Sbjct: 493 SLQALSEVFYTVFYWQAGYAILAAAAPWIFHSIQPLLSPLLCSCGVILLIVTGIFQQYLV 552

Query: 552 YQVHKIR-------------------------------------------------LQGY 611
           YQV KIR                                                 LQGY
Sbjct: 553 YQVQKIRLQWPVRYLMVVMHPLEIVTLFIIFAFGFFVLLLESLILWVYVCLIGLDILQGY 612

Query: 612 YSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPIILRLIMLIEAVCAGSFMI 668
           Y FSQKLKHIVRLPFA TAYGTAA+LLVMVW+P IS LSI ++LR+IMLIE VCAG FM 
Sbjct: 613 YVFSQKLKHIVRLPFAATAYGTAAMLLVMVWKPHISILSISVLLRIIMLIEVVCAGFFMS 672

BLAST of HG10007296 vs. ExPASy Swiss-Prot
Match: Q8RWQ9 (Thiol protease aleurain-like OS=Arabidopsis thaliana OX=3702 GN=At3g45310 PE=2 SV=1)

HSP 1 Score: 555.4 bits (1430), Expect = 8.5e-157
Identity = 261/352 (74.15%), Postives = 303/352 (86.08%), Query Frame = 0

Query: 5   RLFFVSSVLLVLCCAVAGSV--FDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFA 64
           +L   SS+LL+L  A A     FD+SNPI+MVSD L ELE  VV+++GQ  H L F+RF 
Sbjct: 4   KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 65  HRYGKKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAA 124
           HRYGKKY++ EEMK+RF +F E+L+LI+STN++GLSYKL +NQFAD TW+EF++++LGAA
Sbjct: 64  HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 125 QNCSATTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHG 184
           QNCSAT KG+HK+T+  +P++KDWREDGIVSPVK+QGHCGSCWTFSTTGALEAAY QA G
Sbjct: 124 QNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFG 183

Query: 185 KGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSE 244
           KGISLSEQQLVDCAG FNNFGC+GGLPSQAFEYIKYNGGLDTE AYPYTGK+G CKF ++
Sbjct: 184 KGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGGCKFSAK 243

Query: 245 NVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDV 304
           N+GVQV DSVNITLGAEDELKHAV  VRPVSVAFEVV  FR Y KGV+TSN+CG+TPMDV
Sbjct: 244 NIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDV 303

Query: 305 NHAVLAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASC 355
           NHAVLAVGYGVED +PYWLIKNSWGG WGD GYFKME+GKNMC    G A+C
Sbjct: 304 NHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMC----GVATC 351

BLAST of HG10007296 vs. ExPASy Swiss-Prot
Match: Q8H166 (Thiol protease aleurain OS=Arabidopsis thaliana OX=3702 GN=ALEU PE=1 SV=2)

HSP 1 Score: 554.7 bits (1428), Expect = 1.5e-156
Identity = 263/343 (76.68%), Postives = 296/343 (86.30%), Query Frame = 0

Query: 12  VLLVLCCAVAGSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFAHRYGKKYET 71
           V+LV   A A   FD+SNPIRMVSD LRE+E  V +++GQ  H L FARF HRYGKKY+ 
Sbjct: 13  VVLVAASAAANIGFDESNPIRMVSDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQN 72

Query: 72  AEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSATTKG 131
            EEMK+RF IF E+L+LI+STN++GLSYKLGVNQFAD TW+EF++ +LGAAQNCSAT KG
Sbjct: 73  VEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKG 132

Query: 132 NHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQQ 191
           +HK+T+  LPE+KDWREDGIVSPVKDQG CGSCWTFSTTGALEAAY QA GKGISLSEQQ
Sbjct: 133 SHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQ 192

Query: 192 LVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQVIDS 251
           LVDCAGAFNN+GCNGGLPSQAFEYIK NGGLDTE AYPYTGK+  CKF +ENVGVQV++S
Sbjct: 193 LVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSAENVGVQVLNS 252

Query: 252 VNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAVGY 311
           VNITLGAEDELKHAV  VRPVS+AFEV+  FRLY  GVYT + CGSTPMDVNHAVLAVGY
Sbjct: 253 VNITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312

Query: 312 GVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASC 355
           GVEDG+PYWLIKNSWG +WGDKGYFKME+GKNMC    G A+C
Sbjct: 313 GVEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMC----GIATC 351

BLAST of HG10007296 vs. ExPASy Swiss-Prot
Match: A0A072UTP9 (Pro-cathepsin H OS=Medicago truncatula OX=3880 GN=CP PE=1 SV=1)

HSP 1 Score: 552.4 bits (1422), Expect = 7.2e-156
Identity = 264/347 (76.08%), Postives = 298/347 (85.88%), Query Frame = 0

Query: 11  SVLLVLCC---AVAGSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFAHRYGK 70
           ++L+V  C   A AG  F DSNPIRMVSD    +E ++++VIG+  HA+ FARFA+RYGK
Sbjct: 5   TLLIVFFCVATAAAGLSFHDSNPIRMVSD----MEEQLLQVIGESRHAVSFARFANRYGK 64

Query: 71  KYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSA 130
           +Y+T +EMK RF IF E+L+LIKSTN++ L Y LGVN FADWTWEEF+ HRLGAAQNCSA
Sbjct: 65  RYDTVDEMKRRFKIFSENLQLIKSTNKKRLGYTLGVNHFADWTWEEFRSHRLGAAQNCSA 124

Query: 131 TTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISL 190
           T KGNH++TDVVLP  KDWR++GIVS VKDQGHCGSCWTFSTTGALE+AYAQA GK ISL
Sbjct: 125 TLKGNHRITDVVLPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNISL 184

Query: 191 SEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQ 250
           SEQQLVDCAGA+NNFGCNGGLPSQAFEYIKYNGGL+TE AYPYTG+NG CKF SENV VQ
Sbjct: 185 SEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYTGQNGLCKFTSENVAVQ 244

Query: 251 VIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVL 310
           V+ SVNITLGAEDELKHAVAF RPVSVAF+VV  FRLY KGVYTS +CGSTPMDVNHAVL
Sbjct: 245 VLGSVNITLGAEDELKHAVAFARPVSVAFQVVDDFRLYKKGVYTSTTCGSTPMDVNHAVL 304

Query: 311 AVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASC 355
           AVGYG+EDG+PYWLIKNSWGG WGD GYFKME+GKNMC    G A+C
Sbjct: 305 AVGYGIEDGVPYWLIKNSWGGEWGDHGYFKMEMGKNMC----GVATC 343

BLAST of HG10007296 vs. ExPASy Swiss-Prot
Match: Q10717 (Cysteine proteinase 2 OS=Zea mays OX=4577 GN=CCP2 PE=2 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 4.2e-148
Identity = 259/357 (72.55%), Postives = 293/357 (82.07%), Query Frame = 0

Query: 1   MAPPRLFFVSSVLLVLCCAVAGSVFDDSNPIRMVSDRLRE-LELEVVRVIGQVPHALRFA 60
           M P RLF ++ V+L    AV  S F DSNPIR V+DR    LE  V   +G+   ALRFA
Sbjct: 1   MVPRRLFVLAVVVLADTAAVVNSGFADSNPIRPVTDRAASALESTVFAALGRTRDALRFA 60

Query: 61  RFAHRYGKKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRL 120
           RFA RYGK YE+A E+  RF IF ESL+L++STNR+GLSY+LG+N+FAD +WEEF+  RL
Sbjct: 61  RFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRATRL 120

Query: 121 GAAQNCSATTKGNHKL--TDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAY 180
           GAAQNCSAT  GNH++    V LPE+KDWREDGIVSPVK+QGHCGSCWTFSTTGALEAAY
Sbjct: 121 GAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGALEAAY 180

Query: 181 AQAHGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQC 240
            QA GK ISLSEQQLVDC  AFNNFGCNGGLPSQAFEYIKYNGGLDTE +YPY G NG C
Sbjct: 181 TQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGIC 240

Query: 241 KFLSENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGS 300
           KF +ENVGV+V+DSVNITLGAEDELK AV  VRPVSVAFEV++GFRLY  GVYTS+ CG+
Sbjct: 241 KFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTSDHCGT 300

Query: 301 TPMDVNHAVLAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASC 355
           TPMDVNHAVLAVGYGVEDG+PYWLIKNSWG +WGD+GYFKME+GKNMC    G A+C
Sbjct: 301 TPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKMEMGKNMC----GVATC 353

BLAST of HG10007296 vs. ExPASy Swiss-Prot
Match: Q40143 (Cysteine proteinase 3 OS=Solanum lycopersicum OX=4081 GN=CYP-3 PE=2 SV=1)

HSP 1 Score: 512.3 bits (1318), Expect = 8.3e-144
Identity = 249/337 (73.89%), Postives = 285/337 (84.57%), Query Frame = 0

Query: 19  AVAG-SVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFAHRYGKKYETAEEMKI 78
           A+AG + F D NPIR V     ELE  +++V+GQ   AL FARFA R+ K+Y++ EE+K 
Sbjct: 18  ALAGPATFADKNPIRQVVFP-DELENGILQVVGQTRSALSFARFAIRHRKRYDSVEEIKQ 77

Query: 79  RFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSATTKGNHKLTD 138
           RF IFL++L++I+S NR+GLSYKLG+N+F D TW+EF+KH+LGA+QNCSATTKGN KLT+
Sbjct: 78  RFEIFLDNLKMIRSHNRKGLSYKLGINEFTDLTWDEFRKHKLGASQNCSATTKGNLKLTN 137

Query: 139 VVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQQLVDCAG 198
           VVLPE+KDWR+DGIVSPVK QG CGSCWTFSTTGALEAAYAQA GKGISLSEQQLVDCAG
Sbjct: 138 VVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAG 197

Query: 199 AFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQVIDSVNITLG 258
           AFNNFGCNGGLPSQAFEYIK+NGGLDTE AYPYTGKNG CKF   N+GV+VI SVNITLG
Sbjct: 198 AFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNGICKFSQANIGVKVISSVNITLG 257

Query: 259 AEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAVGYGVEDGI 318
           AE ELK+AVA VRPVSVAFEVV GF+ Y  GVY S  CG TPMDVNHAVLAVGYGVE+G 
Sbjct: 258 AEYELKYAVALVRPVSVAFEVVKGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGVENGT 317

Query: 319 PYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASC 355
           PYWLIKNSWG +WG+ GYFKME+GKNMC    G A+C
Sbjct: 318 PYWLIKNSWGADWGEDGYFKMEMGKNMC----GVATC 349

BLAST of HG10007296 vs. ExPASy TrEMBL
Match: A0A4D9BI93 (Uncharacterized protein OS=Salvia splendens OX=180675 GN=Saspl_016945 PE=3 SV=1)

HSP 1 Score: 922.9 bits (2384), Expect = 7.5e-265
Identity = 475/678 (70.06%), Postives = 545/678 (80.38%), Query Frame = 0

Query: 1   MAPPRLFFVSSVLLVLCCAVAGSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFAR 60
           MA   L F+  ++     A +GS   D NPIR V D L ELE  +++ +G    A+ FAR
Sbjct: 1   MARLLLLFIGVLIASATVARSGSELLD-NPIRQVVDGLHELESSILKAVGDSRRAVSFAR 60

Query: 61  FAHRYGKKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLG 120
           FAHRYGK YE++EE++ RF +F E+L +I+S NR+GLSY +GVN+F D TW+EFKKHRLG
Sbjct: 61  FAHRYGKMYESSEEIQRRFQVFSENLRMIRSHNRKGLSYSMGVNEFTDLTWDEFKKHRLG 120

Query: 121 AAQNCSATTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQA 180
           AAQNCSAT  GNHKLTD VLP   DWR+ GIVSPVK+QG CGSCWTFS+TGALEAAYAQA
Sbjct: 121 AAQNCSATRSGNHKLTDAVLPTLIDWRKSGIVSPVKNQGSCGSCWTFSSTGALEAAYAQA 180

Query: 181 HGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFL 240
            GK ISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTE AYPYTGK+G CK+ 
Sbjct: 181 FGKSISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTEAAYPYTGKDGVCKYS 240

Query: 241 SENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPM 300
           SENV V+V+DSVNITLGAEDELKHAVAFVRPVSVAFEVV GF+ Y+ GVYTS +CGS PM
Sbjct: 241 SENVAVKVVDSVNITLGAEDELKHAVAFVRPVSVAFEVVDGFKAYNGGVYTSTTCGSDPM 300

Query: 301 DVNHAVLAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASCK----- 360
           DVNHAVLAVGYGVE+G+PYWLIKNSWG +WGD GYFKME+GKNMC        CK     
Sbjct: 301 DVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMC---GSLTYCKRLPYV 360

Query: 361 -----ERYCG--FQVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTGIFQQYLVYQ 420
                + Y G  F  GYA  A+G  WI   ++ L   LLCSC+++LL++TGIF QYLVYQ
Sbjct: 361 ATRSEQGYLGVSFIAGYAIFALGITWILKSLEDLTVSLLCSCNILLLVVTGIFLQYLVYQ 420

Query: 421 VHKIR-----------LQGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSI 480
           VHKIR           LQGYY FSQKLKHI+RLPFA  AYGTAA+LLVM W+  IS LSI
Sbjct: 421 VHKIRLQVIYAFDDWSLQGYYGFSQKLKHIIRLPFATIAYGTAAMLLVMAWKHHISFLSI 480

Query: 481 PIILRLIMLIEAVCAGSFMIIYISYIQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDV 540
            ++LR+IML+EAVCAG FM +Y+ Y+ +YNSL+SQPD L SLYSPLQQ+S LE LRYHD 
Sbjct: 481 SMLLRIIMLVEAVCAGFFMSVYVGYVHQYNSLDSQPDALNSLYSPLQQASPLEGLRYHDG 540

Query: 541 GRLSDQQMALLQYQRENLHFLNEEILRLQECLSKYERSSDGSTPQVDLAHMLAARDQELR 600
           GRLSDQQMALLQYQRENLHFL+EEILRLQE LSKYER++DGSTPQVDLAH+LA RDQELR
Sbjct: 541 GRLSDQQMALLQYQRENLHFLSEEILRLQESLSKYERTNDGSTPQVDLAHLLATRDQELR 600

Query: 601 TLSAEMNQVTSELGLARSVIAERDTEIQKLLTTNKQYVEENERLRAILGEWSTRAAKLER 656
           TLSAEMNQ+ SEL LARS+IAERD+EIQ +  TN QY+EENERLRAILGEWS RAAKLER
Sbjct: 601 TLSAEMNQLQSELRLARSLIAERDSEIQGVRNTNNQYIEENERLRAILGEWSARAAKLER 660

BLAST of HG10007296 vs. ExPASy TrEMBL
Match: A0A4D9A883 (Uncharacterized protein OS=Salvia splendens OX=180675 GN=Saspl_019390 PE=3 SV=1)

HSP 1 Score: 863.6 bits (2230), Expect = 5.4e-247
Identity = 449/647 (69.40%), Postives = 512/647 (79.13%), Query Frame = 0

Query: 13  LLVLCCAVA--GSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFAHRYGKKYE 72
           LL+   AVA  GS   D NPIR V D L ELE  +++ +G  P A+ FARFAHRYGK YE
Sbjct: 49  LLIASAAVARSGSELLD-NPIRQVVDGLHELESSILKAVGDSPRAVSFARFAHRYGKMYE 108

Query: 73  TAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSATTK 132
           ++EE++ RF +F E+L +I+S NR+GLSY +GVN+F D TW+EFKKHRLGAAQNCSAT  
Sbjct: 109 SSEEIQRRFQVFSENLRMIRSHNRKGLSYSMGVNEFTDLTWDEFKKHRLGAAQNCSATRS 168

Query: 133 GNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQ 192
           GNHKLTD VLP   DWR+ GIVSPVK+QG CGSCWTFS+TGALEAAYAQA GK ISLSEQ
Sbjct: 169 GNHKLTDAVLPTLIDWRKSGIVSPVKNQGSCGSCWTFSSTGALEAAYAQAFGKSISLSEQ 228

Query: 193 QLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQVID 252
           QLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTE AYPYTGK+G CK+ SENV V+V+D
Sbjct: 229 QLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTEAAYPYTGKDGVCKYSSENVAVKVVD 288

Query: 253 SVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAVG 312
           SVNITLGAEDELKHAVAFVRPVSVAFEVV GF+ Y+ GVYTS +CGS PMDVNHAVLAVG
Sbjct: 289 SVNITLGAEDELKHAVAFVRPVSVAFEVVDGFKAYNGGVYTSTTCGSDPMDVNHAVLAVG 348

Query: 313 YGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASCKERYCGFQVGYAALAI 372
           YGVE+G+PYWLIKNSWG +WGD GYFKME+GKNMC                         
Sbjct: 349 YGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMC------------------------- 408

Query: 373 GAPWIFHPIKHLVEPLLCSCDVVLLMLTGIFQQYL--VYQVHKIRLQGYYSFSQKLKHIV 432
                           L  C  +  ++TG  Q+YL  +Y      LQGYY FSQKLK I+
Sbjct: 409 --------------GSLTYCKRLPYVVTGSEQRYLGAIYAFDDWSLQGYYGFSQKLKRII 468

Query: 433 RLPFAVTAYGTAALLLVMVWEPQISALSIPIILRLIMLIEAVCAGSFMIIYISYIQKYNS 492
           RLPFA  AYGTAA+LLVM W+  IS LSI ++LR+IML+EAVCAG FM +Y+ Y+ +YNS
Sbjct: 469 RLPFATIAYGTAAMLLVMAWKHHISFLSISMLLRIIMLVEAVCAGFFMSVYVGYVHQYNS 528

Query: 493 LNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEILRLQEC 552
           L+SQPDVL SLYSPLQQ+S LE LRYHD GRLSDQQMALLQYQRENLHFL+EEILRLQE 
Sbjct: 529 LDSQPDVLNSLYSPLQQASPLEGLRYHDGGRLSDQQMALLQYQRENLHFLSEEILRLQES 588

Query: 553 LSKYERSSDGSTPQVDLAHMLAARDQELRTLSAEMNQVTSELGLARSVIAERDTEIQKLL 612
           LSKYER++DGSTPQVDLAH+LA RDQELRTLSAEMNQ+ SEL LARS+IAERD+E+Q + 
Sbjct: 589 LSKYERTNDGSTPQVDLAHLLATRDQELRTLSAEMNQLQSELRLARSLIAERDSEVQGVR 648

Query: 613 TTNKQYVEENERLRAILGEWSTRAAKLERALEAERISNLELQKRISA 656
            TN QY+EENERLRAILGEWS RAAKLERALEA  +SNLELQK+IS+
Sbjct: 649 NTNNQYIEENERLRAILGEWSARAAKLERALEAANMSNLELQKKISS 655

BLAST of HG10007296 vs. ExPASy TrEMBL
Match: A0A7J7DQ01 (Thiol protease aleurain-like OS=Tripterygium wilfordii OX=458696 GN=HS088_TW04G00375 PE=3 SV=1)

HSP 1 Score: 691.4 bits (1783), Expect = 3.7e-195
Identity = 358/561 (63.81%), Postives = 403/561 (71.84%), Query Frame = 0

Query: 5   RLFFVSSVLLVLCCAVAG---SVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARF 64
           R  F S++LL+LC A A    S F DSNPIRMVS+ L ELE  V++VIG   HAL FARF
Sbjct: 3   RFPFASAILLLLCSAAAAGGVSSFLDSNPIRMVSEGLGELEASVLKVIGNTHHALFFARF 62

Query: 65  AHRYGKKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGA 124
           AHR+GKKYE   EMK+RF IF E+++ I++TNR+ L YKL VN+FAD TWEEF++HRLGA
Sbjct: 63  AHRFGKKYEGDNEMKLRFAIFSENVDFIRATNRKRLPYKLAVNKFADLTWEEFRRHRLGA 122

Query: 125 AQNCSATTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAH 184
           AQNCSAT KGNHKLTDV+LPE KDWRE+GIVSPVKDQG+CGSCWTFS+TGALEAAY QA 
Sbjct: 123 AQNCSATLKGNHKLTDVILPEKKDWREEGIVSPVKDQGNCGSCWTFSSTGALEAAYKQAF 182

Query: 185 GKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLS 244
           G GISLSEQQLVDCAGAF+NFGC+GGLPS AFEYIKYNGGL+TE AYPYTG+NG CKF S
Sbjct: 183 GTGISLSEQQLVDCAGAFDNFGCDGGLPSHAFEYIKYNGGLETEKAYPYTGQNGLCKFSS 242

Query: 245 ENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMD 304
           ENV VQV+ SVNITLGAEDELKHAV  VRPVSVAF+    F  Y +GV+TSN+CGSTPMD
Sbjct: 243 ENVAVQVLSSVNITLGAEDELKHAVGLVRPVSVAFQANKEFSFYKRGVFTSNTCGSTPMD 302

Query: 305 VNHAVLAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASCKERYCGF 364
           VNHAV+AVGYGVEDG+PYW+IKNSWG  WG+ GYFKME+GKNMC                
Sbjct: 303 VNHAVVAVGYGVEDGVPYWIIKNSWGAEWGENGYFKMEMGKNMC---------------- 362

Query: 365 QVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTGIFQQYLVYQVHKIRLQGYYSFS 424
                                                GIFQQ+LVYQV KIRLQGYYSFS
Sbjct: 363 -------------------------------------GIFQQHLVYQVEKIRLQGYYSFS 422

Query: 425 QKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPIILRLIMLIEAVCAGSFMIIYIS 484
           QKL HIVRLPFA+TAYG                                           
Sbjct: 423 QKLNHIVRLPFAITAYG------------------------------------------- 467

Query: 485 YIQKYNSLNSQPDVLKSLYSPLQQSSSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEE 544
           Y+ +YNSL+SQPDVLKSLYSPLQ SSSLE LRY+D GRLSDQQ+ LLQ QR+NLHFL+E 
Sbjct: 483 YVHQYNSLDSQPDVLKSLYSPLQPSSSLEVLRYNDGGRLSDQQIGLLQCQRKNLHFLSEN 467

Query: 545 ILRLQECLSKYERSSDGSTPQ 563
           IL+LQECLSKYERS DGSTPQ
Sbjct: 543 ILQLQECLSKYERSDDGSTPQ 467

BLAST of HG10007296 vs. ExPASy TrEMBL
Match: A0A5A7UIP1 (Thiol protease aleurain-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold181G001590 PE=3 SV=1)

HSP 1 Score: 680.2 bits (1754), Expect = 8.5e-192
Identity = 328/350 (93.71%), Postives = 339/350 (96.86%), Query Frame = 0

Query: 5   RLFFVSSVLLVLCCAVAGSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFAHR 64
           RLFFVSSVLL+L CAVAGSVFDDSNPIRMVSDRLRELELEVVRV+GQVPHALRFARFAHR
Sbjct: 4   RLFFVSSVLLLLSCAVAGSVFDDSNPIRMVSDRLRELELEVVRVLGQVPHALRFARFAHR 63

Query: 65  YGKKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQN 124
           YGKKYETAEEMK RFGIFLESLELIKSTN+QGLSYKLGVNQFADWTWEEFKKHRLGAAQN
Sbjct: 64  YGKKYETAEEMKRRFGIFLESLELIKSTNKQGLSYKLGVNQFADWTWEEFKKHRLGAAQN 123

Query: 125 CSATTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKG 184
           CSATTKG+HKLTD V PESKDWR+DGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKG
Sbjct: 124 CSATTKGSHKLTDAVPPESKDWRKDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKG 183

Query: 185 ISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENV 244
           +SLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTE AYPYTGK+G+CKF+SENV
Sbjct: 184 VSLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTEAAYPYTGKDGKCKFVSENV 243

Query: 245 GVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNH 304
           GVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNH
Sbjct: 244 GVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNH 303

Query: 305 AVLAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASC 355
           AVLAVGYGVEDGIPYWLIKNSWGGNWGD GYFKME+GKNMC    G A+C
Sbjct: 304 AVLAVGYGVEDGIPYWLIKNSWGGNWGDDGYFKMEMGKNMC----GVATC 349

BLAST of HG10007296 vs. ExPASy TrEMBL
Match: A0A1S3BPS0 (thiol protease aleurain-like OS=Cucumis melo OX=3656 GN=LOC103492370 PE=3 SV=1)

HSP 1 Score: 680.2 bits (1754), Expect = 8.5e-192
Identity = 328/350 (93.71%), Postives = 339/350 (96.86%), Query Frame = 0

Query: 5   RLFFVSSVLLVLCCAVAGSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFAHR 64
           RLFFVSSVLL+L CAVAGSVFDDSNPIRMVSDRLRELELEVVRV+GQVPHALRFARFAHR
Sbjct: 4   RLFFVSSVLLLLSCAVAGSVFDDSNPIRMVSDRLRELELEVVRVLGQVPHALRFARFAHR 63

Query: 65  YGKKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQN 124
           YGKKYETAEEMK RFGIFLESLELIKSTN+QGLSYKLGVNQFADWTWEEFKKHRLGAAQN
Sbjct: 64  YGKKYETAEEMKRRFGIFLESLELIKSTNKQGLSYKLGVNQFADWTWEEFKKHRLGAAQN 123

Query: 125 CSATTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKG 184
           CSATTKG+HKLTD V PESKDWR+DGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKG
Sbjct: 124 CSATTKGSHKLTDAVPPESKDWRKDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKG 183

Query: 185 ISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENV 244
           +SLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTE AYPYTGK+G+CKF+SENV
Sbjct: 184 VSLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTEAAYPYTGKDGKCKFVSENV 243

Query: 245 GVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNH 304
           GVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNH
Sbjct: 244 GVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNH 303

Query: 305 AVLAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASC 355
           AVLAVGYGVEDGIPYWLIKNSWGGNWGD GYFKME+GKNMC    G A+C
Sbjct: 304 AVLAVGYGVEDGIPYWLIKNSWGGNWGDDGYFKMEMGKNMC----GVATC 349

BLAST of HG10007296 vs. TAIR 10
Match: AT3G45310.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 555.4 bits (1430), Expect = 6.1e-158
Identity = 261/352 (74.15%), Postives = 303/352 (86.08%), Query Frame = 0

Query: 5   RLFFVSSVLLVLCCAVAGSV--FDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFA 64
           +L   SS+LL+L  A A     FD+SNPI+MVSD L ELE  VV+++GQ  H L F+RF 
Sbjct: 4   KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 65  HRYGKKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAA 124
           HRYGKKY++ EEMK+RF +F E+L+LI+STN++GLSYKL +NQFAD TW+EF++++LGAA
Sbjct: 64  HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 125 QNCSATTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHG 184
           QNCSAT KG+HK+T+  +P++KDWREDGIVSPVK+QGHCGSCWTFSTTGALEAAY QA G
Sbjct: 124 QNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFG 183

Query: 185 KGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSE 244
           KGISLSEQQLVDCAG FNNFGC+GGLPSQAFEYIKYNGGLDTE AYPYTGK+G CKF ++
Sbjct: 184 KGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGGCKFSAK 243

Query: 245 NVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDV 304
           N+GVQV DSVNITLGAEDELKHAV  VRPVSVAFEVV  FR Y KGV+TSN+CG+TPMDV
Sbjct: 244 NIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDV 303

Query: 305 NHAVLAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASC 355
           NHAVLAVGYGVED +PYWLIKNSWGG WGD GYFKME+GKNMC    G A+C
Sbjct: 304 NHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMC----GVATC 351

BLAST of HG10007296 vs. TAIR 10
Match: AT5G60360.1 (aleurain-like protease )

HSP 1 Score: 554.7 bits (1428), Expect = 1.0e-157
Identity = 263/343 (76.68%), Postives = 296/343 (86.30%), Query Frame = 0

Query: 12  VLLVLCCAVAGSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFAHRYGKKYET 71
           V+LV   A A   FD+SNPIRMVSD LRE+E  V +++GQ  H L FARF HRYGKKY+ 
Sbjct: 13  VVLVAASAAANIGFDESNPIRMVSDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQN 72

Query: 72  AEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSATTKG 131
            EEMK+RF IF E+L+LI+STN++GLSYKLGVNQFAD TW+EF++ +LGAAQNCSAT KG
Sbjct: 73  VEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKG 132

Query: 132 NHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQQ 191
           +HK+T+  LPE+KDWREDGIVSPVKDQG CGSCWTFSTTGALEAAY QA GKGISLSEQQ
Sbjct: 133 SHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQ 192

Query: 192 LVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQVIDS 251
           LVDCAGAFNN+GCNGGLPSQAFEYIK NGGLDTE AYPYTGK+  CKF +ENVGVQV++S
Sbjct: 193 LVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSAENVGVQVLNS 252

Query: 252 VNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAVGY 311
           VNITLGAEDELKHAV  VRPVS+AFEV+  FRLY  GVYT + CGSTPMDVNHAVLAVGY
Sbjct: 253 VNITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312

Query: 312 GVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMCEYENGKASC 355
           GVEDG+PYWLIKNSWG +WGDKGYFKME+GKNMC    G A+C
Sbjct: 313 GVEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMC----GIATC 351

BLAST of HG10007296 vs. TAIR 10
Match: AT3G45310.2 (Cysteine proteinases superfamily protein )

HSP 1 Score: 554.3 bits (1427), Expect = 1.3e-157
Identity = 258/343 (75.22%), Postives = 299/343 (87.17%), Query Frame = 0

Query: 5   RLFFVSSVLLVLCCAVAGSV--FDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFA 64
           +L   SS+LL+L  A A     FD+SNPI+MVSD L ELE  VV+++GQ  H L F+RF 
Sbjct: 4   KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 65  HRYGKKYETAEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAA 124
           HRYGKKY++ EEMK+RF +F E+L+LI+STN++GLSYKL +NQFAD TW+EF++++LGAA
Sbjct: 64  HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 125 QNCSATTKGNHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHG 184
           QNCSAT KG+HK+T+  +P++KDWREDGIVSPVK+QGHCGSCWTFSTTGALEAAY QA G
Sbjct: 124 QNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFG 183

Query: 185 KGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSE 244
           KGISLSEQQLVDCAG FNNFGC+GGLPSQAFEYIKYNGGLDTE AYPYTGK+G CKF ++
Sbjct: 184 KGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGGCKFSAK 243

Query: 245 NVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDV 304
           N+GVQV DSVNITLGAEDELKHAV  VRPVSVAFEVV  FR Y KGV+TSN+CG+TPMDV
Sbjct: 244 NIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDV 303

Query: 305 NHAVLAVGYGVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMC 346
           NHAVLAVGYGVED +PYWLIKNSWGG WGD GYFKME+GKNMC
Sbjct: 304 NHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMC 346

BLAST of HG10007296 vs. TAIR 10
Match: AT5G60360.2 (aleurain-like protease )

HSP 1 Score: 554.3 bits (1427), Expect = 1.3e-157
Identity = 260/334 (77.84%), Postives = 292/334 (87.43%), Query Frame = 0

Query: 12  VLLVLCCAVAGSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFAHRYGKKYET 71
           V+LV   A A   FD+SNPIRMVSD LRE+E  V +++GQ  H L FARF HRYGKKY+ 
Sbjct: 13  VVLVAASAAANIGFDESNPIRMVSDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQN 72

Query: 72  AEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSATTKG 131
            EEMK+RF IF E+L+LI+STN++GLSYKLGVNQFAD TW+EF++ +LGAAQNCSAT KG
Sbjct: 73  VEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKG 132

Query: 132 NHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQQ 191
           +HK+T+  LPE+KDWREDGIVSPVKDQG CGSCWTFSTTGALEAAY QA GKGISLSEQQ
Sbjct: 133 SHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQ 192

Query: 192 LVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQVIDS 251
           LVDCAGAFNN+GCNGGLPSQAFEYIK NGGLDTE AYPYTGK+  CKF +ENVGVQV++S
Sbjct: 193 LVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSAENVGVQVLNS 252

Query: 252 VNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAVGY 311
           VNITLGAEDELKHAV  VRPVS+AFEV+  FRLY  GVYT + CGSTPMDVNHAVLAVGY
Sbjct: 253 VNITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312

Query: 312 GVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMC 346
           GVEDG+PYWLIKNSWG +WGDKGYFKME+GKNMC
Sbjct: 313 GVEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMC 346

BLAST of HG10007296 vs. TAIR 10
Match: AT5G60360.3 (aleurain-like protease )

HSP 1 Score: 554.3 bits (1427), Expect = 1.3e-157
Identity = 260/334 (77.84%), Postives = 292/334 (87.43%), Query Frame = 0

Query: 12  VLLVLCCAVAGSVFDDSNPIRMVSDRLRELELEVVRVIGQVPHALRFARFAHRYGKKYET 71
           V+LV   A A   FD+SNPIRMVSD LRE+E  V +++GQ  H L FARF HRYGKKY+ 
Sbjct: 13  VVLVAASAAANIGFDESNPIRMVSDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQN 72

Query: 72  AEEMKIRFGIFLESLELIKSTNRQGLSYKLGVNQFADWTWEEFKKHRLGAAQNCSATTKG 131
            EEMK+RF IF E+L+LI+STN++GLSYKLGVNQFAD TW+EF++ +LGAAQNCSAT KG
Sbjct: 73  VEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATLKG 132

Query: 132 NHKLTDVVLPESKDWREDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQQ 191
           +HK+T+  LPE+KDWREDGIVSPVKDQG CGSCWTFSTTGALEAAY QA GKGISLSEQQ
Sbjct: 133 SHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQ 192

Query: 192 LVDCAGAFNNFGCNGGLPSQAFEYIKYNGGLDTETAYPYTGKNGQCKFLSENVGVQVIDS 251
           LVDCAGAFNN+GCNGGLPSQAFEYIK NGGLDTE AYPYTGK+  CKF +ENVGVQV++S
Sbjct: 193 LVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSAENVGVQVLNS 252

Query: 252 VNITLGAEDELKHAVAFVRPVSVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAVGY 311
           VNITLGAEDELKHAV  VRPVS+AFEV+  FRLY  GVYT + CGSTPMDVNHAVLAVGY
Sbjct: 253 VNITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312

Query: 312 GVEDGIPYWLIKNSWGGNWGDKGYFKMELGKNMC 346
           GVEDG+PYWLIKNSWG +WGDKGYFKME+GKNMC
Sbjct: 313 GVEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMC 346

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6428559.17.2e-26268.01hypothetical protein SASPL_112811 [Salvia splendens][more]
KAG5559120.11.0e-25263.72hypothetical protein RHGRI_008890 [Rhododendron griersonianum][more]
KAG5559119.14.0e-25261.42hypothetical protein RHGRI_008890 [Rhododendron griersonianum][more]
KAF7149320.11.7e-25061.10hypothetical protein RHSIM_Rhsim03G0227400 [Rhododendron simsii][more]
KAG5559117.14.4e-24357.41hypothetical protein RHGRI_008890 [Rhododendron griersonianum][more]
Match NameE-valueIdentityDescription
Q8RWQ98.5e-15774.15Thiol protease aleurain-like OS=Arabidopsis thaliana OX=3702 GN=At3g45310 PE=2 S... [more]
Q8H1661.5e-15676.68Thiol protease aleurain OS=Arabidopsis thaliana OX=3702 GN=ALEU PE=1 SV=2[more]
A0A072UTP97.2e-15676.08Pro-cathepsin H OS=Medicago truncatula OX=3880 GN=CP PE=1 SV=1[more]
Q107174.2e-14872.55Cysteine proteinase 2 OS=Zea mays OX=4577 GN=CCP2 PE=2 SV=1[more]
Q401438.3e-14473.89Cysteine proteinase 3 OS=Solanum lycopersicum OX=4081 GN=CYP-3 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A4D9BI937.5e-26570.06Uncharacterized protein OS=Salvia splendens OX=180675 GN=Saspl_016945 PE=3 SV=1[more]
A0A4D9A8835.4e-24769.40Uncharacterized protein OS=Salvia splendens OX=180675 GN=Saspl_019390 PE=3 SV=1[more]
A0A7J7DQ013.7e-19563.81Thiol protease aleurain-like OS=Tripterygium wilfordii OX=458696 GN=HS088_TW04G0... [more]
A0A5A7UIP18.5e-19293.71Thiol protease aleurain-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... [more]
A0A1S3BPS08.5e-19293.71thiol protease aleurain-like OS=Cucumis melo OX=3656 GN=LOC103492370 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G45310.16.1e-15874.15Cysteine proteinases superfamily protein [more]
AT5G60360.11.0e-15776.68aleurain-like protease [more]
AT3G45310.21.3e-15775.22Cysteine proteinases superfamily protein [more]
AT5G60360.21.3e-15777.84aleurain-like protease [more]
AT5G60360.31.3e-15777.84aleurain-like protease [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 535..555
NoneNo IPR availableCOILSCoilCoilcoord: 604..624
NoneNo IPR availableCOILSCoilCoilcoord: 632..659
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 51..353
e-value: 2.0E-108
score: 364.6
NoneNo IPR availablePANTHERPTHR12411:SF777THIOL PROTEASE ALEURAINcoord: 22..348
NoneNo IPR availablePANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 22..348
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 158..173
score: 62.89
coord: 304..314
score: 63.06
coord: 319..325
score: 71.82
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 140..355
e-value: 5.2E-101
score: 351.5
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 140..350
e-value: 6.3E-74
score: 248.7
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 58..114
e-value: 1.0E-20
score: 84.8
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 58..114
e-value: 1.1E-12
score: 48.2
IPR029399TMEM192 familyPFAMPF14802TMEM192coord: 388..544
e-value: 1.2E-13
score: 50.8
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 158..169
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 302..312
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 319..338
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 141..345
e-value: 3.48041E-108
score: 324.192
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 55..352

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007296.1HG10007296.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005615 extracellular space
cellular_component GO:0005764 lysosome
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity