HG10003804 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003804
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionaspartic proteinase-like protein 2
LocationChr08: 9379914 .. 9389256 (-)
RNA-Seq ExpressionHG10003804
SyntenyHG10003804
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCACCGCCGGCATCGGTACTTCACGCCCTTTGACTCTGTTGCTTTTTGTCATCATCAACTTACTTTCCAACACCATCACCGGCGGCGGAGGAGGAGTTTATGCTGATAACGGTGTGTTCAGTGTCAAATACAAGTATGCCGGACGGGAACGTTCTCTTAGTACCCTTAAGGCGCACGACATCAGCCGTCAGCTTCGATTTCTTGCTGGCGTCGATATCCCTCTCGGTGGCTCTGGCCGGCCTGATGCGGTCGGGTTAGTGGTATCGGTTTTTCAATTTTGTTTGGTATTGTATGTGTTGATTGTGTTGTGGTCTTAATTGGAGGGTTCACTTCGTTTCCAGGCTTTACTATGCCAAAATTGGAATTGGAACTCCTCCGAAGGATTATTATGTTCAAGTTGATACAGGGAGCGATATCGTATGGGTGAATTGTATTCAGTGTAGGGAGTGTCCGAGGAAAAGCTCTCTTGGGGTATTAAATATCTTGTTATTTTCACAGCATGTTGTCTTTATTTTTGATTGATGATGTGAAGGGGTTAATTGGAGTGTTTTTCCTTGGCATCAGATGGAACTCACTCCTTATGATTTAGGGGAGTCCACGACTGGGAAATTGGTTTCTTGTGATGAACAATTTTGCTTGGAGGTGAATGGGGGTCCACTTTCAGGCTGCACGACTAATATGTCCTGCCCATATCTTCAGATATATGGTGACGGGAGCTCGACTGCAGGGTATTTTGTCAAGGACTACGTACAATATGATCGAGTTTCTGGAGACCTTGAAACTACAGCTGCCAATGGAAGTATCAAGTTTGGGTGAGTTGTTTTTATCCTGCTCTTTTGTACTCTGTAGTTTAGCTCCAACACACTGTAGGGGAATGATGAACGAGGCAGGAAACTATAATCTAGAAGGATGAAGAAACTATAGAGAGATCATCATGCATCAATATCCTCATAAACTTTTGTCCCTCGCAAAACTCTAAAAGAACAGAATAGGTGTCGAAGGAGAGTGCAAACTGTCCTTGTTCCTATAGCTTGATCTCTAGGCATGGTATTCAAGCCACAGTACTAGGTAGCAGTCAGAAAAATGGCAAAAGAGAATTAGGCACAATTTGAGATCTCCACAACCAAATAATAACTGGAGCTTCCTTCCCTCCATCCCCACCCCACCACCTTACTTTAAGACTTCCTATGAGTTGTGCAAATAAAGGATCTTTTGGGAGGTTCTTATAATAGTTTATGTACGAAAAGGAAATCTTCTTATACAGGAGGCTAGAGAAAAATTTATTGAATGATGAAGGTACCCAAGTTAGACAGTGGTGACTTGCTATGGTCATTTATGTACTGGAGGGACTACGGATAGATTTTATTGTCCATTTTTCATTTATCTAAATCGTCCTCCATTCTTTTACTGAAAGGAGCATGGGTTATGAGAGTGCTGTGCTGGAGGCTTTGGATCTAGAGGTAGCGAGTCCTCGTTAGAGACTCGATTATTCAAGAAAGTTCTGATTTGGTTATCCTCAAGGAAACAGAGTAGCACAATCTGGCTAGGAGGTAGGTCAAATTCATTAGGAGTTTCCACATTGCTTGGATGTCCTTTGAGCATTGGTGCTTAAAAGATACATTTTGGTTTATGGTATTGCCCCTTCTTTGAATGTTCTTATGCCTGTGGAGGGTTATTTTGTCTATCAACCCTGCTATTTTGGTTGGCTTCTCCTTTGGGATTCCTGGGGTATGGTACCAACTCTAACATTTATAGGAAACTGCATTGACAAGAAATTGAATACCTCTTTCATCTTCATTTGGATGGGTTCTTTGATTGTCTGGAGATTTTTATGTGGCAAAATAGATGAAAAATCTCCTAGTTCTTCATATATAAAGAGTAGGCAGCTGACATGCGATCTCCCTCTTATCTTTAGCGATGAACCCTATGCTCATGCTATTGGTGTCTTTTTGGGGTCAAACCTGAGGGGGATGGTTGTTCTTTCAAGATTCGTTGATTTCCAATACTTGGATTTGATTGGAGCGAGTTACATTTCATCTTTTTGGAAGTTAATTGCTTTATTATTATTATTATTATTATTTTTTTTGGGGGGGGGGGGGGGGGGGGGGGATTATTTCAACTACATGAATATATGATTATCTTACCCACGTTTCCTGATGTTAGAGATGATCATTTTGGTAGGTAATGGATGATTCTTTTATTGGGGTTTGGTCCTTTTATCTATGTGAATGTTGGTTAGCTTTTACCCATGTTTCCTGATAGGTTTCTAAAGCTAGTGGAAGTGATTGCTCGTATGCAATGGGCACCTTGGAGTCATATTATTAGTTCTTTCCCATGTCTAAGTAGCAATTTCATCACTATTTTTAGTTGGGTGATGATTCCTACATCAAATTTTGGGAGGATAACTAACGAGTGTAGTTCATTGAAATGTGAATATTACAAGTGGATGCATTCATTAATAGATAATGAAGGTACGTTGGCTTGATATTTGGGTCTTTGGACCTTGGGGGCTTGGGCATTGGTAATTTGAGAATGTGCAATGAAGTTATGTTGGCGAAGTGCTTTTGGCTTTTCCCCTTGGAGCTTGGCGTCCTATGGCATAGGGTCATTGTAAATAAGTATGGACTTCATCCTCGTATGTTTGATTGGGCTTCGAGTTGCAAGTCGAAAGGCACATCCAATAACCCTTTTTTTTATTTTTTTAGTGTGTATATATATATATATATAAATAAAATCGACTGTTTTCATTGAGAAAAAATGAGATAATACTAGGGCAAACAAGAAAAACCAAGCCCACAGGAAAAACCTCACTGGAGAAAGGGTTTCCAACTAAGTACCTATCTATCATTTGCCTTGTTTGGGATGATTCAAGTCCCTACTTTGAGGATCATTGGTGAGGTAAAAGTACATATCTATCATTTGCACTCTATAAAGTTGGTTTGTCTTGTTTATCTTTCTATATGCCTTTGTGTTGTTTCATGATTCTCAATGAAAGCTATGCCTCATTTAAAAAAAAAGGGAAAAGAAAAGAAAAAAAAAGAAAAGAAAAGATATTTGGGTCTACAGATTTGGAATCTTTATTTAGGAGATGCTCATGAGAAAGAGTGGGCTTTCCTTGTTGGCAATCTTGATGATGTTCATTTATCCATGAGGATGTGGAGATTAACCTTCAGTTCTTTGCTTCATAAGCTCATTGTTTGGATATTAATGGCTATGGTTTCTTGTACTCACCTTTGGCGGGATTTCAATTTCATTCATCAACGATATTAATGGCTATGGAAAGTGTTACCATTAAATTTATTTAGTTCTTTGCTTTGTAAGCTCGTTGATGAAAGCTCATCAACTTGAAATTCCATGTTGCTTTCTTGTTTGGCAACGTCACACATTGCATGTTAGCCCACGTTATTATCAACCACTTCAACATGCACCTTTTCATATGATCAAATATCGCTTCAACCTTGGGATTACATTTGTAGAGCCAGGCAAGTTTATTAGCTCAATAGTTAAACGAGTGAATTTATTAGATACCATCATGAAAGGTGTTTGCTCACCAATCTGTTCTTATTATTAAATATGAACTCTTCTTGTGGGAGACACAAGTCCTACTTCTTAGGCTTTTCATGTTAAGGCATCATGTTAAATTTCGTAATGTCTCATTAATCACCTCTATTTGACCATCAGGATGCAATGACATTTTCTATATCTCAACGGTGTGTCAAAACTTCTCTCACAAGCTTTTTCTAAACTGGCTTGAGAATTTGAATCCCAATCGCTTGTAATCAAATTCTGAATCCATGCAAGTCAACAATTTTCTTGCAGAACAGGTTGGCTACATGTACAAGGTCATTGGTTTTTCAGCAAGGAATGAATGTGTTATTTTGCTTAATTGATCTACAGGGGCAAAATTGAAACTATGTCTAAGAGATGTCAATTGCCACGTTGTAGTTTTTGTCAACATTGGCCCTCTGAGTTCATTCTCCTGTCCTGTTAATTTCATTTAATCTATTTGTTTCTTATGATTTTAGAATTTTAGAATTGATTTATTTGTTAATTTATTTCTGTTTGATAGATGTGGTGCCAGACAATCTGGGGATCTTGGCTCTTCTGGTGAAGAAGCACTTGATGGCATACTTGGTTTTGGAAAATCAAATTCATCTATTATTTCACAACTAGCCTCCTCAAGAAAAGTGAAGAAGATGTTTGCTCATTGCCTAGATGGAATAAATGGGGGTGGTATCTTTGCGATGGGACATGTTGTGCAGCCAAAAGTTAATATGACTCCTTTGGTACCAAATCAGTATGTGATTTGCTTTCTTAGCTCTAAACTGTTGCTTTTAATAATCTCTTGATTTCCATCCTTTTCCCATCTGTAGAATTAAGTTGGTTTCTATGGTCCTTGCATTAACTTGTTGGTTTGTTCTATGAGTTTTATAAATCAAAATTTATGCAAAATGATATATATTTTGTTCTAATGAAATATATTAGCTATTAAAAACTAATTTGAAAGTGTTATGGCAAACCCAAAACTATTTTACTGTTTATGACTAGATATATTTTATTGTTATTATTACGATTTTTTAGTGCTTTCAACTTGCGAAGCATTGACACAGATACGTCTGGATATGCTTTGGACACGTGTTCGGTACACTTTAAAACATGTCTATAATATACACATATTTTCTCTAATTTCGGACACAACTTTGGAAAATATCTGACTTAAAAATTAGGAAAAAAATTGAGATTCTCTTATGGATTGAATCCTTATATATTGGTCAATGTTAAAGTAAGGATGTCATCTTCATCCTTTAGTCATAGCTAACTAAATCTTTTTGTATTTTTTAAGATTTTAAATGTCGTTAATTGACTTAGCTTTGTATGTTTTTAAATTTTAGTGAAGTATATTTTGTTCTTATGTTAAATTTACATGTTTTTATCTATAAAGAAGTATTAATATAATAATATAGATACATATATTTTATGAAAAAATGTGTTCCCAATGTGTCCGTGTCCTACTTTTTAAGAAATTGACATGTCACCATGTCCATATATCATGTAGTGTCTTGTGTCCGTGCTTCTTAGCTTTCAACTATGTTAAAAAAAATATGAATATCGACTGTTAATGATTCAGAACTGTAACTTTTTGCACCACTTGTGACTTCTTCTATTGATATCATATTATGATTATTTTATCTTTATTATGTGCATGATTCTATCTTTTATTAATATTCATTAATTAAATTTAGGCCACATTACAATGTCAATATGACGGGAGTACAAGTTGGTCGTGTCATGTTAAATATTTCTGCCGATGTATTCGAGGCAGGAGACAGAAAAGGGACAATCATTGATAGTGGTACAACCTTGGCATATCTTCCTGAACTGATATACGAGCCTTTAGTGACCATGGTACTAATAATGCCGAGTTCTATCCATTGGAGAATTTTATTACAAATGTGAAATGAGTCAAATTACCATATTTGATTCTGTTCTTGTAGATACTCTCTCAGCAACACAATTTGGAAGTTCAAACCATTCATGGAGAATATAAATGTTTTCAGTACTCAGAGAGGTATGGAGCATTCTGAATTTGGTTACTTATAATTTATATGAAGTTCTTTCTGTGTAATTCTGTCTTGTGTTAATCAGGTCAAACCAGAGTATTTGCTTAATTGGTGTATTTTTTATAGAAAAAAATCCAAATAAAACCTAGAGTGATGTTACCATTTGACCTAAAAACCAAACCAGCAAGATAGATCTCTACTTGTAATTTTGCAGTGTCATAGTCCATCATTGTTCACTTGTTGATTTGAGATATTGAGGTGAAGAATCAAGGCGATCTTGTCTTATGCAACTGTTCTCCTGATAAGTCTCATTATTTTTCCTGGAAAAGTTTGCCTGATAACATTTTTTACTCTGTTTTTTAGTGTCAGAAAAAATGTTATAGCCACTTTCTTTCATTAAGATAGATATCAAGAGATTCAGCTGTTATGATTGTCTTTGACTTGTATATCGATGGTGTAGAATGCTTTATTTATTATATTTTGCTTTGTTTGCAGGGTCGACGATGGATTTCCTCCAGTTATTTTCCATTTTGAGAACTCACTCTTGTTGAGGGTTTATCCTCATGAGTATCTATTCCAATATGTAAGTATGCTGGACTTGCGATTCTGTGTGCACTAACCCTTGAACTTGCTGCAATAATAGGTGTTTTAAAATTTCAATGTCTAACAATGCCGTTAGGCCACCTTTGTTAATTGCTTTGGTGCACCGTACTTCTAGATTCATAATGTATATTAACGTTTAAATCCAGTTTGTTGTTACAATTTTCTATTGGCAATCTTTCTTTATCTTAGGACTTTTTTGACACTGTTATTAATTGATCTGTGGTCTTGTTCTCATCTCATGAGATTTAGATGAAATTTTTTAGCATGAATTTAATTACCTAGTGAGCTTTCACTATTTTCCCTATCAATCATCTAATATAAGTTTGACAATTAAATGTATTCGACAATTGAAATCAAACGGTCAACCTAAATTGGTTAGGAAATGTGCCCTCAACCAAGAGGTTAAAAATTTGAGTGTCCCACCCCATATGTTAAACTAAAAAAAAACCATTGAAATCAAATGAAATGTCATTGTGAGCAAAACCTTGATGGACCCGTTCTTTATAATCTAGCTTTCTAGACCAGATAGACATTTTTATTTATCAGTATTATTGTTTTGTAGTTTGCTATTGTTGTTTCATTTAATGTAATTCATACATATACTTTGGCTATGTCAGCTTGCTAATGACGAATGTGATGTTTTTGACCCAAATGATTACTTTGTAGAGTTTATCCCATGTACTGAAATTTTATTTTCAAATCGGGTTGCTTCTTTCTTGGAGTAGATGGATGAGTTGGTTGACTAATATAGACTTCTACAGGAGGGCTTGTGGTGTATTGGTTGGCAAAACAGTGGGATGCAATCTAGAGATAGGAAGAACGTTACACTCTTTGGAGGTAAACTCCAAAATTTGTCCACGTGGATATTAATTCGTCTTAAAATTCTAGCTTGCCTTCTGGCAGGAAGGTTGCTAACTTTTAGAGTATCCATAATCCCTTATCTTCCCGTCGATTGTATGAGCTGTATATTCAGGCAGTGTGCTATGTAGAGCTCTTGATTACTGACAAGTTTATGGGTTAGAAATCATACCCTAAGAAAAAATTAGGTGCTGGTTCAATACTGTGCTCATTGTGGATGACATTACTTAGCTTTTTTTTATCTGTTGCTATAACGAATGACGAGCAGGTAATTCAATTTTTCAAAAAAGATTATTACTAAAATGAAGGGTGAGGATCCAAAACCTATCTATAAGGAAAAAATATTTAATACCTTAGACTAGTTTATTCACAGGCCGCCAAGTTTATAATTATTTTGGGTTTTACGCCAATCTCTCTCTGCAGGAGAAACGTGAAAGCATGATAGCTCCTCCCAGATGTTACATGTAATGGGGAGAAATGAAATAATCAGTTCTTATGACATTTGACCTTGGAAATAAAATGATAGTGAAAATGATTATAAAGCATAACCACCAACAGTTTTATATGGAGATAAAAGAGGCGAGTCCTTAATTTATGTAGTAAAGGTCCATACGAGGAGAAATTAATGATTATTAATCTCATTTTTTACTGTCTATTTGCTGCTGTTGGTTAAGTATGTGATCATCTATGAGTCATCTTTAATTCTTCGTACTATATTTTGCATAAAAGAGAATCGTATTTTAATTCCTAGTCCTAGCTGAACAGTGTATGATTTTCTTTTCTTATTGTTTGTGTTAAATTCAGCAAAGGGGGACTTTTCTTATTTTCTTTCTTTTCATTTTTCCTATCTTCTAGGCCTTCATTTATATCTCTTTTGCTTGCAGATTTAGTGCTTTCAAATAAGCTAGTTTTATATGATCTTGAAAATCAGACAATCGGGTGGACCGAATACAACTGTAAGTACAAAGTTTTTGTTTTGTTGTTTTCGAAGTTATCTCTGAATGACATTTCATGTGTTTGCTGCCCAACTTTTCCAGTATTTGCTGTAATTTGTCTTCACTTCAGTGTTGGTTTTTCGCTCTCTTTTTTTAATCCTCATGATGGAAGATTCTGATAAAATTCAGTTGCATAAATACGACACAAGCATACATGAAAATATTTATACATCAACATATTTTACATCTTCACACATGCCTCATATCTCATGGATTATTTATCAACAATAGGTTTAAATTCTATTATGGGACTTGAACTTCTATGTGATTCCTAAACCATTCTATATTAGTTTCTAAATTTTCAAATAGTTCTACTTTGCAGAAATAGCCATTAAGTTGGAACCTAATACTTGTTCAACTGAAGTAGCTTAACAAATTTGTACATGTAGACATCTGATCACCAAATACATGAACAGAAGTGTGCCCTTAAAAATTTTATTCAAGTGATTTATTGAAAAAAATCATCAGCAAGGACTAAAACGATTTTCTTTTGCAATATTTAAAAACCAAAATTGAATCTTTTGGGTTTTATTTGTTAACTATTTGGTTTTTGGTTTTTGATATTTGAAAATTAATCTAAACACTACTTCCACTTATGATTTTCTTTGTTTTGTTATCTACTTTTAGAGATGCTTTGAAAATCTGAGATAAGTTTTGAAAACTAGAAAAAGTATTTATTTTTGAAAATTTCACTTCATAATTATTTAAAATTAATTAACTGAAAGCTAACGCTAACGATCAAAGTGAACCTTAGCAAAATCTCAAGAGTAAAAGTGTAAAGTATTAAAACCTAGGGACCTCAAATACTCAAGGGTAAAAATGTAACATTTTGAAACCTAGGGACTAAATGGAAATCAAACTCTAAACTTGGGGTTAAAAGTTTGGCATTTTGAAATTTAGAGACCAAATGGAAACTACGTCCAAAACTTAGGGACCAAAAAAGTATTTTTCCTAAATAATTTGCGAGAAAACAAGGACAAATTTTAAAAACAAAAAACCAAATGGTTATCAAAGTAAGATTTAAACCTTAATAATATTTCTTCTTTTGTTCACTAGTGAACAAACATTTCTGAGATCTTCGACAAGAAATTTGATACGTTAGGGAGATAATTCACTTTAGTCCATCAATATCTCACATTTTTAGTAATTGATATTCAGGTTCTTCAAGCATCAAAGTGCAGGATGAACAGACTGGAACAGTTCATTTAGTTGGTTCACATTACATTTCTTCAGCCCACAGATTGAATACCAAATGGGGTGTGATCTTGCTATTCCTAATCTTGCCGATGCATTGGTCAGCTCATTTCAGATGCCTTAGGTAA

mRNA sequence

ATGGCCACCGCCGGCATCGGTACTTCACGCCCTTTGACTCTGTTGCTTTTTGTCATCATCAACTTACTTTCCAACACCATCACCGGCGGCGGAGGAGGAGTTTATGCTGATAACGGTGTGTTCAGTGTCAAATACAAGTATGCCGGACGGGAACGTTCTCTTAGTACCCTTAAGGCGCACGACATCAGCCGTCAGCTTCGATTTCTTGCTGGCGTCGATATCCCTCTCGGTGGCTCTGGCCGGCCTGATGCGGTCGGGCTTTACTATGCCAAAATTGGAATTGGAACTCCTCCGAAGGATTATTATGTTCAAGTTGATACAGGGAGCGATATCGTATGGGTGAATTGTATTCAGTGTAGGGAGTGTCCGAGGAAAAGCTCTCTTGGGATGGAACTCACTCCTTATGATTTAGGGGAGTCCACGACTGGGAAATTGGTTTCTTGTGATGAACAATTTTGCTTGGAGGTGAATGGGGGTCCACTTTCAGGCTGCACGACTAATATGTCCTGCCCATATCTTCAGATATATGGTGACGGGAGCTCGACTGCAGGGTATTTTGTCAAGGACTACGTACAATATGATCGAGTTTCTGGAGACCTTGAAACTACAGCTGCCAATGGAAGTATCAAGTTTGGATGTGGTGCCAGACAATCTGGGGATCTTGGCTCTTCTGGTGAAGAAGCACTTGATGGCATACTTGGTTTTGGAAAATCAAATTCATCTATTATTTCACAACTAGCCTCCTCAAGAAAAGTGAAGAAGATGTTTGCTCATTGCCTAGATGGAATAAATGGGGGTGGTATCTTTGCGATGGGACATGTTGTGCAGCCAAAAGTTAATATGACTCCTTTGGTACCAAATCAGCCACATTACAATGTCAATATGACGGGAGTACAAGTTGGTCGTGTCATGTTAAATATTTCTGCCGATGTATTCGAGGCAGGAGACAGAAAAGGGACAATCATTGATAGTGGTACAACCTTGGCATATCTTCCTGAACTGATATACGAGCCTTTAGTGACCATGATACTCTCTCAGCAACACAATTTGGAAGTTCAAACCATTCATGGAGAATATAAATGTTTTCAGTACTCAGAGAGGGTCGACGATGGATTTCCTCCAGTTATTTTCCATTTTGAGAACTCACTCTTGTTGAGGGTTTATCCTCATGAGTATCTATTCCAATATGAGGGCTTGTGGTGTATTGGTTGGCAAAACAGTGGGATGCAATCTAGAGATAGGAAGAACGTTACACTCTTTGGAGATTTAGTGCTTTCAAATAAGCTAGTTTTATATGATCTTGAAAATCAGACAATCGGGTGGACCGAATACAACTGTTCTTCAAGCATCAAAGTGCAGGATGAACAGACTGGAACAGTTCATTTAGTTGGTTCACATTACATTTCTTCAGCCCACAGATTGAATACCAAATGGGGTGTGATCTTGCTATTCCTAATCTTGCCGATGCATTGGTCAGCTCATTTCAGATGCCTTAGGTAA

Coding sequence (CDS)

ATGGCCACCGCCGGCATCGGTACTTCACGCCCTTTGACTCTGTTGCTTTTTGTCATCATCAACTTACTTTCCAACACCATCACCGGCGGCGGAGGAGGAGTTTATGCTGATAACGGTGTGTTCAGTGTCAAATACAAGTATGCCGGACGGGAACGTTCTCTTAGTACCCTTAAGGCGCACGACATCAGCCGTCAGCTTCGATTTCTTGCTGGCGTCGATATCCCTCTCGGTGGCTCTGGCCGGCCTGATGCGGTCGGGCTTTACTATGCCAAAATTGGAATTGGAACTCCTCCGAAGGATTATTATGTTCAAGTTGATACAGGGAGCGATATCGTATGGGTGAATTGTATTCAGTGTAGGGAGTGTCCGAGGAAAAGCTCTCTTGGGATGGAACTCACTCCTTATGATTTAGGGGAGTCCACGACTGGGAAATTGGTTTCTTGTGATGAACAATTTTGCTTGGAGGTGAATGGGGGTCCACTTTCAGGCTGCACGACTAATATGTCCTGCCCATATCTTCAGATATATGGTGACGGGAGCTCGACTGCAGGGTATTTTGTCAAGGACTACGTACAATATGATCGAGTTTCTGGAGACCTTGAAACTACAGCTGCCAATGGAAGTATCAAGTTTGGATGTGGTGCCAGACAATCTGGGGATCTTGGCTCTTCTGGTGAAGAAGCACTTGATGGCATACTTGGTTTTGGAAAATCAAATTCATCTATTATTTCACAACTAGCCTCCTCAAGAAAAGTGAAGAAGATGTTTGCTCATTGCCTAGATGGAATAAATGGGGGTGGTATCTTTGCGATGGGACATGTTGTGCAGCCAAAAGTTAATATGACTCCTTTGGTACCAAATCAGCCACATTACAATGTCAATATGACGGGAGTACAAGTTGGTCGTGTCATGTTAAATATTTCTGCCGATGTATTCGAGGCAGGAGACAGAAAAGGGACAATCATTGATAGTGGTACAACCTTGGCATATCTTCCTGAACTGATATACGAGCCTTTAGTGACCATGATACTCTCTCAGCAACACAATTTGGAAGTTCAAACCATTCATGGAGAATATAAATGTTTTCAGTACTCAGAGAGGGTCGACGATGGATTTCCTCCAGTTATTTTCCATTTTGAGAACTCACTCTTGTTGAGGGTTTATCCTCATGAGTATCTATTCCAATATGAGGGCTTGTGGTGTATTGGTTGGCAAAACAGTGGGATGCAATCTAGAGATAGGAAGAACGTTACACTCTTTGGAGATTTAGTGCTTTCAAATAAGCTAGTTTTATATGATCTTGAAAATCAGACAATCGGGTGGACCGAATACAACTGTTCTTCAAGCATCAAAGTGCAGGATGAACAGACTGGAACAGTTCATTTAGTTGGTTCACATTACATTTCTTCAGCCCACAGATTGAATACCAAATGGGGTGTGATCTTGCTATTCCTAATCTTGCCGATGCATTGGTCAGCTCATTTCAGATGCCTTAGGTAA

Protein sequence

MATAGIGTSRPLTLLLFVIINLLSNTITGGGGGVYADNGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNTKWGVILLFLILPMHWSAHFRCLR
Homology
BLAST of HG10003804 vs. NCBI nr
Match: XP_038886489.1 (aspartic proteinase 39 [Benincasa hispida] >XP_038886490.1 aspartic proteinase 39 [Benincasa hispida])

HSP 1 Score: 998.8 bits (2581), Expect = 1.7e-287
Identity = 487/497 (97.99%), Postives = 492/497 (98.99%), Query Frame = 0

Query: 1   MATAGIGTSRPLTLLLFVIINLLSNTITGGGGGVYADNGVFSVKYKYAGRERSLSTLKAH 60
           MATAGIGTSRPLTLLLFVIIN LSNTIT GGGGVYADNGVFSVKYKYAGRERSLSTLKAH
Sbjct: 1   MATAGIGTSRPLTLLLFVIINSLSNTIT-GGGGVYADNGVFSVKYKYAGRERSLSTLKAH 60

Query: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCR 120
           DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCR
Sbjct: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCR 120

Query: 121 ECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180
           ECPRKSSLGMELTPYDL ESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS
Sbjct: 121 ECPRKSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180

Query: 181 STAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240
           STAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS
Sbjct: 181 STAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240

Query: 241 SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300
           SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQP+VNMTPLVPNQPHYNVNMTGVQV
Sbjct: 241 SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPRVNMTPLVPNQPHYNVNMTGVQV 300

Query: 301 GRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIHGEYK 360
           GRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIY+PLVTMILSQQHNLEVQTIHGEYK
Sbjct: 301 GRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYKPLVTMILSQQHNLEVQTIHGEYK 360

Query: 361 CFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKNVTLF 420
           CFQYSERVDDGFPPVIFHFENSLLL+VYPHEYLFQYE LWCIGWQNSGMQSRDRKNVTLF
Sbjct: 361 CFQYSERVDDGFPPVIFHFENSLLLKVYPHEYLFQYESLWCIGWQNSGMQSRDRKNVTLF 420

Query: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNTKWGV 480
           GDLVLSNKLVLYDLENQ+IGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSA+RLNTKWGV
Sbjct: 421 GDLVLSNKLVLYDLENQSIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSANRLNTKWGV 480

Query: 481 ILLFLILPMHWSAHFRC 498
           ILLFLIL MHWSAHFRC
Sbjct: 481 ILLFLILLMHWSAHFRC 496

BLAST of HG10003804 vs. NCBI nr
Match: XP_016900131.1 (PREDICTED: aspartic proteinase-like protein 2 isoform X1 [Cucumis melo])

HSP 1 Score: 979.5 bits (2531), Expect = 1.0e-281
Identity = 479/497 (96.38%), Postives = 483/497 (97.18%), Query Frame = 0

Query: 1   MATAGIGTSRPLTLLLFVIINLLSNTITGGGGGVYADNGVFSVKYKYAGRERSLSTLKAH 60
           MATAGIGTSRPLTLLLF+IINLLSNTIT GGG VYADNGVFSVKYKYAGRERSLSTLKAH
Sbjct: 1   MATAGIGTSRPLTLLLFLIINLLSNTIT-GGGRVYADNGVFSVKYKYAGRERSLSTLKAH 60

Query: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCR 120
           DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTP KDYYVQVDTGSDIVWVNCIQCR
Sbjct: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCR 120

Query: 121 ECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180
           ECPR SSLGMELTPYDL ESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS
Sbjct: 121 ECPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180

Query: 181 STAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240
           STAGYFVKDYVQY+RVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS
Sbjct: 181 STAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240

Query: 241 SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300
           SIISQLASSRKVKKMFAHCLDG NGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV
Sbjct: 241 SIISQLASSRKVKKMFAHCLDGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300

Query: 301 GRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIHGEYK 360
           G VMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLV  ILSQQHNLEVQTIHGEYK
Sbjct: 301 GHVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK 360

Query: 361 CFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKNVTLF 420
           CFQYSERVDDGFPPVIFHFENSLLL+VYPHEYLFQYE LWCIGWQNSGMQSRDRKNVTLF
Sbjct: 361 CFQYSERVDDGFPPVIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLF 420

Query: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNTKWGV 480
           GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHY+SSA RLNTKWGV
Sbjct: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYLSSAKRLNTKWGV 480

Query: 481 ILLFLILPMHWSAHFRC 498
           I LFLIL MHWSAH RC
Sbjct: 481 IFLFLILLMHWSAHSRC 496

BLAST of HG10003804 vs. NCBI nr
Match: XP_004140876.1 (aspartic proteinase-like protein 2 [Cucumis sativus] >KGN45989.1 hypothetical protein Csa_005288 [Cucumis sativus])

HSP 1 Score: 979.2 bits (2530), Expect = 1.4e-281
Identity = 476/497 (95.77%), Postives = 484/497 (97.38%), Query Frame = 0

Query: 1   MATAGIGTSRPLTLLLFVIINLLSNTITGGGGGVYADNGVFSVKYKYAGRERSLSTLKAH 60
           MATAGIGTSRPLTLLLF+IINLLSNTI  GGGGVYADNG+FSVKYKYAGRERSLSTLKAH
Sbjct: 1   MATAGIGTSRPLTLLLFLIINLLSNTI-NGGGGVYADNGIFSVKYKYAGRERSLSTLKAH 60

Query: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCR 120
           DISRQLRFLAG+DIPLGGSGRPDAVGLYYAKIGIGTP KDYYVQVDTGSDIVWVNCIQCR
Sbjct: 61  DISRQLRFLAGIDIPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCR 120

Query: 121 ECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180
           ECPR SSLGMELTPYDL ESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS
Sbjct: 121 ECPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180

Query: 181 STAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240
           STAGYFVKDYVQY+RVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS
Sbjct: 181 STAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240

Query: 241 SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300
           SIISQLAS+RKVKKMFAHCLDG NGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV
Sbjct: 241 SIISQLASTRKVKKMFAHCLDGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300

Query: 301 GRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIHGEYK 360
           G ++LNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLV  ILSQQHNLEVQTIHGEYK
Sbjct: 301 GHIILNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK 360

Query: 361 CFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKNVTLF 420
           CFQYSERVDDGFPPVIFHFENSLLL+VYPHEYLFQYE LWCIGWQNSGMQSRDRKNVTLF
Sbjct: 361 CFQYSERVDDGFPPVIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLF 420

Query: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNTKWGV 480
           GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSA RLNTKWGV
Sbjct: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAKRLNTKWGV 480

Query: 481 ILLFLILPMHWSAHFRC 498
           ILLFLIL MHWSAH RC
Sbjct: 481 ILLFLILLMHWSAHSRC 496

BLAST of HG10003804 vs. NCBI nr
Match: XP_023514824.1 (aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo] >XP_023514825.1 aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 972.2 bits (2512), Expect = 1.7e-279
Identity = 471/498 (94.58%), Postives = 484/498 (97.19%), Query Frame = 0

Query: 1   MATAGIGTSRPLTLLLFVIINLLSNTITGGGGGVYADNGVFSVKYKYAGRERSLSTLKAH 60
           MATA IGTSRP T+LLFVIINLLS+TI GGGGGVYADNGVFSVKYKYAGRERSLSTLKAH
Sbjct: 1   MATAAIGTSRPFTVLLFVIINLLSSTILGGGGGVYADNGVFSVKYKYAGRERSLSTLKAH 60

Query: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCR 120
           DI+RQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPK+YYVQVDTGSDIVWVNCIQC+
Sbjct: 61  DINRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKNYYVQVDTGSDIVWVNCIQCK 120

Query: 121 ECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180
           ECPR+SSLGMELT YDL +STTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIYGDGS
Sbjct: 121 ECPRRSSLGMELTTYDLEQSTTGKLVSCDEQFCLEVNGGPLSGCTANMSCPYLQIYGDGS 180

Query: 181 STAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240
           STAG FVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS
Sbjct: 181 STAGIFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240

Query: 241 SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300
           SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKV MTPLVPNQPHYNVNMTGVQV
Sbjct: 241 SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQV 300

Query: 301 GRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIHGEYK 360
           GRV+LNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILS+QHNLEVQTIHGEYK
Sbjct: 301 GRVILNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYK 360

Query: 361 CFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKNVTLF 420
           CFQYS  VDDGFPPV FHFENSLLL+VYPHEYLFQ+EGLWCIGWQNSGMQSRDRKNVTLF
Sbjct: 361 CFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSRDRKNVTLF 420

Query: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNTKWGV 480
           GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSA+RLNTKW V
Sbjct: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAYRLNTKWAV 480

Query: 481 ILLFLILPMHWSAHFRCL 499
           +LLFLIL MHWSAH RCL
Sbjct: 481 MLLFLILVMHWSAHLRCL 498

BLAST of HG10003804 vs. NCBI nr
Match: XP_023005003.1 (aspartic proteinase-like protein 2 [Cucurbita maxima] >XP_023005004.1 aspartic proteinase-like protein 2 [Cucurbita maxima] >XP_023005005.1 aspartic proteinase-like protein 2 [Cucurbita maxima])

HSP 1 Score: 968.4 bits (2502), Expect = 2.4e-278
Identity = 472/502 (94.02%), Postives = 485/502 (96.61%), Query Frame = 0

Query: 1   MATAGIGTSRPLTLLLFVIINLLSNTITGGGG----GVYADNGVFSVKYKYAGRERSLST 60
           MA+A IGTSRP T+LLFVIINLLS+TI GGGG    GVYADNGVFSVKYKYAGRERSLST
Sbjct: 1   MASAAIGTSRPFTVLLFVIINLLSSTILGGGGGVAVGVYADNGVFSVKYKYAGRERSLST 60

Query: 61  LKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNC 120
           LKAHDI+RQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPK+YYVQVDTGSDIVWVNC
Sbjct: 61  LKAHDINRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKNYYVQVDTGSDIVWVNC 120

Query: 121 IQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIY 180
           IQC+ECPR+SSLGMELT YDL +STTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIY
Sbjct: 121 IQCKECPRRSSLGMELTTYDLEQSTTGKLVSCDEQFCLEVNGGPLSGCTANMSCPYLQIY 180

Query: 181 GDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG 240
           GDGSSTAG FVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG
Sbjct: 181 GDGSSTAGIFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG 240

Query: 241 KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMT 300
           KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKV MTPLVPNQPHYNVNMT
Sbjct: 241 KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMT 300

Query: 301 GVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIH 360
           GVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILS+QHNLEVQ+IH
Sbjct: 301 GVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQSIH 360

Query: 361 GEYKCFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKN 420
           GEYKCFQYS  VDDGFPPV FHFENSLLL+VYPHEYLFQ+EGLWCIGWQNSGMQSRDRKN
Sbjct: 361 GEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSRDRKN 420

Query: 421 VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNT 480
           VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSA+RLNT
Sbjct: 421 VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAYRLNT 480

Query: 481 KWGVILLFLILPMHWSAHFRCL 499
           KW VILLFLIL MHWSAHFRCL
Sbjct: 481 KWAVILLFLILVMHWSAHFRCL 502

BLAST of HG10003804 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 446.0 bits (1146), Expect = 5.4e-124
Identity = 219/462 (47.40%), Postives = 303/462 (65.58%), Query Frame = 0

Query: 34  VYADNGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIG 93
           V + N VF+V +K+AG+E+ LS LK+HD  R  R LA +D+PLGG  R D++GLY+ KI 
Sbjct: 24  VVSGNFVFNVTHKFAGKEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIK 83

Query: 94  IGTPPKDYYVQVDTGSDIVWVNCIQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFC 153
           +G+PPK+YYVQVDTGSDI+WVNC  C +CP K+ LG+ L+ YD   S+T K V C++ FC
Sbjct: 84  LGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFC 143

Query: 154 LEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGC 213
             +       C     C Y  +YGDGS++ G F+KD +  ++V+G+L T      + FGC
Sbjct: 144 SFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGC 203

Query: 214 GARQSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGH 273
           G  QSG LG + + A+DGI+GFG+SN+SIISQLA+    K++F+HCLD +NGGGIFA+G 
Sbjct: 204 GKNQSGQLGQT-DSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGE 263

Query: 274 VVQPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPE 333
           V  P V  TP+VPNQ HYNV + G+ V    +++   +       GTIIDSGTTLAYLP+
Sbjct: 264 VESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQ 323

Query: 334 LIYEPLVTMILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLRVYPHEYL 393
            +Y  L+  I ++Q  +++  +   + CF ++   D  FP V  HFE+SL L VYPH+YL
Sbjct: 324 NLYNSLIEKITAKQ-QVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYL 383

Query: 394 FQY-EGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKV 453
           F   E ++C GWQ+ GM ++D  +V L GDLVLSNKLV+YDLEN+ IGW ++NCSSSIKV
Sbjct: 384 FSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKV 443

Query: 454 QDEQTGTVHLVGSHYISSAHRLNTKWGVILLFLILPMHWSAH 495
           +D       L   + IS+A  +     V LL +++   W  H
Sbjct: 444 KDGSGAAYQLGAENLISAASSVMNGTLVTLLSILI---WVFH 478

BLAST of HG10003804 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 444.9 bits (1143), Expect = 1.2e-123
Identity = 209/438 (47.72%), Postives = 306/438 (69.86%), Query Frame = 0

Query: 38  NGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTP 97
           N VF  ++K+AG++++L   K+HD  R  R LA +D+PLGG  R D+VGLY+ KI +G+P
Sbjct: 24  NFVFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSP 83

Query: 98  PKDYYVQVDTGSDIVWVNCIQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVN 157
           PK+Y+VQVDTGSDI+W+NC  C +CP K++L   L+ +D+  S+T K V CD+ FC  ++
Sbjct: 84  PKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFIS 143

Query: 158 GGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQ 217
                 C   + C Y  +Y D S++ G F++D +  ++V+GDL+T      + FGCG+ Q
Sbjct: 144 QS--DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQ 203

Query: 218 SGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQP 277
           SG LG +G+ A+DG++GFG+SN+S++SQLA++   K++F+HCLD + GGGIFA+G V  P
Sbjct: 204 SGQLG-NGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSP 263

Query: 278 KVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYE 337
           KV  TP+VPNQ HYNV + G+ V    L++   +   G   GTI+DSGTTLAY P+++Y+
Sbjct: 264 KVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNG---GTIVDSGTTLAYFPKVLYD 323

Query: 338 PLVTMILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQY- 397
            L+  IL++Q  +++  +   ++CF +S  VD+ FPPV F FE+S+ L VYPH+YLF   
Sbjct: 324 SLIETILARQ-PVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLE 383

Query: 398 EGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQ 457
           E L+C GWQ  G+ + +R  V L GDLVLSNKLV+YDL+N+ IGW ++NCSSSIK++D  
Sbjct: 384 EELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKIKD-G 443

Query: 458 TGTVHLVGSHYISSAHRL 475
           +G V+ VG+  +SSA RL
Sbjct: 444 SGGVYSVGADNLSSAPRL 453

BLAST of HG10003804 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 2.9e-32
Identity = 111/426 (26.06%), Postives = 182/426 (42.72%), Query Frame = 0

Query: 37  DNGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGT 96
           D+G    KY+   R       +   I+  L+  +G++ P+         G Y   + IGT
Sbjct: 50  DSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGD-----GEYLMNVAIGT 109

Query: 97  PPKDYYVQVDTGSDIVWVNCIQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEV 156
           P   +   +DTGSD++W  C  C +C            ++  +S++   + C+ Q+C ++
Sbjct: 110 PDSSFSAIMDTGSDLIWTQCEPCTQC-----FSQPTPIFNPQDSSSFSTLPCESQYCQDL 169

Query: 157 NGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGAR 216
              P   C  N  C Y   YGDGS+T GY   +   ++        T++  +I FGCG  
Sbjct: 170 ---PSETCNNN-ECQYTYGYGDGSTTQGYMATETFTFE--------TSSVPNIAFGCGED 229

Query: 217 QSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLD--GINGGGIFAMGHV 276
             G    +G     G++G G    S+ SQL   +     F++C+   G +     A+G  
Sbjct: 230 NQGFGQGNGA----GLIGMGWGPLSLPSQLGVGQ-----FSYCMTSYGSSSPSTLALGSA 289

Query: 277 V------QPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGD--RKGTIIDSGT 336
                   P   +     N  +Y + + G+ VG   L I +  F+  D    G IIDSGT
Sbjct: 290 ASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGT 349

Query: 337 TLAYLPELIYEPLVTMILSQQHNLEV--QTIHGEYKCFQY-SERVDDGFPPVIFHFENSL 396
           TL YLP+  Y   V    + Q NL    ++  G   CFQ  S+      P +   F+  +
Sbjct: 350 TLTYLPQDAYN-AVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV 409

Query: 397 LLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTE 450
           L     +  +   EG+ C+      M S  +  +++FG++      VLYDL+N  + +  
Sbjct: 410 LNLGEQNILISPAEGVICL-----AMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVP 438

BLAST of HG10003804 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 1.5e-28
Identity = 112/414 (27.05%), Postives = 181/414 (43.72%), Query Frame = 0

Query: 62  ISRQLRF---LAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQ 121
           +SR  RF   L+  D+    SG   A G ++  I IGTPP   +   DTGSD+ WV C  
Sbjct: 59  VSRSRRFNHQLSQTDLQ---SGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKP 118

Query: 122 CRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGD 181
           C++C +++        +D  +S+T K   CD + C  ++        +N  C Y   YGD
Sbjct: 119 CQQCYKENG-----PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGD 178

Query: 182 GSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKS 241
            S + G    + V  D  SG     +  G++ FGCG    G    +G     GI+G G  
Sbjct: 179 QSFSKGDVATETVSIDSASG--SPVSFPGTV-FGCGYNNGGTFDETG----SGIIGLGGG 238

Query: 242 NSSIISQLASSRKVKKMFAHCLD----GINGGGIFAMGHVVQPK-------VNMTPLVPN 301
           + S+ISQL SS  + K F++CL       NG  +  +G    P        V  TPLV  
Sbjct: 239 HLSLISQLGSS--ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDK 298

Query: 302 QP--HYNVNMTGVQVGRVMLNISADVFEAGD-------RKGTIIDSGTTLAYLPELIYEP 361
           +P  +Y + +  + VG+  +  +   +   D           IIDSGTTL  L    ++ 
Sbjct: 299 EPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDK 358

Query: 362 LVTMILSQQHNLE-VQTIHGEYK-CFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQY 421
             + +       + V    G    CF+ S   + G P +  HF  + +     + ++   
Sbjct: 359 FSSAVEESVTGAKRVSDPQGLLSHCFK-SGSAEIGLPEITVHFTGADVRLSPINAFVKLS 418

Query: 422 EGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSI 451
           E + C+    +         V ++G+    + LV YDLE +T+ +   +CS+++
Sbjct: 419 EDMVCLSMVPT-------TEVAIYGNFAQMDFLVGYDLETRTVSFQHMDCSANL 447

BLAST of HG10003804 vs. ExPASy Swiss-Prot
Match: Q9LTW4 (Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE=1 SV=1)

HSP 1 Score: 126.3 bits (316), Expect = 9.5e-28
Identity = 110/405 (27.16%), Postives = 179/405 (44.20%), Query Frame = 0

Query: 62  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCRE 121
           ISR+     GV + L GSG       Y+ +I +GTP K + V VDTGS++ WVNC     
Sbjct: 81  ISRKRNSTVGVKMDL-GSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNC----- 140

Query: 122 CPRKSSLGME-LTPYDLGESTTGKLVSCDEQFCLE--VNGGPLSGC-TTNMSCPYLQIYG 181
             R  + G +    +   ES + K V C  Q C    +N   L+ C T +  C Y   Y 
Sbjct: 141 --RYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYA 200

Query: 182 DGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGK 241
           DGS+  G F K+ +     +G +     +     GC +  +G       +  DG+LG   
Sbjct: 201 DGSAAQGVFAKETITVGLTNGRMARLPGH---LIGCSSSFTG----QSFQGADGVLGLAF 260

Query: 242 SNSSIISQLASSRKVKKMFAHCL------DGINGGGIFAMGHVVQPKVNMT---PLVPNQ 301
           S+ S  S   S    K  F++CL        ++   IF      +     T    L    
Sbjct: 261 SDFSFTSTATSLYGAK--FSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIP 320

Query: 302 PHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQH 361
           P Y +N+ G+ +G  ML+I + V++A    GTI+DSGT+L  L +  Y+ +VT +   ++
Sbjct: 321 PFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGL--ARY 380

Query: 362 NLEVQTIHGEYKCFQYSERVDDGF-----PPVIFHFENSLLLRVYPHEYLFQ-YEGLWCI 421
            +E++ +  E    +Y      GF     P + FH +       +   YL     G+ C+
Sbjct: 381 LVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCL 440

Query: 422 GWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCS 448
           G+ ++G  +       + G+++  N L  +DL   T+ +    C+
Sbjct: 441 GFVSAGTPA-----TNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461

BLAST of HG10003804 vs. ExPASy TrEMBL
Match: A0A1S4DVW7 (aspartic proteinase-like protein 2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103488659 PE=3 SV=1)

HSP 1 Score: 979.5 bits (2531), Expect = 5.0e-282
Identity = 479/497 (96.38%), Postives = 483/497 (97.18%), Query Frame = 0

Query: 1   MATAGIGTSRPLTLLLFVIINLLSNTITGGGGGVYADNGVFSVKYKYAGRERSLSTLKAH 60
           MATAGIGTSRPLTLLLF+IINLLSNTIT GGG VYADNGVFSVKYKYAGRERSLSTLKAH
Sbjct: 1   MATAGIGTSRPLTLLLFLIINLLSNTIT-GGGRVYADNGVFSVKYKYAGRERSLSTLKAH 60

Query: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCR 120
           DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTP KDYYVQVDTGSDIVWVNCIQCR
Sbjct: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCR 120

Query: 121 ECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180
           ECPR SSLGMELTPYDL ESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS
Sbjct: 121 ECPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180

Query: 181 STAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240
           STAGYFVKDYVQY+RVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS
Sbjct: 181 STAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240

Query: 241 SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300
           SIISQLASSRKVKKMFAHCLDG NGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV
Sbjct: 241 SIISQLASSRKVKKMFAHCLDGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300

Query: 301 GRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIHGEYK 360
           G VMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLV  ILSQQHNLEVQTIHGEYK
Sbjct: 301 GHVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK 360

Query: 361 CFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKNVTLF 420
           CFQYSERVDDGFPPVIFHFENSLLL+VYPHEYLFQYE LWCIGWQNSGMQSRDRKNVTLF
Sbjct: 361 CFQYSERVDDGFPPVIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLF 420

Query: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNTKWGV 480
           GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHY+SSA RLNTKWGV
Sbjct: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYLSSAKRLNTKWGV 480

Query: 481 ILLFLILPMHWSAHFRC 498
           I LFLIL MHWSAH RC
Sbjct: 481 IFLFLILLMHWSAHSRC 496

BLAST of HG10003804 vs. ExPASy TrEMBL
Match: A0A0A0KAX9 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G041740 PE=3 SV=1)

HSP 1 Score: 979.2 bits (2530), Expect = 6.6e-282
Identity = 476/497 (95.77%), Postives = 484/497 (97.38%), Query Frame = 0

Query: 1   MATAGIGTSRPLTLLLFVIINLLSNTITGGGGGVYADNGVFSVKYKYAGRERSLSTLKAH 60
           MATAGIGTSRPLTLLLF+IINLLSNTI  GGGGVYADNG+FSVKYKYAGRERSLSTLKAH
Sbjct: 1   MATAGIGTSRPLTLLLFLIINLLSNTI-NGGGGVYADNGIFSVKYKYAGRERSLSTLKAH 60

Query: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCR 120
           DISRQLRFLAG+DIPLGGSGRPDAVGLYYAKIGIGTP KDYYVQVDTGSDIVWVNCIQCR
Sbjct: 61  DISRQLRFLAGIDIPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCR 120

Query: 121 ECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180
           ECPR SSLGMELTPYDL ESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS
Sbjct: 121 ECPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180

Query: 181 STAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240
           STAGYFVKDYVQY+RVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS
Sbjct: 181 STAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240

Query: 241 SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300
           SIISQLAS+RKVKKMFAHCLDG NGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV
Sbjct: 241 SIISQLASTRKVKKMFAHCLDGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300

Query: 301 GRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIHGEYK 360
           G ++LNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLV  ILSQQHNLEVQTIHGEYK
Sbjct: 301 GHIILNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK 360

Query: 361 CFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKNVTLF 420
           CFQYSERVDDGFPPVIFHFENSLLL+VYPHEYLFQYE LWCIGWQNSGMQSRDRKNVTLF
Sbjct: 361 CFQYSERVDDGFPPVIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLF 420

Query: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNTKWGV 480
           GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSA RLNTKWGV
Sbjct: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAKRLNTKWGV 480

Query: 481 ILLFLILPMHWSAHFRC 498
           ILLFLIL MHWSAH RC
Sbjct: 481 ILLFLILLMHWSAHSRC 496

BLAST of HG10003804 vs. ExPASy TrEMBL
Match: A0A6J1KXX5 (aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111498125 PE=3 SV=1)

HSP 1 Score: 968.4 bits (2502), Expect = 1.2e-278
Identity = 472/502 (94.02%), Postives = 485/502 (96.61%), Query Frame = 0

Query: 1   MATAGIGTSRPLTLLLFVIINLLSNTITGGGG----GVYADNGVFSVKYKYAGRERSLST 60
           MA+A IGTSRP T+LLFVIINLLS+TI GGGG    GVYADNGVFSVKYKYAGRERSLST
Sbjct: 1   MASAAIGTSRPFTVLLFVIINLLSSTILGGGGGVAVGVYADNGVFSVKYKYAGRERSLST 60

Query: 61  LKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNC 120
           LKAHDI+RQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPK+YYVQVDTGSDIVWVNC
Sbjct: 61  LKAHDINRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKNYYVQVDTGSDIVWVNC 120

Query: 121 IQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIY 180
           IQC+ECPR+SSLGMELT YDL +STTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIY
Sbjct: 121 IQCKECPRRSSLGMELTTYDLEQSTTGKLVSCDEQFCLEVNGGPLSGCTANMSCPYLQIY 180

Query: 181 GDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG 240
           GDGSSTAG FVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG
Sbjct: 181 GDGSSTAGIFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG 240

Query: 241 KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMT 300
           KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKV MTPLVPNQPHYNVNMT
Sbjct: 241 KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMT 300

Query: 301 GVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIH 360
           GVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILS+QHNLEVQ+IH
Sbjct: 301 GVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQSIH 360

Query: 361 GEYKCFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKN 420
           GEYKCFQYS  VDDGFPPV FHFENSLLL+VYPHEYLFQ+EGLWCIGWQNSGMQSRDRKN
Sbjct: 361 GEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSRDRKN 420

Query: 421 VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNT 480
           VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSA+RLNT
Sbjct: 421 VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAYRLNT 480

Query: 481 KWGVILLFLILPMHWSAHFRCL 499
           KW VILLFLIL MHWSAHFRCL
Sbjct: 481 KWAVILLFLILVMHWSAHFRCL 502

BLAST of HG10003804 vs. ExPASy TrEMBL
Match: A0A6J1H6J5 (aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111460583 PE=3 SV=1)

HSP 1 Score: 966.1 bits (2496), Expect = 5.8e-278
Identity = 468/497 (94.16%), Postives = 481/497 (96.78%), Query Frame = 0

Query: 2   ATAGIGTSRPLTLLLFVIINLLSNTITGGGGGVYADNGVFSVKYKYAGRERSLSTLKAHD 61
           A A IGTSRP T+LLFVIINLLS+TI GGGGGVYADNGVFSVKYKYAGR+RSLSTLKAHD
Sbjct: 3   AAAAIGTSRPFTVLLFVIINLLSSTILGGGGGVYADNGVFSVKYKYAGRQRSLSTLKAHD 62

Query: 62  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCRE 121
           I+RQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPK+YYVQVDTGSDIVWVNCIQC+E
Sbjct: 63  INRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKNYYVQVDTGSDIVWVNCIQCKE 122

Query: 122 CPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSS 181
           CPR+SSLGMELT YDL +STTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIYGDGSS
Sbjct: 123 CPRRSSLGMELTTYDLEQSTTGKLVSCDEQFCLEVNGGPLSGCTANMSCPYLQIYGDGSS 182

Query: 182 TAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 241
           TAG FVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGS GEEALDGILGFGKSNSS
Sbjct: 183 TAGIFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSPGEEALDGILGFGKSNSS 242

Query: 242 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 301
           IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKV MTPLVPNQPHYNVNMTGVQVG
Sbjct: 243 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVG 302

Query: 302 RVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIHGEYKC 361
           RVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILS+QHNLEVQTIHGEYKC
Sbjct: 303 RVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKC 362

Query: 362 FQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKNVTLFG 421
           FQYS  VDDGFPPV FHFENSLLL+VYPHEYLFQ+EGLWCIGWQNSGMQSRDRKNVTLFG
Sbjct: 363 FQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSRDRKNVTLFG 422

Query: 422 DLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNTKWGVI 481
           DLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSA+RLNTKW V+
Sbjct: 423 DLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAYRLNTKWAVM 482

Query: 482 LLFLILPMHWSAHFRCL 499
           LLFLIL MHWSAH RCL
Sbjct: 483 LLFLILVMHWSAHSRCL 499

BLAST of HG10003804 vs. ExPASy TrEMBL
Match: A0A6J1DAP8 (aspartic proteinase-like protein 2 OS=Momordica charantia OX=3673 GN=LOC111018586 PE=3 SV=1)

HSP 1 Score: 960.3 bits (2481), Expect = 3.2e-276
Identity = 468/497 (94.16%), Postives = 482/497 (96.98%), Query Frame = 0

Query: 1   MATAGIGTSRPLTLLLFVIINLLSNTITGGGGGVYADNGVFSVKYKYAGRERSLSTLKAH 60
           MATAG+GTSRPLTLLL VI NLLSNTIT GGGGV A++GVFSVKYKYAGRERSLSTLKAH
Sbjct: 1   MATAGMGTSRPLTLLLLVIFNLLSNTIT-GGGGVSAESGVFSVKYKYAGRERSLSTLKAH 60

Query: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCR 120
           DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQC 
Sbjct: 61  DISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCM 120

Query: 121 ECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGS 180
           ECPRKS+LGMELTPY++ ESTTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIYGDGS
Sbjct: 121 ECPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGS 180

Query: 181 STAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240
           STAGYF+KDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS
Sbjct: 181 STAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNS 240

Query: 241 SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300
           SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV
Sbjct: 241 SIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQV 300

Query: 301 GRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSQQHNLEVQTIHGEYK 360
           GRVMLNISADVFEAGDRKGTIID GTTLAYLPELIY PLVTMI+S+Q NLEVQTIHGEYK
Sbjct: 301 GRVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYK 360

Query: 361 CFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYEGLWCIGWQNSGMQSRDRKNVTLF 420
           CFQYSE VDDGFPPVIFHFENSLLL+VYPHEYLFQYEGLWC+GWQNSGMQSRDRKNVTLF
Sbjct: 361 CFQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLF 420

Query: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAHRLNTKWGV 480
           GDLVLSNKLVLYDLENQTIGWTEYNCSS+IKVQDEQTGTVHLVGSH ISSA RLN++W V
Sbjct: 421 GDLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAV 480

Query: 481 ILLFLILPMHWSAHFRC 498
           ILLFLIL MHWSAHFRC
Sbjct: 481 ILLFLILLMHWSAHFRC 496

BLAST of HG10003804 vs. TAIR 10
Match: AT1G05840.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 709.5 bits (1830), Expect = 1.9e-204
Identity = 336/454 (74.01%), Postives = 387/454 (85.24%), Query Frame = 0

Query: 34  VYADNGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIG 93
           V  + GVF+VKY+Y   + SL+ LK HD  RQL  LAG+D+PLGG+GRPD  GLYYAKIG
Sbjct: 26  VSCNPGVFNVKYRYPRLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPGLYYAKIG 85

Query: 94  IGTPPKDYYVQVDTGSDIVWVNCIQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFC 153
           IGTP K YYVQVDTGSDI+WVNCIQC++CPR+S+LG+ELT Y++ ES +GKLVSCD+ FC
Sbjct: 86  IGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFC 145

Query: 154 LEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGC 213
            +++GGPLSGC  NMSCPYL+IYGDGSSTAGYFVKD VQYD V+GDL+T  ANGS+ FGC
Sbjct: 146 YQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGC 205

Query: 214 GARQSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGH 273
           GARQSGDL SS EEALDGILGFGK+NSS+ISQLASS +VKK+FAHCLDG NGGGIFA+G 
Sbjct: 206 GARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGR 265

Query: 274 VVQPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPE 333
           VVQPKVNMTPLVPNQPHYNVNMT VQVG+  L I AD+F+ GDRKG IIDSGTTLAYLPE
Sbjct: 266 VVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPE 325

Query: 334 LIYEPLVTMILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLRVYPHEYL 393
           +IYEPLV  I SQ+  L+V  +  +YKCFQYS RVD+GFP V FHFENS+ LRVYPH+YL
Sbjct: 326 IIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYL 385

Query: 394 FQYEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQ 453
           F +EG+WCIGWQNS MQSRDR+N+TL GDLVLSNKLVLYDLENQ IGWTEYNCSSSIKV+
Sbjct: 386 FPHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCSSSIKVK 445

Query: 454 DEQTGTVHLVGSHYISSAHRLNTKWGVILLFLIL 488
           DE TGTVHLVGSH+ISSA  L+T   ++   L+L
Sbjct: 446 DEGTGTVHLVGSHFISSALPLDTSMCLLFSLLLL 479

BLAST of HG10003804 vs. TAIR 10
Match: AT3G02740.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 501.1 bits (1289), Expect = 1.0e-141
Identity = 238/455 (52.31%), Postives = 326/455 (71.65%), Query Frame = 0

Query: 36  ADNGVFSVKYKYAG-RERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGI 95
           ++N VF V+ K+AG R + L  L+AHD+ R  R L+ +DIPLGG  +P+++GLY+AKIG+
Sbjct: 32  SENLVFEVRSKFAGKRVKDLGALRAHDVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIGL 91

Query: 96  GTPPKDYYVQVDTGSDIVWVNCIQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCL 155
           GTP +D++VQVDTGSDI+WVNC  C  CPRKS L +ELTPYD+  S+T K VSC + FC 
Sbjct: 92  GTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL-VELTPYDVDASSTAKSVSCSDNFCS 151

Query: 156 EVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGCG 215
            VN    S C +  +C Y+ +YGDGSST GY VKD V  D V+G+ +T + NG+I FGCG
Sbjct: 152 YVN--QRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCG 211

Query: 216 ARQSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHV 275
           ++QSG LG S + A+DGI+GFG+SNSS ISQLAS  KVK+ FAHCLD  NGGGIFA+G V
Sbjct: 212 SKQSGQLGES-QAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEV 271

Query: 276 VQPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPEL 335
           V PKV  TP++    HY+VN+  ++VG  +L +S++ F++GD KG IIDSGTTL YLP+ 
Sbjct: 272 VSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDA 331

Query: 336 IYEPLVTMILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLF 395
           +Y PL+  IL+    L + T+   + CF Y++++ D FP V F F+ S+ L VYP EYLF
Sbjct: 332 VYNPLLNEILASHPELTLHTVQESFTCFHYTDKL-DRFPTVTFQFDKSVSLAVYPREYLF 391

Query: 396 QY-EGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQ 455
           Q  E  WC GWQN G+Q++   ++T+ GD+ LSNKLV+YD+ENQ IGWT +NCS  I+V+
Sbjct: 392 QVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCSGGIQVK 451

Query: 456 DEQTGTVHLVGSHYISSAHRLNTKWGVILLFLILP 489
           DE++G ++ VG+H +S +  L     + L+ L++P
Sbjct: 452 DEESGAIYTVGAHNLSWSSSLAITKLLTLVSLLIP 481

BLAST of HG10003804 vs. TAIR 10
Match: AT5G36260.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 446.0 bits (1146), Expect = 3.8e-125
Identity = 219/462 (47.40%), Postives = 303/462 (65.58%), Query Frame = 0

Query: 34  VYADNGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIG 93
           V + N VF+V +K+AG+E+ LS LK+HD  R  R LA +D+PLGG  R D++GLY+ KI 
Sbjct: 24  VVSGNFVFNVTHKFAGKEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIK 83

Query: 94  IGTPPKDYYVQVDTGSDIVWVNCIQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFC 153
           +G+PPK+YYVQVDTGSDI+WVNC  C +CP K+ LG+ L+ YD   S+T K V C++ FC
Sbjct: 84  LGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFC 143

Query: 154 LEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGC 213
             +       C     C Y  +YGDGS++ G F+KD +  ++V+G+L T      + FGC
Sbjct: 144 SFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGC 203

Query: 214 GARQSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGH 273
           G  QSG LG + + A+DGI+GFG+SN+SIISQLA+    K++F+HCLD +NGGGIFA+G 
Sbjct: 204 GKNQSGQLGQT-DSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGE 263

Query: 274 VVQPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPE 333
           V  P V  TP+VPNQ HYNV + G+ V    +++   +       GTIIDSGTTLAYLP+
Sbjct: 264 VESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQ 323

Query: 334 LIYEPLVTMILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLRVYPHEYL 393
            +Y  L+  I ++Q  +++  +   + CF ++   D  FP V  HFE+SL L VYPH+YL
Sbjct: 324 NLYNSLIEKITAKQ-QVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYL 383

Query: 394 FQY-EGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKV 453
           F   E ++C GWQ+ GM ++D  +V L GDLVLSNKLV+YDLEN+ IGW ++NCSSSIKV
Sbjct: 384 FSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKV 443

Query: 454 QDEQTGTVHLVGSHYISSAHRLNTKWGVILLFLILPMHWSAH 495
           +D       L   + IS+A  +     V LL +++   W  H
Sbjct: 444 KDGSGAAYQLGAENLISAASSVMNGTLVTLLSILI---WVFH 478

BLAST of HG10003804 vs. TAIR 10
Match: AT1G65240.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 444.9 bits (1143), Expect = 8.6e-125
Identity = 209/438 (47.72%), Postives = 306/438 (69.86%), Query Frame = 0

Query: 38  NGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTP 97
           N VF  ++K+AG++++L   K+HD  R  R LA +D+PLGG  R D+VGLY+ KI +G+P
Sbjct: 24  NFVFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSP 83

Query: 98  PKDYYVQVDTGSDIVWVNCIQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVN 157
           PK+Y+VQVDTGSDI+W+NC  C +CP K++L   L+ +D+  S+T K V CD+ FC  ++
Sbjct: 84  PKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFIS 143

Query: 158 GGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQ 217
                 C   + C Y  +Y D S++ G F++D +  ++V+GDL+T      + FGCG+ Q
Sbjct: 144 QS--DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQ 203

Query: 218 SGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQP 277
           SG LG +G+ A+DG++GFG+SN+S++SQLA++   K++F+HCLD + GGGIFA+G V  P
Sbjct: 204 SGQLG-NGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSP 263

Query: 278 KVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYE 337
           KV  TP+VPNQ HYNV + G+ V    L++   +   G   GTI+DSGTTLAY P+++Y+
Sbjct: 264 KVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNG---GTIVDSGTTLAYFPKVLYD 323

Query: 338 PLVTMILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQY- 397
            L+  IL++Q  +++  +   ++CF +S  VD+ FPPV F FE+S+ L VYPH+YLF   
Sbjct: 324 SLIETILARQ-PVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLE 383

Query: 398 EGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQ 457
           E L+C GWQ  G+ + +R  V L GDLVLSNKLV+YDL+N+ IGW ++NCSSSIK++D  
Sbjct: 384 EELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKIKD-G 443

Query: 458 TGTVHLVGSHYISSAHRL 475
           +G V+ VG+  +SSA RL
Sbjct: 444 SGGVYSVGADNLSSAPRL 453

BLAST of HG10003804 vs. TAIR 10
Match: AT5G22850.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 336.7 bits (862), Expect = 3.3e-92
Identity = 175/416 (42.07%), Postives = 240/416 (57.69%), Query Frame = 0

Query: 48  AGRERSLSTLKAHDISRQLRFLAG----VDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYV 107
           A  E  LS LKA D +R  R L      +D P+ G+  P  VGLYY K+ +GTPP+D+YV
Sbjct: 37  ANHEMELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYV 96

Query: 108 QVDTGSDIVWVNCIQCRECPRKSSLGMELTPYDLGESTTGKLVSCDEQFCLEVNGGPLSG 167
           QVDTGSD++WV+C  C  CP+ S L ++L  +D G S T   +SC +Q C        SG
Sbjct: 97  QVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156

Query: 168 CTT-NMSCPYLQIYGDGSSTAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLG 227
           C+  N  C Y   YGDGS T+G++V D +Q+D + G      +   + FGC   Q+GDL 
Sbjct: 157 CSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLV 216

Query: 228 SSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGIN-GGGIFAMGHVVQPKVNM 287
            S + A+DGI GFG+   S+ISQLAS     ++F+HCL G N GGGI  +G +V+P +  
Sbjct: 217 KS-DRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMVF 276

Query: 288 TPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVT 347
           TPLVP+QPHYNVN+  + V    L I+  VF   + +GTIID+GTTLAYL E  Y P V 
Sbjct: 277 TPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVE 336

Query: 348 MILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLRVYPHEYLFQYE---- 407
            I +         +    +C+  +  V D FPPV  +F     + + P +YL Q      
Sbjct: 337 AITNAVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGG 396

Query: 408 -GLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKV 453
             +WCIG+Q         + +T+ GDLVL +K+ +YDL  Q IGW  Y+CS+S+ V
Sbjct: 397 TAVWCIGFQRI-----QNQGITILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNV 446

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886489.11.7e-28797.99aspartic proteinase 39 [Benincasa hispida] >XP_038886490.1 aspartic proteinase 3... [more]
XP_016900131.11.0e-28196.38PREDICTED: aspartic proteinase-like protein 2 isoform X1 [Cucumis melo][more]
XP_004140876.11.4e-28195.77aspartic proteinase-like protein 2 [Cucumis sativus] >KGN45989.1 hypothetical pr... [more]
XP_023514824.11.7e-27994.58aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo] >XP_023514825.1 ... [more]
XP_023005003.12.4e-27894.02aspartic proteinase-like protein 2 [Cucurbita maxima] >XP_023005004.1 aspartic p... [more]
Match NameE-valueIdentityDescription
Q4V3D25.4e-12447.40Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9S9K41.2e-12347.72Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q766C22.9e-3226.06Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q3EBM51.5e-2827.05Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q9LTW49.5e-2827.16Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE... [more]
Match NameE-valueIdentityDescription
A0A1S4DVW75.0e-28296.38aspartic proteinase-like protein 2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC1034... [more]
A0A0A0KAX96.6e-28295.77Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G04174... [more]
A0A6J1KXX51.2e-27894.02aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111498125 P... [more]
A0A6J1H6J55.8e-27894.16aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111460583... [more]
A0A6J1DAP83.2e-27694.16aspartic proteinase-like protein 2 OS=Momordica charantia OX=3673 GN=LOC11101858... [more]
Match NameE-valueIdentityDescription
AT1G05840.11.9e-20474.01Eukaryotic aspartyl protease family protein [more]
AT3G02740.11.0e-14152.31Eukaryotic aspartyl protease family protein [more]
AT5G36260.13.8e-12547.40Eukaryotic aspartyl protease family protein [more]
AT1G65240.18.6e-12547.72Eukaryotic aspartyl protease family protein [more]
AT5G22850.13.3e-9242.07Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 320..331
score: 42.62
coord: 418..433
score: 25.0
coord: 94..114
score: 48.74
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 12..464
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 79..272
e-value: 4.3E-47
score: 162.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 273..452
e-value: 1.3E-41
score: 144.3
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 83..450
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 290..442
e-value: 2.0E-18
score: 66.7
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 88..272
e-value: 1.3E-36
score: 126.5
NoneNo IPR availablePANTHERPTHR13683:SF685EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 12..464
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 88..442
score: 38.714745
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 88..446
e-value: 9.58481E-58
score: 190.552

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003804.1HG10003804.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity