HG10021500 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021500
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPeptidase A1 domain-containing protein
LocationChr05: 10131748 .. 10133295 (+)
RNA-Seq ExpressionHG10021500
SyntenyHG10021500
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTCAATATCAACCTCCATTGCAACAAAATTCTTGAGCTTTTTCCTTCTTCTTGTATATGTCTCAAGAAAAACCCTAGCTTCAAACCCTAAAACCAATAATTTCCCCACAGATTCTCTAGTTATTGGTCTTGCTCATTCAAGAACATCCCTCCTTACCCCTAAAAAAGGCTATAATTCCATGTCAAGGAAGAGAATGAAGGCAATGGAAATGGATAGTGATGATAATGTAATAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACAGGAAGTGATCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATGTTTTAGGCCCAAAATTGGCTGCTTTTTTGCCTACTCATTCTTCTACTTCTATTAGAGACACTTGTGGTAGCTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGCTGTTCCCTTGCTACCCTTGTGAAGGGCACTTGCCCTAGACCATGCCCTTCATTTGCTTACACTTATGGGTCAAGTGGGGTTGTAATTGGAACTTTAACAAGAGATATCCTTTTAATGCATGGAAATAATATTAATTCTCCAATTTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCAAGTTATAGAGAGCCAATTGGTATAGCTGGTTTTGGTAGAGGTTTACTTTCTCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTCTTGCCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAACCTTGCTATTTCTTCAAAAGATGGCCATTTACAATTTACTCCTTTGTTGAAAAGTCCAATTTACCCCAACTACTACTATATTGGGCTTGAGTCAATCACTATAGGAAATGGGAATAATAATTTTAGATTTGGTGTTTCTTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGAATGTTGATTGATTCGGGTACTACCTATACTCATTTACCTGAACCTTTGTATTCACAACTTATTTCAAATCTTGAGTCAGTGATAAGCTATCCAAGAGCCAAACAAGTTGAACTCAATACAGGATTTGATCTTTGTTATAAAGTTCCTTGTAAAAACAACAATTTTTCTTTTATTGATGACTCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTTAGTGTTGTTTTGCCCCAAGGGAACAATTTCTATGCCATGGCTGCTCCAATTAACTCCACTGTGATTAAATGCTTGTTGTTTCAAAGCATGGATGACGGTGTTGGTGGCGATAACGATGACGATAACGGCCGAGATGGGCCGGCGGGCATTTTCGGAAGCTTCCAACAACAAAATTTAGAGGTTGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGCTTCTGTTGCTGCCACTGAAGGACTCCACAAGAATGTTTGA

mRNA sequence

ATGCCTTCAATATCAACCTCCATTGCAACAAAATTCTTGAGCTTTTTCCTTCTTCTTGTATATGTCTCAAGAAAAACCCTAGCTTCAAACCCTAAAACCAATAATTTCCCCACAGATTCTCTAGTTATTGGTCTTGCTCATTCAAGAACATCCCTCCTTACCCCTAAAAAAGGCTATAATTCCATGTCAAGGAAGAGAATGAAGGCAATGGAAATGGATAGTGATGATAATGTAATAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACAGGAAGTGATCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATGTTTTAGGCCCAAAATTGGCTGCTTTTTTGCCTACTCATTCTTCTACTTCTATTAGAGACACTTGTGGTAGCTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGCTGTTCCCTTGCTACCCTTGTGAAGGGCACTTGCCCTAGACCATGCCCTTCATTTGCTTACACTTATGGGTCAAGTGGGGTTGTAATTGGAACTTTAACAAGAGATATCCTTTTAATGCATGGAAATAATATTAATTCTCCAATTTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCAAGTTATAGAGAGCCAATTGGTATAGCTGGTTTTGGTAGAGGTTTACTTTCTCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTCTTGCCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAACCTTGCTATTTCTTCAAAAGATGGCCATTTACAATTTACTCCTTTGTTGAAAAGTCCAATTTACCCCAACTACTACTATATTGGGCTTGAGTCAATCACTATAGGAAATGGGAATAATAATTTTAGATTTGGTGTTTCTTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGAATGTTGATTGATTCGGGTACTACCTATACTCATTTACCTGAACCTTTGTATTCACAACTTATTTCAAATCTTGAGTCAGTGATAAGCTATCCAAGAGCCAAACAAGTTGAACTCAATACAGGATTTGATCTTTGTTATAAAGTTCCTTGTAAAAACAACAATTTTTCTTTTATTGATGACTCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTTAGTGTTGTTTTGCCCCAAGGGAACAATTTCTATGCCATGGCTGCTCCAATTAACTCCACTGTGATTAAATGCTTGTTGTTTCAAAGCATGGATGACGGTGTTGGTGGCGATAACGATGACGATAACGGCCGAGATGGGCCGGCGGGCATTTTCGGAAGCTTCCAACAACAAAATTTAGAGGTTGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGCTTCTGTTGCTGCCACTGAAGGACTCCACAAGAATGTTTGA

Coding sequence (CDS)

ATGCCTTCAATATCAACCTCCATTGCAACAAAATTCTTGAGCTTTTTCCTTCTTCTTGTATATGTCTCAAGAAAAACCCTAGCTTCAAACCCTAAAACCAATAATTTCCCCACAGATTCTCTAGTTATTGGTCTTGCTCATTCAAGAACATCCCTCCTTACCCCTAAAAAAGGCTATAATTCCATGTCAAGGAAGAGAATGAAGGCAATGGAAATGGATAGTGATGATAATGTAATAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACAGGAAGTGATCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATGTTTTAGGCCCAAAATTGGCTGCTTTTTTGCCTACTCATTCTTCTACTTCTATTAGAGACACTTGTGGTAGCTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGCTGTTCCCTTGCTACCCTTGTGAAGGGCACTTGCCCTAGACCATGCCCTTCATTTGCTTACACTTATGGGTCAAGTGGGGTTGTAATTGGAACTTTAACAAGAGATATCCTTTTAATGCATGGAAATAATATTAATTCTCCAATTTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCAAGTTATAGAGAGCCAATTGGTATAGCTGGTTTTGGTAGAGGTTTACTTTCTCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTCTTGCCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAACCTTGCTATTTCTTCAAAAGATGGCCATTTACAATTTACTCCTTTGTTGAAAAGTCCAATTTACCCCAACTACTACTATATTGGGCTTGAGTCAATCACTATAGGAAATGGGAATAATAATTTTAGATTTGGTGTTTCTTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGAATGTTGATTGATTCGGGTACTACCTATACTCATTTACCTGAACCTTTGTATTCACAACTTATTTCAAATCTTGAGTCAGTGATAAGCTATCCAAGAGCCAAACAAGTTGAACTCAATACAGGATTTGATCTTTGTTATAAAGTTCCTTGTAAAAACAACAATTTTTCTTTTATTGATGACTCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTTAGTGTTGTTTTGCCCCAAGGGAACAATTTCTATGCCATGGCTGCTCCAATTAACTCCACTGTGATTAAATGCTTGTTGTTTCAAAGCATGGATGACGGTGTTGGTGGCGATAACGATGACGATAACGGCCGAGATGGGCCGGCGGGCATTTTCGGAAGCTTCCAACAACAAAATTTAGAGGTTGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGCTTCTGTTGCTGCCACTGAAGGACTCCACAAGAATGTTTGA

Protein sequence

MPSISTSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATEGLHKNV
Homology
BLAST of HG10021500 vs. NCBI nr
Match: XP_038893627.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 968.4 bits (2502), Expect = 2.5e-278
Identity = 480/516 (93.02%), Postives = 491/516 (95.16%), Query Frame = 0

Query: 1   MPSISTSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYN 60
           M SISTS A K LS+FLLLVYVSRKTLA+NPKTN  P DSLVIGL HSRT+LLTPKKGYN
Sbjct: 1   MASISTSFAKKILSYFLLLVYVSRKTLATNPKTNG-PKDSLVIGLVHSRTTLLTPKKGYN 60

Query: 61  SMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNL 120
            +SRKRMKAMEM  DDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNL
Sbjct: 61  FISRKRMKAMEM--DDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNL 120

Query: 121 SFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSL 180
           SFDCQDCEEYQNNV GPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSL
Sbjct: 121 SFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSL 180

Query: 181 ATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSP-ISTKQIPRFCFGCVG 240
           ATLVK TCPRPCPSFAYTYG+SGVVIGTLTRD+LLMH NNINSP  STK+ PRFCFGCVG
Sbjct: 181 ATLVKATCPRPCPSFAYTYGASGVVIGTLTRDVLLMHINNINSPNSSTKKTPRFCFGCVG 240

Query: 241 ASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD 300
           ASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA+SSKD
Sbjct: 241 ASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAVSSKD 300

Query: 301 GHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTT 360
            HLQFTPLLKSPIYPNYYYIGLESITIGNGN+NFRFGVSF LREIDTKGNGGMLIDSGTT
Sbjct: 301 EHLQFTPLLKSPIYPNYYYIGLESITIGNGNSNFRFGVSFNLREIDTKGNGGMLIDSGTT 360

Query: 361 YTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITF 420
           YTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITF
Sbjct: 361 YTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITF 420

Query: 421 HFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSF 480
           HFLNNVSVVLPQ NNFYAMAAPINSTV+KCLLFQSM DGVGGD DDD  RDGPAGIFGSF
Sbjct: 421 HFLNNVSVVLPQENNFYAMAAPINSTVVKCLLFQSM-DGVGGDTDDD--RDGPAGIFGSF 480

Query: 481 QQQNLEVVYDLEKERLGFQPMDCASVAATEGLHKNV 516
           QQQNLEVVYDLEKERLGFQPMDCA VAAT+GLHKNV
Sbjct: 481 QQQNLEVVYDLEKERLGFQPMDCAYVAATQGLHKNV 510

BLAST of HG10021500 vs. NCBI nr
Match: XP_008459091.1 (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo] >KAA0043307.1 aspartic proteinase nepenthesin-2 [Cucumis melo var. makuwa] >TYK29371.1 aspartic proteinase nepenthesin-2 [Cucumis melo var. makuwa])

HSP 1 Score: 940.3 bits (2429), Expect = 7.2e-270
Identity = 462/521 (88.68%), Postives = 490/521 (94.05%), Query Frame = 0

Query: 1   MPSI-STSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGY 60
           MPSI STSIATKFLS FLLLV+ S++TLA+NPKT NFP DSLV+GL HSRTSLLTPKKGY
Sbjct: 1   MPSISSTSIATKFLSLFLLLVHASKQTLATNPKT-NFPKDSLVLGLVHSRTSLLTPKKGY 60

Query: 61  NSMSRKRMKAM-EMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCG 120
           N +S+KRMKAM +MD DDNVIEPLREIRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCG
Sbjct: 61  NFISKKRMKAMDQMDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120

Query: 121 NLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180
           NLSFDCQDCEEYQNN+ GPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC
Sbjct: 121 NLSFDCQDCEEYQNNISGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180

Query: 181 SLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHG----NNINSPISTKQIPRFC 240
           SLATLVKGTCPRPCPSFAYTYG+SGVV G+LTRD+L MHG    NN N+  + KQ+PRFC
Sbjct: 181 SLATLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFMHGNYHNNNNNNSNNNKQVPRFC 240

Query: 241 FGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA 300
           FGCVGA+YREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LA
Sbjct: 241 FGCVGATYREPIGIAGFGRGLLSLPFQLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHLA 300

Query: 301 ISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLI 360
           ISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLI
Sbjct: 301 ISSKDENLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLI 360

Query: 361 DSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQL 420
           DSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNN SF+DDSQL
Sbjct: 361 DSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDSQL 420

Query: 421 PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAG 480
           PSITFHFLNNVSVVLPQGNNFYAMAAPINSTV+KCLL+QSM DGVG DND D+  +GPAG
Sbjct: 421 PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSM-DGVGDDNDSDD--NGPAG 480

Query: 481 IFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATEGLHKNV 516
           IFGSFQQQNL+VVYDLEKERLGFQ MDC SVAA +GLHKNV
Sbjct: 481 IFGSFQQQNLQVVYDLEKERLGFQAMDCVSVAANQGLHKNV 517

BLAST of HG10021500 vs. NCBI nr
Match: XP_004145478.2 (probable aspartyl protease At4g16563 [Cucumis sativus] >KGN66888.1 hypothetical protein Csa_007266 [Cucumis sativus])

HSP 1 Score: 932.9 bits (2410), Expect = 1.2e-267
Identity = 455/517 (88.01%), Postives = 486/517 (94.00%), Query Frame = 0

Query: 1   MPSIST-SIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGY 60
           MPSIS+ S ATKFLS FLLLV+VS +TLA+NPKT NFP DSLV+GL HSRTSLLTPKKGY
Sbjct: 1   MPSISSISTATKFLSLFLLLVHVSTQTLATNPKT-NFPKDSLVLGLVHSRTSLLTPKKGY 60

Query: 61  NSMSRKRMKAMEM-DSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCG 120
           N +S+KRMKAM+  D DDNVIEPLREIRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCG
Sbjct: 61  NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120

Query: 121 NLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180
           NLSFDCQDCEEYQNN+ GP+LAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC
Sbjct: 121 NLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180

Query: 181 SLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCV 240
           SLA+LVKGTCPRPCPSFAYTYG+SGVV G+LTRD+L  HGN  N+  + KQIPRFCFGCV
Sbjct: 181 SLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCV 240

Query: 241 GASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK 300
           GA+YREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK
Sbjct: 241 GATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK 300

Query: 301 DGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGT 360
           D +LQFTPLLKSP+YPNYYYIGLESITIGNG+NNFRFGVSFKLREIDTKGNGGMLIDSGT
Sbjct: 301 DENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGT 360

Query: 361 TYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSIT 420
           TYTHLPEPLYSQLISNLE VI YPRAKQVELNTGFDLCYKVPCKNNN SF+DD+QLPSIT
Sbjct: 361 TYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSIT 420

Query: 421 FHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGS 480
           FHFLNNVSVVLPQGNNFYAMAAPINSTV+KCLL+QSM DGVG DND D+  +GPAGIFGS
Sbjct: 421 FHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSM-DGVGDDNDSDD--NGPAGIFGS 480

Query: 481 FQQQNLEVVYDLEKERLGFQPMDCASVAATEGLHKNV 516
           FQQQN+EVVYDLEKERLGFQPMDC SVAA +GLHKNV
Sbjct: 481 FQQQNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNV 513

BLAST of HG10021500 vs. NCBI nr
Match: XP_022142611.1 (probable aspartyl protease At4g16563 [Momordica charantia])

HSP 1 Score: 852.8 bits (2202), Expect = 1.5e-243
Identity = 430/524 (82.06%), Postives = 463/524 (88.36%), Query Frame = 0

Query: 3   SISTSIATKFLSFFLLLV-YVSRKTLASNPKTNNFP-TDSLVIGLAHSRTSLLTPKKGYN 62
           S +T+I++K L+FFLLL+  +S    A+ P  NNFP TDSLV+GL HSRTSLLTPK+GY 
Sbjct: 6   SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYF 65

Query: 63  S---MSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPC 122
           S    S    K ME    DNVIEPLREIRDGYL+SLTLGTPPQVIQVYMDTGSDLTWVPC
Sbjct: 66  SRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 125

Query: 123 GNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 182
           GNLSFDCQDCEEYQNNV GPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG
Sbjct: 126 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 185

Query: 183 CSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC 242
           CSLATLVKGTCPRPCPSFAYTYG+SGVV GTLT+D++LMHG    SP ST QIPRFCFGC
Sbjct: 186 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHG---VSPNSTTQIPRFCFGC 245

Query: 243 VGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS 302
           VGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISS
Sbjct: 246 VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS 305

Query: 303 KDGHLQFTPLLKSPIYPNYYYIGLESITIGN--GNNNFRFGVSFKLREIDTKGNGGMLID 362
           KD  LQFTPLLKSP+YPNYYYIGLES+TIG+  GNNN RFGVS KLREIDTKGNGGMLID
Sbjct: 306 KDHSLQFTPLLKSPLYPNYYYIGLESVTIGDGIGNNNSRFGVSLKLREIDTKGNGGMLID 365

Query: 363 SGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNN-NF-SFIDD-- 422
           SGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTGFDLCYK+PCKNN NF S +DD  
Sbjct: 366 SGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYC 425

Query: 423 -SQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRD 482
            + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTV+KCLLFQSMD G GGD D +   D
Sbjct: 426 SNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGG-GGDGDGNGDDD 485

Query: 483 GPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATEGLHKN 515
           GPAGIFGSFQQQN+EVVYDL+KER+GFQ MDCAS AA++GLHKN
Sbjct: 486 GPAGIFGSFQQQNMEVVYDLQKERIGFQTMDCASSAASQGLHKN 525

BLAST of HG10021500 vs. NCBI nr
Match: XP_023520027.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 844.0 bits (2179), Expect = 7.0e-241
Identity = 419/513 (81.68%), Postives = 460/513 (89.67%), Query Frame = 0

Query: 4   ISTSIATKFLSFFLLLVYVSRKTLA---SNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYN 63
           +++ +A  F    L+LV VS + +    +NPKT  F  DSLV+GL HSRTSLLTPK+GYN
Sbjct: 1   MASIVAKSFFVLVLVLVLVSGEAMGQTLANPKT-KFLKDSLVLGLVHSRTSLLTPKRGYN 60

Query: 64  SMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNL 123
           S+SRKR+K MEM +DD VIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNL
Sbjct: 61  SLSRKRIKPMEMGNDD-VIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNL 120

Query: 124 SFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSL 183
           SFDCQDC+EYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSL
Sbjct: 121 SFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSL 180

Query: 184 ATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGA 243
           ATLVKGTCPRPCPSF+YTYG+SG+VIGTLT+D++ +HG   NSP S+++IP+FCFGCVGA
Sbjct: 181 ATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHG---NSPNSSRKIPKFCFGCVGA 240

Query: 244 SYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDG 303
           +YREPIGIAGFGRGLLSLP+QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK+ 
Sbjct: 241 TYREPIGIAGFGRGLLSLPYQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKE- 300

Query: 304 HLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTY 363
           HL+FTPLLKSP YPNYYYIGLESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTY
Sbjct: 301 HLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTY 360

Query: 364 THLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFH 423
           THLPEPLYSQLISNLES+ISYPRAK+ ELNTGFDLCYKVP KNN F F D+ +LPSITFH
Sbjct: 361 THLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTF-FSDEFELPSITFH 420

Query: 424 FLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQ 483
           FLNNVSVVLPQGN+FYAMAAP NSTV+KCLLFQSMD    GD       DGPAGIFGSFQ
Sbjct: 421 FLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMD----GDG------DGPAGIFGSFQ 480

Query: 484 QQNLEVVYDLEKERLGFQPMDCASVAATEGLHK 514
           QQNLEVVYDLEKERLGF+ MDCASVA ++GLHK
Sbjct: 481 QQNLEVVYDLEKERLGFEGMDCASVAVSQGLHK 496

BLAST of HG10021500 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 2.7e-54
Identity = 161/492 (32.72%), Postives = 227/492 (46.14%), Query Frame = 0

Query: 47  HSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYM 106
           HS + L   K   +  S +  +         +  P+    D YL+SL++G+    + +Y+
Sbjct: 42  HSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSD-YLISLSVGSSSSAVSLYL 101

Query: 107 DTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSS 166
           DTGSDL W PC    F C  CE   +  L P   + L   SS++   +C S  C   HSS
Sbjct: 102 DTGSDLVWFPC--RPFTCILCE---SKPLPPSPPSSL---SSSATTVSCSSPSCSAAHSS 161

Query: 167 DNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINS 226
               D C I+ C L  +  G C     PCP F Y YG  G ++  L  D L         
Sbjct: 162 LPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYG-DGSLVAKLYSDSL--------- 221

Query: 227 PISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGF--SHKG--FSHCFLPFKF- 286
            + +  +  F FGC   +  EPIG+AGFGRG LSLP QL     H G  FS+C +   F 
Sbjct: 222 SLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFD 281

Query: 287 SNNPNFSSPLILGNLA------ISSKDGH------------LQFTPLLKSPIYPNYYYIG 346
           S+     SPLILG         + + D H              FT +L++P +P +Y + 
Sbjct: 282 SDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSVS 341

Query: 347 LESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIS 406
           L+ I+IG  N          LR ID  G GG+++DSGTT+T LP   Y+ ++   +S + 
Sbjct: 342 LQGISIGKRN----IPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVG 401

Query: 407 --YPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFL-NNVSVVLPQGNNFYA 466
             + RA +VE ++G   CY +             ++P++  HF  N  SV LP+ N FY 
Sbjct: 402 RVHERADRVEPSSGMSPCYYLN---------QTVKVPALVLHFAGNRSSVTLPRRNYFYE 461

Query: 467 MA----APINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKE 506
                        I CL+  +     GGD  +  G  G   I G++QQQ  EVVYDL   
Sbjct: 462 FMDGGDGKEEKRKIGCLMLMN-----GGDESELRG--GTGAILGNYQQQGFEVVYDLLNR 494

BLAST of HG10021500 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 3.5e-33
Identity = 121/447 (27.07%), Postives = 177/447 (39.60%), Query Frame = 0

Query: 65  KRMKAME--MDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF 124
           +RM+++   + S   +  P+      YLM++ +GTP       MDTGSDL W  C     
Sbjct: 70  RRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE---- 129

Query: 125 DCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLAT 184
            C  C      +       F P  SS+     C S +C D+     P + C    C    
Sbjct: 130 PCTQCFSQPTPI-------FNPQDSSSFSTLPCESQYCQDL-----PSETCNNNEC---- 189

Query: 185 LVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC----V 244
                       + Y YG      G +  +              T  +P   FGC     
Sbjct: 190 -----------QYTYGYGDGSTTQGYMATETF---------TFETSSVPNIAFGCGEDNQ 249

Query: 245 GASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK 304
           G       G+ G G G LSLP QLG     FS+C   +  S+     S L LG+ A    
Sbjct: 250 GFGQGNGAGLIGMGWGPLSLPSQLGVGQ--FSYCMTSYGSSS----PSTLALGSAASGVP 309

Query: 305 DGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGT 364
           +G    T L+ S + P YYYI L+ IT+G  N     G+     ++   G GGM+IDSGT
Sbjct: 310 EGS-PSTTLIHSSLNPTYYYITLQGITVGGDN----LGIPSSTFQLQDDGTGGMIIDSGT 369

Query: 365 TYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSIT 424
           T T+LP+  Y+ +       I+ P     E ++G   C++ P   +        Q+P I+
Sbjct: 370 TLTYLPQDAYNAVAQAFTDQINLPTVD--ESSSGLSTCFQQPSDGSTV------QVPEIS 429

Query: 425 FHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDD-GVGGDNDDDNGRDGPAGIFG 484
             F   V  +  Q      + +P    +  CL   S    G+               IFG
Sbjct: 430 MQFDGGVLNLGEQN----ILISPAEGVI--CLAMGSSSQLGI--------------SIFG 437

Query: 485 SFQQQNLEVVYDLEKERLGFQPMDCAS 505
           + QQQ  +V+YDL+   + F P  C +
Sbjct: 490 NIQQQETQVLYDLQNLAVSFVPTQCGA 437

BLAST of HG10021500 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 142.1 bits (357), Expect = 1.7e-32
Identity = 124/438 (28.31%), Postives = 184/438 (42.01%), Query Frame = 0

Query: 77  NVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLG 136
           +V+  L +    Y   L +GTP + + + +DTGSD+ W+ C      C+ C    + +  
Sbjct: 130 SVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA----PCRRCYSQSDPI-- 189

Query: 137 PKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFA 196
                F P  S T     C S  C  + S          AGC+     + TC      + 
Sbjct: 190 -----FDPRKSKTYATIPCSSPHCRRLDS----------AGCNTR---RKTC-----LYQ 249

Query: 197 YTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC--------VGASYREPIGI 256
            +YG     +G  + + L    N +              GC        VGA+     G+
Sbjct: 250 VSYGDGSFTVGDFSTETLTFRRNRVKG---------VALGCGHDNEGLFVGAA-----GL 309

Query: 257 AGFGRGLLSLPFQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTP 316
            G G+G LS P Q G  F+ K FS+C +    S+ P   S ++ GN A+S      +FTP
Sbjct: 310 LGLGKGKLSFPGQTGHRFNQK-FSYCLVDRSASSKP---SSVVFGNAAVSR---IARFTP 369

Query: 317 LLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEP 376
           LL +P    +YY+GL  I++G        GV+  L ++D  GNGG++IDSGT+ T L  P
Sbjct: 370 LLSNPKLDTFYYVGLLGISVGGTRVP---GVTASLFKLDQIGNGGVIIDSGTSVTRLIRP 429

Query: 377 LYSQLISNLE-SVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNV 436
            Y  +         +  RA    L   FD C+       + S +++ ++P++  HF    
Sbjct: 430 AYIAMRDAFRVGAKTLKRAPDFSL---FDTCF-------DLSNMNEVKVPTVVLHF-RGA 485

Query: 437 SVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLE 496
            V LP  N       P+++    C  F               G  G   I G+ QQQ   
Sbjct: 490 DVSLPATN----YLIPVDTNGKFCFAFA--------------GTMGGLSIIGNIQQQGFR 485

Query: 497 VVYDLEKERLGFQPMDCA 504
           VVYDL   R+GF P  CA
Sbjct: 550 VVYDLASSRVGFAPGGCA 485

BLAST of HG10021500 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 141.7 bits (356), Expect = 2.3e-32
Identity = 130/447 (29.08%), Postives = 180/447 (40.27%), Query Frame = 0

Query: 63  SRKRMKAMEMDSDDNVIEPLREIRDG-YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLS 122
           SR+  +   M +  + +E      DG YLM+L++GTP Q     MDTGSDL W       
Sbjct: 68  SRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWT------ 127

Query: 123 FDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA 182
             CQ C +  N         F P  SS+     C S  C  + S                
Sbjct: 128 -QCQPCTQCFNQ----STPIFNPQGSSSFSTLPCSSQLCQALSSP--------------- 187

Query: 183 TLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC---- 242
                TC      + Y YG      G++  + L            +  IP   FGC    
Sbjct: 188 -----TCSNNFCQYTYGYGDGSETQGSMGTETL---------TFGSVSIPNITFGCGENN 247

Query: 243 VGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS 302
            G       G+ G GRG LSLP QL  +   FS+C  P   S   N    L+LG+LA S 
Sbjct: 248 QGFGQGNGAGLVGMGRGPLSLPSQLDVTK--FSYCMTPIGSSTPSN----LLLGSLANSV 307

Query: 303 KDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSG 362
             G    T L++S   P +YYI L  +++G+         +F L      G GG++IDSG
Sbjct: 308 TAGSPN-TTLIQSSQIPTFYYITLNGLSVGSTRLPID-PSAFALN--SNNGTGGIIIDSG 367

Query: 363 TTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSI 422
           TT T+     Y  +     S I+ P       ++GFDLC++ P   +N       Q+P+ 
Sbjct: 368 TTLTYFVNNAYQSVRQEFISQINLPVVN--GSSSGFDLCFQTPSDPSNL------QIPTF 427

Query: 423 TFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFG 482
             HF +   + LP  N F    +P N  +  CL   S   G+               IFG
Sbjct: 428 VMHF-DGGDLELPSENYF---ISPSNGLI--CLAMGSSSQGM--------------SIFG 436

Query: 483 SFQQQNLEVVYDLEKERLGFQPMDCAS 505
           + QQQN+ VVYD     + F    C +
Sbjct: 488 NIQQQNMLVVYDTGNSVVSFASAQCGA 436

BLAST of HG10021500 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 8.6e-32
Identity = 131/455 (28.79%), Postives = 193/455 (42.42%), Query Frame = 0

Query: 60  NSMSRKRMKAMEMDSDDNVIEPLREIRDG---YLMSLTLGTPPQVIQVYMDTGSDLTWVP 119
           N++ R   +       DN  +P  ++      YLM++++GTPP  I    DTGSDL W  
Sbjct: 58  NAIHRSVNRVFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQ 117

Query: 120 CGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIA 179
           C      C DC    + +  PK        SST    +C SS C  + +          A
Sbjct: 118 CA----PCDDCYTQVDPLFDPKT-------SSTYKDVSCSSSQCTALENQ---------A 177

Query: 180 GCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFG 239
            CS       TC     S++ +YG +    G +  D L + G++   P+  K I     G
Sbjct: 178 SCSTN---DNTC-----SYSLSYGDNSYTKGNIAVDTLTL-GSSDTRPMQLKNI---IIG 237

Query: 240 C----VGASYREPIGIAGFGRGLLSLPFQLGFSHKG-FSHCFLPFKFSNNPNFSSPLILG 299
           C     G   ++  GI G G G +SL  QLG S  G FS+C +P   ++  + +S +  G
Sbjct: 238 CGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVP--LTSKKDQTSKINFG 297

Query: 300 NLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGG 359
             AI S  G +  TPL+       +YY+ L+SI++G+    +    S           G 
Sbjct: 298 TNAIVSGSGVVS-TPLIAKASQETFYYLTLKSISVGSKQIQYSGSDS-------ESSEGN 357

Query: 360 MLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDD 419
           ++IDSGTT T LP   YS+L   + S  S    K+ +  +G  LCY         S   D
Sbjct: 358 IIIDSGTTLTLLPTEFYSELEDAVAS--SIDAEKKQDPQSGLSLCY---------SATGD 417

Query: 420 SQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDG 479
            ++P IT HF +   V L   N F  +     S  + C  F                R  
Sbjct: 418 LKVPVITMHF-DGADVKLDSSNAFVQV-----SEDLVCFAF----------------RGS 437

Query: 480 PA-GIFGSFQQQNLEVVYDLEKERLGFQPMDCASV 506
           P+  I+G+  Q N  V YD   + + F+P DCA +
Sbjct: 478 PSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAKM 437

BLAST of HG10021500 vs. ExPASy TrEMBL
Match: A0A5A7TNC9 (Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold129G00970 PE=3 SV=1)

HSP 1 Score: 940.3 bits (2429), Expect = 3.5e-270
Identity = 462/521 (88.68%), Postives = 490/521 (94.05%), Query Frame = 0

Query: 1   MPSI-STSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGY 60
           MPSI STSIATKFLS FLLLV+ S++TLA+NPKT NFP DSLV+GL HSRTSLLTPKKGY
Sbjct: 1   MPSISSTSIATKFLSLFLLLVHASKQTLATNPKT-NFPKDSLVLGLVHSRTSLLTPKKGY 60

Query: 61  NSMSRKRMKAM-EMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCG 120
           N +S+KRMKAM +MD DDNVIEPLREIRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCG
Sbjct: 61  NFISKKRMKAMDQMDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120

Query: 121 NLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180
           NLSFDCQDCEEYQNN+ GPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC
Sbjct: 121 NLSFDCQDCEEYQNNISGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180

Query: 181 SLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHG----NNINSPISTKQIPRFC 240
           SLATLVKGTCPRPCPSFAYTYG+SGVV G+LTRD+L MHG    NN N+  + KQ+PRFC
Sbjct: 181 SLATLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFMHGNYHNNNNNNSNNNKQVPRFC 240

Query: 241 FGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA 300
           FGCVGA+YREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LA
Sbjct: 241 FGCVGATYREPIGIAGFGRGLLSLPFQLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHLA 300

Query: 301 ISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLI 360
           ISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLI
Sbjct: 301 ISSKDENLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLI 360

Query: 361 DSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQL 420
           DSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNN SF+DDSQL
Sbjct: 361 DSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDSQL 420

Query: 421 PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAG 480
           PSITFHFLNNVSVVLPQGNNFYAMAAPINSTV+KCLL+QSM DGVG DND D+  +GPAG
Sbjct: 421 PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSM-DGVGDDNDSDD--NGPAG 480

Query: 481 IFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATEGLHKNV 516
           IFGSFQQQNL+VVYDLEKERLGFQ MDC SVAA +GLHKNV
Sbjct: 481 IFGSFQQQNLQVVYDLEKERLGFQAMDCVSVAANQGLHKNV 517

BLAST of HG10021500 vs. ExPASy TrEMBL
Match: A0A1S3CAK9 (aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103498305 PE=3 SV=1)

HSP 1 Score: 940.3 bits (2429), Expect = 3.5e-270
Identity = 462/521 (88.68%), Postives = 490/521 (94.05%), Query Frame = 0

Query: 1   MPSI-STSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGY 60
           MPSI STSIATKFLS FLLLV+ S++TLA+NPKT NFP DSLV+GL HSRTSLLTPKKGY
Sbjct: 1   MPSISSTSIATKFLSLFLLLVHASKQTLATNPKT-NFPKDSLVLGLVHSRTSLLTPKKGY 60

Query: 61  NSMSRKRMKAM-EMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCG 120
           N +S+KRMKAM +MD DDNVIEPLREIRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCG
Sbjct: 61  NFISKKRMKAMDQMDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120

Query: 121 NLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180
           NLSFDCQDCEEYQNN+ GPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC
Sbjct: 121 NLSFDCQDCEEYQNNISGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180

Query: 181 SLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHG----NNINSPISTKQIPRFC 240
           SLATLVKGTCPRPCPSFAYTYG+SGVV G+LTRD+L MHG    NN N+  + KQ+PRFC
Sbjct: 181 SLATLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFMHGNYHNNNNNNSNNNKQVPRFC 240

Query: 241 FGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA 300
           FGCVGA+YREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LA
Sbjct: 241 FGCVGATYREPIGIAGFGRGLLSLPFQLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHLA 300

Query: 301 ISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLI 360
           ISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLI
Sbjct: 301 ISSKDENLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLI 360

Query: 361 DSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQL 420
           DSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNN SF+DDSQL
Sbjct: 361 DSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDSQL 420

Query: 421 PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAG 480
           PSITFHFLNNVSVVLPQGNNFYAMAAPINSTV+KCLL+QSM DGVG DND D+  +GPAG
Sbjct: 421 PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSM-DGVGDDNDSDD--NGPAG 480

Query: 481 IFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATEGLHKNV 516
           IFGSFQQQNL+VVYDLEKERLGFQ MDC SVAA +GLHKNV
Sbjct: 481 IFGSFQQQNLQVVYDLEKERLGFQAMDCVSVAANQGLHKNV 517

BLAST of HG10021500 vs. ExPASy TrEMBL
Match: A0A0A0LYP0 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G704590 PE=3 SV=1)

HSP 1 Score: 932.9 bits (2410), Expect = 5.6e-268
Identity = 455/517 (88.01%), Postives = 486/517 (94.00%), Query Frame = 0

Query: 1   MPSIST-SIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGY 60
           MPSIS+ S ATKFLS FLLLV+VS +TLA+NPKT NFP DSLV+GL HSRTSLLTPKKGY
Sbjct: 1   MPSISSISTATKFLSLFLLLVHVSTQTLATNPKT-NFPKDSLVLGLVHSRTSLLTPKKGY 60

Query: 61  NSMSRKRMKAMEM-DSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCG 120
           N +S+KRMKAM+  D DDNVIEPLREIRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCG
Sbjct: 61  NFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCG 120

Query: 121 NLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180
           NLSFDCQDCEEYQNN+ GP+LAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC
Sbjct: 121 NLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGC 180

Query: 181 SLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCV 240
           SLA+LVKGTCPRPCPSFAYTYG+SGVV G+LTRD+L  HGN  N+  + KQIPRFCFGCV
Sbjct: 181 SLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCV 240

Query: 241 GASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK 300
           GA+YREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK
Sbjct: 241 GATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK 300

Query: 301 DGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGT 360
           D +LQFTPLLKSP+YPNYYYIGLESITIGNG+NNFRFGVSFKLREIDTKGNGGMLIDSGT
Sbjct: 301 DENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGT 360

Query: 361 TYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSIT 420
           TYTHLPEPLYSQLISNLE VI YPRAKQVELNTGFDLCYKVPCKNNN SF+DD+QLPSIT
Sbjct: 361 TYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSIT 420

Query: 421 FHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGS 480
           FHFLNNVSVVLPQGNNFYAMAAPINSTV+KCLL+QSM DGVG DND D+  +GPAGIFGS
Sbjct: 421 FHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSM-DGVGDDNDSDD--NGPAGIFGS 480

Query: 481 FQQQNLEVVYDLEKERLGFQPMDCASVAATEGLHKNV 516
           FQQQN+EVVYDLEKERLGFQPMDC SVAA +GLHKNV
Sbjct: 481 FQQQNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNV 513

BLAST of HG10021500 vs. ExPASy TrEMBL
Match: A0A6J1CMP8 (probable aspartyl protease At4g16563 OS=Momordica charantia OX=3673 GN=LOC111012684 PE=3 SV=1)

HSP 1 Score: 852.8 bits (2202), Expect = 7.3e-244
Identity = 430/524 (82.06%), Postives = 463/524 (88.36%), Query Frame = 0

Query: 3   SISTSIATKFLSFFLLLV-YVSRKTLASNPKTNNFP-TDSLVIGLAHSRTSLLTPKKGYN 62
           S +T+I++K L+FFLLL+  +S    A+ P  NNFP TDSLV+GL HSRTSLLTPK+GY 
Sbjct: 6   SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYF 65

Query: 63  S---MSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPC 122
           S    S    K ME    DNVIEPLREIRDGYL+SLTLGTPPQVIQVYMDTGSDLTWVPC
Sbjct: 66  SRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPC 125

Query: 123 GNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 182
           GNLSFDCQDCEEYQNNV GPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG
Sbjct: 126 GNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAG 185

Query: 183 CSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC 242
           CSLATLVKGTCPRPCPSFAYTYG+SGVV GTLT+D++LMHG    SP ST QIPRFCFGC
Sbjct: 186 CSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHG---VSPNSTTQIPRFCFGC 245

Query: 243 VGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS 302
           VGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISS
Sbjct: 246 VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS 305

Query: 303 KDGHLQFTPLLKSPIYPNYYYIGLESITIGN--GNNNFRFGVSFKLREIDTKGNGGMLID 362
           KD  LQFTPLLKSP+YPNYYYIGLES+TIG+  GNNN RFGVS KLREIDTKGNGGMLID
Sbjct: 306 KDHSLQFTPLLKSPLYPNYYYIGLESVTIGDGIGNNNSRFGVSLKLREIDTKGNGGMLID 365

Query: 363 SGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNN-NF-SFIDD-- 422
           SGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTGFDLCYK+PCKNN NF S +DD  
Sbjct: 366 SGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYC 425

Query: 423 -SQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRD 482
            + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTV+KCLLFQSMD G GGD D +   D
Sbjct: 426 SNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGG-GGDGDGNGDDD 485

Query: 483 GPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATEGLHKN 515
           GPAGIFGSFQQQN+EVVYDL+KER+GFQ MDCAS AA++GLHKN
Sbjct: 486 GPAGIFGSFQQQNMEVVYDLQKERIGFQTMDCASSAASQGLHKN 525

BLAST of HG10021500 vs. ExPASy TrEMBL
Match: A0A6J1EHM1 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111434252 PE=3 SV=1)

HSP 1 Score: 834.3 bits (2154), Expect = 2.7e-238
Identity = 415/508 (81.69%), Postives = 452/508 (88.98%), Query Frame = 0

Query: 9   ATKFLSFFLLLVYVSRKTLA---SNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRK 68
           A  +    L+LV VS + +    +NPKT  F  DSLV+GL HSRTSLLTPK+GYNS+  K
Sbjct: 6   ARSYFVLVLVLVLVSGEAMGQTLANPKT-KFLKDSLVLGLVHSRTSLLTPKRGYNSLLTK 65

Query: 69  RMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQ 128
           R+K MEM +DD VIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQ
Sbjct: 66  RIKPMEMGNDD-VIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQ 125

Query: 129 DCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK 188
           DC+EYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVK
Sbjct: 126 DCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVK 185

Query: 189 GTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREP 248
           GTCPRPCPSF+YTYG+SG+VIGTLT+D++ +HG   NSP S+++IP+FCFGCVGA+YREP
Sbjct: 186 GTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHG---NSPNSSRKIPKFCFGCVGATYREP 245

Query: 249 IGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFT 308
           IGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK+ HL+FT
Sbjct: 246 IGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKE-HLKFT 305

Query: 309 PLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPE 368
           P LKSP YPNYYYIGLESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPE
Sbjct: 306 PFLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPE 365

Query: 369 PLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNV 428
           PLYSQLISNLES+ISYPRAK+ ELNTGFDLCYKVP KNN F F D+ +LPSITFHFLNNV
Sbjct: 366 PLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTF-FSDEFELPSITFHFLNNV 425

Query: 429 SVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLE 488
           SVVLPQGN+FYAMAAP NSTV+KCLLFQSMD    GD       DGPAGIFGSFQQQNLE
Sbjct: 426 SVVLPQGNSFYAMAAPSNSTVVKCLLFQSMD----GDG------DGPAGIFGSFQQQNLE 485

Query: 489 VVYDLEKERLGFQPMDCASVAATEGLHK 514
           VVYDLEKERLGF+ MDCASVA ++GLHK
Sbjct: 486 VVYDLEKERLGFEAMDCASVAVSQGLHK 496

BLAST of HG10021500 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 597.4 bits (1539), Expect = 1.1e-170
Identity = 302/512 (58.98%), Postives = 375/512 (73.24%), Query Frame = 0

Query: 10  TKFLSFFLLLVYVSRKTLASNPKTNNFPTDS----LVIGLAHSRTSLLTPKKGYNSMSRK 69
           T  L  FLL+  +   T  +  + +  P+ S    LV+ L  S  SL TPK    S +++
Sbjct: 5   THVLFLFLLITLLLNTTNKTQARQHKNPSSSSSSFLVLTLTKSSVSLPTPK----SQTQE 64

Query: 70  RMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQ 129
           R+K   + S D V+EPLRE+RDGYL++L +GTPPQ +QVY+DTGSDLTWVPCGNLSFDC 
Sbjct: 65  RIK-KPLSSVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCI 124

Query: 130 DCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK 189
           +C + +NN L    + F P HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS++ L+K
Sbjct: 125 ECYDLKNNDL-KSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLK 184

Query: 190 GTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREP 249
            TC RPCPSFAYTYG  G++ G LTRDIL            T+ +PRF FGCV ++YREP
Sbjct: 185 STCVRPCPSFAYTYGEGGLISGILTRDIL---------KARTRDVPRFSFGCVTSTYREP 244

Query: 250 IGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS-SKDGHLQF 309
           IGIAGFGRGLLSLP QLGF  KGFSHCFLPFKF NNPN SSPLILG  A+S +    LQF
Sbjct: 245 IGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQF 304

Query: 310 TPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLP 369
           TP+L +P+YPN YYIGLESITI  G N     V   LR+ D++GNGGML+DSGTTYTHLP
Sbjct: 305 TPMLNTPMYPNSYYIGLESITI--GTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLP 364

Query: 370 EPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQL---PSITFHF 429
           EP YSQL++ L+S I+YPRA + E  TGFDLCYKVPC NNN + +++  +   PSITFHF
Sbjct: 365 EPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHF 424

Query: 430 LNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQ 489
           LNN +++LPQGN+FYAM+AP + +V++CLLFQ+M+D          G  GPAG+FGSFQQ
Sbjct: 425 LNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMED----------GDYGPAGVFGSFQQ 484

Query: 490 QNLEVVYDLEKERLGFQPMDCASVAATEGLHK 514
           QN++VVYDLEKER+GFQ MDC   AA+ GL++
Sbjct: 485 QNVKVVYDLEKERIGFQAMDCVLEAASHGLNQ 489

BLAST of HG10021500 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 214.5 bits (545), Expect = 1.9e-55
Identity = 161/492 (32.72%), Postives = 227/492 (46.14%), Query Frame = 0

Query: 47  HSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYM 106
           HS + L   K   +  S +  +         +  P+    D YL+SL++G+    + +Y+
Sbjct: 42  HSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSD-YLISLSVGSSSSAVSLYL 101

Query: 107 DTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSS 166
           DTGSDL W PC    F C  CE   +  L P   + L   SS++   +C S  C   HSS
Sbjct: 102 DTGSDLVWFPC--RPFTCILCE---SKPLPPSPPSSL---SSSATTVSCSSPSCSAAHSS 161

Query: 167 DNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINS 226
               D C I+ C L  +  G C     PCP F Y YG  G ++  L  D L         
Sbjct: 162 LPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYG-DGSLVAKLYSDSL--------- 221

Query: 227 PISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGF--SHKG--FSHCFLPFKF- 286
            + +  +  F FGC   +  EPIG+AGFGRG LSLP QL     H G  FS+C +   F 
Sbjct: 222 SLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFD 281

Query: 287 SNNPNFSSPLILGNLA------ISSKDGH------------LQFTPLLKSPIYPNYYYIG 346
           S+     SPLILG         + + D H              FT +L++P +P +Y + 
Sbjct: 282 SDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSVS 341

Query: 347 LESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIS 406
           L+ I+IG  N          LR ID  G GG+++DSGTT+T LP   Y+ ++   +S + 
Sbjct: 342 LQGISIGKRN----IPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVG 401

Query: 407 --YPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFL-NNVSVVLPQGNNFYA 466
             + RA +VE ++G   CY +             ++P++  HF  N  SV LP+ N FY 
Sbjct: 402 RVHERADRVEPSSGMSPCYYLN---------QTVKVPALVLHFAGNRSSVTLPRRNYFYE 461

Query: 467 MA----APINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKE 506
                        I CL+  +     GGD  +  G  G   I G++QQQ  EVVYDL   
Sbjct: 462 FMDGGDGKEEKRKIGCLMLMN-----GGDESELRG--GTGAILGNYQQQGFEVVYDLLNR 494

BLAST of HG10021500 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 178.3 bits (451), Expect = 1.5e-44
Identity = 141/426 (33.10%), Postives = 197/426 (46.24%), Query Frame = 0

Query: 88  GYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKL-AAFLPTH 147
           GY +SL+ GTP Q I    DTGS L W+PC +  + C  C+    + L P L   F+P +
Sbjct: 89  GYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTS-RYLCSGCD---FSGLDPTLIPRFIPKN 148

Query: 148 SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVI 207
           SS+S    C S  C  ++    P   C   GC   T     C   CP +   YG  G   
Sbjct: 149 SSSSKIIGCQSPKCQFLY---GPNVQC--RGCDPNT---RNCTVGCPPYILQYG-LGSTA 208

Query: 208 GTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSH 267
           G L  +        ++ P  T  +P F  GC   S R+P GIAGFGRG +SLP Q+    
Sbjct: 209 GVLITE-------KLDFPDLT--VPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNL-- 268

Query: 268 KGFSHCFLPFKFSNNPNFSSPLILGNLA---ISSKDGHLQFTPLLKSPIYPN-----YYY 327
           K FSHC +  +F ++ N ++ L L   +     SK   L +TP  K+P   N     YYY
Sbjct: 269 KRFSHCLVSRRF-DDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYY 328

Query: 328 IGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV 387
           + L  I +G  +      + +K     T G+GG ++DSG+T+T +  P++  +     S 
Sbjct: 329 LNLRRIYVGRKH----VKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQ 388

Query: 388 IS-YPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYA 447
           +S Y R K +E  TG   C+ +  K        D  +P + F F     + LP  N F  
Sbjct: 389 MSNYTREKDLEKETGLGPCFNISGKG-------DVTVPELIFEFKGGAKLELPLSNYF-- 448

Query: 448 MAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGF 504
               + +T   CL        V     + +G  GPA I GSFQQQN  V YDLE +R GF
Sbjct: 449 --TFVGNTDTVCLTV------VSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGF 468

BLAST of HG10021500 vs. TAIR 10
Match: AT3G25700.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 161.0 bits (406), Expect = 2.6e-39
Identity = 128/428 (29.91%), Postives = 177/428 (41.36%), Query Frame = 0

Query: 89  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSS 148
           Y + L +G PPQ + +  DTGSDL WV C      C++C  +           F P HSS
Sbjct: 84  YFVDLRIGQPPQSLLLIADTGSDLVWVKCS----ACRNCSHHS------PATVFFPRHSS 143

Query: 149 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVIGT 208
           T     C    C  +   D        A     T +  TC      + Y Y    +  G 
Sbjct: 144 TFSPAHCYDPVCRLVPKPDR-------APICNHTRIHSTC-----HYEYGYADGSLTSGL 203

Query: 209 LTRDILLMHGNNINSPISTKQIPRFCFGC---------VGASYREPIGIAGFGRGLLSLP 268
             R+   +      S     ++    FGC          G S+    G+ G GRG +S  
Sbjct: 204 FARETTSLK----TSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFA 263

Query: 269 FQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDG--HLQFTPLLKSPIYPN 328
            QLG  F +K FS+C + +  S  P  +S LI+GN      DG   L FTPLL +P+ P 
Sbjct: 264 SQLGRRFGNK-FSYCLMDYTLSPPP--TSYLIIGN----GGDGISKLFFTPLLTNPLSPT 323

Query: 329 YYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNL 388
           +YY+ L+S+ +    N  +  +   + EID  GNGG ++DSGTT   L EP Y  +I+ +
Sbjct: 324 FYYVKLKSVFV----NGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAV 383

Query: 389 ESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNF 448
              +  P A    L  GFDLC  V           +  LP + F F      V P  N F
Sbjct: 384 RRRVKLPIAD--ALTPGFDLCVNVSGVTK-----PEKILPRLKFEFSGGAVFVPPPRNYF 443

Query: 449 YAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERL 504
                      I+CL  QS+D  VG              + G+  QQ     +D ++ RL
Sbjct: 444 IE-----TEEQIQCLAIQSVDPKVG------------FSVIGNLMQQGFLFEFDRDRSRL 450

BLAST of HG10021500 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 156.0 bits (393), Expect = 8.2e-38
Identity = 153/533 (28.71%), Postives = 220/533 (41.28%), Query Frame = 0

Query: 5   STSIATKFLSFFLLL------VYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPK-- 64
           S+S ++    FFL+L      V  SR++L       N P     + L H  +     K  
Sbjct: 3   SSSSSSLLFPFFLILFSCLISVSSSRRSLIDRTLPKNLPRSGFRLSLRHVDSGKNLTKIQ 62

Query: 65  -------KGYNSMSRKRMKAM-----EMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQV 124
                  +G++ ++R    A+     + D  +N+  P       +LM L++G P      
Sbjct: 63  KIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYSA 122

Query: 125 YMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIH 184
            +DTGSDL W  C      C +C +    +       F P  SS+  +  C S  C  + 
Sbjct: 123 IVDTGSDLIWTQCK----PCTECFDQPTPI-------FDPEKSSSYSKVGCSSGLCNALP 182

Query: 185 SSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSP 244
            S+   D             K  C      + YTYG      G L  +       N  S 
Sbjct: 183 RSNCNED-------------KDAC-----EYLYTYGDYSSTRGLLATETFTFEDENSISG 242

Query: 245 ISTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSN 304
           I         FGC     G  + +  G+ G GRG LSL  QL      FS+C    + S 
Sbjct: 243 IG--------FGCGVENEGDGFSQGSGLVGLGRGPLSLISQL--KETKFSYCLTSIEDS- 302

Query: 305 NPNFSSPLILGNLAI-------SSKDGHLQFT-PLLKSPIYPNYYYIGLESITIGNGNNN 364
               SS L +G+LA        +S DG +  T  LL++P  P++YY+ L+ IT+G     
Sbjct: 303 --EASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAK--- 362

Query: 365 FRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTG 424
            R  V     E+   G GGM+IDSGTT T+L E  +  L     S +S P       +TG
Sbjct: 363 -RLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLP--VDDSGSTG 422

Query: 425 FDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLF 484
            DLC+K+P    N +      +P + FHF     + LP G N+  M A  +ST + CL  
Sbjct: 423 LDLCFKLPDAAKNIA------VPKMIFHF-KGADLELP-GENY--MVAD-SSTGVLCLAM 461

Query: 485 QSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV 506
                          G      IFG+ QQQN  V++DLEKE + F P +C  +
Sbjct: 483 ---------------GSSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893627.12.5e-27893.02probable aspartyl protease At4g16563 [Benincasa hispida][more]
XP_008459091.17.2e-27088.68PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo] >KAA0043307.1 aspart... [more]
XP_004145478.21.2e-26788.01probable aspartyl protease At4g16563 [Cucumis sativus] >KGN66888.1 hypothetical ... [more]
XP_022142611.11.5e-24382.06probable aspartyl protease At4g16563 [Momordica charantia][more]
XP_023520027.17.0e-24181.68probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q940R42.7e-5432.72Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q766C23.5e-3327.07Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ31.7e-3228.31Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q766C32.3e-3229.08Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q6XBF88.6e-3228.79Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7TNC93.5e-27088.68Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3CAK93.5e-27088.68aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103498305 PE=3 S... [more]
A0A0A0LYP05.6e-26888.01Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G70459... [more]
A0A6J1CMP87.3e-24482.06probable aspartyl protease At4g16563 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A6J1EHM12.7e-23881.69probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114342... [more]
Match NameE-valueIdentityDescription
AT5G45120.11.1e-17058.98Eukaryotic aspartyl protease family protein [more]
AT4G16563.11.9e-5532.72Eukaryotic aspartyl protease family protein [more]
AT3G52500.11.5e-4433.10Eukaryotic aspartyl protease family protein [more]
AT3G25700.12.6e-3929.91Eukaryotic aspartyl protease family protein [more]
AT2G03200.18.2e-3828.71Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 317..498
e-value: 1.3E-25
score: 90.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 292..509
e-value: 2.0E-45
score: 156.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 83..291
e-value: 3.9E-30
score: 107.2
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 88..504
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 89..292
e-value: 4.2E-27
score: 95.5
NoneNo IPR availablePANTHERPTHR47967:SF47CHLOROPLAST NUCLEOID DNA-BINDING PROTEIN-LIKEcoord: 51..507
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 51..507
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 352..363
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 89..498
score: 33.669682
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 88..502
e-value: 1.40324E-70
score: 224.449

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021500.1HG10021500.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity