Lsi01G019440 (gene) Bottle gourd (USVL1VR-Ls) v1

Overview
NameLsi01G019440
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls) v1)
DescriptionEukaryotic aspartyl protease family protein
Locationchr01: 25034345 .. 25041461 (-)
RNA-Seq ExpressionLsi01G019440
SyntenyLsi01G019440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGGTAGTTTCATACGTAATGCATGATATTAATTAATTGAACAAAGTTGTTGAAATTTATGAAAGAAAAAGGGATTTTTACCTAAATAATTATTTTTTCGCCATATATTTCAAAATTATCTTTAAATTTAAATTTTTTTAAGAAAGGTTTTAGTGTTACTTTTAGATAAATTTACCTTTTTCTTTTGTTATTTTTATTTTTGATTTTTATATATTTTTCCTATTTACGATGCTCCTTTAACGAAAGAGAGAGAAAATGCATTTAATATAAATTTTTTTTATTTTCATTTGGAGAGAGAAAATGATATTTAATATGATTTTATTTTTCATTTCTTTTTTCATGTTAATTTTTTTATATTATATAATATATTCCGTTGTTTATTTGGAAATGTTCTAACAAATATATGAAATATACTGGTAAATATATTTGATATGTACTCAACGATTCAGGATTATTTTTAAATATACCTTAGAACATTGTATTAGATGTATGAAATATAAATTTACTCAAATAGATTGTTGTACCATTAAAAAGTTTTGATATAACCATATACATATGACAGTAGTTATCATAAATTTCATTTATATCACAATATATATCATAAATTCAAAGCAGTCGAAAATAAACAAAAATATCTACCACAGTATATAAATCACATATGAACTTAATGTTCAATACAAAACATATATATAAGCGTATATATATCATAAAACAAGAAATCTAAATACAAAAACCATCATCGTTTATATAAAATAAACGATTATATATATCACAATACATTTATTCCTTGGTTTGTATGTCCAAGAATTCTCCCCACCTTCATCTCTTATTCTCTCTATCTTCATCTTCATCTCGTTCAAAACCTGCGCCATCGACCAGATCTACTCGGCCTGCGCCGTCAACCAGATATTCTTGCCCCACGCTTGATTTGCTTGCACCCTCACCAGATCATCTGCTCGCCCTTGCCATCGATTCACCCCATGTTGTCCGTTTCCCATCGCCAGCCATTGCAAGTCACGCCCTTGAGAAGGAGAAAAAGAGAGGGAGGAAGAAGAAAAGGAAGAGGGAAGGAGAAAGAATGCCACACCCGATTTGTTTACGCCCATACCAAATTTACTCGCCCCGTTGCTGATTCGCCCACGTCGTCCATCTTCCGTCGCTATCTGCTGCAACTCACGCCATCGGGAAGGAGAAAAAGAGAGGGAGGAGGAAGAATGAAATTCCAACGTTTTTTTTTAAAAAAAAGAATACAATGATGCGAAGCCCTAGGGCTGTGCATGGAGGAGAGAGAATCAATAATTTTAAATTTTTTGTTTTATAAAAAATATGGGTGAGATCTTAATGGTAAATACACACCAATATTAACGAAAATTTGTATTCTATTAGCCTATTTATGAAATAAATTATGTGTTGAAGACAATAAAGTAATATAAGCCTATAATTATTAAGGCTATATATATATATGTAGGAATCCCAAAAAAAATAACAGAAAAAGATTATTGTAAATACTCCAGTTCTTGTTGGTAAACGATGATAACAAATAATAATGAAATTAACAAATTAAACTACAAATTTCATCATTCTAACTTTGAGTATTATATTATAGTTTTTAGAATCAAAGAAATTAAATGAATTCTCTTAAATTTTAATTACTATACTTATCCCTTTGAAAAACTTATACATATGTGTGTATATAAACACTAGTCTCAATCAATCTTTTATTTGTGTGTTGTTCAATTTGAAATACGGCAAAAGGAAGGAAAATCCCCACCGTAAGTCTTTCTTCGTTTGAACCTCGCCCTTAACTTTAAGGCATCTTTCCTATCTTTCTCTTTCTCGAATCAATTATATATATATATATATATATATATTACCTTAAATAATGTTTGGATTAAATAATCTTGGATGGAATGGAGCACACAAACTAATTAAAAATAATTGCATACATACTTTAACCATAAACTCGATATTCCATCGTTAGACACACCTATTATGTAATGCCTCATATCTAGGATTCAAACCAAACTTTGAAATTTGGATTAGGTATATTATGGCTTCAGCATTCCCCAACATCCCCTGGCGTTGTTATCCTTGCTCATCTTGAATGTTTTTTAGAGTGAAAGTCGTCCCCACAAATCAACACAAGTCCTTTCAGCATATTTTGTCCTCACTCACATGCTTCCTAAAAAAATTTCCCGAATGTCACGCAACATAGAATTGCTCCAAACTAAGCACCACCTTTGAAGTTCCTATGATTGAGCAACCGAAAAGAAAGTTGCATCTTGTTGGTATAGGTAGTAACTTTTAATTCTTTTAATCCTTTCTTAATAATACTCTCATATCTTCAGGATTCCTCTCATTCATATGTGATCTTACTTCTTTCATGTCCACCTCTTAAACTTGAGACGTTACATACTCATCCCAAATCTAATTCATTAATTAGAATTACGTTTGTCCATTCATTTCTAAAATCAATCTAATTGGTTGTCAAATTTCATTACCATATCCAAATTTTAACTAATAAAGTAACACTTCAAAATTCCATCCATCAAAATTCATAATATAGAGTTTATTTTATTTTTTTATTTTTATTCTTTGCGAAGTTCAACAACTGTGTGAGTATGGTATTAAAGGGTCAACTTTAGGGAAAGATATATATTAAGAACTTTATATATCCATATTAATCTACAATAAATCAAGTTGATTAAAAATATTGATGTATTAAAGATAAAGCATATATTTGTAACTTTAGCTTATAAAAACACACCAAAGATATAAAACCAACAATCATGGAGTTAAAAGTGGATTAAAAAACAAAAATGCACAAGAAGTGTGTTGAATTATCTCTCCTAAACAAGAGGATAATTTTCAGGTAGAAATCAAAGTGTGTGAGAGACTTCTAATTTTTACCATTTTATCACACCAAAGTTTGTTGTAATGTTCAAATCAAAGAAATAATAATCCCTCTTCTCTTCTTCTCTTCCATTTCTTGTATTCCTTTTCTCACCAACCCACCAATTAAAAGACTTTTCCAAGTTAGCTGTGATCTTACTCTACATCCCCTCATTTCCAAAGTTTGTTTGCCTATCCTTTCTTAAATATTCTCTCCTACTCTCTTTCCTTGTTCCTATTTCAACTCCTCAATCCCTTCATTGTTCTCTTGTTATGAAAAAGGGGTCAAGAACTTAATTCCTTTATTTTACCCTTCTTTTGGTATATCGTTTTCTTTGCATGCTAGTATGACGGTTTTGTTACATTCATTTCGATATCTAATTCTCTTGATTGTAGTATTGAGTACAACATTGTTTATTGATACATCTGCTTTGAGTTCGACTCTCTCAAGGCGAGCTCTACAGAAACCGAATAAGTTGCCTAGTAATGGCTTTAGGGTGAAGCTTAACCATGTGGATCATGTGAAGAATTTGACGAGATTCGAGCGGTTGCGGCGAGGAGTGGCACGTGGGAAGAATAGATTGCAAAGACTAAATGCCATGGTGTTGGCCGCCAATGCTGCGGTTGGTGACCAAGTGAAGGCGCCTGTGGTTGCGGGTAATGGTGAGTTTCTTATGAAGTTGGCTATCGGAACTCCGCCGAAAAGCTTCTCGGCAATTATGGACACTGGTAGTGATCTGATCTGGACACAGTGCAAGCCTTGTCAACAGTGTTTTGATCAAGCAACACCTATTTTTGATCCGAAAAAATCTTCTTCTTTCTCTAAGATTTCTTGCTCGAGCGAGCTCTGTGGAGCTCTCCCGACATCGACGTGCAGCAACAACGGGTGTGAGTATTTGTACACGTATGGAGATTCTTCCTCCACCGAAGGTGTTTTGGCTTTTGAGACCTTCACGTTTGGAGATTCAAGTGAAGATCAGGTATTTTCCACAACAAAACTCCCCACAAAAAATGAAAGAAAATTATAAAGTTTGATAAACCAGTAGACCTCGCTTGATGATGATTTGATTTTTAAAACTTTAAGCTACTTCTACCTATCTTTATTTTTGTTAATTATTCACTTTCTGTTAATATTTTAAAAACTAAATTAAAATTTGAAACTATAAAAAAATAGTTTTCAAAAATATATTTTTATTTTTTGAATTTAACTAAAAATTCAAGTATTTTTCAAGAATCGTGAGAATCACAGTATGAAATTTAGTAGAAAATTAAACAAAACATACCTTTCAAAAACAGAAAATTAGAAACAAAATCGTTACCAAACTAATTTTAATATACTGTTTTTTAAAATTTTAAATATTATTTTTTTCAGGGTTTTTTCAAATTCTAGTCCAAAACTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTGAGGAAAGTCCAAAACTTATTAAATGTGCATGAATTTTTTTTTCTCTCTTTTTTGGGGCCTTTGATACTATTTTGTTGTTTAAAAAAAAAGATAATATTGCTTGTGGATCATTGGGTTGATTGAGGTGGTTGTTGGGTGCAGGTGAAAGTTAGATCATAAATAATAGCAACAATTAAAATAGTTTATTTCAAAAATCTTAAGAGACTAAACAGAATATTAATTTTGAAAATTTATTCAACAAAATGTTTTTTTTTTAAGTTCAATTCATCAAATAAATATTTTGAAAATTTAGGGGAGGATTTAATATATATATATATATATATATATATATAATATGCTTTATCGAGTTTGTTTTAATAACGTGTTAAATCACTACTAAACCAAAAATCTTAAGCTGATAATTAAGCCTCCATTTGTAAACAATTTTGTTTATTATCTTAAAATTTTGAAATATTTTATGGATTTTATTTTTTTGATAAAAAAATAAACATTTAAATTCTTAGTCAAATTCGAAAAACAAAAACAACTTTTCTTAATTAGTTCTCAAAACTTGACTTAGTTTTTATAAACAATGGTAAAAAGTAGATAACAGAATAAAGAAAGTTAAAGCGAAAGTAGTGCTTATTATAAGCTTAATTTTCAAAAATTAAAAACAAAAAACCAAATGGTTATCAAATGGGACCTAAATTATAGTATATTTAATTTTTTTTATACATATTCTTAATCTAGGTCTCGATCTCCGAACTCGGGTTCGGATGCGGAGACGATAACGAAGGAGACGGGTTCAGCCAAGGCGAGGGGTTGGTGGGGCTCGGGCGAGGACCGTTATCGTTGGTTTCTCAACTAAAAGAACAAAAGTTTGCTTATTGTTTAACAGCCATTGATGACTCAAAACCAAGCTCACTTTTGTTGGGATCTCTAGCAAACATAAAACCTAAAACAACAAAAGATGAAATGAAAACAACCCCATTGATCAGAAATCCTTCTCAGCCATCTTTTTACTATCTTTCTCTCCAAGGAATATCAGTTGGTGACACTCAATTACCAATACCAAAGTCCACTTTTGAGCTCCATAATGATGGGAGTGGAGGAGTAATCATAGACTCAGGCACAACAATCACTTACATTGAGAACACAGCTTTTACTTTACTCAAAAATGAGTTCATTGCTCAAATGAATCTTCCCGTCGACGACTCCGGTACTGGTGGCCTTGACCTCTGCTTTAACCTACCAGCCGAGGCAACTCAGGTATGTTTACCTAAAAAGTTTAAGTCGATAGGTTACAGTACTATAATATTAGTGTATGTTTGAAAATGGTTTTGAAAATTTTAATTTAATTTAATTTAATTTAATCTTTCCAAAACATACTTTTAATCCTTCAAAATTTAACTTTTTATTATTAAACCTGATTTTCGATGATTAAAAGCATTTTTCAAAGTGATTTAAAAAATTTCATAATCATTCCCAAACATGTCATTAATCCTTTATGTTTATGTTTTTTTTAACAAAGGTGGAGGTTCCGAAGTTGACGTTTCATTTCAAGGGTGCGGATTTGGAGCTTCCCGGGGAGAACTACATGATCGGCGACTCGAGGGCGGGATTGATATGCTTGGCCATTGGGAGTTCTAGAGGAATGTCCATCTTTGGAAATCTTCAGCAACAAAACTTTATGGTTGTTCATGATCTTCAGGAAGAAACCCTGTCGTTTTTGCCCACTCAATGTGATAGTATATAAAAGAACTTGAAAGGAATTGTTTCATTAAAATGAAGGAAGAATATCAAAGTTTGGAGTGAAATGATACAAAGTTGTTGTTGATTAACCTAAAGAATTAATATTGTTTCCTGGATTTTTTTTTTTTTGAACAAATTATTTCGTGGATCTAAGGCTAGGTAGATCGTTCTTCTTTATGTTATTAATTCAAAAGAGGCAATTTGTATGATTATTAGTGCTGTTTGAATTGGAGTTGTTTTTAAGCTTTGAATTTTATTGGGGAAGACATTGATTCATTTGATAATGAATTTGTTTACGGTCTTCTGTATTTTT

mRNA sequence

ATGGAAGTATTGAGTACAACATTGTTTATTGATACATCTGCTTTGAGTTCGACTCTCTCAAGGCGAGCTCTACAGAAACCGAATAAGTTGCCTAGTAATGGCTTTAGGGTGAAGCTTAACCATGTGGATCATGTGAAGAATTTGACGAGATTCGAGCGGTTGCGGCGAGGAGTGGCACGTGGGAAGAATAGATTGCAAAGACTAAATGCCATGGTGTTGGCCGCCAATGCTGCGGTTGGTGACCAAGTGAAGGCGCCTGTGGTTGCGGGTAATGGTGAGTTTCTTATGAAGTTGGCTATCGGAACTCCGCCGAAAAGCTTCTCGGCAATTATGGACACTGGTAGTGATCTGATCTGGACACAGTGCAAGCCTTGTCAACAGTGTTTTGATCAAGCAACACCTATTTTTGATCCGAAAAAATCTTCTTCTTTCTCTAAGATTTCTTGCTCGAGCGAGCTCTGTGGAGCTCTCCCGACATCGACGTGCAGCAACAACGGGTGTGAGTATTTGTACACGTATGGAGATTCTTCCTCCACCGAAGGTGTTTTGGCTTTTGAGACCTTCACGTTTGGAGATTCAAGTGAAGATCAGAATATTAATTTTGAAAATTTATTCAACAAAATCCAAGGCGAGGGGTTGGTGGGGCTCGGGCGAGGACCGTTATCGTTGGTTTCTCAACTAAAAGAACAAAAGTTTGCTTATTGTTTAACAGCCATTGATGACTCAAAACCAAGCTCACTTTTGTTGGGATCTCTAGCAAACATAAAACCTAAAACAACAAAAGATGAAATGAAAACAACCCCATTGATCAGAAATCCTTCTCAGCCATCTTTTTACTATCTTTCTCTCCAAGGAATATCAGTTGGTGACACTCAATTACCAATACCAAAGTCCACTTTTGAGCTCCATAATGATGGGAGTGGAGGAGTAATCATAGACTCAGGCACAACAATCACTTACATTGAGAACACAGCTTTTACTTTACTCAAAAATGAGTTCATTGCTCAAATGAATCTTCCCGTCGACGACTCCGGTACTGGTGGCCTTGACCTCTGCTTTAACCTACCAGCCGAGGCAACTCAGGTGGAGGTTCCGAAGTTGACGTTTCATTTCAAGGGTGCGGATTTGGAGCTTCCCGGGGAGAACTACATGATCGGCGACTCGAGGGCGGGATTGATATGCTTGGCCATTGGGAGTTCTAGAGGAATGTCCATCTTTGGAAATCTTCAGCAACAAAACTTTATGGTTGTTCATGATCTTCAGGAAGAAACCCTGTCGTTTTTGCCCACTCAATGTGATAGTATATAAAAGAACTTGAAAGGAATTGTTTCATTAAAATGAAGGAAGAATATCAAAGTTTGGAGTGAAATGATACAAAGTTGTTGTTGATTAACCTAAAGAATTAATATTGTTTCCTGGATTTTTTTTTTTTTGAACAAATTATTTCGTGGATCTAAGGCTAGGTAGATCGTTCTTCTTTATGTTATTAATTCAAAAGAGGCAATTTGTATGATTATTAGTGCTGTTTGAATTGGAGTTGTTTTTAAGCTTTGAATTTTATTGGGGAAGACATTGATTCATTTGATAATGAATTTGTTTACGGTCTTCTGTATTTTT

Coding sequence (CDS)

ATGGAAGTATTGAGTACAACATTGTTTATTGATACATCTGCTTTGAGTTCGACTCTCTCAAGGCGAGCTCTACAGAAACCGAATAAGTTGCCTAGTAATGGCTTTAGGGTGAAGCTTAACCATGTGGATCATGTGAAGAATTTGACGAGATTCGAGCGGTTGCGGCGAGGAGTGGCACGTGGGAAGAATAGATTGCAAAGACTAAATGCCATGGTGTTGGCCGCCAATGCTGCGGTTGGTGACCAAGTGAAGGCGCCTGTGGTTGCGGGTAATGGTGAGTTTCTTATGAAGTTGGCTATCGGAACTCCGCCGAAAAGCTTCTCGGCAATTATGGACACTGGTAGTGATCTGATCTGGACACAGTGCAAGCCTTGTCAACAGTGTTTTGATCAAGCAACACCTATTTTTGATCCGAAAAAATCTTCTTCTTTCTCTAAGATTTCTTGCTCGAGCGAGCTCTGTGGAGCTCTCCCGACATCGACGTGCAGCAACAACGGGTGTGAGTATTTGTACACGTATGGAGATTCTTCCTCCACCGAAGGTGTTTTGGCTTTTGAGACCTTCACGTTTGGAGATTCAAGTGAAGATCAGAATATTAATTTTGAAAATTTATTCAACAAAATCCAAGGCGAGGGGTTGGTGGGGCTCGGGCGAGGACCGTTATCGTTGGTTTCTCAACTAAAAGAACAAAAGTTTGCTTATTGTTTAACAGCCATTGATGACTCAAAACCAAGCTCACTTTTGTTGGGATCTCTAGCAAACATAAAACCTAAAACAACAAAAGATGAAATGAAAACAACCCCATTGATCAGAAATCCTTCTCAGCCATCTTTTTACTATCTTTCTCTCCAAGGAATATCAGTTGGTGACACTCAATTACCAATACCAAAGTCCACTTTTGAGCTCCATAATGATGGGAGTGGAGGAGTAATCATAGACTCAGGCACAACAATCACTTACATTGAGAACACAGCTTTTACTTTACTCAAAAATGAGTTCATTGCTCAAATGAATCTTCCCGTCGACGACTCCGGTACTGGTGGCCTTGACCTCTGCTTTAACCTACCAGCCGAGGCAACTCAGGTGGAGGTTCCGAAGTTGACGTTTCATTTCAAGGGTGCGGATTTGGAGCTTCCCGGGGAGAACTACATGATCGGCGACTCGAGGGCGGGATTGATATGCTTGGCCATTGGGAGTTCTAGAGGAATGTCCATCTTTGGAAATCTTCAGCAACAAAACTTTATGGTTGTTCATGATCTTCAGGAAGAAACCCTGTCGTTTTTGCCCACTCAATGTGATAGTATATAA

Protein sequence

MEVLSTTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGKNRLQRLNAMVLAANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPCQQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEGVLAFETFTFGDSSEDQNINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQLPIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDLCFNLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI
Homology
BLAST of Lsi01G019440 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 422.9 bits (1086), Expect = 4.3e-117
Identity = 230/426 (53.99%), Postives = 292/426 (68.54%), Query Frame = 0

Query: 14  ALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGKNRLQRLNAMVL 73
           A + + SR AL   ++    GF++ L HVD  KNLT+F+ L R + RG  RLQRL AM+ 
Sbjct: 20  APTHSTSRTALNHRHEAKVTGFQIMLEHVDSGKNLTKFQLLERAIERGSRRLQRLEAML- 79

Query: 74  AANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPCQQCFDQAT 133
             N   G  V+  V AG+GE+LM L+IGTP + FSAIMDTGSDLIWTQC+PC QCF+Q+T
Sbjct: 80  --NGPSG--VETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQST 139

Query: 134 PIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEGVLAFETFTFGDS 193
           PIF+P+ SSSFS + CSS+LC AL + TCSNN C+Y Y YGD S T+G +  ET TFG S
Sbjct: 140 PIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFG-S 199

Query: 194 SEDQNINF---ENL--FNKIQGEGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLL 253
               NI F   EN   F +  G GLVG+GRGPLSL SQL   KF+YC+T I  S PS+LL
Sbjct: 200 VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLL 259

Query: 254 LGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQLPIPKSTFELH-NDGS 313
           LGSLAN     T     TT LI++   P+FYY++L G+SVG T+LPI  S F L+ N+G+
Sbjct: 260 LGSLAN---SVTAGSPNTT-LIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGT 319

Query: 314 GGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDLCFNLPAEATQVEVPKL 373
           GG+IIDSGTT+TY  N A+  ++ EFI+Q+NLPV +  + G DLCF  P++ + +++P  
Sbjct: 320 GGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTF 379

Query: 374 TFHFKGADLELPGENYMIGDSRAGLICLAIG-SSRGMSIFGNLQQQNFMVVHDLQEETLS 433
             HF G DLELP ENY I  S  GLICLA+G SS+GMSIFGN+QQQN +VV+D     +S
Sbjct: 380 VMHFDGGDLELPSENYFISPSN-GLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVS 434

BLAST of Lsi01G019440 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 392.1 bits (1006), Expect = 8.1e-108
Identity = 205/424 (48.35%), Postives = 282/424 (66.51%), Query Frame = 0

Query: 16  SSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGKNRLQRLNAMVLAA 75
           SST     L    K P  G RV L  VD  KNLT++E ++R + RG+ R++ +NAM+ ++
Sbjct: 23  SSTSRGTLLHHGQKRPQPGLRVDLEQVDSGKNLTKYELIKRAIKRGERRMRSINAMLQSS 82

Query: 76  NAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPCQQCFDQATPI 135
           +      ++ PV AG+GE+LM +AIGTP  SFSAIMDTGSDLIWTQC+PC QCF Q TPI
Sbjct: 83  SG-----IETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPI 142

Query: 136 FDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEGVLAFETFTFGDSSE 195
           F+P+ SSSFS + C S+ C  LP+ TC+NN C+Y Y YGD S+T+G +A ETFTF ++S 
Sbjct: 143 FNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTF-ETSS 202

Query: 196 DQNINF-----ENLFNKIQGEGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLG 255
             NI F        F +  G GL+G+G GPLSL SQL   +F+YC+T+   S PS+L LG
Sbjct: 203 VPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALG 262

Query: 256 SLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQLPIPKSTFELHNDGSGGV 315
           S A+  P+ +     +T LI +   P++YY++LQGI+VG   L IP STF+L +DG+GG+
Sbjct: 263 SAASGVPEGS----PSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGM 322

Query: 316 IIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDLCFNLPAEATQVEVPKLTFH 375
           IIDSGTT+TY+   A+  +   F  Q+NLP  D  + GL  CF  P++ + V+VP+++  
Sbjct: 323 IIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQ 382

Query: 376 FKGADLELPGENYMIGDSRAGLICLAIGSSR--GMSIFGNLQQQNFMVVHDLQEETLSFL 433
           F G  L L  +N +I  +  G+ICLA+GSS   G+SIFGN+QQQ   V++DLQ   +SF+
Sbjct: 383 FDGGVLNLGEQNILISPAE-GVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFV 435

BLAST of Lsi01G019440 vs. ExPASy Swiss-Prot
Match: Q7XV21 (Aspartyl protease 37 OS=Oryza sativa subsp. japonica OX=39947 GN=AP37 PE=3 SV=2)

HSP 1 Score: 255.4 bits (651), Expect = 1.2e-66
Identity = 155/446 (34.75%), Postives = 235/446 (52.69%), Query Frame = 0

Query: 31  PSNGFRVKLNHVD----HVKNLTRFERLRRGVARGKNRLQRLN-AMVLAANAAVGDQVKA 90
           P   FR++L  VD       NLT  E LRR + R + RL  +  A   AA+A      + 
Sbjct: 21  PPRSFRLELASVDASAADAANLTEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAET 80

Query: 91  PVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPCQQCFDQATPIFDPKKSSSFS 150
           P++   GE+L+KL IGTPP  F+A +DT SDLIWTQC+PC  C+ Q  P+F+P+ SS+++
Sbjct: 81  PIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYA 140

Query: 151 KISCSSELCGALPTSTCSNN---GCEYLYTYGDSSSTEGVLAFETFTFGDSSEDQNINFE 210
            + CSS+ C  L    C ++    C+Y YTY  +++TEG LA +    G+ +  + + F 
Sbjct: 141 ALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDA-FRGVAFG 200

Query: 211 NLFNKI------QGEGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLGSLANIK 270
              +        Q  G+VGLGRGPLSLVSQL  ++FAYCL       P  L+LG+ A+  
Sbjct: 201 CSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPPASRIPGKLVLGADADAA 260

Query: 271 PKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQLPIPKST----------------- 330
              T       P+ R+P  PS+YYL+L G+ +GD  + +P +T                 
Sbjct: 261 RNAT--NRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTP 320

Query: 331 ------FELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDLCF 390
                   + +    G+IID  +TIT++E + +  L N+   ++ LP     + GLDLCF
Sbjct: 321 SPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCF 380

Query: 391 NLP--AEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSR--GMSIFGNL 436
            LP      +V VP +   F G  L L        D  +G++CL +G +    +SI GN 
Sbjct: 381 ILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNF 440

BLAST of Lsi01G019440 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 254.2 bits (648), Expect = 2.6e-66
Identity = 162/439 (36.90%), Postives = 235/439 (53.53%), Query Frame = 0

Query: 19  LSRRALQKPNKLPSNGFRVKLNHVDHVKN------LTRFERLRRGVARGKNRLQRLNAMV 78
           LS   L   N  P  GF   L H D  K+       T  +RLR  + R  NR+       
Sbjct: 15  LSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHF---- 74

Query: 79  LAANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPCQQCFDQA 138
                    Q +  + + +GE+LM ++IGTPP    AI DTGSDL+WTQC PC  C+ Q 
Sbjct: 75  --TEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQV 134

Query: 139 TPIFDPKKSSSFSKISCSSELCGALPT-STCS--NNGCEYLYTYGDSSSTEGVLAFETFT 198
            P+FDPK SS++  +SCSS  C AL   ++CS  +N C Y  +YGD+S T+G +A +T T
Sbjct: 135 DPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLT 194

Query: 199 FGDSSEDQNINFENL-----------FNKIQGEGLVGLGRGPLSLVSQLKEQ---KFAYC 258
            G SS+ + +  +N+           FNK +G G+VGLG GP+SL+ QL +    KF+YC
Sbjct: 195 LG-SSDTRPMQLKNIIIGCGHNNAGTFNK-KGSGIVGLGGGPVSLIKQLGDSIDGKFSYC 254

Query: 259 LTAIDDSK--PSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQL 318
           L  +   K   S +  G+ A +    +   + +TPLI   SQ +FYYL+L+ ISVG  Q+
Sbjct: 255 LVPLTSKKDQTSKINFGTNAIV----SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQI 314

Query: 319 PIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDLCF 378
               S  E      G +IIDSGTT+T +    ++ L++   + ++         GL LC+
Sbjct: 315 QYSGSDSE---SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY 374

Query: 379 NLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQQQN 433
           +   +   ++VP +T HF GAD++L   N  +  S   L+C A   S   SI+GN+ Q N
Sbjct: 375 SATGD---LKVPVITMHFDGADVKLDSSNAFVQVSE-DLVCFAFRGSPSFSIYGNVAQMN 434

BLAST of Lsi01G019440 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 6.5e-65
Identity = 135/365 (36.99%), Postives = 205/365 (56.16%), Query Frame = 0

Query: 81  DQVKAPVVA----GNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPCQQCFDQATPIF 140
           + +  PVV+    G+GE+  ++ +GTP K    ++DTGSD+ W QC+PC  C+ Q+ P+F
Sbjct: 145 EDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVF 204

Query: 141 DPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEGVLAFETFTFGDSSED 200
           +P  SS++  ++CS+  C  L TS C +N C Y  +YGD S T G LA +T TFG+S + 
Sbjct: 205 NPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKI 264

Query: 201 QNI------NFENLFNKIQGEGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLG 260
            N+      + E LF      GL+GLG G LS+ +Q+K   F+YCL   D  K SSL   
Sbjct: 265 NNVALGCGHDNEGLFT--GAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFN 324

Query: 261 SLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQLPIPKSTFELHNDGSGGV 320
           S+             T PL+RN    +FYY+ L G SVG  ++ +P + F++   GSGGV
Sbjct: 325 SV------QLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGV 384

Query: 321 IIDSGTTITYIENTAFTLLKNEFI-AQMNLPVDDSGTGGLDLCFNLPAEATQVEVPKLTF 380
           I+D GT +T ++  A+  L++ F+   +NL    S     D C++  + +T V+VP + F
Sbjct: 385 ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST-VKVPTVAF 444

Query: 381 HFKGA-DLELPGENYMIGDSRAGLICLAIG-SSRGMSIFGNLQQQNFMVVHDLQEETLSF 433
           HF G   L+LP +NY+I    +G  C A   +S  +SI GN+QQQ   + +DL +  +  
Sbjct: 445 HFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGL 500

BLAST of Lsi01G019440 vs. ExPASy TrEMBL
Match: A0A0A0KYT9 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G554680 PE=3 SV=1)

HSP 1 Score: 763.5 bits (1970), Expect = 4.9e-217
Identity = 390/445 (87.64%), Postives = 412/445 (92.58%), Query Frame = 0

Query: 3   VLSTTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGK 62
           +L TTLFI+T A SS+LSRRALQKPNKLPS+GFRV+L HVDHVKNLTRFERLRRGVARGK
Sbjct: 19  ILITTLFINTLAFSSSLSRRALQKPNKLPSHGFRVRLKHVDHVKNLTRFERLRRGVARGK 78

Query: 63  NRLQRLNAMVL-AANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQ 122
           NRL RLNAMVL AANA VGDQVKAPVVAGNGEFLMKLAIG+PP+SFSAIMDTGSDLIWTQ
Sbjct: 79  NRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQ 138

Query: 123 CKPCQQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEG 182
           CKPCQQCFDQ+TPIFDPK+SSSF KISCSSELCGALPTSTCS++GCEYLYTYGDSSST+G
Sbjct: 139 CKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQG 198

Query: 183 VLAFETFTFGDSSEDQ-----------NINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQ 242
           VLAFETFTFGDS+EDQ           N N  + F+  QG GLVGLGRGPLSLVSQLKEQ
Sbjct: 199 VLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFS--QGAGLVGLGRGPLSLVSQLKEQ 258

Query: 243 KFAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGD 302
           KFAYCLTAIDDSKPSSLLLGSLANI PKT+KDEMKTTPLI+NPSQPSFYYLSLQGISVG 
Sbjct: 259 KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGG 318

Query: 303 TQLPIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLD 362
           TQL IPKSTFELH+DGSGGVIIDSGTTITY+EN+AFT LKNEFIAQMNLPVDDSGTGGLD
Sbjct: 319 TQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLD 378

Query: 363 LCFNLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQ 422
           LCFNLPA   QVEVPKLTFHFKGADLELPGENYMIGDS+AGL+CLAIGSSRGMSIFGNLQ
Sbjct: 379 LCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQ 438

Query: 423 QQNFMVVHDLQEETLSFLPTQCDSI 436
           QQNFMVVHDLQEETLSFLPTQCDSI
Sbjct: 439 QQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of Lsi01G019440 vs. ExPASy TrEMBL
Match: A0A5A7TD10 (Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold128G001020 PE=3 SV=1)

HSP 1 Score: 758.8 bits (1958), Expect = 1.2e-215
Identity = 389/445 (87.42%), Postives = 410/445 (92.13%), Query Frame = 0

Query: 3   VLSTTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGK 62
           V  TTLFI+T A SS+LS RALQKPNKLPS+GFRV+L HVDHVKNLTRFERLRRGVARGK
Sbjct: 19  VFITTLFINTLAFSSSLSTRALQKPNKLPSHGFRVRLKHVDHVKNLTRFERLRRGVARGK 78

Query: 63  NRLQRLNAMVL-AANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQ 122
           NRL RLNAMVL AANA+VGDQVKAPVVAGNGEFLMKLAIG+PP+SFSAIMDTGSDLIWTQ
Sbjct: 79  NRLHRLNAMVLAAANASVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQ 138

Query: 123 CKPCQQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEG 182
           CKPCQQCFDQATPIFDPK+SSSFSKISC SELCGALPTSTCS++GCEYLYTYGDSSST+G
Sbjct: 139 CKPCQQCFDQATPIFDPKQSSSFSKISCRSELCGALPTSTCSSDGCEYLYTYGDSSSTQG 198

Query: 183 VLAFETFTFGDSSEDQ-----------NINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQ 242
           VLAFETFTFGDS+EDQ           N N  + F+  QG GLVGLGRGPLSLVSQLKEQ
Sbjct: 199 VLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFS--QGAGLVGLGRGPLSLVSQLKEQ 258

Query: 243 KFAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGD 302
           KFAYCLTAIDDSKPSSLLLGSLANI PKT+KDEMK TPLI+NPSQPSFYYLSLQGISVG 
Sbjct: 259 KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKATPLIKNPSQPSFYYLSLQGISVGG 318

Query: 303 TQLPIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLD 362
           TQL IPKSTFELH+DGSGGVIIDSGTTITYIE+TAF+ LKNEFIAQMNLPVDDSGTGGLD
Sbjct: 319 TQLSIPKSTFELHDDGSGGVIIDSGTTITYIESTAFSSLKNEFIAQMNLPVDDSGTGGLD 378

Query: 363 LCFNLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQ 422
           LCFNLPA  TQVEVPKLTFHFKGADLELPGENYMIGDS+ GL+CLAIGSSRGMSIFGNLQ
Sbjct: 379 LCFNLPAGTTQVEVPKLTFHFKGADLELPGENYMIGDSKTGLLCLAIGSSRGMSIFGNLQ 438

Query: 423 QQNFMVVHDLQEETLSFLPTQCDSI 436
           QQNFMVVHDLQEETLSFLPTQCDSI
Sbjct: 439 QQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of Lsi01G019440 vs. ExPASy TrEMBL
Match: A0A5D3BTY9 (Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1738G00580 PE=3 SV=1)

HSP 1 Score: 758.8 bits (1958), Expect = 1.2e-215
Identity = 389/445 (87.42%), Postives = 410/445 (92.13%), Query Frame = 0

Query: 3   VLSTTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGK 62
           V  TTLFI+T A SS+LSRRALQKPNKLPS+GF V+L HVDHVKNLTRFERLRRGVARGK
Sbjct: 19  VFITTLFINTLAFSSSLSRRALQKPNKLPSHGFMVRLKHVDHVKNLTRFERLRRGVARGK 78

Query: 63  NRLQRLNAMVL-AANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQ 122
           NRL RLNAMVL AANA+VGDQVKAPVVAGNGEFLMKLAIG+PP+SFSAIMDTGSDLIWTQ
Sbjct: 79  NRLHRLNAMVLAAANASVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQ 138

Query: 123 CKPCQQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEG 182
           CKPCQQCFDQATPIFDPK+SSSFSKISC SELCGALPTSTCS++GCEYLYTYGDSSST+G
Sbjct: 139 CKPCQQCFDQATPIFDPKQSSSFSKISCRSELCGALPTSTCSSDGCEYLYTYGDSSSTQG 198

Query: 183 VLAFETFTFGDSSEDQ-----------NINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQ 242
           VLAFETFTFGDS+EDQ           N N  + F+  QG GLVGLGRGPLSLVSQLKEQ
Sbjct: 199 VLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFS--QGAGLVGLGRGPLSLVSQLKEQ 258

Query: 243 KFAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGD 302
           KFAYCLTAIDDSKPSSLLLGSLANI PKT+KDEMK TPLI+NPSQPSFYYLSLQGISVG 
Sbjct: 259 KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKATPLIKNPSQPSFYYLSLQGISVGG 318

Query: 303 TQLPIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLD 362
           TQL IPKSTFELH+DGSGGVIIDSGTTITYIE+TAF+ LKNEFIAQMNLPVDDSGTGGLD
Sbjct: 319 TQLSIPKSTFELHDDGSGGVIIDSGTTITYIESTAFSSLKNEFIAQMNLPVDDSGTGGLD 378

Query: 363 LCFNLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQ 422
           LCFNLPA  TQVEVPKLTFHFKGADLELPGENYMIGDS+ GL+CLAIGSSRGMSIFGNLQ
Sbjct: 379 LCFNLPAGTTQVEVPKLTFHFKGADLELPGENYMIGDSKTGLLCLAIGSSRGMSIFGNLQ 438

Query: 423 QQNFMVVHDLQEETLSFLPTQCDSI 436
           QQNFMVVHDLQEETLSFLPTQCDSI
Sbjct: 439 QQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of Lsi01G019440 vs. ExPASy TrEMBL
Match: A0A1S3B573 (aspartic proteinase nepenthesin-1 OS=Cucumis melo OX=3656 GN=LOC103486136 PE=3 SV=1)

HSP 1 Score: 758.8 bits (1958), Expect = 1.2e-215
Identity = 389/445 (87.42%), Postives = 410/445 (92.13%), Query Frame = 0

Query: 3   VLSTTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGK 62
           V  TTLFI+T A SS+LS RALQKPNKLPS+GFRV+L HVDHVKNLTRFERLRRGVARGK
Sbjct: 19  VFITTLFINTLAFSSSLSTRALQKPNKLPSHGFRVRLKHVDHVKNLTRFERLRRGVARGK 78

Query: 63  NRLQRLNAMVL-AANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQ 122
           NRL RLNAMVL AANA+VGDQVKAPVVAGNGEFLMKLAIG+PP+SFSAIMDTGSDLIWTQ
Sbjct: 79  NRLHRLNAMVLAAANASVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQ 138

Query: 123 CKPCQQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEG 182
           CKPCQQCFDQATPIFDPK+SSSFSKISC SELCGALPTSTCS++GCEYLYTYGDSSST+G
Sbjct: 139 CKPCQQCFDQATPIFDPKQSSSFSKISCRSELCGALPTSTCSSDGCEYLYTYGDSSSTQG 198

Query: 183 VLAFETFTFGDSSEDQ-----------NINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQ 242
           VLAFETFTFGDS+EDQ           N N  + F+  QG GLVGLGRGPLSLVSQLKEQ
Sbjct: 199 VLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFS--QGAGLVGLGRGPLSLVSQLKEQ 258

Query: 243 KFAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGD 302
           KFAYCLTAIDDSKPSSLLLGSLANI PKT+KDEMK TPLI+NPSQPSFYYLSLQGISVG 
Sbjct: 259 KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKATPLIKNPSQPSFYYLSLQGISVGG 318

Query: 303 TQLPIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLD 362
           TQL IPKSTFELH+DGSGGVIIDSGTTITYIE+TAF+ LKNEFIAQMNLPVDDSGTGGLD
Sbjct: 319 TQLSIPKSTFELHDDGSGGVIIDSGTTITYIESTAFSSLKNEFIAQMNLPVDDSGTGGLD 378

Query: 363 LCFNLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQ 422
           LCFNLPA  TQVEVPKLTFHFKGADLELPGENYMIGDS+ GL+CLAIGSSRGMSIFGNLQ
Sbjct: 379 LCFNLPAGTTQVEVPKLTFHFKGADLELPGENYMIGDSKTGLLCLAIGSSRGMSIFGNLQ 438

Query: 423 QQNFMVVHDLQEETLSFLPTQCDSI 436
           QQNFMVVHDLQEETLSFLPTQCDSI
Sbjct: 439 QQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of Lsi01G019440 vs. ExPASy TrEMBL
Match: A0A6J1EPU6 (aspartic proteinase nepenthesin-1 OS=Cucurbita moschata OX=3662 GN=LOC111435534 PE=3 SV=1)

HSP 1 Score: 711.4 bits (1835), Expect = 2.2e-201
Identity = 364/441 (82.54%), Postives = 392/441 (88.89%), Query Frame = 0

Query: 6   TTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGKNRL 65
           T LFI TSA SS+LSRRAL +P KLPS+GFRV LNHVDHVKNLTRFERL+RGVARGK RL
Sbjct: 17  TMLFIHTSASSSSLSRRALWQP-KLPSDGFRVSLNHVDHVKNLTRFERLQRGVARGKTRL 76

Query: 66  QRLNAMVLAANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPC 125
            RLNAMVLAAN  VG +V+APVVAGNGEFLMKLAIG+PP+SFSAIMDTGSDLIWTQCKPC
Sbjct: 77  HRLNAMVLAANVGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPC 136

Query: 126 QQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEGVLAF 185
           QQCFDQATPIFDPK+SSSFSKISCSSELC ALPTSTCS++ CEY YTYGD SST GVLA 
Sbjct: 137 QQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYFYTYGDYSSTHGVLAA 196

Query: 186 ETFTFGDSSEDQ-----------NINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQKFAY 245
           ETFTFGDSS+DQ           + N  + F+  QGEGLVGLGRGPLSLVSQLKEQKF+Y
Sbjct: 197 ETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFS--QGEGLVGLGRGPLSLVSQLKEQKFSY 256

Query: 246 CLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQLP 305
           CLTAIDD+KPSSLL+GSLAN+KPK ++ E+KTTPLIRNPSQPSFYYLSLQGISVG TQLP
Sbjct: 257 CLTAIDDTKPSSLLMGSLANVKPKASEGEIKTTPLIRNPSQPSFYYLSLQGISVGGTQLP 316

Query: 306 IPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDLCFN 365
           IPK+TFELH+DGSGGVIIDSGTTITYIE  AFTLLK EF++QM LPVDDSGT GLDLCFN
Sbjct: 317 IPKATFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTSGLDLCFN 376

Query: 366 LPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQQQNF 425
           LP E TQVEVPKLTFHFKGADLELPGENYMIGDS+A LICLAIGSS GMSIFGNLQQQN 
Sbjct: 377 LPPETTQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLAIGSSSGMSIFGNLQQQNI 436

Query: 426 MVVHDLQEETLSFLPTQCDSI 436
           MVVHDLQEET+SFLPTQC  I
Sbjct: 437 MVVHDLQEETVSFLPTQCSEI 454

BLAST of Lsi01G019440 vs. NCBI nr
Match: XP_038883313.1 (aspartic proteinase nepenthesin-1 [Benincasa hispida])

HSP 1 Score: 772.7 bits (1994), Expect = 1.7e-219
Identity = 398/444 (89.64%), Postives = 412/444 (92.79%), Query Frame = 0

Query: 3   VLSTTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGK 62
           VL TTLFIDT A    LSRRALQK N+LPSNGF+VKLNHVDHVKNLTRFERLRRGVARGK
Sbjct: 17  VLITTLFIDTLA----LSRRALQKSNELPSNGFKVKLNHVDHVKNLTRFERLRRGVARGK 76

Query: 63  NRLQRLNAMVLAANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQC 122
           NRL RLNAMVLAANAAVG+QV+APVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQC
Sbjct: 77  NRLHRLNAMVLAANAAVGEQVQAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQC 136

Query: 123 KPCQQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEGV 182
           KPCQQCFDQATPIFDPK SSSFSKISCSSELCG LPTSTCS++GCEYLYTYGDSSST+GV
Sbjct: 137 KPCQQCFDQATPIFDPKASSSFSKISCSSELCGPLPTSTCSSDGCEYLYTYGDSSSTQGV 196

Query: 183 LAFETFTFGDSSEDQ-----------NINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQK 242
           LA ETFTFGDSS+DQ           + N  + F+  QG GLVGLGRGPLSLVSQLKEQK
Sbjct: 197 LALETFTFGDSSDDQVSITGLGFGCGDDNEGDGFS--QGAGLVGLGRGPLSLVSQLKEQK 256

Query: 243 FAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDT 302
           FAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVG T
Sbjct: 257 FAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGST 316

Query: 303 QLPIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDL 362
           QL IPK+TFELH+DGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDL
Sbjct: 317 QLSIPKTTFELHDDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDL 376

Query: 363 CFNLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQQ 422
           CFNLPAEATQVEVPKLTFHF+GADLELPGENYMIGDSR GLICLAIGSSRGMSIFGNLQQ
Sbjct: 377 CFNLPAEATQVEVPKLTFHFEGADLELPGENYMIGDSRTGLICLAIGSSRGMSIFGNLQQ 436

Query: 423 QNFMVVHDLQEETLSFLPTQCDSI 436
           QNFMVVHDLQEETLSFLPTQCDSI
Sbjct: 437 QNFMVVHDLQEETLSFLPTQCDSI 454

BLAST of Lsi01G019440 vs. NCBI nr
Match: XP_011653928.1 (aspartic proteinase nepenthesin-1 [Cucumis sativus] >KGN54860.1 hypothetical protein Csa_012038 [Cucumis sativus])

HSP 1 Score: 763.5 bits (1970), Expect = 1.0e-216
Identity = 390/445 (87.64%), Postives = 412/445 (92.58%), Query Frame = 0

Query: 3   VLSTTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGK 62
           +L TTLFI+T A SS+LSRRALQKPNKLPS+GFRV+L HVDHVKNLTRFERLRRGVARGK
Sbjct: 19  ILITTLFINTLAFSSSLSRRALQKPNKLPSHGFRVRLKHVDHVKNLTRFERLRRGVARGK 78

Query: 63  NRLQRLNAMVL-AANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQ 122
           NRL RLNAMVL AANA VGDQVKAPVVAGNGEFLMKLAIG+PP+SFSAIMDTGSDLIWTQ
Sbjct: 79  NRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQ 138

Query: 123 CKPCQQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEG 182
           CKPCQQCFDQ+TPIFDPK+SSSF KISCSSELCGALPTSTCS++GCEYLYTYGDSSST+G
Sbjct: 139 CKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQG 198

Query: 183 VLAFETFTFGDSSEDQ-----------NINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQ 242
           VLAFETFTFGDS+EDQ           N N  + F+  QG GLVGLGRGPLSLVSQLKEQ
Sbjct: 199 VLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFS--QGAGLVGLGRGPLSLVSQLKEQ 258

Query: 243 KFAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGD 302
           KFAYCLTAIDDSKPSSLLLGSLANI PKT+KDEMKTTPLI+NPSQPSFYYLSLQGISVG 
Sbjct: 259 KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGG 318

Query: 303 TQLPIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLD 362
           TQL IPKSTFELH+DGSGGVIIDSGTTITY+EN+AFT LKNEFIAQMNLPVDDSGTGGLD
Sbjct: 319 TQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLD 378

Query: 363 LCFNLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQ 422
           LCFNLPA   QVEVPKLTFHFKGADLELPGENYMIGDS+AGL+CLAIGSSRGMSIFGNLQ
Sbjct: 379 LCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQ 438

Query: 423 QQNFMVVHDLQEETLSFLPTQCDSI 436
           QQNFMVVHDLQEETLSFLPTQCDSI
Sbjct: 439 QQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of Lsi01G019440 vs. NCBI nr
Match: TYK02448.1 (aspartic proteinase nepenthesin-1 [Cucumis melo var. makuwa])

HSP 1 Score: 758.8 bits (1958), Expect = 2.5e-215
Identity = 389/445 (87.42%), Postives = 410/445 (92.13%), Query Frame = 0

Query: 3   VLSTTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGK 62
           V  TTLFI+T A SS+LSRRALQKPNKLPS+GF V+L HVDHVKNLTRFERLRRGVARGK
Sbjct: 19  VFITTLFINTLAFSSSLSRRALQKPNKLPSHGFMVRLKHVDHVKNLTRFERLRRGVARGK 78

Query: 63  NRLQRLNAMVL-AANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQ 122
           NRL RLNAMVL AANA+VGDQVKAPVVAGNGEFLMKLAIG+PP+SFSAIMDTGSDLIWTQ
Sbjct: 79  NRLHRLNAMVLAAANASVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQ 138

Query: 123 CKPCQQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEG 182
           CKPCQQCFDQATPIFDPK+SSSFSKISC SELCGALPTSTCS++GCEYLYTYGDSSST+G
Sbjct: 139 CKPCQQCFDQATPIFDPKQSSSFSKISCRSELCGALPTSTCSSDGCEYLYTYGDSSSTQG 198

Query: 183 VLAFETFTFGDSSEDQ-----------NINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQ 242
           VLAFETFTFGDS+EDQ           N N  + F+  QG GLVGLGRGPLSLVSQLKEQ
Sbjct: 199 VLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFS--QGAGLVGLGRGPLSLVSQLKEQ 258

Query: 243 KFAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGD 302
           KFAYCLTAIDDSKPSSLLLGSLANI PKT+KDEMK TPLI+NPSQPSFYYLSLQGISVG 
Sbjct: 259 KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKATPLIKNPSQPSFYYLSLQGISVGG 318

Query: 303 TQLPIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLD 362
           TQL IPKSTFELH+DGSGGVIIDSGTTITYIE+TAF+ LKNEFIAQMNLPVDDSGTGGLD
Sbjct: 319 TQLSIPKSTFELHDDGSGGVIIDSGTTITYIESTAFSSLKNEFIAQMNLPVDDSGTGGLD 378

Query: 363 LCFNLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQ 422
           LCFNLPA  TQVEVPKLTFHFKGADLELPGENYMIGDS+ GL+CLAIGSSRGMSIFGNLQ
Sbjct: 379 LCFNLPAGTTQVEVPKLTFHFKGADLELPGENYMIGDSKTGLLCLAIGSSRGMSIFGNLQ 438

Query: 423 QQNFMVVHDLQEETLSFLPTQCDSI 436
           QQNFMVVHDLQEETLSFLPTQCDSI
Sbjct: 439 QQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of Lsi01G019440 vs. NCBI nr
Match: XP_008442220.1 (PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo] >KAA0041170.1 aspartic proteinase nepenthesin-1 [Cucumis melo var. makuwa])

HSP 1 Score: 758.8 bits (1958), Expect = 2.5e-215
Identity = 389/445 (87.42%), Postives = 410/445 (92.13%), Query Frame = 0

Query: 3   VLSTTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGK 62
           V  TTLFI+T A SS+LS RALQKPNKLPS+GFRV+L HVDHVKNLTRFERLRRGVARGK
Sbjct: 19  VFITTLFINTLAFSSSLSTRALQKPNKLPSHGFRVRLKHVDHVKNLTRFERLRRGVARGK 78

Query: 63  NRLQRLNAMVL-AANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQ 122
           NRL RLNAMVL AANA+VGDQVKAPVVAGNGEFLMKLAIG+PP+SFSAIMDTGSDLIWTQ
Sbjct: 79  NRLHRLNAMVLAAANASVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQ 138

Query: 123 CKPCQQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEG 182
           CKPCQQCFDQATPIFDPK+SSSFSKISC SELCGALPTSTCS++GCEYLYTYGDSSST+G
Sbjct: 139 CKPCQQCFDQATPIFDPKQSSSFSKISCRSELCGALPTSTCSSDGCEYLYTYGDSSSTQG 198

Query: 183 VLAFETFTFGDSSEDQ-----------NINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQ 242
           VLAFETFTFGDS+EDQ           N N  + F+  QG GLVGLGRGPLSLVSQLKEQ
Sbjct: 199 VLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFS--QGAGLVGLGRGPLSLVSQLKEQ 258

Query: 243 KFAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGD 302
           KFAYCLTAIDDSKPSSLLLGSLANI PKT+KDEMK TPLI+NPSQPSFYYLSLQGISVG 
Sbjct: 259 KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKATPLIKNPSQPSFYYLSLQGISVGG 318

Query: 303 TQLPIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLD 362
           TQL IPKSTFELH+DGSGGVIIDSGTTITYIE+TAF+ LKNEFIAQMNLPVDDSGTGGLD
Sbjct: 319 TQLSIPKSTFELHDDGSGGVIIDSGTTITYIESTAFSSLKNEFIAQMNLPVDDSGTGGLD 378

Query: 363 LCFNLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQ 422
           LCFNLPA  TQVEVPKLTFHFKGADLELPGENYMIGDS+ GL+CLAIGSSRGMSIFGNLQ
Sbjct: 379 LCFNLPAGTTQVEVPKLTFHFKGADLELPGENYMIGDSKTGLLCLAIGSSRGMSIFGNLQ 438

Query: 423 QQNFMVVHDLQEETLSFLPTQCDSI 436
           QQNFMVVHDLQEETLSFLPTQCDSI
Sbjct: 439 QQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of Lsi01G019440 vs. NCBI nr
Match: XP_022928703.1 (aspartic proteinase nepenthesin-1 [Cucurbita moschata])

HSP 1 Score: 711.4 bits (1835), Expect = 4.6e-201
Identity = 364/441 (82.54%), Postives = 392/441 (88.89%), Query Frame = 0

Query: 6   TTLFIDTSALSSTLSRRALQKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGKNRL 65
           T LFI TSA SS+LSRRAL +P KLPS+GFRV LNHVDHVKNLTRFERL+RGVARGK RL
Sbjct: 17  TMLFIHTSASSSSLSRRALWQP-KLPSDGFRVSLNHVDHVKNLTRFERLQRGVARGKTRL 76

Query: 66  QRLNAMVLAANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPC 125
            RLNAMVLAAN  VG +V+APVVAGNGEFLMKLAIG+PP+SFSAIMDTGSDLIWTQCKPC
Sbjct: 77  HRLNAMVLAANVGVGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPC 136

Query: 126 QQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEGVLAF 185
           QQCFDQATPIFDPK+SSSFSKISCSSELC ALPTSTCS++ CEY YTYGD SST GVLA 
Sbjct: 137 QQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYFYTYGDYSSTHGVLAA 196

Query: 186 ETFTFGDSSEDQ-----------NINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQKFAY 245
           ETFTFGDSS+DQ           + N  + F+  QGEGLVGLGRGPLSLVSQLKEQKF+Y
Sbjct: 197 ETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFS--QGEGLVGLGRGPLSLVSQLKEQKFSY 256

Query: 246 CLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQLP 305
           CLTAIDD+KPSSLL+GSLAN+KPK ++ E+KTTPLIRNPSQPSFYYLSLQGISVG TQLP
Sbjct: 257 CLTAIDDTKPSSLLMGSLANVKPKASEGEIKTTPLIRNPSQPSFYYLSLQGISVGGTQLP 316

Query: 306 IPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDLCFN 365
           IPK+TFELH+DGSGGVIIDSGTTITYIE  AFTLLK EF++QM LPVDDSGT GLDLCFN
Sbjct: 317 IPKATFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPVDDSGTSGLDLCFN 376

Query: 366 LPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQQQNF 425
           LP E TQVEVPKLTFHFKGADLELPGENYMIGDS+A LICLAIGSS GMSIFGNLQQQN 
Sbjct: 377 LPPETTQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLAIGSSSGMSIFGNLQQQNI 436

Query: 426 MVVHDLQEETLSFLPTQCDSI 436
           MVVHDLQEET+SFLPTQC  I
Sbjct: 437 MVVHDLQEETVSFLPTQCSEI 454

BLAST of Lsi01G019440 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 509.6 bits (1311), Expect = 2.5e-144
Identity = 267/447 (59.73%), Postives = 333/447 (74.50%), Query Frame = 0

Query: 8   LFIDTSALSSTLSRRAL---QKPNKLPSNGFRVKLNHVDHVKNLTRFERLRRGVARGKNR 67
           L + +  +S + SRR+L     P  LP +GFR+ L HVD  KNLT+ ++++RG+ RG +R
Sbjct: 15  LILFSCLISVSSSRRSLIDRTLPKNLPRSGFRLSLRHVDSGKNLTKIQKIQRGINRGFHR 74

Query: 68  LQRLNAMVLAANAAVGD---QVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQ 127
           L RL A+ + A A+  D    +KAP   G+GEFLM+L+IG P   +SAI+DTGSDLIWTQ
Sbjct: 75  LNRLGAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQ 134

Query: 128 CKPCQQCFDQATPIFDPKKSSSFSKISCSSELCGALPTSTCS--NNGCEYLYTYGDSSST 187
           CKPC +CFDQ TPIFDP+KSSS+SK+ CSS LC ALP S C+   + CEYLYTYGD SST
Sbjct: 135 CKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSST 194

Query: 188 EGVLAFETFTFGDSSEDQNINF----ENLFNKI-QGEGLVGLGRGPLSLVSQLKEQKFAY 247
            G+LA ETFTF D +    I F    EN  +   QG GLVGLGRGPLSL+SQLKE KF+Y
Sbjct: 195 RGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSY 254

Query: 248 CLTAIDDSK-PSSLLLGSLAN-IKPKT----TKDEMKTTPLIRNPSQPSFYYLSLQGISV 307
           CLT+I+DS+  SSL +GSLA+ I  KT      +  KT  L+RNP QPSFYYL LQGI+V
Sbjct: 255 CLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITV 314

Query: 308 GDTQLPIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGG 367
           G  +L + KSTFEL  DG+GG+IIDSGTTITY+E TAF +LK EF ++M+LPVDDSG+ G
Sbjct: 315 GAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTG 374

Query: 368 LDLCFNLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGN 427
           LDLCF LP  A  + VPK+ FHFKGADLELPGENYM+ DS  G++CLA+GSS GMSIFGN
Sbjct: 375 LDLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGN 434

Query: 428 LQQQNFMVVHDLQEETLSFLPTQCDSI 436
           +QQQNF V+HDL++ET+SF+PT+C  +
Sbjct: 435 VQQQNFNVLHDLEKETVSFVPTECGKL 461

BLAST of Lsi01G019440 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 254.2 bits (648), Expect = 1.9e-67
Identity = 162/439 (36.90%), Postives = 235/439 (53.53%), Query Frame = 0

Query: 19  LSRRALQKPNKLPSNGFRVKLNHVDHVKN------LTRFERLRRGVARGKNRLQRLNAMV 78
           LS   L   N  P  GF   L H D  K+       T  +RLR  + R  NR+       
Sbjct: 15  LSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHF---- 74

Query: 79  LAANAAVGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPCQQCFDQA 138
                    Q +  + + +GE+LM ++IGTPP    AI DTGSDL+WTQC PC  C+ Q 
Sbjct: 75  --TEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQV 134

Query: 139 TPIFDPKKSSSFSKISCSSELCGALPT-STCS--NNGCEYLYTYGDSSSTEGVLAFETFT 198
            P+FDPK SS++  +SCSS  C AL   ++CS  +N C Y  +YGD+S T+G +A +T T
Sbjct: 135 DPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLT 194

Query: 199 FGDSSEDQNINFENL-----------FNKIQGEGLVGLGRGPLSLVSQLKEQ---KFAYC 258
            G SS+ + +  +N+           FNK +G G+VGLG GP+SL+ QL +    KF+YC
Sbjct: 195 LG-SSDTRPMQLKNIIIGCGHNNAGTFNK-KGSGIVGLGGGPVSLIKQLGDSIDGKFSYC 254

Query: 259 LTAIDDSK--PSSLLLGSLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQL 318
           L  +   K   S +  G+ A +    +   + +TPLI   SQ +FYYL+L+ ISVG  Q+
Sbjct: 255 LVPLTSKKDQTSKINFGTNAIV----SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQI 314

Query: 319 PIPKSTFELHNDGSGGVIIDSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDLCF 378
               S  E      G +IIDSGTT+T +    ++ L++   + ++         GL LC+
Sbjct: 315 QYSGSDSE---SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY 374

Query: 379 NLPAEATQVEVPKLTFHFKGADLELPGENYMIGDSRAGLICLAIGSSRGMSIFGNLQQQN 433
           +   +   ++VP +T HF GAD++L   N  +  S   L+C A   S   SI+GN+ Q N
Sbjct: 375 SATGD---LKVPVITMHFDGADVKLDSSNAFVQVSE-DLVCFAFRGSPSFSIYGNVAQMN 434

BLAST of Lsi01G019440 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 251.5 bits (641), Expect = 1.2e-66
Identity = 157/422 (37.20%), Postives = 221/422 (52.37%), Query Frame = 0

Query: 36  RVKLNHVDH--VKNLTRFERLRRGVARGKNRLQRLNAMVLAANAA-----------VGDQ 95
           RV +   +H   K+LT   RL R  AR K+ + RL+  +   + A               
Sbjct: 74  RVSVRGTEHSDYKSLT-LARLNRDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQD 133

Query: 96  VKAPVVA----GNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPCQQCFDQATPIFDP 155
           ++AP+++    G+GE+  ++ IG P +    ++DTGSD+ W QC PC  C+ Q  PIF+P
Sbjct: 134 IEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEP 193

Query: 156 KKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEGVLAFETFTFGDSSEDQN 215
             SSS+  +SC +  C AL  S C N  C Y  +YGD S T G  A ET T G S+  QN
Sbjct: 194 SSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIG-STLVQN 253

Query: 216 I------NFENLFNKIQGEGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLGSL 275
           +      + E LF  +   GL+GLG G L+L SQL    F+YCL   D    S++  G+ 
Sbjct: 254 VAVGCGHSNEGLF--VGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVDFGT- 313

Query: 276 ANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQLPIPKSTFELHNDGSGGVII 335
                 +   +    PL+RN    +FYYL L GISVG   L IP+S+FE+   GSGG+II
Sbjct: 314 ------SLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIII 373

Query: 336 DSGTTITYIENTAFTLLKNEFIAQMNLPVDDSGTGGLDLCFNLPAEATQVEVPKLTFHFK 395
           DSGT +T ++   +  L++ F+         +G    D C+NL A+ T VEVP + FHF 
Sbjct: 374 DSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTT-VEVPTVAFHFP 433

Query: 396 GAD-LELPGENYMIGDSRAGLICLAIG-SSRGMSIFGNLQQQNFMVVHDLQEETLSFLPT 433
           G   L LP +NYMI     G  CLA   ++  ++I GN+QQQ   V  DL    + F   
Sbjct: 434 GGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSN 483

BLAST of Lsi01G019440 vs. TAIR 10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 249.6 bits (636), Expect = 4.6e-66
Identity = 135/365 (36.99%), Postives = 205/365 (56.16%), Query Frame = 0

Query: 81  DQVKAPVVA----GNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPCQQCFDQATPIF 140
           + +  PVV+    G+GE+  ++ +GTP K    ++DTGSD+ W QC+PC  C+ Q+ P+F
Sbjct: 145 EDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVF 204

Query: 141 DPKKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEGVLAFETFTFGDSSED 200
           +P  SS++  ++CS+  C  L TS C +N C Y  +YGD S T G LA +T TFG+S + 
Sbjct: 205 NPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKI 264

Query: 201 QNI------NFENLFNKIQGEGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLG 260
            N+      + E LF      GL+GLG G LS+ +Q+K   F+YCL   D  K SSL   
Sbjct: 265 NNVALGCGHDNEGLFT--GAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFN 324

Query: 261 SLANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQLPIPKSTFELHNDGSGGV 320
           S+             T PL+RN    +FYY+ L G SVG  ++ +P + F++   GSGGV
Sbjct: 325 SV------QLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGV 384

Query: 321 IIDSGTTITYIENTAFTLLKNEFI-AQMNLPVDDSGTGGLDLCFNLPAEATQVEVPKLTF 380
           I+D GT +T ++  A+  L++ F+   +NL    S     D C++  + +T V+VP + F
Sbjct: 385 ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST-VKVPTVAF 444

Query: 381 HFKGA-DLELPGENYMIGDSRAGLICLAIG-SSRGMSIFGNLQQQNFMVVHDLQEETLSF 433
           HF G   L+LP +NY+I    +G  C A   +S  +SI GN+QQQ   + +DL +  +  
Sbjct: 445 HFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGL 500

BLAST of Lsi01G019440 vs. TAIR 10
Match: AT3G20015.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 242.7 bits (618), Expect = 5.6e-64
Identity = 140/423 (33.10%), Postives = 223/423 (52.72%), Query Frame = 0

Query: 32  SNGFRVKLNHVDHVKNLT-------RFERLRRGVARGKNRLQRLNAMVLAANAA------ 91
           S+ + ++L H D   ++T          R+RR   R    L+R++  V+ ++ +      
Sbjct: 56  SSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVND 115

Query: 92  VGDQVKAPVVAGNGEFLMKLAIGTPPKSFSAIMDTGSDLIWTQCKPCQQCFDQATPIFDP 151
            G  + + +  G+GE+ +++ +G+PP+    ++D+GSD++W QC+PC+ C+ Q+ P+FDP
Sbjct: 116 FGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDP 175

Query: 152 KKSSSFSKISCSSELCGALPTSTCSNNGCEYLYTYGDSSSTEGVLAFETFTFGDS---SE 211
            KS S++ +SC S +C  +  S C + GC Y   YGD S T+G LA ET TF  +   + 
Sbjct: 176 AKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNV 235

Query: 212 DQNINFENLFNKIQGEGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKPSSLLLGSL 271
                  N    I   GL+G+G G +S V QL  Q    F YCL +       SL+ G  
Sbjct: 236 AMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGR- 295

Query: 272 ANIKPKTTKDEMKTTPLIRNPSQPSFYYLSLQGISVGDTQLPIPKSTFELHNDGSGGVII 331
                +         PL+RNP  PSFYY+ L+G+ VG  ++P+P   F+L   G GGV++
Sbjct: 296 -----EALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVM 355

Query: 332 DSGTTITYIENTAFTLLKNEFIAQ-MNLPVDDSGTGGLDLCFNLPAEATQVEVPKLTFHF 391
           D+GT +T +   A+   ++ F +Q  NLP   SG    D C++L +    V VP ++F+F
Sbjct: 356 DTGTAVTRLPTAAYVAFRDGFKSQTANLP-RASGVSIFDTCYDL-SGFVSVRVPTVSFYF 415

Query: 392 -KGADLELPGENYMIGDSRAGLICLAIGSS-RGMSIFGNLQQQNFMVVHDLQEETLSFLP 433
            +G  L LP  N+++    +G  C A  +S  G+SI GN+QQ+   V  D     + F P
Sbjct: 416 TEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 470

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q766C34.3e-11753.99Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C28.1e-10848.35Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q7XV211.2e-6634.75Aspartyl protease 37 OS=Oryza sativa subsp. japonica OX=39947 GN=AP37 PE=3 SV=2[more]
Q6XBF82.6e-6636.90Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q9LS406.5e-6536.99Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
A0A0A0KYT94.9e-21787.64Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G55468... [more]
A0A5A7TD101.2e-21587.42Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A5D3BTY91.2e-21587.42Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3B5731.2e-21587.42aspartic proteinase nepenthesin-1 OS=Cucumis melo OX=3656 GN=LOC103486136 PE=3 S... [more]
A0A6J1EPU62.2e-20182.54aspartic proteinase nepenthesin-1 OS=Cucurbita moschata OX=3662 GN=LOC111435534 ... [more]
Match NameE-valueIdentityDescription
XP_038883313.11.7e-21989.64aspartic proteinase nepenthesin-1 [Benincasa hispida][more]
XP_011653928.11.0e-21687.64aspartic proteinase nepenthesin-1 [Cucumis sativus] >KGN54860.1 hypothetical pro... [more]
TYK02448.12.5e-21587.42aspartic proteinase nepenthesin-1 [Cucumis melo var. makuwa][more]
XP_008442220.12.5e-21587.42PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo] >KAA0041170.1 aspart... [more]
XP_022928703.14.6e-20182.54aspartic proteinase nepenthesin-1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT2G03200.12.5e-14459.73Eukaryotic aspartyl protease family protein [more]
AT5G33340.11.9e-6736.90Eukaryotic aspartyl protease family protein [more]
AT1G25510.11.2e-6637.20Eukaryotic aspartyl protease family protein [more]
AT3G18490.14.6e-6636.99Eukaryotic aspartyl protease family protein [more]
AT3G20015.15.6e-6433.10Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (USVL1VR-Ls) v1
Date Performed: 2021-10-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 310..321
score: 38.11
coord: 404..419
score: 26.0
coord: 100..120
score: 45.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 252..435
e-value: 4.1E-56
score: 191.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 77..250
e-value: 8.6E-49
score: 168.1
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 88..433
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 94..251
e-value: 6.4E-44
score: 150.3
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 278..427
e-value: 1.1E-35
score: 122.9
NoneNo IPR availablePANTHERPTHR47967:SF23OS08G0469000 PROTEINcoord: 19..434
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 19..434
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 310..321
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 94..428
score: 40.199642
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 93..432
e-value: 1.37647E-101
score: 301.489

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi01G019440.1Lsi01G019440.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity