Bhi01G000358 (gene) Wax gourd

NameBhi01G000358
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionAspartic protease
Locationchr1 : 8780286 .. 8782244 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATAAAATAAAATAACACAGATGTCAAACTCCACCACTTTAATTTCTTATGCTTCTCCCTTATTTATAGTGTCTGAGCTCTGAACAAATTGTATCTCTGTCTTTCTTTCTAACAAAAAATGGAAGCAATGACCAAACTGTTCTCCTTTATCTTCTTCCTTCTCACTCTTCTTTCTCTCTCCACCGCCATCTCCGATTTCCAAACCCTAATTCCCACCTCTCTTCCTTCCTCACCTTCCTTCTTACCCTCCGATTCTGAGTCTTTCATCTCCTCCGACACCACCGAATCGGACCTCGGCTTAACACTGCACCTCCACCATTTGGACGCTCTGTCTCTCAACCGAACGCCGGAGGAGCTCTTCCACCTCCGCCTTCAAAGGGATGCTCTCCGAGTCAAGAAGCTGAGTTCACTCCATGATTCATCTCAAAATGTGAGCCAAACCAGTGGGACCGGGTTCAGTAGCTCCGTCATCTCGGGACTCGCTCAGGGCAGCGGCGAGTACTTCACGCGCATCGGCGTCGGCACGCCACCCAAGTATGTTTACATGGTGCTCGACACCGGCAGTGACATCGTTTGGCTACAGTGTGCTCCTTGTAAGAATTGCTACTCTCAGACTGACCCTGTTTTCAACCCGGTTAAGTCTGGATCCTTCGCCAAGGTCCTCTGCAGGACGCCGCTGTGCCGTCGGCTTGAATCTCCGGGATGCAACCAGCGTCAGACGTGTCTCTACCAAGTTTCTTACGGCGACGGTTCATACACCACCGGCGAGTTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGTGGCCACGATAATGAAGGCTTGTTCGTTGGTGCGGCTGGGCTTTTAGGTCTTGGCCGTGGAGGGTTGTCGTTTCCGTCGCAAACCGGTCGGACTTTCAATCAGAAATTCTCTTACTGTTTGGTGGACCGGTCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTAACGCCGCTGTCTCTCGAACCGCCCGGTTCACTCCTCTCCTCACAAACCCTAGGTTGGATACGTTTTACTACGTTGAACTGCTTGGAATCAGCGTCGGAGGCACGCCCGTCTCCGGCATCTCCGCTTCACATTTCAAGCTCGATCCGACCGGTAATGGTGGAGTAATCATCGATTGTGGTACTTCTGTTACTCGGTTGAACCGACCAGCGTACATTGCTCTGCGCGATGCCTTCCGTGCCGGAGCATCGAGTTTGAAATCGGCGCCGGAGTTTTCTCTTTTCGATACTTGCTACGATCTTTCCGGGAAGACGACGGTGAAGGTCCCGACGGTTGTGCTGCATTTCAGAGGCGCTGATGTATCGTTACCGGCGTCCAATTATCTGATCCCTGTCGACGGCAGCGGCCGATTCTGCTTCGCCTTCGCCGGAACGACCAGTGGGCTATCGATTATCGGCAACATTCAGCAGCAAGGATTTCGGGTGGTGTACGATTTGGCGAGTTCTCGGGTCGGATTTTCTCCTCGTGGTTGCGCCTAATTTCATCGGACCAACGTAATGCGGTTGCCGGGAAAATTCCTCCGCCGCCGTCTGTCCTAAAACCCTTCTTCCTTTTTTCAGACGTGATAATAAAAAAGAAAGAAAAAGAAAAAAGGTAGAAAGAAAAAGGAGTGCATAATAGACTCGAACGGTGTCGTTTAACTTTTACTATTTTAATTGACTTACTTTTTTCATTCCAATTATCTCTTTTCCTTTGCTTATTTAATTATATTATCTATTTGGAGTGTCGGTTATTTTATATATTAATTTCTTAAAAACATAAATGTCATAAACATAAATAATAAGAATTGTTCAAATTCTTTTTGGTTTCTGAATTATGTGTGCCTTTAATATGTTCTTTTTAATGTACATATTTTAGTATTTGAATTATATGTTAGTCATTATTTGGACTATGATTTTCTTTTTTAA

mRNA sequence

ATAAAATAAAATAACACAGATGTCAAACTCCACCACTTTAATTTCTTATGCTTCTCCCTTATTTATAGTGTCTGAGCTCTGAACAAATTGTATCTCTGTCTTTCTTTCTAACAAAAAATGGAAGCAATGACCAAACTGTTCTCCTTTATCTTCTTCCTTCTCACTCTTCTTTCTCTCTCCACCGCCATCTCCGATTTCCAAACCCTAATTCCCACCTCTCTTCCTTCCTCACCTTCCTTCTTACCCTCCGATTCTGAGTCTTTCATCTCCTCCGACACCACCGAATCGGACCTCGGCTTAACACTGCACCTCCACCATTTGGACGCTCTGTCTCTCAACCGAACGCCGGAGGAGCTCTTCCACCTCCGCCTTCAAAGGGATGCTCTCCGAGTCAAGAAGCTGAGTTCACTCCATGATTCATCTCAAAATGTGAGCCAAACCAGTGGGACCGGGTTCAGTAGCTCCGTCATCTCGGGACTCGCTCAGGGCAGCGGCGAGTACTTCACGCGCATCGGCGTCGGCACGCCACCCAAGTATGTTTACATGGTGCTCGACACCGGCAGTGACATCGTTTGGCTACAGTGTGCTCCTTGTAAGAATTGCTACTCTCAGACTGACCCTGTTTTCAACCCGGTTAAGTCTGGATCCTTCGCCAAGGTCCTCTGCAGGACGCCGCTGTGCCGTCGGCTTGAATCTCCGGGATGCAACCAGCGTCAGACGTGTCTCTACCAAGTTTCTTACGGCGACGGTTCATACACCACCGGCGAGTTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGTGGCCACGATAATGAAGGCTTGTTCGTTGGTGCGGCTGGGCTTTTAGGTCTTGGCCGTGGAGGGTTGTCGTTTCCGTCGCAAACCGGTCGGACTTTCAATCAGAAATTCTCTTACTGTTTGGTGGACCGGTCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTAACGCCGCTGTCTCTCGAACCGCCCGGTTCACTCCTCTCCTCACAAACCCTAGGTTGGATACGTTTTACTACGTTGAACTGCTTGGAATCAGCGTCGGAGGCACGCCCGTCTCCGGCATCTCCGCTTCACATTTCAAGCTCGATCCGACCGGTAATGGTGGAGTAATCATCGATTGTGGTACTTCTGTTACTCGGTTGAACCGACCAGCGTACATTGCTCTGCGCGATGCCTTCCGTGCCGGAGCATCGAGTTTGAAATCGGCGCCGGAGTTTTCTCTTTTCGATACTTGCTACGATCTTTCCGGGAAGACGACGGTGAAGGTCCCGACGGTTGTGCTGCATTTCAGAGGCGCTGATGTATCGTTACCGGCGTCCAATTATCTGATCCCTGTCGACGGCAGCGGCCGATTCTGCTTCGCCTTCGCCGGAACGACCAGTGGGCTATCGATTATCGGCAACATTCAGCAGCAAGGATTTCGGGTGGTGTACGATTTGGCGAGTTCTCGGGTCGGATTTTCTCCTCGTGGTTGCGCCTAATTTCATCGGACCAACGTAATGCGGTTGCCGGGAAAATTCCTCCGCCGCCGTCTGTCCTAAAACCCTTCTTCCTTTTTTCAGACGTGATAATAAAAAAGAAAGAAAAAGAAAAAAGGTAGAAAGAAAAAGGAGTGCATAATAGACTCGAACGGTGTCGTTTAACTTTTACTATTTTAATTGACTTACTTTTTTCATTCCAATTATCTCTTTTCCTTTGCTTATTTAATTATATTATCTATTTGGAGTGTCGGTTATTTTATATATTAATTTCTTAAAAACATAAATGTCATAAACATAAATAATAAGAATTGTTCAAATTCTTTTTGGTTTCTGAATTATGTGTGCCTTTAATATGTTCTTTTTAATGTACATATTTTAGTATTTGAATTATATGTTAGTCATTATTTGGACTATGATTTTCTTTTTTAA

Coding sequence (CDS)

ATGGAAGCAATGACCAAACTGTTCTCCTTTATCTTCTTCCTTCTCACTCTTCTTTCTCTCTCCACCGCCATCTCCGATTTCCAAACCCTAATTCCCACCTCTCTTCCTTCCTCACCTTCCTTCTTACCCTCCGATTCTGAGTCTTTCATCTCCTCCGACACCACCGAATCGGACCTCGGCTTAACACTGCACCTCCACCATTTGGACGCTCTGTCTCTCAACCGAACGCCGGAGGAGCTCTTCCACCTCCGCCTTCAAAGGGATGCTCTCCGAGTCAAGAAGCTGAGTTCACTCCATGATTCATCTCAAAATGTGAGCCAAACCAGTGGGACCGGGTTCAGTAGCTCCGTCATCTCGGGACTCGCTCAGGGCAGCGGCGAGTACTTCACGCGCATCGGCGTCGGCACGCCACCCAAGTATGTTTACATGGTGCTCGACACCGGCAGTGACATCGTTTGGCTACAGTGTGCTCCTTGTAAGAATTGCTACTCTCAGACTGACCCTGTTTTCAACCCGGTTAAGTCTGGATCCTTCGCCAAGGTCCTCTGCAGGACGCCGCTGTGCCGTCGGCTTGAATCTCCGGGATGCAACCAGCGTCAGACGTGTCTCTACCAAGTTTCTTACGGCGACGGTTCATACACCACCGGCGAGTTCGTCACCGAAACCCTAACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGTGGCCACGATAATGAAGGCTTGTTCGTTGGTGCGGCTGGGCTTTTAGGTCTTGGCCGTGGAGGGTTGTCGTTTCCGTCGCAAACCGGTCGGACTTTCAATCAGAAATTCTCTTACTGTTTGGTGGACCGGTCCGCCTCTTCCAAACCGTCCTCCGTCGTCTTCGGTAACGCCGCTGTCTCTCGAACCGCCCGGTTCACTCCTCTCCTCACAAACCCTAGGTTGGATACGTTTTACTACGTTGAACTGCTTGGAATCAGCGTCGGAGGCACGCCCGTCTCCGGCATCTCCGCTTCACATTTCAAGCTCGATCCGACCGGTAATGGTGGAGTAATCATCGATTGTGGTACTTCTGTTACTCGGTTGAACCGACCAGCGTACATTGCTCTGCGCGATGCCTTCCGTGCCGGAGCATCGAGTTTGAAATCGGCGCCGGAGTTTTCTCTTTTCGATACTTGCTACGATCTTTCCGGGAAGACGACGGTGAAGGTCCCGACGGTTGTGCTGCATTTCAGAGGCGCTGATGTATCGTTACCGGCGTCCAATTATCTGATCCCTGTCGACGGCAGCGGCCGATTCTGCTTCGCCTTCGCCGGAACGACCAGTGGGCTATCGATTATCGGCAACATTCAGCAGCAAGGATTTCGGGTGGTGTACGATTTGGCGAGTTCTCGGGTCGGATTTTCTCCTCGTGGTTGCGCCTAA

Protein sequence

MEAMTKLFSFIFFLLTLLSLSTAISDFQTLIPTSLPSSPSFLPSDSESFISSDTTESDLGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLHDSSQNVSQTSGTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA
BLAST of Bhi01G000358 vs. TAIR10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 626.7 bits (1615), Expect = 1.1e-179
Identity = 333/478 (69.67%), Postives = 378/478 (79.08%), Query Frame = 0

Query: 7   LFSFIFFLLTLLSLSTAISDFQTLIPT--SLP-SSPSFLPSDSE-------SFISSDTTE 66
           LFS  FF L+L S S      QTL P   SLP +SP     DS+                
Sbjct: 9   LFSLCFFFLSLPSFS-XXXXXQTLFPNSHSLPCASPVSFQPDSDXXXXXXXXXXXXXXXX 68

Query: 67  SDLGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSL--HDSSQNVSQTSGT-GF 126
                  +L H+DALS N+TP+ELF  RLQRD+ RVK +++L      +NV+      GF
Sbjct: 69  XXXXXXXNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGF 128

Query: 127 SSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPV 186
           SSSV+SGL+QGSGEYFTR+GVGTP +YVYMVLDTGSDIVWLQCAPC+ CYSQ+DP+F+P 
Sbjct: 129 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR 188

Query: 187 KSGSFAKVLCRTPLCRRLESPGCN-QRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVER 246
           KS ++A + C +P CRRL+S GCN +R+TCLYQVSYGDGS+T G+F TETLTFRR +V+ 
Sbjct: 189 KSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG 248

Query: 247 VALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFG 306
           VALGCGHDNEGLFVGAAGLLGLG+G LSFP QTG  FNQKFSYCLVDRSASSKPSSVVFG
Sbjct: 249 VALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFG 308

Query: 307 NAAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCG 366
           NAAVSR ARFTPLL+NP+LDTFYYV LLGISVGGT V G++AS FKLD  GNGGVIID G
Sbjct: 309 NAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSG 368

Query: 367 TSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADV 426
           TSVTRL RPAYIA+RDAFR GA +LK AP+FSLFDTC+DLS    VKVPTVVLHFRGADV
Sbjct: 369 TSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV 428

Query: 427 SLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           SLPA+NYLIPVD +G+FCFAFAGT  GLSIIGNIQQQGFRVVYDLASSRVGF+P GCA
Sbjct: 429 SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Bhi01G000358 vs. TAIR10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 588.6 bits (1516), Expect = 3.5e-168
Identity = 309/478 (64.64%), Postives = 369/478 (77.20%), Query Frame = 0

Query: 7   LFSFIFFLLTLLSLSTAISDFQTLIPTSLPSSPSFLPSDSESFISSDTTESDLGLTLHLH 66
           +F+ +FF       S+A S +QTL+  +LPSS +    +SES      +ES   L++HL 
Sbjct: 13  VFAVLFF------TSSASSQYQTLVVNTLPSSATLSWPESESLTDESLSESTTSLSVHLS 72

Query: 67  HLDALS--LNRTPEELFHLRLQRDALRVKKLSSL------HDSSQNVSQTSGTGFSSSVI 126
           H+DALS   + +P +LF+LRLQRD+LRVK ++SL       ++++   +T+G GFS +VI
Sbjct: 73  HVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTPRTAG-GFSGAVI 132

Query: 127 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 186
           SGL+QGSGEYF R+GVGTP   VYMVLDTGSD+VWLQC+PCK CY+QTD +F+P KS +F
Sbjct: 133 SGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTF 192

Query: 187 AKVLCRTPLCRRL-ESPGCNQR--QTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVAL 246
           A V C + LCRRL +S  C  R  +TCLYQVSYGDGS+T G+F TETLTF   +V+ V L
Sbjct: 193 ATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPL 252

Query: 247 GCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDR----SASSKPSSVVF 306
           GCGHDNEGLFVGAAGLLGLGRGGLSFPSQT   +N KFSYCLVDR    S+S  PS++VF
Sbjct: 253 GCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVF 312

Query: 307 GNAAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDC 366
           GNAAV +T+ FTPLLTNP+LDTFYY++LLGISVGG+ V G+S S FKLD TGNGGVIID 
Sbjct: 313 GNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDS 372

Query: 367 GTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD 426
           GTSVTRL +PAY+ALRDAFR GA+ LK AP +SLFDTC+DLSG TTVKVPTVV HF G +
Sbjct: 373 GTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGE 432

Query: 427 VSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
           VSLPASNYLIPV+  GRFCFAFAGT   LSIIGNIQQQGFRV YDL  SRVGF  R C
Sbjct: 433 VSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of Bhi01G000358 vs. TAIR10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 400.6 bits (1028), Expect = 1.3e-111
Identity = 242/495 (48.89%), Postives = 306/495 (61.82%), Query Frame = 0

Query: 4   MTKLFSFIFFLLTLLSLSTAISDFQTLIPTSLPSSPSFL-PSDS---ESFISS------- 63
           M+  +SF FF+  L S S   S F  ++P +  ++ S L  +DS     + SS       
Sbjct: 1   MSPNYSFFFFIFFLTSHS---SVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQE 60

Query: 64  -DTTESDLGLTLHLHHLDALSLNRTP----EELFHLRLQRDALRVKKLSSLHD-SSQNVS 123
             T  +    +L LH    +S+  T     + L   RL RD  RVK L +  D +  N+S
Sbjct: 61  EQTHSASSSFSLQLH--SRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNIS 120

Query: 124 Q-----------TSGTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQ 183
           +           T      + +ISG  QGSGEYFTR+G+G P + VYMVLDTGSD+ WLQ
Sbjct: 121 KADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQ 180

Query: 184 CAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTT 243
           C PC +CY QT+P+F P  S S+  + C TP C  LE   C +  TCLY+VSYGDGSYT 
Sbjct: 181 CTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSEC-RNATCLYEVSYGDGSYTV 240

Query: 244 GEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSY 303
           G+F TETLT   T V+ VA+GCGH NEGLFVGAAGLLGLG G L+ PSQ   T    FSY
Sbjct: 241 GDFATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSY 300

Query: 304 CLVDRSASSKPSSVVFGNAAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISAS 363
           CLVDR + S  S+V FG  ++S  A   PLL N +LDTFYY+ L GISVGG  +  I  S
Sbjct: 301 CLVDRDSDS-ASTVDFG-TSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQ-IPQS 360

Query: 364 HFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGK 423
            F++D +G+GG+IID GT+VTRL    Y +LRD+F  G   L+ A   ++FDTCY+LS K
Sbjct: 361 SFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAK 420

Query: 424 TTVKVPTVVLHFRGAD-VSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVV 470
           TTV+VPTV  HF G   ++LPA NY+IPVD  G FC AFA T S L+IIGN+QQQG RV 
Sbjct: 421 TTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVT 480

BLAST of Bhi01G000358 vs. TAIR10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 397.5 bits (1020), Expect = 1.1e-110
Identity = 230/470 (48.94%), Postives = 285/470 (60.64%), Query Frame = 0

Query: 16  TLLSLSTAISDFQTLIPTSLPSSPSFLPSDSESFISSDTTESDLGLTLHLHHLDALSLNR 75
           T+LSL    S   T  P SL S P F  S            S L L LH       S ++
Sbjct: 49  TILSLDPTRSSLTTTKPESL-SDPVFFNS-----------SSPLSLELHSRDTFVASQHK 108

Query: 76  TPEELFHLRLQRDALRVKKL-------------SSLHDSSQNVSQTSGTGFSSSVISGLA 135
             + L   RL+RD+ RV  +             S L       ++      ++ V+SG +
Sbjct: 109 DYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGAS 168

Query: 136 QGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVL 195
           QGSGEYF+RIGVGTP K +Y+VLDTGSD+ W+QC PC +CY Q+DPVFNP  S ++  + 
Sbjct: 169 QGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLT 228

Query: 196 CRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRT-KVERVALGCGHDN 255
           C  P C  LE+  C   + CLYQVSYGDGS+T GE  T+T+TF  + K+  VALGCGHDN
Sbjct: 229 CSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDN 288

Query: 256 EGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRTAR 315
           EGLF GAAGLLGLG G LS  +Q   T    FSYCLVDR  S K SS+ F +  +     
Sbjct: 289 EGLFTGAAGLLGLGGGVLSITNQMKAT---SFSYCLVDRD-SGKSSSLDFNSVQLGGGDA 348

Query: 316 FTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRP 375
             PLL N ++DTFYYV L G SVGG  V  +  + F +D +G+GGVI+DCGT+VTRL   
Sbjct: 349 TAPLLRNKKIDTFYYVGLSGFSVGGEKVV-LPDAIFDVDASGSGGVILDCGTAVTRLQTQ 408

Query: 376 AYIALRDAF-RAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA-DVSLPASNY 435
           AY +LRDAF +   +  K +   SLFDTCYD S  +TVKVPTV  HF G   + LPA NY
Sbjct: 409 AYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNY 468

Query: 436 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
           LIPVD SG FCFAFA T+S LSIIGN+QQQG R+ YDL+ + +G S   C
Sbjct: 469 LIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Bhi01G000358 vs. TAIR10
Match: AT3G20015.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 387.9 bits (995), Expect = 9.0e-108
Identity = 215/452 (47.57%), Postives = 280/452 (61.95%), Query Frame = 0

Query: 26  DFQTLIPTSLPSSPSFLPSDSESFISSDTTESDLGLTLHLHHLDALS--LNRTPEELFHL 85
           DFQ +     P + +    D  +   SD  ES    TL L H D       R      H 
Sbjct: 26  DFQIIDVLQPPLTVTATLPDFNNTHFSD--ESSSKYTLRLLHRDRFPSVTYRNHHHRLHA 85

Query: 86  RLQRDALRVKKLSSLHDSSQNVSQTSGT-----GFSSSVISGLAQGSGEYFTRIGVGTPP 145
           R++RD  RV  +  L   S  V  +S +      F S ++SG+ QGSGEYF RIGVG+PP
Sbjct: 86  RMRRDTDRVSAI--LRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPP 145

Query: 146 KYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQ 205
           +  YMV+D+GSD+VW+QC PCK CY Q+DPVF+P KSGS+  V C + +C R+E+ GC+ 
Sbjct: 146 RDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHS 205

Query: 206 RQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGG 265
              C Y+V YGDGSYT G    ETLTF +T V  VA+GCGH N G+F+GAAGLLG+G G 
Sbjct: 206 -GGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGS 265

Query: 266 LSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRTARFTPLLTNPRLDTFYYVE 325
           +SF  Q        F YCLV R   S   S+VFG  A+   A + PL+ NPR  +FYYV 
Sbjct: 266 MSFVGQLSGQTGGAFGYCLVSRGTDS-TGSLVFGREALPVGASWVPLVRNPRAPSFYYVG 325

Query: 326 LLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLK 385
           L G+ VGG  +  +    F L  TG+GGV++D GT+VTRL   AY+A RD F++  ++L 
Sbjct: 326 LKGLGVGGVRIP-LPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLP 385

Query: 386 SAPEFSLFDTCYDLSGKTTVKVPTVVLHF-RGADVSLPASNYLIPVDGSGRFCFAFAGTT 445
            A   S+FDTCYDLSG  +V+VPTV  +F  G  ++LPA N+L+PVD SG +CFAFA + 
Sbjct: 386 RASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASP 445

Query: 446 SGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
           +GLSIIGNIQQ+G +V +D A+  VGF P  C
Sbjct: 446 TGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Bhi01G000358 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 626.7 bits (1615), Expect = 2.1e-178
Identity = 333/478 (69.67%), Postives = 378/478 (79.08%), Query Frame = 0

Query: 7   LFSFIFFLLTLLSLSTAISDFQTLIPT--SLP-SSPSFLPSDSE-------SFISSDTTE 66
           LFS  FF L+L S S      QTL P   SLP +SP     DS+                
Sbjct: 9   LFSLCFFFLSLPSFS-XXXXXQTLFPNSHSLPCASPVSFQPDSDXXXXXXXXXXXXXXXX 68

Query: 67  SDLGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSL--HDSSQNVSQTSGT-GF 126
                  +L H+DALS N+TP+ELF  RLQRD+ RVK +++L      +NV+      GF
Sbjct: 69  XXXXXXXNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGF 128

Query: 127 SSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPV 186
           SSSV+SGL+QGSGEYFTR+GVGTP +YVYMVLDTGSDIVWLQCAPC+ CYSQ+DP+F+P 
Sbjct: 129 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR 188

Query: 187 KSGSFAKVLCRTPLCRRLESPGCN-QRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVER 246
           KS ++A + C +P CRRL+S GCN +R+TCLYQVSYGDGS+T G+F TETLTFRR +V+ 
Sbjct: 189 KSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG 248

Query: 247 VALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFG 306
           VALGCGHDNEGLFVGAAGLLGLG+G LSFP QTG  FNQKFSYCLVDRSASSKPSSVVFG
Sbjct: 249 VALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFG 308

Query: 307 NAAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCG 366
           NAAVSR ARFTPLL+NP+LDTFYYV LLGISVGGT V G++AS FKLD  GNGGVIID G
Sbjct: 309 NAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSG 368

Query: 367 TSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADV 426
           TSVTRL RPAYIA+RDAFR GA +LK AP+FSLFDTC+DLS    VKVPTVVLHFRGADV
Sbjct: 369 TSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV 428

Query: 427 SLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           SLPA+NYLIPVD +G+FCFAFAGT  GLSIIGNIQQQGFRVVYDLASSRVGF+P GCA
Sbjct: 429 SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Bhi01G000358 vs. Swiss-Prot
Match: sp|Q9LS40|ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 2.0e-109
Identity = 230/470 (48.94%), Postives = 285/470 (60.64%), Query Frame = 0

Query: 16  TLLSLSTAISDFQTLIPTSLPSSPSFLPSDSESFISSDTTESDLGLTLHLHHLDALSLNR 75
           T+LSL    S   T  P SL S P F  S            S L L LH       S ++
Sbjct: 49  TILSLDPTRSSLTTTKPESL-SDPVFFNS-----------SSPLSLELHSRDTFVASQHK 108

Query: 76  TPEELFHLRLQRDALRVKKL-------------SSLHDSSQNVSQTSGTGFSSSVISGLA 135
             + L   RL+RD+ RV  +             S L       ++      ++ V+SG +
Sbjct: 109 DYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGAS 168

Query: 136 QGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVL 195
           QGSGEYF+RIGVGTP K +Y+VLDTGSD+ W+QC PC +CY Q+DPVFNP  S ++  + 
Sbjct: 169 QGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLT 228

Query: 196 CRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRT-KVERVALGCGHDN 255
           C  P C  LE+  C   + CLYQVSYGDGS+T GE  T+T+TF  + K+  VALGCGHDN
Sbjct: 229 CSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDN 288

Query: 256 EGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRTAR 315
           EGLF GAAGLLGLG G LS  +Q   T    FSYCLVDR  S K SS+ F +  +     
Sbjct: 289 EGLFTGAAGLLGLGGGVLSITNQMKAT---SFSYCLVDRD-SGKSSSLDFNSVQLGGGDA 348

Query: 316 FTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRP 375
             PLL N ++DTFYYV L G SVGG  V  +  + F +D +G+GGVI+DCGT+VTRL   
Sbjct: 349 TAPLLRNKKIDTFYYVGLSGFSVGGEKVV-LPDAIFDVDASGSGGVILDCGTAVTRLQTQ 408

Query: 376 AYIALRDAF-RAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA-DVSLPASNY 435
           AY +LRDAF +   +  K +   SLFDTCYD S  +TVKVPTV  HF G   + LPA NY
Sbjct: 409 AYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNY 468

Query: 436 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
           LIPVD SG FCFAFA T+S LSIIGN+QQQG R+ YDL+ + +G S   C
Sbjct: 469 LIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Bhi01G000358 vs. Swiss-Prot
Match: sp|Q9LHE3|ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 387.9 bits (995), Expect = 1.6e-106
Identity = 215/452 (47.57%), Postives = 280/452 (61.95%), Query Frame = 0

Query: 26  DFQTLIPTSLPSSPSFLPSDSESFISSDTTESDLGLTLHLHHLDALS--LNRTPEELFHL 85
           DFQ +     P + +    D  +   SD  ES    TL L H D       R      H 
Sbjct: 26  DFQIIDVLQPPLTVTATLPDFNNTHFSD--ESSSKYTLRLLHRDRFPSVTYRNHHHRLHA 85

Query: 86  RLQRDALRVKKLSSLHDSSQNVSQTSGT-----GFSSSVISGLAQGSGEYFTRIGVGTPP 145
           R++RD  RV  +  L   S  V  +S +      F S ++SG+ QGSGEYF RIGVG+PP
Sbjct: 86  RMRRDTDRVSAI--LRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPP 145

Query: 146 KYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQ 205
           +  YMV+D+GSD+VW+QC PCK CY Q+DPVF+P KSGS+  V C + +C R+E+ GC+ 
Sbjct: 146 RDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHS 205

Query: 206 RQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGG 265
              C Y+V YGDGSYT G    ETLTF +T V  VA+GCGH N G+F+GAAGLLG+G G 
Sbjct: 206 -GGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGS 265

Query: 266 LSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRTARFTPLLTNPRLDTFYYVE 325
           +SF  Q        F YCLV R   S   S+VFG  A+   A + PL+ NPR  +FYYV 
Sbjct: 266 MSFVGQLSGQTGGAFGYCLVSRGTDS-TGSLVFGREALPVGASWVPLVRNPRAPSFYYVG 325

Query: 326 LLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLK 385
           L G+ VGG  +  +    F L  TG+GGV++D GT+VTRL   AY+A RD F++  ++L 
Sbjct: 326 LKGLGVGGVRIP-LPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLP 385

Query: 386 SAPEFSLFDTCYDLSGKTTVKVPTVVLHF-RGADVSLPASNYLIPVDGSGRFCFAFAGTT 445
            A   S+FDTCYDLSG  +V+VPTV  +F  G  ++LPA N+L+PVD SG +CFAFA + 
Sbjct: 386 RASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASP 445

Query: 446 SGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
           +GLSIIGNIQQ+G +V +D A+  VGF P  C
Sbjct: 446 TGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Bhi01G000358 vs. Swiss-Prot
Match: sp|Q8S9J6|ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 279.3 bits (713), Expect = 8.1e-74
Identity = 175/446 (39.24%), Postives = 240/446 (53.81%), Query Frame = 0

Query: 40  SFLPSDSESFISS---DTTESDLGLTLHLHHLDALSLNRTPEELFHLRLQR-DALRVKKL 99
           S LPS S S + S    TT+S L +T H H   +   N       H+ + R D  RV  +
Sbjct: 40  SLLPSSSSSCVLSPRASTTKSSLHVT-HRHGTCSRLNNGKATSPDHVEILRLDQARVNSI 99

Query: 100 SSLHD---SSQNVSQTSGTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIV 159
            S      ++ +VS++  T   +    G   GSG Y   +G+GTP   + ++ DTGSD+ 
Sbjct: 100 HSKLSKKLATDHVSESKSTDLPAK--DGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLT 159

Query: 160 WLQCAPC-KNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLES----PGCNQRQTCLYQVS 219
           W QC PC + CY Q +P+FNP KS S+  V C +  C  L S     G      C+Y + 
Sbjct: 160 WTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ 219

Query: 220 YGDGSYTTGEFVTETLTFRRTKV-ERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTG 279
           YGD S++ G    E  T   + V + V  GCG +N+GLF G AGLLGLGR  LSFPSQT 
Sbjct: 220 YGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTA 279

Query: 280 RTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRTARFTPLLTNPRLDTFYYVELLGISVGG 339
             +N+ FSYCL   S++S    + FG+A +SR+ +FTP+ T     +FY + ++ I+VGG
Sbjct: 280 TAYNKIFSYCL--PSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGG 339

Query: 340 TPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLF 399
             +  I ++ F        G +ID GT +TRL   AY ALR +F+A  S   +    S+ 
Sbjct: 340 QKLP-IPSTVF-----STPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSIL 399

Query: 400 DTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFAFAGTT--SGLSIIG 459
           DTC+DLSG  TV +P V   F G  V    S  +  V    + C AFAG +  S  +I G
Sbjct: 400 DTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFG 459

Query: 460 NIQQQGFRVVYDLASSRVGFSPRGCA 471
           N+QQQ   VVYD A  RVGF+P GC+
Sbjct: 460 NVQQQTLEVVYDGAGGRVGFAPNGCS 474

BLAST of Bhi01G000358 vs. Swiss-Prot
Match: sp|Q9LEW3|AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 276.2 bits (705), Expect = 6.9e-73
Identity = 190/486 (39.09%), Postives = 259/486 (53.29%), Query Frame = 0

Query: 1   MEAMTKLFSFIFFLLTLL----------SLSTAISDFQTLIPTSL-PSSPSFLPSDSESF 60
           M  M    S I  L   L          S S  + D  T+  +SL PSS S +PS   S 
Sbjct: 1   MSIMRNFLSMIIMLCVCLNWCFAEGAEKSDSGKVLDSYTIQVSSLFPSSSSCVPSSKAS- 60

Query: 61  ISSDTTESDLGLTLHLHHLDALSLNRTPEELFHLR-LQRDALRVKKLSS--LHDSSQNVS 120
                T+S L + +H+H   A S   +   + H   ++RD  RV+ + S    +S+  VS
Sbjct: 61  ----NTKSSLRV-VHMH--GACSHLSSDARVDHDEIIRRDQARVESIYSKLSKNSANEVS 120

Query: 121 QTSGTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPC-KNCYSQ 180
           +   T   +   SG+  GSG Y   IG+GTP   + +V DTGSD+ W QC PC  +CYSQ
Sbjct: 121 EAKSTELPAK--SGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQ 180

Query: 181 TDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTF 240
            +P FNP  S ++  V C +P+C   ES  C+    C+Y + YGD S+T G    E  T 
Sbjct: 181 KEPKFNPSSSSTYQNVSCSSPMCEDAES--CS-ASNCVYSIVYGDKSFTQGFLAKEKFTL 240

Query: 241 RRTKV-ERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASS 300
             + V E V  GCG +N+GLF G AGLLGLG G LS P+QT  T+N  FSYCL   +++S
Sbjct: 241 TNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNS 300

Query: 301 KPSSVVFGNAAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGN 360
               + FG+A +S + +FTP+ + P     Y ++++GISVG   ++ I+ + F  +    
Sbjct: 301 -TGHLTFGSAGISESVKFTPISSFPSAFN-YGIDIIGISVGDKELA-ITPNSFSTE---- 360

Query: 361 GGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVV 420
            G IID GT  TRL    Y  LR  F+   SS KS   + LFDTCYD +G  TV  PT+ 
Sbjct: 361 -GAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIA 420

Query: 421 LHFRGAD-VSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVG 470
             F G+  V L  S   +P+  S + C AFAG     +I GN+QQ    VVYD+A  RVG
Sbjct: 421 FSFAGSTVVELDGSGISLPIKIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVG 464

BLAST of Bhi01G000358 vs. TrEMBL
Match: tr|A0A0A0L8K0|A0A0A0L8K0_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus OX=3659 GN=Csa_3G119540 PE=3 SV=1)

HSP 1 Score: 861.3 bits (2224), Expect = 1.0e-246
Identity = 436/471 (92.57%), Postives = 451/471 (95.75%), Query Frame = 0

Query: 1   MEAMTKLFSFIFFLLTLLSLSTAISDFQTLIPTSLPSSPSFLPSDSESFISSDTTESDLG 60
           ME  T    FIFFLLT+LSL+TA SDFQTL  TSLPSSPSFLPSDS SF+SS+ T+S+LG
Sbjct: 42  MEPNTISLPFIFFLLTVLSLATAFSDFQTLPLTSLPSSPSFLPSDSNSFLSSEATQSELG 101

Query: 61  LTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLHDSSQNVSQTSG-TGFSSSVIS 120
           L LHLHHLDALS NRTPEELFHLRLQRDA+RVKKLSSL  +S+N+S+  G TGFSSSVIS
Sbjct: 102 LELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGGTTGFSSSVIS 161

Query: 121 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 180
           GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA
Sbjct: 162 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 221

Query: 181 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGH 240
           KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE+VALGCGH
Sbjct: 222 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGH 281

Query: 241 DNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRT 300
           DNEGLFVGAAGLLGLGRGGLSFPSQ GRTFNQKFSYCLVDRSASSKPSSVVFGN+AVSRT
Sbjct: 282 DNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT 341

Query: 301 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLN 360
           ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI+ASHFKLD TGNGGVIIDCGTSVTRLN
Sbjct: 342 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLN 401

Query: 361 RPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 420
           +PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY
Sbjct: 402 KPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 461

Query: 421 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA
Sbjct: 462 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 512

BLAST of Bhi01G000358 vs. TrEMBL
Match: tr|A0A1S3AV66|A0A1S3AV66_CUCME (aspartyl protease family protein 2 OS=Cucumis melo OX=3656 GN=LOC103483183 PE=3 SV=1)

HSP 1 Score: 860.1 bits (2221), Expect = 2.3e-246
Identity = 438/472 (92.80%), Postives = 452/472 (95.76%), Query Frame = 0

Query: 1   MEAMTKLFSFIFF-LLTLLSLSTAISDFQTLIPTSLPSSPSFLPSDSESFISSDTTESDL 60
           MEA T    FIFF LL +LSLSTA SDFQTLI  SLPSSPSFLPSDS SF+SS+ TE++L
Sbjct: 3   MEANTISLPFIFFLLLAILSLSTAFSDFQTLILRSLPSSPSFLPSDSNSFLSSEATETEL 62

Query: 61  GLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLHDSSQNVSQTSG-TGFSSSVI 120
           GL LHLHHLDALS NRTPEELFHLRLQRDA+RVKKLSSL  +S+N+S+ SG TGFSSSVI
Sbjct: 63  GLELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSRPSGTTGFSSSVI 122

Query: 121 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 180
           SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF
Sbjct: 123 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 182

Query: 181 AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCG 240
           AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE+VALGCG
Sbjct: 183 AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCG 242

Query: 241 HDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSR 300
           HDNEGLFVGAAGLLGLGRGGLSFPSQ GRTFNQKFSYCLVDRSASSKPSSVVFGN+AVSR
Sbjct: 243 HDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSR 302

Query: 301 TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRL 360
           TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGIS+SHFKLD TGNGGVIIDCGTSVTRL
Sbjct: 303 TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISSSHFKLDRTGNGGVIIDCGTSVTRL 362

Query: 361 NRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN 420
           N+PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN
Sbjct: 363 NKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN 422

Query: 421 YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA
Sbjct: 423 YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 474

BLAST of Bhi01G000358 vs. TrEMBL
Match: tr|A0A2I4EG53|A0A2I4EG53_9ROSI (aspartyl protease family protein 2 OS=Juglans regia OX=51240 GN=LOC108989274 PE=3 SV=1)

HSP 1 Score: 689.1 bits (1777), Expect = 6.9e-195
Identity = 353/463 (76.24%), Postives = 394/463 (85.10%), Query Frame = 0

Query: 11  IFFLLTLLSLSTAISDFQTLIPTSLPSSP---SFLPSDSESFISSDTTESDLGLTLHLHH 70
           IFF    +S ST++  +QTL+   L ++P   S+  S+SES +S  T  +    TL LHH
Sbjct: 15  IFFXXXSISSSTSLR-YQTLVLNPLSTTPHSLSWPESESESVVSDSTVAT---TTLELHH 74

Query: 71  LDALSLNRTPEELFHLRLQRDALRVKKLSSLHDSSQNVSQTSGTGFSSSVISGLAQGSGE 130
           LD+LSLN+TPE+LFHLRLQRDA RVK L+SL  +  N S+  G GFSSSVISGLAQGSGE
Sbjct: 75  LDSLSLNKTPEQLFHLRLQRDAFRVKALTSL-AAVGNRSRAHGAGFSSSVISGLAQGSGE 134

Query: 131 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPL 190
           YFTRIGVGTPPKYVYMVLDTGSD+VW+QCAPC+ CYSQ DPVF+P KS SFA + C +PL
Sbjct: 135 YFTRIGVGTPPKYVYMVLDTGSDVVWVQCAPCRKCYSQVDPVFDPRKSRSFAGISCGSPL 194

Query: 191 CRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVG 250
           C +L+SPGCN R+TCLYQVSYGDGS+TTG+F TETLTFR T+V RVALGCGH+N+GLFVG
Sbjct: 195 CLKLDSPGCNSRKTCLYQVSYGDGSFTTGDFSTETLTFRGTRVGRVALGCGHNNQGLFVG 254

Query: 251 AAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRTARFTPLLT 310
           AAGLLGLGRG LSFPSQTGR FN+KFSYCLVDRSASS+PSS+VFG+ AVSRTARFTPL+ 
Sbjct: 255 AAGLLGLGRGRLSFPSQTGRQFNRKFSYCLVDRSASSRPSSIVFGDPAVSRTARFTPLIA 314

Query: 311 NPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALR 370
           NP+LDTFYYVEL+GISVGGTPV GISAS FKLD TGNGGVIID GTSVTRL RPAY ALR
Sbjct: 315 NPKLDTFYYVELVGISVGGTPVPGISASFFKLDRTGNGGVIIDSGTSVTRLTRPAYNALR 374

Query: 371 DAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSG 430
           DAFR G SSLK A +FSLFDTCYDLSGKT VKVPTVVLHFRGADV LPA+NYLIPVD  G
Sbjct: 375 DAFRIGTSSLKRASDFSLFDTCYDLSGKTEVKVPTVVLHFRGADVPLPATNYLIPVDSDG 434

Query: 431 RFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
            FCFAFAGT SGLSI+GNIQQQGFRVVYDLA SRVGFSPRGCA
Sbjct: 435 TFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAGSRVGFSPRGCA 472

BLAST of Bhi01G000358 vs. TrEMBL
Match: tr|B9SBG8|B9SBG8_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis OX=3988 GN=RCOM_0717990 PE=3 SV=1)

HSP 1 Score: 687.2 bits (1772), Expect = 2.6e-194
Identity = 349/461 (75.70%), Postives = 391/461 (84.82%), Query Frame = 0

Query: 11  IFFLLTLLSLSTAISDFQTLIPTSLPSSPSFLPSDSESFISSDTTESDLGLTLHLHHLDA 70
           +FF  T+    +   ++QTL+   L S P+   +DSES   +DT ES    ++ LHH+DA
Sbjct: 12  LFFSFTIFFSHSTSLNYQTLVANPLRSQPTLSWTDSES--PTDTAESSATFSVQLHHVDA 71

Query: 71  LSLNRTPEELFHLRLQRDALRVKKLSSLHDSSQNVSQTSGTGFSSSVISGLAQGSGEYFT 130
           LS N TPE LF  RLQRDA RV+ +S L +++    +  GTGFSSSVISGLAQGSGEYFT
Sbjct: 72  LSFNSTPETLFTTRLQRDAARVEAISYLAETA-GTGKRVGTGFSSSVISGLAQGSGEYFT 131

Query: 131 RIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRR 190
           RIGVGTPP+YVYMVLDTGSDIVW+QCAPCK CY+Q+DPVF+P KS SFA + CR+PLC R
Sbjct: 132 RIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSPLCHR 191

Query: 191 LESPGCN-QRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAA 250
           L+SPGCN Q+QTC+YQVSYGDGS+T G+F TETLTFRRT+V RVALGCGHDNEGLFVGAA
Sbjct: 192 LDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARVALGCGHDNEGLFVGAA 251

Query: 251 GLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRTARFTPLLTNP 310
           GLLGLGRG LSFPSQTGR FN KFSYCLVDRSASSKPSS+VFG++AVSRTARFTPL++NP
Sbjct: 252 GLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGDSAVSRTARFTPLVSNP 311

Query: 311 RLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDA 370
           +LDTFYYVELLGISVGGT V GI+AS FKLD TGNGGVIID GTSVTRL RPAYIA RDA
Sbjct: 312 KLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDA 371

Query: 371 FRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRF 430
           FRAGAS+LK AP+FSLFDTC+DLSGKT VKVPTVVLHFRGADVSLPASNYLIPVD SG F
Sbjct: 372 FRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPASNYLIPVDTSGNF 431

Query: 431 CFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           C AFAGT  GLSIIGNIQQQGFRVVYDLA SRVGF+P GCA
Sbjct: 432 CLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469

BLAST of Bhi01G000358 vs. TrEMBL
Match: tr|A0A2P5AIW4|A0A2P5AIW4_PARAD (Aspartic peptidase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_327890 PE=3 SV=1)

HSP 1 Score: 687.2 bits (1772), Expect = 2.6e-194
Identity = 352/470 (74.89%), Postives = 400/470 (85.11%), Query Frame = 0

Query: 8   FSFIFFLLTLLSLSTAISD---FQTLIPTSLPSSPSFLPSDSESFIS--SDTTESDLGLT 67
           F F  F    ++LSTA++D   +QTL+  +L + P+    +S+   S     +E++  L+
Sbjct: 10  FFFFSFSAIFVTLSTALTDPIQYQTLVVNTLSTPPTLSWPESQLSGSDPGPDSETESTLS 69

Query: 68  LHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLHDSS--QNVSQTSGTGFSSSVISG 127
           L LHHLDALS +++PE+LF LRLQRDA+RVK L SL  S+    V   SG+GFSSSVISG
Sbjct: 70  LQLHHLDALSTDQSPEQLFDLRLQRDAMRVKSLYSLVASTNGSRVGYGSGSGFSSSVISG 129

Query: 128 LAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAK 187
           LAQGSGEYFTR+GVGTPP+YVYMVLDTGSD+VWLQCAPCK CY+Q DPVF+P KS SFA 
Sbjct: 130 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWLQCAPCKKCYTQADPVFDPAKSRSFAG 189

Query: 188 VLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHD 247
           + C +PLCR+L+SPGCNQR+ CLYQVSYGDGS+TTGEF TETLTFRRT+V RVALGCGHD
Sbjct: 190 IPCGSPLCRKLDSPGCNQRKQCLYQVSYGDGSFTTGEFSTETLTFRRTRVARVALGCGHD 249

Query: 248 NEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRTA 307
           NEGLFVGAAGLLGLGRG LSFPSQTG  FN+KFSYCLVDRSA+SKPSSVVFG++AVSRTA
Sbjct: 250 NEGLFVGAAGLLGLGRGRLSFPSQTGYRFNRKFSYCLVDRSATSKPSSVVFGDSAVSRTA 309

Query: 308 RFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNR 367
           RFTPLL NP+LDTFYY+EL+GISVGG  V GISA+ FKLD  GNGGVIID GTSVTRL R
Sbjct: 310 RFTPLLANPKLDTFYYLELVGISVGGARVPGISAALFKLDNAGNGGVIIDSGTSVTRLTR 369

Query: 368 PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYL 427
           PAY+ALRD+FRAGAS+LK APEFSLFDTCYDLSGK+ VKVPTVVLHFRGADVSLPA+NYL
Sbjct: 370 PAYLALRDSFRAGASNLKRAPEFSLFDTCYDLSGKSEVKVPTVVLHFRGADVSLPATNYL 429

Query: 428 IPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           IPVD SG FCFAFAGT SGLSIIGNIQQQGFRVVYDLA SRVGF+PRGCA
Sbjct: 430 IPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRVGFAPRGCA 479

BLAST of Bhi01G000358 vs. NCBI nr
Match: XP_004133810.1 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis sativus])

HSP 1 Score: 861.3 bits (2224), Expect = 1.5e-246
Identity = 436/471 (92.57%), Postives = 451/471 (95.75%), Query Frame = 0

Query: 1   MEAMTKLFSFIFFLLTLLSLSTAISDFQTLIPTSLPSSPSFLPSDSESFISSDTTESDLG 60
           ME  T    FIFFLLT+LSL+TA SDFQTL  TSLPSSPSFLPSDS SF+SS+ T+S+LG
Sbjct: 1   MEPNTISLPFIFFLLTVLSLATAFSDFQTLPLTSLPSSPSFLPSDSNSFLSSEATQSELG 60

Query: 61  LTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLHDSSQNVSQTSG-TGFSSSVIS 120
           L LHLHHLDALS NRTPEELFHLRLQRDA+RVKKLSSL  +S+N+S+  G TGFSSSVIS
Sbjct: 61  LELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGGTTGFSSSVIS 120

Query: 121 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 180
           GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA
Sbjct: 121 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 180

Query: 181 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGH 240
           KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE+VALGCGH
Sbjct: 181 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGH 240

Query: 241 DNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRT 300
           DNEGLFVGAAGLLGLGRGGLSFPSQ GRTFNQKFSYCLVDRSASSKPSSVVFGN+AVSRT
Sbjct: 241 DNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT 300

Query: 301 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLN 360
           ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI+ASHFKLD TGNGGVIIDCGTSVTRLN
Sbjct: 301 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLN 360

Query: 361 RPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 420
           +PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY
Sbjct: 361 KPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 420

Query: 421 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA
Sbjct: 421 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471

BLAST of Bhi01G000358 vs. NCBI nr
Match: KGN56421.1 (Aspartic proteinase nepenthesin-1 [Cucumis sativus])

HSP 1 Score: 861.3 bits (2224), Expect = 1.5e-246
Identity = 436/471 (92.57%), Postives = 451/471 (95.75%), Query Frame = 0

Query: 1   MEAMTKLFSFIFFLLTLLSLSTAISDFQTLIPTSLPSSPSFLPSDSESFISSDTTESDLG 60
           ME  T    FIFFLLT+LSL+TA SDFQTL  TSLPSSPSFLPSDS SF+SS+ T+S+LG
Sbjct: 42  MEPNTISLPFIFFLLTVLSLATAFSDFQTLPLTSLPSSPSFLPSDSNSFLSSEATQSELG 101

Query: 61  LTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLHDSSQNVSQTSG-TGFSSSVIS 120
           L LHLHHLDALS NRTPEELFHLRLQRDA+RVKKLSSL  +S+N+S+  G TGFSSSVIS
Sbjct: 102 LELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGGTTGFSSSVIS 161

Query: 121 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 180
           GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA
Sbjct: 162 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 221

Query: 181 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGH 240
           KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE+VALGCGH
Sbjct: 222 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGH 281

Query: 241 DNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSRT 300
           DNEGLFVGAAGLLGLGRGGLSFPSQ GRTFNQKFSYCLVDRSASSKPSSVVFGN+AVSRT
Sbjct: 282 DNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT 341

Query: 301 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLN 360
           ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI+ASHFKLD TGNGGVIIDCGTSVTRLN
Sbjct: 342 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLN 401

Query: 361 RPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 420
           +PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY
Sbjct: 402 KPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 461

Query: 421 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA
Sbjct: 462 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 512

BLAST of Bhi01G000358 vs. NCBI nr
Match: XP_008437888.1 (PREDICTED: aspartyl protease family protein 2 [Cucumis melo])

HSP 1 Score: 860.1 bits (2221), Expect = 3.4e-246
Identity = 438/472 (92.80%), Postives = 452/472 (95.76%), Query Frame = 0

Query: 1   MEAMTKLFSFIFF-LLTLLSLSTAISDFQTLIPTSLPSSPSFLPSDSESFISSDTTESDL 60
           MEA T    FIFF LL +LSLSTA SDFQTLI  SLPSSPSFLPSDS SF+SS+ TE++L
Sbjct: 3   MEANTISLPFIFFLLLAILSLSTAFSDFQTLILRSLPSSPSFLPSDSNSFLSSEATETEL 62

Query: 61  GLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLHDSSQNVSQTSG-TGFSSSVI 120
           GL LHLHHLDALS NRTPEELFHLRLQRDA+RVKKLSSL  +S+N+S+ SG TGFSSSVI
Sbjct: 63  GLELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSRPSGTTGFSSSVI 122

Query: 121 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 180
           SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF
Sbjct: 123 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 182

Query: 181 AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCG 240
           AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE+VALGCG
Sbjct: 183 AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCG 242

Query: 241 HDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAVSR 300
           HDNEGLFVGAAGLLGLGRGGLSFPSQ GRTFNQKFSYCLVDRSASSKPSSVVFGN+AVSR
Sbjct: 243 HDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSR 302

Query: 301 TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRL 360
           TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGIS+SHFKLD TGNGGVIIDCGTSVTRL
Sbjct: 303 TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISSSHFKLDRTGNGGVIIDCGTSVTRL 362

Query: 361 NRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN 420
           N+PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN
Sbjct: 363 NKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN 422

Query: 421 YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA
Sbjct: 423 YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 474

BLAST of Bhi01G000358 vs. NCBI nr
Match: XP_022924595.1 (aspartyl protease family protein 2-like [Cucurbita moschata])

HSP 1 Score: 837.0 bits (2161), Expect = 3.1e-239
Identity = 427/474 (90.08%), Postives = 440/474 (92.83%), Query Frame = 0

Query: 1   MEAMTKLFSFIFFLLTLLSLSTAISDFQTLIPTSLPSSPSFLP----SDSESFISSDTTE 60
           M A T  F+FIF LLTLLSLSTA SDFQTL+P  LP+SPS L      DS+SF SS+ TE
Sbjct: 1   MVAKTSPFTFIFVLLTLLSLSTAFSDFQTLVPRPLPTSPSSLAPESNEDSDSFFSSEATE 60

Query: 61  SDLGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLHDSSQNVSQTSGTGFSSS 120
           S+ GL LHLHHLD+LSL+RTPEELFHLRLQRDALRV KLS L   S NVS+ SGTGFSSS
Sbjct: 61  SEPGLALHLHHLDSLSLSRTPEELFHLRLQRDALRVNKLSLLAAVSPNVSRASGTGFSSS 120

Query: 121 VISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSG 180
           VISGLAQGSGEYFTRIGVGTPP+YVYMVLDTGSDIVWLQCAPCKNCYSQTDPVF+PVKSG
Sbjct: 121 VISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFDPVKSG 180

Query: 181 SFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG 240
           SF+KVLCRTPLC RLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG
Sbjct: 181 SFSKVLCRTPLCGRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG 240

Query: 241 CGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNAAV 300
           CGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGN+AV
Sbjct: 241 CGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAV 300

Query: 301 SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVT 360
           SRTARFTPLLTNPRLDTFYYVELLGISVGG PVSGIS  HFKLD TGNGGVIIDCGTSVT
Sbjct: 301 SRTARFTPLLTNPRLDTFYYVELLGISVGGRPVSGISPLHFKLDSTGNGGVIIDCGTSVT 360

Query: 361 RLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPA 420
           RLNRPAYIALRDAFRAGASSLKSA EFSLFDTCYDLSGKTTVKVPTVVLHFR ADVSLPA
Sbjct: 361 RLNRPAYIALRDAFRAGASSLKSAAEFSLFDTCYDLSGKTTVKVPTVVLHFRNADVSLPA 420

Query: 421 SNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           SNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA SRVGFSPRGCA
Sbjct: 421 SNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 474

BLAST of Bhi01G000358 vs. NCBI nr
Match: XP_022945440.1 (aspartyl protease family protein 2-like [Cucurbita moschata])

HSP 1 Score: 832.8 bits (2150), Expect = 5.8e-238
Identity = 429/479 (89.56%), Postives = 441/479 (92.07%), Query Frame = 0

Query: 1   MEAMTKLFSFIFFLLTLLSLSTAISDFQTLIPTSLPSSPSFL----PSDSESFISSDTTE 60
           MEA T  F FI FLLTLLSLSTA SDFQTLIP SLP+SPS L     +DSESFISS+   
Sbjct: 1   MEAKTSAFPFIGFLLTLLSLSTAFSDFQTLIPKSLPASPSLLSPESATDSESFISSEA-- 60

Query: 61  SDLGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLHDSSQNVSQTSG-----T 120
              GL L LHHLDALSLNRTPEELFHLRLQRDALRV KLSSL   SQN+SQ SG     T
Sbjct: 61  ---GLELQLHHLDALSLNRTPEELFHLRLQRDALRVTKLSSLSGGSQNLSQASGTSHGTT 120

Query: 121 GFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFN 180
           GFSSSVISGLAQGSGEYFTRIGVGTPPKY+YMVLDTGSDIVWLQCAPCKNCYSQTDPVFN
Sbjct: 121 GFSSSVISGLAQGSGEYFTRIGVGTPPKYIYMVLDTGSDIVWLQCAPCKNCYSQTDPVFN 180

Query: 181 PVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 240
           PVKS S++KVLCRTPLC RLESPGCNQ+QTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE
Sbjct: 181 PVKSRSYSKVLCRTPLCLRLESPGCNQKQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 240

Query: 241 RVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVF 300
           RVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQ GR+FNQKFSYCLVDRSASSKPSSVVF
Sbjct: 241 RVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRSFNQKFSYCLVDRSASSKPSSVVF 300

Query: 301 GNAAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDC 360
           G++AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDP GNGGVIIDC
Sbjct: 301 GDSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPNGNGGVIIDC 360

Query: 361 GTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD 420
           GTSVTRLNRPAYIALRDAFRAGASSLK APEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD
Sbjct: 361 GTSVTRLNRPAYIALRDAFRAGASSLKLAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD 420

Query: 421 VSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
           VSLPASNYLIPVD +GRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA SRVGFSPRGCA
Sbjct: 421 VSLPASNYLIPVDDNGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 474

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G01300.11.1e-17969.67Eukaryotic aspartyl protease family protein[more]
AT3G61820.13.5e-16864.64Eukaryotic aspartyl protease family protein[more]
AT1G25510.11.3e-11148.89Eukaryotic aspartyl protease family protein[more]
AT3G18490.11.1e-11048.94Eukaryotic aspartyl protease family protein[more]
AT3G20015.19.0e-10847.57Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
sp|Q9LNJ3|APF2_ARATH2.1e-17869.67Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
sp|Q9LS40|ASPG1_ARATH2.0e-10948.94Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LHE3|ASPG2_ARATH1.6e-10647.57Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q8S9J6|ASPA_ARATH8.1e-7439.24Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At... [more]
sp|Q9LEW3|AED1_ARATH6.9e-7339.09Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0L8K0|A0A0A0L8K0_CUCSA1.0e-24692.57Aspartic proteinase nepenthesin-1 OS=Cucumis sativus OX=3659 GN=Csa_3G119540 PE=... [more]
tr|A0A1S3AV66|A0A1S3AV66_CUCME2.3e-24692.80aspartyl protease family protein 2 OS=Cucumis melo OX=3656 GN=LOC103483183 PE=3 ... [more]
tr|A0A2I4EG53|A0A2I4EG53_9ROSI6.9e-19576.24aspartyl protease family protein 2 OS=Juglans regia OX=51240 GN=LOC108989274 PE=... [more]
tr|B9SBG8|B9SBG8_RICCO2.6e-19475.70Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis OX=3988 GN=RCOM_... [more]
tr|A0A2P5AIW4|A0A2P5AIW4_PARAD2.6e-19474.89Aspartic peptidase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_327890 PE=3 SV... [more]
Match NameE-valueIdentityDescription
XP_004133810.11.5e-24692.57PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis sativus][more]
KGN56421.11.5e-24692.57Aspartic proteinase nepenthesin-1 [Cucumis sativus][more]
XP_008437888.13.4e-24692.80PREDICTED: aspartyl protease family protein 2 [Cucumis melo][more]
XP_022924595.13.1e-23990.08aspartyl protease family protein 2-like [Cucurbita moschata][more]
XP_022945440.15.8e-23889.56aspartyl protease family protein 2-like [Cucurbita moschata][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR033873CND41-like
IPR033121PEPTIDASE_A1
IPR001969Aspartic_peptidase_AS
IPR032799TAXi_C
IPR032861TAXi_N
IPR021109Peptidase_aspartic_dom_sf
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030163 protein catabolic process
biological_process GO:0006508 proteolysis
biological_process GO:0080167 response to karrikin
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009505 plant-type cell wall
cellular_component GO:0016020 membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0008233 peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi01M000358Bhi01M000358mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 347..358
score: 39.97
coord: 134..154
score: 40.7
coord: 441..456
score: 26.83
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 7..469
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 101..289
e-value: 2.0E-52
score: 180.1
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 290..470
e-value: 1.6E-54
score: 186.3
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 121..469
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 128..293
e-value: 3.0E-55
score: 187.1
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 315..465
e-value: 1.9E-36
score: 125.2
NoneNo IPR availablePANTHERPTHR13683:SF308ASPARTYL PROTEASE-RELATEDcoord: 7..469
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 143..154
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 128..465
score: 46.64
IPR033873CND41-likeCDDcd05472cnd41_likecoord: 127..469
e-value: 2.72362E-136
score: 396.642