Bhi04G001952 (gene) Wax gourd (B227) v1

Overview
NameBhi04G001952
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
Descriptionaspartic proteinase PCS1
Locationchr4: 65352015 .. 65353874 (+)
RNA-Seq ExpressionBhi04G001952
SyntenyBhi04G001952
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTCGTTGTTCGTGTAAAAAATTTAAATGTGGGTGTCGCATTACATACCTCAAATTACAAAAATTATAAGAACGTAAATTAATAAGACAACATTTTTGCCTCTCTCTTAATGCACGTTTTCGTATTCAAAACGCACTGTTTTAACGACACACATAACCCACTGTTTCCTCTGTTATTTCAGTCCCAAACCCAACGAACAAATCCATACCATTTCAAACCCCTTCCCCCTTTTCCTTCACTTCTCTGTGATTTTAGATTCGATTCCATCACGAAATCATGGCCTTCTTCTTCCTCCTCCGTCTACTGCAACTTCTCATCTACGGTCTGTCTTTAAAACAGAGCCTCTGTTTTTCTGCAACTCAGATGACCCTGGTTTTGCCTCTCAAAACACAGATGGGTTTGATTTCTCAGCCTTCCAATAAGCTCAGTTTCCACCATAATGTCACTTTGACTGTTTCCTTAACTGTTGGCTCGCCTCCTCAACAAGTCACCATGGTTCTCGATACAGGGAGTGAACTCTCATGGCTTCACTGCAAAAAATCCCCAAATTTAACCTCTGTTTTTAACCCACTTTCCTCTTCTTCTTACTCACCAATCCCCTGTTCGTCCCCCATTTGCCGCACCCGAACCCGAGATTTACCCAACCCGGTTATCTGCGACCCGAAAAAGCTCTGCCACGTCTTTGTCTCTTACGCCGATGCATCGTCGCTCGAGGGCAACCTCGCATCGGACACGTTCCGAATCGGTTCATCGGCTCAACCCGGAACTTTATTCGGGTGTATGGATTCGGGTTTCAGTTCGAATTCAGAGGAGGACGCGAAGACCACTGGATTAATGGGTATGAACAGAGGATCGCTCTCGTTTGTTACGCAATTGGGTTTGCCGAAATTCTCTTATTGCATATCGGGTAGTGATTCTTCAGGGGTTCTTCTTTTCGGCGACGCACCTGTTTCGTGGCTTGGGAATTTAACCTACACGCCTTTAGTTCAAATCTCAACCCCATTACCCTATTACGACCGAGTCGCTTACACCGTCCAACTCGACGGAATCAGAGTAGGGAACAAAATCCTCCCACTCCCGAAATCAATCTTCGCACCAGATCACACCGGCGCCGGACAAACCATGGTGGATTCAGGTACCCAGTTCACGTTTCTTCTGGGACCAGTCTACACAGCTTTGAAAAACGAGTTTCTAGAGCAAACGAAAGGGGTTTTAATCCCACTGGGTGATCCAAACTTCGTGTTCCAAGGAGCAATGGACTTATGCTACAGAGTAGCAGCGAAAGAAGGGAAGCTACCGCCGTTACCGGCGGTGAGTCTGATGTTCCATGGGGCGGAGATGGTAGTCGGAGGGGAGGTTCTGCTGTACAGAGTACCGGGAATGATGAAGGGAAACGAGTTGGTGTATTGCCTAACATTTGGGAATTCGGATTTGTTAGGAATAGAAGCGTTTGTGATAGGGCATCATCATCAACAAAACGTGTGGATGGAATTTGATTTGGTGAAATCAAGGGTTGGATTTGTAGAGACGAGGTGTGACTTGGCAGGTCAAAGATTGGGATTGGGCCTTTAAAAAAAGCCCAATTTAAAAAAGGACTGTCCAATCCATGGCCCACCTTAAGGGAGGGGGTGGCCCATTGATCGGACAGAAAAAGTTTTTTTTTTTTTTTTTTATTTTTTTTATTGCTATGTGAGATTCTCTGAACAACAACATTGTGGACACGTCAGGTTCTTTTTTTTTTCTTTTTATTATTATTTTTACAATGTTTTGTTTATTGGGTTTGGTGTTGAGTCAGTGAGTTAGTACAAATTTGAATTGTATATAATAATGTATATTAAATATGACCAAACATTAGTGCTG

mRNA sequence

CCTCGTTGTTCGTGTAAAAAATTTAAATGTGGGTGTCGCATTACATACCTCAAATTACAAAAATTATAAGAACGTAAATTAATAAGACAACATTTTTGCCTCTCTCTTAATGCACGTTTTCGTATTCAAAACGCACTGTTTTAACGACACACATAACCCACTGTTTCCTCTGTTATTTCAGTCCCAAACCCAACGAACAAATCCATACCATTTCAAACCCCTTCCCCCTTTTCCTTCACTTCTCTGTGATTTTAGATTCGATTCCATCACGAAATCATGGCCTTCTTCTTCCTCCTCCGTCTACTGCAACTTCTCATCTACGGTCTGTCTTTAAAACAGAGCCTCTGTTTTTCTGCAACTCAGATGACCCTGGTTTTGCCTCTCAAAACACAGATGGGTTTGATTTCTCAGCCTTCCAATAAGCTCAGTTTCCACCATAATGTCACTTTGACTGTTTCCTTAACTGTTGGCTCGCCTCCTCAACAAGTCACCATGGTTCTCGATACAGGGAGTGAACTCTCATGGCTTCACTGCAAAAAATCCCCAAATTTAACCTCTGTTTTTAACCCACTTTCCTCTTCTTCTTACTCACCAATCCCCTGTTCGTCCCCCATTTGCCGCACCCGAACCCGAGATTTACCCAACCCGGTTATCTGCGACCCGAAAAAGCTCTGCCACGTCTTTGTCTCTTACGCCGATGCATCGTCGCTCGAGGGCAACCTCGCATCGGACACGTTCCGAATCGGTTCATCGGCTCAACCCGGAACTTTATTCGGGTGTATGGATTCGGGTTTCAGTTCGAATTCAGAGGAGGACGCGAAGACCACTGGATTAATGGGTATGAACAGAGGATCGCTCTCGTTTGTTACGCAATTGGGTTTGCCGAAATTCTCTTATTGCATATCGGGTAGTGATTCTTCAGGGGTTCTTCTTTTCGGCGACGCACCTGTTTCGTGGCTTGGGAATTTAACCTACACGCCTTTAGTTCAAATCTCAACCCCATTACCCTATTACGACCGAGTCGCTTACACCGTCCAACTCGACGGAATCAGAGTAGGGAACAAAATCCTCCCACTCCCGAAATCAATCTTCGCACCAGATCACACCGGCGCCGGACAAACCATGGTGGATTCAGGTACCCAGTTCACGTTTCTTCTGGGACCAGTCTACACAGCTTTGAAAAACGAGTTTCTAGAGCAAACGAAAGGGGTTTTAATCCCACTGGGTGATCCAAACTTCGTGTTCCAAGGAGCAATGGACTTATGCTACAGAGTAGCAGCGAAAGAAGGGAAGCTACCGCCGTTACCGGCGGTGAGTCTGATGTTCCATGGGGCGGAGATGGTAGTCGGAGGGGAGGTTCTGCTGTACAGAGTACCGGGAATGATGAAGGGAAACGAGTTGGTGTATTGCCTAACATTTGGGAATTCGGATTTGTTAGGAATAGAAGCGTTTGTGATAGGGCATCATCATCAACAAAACGTGTGGATGGAATTTGATTTGGTGAAATCAAGGGTTGGATTTGTAGAGACGAGGTGTGACTTGGCAGGTCAAAGATTGGGATTGGGCCTTTAAAAAAAGCCCAATTTAAAAAAGGACTGTCCAATCCATGGCCCACCTTAAGGGAGGGGGTGGCCCATTGATCGGACAGAAAAAGTTTTTTTTTTTTTTTTTTATTTTTTTTATTGCTATGTGAGATTCTCTGAACAACAACATTGTGGACACGTCAGGTTCTTTTTTTTTTCTTTTTATTATTATTTTTACAATGTTTTGTTTATTGGGTTTGGTGTTGAGTCAGTGAGTTAGTACAAATTTGAATTGTATATAATAATGTATATTAAATATGACCAAACATTAGTGCTG

Coding sequence (CDS)

ATGGCCTTCTTCTTCCTCCTCCGTCTACTGCAACTTCTCATCTACGGTCTGTCTTTAAAACAGAGCCTCTGTTTTTCTGCAACTCAGATGACCCTGGTTTTGCCTCTCAAAACACAGATGGGTTTGATTTCTCAGCCTTCCAATAAGCTCAGTTTCCACCATAATGTCACTTTGACTGTTTCCTTAACTGTTGGCTCGCCTCCTCAACAAGTCACCATGGTTCTCGATACAGGGAGTGAACTCTCATGGCTTCACTGCAAAAAATCCCCAAATTTAACCTCTGTTTTTAACCCACTTTCCTCTTCTTCTTACTCACCAATCCCCTGTTCGTCCCCCATTTGCCGCACCCGAACCCGAGATTTACCCAACCCGGTTATCTGCGACCCGAAAAAGCTCTGCCACGTCTTTGTCTCTTACGCCGATGCATCGTCGCTCGAGGGCAACCTCGCATCGGACACGTTCCGAATCGGTTCATCGGCTCAACCCGGAACTTTATTCGGGTGTATGGATTCGGGTTTCAGTTCGAATTCAGAGGAGGACGCGAAGACCACTGGATTAATGGGTATGAACAGAGGATCGCTCTCGTTTGTTACGCAATTGGGTTTGCCGAAATTCTCTTATTGCATATCGGGTAGTGATTCTTCAGGGGTTCTTCTTTTCGGCGACGCACCTGTTTCGTGGCTTGGGAATTTAACCTACACGCCTTTAGTTCAAATCTCAACCCCATTACCCTATTACGACCGAGTCGCTTACACCGTCCAACTCGACGGAATCAGAGTAGGGAACAAAATCCTCCCACTCCCGAAATCAATCTTCGCACCAGATCACACCGGCGCCGGACAAACCATGGTGGATTCAGGTACCCAGTTCACGTTTCTTCTGGGACCAGTCTACACAGCTTTGAAAAACGAGTTTCTAGAGCAAACGAAAGGGGTTTTAATCCCACTGGGTGATCCAAACTTCGTGTTCCAAGGAGCAATGGACTTATGCTACAGAGTAGCAGCGAAAGAAGGGAAGCTACCGCCGTTACCGGCGGTGAGTCTGATGTTCCATGGGGCGGAGATGGTAGTCGGAGGGGAGGTTCTGCTGTACAGAGTACCGGGAATGATGAAGGGAAACGAGTTGGTGTATTGCCTAACATTTGGGAATTCGGATTTGTTAGGAATAGAAGCGTTTGTGATAGGGCATCATCATCAACAAAACGTGTGGATGGAATTTGATTTGGTGAAATCAAGGGTTGGATTTGTAGAGACGAGGTGTGACTTGGCAGGTCAAAGATTGGGATTGGGCCTTTAA

Protein sequence

MAFFFLLRLLQLLIYGLSLKQSLCFSATQMTLVLPLKTQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPICRTRTRDLPNPVICDPKKLCHVFVSYADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGSDSSGVLLFGDAPVSWLGNLTYTPLVQISTPLPYYDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLEQTKGVLIPLGDPNFVFQGAMDLCYRVAAKEGKLPPLPAVSLMFHGAEMVVGGEVLLYRVPGMMKGNELVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAGQRLGLGL
Homology
BLAST of Bhi04G001952 vs. TAIR 10
Match: AT2G39710.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 606.3 bits (1562), Expect = 1.9e-173
Identity = 314/429 (73.19%), Postives = 347/429 (80.89%), Query Frame = 0

Query: 5   FLLRLLQLLIYGLSLKQSLCFSATQMTLVLPLKTQMGLISQPSNKLSFHHNVTLTVSLTV 64
           FL   + LLI+ L+  ++   S+T  TL+  LKTQ  L    S+KLSF HNVTLTV+L V
Sbjct: 16  FLRISVLLLIFPLTFCKT---SSTNQTLLFSLKTQK-LPQSSSDKLSFRHNVTLTVTLAV 75

Query: 65  GSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPICRTRTRDLPNP 124
           G PPQ ++MVLDTGSELSWLHCKKSPNL SVFNP+SSS+YSP+PCSSPICRTRTRDLP P
Sbjct: 76  GDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTRDLPIP 135

Query: 125 VICDPK-KLCHVFVSYADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGFSSNSEEDAKT 184
             CDPK  LCHV +SYADA+S+EGNLA +TF IGS  +PGTLFGCMDSG SSNSEEDAK+
Sbjct: 136 ASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEEDAKS 195

Query: 185 TGLMGMNRGSLSFVTQLGLPKFSYCISGSDSSGVLLFGDAPVSWLGNLTYTPLVQISTPL 244
           TGLMGMNRGSLSFV QLG  KFSYCISGSDSSG LL GDA  SWLG + YTPLV  STPL
Sbjct: 196 TGLMGMNRGSLSFVNQLGFSKFSYCISGSDSSGFLLLGDASYSWLGPIQYTPLVLQSTPL 255

Query: 245 PYYDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN 304
           PY+DRVAYTVQL+GIRVG+KIL LPKS+F PDHTGAGQTMVDSGTQFTFL+GPVYTALKN
Sbjct: 256 PYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKN 315

Query: 305 EFLEQTKGVLIPLGDPNFVFQGAMDLCYRV-AAKEGKLPPLPAVSLMFHGAEMVVGGEVL 364
           EF+ QTK VL  + DP+FVFQG MDLCY+V +        LP VSLMF GAEM V G+ L
Sbjct: 316 EFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFRGAEMSVSGQKL 375

Query: 365 LYRVPGM-MKGNELVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFV-ETRC 424
           LYRV G   +G E VYC TFGNSDLLGIEAFVIGHHHQQNVWMEFDL KSRVGF    RC
Sbjct: 376 LYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAGNVRC 435

Query: 425 DLAGQRLGL 430
           DLA QRLGL
Sbjct: 436 DLASQRLGL 440

BLAST of Bhi04G001952 vs. TAIR 10
Match: AT5G02190.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 542.3 bits (1396), Expect = 3.4e-154
Identity = 265/417 (63.55%), Postives = 322/417 (77.22%), Query Frame = 0

Query: 22  SLCFSATQMTLVLPLKTQMGLIS-QPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSE 81
           S   S++  TLVLPLKT++     +P++KL FHHNVTLTV+LTVG+PPQ ++MV+DTGSE
Sbjct: 36  SFSSSSSSQTLVLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTPPQNISMVIDTGSE 95

Query: 82  LSWLHCKKS--PNLTSVFNPLSSSSYSPIPCSSPICRTRTRDLPNPVICDPKKLCHVFVS 141
           LSWL C +S  PN  + F+P  SSSYSPIPCSSP CRTRTRD   P  CD  KLCH  +S
Sbjct: 96  LSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHATLS 155

Query: 142 YADASSLEGNLASDTFRIGSSAQPGTL-FGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFV 201
           YADASS EGNLA++ F  G+S     L FGCM S   S+ EED KTTGL+GMNRGSLSF+
Sbjct: 156 YADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFI 215

Query: 202 TQLGLPKFSYCISGSDS-SGVLLFGDAPVSWLGNLTYTPLVQISTPLPYYDRVAYTVQLD 261
           +Q+G PKFSYCISG+D   G LL GD+  +WL  L YTPL++ISTPLPY+DRVAYTVQL 
Sbjct: 216 SQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLT 275

Query: 262 GIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLEQTKGVLIPL 321
           GI+V  K+LP+PKS+  PDHTGAGQTMVDSGTQFTFLLGPVYTAL++ FL +T G+L   
Sbjct: 276 GIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVY 335

Query: 322 GDPNFVFQGAMDLCYRVA---AKEGKLPPLPAVSLMFHGAEMVVGGEVLLYRVPGMMKGN 381
            DP+FVFQG MDLCYR++    + G L  LP VSL+F GAE+ V G+ LLYRVP +  GN
Sbjct: 336 EDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGN 395

Query: 382 ELVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAGQRLGLG 431
           + VYC TFGNSDL+G+EA+VIGHHHQQN+W+EFDL +SR+G     CD++GQRLG+G
Sbjct: 396 DSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECDVSGQRLGIG 452

BLAST of Bhi04G001952 vs. TAIR 10
Match: AT5G37540.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 242.7 bits (618), Expect = 5.6e-64
Identity = 150/403 (37.22%), Postives = 212/403 (52.61%), Query Frame = 0

Query: 44  SQPSNKLSFHHNV----TLTVSLTVGSPPQQVTMVLDTGSELSWLHC------KKSPNLT 103
           S PS+  +F  N+     L +SL +G+P Q   +VLDTGS+LSW+ C      K  P  T
Sbjct: 62  SPPSSPYTFRSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPT 121

Query: 104 SVFNPLSSSSYSPIPCSSPICRTRTRDLPNPVICDPKKLCHVFVSYADASSLEGNLASDT 163
           + F+P  SSS+S +PCS P+C+ R  D   P  CD  +LCH    YAD +  EGNL  + 
Sbjct: 122 TSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEK 181

Query: 164 FRIGSS-AQPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI--- 223
           F   +S   P  + GC        ++E     G++GMN G LSF++Q  + KFSYCI   
Sbjct: 182 FTFSNSQTTPPLILGC--------AKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTR 241

Query: 224 ---SGSDSSGVLLFGDAPVSWLGNLTYTPLVQI--STPLPYYDRVAYTVQLDGIRVGNKI 283
               G  S+G    GD P S      Y  L+    S  +P  D +AYTV L GIR+G K 
Sbjct: 242 SNRPGLASTGSFYLGDNPNS--RGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKR 301

Query: 284 LPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLEQTKGVLIPLGDPNFVFQ 343
           L +P S+F PD  G+GQTMVDSG++FT L+   Y  +K E +      L       +V+ 
Sbjct: 302 LNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRL----KKGYVYG 361

Query: 344 GAMDLCY--RVAAKEGKLPPLPAVSLMF---HGAEMVVGGEVLLYRVPGMMKGNELVYCL 403
              D+C+    + + G+L       L+F    G E++V  + LL  V G       ++C+
Sbjct: 362 STADMCFDGNHSMEIGRL----IGDLVFEFGRGVEILVEKQSLLVNVGGG------IHCV 421

Query: 404 TFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDL 423
             G S +LG  + +IG+ HQQN+W+EFD+   RVGF +  C L
Sbjct: 422 GIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAECRL 440

BLAST of Bhi04G001952 vs. TAIR 10
Match: AT1G66180.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 233.0 bits (593), Expect = 4.4e-61
Identity = 156/442 (35.29%), Postives = 225/442 (50.90%), Query Frame = 0

Query: 3   FFFLLRL------LQLLIYGLSLKQSLCFSATQMTLVLPLKTQMGLISQPSN-KLSFHHN 62
           FFF L        L L +   SL  S   ++ + T  L  +      S P N +  F ++
Sbjct: 10  FFFFLNYVSLSTSLSLHLPLTSLPISTTTNSHRFTTSLLSRKNPSPSSPPYNFRSRFKYS 69

Query: 63  VTLTVSLTVGSPPQQVTMVLDTGSELSWLHC---KKSPNLTSVFNPLSSSSYSPIPCSSP 122
           + L +SL +G+PPQ   MVLDTGS+LSW+ C   K  P   + F+P  SSS+S +PCS P
Sbjct: 70  MALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLSSSFSTLPCSHP 129

Query: 123 ICRTRTRDLPNPVICDPKKLCHVFVSYADASSLEGNLASDTFRIGSS-AQPGTLFGCMDS 182
           +C+ R  D   P  CD  +LCH    YAD +  EGNL  +     ++   P  + GC   
Sbjct: 130 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGC--- 189

Query: 183 GFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI------SGSDSSGVLLFGDAPV 242
                + E +   G++GMNRG LSFV+Q  + KFSYCI       G   +G    GD P 
Sbjct: 190 -----ATESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPN 249

Query: 243 SWLGNLTYTPLVQI--STPLPYYDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 302
           S      Y  L+    S  +P  D +AYTV + GIR G K L +  S+F PD  G+GQTM
Sbjct: 250 S--HGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTM 309

Query: 303 VDSGTQFTFLLGPVYTALKNEFLEQTKGVLIPLGDPNFVFQGAMDLCYRVAAKEGKLPPL 362
           VDSG++FT L+   Y  ++ E + +    L       +V+ G  D+C+     +G +  +
Sbjct: 310 VDSGSEFTHLVDAAYDKVRAEIMTRVGRRL----KKGYVYGGTADMCF-----DGNVAMI 369

Query: 363 P-----AVSLMFHGAEMVVGGEVLLYRVPGMMKGNELVYCLTFGNSDLLGIEAFVIGHHH 421
           P      V +   G E++V  E +L  V G       ++C+  G S +LG  + +IG+ H
Sbjct: 370 PRLIGDLVFVFTRGVEILVPKERVLVNVGGG------IHCVGIGRSSMLGAASNIIGNVH 426

BLAST of Bhi04G001952 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 158.3 bits (399), Expect = 1.4e-38
Identity = 122/377 (32.36%), Postives = 180/377 (47.75%), Query Frame = 0

Query: 60  VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNL----TSVFNPLSSSSYSPIPCSSPICR 119
           + L++G+P  + + ++DTGS+L W  CK         T +F+P  SSSYS + CSS +C 
Sbjct: 109 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 168

Query: 120 TRTRDLPNPVICDPKKLCHVFVSYADASSLEGNLASDTFRI-GSSAQPGTLFGCMDSGFS 179
                LP     + K  C    +Y D SS  G LA++TF     ++  G  FGC   G  
Sbjct: 169 A----LPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGC---GVE 228

Query: 180 SNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYC---ISGSDSSGVLLFGD--------A 239
           +  +  ++ +GL+G+ RG LS ++QL   KFSYC   I  S++S  L  G          
Sbjct: 229 NEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKT 288

Query: 240 PVSWLGNLTYTPLVQISTPLPYYDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 299
             S  G +T T  +  +   P +    Y ++L GI VG K L + KS F     G G  +
Sbjct: 289 GASLDGEVTKTMSLLRNPDQPSF----YYLELQGITVGAKRLSVEKSTFELAEDGTGGMI 348

Query: 300 VDSGTQFTFLLGPVYTALKNEFLEQTKGVLIPLGDPNFVFQGAMDLCYRVAAKEGKLPPL 359
           +DSGT  T+L    +  LK EF   T  + +P+ D        +DLC+++     K   +
Sbjct: 349 IDSGTTITYLEETAFKVLKEEF---TSRMSLPVDDSG---STGLDLCFKL-PDAAKNIAV 408

Query: 360 PAVSLMFHGAEMVVGGEVLLYRVPGMMKGNELVYCLTFGNSDLLGIEAFVIGHHHQQNVW 419
           P +   F GA++ + GE   Y V     G   V CL  G+S+ + I     G+  QQN  
Sbjct: 409 PKMIFHFKGADLELPGE--NYMVADSSTG---VLCLAMGSSNGMSI----FGNVQQQNFN 458

Query: 420 MEFDLVKSRVGFVETRC 421
           +  DL K  V FV T C
Sbjct: 469 VLHDLEKETVSFVPTEC 458

BLAST of Bhi04G001952 vs. ExPASy Swiss-Prot
Match: Q9LZL3 (Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1)

HSP 1 Score: 542.3 bits (1396), Expect = 4.8e-153
Identity = 265/417 (63.55%), Postives = 322/417 (77.22%), Query Frame = 0

Query: 22  SLCFSATQMTLVLPLKTQMGLIS-QPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSE 81
           S   S++  TLVLPLKT++     +P++KL FHHNVTLTV+LTVG+PPQ ++MV+DTGSE
Sbjct: 36  SFSSSSSSQTLVLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTPPQNISMVIDTGSE 95

Query: 82  LSWLHCKKS--PNLTSVFNPLSSSSYSPIPCSSPICRTRTRDLPNPVICDPKKLCHVFVS 141
           LSWL C +S  PN  + F+P  SSSYSPIPCSSP CRTRTRD   P  CD  KLCH  +S
Sbjct: 96  LSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHATLS 155

Query: 142 YADASSLEGNLASDTFRIGSSAQPGTL-FGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFV 201
           YADASS EGNLA++ F  G+S     L FGCM S   S+ EED KTTGL+GMNRGSLSF+
Sbjct: 156 YADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFI 215

Query: 202 TQLGLPKFSYCISGSDS-SGVLLFGDAPVSWLGNLTYTPLVQISTPLPYYDRVAYTVQLD 261
           +Q+G PKFSYCISG+D   G LL GD+  +WL  L YTPL++ISTPLPY+DRVAYTVQL 
Sbjct: 216 SQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLT 275

Query: 262 GIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLEQTKGVLIPL 321
           GI+V  K+LP+PKS+  PDHTGAGQTMVDSGTQFTFLLGPVYTAL++ FL +T G+L   
Sbjct: 276 GIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVY 335

Query: 322 GDPNFVFQGAMDLCYRVA---AKEGKLPPLPAVSLMFHGAEMVVGGEVLLYRVPGMMKGN 381
            DP+FVFQG MDLCYR++    + G L  LP VSL+F GAE+ V G+ LLYRVP +  GN
Sbjct: 336 EDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGN 395

Query: 382 ELVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAGQRLGLG 431
           + VYC TFGNSDL+G+EA+VIGHHHQQN+W+EFDL +SR+G     CD++GQRLG+G
Sbjct: 396 DSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECDVSGQRLGIG 452

BLAST of Bhi04G001952 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 166.8 bits (421), Expect = 5.5e-40
Identity = 115/367 (31.34%), Postives = 174/367 (47.41%), Query Frame = 0

Query: 60  VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNL----TSVFNPLSSSSYSPIPCSSPICR 119
           +++ +G+P    + ++DTGS+L W  C+         T +FNP  SSS+S +PC S  C 
Sbjct: 98  MNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYC- 157

Query: 120 TRTRDLPNPVICDPKKLCHVFVSYADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGFSS 179
              +DLP+   C+  + C     Y D S+ +G +A++TF   +S+ P   FGC   G  +
Sbjct: 158 ---QDLPSET-CNNNE-CQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGC---GEDN 217

Query: 180 NSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCIS--GSDSSGVLLFGDAPVSWLGNLTY 239
                    GL+GM  G LS  +QLG+ +FSYC++  GS S   L  G A          
Sbjct: 218 QGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPS 277

Query: 240 TPLVQISTPLPYYDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFL 299
           T L+  S    Y     Y + L GI VG   L +P S F     G G  ++DSGT  T+L
Sbjct: 278 TTLIHSSLNPTY-----YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYL 337

Query: 300 LGPVYTALKNEFLEQTKGVLIPLGDPNFVFQGAMDLCYRVAAKEGKLPPLPAVSLMFHGA 359
               Y A+   F +Q   + +P  D +      +  C++    +G    +P +S+ F G 
Sbjct: 338 PQDAYNAVAQAFTDQ---INLPTVDES---SSGLSTCFQ-QPSDGSTVQVPEISMQFDGG 397

Query: 360 EMVVGGEVLLYRVPGMMKGNELVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRV 419
            + +G + +L      +   E V CL  G+S  LGI  F  G+  QQ   + +DL    V
Sbjct: 398 VLNLGEQNIL------ISPAEGVICLAMGSSSQLGISIF--GNIQQQETQVLYDLQNLAV 435

Query: 420 GFVETRC 421
            FV T+C
Sbjct: 458 SFVPTQC 435

BLAST of Bhi04G001952 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 1.0e-38
Identity = 112/368 (30.43%), Postives = 183/368 (49.73%), Query Frame = 0

Query: 60  VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNL----TSVFNPLSSSSYSPIPCSSPICR 119
           ++L++G+P Q  + ++DTGS+L W  C+         T +FNP  SSS+S +PCSS +C+
Sbjct: 97  MNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQ 156

Query: 120 TRTRDLPNPVICDPKKLCHVFVSYADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGFSS 179
                L +P   +    C     Y D S  +G++ ++T   GS + P   FGC   G ++
Sbjct: 157 A----LSSPTCSN--NFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGC---GENN 216

Query: 180 NSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCIS--GSDSSGVLLFGDAPVSWLGNLTY 239
                    GL+GM RG LS  +QL + KFSYC++  GS +   LL G    S       
Sbjct: 217 QGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPN 276

Query: 240 TPLVQISTPLPYYDRVAYTVQLDGIRVGNKILPLPKSIFA-PDHTGAGQTMVDSGTQFTF 299
           T L+Q S+ +P +    Y + L+G+ VG+  LP+  S FA   + G G  ++DSGT  T+
Sbjct: 277 TTLIQ-SSQIPTF----YYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTY 336

Query: 300 LLGPVYTALKNEFLEQTKGVLIPLGDPNFVFQGAMDLCYRVAAKEGKLPPLPAVSLMFHG 359
            +   Y +++ EF+ Q   + +P+ + +       DLC++  +    L  +P   + F G
Sbjct: 337 FVNNAYQSVRQEFISQ---INLPVVNGS---SSGFDLCFQTPSDPSNL-QIPTFVMHFDG 396

Query: 360 AEMVVGGEVLLYRVPGMMKGNELVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSR 419
            ++ +  E        +   N L+ CL  G+S   G+  F  G+  QQN+ + +D   S 
Sbjct: 397 GDLELPSENYF-----ISPSNGLI-CLAMGSSS-QGMSIF--GNIQQQNMLVVYDTGNSV 434

Query: 420 VGFVETRC 421
           V F   +C
Sbjct: 457 VSFASAQC 434

BLAST of Bhi04G001952 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 8.5e-33
Identity = 123/371 (33.15%), Postives = 171/371 (46.09%), Query Frame = 0

Query: 62  LTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSPICRTR 121
           L VG+P + V MVLDTGS++ WL C       S    +F+P  S +Y+ IPCSSP CR  
Sbjct: 146 LGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRR- 205

Query: 122 TRDLPNPVICDPKKLCHVFVSYADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGFSSNS 181
              L +      +K C   VSY D S   G+ +++T     +   G   GC       N 
Sbjct: 206 ---LDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGC----GHDNE 265

Query: 182 EEDAKTTGLMGMNRGSLSFVTQLG---LPKFSYCI---SGSDSSGVLLFGDAPVSWLGNL 241
                  GL+G+ +G LSF  Q G     KFSYC+   S S     ++FG+A VS +   
Sbjct: 266 GLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIAR- 325

Query: 242 TYTPLVQISTPLPYYDRVAYTVQLDGIRVGNKILP-LPKSIFAPDHTGAGQTMVDSGTQF 301
            +TPL+      P  D   Y V L GI VG   +P +  S+F  D  G G  ++DSGT  
Sbjct: 326 -FTPLLS----NPKLD-TFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSV 385

Query: 302 TFLLGPVYTALKNEFLEQTKGVLIPLGDPNF-VFQGAMDLCYRVAAKEGKLPPLPAVSLM 361
           T L+ P Y A+++ F    K +      P+F +F    DL      K      +P V L 
Sbjct: 386 TRLIRPAYIAMRDAFRVGAKTL---KRAPDFSLFDTCFDLSNMNEVK------VPTVVLH 445

Query: 362 FHGAEMVVGGEVLLYRVPGMMKGNELVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLV 421
           F GA+  V      Y +P    G    +C  F  + + G+   +IG+  QQ   + +DL 
Sbjct: 446 FRGAD--VSLPATNYLIPVDTNGK---FCFAFAGT-MGGLS--IIGNIQQQGFRVVYDLA 484

BLAST of Bhi04G001952 vs. ExPASy Swiss-Prot
Match: Q6F4N5 (Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1)

HSP 1 Score: 136.3 bits (342), Expect = 7.9e-31
Identity = 124/377 (32.89%), Postives = 181/377 (48.01%), Query Frame = 0

Query: 64  VGSPPQQVTMVLDTGSELSWLHCKKSPNL--TSVFNPLSSSSYSPIPCSSPICRT-RTRD 123
           +GSP QQ+ + LDT ++ +W HC        +S+F P +SSSY+ +PCSS  C   + + 
Sbjct: 85  LGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPLFQGQA 144

Query: 124 LPNP-----VICDPKKL--CHVFVSYADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGF 183
            P P         P  L  C     +ADA S +  LASDT R+G  A P   FGC+ S  
Sbjct: 145 CPAPQGGGDAAPPPATLPTCAFSKPFADA-SFQAALASDTLRLGKDAIPNYTFGCVSS-- 204

Query: 184 SSNSEEDAKTTGLMGMNRGSLSFVTQLGL---PKFSYCISGSDS---SGVLLFGDAPVSW 243
            +    +    GL+G+ RG ++ ++Q G      FSYC+    S   SG L  G A    
Sbjct: 205 VTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLG-AGGGQ 264

Query: 244 LGNLTYTPLVQISTPLPYYDRVAYTVQLDGIRVGNKILPLPKSIFAPD-HTGAGQTMVDS 303
             ++ YTP+++     P+   + Y V + G+ VG+  + +P   FA D  TGAG T+VDS
Sbjct: 265 PRSVRYTPMLR----NPHRSSL-YYVNVTGLSVGHAWVKVPAGSFAFDAATGAG-TVVDS 324

Query: 304 GTQFTFLLGPVYTALKNEFLEQTKGVLIPLGDPNFVFQGAMDLCYRVAAKEGKLPPLPAV 363
           GT  T    PVY AL+ EF  Q   V  P G   +   GA D C+     E      PAV
Sbjct: 325 GTVITRWTAPVYAALREEFRRQ---VAAPSG---YTSLGAFDTCFN--TDEVAAGGAPAV 384

Query: 364 SL-MFHGAEMVVGGEVLLYRVPGMMKGNELVYCLTFGNS-DLLGIEAFVIGHHHQQNVWM 422
           ++ M  G ++ +  E  L     +      + CL    +   +     VI +  QQN+ +
Sbjct: 385 TVHMDGGVDLALPMENTL-----IHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRV 438

BLAST of Bhi04G001952 vs. ExPASy TrEMBL
Match: A0A1S3B3Y8 (aspartic proteinase PCS1 OS=Cucumis melo OX=3656 GN=LOC103485512 PE=3 SV=1)

HSP 1 Score: 790.0 bits (2039), Expect = 4.9e-225
Identity = 394/431 (91.42%), Postives = 405/431 (93.97%), Query Frame = 0

Query: 1   MAFFFLLRLLQLLIYGLSLKQSLCFSATQMTLVLPLKTQMGLISQPSNKLSFHHNVTLTV 60
           M+FFF       L+  LS+KQSLCFSAT  T+VLPL+TQMGLISQPSNKLSFHHNVTLTV
Sbjct: 1   MSFFF---FFFFLVVSLSMKQSLCFSATPTTMVLPLQTQMGLISQPSNKLSFHHNVTLTV 60

Query: 61  SLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPICRTRTRD 120
           SLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSP+CRTRTRD
Sbjct: 61  SLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPVCRTRTRD 120

Query: 121 LPNPVICDPKKLCHVFVSYADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGFSSNSEED 180
           LPNPV CDPKKLCH  VSYADASSLEGNLASD FRIGSSA PGTLFGCMDSGFSSNSEED
Sbjct: 121 LPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSSNSEED 180

Query: 181 AKTTGLMGMNRGSLSFVTQLGLPKFSYCISGSDSSGVLLFGDAPVSWLGNLTYTPLVQIS 240
           AKTTGLMGMNRGSLSFVTQLGLPKFSYCISG DSSGVLLFGD+ +SWLGNLTYTPLVQIS
Sbjct: 181 AKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDSHLSWLGNLTYTPLVQIS 240

Query: 241 TPLPYYDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA 300
           TPLPY+DRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA
Sbjct: 241 TPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA 300

Query: 301 LKNEFLEQTKGVLIPLGDPNFVFQGAMDLCYRVAAKEGKLPPLPAVSLMFHGAEMVVGGE 360
           L+NEFLEQTKGVL PLGDPNFVFQGAMDLCYRV A  GKLP LPAVSLMF GAEMVVGGE
Sbjct: 301 LRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPA-GGKLPELPAVSLMFRGAEMVVGGE 360

Query: 361 VLLYRVPGMMKGNELVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 420
           VLLY+VPGMMKG E VYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC
Sbjct: 361 VLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 420

Query: 421 DLAGQRLGLGL 432
           DLAGQRLGLGL
Sbjct: 421 DLAGQRLGLGL 427

BLAST of Bhi04G001952 vs. ExPASy TrEMBL
Match: A0A5D3DLY6 (Aspartic proteinase PCS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold205G00670 PE=3 SV=1)

HSP 1 Score: 786.2 bits (2029), Expect = 7.0e-224
Identity = 387/413 (93.70%), Postives = 396/413 (95.88%), Query Frame = 0

Query: 19  LKQSLCFSATQMTLVLPLKTQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTG 78
           +KQSLCFSAT  T+VLPL+TQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTG
Sbjct: 1   MKQSLCFSATPTTMVLPLQTQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTG 60

Query: 79  SELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPICRTRTRDLPNPVICDPKKLCHVFVS 138
           SELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSP+CRTRTRDLPNPV CDPKKLCH  VS
Sbjct: 61  SELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPVCRTRTRDLPNPVTCDPKKLCHAIVS 120

Query: 139 YADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVT 198
           YADASSLEGNLASD FRIGSSA PGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVT
Sbjct: 121 YADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVT 180

Query: 199 QLGLPKFSYCISGSDSSGVLLFGDAPVSWLGNLTYTPLVQISTPLPYYDRVAYTVQLDGI 258
           QLGLPKFSYCISG DSSGVLLFGD+ +SWLGNLTYTPLVQISTPLPY+DRVAYTVQLDGI
Sbjct: 181 QLGLPKFSYCISGRDSSGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGI 240

Query: 259 RVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLEQTKGVLIPLGD 318
           RVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEFLEQTKGVL PLGD
Sbjct: 241 RVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGD 300

Query: 319 PNFVFQGAMDLCYRVAAKEGKLPPLPAVSLMFHGAEMVVGGEVLLYRVPGMMKGNELVYC 378
           PNFVFQGAMDLCYRV A  GKLP LPAVSLMF GAEMVVGGEVLLY+VPGMMKG E VYC
Sbjct: 301 PNFVFQGAMDLCYRVPA-GGKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYC 360

Query: 379 LTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAGQRLGLGL 432
           LTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAGQRLGLGL
Sbjct: 361 LTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAGQRLGLGL 412

BLAST of Bhi04G001952 vs. ExPASy TrEMBL
Match: E5GC71 (Aspartic proteinase nepenthesin-1 OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1)

HSP 1 Score: 786.2 bits (2029), Expect = 7.0e-224
Identity = 387/413 (93.70%), Postives = 396/413 (95.88%), Query Frame = 0

Query: 19  LKQSLCFSATQMTLVLPLKTQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTG 78
           +KQSLCFSAT  T+VLPL+TQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTG
Sbjct: 1   MKQSLCFSATPTTMVLPLQTQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTG 60

Query: 79  SELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPICRTRTRDLPNPVICDPKKLCHVFVS 138
           SELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSP+CRTRTRDLPNPV CDPKKLCH  VS
Sbjct: 61  SELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPVCRTRTRDLPNPVTCDPKKLCHAIVS 120

Query: 139 YADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVT 198
           YADASSLEGNLASD FRIGSSA PGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVT
Sbjct: 121 YADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVT 180

Query: 199 QLGLPKFSYCISGSDSSGVLLFGDAPVSWLGNLTYTPLVQISTPLPYYDRVAYTVQLDGI 258
           QLGLPKFSYCISG DSSGVLLFGD+ +SWLGNLTYTPLVQISTPLPY+DRVAYTVQLDGI
Sbjct: 181 QLGLPKFSYCISGRDSSGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGI 240

Query: 259 RVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLEQTKGVLIPLGD 318
           RVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEFLEQTKGVL PLGD
Sbjct: 241 RVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGD 300

Query: 319 PNFVFQGAMDLCYRVAAKEGKLPPLPAVSLMFHGAEMVVGGEVLLYRVPGMMKGNELVYC 378
           PNFVFQGAMDLCYRV A  GKLP LPAVSLMF GAEMVVGGEVLLY+VPGMMKG E VYC
Sbjct: 301 PNFVFQGAMDLCYRVPA-GGKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYC 360

Query: 379 LTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAGQRLGLGL 432
           LTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAGQRLGLGL
Sbjct: 361 LTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAGQRLGLGL 412

BLAST of Bhi04G001952 vs. ExPASy TrEMBL
Match: A0A6J1FDS6 (aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111444820 PE=3 SV=1)

HSP 1 Score: 785.8 bits (2028), Expect = 9.2e-224
Identity = 385/430 (89.53%), Postives = 408/430 (94.88%), Query Frame = 0

Query: 1   MAFFFLLRLLQLLIYGLSLKQSLCFSATQMTLVLPLKTQMGLISQPSNKLSFHHNVTLTV 60
           MAFF  LRLLQLLI  +S KQ LCFSATQ T+VLPLKTQMG+ S+PSNKLSFHHNVTLTV
Sbjct: 1   MAFF--LRLLQLLICCVSFKQGLCFSATQ-TMVLPLKTQMGVTSRPSNKLSFHHNVTLTV 60

Query: 61  SLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPICRTRTRD 120
           SLT+GSPPQ VTMVLDTGSELSWLHCKK+PNL SVFNPLSSSSYSP+PC+SP+CRTRTRD
Sbjct: 61  SLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRD 120

Query: 121 LPNPVICDPKKLCHVFVSYADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGFSSNSEED 180
           LPNPV CDPKKLCHVFVSYADASSLEGNLASDTFR+GSSAQPGT FGCMDSGFSSNSEED
Sbjct: 121 LPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEED 180

Query: 181 AKTTGLMGMNRGSLSFVTQLGLPKFSYCISGSDSSGVLLFGDAPVSWLGNLTYTPLVQIS 240
           AKTTGLMGMNRGSLSFVTQLGLPKFSYCISG DSSGVLLFGDA +SWLGNLTYTPLVQ+S
Sbjct: 181 AKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMS 240

Query: 241 TPLPYYDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA 300
           TPLPYYDRVAYTVQLDGIRVGNKIL LPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA
Sbjct: 241 TPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA 300

Query: 301 LKNEFLEQTKGVLIPLGDPNFVFQGAMDLCYRVAAKEGKLPPLPAVSLMFHGAEMVVGGE 360
           LKNEF+ QTKG+L+PLGDPNFVFQGAMDLCYRV  K+GKLPPLP VSLMF GAEMVVGGE
Sbjct: 301 LKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGE 360

Query: 361 VLLYRVPGMMKGNELVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 420
           VL+Y+VPGM++G + V+CLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC
Sbjct: 361 VLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 420

Query: 421 DLAGQRLGLG 431
           DLAG+RLGLG
Sbjct: 421 DLAGERLGLG 427

BLAST of Bhi04G001952 vs. ExPASy TrEMBL
Match: A0A6J1JZR1 (aspartic proteinase PCS1-like OS=Cucurbita maxima OX=3661 GN=LOC111488906 PE=3 SV=1)

HSP 1 Score: 784.6 bits (2025), Expect = 2.0e-223
Identity = 385/430 (89.53%), Postives = 407/430 (94.65%), Query Frame = 0

Query: 1   MAFFFLLRLLQLLIYGLSLKQSLCFSATQMTLVLPLKTQMGLISQPSNKLSFHHNVTLTV 60
           MAFF  LRLL LLI  +S KQSLCFSA Q T+VLPLKTQMG+ SQPSNKLSFHHNVTLTV
Sbjct: 1   MAFF--LRLLHLLICCVSFKQSLCFSAIQ-TMVLPLKTQMGVTSQPSNKLSFHHNVTLTV 60

Query: 61  SLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPICRTRTRD 120
           SLT+GSPPQ VTMVLDTGSELSWLHCKK+PNL SVFNPLSSSSYSP+PC+SP+CRTRTRD
Sbjct: 61  SLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRD 120

Query: 121 LPNPVICDPKKLCHVFVSYADASSLEGNLASDTFRIGSSAQPGTLFGCMDSGFSSNSEED 180
           LPNPV CDPKKLCHVFVSYADASSLEGNLASDTFR+GSSAQPGT FGCMDSGFSSNSEED
Sbjct: 121 LPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEED 180

Query: 181 AKTTGLMGMNRGSLSFVTQLGLPKFSYCISGSDSSGVLLFGDAPVSWLGNLTYTPLVQIS 240
           AKTTGLMGMNRGSLSFVTQLGLPKFSYCISG DSSGVLLFGDA +SWLGNLTYTPLVQ+S
Sbjct: 181 AKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMS 240

Query: 241 TPLPYYDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA 300
           TPLPYYDRVAYTVQLDGIRVGNKIL LPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA
Sbjct: 241 TPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA 300

Query: 301 LKNEFLEQTKGVLIPLGDPNFVFQGAMDLCYRVAAKEGKLPPLPAVSLMFHGAEMVVGGE 360
           LKNEF+ QTKG+L+PLGDPNFVFQGAMDLCYRV  K+GKLPPLP VSLMF GAEMVVGGE
Sbjct: 301 LKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGE 360

Query: 361 VLLYRVPGMMKGNELVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 420
           VL+YRVPGM++G + V+C+TFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC
Sbjct: 361 VLMYRVPGMVRGGDQVHCVTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 420

Query: 421 DLAGQRLGLG 431
           DLAG+RLGLG
Sbjct: 421 DLAGERLGLG 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT2G39710.11.9e-17373.19Eukaryotic aspartyl protease family protein [more]
AT5G02190.13.4e-15463.55Eukaryotic aspartyl protease family protein [more]
AT5G37540.15.6e-6437.22Eukaryotic aspartyl protease family protein [more]
AT1G66180.14.4e-6135.29Eukaryotic aspartyl protease family protein [more]
AT2G03200.11.4e-3832.36Eukaryotic aspartyl protease family protein [more]
Match NameE-valueIdentityDescription
Q9LZL34.8e-15363.55Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1[more]
Q766C25.5e-4031.34Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C31.0e-3830.43Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LNJ38.5e-3333.15Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q6F4N57.9e-3132.89Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3B3Y84.9e-22591.42aspartic proteinase PCS1 OS=Cucumis melo OX=3656 GN=LOC103485512 PE=3 SV=1[more]
A0A5D3DLY67.0e-22493.70Aspartic proteinase PCS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
E5GC717.0e-22493.70Aspartic proteinase nepenthesin-1 OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=... [more]
A0A6J1FDS69.2e-22489.53aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111444820 PE=3... [more]
A0A6J1JZR12.0e-22389.53aspartic proteinase PCS1-like OS=Cucurbita maxima OX=3661 GN=LOC111488906 PE=3 S... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 226..423
e-value: 1.1E-42
score: 147.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 51..221
e-value: 1.3E-36
score: 128.3
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 58..420
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 250..416
e-value: 5.8E-36
score: 123.7
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 60..222
e-value: 3.3E-39
score: 134.9
NoneNo IPR availablePANTHERPTHR47965:SF49EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 10..429
IPR001461Aspartic peptidase A1 familyPANTHERPTHR47965ASPARTYL PROTEASE-RELATEDcoord: 10..429
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 73..84
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 58..416
score: 35.619724
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 57..420
e-value: 8.37736E-72
score: 225.22

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M001952Bhi04M001952mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity