Bhi04G000762 (gene) Wax gourd (B227) v1

Overview
NameBhi04G000762
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionEukaryotic aspartyl protease family protein
Locationchr4: 23458119 .. 23460746 (-)
RNA-Seq ExpressionBhi04G000762
SyntenyBhi04G000762
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTAGCCATTACACCAAAGAAATTAATTGCAACCCCATTTTCGTAATTTCACAGCCAATCTTGGTATAAATGCACCCCATCTCCATGCAAAACTCCGGTAACCGGTAACCTTCAAATAATTTCCCTGCCCACTTCTTCCACCGCCGTCTCTCTCAACTCCGGCCACCGAGTTGAGTCCGAATCATGGCCCTCAAAATTTTTTTCTTCTTCTTTTTAGCCCTTCTGGTACTAAATACCAATGCTTCCGATCTCTGCGCCTCCGGATCGGATGGCGATCTCTCCGTCATCCCCATCTATGGCAAATGCTCGCCGTTCACGGCTCCAAAGTCAGAATCTTGGGTGAACACGGTGATAGATATGGCTTCAAAGGACCCGGCCCGAATTAAATACTTGTCAACCCTCGCCGCCCAGAAGACGGTGGCAGCTCCGATAGCCTCCGGGCAGCAGGTTCTTAATATTGGGAATTATGTCGTGAGAGTTCAACTTGGTACTCCTGGTCAAGTAATGTACATGGTTCTTGATACCAGTAATGACGCCGCCTGGGCTCCGTGCTCCGGCTGCACCGGTTGCTCCGCCACCACTTTTTCGTCTAATTATTCCTCAACTTTTGCGACTTTAGACTGCTCTAAACCGCAGTGTAGTCAGGTTTGTTAAATCACGGCTCAACTCAAAAAGTTTAAAGGGGTAGTTACTCTATTTTTTTTTTTTTTATTATTTTGTCAGATTTTAAAAATATCAATTTGTCTCTAAAATTGTTTGTAATTTTGATTTCTTTTAGAAAAATACCTTTTTGTCTCCTATAGTTTCTGCTCTTAAATTATGAGTTTATTAATTTCACGTGGTTCTTAAATTTTAAAATATTACATTTTTATCCAATAATTTTGAATTCAATTTTCATTTAATGTTTAGGTTTTAGTATATTTACAGTTTTACACTCGAATGTTTGCTAATGTGGGCTTTGGTGTTTGGCATTATGTTTCAGTTAATTAATTTAAAATAATTATAGTGAAATTTAAACTGAGTTAGATGAAAAGTTAGTGAAGAATATAGGTATTCAAATGAGTATTTAATAAAAAATCAAAAGTTCAAAATATAAATCCTCAAAACTTAGGAAAGAAATAAAAACAAAACTCATTGTAAAAATGTAACCAAGAAACTCACTCAATTCGAAATAGTATTTACAAATTTAATTTTTAAAAATTAAAATAGAAAACCAAAAGATGTTAGTTGGTAGGTAACTATTATAACCCCAAGATAATTGCAAAAATTTAATTACTTTTTCTAAATTCAAATTTAATTTTAAAAGATTAAGATATGAAAATGTAAAGTTTAGAGTCAAATAGAAAAACAAAAACTTTTGTTTTATTGGCAGCAATTAAATTGTTACGGTAAATTCAATTTCTTTATTCATTTTCAACAGGCTCGGGGTCTTTCCTGTCCGACCACCGGCAACGCCGATTGCCTATTCAATCAAACATACGGCGGCGACTCATCGTTTTCAGCCACCTTGGTTCAAGACTCTCTCCATTTAGGCACTGACGTCATTCCAAATTTCTCATTCGGCTGCATCAGCTCCGCCTCCGGCAGCTCCATTCCGCCGCAAGGCCTAATGGGTCTCGGCCGTGGTCCCCTCTCCCTCATCTCTCAATCCGGTTCACTCTACTCCGGTCTATTCTCCTACTGCCTCCCAAGTTTCAAATCCTACTACTTCTCCGGTTCACTAAAACTCGGACCGGCCGGTCAACCAAAATTAATCCGAACCACCCCACTCCTCCACAACCCACACCGACCATCGCTCTATTATGTCAATCTAACCGGCATCAGCGTCGGTCGGGTCCTCGTCCCAATTTCCCCTGAGCATCTCGCATTCGACCCAAACACCGGCGCGGGAACCATCATTGACTCAGGCACAGTAATAACCCGGTTCGTGTCCCCGGTCTACACAGCGGTTCGAGATGAGTTTAGAAAGCAAGTGGGGGGTTCGTTTTCGCCTTTGGGGGCTTTCGACACGTGTTTCGCAACGAGCAATGAGGTTTCGGCACCTGGCATTACGTTTCATTTGAGTGGATTGGACTTGAAATTGCCAATGGAAAATAGTTTGATTCATAGTAGCGCCGGTTCGTTGGCTTGTTTGGCGATGGCGGCGGCGCCGAATAATGTGAACTCTGTGTTGAATGTGATTGCAAATTTGCAGCAACAAAATCATAGGATTTTGTTTGATATTACAAATTCTAAATTGGGGATTGCTCGTGAGCTTTGTAATTAGGCTTTGAATCAACAATAATGGTGACTAAGGATTTTGATTTCCTTCTTTGGTTGAAAGAGTGGAAGTTTGGAAGATTAAAGAACTTATGAATAATTTGGAAAGTGTATCAATATTATTTTAGTAGGATTGATTTGTGAGTTTCTTTCAATTTGACGATCAACATCTAAAGTATCGTTTAAACTGTGTTTAAAGATGGTGGTTTCAGCATCCATTTTAATGGGCTTGGCCCTAACCCATTTTGGGCTCCAATTGTTTGTTTGTTTATTAGTATTATTATTATTATTATTGGAAAATGGGAGGAAATATAAAAAACAGGGGATGTCTTTTGATTTTTGTCCAAAAAGAAAATAAGAAGATTTTTG

mRNA sequence

CCTAGCCATTACACCAAAGAAATTAATTGCAACCCCATTTTCGTAATTTCACAGCCAATCTTGGTATAAATGCACCCCATCTCCATGCAAAACTCCGGTAACCGGTAACCTTCAAATAATTTCCCTGCCCACTTCTTCCACCGCCGTCTCTCTCAACTCCGGCCACCGAGTTGAGTCCGAATCATGGCCCTCAAAATTTTTTTCTTCTTCTTTTTAGCCCTTCTGGTACTAAATACCAATGCTTCCGATCTCTGCGCCTCCGGATCGGATGGCGATCTCTCCGTCATCCCCATCTATGGCAAATGCTCGCCGTTCACGGCTCCAAAGTCAGAATCTTGGGTGAACACGGTGATAGATATGGCTTCAAAGGACCCGGCCCGAATTAAATACTTGTCAACCCTCGCCGCCCAGAAGACGGTGGCAGCTCCGATAGCCTCCGGGCAGCAGGTTCTTAATATTGGGAATTATGTCGTGAGAGTTCAACTTGGTACTCCTGGTCAAGTAATGTACATGGTTCTTGATACCAGTAATGACGCCGCCTGGGCTCCGTGCTCCGGCTGCACCGGTTGCTCCGCCACCACTTTTTCGTCTAATTATTCCTCAACTTTTGCGACTTTAGACTGCTCTAAACCGCAGTGTAGTCAGGCTCGGGGTCTTTCCTGTCCGACCACCGGCAACGCCGATTGCCTATTCAATCAAACATACGGCGGCGACTCATCGTTTTCAGCCACCTTGGTTCAAGACTCTCTCCATTTAGGCACTGACGTCATTCCAAATTTCTCATTCGGCTGCATCAGCTCCGCCTCCGGCAGCTCCATTCCGCCGCAAGGCCTAATGGGTCTCGGCCGTGGTCCCCTCTCCCTCATCTCTCAATCCGGTTCACTCTACTCCGGTCTATTCTCCTACTGCCTCCCAAGTTTCAAATCCTACTACTTCTCCGGTTCACTAAAACTCGGACCGGCCGGTCAACCAAAATTAATCCGAACCACCCCACTCCTCCACAACCCACACCGACCATCGCTCTATTATGTCAATCTAACCGGCATCAGCGTCGGTCGGGTCCTCGTCCCAATTTCCCCTGAGCATCTCGCATTCGACCCAAACACCGGCGCGGGAACCATCATTGACTCAGGCACAGTAATAACCCGGTTCGTGTCCCCGGTCTACACAGCGGTTCGAGATGAGTTTAGAAAGCAAGTGGGGGGTTCGTTTTCGCCTTTGGGGGCTTTCGACACGTGTTTCGCAACGAGCAATGAGGTTTCGGCACCTGGCATTACGTTTCATTTGAGTGGATTGGACTTGAAATTGCCAATGGAAAATAGTTTGATTCATAGTAGCGCCGGTTCGTTGGCTTGTTTGGCGATGGCGGCGGCGCCGAATAATGTGAACTCTGTGTTGAATGTGATTGCAAATTTGCAGCAACAAAATCATAGGATTTTGTTTGATATTACAAATTCTAAATTGGGGATTGCTCGTGAGCTTTGTAATTAGGCTTTGAATCAACAATAATGGTGACTAAGGATTTTGATTTCCTTCTTTGGTTGAAAGAGTGGAAGTTTGGAAGATTAAAGAACTTATGAATAATTTGGAAAGTGTATCAATATTATTTTAGTAGGATTGATTTGTGAGTTTCTTTCAATTTGACGATCAACATCTAAAGTATCGTTTAAACTGTGTTTAAAGATGGTGGTTTCAGCATCCATTTTAATGGGCTTGGCCCTAACCCATTTTGGGCTCCAATTGTTTGTTTGTTTATTAGTATTATTATTATTATTATTGGAAAATGGGAGGAAATATAAAAAACAGGGGATGTCTTTTGATTTTTGTCCAAAAAGAAAATAAGAAGATTTTTG

Coding sequence (CDS)

ATGGCCCTCAAAATTTTTTTCTTCTTCTTTTTAGCCCTTCTGGTACTAAATACCAATGCTTCCGATCTCTGCGCCTCCGGATCGGATGGCGATCTCTCCGTCATCCCCATCTATGGCAAATGCTCGCCGTTCACGGCTCCAAAGTCAGAATCTTGGGTGAACACGGTGATAGATATGGCTTCAAAGGACCCGGCCCGAATTAAATACTTGTCAACCCTCGCCGCCCAGAAGACGGTGGCAGCTCCGATAGCCTCCGGGCAGCAGGTTCTTAATATTGGGAATTATGTCGTGAGAGTTCAACTTGGTACTCCTGGTCAAGTAATGTACATGGTTCTTGATACCAGTAATGACGCCGCCTGGGCTCCGTGCTCCGGCTGCACCGGTTGCTCCGCCACCACTTTTTCGTCTAATTATTCCTCAACTTTTGCGACTTTAGACTGCTCTAAACCGCAGTGTAGTCAGGCTCGGGGTCTTTCCTGTCCGACCACCGGCAACGCCGATTGCCTATTCAATCAAACATACGGCGGCGACTCATCGTTTTCAGCCACCTTGGTTCAAGACTCTCTCCATTTAGGCACTGACGTCATTCCAAATTTCTCATTCGGCTGCATCAGCTCCGCCTCCGGCAGCTCCATTCCGCCGCAAGGCCTAATGGGTCTCGGCCGTGGTCCCCTCTCCCTCATCTCTCAATCCGGTTCACTCTACTCCGGTCTATTCTCCTACTGCCTCCCAAGTTTCAAATCCTACTACTTCTCCGGTTCACTAAAACTCGGACCGGCCGGTCAACCAAAATTAATCCGAACCACCCCACTCCTCCACAACCCACACCGACCATCGCTCTATTATGTCAATCTAACCGGCATCAGCGTCGGTCGGGTCCTCGTCCCAATTTCCCCTGAGCATCTCGCATTCGACCCAAACACCGGCGCGGGAACCATCATTGACTCAGGCACAGTAATAACCCGGTTCGTGTCCCCGGTCTACACAGCGGTTCGAGATGAGTTTAGAAAGCAAGTGGGGGGTTCGTTTTCGCCTTTGGGGGCTTTCGACACGTGTTTCGCAACGAGCAATGAGGTTTCGGCACCTGGCATTACGTTTCATTTGAGTGGATTGGACTTGAAATTGCCAATGGAAAATAGTTTGATTCATAGTAGCGCCGGTTCGTTGGCTTGTTTGGCGATGGCGGCGGCGCCGAATAATGTGAACTCTGTGTTGAATGTGATTGCAAATTTGCAGCAACAAAATCATAGGATTTTGTTTGATATTACAAATTCTAAATTGGGGATTGCTCGTGAGCTTTGTAATTAG

Protein sequence

MALKIFFFFFLALLVLNTNASDLCASGSDGDLSVIPIYGKCSPFTAPKSESWVNTVIDMASKDPARIKYLSTLAAQKTVAAPIASGQQVLNIGNYVVRVQLGTPGQVMYMVLDTSNDAAWAPCSGCTGCSATTFSSNYSSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDSSFSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPEHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVGGSFSPLGAFDTCFATSNEVSAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDITNSKLGIARELCN
Homology
BLAST of Bhi04G000762 vs. TAIR 10
Match: AT1G09750.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 557.0 bits (1434), Expect = 1.3e-158
Identity = 296/446 (66.37%), Postives = 351/446 (78.70%), Query Frame = 0

Query: 5   IFFFFFLALLV---LNTNASDLCAS----GSDGDLSVIPIYGKCSPFTAPK-SESWVNTV 64
           + FFFFL LL+     T   D CA+    GSD DLS+IPI  KCSPF     S S ++TV
Sbjct: 6   LHFFFFLTLLLPFTFTTATRDTCATAAPDGSD-DLSIIPINAKCSPFAPTHVSASVIDTV 65

Query: 65  IDMASKDPARIKYLSTLAA--QKTVAAPIASGQQVLNIGNYVVRVQLGTPGQVMYMVLDT 124
           + MAS D  R+ YLS+L A   K  + P+ASG Q L+IGNYVVR +LGTP Q+M+MVLDT
Sbjct: 66  LHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQ-LHIGNYVVRAKLGTPPQLMFMVLDT 125

Query: 125 SNDAAWAPCSGCTGCS--ATTFSSNYSSTFATLDCSKPQCSQARGLSCPTTG--NADCLF 184
           SNDA W PCSGC+GCS  +T+F++N SST++T+ CS  QC+QARGL+CP++    + C F
Sbjct: 126 SNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSF 185

Query: 185 NQTYGGDSSFSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQ 244
           NQ+YGGDSSFSA+LVQD+L L  DVIPNFSFGCI+SASG+S+PPQGLMGLGRGP+SL+SQ
Sbjct: 186 NQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQ 245

Query: 245 SGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTGISV 304
           + SLYSG+FSYCLPSF+S+YFSGSLKLG  GQPK IR TPLL NP RPSLYYVNLTG+SV
Sbjct: 246 TTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSV 305

Query: 305 GRVLVPISPEHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVG-GSFSPLGAF 364
           G V VP+ P +L FD N+GAGTIIDSGTVITRF  PVY A+RDEFRKQV   SFS LGAF
Sbjct: 306 GSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAF 365

Query: 365 DTCFATSNEVSAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIA 424
           DTCF+  NE  AP IT H++ LDLKLPMEN+LIHSSAG+L CL+MA    N N+VLNVIA
Sbjct: 366 DTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIA 425

Query: 425 NLQQQNHRILFDITNSKLGIARELCN 436
           NLQQQN RILFD+ NS++GIA E CN
Sbjct: 426 NLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of Bhi04G000762 vs. TAIR 10
Match: AT3G54400.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 441.8 bits (1135), Expect = 6.3e-124
Identity = 239/426 (56.10%), Postives = 296/426 (69.48%), Query Frame = 0

Query: 11  LALLVLNTNASDLCASGSDGDLSVIPIYGKCSPFTAPKSESWVNTVIDMASKDPARIKYL 70
           ++LL+L + + +        DL V  I   CSPF    S SW +T++    +D AR  YL
Sbjct: 10  ISLLILKSESINCNEKSHSSDLRVFHINSLCSPFKT--SVSWADTLL----QDKARFLYL 69

Query: 71  STLAAQKTVAAPIASGQQVLNIGNYVVRVQLGTPGQVMYMVLDTSNDAAWAPCSGCTGCS 130
           S+LA  +  + PIASG+ ++    Y+VR  +GTP Q M + LDTSNDAAW PCSGC GCS
Sbjct: 70  SSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCS 129

Query: 131 ATT-FSSNYSSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDSSFSATLVQDSL 190
           ++  F  + SS+  TL C  PQC QA   SC  T +  C FN TYGG S+  A L QD+L
Sbjct: 130 SSVLFDPSKSSSSRTLQCEAPQCKQAPNPSC--TVSKSCGFNMTYGG-STIEAYLTQDTL 189

Query: 191 HLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSY 250
            L +DVIPN++FGCI+ ASG+S+P QGLMGLGRGPLSLISQS +LY   FSYCLP+ KS 
Sbjct: 190 TLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSS 249

Query: 251 YFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPEHLAFDPNTG 310
            FSGSL+LGP  QP  I+TTPLL NP R SLYYVNL GI VG  +V I    LAFDP TG
Sbjct: 250 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 309

Query: 311 AGTIIDSGTVITRFVSPVYTAVRDEFRKQV-GGSFSPLGAFDTCFATSNEVSAPGITFHL 370
           AGTI DSGTV TR V P Y AVR+EFR++V   + + LG FDTC+  S  V  P +TF  
Sbjct: 310 AGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCY--SGSVVFPSVTFMF 369

Query: 371 SGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDITNSKLG 430
           +G+++ LP +N LIHSSAG+L+CLAMAAAP NVNSVLNVIA++QQQNHR+L D+ NS+LG
Sbjct: 370 AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLG 424

Query: 431 IARELC 435
           I+RE C
Sbjct: 430 ISRETC 424

BLAST of Bhi04G000762 vs. TAIR 10
Match: AT5G07030.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 409.8 bits (1052), Expect = 2.7e-114
Identity = 220/439 (50.11%), Postives = 290/439 (66.06%), Query Frame = 0

Query: 3   LKIFFFFFLALLVLNTNASDLCASGSDGDLSVIPIYGKCSPFTAPKSESWVNTVIDMASK 62
           L++F    LAL + + N            L +  I   CSPF +    SW   V+   ++
Sbjct: 24  LQLFSILPLALGLNHPNCDLTKTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQTLAQ 83

Query: 63  DPARIKYLSTLAAQKTVAAPIASGQQVLNIGNYVVRVQLGTPGQVMYMVLDTSNDAAWAP 122
           D AR++YLS+L A ++V  PIASG+Q+L    Y+V+  +GTP Q + + +DTS+D AW P
Sbjct: 84  DQARLQYLSSLVAGRSV-VPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIP 143

Query: 123 CSGCTGC-SATTFSSNYSSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDSSFS 182
           CSGC GC S T FS   S++F  + CS PQC Q      PT G   C FN TY G SS +
Sbjct: 144 CSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPN---PTCGARACSFNLTY-GSSSIA 203

Query: 183 ATLVQDSLHLGTDVIPNFSFGCISSASGSSI--PPQGLMGLGRGPLSLISQSGSLYSGLF 242
           A L QD++ L  D I  F+FGC++  +G     PPQGL+GLGRGPLSL+SQ+ S+Y   F
Sbjct: 204 ANLSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTF 263

Query: 243 SYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISP 302
           SYCLPSF+S  FSGSL+LGP  QP+ ++ T LL NP R SLYYVNL  I VGR +V + P
Sbjct: 264 SYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPP 323

Query: 303 EHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVGGS---FSPLGAFDTCFATS 362
             +AF+P+TGAGTI DSGTV TR   PVY AVR+EFRK+V  +    + LG FDTC+  S
Sbjct: 324 AAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCY--S 383

Query: 363 NEVSAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNH 422
            +V  P ITF   G+++ +P +N ++HS+AGS +CLAMAAAP NVNSV+NVIA++QQQNH
Sbjct: 384 GQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNH 443

Query: 423 RILFDITNSKLGIARELCN 436
           R+L D+ N +LG+ARE C+
Sbjct: 444 RVLIDVPNGRLGLARERCS 455

BLAST of Bhi04G000762 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 205.3 bits (521), Expect = 1.0e-52
Identity = 139/394 (35.28%), Postives = 188/394 (47.72%), Query Frame = 0

Query: 62  KDPARIKYLSTLAAQ----KTVAAPIASG--QQVLN-----IGNYVVRVQLGTPGQVMYM 121
           +D  R+K ++TLAAQ        AP   G    V++      G Y  R+ +GTP + +YM
Sbjct: 98  RDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYM 157

Query: 122 VLDTSNDAAW---APCSGCTGCSATTFSSNYSSTFATLDCSKPQCSQARGLSCPTTGNAD 181
           VLDT +D  W   APC  C   S   F    S T+AT+ CS P C +     C  T    
Sbjct: 158 VLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGC-NTRRKT 217

Query: 182 CLFNQTYGGDSSFSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSL 241
           CL+  +YG  S        ++L    + +   + GC     G  +   GL+GLG+G LS 
Sbjct: 218 CLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSF 277

Query: 242 ISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTG 301
             Q+G  ++  FSYCL    +     S+  G A   ++ R TPLL NP   + YYV L G
Sbjct: 278 PGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLG 337

Query: 302 ISVGRVLVP-ISPEHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVGGS---- 361
           ISVG   VP ++      D     G IIDSGT +TR + P Y A+RD FR  VG      
Sbjct: 338 ISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKTLKR 397

Query: 362 FSPLGAFDTCFATS--NEVSAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNN 421
                 FDTCF  S  NEV  P +  H  G D+ LP  N LI        C A A     
Sbjct: 398 APDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGG 457

Query: 422 VNSVLNVIANLQQQNHRILFDITNSKLGIARELC 435
               L++I N+QQQ  R+++D+ +S++G A   C
Sbjct: 458 ----LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Bhi04G000762 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 191.8 bits (486), Expect = 1.1e-48
Identity = 137/402 (34.08%), Postives = 185/402 (46.02%), Query Frame = 0

Query: 62  KDPARIKYLSTLAAQKT--------------VAAPIASGQQVLNIGNYVVRVQLGTPGQV 121
           +D  R+K +++LAA  T               +  + SG      G Y +R+ +GTP   
Sbjct: 89  RDSLRVKSITSLAAVSTGRNATKRTPRTAGGFSGAVISGLS-QGSGEYFMRLGVGTPATN 148

Query: 122 MYMVLDTSNDAAWAPCSGCTGCSATT---FSSNYSSTFATLDCSKPQCSQARGLS-CPTT 181
           +YMVLDT +D  W  CS C  C   T   F    S TFAT+ C    C +    S C T 
Sbjct: 149 VYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTR 208

Query: 182 GNADCLFNQTYGGDSSFSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRG 241
            +  CL+  +YG  S        ++L      + +   GC     G  +   GL+GLGRG
Sbjct: 209 RSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRG 268

Query: 242 PLSLISQSGSLYSGLFSYCL----PSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPS 301
            LS  SQ+ + Y+G FSYCL     S  S     ++  G A  PK    TPLL NP   +
Sbjct: 269 GLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDT 328

Query: 302 LYYVNLTGISVGRVLVP-ISPEHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQ 361
            YY+ L GISVG   VP +S      D     G IIDSGT +TR   P Y A+RD FR  
Sbjct: 329 FYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFR-- 388

Query: 362 VGGS----FSPLGAFDTCFATS--NEVSAPGITFHLSGLDLKLPMENSLIHSSAGSLACL 421
           +G +          FDTCF  S    V  P + FH  G ++ LP  N LI  +     C 
Sbjct: 389 LGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVNTEGRFCF 448

Query: 422 AMAAAPNNVNSVLNVIANLQQQNHRILFDITNSKLGIARELC 435
           A A    +    L++I N+QQQ  R+ +D+  S++G     C
Sbjct: 449 AFAGTMGS----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of Bhi04G000762 vs. ExPASy Swiss-Prot
Match: O04496 (Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1)

HSP 1 Score: 557.0 bits (1434), Expect = 1.9e-157
Identity = 296/446 (66.37%), Postives = 351/446 (78.70%), Query Frame = 0

Query: 5   IFFFFFLALLV---LNTNASDLCAS----GSDGDLSVIPIYGKCSPFTAPK-SESWVNTV 64
           + FFFFL LL+     T   D CA+    GSD DLS+IPI  KCSPF     S S ++TV
Sbjct: 6   LHFFFFLTLLLPFTFTTATRDTCATAAPDGSD-DLSIIPINAKCSPFAPTHVSASVIDTV 65

Query: 65  IDMASKDPARIKYLSTLAA--QKTVAAPIASGQQVLNIGNYVVRVQLGTPGQVMYMVLDT 124
           + MAS D  R+ YLS+L A   K  + P+ASG Q L+IGNYVVR +LGTP Q+M+MVLDT
Sbjct: 66  LHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQ-LHIGNYVVRAKLGTPPQLMFMVLDT 125

Query: 125 SNDAAWAPCSGCTGCS--ATTFSSNYSSTFATLDCSKPQCSQARGLSCPTTG--NADCLF 184
           SNDA W PCSGC+GCS  +T+F++N SST++T+ CS  QC+QARGL+CP++    + C F
Sbjct: 126 SNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSF 185

Query: 185 NQTYGGDSSFSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQ 244
           NQ+YGGDSSFSA+LVQD+L L  DVIPNFSFGCI+SASG+S+PPQGLMGLGRGP+SL+SQ
Sbjct: 186 NQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQ 245

Query: 245 SGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTGISV 304
           + SLYSG+FSYCLPSF+S+YFSGSLKLG  GQPK IR TPLL NP RPSLYYVNLTG+SV
Sbjct: 246 TTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSV 305

Query: 305 GRVLVPISPEHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVG-GSFSPLGAF 364
           G V VP+ P +L FD N+GAGTIIDSGTVITRF  PVY A+RDEFRKQV   SFS LGAF
Sbjct: 306 GSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAF 365

Query: 365 DTCFATSNEVSAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIA 424
           DTCF+  NE  AP IT H++ LDLKLPMEN+LIHSSAG+L CL+MA    N N+VLNVIA
Sbjct: 366 DTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIA 425

Query: 425 NLQQQNHRILFDITNSKLGIARELCN 436
           NLQQQN RILFD+ NS++GIA E CN
Sbjct: 426 NLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of Bhi04G000762 vs. ExPASy Swiss-Prot
Match: Q6F4N5 (Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.3e-110
Identity = 219/410 (53.41%), Postives = 289/410 (70.49%), Query Frame = 0

Query: 47  PKSESWVNTVIDMASKDPARIKYLSTLAAQKTV-AAPIASGQQVLNIGNYVVRVQLGTPG 106
           P S S + ++I +A  D AR+ +LS+ AA   V +AP+ASGQ      +YVVR  LG+P 
Sbjct: 33  PSSPSPLESIIALARDDDARLLFLSSKAATAGVSSAPVASGQAP---PSYVVRAGLGSPS 92

Query: 107 QVMYMVLDTSNDAAWAPCSGCTGC-SATTFSSNYSSTFATLDCSKPQCSQARGLSCPT-T 166
           Q + + LDTS DA WA CS C  C S++ F+   SS++A+L CS   C   +G +CP   
Sbjct: 93  QQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPLFQGQACPAPQ 152

Query: 167 GNAD----------CLFNQTYGGDSSFSATLVQDSLHLGTDVIPNFSFGCISSASG--SS 226
           G  D          C F++ +  D+SF A L  D+L LG D IPN++FGC+SS +G  ++
Sbjct: 153 GGGDAAPPPATLPTCAFSKPF-ADASFQAALASDTLRLGKDAIPNYTFGCVSSVTGPTTN 212

Query: 227 IPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGP-AGQPKLIRTTP 286
           +P QGL+GLGRGP++L+SQ+GSLY+G+FSYCLPS++SYYFSGSL+LG   GQP+ +R TP
Sbjct: 213 MPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGGQPRSVRYTP 272

Query: 287 LLHNPHRPSLYYVNLTGISVGRVLVPISPEHLAFDPNTGAGTIIDSGTVITRFVSPVYTA 346
           +L NPHR SLYYVN+TG+SVG   V +     AFD  TGAGT++DSGTVITR+ +PVY A
Sbjct: 273 MLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAA 332

Query: 347 VRDEFRKQVG--GSFSPLGAFDTCFATSNEVS--APGITFHL-SGLDLKLPMENSLIHSS 406
           +R+EFR+QV     ++ LGAFDTCF T    +  AP +T H+  G+DL LPMEN+LIHSS
Sbjct: 333 LREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSS 392

Query: 407 AGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDITNSKLGIARELCN 436
           A  LACLAMA AP NVNSV+NVIANLQQQN R++FD+ NS++G A+E CN
Sbjct: 393 ATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKESCN 438

BLAST of Bhi04G000762 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 1.4e-51
Identity = 139/394 (35.28%), Postives = 188/394 (47.72%), Query Frame = 0

Query: 62  KDPARIKYLSTLAAQ----KTVAAPIASG--QQVLN-----IGNYVVRVQLGTPGQVMYM 121
           +D  R+K ++TLAAQ        AP   G    V++      G Y  R+ +GTP + +YM
Sbjct: 98  RDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYM 157

Query: 122 VLDTSNDAAW---APCSGCTGCSATTFSSNYSSTFATLDCSKPQCSQARGLSCPTTGNAD 181
           VLDT +D  W   APC  C   S   F    S T+AT+ CS P C +     C  T    
Sbjct: 158 VLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGC-NTRRKT 217

Query: 182 CLFNQTYGGDSSFSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSL 241
           CL+  +YG  S        ++L    + +   + GC     G  +   GL+GLG+G LS 
Sbjct: 218 CLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSF 277

Query: 242 ISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTG 301
             Q+G  ++  FSYCL    +     S+  G A   ++ R TPLL NP   + YYV L G
Sbjct: 278 PGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLG 337

Query: 302 ISVGRVLVP-ISPEHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVGGS---- 361
           ISVG   VP ++      D     G IIDSGT +TR + P Y A+RD FR  VG      
Sbjct: 338 ISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKTLKR 397

Query: 362 FSPLGAFDTCFATS--NEVSAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNN 421
                 FDTCF  S  NEV  P +  H  G D+ LP  N LI        C A A     
Sbjct: 398 APDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGG 457

Query: 422 VNSVLNVIANLQQQNHRILFDITNSKLGIARELC 435
               L++I N+QQQ  R+++D+ +S++G A   C
Sbjct: 458 ----LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Bhi04G000762 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 1.6e-47
Identity = 136/360 (37.78%), Postives = 181/360 (50.28%), Query Frame = 0

Query: 93  GNYVVRVQLGTPGQVMYMVLDTSNDAAWAPCSGCTGC---SATTFSSNYSSTFATLDCSK 152
           G Y++ + +GTP Q    ++DT +D  W  C  CT C   S   F+   SS+F+TL CS 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSS 152

Query: 153 PQCSQARGLSCPTTGNADCLFNQTYGGDSSFSATLVQDSLHLGTDVIPNFSFGCISSASG 212
             C   + LS PT  N  C +   YG  S    ++  ++L  G+  IPN +FGC  +  G
Sbjct: 153 QLC---QALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQG 212

Query: 213 -SSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCL----PSFKSYYFSGSLKLG-PAGQP 272
                  GL+G+GRGPLSL SQ   L    FSYC+     S  S    GSL     AG P
Sbjct: 213 FGQGNGAGLVGMGRGPLSLPSQ---LDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSP 272

Query: 273 KLIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPEHLAFDPNTG-AGTIIDSGTVITR 332
                T L+ +   P+ YY+ L G+SVG   +PI P   A + N G  G IIDSGT +T 
Sbjct: 273 ----NTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTY 332

Query: 333 FVSPVYTAVRDEFRKQ-----VGGSFSPLGAFDTCFATSNEVS---APGITFHLSGLDLK 392
           FV+  Y +VR EF  Q     V GS S    FD CF T ++ S    P    H  G DL+
Sbjct: 333 FVNNAYQSVRQEFISQINLPVVNGSSS---GFDLCFQTPSDPSNLQIPTFVMHFDGGDLE 392

Query: 393 LPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDITNSKLGIARELC 435
           LP EN  I  S G L CLAM ++       +++  N+QQQN  +++D  NS +  A   C
Sbjct: 393 LPSENYFISPSNG-LICLAMGSSSQG----MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Bhi04G000762 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 172.2 bits (435), Expect = 1.3e-41
Identity = 111/350 (31.71%), Postives = 160/350 (45.71%), Query Frame = 0

Query: 93  GNYVVRVQLGTPGQVMYMVLDTSNDAAWAPCSGCTGC---SATTFSSNYSSTFATLDCSK 152
           G Y VR+ +G+P +  YMV+D+ +D  W  C  C  C   S   F    S ++  + C  
Sbjct: 129 GEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGS 188

Query: 153 PQCSQARGLSCPTTGNADCLFNQTYGGDSSFSATLVQDSLHLGTDVIPNFSFGCISSASG 212
             C +     C + G   C +   YG  S    TL  ++L     V+ N + GC     G
Sbjct: 189 SVCDRIENSGCHSGG---CRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRG 248

Query: 213 SSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKLIRTT 272
             I   GL+G+G G +S + Q      G F YCL S +    +GSL  G    P      
Sbjct: 249 MFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWV 308

Query: 273 PLLHNPHRPSLYYVNLTGISVGRVLVPISPEHLAFDPNTGAGTIIDSGTVITRFVSPVYT 332
           PL+ NP  PS YYV L G+ VG V +P+             G ++D+GT +TR  +  Y 
Sbjct: 309 PLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYV 368

Query: 333 AVRDEFRKQVGG--SFSPLGAFDTCFATSNEVS--APGITFHLS-GLDLKLPMENSLIHS 392
           A RD F+ Q       S +  FDTC+  S  VS   P ++F+ + G  L LP  N L+  
Sbjct: 369 AFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPV 428

Query: 393 SAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDITNSKLGIARELC 435
                 C A AA+P      L++I N+QQ+  ++ FD  N  +G    +C
Sbjct: 429 DDSGTYCFAFAASPTG----LSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Bhi04G000762 vs. ExPASy TrEMBL
Match: A0A6J1K469 (aspartyl protease AED3 OS=Cucurbita maxima OX=3661 GN=LOC111492140 PE=4 SV=1)

HSP 1 Score: 799.7 bits (2064), Expect = 6.2e-228
Identity = 405/437 (92.68%), Postives = 416/437 (95.19%), Query Frame = 0

Query: 1   MALKIFFFFFLALLVLNTNASDLCASGSD--GDLSVIPIYGKCSPFTAPKSESWVNTVID 60
           MA+K FFF FLALL LN+NASDLCA+GSD  GDLSVIPIYGKCSPFTAPKSESWVNTVID
Sbjct: 1   MAVKFFFFVFLALLALNSNASDLCAAGSDGSGDLSVIPIYGKCSPFTAPKSESWVNTVID 60

Query: 61  MASKDPARIKYLSTLAAQKTVAAPIASGQQVLNIGNYVVRVQLGTPGQVMYMVLDTSNDA 120
           MASKDPARIKYLS+LAAQKTVAAPIASGQ  LNIGNYVVRVQLGTPGQ MYMVLDTS+DA
Sbjct: 61  MASKDPARIKYLSSLAAQKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDTSSDA 120

Query: 121 AWAPCSGCTGCSATTFSSNYSSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDS 180
           AWAPCSGC+GCSATTF S  SSTFATLDCSKPQCSQARGLSCPTTG+ DCLFNQTYGGDS
Sbjct: 121 AWAPCSGCSGCSATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVDCLFNQTYGGDS 180

Query: 181 SFSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGL 240
           SFSATLVQD+LHLGTDVIPNFSFGCISSASGSSIPPQGL+GLGRGPLSLISQS SLYSGL
Sbjct: 181 SFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSLISQSTSLYSGL 240

Query: 241 FSYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPIS 300
           FSYCLPSFKSYYFSGSLKLGP GQPK IRTTPLL NPHRPSLYYVNLTGISVGRVLVPI 
Sbjct: 241 FSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISVGRVLVPIP 300

Query: 301 PEHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVGGSFSPLGAFDTCFATSNE 360
           PE LAFDPNTGAGTIIDSGTVITRFV PVYTAVRDEFRKQVGGSFSPLGAFDTCF TSNE
Sbjct: 301 PETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVGGSFSPLGAFDTCFTTSNE 360

Query: 361 VSAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRI 420
           ++APGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRI
Sbjct: 361 MAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRI 420

Query: 421 LFDITNSKLGIARELCN 436
           LFDI NSKLGIARELCN
Sbjct: 421 LFDIANSKLGIARELCN 437

BLAST of Bhi04G000762 vs. ExPASy TrEMBL
Match: A0A6J1HAN7 (aspartyl protease AED3 OS=Cucurbita moschata OX=3662 GN=LOC111462342 PE=4 SV=1)

HSP 1 Score: 797.7 bits (2059), Expect = 2.4e-227
Identity = 403/437 (92.22%), Postives = 416/437 (95.19%), Query Frame = 0

Query: 1   MALKIFFFFFLALLVLNTNASDLCASGSD--GDLSVIPIYGKCSPFTAPKSESWVNTVID 60
           MA+K  FFFFLALL LN+NASDLCA+GSD  GDLSVIPIYGKCSPFTAPKSESWVNTVID
Sbjct: 1   MAVKFVFFFFLALLALNSNASDLCAAGSDGSGDLSVIPIYGKCSPFTAPKSESWVNTVID 60

Query: 61  MASKDPARIKYLSTLAAQKTVAAPIASGQQVLNIGNYVVRVQLGTPGQVMYMVLDTSNDA 120
           MASKDPARIKYLS+LAAQKTVAAPIASGQ  LNIGNYVVRVQLGTPGQ MYMVLDTS+DA
Sbjct: 61  MASKDPARIKYLSSLAAQKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDTSSDA 120

Query: 121 AWAPCSGCTGCSATTFSSNYSSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDS 180
           AWAPCSGC+GCSATTF +  SSTFATLDCSKPQCSQARGLSCPTTG+ DCLFNQTYGGDS
Sbjct: 121 AWAPCSGCSGCSATTFLAKNSSTFATLDCSKPQCSQARGLSCPTTGSVDCLFNQTYGGDS 180

Query: 181 SFSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGL 240
           SFSATLVQD+LHLG+DVIPNFSFGCISSASGSSIPPQGL+GLGRGPLSLISQS SLYSGL
Sbjct: 181 SFSATLVQDTLHLGSDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSLISQSTSLYSGL 240

Query: 241 FSYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPIS 300
           FSYCLPSFKSYYFSGSLKLGP GQPK IRTTPLL NPHRPSLYYVNLTGISVGRVLVPI 
Sbjct: 241 FSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISVGRVLVPIP 300

Query: 301 PEHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVGGSFSPLGAFDTCFATSNE 360
           PE LAFDPNTGAGTIIDSGTVITRFV PVYTAVRDEFRKQVGGSFSPLGAFDTCF TSNE
Sbjct: 301 PETLAFDPNTGAGTIIDSGTVITRFVDPVYTAVRDEFRKQVGGSFSPLGAFDTCFTTSNE 360

Query: 361 VSAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRI 420
           ++APGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRI
Sbjct: 361 MAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRI 420

Query: 421 LFDITNSKLGIARELCN 436
           LFDI NSKLGIARELCN
Sbjct: 421 LFDIANSKLGIARELCN 437

BLAST of Bhi04G000762 vs. ExPASy TrEMBL
Match: A0A0A0LRW0 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G371100 PE=4 SV=1)

HSP 1 Score: 795.4 bits (2053), Expect = 1.2e-226
Identity = 400/436 (91.74%), Postives = 416/436 (95.41%), Query Frame = 0

Query: 1   MALKIFFFFFLALLVLNTNASDLCASGSDGDLSVIPIYGKCSPFTAPKSESWVNTVIDMA 60
           MALK+FFF  LALL+L +NA DLCASGSDGDLSVIPIYGKCSPFTAPKSESW+NTVIDMA
Sbjct: 1   MALKLFFFLSLALLLLTSNAFDLCASGSDGDLSVIPIYGKCSPFTAPKSESWMNTVIDMA 60

Query: 61  SKDPARIKYLSTLAAQKTVAAPIASGQQVLNIGNYVVRVQLGTPGQVMYMVLDTSNDAAW 120
           SKDPARI+YLS+L AQKTVAAPIASGQQVLN+GNYVVRVQLGTPGQ MYMVLDTSNDAAW
Sbjct: 61  SKDPARIRYLSSLTAQKTVAAPIASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAW 120

Query: 121 APCSGCTGCSA-TTFSSNYSSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDSS 180
           APCSGC GCS+ TTFS+  SSTFATLDCSKP+C+QARGLSCPTTGN DCLFNQTYGGDS+
Sbjct: 121 APCSGCIGCSSTTTFSAQNSSTFATLDCSKPECTQARGLSCPTTGNVDCLFNQTYGGDST 180

Query: 181 FSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLF 240
           FSATLVQDSLHLG +VIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLF
Sbjct: 181 FSATLVQDSLHLGPNVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLF 240

Query: 241 SYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISP 300
           SYCLPSFKSYYFSGSLKLGP GQPK IRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISP
Sbjct: 241 SYCLPSFKSYYFSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISP 300

Query: 301 EHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVGGSFSPLGAFDTCFATSNEV 360
           E LAFDPNTGAGTIIDSGTVITRFV  +YTAVRDEFRKQVGGSFSPLGAFDTCFAT+NEV
Sbjct: 301 ELLAFDPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFSPLGAFDTCFATNNEV 360

Query: 361 SAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRIL 420
           SAP IT HLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSV+NVIANLQQQNHRIL
Sbjct: 361 SAPAITLHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRIL 420

Query: 421 FDITNSKLGIARELCN 436
           FDI NSKLGIARELCN
Sbjct: 421 FDINNSKLGIARELCN 436

BLAST of Bhi04G000762 vs. ExPASy TrEMBL
Match: A0A5A7VCS8 (Aspartyl protease AED3 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G004150 PE=4 SV=1)

HSP 1 Score: 794.3 bits (2050), Expect = 2.6e-226
Identity = 399/436 (91.51%), Postives = 416/436 (95.41%), Query Frame = 0

Query: 1   MALKIFFFFFLALLVLNTNASDLCASGSDGDLSVIPIYGKCSPFTAPKSESWVNTVIDMA 60
           MALK FF   LALL+LN+NA DLCASGSDGDLSVIPIYGKCSPFTAPKSESW+NTVIDMA
Sbjct: 1   MALKFFFVLSLALLLLNSNAFDLCASGSDGDLSVIPIYGKCSPFTAPKSESWMNTVIDMA 60

Query: 61  SKDPARIKYLSTLAAQKTVAAPIASGQQVLNIGNYVVRVQLGTPGQVMYMVLDTSNDAAW 120
           SKDPARI+YLS+L AQK+VAAPIASGQQVLNIGNYVVRVQLGTPGQ MYMVLDTSNDAAW
Sbjct: 61  SKDPARIRYLSSLTAQKSVAAPIASGQQVLNIGNYVVRVQLGTPGQTMYMVLDTSNDAAW 120

Query: 121 APCSGCTGCSA-TTFSSNYSSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDSS 180
           APCSGCTGCS   TFS+  SSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDSS
Sbjct: 121 APCSGCTGCSGPATFSAKNSSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDSS 180

Query: 181 FSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLF 240
           FSATLV+DSLHLG++VIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSG+F
Sbjct: 181 FSATLVEDSLHLGSNVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGIF 240

Query: 241 SYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISP 300
           SYCLPSFKSYYFSGSLKLGP GQPK IRTTPLL NPHRPSLYYVNLTGISVGRVLVPISP
Sbjct: 241 SYCLPSFKSYYFSGSLKLGPVGQPKAIRTTPLLRNPHRPSLYYVNLTGISVGRVLVPISP 300

Query: 301 EHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVGGSFSPLGAFDTCFATSNEV 360
           E LAFDPNTGAGTIIDSGTVITRFV PVYTA+RDEFRKQVGGSFSPLGAFDTCFATSNEV
Sbjct: 301 ELLAFDPNTGAGTIIDSGTVITRFVPPVYTAIRDEFRKQVGGSFSPLGAFDTCFATSNEV 360

Query: 361 SAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRIL 420
           SAP +T HLSGLDL+LPMENSLIHSSAGSLACLAMAAAPNNVN+V+NVIANLQQQNHRIL
Sbjct: 361 SAPAVTLHLSGLDLRLPMENSLIHSSAGSLACLAMAAAPNNVNAVVNVIANLQQQNHRIL 420

Query: 421 FDITNSKLGIARELCN 436
           FDI N+KLGIARELCN
Sbjct: 421 FDINNAKLGIARELCN 436

BLAST of Bhi04G000762 vs. ExPASy TrEMBL
Match: A0A1S3BCB0 (aspartyl protease AED3 OS=Cucumis melo OX=3656 GN=LOC103488136 PE=4 SV=1)

HSP 1 Score: 794.3 bits (2050), Expect = 2.6e-226
Identity = 399/436 (91.51%), Postives = 416/436 (95.41%), Query Frame = 0

Query: 1   MALKIFFFFFLALLVLNTNASDLCASGSDGDLSVIPIYGKCSPFTAPKSESWVNTVIDMA 60
           MALK FF   LALL+LN+NA DLCASGSDGDLSVIPIYGKCSPFTAPKSESW+NTVIDMA
Sbjct: 1   MALKFFFVLSLALLLLNSNAFDLCASGSDGDLSVIPIYGKCSPFTAPKSESWMNTVIDMA 60

Query: 61  SKDPARIKYLSTLAAQKTVAAPIASGQQVLNIGNYVVRVQLGTPGQVMYMVLDTSNDAAW 120
           SKDPARI+YLS+L AQK+VAAPIASGQQVLNIGNYVVRVQLGTPGQ MYMVLDTSNDAAW
Sbjct: 61  SKDPARIRYLSSLTAQKSVAAPIASGQQVLNIGNYVVRVQLGTPGQTMYMVLDTSNDAAW 120

Query: 121 APCSGCTGCSA-TTFSSNYSSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDSS 180
           APCSGCTGCS   TFS+  SSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDSS
Sbjct: 121 APCSGCTGCSGPATFSAKNSSTFATLDCSKPQCSQARGLSCPTTGNADCLFNQTYGGDSS 180

Query: 181 FSATLVQDSLHLGTDVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLF 240
           FSATLV+DSLHLG++VIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSG+F
Sbjct: 181 FSATLVEDSLHLGSNVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGIF 240

Query: 241 SYCLPSFKSYYFSGSLKLGPAGQPKLIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISP 300
           SYCLPSFKSYYFSGSLKLGP GQPK IRTTPLL NPHRPSLYYVNLTGISVGRVLVPISP
Sbjct: 241 SYCLPSFKSYYFSGSLKLGPVGQPKAIRTTPLLRNPHRPSLYYVNLTGISVGRVLVPISP 300

Query: 301 EHLAFDPNTGAGTIIDSGTVITRFVSPVYTAVRDEFRKQVGGSFSPLGAFDTCFATSNEV 360
           E LAFDPNTGAGTIIDSGTVITRFV PVYTA+RDEFRKQVGGSFSPLGAFDTCFATSNEV
Sbjct: 301 ELLAFDPNTGAGTIIDSGTVITRFVPPVYTAIRDEFRKQVGGSFSPLGAFDTCFATSNEV 360

Query: 361 SAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRIL 420
           SAP +T HLSGLDL+LPMENSLIHSSAGSLACLAMAAAPNNVN+V+NVIANLQQQNHRIL
Sbjct: 361 SAPAVTLHLSGLDLRLPMENSLIHSSAGSLACLAMAAAPNNVNAVVNVIANLQQQNHRIL 420

Query: 421 FDITNSKLGIARELCN 436
           FDI N+KLGIARELCN
Sbjct: 421 FDINNAKLGIARELCN 436

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G09750.11.3e-15866.37Eukaryotic aspartyl protease family protein [more]
AT3G54400.16.3e-12456.10Eukaryotic aspartyl protease family protein [more]
AT5G07030.12.7e-11450.11Eukaryotic aspartyl protease family protein [more]
AT1G01300.11.0e-5235.28Eukaryotic aspartyl protease family protein [more]
AT3G61820.11.1e-4834.08Eukaryotic aspartyl protease family protein [more]
Match NameE-valueIdentityDescription
O044961.9e-15766.37Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1[more]
Q6F4N51.3e-11053.41Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1[more]
Q9LNJ31.4e-5135.28Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q766C31.6e-4737.78Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LHE31.3e-4131.71Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
A0A6J1K4696.2e-22892.68aspartyl protease AED3 OS=Cucurbita maxima OX=3661 GN=LOC111492140 PE=4 SV=1[more]
A0A6J1HAN72.4e-22792.22aspartyl protease AED3 OS=Cucurbita moschata OX=3662 GN=LOC111462342 PE=4 SV=1[more]
A0A0A0LRW01.2e-22691.74Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G37110... [more]
A0A5A7VCS82.6e-22691.51Aspartyl protease AED3 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold8... [more]
A0A1S3BCB02.6e-22691.51aspartyl protease AED3 OS=Cucumis melo OX=3656 GN=LOC103488136 PE=4 SV=1[more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 95..258
e-value: 1.2E-33
score: 116.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 260..435
e-value: 2.8E-44
score: 153.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 70..258
e-value: 9.6E-35
score: 122.2
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 91..434
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 281..430
e-value: 8.1E-31
score: 107.0
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 14..435
NoneNo IPR availablePANTHERPTHR13683:SF839ASPARTYL PROTEASE AED3-LIKEcoord: 14..435
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 95..430
score: 35.673393

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M000762Bhi04M000762mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0043067 regulation of programmed cell death
molecular_function GO:0004190 aspartic-type endopeptidase activity