Bhi05G001543 (gene) Wax gourd (B227) v1

Overview
NameBhi05G001543
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
Descriptioncysteine proteinase RD19a-like
Locationchr5: 56900172 .. 56903886 (+)
RNA-Seq ExpressionBhi05G001543
SyntenyBhi05G001543
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGCGTGGAATTATCCTTTAAAACCACCGCTACCCTATAGCCACCGTGAAACCCCCTGTAATTTTTTTAGTTAAACATGGATTGGAGTTTCTCCTTGTTCGCCGTGATAGCCGCTGCCACCGCCACCCTCTGCTCATCGGAATCTTTGACCTCACCTCGATCCATCGAACGTGATGGTGATCCACTGATTCGTCAAGTGGTGGATGACGGAAATTTCAATAATCGTCTCCCTCTCGGAGCAGAGCACCATTTTTCGCTATTCAAGCAAAGGTTCGGGAAATCGTATGCCACAGAGGAAGAGCACGATCGTAGATTCAAGATTTTCGAGGCTAATATGCGACGAGCTCAACGCCATCAGTCATTTGATCCGTCCGCCATTCATGGTATTACTCAGTTTTCCGACTTGACCCCTTTCGAGTTTCGGAAGGCATTTCTAGGGCTCAGAGGTCACCGTCTCAGGCTTCCTGTTGATACAAATGCGGCTCCGATTCTTCCTACGGAGAATCTTCCGATCGATTTTGATTGGAGAGAACGTGGTGCCGTAACTCCGGTAAAAAATCAGGTATCGTTCTATATTTCAACTTAATTCTTCTGTTTATTGCAATTTTTCAAGTTTTAGGCGTGTTTTGATTCTTGTTCTTGTTCTTTCTTTCTGGAGGATTCACCTATTGAATTTGGTTACCTTTGATTTTTATTTTTAAACGCATTTTAATTTGTGGAAGGTTGAAGAACAATTGTTAATCTACCATTCACCTGTACATTTGTTTTCATCCTTAGTTCATGTGGCATATATGAATTCTTTGAAACAATCCCACTGTAGTTTAATGCATTTTATTTATGACTAGTCTATGACATGTTGATGCATATGAAATTTATAGTTTATTTTTATCTATAGATGAAAAGATTTAAGAAGATTGTAATTAATATATCAAATAGGTTTTTACATTAGTTGATTAATTGTAATTATTTGAAACGAAAAGACTAATTAAGAAATTTATGAAGTTTTAAAAGGCTGAAGAAGATTGTCCGTCCTAACATATTTGTTAGATCGTAATTTTTTATATCTAAGGTCTACATCACACCCATACACGAAAAGATATAATACTTCCATCTGAATAGTTGTAATCGAGAAATGATGTGATTATTCGTTCAAACATATCACTACAATTAGATCATGATGTTTTATATTTAAGTGTTCTAGATCACCCCGTACATAAGCATCACGTTTCTAGCATATGATTTTGGATGCTTTTTTGTTGGTCCTTTGTCTAGTCTGATAATATTTTTGACATTTTGACATCTCTTTTGGTGGGTCATCCGTTACAAAGAATGTTGTCATAATTCATGCCTTTTTCTAGATTGTATGGGACGTGTGTGATAAACGTCTCTTTCAAGATTCCTTTTCTTTTTTTCTTTTTTGATAGTTTCATGGATTTGGTTCGTCTACTGGGTATAAAACGCGACATCCTTTTAAGCATTTTACTCTTTCCTATTTAATTTTCAATTGGGAGTCTCTTTTGTAATCACCTATAGGTGTTTGGGGTCTTCCCTATTTCATTTTTTCAATGAAATATTTCTCATCTAAAAACAAAGATCACCTCGTATATAAAACAATACACTACTCTGTGTTTTGACCACCTCGACCCGAATAGTTGTGATGAAAGACAAGATAGAAATTATGGAAAGTTGTAATGGCAATATGAGGATGTAAAGATATATATTATAAGATTATTTTCTACCTTACCATTTGGGGTTTATTAAAAGTTCAAAACTAAGGTATGAATATCTATATTGGCACAGGGATCTTGCGGATCCTGCTGGAGTTTCAGCACGACCGGTGCCCTTGAAGGTGCTAACTTCCTTGCGACGGGGGAACTTGTTAGCTTAAGTGAACAGCAGCTGGTAGATTGTGATCACGAGGTTTGATTCCTTCACTGTTTTTAATTAGCTTTGTGATAGCTTTATTTAAATTTAAGATGTGATTTATCTCTTGTTTTGTTGATGTCAGGATGTTTAATCAACTCTCTTAGATGCATTGATTGTTGTATTGGTATTGGGTTACAGTTAATTTTCTCATTCATTAGTCGGAAAGGATTGTTATTGTGGTACAGTTAATTTAAACTAAAGGGTCCGTGTTTCTTGTTTGCGGAGCGATGTTAAATTTTCATCAAGTTCAAAATCACTTCTAGACATGCTTTAAATTATTTAAAGTCAATTTTGAATGTATGCAAATCGCATTCAAATGTGTTGAATCAAACGTGAAATTGATTTTGAATGATTAAAAACATGTTCGTTATAGATTTTAAACTTAACGGAAGTGAGATTACCATTTCAAAATCACTTCAAACATATCTTACACTGTGGCTGAATCTTTTCAAGTGGCATTTCTAAATTTTGCACATAATCTTTCTACATTCTGCAGTGTGATCCAGAGGAAGCCGATTCCTGTGACTCTGGTTGCAATGGTGGTCTGATGAACAGTGCATTTGAATACACATTAAAAGCCGGAGGTTTGATGAGAGAGGAAGATTATCCCTATACTGGAACTGATCGTGGAAATTGCAACTTTGACAAATCCAAGATTGCTGCATCAGTTGCCAATTTCAGTGTTGTTTCACTTGATGAAGATCAAATTGCTGCAAATCTAGTGAAAAATGGCCCACTTGCAAGTAAGATCACCTGCCTACCTCCTAACAATAACACTCAAGAACAACATGAAATATGTTGAAATAACAAACCGATAACCTAGCTTTTTTAGAGGATTAGTCTCTCCCAAATTCTCCAAAGGAAAAAAAATCATGGGTTAGCCTAATGGTAAGTAGGGCACATGACCTTGATAAAGGGTTAAGAGGTCACAGGTTAAATACCTGGTAGTCACATATCTAGAATTTAATATCTTACAAGTTTTCTTAACACCCAAATATTGTAGGGTCATATGGGCTGTCCCGTGAGATTAATTGAGGTGCACATAAGCTGACCCAGATACTCACGGATATTAAAAAGAAAAAAAAGAAAAATCCTAACAAACTTACTATCGGATTACTAACATACCACTTCTAATAACCATACTAATATTCTACTATATCTCTAATTAGGGTTCTCAAAGGTTCTGAAATAAATTCAATACGTTCTTAAAAACTTGCTATATGGATCTAAATTCTGGTTTGTTCTTTTACCTCAATGCAGTTGCCATCAATGCAGTGTTCATGCAAACATACATAGGTGGAGTATCTTGTCCATTCATATGTTCAAGGCGCTTGGATCATGGAGTTTTGCTGGTGGGTTATGGCTCAGCTGGGTATGCTCCCATCAGAATGAGAGACAAAGATTACTGGATCATAAAGAATTCCTGGGGCGAAAACTGGGGAGAAAATGGCTATTATAAGATTTGCAGAGGAAGGAATATTTGTGGAGTTGATTCCTTAGTCTCAACTGTTGCTGCAGTTCATACTCATGATACCTCCATTGCAGCAGCAGGTCAATAGAGATGGGATTTTTCTCTTAAAAGTTTGTAGTTGTTGATGAAAATGTGGGGATTCCTAGAACAGCCTTTAGCTAACTTTCATTTGTTAAAGGCACAGATGCATTTGTATCAGCTACTACAGCCAATGCATATATGTTTAGTTACATTTACACCAATAAATTCAAAAGAAATACAGATGAAAAGTTGGTTCTTATTTATGAAGAAATTAAATCTTTTTATTGAAAAAAGGGGC

mRNA sequence

TGCGTGGAATTATCCTTTAAAACCACCGCTACCCTATAGCCACCGTGAAACCCCCTGTAATTTTTTTAGTTAAACATGGATTGGAGTTTCTCCTTGTTCGCCGTGATAGCCGCTGCCACCGCCACCCTCTGCTCATCGGAATCTTTGACCTCACCTCGATCCATCGAACGTGATGGTGATCCACTGATTCGTCAAGTGGTGGATGACGGAAATTTCAATAATCGTCTCCCTCTCGGAGCAGAGCACCATTTTTCGCTATTCAAGCAAAGGTTCGGGAAATCGTATGCCACAGAGGAAGAGCACGATCGTAGATTCAAGATTTTCGAGGCTAATATGCGACGAGCTCAACGCCATCAGTCATTTGATCCGTCCGCCATTCATGGTATTACTCAGTTTTCCGACTTGACCCCTTTCGAGTTTCGGAAGGCATTTCTAGGGCTCAGAGGTCACCGTCTCAGGCTTCCTGTTGATACAAATGCGGCTCCGATTCTTCCTACGGAGAATCTTCCGATCGATTTTGATTGGAGAGAACGTGGTGCCGTAACTCCGGTAAAAAATCAGGGATCTTGCGGATCCTGCTGGAGTTTCAGCACGACCGGTGCCCTTGAAGGTGCTAACTTCCTTGCGACGGGGGAACTTGTTAGCTTAAGTGAACAGCAGCTGGTAGATTGTGATCACGAGTGTGATCCAGAGGAAGCCGATTCCTGTGACTCTGGTTGCAATGGTGGTCTGATGAACAGTGCATTTGAATACACATTAAAAGCCGGAGGTTTGATGAGAGAGGAAGATTATCCCTATACTGGAACTGATCGTGGAAATTGCAACTTTGACAAATCCAAGATTGCTGCATCAGTTGCCAATTTCAGTGTTGTTTCACTTGATGAAGATCAAATTGCTGCAAATCTAGTGAAAAATGGCCCACTTGCAATTGCCATCAATGCAGTGTTCATGCAAACATACATAGGTGGAGTATCTTGTCCATTCATATGTTCAAGGCGCTTGGATCATGGAGTTTTGCTGGTGGGTTATGGCTCAGCTGGGTATGCTCCCATCAGAATGAGAGACAAAGATTACTGGATCATAAAGAATTCCTGGGGCGAAAACTGGGGAGAAAATGGCTATTATAAGATTTGCAGAGGAAGGAATATTTGTGGAGTTGATTCCTTAGTCTCAACTGTTGCTGCAGTTCATACTCATGATACCTCCATTGCAGCAGCAGGTCAATAGAGATGGGATTTTTCTCTTAAAAGTTTGTAGTTGTTGATGAAAATGTGGGGATTCCTAGAACAGCCTTTAGCTAACTTTCATTTGTTAAAGGCACAGATGCATTTGTATCAGCTACTACAGCCAATGCATATATGTTTAGTTACATTTACACCAATAAATTCAAAAGAAATACAGATGAAAAGTTGGTTCTTATTTATGAAGAAATTAAATCTTTTTATTGAAAAAAGGGGC

Coding sequence (CDS)

ATGGATTGGAGTTTCTCCTTGTTCGCCGTGATAGCCGCTGCCACCGCCACCCTCTGCTCATCGGAATCTTTGACCTCACCTCGATCCATCGAACGTGATGGTGATCCACTGATTCGTCAAGTGGTGGATGACGGAAATTTCAATAATCGTCTCCCTCTCGGAGCAGAGCACCATTTTTCGCTATTCAAGCAAAGGTTCGGGAAATCGTATGCCACAGAGGAAGAGCACGATCGTAGATTCAAGATTTTCGAGGCTAATATGCGACGAGCTCAACGCCATCAGTCATTTGATCCGTCCGCCATTCATGGTATTACTCAGTTTTCCGACTTGACCCCTTTCGAGTTTCGGAAGGCATTTCTAGGGCTCAGAGGTCACCGTCTCAGGCTTCCTGTTGATACAAATGCGGCTCCGATTCTTCCTACGGAGAATCTTCCGATCGATTTTGATTGGAGAGAACGTGGTGCCGTAACTCCGGTAAAAAATCAGGGATCTTGCGGATCCTGCTGGAGTTTCAGCACGACCGGTGCCCTTGAAGGTGCTAACTTCCTTGCGACGGGGGAACTTGTTAGCTTAAGTGAACAGCAGCTGGTAGATTGTGATCACGAGTGTGATCCAGAGGAAGCCGATTCCTGTGACTCTGGTTGCAATGGTGGTCTGATGAACAGTGCATTTGAATACACATTAAAAGCCGGAGGTTTGATGAGAGAGGAAGATTATCCCTATACTGGAACTGATCGTGGAAATTGCAACTTTGACAAATCCAAGATTGCTGCATCAGTTGCCAATTTCAGTGTTGTTTCACTTGATGAAGATCAAATTGCTGCAAATCTAGTGAAAAATGGCCCACTTGCAATTGCCATCAATGCAGTGTTCATGCAAACATACATAGGTGGAGTATCTTGTCCATTCATATGTTCAAGGCGCTTGGATCATGGAGTTTTGCTGGTGGGTTATGGCTCAGCTGGGTATGCTCCCATCAGAATGAGAGACAAAGATTACTGGATCATAAAGAATTCCTGGGGCGAAAACTGGGGAGAAAATGGCTATTATAAGATTTGCAGAGGAAGGAATATTTGTGGAGTTGATTCCTTAGTCTCAACTGTTGCTGCAGTTCATACTCATGATACCTCCATTGCAGCAGCAGGTCAATAG

Protein sequence

MDWSFSLFAVIAAATATLCSSESLTSPRSIERDGDPLIRQVVDDGNFNNRLPLGAEHHFSLFKQRFGKSYATEEEHDRRFKIFEANMRRAQRHQSFDPSAIHGITQFSDLTPFEFRKAFLGLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGNCNFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSRRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNICGVDSLVSTVAAVHTHDTSIAAAGQ
Homology
BLAST of Bhi05G001543 vs. TAIR 10
Match: AT4G39090.1 (Papain family cysteine protease )

HSP 1 Score: 545.4 bits (1404), Expect = 3.6e-155
Identity = 263/339 (77.58%), Postives = 294/339 (86.73%), Query Frame = 0

Query: 33  DGDPL-IRQVVDDGNFNNRLPLGAEHHFSLFKQRFGKSYATEEEHDRRFKIFEANMRRAQ 92
           DGD L IRQVV  G    ++ L +E HFSLFK++FGK YA+ EEHD RF +F+AN+RRA+
Sbjct: 27  DGDDLVIRQVV--GGAEPQV-LTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRAR 86

Query: 93  RHQSFDPSAIHGITQFSDLTPFEFRKAFLGLRGHRLRLPVDTNAAPILPTENLPIDFDWR 152
           RHQ  DPSA HG+TQFSDLT  EFRK  LG+R    +LP D N APILPTENLP DFDWR
Sbjct: 87  RHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS-GFKLPKDANKAPILPTENLPEDFDWR 146

Query: 153 ERGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEADSC 212
           + GAVTPVKNQGSCGSCWSFS TGALEGANFLATG+LVSLSEQQLVDCDHECDPEEADSC
Sbjct: 147 DHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSC 206

Query: 213 DSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGNCNFDKSKIAASVANFSVVSLDED 272
           DSGCNGGLMNSAFEYTLK GGLM+EEDYPYTG D   C  DKSKI ASV+NFSV+S+DE+
Sbjct: 207 DSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEE 266

Query: 273 QIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSRRLDHGVLLVGYGSAGYAPIRMRDK 332
           QIAANLVKNGPLA+AINA +MQTYIGGVSCP+IC+RRL+HGVLLVGYG+AGYAP R ++K
Sbjct: 267 QIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEK 326

Query: 333 DYWIIKNSWGENWGENGYYKICRGRNICGVDSLVSTVAA 371
            YWIIKNSWGE WGENG+YKIC+GRNICGVDS+VSTVAA
Sbjct: 327 PYWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAA 361

BLAST of Bhi05G001543 vs. TAIR 10
Match: AT2G21430.1 (Papain family cysteine protease )

HSP 1 Score: 538.1 bits (1385), Expect = 5.7e-153
Identity = 256/342 (74.85%), Postives = 295/342 (86.26%), Query Frame = 0

Query: 29  SIERDGDPLIRQVVDDGNFNNRLPLGAEHHFSLFKQRFGKSYATEEEHDRRFKIFEANMR 88
           S+  D D LIRQVVD+        L +E HF+LFK++FGK Y + EEH  RF +F+AN+ 
Sbjct: 21  SVCGDEDVLIRQVVDE---TEPKVLSSEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANLL 80

Query: 89  RAQRHQSFDPSAIHGITQFSDLTPFEFRKAFLGLRGHRLRLPVDTNAAPILPTENLPIDF 148
           RA RHQ  DPSA HG+TQFSDLT  EFR+  LG++G   +LP D N APILPT+NLP +F
Sbjct: 81  RAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKG-GFKLPKDANQAPILPTQNLPEEF 140

Query: 149 DWRERGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEA 208
           DWR+RGAVTPVKNQGSCGSCWSFSTTGALEGA+FLATG+LVSLSEQQLVDCDHECDPEE 
Sbjct: 141 DWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEE 200

Query: 209 DSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGNCNFDKSKIAASVANFSVVSL 268
            SCDSGCNGGLMNSAFEYTLK GGLMRE+DYPYTGTD G+C  D+SKI ASV+NFSVVS+
Sbjct: 201 GSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSI 260

Query: 269 DEDQIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSRRLDHGVLLVGYGSAGYAPIRM 328
           +EDQIAANL+KNGPLA+AINA +MQTYIGGVSCP+ICSRRL+HGVLLVGYGSAG++  R+
Sbjct: 261 NEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARL 320

Query: 329 RDKDYWIIKNSWGENWGENGYYKICRGRNICGVDSLVSTVAA 371
           ++K YWIIKNSWGE+WGENG+YKIC+GRNICGVDSLVSTVAA
Sbjct: 321 KEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTVAA 358

BLAST of Bhi05G001543 vs. TAIR 10
Match: AT4G16190.1 (Papain family cysteine protease )

HSP 1 Score: 528.9 bits (1361), Expect = 3.5e-150
Identity = 248/337 (73.59%), Postives = 292/337 (86.65%), Query Frame = 0

Query: 38  IRQVVDDGNFNNRLPLGAEHHFSLFKQRFGKSYATEEEHDRRFKIFEANMRRAQRHQSFD 97
           IRQVV +   N+   L AEHHF+LFK ++ K+YAT+ EHD RF++F+AN+RRA+R+Q  D
Sbjct: 36  IRQVVPEE--NDEQLLNAEHHFTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLD 95

Query: 98  PSAIHGITQFSDLTPFEFRKAFLGLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVT 157
           PSA+HG+TQFSDLTP EFR+ FLGL+    RLP DT  APILPT +LP +FDWRE+GAVT
Sbjct: 96  PSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVT 155

Query: 158 PVKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEADSCDSGCNG 217
           PVKNQG CGSCWSFS  GALEGA+FLAT ELVSLSEQQLVDCDHECDP +A+SCDSGC+G
Sbjct: 156 PVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSG 215

Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGNCNFDKSKIAASVANFSVVSLDEDQIAANL 277
           GLMN+AFEY LKAGGLM+EEDYPYTG D   C FDKSKI ASV+NFSVVS DEDQIAANL
Sbjct: 216 GLMNNAFEYALKAGGLMKEEDYPYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANL 275

Query: 278 VKNGPLAIAINAVFMQTYIGGVSCPFICSRRLDHGVLLVGYGSAGYAPIRMRDKDYWIIK 337
           V++GPLAIAINA++MQTYIGGVSCP++CS+  DHGVLLVG+GS+GYAPIR+++K YWIIK
Sbjct: 276 VQHGPLAIAINAMWMQTYIGGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIK 335

Query: 338 NSWGENWGENGYYKICRG-RNICGVDSLVSTVAAVHT 374
           NSWG  WGE+GYYKICRG  N+CG+D++VSTVAAVHT
Sbjct: 336 NSWGAMWGEHGYYKICRGPHNMCGMDTMVSTVAAVHT 370

BLAST of Bhi05G001543 vs. TAIR 10
Match: AT3G54940.2 (Papain family cysteine protease )

HSP 1 Score: 411.8 bits (1057), Expect = 6.2e-115
Identity = 198/344 (57.56%), Postives = 253/344 (73.55%), Query Frame = 0

Query: 35  DPLIRQVVDDGN--FNNRLPLGAEHHFSLFKQRFGKSYATEEEHDRRFKIFEANMRRAQR 94
           D  IRQV  D      N L    E  F LF   +GK+Y+T EE+  R  IF  N+ +A  
Sbjct: 25  DLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAE 84

Query: 95  HQSFDPSAIHGITQFSDLTPFEFRKAFLGL------RGHRLRLPVDTNAAPILPTENLPI 154
           HQ  DPSA+HG+TQFSDLT  EF++ + G+      RG  +        AP++  + LP 
Sbjct: 85  HQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGGTVGA-----EAPMVEVDGLPE 144

Query: 155 DFDWRERGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPE 214
           DFDWRE+G VT VKNQG+CGSCW+FSTTGA EGA+F++TG+L+SLSEQQLVDCD  CDP+
Sbjct: 145 DFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPK 204

Query: 215 EADSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGNCNFDKSKIAASVANFSVV 274
           +  +CD+GC GGLM +A+EY ++AGGL  E  YPYTG  RG+C FD  K+A  V NF+ +
Sbjct: 205 DKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTG-KRGHCKFDPEKVAVRVLNFTTI 264

Query: 275 SLDEDQIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSRR-LDHGVLLVGYGSAGYAP 334
            LDE+QIAANLV++GPLA+ +NAVFMQTYIGGVSCP ICS+R ++HGVLLVGYGS G++ 
Sbjct: 265 PLDENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSI 324

Query: 335 IRMRDKDYWIIKNSWGENWGENGYYKICRGRNICGVDSLVSTVA 370
           +R+ +K YWIIKNSWG+ WGENGYYK+CRG +ICG++S+VS VA
Sbjct: 325 LRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVA 362

BLAST of Bhi05G001543 vs. TAIR 10
Match: AT3G19390.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 235.0 bits (598), Expect = 1.0e-61
Identity = 132/309 (42.72%), Postives = 180/309 (58.25%), Query Frame = 0

Query: 68  KSYATEEEHDRRFKIFEANMRRAQRHQSFDPSAIH-GITQFSDLTPFEFRKAFLGLRGHR 127
           K+Y    E +RRF+IF+ N++  + H S        G+T+F+DLT  EFR  +L  +  R
Sbjct: 52  KNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMER 111

Query: 128 LRLPVDTNAAPILPTENLPIDFDWRERGAVTPVKNQGSCGSCWSFSTTGALEGANFLATG 187
            R+PV          ++LP   DWR +GAV PVK+QGSCGSCW+FS  GA+EG N + TG
Sbjct: 112 TRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTG 171

Query: 188 ELVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 247
           EL+SLSEQ+LVDCD         S + GC GGLM+ AF++ ++ GG+  EEDYPY  TD 
Sbjct: 172 ELISLSEQELVDCD--------TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDV 231

Query: 248 GNCNFDKSKI-AASVANFSVVSLDEDQIAANLVKNGPLAIAINA--VFMQTYIGGVSCPF 307
             CN DK      ++  +  V  ++++     + N P+++AI A     Q Y  GV    
Sbjct: 232 NVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTG- 291

Query: 308 ICSRRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNI----- 367
            C   LDHGV+ VGYGS G        +DYWI++NSWG NWGE+GY+K+   RNI     
Sbjct: 292 TCGTSLDHGVVAVGYGSEG-------GQDYWIVRNSWGSNWGESGYFKL--ERNIKESSG 342

BLAST of Bhi05G001543 vs. ExPASy Swiss-Prot
Match: P43296 (Cysteine protease RD19A OS=Arabidopsis thaliana OX=3702 GN=RD19A PE=1 SV=1)

HSP 1 Score: 545.4 bits (1404), Expect = 5.0e-154
Identity = 263/339 (77.58%), Postives = 294/339 (86.73%), Query Frame = 0

Query: 33  DGDPL-IRQVVDDGNFNNRLPLGAEHHFSLFKQRFGKSYATEEEHDRRFKIFEANMRRAQ 92
           DGD L IRQVV  G    ++ L +E HFSLFK++FGK YA+ EEHD RF +F+AN+RRA+
Sbjct: 27  DGDDLVIRQVV--GGAEPQV-LTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRAR 86

Query: 93  RHQSFDPSAIHGITQFSDLTPFEFRKAFLGLRGHRLRLPVDTNAAPILPTENLPIDFDWR 152
           RHQ  DPSA HG+TQFSDLT  EFRK  LG+R    +LP D N APILPTENLP DFDWR
Sbjct: 87  RHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS-GFKLPKDANKAPILPTENLPEDFDWR 146

Query: 153 ERGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEADSC 212
           + GAVTPVKNQGSCGSCWSFS TGALEGANFLATG+LVSLSEQQLVDCDHECDPEEADSC
Sbjct: 147 DHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSC 206

Query: 213 DSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGNCNFDKSKIAASVANFSVVSLDED 272
           DSGCNGGLMNSAFEYTLK GGLM+EEDYPYTG D   C  DKSKI ASV+NFSV+S+DE+
Sbjct: 207 DSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEE 266

Query: 273 QIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSRRLDHGVLLVGYGSAGYAPIRMRDK 332
           QIAANLVKNGPLA+AINA +MQTYIGGVSCP+IC+RRL+HGVLLVGYG+AGYAP R ++K
Sbjct: 267 QIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEK 326

Query: 333 DYWIIKNSWGENWGENGYYKICRGRNICGVDSLVSTVAA 371
            YWIIKNSWGE WGENG+YKIC+GRNICGVDS+VSTVAA
Sbjct: 327 PYWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAA 361

BLAST of Bhi05G001543 vs. ExPASy Swiss-Prot
Match: P43295 (Probable cysteine protease RD19B OS=Arabidopsis thaliana OX=3702 GN=RD19B PE=2 SV=2)

HSP 1 Score: 538.1 bits (1385), Expect = 8.0e-152
Identity = 256/342 (74.85%), Postives = 295/342 (86.26%), Query Frame = 0

Query: 29  SIERDGDPLIRQVVDDGNFNNRLPLGAEHHFSLFKQRFGKSYATEEEHDRRFKIFEANMR 88
           S+  D D LIRQVVD+        L +E HF+LFK++FGK Y + EEH  RF +F+AN+ 
Sbjct: 21  SVCGDEDVLIRQVVDE---TEPKVLSSEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANLL 80

Query: 89  RAQRHQSFDPSAIHGITQFSDLTPFEFRKAFLGLRGHRLRLPVDTNAAPILPTENLPIDF 148
           RA RHQ  DPSA HG+TQFSDLT  EFR+  LG++G   +LP D N APILPT+NLP +F
Sbjct: 81  RAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKG-GFKLPKDANQAPILPTQNLPEEF 140

Query: 149 DWRERGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEA 208
           DWR+RGAVTPVKNQGSCGSCWSFSTTGALEGA+FLATG+LVSLSEQQLVDCDHECDPEE 
Sbjct: 141 DWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEE 200

Query: 209 DSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGNCNFDKSKIAASVANFSVVSL 268
            SCDSGCNGGLMNSAFEYTLK GGLMRE+DYPYTGTD G+C  D+SKI ASV+NFSVVS+
Sbjct: 201 GSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSI 260

Query: 269 DEDQIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSRRLDHGVLLVGYGSAGYAPIRM 328
           +EDQIAANL+KNGPLA+AINA +MQTYIGGVSCP+ICSRRL+HGVLLVGYGSAG++  R+
Sbjct: 261 NEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARL 320

Query: 329 RDKDYWIIKNSWGENWGENGYYKICRGRNICGVDSLVSTVAA 371
           ++K YWIIKNSWGE+WGENG+YKIC+GRNICGVDSLVSTVAA
Sbjct: 321 KEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTVAA 358

BLAST of Bhi05G001543 vs. ExPASy Swiss-Prot
Match: Q9SUL1 (Probable cysteine protease RD19C OS=Arabidopsis thaliana OX=3702 GN=RD19C PE=2 SV=1)

HSP 1 Score: 528.9 bits (1361), Expect = 4.9e-149
Identity = 248/337 (73.59%), Postives = 292/337 (86.65%), Query Frame = 0

Query: 38  IRQVVDDGNFNNRLPLGAEHHFSLFKQRFGKSYATEEEHDRRFKIFEANMRRAQRHQSFD 97
           IRQVV +   N+   L AEHHF+LFK ++ K+YAT+ EHD RF++F+AN+RRA+R+Q  D
Sbjct: 36  IRQVVPEE--NDEQLLNAEHHFTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLD 95

Query: 98  PSAIHGITQFSDLTPFEFRKAFLGLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVT 157
           PSA+HG+TQFSDLTP EFR+ FLGL+    RLP DT  APILPT +LP +FDWRE+GAVT
Sbjct: 96  PSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVT 155

Query: 158 PVKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEADSCDSGCNG 217
           PVKNQG CGSCWSFS  GALEGA+FLAT ELVSLSEQQLVDCDHECDP +A+SCDSGC+G
Sbjct: 156 PVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSG 215

Query: 218 GLMNSAFEYTLKAGGLMREEDYPYTGTDRGNCNFDKSKIAASVANFSVVSLDEDQIAANL 277
           GLMN+AFEY LKAGGLM+EEDYPYTG D   C FDKSKI ASV+NFSVVS DEDQIAANL
Sbjct: 216 GLMNNAFEYALKAGGLMKEEDYPYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANL 275

Query: 278 VKNGPLAIAINAVFMQTYIGGVSCPFICSRRLDHGVLLVGYGSAGYAPIRMRDKDYWIIK 337
           V++GPLAIAINA++MQTYIGGVSCP++CS+  DHGVLLVG+GS+GYAPIR+++K YWIIK
Sbjct: 276 VQHGPLAIAINAMWMQTYIGGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIK 335

Query: 338 NSWGENWGENGYYKICRG-RNICGVDSLVSTVAAVHT 374
           NSWG  WGE+GYYKICRG  N+CG+D++VSTVAAVHT
Sbjct: 336 NSWGAMWGEHGYYKICRGPHNMCGMDTMVSTVAAVHT 370

BLAST of Bhi05G001543 vs. ExPASy Swiss-Prot
Match: P25804 (Cysteine proteinase 15A OS=Pisum sativum OX=3888 PE=2 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 3.6e-144
Identity = 251/375 (66.93%), Postives = 306/375 (81.60%), Query Frame = 0

Query: 1   MDWSFSLFAVIAAATATLCSSESLTSPRSIERDGDPLIRQVVDDGNFNNRLPLGAEHHFS 60
           MD  F     + AA AT  + ++         + D +IRQVVD  N  + L L AEHHF+
Sbjct: 1   MDRRFLFALFLFAAVATAVTDDT--------NNDDFIIRQVVD--NEEDHL-LNAEHHFT 60

Query: 61  LFKQRFGKSYATEEEHDRRFKIFEANMRRAQRHQSFDPSAIHGITQFSDLTPFEFRKAFL 120
            FK +F KSYAT+EEHD RF +F++N+ +A+ HQ+ DP+A HGIT+FSDLT  EFR+ FL
Sbjct: 61  SFKSKFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFL 120

Query: 121 GLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVTPVKNQGSCGSCWSFSTTGALEGA 180
           GL+  RLRLP     APILPT NLP DFDWRE+GAVTPVK+QGSCGSCW+FSTTGALEGA
Sbjct: 121 GLK-KRLRLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGA 180

Query: 181 NFLATGELVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
           ++LATG+LVSLSEQQLVDCDH CDPE+A SCDSGCNGGLMN+AFEY L++GG+++E+DY 
Sbjct: 181 HYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYA 240

Query: 241 YTGTDRGNCNFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAIAINAVFMQTYIGGVS 300
           YTG D G+C FDKSK+ ASV+NFSVV+LDEDQIAANLVKNGPLA+AINA +MQTY+ GVS
Sbjct: 241 YTGRD-GSCKFDKSKVVASVSNFSVVTLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVS 300

Query: 301 CPFICSR-RLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNIC 360
           CP++C++ RLDHGVLLVG+G   YAPIR+++K YWIIKNSWG+NWGE GYYKICRGRN+C
Sbjct: 301 CPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQGYYKICRGRNVC 360

Query: 361 GVDSLVSTVAAVHTH 375
           GVDS+VSTVAA  ++
Sbjct: 361 GVDSMVSTVAAAQSN 362

BLAST of Bhi05G001543 vs. ExPASy Swiss-Prot
Match: Q10716 (Cysteine proteinase 1 OS=Zea mays OX=4577 GN=CCP1 PE=2 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 9.2e-140
Identity = 242/345 (70.14%), Postives = 277/345 (80.29%), Query Frame = 0

Query: 35  DPLIRQVVDDGNFNNRLPLGAEHHFSLFKQRFGKSYATEEEHDRRFKIFEANMRRAQRHQ 94
           DPLIRQVV  G+ +N L L AE HF  F QRFGKSY   +EH  R  +F+ N+RRA+RHQ
Sbjct: 25  DPLIRQVVPGGD-DNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQ 84

Query: 95  SFDPSAIHGITQFSDLTPFEFRKAFLGLRGHR----LRLPVDTNAAPILPTENLPIDFDW 154
             DPSA HG+T+FSDLTP EFR+ +LGLR  R      L    + AP+LPT+ LP DFDW
Sbjct: 85  LLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFDW 144

Query: 155 RERGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEADS 214
           R+ GAV PVKNQGSCGSCWSFS +GALEGA++LATG+L  LSEQQ VDCDHECD  E DS
Sbjct: 145 RDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDS 204

Query: 215 CDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGNCNFDKSKIAASVANFSVVSLDE 274
           CDSGCNGGLM +AF Y  KAGGL  E+DYPYTG+D G C FDKSKI ASV NFSVVS+DE
Sbjct: 205 CDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSD-GKCKFDKSKIVASVQNFSVVSVDE 264

Query: 275 DQIAANLVKNGPLAIAINAVFMQTYIGGVSCPFICSRRLDHGVLLVGYGSAGYAPIRMRD 334
            QI+ANL+K+GPLAI INA +MQTYIGGVSCP+IC R LDHGVLLVGYG++G+APIR++D
Sbjct: 265 AQISANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGASGFAPIRLKD 324

Query: 335 KDYWIIKNSWGENWGENGYYKICRG---RNICGVDSLVSTVAAVH 373
           K YWIIKNSWGENWGENGYYKICRG   RN CGVDS+VSTV+AVH
Sbjct: 325 KPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMVSTVSAVH 367

BLAST of Bhi05G001543 vs. ExPASy TrEMBL
Match: A0A1S3BIU9 (cysteine proteinase RD19a-like OS=Cucumis melo OX=3656 GN=LOC103490359 PE=3 SV=1)

HSP 1 Score: 696.0 bits (1795), Expect = 8.5e-197
Identity = 341/382 (89.27%), Postives = 355/382 (92.93%), Query Frame = 0

Query: 1   MDWSFSLFAVIAAATATLCSSESLTSPRSIERDGDPLIRQVVDDGNFNNRLPLGAEHHFS 60
           MD +F LFAVI A  ATLC SESLTSP S++ D DP IRQVV+D    NR  LGAEHHFS
Sbjct: 1   MDLNFFLFAVITA--ATLCLSESLTSPHSVQHDRDPFIRQVVEDDGDFNRHALGAEHHFS 60

Query: 61  LFKQRFGKSYATEEEHDRRFKIFEANMRRAQRHQSFDPSAIHGITQFSDLTPFEFRKAFL 120
           LFK+RFGKSYATEEEHDRRFKIF+ANMRRA+RHQSFDPSAIHGITQFSDLTPFEFRKAFL
Sbjct: 61  LFKRRFGKSYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGITQFSDLTPFEFRKAFL 120

Query: 121 GLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVTPVKNQGSCGSCWSFSTTGALEGA 180
           GLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVT VKNQGSCGSCWSFSTTGA+EGA
Sbjct: 121 GLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVTRVKNQGSCGSCWSFSTTGAIEGA 180

Query: 181 NFLATGELVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
           NFLATG+LVSLSEQQLVDCDHECDPEE D+CDSGCNGGLMNSAFEYTLKAGGLM+E+DYP
Sbjct: 181 NFLATGKLVSLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYP 240

Query: 241 YTGTDRGNCNFDKSKIAASVANFSVV-SLDEDQIAANLVKNGPLAIAINAVFMQTYIGGV 300
           YTGTD   CNFDKSKIAAS+ANFSVV SLDEDQIAANLVKNGPLAIAINAVFMQTYIGGV
Sbjct: 241 YTGTDHSTCNFDKSKIAASIANFSVVNSLDEDQIAANLVKNGPLAIAINAVFMQTYIGGV 300

Query: 301 SCPFICSRRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNIC 360
           SCPFICS RLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNIC
Sbjct: 301 SCPFICSNRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNIC 360

Query: 361 GVDSLVSTVAAVHTHDTSIAAA 382
           GVDSLVSTVAAVH H TS  A+
Sbjct: 361 GVDSLVSTVAAVHIHHTSSIAS 380

BLAST of Bhi05G001543 vs. ExPASy TrEMBL
Match: A0A0A0K2B5 (Papain-like cysteine proteinase isoform I OS=Cucumis sativus OX=3659 GN=Csa_7G004120 PE=3 SV=1)

HSP 1 Score: 690.6 bits (1781), Expect = 3.6e-195
Identity = 339/385 (88.05%), Postives = 359/385 (93.25%), Query Frame = 0

Query: 1   MDWSFSLFAVIAAATATLCSSESLTSPRSIERDGDPLIRQVVD-DGNFNNRLPLGAEHHF 60
           MD +F LFAVI A TATLCSSE L S  S+E DGDPLIRQVV+ DG+FN+   LGAEHHF
Sbjct: 1   MDRNFFLFAVITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNHH-ALGAEHHF 60

Query: 61  SLFKQRFGKSYATEEEHDRRFKIFEANMRRAQRHQSFDPSAIHGITQFSDLTPFEFRKAF 120
           SLFK+RFGKSYATEEEHDRRFKIF+ANMRRA+RHQSFDPSAIHG+TQFSDLTPFEFRKAF
Sbjct: 61  SLFKRRFGKSYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGVTQFSDLTPFEFRKAF 120

Query: 121 LGLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVTPVKNQGSCGSCWSFSTTGALEG 180
           LGLRGHRLRLPVDTNAAPILPTENLPIDFDWR+ G VT VKNQGSCGSCWSFSTTGALEG
Sbjct: 121 LGLRGHRLRLPVDTNAAPILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEG 180

Query: 181 ANFLATGELVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKAGGLMREEDY 240
           ANFLATGELVSLSEQQLVDCDHECDPEE D+CDSGCNGGLMNSAFEYTLKAGGLM+E+DY
Sbjct: 181 ANFLATGELVSLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDY 240

Query: 241 PYTGTDRGNCNFDKSKIAASVANFSVV-SLDEDQIAANLVKNGPLAIAINAVFMQTYIGG 300
           PY G DR  CNFDKSKIAAS+A+FSVV S+DEDQIAANLVKNGPLAIAINAVFMQTYIGG
Sbjct: 241 PYAGIDRNTCNFDKSKIAASIASFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIGG 300

Query: 301 VSCPFICSRRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNI 360
           VSCPFICS+RLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGE+WGENGYYKICRGRNI
Sbjct: 301 VSCPFICSKRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGENGYYKICRGRNI 360

Query: 361 CGVDSLVSTVAAVHT--HDTSIAAA 382
           CGVDSLVSTVAAVH   H +SIA+A
Sbjct: 361 CGVDSLVSTVAAVHIHHHSSSIASA 384

BLAST of Bhi05G001543 vs. ExPASy TrEMBL
Match: A0A5A7SS65 (Cysteine proteinase RD19a-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold142G00480 PE=3 SV=1)

HSP 1 Score: 680.6 bits (1755), Expect = 3.7e-192
Identity = 337/382 (88.22%), Postives = 352/382 (92.15%), Query Frame = 0

Query: 1   MDWSFSLFAVIAAATATLCSSESLTSPRSIERDGDPLIRQVVDDGNFNNRLPLGAEHHFS 60
           MD +F LFAVI A  ATLC SESLTSP S++ D DP IRQVV+D +  NR  LGAEHHFS
Sbjct: 1   MDLNFFLFAVITA--ATLCLSESLTSPHSVQHDRDPFIRQVVEDDSDFNRHALGAEHHFS 60

Query: 61  LFKQRFGKSYATEEEHDRRFKIFEANMRRAQRHQSFDPSAIHGITQFSDLTPFEFRKAFL 120
           LFK+RFGKSYATEEEHDRRFKIF+ANMRRA+RHQSFDPSAIHGITQFSDLTPFEFRKAFL
Sbjct: 61  LFKRRFGKSYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGITQFSDLTPFEFRKAFL 120

Query: 121 GLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVTPVKNQGSCGSCWSFSTTGALEGA 180
           GLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVT VKNQGSCGSCWSFSTTGA+EGA
Sbjct: 121 GLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVTRVKNQGSCGSCWSFSTTGAIEGA 180

Query: 181 NFLATGELVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
           NFLATG+LVSLSEQQLVDCDH    EE D+CDSGCNGGLMNSAFEYTLKAGGLM+E+DYP
Sbjct: 181 NFLATGKLVSLSEQQLVDCDH----EEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYP 240

Query: 241 YTGTDRGNCNFDKSKIAASVANFSVV-SLDEDQIAANLVKNGPLAIAINAVFMQTYIGGV 300
           YTGTD   CNFDKSKIAAS+ANFSVV SLDEDQIAANLVKNGPLAIAINAVFMQTYIGGV
Sbjct: 241 YTGTDHSTCNFDKSKIAASIANFSVVNSLDEDQIAANLVKNGPLAIAINAVFMQTYIGGV 300

Query: 301 SCPFICSRRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNIC 360
           SCPFICS RLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNIC
Sbjct: 301 SCPFICSNRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNIC 360

Query: 361 GVDSLVSTVAAVHTHDTSIAAA 382
           GVDSLVSTVAAVH H TS  A+
Sbjct: 361 GVDSLVSTVAAVHIHHTSSIAS 376

BLAST of Bhi05G001543 vs. ExPASy TrEMBL
Match: A0A6J1F4R7 (probable cysteine protease RD19B OS=Cucurbita moschata OX=3662 GN=LOC111440857 PE=3 SV=1)

HSP 1 Score: 649.0 bits (1673), Expect = 1.2e-182
Identity = 317/373 (84.99%), Postives = 338/373 (90.62%), Query Frame = 0

Query: 1   MDWSFSLFAVIAAATATLCSSESLTSPRSIERDGDPLIRQVVDDGNFNNRLPLGAEHHFS 60
           MD SF LF+V+ AA A LCSSESL S  S+     PLIRQV+DDG  +N LPL AEHHF 
Sbjct: 1   MDRSFFLFSVVIAAAAALCSSESLASTYSV----SPLIRQVIDDGESSN-LPLQAEHHFL 60

Query: 61  LFKQRFGKSYATEEEHDRRFKIFEANMRRAQRHQSFDPSAIHGITQFSDLTPFEFRKAFL 120
           LFK++FGKSYATEEEH+RRF+IF+ANMRRA RHQSFDPSAIHG+TQFSDLT  EF+K FL
Sbjct: 61  LFKRKFGKSYATEEEHNRRFRIFKANMRRALRHQSFDPSAIHGVTQFSDLTVSEFQKTFL 120

Query: 121 GLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVTPVKNQGSCGSCWSFSTTGALEGA 180
           GLRGHRL+LP+D N APILPTENLP  FDWRERGAVTPVKNQG+CGSCWSFSTTGALEGA
Sbjct: 121 GLRGHRLKLPLDANQAPILPTENLPGGFDWRERGAVTPVKNQGTCGSCWSFSTTGALEGA 180

Query: 181 NFLATGELVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
           NFLATGELVSLSEQQLVDCDHECD EEA SCDSGCNGGLMNSA EYTLK GGLMRE+DYP
Sbjct: 181 NFLATGELVSLSEQQLVDCDHECDSEEAGSCDSGCNGGLMNSALEYTLKVGGLMREQDYP 240

Query: 241 YTGTDRGNCNFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAIAINAVFMQTYIGGVS 300
           YTGTDR  C FD+SKIAASVANFSVVSLDEDQIAANLVKNGPLAIAINAVFMQTYIGGVS
Sbjct: 241 YTGTDRETCKFDRSKIAASVANFSVVSLDEDQIAANLVKNGPLAIAINAVFMQTYIGGVS 300

Query: 301 CPFICSRRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNICG 360
           CPFICS+RLDHGVLLVGYGSAGYAPIRMRD+DYWIIKNSWG NWGENGYY+IC+GRNICG
Sbjct: 301 CPFICSKRLDHGVLLVGYGSAGYAPIRMRDRDYWIIKNSWGPNWGENGYYRICKGRNICG 360

Query: 361 VDSLVSTVAAVHT 374
           VDSLVSTVAAV T
Sbjct: 361 VDSLVSTVAAVQT 368

BLAST of Bhi05G001543 vs. ExPASy TrEMBL
Match: A0A6J1I0L8 (probable cysteine protease RD19B OS=Cucurbita maxima OX=3661 GN=LOC111469766 PE=3 SV=1)

HSP 1 Score: 648.7 bits (1672), Expect = 1.6e-182
Identity = 319/373 (85.52%), Postives = 337/373 (90.35%), Query Frame = 0

Query: 1   MDWSFSLFAVIAAATATLCSSESLTSPRSIERDGDPLIRQVVDDGNFNNRLPLGAEHHFS 60
           MD SF LF V+ AA A LCSSESL S  S+     PLIRQV+DDG  NN LPL AEHHF 
Sbjct: 1   MDRSFLLFPVVIAA-AVLCSSESLASTYSV----SPLIRQVIDDGESNN-LPLQAEHHFL 60

Query: 61  LFKQRFGKSYATEEEHDRRFKIFEANMRRAQRHQSFDPSAIHGITQFSDLTPFEFRKAFL 120
           LFK++FGKSYATEEEH+RRF+IF ANMRRA RHQSFDPSAIHG+TQFSDLT  EF+K FL
Sbjct: 61  LFKRKFGKSYATEEEHNRRFRIFRANMRRALRHQSFDPSAIHGVTQFSDLTVSEFQKTFL 120

Query: 121 GLRGHRLRLPVDTNAAPILPTENLPIDFDWRERGAVTPVKNQGSCGSCWSFSTTGALEGA 180
           GLRGHRL+LP+  N APILPTENLP DFDWRERGAVTPVKNQG+CGSCWSFSTTGALEGA
Sbjct: 121 GLRGHRLKLPLHANQAPILPTENLPGDFDWRERGAVTPVKNQGTCGSCWSFSTTGALEGA 180

Query: 181 NFLATGELVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKAGGLMREEDYP 240
           NFLATGELVSLSEQQLVDCDHECD EEA SCDSGCNGGLMNSA EYTLK GGLMRE+DYP
Sbjct: 181 NFLATGELVSLSEQQLVDCDHECDSEEAGSCDSGCNGGLMNSALEYTLKVGGLMREQDYP 240

Query: 241 YTGTDRGNCNFDKSKIAASVANFSVVSLDEDQIAANLVKNGPLAIAINAVFMQTYIGGVS 300
           YTGTDR  C FD+SKIAASVANFSVVSLDEDQIAANLVKNGPLAIAINAVFMQTYIGGVS
Sbjct: 241 YTGTDRETCKFDRSKIAASVANFSVVSLDEDQIAANLVKNGPLAIAINAVFMQTYIGGVS 300

Query: 301 CPFICSRRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGENWGENGYYKICRGRNICG 360
           CPFICS+RLDHGVLLVGYGSAGYAPIRMRD+DYWIIKNSWG NWGENGYY+IC+GRNICG
Sbjct: 301 CPFICSKRLDHGVLLVGYGSAGYAPIRMRDRDYWIIKNSWGPNWGENGYYRICKGRNICG 360

Query: 361 VDSLVSTVAAVHT 374
           VDSLVSTVAAVHT
Sbjct: 361 VDSLVSTVAAVHT 367

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT4G39090.13.6e-15577.58Papain family cysteine protease [more]
AT2G21430.15.7e-15374.85Papain family cysteine protease [more]
AT4G16190.13.5e-15073.59Papain family cysteine protease [more]
AT3G54940.26.2e-11557.56Papain family cysteine protease [more]
AT3G19390.11.0e-6142.72Granulin repeat cysteine protease family protein [more]
Match NameE-valueIdentityDescription
P432965.0e-15477.58Cysteine protease RD19A OS=Arabidopsis thaliana OX=3702 GN=RD19A PE=1 SV=1[more]
P432958.0e-15274.85Probable cysteine protease RD19B OS=Arabidopsis thaliana OX=3702 GN=RD19B PE=2 S... [more]
Q9SUL14.9e-14973.59Probable cysteine protease RD19C OS=Arabidopsis thaliana OX=3702 GN=RD19C PE=2 S... [more]
P258043.6e-14466.93Cysteine proteinase 15A OS=Pisum sativum OX=3888 PE=2 SV=1[more]
Q107169.2e-14070.14Cysteine proteinase 1 OS=Zea mays OX=4577 GN=CCP1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3BIU98.5e-19789.27cysteine proteinase RD19a-like OS=Cucumis melo OX=3656 GN=LOC103490359 PE=3 SV=1[more]
A0A0A0K2B53.6e-19588.05Papain-like cysteine proteinase isoform I OS=Cucumis sativus OX=3659 GN=Csa_7G00... [more]
A0A5A7SS653.7e-19288.22Cysteine proteinase RD19a-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_s... [more]
A0A6J1F4R71.2e-18284.99probable cysteine protease RD19B OS=Cucurbita moschata OX=3662 GN=LOC111440857 P... [more]
A0A6J1I0L81.6e-18285.52probable cysteine protease RD19B OS=Cucurbita maxima OX=3661 GN=LOC111469766 PE=... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 311..321
score: 58.8
coord: 333..339
score: 75.14
coord: 162..177
score: 64.12
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 144..369
e-value: 1.7E-108
score: 376.4
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 144..367
e-value: 2.2E-73
score: 246.9
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 59..115
e-value: 2.6E-19
score: 80.1
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 59..115
e-value: 6.2E-10
score: 39.4
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 41..369
e-value: 6.7E-97
score: 327.1
NoneNo IPR availablePANTHERPTHR12411:SF783CYSTEINE PROTEASE RD19C-RELATEDcoord: 31..370
NoneNo IPR availablePANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 31..370
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 162..173
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 309..319
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 333..352
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 145..368
e-value: 2.74694E-104
score: 304.547
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 57..368

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi05M001543Bhi05M001543mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005615 extracellular space
cellular_component GO:0005764 lysosome
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity