Lsi10G000950 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi10G000950
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionCathepsin B-like cysteine proteinase 1
Locationchr10 : 1518714 .. 1520941 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGGTCAGAGGAGAGGGAGAGATGATTGAGAAAGAAAAGAAAAATATTTAAAATAATATAAAAAATACTTAATTAATGTAGTAGAAAAGTTTTGGCTAACTTGGCCAAATTAATACTAAGGGACCAAATATACCTAGTCCTAAAAATTCGACGACTAAAAAGATATTTTTTTTCCTAGAATTAAACTTTATGATACCAACCTAACTGAGATATTTAAACACTATGAGTATCCTAAAACTAAGGGGTATTTGGGCGTTGGGTTGGTAATTATAATCAATAGTTTTTATTGTCTATGGGTTATGATAGTTTGTAGGTTATAATAGTCTATGTTTGGGATGCATACTATTTGAATTGTTTGAGTAGGAAATAACAAACATGGTAACAAAGAGAGAAAAAGAGGATGAGTAGGAAATATTAAACACTATAACAAAAAGCAATAATCGCAGGTTATAATAGTTGAAAATACCAACTATTATAATTGGAACCTTAAACATAAAGTGAACTATAATAGTTCACTCCACCTACTTGAAGTTGAAACCCAAACACCCCCTAAATATGGTAGGTTCAACCATTTGTGCTAACCTTTGTTCAATGTCATTTTTAATTGTTAGATTTCTTATTTCTCATTTTTTGTTTAATTATTTACGGGAAAAATGTGATTATTCATTTCTCACCTTTTTTTATGAACTTTAATCATTTTACTTACAAGAACACTTGCATTATTATGTAGTACACCCAAATCATTTTCTACCACAGAAGTGAAGCACAATTGATCTATTTTCTTTTCGTGGGTATCTATCTTAAGCAGAAATGTAGTGTTTCCACCCTTTTTTGCCGTGCTGGAAAGATGAAACAACAGACTAGCCCCTACTGAAACTTCAAGGTGTGAAATACATGAATCCAAGAATTTCTTAAGTATCAAAACTTCATCCTCTTGCAGTTGGGAATGGATCAATATATAACTAGAATTGATGTGGAGAATGAAAACATAATTAGCTCGATCCGCAATCGGAAAAAGGCCATGGAAGCATATAGAATGATTTGGAATGTGGGTTTGATGTCTCTGATTCTCTGGGTTTTCTGGACACCCTCAATGGTATCCATGGCAATGGACTACCCTCCAGGATCTGATTCCAGTGACTTACAAGACAGGTACCAGAATTGGATGAGTAAATACGGTCGAGAATACAAGAGCAGAGAAGAGCGGGAGCGGAGATTCACAATTTATCAGTTGAATGTTCAGTACATTGACAACTTCAATTCTCTGAATCATTCATATACTCTGGCTGAAAATAACTTTGCAGATCTCACAAATGATGAGTTCAAGACAACTTACTTGGGGTATCGAACTGATTGGCTTCCTGATACATGCTTCAGATATGGAAATATGGTAAATTTGCCTACTAATGTTGACTGGAGAAAGGAAGGTGCAGTTACTCCAATAAAGAATCAAGGCCAATGTGGTATGAACTCATAATACTCATTCATCTAACTAATCAGAAATCAGAATGAAAATGTTCTCTTTTCGTCGGATTTTGAGTTCTTTTTCCTAACAACTACCCTACCCAACTCTATCCATTGTAAGTCATCCCTTCGTATTTGAACAAACTCACCAGTTTTGAATTTTTGACACAGGGAGCTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCAACAAAATAAAAACAGGCAAATTGATGTCTCTATCAGAACAAGAGCTTGTGGACTGCGATGTGGTCTCGGGGAACCAGGGATGCAATGGTGGTTACATGTACAAAGCATTTGAGTTCGTCAAGAAAAGTGGAATCACTACCGAAACAGAATATCCATACAGGGGAACTGAATCTGTATGCAACAAACAAAAAGTGAGATACCACACTGTGACAATAACTGGATATGAAAAAGTACCTGTCAATGATGAGAAAAGCTTAAAAGCAGCAGTTGCTAACCAGCCAGTCTCTGTAGCAATTGATGCAGGGGGATATGATTTTCAGTTCTATTCTGGTGGAGTCTACTCAGGCAATTGTGGAAAGCAACTCAATCATGGAGTGTCAATAGTTGGGTATGGGGAAGCTAGTAGTAAACCTTATTGGCTTGTTAAGAATTCATGGGGCACTGACTGGGGTGAATCTGGTTATATAAGAATGAAACGTGATTCAACCGACAAGCGAGGTACTTGTGGCATAGCTATGATGGCTAGCTACCCCATCAAAGACTGA

mRNA sequence

ATGACGTTGGGAATGGATCAATATATAACTAGAATTGATGTGGAGAATGAAAACATAATTAGCTCGATCCGCAATCGGAAAAAGGCCATGGAAGCATATAGAATGATTTGGAATGTGGGTTTGATGTCTCTGATTCTCTGGGTTTTCTGGACACCCTCAATGGTATCCATGGCAATGGACTACCCTCCAGGATCTGATTCCAGTGACTTACAAGACAGGTACCAGAATTGGATGAGTAAATACGGTCGAGAATACAAGAGCAGAGAAGAGCGGGAGCGGAGATTCACAATTTATCAGTTGAATGTTCAGTACATTGACAACTTCAATTCTCTGAATCATTCATATACTCTGGCTGAAAATAACTTTGCAGATCTCACAAATGATGAGTTCAAGACAACTTACTTGGGGTATCGAACTGATTGGCTTCCTGATACATGCTTCAGATATGGAAATATGGTAAATTTGCCTACTAATGTTGACTGGAGAAAGGAAGGTGCAGTTACTCCAATAAAGAATCAAGGCCAATGTGGGAGCTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCAACAAAATAAAAACAGGCAAATTGATGTCTCTATCAGAACAAGAGCTTGTGGACTGCGATGTGGTCTCGGGGAACCAGGGATGCAATGGTGGTTACATGTACAAAGCATTTGAGTTCGTCAAGAAAAGTGGAATCACTACCGAAACAGAATATCCATACAGGGGAACTGAATCTGTATGCAACAAACAAAAAGTGAGATACCACACTGTGACAATAACTGGATATGAAAAAGTACCTGTCAATGATGAGAAAAGCTTAAAAGCAGCAGTTGCTAACCAGCCAGTCTCTGTAGCAATTGATGCAGGGGGATATGATTTTCAGTTCTATTCTGGTGGAGTCTACTCAGGCAATTGTGGAAAGCAACTCAATCATGGAGTGTCAATAGTTGGGTATGGGGAAGCTAGTAGTAAACCTTATTGGCTTGTTAAGAATTCATGGGGCACTGACTGGGGTGAATCTGGTTATATAAGAATGAAACGTGATTCAACCGACAAGCGAGGTACTTGTGGCATAGCTATGATGGCTAGCTACCCCATCAAAGACTGA

Coding sequence (CDS)

ATGACGTTGGGAATGGATCAATATATAACTAGAATTGATGTGGAGAATGAAAACATAATTAGCTCGATCCGCAATCGGAAAAAGGCCATGGAAGCATATAGAATGATTTGGAATGTGGGTTTGATGTCTCTGATTCTCTGGGTTTTCTGGACACCCTCAATGGTATCCATGGCAATGGACTACCCTCCAGGATCTGATTCCAGTGACTTACAAGACAGGTACCAGAATTGGATGAGTAAATACGGTCGAGAATACAAGAGCAGAGAAGAGCGGGAGCGGAGATTCACAATTTATCAGTTGAATGTTCAGTACATTGACAACTTCAATTCTCTGAATCATTCATATACTCTGGCTGAAAATAACTTTGCAGATCTCACAAATGATGAGTTCAAGACAACTTACTTGGGGTATCGAACTGATTGGCTTCCTGATACATGCTTCAGATATGGAAATATGGTAAATTTGCCTACTAATGTTGACTGGAGAAAGGAAGGTGCAGTTACTCCAATAAAGAATCAAGGCCAATGTGGGAGCTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCAACAAAATAAAAACAGGCAAATTGATGTCTCTATCAGAACAAGAGCTTGTGGACTGCGATGTGGTCTCGGGGAACCAGGGATGCAATGGTGGTTACATGTACAAAGCATTTGAGTTCGTCAAGAAAAGTGGAATCACTACCGAAACAGAATATCCATACAGGGGAACTGAATCTGTATGCAACAAACAAAAAGTGAGATACCACACTGTGACAATAACTGGATATGAAAAAGTACCTGTCAATGATGAGAAAAGCTTAAAAGCAGCAGTTGCTAACCAGCCAGTCTCTGTAGCAATTGATGCAGGGGGATATGATTTTCAGTTCTATTCTGGTGGAGTCTACTCAGGCAATTGTGGAAAGCAACTCAATCATGGAGTGTCAATAGTTGGGTATGGGGAAGCTAGTAGTAAACCTTATTGGCTTGTTAAGAATTCATGGGGCACTGACTGGGGTGAATCTGGTTATATAAGAATGAAACGTGATTCAACCGACAAGCGAGGTACTTGTGGCATAGCTATGATGGCTAGCTACCCCATCAAAGACTGA

Protein sequence

MTLGMDQYITRIDVENENIISSIRNRKKAMEAYRMIWNVGLMSLILWVFWTPSMVSMAMDYPPGSDSSDLQDRYQNWMSKYGREYKSREERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGYRTDWLPDTCFRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVVSGNQGCNGGYMYKAFEFVKKSGITTETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEASSKPYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD
BLAST of Lsi10G000950 vs. Swiss-Prot
Match: CYSEP_VIGMU (Vignain OS=Vigna mungo PE=1 SV=1)

HSP 1 Score: 348.2 bits (892), Expect = 1.1e-94
Identity = 183/346 (52.89%), Postives = 234/346 (67.63%), Query Frame = 1

Query: 42  MSLILWVFWTPSMV---SMAMDYPPGSDSSD--LQDRYQNWMSKYGREYKSREERERRFT 101
           M  +LWV  + S+V   + + D+      S+  L D Y+ W S +    +S  E+ +RF 
Sbjct: 3   MKKLLWVVLSLSLVLGVANSFDFHEKDLESEESLWDLYERWRSHHTVS-RSLGEKHKRFN 62

Query: 102 IYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLG--------YRTDWLPDTCFR 161
           +++ NV ++ N N ++  Y L  N FAD+TN EF++TY G        +R        F 
Sbjct: 63  VFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFM 122

Query: 162 YGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELV 221
           Y  + ++P +VDWRK+GAVT +K+QGQCGSCWAFS + AVEGIN+IKT KL+SLSEQELV
Sbjct: 123 YEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELV 182

Query: 222 DCDVVSGNQGCNGGYMYKAFEFVK-KSGITTETEYPYRGTESVCNKQKVRYHTVTITGYE 281
           DCD    NQGCNGG M  AFEF+K K GITTE+ YPY   E  C++ KV    V+I G+E
Sbjct: 183 DCDKEE-NQGCNGGLMESAFEFIKQKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHE 242

Query: 282 KVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYG-EAS 341
            VPVNDE +L  AVANQPVSVAIDAGG DFQFYS GV++G+C   LNHGV+IVGYG    
Sbjct: 243 NVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVD 302

Query: 342 SKPYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 373
              YW+V+NSWG +WGE GYIRM+R+ + K G CGIAMMASYPIK+
Sbjct: 303 GTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMMASYPIKN 346

BLAST of Lsi10G000950 vs. Swiss-Prot
Match: CYSEP_PHAVU (Vignain OS=Phaseolus vulgaris PE=2 SV=2)

HSP 1 Score: 343.2 bits (879), Expect = 3.6e-93
Identity = 181/355 (50.99%), Postives = 237/355 (66.76%), Query Frame = 1

Query: 30  MEAYRMIWNVGLMSLILWVFWTPSMVSMAMDYPPGSDSSD--LQDRYQNWMSKYGREYKS 89
           M   +++W V   SL+L V       + + D+     +S+  L D Y+ W S +    +S
Sbjct: 1   MATKKLLWVVLSFSLVLGV-------ANSFDFHDKDLASEESLWDLYERWRSHHTVS-RS 60

Query: 90  REERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLG--------YRT 149
             E+ +RF +++ N+ ++ N N ++  Y L  N FAD+TN EF++TY G        +R 
Sbjct: 61  LGEKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRG 120

Query: 150 DWLPDTCFRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKL 209
               +  F Y  +V++P +VDWRK+GAVT +K+QGQCGSCWAFS V AVEGIN+IKT KL
Sbjct: 121 TPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKL 180

Query: 210 MSLSEQELVDCDVVSGNQGCNGGYMYKAFEFVK-KSGITTETEYPYRGTESVCNKQKVRY 269
           ++LSEQELVDCD    NQGCNGG M  AFEF+K K GITTE+ YPY+  E  C+  KV  
Sbjct: 181 VALSEQELVDCDKEE-NQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVND 240

Query: 270 HTVTITGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVS 329
             V+I G+E VP NDE +L  AVANQPVSVAIDAGG DFQFYS GV++G+C   LNHGV+
Sbjct: 241 LAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVA 300

Query: 330 IVGYG-EASSKPYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 373
           IVGYG       YW+V+NSWG +WGE GYIRM+R+ + K G CGIAM+ SYPIK+
Sbjct: 301 IVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIKN 346

BLAST of Lsi10G000950 vs. Swiss-Prot
Match: CEP2_ARATH (KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana GN=CEP2 PE=1 SV=1)

HSP 1 Score: 340.9 bits (873), Expect = 1.8e-92
Identity = 177/313 (56.55%), Postives = 215/313 (68.69%), Query Frame = 1

Query: 70  LQDRYQNWMSKYGREYKSREERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDE 129
           L   Y  W S +    +S  ERE+RF +++ NV ++ N N  N SY L  N FADLT +E
Sbjct: 34  LSTLYDRWRSHHSVP-RSLNEREKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTINE 93

Query: 130 FKTTYLG-----YRTDWLPDT-----CFRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSC 189
           FK  Y G     +R    P        + + N+  LP++VDWRK+GAVT IKNQG+CGSC
Sbjct: 94  FKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSC 153

Query: 190 WAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVVSGNQGCNGGYMYKAFEFVKKSG-ITT 249
           WAFS VAAVEGINKIKT KL+SLSEQELVDCD    N+GCNGG M  AFEF+KK+G ITT
Sbjct: 154 WAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQ-NEGCNGGLMEIAFEFIKKNGGITT 213

Query: 250 ETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQ 309
           E  YPY G +  C+  K     VTI G+E VP NDE +L  AVANQPVSVAIDAG  DFQ
Sbjct: 214 EDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQ 273

Query: 310 FYSGGVYSGNCGKQLNHGVSIVGYGEASSKPYWLVKNSWGTDWGESGYIRMKRDSTDKRG 369
           FYS GV++G+CG +LNHGV+ VGYG    K YW+V+NSWG +WGE GYI+++R+  +  G
Sbjct: 274 FYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEG 333

Query: 370 TCGIAMMASYPIK 372
            CGIAM ASYPIK
Sbjct: 334 RCGIAMEASYPIK 344

BLAST of Lsi10G000950 vs. Swiss-Prot
Match: SAG39_ORYSJ (Senescence-specific cysteine protease SAG39 OS=Oryza sativa subsp. japonica GN=SAG39 PE=2 SV=2)

HSP 1 Score: 340.9 bits (873), Expect = 1.8e-92
Identity = 171/315 (54.29%), Postives = 223/315 (70.79%), Query Frame = 1

Query: 65  SDSSDLQDRYQNWMSKYGREYKSREERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFAD 124
           SD + +  R++ WM++YGR Y+   E+ RRF +++ NV +I++FN+ NH++ L  N FAD
Sbjct: 28  SDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVNQFAD 87

Query: 125 LTNDEFKTTYLGYRTDWLPDTC-----FRYGNMVN---LPTNVDWRKEGAVTPIKNQGQC 184
           LTNDEF+  ++     ++P T      FRY N VN   LP  VDWR +GAVTPIK+QGQC
Sbjct: 88  LTNDEFR--WMKTNKGFIPSTTRVPTGFRYEN-VNIDALPATVDWRTKGAVTPIKDQGQC 147

Query: 185 GSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVVSGNQGCNGGYMYKAFEF-VKKSG 244
           G CWAFSAVAA+EGI K+ TGKL+SLSEQELVDCDV   +QGC GG M  AF+F +K  G
Sbjct: 148 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 207

Query: 245 ITTETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVNDEKSLKAAVANQPVSVAIDAGGY 304
           +TTE+ YPY   +  C  + V     +I GYE VP N+E +L  AVANQPVSVA+D G  
Sbjct: 208 LTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDM 267

Query: 305 DFQFYSGGVYSGNCGKQLNHGVSIVGYGEAS-SKPYWLVKNSWGTDWGESGYIRMKRDST 364
            FQFY GGV +G+CG  L+HG+  +GYG+AS    YWL+KNSWGT WGE+G++RM++D +
Sbjct: 268 TFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDIS 327

Query: 365 DKRGTCGIAMMASYP 370
           DKRG CG+AM  SYP
Sbjct: 328 DKRGMCGLAMEPSYP 337

BLAST of Lsi10G000950 vs. Swiss-Prot
Match: SAG39_ORYSI (Senescence-specific cysteine protease SAG39 OS=Oryza sativa subsp. indica GN=OsI_14861 PE=3 SV=1)

HSP 1 Score: 340.5 bits (872), Expect = 2.3e-92
Identity = 172/315 (54.60%), Postives = 222/315 (70.48%), Query Frame = 1

Query: 65  SDSSDLQDRYQNWMSKYGREYKSREERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFAD 124
           SD + +  R++ WM++YGR Y+   E+ RRF +++ NV +I++FN+ NH++ L  N FAD
Sbjct: 28  SDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVNQFAD 87

Query: 125 LTNDEFKTTYLGYRTDWLPDTC-----FRYGNMVN---LPTNVDWRKEGAVTPIKNQGQC 184
           LTNDEF+ T       ++P T      FRY N VN   LP  VDWR +GAVTPIK+QGQC
Sbjct: 88  LTNDEFRWTKTN--KGFIPSTTRVPTGFRYEN-VNIDALPATVDWRTKGAVTPIKDQGQC 147

Query: 185 GSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVVSGNQGCNGGYMYKAFEF-VKKSG 244
           G CWAFSAVAA+EGI K+ TGKL+SLSEQELVDCDV   +QGC GG M  AF+F +K  G
Sbjct: 148 GCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGG 207

Query: 245 ITTETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVNDEKSLKAAVANQPVSVAIDAGGY 304
           +TTE+ YPY   +  C  + V     +I GYE VP N+E +L  AVANQPVSVA+D G  
Sbjct: 208 LTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDM 267

Query: 305 DFQFYSGGVYSGNCGKQLNHGVSIVGYGEAS-SKPYWLVKNSWGTDWGESGYIRMKRDST 364
            FQFY GGV +G+CG  L+HG+  +GYG+AS    YWL+KNSWGT WGE+G++RM++D +
Sbjct: 268 TFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDIS 327

Query: 365 DKRGTCGIAMMASYP 370
           DKRG CG+AM  SYP
Sbjct: 328 DKRGMCGLAMEPSYP 337

BLAST of Lsi10G000950 vs. TrEMBL
Match: A0A0A0LJV6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G292830 PE=3 SV=1)

HSP 1 Score: 593.6 bits (1529), Expect = 1.7e-166
Identity = 282/340 (82.94%), Postives = 310/340 (91.18%), Query Frame = 1

Query: 35  MIW-NVGLMSLILWVFWTPSMVSMAMDYPPGSD-SSDLQDRYQNWMSKYGREYKSREERE 94
           M W NV L+ LILWVFWTP +VSMAMDY  GS  SSD+QDRYQ WM KYGR+YKSREE E
Sbjct: 1   MTWINVSLIFLILWVFWTPRLVSMAMDYSLGSSCSSDIQDRYQKWMDKYGRQYKSREEWE 60

Query: 95  RRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGYRTDWLPDTCFRYGNM 154
           RRFTIYQ NVQYIDNFNS+NHS+TLAENNFADLTN+EFK TYLGY+T  +PDTCFRYGNM
Sbjct: 61  RRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEEFKATYLGYKTVSIPDTCFRYGNM 120

Query: 155 VNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDV 214
           VNLPTNVDWR+EGAVTPIKNQGQCGSCWAFSAVAAVEGINKIK GKL+SLSEQELVDCDV
Sbjct: 121 VNLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDV 180

Query: 215 VSGNQGCNGGYMYKAFEFVKKSGITTETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVN 274
            SGNQGCNGGYMYKAFEF+K++G+TTE EYPY+G ES CN+QK +Y  V+I+GYEKVPVN
Sbjct: 181 TSGNQGCNGGYMYKAFEFIKRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVN 240

Query: 275 DEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEASSKPYWL 334
           DEKSLKAAVANQPVSVAIDA G +FQFYSGG++SGNCG QLNHGV+IVGYGE S++ YWL
Sbjct: 241 DEKSLKAAVANQPVSVAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWL 300

Query: 335 VKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 373
           VKNSWGTDWGESGYIRMKRDSTD++GTCGIAMMASYP KD
Sbjct: 301 VKNSWGTDWGESGYIRMKRDSTDRQGTCGIAMMASYPTKD 340

BLAST of Lsi10G000950 vs. TrEMBL
Match: M5X1M2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023515mg PE=3 SV=1)

HSP 1 Score: 440.7 bits (1132), Expect = 1.8e-120
Identity = 205/342 (59.94%), Postives = 257/342 (75.15%), Query Frame = 1

Query: 30  MEAYRMIWNVGLMSLILWVFWTPSMVSMAMDYPPGSDSSDLQDRYQNWMSKYGREYKSRE 89
           ME   ++    L  L++W+F   S        P  +D   +++RY+ W+ KYGR YK+RE
Sbjct: 1   METSMVLTRASLTFLMVWIFCISSTACSETYKPLRTDPKAMKERYERWLQKYGRIYKNRE 60

Query: 90  ERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGYRTDWLPDTCFRY 149
           E   RF +Y+ N++++D  NS N SY L +N FAD+TN EF  T++G++T   P T F Y
Sbjct: 61  EAAYRFGVYKSNIEFVDFVNSQNLSYKLTDNKFADITNLEFTKTFMGFQTRSHPKTKFSY 120

Query: 150 GNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVD 209
                LPT VDWRK GAVTPIKNQGQCGSCWAFSAVAAVEGIN+IKTGKL+SLSEQELVD
Sbjct: 121 DKDEELPTAVDWRKHGAVTPIKNQGQCGSCWAFSAVAAVEGINQIKTGKLVSLSEQELVD 180

Query: 210 CDVVSGNQGCNGGYMYKAFEFVKKSGITTETEYPYRGTESVCNKQKVRYHTVTITGYEKV 269
           CDV +GN+GCNGGYM KAF F+K +G++TE +YPY+G++ +C++  ++   V I+GYE +
Sbjct: 181 CDVKTGNEGCNGGYMEKAFSFIKDNGLSTEKDYPYKGSDGICDEDSLKNSAVNISGYESI 240

Query: 270 PVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEASSKP 329
           P N EKSL+AAVA+QPVSVA+DA GY FQFYS G ++G CGK LNHGV+ VGYGE S K 
Sbjct: 241 PANSEKSLQAAVAHQPVSVAVDAAGYAFQFYSSGTFTGQCGKNLNHGVTAVGYGEDSGKK 300

Query: 330 YWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIK 372
           YW+VKNSWG DWGESGYIRM RDS DK+GTCGIAM ASYP+K
Sbjct: 301 YWIVKNSWGPDWGESGYIRMTRDSVDKKGTCGIAMQASYPVK 342

BLAST of Lsi10G000950 vs. TrEMBL
Match: G7ZUL7_MEDTR (Cysteine proteinase OS=Medicago truncatula GN=MTR_2g090890 PE=3 SV=1)

HSP 1 Score: 416.8 bits (1070), Expect = 2.8e-113
Identity = 195/331 (58.91%), Postives = 250/331 (75.53%), Query Frame = 1

Query: 42  MSLILWVFWTPSMVSMAMDYPPGSDSSDLQDRYQNWMSKYGREYKSREERERRFTIYQLN 101
           +S+++   W  +     +     ++ + ++ RY+ W+ +YGR Y+ REE E RF IYQ N
Sbjct: 7   LSIVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIYQSN 66

Query: 102 VQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGYRTDWLPDTCFRYGNMVNLPTNVDW 161
           VQYI+ +NS N+SY L +N FAD+TN+EFK+TYLGY   +   T FRY     LP ++DW
Sbjct: 67  VQYIEFYNSQNYSYKLIDNRFADITNEEFKSTYLGYLPRFRVQTEFRYHKHGELPKSIDW 126

Query: 162 RKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVVSGNQGCNG 221
           RK+GAVT +K+QG+CGSCWAFSAVAAVEGINKIKT  L+SLSEQ+L+DCD+ SGN+GC G
Sbjct: 127 RKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKSGNEGCEG 186

Query: 222 GYMYKAFEFVKK-SGITTETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVNDEKSLKAA 281
           G MY AF ++KK  GI T  EYPY+G +  CNK K + + VTI+GYE VP  +EK LKAA
Sbjct: 187 GDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGYESVPARNEKMLKAA 246

Query: 282 VANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEASSKPYWLVKNSWGTD 341
           VA+QPVS+A DAGGY FQFYS G++SG+CGK LNHG++IVGYGE +   YW+VKNSW  D
Sbjct: 247 VAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYGEENGDKYWIVKNSWAND 306

Query: 342 WGESGYIRMKRDSTDKRGTCGIAMMASYPIK 372
           WGESGY+RMKRD+ DK GTCGIAM A+YP+K
Sbjct: 307 WGESGYVRMKRDTKDKDGTCGIAMDATYPVK 337

BLAST of Lsi10G000950 vs. TrEMBL
Match: W9RAD3_9ROSA (KDEL-tailed cysteine endopeptidase CEP2 OS=Morus notabilis GN=L484_001450 PE=3 SV=1)

HSP 1 Score: 414.8 bits (1065), Expect = 1.1e-112
Identity = 203/347 (58.50%), Postives = 252/347 (72.62%), Query Frame = 1

Query: 30  MEAYRMIWNVGLMSLILWVFWTPSMVSMAMDYPP----GSDSSDLQDRYQNWMSKYGREY 89
           ME   ++    L  LIL     PS V  + +Y P      +   ++ RY  W  +YGR Y
Sbjct: 1   MEIPIVLRGASLALLILLTLLPPSRV-YSTEYRPLWREEHNRQAVRQRYDRWAEQYGRNY 60

Query: 90  KSREERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGYRTDWLPDT 149
            S EE+E RF IY +N+ +I+  NS N SY L +N FAD+ N EF+   LGYR      T
Sbjct: 61  GSEEEKELRFQIYHMNLLFIEQVNSQNFSYKLTDNKFADMMNAEFRLRLLGYRPLLHNQT 120

Query: 150 CFRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQ 209
            FR+G  + +P  VDWRK GAVTP+K+QGQCGSCWAFS+VAAVEG+N+IKTG+L+SLSEQ
Sbjct: 121 SFRFGGPMLVPKQVDWRKNGAVTPVKDQGQCGSCWAFSSVAAVEGVNQIKTGELVSLSEQ 180

Query: 210 ELVDCDVVSGNQGCNGGYMYKAFEFVKKSGITTETEYPYRGTESVCNKQKVRYHTVTITG 269
           ELVDCDV +GNQGCNGGYM KAF+F+K++GITT  +YPYRG    C++ K+R   V I+G
Sbjct: 181 ELVDCDVNTGNQGCNGGYMEKAFQFIKRNGITTNGKYPYRGANGRCDEDKLRGRRVKISG 240

Query: 270 YEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEA 329
           YEKVP NDE+ L+A VA+QPVSVAIDAGG +FQFYS G+++G CG  LNHGV++VGYGE 
Sbjct: 241 YEKVPHNDEERLQATVAHQPVSVAIDAGGSEFQFYSHGIFNGRCGTDLNHGVTVVGYGEE 300

Query: 330 SSKPYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 373
             K YWLVKNSWGT+WGESGY+R+ R S D RGTCGIAM ASYP+KD
Sbjct: 301 DGKTYWLVKNSWGTEWGESGYVRIHRGSVDGRGTCGIAMEASYPVKD 346

BLAST of Lsi10G000950 vs. TrEMBL
Match: B9SGM8_RICCO (Cysteine protease, putative OS=Ricinus communis GN=RCOM_0554360 PE=3 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 2.4e-112
Identity = 209/347 (60.23%), Postives = 250/347 (72.05%), Query Frame = 1

Query: 30  MEAYRMIWNVGLMSLILWVFWTPSMV-SMAMDYPPGSDSSDLQDRYQNWMSKYGREYKSR 89
           MEA  MI N GLM + L   W PS+  S     P  S  + ++ RY  W+ +YGR+Y ++
Sbjct: 1   MEAPTMIKNAGLMLITLCTLWIPSIARSEIHSLPIDSAPTAMKVRYDKWLEQYGRKYDTK 60

Query: 90  EERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGY--RTDWLPDTC 149
           +E   RF IY  N+Q+I+  NS N S+ L +N FADLTNDEF + YLGY  R+    +  
Sbjct: 61  DEYLLRFGIYHSNIQFIEYINSQNLSFKLTDNKFADLTNDEFNSIYLGYQIRSYKRRNLS 120

Query: 150 FRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQE 209
             + N  +LP  VDWR+ GAVTPIK+QGQCGSCWAFSAVAAVEGINKIKTG L+SLSEQE
Sbjct: 121 HMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGNLVSLSEQE 180

Query: 210 LVDCDVVSGNQGCNGGYMYKAFEFVKK-SGITTETEYPYRGTESVCNKQKVRYHTVTITG 269
           LVDCDV   N+GCNGG+M KAF F+K   G+TTE +YPY+GT+  C K K   H V I G
Sbjct: 181 LVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPYKGTDGSCEKAKTDNHAVIIGG 240

Query: 270 YEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEA 329
           YE VP N+E SLK AV+ QPVSVAIDA GY+FQ YS GV+SG CG QLNHGV+IVGYG+ 
Sbjct: 241 YETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGVFSGYCGIQLNHGVTIVGYGDN 300

Query: 330 SSKPYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 373
           + + YWLVKNSWG  WGESGYIRMKRDS+D +G CGIAM  SYPIKD
Sbjct: 301 NGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGIAMEPSYPIKD 347

BLAST of Lsi10G000950 vs. TAIR10
Match: AT1G06260.1 (AT1G06260.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 344.7 bits (883), Expect = 6.9e-95
Identity = 180/342 (52.63%), Postives = 229/342 (66.96%), Query Frame = 1

Query: 38  NVGLMSLILWVFWTPSMVSMAMD-YPPGSDSSDLQDRYQNWMSKYGREYKSREERERRFT 97
           N+ L  LI +V     + S+    Y P      L+ R++ W+  + + Y  R+E   RF 
Sbjct: 9   NLTLAVLICFVLIASKLCSVDSSVYDP---HKTLKQRFEKWLKTHSKLYGGRDEWMLRFG 68

Query: 98  IYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGYRTDWLP------DTCFRYG 157
           IYQ NVQ ID  NSL+  + L +N FAD+TN EFK  +LG  T  L         C   G
Sbjct: 69  IYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPVCDPAG 128

Query: 158 NMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDC 217
           N+   P  VDWR +GAVTPI+NQG+CG CWAFSAVAA+EGINKIKTG L+SLSEQ+L+DC
Sbjct: 129 NV---PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDC 188

Query: 218 DVVSGNQGCNGGYMYKAFEFVKKSG-ITTETEYPYRGTESVCNKQKVRYHTVTITGYEKV 277
           DV + N+GC+GG M  AFEF+K +G + TET+YPY G E  C+++K +   VTI GY+KV
Sbjct: 189 DVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKV 248

Query: 278 PVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEASSKP 337
             N E SL+ A A QPVSV IDAGG+ FQ YS GV++  CG  LNHGV++VGYG    + 
Sbjct: 249 AQN-EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGDQK 308

Query: 338 YWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIK 372
           YW+VKNSWGT WGE GYIRM+R  ++  G CGIAMMASYP++
Sbjct: 309 YWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPLQ 343

BLAST of Lsi10G000950 vs. TAIR10
Match: AT3G48340.1 (AT3G48340.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 340.9 bits (873), Expect = 1.0e-93
Identity = 177/313 (56.55%), Postives = 215/313 (68.69%), Query Frame = 1

Query: 70  LQDRYQNWMSKYGREYKSREERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDE 129
           L   Y  W S +    +S  ERE+RF +++ NV ++ N N  N SY L  N FADLT +E
Sbjct: 34  LSTLYDRWRSHHSVP-RSLNEREKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTINE 93

Query: 130 FKTTYLG-----YRTDWLPDT-----CFRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSC 189
           FK  Y G     +R    P        + + N+  LP++VDWRK+GAVT IKNQG+CGSC
Sbjct: 94  FKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSC 153

Query: 190 WAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVVSGNQGCNGGYMYKAFEFVKKSG-ITT 249
           WAFS VAAVEGINKIKT KL+SLSEQELVDCD    N+GCNGG M  AFEF+KK+G ITT
Sbjct: 154 WAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQ-NEGCNGGLMEIAFEFIKKNGGITT 213

Query: 250 ETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQ 309
           E  YPY G +  C+  K     VTI G+E VP NDE +L  AVANQPVSVAIDAG  DFQ
Sbjct: 214 EDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQ 273

Query: 310 FYSGGVYSGNCGKQLNHGVSIVGYGEASSKPYWLVKNSWGTDWGESGYIRMKRDSTDKRG 369
           FYS GV++G+CG +LNHGV+ VGYG    K YW+V+NSWG +WGE GYI+++R+  +  G
Sbjct: 274 FYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEG 333

Query: 370 TCGIAMMASYPIK 372
            CGIAM ASYPIK
Sbjct: 334 RCGIAMEASYPIK 344

BLAST of Lsi10G000950 vs. TAIR10
Match: AT5G45890.1 (AT5G45890.1 senescence-associated gene 12)

HSP 1 Score: 338.2 bits (866), Expect = 6.5e-93
Identity = 167/316 (52.85%), Postives = 226/316 (71.52%), Query Frame = 1

Query: 70  LQDRYQNWMSKYGREYKSREERERRFTIYQLNVQYIDNFNSL--NHSYTLAENNFADLTN 129
           +Q R+  WM+K+GR Y   +E   R+ +++ NV+ I++ NS+    ++ LA N FADLTN
Sbjct: 34  MQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTN 93

Query: 130 DEFKTTYLGYR----------TDWLPDTCFRYGNMVN--LPTNVDWRKEGAVTPIKNQGQ 189
           DEF++ Y G++          T   P   FRY N+ +  LP +VDWRK+GAVTPIKNQG 
Sbjct: 94  DEFRSMYTGFKGVSALSSQSQTKMSP---FRYQNVSSGALPVSVDWRKKGAVTPIKNQGS 153

Query: 190 CGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVVSGNQGCNGGYMYKAFEFVKKSG 249
           CG CWAFSAVAA+EG  +IK GKL+SLSEQ+LVDCD  + + GC GG M  AFE +K +G
Sbjct: 154 CGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD--TNDFGCEGGLMDTAFEHIKATG 213

Query: 250 -ITTETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVNDEKSLKAAVANQPVSVAIDAGG 309
            +TTE+ YPY+G ++ CN +K      +ITGYE VPVNDE++L  AVA+QPVSV I+ GG
Sbjct: 214 GLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGG 273

Query: 310 YDFQFYSGGVYSGNCGKQLNHGVSIVGYGEASS-KPYWLVKNSWGTDWGESGYIRMKRDS 369
           +DFQFYS GV++G C   L+H V+ +GYGE+++   YW++KNSWGT WGESGY+R+++D 
Sbjct: 274 FDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDV 333

BLAST of Lsi10G000950 vs. TAIR10
Match: AT5G50260.1 (AT5G50260.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 337.4 bits (864), Expect = 1.1e-92
Identity = 171/315 (54.29%), Postives = 216/315 (68.57%), Query Frame = 1

Query: 68  SDLQDRYQNWMSKYGREYKSREERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTN 127
           + L + Y+ W S +    +S EE+ +RF +++ NV++I   N  + SY L  N F D+T+
Sbjct: 32  NSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTS 91

Query: 128 DEFKTTYLG--------YRTDWLPDTCFRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSC 187
           +EF+ TY G        ++ +      F Y N+  LPT+VDWRK GAVTP+KNQGQCGSC
Sbjct: 92  EEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSC 151

Query: 188 WAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVVSGNQGCNGGYMYKAFEFVK-KSGITT 247
           WAFS V AVEGIN+I+T KL SLSEQELVDCD  + NQGCNGG M  AFEF+K K G+T+
Sbjct: 152 WAFSTVVAVEGINQIRTKKLTSLSEQELVDCDT-NQNQGCNGGLMDLAFEFIKEKGGLTS 211

Query: 248 ETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQ 307
           E  YPY+ ++  C+  K     V+I G+E VP N E  L  AVANQPVSVAIDAGG DFQ
Sbjct: 212 ELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQ 271

Query: 308 FYSGGVYSGNCGKQLNHGVSIVGYGEA-SSKPYWLVKNSWGTDWGESGYIRMKRDSTDKR 367
           FYS GV++G CG +LNHGV++VGYG       YW+VKNSWG +WGE GYIRM+R    K 
Sbjct: 272 FYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKE 331

Query: 368 GTCGIAMMASYPIKD 373
           G CGIAM ASYP+K+
Sbjct: 332 GLCGIAMEASYPLKN 344

BLAST of Lsi10G000950 vs. TAIR10
Match: AT4G35350.1 (AT4G35350.1 xylem cysteine peptidase 1)

HSP 1 Score: 333.6 bits (854), Expect = 1.6e-91
Identity = 165/313 (52.72%), Postives = 218/313 (69.65%), Query Frame = 1

Query: 65  SDSSDLQDRYQNWMSKYGREYKSREERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFAD 124
           +++  L + +++WMS++ + YKS EE+  RF +++ N+ +ID  N+  +SY L  N FAD
Sbjct: 42  TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFAD 101

Query: 125 LTNDEFKTTYLG-----YRTDWLPDTCFRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSC 184
           LT++EFK  YLG     +     P   FRY ++ +LP +VDWRK+GAV P+K+QGQCGSC
Sbjct: 102 LTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSC 161

Query: 185 WAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVVSGNQGCNGGYMYKAFEF-VKKSGITT 244
           WAFS VAAVEGIN+I TG L SLSEQEL+DCD  + N GCNGG M  AF++ +   G+  
Sbjct: 162 WAFSTVAAVEGINQITTGNLSSLSEQELIDCD-TTFNSGCNGGLMDYAFQYIISTGGLHK 221

Query: 245 ETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQ 304
           E +YPY   E +C +QK     VTI+GYE VP ND++SL  A+A+QPVSVAI+A G DFQ
Sbjct: 222 EDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQ 281

Query: 305 FYSGGVYSGNCGKQLNHGVSIVGYGEASSKPYWLVKNSWGTDWGESGYIRMKRDSTDKRG 364
           FY GGV++G CG  L+HGV+ VGYG +    Y +VKNSWG  WGE G+IRMKR++    G
Sbjct: 282 FYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEG 341

Query: 365 TCGIAMMASYPIK 372
            CGI  MASYP K
Sbjct: 342 LCGINKMASYPTK 353

BLAST of Lsi10G000950 vs. NCBI nr
Match: gi|659117224|ref|XP_008458487.1| (PREDICTED: ervatamin-B-like [Cucumis melo])

HSP 1 Score: 601.7 bits (1550), Expect = 8.9e-169
Identity = 289/346 (83.53%), Postives = 313/346 (90.46%), Query Frame = 1

Query: 30  MEAYRMIWNVGLMSLILWVFWTPSMVSMAMDYPPGSDSS-DLQDRYQNWMSKYGREYKSR 89
           MEAY+MIW+V L+SLILWVFWTP+ VSMAMDYP GS +S +LQDRYQ WM KYGR+YKSR
Sbjct: 1   MEAYKMIWSVSLISLILWVFWTPTRVSMAMDYPSGSSNSGELQDRYQKWMDKYGRQYKSR 60

Query: 90  EERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGYRTDWLPDT--C 149
           EE E+RFTIYQ NVQYIDNFNSLNHSYTLAENNF DLTN+EF  TYLGY T  LPDT   
Sbjct: 61  EEWEQRFTIYQANVQYIDNFNSLNHSYTLAENNFTDLTNEEFMATYLGYETVSLPDTDTF 120

Query: 150 FRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQE 209
           FRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIK GKLMSLSEQE
Sbjct: 121 FRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLMSLSEQE 180

Query: 210 LVDCDVVSGNQGCNGGYMYKAFEFVKKSGITTETEYPYRGTESVCNKQKVRYHTVTITGY 269
           LVDCDV SGNQGCNGGYMYKAFEF+KK+G+TTE EYPY  T S C+KQK +Y +V+I+GY
Sbjct: 181 LVDCDVTSGNQGCNGGYMYKAFEFIKKTGLTTELEYPYTATRSACDKQKEKYQSVSISGY 240

Query: 270 EKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEAS 329
           EKVPVNDEKSL+AAVA QPVSVAIDAGG DFQFYSGG++SGNCGKQLNHGV+IVGYGE S
Sbjct: 241 EKVPVNDEKSLQAAVAKQPVSVAIDAGGNDFQFYSGGIFSGNCGKQLNHGVAIVGYGEDS 300

Query: 330 SKPYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 373
           ++ YWLVKNSWGT WGESGYIRM RDSTDK+GTCGIAMMASYPIKD
Sbjct: 301 NQAYWLVKNSWGTSWGESGYIRMMRDSTDKQGTCGIAMMASYPIKD 346

BLAST of Lsi10G000950 vs. NCBI nr
Match: gi|700206934|gb|KGN62053.1| (hypothetical protein Csa_2G292830 [Cucumis sativus])

HSP 1 Score: 593.6 bits (1529), Expect = 2.4e-166
Identity = 282/340 (82.94%), Postives = 310/340 (91.18%), Query Frame = 1

Query: 35  MIW-NVGLMSLILWVFWTPSMVSMAMDYPPGSD-SSDLQDRYQNWMSKYGREYKSREERE 94
           M W NV L+ LILWVFWTP +VSMAMDY  GS  SSD+QDRYQ WM KYGR+YKSREE E
Sbjct: 1   MTWINVSLIFLILWVFWTPRLVSMAMDYSLGSSCSSDIQDRYQKWMDKYGRQYKSREEWE 60

Query: 95  RRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGYRTDWLPDTCFRYGNM 154
           RRFTIYQ NVQYIDNFNS+NHS+TLAENNFADLTN+EFK TYLGY+T  +PDTCFRYGNM
Sbjct: 61  RRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEEFKATYLGYKTVSIPDTCFRYGNM 120

Query: 155 VNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDV 214
           VNLPTNVDWR+EGAVTPIKNQGQCGSCWAFSAVAAVEGINKIK GKL+SLSEQELVDCDV
Sbjct: 121 VNLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDV 180

Query: 215 VSGNQGCNGGYMYKAFEFVKKSGITTETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVN 274
            SGNQGCNGGYMYKAFEF+K++G+TTE EYPY+G ES CN+QK +Y  V+I+GYEKVPVN
Sbjct: 181 TSGNQGCNGGYMYKAFEFIKRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVN 240

Query: 275 DEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEASSKPYWL 334
           DEKSLKAAVANQPVSVAIDA G +FQFYSGG++SGNCG QLNHGV+IVGYGE S++ YWL
Sbjct: 241 DEKSLKAAVANQPVSVAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWL 300

Query: 335 VKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 373
           VKNSWGTDWGESGYIRMKRDSTD++GTCGIAMMASYP KD
Sbjct: 301 VKNSWGTDWGESGYIRMKRDSTDRQGTCGIAMMASYPTKD 340

BLAST of Lsi10G000950 vs. NCBI nr
Match: gi|449460678|ref|XP_004148072.1| (PREDICTED: ervatamin-B-like [Cucumis sativus])

HSP 1 Score: 563.9 bits (1452), Expect = 2.1e-157
Identity = 266/317 (83.91%), Postives = 292/317 (92.11%), Query Frame = 1

Query: 57  MAMDYPPGSD-SSDLQDRYQNWMSKYGREYKSREERERRFTIYQLNVQYIDNFNSLNHSY 116
           MAMDY  GS  SSD+QDRYQ WM KYGR+YKSREE ERRFTIYQ NVQYIDNFNS+NHS+
Sbjct: 1   MAMDYSLGSSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSH 60

Query: 117 TLAENNFADLTNDEFKTTYLGYRTDWLPDTCFRYGNMVNLPTNVDWRKEGAVTPIKNQGQ 176
           TLAENNFADLTN+EFK TYLGY+T  +PDTCFRYGNMVNLPTNVDWR+EGAVTPIKNQGQ
Sbjct: 61  TLAENNFADLTNEEFKATYLGYKTVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQ 120

Query: 177 CGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVVSGNQGCNGGYMYKAFEFVKKSG 236
           CGSCWAFSAVAAVEGINKIK GKL+SLSEQELVDCDV SGNQGCNGGYMYKAFEF+K++G
Sbjct: 121 CGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFIKRTG 180

Query: 237 ITTETEYPYRGTESVCNKQKVRYHTVTITGYEKVPVNDEKSLKAAVANQPVSVAIDAGGY 296
           +TTE EYPY+G ES CN+QK +Y  V+I+GYEKVPVNDEKSLKAAVANQPVSVAIDA G 
Sbjct: 181 LTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGN 240

Query: 297 DFQFYSGGVYSGNCGKQLNHGVSIVGYGEASSKPYWLVKNSWGTDWGESGYIRMKRDSTD 356
           +FQFYSGG++SGNCG QLNHGV+IVGYGE S++ YWLVKNSWGTDWGESGYIRMKRDSTD
Sbjct: 241 NFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYIRMKRDSTD 300

Query: 357 KRGTCGIAMMASYPIKD 373
           ++GTCGIAMMASYP KD
Sbjct: 301 RQGTCGIAMMASYPTKD 317

BLAST of Lsi10G000950 vs. NCBI nr
Match: gi|645245974|ref|XP_008229136.1| (PREDICTED: zingipain-2 [Prunus mume])

HSP 1 Score: 445.7 bits (1145), Expect = 8.1e-122
Identity = 205/342 (59.94%), Postives = 259/342 (75.73%), Query Frame = 1

Query: 30  MEAYRMIWNVGLMSLILWVFWTPSMVSMAMDYPPGSDSSDLQDRYQNWMSKYGREYKSRE 89
           ME   ++    L   ++W+F   S        P  +D   +++RY+ W+ KYGR YK+RE
Sbjct: 1   METSMVLTRASLTFFMVWIFCISSTACSETYKPLRTDPKAMKERYERWLQKYGRIYKNRE 60

Query: 90  ERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGYRTDWLPDTCFRY 149
           E E RF +Y+ N++++D  NS N SY L +N FAD+TN EF  T++G++T   P T F Y
Sbjct: 61  EAEYRFGVYKSNIEFVDFVNSQNQSYKLTDNKFADITNLEFTNTFMGFQTRSHPKTKFSY 120

Query: 150 GNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVD 209
               +LPT VDWRK GAVTPIKNQGQCGSCWAFSAVAAVEGIN+IKTGKL+SLSEQELVD
Sbjct: 121 DKDEDLPTAVDWRKNGAVTPIKNQGQCGSCWAFSAVAAVEGINQIKTGKLVSLSEQELVD 180

Query: 210 CDVVSGNQGCNGGYMYKAFEFVKKSGITTETEYPYRGTESVCNKQKVRYHTVTITGYEKV 269
           CDV +GN+GCNGGYM KAF F+K +G++TE +YPY+G++ +C++  ++ H V I+GYE +
Sbjct: 181 CDVKTGNEGCNGGYMEKAFSFIKDNGLSTEKDYPYKGSDGICDEDSLKNHAVNISGYESI 240

Query: 270 PVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEASSKP 329
           P N EKSL+AAVA+QPVSVA+DA  Y FQFYS G+++G CGK LNHGV+ VGYGE S K 
Sbjct: 241 PANSEKSLQAAVAHQPVSVAVDAASYAFQFYSSGIFTGQCGKNLNHGVTAVGYGEDSGKK 300

Query: 330 YWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIK 372
           YW+VKNSWG DWGESGYIRM RDS DK+GTCGIAM ASYP+K
Sbjct: 301 YWIVKNSWGPDWGESGYIRMTRDSVDKKGTCGIAMQASYPVK 342

BLAST of Lsi10G000950 vs. NCBI nr
Match: gi|595908837|ref|XP_007214244.1| (hypothetical protein PRUPE_ppa023515mg [Prunus persica])

HSP 1 Score: 440.7 bits (1132), Expect = 2.6e-120
Identity = 205/342 (59.94%), Postives = 257/342 (75.15%), Query Frame = 1

Query: 30  MEAYRMIWNVGLMSLILWVFWTPSMVSMAMDYPPGSDSSDLQDRYQNWMSKYGREYKSRE 89
           ME   ++    L  L++W+F   S        P  +D   +++RY+ W+ KYGR YK+RE
Sbjct: 1   METSMVLTRASLTFLMVWIFCISSTACSETYKPLRTDPKAMKERYERWLQKYGRIYKNRE 60

Query: 90  ERERRFTIYQLNVQYIDNFNSLNHSYTLAENNFADLTNDEFKTTYLGYRTDWLPDTCFRY 149
           E   RF +Y+ N++++D  NS N SY L +N FAD+TN EF  T++G++T   P T F Y
Sbjct: 61  EAAYRFGVYKSNIEFVDFVNSQNLSYKLTDNKFADITNLEFTKTFMGFQTRSHPKTKFSY 120

Query: 150 GNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVD 209
                LPT VDWRK GAVTPIKNQGQCGSCWAFSAVAAVEGIN+IKTGKL+SLSEQELVD
Sbjct: 121 DKDEELPTAVDWRKHGAVTPIKNQGQCGSCWAFSAVAAVEGINQIKTGKLVSLSEQELVD 180

Query: 210 CDVVSGNQGCNGGYMYKAFEFVKKSGITTETEYPYRGTESVCNKQKVRYHTVTITGYEKV 269
           CDV +GN+GCNGGYM KAF F+K +G++TE +YPY+G++ +C++  ++   V I+GYE +
Sbjct: 181 CDVKTGNEGCNGGYMEKAFSFIKDNGLSTEKDYPYKGSDGICDEDSLKNSAVNISGYESI 240

Query: 270 PVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVYSGNCGKQLNHGVSIVGYGEASSKP 329
           P N EKSL+AAVA+QPVSVA+DA GY FQFYS G ++G CGK LNHGV+ VGYGE S K 
Sbjct: 241 PANSEKSLQAAVAHQPVSVAVDAAGYAFQFYSSGTFTGQCGKNLNHGVTAVGYGEDSGKK 300

Query: 330 YWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIK 372
           YW+VKNSWG DWGESGYIRM RDS DK+GTCGIAM ASYP+K
Sbjct: 301 YWIVKNSWGPDWGESGYIRMTRDSVDKKGTCGIAMQASYPVK 342

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CYSEP_VIGMU1.1e-9452.89Vignain OS=Vigna mungo PE=1 SV=1[more]
CYSEP_PHAVU3.6e-9350.99Vignain OS=Phaseolus vulgaris PE=2 SV=2[more]
CEP2_ARATH1.8e-9256.55KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana GN=CEP2 PE=1 SV=... [more]
SAG39_ORYSJ1.8e-9254.29Senescence-specific cysteine protease SAG39 OS=Oryza sativa subsp. japonica GN=S... [more]
SAG39_ORYSI2.3e-9254.60Senescence-specific cysteine protease SAG39 OS=Oryza sativa subsp. indica GN=OsI... [more]
Match NameE-valueIdentityDescription
A0A0A0LJV6_CUCSA1.7e-16682.94Uncharacterized protein OS=Cucumis sativus GN=Csa_2G292830 PE=3 SV=1[more]
M5X1M2_PRUPE1.8e-12059.94Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023515mg PE=3 SV=1[more]
G7ZUL7_MEDTR2.8e-11358.91Cysteine proteinase OS=Medicago truncatula GN=MTR_2g090890 PE=3 SV=1[more]
W9RAD3_9ROSA1.1e-11258.50KDEL-tailed cysteine endopeptidase CEP2 OS=Morus notabilis GN=L484_001450 PE=3 S... [more]
B9SGM8_RICCO2.4e-11260.23Cysteine protease, putative OS=Ricinus communis GN=RCOM_0554360 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G06260.16.9e-9552.63 Cysteine proteinases superfamily protein[more]
AT3G48340.11.0e-9356.55 Cysteine proteinases superfamily protein[more]
AT5G45890.16.5e-9352.85 senescence-associated gene 12[more]
AT5G50260.11.1e-9254.29 Cysteine proteinases superfamily protein[more]
AT4G35350.11.6e-9152.72 xylem cysteine peptidase 1[more]
Match NameE-valueIdentityDescription
gi|659117224|ref|XP_008458487.1|8.9e-16983.53PREDICTED: ervatamin-B-like [Cucumis melo][more]
gi|700206934|gb|KGN62053.1|2.4e-16682.94hypothetical protein Csa_2G292830 [Cucumis sativus][more]
gi|449460678|ref|XP_004148072.1|2.1e-15783.91PREDICTED: ervatamin-B-like [Cucumis sativus][more]
gi|645245974|ref|XP_008229136.1|8.1e-12259.94PREDICTED: zingipain-2 [Prunus mume][more]
gi|595908837|ref|XP_007214244.1|2.6e-12059.94hypothetical protein PRUPE_ppa023515mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008234cysteine-type peptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR025661Pept_asp_AS
IPR013201Prot_inhib_I29
IPR013128Peptidase_C1A
IPR000668Peptidase_C1A_C
IPR000169Pept_cys_AS
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0008234 cysteine-type peptidase activity
molecular_function GO:0016491 oxidoreductase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi10G000950.1Lsi10G000950.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 173..184
scor
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 173..188
score: 2.3E-10coord: 330..336
score: 2.3E-10coord: 315..325
score: 2.3
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 155..370
score: 3.2
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 155..370
score: 1.3E
IPR013128Peptidase C1APANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 1..372
score: 3.2E
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 74..130
score: 7.0
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 74..130
score: 1.4
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 330..349
scor
NoneNo IPR availableGENE3DG3DSA:3.90.70.10coord: 51..370
score: 3.0E
NoneNo IPR availablePANTHERPTHR12411:SF333CYSTEINE PROTEASE-LIKE PROTEIN-RELATEDcoord: 1..372
score: 3.2E
NoneNo IPR availableunknownSSF54001Cysteine proteinasescoord: 70..371
score: 1.83E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Lsi10G000950CmaCh20G005860Cucurbita maxima (Rimu)cmalsiB501
Lsi10G000950CmoCh20G006490Cucurbita moschata (Rifu)cmolsiB493
Lsi10G000950Cp4.1LG16g04150Cucurbita pepo (Zucchini)cpelsiB221
Lsi10G000950Carg03113Silver-seed gourdcarlsiB304
The following gene(s) are paralogous to this gene:

None