Lsi09G010720 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi09G010720
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionCysteine protease ATG4B
Locationchr09 : 16640654 .. 16642756 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTGAGGTTATCGAGAACTTGGTGATTCTTTATGTGGAGGACCAGGAGGTTGATGGAGTTGTGGTGAGGGAGTGGTGAGAGGAAGAGGTGTACTTCAAGGCTTGCGATAATGTCAATGGGTTATACTACCATTGAACAAGTGCTCCTCTGTTTGCCTTTTCAGGGGAAGGGTAATTTTTTGTGGCAAGCTAGTGTCTGCTATTTTATGGTGAGTTTAGTTCGAGAGGAACAATAGGGTTTTTAAAGAGGCGGTGTGGTCTTGTGATGAGGTTTGGGAAATGGTGAGGGTTCAATGCATCCCTGTGGGCTTTTGGTCGCTAACCTTAATTTGATTTTATTGGATTGGAGTCCTTTTTTTGTAGTTTCTTGGGGTCTCCTTGTTTTGTTGGGATTTTTTTCCCCTCTTTTTGTATGCCTTTTACATTATTTCATTTCTATCATAATGGAGAGATCATGAGCCAGCGATATTTTGGTTTCTAGTGCTCCAGCTAGATCTTTGAATCTGGAACAAGCATAGGTACATTTTTGACACTTGAATTAAAGTCTGAAATGTTTTACAGTACCCCTTAGAACTTACAGGCGCTGTGATCCATTGTAGGCATGTGGACTCAGCTAATAAAATGACAAAATGTACAGAATACTATGATGTAGTACCTTTTTTTCTTTTAAAAGAAAAAAAAGCTTAATTATCCAGAACAAATGTCTCCCTGAGCTGTAGGATAATTTTTACATTTTTGTAAATAAATCTTTAGTTGTAATGATAAAATATTCAATCCCTCAAATATATTTTGTTTGGTTTTGGATGTACTCTATAACATTTGTTACAAACTATGGCATCCTGAAAAAAGAAAAGTTTTTTACATCATGATTGCGGTTGCTAACTTTGTTTCGTTATTTGAGAATTGCTATGCGTAAGAAAATTCACATGTTAACTTTTGTTTTATCGCATTGTGTTTTGGTTTTGATTCTTCAAACAAGATTGCTTCTGTTTTCTACGCTGAATGTTTATTGTCGCGGTTTCCTTGTTCATGTCAATTTACAGAATGGGATAGGTAATGACTGTCTAACAATGTTTTCATTTTAGGTAGTCAATATTGATAAGGATGACCTAGAGGCTGATACTTCCTCTTATCACTGCAAGTGAGTTTTTCCATATCATAAATGTCTTGTGGTTTACTGTTTTTCATTTTTCGTCTTTCATTTTTGTTTGGGTTCTCATATATATATCTCTGTACATTGTTTTCTGTAGTGTCATCCGGCACATCCCCTTAGAATCCATAGATCCTTCTCTAGCAATTGGGTTTTATTGTCGAGACAAAGGTTTGCACACCCCTCTAGCTTCATAATTGTGGATGTGAGCTTGATCGATTCCTTTCATGTTGTAAAACCTAATCTTGATTTCCTGTTTTGCTCCCTCGTTTTTTCTTTTGTGTTCTTTTTTCCTGGTGCCAGATGATTTCGACAATTTCTGTTATCGGGCATCGAAGTTGGCAGACGAGTCGGATGGAGCTCCATTATTTACAGTTGCTGAAACACATTCCACGAATTCAGGGAGACACGGCAGTGCATTGAATGACTGTAGTAGATTAGTGGAGGATACCGATGGCGTGGTGCACATGCCGAACGAAGAGGAGGCGCATGAGGACGACTGGCAATTTCTTTGAAGAAAATAGAACCGGTTTTGGAATCTGGGGGTGGGTTGGTATTCTTCCTTAATTACTTCTCTAGGCTTCCAAGTTCCCTTGAGCTTTCTGGCTAATATGATTGAAATGGATGGGGAGTGATGCCCAATATGTCAAATGTTTTAGTTTATTTAGTGTTAATTGTGCAATAGGCGTTGAGTTTATTTGTTGAATTTGTTTATTCTTGGTCCACTTTCACGTGAAATTAGCCTACATTCGTGTGTATAATAAGCAGGTTCAATTCAAATTCATAAATGAATTATAAATGCTCTGATTCTATGAAAAATTTAGCTTACCATTCTTAACCGTTTACGGCCACCATGGCCACCGCTGCACAACCTCCTCTACCGATCTTCTCCCCTCCATCAATTCTTATTGATCAAAGTAACATTTCAATTGCAATTGACGAATATCCCAG

mRNA sequence

ATGGTTGAGGTTATCGAGAACTTGGTGATTCTTTATGTGGAGGACCAGGAGGTTGATGGAGTTGTGGTAGTCAATATTGATAAGGATGACCTAGAGGCTGATACTTCCTCTTATCACTGCAATGTCATCCGGCACATCCCCTTAGAATCCATAGATCCTTCTCTAGCAATTGGGTTTTATTGTCGAGACAAAGATGATTTCGACAATTTCTGTTATCGGGCATCGAAGTTGGCAGACGAGTCGGATGGAGCTCCATTATTTACAGTTGCTGAAACACATTCCACGAATTCAGGGAGACACGGCAGTGCATTGAATGACTGTAGTAGATTAGTGGAGGATACCGATGGCGTGGTGCACATGCCGAACGAAGAGGAGGCGCATGAGGACGACTGGCAATTTCTTTGAAGAAAATAGAACCGGTTTTGGAATCTGGGGGTGGGTTGGTATTCTTCCTTAATTACTTCTCTAGGCTTCCAAGTTCCCTTGAGCTTTCTGGCTAATATGATTGAAATGGATGGGGAGTGATGCCCAATATGTCAAATGTTTTAGTTTATTTAGTGTTAATTGTGCAATAGGCGTTGAGTTTATTTGTTGAATTTGTTTATTCTTGGTCCACTTTCACGTGAAATTAGCCTACATTCGTGTGTATAATAAGCAGGTTCAATTCAAATTCATAAATGAATTATAAATGCTCTGATTCTATGAAAAATTTAGCTTACCATTCTTAACCGTTTACGGCCACCATGGCCACCGCTGCACAACCTCCTCTACCGATCTTCTCCCCTCCATCAATTCTTATTGATCAAAGTAACATTTCAATTGCAATTGACGAATATCCCAG

Coding sequence (CDS)

ATGGTTGAGGTTATCGAGAACTTGGTGATTCTTTATGTGGAGGACCAGGAGGTTGATGGAGTTGTGGTAGTCAATATTGATAAGGATGACCTAGAGGCTGATACTTCCTCTTATCACTGCAATGTCATCCGGCACATCCCCTTAGAATCCATAGATCCTTCTCTAGCAATTGGGTTTTATTGTCGAGACAAAGATGATTTCGACAATTTCTGTTATCGGGCATCGAAGTTGGCAGACGAGTCGGATGGAGCTCCATTATTTACAGTTGCTGAAACACATTCCACGAATTCAGGGAGACACGGCAGTGCATTGAATGACTGTAGTAGATTAGTGGAGGATACCGATGGCGTGGTGCACATGCCGAACGAAGAGGAGGCGCATGAGGACGACTGGCAATTTCTTTGA

Protein sequence

MVEVIENLVILYVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCYRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVEDTDGVVHMPNEEEAHEDDWQFL
BLAST of Lsi09G010720 vs. Swiss-Prot
Match: ATG4_MEDTR (Cysteine protease ATG4 OS=Medicago truncatula GN=ATG4 PE=3 SV=1)

HSP 1 Score: 129.8 bits (325), Expect = 2.2e-29
Identity = 71/131 (54.20%), Postives = 91/131 (69.47%), Query Frame = 1

Query: 5   IENLVILYVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDK 64
           ++N    Y++  EV  VV  NI  D  E +TSSYHCN+ RH+PL+SIDPSLAIGFYCRDK
Sbjct: 360 VQNDKAFYLDPHEVKPVV--NITGDTQEPNTSSYHCNISRHMPLDSIDPSLAIGFYCRDK 419

Query: 65  DDFDNFCYRASKLADESDGAPLFTVAETHS-TNSGRHGSALNDCSRLVEDTDGVVHMPNE 124
           DDFD+FC RA+KLA+ES+GAPLFTVA++ S        S   D +R  ED    +++ N 
Sbjct: 420 DDFDDFCSRATKLAEESNGAPLFTVAQSRSLPMQVTSNSVSGDDTRFEEDDSLSMNLVN- 479

Query: 125 EEAHEDDWQFL 135
           +  +EDDWQFL
Sbjct: 480 DAGNEDDWQFL 487

BLAST of Lsi09G010720 vs. Swiss-Prot
Match: ATG4A_ARATH (Cysteine protease ATG4a OS=Arabidopsis thaliana GN=ATG4A PE=2 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 5.0e-29
Identity = 67/124 (54.03%), Postives = 84/124 (67.74%), Query Frame = 1

Query: 12  YVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 71
           Y++  EV  VV VN  K+  + DTSSYHCNV+R++PLES+DPSLA+GFYCRDKDDFD+FC
Sbjct: 362 YLDPHEVQQVVTVN--KETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFC 421

Query: 72  YRASKLADESDGAPLFTVAETHST-NSGRHGSALNDCSRLVEDTDGVVHMPNEEEAHEDD 131
            RA KLA+ES+GAPLFTV +TH+  N   +G A +D                 E+  EDD
Sbjct: 422 LRALKLAEESNGAPLFTVTQTHTAINQSNYGFADDD----------------SEDEREDD 467

Query: 132 WQFL 135
           WQ L
Sbjct: 482 WQML 467

BLAST of Lsi09G010720 vs. Swiss-Prot
Match: ATG4B_ARATH (Cysteine protease ATG4b OS=Arabidopsis thaliana GN=ATG4B PE=1 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 1.0e-26
Identity = 60/123 (48.78%), Postives = 79/123 (64.23%), Query Frame = 1

Query: 12  YVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 71
           Y++  +V  VV V   K++ + DTSSYHCN +R++PLES+DPSLA+GFYC+ KDDFD+FC
Sbjct: 366 YLDPHDVQQVVTVK--KENQDVDTSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFC 425

Query: 72  YRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVEDTDGVVHMPNEEEAHEDDW 131
            RA+KLA +S+GAPLFTV ++H           NDC      +          E HEDDW
Sbjct: 426 IRATKLAGDSNGAPLFTVTQSHRR---------NDCGIAETSSSTETSTEISGEEHEDDW 477

Query: 132 QFL 135
           Q L
Sbjct: 486 QLL 477

BLAST of Lsi09G010720 vs. Swiss-Prot
Match: ATG4A_ORYSI (Cysteine protease ATG4A OS=Oryza sativa subsp. indica GN=ATG4A PE=3 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 8.2e-24
Identity = 58/125 (46.40%), Postives = 82/125 (65.60%), Query Frame = 1

Query: 10  ILYVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDN 69
           +LY++  EV   + V+I  D+LEADTSSYHC+ +R + L+ IDPSLAIGFYCRDKDDFD+
Sbjct: 353 VLYLDPHEVQ--LAVDIAADNLEADTSSYHCSTVRDLALDLIDPSLAIGFYCRDKDDFDD 412

Query: 70  FCYRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVEDTDGVVHMPNEEEAHED 129
           FC RAS+L D+++GAPLFTV ++   +   +    +    +  D   V  +    E  E+
Sbjct: 413 FCSRASELVDKANGAPLFTVMQSVQPSKQMYNEESSSGDGM--DIINVEGLDGSGETGEE 472

Query: 130 DWQFL 135
           +WQ L
Sbjct: 473 EWQIL 473

BLAST of Lsi09G010720 vs. Swiss-Prot
Match: ATG4A_ORYSJ (Cysteine protease ATG4A OS=Oryza sativa subsp. japonica GN=ATG4A PE=2 SV=1)

HSP 1 Score: 109.0 bits (271), Expect = 4.1e-23
Identity = 57/124 (45.97%), Postives = 81/124 (65.32%), Query Frame = 1

Query: 11  LYVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNF 70
           LY++  EV   + V+I  D+LEA TSSYHC+ +R + L+ IDPSLAIGFYCRDKDDFD+F
Sbjct: 355 LYLDPHEVQ--LAVDIAADNLEAGTSSYHCSTVRDLALDLIDPSLAIGFYCRDKDDFDDF 414

Query: 71  CYRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVEDTDGVVHMPNEEEAHEDD 130
           C RAS+L D+++GAPLFTV ++   +   +    +    +  D+  V  +    E  E++
Sbjct: 415 CSRASELVDKANGAPLFTVVQSVQPSKQMYNEESSSGDGM--DSINVEGLDGSGETGEEE 474

Query: 131 WQFL 135
           WQ L
Sbjct: 475 WQIL 474

BLAST of Lsi09G010720 vs. TrEMBL
Match: A0A0A0LK81_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G128640 PE=3 SV=1)

HSP 1 Score: 213.8 bits (543), Expect = 1.3e-52
Identity = 107/124 (86.29%), Postives = 112/124 (90.32%), Query Frame = 1

Query: 12  YVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 71
           Y++  EV    VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC
Sbjct: 362 YLDPHEVQQ--VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 421

Query: 72  YRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVEDT-DGVVHMPNEEEAHEDD 131
           +RASKLA+ESDGAPLFTVAETHSTN GR  SALND SRLVED  DGVVHMPNEEE+HEDD
Sbjct: 422 HRASKLAEESDGAPLFTVAETHSTNPGRQSSALNDHSRLVEDDGDGVVHMPNEEESHEDD 481

Query: 132 WQFL 135
           WQFL
Sbjct: 482 WQFL 483

BLAST of Lsi09G010720 vs. TrEMBL
Match: A0A061DVN9_THECC (Peptidase family C54 protein isoform 2 OS=Theobroma cacao GN=TCM_003165 PE=3 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 1.4e-33
Identity = 76/126 (60.32%), Postives = 96/126 (76.19%), Query Frame = 1

Query: 10  ILYVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDN 69
           + Y++  +V   +VVN+ +D+ EADTSSYHC++IRHIPL+SIDPSLAIGF+CRDKDDFD+
Sbjct: 273 VFYLDPHDVQ--LVVNLSQDNQEADTSSYHCDIIRHIPLDSIDPSLAIGFFCRDKDDFDD 332

Query: 70  FCYRASKLADESDGAPLFTVAETHST-NSGRHGSALNDCSRLVEDTDGVVHMPNEEEAHE 129
           FC RASKLADES+GAPLFTVA+THS+     HG+AL+D   + ED    V    +   HE
Sbjct: 333 FCLRASKLADESNGAPLFTVAQTHSSFKPISHGNALDDTGEVREDDSLGVVPDMDGSIHE 392

Query: 130 DDWQFL 135
           DDWQ L
Sbjct: 393 DDWQLL 396

BLAST of Lsi09G010720 vs. TrEMBL
Match: A0A061DMN0_THECC (Peptidase family C54 protein isoform 3 OS=Theobroma cacao GN=TCM_003165 PE=3 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 1.4e-33
Identity = 76/126 (60.32%), Postives = 96/126 (76.19%), Query Frame = 1

Query: 10  ILYVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDN 69
           + Y++  +V   +VVN+ +D+ EADTSSYHC++IRHIPL+SIDPSLAIGF+CRDKDDFD+
Sbjct: 363 VFYLDPHDVQ--LVVNLSQDNQEADTSSYHCDIIRHIPLDSIDPSLAIGFFCRDKDDFDD 422

Query: 70  FCYRASKLADESDGAPLFTVAETHST-NSGRHGSALNDCSRLVEDTDGVVHMPNEEEAHE 129
           FC RASKLADES+GAPLFTVA+THS+     HG+AL+D   + ED    V    +   HE
Sbjct: 423 FCLRASKLADESNGAPLFTVAQTHSSFKPISHGNALDDTGEVREDDSLGVVPDMDGSIHE 482

Query: 130 DDWQFL 135
           DDWQ L
Sbjct: 483 DDWQLL 486

BLAST of Lsi09G010720 vs. TrEMBL
Match: M5XJB4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004885mg PE=3 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 4.4e-32
Identity = 76/127 (59.84%), Postives = 93/127 (73.23%), Query Frame = 1

Query: 11  LYVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNF 70
           LY++  EV     +NI +DDLEADT SYHCNVIRHIPL+SIDPSLAIGFYCRD+DDFD+F
Sbjct: 363 LYLDPHEVQPA--INIRRDDLEADTLSYHCNVIRHIPLDSIDPSLAIGFYCRDRDDFDDF 422

Query: 71  CYRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVEDTDGVVHMPNEE---EAH 130
           C+RASKLAD S+GAPLFTV ++H+     + S + D S  V++ D  V  P  +    AH
Sbjct: 423 CFRASKLADGSNGAPLFTVTQSHNFPKPVNHSDVLDDSGGVQNDDSFVAPPISDADGSAH 482

Query: 131 EDDWQFL 135
           EDDWQ L
Sbjct: 483 EDDWQLL 487

BLAST of Lsi09G010720 vs. TrEMBL
Match: A0A0D2SXT5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G001300 PE=3 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 7.5e-32
Identity = 76/136 (55.88%), Postives = 97/136 (71.32%), Query Frame = 1

Query: 10  ILYVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDN 69
           + Y++  +V  VV  N+  ++LEADTSSYHCN+IR+IPLES+DPSLAIGF+CRDKDDFD+
Sbjct: 363 VFYLDPHDVQPVV--NLSTENLEADTSSYHCNIIRYIPLESLDPSLAIGFFCRDKDDFDD 422

Query: 70  FCYRASKLADESDGAPLFTVAETHS---------TNSGRHGSALNDCSRLVE--DTDGVV 129
           FC+RASKLADES+GAPLFTVA+THS         T +   G  ++D  R++   D DG  
Sbjct: 423 FCFRASKLADESNGAPLFTVAQTHSVFKPINHGDTMANAGGDRMDDSVRVLPTGDVDG-- 482

Query: 130 HMPNEEEAHEDDWQFL 135
                  +HEDDWQFL
Sbjct: 483 ------NSHEDDWQFL 488

BLAST of Lsi09G010720 vs. TAIR10
Match: AT2G44140.1 (AT2G44140.1 Peptidase family C54 protein)

HSP 1 Score: 128.6 bits (322), Expect = 2.8e-30
Identity = 67/124 (54.03%), Postives = 84/124 (67.74%), Query Frame = 1

Query: 12  YVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 71
           Y++  EV  VV VN  K+  + DTSSYHCNV+R++PLES+DPSLA+GFYCRDKDDFD+FC
Sbjct: 362 YLDPHEVQQVVTVN--KETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFC 421

Query: 72  YRASKLADESDGAPLFTVAETHST-NSGRHGSALNDCSRLVEDTDGVVHMPNEEEAHEDD 131
            RA KLA+ES+GAPLFTV +TH+  N   +G A +D                 E+  EDD
Sbjct: 422 LRALKLAEESNGAPLFTVTQTHTAINQSNYGFADDD----------------SEDEREDD 467

Query: 132 WQFL 135
           WQ L
Sbjct: 482 WQML 467

BLAST of Lsi09G010720 vs. TAIR10
Match: AT3G59950.1 (AT3G59950.1 Peptidase family C54 protein)

HSP 1 Score: 120.9 bits (302), Expect = 5.8e-28
Identity = 60/123 (48.78%), Postives = 79/123 (64.23%), Query Frame = 1

Query: 12  YVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 71
           Y++  +V  VV V   K++ + DTSSYHCN +R++PLES+DPSLA+GFYC+ KDDFD+FC
Sbjct: 366 YLDPHDVQQVVTVK--KENQDVDTSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFC 425

Query: 72  YRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVEDTDGVVHMPNEEEAHEDDW 131
            RA+KLA +S+GAPLFTV ++H           NDC      +          E HEDDW
Sbjct: 426 IRATKLAGDSNGAPLFTVTQSHRR---------NDCGIAETSSSTETSTEISGEEHEDDW 477

Query: 132 QFL 135
           Q L
Sbjct: 486 QLL 477

BLAST of Lsi09G010720 vs. NCBI nr
Match: gi|659082126|ref|XP_008441684.1| (PREDICTED: cysteine protease ATG4-like isoform X1 [Cucumis melo])

HSP 1 Score: 217.6 bits (553), Expect = 1.3e-53
Identity = 109/124 (87.90%), Postives = 112/124 (90.32%), Query Frame = 1

Query: 12  YVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 71
           Y++  EV    VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC
Sbjct: 362 YLDPHEVQQ--VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 421

Query: 72  YRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVE-DTDGVVHMPNEEEAHEDD 131
           YRASKLA+ESDGAPLFTVAETHSTNSGR  SALND SRLVE D DG VHMPNEEEAHEDD
Sbjct: 422 YRASKLAEESDGAPLFTVAETHSTNSGRQSSALNDHSRLVEDDADGAVHMPNEEEAHEDD 481

Query: 132 WQFL 135
           WQFL
Sbjct: 482 WQFL 483

BLAST of Lsi09G010720 vs. NCBI nr
Match: gi|659082128|ref|XP_008441686.1| (PREDICTED: cysteine protease ATG4-like isoform X2 [Cucumis melo])

HSP 1 Score: 217.6 bits (553), Expect = 1.3e-53
Identity = 109/124 (87.90%), Postives = 112/124 (90.32%), Query Frame = 1

Query: 12  YVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 71
           Y++  EV    VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC
Sbjct: 334 YLDPHEVQQ--VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 393

Query: 72  YRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVE-DTDGVVHMPNEEEAHEDD 131
           YRASKLA+ESDGAPLFTVAETHSTNSGR  SALND SRLVE D DG VHMPNEEEAHEDD
Sbjct: 394 YRASKLAEESDGAPLFTVAETHSTNSGRQSSALNDHSRLVEDDADGAVHMPNEEEAHEDD 453

Query: 132 WQFL 135
           WQFL
Sbjct: 454 WQFL 455

BLAST of Lsi09G010720 vs. NCBI nr
Match: gi|449442361|ref|XP_004138950.1| (PREDICTED: cysteine protease ATG4-like isoform X2 [Cucumis sativus])

HSP 1 Score: 213.8 bits (543), Expect = 1.9e-52
Identity = 107/124 (86.29%), Postives = 112/124 (90.32%), Query Frame = 1

Query: 12  YVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 71
           Y++  EV    VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC
Sbjct: 362 YLDPHEVQQ--VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 421

Query: 72  YRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVEDT-DGVVHMPNEEEAHEDD 131
           +RASKLA+ESDGAPLFTVAETHSTN GR  SALND SRLVED  DGVVHMPNEEE+HEDD
Sbjct: 422 HRASKLAEESDGAPLFTVAETHSTNPGRQSSALNDHSRLVEDDGDGVVHMPNEEESHEDD 481

Query: 132 WQFL 135
           WQFL
Sbjct: 482 WQFL 483

BLAST of Lsi09G010720 vs. NCBI nr
Match: gi|778668355|ref|XP_011649086.1| (PREDICTED: cysteine protease ATG4-like isoform X1 [Cucumis sativus])

HSP 1 Score: 213.8 bits (543), Expect = 1.9e-52
Identity = 107/124 (86.29%), Postives = 112/124 (90.32%), Query Frame = 1

Query: 12  YVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 71
           Y++  EV    VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC
Sbjct: 362 YLDPHEVQQ--VVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFC 421

Query: 72  YRASKLADESDGAPLFTVAETHSTNSGRHGSALNDCSRLVEDT-DGVVHMPNEEEAHEDD 131
           +RASKLA+ESDGAPLFTVAETHSTN GR  SALND SRLVED  DGVVHMPNEEE+HEDD
Sbjct: 422 HRASKLAEESDGAPLFTVAETHSTNPGRQSSALNDHSRLVEDDGDGVVHMPNEEESHEDD 481

Query: 132 WQFL 135
           WQFL
Sbjct: 482 WQFL 483

BLAST of Lsi09G010720 vs. NCBI nr
Match: gi|590714446|ref|XP_007049916.1| (Peptidase family C54 protein isoform 2 [Theobroma cacao])

HSP 1 Score: 150.6 bits (379), Expect = 2.0e-33
Identity = 76/126 (60.32%), Postives = 96/126 (76.19%), Query Frame = 1

Query: 10  ILYVEDQEVDGVVVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDN 69
           + Y++  +V   +VVN+ +D+ EADTSSYHC++IRHIPL+SIDPSLAIGF+CRDKDDFD+
Sbjct: 273 VFYLDPHDVQ--LVVNLSQDNQEADTSSYHCDIIRHIPLDSIDPSLAIGFFCRDKDDFDD 332

Query: 70  FCYRASKLADESDGAPLFTVAETHST-NSGRHGSALNDCSRLVEDTDGVVHMPNEEEAHE 129
           FC RASKLADES+GAPLFTVA+THS+     HG+AL+D   + ED    V    +   HE
Sbjct: 333 FCLRASKLADESNGAPLFTVAQTHSSFKPISHGNALDDTGEVREDDSLGVVPDMDGSIHE 392

Query: 130 DDWQFL 135
           DDWQ L
Sbjct: 393 DDWQLL 396

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ATG4_MEDTR2.2e-2954.20Cysteine protease ATG4 OS=Medicago truncatula GN=ATG4 PE=3 SV=1[more]
ATG4A_ARATH5.0e-2954.03Cysteine protease ATG4a OS=Arabidopsis thaliana GN=ATG4A PE=2 SV=1[more]
ATG4B_ARATH1.0e-2648.78Cysteine protease ATG4b OS=Arabidopsis thaliana GN=ATG4B PE=1 SV=1[more]
ATG4A_ORYSI8.2e-2446.40Cysteine protease ATG4A OS=Oryza sativa subsp. indica GN=ATG4A PE=3 SV=1[more]
ATG4A_ORYSJ4.1e-2345.97Cysteine protease ATG4A OS=Oryza sativa subsp. japonica GN=ATG4A PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LK81_CUCSA1.3e-5286.29Uncharacterized protein OS=Cucumis sativus GN=Csa_2G128640 PE=3 SV=1[more]
A0A061DVN9_THECC1.4e-3360.32Peptidase family C54 protein isoform 2 OS=Theobroma cacao GN=TCM_003165 PE=3 SV=... [more]
A0A061DMN0_THECC1.4e-3360.32Peptidase family C54 protein isoform 3 OS=Theobroma cacao GN=TCM_003165 PE=3 SV=... [more]
M5XJB4_PRUPE4.4e-3259.84Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004885mg PE=3 SV=1[more]
A0A0D2SXT5_GOSRA7.5e-3255.88Uncharacterized protein OS=Gossypium raimondii GN=B456_008G001300 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G44140.12.8e-3054.03 Peptidase family C54 protein[more]
AT3G59950.15.8e-2848.78 Peptidase family C54 protein[more]
Match NameE-valueIdentityDescription
gi|659082126|ref|XP_008441684.1|1.3e-5387.90PREDICTED: cysteine protease ATG4-like isoform X1 [Cucumis melo][more]
gi|659082128|ref|XP_008441686.1|1.3e-5387.90PREDICTED: cysteine protease ATG4-like isoform X2 [Cucumis melo][more]
gi|449442361|ref|XP_004138950.1|1.9e-5286.29PREDICTED: cysteine protease ATG4-like isoform X2 [Cucumis sativus][more]
gi|778668355|ref|XP_011649086.1|1.9e-5286.29PREDICTED: cysteine protease ATG4-like isoform X1 [Cucumis sativus][more]
gi|590714446|ref|XP_007049916.1|2.0e-3360.32Peptidase family C54 protein isoform 2 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005078Peptidase_C54
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000045 autophagosome assembly
biological_process GO:0006501 C-terminal protein lipidation
biological_process GO:0000422 mitophagy
biological_process GO:0044804 nucleophagy
biological_process GO:0051697 protein delipidation
biological_process GO:0016485 protein processing
biological_process GO:0006612 protein targeting to membrane
biological_process GO:0006914 autophagy
biological_process GO:0015031 protein transport
biological_process GO:0006508 proteolysis
cellular_component GO:0005829 cytosol
cellular_component GO:0005737 cytoplasm
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi09G010720.1Lsi09G010720.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005078Peptidase C54PANTHERPTHR22624APG4 AUTOPHAGY 4-RELATEDcoord: 23..134
score: 2.7
IPR005078Peptidase C54PFAMPF03416Peptidase_C54coord: 27..72
score: 2.1
NoneNo IPR availablePANTHERPTHR22624:SF34AUTOPHAGY-SPECIFIC GENE 4, ISOFORM Acoord: 23..134
score: 2.7
NoneNo IPR availableunknownSSF54001Cysteine proteinasescoord: 27..93
score: 1.08