Csa3G131900 (gene) Cucumber (Chinese Long) v2

NameCsa3G131900
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionCysteine protease, putative; contains IPR000668 (Peptidase C1A, papain C-terminal)
LocationChr3 : 8509728 .. 8510066 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGCGCATTAAACATCCACACATCTTCCAAACACGTGTAAGTATGTTTTCTAAATAATATGCGTATAATATATACATGTTTGTCTATTGATTTTGTTAGGATGGTTTGCCCAAGAAAGTGGATTGGGTTGAGAGCGCTGCTGTTACTCCGCCAAGAGATCAAGGACCCTCTCCAACTTGTAGGGCATATAGTGGAGTAGCAGCAATAGAATCGATGAATAAAATAAAAAGAGGGCAATTAGTTAACCTGACCGTTGTTGATGTTATTATTGACAACATACGGTTTTGGATGGATGGAGCATGGCCGGACGTTGTATTTCACGATGGAGTAAAGTAA

mRNA sequence

ATGTGGCGCATTAAACATCCACACATCTTCCAAACACGTGATGGTTTGCCCAAGAAAGTGGATTGGGTTGAGAGCGCTGCTGTTACTCCGCCAAGAGATCAAGGACCCTCTCCAACTTGTAGGGCATATAGTGGAGTAGCAGCAATAGAATCGATGAATAAAATAAAAAGAGGGCAATTAGTTAACCTGACCGTTGTTGATGTTATTATTGACAACATACGGTTTTGGATGGATGGAGCATGGCCGGACGTTGTATTTCACGATGGAGTAAAGTAA

Coding sequence (CDS)

ATGTGGCGCATTAAACATCCACACATCTTCCAAACACGTGATGGTTTGCCCAAGAAAGTGGATTGGGTTGAGAGCGCTGCTGTTACTCCGCCAAGAGATCAAGGACCCTCTCCAACTTGTAGGGCATATAGTGGAGTAGCAGCAATAGAATCGATGAATAAAATAAAAAGAGGGCAATTAGTTAACCTGACCGTTGTTGATGTTATTATTGACAACATACGGTTTTGGATGGATGGAGCATGGCCGGACGTTGTATTTCACGATGGAGTAAAGTAA

Protein sequence

MWRIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVDVIIDNIRFWMDGAWPDVVFHDGVK*
BLAST of Csa3G131900 vs. Swiss-Prot
Match: RDL5_ARATH (Probable cysteine protease RDL5 OS=Arabidopsis thaliana GN=RDL5 PE=2 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 7.4e-08
Identity = 30/62 (48.39%), Postives = 38/62 (61.29%), Query Frame = 1

Query: 10  FQTRDG--LPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVD 69
           ++T DG  LPK VDW    AVT  +DQG   +C A+S V A+E +NKI  G+LV L+  D
Sbjct: 136 YKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 195

BLAST of Csa3G131900 vs. Swiss-Prot
Match: RD21C_ARATH (Probable cysteine protease RD21C OS=Arabidopsis thaliana GN=RD21C PE=1 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 9.7e-08
Identity = 22/62 (35.48%), Postives = 39/62 (62.90%), Query Frame = 1

Query: 8   HIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVD 67
           ++++  D LP  +DW    AV P +DQG   +C A+S + A+E +N+IK G+L++L+  +
Sbjct: 121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180

Query: 68  VI 70
           ++
Sbjct: 181 LV 182

BLAST of Csa3G131900 vs. Swiss-Prot
Match: RDL4_ARATH (Probable cysteine protease RDL4 OS=Arabidopsis thaliana GN=RDL4 PE=2 SV=1)

HSP 1 Score: 57.0 bits (136), Expect = 1.3e-07
Identity = 31/71 (43.66%), Postives = 38/71 (53.52%), Query Frame = 1

Query: 8   HIFQTR---------DGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRG 67
           H+F T          D LPK VDW    AVT  +DQG   +C A+S V A+E +NKI  G
Sbjct: 120 HVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTG 179

Query: 68  QLVNLTVVDVI 70
           +LV L+  D+I
Sbjct: 180 ELVTLSEQDLI 190

BLAST of Csa3G131900 vs. Swiss-Prot
Match: ANAN_ANACO (Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2)

HSP 1 Score: 56.6 bits (135), Expect = 1.6e-07
Identity = 25/66 (37.88%), Postives = 40/66 (60.61%), Query Frame = 1

Query: 16  LPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVDVIIDNIRF 75
           +P+ +DW +S AVT  ++QG   +C A++ +A +ES+ KIKRG LV+L+   V+   + +
Sbjct: 123 VPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAVSY 182

Query: 76  WMDGAW 82
              G W
Sbjct: 183 GCKGGW 188

BLAST of Csa3G131900 vs. Swiss-Prot
Match: PAPA3_CARPA (Caricain OS=Carica papaya PE=1 SV=2)

HSP 1 Score: 55.8 bits (133), Expect = 2.8e-07
Identity = 25/54 (46.30%), Postives = 35/54 (64.81%), Query Frame = 1

Query: 16  LPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVDVI 70
           LP+ VDW +  AVTP R QG   +C A+S VA +E +NKI+ G+LV L+  +++
Sbjct: 133 LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELV 186

BLAST of Csa3G131900 vs. TrEMBL
Match: A0A0A0L7Y4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G131900 PE=4 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 1.9e-47
Identity = 91/91 (100.00%), Postives = 91/91 (100.00%), Query Frame = 1

Query: 1  MWRIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQL 60
          MWRIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQL
Sbjct: 1  MWRIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQL 60

Query: 61 VNLTVVDVIIDNIRFWMDGAWPDVVFHDGVK 92
          VNLTVVDVIIDNIRFWMDGAWPDVVFHDGVK
Sbjct: 61 VNLTVVDVIIDNIRFWMDGAWPDVVFHDGVK 91

BLAST of Csa3G131900 vs. TrEMBL
Match: A0A0D3A5L6_BRAOL (Uncharacterized protein OS=Brassica oleracea var. oleracea PE=3 SV=1)

HSP 1 Score: 63.5 bits (153), Expect = 1.5e-07
Identity = 29/62 (46.77%), Postives = 42/62 (67.74%), Query Frame = 1

Query: 3   RIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVN 62
           R+ H ++   RD LP+ VDW +  AVT  +DQG   +C A+S VAA+E +NKI  G+LV+
Sbjct: 121 RVSHRYVSLPRDQLPESVDWRKEGAVTAVKDQGSCSSCWAFSSVAAVEGINKIVTGELVS 180

Query: 63  LT 65
           L+
Sbjct: 181 LS 182

BLAST of Csa3G131900 vs. TrEMBL
Match: B9SGM8_RICCO (Cysteine protease, putative OS=Ricinus communis GN=RCOM_0554360 PE=3 SV=1)

HSP 1 Score: 63.2 bits (152), Expect = 2.0e-07
Identity = 29/62 (46.77%), Postives = 42/62 (67.74%), Query Frame = 1

Query: 8   HIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVD 67
           H+ +    LP  VDW E+ AVTP +DQG   +C A+S VAA+E +NKIK G LV+L+  +
Sbjct: 121 HMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGNLVSLSEQE 180

Query: 68  VI 70
           ++
Sbjct: 181 LV 182

BLAST of Csa3G131900 vs. TrEMBL
Match: R0H3G6_9BRAS (Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006494mg PE=3 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 2.6e-07
Identity = 32/73 (43.84%), Postives = 48/73 (65.75%), Query Frame = 1

Query: 3   RIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVN 62
           R+ H ++    D LP+ VDW +  AV+  +DQG   +C A+S VAA+ES+NKI  G+L++
Sbjct: 118 RVTHRYVPLAEDQLPQSVDWRQKGAVSEIKDQGRCNSCWAFSTVAAVESINKIVTGELIS 177

Query: 63  LT---VVDVIIDN 73
           L+   +VD  IDN
Sbjct: 178 LSEQELVDCSIDN 190

BLAST of Csa3G131900 vs. TrEMBL
Match: K4BG42_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=3 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 4.4e-07
Identity = 30/74 (40.54%), Postives = 47/74 (63.51%), Query Frame = 1

Query: 6   HPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTV 65
           H + F+  D +PK VDW +  AV P +DQG   +C A+S VAA+E +N+I  G+++ L+ 
Sbjct: 122 HHYDFRASDSVPKSVDWRKKGAVAPIKDQGTCGSCWAFSTVAAVEGINQIATGEMITLSE 181

Query: 66  VDVIIDNIRFWMDG 80
            + +ID  R + DG
Sbjct: 182 QE-LIDCDRMYNDG 194

BLAST of Csa3G131900 vs. TAIR10
Match: AT4G11320.1 (AT4G11320.1 Papain family cysteine protease)

HSP 1 Score: 57.8 bits (138), Expect = 4.2e-09
Identity = 30/62 (48.39%), Postives = 38/62 (61.29%), Query Frame = 1

Query: 10  FQTRDG--LPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVD 69
           ++T DG  LPK VDW    AVT  +DQG   +C A+S V A+E +NKI  G+LV L+  D
Sbjct: 136 YKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 195

BLAST of Csa3G131900 vs. TAIR10
Match: AT3G19390.1 (AT3G19390.1 Granulin repeat cysteine protease family protein)

HSP 1 Score: 57.4 bits (137), Expect = 5.4e-09
Identity = 22/62 (35.48%), Postives = 39/62 (62.90%), Query Frame = 1

Query: 8   HIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVD 67
           ++++  D LP  +DW    AV P +DQG   +C A+S + A+E +N+IK G+L++L+  +
Sbjct: 121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180

Query: 68  VI 70
           ++
Sbjct: 181 LV 182

BLAST of Csa3G131900 vs. TAIR10
Match: AT4G11310.1 (AT4G11310.1 Papain family cysteine protease)

HSP 1 Score: 57.0 bits (136), Expect = 7.1e-09
Identity = 31/71 (43.66%), Postives = 38/71 (53.52%), Query Frame = 1

Query: 8   HIFQTR---------DGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRG 67
           H+F T          D LPK VDW    AVT  +DQG   +C A+S V A+E +NKI  G
Sbjct: 120 HVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTG 179

Query: 68  QLVNLTVVDVI 70
           +LV L+  D+I
Sbjct: 180 ELVTLSEQDLI 190

BLAST of Csa3G131900 vs. TAIR10
Match: AT4G23520.1 (AT4G23520.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 55.1 bits (131), Expect = 2.7e-08
Identity = 25/60 (41.67%), Postives = 39/60 (65.00%), Query Frame = 1

Query: 14  DGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVDVIIDNI 73
           D LP+ VDW +  AV+  +DQG   +C A+S VAA+E +NKI  G+L++L+  +++  N+
Sbjct: 131 DQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNL 190

BLAST of Csa3G131900 vs. TAIR10
Match: AT1G06260.1 (AT1G06260.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 54.7 bits (130), Expect = 3.5e-08
Identity = 28/54 (51.85%), Postives = 33/54 (61.11%), Query Frame = 1

Query: 16  LPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVDVI 70
           +P  VDW    AVTP R+QG    C A+S VAAIE +NKIK G LV+L+   +I
Sbjct: 127 VPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLI 180

BLAST of Csa3G131900 vs. NCBI nr
Match: gi|700201604|gb|KGN56737.1| (hypothetical protein Csa_3G131900 [Cucumis sativus])

HSP 1 Score: 196.1 bits (497), Expect = 2.8e-47
Identity = 91/91 (100.00%), Postives = 91/91 (100.00%), Query Frame = 1

Query: 1  MWRIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQL 60
          MWRIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQL
Sbjct: 1  MWRIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQL 60

Query: 61 VNLTVVDVIIDNIRFWMDGAWPDVVFHDGVK 92
          VNLTVVDVIIDNIRFWMDGAWPDVVFHDGVK
Sbjct: 61 VNLTVVDVIIDNIRFWMDGAWPDVVFHDGVK 91

BLAST of Csa3G131900 vs. NCBI nr
Match: gi|778688440|ref|XP_011652750.1| (PREDICTED: ananain-like [Cucumis sativus])

HSP 1 Score: 167.2 bits (422), Expect = 1.4e-38
Identity = 79/82 (96.34%), Postives = 79/82 (96.34%), Query Frame = 1

Query: 10  FQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVDVI 69
           F   DGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVDVI
Sbjct: 168 FNRLDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTVVDVI 227

Query: 70  IDNIRFWMDGAWPDVVFHDGVK 92
           IDNIRFWMDGAWPDVVFHDGVK
Sbjct: 228 IDNIRFWMDGAWPDVVFHDGVK 249

BLAST of Csa3G131900 vs. NCBI nr
Match: gi|922414518|ref|XP_013590665.1| (PREDICTED: zingipain-2 [Brassica oleracea var. oleracea])

HSP 1 Score: 63.5 bits (153), Expect = 2.2e-07
Identity = 29/62 (46.77%), Postives = 42/62 (67.74%), Query Frame = 1

Query: 3   RIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVN 62
           R+ H ++   RD LP+ VDW +  AVT  +DQG   +C A+S VAA+E +NKI  G+LV+
Sbjct: 121 RVSHRYVSLPRDQLPESVDWRKEGAVTAVKDQGSCSSCWAFSSVAAVEGINKIVTGELVS 180

Query: 63  LT 65
           L+
Sbjct: 181 LS 182

BLAST of Csa3G131900 vs. NCBI nr
Match: gi|802640836|ref|XP_012079031.1| (PREDICTED: ervatamin-B [Jatropha curcas])

HSP 1 Score: 63.5 bits (153), Expect = 2.2e-07
Identity = 30/67 (44.78%), Postives = 43/67 (64.18%), Query Frame = 1

Query: 3   RIKHPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVN 62
           R K  H       LP  VDW++  AVTP +DQG   +C A+S VAA+E +NKIK G+LV+
Sbjct: 115 RKKPSHEHGNSSDLPTSVDWIKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLVS 174

Query: 63  LTVVDVI 70
           L+  +++
Sbjct: 175 LSEQELV 181

BLAST of Csa3G131900 vs. NCBI nr
Match: gi|970015214|ref|XP_015068628.1| (PREDICTED: cysteine proteinase COT44-like [Solanum pennellii])

HSP 1 Score: 63.2 bits (152), Expect = 2.8e-07
Identity = 29/74 (39.19%), Postives = 48/74 (64.86%), Query Frame = 1

Query: 6   HPHIFQTRDGLPKKVDWVESAAVTPPRDQGPSPTCRAYSGVAAIESMNKIKRGQLVNLTV 65
           H ++F+  D +PK +DW +  AV P +DQG   +C A+S VAA+E +N+I  G+++ L+ 
Sbjct: 122 HRYVFRASDSVPKSIDWRKKGAVAPIKDQGTCGSCWAFSTVAAVEGINQIATGEMMTLSE 181

Query: 66  VDVIIDNIRFWMDG 80
            + +ID  R + DG
Sbjct: 182 QE-LIDCDRMYNDG 194

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RDL5_ARATH7.4e-0848.39Probable cysteine protease RDL5 OS=Arabidopsis thaliana GN=RDL5 PE=2 SV=1[more]
RD21C_ARATH9.7e-0835.48Probable cysteine protease RD21C OS=Arabidopsis thaliana GN=RD21C PE=1 SV=1[more]
RDL4_ARATH1.3e-0743.66Probable cysteine protease RDL4 OS=Arabidopsis thaliana GN=RDL4 PE=2 SV=1[more]
ANAN_ANACO1.6e-0737.88Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2[more]
PAPA3_CARPA2.8e-0746.30Caricain OS=Carica papaya PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0L7Y4_CUCSA1.9e-47100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G131900 PE=4 SV=1[more]
A0A0D3A5L6_BRAOL1.5e-0746.77Uncharacterized protein OS=Brassica oleracea var. oleracea PE=3 SV=1[more]
B9SGM8_RICCO2.0e-0746.77Cysteine protease, putative OS=Ricinus communis GN=RCOM_0554360 PE=3 SV=1[more]
R0H3G6_9BRAS2.6e-0743.84Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006494mg PE=3 SV=1[more]
K4BG42_SOLLC4.4e-0740.54Uncharacterized protein OS=Solanum lycopersicum PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G11320.14.2e-0948.39 Papain family cysteine protease[more]
AT3G19390.15.4e-0935.48 Granulin repeat cysteine protease family protein[more]
AT4G11310.17.1e-0943.66 Papain family cysteine protease[more]
AT4G23520.12.7e-0841.67 Cysteine proteinases superfamily protein[more]
AT1G06260.13.5e-0851.85 Cysteine proteinases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700201604|gb|KGN56737.1|2.8e-47100.00hypothetical protein Csa_3G131900 [Cucumis sativus][more]
gi|778688440|ref|XP_011652750.1|1.4e-3896.34PREDICTED: ananain-like [Cucumis sativus][more]
gi|922414518|ref|XP_013590665.1|2.2e-0746.77PREDICTED: zingipain-2 [Brassica oleracea var. oleracea][more]
gi|802640836|ref|XP_012079031.1|2.2e-0744.78PREDICTED: ervatamin-B [Jatropha curcas][more]
gi|970015214|ref|XP_015068628.1|2.8e-0739.19PREDICTED: cysteine proteinase COT44-like [Solanum pennellii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000668Peptidase_C1A_C
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0008234cysteine-type peptidase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0008234 cysteine-type peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa3G131900.1Csa3G131900.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 16..69
score: 4.
NoneNo IPR availableGENE3DG3DSA:3.90.70.10coord: 8..68
score: 9.2
NoneNo IPR availableunknownSSF54001Cysteine proteinasescoord: 4..86
score: 9.12