CSPI04G08230 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G08230
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSubtilisin-like protease
LocationChr4: 5976305 .. 5978842 (-)
RNA-Seq ExpressionCSPI04G08230
SyntenyCSPI04G08230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACTTTTGAACTTTAGGGTTCTCTGATGTTAAGGATTTTGTTGATTTTCATTGTTTGATTACAACCCTTTTTTGTTACGATTTCGTAATTGAATTTCATTGTTTTGTTTGATTTTTCAGGCATCTCTGCTAATTGTTTTGTGTTAACTGTTCGCTGGAATCGTCTTTGAAGTTTAGACGCGTTTGATAACTGTCCGCTGGAGTCGTCTTTGAAGTTTAGACGCGTTTGATCACGTAATTTGTTCATGATTTCGAGGTACCTGAATTTGGTTTGAGAGTGATTTCTCTCTATCTCTTTTTGTTTTCAGTTTATTGTTGCGGTTTTTTCTTTTTATCGTTGGATTGGTGGGTGCCATTGAAGATTTGAGAAGAATTTCGTGAAGCCTTGCAATATAGATTTATCTACTGCGATTATTTGCATCTTTTATCCTTTTGATTGCTTAGTTCATATTTGTGGTGGTTTATGTTAGAATTTGTATATATGGAAAAAATATTTATGCAAAACTTACCATAAATATTTCTTTTCCTTTTCTTCTTTGCTCTTTACATTCTATTTAAGCATCTTTTATCCTTTTGATTGCTTAGTTCATATTTTTGGTGGTTTATGTTAGAATTTGTATATATGGAAAAAATATTTATGCAAAACTTACCATAAATATTTCTTTTCCTTTTCTTCTTTGCTCTTTACATTCTATTTAAGCATCTTTTATCCTTTTGATTGCTTAGTTCATATTTTCAGGTAAGCCTTGTTTCCTGTTTATTGCTTTCAATAGTTTATATTCGCTTGGATGTTTGGATTCTGGATGTTAGAGCAGTGGTTGGATGAATTACAATATACAACAATTTCCTTATGATACATTGACCCAAGTAAAGTCATGCGTTTTTTTTTGTTTGGGATCATGGTTACTTATTCATGTCAGTGAAAGACTTTGTCTAAGAAATCGTATAGTAGTTGGTCATAGAAAGACTAGATCCATAGGTTGTACATCTGTATCACTGTCTTGATACTTGCTGTAGAGAGCATGTCTGGCCAATGCCAATGCTTAGTATTTACGATCAAAGATGATTTTTGCAGTTTCTTATGAGGCTGGTGGCTTCTTGGTATCAGTAGATGCCGTTTTTAATCTGATATCTAGTTTAGGCTTTGAGAGATAGAATTCTTATTCTTCTCTTTTTACTGCATTTTTGGTATAATGGTATTGATTTGAAGTTCTGAAAAGACAGGTTGTTACTGTTAGAAATGAGGATTCTGTCTCTGCGCCTAGTTGTTGAAATAGACTTATTTGTTTCTATGTTAGAAGACCACGAAGATTGAATGTATCTACTCCATCCTAGTTCATGGTTTCCTTTCTACCATCTTGTTCTCTATTTCTTACAGTGATAAGAAATTTGGTTGATCATCCTCTTTGCCCTTGTTTCCCTCTCTTCTGCTTTATTCAATTTGGTTTTCTTTTGTTCTCTTTCCGATTTGCCAGAAAAGTACGCATTATTTCTTTATTTTACAATACATTCTTTGTGGTTGAGCCTGATCTCTACATATCTGAATTCTTCCTTGGTTTCTCTTTTGTTTCATCTCTTGCTCTTTACCATTCCTAGCAGCAGTTGGAGGAGATCATTAGGGAATGTCAGGTCCTTCATCGGCAATTCGATGGGCGGTCTCAGGGGTGGTGCCAATCTCGCTTCTTGGGTTGTGGCTGGAACGCTCGCTTACTACCTCTGGGTCAAGCCTTCCCAAGACCTCAAACGGGAACAGCAGGAAAGGGCTGCTCTTGCAGCTGTGGATCCTCATCGGTATATTGAGAAGAGAAAACCTATTCCTGATCCCCAGGAAACTGGTTTGATATATGGCAACAAGAATACACCTCGAAAACCCGAGGAATGAGCTACATGAGTGAAATCTTTGGAGAAATGTAACGGCTTAAGCTTTTCACTGATCAAAGGTGAAGACGGGGGTCAGAAATTATAATCTAGGTAGGAGCTAGAGCCTAAAACAGAGAAGATTAGAAGTTGATACATGAACAAAACTGTTGTTCAGTAAAGAGGTGTGGCCAAGTGATTTGGAATTTTGGATCCTTTTAATGGAACCCTTAGTTGAAGATTGAAAAAGTTGTGTATTTTGCGTTTGGCACTGATGAACAATTGAATTGTCTACCATAGATTTATATTTCTAATTTTCTAAAAAGATTGTCTACCATGAGAATTTGGATATTACTTTAAGAATTCAAACTTAAAATTTGTGTTTCTATCAGTCAATATATACTGTGAAAAAAGGATTGATTAAAATCAGTAGAGTTTTGGTTCTTTCATATGTTATTGGGACCATTTTGATTGGGTGAGAGCATTAACGAACGTTGATCAACCTTT

mRNA sequence

ACTTTTGAACTTTAGGGTTCTCTGATGTTAAGGATTTTGTTGATTTTCATTGTTTGATTACAACCCTTTTTTGTTACGATTTCGTAATTGAATTTCATTGTTTTGTTTGATTTTTCAGGCATCTCTGCTAATTGTTTTGTGTTAACTGTTCGCTGGAATCGTCTTTGAAGTTTAGACGCGTTTGATAACTGTCCGCTGGAGTCGTCTTTGAAGTTTAGACGCGTTTGATCACGTAATTTGTTCATGATTTCGAGCAGTTGGAGGAGATCATTAGGGAATGTCAGGTCCTTCATCGGCAATTCGATGGGCGGTCTCAGGGGTGGTGCCAATCTCGCTTCTTGGGTTGTGGCTGGAACGCTCGCTTACTACCTCTGGGTCAAGCCTTCCCAAGACCTCAAACGGGAACAGCAGGAAAGGGCTGCTCTTGCAGCTGTGGATCCTCATCGGTATATTGAGAAGAGAAAACCTATTCCTGATCCCCAGGAAACTGGTTTGATATATGGCAACAAGAATACACCTCGAAAACCCGAGGAATGAGCTACATGAGTGAAATCTTTGGAGAAATGTAACGGCTTAAGCTTTTCACTGATCAAAGGTGAAGACGGGGGTCAGAAATTATAATCTAGGTAGGAGCTAGAGCCTAAAACAGAGAAGATTAGAAGTTGATACATGAACAAAACTGTTGTTCAGTAAAGAGGTGTGGCCAAGTGATTTGGAATTTTGGATCCTTTTAATGGAACCCTTAGTTGAAGATTGAAAAAGTTGTGTATTTTGCGTTTGGCACTGATGAACAATTGAATTGTCTACCATAGATTTATATTTCTAATTTTCTAAAAAGATTGTCTACCATGAGAATTTGGATATTACTTTAAGAATTCAAACTTAAAATTTGTGTTTCTATCAGTCAATATATACTGTGAAAAAAGGATTGATTAAAATCAGTAGAGTTTTGGTTCTTTCATATGTTATTGGGACCATTTTGATTGGGTGAGAGCATTAACGAACGTTGATCAACCTTT

Coding sequence (CDS)

ATGATTTCGAGCAGTTGGAGGAGATCATTAGGGAATGTCAGGTCCTTCATCGGCAATTCGATGGGCGGTCTCAGGGGTGGTGCCAATCTCGCTTCTTGGGTTGTGGCTGGAACGCTCGCTTACTACCTCTGGGTCAAGCCTTCCCAAGACCTCAAACGGGAACAGCAGGAAAGGGCTGCTCTTGCAGCTGTGGATCCTCATCGGTATATTGAGAAGAGAAAACCTATTCCTGATCCCCAGGAAACTGGTTTGATATATGGCAACAAGAATACACCTCGAAAACCCGAGGAATGA

Protein sequence

MISSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAALAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE*
Homology
BLAST of CSPI04G08230 vs. ExPASy TrEMBL
Match: A0A0A0KZ01 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G089280 PE=4 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 3.2e-47
Identity = 96/97 (98.97%), Postives = 96/97 (98.97%), Query Frame = 0

Query: 1  MISSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA 60
          M SSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA
Sbjct: 1  MTSSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA 60

Query: 61 LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
          LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE
Sbjct: 61 LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 97

BLAST of CSPI04G08230 vs. ExPASy TrEMBL
Match: A0A1S3BT82 (uncharacterized protein LOC103492913 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103492913 PE=4 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 2.7e-46
Identity = 94/97 (96.91%), Postives = 95/97 (97.94%), Query Frame = 0

Query: 1  MISSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA 60
          M SSSWRRS GNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA
Sbjct: 1  MTSSSWRRSFGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA 60

Query: 61 LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
          LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTP+KPEE
Sbjct: 61 LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPQKPEE 97

BLAST of CSPI04G08230 vs. ExPASy TrEMBL
Match: A0A1S3BS24 (uncharacterized protein LOC103492913 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492913 PE=4 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 7.8e-46
Identity = 93/95 (97.89%), Postives = 94/95 (98.95%), Query Frame = 0

Query: 3  SSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAALA 62
          SSSWRRS GNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAALA
Sbjct: 4  SSSWRRSFGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAALA 63

Query: 63 AVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
          AVDPHRYIEKRKPIPDPQETGLIYGNKNTP+KPEE
Sbjct: 64 AVDPHRYIEKRKPIPDPQETGLIYGNKNTPQKPEE 98

BLAST of CSPI04G08230 vs. ExPASy TrEMBL
Match: A0A1S4DYR4 (uncharacterized protein LOC103492913 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103492913 PE=4 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 1.3e-45
Identity = 92/96 (95.83%), Postives = 95/96 (98.96%), Query Frame = 0

Query: 2  ISSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL 61
          ++SSWRRS GNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL
Sbjct: 1  MTSSWRRSFGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL 60

Query: 62 AAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
          AAVDPHRYIEKRKPIPDPQETGLIYGNKNTP+KPEE
Sbjct: 61 AAVDPHRYIEKRKPIPDPQETGLIYGNKNTPQKPEE 96

BLAST of CSPI04G08230 vs. ExPASy TrEMBL
Match: A0A6J1DP57 (uncharacterized protein LOC111022900 OS=Momordica charantia OX=3673 GN=LOC111022900 PE=4 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 5.6e-44
Identity = 88/96 (91.67%), Postives = 93/96 (96.88%), Query Frame = 0

Query: 2  ISSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL 61
          ++SSWRRS GNVRSFIGNSMGGLRGG+NLASW+VAGTLAY+LWVKPSQDLKREQQERAAL
Sbjct: 1  MASSWRRSFGNVRSFIGNSMGGLRGGSNLASWIVAGTLAYFLWVKPSQDLKREQQERAAL 60

Query: 62 AAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
          A  DPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE
Sbjct: 61 ADTDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 96

BLAST of CSPI04G08230 vs. NCBI nr
Match: XP_004137209.1 (uncharacterized protein LOC101205325 isoform X1 [Cucumis sativus])

HSP 1 Score: 197.2 bits (500), Expect = 6.6e-47
Identity = 96/97 (98.97%), Postives = 96/97 (98.97%), Query Frame = 0

Query: 1  MISSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA 60
          M SSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA
Sbjct: 1  MTSSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA 60

Query: 61 LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
          LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE
Sbjct: 61 LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 97

BLAST of CSPI04G08230 vs. NCBI nr
Match: XP_011653322.1 (uncharacterized protein LOC101205325 isoform X2 [Cucumis sativus])

HSP 1 Score: 194.9 bits (494), Expect = 3.3e-46
Identity = 94/96 (97.92%), Postives = 96/96 (100.00%), Query Frame = 0

Query: 2  ISSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL 61
          ++SSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL
Sbjct: 1  MTSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL 60

Query: 62 AAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
          AAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE
Sbjct: 61 AAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 96

BLAST of CSPI04G08230 vs. NCBI nr
Match: XP_008451690.1 (PREDICTED: uncharacterized protein LOC103492913 isoform X2 [Cucumis melo])

HSP 1 Score: 194.1 bits (492), Expect = 5.6e-46
Identity = 94/97 (96.91%), Postives = 95/97 (97.94%), Query Frame = 0

Query: 1  MISSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA 60
          M SSSWRRS GNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA
Sbjct: 1  MTSSSWRRSFGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAA 60

Query: 61 LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
          LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTP+KPEE
Sbjct: 61 LAAVDPHRYIEKRKPIPDPQETGLIYGNKNTPQKPEE 97

BLAST of CSPI04G08230 vs. NCBI nr
Match: XP_008451686.1 (PREDICTED: uncharacterized protein LOC103492913 isoform X1 [Cucumis melo] >XP_008451687.1 PREDICTED: uncharacterized protein LOC103492913 isoform X1 [Cucumis melo] >XP_008451688.1 PREDICTED: uncharacterized protein LOC103492913 isoform X1 [Cucumis melo] >XP_008451689.1 PREDICTED: uncharacterized protein LOC103492913 isoform X1 [Cucumis melo])

HSP 1 Score: 192.6 bits (488), Expect = 1.6e-45
Identity = 93/95 (97.89%), Postives = 94/95 (98.95%), Query Frame = 0

Query: 3  SSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAALA 62
          SSSWRRS GNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAALA
Sbjct: 4  SSSWRRSFGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAALA 63

Query: 63 AVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
          AVDPHRYIEKRKPIPDPQETGLIYGNKNTP+KPEE
Sbjct: 64 AVDPHRYIEKRKPIPDPQETGLIYGNKNTPQKPEE 98

BLAST of CSPI04G08230 vs. NCBI nr
Match: XP_016901126.1 (PREDICTED: uncharacterized protein LOC103492913 isoform X3 [Cucumis melo])

HSP 1 Score: 191.8 bits (486), Expect = 2.8e-45
Identity = 92/96 (95.83%), Postives = 95/96 (98.96%), Query Frame = 0

Query: 2  ISSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL 61
          ++SSWRRS GNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL
Sbjct: 1  MTSSWRRSFGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL 60

Query: 62 AAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
          AAVDPHRYIEKRKPIPDPQETGLIYGNKNTP+KPEE
Sbjct: 61 AAVDPHRYIEKRKPIPDPQETGLIYGNKNTPQKPEE 96

BLAST of CSPI04G08230 vs. TAIR 10
Match: AT2G33585.1 (unknown protein; Has 31 Blast hits to 31 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 31; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 149.8 bits (377), Expect = 1.1e-36
Identity = 70/96 (72.92%), Postives = 83/96 (86.46%), Query Frame = 0

Query: 2   ISSSWRRSLGNVRSFIGNSMGGLRGGANLASWVVAGTLAYYLWVKPSQDLKREQQERAAL 61
           ++SSWRRS+GNVRSFIGNSMGGLRGG + ASWVVAGT+AY+LW+KP QDLK+EQ+ RAAL
Sbjct: 98  MASSWRRSIGNVRSFIGNSMGGLRGGQSAASWVVAGTIAYFLWIKPEQDLKKEQEARAAL 157

Query: 62  AAVDPHRYIEKRKPIPDPQETGLIYGNKNTPRKPEE 98
           A  D ++Y+EKRKPI DPQ TGLIYGNKN   K E+
Sbjct: 158 AMADTNQYVEKRKPIADPQVTGLIYGNKNGTDKSED 193

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KZ013.2e-4798.97Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G089280 PE=4 SV=1[more]
A0A1S3BT822.7e-4696.91uncharacterized protein LOC103492913 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3BS247.8e-4697.89uncharacterized protein LOC103492913 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S4DYR41.3e-4595.83uncharacterized protein LOC103492913 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1DP575.6e-4491.67uncharacterized protein LOC111022900 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
Match NameE-valueIdentityDescription
XP_004137209.16.6e-4798.97uncharacterized protein LOC101205325 isoform X1 [Cucumis sativus][more]
XP_011653322.13.3e-4697.92uncharacterized protein LOC101205325 isoform X2 [Cucumis sativus][more]
XP_008451690.15.6e-4696.91PREDICTED: uncharacterized protein LOC103492913 isoform X2 [Cucumis melo][more]
XP_008451686.11.6e-4597.89PREDICTED: uncharacterized protein LOC103492913 isoform X1 [Cucumis melo] >XP_00... [more]
XP_016901126.12.8e-4595.83PREDICTED: uncharacterized protein LOC103492913 isoform X3 [Cucumis melo][more]
Match NameE-valueIdentityDescription
AT2G33585.11.1e-3672.92unknown protein; Has 31 Blast hits to 31 proteins in 12 species: Archae - 0; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 70..97
NoneNo IPR availablePANTHERPTHR37213:SF1SUBTILISIN-LIKE PROTEASEcoord: 2..96
NoneNo IPR availablePANTHERPTHR37213SUBTILISIN-LIKE PROTEASEcoord: 2..96

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G08230.2CSPI04G08230.2mRNA
CSPI04G08230.1CSPI04G08230.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane