ClCG02G012910.1 (mRNA) Watermelon (Charleston Gray)

NameClCG02G012910.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionPR-6 proteinase inhibitor family protein
LocationCG_Chr02 : 26657941 .. 26658445 (+)
Sequence length282
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGGTATTCGTACTTGTAGCTCTCCTCCTTACCCACCTTGTGTAAGTGATGGTTGCATGGATCCTCAATGTTGCGGTTAGAGCCTTTATCCCTCTTTTAATCAAAATTCGTTCCACCTCTAATTTTCTTATTAGATGAGATAATATTACATTTTAAGCAATTGTTTAAATTAGAATACAAATTATGAATATGGTACGAAAAATATCAAATACAGTTATATATGTGTTTATATTGAATCTCTTGTTTTTTAAATTGTCCATCAAATTGGAGGAATAATTACTAACGTTGAGTTTTGTAGATGCAGGAGGGAATTTTAGATGGCCTAAACTAGTAGGTAAGAAGGGAGATTATGCTAAAAGTGAAATAGAAAAGGATGTGCGTTGTGTGAGGGTTGTAATTTTAAGACAAGGCACGGTTAGAATTGAAAATTTCTGTTGCAATCGTGTTTTTGTTTACGTGGATGATAGTGGCCACGTGGTTGATGTCCCAACCATTGGTTGA

mRNA sequence

ATGGCAGGTATTCGTACTTGTAGCTCTCCTCCTTACCCACCTTGTGTAAGTGATGGTTGCATGGATCCTCAATGTTGCGATGCAGGAGGGAATTTTAGATGGCCTAAACTAGTAGGTAAGAAGGGAGATTATGCTAAAAGTGAAATAGAAAAGGATGTGCGTTGTGTGAGGGTTGTAATTTTAAGACAAGGCACGGTTAGAATTGAAAATTTCTGTTGCAATCGTGTTTTTGTTTACGTGGATGATAGTGGCCACGTGGTTGATGTCCCAACCATTGGTTGA

Coding sequence (CDS)

ATGGCAGGTATTCGTACTTGTAGCTCTCCTCCTTACCCACCTTGTGTAAGTGATGGTTGCATGGATCCTCAATGTTGCGATGCAGGAGGGAATTTTAGATGGCCTAAACTAGTAGGTAAGAAGGGAGATTATGCTAAAAGTGAAATAGAAAAGGATGTGCGTTGTGTGAGGGTTGTAATTTTAAGACAAGGCACGGTTAGAATTGAAAATTTCTGTTGCAATCGTGTTTTTGTTTACGTGGATGATAGTGGCCACGTGGTTGATGTCCCAACCATTGGTTGA

Protein sequence

MAGIRTCSSPPYPPCVSDGCMDPQCCDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENFCCNRVFVYVDDSGHVVDVPTIG
BLAST of ClCG02G012910.1 vs. Swiss-Prot
Match: ICI_LINUS (Proteinase inhibitor OS=Linum usitatissimum PE=1 SV=1)

HSP 1 Score: 58.9 bits (141), Expect = 3.4e-08
Identity = 26/63 (41.27%), Postives = 40/63 (63.49%), Query Frame = 1

Query: 30 GNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENFCCNRVFVYVDDSGHVVDV 89
          G   WP+LVGK G+ A + +E++ R V  ++L++G+   ++F C+RV+V V+D G V  V
Sbjct: 6  GKNAWPELVGKSGNMAAATVERENRNVHAIVLKEGSAMTKDFRCDRVWVIVNDHGVVTSV 65

Query: 90 PTI 93
          P I
Sbjct: 66 PHI 68

BLAST of ClCG02G012910.1 vs. Swiss-Prot
Match: ITH5_CUCMA (Inhibitor of trypsin and hageman factor OS=Cucurbita maxima PE=1 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 8.3e-07
Identity = 29/64 (45.31%), Postives = 37/64 (57.81%), Query Frame = 1

Query: 30 GNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENFCCNRVFVYVDDSGHVVDV 89
          G   WP LVG  G  AK+ IE+    V+ VIL +GT   ++F CNRV ++V+  G VV  
Sbjct: 5  GKSSWPHLVGVGGSVAKAIIERQNPNVKAVILEEGTPVTKDFRCNRVRIWVNKRGLVVSP 64

Query: 90 PTIG 94
          P IG
Sbjct: 65 PRIG 68

BLAST of ClCG02G012910.1 vs. Swiss-Prot
Match: BGIA_MOMCH (Glu S.griseus protease inhibitor OS=Momordica charantia PE=1 SV=1)

HSP 1 Score: 50.8 bits (120), Expect = 9.1e-06
Identity = 28/64 (43.75%), Postives = 36/64 (56.25%), Query Frame = 1

Query: 30 GNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENFCCNRVFVYVDDSGHVVDV 89
          G   WP+LVG  G  AK+ IE++   VR VI+R G+    +F C+RV V+V + G V   
Sbjct: 5  GKRSWPQLVGSTGAAAKAVIERENPRVRAVIVRVGSPVTADFRCDRVRVWVTERGIVARP 64

Query: 90 PTIG 94
          P IG
Sbjct: 65 PAIG 68

BLAST of ClCG02G012910.1 vs. TrEMBL
Match: V4NI49_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10011055mg PE=4 SV=1)

HSP 1 Score: 107.1 bits (266), Expect = 1.2e-20
Identity = 47/82 (57.32%), Postives = 57/82 (69.51%), Query Frame = 1

Query: 12 YPPCVSDGCMDPQCCDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENF 71
          YPPC S GC DP+CC  GG F WP+LVGK G+ AK  IE++   V   I+  GT+RIE+F
Sbjct: 7  YPPCESSGCTDPRCCALGGKFEWPELVGKSGEIAKMTIERENPNVLGFIMPYGTLRIEDF 66

Query: 72 CCNRVFVYVDDSGHVVDVPTIG 94
          CCNRVF+ V   GHVV +P IG
Sbjct: 67 CCNRVFIVVGSKGHVVMIPKIG 88

BLAST of ClCG02G012910.1 vs. TrEMBL
Match: Q9STF8_ARATH (PR-6 proteinase inhibitor family protein OS=Arabidopsis thaliana GN=T6H20.110 PE=2 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 3.5e-20
Identity = 46/82 (56.10%), Postives = 55/82 (67.07%), Query Frame = 1

Query: 12 YPPCVSDGCMDPQCCDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENF 71
          YPPC S  C DP+CC  G  +RWP+LVGK G  AK  IE++   V  ++LR G  RIENF
Sbjct: 4  YPPCWSGSCEDPECCAIGKKYRWPELVGKNGQLAKMTIERENPNVLAIVLRYGEKRIENF 63

Query: 72 CCNRVFVYVDDSGHVVDVPTIG 94
          CCNRVFVY+  +G V D P IG
Sbjct: 64 CCNRVFVYLGSNGQVADAPMIG 85

BLAST of ClCG02G012910.1 vs. TrEMBL
Match: A0MF11_ARATH (Putative uncharacterized protein (Fragment) OS=Arabidopsis thaliana PE=2 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 8.6e-19
Identity = 45/82 (54.88%), Postives = 54/82 (65.85%), Query Frame = 1

Query: 12 YPPCVSDGCMDPQCCDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENF 71
          YPPC S  C DP+C   G  +RWP+LVGK G  AK  IE++   V  ++LR G  RIENF
Sbjct: 4  YPPCWSGSCEDPECRAIGKKYRWPELVGKNGQLAKMTIERENPNVLAIVLRYGEKRIENF 63

Query: 72 CCNRVFVYVDDSGHVVDVPTIG 94
          CCNRVFVY+  +G V D P IG
Sbjct: 64 CCNRVFVYLGSNGQVADAPMIG 85

BLAST of ClCG02G012910.1 vs. TrEMBL
Match: R0G9B1_9BRAS (Uncharacterized protein OS=Capsella rubella GN=CARUB_v10015502mg PE=4 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 1.6e-17
Identity = 42/83 (50.60%), Postives = 54/83 (65.06%), Query Frame = 1

Query: 11 PYPPCVSDGCMDPQCCDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIEN 70
          PYPPC+   C DP+CC  G   RWP+LVGK G+ AK  IE++   V  ++L  G  RI +
Sbjct: 3  PYPPCLVTACEDPECCANGKKCRWPELVGKSGEVAKMTIERENPNVLAILLHAGDGRISD 62

Query: 71 FCCNRVFVYVDDSGHVVDVPTIG 94
          FCCN+VFV +D  GHV  +P IG
Sbjct: 63 FCCNKVFVVLDIHGHVSTIPKIG 85

BLAST of ClCG02G012910.1 vs. TrEMBL
Match: A0A0A0LJT7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G287050 PE=4 SV=1)

HSP 1 Score: 95.5 bits (236), Expect = 3.6e-17
Identity = 42/66 (63.64%), Postives = 54/66 (81.82%), Query Frame = 1

Query: 28 AGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENFCCNRVFVYVDDSGHVV 87
          AGGN+RWP LVGK+  YAK +IE+++  V V ++R+G +RIE+FCCNRV VYVDDSG VV
Sbjct: 7  AGGNYRWPNLVGKRWQYAKRKIEEELPRVGVAVMRRGAIRIEDFCCNRVIVYVDDSGIVV 66

Query: 88 DVPTIG 94
          +VP IG
Sbjct: 67 EVPVIG 72

BLAST of ClCG02G012910.1 vs. TAIR10
Match: AT3G46860.1 (AT3G46860.1 Serine protease inhibitor, potato inhibitor I-type family protein)

HSP 1 Score: 105.5 bits (262), Expect = 1.8e-23
Identity = 46/82 (56.10%), Postives = 53/82 (64.63%), Query Frame = 1

Query: 12 YPPCVSDGCMDPQCCDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENF 71
          YPPC S  C DP+CC  G  +RWP+LVGK G  AK  IE++   V  ++LR G  RIENF
Sbjct: 4  YPPCWSGSCEDPECCAIGKKYRWPELVGKNGQLAKMTIERENPNVLAIVLRYGEKRIENF 63

Query: 72 CCNRVFVYVDDSGHVVDVPTIG 94
          CCNRVFVY+  +G V D P IG
Sbjct: 64 CCNRVFVYLGSNGQVADAPMIG 85

BLAST of ClCG02G012910.1 vs. TAIR10
Match: AT5G43580.1 (AT5G43580.1 Serine protease inhibitor, potato inhibitor I-type family protein)

HSP 1 Score: 64.7 bits (156), Expect = 3.4e-11
Identity = 31/68 (45.59%), Postives = 43/68 (63.24%), Query Frame = 1

Query: 26 CDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENFCCNRVFVYVDDSGH 85
          C+  G   WP+L+G KG+ AK  IE++   ++ VI+  GTV  E F C+RV+V+V+D G 
Sbjct: 32 CEDPGKSSWPELLGAKGEDAKEVIERENPKMKAVIILDGTVVPEIFICSRVYVWVNDCGI 91

Query: 86 VVDVPTIG 94
          VV +P IG
Sbjct: 92 VVQIPIIG 99

BLAST of ClCG02G012910.1 vs. TAIR10
Match: AT2G38900.2 (AT2G38900.2 Serine protease inhibitor, potato inhibitor I-type family protein)

HSP 1 Score: 49.7 bits (117), Expect = 1.1e-06
Identity = 26/64 (40.62%), Postives = 34/64 (53.12%), Query Frame = 1

Query: 30 GNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENFCCNRVFVYVDDSGHVVDV 89
          G   WP+L+G  GDYA S I+ +   + VV++  G    E+  C RV V+VD+   VV  
Sbjct: 25 GKNSWPELLGTNGDYAASVIKGENSSLNVVVVSDGNYVTEDLSCYRVRVWVDEIRIVVRN 84

Query: 90 PTIG 94
          PT G
Sbjct: 85 PTAG 88

BLAST of ClCG02G012910.1 vs. TAIR10
Match: AT2G38870.1 (AT2G38870.1 Serine protease inhibitor, potato inhibitor I-type family protein)

HSP 1 Score: 46.6 bits (109), Expect = 9.7e-06
Identity = 23/60 (38.33%), Postives = 31/60 (51.67%), Query Frame = 1

Query: 34 WPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENFCCNRVFVYVDDSGHVVDVPTIG 93
          WP+L G  GDYA   IE++   V   ++  G+    +F C+RV V+VD +  VV  P  G
Sbjct: 11 WPELTGTNGDYAAVVIERENPTVNAAVILDGSPVTADFRCDRVRVFVDGNRIVVKTPKSG 70

BLAST of ClCG02G012910.1 vs. NCBI nr
Match: gi|567189624|ref|XP_006404448.1| (hypothetical protein EUTSA_v10011055mg [Eutrema salsugineum])

HSP 1 Score: 107.1 bits (266), Expect = 1.7e-20
Identity = 47/82 (57.32%), Postives = 57/82 (69.51%), Query Frame = 1

Query: 12 YPPCVSDGCMDPQCCDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENF 71
          YPPC S GC DP+CC  GG F WP+LVGK G+ AK  IE++   V   I+  GT+RIE+F
Sbjct: 7  YPPCESSGCTDPRCCALGGKFEWPELVGKSGEIAKMTIERENPNVLGFIMPYGTLRIEDF 66

Query: 72 CCNRVFVYVDDSGHVVDVPTIG 94
          CCNRVF+ V   GHVV +P IG
Sbjct: 67 CCNRVFIVVGSKGHVVMIPKIG 88

BLAST of ClCG02G012910.1 vs. NCBI nr
Match: gi|15232657|ref|NP_190270.1| (PR-6 proteinase inhibitor family protein [Arabidopsis thaliana])

HSP 1 Score: 105.5 bits (262), Expect = 5.0e-20
Identity = 46/82 (56.10%), Postives = 55/82 (67.07%), Query Frame = 1

Query: 12 YPPCVSDGCMDPQCCDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENF 71
          YPPC S  C DP+CC  G  +RWP+LVGK G  AK  IE++   V  ++LR G  RIENF
Sbjct: 4  YPPCWSGSCEDPECCAIGKKYRWPELVGKNGQLAKMTIERENPNVLAIVLRYGEKRIENF 63

Query: 72 CCNRVFVYVDDSGHVVDVPTIG 94
          CCNRVFVY+  +G V D P IG
Sbjct: 64 CCNRVFVYLGSNGQVADAPMIG 85

BLAST of ClCG02G012910.1 vs. NCBI nr
Match: gi|116831272|gb|ABK28590.1| (unknown [Arabidopsis thaliana])

HSP 1 Score: 100.9 bits (250), Expect = 1.2e-18
Identity = 45/82 (54.88%), Postives = 54/82 (65.85%), Query Frame = 1

Query: 12 YPPCVSDGCMDPQCCDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENF 71
          YPPC S  C DP+C   G  +RWP+LVGK G  AK  IE++   V  ++LR G  RIENF
Sbjct: 4  YPPCWSGSCEDPECRAIGKKYRWPELVGKNGQLAKMTIERENPNVLAIVLRYGEKRIENF 63

Query: 72 CCNRVFVYVDDSGHVVDVPTIG 94
          CCNRVFVY+  +G V D P IG
Sbjct: 64 CCNRVFVYLGSNGQVADAPMIG 85

BLAST of ClCG02G012910.1 vs. NCBI nr
Match: gi|565483408|ref|XP_006299344.1| (hypothetical protein CARUB_v10015502mg [Capsella rubella])

HSP 1 Score: 96.7 bits (239), Expect = 2.3e-17
Identity = 42/83 (50.60%), Postives = 54/83 (65.06%), Query Frame = 1

Query: 11 PYPPCVSDGCMDPQCCDAGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIEN 70
          PYPPC+   C DP+CC  G   RWP+LVGK G+ AK  IE++   V  ++L  G  RI +
Sbjct: 3  PYPPCLVTACEDPECCANGKKCRWPELVGKSGEVAKMTIERENPNVLAILLHAGDGRISD 62

Query: 71 FCCNRVFVYVDDSGHVVDVPTIG 94
          FCCN+VFV +D  GHV  +P IG
Sbjct: 63 FCCNKVFVVLDIHGHVSTIPKIG 85

BLAST of ClCG02G012910.1 vs. NCBI nr
Match: gi|700206904|gb|KGN62023.1| (hypothetical protein Csa_2G287050 [Cucumis sativus])

HSP 1 Score: 95.5 bits (236), Expect = 5.2e-17
Identity = 42/66 (63.64%), Postives = 54/66 (81.82%), Query Frame = 1

Query: 28 AGGNFRWPKLVGKKGDYAKSEIEKDVRCVRVVILRQGTVRIENFCCNRVFVYVDDSGHVV 87
          AGGN+RWP LVGK+  YAK +IE+++  V V ++R+G +RIE+FCCNRV VYVDDSG VV
Sbjct: 7  AGGNYRWPNLVGKRWQYAKRKIEEELPRVGVAVMRRGAIRIEDFCCNRVIVYVDDSGIVV 66

Query: 88 DVPTIG 94
          +VP IG
Sbjct: 67 EVPVIG 72

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ICI_LINUS3.4e-0841.27Proteinase inhibitor OS=Linum usitatissimum PE=1 SV=1[more]
ITH5_CUCMA8.3e-0745.31Inhibitor of trypsin and hageman factor OS=Cucurbita maxima PE=1 SV=1[more]
BGIA_MOMCH9.1e-0643.75Glu S.griseus protease inhibitor OS=Momordica charantia PE=1 SV=1[more]
Match NameE-valueIdentityDescription
V4NI49_EUTSA1.2e-2057.32Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10011055mg PE=4 SV=1[more]
Q9STF8_ARATH3.5e-2056.10PR-6 proteinase inhibitor family protein OS=Arabidopsis thaliana GN=T6H20.110 PE... [more]
A0MF11_ARATH8.6e-1954.88Putative uncharacterized protein (Fragment) OS=Arabidopsis thaliana PE=2 SV=1[more]
R0G9B1_9BRAS1.6e-1750.60Uncharacterized protein OS=Capsella rubella GN=CARUB_v10015502mg PE=4 SV=1[more]
A0A0A0LJT7_CUCSA3.6e-1763.64Uncharacterized protein OS=Cucumis sativus GN=Csa_2G287050 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G46860.11.8e-2356.10 Serine protease inhibitor, potato inhibitor I-type family protein[more]
AT5G43580.13.4e-1145.59 Serine protease inhibitor, potato inhibitor I-type family protein[more]
AT2G38900.21.1e-0640.63 Serine protease inhibitor, potato inhibitor I-type family protein[more]
AT2G38870.19.7e-0638.33 Serine protease inhibitor, potato inhibitor I-type family protein[more]
Match NameE-valueIdentityDescription
gi|567189624|ref|XP_006404448.1|1.7e-2057.32hypothetical protein EUTSA_v10011055mg [Eutrema salsugineum][more]
gi|15232657|ref|NP_190270.1|5.0e-2056.10PR-6 proteinase inhibitor family protein [Arabidopsis thaliana][more]
gi|116831272|gb|ABK28590.1|1.2e-1854.88unknown [Arabidopsis thaliana][more]
gi|565483408|ref|XP_006299344.1|2.3e-1750.60hypothetical protein CARUB_v10015502mg [Capsella rubella][more]
gi|700206904|gb|KGN62023.1|5.2e-1763.64hypothetical protein Csa_2G287050 [Cucumis sativus][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR000864Prot_inh_pot1
Vocabulary: Molecular Function
TermDefinition
GO:0004867serine-type endopeptidase inhibitor activity
Vocabulary: Biological Process
TermDefinition
GO:0009611response to wounding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009611 response to wounding
biological_process GO:0010951 negative regulation of endopeptidase activity
cellular_component GO:0005575 cellular_component
molecular_function GO:0004867 serine-type endopeptidase inhibitor activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
ClCG02G012910ClCG02G012910gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
ClCG02G012910.1ClCG02G012910.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG02G012910.1.cds1ClCG02G012910.1.cds1CDS
ClCG02G012910.1.cds2ClCG02G012910.1.cds2CDS


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000864Proteinase inhibitor I13, potato inhibitor IPRODOMPD002604coord: 34..92
score: 3.
IPR000864Proteinase inhibitor I13, potato inhibitor IPFAMPF00280potato_inhibitcoord: 33..93
score: 2.0
IPR000864Proteinase inhibitor I13, potato inhibitor IunknownSSF54654CI-2 family of serine protease inhibitorscoord: 28..93
score: 3.79
NoneNo IPR availableGENE3DG3DSA:3.30.10.10coord: 32..93
score: 2.1
NoneNo IPR availablePANTHERPTHR33091FAMILY NOT NAMEDcoord: 1..93
score: 4.5
NoneNo IPR availablePANTHERPTHR33091:SF13PR-6 PROTEINASE INHIBITOR FAMILY PROTEIN-RELATEDcoord: 1..93
score: 4.5