Cla97C06G113530.1 (mRNA) Watermelon (97103) v2

NameCla97C06G113530.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionHydrolase, hydrolyzing O-glycosyl compounds, putative
LocationCla97Chr06 : 4432048 .. 4432549 (+)
Sequence length351
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGCAAGGTAATTATATTAATTATTCTAGTAATAACTTGTGATCCCATACAACAAAAGAATCATCGTGAAGAAATTGTTACATACAAACCATAAATTACTAAACCAATTTGTAAGTTTAAAATAGAATTTAATTGTGTTGCATTTTTGCGGCATATAGATCCAAGTTCCAATGCATCGTCTTCTTATGTTATGTATCATCCACAAAGTGGTCAATGTGCCCAAGTTTCAAATGACAATACTGGAATTTTTTTGAGTAATTGTTCCACCTCAAGTCAATGGAGTCATGGTGATGATGGCACTCCAATCAAAAGGAGGACAAATGGTTTGTGTTTAAAGGCTAATGGCGAAGGTGTTGGAGCATCCCTCTCAAGTGATTGTTTGGGTCAACAAAGTGTTTGGAGAGCAATTTCTAACGTAATCTTCATTTGGCCACTGTCACTCAAGATGGGAAGAGTCTTTGCTTGTAGGTTGAAAGCTCCAATTCTTCAAAAATTGTGA

mRNA sequence

ATGTTGCAAGATCCAAGTTCCAATGCATCGTCTTCTTATGTTATGTATCATCCACAAAGTGGTCAATGTGCCCAAGTTTCAAATGACAATACTGGAATTTTTTTGAGTAATTGTTCCACCTCAAGTCAATGGAGTCATGGTGATGATGGCACTCCAATCAAAAGGAGGACAAATGGTTTGTGTTTAAAGGCTAATGGCGAAGGTGTTGGAGCATCCCTCTCAAGTGATTGTTTGGGTCAACAAAGTGTTTGGAGAGCAATTTCTAACGTAATCTTCATTTGGCCACTGTCACTCAAGATGGGAAGAGTCTTTGCTTGTAGGTTGAAAGCTCCAATTCTTCAAAAATTGTGA

Coding sequence (CDS)

ATGTTGCAAGATCCAAGTTCCAATGCATCGTCTTCTTATGTTATGTATCATCCACAAAGTGGTCAATGTGCCCAAGTTTCAAATGACAATACTGGAATTTTTTTGAGTAATTGTTCCACCTCAAGTCAATGGAGTCATGGTGATGATGGCACTCCAATCAAAAGGAGGACAAATGGTTTGTGTTTAAAGGCTAATGGCGAAGGTGTTGGAGCATCCCTCTCAAGTGATTGTTTGGGTCAACAAAGTGTTTGGAGAGCAATTTCTAACGTAATCTTCATTTGGCCACTGTCACTCAAGATGGGAAGAGTCTTTGCTTGTAGGTTGAAAGCTCCAATTCTTCAAAAATTGTGA

Protein sequence

MLQDPSSNASSSYVMYHPQSGQCAQVSNDNTGIFLSNCSTSSQWSHGDDGTPIKRRTNGLCLKANGEGVGASLSSDCLGQQSVWRAISNVIFIWPLSLKMGRVFACRLKAPILQKL
BLAST of Cla97C06G113530.1 vs. NCBI nr
Match: XP_022158418.1 (uncharacterized protein LOC111024912, partial [Momordica charantia])

HSP 1 Score: 150.6 bits (379), Expect = 3.3e-33
Identity = 73/89 (82.02%), Postives = 77/89 (86.52%), Query Frame = 0

Query: 1   MLQDPSSNASSSYVMYHPQSGQCAQVSNDNTGIFLSNCSTSSQWSHGDDGTPIKRRTNGL 60
           MLQDP+SNASSSYVMYHPQSGQC   SNDN GIFLS+CSTSS+WSHG DGT IK  T GL
Sbjct: 101 MLQDPTSNASSSYVMYHPQSGQCILASNDNKGIFLSSCSTSSRWSHGGDGTSIKITTTGL 160

Query: 61  CLKANGEGVGASLSSDCLGQQSVWRAISN 90
           CLKANGEG+  SLSSDCL QQSVWRAISN
Sbjct: 161 CLKANGEGLRVSLSSDCLSQQSVWRAISN 189

BLAST of Cla97C06G113530.1 vs. NCBI nr
Match: XP_022157219.1 (uncharacterized protein LOC111023982 [Momordica charantia])

HSP 1 Score: 147.1 bits (370), Expect = 3.6e-32
Identity = 70/89 (78.65%), Postives = 75/89 (84.27%), Query Frame = 0

Query: 1   MLQDPSSNASSSYVMYHPQSGQCAQVSNDNTGIFLSNCSTSSQWSHGDDGTPIKRRTNGL 60
           MLQDP+SNASSSYVMYHPQSGQC   SNDN  +FLSNC TSS+WSHG DGTPIK  T GL
Sbjct: 335 MLQDPNSNASSSYVMYHPQSGQCVLSSNDNREMFLSNCXTSSRWSHGGDGTPIKMTTTGL 394

Query: 61  CLKANGEGVGASLSSDCLGQQSVWRAISN 90
           C+KANGE +G SLSSDCLGQ S WRAISN
Sbjct: 395 CVKANGESLGXSLSSDCLGQLSAWRAISN 423

BLAST of Cla97C06G113530.1 vs. NCBI nr
Match: XP_022958844.1 (uncharacterized protein LOC111459997 [Cucurbita moschata])

HSP 1 Score: 147.1 bits (370), Expect = 3.6e-32
Identity = 69/89 (77.53%), Postives = 77/89 (86.52%), Query Frame = 0

Query: 1   MLQDPSSNASSSYVMYHPQSGQCAQVSNDNTGIFLSNCSTSSQWSHGDDGTPIKRRTNGL 60
           MLQDP+SNAS SY++YHPQSGQCAQ SN++T IFLS+CSTSS+WSHGD G  IK  T GL
Sbjct: 391 MLQDPNSNASFSYILYHPQSGQCAQTSNNDTQIFLSDCSTSSRWSHGDGGNSIKMATTGL 450

Query: 61  CLKANGEGVGASLSSDCLGQQSVWRAISN 90
           CLKANGEG+G SLSSDCL QQSVWR ISN
Sbjct: 451 CLKANGEGLGVSLSSDCLSQQSVWRTISN 479

BLAST of Cla97C06G113530.1 vs. NCBI nr
Match: XP_008445780.1 (PREDICTED: major extracellular endoglucanase-like [Cucumis melo])

HSP 1 Score: 143.3 bits (360), Expect = 5.2e-31
Identity = 67/89 (75.28%), Postives = 77/89 (86.52%), Query Frame = 0

Query: 1   MLQDPSSNASSSYVMYHPQSGQCAQVSNDNTGIFLSNCSTSSQWSHGDDGTPIKRRTNGL 60
           MLQDP+SNAS SYV+YHPQSGQC +VSNDN  IFL+NCSTSS+WSH +D TPIK    GL
Sbjct: 302 MLQDPNSNASFSYVIYHPQSGQCIEVSNDNKDIFLTNCSTSSRWSHDNDSTPIKMSNTGL 361

Query: 61  CLKANGEGVGASLSSDCLGQQSVWRAISN 90
           CLKA+GEG+ ASLS+DCLG+QSVW AISN
Sbjct: 362 CLKASGEGLAASLSNDCLGKQSVWSAISN 390

BLAST of Cla97C06G113530.1 vs. NCBI nr
Match: XP_011658389.1 (PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus] >KGN45940.1 hypothetical protein Csa_6G028440 [Cucumis sativus])

HSP 1 Score: 136.0 bits (341), Expect = 8.4e-29
Identity = 64/89 (71.91%), Postives = 76/89 (85.39%), Query Frame = 0

Query: 1   MLQDPSSNASSSYVMYHPQSGQCAQVSNDNTGIFLSNCSTSSQWSHGDDGTPIKRRTNGL 60
           MLQDP SNAS SYV+YH QSGQC +VSNDN  IFL+NCSTSS+WSH +D TPIK  + GL
Sbjct: 391 MLQDPYSNASFSYVIYHVQSGQCIEVSNDNKEIFLTNCSTSSRWSHDNDSTPIKMSSTGL 450

Query: 61  CLKANGEGVGASLSSDCLGQQSVWRAISN 90
           CLKA+GEG+ ASLS+DC+G+QS+W AISN
Sbjct: 451 CLKASGEGLEASLSTDCIGKQSLWSAISN 479

BLAST of Cla97C06G113530.1 vs. TrEMBL
Match: tr|A0A1S3BDI2|A0A1S3BDI2_CUCME (major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103488703 PE=3 SV=1)

HSP 1 Score: 143.3 bits (360), Expect = 3.5e-31
Identity = 67/89 (75.28%), Postives = 77/89 (86.52%), Query Frame = 0

Query: 1   MLQDPSSNASSSYVMYHPQSGQCAQVSNDNTGIFLSNCSTSSQWSHGDDGTPIKRRTNGL 60
           MLQDP+SNAS SYV+YHPQSGQC +VSNDN  IFL+NCSTSS+WSH +D TPIK    GL
Sbjct: 302 MLQDPNSNASFSYVIYHPQSGQCIEVSNDNKDIFLTNCSTSSRWSHDNDSTPIKMSNTGL 361

Query: 61  CLKANGEGVGASLSSDCLGQQSVWRAISN 90
           CLKA+GEG+ ASLS+DCLG+QSVW AISN
Sbjct: 362 CLKASGEGLAASLSNDCLGKQSVWSAISN 390

BLAST of Cla97C06G113530.1 vs. TrEMBL
Match: tr|A0A0A0K853|A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 136.0 bits (341), Expect = 5.5e-29
Identity = 64/89 (71.91%), Postives = 76/89 (85.39%), Query Frame = 0

Query: 1   MLQDPSSNASSSYVMYHPQSGQCAQVSNDNTGIFLSNCSTSSQWSHGDDGTPIKRRTNGL 60
           MLQDP SNAS SYV+YH QSGQC +VSNDN  IFL+NCSTSS+WSH +D TPIK  + GL
Sbjct: 391 MLQDPYSNASFSYVIYHVQSGQCIEVSNDNKEIFLTNCSTSSRWSHDNDSTPIKMSSTGL 450

Query: 61  CLKANGEGVGASLSSDCLGQQSVWRAISN 90
           CLKA+GEG+ ASLS+DC+G+QS+W AISN
Sbjct: 451 CLKASGEGLEASLSTDCIGKQSLWSAISN 479

BLAST of Cla97C06G113530.1 vs. TrEMBL
Match: tr|A0A0A0L644|A0A0A0L644_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G186670 PE=4 SV=1)

HSP 1 Score: 130.2 bits (326), Expect = 3.0e-27
Identity = 63/89 (70.79%), Postives = 74/89 (83.15%), Query Frame = 0

Query: 1   MLQDPSSNASSSYVMYHPQSGQCAQVSNDNTGIFLSNCSTSSQWSHGDDGTPIKRRTNGL 60
           MLQDP+SNAS SYV+YHPQS QC QVSNDN  IFL+NCST ++WSH +DGTPI+  + GL
Sbjct: 202 MLQDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGL 261

Query: 61  CLKANGEGVGASLSSDCLGQQSVWRAISN 90
            LKA+G+G+ ASLSSD L QQSVW AISN
Sbjct: 262 YLKASGKGLEASLSSDTLSQQSVWSAISN 290

BLAST of Cla97C06G113530.1 vs. TrEMBL
Match: tr|A0A1S3CTF8|A0A1S3CTF8_CUCME (major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504686 PE=3 SV=1)

HSP 1 Score: 112.8 bits (281), Expect = 5.0e-22
Identity = 54/90 (60.00%), Postives = 67/90 (74.44%), Query Frame = 0

Query: 1   MLQDPSSNASSSYVMYHPQSGQCAQVSN-DNTGIFLSNCSTSSQWSHGDDGTPIKRRTNG 60
           MLQDP+SN+S++Y+MYHPQSGQC QV +     IFL+NCS +S WS+  DGTPI   +  
Sbjct: 391 MLQDPNSNSSNTYLMYHPQSGQCVQVHDMKQKEIFLNNCSNASHWSYEGDGTPIMLASTN 450

Query: 61  LCLKANGEGVGASLSSDCLGQQSVWRAISN 90
            CLKANG G+  SLS DC G+QSVW AIS+
Sbjct: 451 FCLKANGNGLPPSLSRDCFGEQSVWTAISD 480

BLAST of Cla97C06G113530.1 vs. TrEMBL
Match: tr|A0A0A0KKZ0|A0A0A0KKZ0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G168960 PE=3 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 1.6e-20
Identity = 51/90 (56.67%), Postives = 68/90 (75.56%), Query Frame = 0

Query: 1   MLQDPSSNASSSYVMYHPQSGQCAQVSN-DNTGIFLSNCSTSSQWSHGDDGTPIKRRTNG 60
           MLQDP+SN+S++YVMYHPQSGQC  V +  +  I+L++CS +S WS+  DGTPI   +  
Sbjct: 195 MLQDPNSNSSNTYVMYHPQSGQCVLVQDMKHMQIYLNDCSNASHWSYEGDGTPIMLASTN 254

Query: 61  LCLKANGEGVGASLSSDCLGQQSVWRAISN 90
            CLKA+G+G+  SLS DC G+QSVW AIS+
Sbjct: 255 FCLKASGDGLPPSLSRDCFGEQSVWTAISD 284

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022158418.13.3e-3382.02uncharacterized protein LOC111024912, partial [Momordica charantia][more]
XP_022157219.13.6e-3278.65uncharacterized protein LOC111023982 [Momordica charantia][more]
XP_022958844.13.6e-3277.53uncharacterized protein LOC111459997 [Cucurbita moschata][more]
XP_008445780.15.2e-3175.28PREDICTED: major extracellular endoglucanase-like [Cucumis melo][more]
XP_011658389.18.4e-2971.91PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus] >KGN45940.1 hy... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3BDI2|A0A1S3BDI2_CUCME3.5e-3175.28major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103488703 P... [more]
tr|A0A0A0K853|A0A0A0K853_CUCSA5.5e-2971.91Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028440 PE=3 SV=1[more]
tr|A0A0A0L644|A0A0A0L644_CUCSA3.0e-2770.79Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G186670 PE=4 SV=1[more]
tr|A0A1S3CTF8|A0A1S3CTF8_CUCME5.0e-2260.00major extracellular endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504686 P... [more]
tr|A0A0A0KKZ0|A0A0A0KKZ0_CUCSA1.6e-2056.67Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G168960 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR035992Ricin_B-like_lectins
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0008152 metabolic process
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016798 hydrolase activity, acting on glycosyl bonds
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla97C06G113530Cla97C06G113530gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C06G113530.1.exon.1Cla97C06G113530.1.exon.1exon
Cla97C06G113530.1.exon.2Cla97C06G113530.1.exon.2exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C06G113530.1.CDS.1Cla97C06G113530.1.CDS.1CDS
Cla97C06G113530.1.CDS.2Cla97C06G113530.1.CDS.2CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla97C06G113530.1Cla97C06G113530.1-proteinpolypeptide


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:2.80.10.50coord: 4..90
e-value: 1.5E-6
score: 30.1
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 1..89
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 1..89
IPR035992Ricin B-like lectinsSUPERFAMILYSSF50370Ricin B-like lectinscoord: 10..73