Cla97C11G210670 (gene) Watermelon (97103) v2

NameCla97C11G210670
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionBeta-D-glucosidase
LocationCla97Chr11 : 4023043 .. 4023321 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGAAGGTGACACTGCTTTTACTCTGTTGCTGGGCGGCTTTGGTGGTTGCTGATGAAGATTATGTCAAGTATAAGGACCCGATACAACCGCTTAACGTCCGGATCAAGGATCTAATGGATAGAATGACTCTAGCAGGGAAGATTGGGCAGATGGCACAGTTGGATCGTTCGGTTGTAACACCAGAGATCATGAGAGATTACTCCATTGGCAATGTGCTTAGTGTCACAATCGCTCACTTCAGAATACGGGACGGCGATTGTGCGGCACTGCTCTAA

mRNA sequence

ATGATGAAGGTGACACTGCTTTTACTCTGTTGCTGGGCGGCTTTGGTGGTTGCTGATGAAGATTATGTCAAGTATAAGGACCCGATACAACCGCTTAACGTCCGGATCAAGGATCTAATGGATAGAATGACTCTAGCAGGGAAGATTGGGCAGATGGCACAGTTGGATCGTTCGGTTGTAACACCAGAGATCATGAGAGATTACTCCATTGGCAATGTGCTTAGTGTCACAATCGCTCACTTCAGAATACGGGACGGCGATTGTGCGGCACTGCTCTAA

Coding sequence (CDS)

ATGATGAAGGTGACACTGCTTTTACTCTGTTGCTGGGCGGCTTTGGTGGTTGCTGATGAAGATTATGTCAAGTATAAGGACCCGATACAACCGCTTAACGTCCGGATCAAGGATCTAATGGATAGAATGACTCTAGCAGGGAAGATTGGGCAGATGGCACAGTTGGATCGTTCGGTTGTAACACCAGAGATCATGAGAGATTACTCCATTGGCAATGTGCTTAGTGTCACAATCGCTCACTTCAGAATACGGGACGGCGATTGTGCGGCACTGCTCTAA

Protein sequence

MMKVTLLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVTPEIMRDYSIGNVLSVTIAHFRIRDGDCAALL
BLAST of Cla97C11G210670 vs. NCBI nr
Match: XP_016902614.1 (PREDICTED: LOW QUALITY PROTEIN: lysosomal beta glucosidase-like [Cucumis melo])

HSP 1 Score: 128.3 bits (321), Expect = 1.4e-26
Identity = 60/74 (81.08%), Postives = 67/74 (90.54%), Query Frame = 0

Query: 2  MKVTLLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVT 61
          MKV ++LLCCWAALV  DEDYV YKDPIQPLN+RIKDLMDRMTLA K+GQMAQLD S +T
Sbjct: 1  MKVMVVLLCCWAALVAVDEDYVMYKDPIQPLNIRIKDLMDRMTLADKVGQMAQLDSSAIT 60

Query: 62 PEIMRDYSIGNVLS 76
          PEI+RDYSIG+VLS
Sbjct: 61 PEIIRDYSIGSVLS 74

BLAST of Cla97C11G210670 vs. NCBI nr
Match: XP_022943425.1 (uncharacterized protein LOC111448193 [Cucurbita moschata])

HSP 1 Score: 123.2 bits (308), Expect = 4.5e-25
Identity = 63/80 (78.75%), Postives = 68/80 (85.00%), Query Frame = 0

Query: 1  MMKVT-----LLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQL 60
          MMKVT     +LLLCCW AL  A ED+VKYKDP QPLN+RIKDLMDRMTLA KIGQM QL
Sbjct: 1  MMKVTVASILVLLLCCWPALAAAREDHVKYKDPAQPLNIRIKDLMDRMTLAEKIGQMTQL 60

Query: 61 DRSVVTPEIMRDYSIGNVLS 76
          DRSVVTPEI+RDYSIG+VLS
Sbjct: 61 DRSVVTPEIVRDYSIGSVLS 80

BLAST of Cla97C11G210670 vs. NCBI nr
Match: XP_023512240.1 (uncharacterized protein LOC111777028 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 118.2 bits (295), Expect = 1.4e-23
Identity = 62/80 (77.50%), Postives = 67/80 (83.75%), Query Frame = 0

Query: 1  MMKVT-----LLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQL 60
          MMKVT     +LLLC W AL  A +D+VKYKDP QPLNVRIKDLMDRMTLA KIGQM QL
Sbjct: 1  MMKVTVSSILVLLLCGWPALAAAHDDHVKYKDPAQPLNVRIKDLMDRMTLAEKIGQMTQL 60

Query: 61 DRSVVTPEIMRDYSIGNVLS 76
          DRSVVTPEI+RDYSIG+VLS
Sbjct: 61 DRSVVTPEIVRDYSIGSVLS 80

BLAST of Cla97C11G210670 vs. NCBI nr
Match: XP_002523937.1 (uncharacterized protein LOC8281990 [Ricinus communis] >EEF38424.1 hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus communis])

HSP 1 Score: 107.1 bits (266), Expect = 3.3e-20
Identity = 53/69 (76.81%), Postives = 61/69 (88.41%), Query Frame = 0

Query: 7  LLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVTPEIMR 66
          L+LCC+AA   AD +Y+KYKDP QPLNVRI+D+M RMTLA KIGQM QLDRSVVTPEIMR
Sbjct: 13 LVLCCFAA---ADAEYLKYKDPSQPLNVRIRDVMKRMTLAEKIGQMVQLDRSVVTPEIMR 72

Query: 67 DYSIGNVLS 76
          DYSIG++LS
Sbjct: 73 DYSIGSILS 78

BLAST of Cla97C11G210670 vs. NCBI nr
Match: XP_022146225.1 (uncharacterized protein LOC111015489 [Momordica charantia])

HSP 1 Score: 104.4 bits (259), Expect = 2.1e-19
Identity = 50/58 (86.21%), Postives = 54/58 (93.10%), Query Frame = 0

Query: 18 ADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVTPEIMRDYSIGNVLS 76
          AD DYVKYKDP QPLN+RIKDLMDRMTLA KIGQM QLDR+VVTPEIMRDYS+G+VLS
Sbjct: 23 ADGDYVKYKDPGQPLNIRIKDLMDRMTLAEKIGQMTQLDRTVVTPEIMRDYSLGSVLS 80

BLAST of Cla97C11G210670 vs. TrEMBL
Match: tr|A0A1S4E304|A0A1S4E304_CUCME (LOW QUALITY PROTEIN: lysosomal beta glucosidase-like OS=Cucumis melo OX=3656 GN=LOC103499742 PE=4 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 9.2e-27
Identity = 60/74 (81.08%), Postives = 67/74 (90.54%), Query Frame = 0

Query: 2  MKVTLLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVT 61
          MKV ++LLCCWAALV  DEDYV YKDPIQPLN+RIKDLMDRMTLA K+GQMAQLD S +T
Sbjct: 1  MKVMVVLLCCWAALVAVDEDYVMYKDPIQPLNIRIKDLMDRMTLADKVGQMAQLDSSAIT 60

Query: 62 PEIMRDYSIGNVLS 76
          PEI+RDYSIG+VLS
Sbjct: 61 PEIIRDYSIGSVLS 74

BLAST of Cla97C11G210670 vs. TrEMBL
Match: tr|B9SD68|B9SD68_RICCO (Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis OX=3988 GN=RCOM_1162600 PE=4 SV=1)

HSP 1 Score: 107.1 bits (266), Expect = 2.2e-20
Identity = 53/69 (76.81%), Postives = 61/69 (88.41%), Query Frame = 0

Query: 7  LLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVTPEIMR 66
          L+LCC+AA   AD +Y+KYKDP QPLNVRI+D+M RMTLA KIGQM QLDRSVVTPEIMR
Sbjct: 13 LVLCCFAA---ADAEYLKYKDPSQPLNVRIRDVMKRMTLAEKIGQMVQLDRSVVTPEIMR 72

Query: 67 DYSIGNVLS 76
          DYSIG++LS
Sbjct: 73 DYSIGSILS 78

BLAST of Cla97C11G210670 vs. TrEMBL
Match: tr|A0A251NQP2|A0A251NQP2_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G150200 PE=4 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 1.6e-18
Identity = 49/76 (64.47%), Postives = 61/76 (80.26%), Query Frame = 0

Query: 1  MMKVTLLLLCCWAALVV-ADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSV 60
          ++ V LL LCCW A++   + +Y+ YKDP +P+N+RIKDLMDRMTLA KIGQM QLDR  
Sbjct: 8  VLLVGLLCLCCWEAMITKVEAEYMAYKDPNKPINIRIKDLMDRMTLAEKIGQMTQLDRQN 67

Query: 61 VTPEIMRDYSIGNVLS 76
          VT EIMRDYSIG++LS
Sbjct: 68 VTAEIMRDYSIGSLLS 83

BLAST of Cla97C11G210670 vs. TrEMBL
Match: tr|A0A251NQP7|A0A251NQP7_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G150400 PE=4 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 1.0e-17
Identity = 49/72 (68.06%), Postives = 56/72 (77.78%), Query Frame = 0

Query: 4  VTLLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVTPE 63
          V LL LCCW A+  A  +Y+ YKDP QPLN RIKDLM RMTL  KIGQM QLDR+ VT E
Sbjct: 12 VGLLWLCCWGAMARAGAEYMAYKDPKQPLNRRIKDLMGRMTLEEKIGQMTQLDRANVTAE 71

Query: 64 IMRDYSIGNVLS 76
          IMRD+S+G+VLS
Sbjct: 72 IMRDFSLGSVLS 83

BLAST of Cla97C11G210670 vs. TrEMBL
Match: tr|M5W6P3|M5W6P3_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa020836mg PE=4 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 1.0e-17
Identity = 49/72 (68.06%), Postives = 56/72 (77.78%), Query Frame = 0

Query: 4  VTLLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVTPE 63
          V LL LCCW A+  A  +Y+ YKDP QPLN RIKDLM RMTL  KIGQM QLDR+ VT E
Sbjct: 11 VGLLWLCCWGAMARAGAEYMAYKDPKQPLNRRIKDLMGRMTLEEKIGQMTQLDRANVTAE 70

Query: 64 IMRDYSIGNVLS 76
          IMRD+S+G+VLS
Sbjct: 71 IMRDFSLGSVLS 82

BLAST of Cla97C11G210670 vs. TAIR10
Match: AT5G20950.1 (Glycosyl hydrolase family protein)

HSP 1 Score: 87.4 bits (215), Expect = 4.9e-18
Identity = 44/70 (62.86%), Postives = 54/70 (77.14%), Query Frame = 0

Query: 6  LLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVTPEIM 65
          L+LLCC   +V A E  +KYKDP QPL  RI+DLM+RMTL  KIGQM Q++RSV TPE+M
Sbjct: 10 LMLLCC---IVAAAEGTLKYKDPKQPLGARIRDLMNRMTLQEKIGQMVQIERSVATPEVM 69

Query: 66 RDYSIGNVLS 76
          + Y IG+VLS
Sbjct: 70 KKYFIGSVLS 76

BLAST of Cla97C11G210670 vs. TAIR10
Match: AT5G04885.1 (Glycosyl hydrolase family protein)

HSP 1 Score: 75.5 bits (184), Expect = 1.9e-14
Identity = 39/70 (55.71%), Postives = 47/70 (67.14%), Query Frame = 0

Query: 6  LLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVTPEIM 65
          LL +C W      D +Y+ YKDP Q ++ R+ DL  RMTL  KIGQM Q+DRSV T  IM
Sbjct: 12 LLWMCMWVC-CYGDGEYLLYKDPKQTVSDRVADLFGRMTLEEKIGQMVQIDRSVATVNIM 71

Query: 66 RDYSIGNVLS 76
          RDY IG+VLS
Sbjct: 72 RDYFIGSVLS 80

BLAST of Cla97C11G210670 vs. TAIR10
Match: AT5G20940.1 (Glycosyl hydrolase family protein)

HSP 1 Score: 71.2 bits (173), Expect = 3.6e-13
Identity = 37/70 (52.86%), Postives = 45/70 (64.29%), Query Frame = 0

Query: 6  LLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVTPEIM 65
          LLLLCC  A         KYKDP +PL VRIK+LM  MTL  KIGQM Q++R   T E+M
Sbjct: 13 LLLLCCTVAANKVPLANAKYKDPKEPLGVRIKNLMSHMTLEEKIGQMVQVERVNATTEVM 72

Query: 66 RDYSIGNVLS 76
          + Y +G+V S
Sbjct: 73 QKYFVGSVFS 82

BLAST of Cla97C11G210670 vs. TAIR10
Match: AT3G47050.1 (Glycosyl hydrolase family protein)

HSP 1 Score: 61.2 bits (147), Expect = 3.8e-10
Identity = 29/60 (48.33%), Postives = 43/60 (71.67%), Query Frame = 0

Query: 16 VVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDRSVVTPEIMRDYSIGNVLS 75
          +V +E    YK+   P+  R+KDL+ RMTLA KIGQM  ++RSV +  ++RD+SIG+VL+
Sbjct: 1  MVDNEKSYVYKNREAPVEARVKDLLSRMTLAEKIGQMTLIERSVASEAVIRDFSIGSVLN 60

BLAST of Cla97C11G210670 vs. TAIR10
Match: AT3G62710.1 (Glycosyl hydrolase family protein)

HSP 1 Score: 60.5 bits (145), Expect = 6.4e-10
Identity = 33/82 (40.24%), Postives = 45/82 (54.88%), Query Frame = 0

Query: 4  VTLLLLCCWAALVVADEDYVKYKDPIQPLNVRIKDLMDRMTLAGKIGQMAQLDR------ 63
          + +L    +     AD  Y+KYKDP   +  R++DL+ RMTL  K+GQM Q+DR      
Sbjct: 17 IVILFAGRYGEATAADRGYIKYKDPKVAVEERVEDLLIRMTLPEKLGQMCQIDRFNFSQV 76

Query: 64 ----SVVTPEIMRDYSIGNVLS 76
              + V PEI   Y IG+VLS
Sbjct: 77 TGGVATVVPEIFTKYMIGSVLS 98

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_016902614.11.4e-2681.08PREDICTED: LOW QUALITY PROTEIN: lysosomal beta glucosidase-like [Cucumis melo][more]
XP_022943425.14.5e-2578.75uncharacterized protein LOC111448193 [Cucurbita moschata][more]
XP_023512240.11.4e-2377.50uncharacterized protein LOC111777028 [Cucurbita pepo subsp. pepo][more]
XP_002523937.13.3e-2076.81uncharacterized protein LOC8281990 [Ricinus communis] >EEF38424.1 hydrolase, hyd... [more]
XP_022146225.12.1e-1986.21uncharacterized protein LOC111015489 [Momordica charantia][more]
Match NameE-valueIdentityDescription
tr|A0A1S4E304|A0A1S4E304_CUCME9.2e-2781.08LOW QUALITY PROTEIN: lysosomal beta glucosidase-like OS=Cucumis melo OX=3656 GN=... [more]
tr|B9SD68|B9SD68_RICCO2.2e-2076.81Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis OX=398... [more]
tr|A0A251NQP2|A0A251NQP2_PRUPE1.6e-1864.47Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G150200 PE=4 SV=1[more]
tr|A0A251NQP7|A0A251NQP7_PRUPE1.0e-1768.06Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G150400 PE=4 SV=1[more]
tr|M5W6P3|M5W6P3_PRUPE1.0e-1768.06Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_ppa020836mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT5G20950.14.9e-1862.86Glycosyl hydrolase family protein[more]
AT5G04885.11.9e-1455.71Glycosyl hydrolase family protein[more]
AT5G20940.13.6e-1352.86Glycosyl hydrolase family protein[more]
AT3G47050.13.8e-1048.33Glycosyl hydrolase family protein[more]
AT3G62710.16.4e-1040.24Glycosyl hydrolase family protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR017853Glycoside_hydrolase_SF
IPR036962Glyco_hydro_3_N_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0006468 protein phosphorylation
cellular_component GO:0005886 plasma membrane
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0005524 ATP binding
molecular_function GO:0008422 beta-glucosidase activity
molecular_function GO:0004672 protein kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G210670.1Cla97C11G210670.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036962Glycoside hydrolase, family 3, N-terminal domain superfamilyGENE3DG3DSA:3.20.20.300coord: 18..89
e-value: 3.5E-12
score: 47.8
NoneNo IPR availablePANTHERPTHR30620:SF35GLYCOSYL HYDROLASE FAMILY PROTEINcoord: 3..75
NoneNo IPR availablePANTHERPTHR30620PERIPLASMIC BETA-GLUCOSIDASE-RELATEDcoord: 3..75
IPR017853Glycoside hydrolase superfamilySUPERFAMILYSSF51445(Trans)glycosidasescoord: 23..77

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C11G210670Cla016604Watermelon (97103) v1wmwmbB361
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C11G210670Watermelon (97103) v2wmbwmbB065
Cla97C11G210670Silver-seed gourdcarwmbB0311
Cla97C11G210670Silver-seed gourdcarwmbB0323
Cla97C11G210670Silver-seed gourdcarwmbB0454
Cla97C11G210670Silver-seed gourdcarwmbB0993
Cla97C11G210670Cucumber (Gy14) v2cgybwmbB105
Cla97C11G210670Cucumber (Gy14) v2cgybwmbB492
Cla97C11G210670Cucumber (Gy14) v1cgywmbB150
Cla97C11G210670Cucumber (Gy14) v1cgywmbB646
Cla97C11G210670Cucurbita maxima (Rimu)cmawmbB457
Cla97C11G210670Cucurbita maxima (Rimu)cmawmbB526
Cla97C11G210670Cucurbita maxima (Rimu)cmawmbB567
Cla97C11G210670Cucurbita maxima (Rimu)cmawmbB613
Cla97C11G210670Cucurbita moschata (Rifu)cmowmbB438
Cla97C11G210670Cucurbita moschata (Rifu)cmowmbB506
Cla97C11G210670Cucurbita moschata (Rifu)cmowmbB544
Cla97C11G210670Cucurbita moschata (Rifu)cmowmbB587
Cla97C11G210670Wild cucumber (PI 183967)cpiwmbB112
Cla97C11G210670Wild cucumber (PI 183967)cpiwmbB543
Cla97C11G210670Cucumber (Chinese Long) v3cucwmbB115
Cla97C11G210670Cucumber (Chinese Long) v3cucwmbB539
Cla97C11G210670Cucumber (Chinese Long) v2cuwmbB110
Cla97C11G210670Cucumber (Chinese Long) v2cuwmbB519
Cla97C11G210670Bottle gourd (USVL1VR-Ls)lsiwmbB043
Cla97C11G210670Bottle gourd (USVL1VR-Ls)lsiwmbB080
Cla97C11G210670Melon (DHL92) v3.6.1medwmbB082
Cla97C11G210670Melon (DHL92) v3.6.1medwmbB162
Cla97C11G210670Melon (DHL92) v3.5.1mewmbB089
Cla97C11G210670Melon (DHL92) v3.5.1mewmbB176
Cla97C11G210670Watermelon (Charleston Gray)wcgwmbB065
Cla97C11G210670Watermelon (Charleston Gray)wcgwmbB137
Cla97C11G210670Watermelon (97103) v1wmwmbB315
Cla97C11G210670Wax gourdwgowmbB352