Tan0017084 (gene) Snake gourd v1

Overview
NameTan0017084
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglycine-rich cell wall structural protein 2-like
LocationLG04: 5545600 .. 5546668 (+)
RNA-Seq ExpressionTan0017084
SyntenyTan0017084
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCTCTTACTTCCTTCCAATTGCACTACAAGTTTACGTAAGGCTTCCCTTTCTTCATATCCTCACATAAAAAAGCTACGTCCAAGCTCTGTTTTTTTCTCTACATTATTAATTATAATCAGCAATGGCCAGCACCAGGGCGATCGGTTTGGTGCTGTTGGTTTTGTTAATGGTGGACTTAACTCTTGCTGCTAGATTCTTGAAGGGGTATGGAAGTGGTGGTGGAGGTGGTAGTGGTGGCGGTGGCGGTGGAGGCAGCGGAGAAGGGTCAGGACCTTGGTCGGGTTCGGGATATGGTTCGGGGTATGGCTCGGGGTATGGCTCGGGATATGGCGATGAAGGCTATTCTAGATATGGAGATGAAGGCTATGGGCCGTATGGGCCATATGGGAGTGGAGGGGGAGGAGGAGGAGGAGGCGGCGGCGGCGGCGGCGCAAATGGAGCTGGGTATGGTCGCGGGTTTGGATCGGGAAGTGGGTCCGGGTATGGGTCTGGGTCTGGTGGTGGAGCAGGCCGTGGTGGAGGAGGCGGCGGTGGTAGAGGTGGTGGAGGAGGTGGAGGCTCGGGTATCGGAAGTGGGTCGGGATATGGCTCGGGATATGGAAGCGGAGGAGGCTATGGGAGCGGAGGGGGTAGAGGCAGTGGCGGAGGCGGAGGCGGGGGCGGGGGCGGAGGCGGAGGCGGCGGTGGAGGAGGAATGGCTGGAGGATCTGGCTATGGTTCTGGATATGGGTCGGGTTATGGGTCTGGATATGGAGGAGGGGGTGAAGAATCACCATGAACATATTATAATGTTACTTAGGATTTACTTTGAAAGCATTTGAAATAAAGTGCAATGGCAGCCTCCGCTTAGGAAGATGATACTGAAATTTGGTGCGTTGGTATGATGGTCTAGAATATATCTTCCAATTATTACCAAATTCTCCTTATTCTAAATAAATAAATTAGTACCGACCTTTTTTCTTAATTACATTATAATCTCCCATCAAATATTTTTCTTGTTTGTCAATTGGTGTGGGAGTATTTTTCTACTCCAACTTTACATGTTGAATCAATTAAAATTGATACAA

mRNA sequence

GTCTCTTACTTCCTTCCAATTGCACTACAAGTTTACGTAAGGCTTCCCTTTCTTCATATCCTCACATAAAAAAGCTACGTCCAAGCTCTGTTTTTTTCTCTACATTATTAATTATAATCAGCAATGGCCAGCACCAGGGCGATCGGTTTGGTGCTGTTGGTTTTGTTAATGGTGGACTTAACTCTTGCTGCTAGATTCTTGAAGGGGTATGGAAGTGGTGGTGGAGGTGGTAGTGGTGGCGGTGGCGGTGGAGGCAGCGGAGAAGGGTCAGGACCTTGGTCGGGTTCGGGATATGGTTCGGGGTATGGCTCGGGGTATGGCTCGGGATATGGCGATGAAGGCTATTCTAGATATGGAGATGAAGGCTATGGGCCGTATGGGCCATATGGGAGTGGAGGGGGAGGAGGAGGAGGAGGCGGCGGCGGCGGCGGCGCAAATGGAGCTGGGTATGGTCGCGGGTTTGGATCGGGAAGTGGGTCCGGGTATGGGTCTGGGTCTGGTGGTGGAGCAGGCCGTGGTGGAGGAGGCGGCGGTGGTAGAGGTGGTGGAGGAGGTGGAGGCTCGGGTATCGGAAGTGGGTCGGGATATGGCTCGGGATATGGAAGCGGAGGAGGCTATGGGAGCGGAGGGGGTAGAGGCAGTGGCGGAGGCGGAGGCGGGGGCGGGGGCGGAGGCGGAGGCGGCGGTGGAGGAGGAATGGCTGGAGGATCTGGCTATGGTTCTGGATATGGGTCGGGTTATGGGTCTGGATATGGAGGAGGGGGTGAAGAATCACCATGAACATATTATAATGTTACTTAGGATTTACTTTGAAAGCATTTGAAATAAAGTGCAATGGCAGCCTCCGCTTAGGAAGATGATACTGAAATTTGGTGCGTTGGTATGATGGTCTAGAATATATCTTCCAATTATTACCAAATTCTCCTTATTCTAAATAAATAAATTAGTACCGACCTTTTTTCTTAATTACATTATAATCTCCCATCAAATATTTTTCTTGTTTGTCAATTGGTGTGGGAGTATTTTTCTACTCCAACTTTACATGTTGAATCAATTAAAATTGATACAA

Coding sequence (CDS)

ATGGCCAGCACCAGGGCGATCGGTTTGGTGCTGTTGGTTTTGTTAATGGTGGACTTAACTCTTGCTGCTAGATTCTTGAAGGGGTATGGAAGTGGTGGTGGAGGTGGTAGTGGTGGCGGTGGCGGTGGAGGCAGCGGAGAAGGGTCAGGACCTTGGTCGGGTTCGGGATATGGTTCGGGGTATGGCTCGGGGTATGGCTCGGGATATGGCGATGAAGGCTATTCTAGATATGGAGATGAAGGCTATGGGCCGTATGGGCCATATGGGAGTGGAGGGGGAGGAGGAGGAGGAGGCGGCGGCGGCGGCGGCGCAAATGGAGCTGGGTATGGTCGCGGGTTTGGATCGGGAAGTGGGTCCGGGTATGGGTCTGGGTCTGGTGGTGGAGCAGGCCGTGGTGGAGGAGGCGGCGGTGGTAGAGGTGGTGGAGGAGGTGGAGGCTCGGGTATCGGAAGTGGGTCGGGATATGGCTCGGGATATGGAAGCGGAGGAGGCTATGGGAGCGGAGGGGGTAGAGGCAGTGGCGGAGGCGGAGGCGGGGGCGGGGGCGGAGGCGGAGGCGGCGGTGGAGGAGGAATGGCTGGAGGATCTGGCTATGGTTCTGGATATGGGTCGGGTTATGGGTCTGGATATGGAGGAGGGGGTGAAGAATCACCATGA

Protein sequence

MASTRAIGLVLLVLLMVDLTLAARFLKGYGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSGYGSGYGSGYGDEGYSRYGDEGYGPYGPYGSGGGGGGGGGGGGGANGAGYGRGFGSGSGSGYGSGSGGGAGRGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGSGGGGGGGGGGGGGGGGGGMAGGSGYGSGYGSGYGSGYGGGGEESP
Homology
BLAST of Tan0017084 vs. NCBI nr
Match: KAG6601193.1 (hypothetical protein SDJN03_06426, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 190.3 bits (482), Expect = 1.8e-44
Identity = 187/212 (88.21%), Postives = 190/212 (89.62%), Query Frame = 0

Query: 1   MASTRAIGLVLLVLLMVDLTLAARFLKGYGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSG 60
           MA+T  IGLVLLVLLM DLTLAARFL+GYG GGGGGSGGGGGG  GEGS PWSGSGYGSG
Sbjct: 1   MATTGTIGLVLLVLLMADLTLAARFLRGYGGGGGGGSGGGGGG--GEGSRPWSGSGYGSG 60

Query: 61  YGSGYGSGYGDEGYSRYGDEGYGPYGPYGS---GGGGGGGGGGGGGANGAGYGRGFGSGS 120
           YGSGYGSGYGD GY RYGDE YGPYGPYGS   GGGGGGGGGGGGGANGAGYG GFGSGS
Sbjct: 61  YGSGYGSGYGDGGYPRYGDEEYGPYGPYGSGGGGGGGGGGGGGGGGANGAGYGHGFGSGS 120

Query: 121 GSGYGSGSGGGAGRGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGSGGGG 180
           GSGYGSG G G G GGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYG+GGGRG GGGG
Sbjct: 121 GSGYGSGGGVGRGGGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGNGGGRG-GGGG 180

Query: 181 GGGGGGGGGGGGGGMAGGSGYGSGYGSGYGSG 210
           GGGGGGGGGGGG GM GGSGYGSGYGSGYG G
Sbjct: 181 GGGGGGGGGGGGEGMGGGSGYGSGYGSGYGGG 209

BLAST of Tan0017084 vs. NCBI nr
Match: XP_022990525.1 (putative glycine-rich cell wall structural protein 1 [Cucurbita maxima])

HSP 1 Score: 189.1 bits (479), Expect = 4.0e-44
Identity = 185/209 (88.52%), Postives = 188/209 (89.95%), Query Frame = 0

Query: 1   MASTRAIGLVLLVLLMVDLTLAARFLKGYGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSG 60
           MA+  AIGLVLLVLLM DLTLAARFL+GYG GGGGGSGGGGGG  GEGS PWSGSGYGSG
Sbjct: 1   MATNGAIGLVLLVLLMADLTLAARFLRGYGGGGGGGSGGGGGG--GEGSRPWSGSGYGSG 60

Query: 61  YGSGYGSGYGDEGYSRYGDEGYGPYGPYGSGGGGGGGGGGGGGANGAGYGRGFGSGSGSG 120
           YGSGYGSGYGD GY RYGDEGYGPYG  G GGGGGGGGGGGG ANGAGYG+GFGSGSGSG
Sbjct: 61  YGSGYGSGYGDGGYPRYGDEGYGPYGSGGGGGGGGGGGGGGGSANGAGYGQGFGSGSGSG 120

Query: 121 YGSGSGGGAGRGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGSGGGGGGG 180
           YGSG G G G GGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRG GGGGGGG
Sbjct: 121 YGSGGGVGRGGGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRG-GGGGGGG 180

Query: 181 GGGGGGGGGGGMAGGSGYGSGYGSGYGSG 210
           GGGGGGGGG GM GGSGYGSGYGSGYG G
Sbjct: 181 GGGGGGGGGEGMGGGSGYGSGYGSGYGGG 206

BLAST of Tan0017084 vs. NCBI nr
Match: XP_023550088.1 (glycine-rich cell wall structural protein 2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 181.8 bits (460), Expect = 6.4e-42
Identity = 185/216 (85.65%), Postives = 188/216 (87.04%), Query Frame = 0

Query: 1   MASTRAIGLVLLVLLMVDLTLAARFLKGYGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSG 60
           MA+T AIGLVLLVLLM DLTLAARFL+GY    GGGSGGGGGG  GEGS PWSGSGYGSG
Sbjct: 1   MATTGAIGLVLLVLLMADLTLAARFLRGY----GGGSGGGGGG--GEGSRPWSGSGYGSG 60

Query: 61  YGSGYGSGYGDEGYSRYGDEGYGPYGPY-------GSGGGGGGGGGGGGGANGAGYGRGF 120
           YGSGYGSGYGD GY RYGDEGYGPYGPY       G GGGGGGGGGGGGGANGAGYG GF
Sbjct: 61  YGSGYGSGYGDGGYPRYGDEGYGPYGPYGPYGSGGGGGGGGGGGGGGGGGANGAGYGHGF 120

Query: 121 GSGSGSGYGSGSGGGAGRGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGS 180
           GSGSGSGYGSG G G G GGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYG+GGGRG 
Sbjct: 121 GSGSGSGYGSGGGVGRGGGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGNGGGRG- 180

Query: 181 GGGGGGGGGGGGGGGGGGMAGGSGYGSGYGSGYGSG 210
           GGGGGGGGGGGGGGGG GM GGSGYGSGYGSGYG G
Sbjct: 181 GGGGGGGGGGGGGGGGEGMGGGSGYGSGYGSGYGGG 209

BLAST of Tan0017084 vs. NCBI nr
Match: XP_008447018.1 (PREDICTED: glycine-rich cell wall structural protein 2-like [Cucumis melo])

HSP 1 Score: 175.3 bits (443), Expect = 5.9e-40
Identity = 184/221 (83.26%), Postives = 192/221 (86.88%), Query Frame = 0

Query: 1   MASTRAIGLVLLVLLMVDLTLAARFLKGYGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSG 60
           MA+T+A+G V+LVLLM+DL LAARF +GYG GGGGGSGGGGGGG G G GPW GSGYGSG
Sbjct: 1   MATTKAVGFVVLVLLMLDLALAARFFRGYGGGGGGGSGGGGGGGEGSG-GPWPGSGYGSG 60

Query: 61  YGSGYGSGY-GDEGYSRYGDEGYGPYGPY--GSGGGGGGGGGGGGGANGAGYGRGFGSGS 120
           YGSGYGSGY G+EGYSRYGDEGYGPYG Y  G GGGGGGGGGGGG A GA YGRGFG G 
Sbjct: 61  YGSGYGSGYGGEEGYSRYGDEGYGPYGRYSGGGGGGGGGGGGGGGSAIGAEYGRGFGMGG 120

Query: 121 GSGYGSGSGGGAGRGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGSGGGG 180
           GSGY  GSGGG GRGGGGGGG GGGGGGGSG+GSGSGYGSGYGSGGGYGSGGGRG GGGG
Sbjct: 121 GSGY--GSGGGVGRGGGGGGGSGGGGGGGSGVGSGSGYGSGYGSGGGYGSGGGRG-GGGG 180

Query: 181 GGGGGGGGGGGGGGMAGGSGYGSGYGSGYGSGYGGGGEESP 219
            GGGGGGGGGGGGGM GGSGYGSGYGSGYG   GGG EESP
Sbjct: 181 VGGGGGGGGGGGGGMGGGSGYGSGYGSGYG---GGGDEESP 214

BLAST of Tan0017084 vs. NCBI nr
Match: XP_031741764.1 (glycine-rich cell wall structural protein 2 [Cucumis sativus])

HSP 1 Score: 168.3 bits (425), Expect = 7.3e-38
Identity = 181/222 (81.53%), Postives = 188/222 (84.68%), Query Frame = 0

Query: 1   MASTRAIGLVLLVLLMVDLTLAARFLKGYGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSG 60
           MA+T+A GLV+LVLLM+DL LAARF +GY    GGGSGGGGGG  GEGSGPWSGSGYGSG
Sbjct: 1   MATTKAAGLVILVLLMLDLALAARFFRGY----GGGSGGGGGG--GEGSGPWSGSGYGSG 60

Query: 61  YGSGYGSGY-GDEGYSRYGDEG-YGPYGPY--GSGGGGGGGGGGGGGANGAGYGRGFGSG 120
           YGSGYGSGY G+EGYSRYGDEG YGPYGPY  G GGGGGGGGGGGG A GA YGRGFG+G
Sbjct: 61  YGSGYGSGYGGEEGYSRYGDEGRYGPYGPYSGGGGGGGGGGGGGGGSAIGAEYGRGFGAG 120

Query: 121 SGSGYGSGSGGGAGRGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGSGGG 180
           SGSGY  GSGGG GRGGGGGGG GGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRG GGG
Sbjct: 121 SGSGY--GSGGGVGRGGGGGGGSGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGGGGG 180

Query: 181 GGGGGGGGGGGGGGGMAGGSGYGSGYGSGYGSGYGGGGEESP 219
           GGGGGGGGG        GGSGYGSGYGSGYG   GGG EESP
Sbjct: 181 GGGGGGGGG--------GGSGYGSGYGSGYG---GGGDEESP 203

BLAST of Tan0017084 vs. ExPASy TrEMBL
Match: A0A6J1JQB8 (putative glycine-rich cell wall structural protein 1 OS=Cucurbita maxima OX=3661 GN=LOC111487374 PE=4 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 1.9e-44
Identity = 185/209 (88.52%), Postives = 188/209 (89.95%), Query Frame = 0

Query: 1   MASTRAIGLVLLVLLMVDLTLAARFLKGYGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSG 60
           MA+  AIGLVLLVLLM DLTLAARFL+GYG GGGGGSGGGGGG  GEGS PWSGSGYGSG
Sbjct: 1   MATNGAIGLVLLVLLMADLTLAARFLRGYGGGGGGGSGGGGGG--GEGSRPWSGSGYGSG 60

Query: 61  YGSGYGSGYGDEGYSRYGDEGYGPYGPYGSGGGGGGGGGGGGGANGAGYGRGFGSGSGSG 120
           YGSGYGSGYGD GY RYGDEGYGPYG  G GGGGGGGGGGGG ANGAGYG+GFGSGSGSG
Sbjct: 61  YGSGYGSGYGDGGYPRYGDEGYGPYGSGGGGGGGGGGGGGGGSANGAGYGQGFGSGSGSG 120

Query: 121 YGSGSGGGAGRGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGSGGGGGGG 180
           YGSG G G G GGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRG GGGGGGG
Sbjct: 121 YGSGGGVGRGGGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRG-GGGGGGG 180

Query: 181 GGGGGGGGGGGMAGGSGYGSGYGSGYGSG 210
           GGGGGGGGG GM GGSGYGSGYGSGYG G
Sbjct: 181 GGGGGGGGGEGMGGGSGYGSGYGSGYGGG 206

BLAST of Tan0017084 vs. ExPASy TrEMBL
Match: A0A1S3BGF1 (glycine-rich cell wall structural protein 2-like OS=Cucumis melo OX=3656 GN=LOC103489565 PE=4 SV=1)

HSP 1 Score: 175.3 bits (443), Expect = 2.9e-40
Identity = 184/221 (83.26%), Postives = 192/221 (86.88%), Query Frame = 0

Query: 1   MASTRAIGLVLLVLLMVDLTLAARFLKGYGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSG 60
           MA+T+A+G V+LVLLM+DL LAARF +GYG GGGGGSGGGGGGG G G GPW GSGYGSG
Sbjct: 1   MATTKAVGFVVLVLLMLDLALAARFFRGYGGGGGGGSGGGGGGGEGSG-GPWPGSGYGSG 60

Query: 61  YGSGYGSGY-GDEGYSRYGDEGYGPYGPY--GSGGGGGGGGGGGGGANGAGYGRGFGSGS 120
           YGSGYGSGY G+EGYSRYGDEGYGPYG Y  G GGGGGGGGGGGG A GA YGRGFG G 
Sbjct: 61  YGSGYGSGYGGEEGYSRYGDEGYGPYGRYSGGGGGGGGGGGGGGGSAIGAEYGRGFGMGG 120

Query: 121 GSGYGSGSGGGAGRGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGSGGGG 180
           GSGY  GSGGG GRGGGGGGG GGGGGGGSG+GSGSGYGSGYGSGGGYGSGGGRG GGGG
Sbjct: 121 GSGY--GSGGGVGRGGGGGGGSGGGGGGGSGVGSGSGYGSGYGSGGGYGSGGGRG-GGGG 180

Query: 181 GGGGGGGGGGGGGGMAGGSGYGSGYGSGYGSGYGGGGEESP 219
            GGGGGGGGGGGGGM GGSGYGSGYGSGYG   GGG EESP
Sbjct: 181 VGGGGGGGGGGGGGMGGGSGYGSGYGSGYG---GGGDEESP 214

BLAST of Tan0017084 vs. ExPASy TrEMBL
Match: A0A0A0KUV7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G622440 PE=4 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 7.8e-38
Identity = 179/222 (80.63%), Postives = 186/222 (83.78%), Query Frame = 0

Query: 1   MASTRAIGLVLLVLLMVDLTLAARFLKGYGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSG 60
           MA+T+A GLV+LVLLM+DL LAARF +GY    GGGSGGGGGG  GEGSGPWSGSGYGSG
Sbjct: 1   MATTKAAGLVILVLLMLDLALAARFFRGY----GGGSGGGGGG--GEGSGPWSGSGYGSG 60

Query: 61  YGSGYGSGY-GDEGYSRYGDEG-YGPYGPY--GSGGGGGGGGGGGGGANGAGYGRGFGSG 120
           YGSGYGSGY G+EGYSRYGDEG YGPYGPY  G GGGGGGGGGGGG A GA YGRGFG+G
Sbjct: 61  YGSGYGSGYGGEEGYSRYGDEGRYGPYGPYSGGGGGGGGGGGGGGGSAIGAEYGRGFGAG 120

Query: 121 SGSGYGSGSGGGAGRGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGSGGG 180
           SGSGY  GSGGG GRGGGGGGG GGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRG GGG
Sbjct: 121 SGSGY--GSGGGVGRGGGGGGGSGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGGGGG 180

Query: 181 GGGGGGGGGGGGGGGMAGGSGYGSGYGSGYGSGYGGGGEESP 219
           GGGGGGG          GGSGYGSGYGSGYG   GGG EESP
Sbjct: 181 GGGGGGG----------GGSGYGSGYGSGYG---GGGDEESP 201

BLAST of Tan0017084 vs. ExPASy TrEMBL
Match: A0A6J1CF67 (putative glycine-rich cell wall structural protein 1 OS=Momordica charantia OX=3673 GN=LOC111010186 PE=4 SV=1)

HSP 1 Score: 130.6 bits (327), Expect = 8.1e-27
Identity = 145/180 (80.56%), Postives = 151/180 (83.89%), Query Frame = 0

Query: 2   ASTRAIGLVLLVLLMVDLTLAARFLKGYGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSGY 61
           +STRAIGLV+LVLLMVDLTLAAR L+GY    GGG+GGGGGG  GEG GP SGSGYGSGY
Sbjct: 4   SSTRAIGLVVLVLLMVDLTLAARSLRGY----GGGTGGGGGG--GEGFGP-SGSGYGSGY 63

Query: 62  GSGYGSGYGDEGYSRYGDEGYGPYGPYGS--GGGGGGGGGGGGGANGAGYGRGFGSGSGS 121
           GSGYGSG+GDE Y  Y  E YGPYGPYGS  GGGGGGGGGGGGGA GAGYGRGF    GS
Sbjct: 64  GSGYGSGFGDEAYEPY--EPYGPYGPYGSGGGGGGGGGGGGGGGATGAGYGRGF----GS 123

Query: 122 GYGSGSGGGAGRGGGGGGGRGGGGGGGSGIGSGSGYGSGYGSGGGYGSGGGRGSGGGGGG 180
           GYGSG G G G GGGGGGGRGGGGGGGSG GSGSGYGSGYGSGGGYG+G GRG GGGGGG
Sbjct: 124 GYGSGGGIGRGGGGGGGGGRGGGGGGGSGTGSGSGYGSGYGSGGGYGNGRGRGGGGGGGG 170

BLAST of Tan0017084 vs. ExPASy TrEMBL
Match: B9T304 (Glycine-rich cell wall structural protein 1, putative OS=Ricinus communis OX=3988 GN=RCOM_0350510 PE=4 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 1.5e-17
Identity = 153/218 (70.18%), Postives = 164/218 (75.23%), Query Frame = 0

Query: 3   STRAIGLVLLVLLMVDLTLAARFLKG-YGSGGGGGSGGGGGGGSGEGSGPWSGSGYGSGY 62
           S R +G   LVLL++DL  AAR  K  +G  GGGG GGG GGG G GS   SGSGYGSGY
Sbjct: 5   SPRVLGAAFLVLLLMDLCFAARSSKDLFGRSGGGGGGGGQGGGGGGGSALGSGSGYGSGY 64

Query: 63  GSGYGSGYGDEGYSRYGDEGYGPYGPYGSGGGGGGGGGGGGGANGAGYGRGFGSGSGSGY 122
           GSG G GYG  G       GYG  G  G GGGG GGGGGGG A+G+G G G+GSGSGSGY
Sbjct: 65  GSGGGEGYGGAG-------GYGGLGGGGGGGGGSGGGGGGGSASGSGSGSGYGSGSGSGY 124

Query: 123 GSGSGGGAGRGGGGGGGRGGGGGGGSGI--GSGSGYGSGY--GSGGGYGSGGGRGSGGGG 182
           GSGSGGG G GGGGGGG+GGGGGGG G+  G+GSGYGSGY  GSG GYGSGGG+G GGGG
Sbjct: 125 GSGSGGGKGGGGGGGGGKGGGGGGGGGVGNGNGSGYGSGYGSGSGSGYGSGGGKGGGGGG 184

Query: 183 GGGGGGGGGGGGGGMAGGSGYGSGY--GSGYGSGYGGG 214
           GGGGGGGGGGGGGG   GSGYGSGY  GSGYGSGYGGG
Sbjct: 185 GGGGGGGGGGGGGGSGSGSGYGSGYGSGSGYGSGYGGG 215

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6601193.11.8e-4488.21hypothetical protein SDJN03_06426, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022990525.14.0e-4488.52putative glycine-rich cell wall structural protein 1 [Cucurbita maxima][more]
XP_023550088.16.4e-4285.65glycine-rich cell wall structural protein 2-like [Cucurbita pepo subsp. pepo][more]
XP_008447018.15.9e-4083.26PREDICTED: glycine-rich cell wall structural protein 2-like [Cucumis melo][more]
XP_031741764.17.3e-3881.53glycine-rich cell wall structural protein 2 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
A0A6J1JQB81.9e-4488.52putative glycine-rich cell wall structural protein 1 OS=Cucurbita maxima OX=3661... [more]
A0A1S3BGF12.9e-4083.26glycine-rich cell wall structural protein 2-like OS=Cucumis melo OX=3656 GN=LOC1... [more]
A0A0A0KUV77.8e-3880.63Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G622440 PE=4 SV=1[more]
A0A6J1CF678.1e-2780.56putative glycine-rich cell wall structural protein 1 OS=Momordica charantia OX=3... [more]
B9T3041.5e-1770.18Glycine-rich cell wall structural protein 1, putative OS=Ricinus communis OX=398... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePRINTSPR01228EGGSHELLcoord: 126..136
score: 39.77
coord: 6..22
score: 27.94
coord: 39..50
score: 54.17
coord: 99..114
score: 55.47
coord: 162..180
score: 35.53
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 167..186
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..141
NoneNo IPR availablePANTHERPTHR37612FIBROIN HEAVY CHAIN FIB-H LIKE PROTEINcoord: 1..215
NoneNo IPR availablePANTHERPTHR37612:SF6FIBROIN HEAVY CHAIN FIB-H LIKE PROTEINcoord: 1..215

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0017084.1Tan0017084.1mRNA