Tan0012170 (gene) Snake gourd v1

Overview
NameTan0012170
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglycine-rich protein 5
LocationLG05: 82988039 .. 82988629 (-)
RNA-Seq ExpressionTan0012170
SyntenyTan0012170
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAGAGCAAACCCATATTCCTCAAGAATTATTGCTCATCAACAAGAAATGCATGGGAAAGTCCCAACTAAAAGGCAAGTTTGGGCTCTGTAAACTGCCCAAGCACGGATAATGAATGATATTTTTATGTCATTTCATCCATTTCTCTTCCTTTTTTCAGTTCAAAATGTTCTATAAATTCCTACACTCACAATTCCCAATTCCCATCAAAGAAATCATCCTTTAATACAAAAATCATGGGTTCTTTGGCTTTCAAATTCCTCTGGCTGGTGGTTCTGTCTGCACTAATTTGGGGCTCAGAGCCCAGAAGCCTTGTTAGCAATGGGGGAGAGATTGGAAACGAAAAGACAGGGTATTTCCAGTTCTTTCCTGGTTATGGCGGTGGAGGCTTTGGTGGCGGTGGTGGCGGCGGCGGCGGTGGATTAGGTGGTGGTTCGGGCTTTGGCAGTGGAGGTGGAGGGGGGTTTGGGACTGGAATTGGGGGGCTCGGCGGTGGAGGATTCGGCGGAGGTGGTGGCGGTGGCAGTGGCGTTTTAGGCAGTGGAGCGGGTGGAGGAGCTGGTGGTGGATTTGGAGGTGGACTGCCTTGA

mRNA sequence

AAAAAGAGCAAACCCATATTCCTCAAGAATTATTGCTCATCAACAAGAAATGCATGGGAAAGTCCCAACTAAAAGGCAAGTTTGGGCTCTGTAAACTGCCCAAGCACGGATAATGAATGATATTTTTATGTCATTTCATCCATTTCTCTTCCTTTTTTCAGTTCAAAATGTTCTATAAATTCCTACACTCACAATTCCCAATTCCCATCAAAGAAATCATCCTTTAATACAAAAATCATGGGTTCTTTGGCTTTCAAATTCCTCTGGCTGGTGGTTCTGTCTGCACTAATTTGGGGCTCAGAGCCCAGAAGCCTTGTTAGCAATGGGGGAGAGATTGGAAACGAAAAGACAGGGTATTTCCAGTTCTTTCCTGGTTATGGCGGTGGAGGCTTTGGTGGCGGTGGTGGCGGCGGCGGCGGTGGATTAGGTGGTGGTTCGGGCTTTGGCAGTGGAGGTGGAGGGGGGTTTGGGACTGGAATTGGGGGGCTCGGCGGTGGAGGATTCGGCGGAGGTGGTGGCGGTGGCAGTGGCGTTTTAGGCAGTGGAGCGGGTGGAGGAGCTGGTGGTGGATTTGGAGGTGGACTGCCTTGA

Coding sequence (CDS)

ATGGGTTCTTTGGCTTTCAAATTCCTCTGGCTGGTGGTTCTGTCTGCACTAATTTGGGGCTCAGAGCCCAGAAGCCTTGTTAGCAATGGGGGAGAGATTGGAAACGAAAAGACAGGGTATTTCCAGTTCTTTCCTGGTTATGGCGGTGGAGGCTTTGGTGGCGGTGGTGGCGGCGGCGGCGGTGGATTAGGTGGTGGTTCGGGCTTTGGCAGTGGAGGTGGAGGGGGGTTTGGGACTGGAATTGGGGGGCTCGGCGGTGGAGGATTCGGCGGAGGTGGTGGCGGTGGCAGTGGCGTTTTAGGCAGTGGAGCGGGTGGAGGAGCTGGTGGTGGATTTGGAGGTGGACTGCCTTGA

Protein sequence

MGSLAFKFLWLVVLSALIWGSEPRSLVSNGGEIGNEKTGYFQFFPGYGGGGFGGGGGGGGGGLGGGSGFGSGGGGGFGTGIGGLGGGGFGGGGGGGSGVLGSGAGGGAGGGFGGGLP
Homology
BLAST of Tan0012170 vs. NCBI nr
Match: XP_038895952.1 (glycine-rich cell wall structural protein-like [Benincasa hispida])

HSP 1 Score: 116.3 bits (290), Expect = 1.8e-22
Identity = 102/123 (82.93%), Postives = 107/123 (86.99%), Query Frame = 0

Query: 1   MGSLAFKFLWLVVLSALIWGSEPRSLV-SNGGEIGNEKTGYFQFFPGYGG--GGFGGGGG 60
           MGSLAFKFLWLVVL ALIW SEPR LV +NGGEI +EKT  FQFFPG+GG  GG GG GG
Sbjct: 1   MGSLAFKFLWLVVLCALIWASEPRKLVITNGGEIESEKTLAFQFFPGFGGGLGGGGGYGG 60

Query: 61  GGGGGLGGGSGFGSGGGGGFGTGIGGL---GGGGFGGGGGGGSGVLGSGAGGGAGGGFGG 118
           GGGGGLGGGSGFGSGGGGGFG+GIGGL   GGGGFGGGGGGG G+LG GAGGGAGGGFGG
Sbjct: 61  GGGGGLGGGSGFGSGGGGGFGSGIGGLGSGGGGGFGGGGGGGGGILGGGAGGGAGGGFGG 120

BLAST of Tan0012170 vs. NCBI nr
Match: XP_022994117.1 (glycine-rich protein 5 [Cucurbita maxima])

HSP 1 Score: 109.4 bits (272), Expect = 2.1e-20
Identity = 100/123 (81.30%), Postives = 103/123 (83.74%), Query Frame = 0

Query: 1   MGS-LAFKFLWLVVLSALIWGSEPRSLVSNGGEIGNEKTGYFQFFPGYGG--GGFGGGGG 60
           MGS LA KFLWL+VL ALI  SEPR LV  GG  G+EKTGYFQFFPGYGG  GG GG GG
Sbjct: 1   MGSFLALKFLWLLVLFALICASEPRRLVGTGGGFGSEKTGYFQFFPGYGGGLGGGGGFGG 60

Query: 61  GGGGGLGGGSGFGSGGGGGFGTGIGGL---GGGGFGGGGGGGSGVLGSGAGGGAGGGFGG 118
           GGGGGLGGGSGFGSGGG GFG+GIGGL   GGGGFGGGGG GSGVLG GAGGGAGGGFGG
Sbjct: 61  GGGGGLGGGSGFGSGGGAGFGSGIGGLGGGGGGGFGGGGGSGSGVLGGGAGGGAGGGFGG 120

BLAST of Tan0012170 vs. NCBI nr
Match: XP_011648421.1 (glycine-rich cell wall structural protein [Cucumis sativus] >KGN64508.1 hypothetical protein Csa_013856 [Cucumis sativus])

HSP 1 Score: 109.0 bits (271), Expect = 2.8e-20
Identity = 99/121 (81.82%), Postives = 104/121 (85.95%), Query Frame = 0

Query: 1   MGSLAFKFLWLVVLSALIWGSEPRSLV-SNGGEIGNEKTGYFQFFPGYGG--GGFGGGGG 60
           MGS AFK+LWLVVL ALIW SEPR LV +NGGEI +EKT  FQFFPGYGG  GG GG GG
Sbjct: 1   MGSSAFKYLWLVVLCALIWASEPRKLVITNGGEIESEKTLPFQFFPGYGGGLGGGGGFGG 60

Query: 61  GGGGGLGGGSGFGSGGGGGFGTGIGGLGGGGFGG-GGGGGSGVLGSGAGGGAGGGFGGGL 118
           GGGGGLGGGSGFGSGGGGGFG+GIGGLG GG GG GGGGG G+LG GAGGGAGGGFGGGL
Sbjct: 61  GGGGGLGGGSGFGSGGGGGFGSGIGGLGSGGGGGFGGGGGGGILGGGAGGGAGGGFGGGL 120

BLAST of Tan0012170 vs. NCBI nr
Match: XP_023542188.1 (glycine-rich protein 23 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 106.3 bits (264), Expect = 1.8e-19
Identity = 101/126 (80.16%), Postives = 104/126 (82.54%), Query Frame = 0

Query: 1   MGS-LAFKFLWLVVLSALIWGSEPRSLVSNGGEIGNEKTGYFQFFPGYG---GGGFGGGG 60
           MGS LA KFLWL+VL ALI  SEPR LV  GG  G+EKTGYFQFFPGYG   GGG G GG
Sbjct: 1   MGSFLALKFLWLLVLFALICASEPRRLVGTGGGFGSEKTGYFQFFPGYGSGLGGGGGFGG 60

Query: 61  GGGGGGLGGGSGFGSGGGGGFGTGIGGL---GGGGFGGGGGGGSGVLGSGA--GGGAGGG 118
           GGGGGGLGGGSGFGSGGGGGFG+GIGGL   GGGGFGGGGG GSGVLG GA  GGGAGGG
Sbjct: 61  GGGGGGLGGGSGFGSGGGGGFGSGIGGLGGGGGGGFGGGGGSGSGVLGGGAGGGGGAGGG 120

BLAST of Tan0012170 vs. NCBI nr
Match: XP_008446362.1 (PREDICTED: glycine-rich cell wall structural protein [Cucumis melo] >ADN34062.1 hypothetical protein [Cucumis melo subsp. melo] >KAA0056845.1 glycine-rich cell wall structural protein [Cucumis melo var. makuwa] >TYJ99348.1 glycine-rich cell wall structural protein [Cucumis melo var. makuwa])

HSP 1 Score: 104.4 bits (259), Expect = 6.9e-19
Identity = 98/121 (80.99%), Postives = 103/121 (85.12%), Query Frame = 0

Query: 1   MGSLAFKFLWLVVLSALIWGSEPRSLV-SNGGEIGNEKTGYFQFFPGYGG--GGFGGGGG 60
           MGSLAFK+LWLVVL ALI  SEPR LV +NGGEI +EKT  FQFFPGYGG  GG GG GG
Sbjct: 1   MGSLAFKYLWLVVLCALICASEPRRLVITNGGEIESEKTLPFQFFPGYGGGLGGGGGFGG 60

Query: 61  GGGGGLGGGSGFGSGGGGGFGTGIGGLGGGGFGG-GGGGGSGVLGSGAGGGAGGGFGGGL 118
           GGGGGLGGGSGFGSGGGGGFG+GIGGLG GG GG GGGGG G+LG GAGGG GGGFGGGL
Sbjct: 61  GGGGGLGGGSGFGSGGGGGFGSGIGGLGSGGGGGFGGGGGGGILGGGAGGGGGGGFGGGL 120

BLAST of Tan0012170 vs. ExPASy TrEMBL
Match: A0A6J1JUT9 (glycine-rich protein 5 OS=Cucurbita maxima OX=3661 GN=LOC111489946 PE=4 SV=1)

HSP 1 Score: 109.4 bits (272), Expect = 1.0e-20
Identity = 100/123 (81.30%), Postives = 103/123 (83.74%), Query Frame = 0

Query: 1   MGS-LAFKFLWLVVLSALIWGSEPRSLVSNGGEIGNEKTGYFQFFPGYGG--GGFGGGGG 60
           MGS LA KFLWL+VL ALI  SEPR LV  GG  G+EKTGYFQFFPGYGG  GG GG GG
Sbjct: 1   MGSFLALKFLWLLVLFALICASEPRRLVGTGGGFGSEKTGYFQFFPGYGGGLGGGGGFGG 60

Query: 61  GGGGGLGGGSGFGSGGGGGFGTGIGGL---GGGGFGGGGGGGSGVLGSGAGGGAGGGFGG 118
           GGGGGLGGGSGFGSGGG GFG+GIGGL   GGGGFGGGGG GSGVLG GAGGGAGGGFGG
Sbjct: 61  GGGGGLGGGSGFGSGGGAGFGSGIGGLGGGGGGGFGGGGGSGSGVLGGGAGGGAGGGFGG 120

BLAST of Tan0012170 vs. ExPASy TrEMBL
Match: A0A0A0LS00 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G062340 PE=4 SV=1)

HSP 1 Score: 109.0 bits (271), Expect = 1.4e-20
Identity = 99/121 (81.82%), Postives = 104/121 (85.95%), Query Frame = 0

Query: 1   MGSLAFKFLWLVVLSALIWGSEPRSLV-SNGGEIGNEKTGYFQFFPGYGG--GGFGGGGG 60
           MGS AFK+LWLVVL ALIW SEPR LV +NGGEI +EKT  FQFFPGYGG  GG GG GG
Sbjct: 1   MGSSAFKYLWLVVLCALIWASEPRKLVITNGGEIESEKTLPFQFFPGYGGGLGGGGGFGG 60

Query: 61  GGGGGLGGGSGFGSGGGGGFGTGIGGLGGGGFGG-GGGGGSGVLGSGAGGGAGGGFGGGL 118
           GGGGGLGGGSGFGSGGGGGFG+GIGGLG GG GG GGGGG G+LG GAGGGAGGGFGGGL
Sbjct: 61  GGGGGLGGGSGFGSGGGGGFGSGIGGLGSGGGGGFGGGGGGGILGGGAGGGAGGGFGGGL 120

BLAST of Tan0012170 vs. ExPASy TrEMBL
Match: A0A5D3BMG3 (Glycine-rich cell wall structural protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005650 PE=4 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 3.3e-19
Identity = 98/121 (80.99%), Postives = 103/121 (85.12%), Query Frame = 0

Query: 1   MGSLAFKFLWLVVLSALIWGSEPRSLV-SNGGEIGNEKTGYFQFFPGYGG--GGFGGGGG 60
           MGSLAFK+LWLVVL ALI  SEPR LV +NGGEI +EKT  FQFFPGYGG  GG GG GG
Sbjct: 1   MGSLAFKYLWLVVLCALICASEPRRLVITNGGEIESEKTLPFQFFPGYGGGLGGGGGFGG 60

Query: 61  GGGGGLGGGSGFGSGGGGGFGTGIGGLGGGGFGG-GGGGGSGVLGSGAGGGAGGGFGGGL 118
           GGGGGLGGGSGFGSGGGGGFG+GIGGLG GG GG GGGGG G+LG GAGGG GGGFGGGL
Sbjct: 61  GGGGGLGGGSGFGSGGGGGFGSGIGGLGSGGGGGFGGGGGGGILGGGAGGGGGGGFGGGL 120

BLAST of Tan0012170 vs. ExPASy TrEMBL
Match: E5GC63 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 3.3e-19
Identity = 98/121 (80.99%), Postives = 103/121 (85.12%), Query Frame = 0

Query: 1   MGSLAFKFLWLVVLSALIWGSEPRSLV-SNGGEIGNEKTGYFQFFPGYGG--GGFGGGGG 60
           MGSLAFK+LWLVVL ALI  SEPR LV +NGGEI +EKT  FQFFPGYGG  GG GG GG
Sbjct: 1   MGSLAFKYLWLVVLCALICASEPRRLVITNGGEIESEKTLPFQFFPGYGGGLGGGGGFGG 60

Query: 61  GGGGGLGGGSGFGSGGGGGFGTGIGGLGGGGFGG-GGGGGSGVLGSGAGGGAGGGFGGGL 118
           GGGGGLGGGSGFGSGGGGGFG+GIGGLG GG GG GGGGG G+LG GAGGG GGGFGGGL
Sbjct: 61  GGGGGLGGGSGFGSGGGGGFGSGIGGLGSGGGGGFGGGGGGGILGGGAGGGGGGGFGGGL 120

BLAST of Tan0012170 vs. ExPASy TrEMBL
Match: A0A1S3BEW4 (glycine-rich cell wall structural protein OS=Cucumis melo OX=3656 GN=LOC103489125 PE=4 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 3.3e-19
Identity = 98/121 (80.99%), Postives = 103/121 (85.12%), Query Frame = 0

Query: 1   MGSLAFKFLWLVVLSALIWGSEPRSLV-SNGGEIGNEKTGYFQFFPGYGG--GGFGGGGG 60
           MGSLAFK+LWLVVL ALI  SEPR LV +NGGEI +EKT  FQFFPGYGG  GG GG GG
Sbjct: 1   MGSLAFKYLWLVVLCALICASEPRRLVITNGGEIESEKTLPFQFFPGYGGGLGGGGGFGG 60

Query: 61  GGGGGLGGGSGFGSGGGGGFGTGIGGLGGGGFGG-GGGGGSGVLGSGAGGGAGGGFGGGL 118
           GGGGGLGGGSGFGSGGGGGFG+GIGGLG GG GG GGGGG G+LG GAGGG GGGFGGGL
Sbjct: 61  GGGGGLGGGSGFGSGGGGGFGSGIGGLGSGGGGGFGGGGGGGILGGGAGGGGGGGFGGGL 120

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038895952.11.8e-2282.93glycine-rich cell wall structural protein-like [Benincasa hispida][more]
XP_022994117.12.1e-2081.30glycine-rich protein 5 [Cucurbita maxima][more]
XP_011648421.12.8e-2081.82glycine-rich cell wall structural protein [Cucumis sativus] >KGN64508.1 hypothet... [more]
XP_023542188.11.8e-1980.16glycine-rich protein 23 [Cucurbita pepo subsp. pepo][more]
XP_008446362.16.9e-1980.99PREDICTED: glycine-rich cell wall structural protein [Cucumis melo] >ADN34062.1 ... [more]
Match NameE-valueIdentityDescription
A0A6J1JUT91.0e-2081.30glycine-rich protein 5 OS=Cucurbita maxima OX=3661 GN=LOC111489946 PE=4 SV=1[more]
A0A0A0LS001.4e-2081.82Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G062340 PE=4 SV=1[more]
A0A5D3BMG33.3e-1980.99Glycine-rich cell wall structural protein OS=Cucumis melo var. makuwa OX=1194695... [more]
E5GC633.3e-1980.99Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
A0A1S3BEW43.3e-1980.99glycine-rich cell wall structural protein OS=Cucumis melo OX=3656 GN=LOC10348912... [more]
Match NameE-valueIdentityDescription
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012170.1Tan0012170.1mRNA