Tan0007177 (gene) Snake gourd v1

Overview
NameTan0007177
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionserine-rich protein-related
LocationLG06: 2436045 .. 2436614 (-)
RNA-Seq ExpressionTan0007177
SyntenyTan0007177
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTTCTCTTTTCCTCTCTTTTAAGATAACTCTACCATTTTCCCTTCCATTAACATTTAATTTAACACACCATTTTCACAATTACTCCAAATCAACCTTTTCCCATTTCCTTCGTCTTCAAGCCATTTACAAAATCACCGCCTTTAAAATAAATGTGTTACCCGGCCGTCGTTCCCGTTTACAGAGCTGCCGCCTTGCGCGGAGTTGCGGCGGAGCAGGAAGCCGTCCGGCAGCAGAGAAGTGGCGGTGGAAACAAGTGTATTTGCTCGCCGACGACCCACCCAGGTTCTTTCAGATGTAGATTTCATCATAGAGAGTATAAATGGGTTAGCCGATCGAGAATTTCAAAACCCCACATGCTTTAATAACAATTTATATATGTTTATATATATACAAAATCTGTATATATGAAATATGTGGATTGGGTCTGTTTGTTGTGATTGTTTTTTAAACTAAATAGTCTGCAAATAGGCTGTGTTGTGAAATGTGATGTTTCTGTTCGTTTTGGGACATTGGTTTTTAATTTTTATATGGTGTAATTATCTGACTCAGAATTGTGTTGAAATCAAG

mRNA sequence

CTCTTCTCTTTTCCTCTCTTTTAAGATAACTCTACCATTTTCCCTTCCATTAACATTTAATTTAACACACCATTTTCACAATTACTCCAAATCAACCTTTTCCCATTTCCTTCGTCTTCAAGCCATTTACAAAATCACCGCCTTTAAAATAAATGTGTTACCCGGCCGTCGTTCCCGTTTACAGAGCTGCCGCCTTGCGCGGAGTTGCGGCGGAGCAGGAAGCCGTCCGGCAGCAGAGAAGTGGCGGTGGAAACAAGTGTATTTGCTCGCCGACGACCCACCCAGGTTCTTTCAGATGTAGATTTCATCATAGAGAGTATAAATGGGTTAGCCGATCGAGAATTTCAAAACCCCACATGCTTTAATAACAATTTATATATGTTTATATATATACAAAATCTGTATATATGAAATATGTGGATTGGGTCTGTTTGTTGTGATTGTTTTTTAAACTAAATAGTCTGCAAATAGGCTGTGTTGTGAAATGTGATGTTTCTGTTCGTTTTGGGACATTGGTTTTTAATTTTTATATGGTGTAATTATCTGACTCAGAATTGTGTTGAAATCAAG

Coding sequence (CDS)

ATGTGTTACCCGGCCGTCGTTCCCGTTTACAGAGCTGCCGCCTTGCGCGGAGTTGCGGCGGAGCAGGAAGCCGTCCGGCAGCAGAGAAGTGGCGGTGGAAACAAGTGTATTTGCTCGCCGACGACCCACCCAGGTTCTTTCAGATGTAGATTTCATCATAGAGAGTATAAATGGGTTAGCCGATCGAGAATTTCAAAACCCCACATGCTTTAA

Protein sequence

MCYPAVVPVYRAAALRGVAAEQEAVRQQRSGGGNKCICSPTTHPGSFRCRFHHREYKWVSRSRISKPHML
Homology
BLAST of Tan0007177 vs. NCBI nr
Match: KGN45357.1 (hypothetical protein Csa_016243 [Cucumis sativus])

HSP 1 Score: 92.0 bits (227), Expect = 2.1e-15
Identity = 47/73 (64.38%), Postives = 52/73 (71.23%), Query Frame = 0

Query: 1  MCYPAVVPVYRAAAL-------RGVAAEQEAVRQQRSGGGNKCICSPTTHPGSFRCRFHH 60
          MCYP  +P +R  A          VAA +EAVRQQRSGGG KCICSPTTHPGSF+CRFH 
Sbjct: 1  MCYP-TLPFHRPTAFSFASHGRSVVAAVEEAVRQQRSGGGYKCICSPTTHPGSFKCRFHQ 60

Query: 61 REYKWVSRSRISK 67
           +YKWVSRS  SK
Sbjct: 61 GDYKWVSRSTTSK 72

BLAST of Tan0007177 vs. NCBI nr
Match: CAA7043877.1 (unnamed protein product [Microthlaspi erraticum])

HSP 1 Score: 64.7 bits (156), Expect = 3.6e-07
Identity = 32/77 (41.56%), Postives = 43/77 (55.84%), Query Frame = 0

Query: 1  MCYPAVVPV----------YRAAALRGVAAEQEAVRQQRSGGG--NKCICSPTTHPGSFR 60
          MCY  ++P+            AA +  V  E  A+   R GGG   KC+CSP+ HPGSF+
Sbjct: 1  MCYKKILPLVIPPESIHVNMPAAEVTVVTTEVGAIEGHRDGGGGKKKCVCSPSKHPGSFK 60

Query: 61 CRFHHREYKWVSRSRIS 66
          CR+HH EY+W+  S  S
Sbjct: 61 CRYHHHEYQWLPSSSSS 77

BLAST of Tan0007177 vs. NCBI nr
Match: CAF2214202.1 (unnamed protein product [Brassica napus])

HSP 1 Score: 64.3 bits (155), Expect = 4.7e-07
Identity = 32/78 (41.03%), Postives = 43/78 (55.13%), Query Frame = 0

Query: 1  MCYPAVVPV-------YRAAALRGVAAEQEAVR---QQRSGGG--NKCICSPTTHPGSFR 60
          MCY  ++P+       +      GV    E V    + R GGG   KC+CSP+THP SF+
Sbjct: 1  MCYKKIIPLVIPPESFHENVPAAGVTVATEVVATVGRHRDGGGEKKKCVCSPSTHPRSFK 60

Query: 61 CRFHHREYKWVSRSRISK 67
          CR+HH EY+WV  S + K
Sbjct: 61 CRYHHHEYQWVPSSSLHK 78

BLAST of Tan0007177 vs. NCBI nr
Match: PLY76495.1 (hypothetical protein LSAT_4X103641 [Lactuca sativa])

HSP 1 Score: 63.5 bits (153), Expect = 8.1e-07
Identity = 33/66 (50.00%), Postives = 39/66 (59.09%), Query Frame = 0

Query: 1  MCYPAVVPVYRA----AALRGVAAEQEAVRQQRSGGG---NKCICSPTTHPGSFRCRFHH 60
          MCYPA V           +R +AA   AV  + +GGG    +C+CSPT HPGSFRCR HH
Sbjct: 1  MCYPAGVSSISGHESEIRIRRLAALATAVNHREAGGGGVKKQCLCSPTIHPGSFRCRHHH 60

BLAST of Tan0007177 vs. NCBI nr
Match: XP_022846371.1 (uncharacterized protein LOC111369102 [Olea europaea var. sylvestris] >CAA2987260.1 Hypothetical predicted protein [Olea europaea subsp. europaea])

HSP 1 Score: 63.5 bits (153), Expect = 8.1e-07
Identity = 31/65 (47.69%), Postives = 39/65 (60.00%), Query Frame = 0

Query: 6  VVPVYRAAALRGVAAEQEAVRQQRSGGGN---------KCICSPTTHPGSFRCRFHHREY 62
          V P   +AA+  VA +QE V    + GG+         +C+CSPT HPGSFRCR HH EY
Sbjct: 25 VTPPPPSAAVVEVARQQETVTTAEANGGSGGVGGGGMRRCVCSPTNHPGSFRCRHHHAEY 84

BLAST of Tan0007177 vs. ExPASy TrEMBL
Match: A0A0A0KAE1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G446750 PE=4 SV=1)

HSP 1 Score: 92.0 bits (227), Expect = 1.0e-15
Identity = 47/73 (64.38%), Postives = 52/73 (71.23%), Query Frame = 0

Query: 1  MCYPAVVPVYRAAAL-------RGVAAEQEAVRQQRSGGGNKCICSPTTHPGSFRCRFHH 60
          MCYP  +P +R  A          VAA +EAVRQQRSGGG KCICSPTTHPGSF+CRFH 
Sbjct: 1  MCYP-TLPFHRPTAFSFASHGRSVVAAVEEAVRQQRSGGGYKCICSPTTHPGSFKCRFHQ 60

Query: 61 REYKWVSRSRISK 67
           +YKWVSRS  SK
Sbjct: 61 GDYKWVSRSTTSK 72

BLAST of Tan0007177 vs. ExPASy TrEMBL
Match: A0A6D2K7L8 (Uncharacterized protein OS=Microthlaspi erraticum OX=1685480 GN=MERR_LOCUS31112 PE=4 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 1.8e-07
Identity = 32/77 (41.56%), Postives = 43/77 (55.84%), Query Frame = 0

Query: 1  MCYPAVVPV----------YRAAALRGVAAEQEAVRQQRSGGG--NKCICSPTTHPGSFR 60
          MCY  ++P+            AA +  V  E  A+   R GGG   KC+CSP+ HPGSF+
Sbjct: 1  MCYKKILPLVIPPESIHVNMPAAEVTVVTTEVGAIEGHRDGGGGKKKCVCSPSKHPGSFK 60

Query: 61 CRFHHREYKWVSRSRIS 66
          CR+HH EY+W+  S  S
Sbjct: 61 CRYHHHEYQWLPSSSSS 77

BLAST of Tan0007177 vs. ExPASy TrEMBL
Match: A0A2J6KN41 (Uncharacterized protein OS=Lactuca sativa OX=4236 GN=LSAT_4X103641 PE=4 SV=1)

HSP 1 Score: 63.5 bits (153), Expect = 3.9e-07
Identity = 33/66 (50.00%), Postives = 39/66 (59.09%), Query Frame = 0

Query: 1  MCYPAVVPVYRA----AALRGVAAEQEAVRQQRSGGG---NKCICSPTTHPGSFRCRFHH 60
          MCYPA V           +R +AA   AV  + +GGG    +C+CSPT HPGSFRCR HH
Sbjct: 1  MCYPAGVSSISGHESEIRIRRLAALATAVNHREAGGGGVKKQCLCSPTIHPGSFRCRHHH 60

BLAST of Tan0007177 vs. ExPASy TrEMBL
Match: A0A078IDV4 (BnaC03g69280D protein OS=Brassica napus OX=3708 GN=BnaC03g69280D PE=4 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 8.7e-07
Identity = 31/74 (41.89%), Postives = 41/74 (55.41%), Query Frame = 0

Query: 1  MCYPAVVPV-------YRAAALRGVAAEQEAVR---QQRSGGG--NKCICSPTTHPGSFR 60
          MCY   +P+       +    + GV    E V    + R GGG   KC+CSP+THP SF+
Sbjct: 1  MCYKKTIPLVIPPESFHENVPVAGVTVSTEVVATVGRHRDGGGEKKKCVCSPSTHPRSFK 60

Query: 61 CRFHHREYKWVSRS 63
          CR+HH EY+WV  S
Sbjct: 61 CRYHHHEYQWVPSS 74

BLAST of Tan0007177 vs. ExPASy TrEMBL
Match: A0A3N6Q757 (Uncharacterized protein OS=Brassica cretica OX=69181 GN=DY000_00007316 PE=4 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 1.1e-06
Identity = 31/74 (41.89%), Postives = 41/74 (55.41%), Query Frame = 0

Query: 1  MCYPAVVPV-------YRAAALRGVAAEQE---AVRQQRSGGG--NKCICSPTTHPGSFR 60
          MCY  ++P+       +      GV    E    V + R GGG   KC+CSP+THP SF+
Sbjct: 1  MCYKKIIPLVIPPESFHENVPAAGVTVATEVVVTVGRHRDGGGEKKKCVCSPSTHPRSFK 60

Query: 61 CRFHHREYKWVSRS 63
          CR+HH EY+WV  S
Sbjct: 61 CRYHHHEYQWVPSS 74

BLAST of Tan0007177 vs. TAIR 10
Match: AT1G52342.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 54.3 bits (129), Expect = 4.6e-08
Identity = 30/79 (37.97%), Postives = 40/79 (50.63%), Query Frame = 0

Query: 1  MCYPAVVPVY---------RAAALRGVAAEQEAVRQQRSGGG-----NKCICSPTTHPGS 60
          MCY  ++P+             A   VA + EA    RSGG       KC+CSP+ HP S
Sbjct: 1  MCYKILLPLVIPPESFHENTPVAEVTVATDVEATAGHRSGGDGGGGKKKCVCSPSKHPRS 60

Query: 61 FRCRFHHREYKWVSRSRIS 66
          F+CR+H  EY+W+  S  S
Sbjct: 61 FKCRYHQHEYQWLPSSSSS 79

BLAST of Tan0007177 vs. TAIR 10
Match: AT5G20370.1 (serine-rich protein-related )

HSP 1 Score: 46.6 bits (109), Expect = 9.5e-06
Identity = 17/20 (85.00%), Postives = 18/20 (90.00%), Query Frame = 0

Query: 35 KCICSPTTHPGSFRCRFHHR 55
          KC+CSPTTHPGSFRC FH R
Sbjct: 70 KCLCSPTTHPGSFRCSFHRR 89

BLAST of Tan0007177 vs. TAIR 10
Match: AT3G13227.1 (serine-rich protein-related )

HSP 1 Score: 44.7 bits (104), Expect = 3.6e-05
Identity = 20/43 (46.51%), Postives = 24/43 (55.81%), Query Frame = 0

Query: 26 RQQRSGGG--------------NKCICSPTTHPGSFRCRFHHR 55
          R +R GGG                C+C+PTTHPGSFRCR+H R
Sbjct: 41 RVRRGGGGGGGSVGMSKSSSVRQNCLCAPTTHPGSFRCRYHRR 83

BLAST of Tan0007177 vs. TAIR 10
Match: AT1G67910.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G24577.1); Has 167 Blast hits to 167 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 167; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 43.1 bits (100), Expect = 1.1e-04
Identity = 18/35 (51.43%), Postives = 21/35 (60.00%), Query Frame = 0

Query: 18 VAAEQEAVRQQRSGGGNKCICSPTTHPGSFRCRFH 53
          V      + +Q S     C+CSPTTHPGSFRCR H
Sbjct: 26 VEVGSRGLSRQTSMTKTNCLCSPTTHPGSFRCRIH 60

BLAST of Tan0007177 vs. TAIR 10
Match: AT1G67910.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G24577.1). )

HSP 1 Score: 43.1 bits (100), Expect = 1.1e-04
Identity = 18/35 (51.43%), Postives = 21/35 (60.00%), Query Frame = 0

Query: 18 VAAEQEAVRQQRSGGGNKCICSPTTHPGSFRCRFH 53
          V      + +Q S     C+CSPTTHPGSFRCR H
Sbjct: 26 VEVGSRGLSRQTSMTKTNCLCSPTTHPGSFRCRIH 60

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KGN45357.12.1e-1564.38hypothetical protein Csa_016243 [Cucumis sativus][more]
CAA7043877.13.6e-0741.56unnamed protein product [Microthlaspi erraticum][more]
CAF2214202.14.7e-0741.03unnamed protein product [Brassica napus][more]
PLY76495.18.1e-0750.00hypothetical protein LSAT_4X103641 [Lactuca sativa][more]
XP_022846371.18.1e-0747.69uncharacterized protein LOC111369102 [Olea europaea var. sylvestris] >CAA2987260... [more]
Match NameE-valueIdentityDescription
A0A0A0KAE11.0e-1564.38Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G446750 PE=4 SV=1[more]
A0A6D2K7L81.8e-0741.56Uncharacterized protein OS=Microthlaspi erraticum OX=1685480 GN=MERR_LOCUS31112 ... [more]
A0A2J6KN413.9e-0750.00Uncharacterized protein OS=Lactuca sativa OX=4236 GN=LSAT_4X103641 PE=4 SV=1[more]
A0A078IDV48.7e-0741.89BnaC03g69280D protein OS=Brassica napus OX=3708 GN=BnaC03g69280D PE=4 SV=1[more]
A0A3N6Q7571.1e-0641.89Uncharacterized protein OS=Brassica cretica OX=69181 GN=DY000_00007316 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G52342.14.6e-0837.97unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G20370.19.5e-0685.00serine-rich protein-related [more]
AT3G13227.13.6e-0546.51serine-rich protein-related [more]
AT1G67910.11.1e-0451.43unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G67910.21.1e-0451.43unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33132OSJNBB0118P14.9 PROTEINcoord: 16..62
NoneNo IPR availablePANTHERPTHR33132:SF72OS01G0778900 PROTEINcoord: 16..62

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007177.1Tan0007177.1mRNA