Tan0000472 (gene) Snake gourd v1

Overview
NameTan0000472
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG05: 73276930 .. 73277395 (-)
RNA-Seq ExpressionTan0000472
SyntenyTan0000472
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCAATTGGGTCTCTTCTTTCAAACCCATCTAGAGAGAGAGAGGTTGTGAAAATGGCGAGACTGATCAGAGAAATTGGTAGTGTTAATGGAAGTTTTGGACTCGCCGGCGACATTTATTTGCTGTTTTGGGCGGCTCTGTTTACACTCTGCATAATCTCAACCATAATTTTCTCTTGCTCCGACGGCATGTCAAAAGAGAGGAATTCCACGGCGGACGTCGAGCTTTACGGCGGCGGTTGCGCCGCCGGGTGTGGCGCAGGATGTGGTGCTTGAAAAATAGAGGACCCTTTTTTTATATATAGATGAATTTCACCGGCAACAAATTATGTCTCGGATTTTATCTGAGAGAAACAGAGAGCCTGTTGGATACTGACATCATTTTTGTAACATTTAATTGTGATTTAGCAGGCTGAGTTTAGTGTTTTATAATTTTTTGAGACAAAAATGATTATATATCAATACGAG

mRNA sequence

CTCAATTGGGTCTCTTCTTTCAAACCCATCTAGAGAGAGAGAGGTTGTGAAAATGGCGAGACTGATCAGAGAAATTGGTAGTGTTAATGGAAGTTTTGGACTCGCCGGCGACATTTATTTGCTGTTTTGGGCGGCTCTGTTTACACTCTGCATAATCTCAACCATAATTTTCTCTTGCTCCGACGGCATGTCAAAAGAGAGGAATTCCACGGCGGACGTCGAGCTTTACGGCGGCGGTTGCGCCGCCGGGTGTGGCGCAGGATGTGGTGCTTGAAAAATAGAGGACCCTTTTTTTATATATAGATGAATTTCACCGGCAACAAATTATGTCTCGGATTTTATCTGAGAGAAACAGAGAGCCTGTTGGATACTGACATCATTTTTGTAACATTTAATTGTGATTTAGCAGGCTGAGTTTAGTGTTTTATAATTTTTTGAGACAAAAATGATTATATATCAATACGAG

Coding sequence (CDS)

ATGGCGAGACTGATCAGAGAAATTGGTAGTGTTAATGGAAGTTTTGGACTCGCCGGCGACATTTATTTGCTGTTTTGGGCGGCTCTGTTTACACTCTGCATAATCTCAACCATAATTTTCTCTTGCTCCGACGGCATGTCAAAAGAGAGGAATTCCACGGCGGACGTCGAGCTTTACGGCGGCGGTTGCGCCGCCGGGTGTGGCGCAGGATGTGGTGCTTGA

Protein sequence

MARLIREIGSVNGSFGLAGDIYLLFWAALFTLCIISTIIFSCSDGMSKERNSTADVELYGGGCAAGCGAGCGA
Homology
BLAST of Tan0000472 vs. NCBI nr
Match: KAG6584341.1 (hypothetical protein SDJN03_20273, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 133.3 bits (334), Expect = 8.7e-28
Identity = 68/74 (91.89%), Postives = 70/74 (94.59%), Query Frame = 0

Query: 1  MARLIREIGSVNGSFGLAGDIYLLFWAALFTLCIISTIIFSCSDGMS-KERNSTADVELY 60
          MARL+RE GS NGSFGLAGDIYLLFWAALFTLCIISTIIFSCSDGMS K+RNSTADVELY
Sbjct: 1  MARLMREFGSQNGSFGLAGDIYLLFWAALFTLCIISTIIFSCSDGMSTKDRNSTADVELY 60

Query: 61 GGGCAAGCGAGCGA 74
          G GCAAGCGAGCGA
Sbjct: 61 GAGCAAGCGAGCGA 74

BLAST of Tan0000472 vs. NCBI nr
Match: KAE8652552.1 (hypothetical protein Csa_013579 [Cucumis sativus])

HSP 1 Score: 125.2 bits (313), Expect = 2.4e-25
Identity = 63/74 (85.14%), Postives = 67/74 (90.54%), Query Frame = 0

Query: 1   MARLIREIGSVNGSFGLA-GDIYLLFWAALFTLCIISTIIFSCSDGMSKERNSTADVELY 60
           MARL+REI S NGSFGLA G+ Y LFW ALFTLCIIST+IFSCSDGMSK+RNST DVELY
Sbjct: 29  MARLMREISSQNGSFGLAGGETYWLFWVALFTLCIISTLIFSCSDGMSKDRNSTVDVELY 88

Query: 61  GGGCAAGCGAGCGA 74
           GGGCAAGCGAGCGA
Sbjct: 89  GGGCAAGCGAGCGA 102

BLAST of Tan0000472 vs. NCBI nr
Match: KAG6572949.1 (hypothetical protein SDJN03_26836, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 117.1 bits (292), Expect = 6.4e-23
Identity = 60/69 (86.96%), Postives = 63/69 (91.30%), Query Frame = 0

Query: 1  MARLIREIGSVNGSFGLAGDIYLLFWAALFTLCIISTIIFSCSDGMSKERNSTADVELYG 60
          MARLI EIGSV+GSFGL+GDI  LF AALFTLCIIS I FSCSDG+SKERNSTADVELYG
Sbjct: 1  MARLIGEIGSVDGSFGLSGDISCLFLAALFTLCIISLITFSCSDGISKERNSTADVELYG 60

Query: 61 GGCAAGCGA 70
          GGCAAGCGA
Sbjct: 61 GGCAAGCGA 69

BLAST of Tan0000472 vs. NCBI nr
Match: XP_015882794.1 (uncharacterized protein LOC107418606 [Ziziphus jujuba])

HSP 1 Score: 94.4 bits (233), Expect = 4.5e-16
Identity = 49/73 (67.12%), Postives = 59/73 (80.82%), Query Frame = 0

Query: 1  MARLIREIGSVNGSFGLAGDIYLLFWAALFTLCIISTIIFSCSDGMSKERNSTADVELYG 60
          M RL R++ SV+GS G AG  +++ W AL  +CIIS IIFSC+DG+SKE+ STAD ELYG
Sbjct: 1  MVRLWRDLASVHGS-GSAG--FVVLWLALLGVCIISAIIFSCADGVSKEKTSTADTELYG 60

Query: 61 GGCAAGCGAGCGA 74
          GGCAAGCGAGCGA
Sbjct: 61 GGCAAGCGAGCGA 70

BLAST of Tan0000472 vs. NCBI nr
Match: PON38946.1 (hypothetical protein PanWU01x14_308610 [Parasponia andersonii])

HSP 1 Score: 88.6 bits (218), Expect = 2.5e-14
Identity = 46/74 (62.16%), Postives = 53/74 (71.62%), Query Frame = 0

Query: 1  MARLIREIGSVNGSFGL--AGDIYLLFWAALFTLCIISTIIFSCSDGMSKERNSTADVEL 60
          M RL RE   V+G  G   +G   + F   L TLCI+STIIFSC+DG+SKE+ S AD EL
Sbjct: 1  MVRLWREAAGVDGGVGSSHSGTFLVFFLGLLVTLCILSTIIFSCTDGVSKEKTSQADTEL 60

Query: 61 YGGGCAAGCGAGCG 73
          YGGGCAAGCGAGCG
Sbjct: 61 YGGGCAAGCGAGCG 74

BLAST of Tan0000472 vs. ExPASy TrEMBL
Match: A0A0A0LQU7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G042580 PE=4 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 1.1e-25
Identity = 63/74 (85.14%), Postives = 67/74 (90.54%), Query Frame = 0

Query: 1  MARLIREIGSVNGSFGLA-GDIYLLFWAALFTLCIISTIIFSCSDGMSKERNSTADVELY 60
          MARL+REI S NGSFGLA G+ Y LFW ALFTLCIIST+IFSCSDGMSK+RNST DVELY
Sbjct: 1  MARLMREISSQNGSFGLAGGETYWLFWVALFTLCIISTLIFSCSDGMSKDRNSTVDVELY 60

Query: 61 GGGCAAGCGAGCGA 74
          GGGCAAGCGAGCGA
Sbjct: 61 GGGCAAGCGAGCGA 74

BLAST of Tan0000472 vs. ExPASy TrEMBL
Match: A0A6P3ZT85 (uncharacterized protein LOC107418606 OS=Ziziphus jujuba OX=326968 GN=LOC107418606 PE=4 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 2.2e-16
Identity = 49/73 (67.12%), Postives = 59/73 (80.82%), Query Frame = 0

Query: 1  MARLIREIGSVNGSFGLAGDIYLLFWAALFTLCIISTIIFSCSDGMSKERNSTADVELYG 60
          M RL R++ SV+GS G AG  +++ W AL  +CIIS IIFSC+DG+SKE+ STAD ELYG
Sbjct: 1  MVRLWRDLASVHGS-GSAG--FVVLWLALLGVCIISAIIFSCADGVSKEKTSTADTELYG 60

Query: 61 GGCAAGCGAGCGA 74
          GGCAAGCGAGCGA
Sbjct: 61 GGCAAGCGAGCGA 70

BLAST of Tan0000472 vs. ExPASy TrEMBL
Match: A0A803R005 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 1.8e-15
Identity = 46/74 (62.16%), Postives = 55/74 (74.32%), Query Frame = 0

Query: 1  MARLIREIGSVNGSFGLAGDIYLLFWAA-LFTLCIISTIIFSCSDGMSKERNSTADVELY 60
          M RL RE+ +V+G  G     +L+F    L TLCI+STIIFSC+DG+SKE+ S  D ELY
Sbjct: 1  MVRLWREVANVDGGVGTGSGAFLIFLLGFLVTLCILSTIIFSCADGVSKEKTSQGDTELY 60

Query: 61 GGGCAAGCGAGCGA 74
          GGGCAAGCGAGCGA
Sbjct: 61 GGGCAAGCGAGCGA 74

BLAST of Tan0000472 vs. ExPASy TrEMBL
Match: A0A803R004 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 1.8e-15
Identity = 46/74 (62.16%), Postives = 55/74 (74.32%), Query Frame = 0

Query: 1  MARLIREIGSVNGSFGLAGDIYLLFWAA-LFTLCIISTIIFSCSDGMSKERNSTADVELY 60
          M RL RE+ +V+G  G     +L+F    L TLCI+STIIFSC+DG+SKE+ S  D ELY
Sbjct: 1  MVRLWREVANVDGGVGTGSGAFLIFLLGFLVTLCILSTIIFSCADGVSKEKTSQGDTELY 60

Query: 61 GGGCAAGCGAGCGA 74
          GGGCAAGCGAGCGA
Sbjct: 61 GGGCAAGCGAGCGA 74

BLAST of Tan0000472 vs. ExPASy TrEMBL
Match: A0A2P5AQX4 (Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_308610 PE=4 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 1.2e-14
Identity = 46/74 (62.16%), Postives = 53/74 (71.62%), Query Frame = 0

Query: 1  MARLIREIGSVNGSFGL--AGDIYLLFWAALFTLCIISTIIFSCSDGMSKERNSTADVEL 60
          M RL RE   V+G  G   +G   + F   L TLCI+STIIFSC+DG+SKE+ S AD EL
Sbjct: 1  MVRLWREAAGVDGGVGSSHSGTFLVFFLGLLVTLCILSTIIFSCTDGVSKEKTSQADTEL 60

Query: 61 YGGGCAAGCGAGCG 73
          YGGGCAAGCGAGCG
Sbjct: 61 YGGGCAAGCGAGCG 74

BLAST of Tan0000472 vs. TAIR 10
Match: AT1G68238.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 48.5 bits (114), Expect = 2.6e-06
Identity = 24/54 (44.44%), Postives = 33/54 (61.11%), Query Frame = 0

Query: 19 GDIYLLFWAALFTLCIISTIIFSCSDGMSKERNSTADVELYGGGCAAGCGAGCG 73
          G++ L  WAA+    +I+ +IFSCSD  SK   +    ++ G  CAAGCG GCG
Sbjct: 14 GEVSLYIWAAVVAFSVIAAVIFSCSDRASKPHTND---DVNGSSCAAGCGGGCG 64

BLAST of Tan0000472 vs. TAIR 10
Match: AT3G18250.1 (Putative membrane lipoprotein )

HSP 1 Score: 44.3 bits (103), Expect = 4.9e-05
Identity = 24/56 (42.86%), Postives = 36/56 (64.29%), Query Frame = 0

Query: 18 AGDIYLLFWAALFTLCIISTIIFSCSDGMSKERNSTADVELYGGGC-AAGCGAGCG 73
          A  ++ + + A+   CI+S ++FSC+DG+S  R +T+     GGGC  AGCG GCG
Sbjct: 21 ASYLFHVVFLAVIGCCILSALLFSCADGVSDNR-ATSGTSTGGGGCGGAGCGGGCG 75

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6584341.18.7e-2891.89hypothetical protein SDJN03_20273, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAE8652552.12.4e-2585.14hypothetical protein Csa_013579 [Cucumis sativus][more]
KAG6572949.16.4e-2386.96hypothetical protein SDJN03_26836, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_015882794.14.5e-1667.12uncharacterized protein LOC107418606 [Ziziphus jujuba][more]
PON38946.12.5e-1462.16hypothetical protein PanWU01x14_308610 [Parasponia andersonii][more]
Match NameE-valueIdentityDescription
A0A0A0LQU71.1e-2585.14Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G042580 PE=4 SV=1[more]
A0A6P3ZT852.2e-1667.12uncharacterized protein LOC107418606 OS=Ziziphus jujuba OX=326968 GN=LOC10741860... [more]
A0A803R0051.8e-1562.16Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803R0041.8e-1562.16Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A2P5AQX41.2e-1462.16Uncharacterized protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_308610 PE... [more]
Match NameE-valueIdentityDescription
AT1G68238.12.6e-0644.44unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G18250.14.9e-0542.86Putative membrane lipoprotein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37199TRANSMEMBRANE PROTEINcoord: 1..73
NoneNo IPR availablePANTHERPTHR37199:SF2MEMBRANE LIPOPROTEIN-RELATEDcoord: 1..73

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0000472.1Tan0000472.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane