Tan0004400 (gene) Snake gourd v1

Overview
NameTan0004400
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPatatin
LocationLG03: 77281548 .. 77282142 (-)
RNA-Seq ExpressionTan0004400
SyntenyTan0004400
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTAAACTCAAAGCAATCCGAAAACAAAAGTCCTCGAGAAAAGGACCCTGTGAAGGCAGTGCCACTTTCACTTCCTTTTGAGTAAAATTTATAAAGGGATGAGCAGAAGCAGTAGCTTCTCTGAGAAAATGGGTGCTGTTGCTCCACTTGCAATTGGCACGCGAGGCACTGTGGGGTCTCTGGTCATGAAGGAGATTGAGTACTTCACCAAGCTTGAGCTCGAACGCCATGGCGCCTCTCACAGGATAAGTGGAGATGCTTCCAGGAGAAGCGATTCTAAGACAAGCTTTTGGCTTTTGTCGTTGACTTGGAAGTGGAAGAAGAGAAGAACCAACAATGGGATTTTACCTAACATTTGCTCTGCTGTTGAATTGTCGAAAAGCAATCGGTTTAATGGGATTCCTGGTTTTGGTTACAGGATCCTCAAAGACGATGTCAACCACTTTCCCATTTGATTTCAGCCTTCAAATTCAAATGGGTCTTGTTTGCTTTTTGTTGAAGTGCTTTTCCTTGCTCTGTTTGTATTTCATTTTAAGGTTTGCTACGTTTTGACTGGAAAAATGAAATAAGAGTTCGATTGCTGATTTGATTGAAA

mRNA sequence

CCTAAACTCAAAGCAATCCGAAAACAAAAGTCCTCGAGAAAAGGACCCTGTGAAGGCAGTGCCACTTTCACTTCCTTTTGAGTAAAATTTATAAAGGGATGAGCAGAAGCAGTAGCTTCTCTGAGAAAATGGGTGCTGTTGCTCCACTTGCAATTGGCACGCGAGGCACTGTGGGGTCTCTGGTCATGAAGGAGATTGAGTACTTCACCAAGCTTGAGCTCGAACGCCATGGCGCCTCTCACAGGATAAGTGGAGATGCTTCCAGGAGAAGCGATTCTAAGACAAGCTTTTGGCTTTTGTCGTTGACTTGGAAGTGGAAGAAGAGAAGAACCAACAATGGGATTTTACCTAACATTTGCTCTGCTGTTGAATTGTCGAAAAGCAATCGGTTTAATGGGATTCCTGGTTTTGGTTACAGGATCCTCAAAGACGATGTCAACCACTTTCCCATTTGATTTCAGCCTTCAAATTCAAATGGGTCTTGTTTGCTTTTTGTTGAAGTGCTTTTCCTTGCTCTGTTTGTATTTCATTTTAAGGTTTGCTACGTTTTGACTGGAAAAATGAAATAAGAGTTCGATTGCTGATTTGATTGAAA

Coding sequence (CDS)

ATGAGCAGAAGCAGTAGCTTCTCTGAGAAAATGGGTGCTGTTGCTCCACTTGCAATTGGCACGCGAGGCACTGTGGGGTCTCTGGTCATGAAGGAGATTGAGTACTTCACCAAGCTTGAGCTCGAACGCCATGGCGCCTCTCACAGGATAAGTGGAGATGCTTCCAGGAGAAGCGATTCTAAGACAAGCTTTTGGCTTTTGTCGTTGACTTGGAAGTGGAAGAAGAGAAGAACCAACAATGGGATTTTACCTAACATTTGCTCTGCTGTTGAATTGTCGAAAAGCAATCGGTTTAATGGGATTCCTGGTTTTGGTTACAGGATCCTCAAAGACGATGTCAACCACTTTCCCATTTGA

Protein sequence

MSRSSSFSEKMGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSDSKTSFWLLSLTWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI
Homology
BLAST of Tan0004400 vs. NCBI nr
Match: XP_022926407.1 (uncharacterized protein LOC111433571 [Cucurbita moschata] >KAG7026623.1 hypothetical protein SDJN02_10625, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 227.6 bits (579), Expect = 5.5e-56
Identity = 110/118 (93.22%), Postives = 115/118 (97.46%), Query Frame = 0

Query: 1   MSRSSSFSEKMGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSDS 60
           +SRSSSFSE+MGAVAPLAIGTRGT+GSLVMKEIEYFTKLELERHG S R+SGDASRRSDS
Sbjct: 5   ISRSSSFSEEMGAVAPLAIGTRGTMGSLVMKEIEYFTKLELERHGGSQRLSGDASRRSDS 64

Query: 61  KTSFWLLSLTWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI 119
           KTSFWLLSL+WKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDV HFPI
Sbjct: 65  KTSFWLLSLSWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVTHFPI 122

BLAST of Tan0004400 vs. NCBI nr
Match: XP_023003961.1 (uncharacterized protein LOC111497407 [Cucurbita maxima])

HSP 1 Score: 226.1 bits (575), Expect = 1.6e-55
Identity = 108/118 (91.53%), Postives = 115/118 (97.46%), Query Frame = 0

Query: 1   MSRSSSFSEKMGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSDS 60
           +SRSSSFSE+MGAVAPLAIGTRGT+GSLVMKEIEYFTKLELERHG S R+SGDASRRSDS
Sbjct: 5   ISRSSSFSEEMGAVAPLAIGTRGTMGSLVMKEIEYFTKLELERHGGSQRLSGDASRRSDS 64

Query: 61  KTSFWLLSLTWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI 119
           +TSFWLLSL+WKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDD+ HFPI
Sbjct: 65  RTSFWLLSLSWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDITHFPI 122

BLAST of Tan0004400 vs. NCBI nr
Match: KAG6594656.1 (hypothetical protein SDJN03_11209, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 225.7 bits (574), Expect = 2.1e-55
Identity = 109/118 (92.37%), Postives = 115/118 (97.46%), Query Frame = 0

Query: 1   MSRSSSFSEKMGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSDS 60
           +SRSSSFSE+MGAVAPLAIGTRGT+GSLVMKEIEYFTKLELERHG S R+SGDASRRSDS
Sbjct: 5   ISRSSSFSEEMGAVAPLAIGTRGTMGSLVMKEIEYFTKLELERHGGSQRLSGDASRRSDS 64

Query: 61  KTSFWLLSLTWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI 119
           KTSFWLLSL+WKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILK+DV HFPI
Sbjct: 65  KTSFWLLSLSWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKNDVTHFPI 122

BLAST of Tan0004400 vs. NCBI nr
Match: XP_023518155.1 (uncharacterized protein LOC111781699 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 225.7 bits (574), Expect = 2.1e-55
Identity = 109/118 (92.37%), Postives = 114/118 (96.61%), Query Frame = 0

Query: 1   MSRSSSFSEKMGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSDS 60
           +SRSSSF E+MGAVAPLAIGTRGT+GSLVMKEIEYFTKLELERHG S R+SGDASRRSDS
Sbjct: 5   ISRSSSFCEEMGAVAPLAIGTRGTMGSLVMKEIEYFTKLELERHGGSQRLSGDASRRSDS 64

Query: 61  KTSFWLLSLTWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI 119
           KTSFWLLSL+WKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDV HFPI
Sbjct: 65  KTSFWLLSLSWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVTHFPI 122

BLAST of Tan0004400 vs. NCBI nr
Match: XP_038882225.1 (uncharacterized protein LOC120073447 [Benincasa hispida])

HSP 1 Score: 211.8 bits (538), Expect = 3.1e-51
Identity = 106/118 (89.83%), Postives = 107/118 (90.68%), Query Frame = 0

Query: 1   MSRSSSFSEKMGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSDS 60
           MSRS SFSEKMG VAPLAIGTRGTVGSLVMKEIEYFTKLELERHG SH ISGDA RRSDS
Sbjct: 5   MSRSRSFSEKMGGVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGGSHTISGDALRRSDS 64

Query: 61  KTSFWLLSLTWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI 119
           K SFWLLSLTWKWKKR+ NNGILPNICSAVE SKSNRFNGIPGFGYRILKDD   FPI
Sbjct: 65  KGSFWLLSLTWKWKKRKNNNGILPNICSAVEFSKSNRFNGIPGFGYRILKDD---FPI 119

BLAST of Tan0004400 vs. ExPASy TrEMBL
Match: A0A6J1EEE9 (uncharacterized protein LOC111433571 OS=Cucurbita moschata OX=3662 GN=LOC111433571 PE=4 SV=1)

HSP 1 Score: 227.6 bits (579), Expect = 2.6e-56
Identity = 110/118 (93.22%), Postives = 115/118 (97.46%), Query Frame = 0

Query: 1   MSRSSSFSEKMGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSDS 60
           +SRSSSFSE+MGAVAPLAIGTRGT+GSLVMKEIEYFTKLELERHG S R+SGDASRRSDS
Sbjct: 5   ISRSSSFSEEMGAVAPLAIGTRGTMGSLVMKEIEYFTKLELERHGGSQRLSGDASRRSDS 64

Query: 61  KTSFWLLSLTWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI 119
           KTSFWLLSL+WKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDV HFPI
Sbjct: 65  KTSFWLLSLSWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVTHFPI 122

BLAST of Tan0004400 vs. ExPASy TrEMBL
Match: A0A6J1KQQ8 (uncharacterized protein LOC111497407 OS=Cucurbita maxima OX=3661 GN=LOC111497407 PE=4 SV=1)

HSP 1 Score: 226.1 bits (575), Expect = 7.7e-56
Identity = 108/118 (91.53%), Postives = 115/118 (97.46%), Query Frame = 0

Query: 1   MSRSSSFSEKMGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSDS 60
           +SRSSSFSE+MGAVAPLAIGTRGT+GSLVMKEIEYFTKLELERHG S R+SGDASRRSDS
Sbjct: 5   ISRSSSFSEEMGAVAPLAIGTRGTMGSLVMKEIEYFTKLELERHGGSQRLSGDASRRSDS 64

Query: 61  KTSFWLLSLTWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI 119
           +TSFWLLSL+WKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDD+ HFPI
Sbjct: 65  RTSFWLLSLSWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDITHFPI 122

BLAST of Tan0004400 vs. ExPASy TrEMBL
Match: A0A6J1CKN5 (uncharacterized protein LOC111012477 OS=Momordica charantia OX=3673 GN=LOC111012477 PE=4 SV=1)

HSP 1 Score: 202.6 bits (514), Expect = 9.1e-49
Identity = 101/120 (84.17%), Postives = 108/120 (90.00%), Query Frame = 0

Query: 1   MSRSSSFSEKMGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSDS 60
           +S S SFSEKMG+VAPLAIGTRGTVGSLVMKEIEYFTKLELERHG S RI+ +ASRR DS
Sbjct: 5   ISTSRSFSEKMGSVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGGSQRITAEASRRGDS 64

Query: 61  KTSFWLLSLTWKWKKRR--TNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI 119
           K SFWLLS TWKWKKRR  + NGILPNICSAVELSKSNRFNGIPGFGYRILK+DVN+F I
Sbjct: 65  KPSFWLLSSTWKWKKRRSGSTNGILPNICSAVELSKSNRFNGIPGFGYRILKEDVNYFTI 124

BLAST of Tan0004400 vs. ExPASy TrEMBL
Match: A0A6J1IJL8 (uncharacterized protein LOC111477686 OS=Cucurbita maxima OX=3661 GN=LOC111477686 PE=4 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 3.5e-48
Identity = 100/118 (84.75%), Postives = 105/118 (88.98%), Query Frame = 0

Query: 1   MSRSSSFSEKMGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSDS 60
           MSRS SFSEKM  +APLAIGTRGTVGSLVMKEIEYFTK E+ERHG+S RISGDASRRSD 
Sbjct: 5   MSRSRSFSEKMDGIAPLAIGTRGTVGSLVMKEIEYFTKFEVERHGSSQRISGDASRRSDC 64

Query: 61  KTSFWLLSLTWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI 119
           K SFWL+SLTWKWKKR+ NNGILPNICSAVELSKSNRFNGIPGF YRILK D   FPI
Sbjct: 65  KGSFWLVSLTWKWKKRKGNNGILPNICSAVELSKSNRFNGIPGFSYRILKHD---FPI 119

BLAST of Tan0004400 vs. ExPASy TrEMBL
Match: A0A0A0KL84 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G514830 PE=4 SV=1)

HSP 1 Score: 200.3 bits (508), Expect = 4.5e-48
Identity = 103/119 (86.55%), Postives = 107/119 (89.92%), Query Frame = 0

Query: 1   MSRSSSFSEKM-GAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGASHRISGDASRRSD 60
           MSRS SFSEKM GAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHG SH ISG+A RRSD
Sbjct: 5   MSRSRSFSEKMGGAVAPLAIGTRGTVGSLVMKEIEYFTKLELERHGISHTISGNALRRSD 64

Query: 61  SKTSFWLLSLTWKWKKRRTNNGILPNICSAVELSKSNRFNGIPGFGYRILKDDVNHFPI 119
           S+ SFWLLSLTWKWKKR+ NNG+LPNI SAVE SKSNRFNGIPGFGYRILKDD   FPI
Sbjct: 65  SRGSFWLLSLTWKWKKRKGNNGVLPNISSAVEFSKSNRFNGIPGFGYRILKDD---FPI 120

BLAST of Tan0004400 vs. TAIR 10
Match: AT4G21780.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 83.2 bits (204), Expect = 1.6e-16
Identity = 49/109 (44.95%), Postives = 60/109 (55.05%), Query Frame = 0

Query: 15  APLAIGTRGTVGSLVMKEIEYFTKLEL------ERHGASHRISGDASR--RSDSKTSFWL 74
           AP+AIGTRGT+GSLV KEI+YF            R G S        +  RS S+   W 
Sbjct: 3   APIAIGTRGTIGSLVRKEIDYFKNFSTCHPQFDPRRGNSEENKNTFKQRDRSSSRLGSWF 62

Query: 75  LSLTWKWKKRRTNNG---ILPNICSAVELSKSNRFNGIPGFGYRILKDD 113
               W+ KKR+T  G     P++CSAVE+S  NR   +PGF YRILK D
Sbjct: 63  SKTKWRKKKRQTRGGGGKFFPSMCSAVEVSGENR---VPGFNYRILKSD 108

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022926407.15.5e-5693.22uncharacterized protein LOC111433571 [Cucurbita moschata] >KAG7026623.1 hypothet... [more]
XP_023003961.11.6e-5591.53uncharacterized protein LOC111497407 [Cucurbita maxima][more]
KAG6594656.12.1e-5592.37hypothetical protein SDJN03_11209, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023518155.12.1e-5592.37uncharacterized protein LOC111781699 [Cucurbita pepo subsp. pepo][more]
XP_038882225.13.1e-5189.83uncharacterized protein LOC120073447 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1EEE92.6e-5693.22uncharacterized protein LOC111433571 OS=Cucurbita moschata OX=3662 GN=LOC1114335... [more]
A0A6J1KQQ87.7e-5691.53uncharacterized protein LOC111497407 OS=Cucurbita maxima OX=3661 GN=LOC111497407... [more]
A0A6J1CKN59.1e-4984.17uncharacterized protein LOC111012477 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A6J1IJL83.5e-4884.75uncharacterized protein LOC111477686 OS=Cucurbita maxima OX=3661 GN=LOC111477686... [more]
A0A0A0KL844.5e-4886.55Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G514830 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G21780.11.6e-1644.95unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35131:SF1EXPRESSED PROTEINcoord: 3..116
NoneNo IPR availablePANTHERPTHR35131EXPRESSED PROTEINcoord: 3..116

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004400.1Tan0004400.1mRNA