Tan0016788 (gene) Snake gourd v1

Overview
NameTan0016788
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
LocationLG01: 88109492 .. 88110061 (+)
RNA-Seq ExpressionTan0016788
SyntenyTan0016788
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTAGCTGAGAAATTACCAAATTCATGCCTAGAACAAAACACAATCGAATGCAAGGTCAGAACTCTAAAAAAACAATACAATACTATTGCAGAGATGCTTAGTAATGCATGCAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTTGAGGAAGAGAAGGAGGTGTTCGATGCATGGGTTAAGGTGAGATAATAATATTATAGCATATTGTTATTATCGTTCATTGTACGTATGTGTAATATATCTATTCACATGCAGAGCCATACAAACGCAAAGGGGATGAGGAATAAGCCATTTCCGCAATATGATGACCTCGCATTTGTGTTCGAAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCAATGGAAATGGCATCTAGCGCTGTAGAACAAATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATGGGAGTGGAACAACGAACAATGGAGAATCTAAGAATTGGTGACATAGGGGAAGATGACTTGCCAGACACTCCTACTAGCATGCGTAATACATCTGGCATGTCTTCTAGATGTATTGGGAGCAAAAGAAAATGA

mRNA sequence

ATGCTAGCTGAGAAATTACCAAATTCATGCCTAGAACAAAACACAATCGAATGCAAGGTCAGAACTCTAAAAAAACAATACAATACTATTGCAGAGATGCTTAGTAATGCATGCAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTTGAGGAAGAGAAGGAGGTGTTCGATGCATGGGTTAAGAGCCATACAAACGCAAAGGGGATGAGGAATAAGCCATTTCCGCAATATGATGACCTCGCATTTGTGTTCGAAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCAATGGAAATGGCATCTAGCGCTGTAGAACAAATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATGGGAGTGGAACAACGAACAATGGAGAATCTAAGAATTGGTGACATAGGGGAAGATGACTTGCCAGACACTCCTACTAGCATGCGTAATACATCTGGCATGTCTTCTAGATGTATTGGGAGCAAAAGAAAATGA

Coding sequence (CDS)

ATGCTAGCTGAGAAATTACCAAATTCATGCCTAGAACAAAACACAATCGAATGCAAGGTCAGAACTCTAAAAAAACAATACAATACTATTGCAGAGATGCTTAGTAATGCATGCAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTTGAGGAAGAGAAGGAGGTGTTCGATGCATGGGTTAAGAGCCATACAAACGCAAAGGGGATGAGGAATAAGCCATTTCCGCAATATGATGACCTCGCATTTGTGTTCGAAAAAGATAGAGCTACAGGAATAGGCGCAGAGACCCCAATGGAAATGGCATCTAGCGCTGTAGAACAAATGGAGGAGGAGATTCGTTTGGGATCACAAGACTTCATGGGAGTGGAACAACGAACAATGGAGAATCTAAGAATTGGTGACATAGGGGAAGATGACTTGCCAGACACTCCTACTAGCATGCGTAATACATCTGGCATGTCTTCTAGATGTATTGGGAGCAAAAGAAAATGA

Protein sequence

MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEEEIRLGSQDFMGVEQRTMENLRIGDIGEDDLPDTPTSMRNTSGMSSRCIGSKRK
Homology
BLAST of Tan0016788 vs. NCBI nr
Match: XP_038902479.1 (uncharacterized protein At2g29880-like [Benincasa hispida])

HSP 1 Score: 161.0 bits (406), Expect = 8.7e-36
Identity = 88/164 (53.66%), Postives = 108/164 (65.85%), Query Frame = 0

Query: 1   MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAW 60
           +L EK+P   L QNTIECKVR+LKKQYN ++EMLS   SGF WNEEFKCV+ E+E+FD W
Sbjct: 50  ILHEKVPGCTLNQNTIECKVRSLKKQYNIVSEMLSQ--SGFDWNEEFKCVQVEREIFDLW 109

Query: 61  VKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEEEIRLGSQDF 120
           V SH NAK M NKPFP YDD + VF KDR  G  +E P  MA++A  + E+EIRLGSQD 
Sbjct: 110 VLSHPNAKRMWNKPFPHYDDFSTVFGKDRVVGKSSEDPYVMATNAFREFEDEIRLGSQDC 169

Query: 121 MGVEQRTMENLRIGDIGEDDLPDTPTSMRNTSGMSSRCIGSKRK 165
              E R  E+    D  +++  +  T   +    SSR  GSKRK
Sbjct: 170 QTPEVRQTESPLNQDEIDEEPAEQSTGRASVPAKSSR--GSKRK 209

BLAST of Tan0016788 vs. NCBI nr
Match: XP_038880837.1 (uncharacterized protein LOC120072528 [Benincasa hispida])

HSP 1 Score: 152.1 bits (383), Expect = 4.1e-33
Identity = 74/119 (62.18%), Postives = 89/119 (74.79%), Query Frame = 0

Query: 1   MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAW 60
           +L EK+P   L QNTIECKVR+LKKQYN ++EMLS   SGFGWNEEFKCV+ E+E+ D W
Sbjct: 13  ILHEKVPGCALNQNTIECKVRSLKKQYNAVSEMLSQ--SGFGWNEEFKCVQVEREILDLW 72

Query: 61  VKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEEEIRLGSQD 120
           V+SH NAK M NK F  YDDL+ VF KDR  G  +E P  MA++A  + E+EIRLGSQD
Sbjct: 73  VRSHPNAKEMWNKSFSHYDDLSTVFGKDRVVGQSSEDPYVMATNAFREFEDEIRLGSQD 129

BLAST of Tan0016788 vs. NCBI nr
Match: XP_038889264.1 (uncharacterized protein At2g29880-like [Benincasa hispida])

HSP 1 Score: 150.6 bits (379), Expect = 1.2e-32
Identity = 71/118 (60.17%), Postives = 90/118 (76.27%), Query Frame = 0

Query: 1   MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAW 60
           +  EK+P+  +  +TIECKVR LK+QY  I EMLSNAC+GFGWN+EFKCV+ EKEVFD W
Sbjct: 50  LFTEKIPSCSIRLSTIECKVRFLKRQYCAIVEMLSNACNGFGWNDEFKCVQVEKEVFDVW 109

Query: 61  VKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEEEIRLGSQ 119
           V SH N KG+R+KPFP  D+L+ VF KDRAT  G++TP + AS+  E + E+IRL SQ
Sbjct: 110 VWSHPNVKGLRHKPFPHCDELSIVFGKDRATSEGSKTPFDQASATDEHL-EDIRLKSQ 166

BLAST of Tan0016788 vs. NCBI nr
Match: XP_038875070.1 (uncharacterized protein LOC120067596 [Benincasa hispida])

HSP 1 Score: 150.6 bits (379), Expect = 1.2e-32
Identity = 77/145 (53.10%), Postives = 100/145 (68.97%), Query Frame = 0

Query: 1   MLAEKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAW 60
           +L EK+P   L QNTI+CKVR+LKKQYN ++EMLS   S F WNEEFKCV+ E+E+F+ W
Sbjct: 50  ILHEKVPGCTLNQNTIKCKVRSLKKQYNAVSEMLSQ--SRFDWNEEFKCVQVEREIFNLW 109

Query: 61  VKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEEEIRLGSQDF 120
           V+SH N KGM NK F  YDDL+ VF KDRA G  +E P  MA++A  + E+EIRLGSQD 
Sbjct: 110 VQSHPNLKGMWNKSFSHYDDLSTVFRKDRAVGQSSEDPYVMATNAFREFEDEIRLGSQDC 169

Query: 121 MGVEQRTMENLRIGDIGEDDLPDTP 146
              E R  E+     + +D++ + P
Sbjct: 170 HTPEVRQTES----PLNQDEIDEEP 188

BLAST of Tan0016788 vs. NCBI nr
Match: XP_038895773.1 (uncharacterized protein LOC120083935 [Benincasa hispida])

HSP 1 Score: 147.9 bits (372), Expect = 7.6e-32
Identity = 75/128 (58.59%), Postives = 90/128 (70.31%), Query Frame = 0

Query: 4   EKLPNSCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKS 63
           EK+    L QNTIECKVR+LKKQ N ++EMLS   SGF WNEEFKCV+ E+E+FD WV+S
Sbjct: 91  EKVLGCALNQNTIECKVRSLKKQCNAVSEMLSQ--SGFDWNEEFKCVQVEREIFDPWVRS 150

Query: 64  HTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEEEIRLGSQDFMGV 123
           H NAKGM NKPFP YDDL+ VF K +A G  +E P  M ++A  + E+EIRLGSQD    
Sbjct: 151 HPNAKGMWNKPFPHYDDLSTVFGKYKAVGQSSEDPYVMTTNAFREFEDEIRLGSQDCHTP 210

Query: 124 EQRTMENL 132
           E   M  L
Sbjct: 211 ESTHMGRL 216

BLAST of Tan0016788 vs. ExPASy TrEMBL
Match: A0A5A7U0H7 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G002060 PE=4 SV=1)

HSP 1 Score: 130.6 bits (327), Expect = 6.1e-27
Identity = 70/168 (41.67%), Postives = 103/168 (61.31%), Query Frame = 0

Query: 1   MLAEKLPNSCL-EQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDA 60
           M+AEKLP + + E +TI+C V++LKK Y+ IAEM   +CSGFGWNEEF+C+  E+++FD+
Sbjct: 51  MMAEKLPGTNIQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDS 110

Query: 61  WVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEEEIRLGSQD 120
           W+KSH  AKG+ +K FP YDDL++VF KDRATG  +ET   + S+      + I LG   
Sbjct: 111 WIKSHPAAKGLLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLGDSH 170

Query: 121 FMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR 164
              +     + +    +  D++        +  RN S +S R  GS+R
Sbjct: 171 DEDIPTMYSQGVH---MSPDEMFGIRAGQASERRNCSSVSKRKRGSER 215

BLAST of Tan0016788 vs. ExPASy TrEMBL
Match: A0A1S3B4L3 (uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=4 SV=1)

HSP 1 Score: 130.6 bits (327), Expect = 6.1e-27
Identity = 70/168 (41.67%), Postives = 103/168 (61.31%), Query Frame = 0

Query: 1   MLAEKLPNSCL-EQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDA 60
           M+AEKLP + + E +TI+C V++LKK Y+ IAEM   +CSGFGWNEEF+C+  E+++FD+
Sbjct: 51  MMAEKLPGTNIQESSTIDCHVKSLKKTYHAIAEMRGPSCSGFGWNEEFQCIIAERDLFDS 110

Query: 61  WVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEEEIRLGSQD 120
           W+KSH  AKG+ +K FP YDDL++VF KDRATG  +ET   + S+      + I LG   
Sbjct: 111 WIKSHPAAKGLLHKSFPYYDDLSYVFGKDRATGARSETFPNVGSNVSNMFNDTIPLGDSH 170

Query: 121 FMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR 164
              +     + +    +  D++        +  RN S +S R  GS+R
Sbjct: 171 DEDIPTMYSQGVH---MSPDEMFGIRAGQASERRNCSSVSKRKRGSER 215

BLAST of Tan0016788 vs. ExPASy TrEMBL
Match: A0A5D3D9Q6 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold708G00360 PE=4 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 3.0e-26
Identity = 64/154 (41.56%), Postives = 101/154 (65.58%), Query Frame = 0

Query: 1   MLAEKLPNSCLEQNTI-ECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDA 60
           M+AEKLP   +   T+ +C+++TLK+ +  IAEM   ACSGFGWN+E KC+  EKE+FD 
Sbjct: 51  MMAEKLPGCQVRATTVFDCRIKTLKRIFQAIAEMQGQACSGFGWNDEEKCIIAEKELFDN 110

Query: 61  WVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEE-EIRLGSQ 120
           WV+SH  AKG+ NKPFP YD+L +VF++DRATG  A+T  ++ S+  ++ +  ++R G++
Sbjct: 111 WVRSHPVAKGLLNKPFPYYDELTYVFDRDRATGRFAKTFADVGSNEPDEYDRFDMRNGNE 170

Query: 121 DFMGVEQRTMENLRIGDIGEDDLPDTPTSMRNTS 153
           DF  V + +    + G   + D+     ++  T+
Sbjct: 171 DFPPVTESSGSKRKRGSPRDLDVEGIHLALDQTN 204

BLAST of Tan0016788 vs. ExPASy TrEMBL
Match: A0A5A7TC56 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold84G00560 PE=4 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 3.0e-26
Identity = 74/170 (43.53%), Postives = 103/170 (60.59%), Query Frame = 0

Query: 1   MLAEKLPNSCLEQNT-IECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDA 60
           M+AEKLP   +   T I+C+++TLK+ +  IAEM   ACSGFGWN+E KC+  EKE+FD 
Sbjct: 51  MMAEKLPGCQVRATTVIDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDN 110

Query: 61  WVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEEEIRL--GS 120
           WV+SH  AKG+ NKPFP YD+L +VF +DRATG  AET  ++ S+      +   +  G+
Sbjct: 111 WVRSHPAAKGLLNKPFPYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDMGDGN 170

Query: 121 QDFMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR 164
           +DF  V  + +      DI +DD+    P   +  R  S  S R  GS+R
Sbjct: 171 EDFSPVYSQGV------DISQDDVRASRPSRASEGRTGSSGSKRKRGSQR 214

BLAST of Tan0016788 vs. ExPASy TrEMBL
Match: A0A5D3CWL2 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G00640 PE=4 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 4.0e-26
Identity = 74/170 (43.53%), Postives = 103/170 (60.59%), Query Frame = 0

Query: 1   MLAEKLPNSCLEQNT-IECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDA 60
           M+AEKLP   +   T I+C+++TLK+ +  IAEM   ACSGFGWN+E KC+  EKE+FD 
Sbjct: 51  MMAEKLPGCQVRATTVIDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDN 110

Query: 61  WVKSHTNAKGMRNKPFPQYDDLAFVFEKDRATGIGAETPMEMASSAVEQMEEEIRL--GS 120
           WV+SH  AKG+ NKPFP YD+L +VF +DRATG  AET  ++ S+      +   +  G+
Sbjct: 111 WVRSHPAAKGLLNKPFPYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDMGDGN 170

Query: 121 QDFMGVEQRTMENLRIGDIGEDDL----PDTPTSMRNTSGMSSRCIGSKR 164
           +DF  V  + +      DI +DD+    P   +  R  S  S R  GS+R
Sbjct: 171 EDFPPVYSQGV------DISQDDVRASRPSRASDGRTGSSGSKRKRGSQR 214

BLAST of Tan0016788 vs. TAIR 10
Match: AT4G02210.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )

HSP 1 Score: 47.0 bits (110), Expect = 1.7e-05
Identity = 20/73 (27.40%), Postives = 42/73 (57.53%), Query Frame = 0

Query: 9   SCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAK 68
           S  + + ++ + ++L++Q+N I  +L +   GF W+ E + V  +  V+  ++K+H +A+
Sbjct: 230 SNFDVDVLKNRYKSLRRQFNAIKSILRS--DGFAWDNERQMVTADNNVWQDYIKAHRDAR 289

Query: 69  GMRNKPFPQYDDL 82
               +P P Y DL
Sbjct: 290 QFMTRPIPYYKDL 300

BLAST of Tan0016788 vs. TAIR 10
Match: AT4G02210.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2). )

HSP 1 Score: 47.0 bits (110), Expect = 1.7e-05
Identity = 20/73 (27.40%), Postives = 42/73 (57.53%), Query Frame = 0

Query: 9   SCLEQNTIECKVRTLKKQYNTIAEMLSNACSGFGWNEEFKCVEEEKEVFDAWVKSHTNAK 68
           S  + + ++ + ++L++Q+N I  +L +   GF W+ E + V  +  V+  ++K+H +A+
Sbjct: 230 SNFDVDVLKNRYKSLRRQFNAIKSILRS--DGFAWDNERQMVTADNNVWQDYIKAHRDAR 289

Query: 69  GMRNKPFPQYDDL 82
               +P P Y DL
Sbjct: 290 QFMTRPIPYYKDL 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038902479.18.7e-3653.66uncharacterized protein At2g29880-like [Benincasa hispida][more]
XP_038880837.14.1e-3362.18uncharacterized protein LOC120072528 [Benincasa hispida][more]
XP_038889264.11.2e-3260.17uncharacterized protein At2g29880-like [Benincasa hispida][more]
XP_038875070.11.2e-3253.10uncharacterized protein LOC120067596 [Benincasa hispida][more]
XP_038895773.17.6e-3258.59uncharacterized protein LOC120083935 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A5A7U0H76.1e-2741.67Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B4L36.1e-2741.67uncharacterized protein LOC103485953 OS=Cucumis melo OX=3656 GN=LOC103485953 PE=... [more]
A0A5D3D9Q63.0e-2641.56Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7TC563.0e-2643.53Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5D3CWL24.0e-2643.53Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT4G02210.11.7e-0527.40unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02210.21.7e-0527.40unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 135..164
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 144..158
NoneNo IPR availablePANTHERPTHR46250MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEIN-RELATEDcoord: 1..127

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0016788.1Tan0016788.1mRNA