Tan0013547 (gene) Snake gourd v1

Overview
NameTan0013547
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
LocationLG06: 69648246 .. 69649266 (+)
RNA-Seq ExpressionTan0013547
SyntenyTan0013547
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCTTGATCAAGCTCGAGAACCTTGCTCCTCTTTATCATGCAACCTCCCTATTCTCTCAAATTGCTAACAAAGTCGACCTGAAATTCTCGCGGTTGGCGTTCTCGATCATTGCTCGGCACCCGTCCCCTCGGTTCGATGCAGTTATGTTCATGATGCATCAATTATTTGCCAACTATTCTGTCGATCATCATCACATTTCAACTGTTTCCCTCCAAAACTTCCACAAGGCTATATTGGAAAGCCAAAAGTTTTCTTCACTGACCATCCAGCTTGCGGAACAAGCAAGTTGCATAAGCCTTTCATTTGACACTTCAAGTTAGTATATATAATAATACCTTTCATTCTCACTCTGGTTTCTTCTCTTCAAAATGAATCTTGATTTTTTTTTTCTTTTTTTGGGTCATTTCTTGATTCTTTTATGCATTCTGAGAATTTCTTTATGAACATGTTTGACCATTGAAGTCCTGTGGCTCTAAATATGAATCTTGTTTATCTGTTTCTAAGAACAATGTTTACCTTAAAAATCTGATGAAATTGGATTCATTTTTTCTTGATCGATCTTCCTTCAACTTCTTTAAAGCCAATCATTTCCTTGCTGTCTTTAGAATTAATATTTTCATTTTTTAATTGATCCCTCCCCATATAACTTTCTTTTTACTGTTTAACATTATTTTAGGGTATGTGCCAAGACTCGGCCATGAATTGACATTGTCACCCACAGAAAACGTGGATTTTGGTGAAATCTCCAATGCAAAATCTTTTTCAATTGACACAGAAGAGTTTAAACGCATTATAATAGCACTATCTAACTACGATGATCATACAAGTAACACTAATTCCTGATTTCTAATTTCCTTTTTCATATATTTAATTACTTAGTAAAAAAAGGTGGTGTATCCTTGTAAAACTTTGTCATCTTATATTGCAGTTTGTATTACTATAACCCATTCACAAGTCAAGTTCTCTGTTGCATCTGAGGAGATAATTCTTAGCAAAGAGGTATATGTTCACAAATAA

mRNA sequence

ATGTTCTTGATCAAGCTCGAGAACCTTGCTCCTCTTTATCATGCAACCTCCCTATTCTCTCAAATTGCTAACAAAGTCGACCTGAAATTCTCGCGGTTGGCGTTCTCGATCATTGCTCGGCACCCGTCCCCTCGGTTCGATGCAGTTATGTTCATGATGCATCAATTATTTGCCAACTATTCTGTCGATCATCATCACATTTCAACTGTTTCCCTCCAAAACTTCCACAAGGCTATATTGGAAAGCCAAAAGTTTTCTTCACTGACCATCCAGCTTGCGGAACAAGCAAGTTGCATAAGCCTTTCATTTGACACTTCAAGGTATGTGCCAAGACTCGGCCATGAATTGACATTGTCACCCACAGAAAACGTGGATTTTGGTGAAATCTCCAATGCAAAATCTTTTTCAATTGACACAGAAGAGTTTAAACGCATTATAATAGCACTATCTAACTACGATGATCATACAATTTGTATTACTATAACCCATTCACAAGTCAAGTTCTCTGTTGCATCTGAGGAGATAATTCTTAGCAAAGAGGTATATGTTCACAAATAA

Coding sequence (CDS)

ATGTTCTTGATCAAGCTCGAGAACCTTGCTCCTCTTTATCATGCAACCTCCCTATTCTCTCAAATTGCTAACAAAGTCGACCTGAAATTCTCGCGGTTGGCGTTCTCGATCATTGCTCGGCACCCGTCCCCTCGGTTCGATGCAGTTATGTTCATGATGCATCAATTATTTGCCAACTATTCTGTCGATCATCATCACATTTCAACTGTTTCCCTCCAAAACTTCCACAAGGCTATATTGGAAAGCCAAAAGTTTTCTTCACTGACCATCCAGCTTGCGGAACAAGCAAGTTGCATAAGCCTTTCATTTGACACTTCAAGGTATGTGCCAAGACTCGGCCATGAATTGACATTGTCACCCACAGAAAACGTGGATTTTGGTGAAATCTCCAATGCAAAATCTTTTTCAATTGACACAGAAGAGTTTAAACGCATTATAATAGCACTATCTAACTACGATGATCATACAATTTGTATTACTATAACCCATTCACAAGTCAAGTTCTCTGTTGCATCTGAGGAGATAATTCTTAGCAAAGAGGTATATGTTCACAAATAA

Protein sequence

MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANYSVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDTSRYVPRLGHELTLSPTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKEVYVHK
Homology
BLAST of Tan0013547 vs. NCBI nr
Match: XP_016903187.1 (PREDICTED: uncharacterized protein LOC103502263 [Cucumis melo])

HSP 1 Score: 161.4 bits (407), Expect = 7.5e-36
Identity = 83/180 (46.11%), Postives = 124/180 (68.89%), Query Frame = 0

Query: 1   MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANY 60
           MFL++L++  PL+ ATS  +QIA + D+KF+ L FSIIA + SPRF A + M H  F NY
Sbjct: 3   MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINY 62

Query: 61  SVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDTSRYVPRLGHELTLSP 120
            VD+ H S +SL++FH A+L+     S+TI L      + L F++S + P++ HEL+L+P
Sbjct: 63  KVDNDHTSRISLESFHDALLDGGASPSMTIHLLANIKQLILRFESSSHAPKVHHELSLTP 122

Query: 121 TENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE 180
           ++  D GE+  AK FSID+++ +R+I  L  +   +IC+T T SQVKFS+AS+EI+L+KE
Sbjct: 123 SQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKE 182

BLAST of Tan0013547 vs. NCBI nr
Match: XP_031744160.1 (uncharacterized protein LOC116404808 [Cucumis sativus])

HSP 1 Score: 154.5 bits (389), Expect = 9.2e-34
Identity = 80/180 (44.44%), Postives = 119/180 (66.11%), Query Frame = 0

Query: 1   MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANY 60
           MFL++L+N  P +HATS  + IA + D+KF+ L FSI   +  PRF A + M +  F NY
Sbjct: 1   MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINY 60

Query: 61  SVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDTSRYVPRLGHELTLSP 120
            VD+ H S +SL++FH A+L+     S+TI L    + + L F++S + P++ HEL+L P
Sbjct: 61  KVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMILRFESSSHAPQVRHELSLKP 120

Query: 121 TENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE 180
           ++  D GEI  AK FSID++  +R+I  L  +   +IC+T T SQVKFS+AS+EI+L+KE
Sbjct: 121 SQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKE 180

BLAST of Tan0013547 vs. NCBI nr
Match: XP_038875055.1 (uncharacterized protein LOC120067580 [Benincasa hispida])

HSP 1 Score: 148.7 bits (374), Expect = 5.1e-32
Identity = 84/185 (45.41%), Postives = 118/185 (63.78%), Query Frame = 0

Query: 1   MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANY 60
           MFL+KL N  PL  ATS  +QI+N  D+KF+ L F +IA +PSPRF A + +  + F NY
Sbjct: 1   MFLVKLTNFEPLLDATSYLAQISNYADVKFTPLEFYLIAPYPSPRFVATLQLSQKCFTNY 60

Query: 61  SVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDT-SRYVPRLGHELTLS 120
           SVDH H S V L++FH AIL+   F+S+TI L E+ + + L F T S  +P L HELT S
Sbjct: 61  SVDHEHTSKVDLESFHDAILDGGSFASMTIHLLEKPNQMILRFQTPSSEIPPLHHELTFS 120

Query: 121 PTENVD---FGEISNAKSFSIDTEEFKRIIIALSNY-DDHTICITITHSQVKFSVASEEI 180
           P +  D    G++   K F + +E  +RII  L  + DD  +C+ +T SQ+KFS+AS+EI
Sbjct: 121 PPQLADNNIGGQLEEGKFFIVKSEALRRIIKELPIFQDDSVVCVGVTSSQIKFSIASKEI 180

BLAST of Tan0013547 vs. NCBI nr
Match: XP_008458682.1 (PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo])

HSP 1 Score: 144.4 bits (363), Expect = 9.5e-31
Identity = 81/181 (44.75%), Postives = 116/181 (64.09%), Query Frame = 0

Query: 1   MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANY 60
           MFL++LE   PL  ATSL +Q+A   D+KF+ L   II  + SP+F A + +  +LF N+
Sbjct: 1   MFLVRLEQFEPLIDATSLLAQVAKDADVKFTPLMLMIIVSNRSPQFVATLQLSRRLFTNF 60

Query: 61  SVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDT-SRYVPRLGHELTLS 120
           SVDH+  S VSLQ FH A+L+   FSS+TI L +  + + L F+T S  VP L HEL LS
Sbjct: 61  SVDHNKSSKVSLQPFHDAMLDGGSFSSMTIHLLDTTNQMVLRFETPSHDVPPLHHELALS 120

Query: 121 PTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSK 180
           P +  + G++     F++ + E +RII  L  +   T+ +T+T SQVKFS+ S+EIIL+K
Sbjct: 121 PPQAENLGQVEYGNFFTVTSRELRRIIKELPLFHQDTVSVTVTGSQVKFSIQSKEIILTK 180

BLAST of Tan0013547 vs. NCBI nr
Match: XP_023006010.1 (uncharacterized protein LOC111498887 [Cucurbita maxima])

HSP 1 Score: 141.7 bits (356), Expect = 6.2e-30
Identity = 80/181 (44.20%), Postives = 120/181 (66.30%), Query Frame = 0

Query: 1   MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANY 60
           MFL++L +  PL  ATSL +QI+N+ DLKFS   FS+I  +PS RF A   + H+ FANY
Sbjct: 1   MFLVRLHHFDPLREATSLLAQISNEADLKFSSSKFSLITSYPSRRFVATFQISHRFFANY 60

Query: 61  SVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDTSRYVPRLGHE-LTLS 120
           SVD +H S VSLQ+F+ A+ +   FSS+TI   E  S + L F++S +     H  L LS
Sbjct: 61  SVDRNHSSRVSLQSFYDAMYDGIFFSSMTIHFPETTSRMVLQFESSNHTKLKMHRVLKLS 120

Query: 121 PTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSK 180
           P++  + G+I + + FSI +++F+ II  L ++ +++I +++T S+VKF  ASEE IL+K
Sbjct: 121 PSQEEELGQIQHDRFFSIISQDFRDIITGLPSFPNNSIFVSLTSSRVKFCCASEERILTK 180

BLAST of Tan0013547 vs. ExPASy TrEMBL
Match: A0A1S4E4N8 (uncharacterized protein LOC103502263 OS=Cucumis melo OX=3656 GN=LOC103502263 PE=4 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 3.6e-36
Identity = 83/180 (46.11%), Postives = 124/180 (68.89%), Query Frame = 0

Query: 1   MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANY 60
           MFL++L++  PL+ ATS  +QIA + D+KF+ L FSIIA + SPRF A + M H  F NY
Sbjct: 3   MFLVRLKDFDPLFDATSRLAQIAREADIKFTPLFFSIIASNRSPRFVAYLQMTHHCFINY 62

Query: 61  SVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDTSRYVPRLGHELTLSP 120
            VD+ H S +SL++FH A+L+     S+TI L      + L F++S + P++ HEL+L+P
Sbjct: 63  KVDNDHTSRISLESFHDALLDGGASPSMTIHLLANIKQLILRFESSSHAPKVHHELSLTP 122

Query: 121 TENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE 180
           ++  D GE+  AK FSID+++ +R+I  L  +   +IC+T T SQVKFS+AS+EI+L+KE
Sbjct: 123 SQEEDLGEVDYAKFFSIDSKDLRRVIRNLPIFHGDSICVTATGSQVKFSIASKEIVLTKE 182

BLAST of Tan0013547 vs. ExPASy TrEMBL
Match: A0A1S3C8J1 (uncharacterized protein LOC103498010 OS=Cucumis melo OX=3656 GN=LOC103498010 PE=4 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 4.6e-31
Identity = 81/181 (44.75%), Postives = 116/181 (64.09%), Query Frame = 0

Query: 1   MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANY 60
           MFL++LE   PL  ATSL +Q+A   D+KF+ L   II  + SP+F A + +  +LF N+
Sbjct: 1   MFLVRLEQFEPLIDATSLLAQVAKDADVKFTPLMLMIIVSNRSPQFVATLQLSRRLFTNF 60

Query: 61  SVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDT-SRYVPRLGHELTLS 120
           SVDH+  S VSLQ FH A+L+   FSS+TI L +  + + L F+T S  VP L HEL LS
Sbjct: 61  SVDHNKSSKVSLQPFHDAMLDGGSFSSMTIHLLDTTNQMVLRFETPSHDVPPLHHELALS 120

Query: 121 PTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSK 180
           P +  + G++     F++ + E +RII  L  +   T+ +T+T SQVKFS+ S+EIIL+K
Sbjct: 121 PPQAENLGQVEYGNFFTVTSRELRRIIKELPLFHQDTVSVTVTGSQVKFSIQSKEIILTK 180

BLAST of Tan0013547 vs. ExPASy TrEMBL
Match: A0A6J1KZ05 (uncharacterized protein LOC111498887 OS=Cucurbita maxima OX=3661 GN=LOC111498887 PE=4 SV=1)

HSP 1 Score: 141.7 bits (356), Expect = 3.0e-30
Identity = 80/181 (44.20%), Postives = 120/181 (66.30%), Query Frame = 0

Query: 1   MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANY 60
           MFL++L +  PL  ATSL +QI+N+ DLKFS   FS+I  +PS RF A   + H+ FANY
Sbjct: 1   MFLVRLHHFDPLREATSLLAQISNEADLKFSSSKFSLITSYPSRRFVATFQISHRFFANY 60

Query: 61  SVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDTSRYVPRLGHE-LTLS 120
           SVD +H S VSLQ+F+ A+ +   FSS+TI   E  S + L F++S +     H  L LS
Sbjct: 61  SVDRNHSSRVSLQSFYDAMYDGIFFSSMTIHFPETTSRMVLQFESSNHTKLKMHRVLKLS 120

Query: 121 PTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSK 180
           P++  + G+I + + FSI +++F+ II  L ++ +++I +++T S+VKF  ASEE IL+K
Sbjct: 121 PSQEEELGQIQHDRFFSIISQDFRDIITGLPSFPNNSIFVSLTSSRVKFCCASEERILTK 180

BLAST of Tan0013547 vs. ExPASy TrEMBL
Match: A0A6J1CUU8 (uncharacterized protein LOC111014988 OS=Momordica charantia OX=3673 GN=LOC111014988 PE=4 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 3.9e-30
Identity = 77/180 (42.78%), Postives = 113/180 (62.78%), Query Frame = 0

Query: 1   MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANY 60
           MFLI+L+ +APL+ A    ++IA + D+KFS   F II    SP F A + M  + F ++
Sbjct: 1   MFLIRLQPIAPLFDAICSLTRIATRADVKFSPTKFCIIVSQISPPFIAALQMSPEFFTSF 60

Query: 61  SVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDTSRYVPRLGHELTLSP 120
           +VD +H S + L + H  +++ + + ++T  L E  + + L F+ SR +PR   EL LSP
Sbjct: 61  AVDGNHTSRICLDSLHSILMDGRLYPAMTFHLLENQNRLLLRFENSRNLPRGRRELDLSP 120

Query: 121 TENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSKE 180
           +E  D GEI      SI ++EF+ I+  LS Y +H IC T+T SQVKFSVA+EEIIL+KE
Sbjct: 121 SEEEDVGEIDYGNCVSIGSDEFRSIVTKLSAYFNHRICATLTDSQVKFSVANEEIILTKE 180

BLAST of Tan0013547 vs. ExPASy TrEMBL
Match: A0A6J1H2Z8 (uncharacterized protein LOC111460011 OS=Cucurbita moschata OX=3662 GN=LOC111460011 PE=4 SV=1)

HSP 1 Score: 137.9 bits (346), Expect = 4.3e-29
Identity = 78/181 (43.09%), Postives = 118/181 (65.19%), Query Frame = 0

Query: 1   MFLIKLENLAPLYHATSLFSQIANKVDLKFSRLAFSIIARHPSPRFDAVMFMMHQLFANY 60
           MFL++L +  PL  ATS+ +QI+N+ DLKFS   FS+I  +PS RF A   + H+ FANY
Sbjct: 1   MFLVRLHHFDPLMEATSILAQISNEADLKFSSSKFSLITSYPSHRFVATFQISHRFFANY 60

Query: 61  SVDHHHISTVSLQNFHKAILESQKFSSLTIQLAEQASCISLSFDTSRYVPRLGHE-LTLS 120
            VD +H S VSLQ+F+ A+     FSS+TI   E  S + L F++S +     H  L LS
Sbjct: 61  FVDRNHSSRVSLQSFYNAMYAGIVFSSMTIHFPETTSRMVLQFESSNHTRMQMHRVLKLS 120

Query: 121 PTENVDFGEISNAKSFSIDTEEFKRIIIALSNYDDHTICITITHSQVKFSVASEEIILSK 180
           P++  + G+I + + FSI +++F+ II  L ++ +++I +++T S+VKF  ASEE IL+K
Sbjct: 121 PSQEEELGQIQHDRFFSIISQDFRDIITGLPSFPNNSIFVSLTSSRVKFCWASEERILTK 180

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_016903187.17.5e-3646.11PREDICTED: uncharacterized protein LOC103502263 [Cucumis melo][more]
XP_031744160.19.2e-3444.44uncharacterized protein LOC116404808 [Cucumis sativus][more]
XP_038875055.15.1e-3245.41uncharacterized protein LOC120067580 [Benincasa hispida][more]
XP_008458682.19.5e-3144.75PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo][more]
XP_023006010.16.2e-3044.20uncharacterized protein LOC111498887 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A1S4E4N83.6e-3646.11uncharacterized protein LOC103502263 OS=Cucumis melo OX=3656 GN=LOC103502263 PE=... [more]
A0A1S3C8J14.6e-3144.75uncharacterized protein LOC103498010 OS=Cucumis melo OX=3656 GN=LOC103498010 PE=... [more]
A0A6J1KZ053.0e-3044.20uncharacterized protein LOC111498887 OS=Cucurbita maxima OX=3661 GN=LOC111498887... [more]
A0A6J1CUU83.9e-3042.78uncharacterized protein LOC111014988 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
A0A6J1H2Z84.3e-2943.09uncharacterized protein LOC111460011 OS=Cucurbita moschata OX=3662 GN=LOC1114600... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.70.10.10coord: 1..185
e-value: 4.4E-17
score: 64.1
NoneNo IPR availableSUPERFAMILY55979DNA clampcoord: 1..118

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0013547.1Tan0013547.1mRNA