Tan0007079 (gene) Snake gourd v1

Overview
NameTan0007079
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA/RNA polymerases superfamily protein
LocationLG02: 52459300 .. 52464960 (+)
RNA-Seq ExpressionTan0007079
SyntenyTan0007079
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACCTAAGAGACCCTATCCATTTCTGGAGTTTGAGTTGTTCTGAATTCCATGTTACATCCCTCCGAATGGATGGTGCTACTTCGGGCAACCTGCAGGCGCATTATGGGAATCTCACGGTCCAACCTAATGAAGGAGACCGTAGGATGTGTTGACACACTTCCTTCGCTCACTTACTATAAATACTTTCTCCATTCACCTTGCTCTTGATTTATGCAAACAGCATCCGAATGGGGATGCCCTATTGACATCACAAGGCCGAGCATAAATCTCACGGTGTGAACTTCGTGGAGAGCGTAAGAGGAAATTGATATATCATATATACCTTTTCACTCCCACTAAGTATTTTAAACTGTTTGGGTATACTTTTACTTAGGCATATTTCGGCTAAAACAATAAGACCTAAGTCTACCAATATAACAATTATACTGGAATTACCTAAGTCTATTAACCTTTTAATAAACCTAGGTCTATGAGTTTAATTTATTAACTTTTAATAAACGCCACTCATGAAAGATAATTTATAAACCTATAGATTAAATTCACTCCTAGGTCGGTTCTGAGGTGTAGGGGTGTTCATTTAAAATCCACAACCTTAAGTAGTACCAACCTAGCAGAACCTTCCTTAGTCAGAGTCCTTCTTTAATCAAAGTTTATAAATTATTGGTTTCCTAATTCTAATCTTATTAGAACTAGATAACCTAAGGTCTAATTAATTCCAGTTCTAAAATCATTTTAAAACTAATTAATTAACCTTAGGTTACATGCATTTCTATGTTATAAACAAATTGAACATTCATCACATGAATAAACATACATTTATAACAAATCCATCACATGAAAAATGCATAGCAATTATATGTTTAATTTTTAATTTATAACCCTTATATTAATTATAATTAAAACATATAATATTTGTTTCATAAATTAATATAACACTTATATTAATTCTTATGAAACCCAATAAACATGCATAAAATATATTATAACACTTATAACATACTTTCAATGCATGGAGCATGCTTCCTATGGTGGGATTGTAAAATCTATATGGCATACTTTATGCACATACAATCAGGGATTTTAATTAAAACATACATTCACAATGCATAATTAATTAAAGACACTAAAGTGGACTGGTTTTGGCATCTAATGCAAGCAAACTACTAAATTATTACAATAATAATCTAGAATTAAGATCGAGCCACTTCGAACTCTTCGAATTTGGGTCGAACCAGCTTGCAAATGTGTGAACCGGACCTTCAAAAGGTTGAACCGGGCTGAAAAAAGGCTGAACCGGGCCTCAAATTGGTGAAACGGACCTTTAAAATGAAAAAATTGGCCTCGAAATAGTCCGAATCGGTCTAAAAATACCTGAACCAGACCAAAATCGAGCCGAATCGGTCCAAAATTAATTGAACCGGTCAAAGTCAACCTAAAATAGTCAAAGTCAACAAACCAAGCCGCTGACGTCATCGTGACATCACCATCTTCTTCATTTTGAAAAGAAAAAGGCGGGCCGAAGGATGAAATTACTGTAGCTAATTGCAGAATGTGGAAAAAGTGCCTCTGTTCATTGATTTTATGACCTTTCATGCTTCTCTCTTAAAATTGAAGGTCTTCTCGAACTCCAATCGACTCGAAATTTTTACAGCTTATTCTACACACATTAATGCTCAAACACACTTGAGCAATTTGAAAATAAAGCACAAATTGATGCTCAATTTAAATGCCAAAACAAGAAGAACACTTTAGACGCCCAGAAAAATTGTATTTCAAAGAACACCCACCACAGAACAAAATAGATCAAATTGAGTGCTCAAAATTGCTCTGATACCAATTGAAAGAGTCTAGATTAAGAAAGCGGAAGCAATTTTGGATCTTGAACACTCTAAGTTGATCAATTTTGTAAAATAAAAAAACATGCTCAAAGCTGGAAAAACAGAGGGATAGAAAACTCAACTTTGAAGATCAAATTCTCTTCACCCGGTGGATGGTTTTCAACACGAATACTTCGTGGACAACCGTTAGAGTCTTTCCTACTATCCTCAGGCCTTAGAACGGATTGTGGGATCCTTGGTTGAGAGAATTTTGAAGAAATGACCTCAAAGGTCACTAATTCGGGAATATTGAATTTTCTGTGTAATTCTCACAAAACACCTCAAATCACCATTATATAAAGTTTCAAAAAGCTTCAATGGAGTGACAACTCAAGGGTGATGAGGATGCTATGAGATGGAGGTGGCCAATTTTGGTGAATTTTGAAGATGAGTAAGATTGGGCAAGTTGTCAAGTTGGGTTTCATGGAAATCCTTGAATTTTCATTTAATTTTAATTCAAAAATCAATATTAAATTGATTTTTAATTAAAATTAATTTAATTAATTAAATTATTAAATAATTATTTTAATTAATTAATTTTTTAATTTAATTTAATATCAAATATTAAATTAAATTAATTGCCCGATCTCGATCGTTTTTCCGAACATGAATCCTAATTCATGTTATAAACCGATCCGATATTTAAATCATATTTAAATATATCCACCTCTAATTAATCATAGTTTAATTCATAATTAAGCTATGATTCGTTAAATATATCACATATAATTAACACTTTCTTCCAAACTCGAATTTGAATAATTCAAATTCTTTTCTCTCAAAATGTTATAAGGCTAAGTCCGAGCTAGTAAGGAGGACCTAATGGACCTACAGATCATGAGCTCCAACGATACGAGATTAACTGGTCAAACTCTTTAACCTAGCTAATCAACATTCGCTAACCACCGGGACACTCCACTAAAGCCCAGTGGTTGCACTCTCCTCACTATAGATATATTTCTGTCCACTCGATATAACCATGATTAGTAAGTCGATCCTTCACAGGTCGTTCGTAATTACATCTGGGTCAAAATTACCGTTTTACCCCTGTAATTACCTCTTGTTCCTTAAGTACCATTGATCCTCTAATGAACAATTGGTTTGTGGTCCAACCAGCAAACCGAATCCCTCTCGAGCCAATGAGAGGGTGGACCCATTGTTCAAGACACGAGTCGAGACTTAAGGGAACAACCTCTCTACTTATCCCTAAAGCGCGTAGGAGTGAATTCCATCTTGCTGAACTATGTTCACAGCTATCTACCCGATTTTATCCCTGAAATGGGAGGCTTATTGAGTCGGCAATCTCGAGCCACTCTCACCCATGCAAATCTAAGGATAATCCGAATAAACAGGAGTCCATAGAACGCTTAGGATTAAGATCGAGTTACCTAGGTCATCGTATGAATATAGTCAGTTAAGACAGTAAACGAAGTTATAAAGTCTAAGTGACTATTTCGCGGTCCAGTCTTATGCAAACTCATTGCATAGGGCGCCCCCACTCACATGGTCTCCACATGAACGATTTAGGATCACATCGTTTGTACTCTACAAAGTGGGTCGCATCCATAGTGTCCCAAGGATAAGGTACCCAGCCCTATCCTTATACTATAGACTGTTCTGGCTATAACTTGAACTTGATCCACTTTTATGTCACACATAAAGTTCAAGTATTCATCCTATAGCCAAAGGTTCTTTATTGGATTATGGTTACACAATACACAATAGTCAATAACACCTTTACTGAAAAAAATTCAATAACAACTTTATTGTGAAATAGAATATATATTCAGTTTACAAAACCACGAGTTTTAGGACATAAAACCCAACAGTTTAAATGTTTTTATGTGTTTTTTCTGAGACATTGAGCCCGAGGCTATATGGTACCGTGTGCACACAGGTCATTATCTGTTGTCGACGTTGAGTGTTCTCCGTGACAACAATGTTGTCGTGAGTGCTGGGCGGGCCCCACTACGACAAAGACGATGGGAGTGCTGGGCGGGCCCCACTACATCGTAGAGTAAACGTTGGTTGTACTGGGCGTGCCCTACAATACGTGGATTGTGACATGTTTTAAAAGAATTTCTATCAATCATATGCCTTACGAGATTTTAACGATATACTTGCTGGTTTTTCTGAAAAGCTATTATGATTACACAGTTACTTTTAACGTTTGATTACAGATTACATATGCTCATAGATGGTCAGGTGATATCATTTACGAGTTAATAGTTTTCTTTTAACTCAAGTCACTCACTGAGCTTCATAGCTCATTCTTTCAGTGTTTTTCCACTTTGCAGGTAGAGATCGAGCTCCCAGTGCCTGACATCCTGCCATAGTCTACTAGAAGCTCAACGAGCTTGATATTTTGTACGTGAGTGGTGTTGTGTAGAAAGTCTATATGTGTTGTAGTAAATGTTTGAGGGGACTATGTAGTTATAGTTGTATGTGCTTTAGGGGTTGTGGTTGTGTTATTAATGGTTTGGCTTATTGTGGTTTTGATGTTTTTGCCCGATTAGTTTTCAAGAGACTTAAGAAGTGTTTCTCCTTATGTATGCTTAGTAGTTGTCAAGTTCCGCTGTGTTATGCTAAAGTGTTCTATAAGCGAACATAGTATGTTTATCAGGAAAAATAGGGTCTACAGGTATCGTTAGAGAGGTGAACGATGTCTGTTGGCTTCACGCCGTCTTCTGGGTTAAGTAATGAGTAGTTCGGGAGAGGGTGTGACAACTTGGTATCAGAGCAGTTAGCTCCATGGGAATGAGACAGAGCAAGTTAGCTCTAGAAGAAACTGAAAAGTAAATAGCAAGTTATTGAGTTAGTTTAATAGAACTAGCATACCATGATCCAAGTAAGGATAATTGTAAGTCCAGTCAGGACTAGAATGTAAATACAAGTCTAGAATTGTGCATTTGAGAGTATTCATTACATGTCTAGTATGTTGATATGTTATAGAAGTCGTGCCACCATGTACAAGCAGACGATGCAGGCAGGATCAGGACGGGACGCAGGATCCTACCCAAGATCAATCTCAGTGGGGATCTAGTGCCCCGAGAGTCCAGATAGGGGCCAGAATTAAACGACATGCTAATTCCTCACAGGATGTAGGTAAGCCATAGAGAGCAGAGTTAAGTGATCCAGACAAGACATATAAAATAGATCACCTAAAAGAATTAGGTGCCACAGTGTTTGAGGGTACCAAAGATCCAGCTAATGCTGAGGTTGGTTGAATAAGCTTATAAATGTTTTGATGTGATGAGTTGCTCTGAGGAGCGAAATGTTAAGTTGGCCACATTCTTGCTGCTGAAGAAGACAGAAGGATGATGAAAATCGATGTTAGCCAGTTGTAGTGATGCACGTACTTTAGACTGGCAAACTTTCAGAAGTATATTTGAAGATAAATATTATCCTAGCATGTACCGAGAGGCAAAGGGGGATGAATTTTTAGAACTGAAGCAAGGGACACTTTCAGTGGTTGAATACGAGAGAAAGTATACTGAGTTGTCGCGGTATGCTGAAGTAATTGTGGCATCTGAGAGTGACAGGTGTCGAAGGTTTGAAAGAGGATTACATTCTGAGATACGTACCCAGTCACAACTATTTCTAAGTGGGCCGACCTTTTCTCGGCTAGTAGAGACTGCCCTACATGTTGAGCAGACTATAGTAGAGGAGAAGTCAGTAGTGAAGCCTAGTTGTGGGGCTTCGACAACCAGCAGTTTCCAAGGTCGTGAGCAGCGGAGGTTCACACCTGGAGTAAATGTTTCCAAGTCAATAAGACTTTAA

mRNA sequence

ATGGACCTAAGAGACCCTATCCATTTCTGGAGTTTGAGTTGTTCTGAATTCCATGTTACATCCCTCCGAATGGATGGTGCTACTTCGGGCAACCTGCAGGCGCATTATGGGAATCTCACGGACGGGACGCAGGATCCTACCCAAGATCAATCTCAGTGGGGATCTAGTGCCCCGAGAGTCCAGATAGGGGCCAGAATTAAACGACATGCTAATTCCTCACAGGATACTATAGTAGAGGAGAAGTCAGTAGTGAAGCCTAGTTGTGGGGCTTCGACAACCAGCAGTTTCCAAGGTCGTGAGCAGCGGAGGTTCACACCTGGAGTAAATGTTTCCAAGTCAATAAGACTTTAA

Coding sequence (CDS)

ATGGACCTAAGAGACCCTATCCATTTCTGGAGTTTGAGTTGTTCTGAATTCCATGTTACATCCCTCCGAATGGATGGTGCTACTTCGGGCAACCTGCAGGCGCATTATGGGAATCTCACGGACGGGACGCAGGATCCTACCCAAGATCAATCTCAGTGGGGATCTAGTGCCCCGAGAGTCCAGATAGGGGCCAGAATTAAACGACATGCTAATTCCTCACAGGATACTATAGTAGAGGAGAAGTCAGTAGTGAAGCCTAGTTGTGGGGCTTCGACAACCAGCAGTTTCCAAGGTCGTGAGCAGCGGAGGTTCACACCTGGAGTAAATGTTTCCAAGTCAATAAGACTTTAA

Protein sequence

MDLRDPIHFWSLSCSEFHVTSLRMDGATSGNLQAHYGNLTDGTQDPTQDQSQWGSSAPRVQIGARIKRHANSSQDTIVEEKSVVKPSCGASTTSSFQGREQRRFTPGVNVSKSIRL
Homology
BLAST of Tan0007079 vs. NCBI nr
Match: TYK00849.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 55.8 bits (133), Expect = 2.8e-04
Identity = 25/35 (71.43%), Postives = 30/35 (85.71%), Query Frame = 0

Query: 77  IVEEKSVVKPSCGASTTSSFQGREQRRFTPGVNVS 112
           I EEKS V+ SCG ST+S F+GREQRRFTPG+N+S
Sbjct: 165 ITEEKSAVELSCGTSTSSGFRGREQRRFTPGINIS 199

BLAST of Tan0007079 vs. NCBI nr
Match: KAA0054814.1 (reverse transcriptase [Cucumis melo var. makuwa])

HSP 1 Score: 54.3 bits (129), Expect = 8.1e-04
Identity = 29/66 (43.94%), Postives = 41/66 (62.12%), Query Frame = 0

Query: 46  PTQDQSQWGSSAPRVQIGARIKRHANSSQDTIVEEKSVVKPSCGASTTSSFQGREQRRFT 105
           P    ++W + +  V+   R+       + +I EEKS V+ S G STTS F+GREQRRFT
Sbjct: 211 PVTAIAKWTNFSQLVETALRV-------EQSITEEKSAVELSRGTSTTSGFRGREQRRFT 269

Query: 106 PGVNVS 112
           PG+N+S
Sbjct: 271 PGINIS 269

BLAST of Tan0007079 vs. NCBI nr
Match: KAA0061627.1 (putative polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 54.3 bits (129), Expect = 8.1e-04
Identity = 30/66 (45.45%), Postives = 40/66 (60.61%), Query Frame = 0

Query: 46  PTQDQSQWGSSAPRVQIGARIKRHANSSQDTIVEEKSVVKPSCGASTTSSFQGREQRRFT 105
           P    ++W + +  V+   R+       Q +I EEKS V+ S G ST S F+GREQRRFT
Sbjct: 196 PVTAVAKWTNFSQLVETSLRV-------QQSITEEKSAVELSRGTSTASGFRGREQRRFT 254

Query: 106 PGVNVS 112
           PGVN+S
Sbjct: 256 PGVNIS 254

BLAST of Tan0007079 vs. ExPASy TrEMBL
Match: A0A5D3BRY3 (DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold509G00100 PE=4 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 1.4e-04
Identity = 25/35 (71.43%), Postives = 30/35 (85.71%), Query Frame = 0

Query: 77  IVEEKSVVKPSCGASTTSSFQGREQRRFTPGVNVS 112
           I EEKS V+ SCG ST+S F+GREQRRFTPG+N+S
Sbjct: 165 ITEEKSAVELSCGTSTSSGFRGREQRRFTPGINIS 199

BLAST of Tan0007079 vs. ExPASy TrEMBL
Match: A0A5A7UI35 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold437G001040 PE=4 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 3.9e-04
Identity = 29/66 (43.94%), Postives = 41/66 (62.12%), Query Frame = 0

Query: 46  PTQDQSQWGSSAPRVQIGARIKRHANSSQDTIVEEKSVVKPSCGASTTSSFQGREQRRFT 105
           P    ++W + +  V+   R+       + +I EEKS V+ S G STTS F+GREQRRFT
Sbjct: 211 PVTAIAKWTNFSQLVETALRV-------EQSITEEKSAVELSRGTSTTSGFRGREQRRFT 269

Query: 106 PGVNVS 112
           PG+N+S
Sbjct: 271 PGINIS 269

BLAST of Tan0007079 vs. ExPASy TrEMBL
Match: A0A5A7V411 (Putative polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold3980G00110 PE=4 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 3.9e-04
Identity = 30/66 (45.45%), Postives = 40/66 (60.61%), Query Frame = 0

Query: 46  PTQDQSQWGSSAPRVQIGARIKRHANSSQDTIVEEKSVVKPSCGASTTSSFQGREQRRFT 105
           P    ++W + +  V+   R+       Q +I EEKS V+ S G ST S F+GREQRRFT
Sbjct: 196 PVTAVAKWTNFSQLVETSLRV-------QQSITEEKSAVELSRGTSTASGFRGREQRRFT 254

Query: 106 PGVNVS 112
           PGVN+S
Sbjct: 256 PGVNIS 254

BLAST of Tan0007079 vs. ExPASy TrEMBL
Match: A0A5A7UT17 (CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G006090 PE=4 SV=1)

HSP 1 Score: 53.9 bits (128), Expect = 5.1e-04
Identity = 28/48 (58.33%), Postives = 35/48 (72.92%), Query Frame = 0

Query: 64  ARIKRHANSSQDTIVEEKSVVKPSCGASTTSSFQGREQRRFTPGVNVS 112
           A++   A   + +I+EEKS +  S G STTS F+GREQRRFTPGVNVS
Sbjct: 358 AKLVETALPVEQSIIEEKSTMDLSRGVSTTSGFRGREQRRFTPGVNVS 405

BLAST of Tan0007079 vs. ExPASy TrEMBL
Match: A0A5D3BTP3 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold374G00020 PE=4 SV=1)

HSP 1 Score: 53.9 bits (128), Expect = 5.1e-04
Identity = 31/66 (46.97%), Postives = 42/66 (63.64%), Query Frame = 0

Query: 46  PTQDQSQWGSSAPRVQIGARIKRHANSSQDTIVEEKSVVKPSCGASTTSSFQGREQRRFT 105
           P    ++W + +  V+   R+K+       +IVEEKS ++ S G STTS  +GREQRRFT
Sbjct: 423 PVTAIAKWMNFSQLVETALRVKQ-------SIVEEKSAMELSRGVSTTSGIRGREQRRFT 481

Query: 106 PGVNVS 112
           PGVNVS
Sbjct: 483 PGVNVS 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
TYK00849.12.8e-0471.43DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa][more]
KAA0054814.18.1e-0443.94reverse transcriptase [Cucumis melo var. makuwa][more]
KAA0061627.18.1e-0445.45putative polyprotein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5D3BRY31.4e-0471.43DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7UI353.9e-0443.94Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43... [more]
A0A5A7V4113.9e-0445.45Putative polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold398... [more]
A0A5A7UT175.1e-0458.33CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A5D3BTP35.1e-0446.97Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold37... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 35..58
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 87..116

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007079.1Tan0007079.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding