Tan0015602 (gene) Snake gourd v1

Overview
NameTan0015602
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
LocationLG01: 15617976 .. 15620213 (-)
RNA-Seq ExpressionTan0015602
SyntenyTan0015602
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAACAAAGTCCCACTCACCTAGTCCCCTAACTACGTTCTCTCTGTCCAGCTCCTCTCTCTCTCTCCTCTTTCTCTCTCTACCTATCTGCATGGCCTTCGTCTTCTCTCTCTCTCCCTCTGCGGATTAGGGTTCTCTGGTTCCTTCCGCGACCTCCCTTTCTGCATGGTAGATCAAGATTCCCCTCGCTGTAAGCTTCCGATTGACCTTCTTTTCGTTTGCATTTCCTTCGCCTGCTTAAATCACTGGACATCGCGTTCGTTTTTGCGCCCATTTTGTTTATGTGCCTCTGTTTGTATTGTTTTTGTGTTATTTTCTCTCTTTTTTTTTTTTTGGATTTTTTGTTTTTCCTTTGACTTCGCGTTGTGGCATGCATGGTTTGGTTTTCTGTGATTGTTTGATTTGGTAGAAAATGTTTTTTTTCGTTGTCTTTCTTCTGGTTTTATTGGTTAAGCGTTTCGTTTTCTGGTTTTCTGTTCTCGTGCATTTGTTGGGGAGGGATTTTTAATAACTGCATTTTTCAATTTTGGAATATTAAGGGTTTAGCAGATTAACCTCGCATGGGTGAGTATTTCTTATTGTTTCGTTTACGTTTTACTTTTCAGGAGGATTTGGAGTTCAATGTTGCGTTAAACCCTGCTTTTGTGTCTTGTTTTGAATCTCGAACCAAGTGGCGTCGTTGATATAATACTATGACCAACAATGCTGGTTATTATATGGAGTGTGTAATGCCTGGAGTGAAAACTCGCTTCTGTTTTTTCTTAAGTTGCAATTATTTAGTTTGGTTTGAACGTGTTATTGTAACTTCCCTTCATTTTTATAGGATTAATCATACTAATGCAAGTTTATTTCTTATGGTGTCCATATTAGTTTTACATACTTTACGTTGTAGTTGTAGTTTGTAAATCCACCCTAAACTAGAGTTAATGTACGAATACTATTTTAAGCGTAGCACGGTACTGAAAGCTAAAAATCTAAACTAGAGTTAATGTACGAATACTCGTTTCTATTTCTTATCTTTACGCCTTCCCAGTTTGGTCTGTTCTTTGAAAGGCTTTTTTGGGGTCGTTTTCGTATTGGTATGTTGCCTTCACTGATGGGTGAAATCTTCAATAAAATAAGGATATGGAAAAGAGAAAAACTTGCCCAAGTTTCTTGCTTTGATTTGAGAGAGAGGGAAAAGAGAGACACCCAAGTTTTTCTCTAGGGAAAATTGCCCACAATTTCTTGCTTTAACTCTGAGTTTTCAGGGTCTCTTTTGAAAGAAGATAGTTATAGAGAGTTATGTCTTGCACTACTTTTAATCTTGGAGGAGGATCACGTCCGTTTTGTAACATTGATAAAAAATCATCTTCGATTCTTTAATGGGTTTGTGCCCAATGGCCGATGCAATTTAGTTTGTAACTCTGCAATTTCTTCTTCTGTTGGATCCCTCAGCTTCAATTCTCTTCCATTTTCTCTTCCACAATGGCCCGCGTCTCTCCTATCATGTCATCAACTGCTCTAGTGTAAGCTTAGAGGCGTCAATCTCTTCAAGAAAGCCGATTCTTTGGAGCCAGACTCAGTACGACAACATACATGGAAACCTCCAGCTTTTCCTAACTTTAAATTAAATACAGATGCAACTATATTTAGAAGTGAAAATCGAGGTGGAATGGGTGCAGTGATACGGGATGAAAGGGGAAATGAGATGTTTACCGTATCCAAACCAATTCCATGGATTACAGGCGTGGTGACTGTGGAAGCAATCACAGTTAGAGATGCGTTGATTATGGCAAGAGAACTGGGATTTTATCAACTGGAAGTAGAAACGGACTCAACGATGGTGATCAACCTCATACATCAAGAACGTCAAAATCAATCGGAGATAGGGCAGATAATCGAAGAAGTAAAGGAATTGGAAAGGACGATGAAGAACTGTACCCACGATCAAATGCCCTAGCTCATTCCTTCCCTAGCAAGACATGCAAGCACTCTGAAGGAGGAGACGATCTGGATGGAAGAAATTCCTGACCCTTCGAGAGCTCTCTACGAAATGGAGAGAAATGAAGTTTGCACTGAAGACAACTTTTATTTTCTTTAATTATTTCTTGTAATCCGATATTCATCAAAAAAAAGAAAAAAAAATCAATCTCAGTCTATCTATAAAATATTAGACATATTCATTAAATCTATGAAATATACTAAAAGAAAAATAGAAAATATCCACCTTCATTGAAGAGAAAATATTTTAAAATAAA

mRNA sequence

CAACAAAGTCCCACTCACCTAGTCCCCTAACTACGTTCTCTCTGTCCAGCTCCTCTCTCTCTCTCCTCTTTCTCTCTCTACCTATCTGCATGGCCTTCGTCTTCTCTCTCTCTCCCTCTGCGGATTAGGGTTCTCTGGTTCCTTCCGCGACCTCCCTTTCTGCATGGTAGATCAAGATTCCCCTCGCTATGCAACTATATTTAGAAGTGAAAATCGAGGTGGAATGGGTGCAGTGATACGGGATGAAAGGGGAAATGAGATGTTTACCGTATCCAAACCAATTCCATGGATTACAGGCGTGGTGACTGTGGAAGCAATCACAGTTAGAGATGCGTTGATTATGGCAAGAGAACTGGGATTTTATCAACTGGAAGTAGAAACGGACTCAACGATGGTGATCAACCTCATACATCAAGAACGTCAAAATCAATCGGAGATAGGGCAGATAATCGAAGAAGTAAAGGAATTGGAAAGGACGATGAAGAACTGTACCCACGATCAAATGCCCTAGCTCATTCCTTCCCTAGCAAGACATGCAAGCACTCTGAAGGAGGAGACGATCTGGATGGAAGAAATTCCTGACCCTTCGAGAGCTCTCTACGAAATGGAGAGAAATGAAGTTTGCACTGAAGACAACTTTTATTTTCTTTAATTATTTCTTGTAATCCGATATTCATCAAAAAAAAGAAAAAAAAATCAATCTCAGTCTATCTATAAAATATTAGACATATTCATTAAATCTATGAAATATACTAAAAGAAAAATAGAAAATATCCACCTTCATTGAAGAGAAAATATTTTAAAATAAA

Coding sequence (CDS)

ATGGTAGATCAAGATTCCCCTCGCTATGCAACTATATTTAGAAGTGAAAATCGAGGTGGAATGGGTGCAGTGATACGGGATGAAAGGGGAAATGAGATGTTTACCGTATCCAAACCAATTCCATGGATTACAGGCGTGGTGACTGTGGAAGCAATCACAGTTAGAGATGCGTTGATTATGGCAAGAGAACTGGGATTTTATCAACTGGAAGTAGAAACGGACTCAACGATGGTGATCAACCTCATACATCAAGAACGTCAAAATCAATCGGAGATAGGGCAGATAATCGAAGAAGTAAAGGAATTGGAAAGGACGATGAAGAACTGTACCCACGATCAAATGCCCTAG

Protein sequence

MVDQDSPRYATIFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQLEVETDSTMVINLIHQERQNQSEIGQIIEEVKELERTMKNCTHDQMP
Homology
BLAST of Tan0015602 vs. NCBI nr
Match: XP_023916941.1 (uncharacterized protein LOC112028476 [Quercus suber])

HSP 1 Score: 66.2 bits (160), Expect = 2.1e-07
Identity = 36/98 (36.73%), Postives = 57/98 (58.16%), Query Frame = 0

Query: 12  IFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQLEV 71
           +F   +  G+G VIRDE G  +  +S+ IP    V  VEA+  R ALI A+E+  ++ EV
Sbjct: 85  VFEDRSLAGLGIVIRDESGLIIAALSQKIPLPRSVDMVEALDARQALIFAQEISIFKAEV 144

Query: 72  ETDSTMVINLIHQERQNQSEIGQIIEEVKELERTMKNC 110
           E DS  VI  ++  + N++ +G II +++ L   M+ C
Sbjct: 145 EGDSLNVIQALNNPKPNRTLMGHIISDIQCLGAAMQKC 182

BLAST of Tan0015602 vs. NCBI nr
Match: XP_030969974.1 (uncharacterized protein LOC115990267 [Quercus lobata])

HSP 1 Score: 65.9 bits (159), Expect = 2.7e-07
Identity = 33/98 (33.67%), Postives = 56/98 (57.14%), Query Frame = 0

Query: 12  IFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQLEV 71
           IF+  N+ G+G V+RD +G  +  +++ +  +     +EA+ +R A+  A E  F  + +
Sbjct: 28  IFKESNKAGIGVVVRDSQGWVLAALTEKVDGVQDAEVIEALAIRRAIRFAIETSFNCVII 87

Query: 72  ETDSTMVINLIHQERQNQSEIGQIIEEVKELERTMKNC 110
           E+DS  V+  I    +    IG IIE+VK L +TMK+C
Sbjct: 88  ESDSLSVVKAIQDTAEPTCHIGNIIEDVKLLSKTMKSC 125

BLAST of Tan0015602 vs. NCBI nr
Match: XP_023899902.1 (uncharacterized protein LOC112011793 [Quercus suber])

HSP 1 Score: 64.7 bits (156), Expect = 6.0e-07
Identity = 33/89 (37.08%), Postives = 55/89 (61.80%), Query Frame = 0

Query: 12  IFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQLEV 71
           +F      G+G +IR+++G  M  +S+ IP +  +  VE I  R AL+ A+ELGF ++EV
Sbjct: 40  LFSQAELAGIGVIIRNDQGLAMAALSQQIPSLASMEMVEVIAARRALMFAKELGFDKVEV 99

Query: 72  ETDSTMVINLIHQERQNQSEIGQIIEEVK 101
           E DS  V+N I  +  + S +G ++++VK
Sbjct: 100 EGDSETVVNAILGDYMDNSFMGHVLQDVK 128

BLAST of Tan0015602 vs. NCBI nr
Match: XP_030925021.1 (uncharacterized protein LOC115952076 [Quercus lobata])

HSP 1 Score: 64.3 bits (155), Expect = 7.8e-07
Identity = 35/91 (38.46%), Postives = 55/91 (60.44%), Query Frame = 0

Query: 12  IFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQLEV 71
           IF +++   +G V+RD+ G  + T+S+ IP  T V TVE I  R AL  A+ELGF ++ V
Sbjct: 10  IFSTQSSASLGMVVRDQAGLVLATLSQKIPMPTSVETVEVIAARRALEFAKELGFERIMV 69

Query: 72  ETDSTMVINLIHQERQNQSEIGQIIEEVKEL 103
           E D  ++I  I ++    S +G I+E++  L
Sbjct: 70  EGDFEIIIKTIREKTLLSSVLGHILEDIHVL 100

BLAST of Tan0015602 vs. NCBI nr
Match: XP_030964152.1 (uncharacterized protein LOC115985344 [Quercus lobata])

HSP 1 Score: 63.9 bits (154), Expect = 1.0e-06
Identity = 33/90 (36.67%), Postives = 51/90 (56.67%), Query Frame = 0

Query: 10  ATIFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQL 69
           A  FR+ N  G+G ++RD  G  +  +S PIP    V  VEA+  R A+  A E+G  ++
Sbjct: 100 AATFRTTNSAGIGVIVRDCAGEVIGALSMPIPMPQSVAAVEALACRRAVKFAAEIGLTRV 159

Query: 70  EVETDSTMVINLIHQERQNQSEIGQIIEEV 100
             E DS +VIN I   + +Q+  G +IE++
Sbjct: 160 VFEGDSAVVINAISSTKGDQTSYGNVIEDI 189

BLAST of Tan0015602 vs. ExPASy TrEMBL
Match: A0A7N2KZZ0 (RNase H domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 70.9 bits (172), Expect = 4.0e-09
Identity = 36/97 (37.11%), Postives = 62/97 (63.92%), Query Frame = 0

Query: 12  IFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQLEV 71
           IF +++  G+  V+RD+ G  + ++S+ IP  T V  VE I  R AL++ARELGF ++ V
Sbjct: 167 IFSTQSSAGLAMVVRDQAGLVLASLSQKIPLSTSVEIVEVIAARRALLLARELGFERVMV 226

Query: 72  ETDSTMVINLIHQERQNQSEIGQIIEEVKELERTMKN 109
           E DS ++I  I ++    S++G I+E+++ L R+  +
Sbjct: 227 EGDSEIIIKAIKEKALPSSDLGHILEDIRVLSRSFNS 263

BLAST of Tan0015602 vs. ExPASy TrEMBL
Match: A0A7N2N1F3 (RNase H domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 3.4e-08
Identity = 34/100 (34.00%), Postives = 57/100 (57.00%), Query Frame = 0

Query: 10  ATIFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQL 69
           A +F++ +  G+G +IRD +G+ +  +S P P  T V  +EA+  R A++ A+E+G  Q+
Sbjct: 69  AAVFKASSSAGIGVIIRDNKGDAIGALSVPTPLSTSVAAMEALACRRAVLFAKEIGLRQV 128

Query: 70  EVETDSTMVINLIHQERQNQSEIGQIIEEVKELERTMKNC 110
             E DS MVI  + Q     +E G II++++ L      C
Sbjct: 129 LFEGDSAMVIQALIQGDSASAEYGNIIDDIRALAADFDFC 168

BLAST of Tan0015602 vs. ExPASy TrEMBL
Match: A0A2N9EUT1 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS6520 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 9.9e-08
Identity = 33/98 (33.67%), Postives = 57/98 (58.16%), Query Frame = 0

Query: 12   IFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQLEV 71
            +F      G+G VIRDE G  M ++S+ +P+      VEA+ +R A+ +A E+GFY + V
Sbjct: 1481 VFLETMEAGLGVVIRDELGRPMVSLSQKVPFPGSSTAVEALALRRAMFLAIEMGFYSVIV 1540

Query: 72   ETDSTMVINLIHQERQNQSEIGQIIEEVKELERTMKNC 110
            E DS M++  +     + +  G ++E+V+ L + M +C
Sbjct: 1541 EGDSEMLVRAVTSLGGSATVYGHVVEDVQYLTQQMTHC 1578

BLAST of Tan0015602 vs. ExPASy TrEMBL
Match: A0A2N9I6L8 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS49538 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 9.9e-08
Identity = 33/98 (33.67%), Postives = 57/98 (58.16%), Query Frame = 0

Query: 12   IFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQLEV 71
            +F      G+G VIRDE G  M ++S+ +P+      VEA+ +R A+ +A E+GFY + V
Sbjct: 1403 VFLETMEAGLGVVIRDELGRPMVSLSQKVPFPGSSTAVEALALRRAMFLAIEMGFYSVIV 1462

Query: 72   ETDSTMVINLIHQERQNQSEIGQIIEEVKELERTMKNC 110
            E DS M++  +     + +  G ++E+V+ L + M +C
Sbjct: 1463 EGDSEMLVRAVTSLGGSATVYGHVVEDVQYLTQQMTHC 1500

BLAST of Tan0015602 vs. ExPASy TrEMBL
Match: A0A2N9HKV4 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40213 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 9.9e-08
Identity = 33/98 (33.67%), Postives = 57/98 (58.16%), Query Frame = 0

Query: 12   IFRSENRGGMGAVIRDERGNEMFTVSKPIPWITGVVTVEAITVRDALIMARELGFYQLEV 71
            +F      G+G VIRDE G  M ++S+ +P+      VEA+ +R A+ +A E+GFY + V
Sbjct: 1471 VFLETMEAGLGVVIRDELGRPMVSLSQKVPFPGSSTAVEALALRRAMFLAIEMGFYSVIV 1530

Query: 72   ETDSTMVINLIHQERQNQSEIGQIIEEVKELERTMKNC 110
            E DS M++  +     + +  G ++E+V+ L + M +C
Sbjct: 1531 EGDSEMLVRAVTSLGGSATVYGHVVEDVQYLTQQMTHC 1568

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023916941.12.1e-0736.73uncharacterized protein LOC112028476 [Quercus suber][more]
XP_030969974.12.7e-0733.67uncharacterized protein LOC115990267 [Quercus lobata][more]
XP_023899902.16.0e-0737.08uncharacterized protein LOC112011793 [Quercus suber][more]
XP_030925021.17.8e-0738.46uncharacterized protein LOC115952076 [Quercus lobata][more]
XP_030964152.11.0e-0636.67uncharacterized protein LOC115985344 [Quercus lobata][more]
Match NameE-valueIdentityDescription
A0A7N2KZZ04.0e-0937.11RNase H domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A7N2N1F33.4e-0834.00RNase H domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A2N9EUT19.9e-0833.67Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS6520 PE=4 SV=1[more]
A0A2N9I6L89.9e-0833.67Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9HKV49.9e-0833.67Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40213 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 72..92
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 2..90
e-value: 2.3E-14
score: 53.2
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 1..95
e-value: 1.2E-8
score: 37.1
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 2..69

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0015602.1Tan0015602.1mRNA
Tan0015602.2Tan0015602.2mRNA
Tan0015602.3Tan0015602.3mRNA
Tan0015602.4Tan0015602.4mRNA
Tan0015602.5Tan0015602.5mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity