Tan0014468 (gene) Snake gourd v1

Overview
NameTan0014468
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG08: 68313561 .. 68314728 (+)
RNA-Seq ExpressionTan0014468
SyntenyTan0014468
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAGGCCAGAATGTGTGCACTTTGCTTCGTGATTATGGTTTTTGGGACGAAGATAAGGTACAGGAGCACTTTAACCAGGACGATGCAGAATACATTCTCTCAATCCCGCGAACTGGAGAACTGACCACTGACGAAATTGTTTGGAAGTGCACCATGAAAGGGGTCTTTTCAGTTAAGGTGCCTACCTTTAGGTATGGACGTTAGAGCACAAAATGAGGCTTCAAGATCGGACAACTCCCTATCAAGACAAATGTGGAAGTTAGTATGGAGTGCGCTTATCCCAAGCAAAATCAAGATATGTTGTTAGAAGATCATTCACGACATTTTTCCGACTCGAGCTAATTTGCTAAGAAGGGGCATCATCCTGAACCCAGTCTGTCCTTTTTGTTCAAAAAAGATGGAGATGAGTAATCATATCTTTTGGGGTTGCAAGGTATCTACTAAATTTTGGGACCTTTTTTTACCTTCTACCTCTGTTTTGTTTTATGATTGCAGGGACACCTAGAGTGCAGTGGATTATTTCTGTTGGATGTTGGATCGGCATAATCGTATGGACCAAGCGTTGTTCATGACAATTCTTTTGAAGATTTGGTCCTGTTGGAATGTATTATTACAGAACCAGGGCAGACTTGACTGGGAAAGGGTGTTCCTCAACACACAACTCCAGTTTCAGGAGTTCACTCAATCAGAGGCTACAAGGATCCAAGTTCCAGATCATATGACAACGACGACAGAAGTTTGGGAGTCGTCGGATGAAGGTTAGTGGAAATTAAATATCGATGCCTCATGGTGCTCAAACACTAATTACAGGGGAGTAGGTTGGACCTTACAGGATTGGGCAGGAAGGATAGTGCGCGCAGGGCATCACCACATCATAGACAGGTGGTCAATCACCATTTTGGAACTTTGTGATATTCTCAAGGGTATGGACTTCATTCAGGAGTATAACATACCTCTCTTGGTGGAATATGATTCTTGGGAGGTTATACAACTTATTAATAGTGTTGACAATGATCAAACGGAGGCAAGAGACTTTGCGAGGAAGATCAGACAACGGACAAATTCTTGGCCCACTATTTCTTTTCGCCACACTAGACAGGAGACAAATATGGTCGCCCACAAGCTGGCGCAACGTGGAAAACACCTACAGGGAGAAGAACATTAG

mRNA sequence

ATGACAGGCCAGAATGTGTGCACTTTGCTTCGTGATTATGGTTTTTGGGACGAAGATAAGGTACAGGAGCACTTTAACCAGGACGATGCAGAATACATTCTCTCAATCCCGCGAACTGGAGAACTGACCACTGACGAAATTGTTTGGAAGTGCACCATGAAAGGGGTCTTTTCAGTTAAGAACCAGGGCAGACTTGACTGGGAAAGGGTGTTCCTCAACACACAACTCCAGTTTCAGGAGTTCACTCAATCAGAGGCTACAAGGATCCAAGTTCCAGATCATATGACAACGACGACAGAAGTTTGGGAGTCGTCGGATGAAGGAAGGATAGTGCGCGCAGGGCATCACCACATCATAGACAGGTGGTCAATCACCATTTTGGAACTTTGTGATATTCTCAAGGGTATGGACTTCATTCAGGAGTATAACATACCTCTCTTGGTGGAATATGATTCTTGGGAGGTTATACAACTTATTAATAGTGTTGACAATGATCAAACGGAGGCAAGAGACTTTGCGAGGAAGATCAGACAACGGACAAATTCTTGGCCCACTATTTCTTTTCGCCACACTAGACAGGAGACAAATATGGTCGCCCACAAGCTGGCGCAACGTGGAAAACACCTACAGGGAGAAGAACATTAG

Coding sequence (CDS)

ATGACAGGCCAGAATGTGTGCACTTTGCTTCGTGATTATGGTTTTTGGGACGAAGATAAGGTACAGGAGCACTTTAACCAGGACGATGCAGAATACATTCTCTCAATCCCGCGAACTGGAGAACTGACCACTGACGAAATTGTTTGGAAGTGCACCATGAAAGGGGTCTTTTCAGTTAAGAACCAGGGCAGACTTGACTGGGAAAGGGTGTTCCTCAACACACAACTCCAGTTTCAGGAGTTCACTCAATCAGAGGCTACAAGGATCCAAGTTCCAGATCATATGACAACGACGACAGAAGTTTGGGAGTCGTCGGATGAAGGAAGGATAGTGCGCGCAGGGCATCACCACATCATAGACAGGTGGTCAATCACCATTTTGGAACTTTGTGATATTCTCAAGGGTATGGACTTCATTCAGGAGTATAACATACCTCTCTTGGTGGAATATGATTCTTGGGAGGTTATACAACTTATTAATAGTGTTGACAATGATCAAACGGAGGCAAGAGACTTTGCGAGGAAGATCAGACAACGGACAAATTCTTGGCCCACTATTTCTTTTCGCCACACTAGACAGGAGACAAATATGGTCGCCCACAAGCTGGCGCAACGTGGAAAACACCTACAGGGAGAAGAACATTAG

Protein sequence

MTGQNVCTLLRDYGFWDEDKVQEHFNQDDAEYILSIPRTGELTTDEIVWKCTMKGVFSVKNQGRLDWERVFLNTQLQFQEFTQSEATRIQVPDHMTTTTEVWESSDEGRIVRAGHHHIIDRWSITILELCDILKGMDFIQEYNIPLLVEYDSWEVIQLINSVDNDQTEARDFARKIRQRTNSWPTISFRHTRQETNMVAHKLAQRGKHLQGEEH
Homology
BLAST of Tan0014468 vs. NCBI nr
Match: KAF7821943.1 (RVT_3 domain-containing protein [Senna tora])

HSP 1 Score: 63.2 bits (152), Expect = 3.2e-06
Identity = 57/220 (25.91%), Postives = 89/220 (40.45%), Query Frame = 0

Query: 6   VCTLLRDYGFWDEDKVQEHFNQDDAEYILSIPRTGELTTDEIVWKCTMKGVFSVKNQG-- 65
           VC L+   G W+ + +   F    A  ILSIP       D   W  T  G ++VK++   
Sbjct: 50  VCMLISSPGVWNHEALSVFFPPSVANNILSIPLARAPKDDSWFWTLTPNGQYTVKSESAE 109

Query: 66  -----RLDW-ERVFLNTQLQFQEFTQSEATRIQVP----DHMTTTTEVWESSDEGRIVRA 125
                 L W  R  +   +   E       ++ V       + T       + +GR++ A
Sbjct: 110 KFYLFALTWLRRELMRILVVCYEKRMRRRWKVNVDSCKRSEVATGVGCVIRNFQGRVLGA 169

Query: 126 GHHHIIDRWSITILELCDILKGMDFIQEYNIPLL-VEYDSWEVIQLINSVDNDQTEARDF 185
                    S+ +LE   +L GM+F ++     + +E D+  V  L+N            
Sbjct: 170 IARRAPPCASVELLEATAVLAGMEFARDLRCSCVEIEGDAQSVFNLVNGQTCSLFWVGTV 229

Query: 186 ARKIRQRTNSWPTISFRHTRQETNMVAHKLAQRGKHLQGE 213
              I    + + +ISFR   + TNMVAHKLA+ G  L GE
Sbjct: 230 VDSILSIISEFTSISFRWVPRGTNMVAHKLARVGSSLTGE 269

BLAST of Tan0014468 vs. ExPASy TrEMBL
Match: A0A6J1CP26 (uncharacterized protein LOC111013412 OS=Momordica charantia OX=3673 GN=LOC111013412 PE=4 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 5.6e-04
Identity = 39/106 (36.79%), Postives = 55/106 (51.89%), Query Frame = 0

Query: 102 WESSDE-GRIVRAGHHHIIDRWSITILELCDILKGMDFI-QEYNIPLLVEYDSWEVIQLI 161
           W   DE G +++A    I    +IT LE+  I +G+  I QE+  P+ +E DS E I L+
Sbjct: 106 WILRDEKGEVIKASCRIIRAERNITYLEVMAICEGLRAIRQEHCRPIHLESDSLEAIHLL 165

Query: 162 NSVDNDQTEARDFARKIRQRTNSWPTISFRHTRQETNMVAHKLAQR 206
           +    DQTE      +I Q       +S RH  +E N VAH LA+R
Sbjct: 166 HRQCQDQTEIIWLLEEIWQMMKDMEIVSMRHISREANKVAHGLARR 211

BLAST of Tan0014468 vs. ExPASy TrEMBL
Match: A0A803QE56 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 7.3e-04
Identity = 26/59 (44.07%), Postives = 36/59 (61.02%), Query Frame = 0

Query: 3    GQNVCTLLRDYGFWDEDKVQEHFNQDDAEYILSIPRTGELTTDEIVWKCTMKGVFSVKN 62
            G  V  L R  G WDE+ V+  FN++DA+ IL +P TG    D+I+W  T  G +SVK+
Sbjct: 1133 GLYVVDLKRPNGCWDEEFVRVVFNEEDADIILKLPSTGWDIEDKIMWHYTKNGEYSVKS 1191

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAF7821943.13.2e-0625.91RVT_3 domain-containing protein [Senna tora][more]
Match NameE-valueIdentityDescription
A0A6J1CP265.6e-0436.79uncharacterized protein LOC111013412 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A803QE567.3e-0444.07Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 96..210
e-value: 8.2E-7
score: 31.0
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 106..205
e-value: 3.1E-12
score: 46.4
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 114..207

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0014468.1Tan0014468.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity