Tan0004509 (gene) Snake gourd v1

Overview
NameTan0004509
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
LocationLG07: 73855112 .. 73856204 (-)
RNA-Seq ExpressionTan0004509
SyntenyTan0004509
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCTTCGAATTCGGACACAATGCCATCTAACCATAAGTGGAAGCGCTTCTCGAAGCTGGATGGGCCACCTAAAGCTAAAATTACAGCTTGGAGGCTCATTCGTGACTCTGTACCTACTAAAATGAACATCCTTAACAAAAATATACATACTAATCCTTTGTGCCCGTTTTGCAGGAAATATCCTGAGTCCACAGTCCATATTTTGTGGGAGTGCAAATTAGCAAAAGAGGTCTGGAAGAATTTTCTTCCCTATATGATAATTCCATCAGTTCATCCTCACAAACAATGTTGGAGCTTGGAAGACTGGTGGGATTGGGTGGTCCAAAATATCAAAGAAGAAGATCAACACAAAGTCATTATTCTAATCCTATGGAATATATGGACGCAACAAAATTCAACTCTTCAAACGAACAAAACTCCCCCCCCCCTCCAAATTCTTAGTTAACTAAAAAGGCTATTAATCTGTCAATTCAGAGGTACCTGAATGAAATGCACCTGCAGGACGAGGCTTGAGGGAGAACCAAACGAGTCATGAAGGTTGGCGTCCTACTCCAGAAGGATGCTGGAAACTCAACACCGACGCCTCCTGGAACGAAGCGGCCTCACAAGGGGGATTGGGTTGGACAATTCGTGACTCTCGGGTTCTCTCATCTGCGCAGGAATTAAAAAAACTCAAACGCCAATTACCAATTAAATATCTCGAAGGGAAAGCCATCCTTGAAGGCCATCACATTTATCTGAAGATATCTAAGGAACATGCACAACGTTTGATGGTGGAATCTGACTTTGTAGAAGTAATTAAGGTGCTAAATGTTGAAGCGTTTGATCTCTCCGAATTGAATGACATCGCAAACGAAATTCACTCAATAGTTGGCGATGTTGGTATAATTTCCTTCACCAAATGTCCCAGATCGGGCAACCAATCGACCCAAAAGTTGGCGAGAGTAGCTGCTTTCAATTTCCTTTTGGAAGAAAACGCCTCTTCTTCTTTTGTCGAAGAACACTCTCTCTTTTGGATTGTAAATATCCCCTCTTGGGTGTATTCCCTCGTTGAGGAAGTTGGTGTATTGGTTTCCGTTGTCAATTAA

mRNA sequence

ATGGAAGCTTCGAATTCGGACACAATGCCATCTAACCATAAGTGGAAGCGCTTCTCGAAGCTGGATGGGCCACCTAAAGCTAAAATTACAGCTTGGAGGCTCATTCGACGAGGCTTGAGGGAGAACCAAACGAGTCATGAAGGTTGGCGTCCTACTCCAGAAGGATGCTGGAAACTCAACACCGACGCCTCCTGGAACGAAGCGGCCTCACAAGGGGGATTGGGTTGGACAATTCGTGACTCTCGGGTTCTCTCATCTGCGCAGGAATTAAAAAAACTCAAACGCCAATTACCAATTAAATATCTCGAAGGGAAAGCCATCCTTGAAGGCCATCACATTTATCTGAAGATATCTAAGGAACATGCACAACGTTTGATGGTGGAATCTGACTTTGTAGAAGTAATTAAGGTGCTAAATGTTGAAGCGTTTGATCTCTCCGAATTGAATGACATCGCAAACGAAATTCACTCAATAGTTGGCGATGTTGGTATAATTTCCTTCACCAAATGTCCCAGATCGGGCAACCAATCGACCCAAAAGTTGGCGAGAGTAGCTGCTTTCAATTTCCTTTTGGAAGAAAACGCCTCTTCTTCTTTTGTCGAAGAACACTCTCTCTTTTGGATTGTAAATATCCCCTCTTGGGTGTATTCCCTCGTTGAGGAAGTTGGTGTATTGGTTTCCGTTGTCAATTAA

Coding sequence (CDS)

ATGGAAGCTTCGAATTCGGACACAATGCCATCTAACCATAAGTGGAAGCGCTTCTCGAAGCTGGATGGGCCACCTAAAGCTAAAATTACAGCTTGGAGGCTCATTCGACGAGGCTTGAGGGAGAACCAAACGAGTCATGAAGGTTGGCGTCCTACTCCAGAAGGATGCTGGAAACTCAACACCGACGCCTCCTGGAACGAAGCGGCCTCACAAGGGGGATTGGGTTGGACAATTCGTGACTCTCGGGTTCTCTCATCTGCGCAGGAATTAAAAAAACTCAAACGCCAATTACCAATTAAATATCTCGAAGGGAAAGCCATCCTTGAAGGCCATCACATTTATCTGAAGATATCTAAGGAACATGCACAACGTTTGATGGTGGAATCTGACTTTGTAGAAGTAATTAAGGTGCTAAATGTTGAAGCGTTTGATCTCTCCGAATTGAATGACATCGCAAACGAAATTCACTCAATAGTTGGCGATGTTGGTATAATTTCCTTCACCAAATGTCCCAGATCGGGCAACCAATCGACCCAAAAGTTGGCGAGAGTAGCTGCTTTCAATTTCCTTTTGGAAGAAAACGCCTCTTCTTCTTTTGTCGAAGAACACTCTCTCTTTTGGATTGTAAATATCCCCTCTTGGGTGTATTCCCTCGTTGAGGAAGTTGGTGTATTGGTTTCCGTTGTCAATTAA

Protein sequence

MEASNSDTMPSNHKWKRFSKLDGPPKAKITAWRLIRRGLRENQTSHEGWRPTPEGCWKLNTDASWNEAASQGGLGWTIRDSRVLSSAQELKKLKRQLPIKYLEGKAILEGHHIYLKISKEHAQRLMVESDFVEVIKVLNVEAFDLSELNDIANEIHSIVGDVGIISFTKCPRSGNQSTQKLARVAAFNFLLEENASSSFVEEHSLFWIVNIPSWVYSLVEEVGVLVSVVN
Homology
BLAST of Tan0004509 vs. NCBI nr
Match: XP_022156777.1 (uncharacterized protein LOC111023608 [Momordica charantia])

HSP 1 Score: 83.6 bits (205), Expect = 2.5e-12
Identity = 50/150 (33.33%), Postives = 78/150 (52.00%), Query Frame = 0

Query: 49  WRPTPEGCWKLNTDASWNEAASQGGLGWTIRDSRVLSSAQELKKLKRQLPIKYLEGKAIL 108
           W+P     WKLNTDA+W    + GG+GW +RD +      + + ++ +  I YLE  AI 
Sbjct: 79  WKPPTSNSWKLNTDAAWRADTNTGGIGWILRDEKGEVIKADCRIIRTERNITYLEVMAIC 138

Query: 109 EGHHIYLK-----ISKEHAQRLMVESDFVEVIKVLNVEAFDLSELNDIANEIHSIVGDVG 168
           EG     +     I +EH + + +ESD +E I +L+ +  D +E+  +  EI  ++ D+ 
Sbjct: 139 EGLRAIRQEHCRPIQQEHCRPIHLESDSLEAIHLLHRQCQDQTEIIWLLEEIWQMMEDMK 198

Query: 169 IISFTKCPRSGNQSTQKLARVAAFNFLLEE 194
           I+S     R  N+    LAR A  N L EE
Sbjct: 199 IVSMRHISREANKVAHDLARRAMENDLREE 228

BLAST of Tan0004509 vs. NCBI nr
Match: XP_022143535.1 (uncharacterized protein LOC111013412 [Momordica charantia])

HSP 1 Score: 82.8 bits (203), Expect = 4.2e-12
Identity = 55/160 (34.38%), Postives = 84/160 (52.50%), Query Frame = 0

Query: 34  LIRRGLRENQTSHEGWRPTPEGCWKLNTDASWNEAASQGGLGWTIRDSRVLSSAQELKKL 93
           LIRR   E+ T  + W+P     WKLNT+A+W    + GG+GW +RD +        + +
Sbjct: 67  LIRR--IEDNTGAQ-WKPPTSNSWKLNTNAAWRADTNTGGIGWILRDEKGEVIKASCRII 126

Query: 94  KRQLPIKYLEGKAILEGHHIYLKISKEHAQRLMVESDFVEVIKVLNVEAFDLSELNDIAN 153
           + +  I YLE  AI EG      I +EH + + +ESD +E I +L+ +  D +E+  +  
Sbjct: 127 RAERNITYLEVMAICEG---LRAIRQEHCRPIHLESDSLEAIHLLHRQCQDQTEIIWLLE 186

Query: 154 EIHSIVGDVGIISFTKCPRSGNQSTQKLARVAAFNFLLEE 194
           EI  ++ D+ I+S     R  N+    LAR A  N L EE
Sbjct: 187 EIWQMMKDMEIVSMRHISREANKVAHGLARRAMENDLREE 220

BLAST of Tan0004509 vs. NCBI nr
Match: KAG6599977.1 (hypothetical protein SDJN03_05210, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 77.8 bits (190), Expect = 1.4e-10
Identity = 47/116 (40.52%), Postives = 72/116 (62.07%), Query Frame = 0

Query: 58  KLNTDASWNEAASQGGLGWTIRDSRVLSSAQELKKLKRQLPIKYLEGKAILEGHHIYLKI 117
           KLN+ ASW+E A +GG+ W I DS   S   E K+L+R+  +K LEGKA++EG   YL I
Sbjct: 7   KLNSYASWSEEAGKGGVSWVIHDSLGSSICIECKELRRKWLVKMLEGKAMIEGLKTYLLI 66

Query: 118 SK--EHAQRLMVESDFVEVIKVLNVEAFDLSELNDIANEIHSIVGDVGIISFTKCP 172
            +   ++ R ++   F+E+ ++LN    DL EL+++ +EI  +    GIISF + P
Sbjct: 67  RETVSYSHRPII---FLELARILNNTHMDLIELSNVVDEIFDLELLAGIISFFQVP 119

BLAST of Tan0004509 vs. NCBI nr
Match: XP_038886170.1 (uncharacterized protein LOC120076417 [Benincasa hispida])

HSP 1 Score: 71.6 bits (174), Expect = 9.8e-09
Identity = 49/129 (37.98%), Postives = 64/129 (49.61%), Query Frame = 0

Query: 48  GWRPTPEGCWKLNTDASWNEAASQGGLGWTIRDSRVLSSAQELKKLKRQLPIKYLEGKAI 107
           GW P+    WKLN DASWN      GLGW   D         LK + R   +  LE  AI
Sbjct: 68  GWHPSKPFYWKLNMDASWNSKIDACGLGWVFLDHLYRVCMAGLKFVLRCQKVNILEAIAI 127

Query: 108 LEGHHIYLKISKEHAQRLMVESDFV-EVIKVLNVEAFDLSELNDIANEIHSIVGDVGIIS 167
             G  I   +S      +MVESD + EVI +LN +  DLSE++  + E     G++G+IS
Sbjct: 128 CFGLEI---LSSIDISNIMVESDCLEEVINLLNDDVVDLSEVSFCSEEAKDRGGNLGVIS 187

Query: 168 FTKCPRSGN 176
           F+   R  N
Sbjct: 188 FSHVRRYRN 193

BLAST of Tan0004509 vs. NCBI nr
Match: XP_024042448.1 (uncharacterized protein LOC112099303 [Citrus clementina])

HSP 1 Score: 65.5 bits (158), Expect = 7.0e-07
Identity = 46/165 (27.88%), Postives = 79/165 (47.88%), Query Frame = 0

Query: 26  KAKITAWRLIRRGLR-----ENQTSHEGWRPTPEGCWKLNTDASWNEAASQGGLGWTIRD 85
           +A + ++R I  G       + Q   + W+P P GC+K+N DA+ N +  +GG+G  +RD
Sbjct: 66  EAVVESYRRINPGKNIALAGQQQNGQQVWKPPPPGCFKVNVDAATNLSKQRGGIGAVVRD 125

Query: 86  SRVLSSAQELKKLKRQLPIKYLEGKAILEGHHIYLKISKEHAQRLMVESDFVEVIKVLNV 145
           SR    A   ++   +  +  +E +A+L G  +  K +  H    ++ESD  EV++++  
Sbjct: 126 SRGDCVAAAAQRTTLKGNVADMEAEAVLLGIQVARKANCPH---FVIESDSKEVVELVLK 185

Query: 146 EAFDLSELNDIANEIHSIVGDVGIISFTKCPRSGNQSTQKLARVA 186
               L E++    EI   +      S    PR  N     LA+VA
Sbjct: 186 RKRSLVEISWNIEEIQECLKGQNTASILYVPRRCNVRAHNLAKVA 227

BLAST of Tan0004509 vs. ExPASy TrEMBL
Match: A0A6J1DSV1 (uncharacterized protein LOC111023608 OS=Momordica charantia OX=3673 GN=LOC111023608 PE=4 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 1.2e-12
Identity = 50/150 (33.33%), Postives = 78/150 (52.00%), Query Frame = 0

Query: 49  WRPTPEGCWKLNTDASWNEAASQGGLGWTIRDSRVLSSAQELKKLKRQLPIKYLEGKAIL 108
           W+P     WKLNTDA+W    + GG+GW +RD +      + + ++ +  I YLE  AI 
Sbjct: 79  WKPPTSNSWKLNTDAAWRADTNTGGIGWILRDEKGEVIKADCRIIRTERNITYLEVMAIC 138

Query: 109 EGHHIYLK-----ISKEHAQRLMVESDFVEVIKVLNVEAFDLSELNDIANEIHSIVGDVG 168
           EG     +     I +EH + + +ESD +E I +L+ +  D +E+  +  EI  ++ D+ 
Sbjct: 139 EGLRAIRQEHCRPIQQEHCRPIHLESDSLEAIHLLHRQCQDQTEIIWLLEEIWQMMEDMK 198

Query: 169 IISFTKCPRSGNQSTQKLARVAAFNFLLEE 194
           I+S     R  N+    LAR A  N L EE
Sbjct: 199 IVSMRHISREANKVAHDLARRAMENDLREE 228

BLAST of Tan0004509 vs. ExPASy TrEMBL
Match: A0A6J1CP26 (uncharacterized protein LOC111013412 OS=Momordica charantia OX=3673 GN=LOC111013412 PE=4 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 2.1e-12
Identity = 55/160 (34.38%), Postives = 84/160 (52.50%), Query Frame = 0

Query: 34  LIRRGLRENQTSHEGWRPTPEGCWKLNTDASWNEAASQGGLGWTIRDSRVLSSAQELKKL 93
           LIRR   E+ T  + W+P     WKLNT+A+W    + GG+GW +RD +        + +
Sbjct: 67  LIRR--IEDNTGAQ-WKPPTSNSWKLNTNAAWRADTNTGGIGWILRDEKGEVIKASCRII 126

Query: 94  KRQLPIKYLEGKAILEGHHIYLKISKEHAQRLMVESDFVEVIKVLNVEAFDLSELNDIAN 153
           + +  I YLE  AI EG      I +EH + + +ESD +E I +L+ +  D +E+  +  
Sbjct: 127 RAERNITYLEVMAICEG---LRAIRQEHCRPIHLESDSLEAIHLLHRQCQDQTEIIWLLE 186

Query: 154 EIHSIVGDVGIISFTKCPRSGNQSTQKLARVAAFNFLLEE 194
           EI  ++ D+ I+S     R  N+    LAR A  N L EE
Sbjct: 187 EIWQMMKDMEIVSMRHISREANKVAHGLARRAMENDLREE 220

BLAST of Tan0004509 vs. ExPASy TrEMBL
Match: A0A6J1D4B6 (uncharacterized protein LOC111017181 OS=Momordica charantia OX=3673 GN=LOC111017181 PE=4 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 2.2e-06
Identity = 41/139 (29.50%), Postives = 64/139 (46.04%), Query Frame = 0

Query: 48  GWRPTPEGCWKLNTDASWNEAASQGGLGWTIRDSR-VLSSAQELKKLKRQLPIKYLEGKA 107
           GW P  +  WKLN DA+W ++   GGLGW +RDS      A+ LK L + L +       
Sbjct: 77  GWTPPAQHLWKLNVDATWMDSLHAGGLGWIVRDSEGRFIMAECLKALSQSLTL------- 136

Query: 108 ILEGHHIYLKISKEHAQRLMVESDFVEVIKVLNVEAFDLSELNDIANEIHSIVGDVGIIS 167
                        E   ++ +ESD +EV+ ++N  +  L+E++ I  +I   +  + I  
Sbjct: 137 -------------EAGIKIEMESDCLEVVNIINKSSMVLTEVSLIVEDIWKEMESLPIEG 195

Query: 168 FTKCPRSGNQSTQKLARVA 186
           F   P   N     +AR A
Sbjct: 197 FKHLPMKANGVAHGIARRA 195

BLAST of Tan0004509 vs. ExPASy TrEMBL
Match: A0A6J1CQG0 (uncharacterized protein LOC111013216 OS=Momordica charantia OX=3673 GN=LOC111013216 PE=4 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 2.9e-06
Identity = 36/98 (36.73%), Postives = 51/98 (52.04%), Query Frame = 0

Query: 49  WRPTPEGCWKLNTDASWNEAASQGGLGWTIRDSRVLSSAQELKKLKRQLPIKYLEGKAIL 108
           W   P  CWKLNTDASW+E    GG+GW + D R         K++ +  I  LE   I+
Sbjct: 105 WSAPPTNCWKLNTDASWSEEREVGGIGWILCDCRGEIVLAGNCKIREKKEINALELMTII 164

Query: 109 EGHHIYLKISKEHAQRLMVESDFVEVIKVLNVEAFDLS 147
            G      I+ +    + +ESD VEVI+++  E  DL+
Sbjct: 165 RGLQF---INMQSRSPIYLESDSVEVIRLMKKEDVDLT 199

BLAST of Tan0004509 vs. ExPASy TrEMBL
Match: A0A5B7BI33 (Uncharacterized protein (Fragment) OS=Davidia involucrata OX=16924 GN=Din_037968 PE=4 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 3.7e-06
Identity = 53/168 (31.55%), Postives = 76/168 (45.24%), Query Frame = 0

Query: 49  WRPTPEGCWKLNTDASWNEAASQGGLGWTIRDSRVLSSAQELKKLKRQLPIKYLEGKAIL 108
           W P PE  +KLN D SW   +  GG+G  IRDSR L  A   K LK      Y E  A+ 
Sbjct: 214 WSPPPEDLFKLNVDGSWVPGSFSGGIGGVIRDSRGLVIAGFAKPLKWCGSADYAEACAMF 273

Query: 109 EGHHIYLKISKE-HAQRLMVESDFVEVIKVLNVEAFDLSELNDIANEIHSIVGDVGIISF 168
            G    +  +KE     +++ESD + ++  +   + D S +  I ++I    G VG+ SF
Sbjct: 274 FG----VAFAKEIGIVDVLIESDCLSLVNSVGCSSPDFSHIGHITDDIRR--GMVGLRSF 333

Query: 169 --TKCPRSGNQSTQKLARVAAFNFLLEENASSSFVEEHSLFWIVNIPS 214
                 RS NQ+  ++A  A                +  L WI N+PS
Sbjct: 334 QVRHVRRSANQTAHEIASFAR-------------DVDEELLWIENLPS 362

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022156777.12.5e-1233.33uncharacterized protein LOC111023608 [Momordica charantia][more]
XP_022143535.14.2e-1234.38uncharacterized protein LOC111013412 [Momordica charantia][more]
KAG6599977.11.4e-1040.52hypothetical protein SDJN03_05210, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_038886170.19.8e-0937.98uncharacterized protein LOC120076417 [Benincasa hispida][more]
XP_024042448.17.0e-0727.88uncharacterized protein LOC112099303 [Citrus clementina][more]
Match NameE-valueIdentityDescription
A0A6J1DSV11.2e-1233.33uncharacterized protein LOC111023608 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A6J1CP262.1e-1234.38uncharacterized protein LOC111013412 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A6J1D4B62.2e-0629.50uncharacterized protein LOC111017181 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
A0A6J1CQG02.9e-0636.73uncharacterized protein LOC111013216 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A5B7BI333.7e-0631.55Uncharacterized protein (Fragment) OS=Davidia involucrata OX=16924 GN=Din_037968... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 57..188
e-value: 1.4E-7
score: 33.5
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 60..183
e-value: 5.4E-9
score: 35.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..22
NoneNo IPR availablePANTHERPTHR33033POLYNUCLEOTIDYL TRANSFERASE, RIBONUCLEASE H-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 47..217
NoneNo IPR availablePANTHERPTHR33033:SF67SUBFAMILY NOT NAMEDcoord: 47..217
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 59..183
e-value: 4.99226E-10
score: 53.472
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 56..189

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004509.1Tan0004509.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity