Tan0000563 (gene) Snake gourd v1

Overview
NameTan0000563
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
LocationLG01: 8778435 .. 8779866 (-)
RNA-Seq ExpressionTan0000563
SyntenyTan0000563
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGCATCCGAAAAAGTTGGAGGTTACCACTGACCCCTCTGAATATGCTCCTCATTAGATGTCTGACCTGTGTTTCCTATTCCTTTCGTGAAATCATTAATTTGACTTTGTGTAGTATGATAGAACTTTCCCCAATCAGTAACATCGCCATTTTATACTTTCATATTACGCCACGACCAAATTCTCCATAAAATAATCAAGAAGATATTCAGATCTTTTGGTTGTCCATTCTCCAACGTTCACATGAAAAAATCTATTGCATCCCATGATAGAAGAGAAATACAAGCTAACATTAGAGGGCAAGAACAGACATCTATGAAGTTAAGTCATCATCTTAATATTGTTGAAATCAAGCACATCATGTTGCCATCTTCTAGTAGGAATTTAATTAAAGACATCTATGAAGTTTTTAAGCTTACAAAATCTAATAATCTCTAATTGAAAAGTTGGAAGTTATGGAAAGGATATTATGGAATTGTTATGCGACTATGACCATGCAAAAAGGGAGAAAGTTGCAATAAAGAAAGACTTATCAATAAAATTGTTTCTAATATAAATAAGCACAATTTAACTACATGAAAATTTTGAAGGGCATCTAAGCAATTAGGTCAACATCATAATTTTTTTTTATCGCAACATTTTCCCTAGTTTGAATGTCAATAGTTGTACAAATACAAACAACTCCATAACTTCTAAAACTTCTAATAAAAATGAATAAAGACCTGTCAGATTCCAAAAGCTCGAAAACTACATATATGTATTTGAATTCCCTTTTAAAAGTTGGCGCTACGATTGGCTCTAAGTCGAGTCGTGGATTCGGATCCTTCTAGTCTTCTCTCAGGCCAGGTTGACATATCTCCTTCTAGCCCATCTACCTCTGTCCCTTTTATCTCCACAAGGGGTTCCCCCAGTTGGGTGAACACCATTTTGGTTGCTTCGGGGCGTTGGAAGCCGCCCAACTCGAGTCAGTGGAAGTTGAATACTGATGCATCCTGGTCACCAAAATGGAACCGCGGAGGCTTGGACTGGTTAGTTCGTGACGAGTTTGGTTCTCCCATAATCAGCGGATGCAAAGTGGTTACTACCAAATGGTCGATCAAAGTTCTTGAAGCTGCTGCTGTGGTGGAACGTCTTGAAAGCATCCTCTATTTGCAGCTTTTGTGTGTTCCTCCCGTGGAGGTAGAGTTGGATTCTTCAAAAGTGGGTGTTCTTCTGAACCAACAAGAAGTTGACTTGTCAGAAGTGAGGTTGTTTATTGTGGAAGCTTTGGCTTTAGCTTCGATTGTAAATGTGGCTTCTTTTTGTAAAGTTCCGAGAAAGGAGAACCAAGCGGCCCACGAGCTGCCGCTTTGGCATCTTCTTTTGGGCATTTGGACTGGAAAGAGGGGTTTTCGGCTGATGTATTATCCTCCATCCTTGAGATGGAATAG

mRNA sequence

ATGTTGCATCCGAAAAAGTTGGAGTTGGCGCTACGATTGGCTCTAAGTCGAGTCGTGGATTCGGATCCTTCTAGTCTTCTCTCAGGCCAGGTTGACATATCTCCTTCTAGCCCATCTACCTCTGTCCCTTTTATCTCCACAAGGGGTTCCCCCAGTTGGGTGAACACCATTTTGGTTGCTTCGGGGCGTTGGAAGCCGCCCAACTCGAGTCAGTGGAAGTTGAATACTGATGCATCCTGGTCACCAAAATGGAACCGCGGAGGCTTGGACTGGTTAGTTCGTGACGAGTTTGGTTCTCCCATAATCAGCGGATGCAAAGTGGTTACTACCAAATGGTCGATCAAAGTTCTTGAAGCTGCTGCTGTGGTGGAACGTCTTGAAAGCATCCTCTATTTGCAGCTTTTGTGTGTTCCTCCCGTGGAGGTAGAGTTGGATTCTTCAAAAGTGGGTGTTCTTCTGAACCAACAAGAAGTTGACTTGTCAGAAGTGAGTTCCGAGAAAGGAGAACCAAGCGGCCCACGAGCTGCCGCTTTGGCATCTTCTTTTGGGCATTTGGACTGGAAAGAGGGGTTTTCGGCTGATGTATTATCCTCCATCCTTGAGATGGAATAG

Coding sequence (CDS)

ATGTTGCATCCGAAAAAGTTGGAGTTGGCGCTACGATTGGCTCTAAGTCGAGTCGTGGATTCGGATCCTTCTAGTCTTCTCTCAGGCCAGGTTGACATATCTCCTTCTAGCCCATCTACCTCTGTCCCTTTTATCTCCACAAGGGGTTCCCCCAGTTGGGTGAACACCATTTTGGTTGCTTCGGGGCGTTGGAAGCCGCCCAACTCGAGTCAGTGGAAGTTGAATACTGATGCATCCTGGTCACCAAAATGGAACCGCGGAGGCTTGGACTGGTTAGTTCGTGACGAGTTTGGTTCTCCCATAATCAGCGGATGCAAAGTGGTTACTACCAAATGGTCGATCAAAGTTCTTGAAGCTGCTGCTGTGGTGGAACGTCTTGAAAGCATCCTCTATTTGCAGCTTTTGTGTGTTCCTCCCGTGGAGGTAGAGTTGGATTCTTCAAAAGTGGGTGTTCTTCTGAACCAACAAGAAGTTGACTTGTCAGAAGTGAGTTCCGAGAAAGGAGAACCAAGCGGCCCACGAGCTGCCGCTTTGGCATCTTCTTTTGGGCATTTGGACTGGAAAGAGGGGTTTTCGGCTGATGTATTATCCTCCATCCTTGAGATGGAATAG

Protein sequence

MLHPKKLELALRLALSRVVDSDPSSLLSGQVDISPSSPSTSVPFISTRGSPSWVNTILVASGRWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAVVERLESILYLQLLCVPPVEVELDSSKVGVLLNQQEVDLSEVSSEKGEPSGPRAAALASSFGHLDWKEGFSADVLSSILEME
Homology
BLAST of Tan0000563 vs. NCBI nr
Match: XP_022156777.1 (uncharacterized protein LOC111023608 [Momordica charantia])

HSP 1 Score: 80.1 bits (196), Expect = 2.4e-11
Identity = 44/108 (40.74%), Postives = 64/108 (59.26%), Query Frame = 0

Query: 63  RWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAV 122
           RWKPP S+ WKLNTDA+W    N GG+ W++RDE G  I + C+++ T+ +I  LE  A+
Sbjct: 78  RWKPPTSNSWKLNTDAAWRADTNTGGIGWILRDEKGEVIKADCRIIRTERNITYLEVMAI 137

Query: 123 VERLESILYLQLLCVP-------PVEVELDSSKVGVLLNQQEVDLSEV 164
            E L +I   Q  C P       P+ +E DS +   LL++Q  D +E+
Sbjct: 138 CEGLRAI--RQEHCRPIQQEHCRPIHLESDSLEAIHLLHRQCQDQTEI 183

BLAST of Tan0000563 vs. NCBI nr
Match: XP_022155262.1 (uncharacterized protein LOC111022403 [Momordica charantia])

HSP 1 Score: 76.6 bits (187), Expect = 2.7e-10
Identity = 38/100 (38.00%), Postives = 65/100 (65.00%), Query Frame = 0

Query: 63  RWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAV 122
           +W+PP    W LN DASWS   +RGG+ W++R   G  +++G + V    ++K+LEA+A+
Sbjct: 70  KWEPPPMHIWTLNADASWSDSTHRGGIGWIIRSWDGDIVLAGNRFVEACNNVKLLEASAI 129

Query: 123 VERLESILYLQLLCVPPVEVELDSSKVGVLLNQQEVDLSE 163
           +E L ++  L +L   P+ +E DS++V  LLN++  DL++
Sbjct: 130 LEGLRNLTNLGVL--RPLHIETDSAEVESLLNRKREDLTK 167

BLAST of Tan0000563 vs. NCBI nr
Match: XP_022143535.1 (uncharacterized protein LOC111013412 [Momordica charantia])

HSP 1 Score: 75.1 bits (183), Expect = 7.8e-10
Identity = 40/101 (39.60%), Postives = 62/101 (61.39%), Query Frame = 0

Query: 63  RWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAV 122
           +WKPP S+ WKLNT+A+W    N GG+ W++RDE G  I + C+++  + +I  LE  A+
Sbjct: 78  QWKPPTSNSWKLNTNAAWRADTNTGGIGWILRDEKGEVIKASCRIIRAERNITYLEVMAI 137

Query: 123 VERLESILYLQLLCVPPVEVELDSSKVGVLLNQQEVDLSEV 164
            E L +I   Q  C  P+ +E DS +   LL++Q  D +E+
Sbjct: 138 CEGLRAI--RQEHC-RPIHLESDSLEAIHLLHRQCQDQTEI 175

BLAST of Tan0000563 vs. NCBI nr
Match: XP_022154990.1 (uncharacterized protein LOC111022134 isoform X1 [Momordica charantia])

HSP 1 Score: 74.3 bits (181), Expect = 1.3e-09
Identity = 39/93 (41.94%), Postives = 57/93 (61.29%), Query Frame = 0

Query: 63  RWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAV 122
           RWKPP S+ WKLNTDA+W    N  G+ W++RDE G  I +GC+++  + +I  LE  A+
Sbjct: 254 RWKPPTSNSWKLNTDAAWRADTNTDGIGWILRDEKGEVIKTGCRIIRAERNITYLEVMAI 313

Query: 123 VERLESILYLQLLCVPPVEVELDSSKVGVLLNQ 156
            E L +I   Q  C  P+ +E DS +   LL++
Sbjct: 314 CEGLRAI--RQEHC-RPIHLESDSLEAIHLLHR 343

BLAST of Tan0000563 vs. NCBI nr
Match: XP_022154991.1 (uncharacterized protein LOC111022134 isoform X2 [Momordica charantia])

HSP 1 Score: 74.3 bits (181), Expect = 1.3e-09
Identity = 39/93 (41.94%), Postives = 57/93 (61.29%), Query Frame = 0

Query: 63  RWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAV 122
           RWKPP S+ WKLNTDA+W    N  G+ W++RDE G  I +GC+++  + +I  LE  A+
Sbjct: 219 RWKPPTSNSWKLNTDAAWRADTNTDGIGWILRDEKGEVIKTGCRIIRAERNITYLEVMAI 278

Query: 123 VERLESILYLQLLCVPPVEVELDSSKVGVLLNQ 156
            E L +I   Q  C  P+ +E DS +   LL++
Sbjct: 279 CEGLRAI--RQEHC-RPIHLESDSLEAIHLLHR 308

BLAST of Tan0000563 vs. ExPASy TrEMBL
Match: A0A6J1DSV1 (uncharacterized protein LOC111023608 OS=Momordica charantia OX=3673 GN=LOC111023608 PE=4 SV=1)

HSP 1 Score: 80.1 bits (196), Expect = 1.2e-11
Identity = 44/108 (40.74%), Postives = 64/108 (59.26%), Query Frame = 0

Query: 63  RWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAV 122
           RWKPP S+ WKLNTDA+W    N GG+ W++RDE G  I + C+++ T+ +I  LE  A+
Sbjct: 78  RWKPPTSNSWKLNTDAAWRADTNTGGIGWILRDEKGEVIKADCRIIRTERNITYLEVMAI 137

Query: 123 VERLESILYLQLLCVP-------PVEVELDSSKVGVLLNQQEVDLSEV 164
            E L +I   Q  C P       P+ +E DS +   LL++Q  D +E+
Sbjct: 138 CEGLRAI--RQEHCRPIQQEHCRPIHLESDSLEAIHLLHRQCQDQTEI 183

BLAST of Tan0000563 vs. ExPASy TrEMBL
Match: A0A6J1DNV9 (uncharacterized protein LOC111022403 OS=Momordica charantia OX=3673 GN=LOC111022403 PE=4 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 1.3e-10
Identity = 38/100 (38.00%), Postives = 65/100 (65.00%), Query Frame = 0

Query: 63  RWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAV 122
           +W+PP    W LN DASWS   +RGG+ W++R   G  +++G + V    ++K+LEA+A+
Sbjct: 70  KWEPPPMHIWTLNADASWSDSTHRGGIGWIIRSWDGDIVLAGNRFVEACNNVKLLEASAI 129

Query: 123 VERLESILYLQLLCVPPVEVELDSSKVGVLLNQQEVDLSE 163
           +E L ++  L +L   P+ +E DS++V  LLN++  DL++
Sbjct: 130 LEGLRNLTNLGVL--RPLHIETDSAEVESLLNRKREDLTK 167

BLAST of Tan0000563 vs. ExPASy TrEMBL
Match: A0A6J1CP26 (uncharacterized protein LOC111013412 OS=Momordica charantia OX=3673 GN=LOC111013412 PE=4 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 3.8e-10
Identity = 40/101 (39.60%), Postives = 62/101 (61.39%), Query Frame = 0

Query: 63  RWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAV 122
           +WKPP S+ WKLNT+A+W    N GG+ W++RDE G  I + C+++  + +I  LE  A+
Sbjct: 78  QWKPPTSNSWKLNTNAAWRADTNTGGIGWILRDEKGEVIKASCRIIRAERNITYLEVMAI 137

Query: 123 VERLESILYLQLLCVPPVEVELDSSKVGVLLNQQEVDLSEV 164
            E L +I   Q  C  P+ +E DS +   LL++Q  D +E+
Sbjct: 138 CEGLRAI--RQEHC-RPIHLESDSLEAIHLLHRQCQDQTEI 175

BLAST of Tan0000563 vs. ExPASy TrEMBL
Match: A0A6J1DL64 (uncharacterized protein LOC111022134 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022134 PE=4 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 6.4e-10
Identity = 39/93 (41.94%), Postives = 57/93 (61.29%), Query Frame = 0

Query: 63  RWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAV 122
           RWKPP S+ WKLNTDA+W    N  G+ W++RDE G  I +GC+++  + +I  LE  A+
Sbjct: 254 RWKPPTSNSWKLNTDAAWRADTNTDGIGWILRDEKGEVIKTGCRIIRAERNITYLEVMAI 313

Query: 123 VERLESILYLQLLCVPPVEVELDSSKVGVLLNQ 156
            E L +I   Q  C  P+ +E DS +   LL++
Sbjct: 314 CEGLRAI--RQEHC-RPIHLESDSLEAIHLLHR 343

BLAST of Tan0000563 vs. ExPASy TrEMBL
Match: A0A6J1DQC9 (uncharacterized protein LOC111022134 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022134 PE=4 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 6.4e-10
Identity = 39/93 (41.94%), Postives = 57/93 (61.29%), Query Frame = 0

Query: 63  RWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAAV 122
           RWKPP S+ WKLNTDA+W    N  G+ W++RDE G  I +GC+++  + +I  LE  A+
Sbjct: 219 RWKPPTSNSWKLNTDAAWRADTNTDGIGWILRDEKGEVIKTGCRIIRAERNITYLEVMAI 278

Query: 123 VERLESILYLQLLCVPPVEVELDSSKVGVLLNQ 156
            E L +I   Q  C  P+ +E DS +   LL++
Sbjct: 279 CEGLRAI--RQEHC-RPIHLESDSLEAIHLLHR 308

BLAST of Tan0000563 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 43.5 bits (101), Expect = 2.3e-04
Identity = 33/97 (34.02%), Postives = 47/97 (48.45%), Query Frame = 0

Query: 62  GRWKPPNSSQWKLNTDASWSPKWNRGGLDWLVRDEFGSPIISGCKVVTTKWSIKVLEAAA 121
           GRW+PP     K NTDA+W+    R G+ W++R+E G     G + +    S  VLEA  
Sbjct: 418 GRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKLKS--VLEAEL 477

Query: 122 VVERLESILYLQLLCVPPVEVELDSSKVGVLLNQQEV 159
              R  ++L L       V  E DS  +  +LN  E+
Sbjct: 478 EAMRW-AVLSLSRFQYNYVIFESDSQVLIEILNNDEI 511

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022156777.12.4e-1140.74uncharacterized protein LOC111023608 [Momordica charantia][more]
XP_022155262.12.7e-1038.00uncharacterized protein LOC111022403 [Momordica charantia][more]
XP_022143535.17.8e-1039.60uncharacterized protein LOC111013412 [Momordica charantia][more]
XP_022154990.11.3e-0941.94uncharacterized protein LOC111022134 isoform X1 [Momordica charantia][more]
XP_022154991.11.3e-0941.94uncharacterized protein LOC111022134 isoform X2 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1DSV11.2e-1140.74uncharacterized protein LOC111023608 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A6J1DNV91.3e-1038.00uncharacterized protein LOC111022403 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A6J1CP263.8e-1039.60uncharacterized protein LOC111013412 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A6J1DL646.4e-1041.94uncharacterized protein LOC111022134 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DQC96.4e-1041.94uncharacterized protein LOC111022134 isoform X2 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT4G29090.12.3e-0434.02Ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 75..162
e-value: 1.6E-5
score: 24.7

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0000563.1Tan0000563.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity