Tan0006189 (gene) Snake gourd v1

Overview
NameTan0006189
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
LocationLG05: 9557054 .. 9557530 (-)
RNA-Seq ExpressionTan0006189
SyntenyTan0006189
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTTGGTGTGTTTGGAATGAAAGAAATAGGGATTTGATTCTCAATAAGACTAATGTCAGAGAATGTGTTAATGAATGGACCTATTGCCAAACGTATATTCAACCGTTTAGGGAATTTAATCTAAGAAGCCAAGACAAACAAAAAATACCCCCAATTCAGGCACAACCTAATTGGAAACCTCCTGGGTACCCTTTCTTCAAAATTAACACAGATGATGCATTAAACAGAGAGAATCAAAATTGTGGAATAGGCGTGGTAGTCAGAAATGAAAAGGGAGAGGTGATGCTTACTCTAGCTAAGTCGATTGTTGGAATCATGGAAATTGATGTCATCGAAGCGCTGGCAATTTGGAAGGGTTGCATATGGCAAAAGAGATGGGATTCCGGCAGGTTGAGGTTGAGTCGGATTCGACCAAGGTCATTCAGCTCCTACAACAAAATCGCCAAAACTTATCAGATCTGGGTCAGATTATAG

mRNA sequence

ATGTGTTGGTGTGTTTGGAATGAAAGAAATAGGGATTTGATTCTCAATAAGACTAATGTCAGAGAATGTGTTAATGAATGGACCTATTGCCAAACGTATATTCAACCGTTTAGGGAATTTAATCTAAGAAGCCAAGACAAACAAAAAATACCCCCAATTCAGGCACAACCTAATTGGAAACCTCCTGGGTACCCTTTCTTCAAAATTAACACAGATGATGCATTAAACAGAGAGAATCAAAATTGTGGAATAGGCGTGGTAGTCAGAAATGAAAAGGGAGAGGTGATGCTTACTCTAGCTAAGTCGATTGTTGGAATCATGGAAATTGATGTCATCGAAGCGCTGGCAATTTGGAAGGGTTGCATATGGCAAAAGAGATGGGATTCCGGCAGGTTGAGGTTGAGTCGGATTCGACCAAGGTCATTCAGCTCCTACAACAAAATCGCCAAAACTTATCAGATCTGGGTCAGATTATAG

Coding sequence (CDS)

ATGTGTTGGTGTGTTTGGAATGAAAGAAATAGGGATTTGATTCTCAATAAGACTAATGTCAGAGAATGTGTTAATGAATGGACCTATTGCCAAACGTATATTCAACCGTTTAGGGAATTTAATCTAAGAAGCCAAGACAAACAAAAAATACCCCCAATTCAGGCACAACCTAATTGGAAACCTCCTGGGTACCCTTTCTTCAAAATTAACACAGATGATGCATTAAACAGAGAGAATCAAAATTGTGGAATAGGCGTGGTAGTCAGAAATGAAAAGGGAGAGGTGATGCTTACTCTAGCTAAGTCGATTGTTGGAATCATGGAAATTGATGTCATCGAAGCGCTGGCAATTTGGAAGGGTTGCATATGGCAAAAGAGATGGGATTCCGGCAGGTTGAGGTTGAGTCGGATTCGACCAAGGTCATTCAGCTCCTACAACAAAATCGCCAAAACTTATCAGATCTGGGTCAGATTATAG

Protein sequence

MCWCVWNERNRDLILNKTNVRECVNEWTYCQTYIQPFREFNLRSQDKQKIPPIQAQPNWKPPGYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSIVGIMEIDVIEALAIWKGCIWQKRWDSGRLRLSRIRPRSFSSYNKIAKTYQIWVRL
Homology
BLAST of Tan0006189 vs. NCBI nr
Match: TXG48260.1 (hypothetical protein EZV62_027554 [Acer yangbiense])

HSP 1 Score: 76.6 bits (187), Expect = 2.1e-10
Identity = 47/121 (38.84%), Postives = 66/121 (54.55%), Query Frame = 0

Query: 3   WCVWNERNRDLILNKTNVRECVNEWTYCQTYIQPFREFNLRSQDKQKIPPIQAQPNWKPP 62
           W +W +RN+ +  N+      V EW +  +Y+  +R    R  D  K+   +  P WKPP
Sbjct: 11  WRIWYKRNQFVHSNEIPNDIEVLEWAW--SYLHDYRVAFAR--DNPKLTKEREPPKWKPP 70

Query: 63  GYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSIVGIMEIDVIEALAIWKGCI 122
               FKINTD ALN  +   GI VV+RN +G VM +L + I    +  +IEA+AI KG I
Sbjct: 71  DRGVFKINTDAALNENDFQFGISVVIRNYQGHVMASLCQIIKASYQPQIIEAMAILKG-I 126

Query: 123 W 124
           W
Sbjct: 131 W 126

BLAST of Tan0006189 vs. NCBI nr
Match: XP_030969909.1 (uncharacterized protein LOC115990203 [Quercus lobata])

HSP 1 Score: 72.0 bits (175), Expect = 5.1e-09
Identity = 39/123 (31.71%), Postives = 70/123 (56.91%), Query Frame = 0

Query: 3   WCVWNERNRDLILNKTNVRECVNEWTYCQTYIQPFREFNLRSQDKQKIPPIQAQPNWKPP 62
           W +WN RN     N  +  +C +     +  ++ ++E    S  ++K P     P W PP
Sbjct: 63  WSLWNNRN-----NVRHGGQCKSHEVIAREAVEYWKEVQTTSPTQEKTPAPDEHP-WAPP 122

Query: 63  GYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSI---VGIMEIDVIEALAIWK 122
              ++K+NTD A+  + + CGIGVV+RNE+G++M  ++K++   +G++E   IEA A+ +
Sbjct: 123 KQGWYKVNTDGAIFEDIKCCGIGVVIRNERGQLMEAMSKNVELPLGVLE---IEAKAVEE 176

BLAST of Tan0006189 vs. NCBI nr
Match: KAF3966504.1 (hypothetical protein CMV_009401 [Castanea mollissima])

HSP 1 Score: 71.2 bits (173), Expect = 8.8e-09
Identity = 40/89 (44.94%), Postives = 54/89 (60.67%), Query Frame = 0

Query: 45  QDKQKIPPIQAQP---NWKPPGYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAK 104
           Q KQ   P Q +P   NW+PP    +KIN D A+  E+   GIG VVRNE+GEVM +LA+
Sbjct: 183 QTKQTDRPTQNKPAIQNWRPPPKDTYKINYDSAVFSESDEAGIGAVVRNERGEVMASLAE 242

Query: 105 SIV-GIMEIDVIEALAIWKGCIWQKRWDS 130
            IV     ++VIEA+   +  +WQ+ W S
Sbjct: 243 KIVMPSGGVEVIEAMTARRATLWQQSWFS 271

BLAST of Tan0006189 vs. NCBI nr
Match: KAF4382136.1 (hypothetical protein G4B88_009426 [Cannabis sativa])

HSP 1 Score: 68.6 bits (166), Expect = 5.7e-08
Identity = 40/129 (31.01%), Postives = 58/129 (44.96%), Query Frame = 0

Query: 1   MCWCVWNERNRDLILNKTNVRECVNEW--TYCQTYIQPFREFNLRSQDKQKIPPIQAQPN 60
           + W +W  RN  +  +K         W      T+++P        Q  +KIP IQ   +
Sbjct: 92  LTWSIWQRRNSFVFKHKIIDERIWTSWDLDLISTHLEP-------HQQTRKIPAIQPNSS 151

Query: 61  WKPPGYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSIVGIMEIDVIEALAIW 120
           W PP   FF INTD +LN   Q C I  V+R+  G +++     I G + + + EA AI 
Sbjct: 152 WIPPPQKFFLINTDASLNSGQQGCAISAVIRDPNGVLVVAETTYIPGCLSVLLAEATAIL 211

Query: 121 KGCIWQKRW 128
            G     RW
Sbjct: 212 LGIQLAIRW 213

BLAST of Tan0006189 vs. NCBI nr
Match: XP_030483626.1 (uncharacterized protein LOC115700202 [Cannabis sativa])

HSP 1 Score: 67.8 bits (164), Expect = 9.7e-08
Identity = 37/126 (29.37%), Postives = 59/126 (46.83%), Query Frame = 0

Query: 3   WCVWNERNRDLILNKTNVRECVNEWTYCQTYIQPFREFNLRSQDKQKIPPIQAQ-PNWKP 62
           W +W++RN    ++   V+  +  WT    Y+  +R     +         QA   +WKP
Sbjct: 2   WFIWSDRNN--FIHGKKVKTPLQMWTQSVAYMDQYRSITSAATPAASNCTSQASVASWKP 61

Query: 63  PGYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSIVGIMEIDVIEALAIWKGC 122
           P    FK+N D AL+      GIGV++RN  G+V   L+   +G  +   +EA A+  G 
Sbjct: 62  PPENTFKLNVDAALDSSRSKIGIGVIIRNSAGQVKAALSTPAIGNFKSQEMEAKAMSVGL 121

Query: 123 IWQKRW 128
            W K +
Sbjct: 122 SWAKTY 125

BLAST of Tan0006189 vs. ExPASy TrEMBL
Match: A0A5C7GUI2 (RNase H domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_027554 PE=4 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 1.0e-10
Identity = 47/121 (38.84%), Postives = 66/121 (54.55%), Query Frame = 0

Query: 3   WCVWNERNRDLILNKTNVRECVNEWTYCQTYIQPFREFNLRSQDKQKIPPIQAQPNWKPP 62
           W +W +RN+ +  N+      V EW +  +Y+  +R    R  D  K+   +  P WKPP
Sbjct: 11  WRIWYKRNQFVHSNEIPNDIEVLEWAW--SYLHDYRVAFAR--DNPKLTKEREPPKWKPP 70

Query: 63  GYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSIVGIMEIDVIEALAIWKGCI 122
               FKINTD ALN  +   GI VV+RN +G VM +L + I    +  +IEA+AI KG I
Sbjct: 71  DRGVFKINTDAALNENDFQFGISVVIRNYQGHVMASLCQIIKASYQPQIIEAMAILKG-I 126

Query: 123 W 124
           W
Sbjct: 131 W 126

BLAST of Tan0006189 vs. ExPASy TrEMBL
Match: A0A2N9F7C1 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS10641 PE=4 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 2.9e-10
Identity = 37/124 (29.84%), Postives = 69/124 (55.65%), Query Frame = 0

Query: 2    CWCVWNERNRDLILNKTNVRECVNEWTYCQTYIQPFREFNLRSQDKQKIPPIQAQPNWKP 61
            CW +WN+RN D   +     +    WT  Q+ +  +   N   + ++  PP   Q  W+ 
Sbjct: 1470 CWLIWNKRNHD--RHHPPSEQYSQLWTRAQSVLHEYLAVNTEEKAQKPKPP---QARWRL 1529

Query: 62   PGYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSIVGIMEIDVIEALAIWKGC 121
            P   ++K+N D A+ +++ + GIGVV+R+  G+V+ TL++ + G   +++IEALA  +  
Sbjct: 1530 PVNHYYKMNFDGAIFKDSNSGGIGVVIRDNTGQVIATLSQKVFGTHTVEMIEALAARRAI 1588

Query: 122  IWQK 126
            I+ +
Sbjct: 1590 IFAR 1588

BLAST of Tan0006189 vs. ExPASy TrEMBL
Match: A0A2N9J3G2 (Fe2OG dioxygenase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS59614 PE=4 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 2.9e-10
Identity = 37/124 (29.84%), Postives = 69/124 (55.65%), Query Frame = 0

Query: 2    CWCVWNERNRDLILNKTNVRECVNEWTYCQTYIQPFREFNLRSQDKQKIPPIQAQPNWKP 61
            CW +WN+RN D   +     +    WT  Q+ +  +   N   + ++  PP   Q  W+ 
Sbjct: 1296 CWLIWNKRNHD--RHHPPSEQYSQLWTRAQSVLHEYLAVNTEEKAQKPKPP---QARWRL 1355

Query: 62   PGYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSIVGIMEIDVIEALAIWKGC 121
            P   ++K+N D A+ +++ + GIGVV+R+  G+V+ TL++ + G   +++IEALA  +  
Sbjct: 1356 PVNHYYKMNFDGAIFKDSNSGGIGVVIRDNTGQVIATLSQKVFGTHTVEMIEALAARRAI 1414

Query: 122  IWQK 126
            I+ +
Sbjct: 1416 IFAR 1414

BLAST of Tan0006189 vs. ExPASy TrEMBL
Match: A0A2N9GPD6 (RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29161 PE=4 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 2.9e-10
Identity = 37/124 (29.84%), Postives = 69/124 (55.65%), Query Frame = 0

Query: 2    CWCVWNERNRDLILNKTNVRECVNEWTYCQTYIQPFREFNLRSQDKQKIPPIQAQPNWKP 61
            CW +WN+RN D   +     +    WT  Q+ +  +   N   + ++  PP   Q  W+ 
Sbjct: 1437 CWLIWNKRNHD--RHHPPSEQYSQLWTRAQSVLHEYLAVNTEEKAQKPKPP---QARWRL 1496

Query: 62   PGYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSIVGIMEIDVIEALAIWKGC 121
            P   ++K+N D A+ +++ + GIGVV+R+  G+V+ TL++ + G   +++IEALA  +  
Sbjct: 1497 PVNHYYKMNFDGAIFKDSNSGGIGVVIRDNTGQVIATLSQKVFGTHTVEMIEALAARRAI 1555

Query: 122  IWQK 126
            I+ +
Sbjct: 1557 IFAR 1555

BLAST of Tan0006189 vs. ExPASy TrEMBL
Match: A0A2N9J7E4 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS61034 PE=4 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 3.8e-10
Identity = 39/124 (31.45%), Postives = 69/124 (55.65%), Query Frame = 0

Query: 2    CWCVWNERNRDLILNKTNVRECVNEWTYCQTYIQPFREFNLRSQDKQKIPPIQAQPNWKP 61
            CW +WN+RN D     ++    +  WT  Q  +Q +       + +++ PP   Q  W+ 
Sbjct: 1434 CWLLWNKRNHDRHHPPSDQYSQI--WTRAQIVLQEYLAVTTEEKAEKQTPP---QTRWRL 1493

Query: 62   PGYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSIVGIMEIDVIEALAIWKGC 121
            P   ++K+N D A+ +E+ + GIGVV+R+  G  + TL++ + GI  +++IEALA  +  
Sbjct: 1494 PVTNYYKMNFDGAIFKESNSGGIGVVIRDHTGMAIATLSQKVHGIHTVEMIEALAARRAI 1552

Query: 122  IWQK 126
            I+ K
Sbjct: 1554 IFAK 1552

BLAST of Tan0006189 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 53.9 bits (128), Expect = 1.3e-07
Identity = 23/45 (51.11%), Postives = 32/45 (71.11%), Query Frame = 0

Query: 59  WKPPGYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSI 104
           W+PP + + K NTD   NR+N+ CGIG V+RNEKGEV    A+++
Sbjct: 420 WRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARAL 464

BLAST of Tan0006189 vs. TAIR 10
Match: AT2G34320.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 51.6 bits (122), Expect = 6.7e-07
Identity = 31/107 (28.97%), Postives = 50/107 (46.73%), Query Frame = 0

Query: 1   MCWCVWNERNRDLILNKTNVRECVNEWTYCQTYIQPFREFNLRSQDKQKIPPIQAQPN-- 60
           + W +W  RN  +   K       +     +  ++ F E++ R + + K    Q + N  
Sbjct: 80  LLWRLWKSRNELMFKGKE-----YDAPEVLRRAMEDFEEWSTRRELEGKASGPQVERNLS 139

Query: 61  --WKPPGYPFFKINTDDALNRENQNCGIGVVVRNEKGEVMLTLAKSI 104
             WK P Y + K NTD     EN  CGIG ++RNE G V+   A+++
Sbjct: 140 VQWKAPPYQWVKCNTDATWQLENPRCGIGWILRNESGGVLWMGARAL 181

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
TXG48260.12.1e-1038.84hypothetical protein EZV62_027554 [Acer yangbiense][more]
XP_030969909.15.1e-0931.71uncharacterized protein LOC115990203 [Quercus lobata][more]
KAF3966504.18.8e-0944.94hypothetical protein CMV_009401 [Castanea mollissima][more]
KAF4382136.15.7e-0831.01hypothetical protein G4B88_009426 [Cannabis sativa][more]
XP_030483626.19.7e-0829.37uncharacterized protein LOC115700202 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
A0A5C7GUI21.0e-1038.84RNase H domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_027554 ... [more]
A0A2N9F7C12.9e-1029.84Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS10641 PE=4 SV=1[more]
A0A2N9J3G22.9e-1029.84Fe2OG dioxygenase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_L... [more]
A0A2N9GPD62.9e-1029.84RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29161 ... [more]
A0A2N9J7E43.8e-1031.45Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS61034 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G29090.11.3e-0751.11Ribonuclease H-like superfamily protein [more]
AT2G34320.16.7e-0728.97Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 70..120
e-value: 1.4E-5
score: 24.9
NoneNo IPR availablePANTHERPTHR47074BNAC02G40300D PROTEINcoord: 56..137

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0006189.1Tan0006189.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity