Tan0014073 (gene) Snake gourd v1

Overview
NameTan0014073
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
LocationLG04: 5485272 .. 5485769 (-)
RNA-Seq ExpressionTan0014073
SyntenyTan0014073
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAACTTCCCGGTTTTCCAATTTTAAATTAAATACTGATGCAACGATCTTTAGAAATGAGAATCGAAGTGGAATGGGTGCAGTGATACGAGATGAAAATGAGAATGTGATGTTTACCGTGACCAAACCTATCCCATGGATTATAGAAGTGGTGACTGTTGAAGCAATTGCGGTTCGAGATGTGTTGATTATGGCAAGGGAACTGGGATTTTATCAGCTAAAAGTGGAAACAGACTCATCGTTGGTGATCAACCTCATACATCAAAATCGTCAAAATCAGTCGGAGTTAGGGTACATAATCGAAGAAATAAAGGAAATGGCAAGGAAGATGAAGAATTGTACTTTCTCATGGTGTGATCGAAGATCTAATGCCCTAGCACATCCCCTAGCAAGACATGCAAGCAATCTGACGGAAGAGACGGCATGGATGGAACGAGAAGCAAGTTTTATTTCCTTTAATTTTTATTGTATTTCATTATTCATAAAAAAAGGGTAA

mRNA sequence

ATGGAAACTTCCCGGTTTTCCAATTTTAAATTAAATACTGATGCAACGATCTTTAGAAATGAGAATCGAAGTGGAATGGGTGCAGTGATACGAGATGAAAATGAGAATGTGATGTTTACCGTGACCAAACCTATCCCATGGATTATAGAAGTGGTGACTGTTGAAGCAATTGCGGTTCGAGATGTGTTGATTATGGCAAGGGAACTGGGATTTTATCAGCTAAAAGTGGAAACAGACTCATCGTTGGTGATCAACCTCATACATCAAAATCGTCAAAATCAGTCGGAGTTAGGGTACATAATCGAAGAAATAAAGGAAATGGCAAGGAAGATGAAGAATTGTACTTTCTCATGGTGTGATCGAAGATCTAATGCCCTAGCACATCCCCTAGCAAGACATGCAAGCAATCTGACGGAAGAGACGGCATGGATGGAACGAGAAGCAAGTTTTATTTCCTTTAATTTTTATTGTATTTCATTATTCATAAAAAAAGGGTAA

Coding sequence (CDS)

ATGGAAACTTCCCGGTTTTCCAATTTTAAATTAAATACTGATGCAACGATCTTTAGAAATGAGAATCGAAGTGGAATGGGTGCAGTGATACGAGATGAAAATGAGAATGTGATGTTTACCGTGACCAAACCTATCCCATGGATTATAGAAGTGGTGACTGTTGAAGCAATTGCGGTTCGAGATGTGTTGATTATGGCAAGGGAACTGGGATTTTATCAGCTAAAAGTGGAAACAGACTCATCGTTGGTGATCAACCTCATACATCAAAATCGTCAAAATCAGTCGGAGTTAGGGTACATAATCGAAGAAATAAAGGAAATGGCAAGGAAGATGAAGAATTGTACTTTCTCATGGTGTGATCGAAGATCTAATGCCCTAGCACATCCCCTAGCAAGACATGCAAGCAATCTGACGGAAGAGACGGCATGGATGGAACGAGAAGCAAGTTTTATTTCCTTTAATTTTTATTGTATTTCATTATTCATAAAAAAAGGGTAA

Protein sequence

METSRFSNFKLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMARELGFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNALAHPLARHASNLTEETAWMEREASFISFNFYCISLFIKKG
Homology
BLAST of Tan0014073 vs. NCBI nr
Match: XP_023887924.1 (uncharacterized protein LOC112000051 [Quercus suber])

HSP 1 Score: 99.8 bits (247), Expect = 2.4e-17
Identity = 56/139 (40.29%), Postives = 83/139 (59.71%), Query Frame = 0

Query: 7   SNFKLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMA 66
           S +KLN DA IF + N SG GA+IR+E   VM  ++   P  +     EA+A R  +  A
Sbjct: 14  STYKLNFDAAIFADLNCSGFGAIIRNEEGEVMAGMSVKGPLSLNSAEAEALACRRAVQFA 73

Query: 67  RELGFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNAL 126
            E GF +L +E D++LV+N I  + +N S LG+I E+I+ + R ++  + S   R  N +
Sbjct: 74  LEAGFSRLVIEGDNALVMNAISSSAENNSLLGHIFEDIQHLVRGLQYVSISCIKRDGNMV 133

Query: 127 AHPLARHASNLTEETAWME 146
           AH LARHA  +++E  WME
Sbjct: 134 AHSLARHARTISDEMYWME 152

BLAST of Tan0014073 vs. NCBI nr
Match: XP_023885847.1 (keratin, type I cytoskeletal 13-like [Quercus suber])

HSP 1 Score: 99.8 bits (247), Expect = 2.4e-17
Identity = 56/139 (40.29%), Postives = 83/139 (59.71%), Query Frame = 0

Query: 7   SNFKLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMA 66
           S +KLN DA IF + N SG GA+IR+E   VM  ++   P  +     EA+A R  +  A
Sbjct: 148 STYKLNFDAAIFADLNCSGFGAIIRNEEGEVMAGMSVKGPLSLNSAEAEALACRKAVQFA 207

Query: 67  RELGFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNAL 126
            E GF +L +E D++LV+N I  + +N S LG+I E+I+ + R ++  + S   R  N +
Sbjct: 208 LEAGFSRLVIEGDNALVMNAISSSAENNSLLGHIFEDIQHLIRGLQYVSISCIKRDGNMV 267

Query: 127 AHPLARHASNLTEETAWME 146
           AH LARHA  +++E  WME
Sbjct: 268 AHSLARHARTISDEMYWME 286

BLAST of Tan0014073 vs. NCBI nr
Match: XP_023899877.1 (uncharacterized protein LOC112011763 [Quercus suber])

HSP 1 Score: 98.6 bits (244), Expect = 5.4e-17
Identity = 56/139 (40.29%), Postives = 83/139 (59.71%), Query Frame = 0

Query: 7   SNFKLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMA 66
           S +KLN DA IF + N SG GA+IR+E   VM  ++   P  +     EA+A R  +  A
Sbjct: 43  STYKLNFDAAIFADLNCSGFGAIIRNEEGEVMAGMSVKGPLSLNSAEAEALACRRAVHFA 102

Query: 67  RELGFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNAL 126
            E GF +L +E D++LV+N I  + +N S LG+I E+I+ + R ++  + S   R  N +
Sbjct: 103 LEAGFSRLVIEGDNALVMNAISCSAENNSLLGHIFEDIQHLVRGLQYVSISCIKRDGNMV 162

Query: 127 AHPLARHASNLTEETAWME 146
           AH LARHA  +++E  WME
Sbjct: 163 AHSLARHARTISDEMYWME 181

BLAST of Tan0014073 vs. NCBI nr
Match: XP_023881690.1 (uncharacterized protein LOC111994061 [Quercus suber])

HSP 1 Score: 98.2 bits (243), Expect = 7.0e-17
Identity = 51/137 (37.23%), Postives = 81/137 (59.12%), Query Frame = 0

Query: 9   FKLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMARE 68
           FKLN DA +F   N SG+G ++R+    VM  ++   P +      EA+A R  +  A +
Sbjct: 3   FKLNFDAAVFDGTNSSGVGVIVRNSLGEVMAGLSARGPAVANSEEAEALACRKAVEFAMD 62

Query: 69  LGFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNALAH 128
            GF  L +E D++ V+  I  +R ++S LG+I ++I+ +A + ++C F    R +NA+AH
Sbjct: 63  AGFMDLVIEGDNAAVMKAITSSRLDRSRLGHIYDDIRTLAARFRSCNFGCVKRSANAVAH 122

Query: 129 PLARHASNLTEETAWME 146
            LAR ASNL +E  W+E
Sbjct: 123 SLARFASNLVDELVWLE 139

BLAST of Tan0014073 vs. NCBI nr
Match: XP_023928118.1 (uncharacterized protein LOC112039474 [Quercus suber])

HSP 1 Score: 97.8 bits (242), Expect = 9.1e-17
Identity = 54/139 (38.85%), Postives = 86/139 (61.87%), Query Frame = 0

Query: 7   SNFKLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMA 66
           S FKLN DA +F   + SG+GA+IR+EN  VM T+   +P +++ V  E IA R  +  A
Sbjct: 160 STFKLNFDAVVFSKLSCSGVGAMIRNENGEVMATMLARVPHVVDSVVAEVIACRRAMKFA 219

Query: 67  RELGFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNAL 126
            + GF  L VE D+  V+  +  +  + S LG+II++IK + R  +  +FS+  R +N++
Sbjct: 220 CKAGFTDLVVEGDNLSVMKSLTTSETDLSWLGHIIQDIKWLTRSFRRVSFSYVRRAANSV 279

Query: 127 AHPLARHASNLTEETAWME 146
           A+ LAR+A ++ E+  WME
Sbjct: 280 AYGLARYAKDIHEDMYWME 298

BLAST of Tan0014073 vs. ExPASy TrEMBL
Match: A0A2N9I6L8 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS49538 PE=4 SV=1)

HSP 1 Score: 93.6 bits (231), Expect = 8.3e-16
Identity = 44/136 (32.35%), Postives = 84/136 (61.76%), Query Frame = 0

Query: 10   KLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMAREL 69
            K+N D  +F     +G+G VIRDE    M ++++ +P+      VEA+A+R  + +A E+
Sbjct: 1396 KVNYDGAVFLETMEAGLGVVIRDELGRPMVSLSQKVPFPGSSTAVEALALRRAMFLAIEM 1455

Query: 70   GFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNALAHP 129
            GFY + VE DS +++  +     + +  G+++E+++ + ++M +C F++  R+ N +AHP
Sbjct: 1456 GFYSVIVEGDSEMLVRAVTSLGGSATVYGHVVEDVQYLTQQMTHCEFTYVRRQLNQIAHP 1515

Query: 130  LARHASNLTEETAWME 146
            LAR A+++ +   WME
Sbjct: 1516 LARRANSVYDFATWME 1531

BLAST of Tan0014073 vs. ExPASy TrEMBL
Match: A0A2N9EZZ2 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS12168 PE=4 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 2.1e-14
Identity = 44/137 (32.12%), Postives = 79/137 (57.66%), Query Frame = 0

Query: 9   FKLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMARE 68
           +K+N D  +F+  N +G+G ++RD    VM ++T+ + + + V ++EA AV+  +  A E
Sbjct: 319 YKINYDGAVFKETNEAGIGVIVRDSQGLVMASLTQKVLFPLSVPSIEAWAVKRSIQFALE 378

Query: 69  LGFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNALAH 128
           +G  + + E DS  ++N ++    + +  G ++ + KE+A K++N +FS   R  N LAH
Sbjct: 379 IGITEAEFEGDSQTIVNALNAQHPSLAPFGLLLADAKELASKLQNFSFSHVKREGNRLAH 438

Query: 129 PLARHASNLTEETAWME 146
            LAR A +      WME
Sbjct: 439 ALARKAHSCNSLEIWME 455

BLAST of Tan0014073 vs. ExPASy TrEMBL
Match: A0A2N9FY90 (RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS19997 PE=4 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 2.1e-14
Identity = 47/136 (34.56%), Postives = 79/136 (58.09%), Query Frame = 0

Query: 10  KLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMAREL 69
           K+N D  +F   N+ G+G +IR+E    M  +++ IP+      +EA+A+R  L++A E+
Sbjct: 366 KVNYDGAVFSEANKGGIGVIIRNEMGLPMIALSQKIPYPGSSAVMEALALRRALLLAIEM 425

Query: 70  GFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNALAHP 129
           GF  + +E DS +V+          S  G+II +++++A +M  C FS   R++N +AH 
Sbjct: 426 GFQSVVMEGDSEMVVREASMWGAFLSSYGHIIADVQQLAAQMDVCVFSHTRRQANQVAHA 485

Query: 130 LARHASNLTEETAWME 146
           LAR A N+ +   WME
Sbjct: 486 LARRACNVLDYETWME 501

BLAST of Tan0014073 vs. ExPASy TrEMBL
Match: A0A2N9INS6 (RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS53651 PE=4 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 2.1e-14
Identity = 44/137 (32.12%), Postives = 79/137 (57.66%), Query Frame = 0

Query: 9   FKLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMARE 68
           +K+N D  +F+  N +G+G ++RD    VM ++T+ + + + V ++EA AV+  +  A E
Sbjct: 8   YKINYDGAVFKETNEAGIGVIVRDSQGLVMASLTQKVLFPLSVPSIEAWAVKRSIQFALE 67

Query: 69  LGFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNALAH 128
           +G  + + E DS  ++N ++    + +  G ++ + KE+A K++N +FS   R  N LAH
Sbjct: 68  IGITEAEFEGDSQTIVNALNAQHPSLAPFGLLLADAKELASKLQNFSFSHVKREGNRLAH 127

Query: 129 PLARHASNLTEETAWME 146
            LAR A +      WME
Sbjct: 128 ALARKAHSCNSLEIWME 144

BLAST of Tan0014073 vs. ExPASy TrEMBL
Match: A0A7N2KZZ0 (RNase H domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 2.1e-14
Identity = 50/136 (36.76%), Postives = 79/136 (58.09%), Query Frame = 0

Query: 10  KLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMAREL 69
           K+N D  IF  ++ +G+  V+RD+   V+ ++++ IP    V  VE IA R  L++AREL
Sbjct: 160 KVNFDGAIFSTQSSAGLAMVVRDQAGLVLASLSQKIPLSTSVEIVEVIAARRALLLAREL 219

Query: 70  GFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNALAHP 129
           GF ++ VE DS ++I  I +     S+LG+I+E+I+ ++R   + +F    R  N +AH 
Sbjct: 220 GFERVMVEGDSEIIIKAIKEKALPSSDLGHILEDIRVLSRSFNSISFHHIKRMGNCVAHH 279

Query: 130 LARHASNLTEETAWME 146
           LA H S       WME
Sbjct: 280 LA-HRSFCNPLLVWME 294

BLAST of Tan0014073 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 48.5 bits (114), Expect = 5.9e-06
Identity = 33/125 (26.40%), Postives = 63/125 (50.40%), Query Frame = 0

Query: 10  KLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMAREL 69
           K NTDAT  R+  R G+G V+R+E   V +   + +P +  V+  E  A+R  ++     
Sbjct: 429 KCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKLKSVLEAELEAMRWAVLSLSRF 488

Query: 70  GFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNALAHP 129
            +  +  E+DS ++I +++ N +    L   I++++ +  +     F +  R  N LA  
Sbjct: 489 QYNYVIFESDSQVLIEILN-NDEIWPSLKPTIQDLQRLLSQFTEVKFVFIPREGNTLAER 548

Query: 130 LARHA 135
           +AR +
Sbjct: 549 VARES 552

BLAST of Tan0014073 vs. TAIR 10
Match: AT2G34320.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 45.8 bits (107), Expect = 3.8e-05
Identity = 30/125 (24.00%), Postives = 63/125 (50.40%), Query Frame = 0

Query: 10  KLNTDATIFRNENRSGMGAVIRDENENVMFTVTKPIPWIIEVVTVEAIAVRDVLIMAREL 69
           K NTDAT      R G+G ++R+E+  V++   + +P    V+  E  A+R  ++     
Sbjct: 146 KCNTDATWQLENPRCGIGWILRNESGGVLWMGARALPRTKNVLEAELEALRWAVLTMSRF 205

Query: 70  GFYQLKVETDSSLVINLIHQNRQNQSELGYIIEEIKEMARKMKNCTFSWCDRRSNALAHP 129
            + ++  E+D+  ++NL++ +      L   +E+I+++    +   F +  R  N +A  
Sbjct: 206 NYKRIIFESDAQALVNLLNSD-DFWPTLQPALEDIQQLLHHFEEVKFEFTPRGGNKVADR 265

Query: 130 LARHA 135
           +AR +
Sbjct: 266 IARES 269

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023887924.12.4e-1740.29uncharacterized protein LOC112000051 [Quercus suber][more]
XP_023885847.12.4e-1740.29keratin, type I cytoskeletal 13-like [Quercus suber][more]
XP_023899877.15.4e-1740.29uncharacterized protein LOC112011763 [Quercus suber][more]
XP_023881690.17.0e-1737.23uncharacterized protein LOC111994061 [Quercus suber][more]
XP_023928118.19.1e-1738.85uncharacterized protein LOC112039474 [Quercus suber][more]
Match NameE-valueIdentityDescription
A0A2N9I6L88.3e-1632.35Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9EZZ22.1e-1432.12Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS12168 PE=4 SV=1[more]
A0A2N9FY902.1e-1434.56RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS19997 ... [more]
A0A2N9INS62.1e-1432.12RNase H domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS53651 ... [more]
A0A7N2KZZ02.1e-1436.76RNase H domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G29090.15.9e-0626.40Ribonuclease H-like superfamily protein [more]
AT2G34320.13.8e-0524.00Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 7..142
e-value: 1.6E-19
score: 72.2
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 12..134
e-value: 9.2E-25
score: 86.9
NoneNo IPR availablePANTHERPTHR47723OS05G0353850 PROTEINcoord: 4..145
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 11..132
e-value: 4.66836E-21
score: 81.2064
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 9..137

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0014073.1Tan0014073.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity