Tan0000935.1 (mRNA) Snake gourd v1

Overview
NameTan0000935.1
TypemRNA
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
LocationLG11: 59197814 .. 59198848 (+)
Sequence length954
RNA-Seq ExpressionTan0000935.1
SyntenyTan0000935.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGTGTGTGTGTGGAGAGTTCTGAATAATGTAATTCCCACCAACCTAAATATCATAAACAAAAATATTGACACTAATCCTTATTGTTTTTTCTGCAGGGGAGTCGTTGAGTCAACTAGTCATGCCTTATGGAATTGCAGGTTTGTTAAGAAAATTTGGAGTCTTTTCTGTCCTTACTTATCCAGACTCTTGATGATTGCAGGGACTGGTGGCAACCATGGGATTGCTGGCTTAGGGTAAAAGCAGAGATGGGAACTAAAGATTTATCAAAAGCAGTTGCTATCATTTGGAGTTTATGGAACTGCTATCAAAAGCAGTTGCTATCAAAAGCAGTTGCTATCATTTGGAGTTTATGGAATTGCTATCAAAGCTGACTTCAGTACAATTTGCTCACATATTGATCACATCCTAACTGAATTAGAAGGAATTTCGAGAAATTACCAGGAGTTTGAGAGTTGTGAGAACCAAGCGAGTCAAGGACGTTGGTTGCTGCCACCGGAAGGAAAATGGAAGTTGAATTGTGATGCCACCTGGAATGAGAAGCTTAAAGTGGGTGGCATTGGTTGGGTAATCCGTGACTCTCTAGGGTCTCCGATGGGGGCAGGCTATAAAAAAATTCAACGGAATTGGCGAATTAAATTCATAGAGGCAGAAGCGATTAAAGCAGGGCTTATGTATATTGTTGAATTCTTACCTCTGCCAAGAGCTCCGGTTGTCGTTGAATCAGATTCGGCGGAAATCATCAGCGGCCTCAACCGAATTAGAATTGATCTCACCGAAACCAAGGATCTCATAGAAAAAATTTGGGTTTTTGCGAAGCAGGTTGGAAGTGTCTCCTTCCACAAGTGCCATAGATCGAGGAATTCGCTTGCTCACAAGTTAGCGCGCAAGGGCGCGTTGTTTTCCTCTTTTTGTATCCCACATTCTTCTTCCTTCTGTGAAGAAGAGGGGGGGTTTTTGTCCGTTCCTTACCCTTTGTGGGTTACCTTTGCTTTGAAGGAGGATTTTGGTTGTAACAACCCATTGTTTTAA

mRNA sequence

ATGAAAGTGTGTGTGTGGAGAGTTCTGAATAATGTAATTCCCACCAACCTAAATATCATAAACAAAAATATTGACACTAATCCTTATTGTTTTTTCTGCAGGGGAGTCGTTGAGTCAACTAGTCATGCCTTATGGAATTGCAGGTTTGTTAAGAAAATTTGGAGTCTTTTCTGTCCTTACTTATCCAGACTCTTGATGATTGCAGGGACTGGTGGCAACCATGGGATTGCTGGCTTAGGTTGCTATCAAAAGCAGTTGCTATCATTTGGAGTTTATGGAATTGCTATCAAAGCTGACTTCAGTACAATTTGCTCACATATTGATCACATCCTAACTGAATTAGAAGGAATTTCGAGAAATTACCAGGAGTTTGAGAGTTGTGAGAACCAAGCGAGTCAAGGACGTTGGTTGCTGCCACCGGAAGGAAAATGGAAGTTGAATTGTGATGCCACCTGGAATGAGAAGCTTAAAGTGGGTGGCATTGGTTGGGTAATCCGTGACTCTCTAGGGTCTCCGATGGGGGCAGGCTATAAAAAAATTCAACGGAATTGGCGAATTAAATTCATAGAGGCAGAAGCGATTAAAGCAGGGCTTATGTATATTGTTGAATTCTTACCTCTGCCAAGAGCTCCGGTTGTCGTTGAATCAGATTCGGCGGAAATCATCAGCGGCCTCAACCGAATTAGAATTGATCTCACCGAAACCAAGGATCTCATAGAAAAAATTTGGGTTTTTGCGAAGCAGGTTGGAAGTGTCTCCTTCCACAAGTGCCATAGATCGAGGAATTCGCTTGCTCACAAGTTAGCGCGCAAGGGCGCGTTGTTTTCCTCTTTTTGTATCCCACATTCTTCTTCCTTCTGTGAAGAAGAGGGGGGGTTTTTGTCCGTTCCTTACCCTTTGTGGGTTACCTTTGCTTTGAAGGAGGATTTTGGTTGTAACAACCCATTGTTTTAA

Coding sequence (CDS)

ATGAAAGTGTGTGTGTGGAGAGTTCTGAATAATGTAATTCCCACCAACCTAAATATCATAAACAAAAATATTGACACTAATCCTTATTGTTTTTTCTGCAGGGGAGTCGTTGAGTCAACTAGTCATGCCTTATGGAATTGCAGGTTTGTTAAGAAAATTTGGAGTCTTTTCTGTCCTTACTTATCCAGACTCTTGATGATTGCAGGGACTGGTGGCAACCATGGGATTGCTGGCTTAGGTTGCTATCAAAAGCAGTTGCTATCATTTGGAGTTTATGGAATTGCTATCAAAGCTGACTTCAGTACAATTTGCTCACATATTGATCACATCCTAACTGAATTAGAAGGAATTTCGAGAAATTACCAGGAGTTTGAGAGTTGTGAGAACCAAGCGAGTCAAGGACGTTGGTTGCTGCCACCGGAAGGAAAATGGAAGTTGAATTGTGATGCCACCTGGAATGAGAAGCTTAAAGTGGGTGGCATTGGTTGGGTAATCCGTGACTCTCTAGGGTCTCCGATGGGGGCAGGCTATAAAAAAATTCAACGGAATTGGCGAATTAAATTCATAGAGGCAGAAGCGATTAAAGCAGGGCTTATGTATATTGTTGAATTCTTACCTCTGCCAAGAGCTCCGGTTGTCGTTGAATCAGATTCGGCGGAAATCATCAGCGGCCTCAACCGAATTAGAATTGATCTCACCGAAACCAAGGATCTCATAGAAAAAATTTGGGTTTTTGCGAAGCAGGTTGGAAGTGTCTCCTTCCACAAGTGCCATAGATCGAGGAATTCGCTTGCTCACAAGTTAGCGCGCAAGGGCGCGTTGTTTTCCTCTTTTTGTATCCCACATTCTTCTTCCTTCTGTGAAGAAGAGGGGGGGTTTTTGTCCGTTCCTTACCCTTTGTGGGTTACCTTTGCTTTGAAGGAGGATTTTGGTTGTAACAACCCATTGTTTTAA

Protein sequence

MKVCVWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKKIWSLFCPYLSRLLMIAGTGGNHGIAGLGCYQKQLLSFGVYGIAIKADFSTICSHIDHILTELEGISRNYQEFESCENQASQGRWLLPPEGKWKLNCDATWNEKLKVGGIGWVIRDSLGSPMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGLNRIRIDLTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLARKGALFSSFCIPHSSSFCEEEGGFLSVPYPLWVTFALKEDFGCNNPLF
Homology
BLAST of Tan0000935.1 vs. NCBI nr
Match: GAY61101.1 (hypothetical protein CUMW_207140, partial [Citrus unshiu])

HSP 1 Score: 104.8 bits (260), Expect = 1.4e-18
Identity = 73/278 (26.26%), Postives = 130/278 (46.76%), Query Frame = 0

Query: 1   MKVCVWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKKIW--SLF- 60
           +++ VWR   N++P++ N+  + I   P C  C+  +E+  HAL +C+  KK+W  SLF 
Sbjct: 162 IRIFVWRAAKNLLPSDENLWKRKIVQEPTCQLCKMGIENVFHALVDCKAAKKVWRLSLFD 221

Query: 61  -----CPYLSRLLMIAGTGGNHGIAGLGCYQKQLLSFGVYGIAIKADFSTICSHIDHILT 120
                 P    L ++ G       A +  +   L  +  +    +  F     +   ++ 
Sbjct: 222 NDIQAAPGQDILSLLHGVKRMRSNADVDLFAAML--WAKWNARNQWLFKGKRENPQSVVA 281

Query: 121 ELEGISRNYQEFESCENQASQGRWLLPPEGKWKLNCDATWNEKLKVGGIGWVIRDSLGSP 180
           + E +   Y+  +   ++ +Q  W  P EG  K+N DA  N +  + G+G VIRD  G  
Sbjct: 282 KAEAVMEAYKRVQPSADKVAQLGWNPPQEGFVKINTDAATNSEKNLAGLGAVIRDENGQV 341

Query: 181 MGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGLNRIRIDL 240
                K  + +  + + EAEA++ GL    +        V++ESDS E++S +N  +   
Sbjct: 342 TATAIKVSKFHGSVAYAEAEAMEWGLQVAKD---AHVKDVIMESDSQEVVSLVNNRQGSR 401

Query: 241 TETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLAR 271
           +E   ++ +I    +    VS    HRS N++AH LA+
Sbjct: 402 SEIYWVVLEIQKLKESFDHVSCVYTHRSCNAIAHSLAK 434

BLAST of Tan0000935.1 vs. NCBI nr
Match: XP_042962672.1 (uncharacterized protein LOC122296942 [Carya illinoinensis])

HSP 1 Score: 101.3 bits (251), Expect = 1.6e-17
Identity = 75/289 (25.95%), Postives = 127/289 (43.94%), Query Frame = 0

Query: 1    MKVCVWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKKIWSLFCPY 60
            MKV  WR     +PT LN+  K++  +  C  C   +E T+HAL+ C  V+ +W +FC  
Sbjct: 1030 MKVFAWRACQEKLPTFLNLKKKHVLEDATCGLCNQGMEDTAHALFFCSEVRSVWGVFCSQ 1089

Query: 61   LSRLLMIAGTGGNHGIAGLGCYQKQL-----LSFGVYGIAIKADFSTICSHIDHILTELE 120
            +  +           +A +      L     +++G++    K  +  I  HI   +    
Sbjct: 1090 MDNIQTALSFWDLANLARVRGSDSLLARFIAITWGLWYRRNKRIYEDISIHIHVSVNNAL 1149

Query: 121  GISRNYQEFE----SCENQASQGRWLLPPEGKWKLNCDATWNEKLKVGGIGWVIRDSLGS 180
             + + Y + +    S +      RW  PP    KLN D     +  V GIG V+RD  G 
Sbjct: 1150 SLQQEYAQVQLFDGSNQKITKVVRWHPPPNDFLKLNIDGATFPEHSVAGIGVVLRDQYGE 1209

Query: 181  PMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGLNRIRID 240
             + A  K  +     +FIEA A+  GL    ++  +P+  +++E+D   +++ LN   + 
Sbjct: 1210 VIVACSKVEKEVSSAEFIEAVALLRGLQLCAQW-GVPK--IMLETDCLVLVNALNENSVC 1269

Query: 241  LTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLARKGALFSSFCI 281
            LT+   +++ I         V     +R  N +AH+LAR   L    CI
Sbjct: 1270 LTDIAFILQDIRRLMVGFQEVQVVHVNRLGNLVAHRLARHAWLIDDICI 1315

BLAST of Tan0000935.1 vs. NCBI nr
Match: PRQ45142.1 (putative ribonuclease H-like domain, reverse transcriptase zinc-binding domain-containing protein [Rosa chinensis])

HSP 1 Score: 100.9 bits (250), Expect = 2.1e-17
Identity = 74/280 (26.43%), Postives = 127/280 (45.36%), Query Frame = 0

Query: 1   MKVCVWRVLNNVIPTNLNIINKNIDTNPY-CFFCRGVVESTSHALWNCRFVKKIWSLFCP 60
           +KV VWR+++ ++PT L ++++++      C FC+   ES+ H    C  ++  W L   
Sbjct: 32  IKVLVWRLVHGIVPTRLALLSRHLHIQDVACVFCKSTNESSLHVFKECDALQCFWRLGPL 91

Query: 61  YLSRLLMIAGTGGNHGIAGLGCYQKQLLSF------GVYGIAIKADFSTICSHIDHIL-- 120
            L      AG   N     L       + F       V+    K  ++  C    H++  
Sbjct: 92  KLKAKEQAAGDLKNWLFDVLDMLNSNQVDFFFMALWSVWTERNKIVWNDGCFQPMHMIQW 151

Query: 121 --TELEGISRNYQEFESCENQASQGRWLLPPEGKWKLNCDATWNEKLKVGGIGWVIRDSL 180
             + LE   + +Q+  +C+ +    +W  PP G+ K+N D  +     VGGIG V+RD L
Sbjct: 152 CTSSLEEFQKYHQKV-ACKKKRPLTKWQCPPRGRLKINIDGAFQVDSGVGGIGVVVRDDL 211

Query: 181 GSPMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGLNRIR 240
           G+ + A  +          +EAEA +AGL+  +       + VV+ESDSA +I+ L    
Sbjct: 212 GTGIAAIARPFLHAHSAINMEAEACRAGLLLGIH---QGWSDVVIESDSALLIAALKSEE 271

Query: 241 IDLTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLA 270
            + +E   + +    +     SV     +R  N +AH+LA
Sbjct: 272 DNFSEVSRVFDDCKDYLSAFQSVEIRHIYREANGVAHRLA 307

BLAST of Tan0000935.1 vs. NCBI nr
Match: XP_006491472.1 (uncharacterized protein LOC102626455 [Citrus sinensis])

HSP 1 Score: 100.5 bits (249), Expect = 2.7e-17
Identity = 75/290 (25.86%), Postives = 125/290 (43.10%), Query Frame = 0

Query: 1    MKVCVWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKKIWSLFCPY 60
            +K+ +WR L N++PT  N+  +     P C  C+  VE+ SH L  C+  +KIW L    
Sbjct: 1145 VKIFMWRALKNILPTAENLWKRRSLQEPICQRCKLQVETVSHVLIECKAARKIWDL---- 1204

Query: 61   LSRLLMIAGTGGNHGIAGLGCYQKQLLSFG--------VYGIAI------------KADF 120
                 +I     +H        Q+              VY   I            K+D 
Sbjct: 1205 ---APLIVQPSKDHNQDFFSAIQEMWSRSSTAEAELMIVYCWVIWSARNKFIFEGKKSDS 1264

Query: 121  STICSHIDHILTELEGISRNYQEFESCENQASQGRWLLPPEGKWKLNCDATWNEKLKVGG 180
              + +  D +L   + +S+      + +    Q +W  P +   KLN DA  + K +  G
Sbjct: 1265 RFLAAKADSVLKAYQRVSKPGNVHGAKDRGIDQQKWKPPSQNVLKLNVDAAVSTKDQKVG 1324

Query: 181  IGWVIRDSLGSPMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAE 240
            +G ++RD+ G  +  G K+ Q   R+   EAEAI  GL    +   +  + ++VESD  E
Sbjct: 1325 LGAIVRDAEGKILAVGIKQAQFRERVSLAEAEAIHWGLQVANQ---ISSSSLIVESDCKE 1384

Query: 241  IISGLNRIRIDLTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLAR 271
            ++  LN  +   TE   ++  +   +K+   V F    R+ N+ AH LA+
Sbjct: 1385 VVELLNNTKGSRTEIHWILSDVRRESKEFKQVQFSFIPRTCNTYAHALAK 1424

BLAST of Tan0000935.1 vs. NCBI nr
Match: XP_024956542.1 (uncharacterized protein LOC112498908 [Citrus sinensis])

HSP 1 Score: 98.6 bits (244), Expect = 1.0e-16
Identity = 84/326 (25.77%), Postives = 140/326 (42.94%), Query Frame = 0

Query: 1    MKVCVWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKKIW--SLF- 60
            +++ VWR   N++P+  N+  + I   P C  C+  +E+  HAL +C+  KK+W  SLF 
Sbjct: 821  IRIFVWRAAKNLLPSAENLWKRKIVQEPTCQLCKMGIENVFHALVDCKAAKKVWRLSLFY 880

Query: 61   -----CPYLSRLLMIAGTGGNHGIAGLGCYQKQLLSFGVYGIAIKADFSTICSHIDHILT 120
                  P    L ++ G       A +  +   L  +  +    +  F     +   ++ 
Sbjct: 881  IDIQAAPGQDILSLLHGVKRMRSNADVDLFAVML--WAKWNARNQWLFKGKRENPQSVVA 940

Query: 121  ELEGISRNYQEFESC-------ENQASQGRWLLPPEGKWKLNCDATWNEKLKVGGIGWVI 180
            + E +   Y+  +         + + +Q  W  P EG  K+N DA  N +  + G+G VI
Sbjct: 941  KAEAVMEAYKRVQPSADVSHGKQQKVAQLGWNPPQEGFVKINTDAATNSEKNLAGLGAVI 1000

Query: 181  RDSLGSPMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGL 240
            RD  G       K  + +  + + EAEA++ GL    +        V++ESDS E++S +
Sbjct: 1001 RDENGQVTATAIKVSKFHGSVAYAEAEAMEWGLQVAKD---AHVKDVIMESDSQEVVSLV 1060

Query: 241  NRIRIDLTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLARKGALFSSFCIPHSSS 300
            N  +   +E   ++ +I    +    VS    HRS N++AH L  K AL     +    S
Sbjct: 1061 NNRQGSRSEIYWVVLEIQKLKESFDHVSCVYTHRSCNAIAHSLV-KIALEKCETVVWKGS 1120

Query: 301  FCEEEGGFLSVPYPLWVTFALKEDFG 312
                        YPL V FA   D G
Sbjct: 1121 ------------YPLQVIFASSSDIG 1128

BLAST of Tan0000935.1 vs. ExPASy TrEMBL
Match: A0A2N9EEE3 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48306 PE=4 SV=1)

HSP 1 Score: 109.4 bits (272), Expect = 2.8e-20
Identity = 80/286 (27.97%), Postives = 123/286 (43.01%), Query Frame = 0

Query: 1   MKVCVWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKKIWSLFCPY 60
           +K  +WRV NN +P   N++++ + ++  C  C G  E+ +HALW C F K++WS     
Sbjct: 34  IKQFLWRVCNNALPVKTNLVHRKVISDATCDECLGASETITHALWTCPFAKQVWS----- 93

Query: 61  LSRLLMIAGTGGNHGIAGLGCYQKQLLS------FGVYGIAI-----KADFSTICSHIDH 120
           L + L   G    H +  L  Y  ++ +      F V   AI        F       D 
Sbjct: 94  LEKSLAELGQLTVHSVTELVWYILEVSTTMDIEIFAVIAWAIWQRRNSLKFQVTSESPDR 153

Query: 121 ILTELEGISRNYQEFESCENQASQG----RWLLPPEGKWKLNCDATWNEKLKVGGIGWVI 180
           +      + + +Q     +N  +Q      W  PP G   +N D    ++    GIG ++
Sbjct: 154 VYHRALDLLQEFQNAHVSQNIPAQSYIPCAWHPPPTGVMNINFDGAMFKEENAAGIGVIV 213

Query: 181 RDSLGSPMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGL 240
           R   G P+    KKI     ++ IEA A +   + +   L L R  V+ E DS+ IIS L
Sbjct: 214 RSDTGDPIATLSKKIALPHSVEAIEARAAREAAI-LAHHLQLKR--VIFEGDSSIIISAL 273

Query: 241 NRIRIDLTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLARK 272
               + +    ++IE   V      S SF    R  N+ AH LARK
Sbjct: 274 QNPDVCMASYGNIIEDTQVIVSNFESHSFVHIKRQGNAAAHFLARK 311

BLAST of Tan0000935.1 vs. ExPASy TrEMBL
Match: A0A2P6RFF7 (Putative ribonuclease H-like domain, reverse transcriptase zinc-binding domain-containing protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr3g0486981 PE=4 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 1.0e-17
Identity = 74/280 (26.43%), Postives = 127/280 (45.36%), Query Frame = 0

Query: 1   MKVCVWRVLNNVIPTNLNIINKNIDTNPY-CFFCRGVVESTSHALWNCRFVKKIWSLFCP 60
           +KV VWR+++ ++PT L ++++++      C FC+   ES+ H    C  ++  W L   
Sbjct: 32  IKVLVWRLVHGIVPTRLALLSRHLHIQDVACVFCKSTNESSLHVFKECDALQCFWRLGPL 91

Query: 61  YLSRLLMIAGTGGNHGIAGLGCYQKQLLSF------GVYGIAIKADFSTICSHIDHIL-- 120
            L      AG   N     L       + F       V+    K  ++  C    H++  
Sbjct: 92  KLKAKEQAAGDLKNWLFDVLDMLNSNQVDFFFMALWSVWTERNKIVWNDGCFQPMHMIQW 151

Query: 121 --TELEGISRNYQEFESCENQASQGRWLLPPEGKWKLNCDATWNEKLKVGGIGWVIRDSL 180
             + LE   + +Q+  +C+ +    +W  PP G+ K+N D  +     VGGIG V+RD L
Sbjct: 152 CTSSLEEFQKYHQKV-ACKKKRPLTKWQCPPRGRLKINIDGAFQVDSGVGGIGVVVRDDL 211

Query: 181 GSPMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGLNRIR 240
           G+ + A  +          +EAEA +AGL+  +       + VV+ESDSA +I+ L    
Sbjct: 212 GTGIAAIARPFLHAHSAINMEAEACRAGLLLGIH---QGWSDVVIESDSALLIAALKSEE 271

Query: 241 IDLTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLA 270
            + +E   + +    +     SV     +R  N +AH+LA
Sbjct: 272 DNFSEVSRVFDDCKDYLSAFQSVEIRHIYREANGVAHRLA 307

BLAST of Tan0000935.1 vs. ExPASy TrEMBL
Match: A0A2N9EMZ0 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8088 PE=4 SV=1)

HSP 1 Score: 100.5 bits (249), Expect = 1.3e-17
Identity = 75/288 (26.04%), Postives = 129/288 (44.79%), Query Frame = 0

Query: 5   VWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKKIWSLFCPYLSRL 64
           +WR  +N +PT  N+ +++I  +P C  C   +EST HALW C+ +K +W    P+  +L
Sbjct: 640 LWRACHNSLPTRSNLHHRHILADPSCSSCTNQIESTIHALWQCKEIKPVWQSI-PWGRKL 699

Query: 65  LMIAGTG----GNHGIAGLGCYQKQLLSFGVYGIAIKADFSTICSHIDHILTELEGISRN 124
             I+  G           L   + QL S   +GI  + +   +   +D++   +      
Sbjct: 700 REISYAGFIDLMYQCFQTLSTNELQLFSMTSWGIWHRRNRLRLQQPVDNLSQLIPRALDT 759

Query: 125 YQEFESCENQASQ----------GRWLLPPEGKWKLNCDATWNEKLKVGGIGWVIRDSLG 184
             EF++ +N   Q            W  P EG++K+N D     +    G+G +IR+  G
Sbjct: 760 LLEFQTAQNSDPQPSPKPNHTKSTTWKPPEEGRYKVNYDGAVFSERNEAGVGVIIRNYRG 819

Query: 185 SPMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGLNRIRI 244
             MG+   +I     ++ +EA A    + +  +   L    + +E DS  I+  L     
Sbjct: 820 EVMGSLSHRIPYPHSVEAVEASAASCAIQFAKD---LGFMLIDLEGDSKIIVEALLLKAP 879

Query: 245 DLTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLARKGALFSSF 279
             T   ++IE I   A+ + SV F   +R  N++AH LA++  L   F
Sbjct: 880 CTTIYGNVIEDIKQSAQNLQSVHFLHINREGNAMAHLLAKRARLNKPF 923

BLAST of Tan0000935.1 vs. ExPASy TrEMBL
Match: A0A2N9HYE3 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS44563 PE=4 SV=1)

HSP 1 Score: 100.5 bits (249), Expect = 1.3e-17
Identity = 75/288 (26.04%), Postives = 129/288 (44.79%), Query Frame = 0

Query: 5    VWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKKIWSLFCPYLSRL 64
            +WR  +N +PT  N+ +++I  +P C  C   +EST HALW C+ +K +W    P+  +L
Sbjct: 1521 LWRACHNSLPTRSNLHHRHILADPSCSSCTNQIESTIHALWQCKEIKPVWQSI-PWGRKL 1580

Query: 65   LMIAGTG----GNHGIAGLGCYQKQLLSFGVYGIAIKADFSTICSHIDHILTELEGISRN 124
              I+  G           L   + QL S   +GI  + +   +   +D++   +      
Sbjct: 1581 REISYAGFIDLMYQCFQTLSTNELQLFSMTSWGIWHRRNRLRLQQPVDNLSQLIPRALDT 1640

Query: 125  YQEFESCENQASQ----------GRWLLPPEGKWKLNCDATWNEKLKVGGIGWVIRDSLG 184
              EF++ +N   Q            W  P EG++K+N D     +    G+G +IR+  G
Sbjct: 1641 LLEFQTAQNSDPQPSPKPNHTKSTTWKPPEEGRYKVNYDGAVFSERNEAGVGVIIRNYRG 1700

Query: 185  SPMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGLNRIRI 244
              MG+   +I     ++ +EA A    + +  +   L    + +E DS  I+  L     
Sbjct: 1701 EVMGSLSHRIPYPHSVEAVEASAASCAIQFAKD---LGFMLIDLEGDSKIIVEALLLKAP 1760

Query: 245  DLTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLARKGALFSSF 279
              T   ++IE I   A+ + SV F   +R  N++AH LA++  L   F
Sbjct: 1761 CTTIYGNVIEDIKQSAQNLQSVHFLHINREGNAMAHLLAKRARLNKPF 1804

BLAST of Tan0000935.1 vs. ExPASy TrEMBL
Match: A0A2N9F5C6 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS14049 PE=4 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 5.0e-17
Identity = 77/294 (26.19%), Postives = 130/294 (44.22%), Query Frame = 0

Query: 1   MKVCVWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKKIWSLFCPY 60
           +K  +WR  ++ +PT   +  + + +NP+C  CR   E + HALWNC  V ++WSL  P 
Sbjct: 183 IKTFLWRACHDSLPTKSGLFKRQVTSNPFCDTCREQSEDSLHALWNCPAVSQVWSL-APE 242

Query: 61  LSRLLMIAGTGGNHGIAGLGCYQKQLL--SFGVYG-----------IAIKAD-FSTICSH 120
            S L  +A    +  +  +      LL   F +             + + +D  S I   
Sbjct: 243 FSDLQKLAPMSLSDLMRQVIQSNSNLLFEKFAITSWLLWHKRNQDRLRLPSDPHSQILPR 302

Query: 121 IDHILTELEGISRNYQEFESCENQASQGRWLLPPEGKWKLNCDATWNEKLKVGGIGWVIR 180
              +L+E   ++    E +  + Q  Q RW  P    +K+N D         GG+G VIR
Sbjct: 303 AHALLSEYLAVT---TENKPQKPQPPQVRWKPPSSNLFKVNFDGAIFRDSNTGGLGVVIR 362

Query: 181 DSLGSPMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGLN 240
           D+ G  +    +K+  N  ++ IEA A +  + + +E   +       E D+  +I  L 
Sbjct: 363 DNTGMVIATLSQKVTGNHTVEMIEALAARRAIRFAME---VGVTNAEFEGDAETVIRDLC 422

Query: 241 RIRIDLTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLARKGALFSSFCI 281
           R     T    +IE   V   ++ + S     RS NS+AH LAR+ +  +S+ +
Sbjct: 423 RTDPIYTPYGLVIEDAKVMLAEIQNFSLSHTRRSGNSVAHALARRASKCNSYLV 469

BLAST of Tan0000935.1 vs. TAIR 10
Match: AT3G09510.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 68.6 bits (166), Expect = 1.1e-11
Identity = 68/299 (22.74%), Postives = 123/299 (41.14%), Query Frame = 0

Query: 1   MKVCVWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKKIWSLFCPY 60
           +K  +WR L+  + T   +  + +  +P C  C    ES +HAL+ C F    W L    
Sbjct: 171 LKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSS 230

Query: 61  LSRLLMIAGTGGNHGIAGLGCYQKQLLS-----------FGVYGIAIKADFSTICSHIDH 120
           L R  +++     +    L   Q   +S           + ++       F+        
Sbjct: 231 LIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSK 290

Query: 121 ILTELE-------GISRNYQEFESCENQASQGR--WLLPPEGKWKLNCDATWN-EKLKVG 180
            +   +         ++++++  S   Q ++ +  W  PP    K N DA ++ +KL+  
Sbjct: 291 TVLSAKAETHDWLNATQSHKKTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEAT 350

Query: 181 GIGWVIRDSLGSPMGAGYKKIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSA 240
           G GW+IR+  G+P+  G  K+         E +A+ A L    +        V +E D  
Sbjct: 351 G-GWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAALQ---QTWIRGYTQVFMEGDCQ 410

Query: 241 EIISGLNRIRIDLTETKDLIEKIWVFAKQVGSVSFHKCHRSRNSLAHKLARKGALFSSF 279
            +I+ +N I    +   + +E I  +A +  S+ F    R  N LAH LA+ G  +S+F
Sbjct: 411 TLINLINGISFH-SSLANHLEDISFWANKFASIQFGFIRRKGNKLAHVLAKYGCTYSTF 464

BLAST of Tan0000935.1 vs. TAIR 10
Match: AT2G04420.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 50.1 bits (118), Expect = 3.9e-06
Identity = 41/152 (26.97%), Postives = 70/152 (46.05%), Query Frame = 0

Query: 120 NYQEFESCEN-QASQGRWLLPPEGKWKLNCDATWNEKLKVGGIGWVIRDSLGSPMGAGYK 179
           +Y   E C N +++  +W  PP G  K N D ++N + +    GW+IRD  G   GA   
Sbjct: 46  HYTVREGCNNRESTHQKWEQPPMGWIKCNYDGSFNYRTQQTNSGWLIRDDKGFYKGAA-- 105

Query: 180 KIQRNWRIKFIEAEAIKAGLMYIVEFLPLPRAPVVVESDSAEIISGLNRIRIDLTETKDL 239
           +         +E+E ++A +M +          V+ E DS ++   LNR ++      + 
Sbjct: 106 QAVGGTMNNALESE-LQALVMAMQHTWSQGYRKVIFEGDSKQVEELLNRKQMHF-GAFNW 165

Query: 240 IEKIWVFAKQVGSVSFHKCHRSRNSLAHKLAR 271
           I + W ++K+   V F    R+ N  A  LA+
Sbjct: 166 IREAWSWSKRFEEVIFSWTPRTNNQPADMLAK 193

BLAST of Tan0000935.1 vs. TAIR 10
Match: AT5G65005.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 49.7 bits (117), Expect = 5.1e-06
Identity = 34/137 (24.82%), Postives = 59/137 (43.07%), Query Frame = 0

Query: 135 RWLLPPEGKWKLNCDATWNEKLKVGGIGWVIRDSLGSPMGAGYKKIQRNWRIKFIEAEAI 194
           +W  P   K K N DA+ +E+  V G+GW++R+S G+ +  G  K Q     +  E   +
Sbjct: 104 KWSPPGRDKLKCNYDASHHERNTVSGLGWILRNSQGTVIECGMGKFQGRMTTEEAECSTL 163

Query: 195 KAGLMYIVEFLPLPRAPVVVESDSAEIISGLNRIRIDLTETKDLIEKIWVFAKQVGSVSF 254
              +  I          V+ E D+  I   +N  +      +  ++ I  +     S+ F
Sbjct: 164 ---IWAIQASYGFGHKKVIFEGDNQTITRMIN-TKSSNPRLQHFLDTIQSWIPSFESIEF 223

Query: 255 HKCHRSRNSLAHKLARK 272
              HR +N  A  LA++
Sbjct: 224 SFKHREQNGCADFLAKQ 236

BLAST of Tan0000935.1 vs. TAIR 10
Match: AT3G26855.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 49.3 bits (116), Expect = 6.7e-06
Identity = 18/52 (34.62%), Postives = 33/52 (63.46%), Query Frame = 0

Query: 1  MKVCVWRVLNNVIPTNLNIINKNIDTNPYCFFCRGVVESTSHALWNCRFVKK 53
          +K+ +W+ LNN +P    ++++NI   P+C  CR   E+ +H L+NC F ++
Sbjct: 18 IKLLIWKALNNALPVGAQLLSRNISIEPFCTRCRD-FETITHILFNCPFAQR 68

BLAST of Tan0000935.1 vs. TAIR 10
Match: AT1G52990.1 (thioredoxin family protein )

HSP 1 Score: 48.1 bits (113), Expect = 1.5e-05
Identity = 39/148 (26.35%), Postives = 62/148 (41.89%), Query Frame = 0

Query: 140 PEGKWKLNCDATWNEKLKVGGIGWVIRDSLGSPMGAGYKKIQRNWRIKFIEAEAIKAGLM 199
           P  + K N DA+ +E   V G+GW+IR+S G+ +  G  K Q     +  E  A+   + 
Sbjct: 44  PSCRVKCNYDASHHEGDVVSGLGWLIRNSQGTVLECGMGKFQGRMTPEEAECSALIWAIQ 103

Query: 200 YIVEFLPLPRAPVVVESDSAEIISGLNRIRIDLTETKDLIEKIWVFAKQVGSVSFHKCHR 259
               F       V+ E D++  ++ L   + D    K  ++ I  +     S  F   HR
Sbjct: 104 ATSAF---GYTKVIFEGDNSN-VNRLINTKSDNPRLKHYLDTIKSWIPSFTSTEFIFTHR 163

Query: 260 SRNSLAHKLARKGALFSS-----FCIPH 283
            +N  A  L +K    S+      C PH
Sbjct: 164 EQNQCADTLVKKAIKSSTQWSLFNCCPH 187

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
GAY61101.11.4e-1826.26hypothetical protein CUMW_207140, partial [Citrus unshiu][more]
XP_042962672.11.6e-1725.95uncharacterized protein LOC122296942 [Carya illinoinensis][more]
PRQ45142.12.1e-1726.43putative ribonuclease H-like domain, reverse transcriptase zinc-binding domain-c... [more]
XP_006491472.12.7e-1725.86uncharacterized protein LOC102626455 [Citrus sinensis][more]
XP_024956542.11.0e-1625.77uncharacterized protein LOC112498908 [Citrus sinensis][more]
Match NameE-valueIdentityDescription
A0A2N9EEE32.8e-2027.97Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48306 PE=4 SV=1[more]
A0A2P6RFF71.0e-1726.43Putative ribonuclease H-like domain, reverse transcriptase zinc-binding domain-c... [more]
A0A2N9EMZ01.3e-1726.04Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9HYE31.3e-1726.04Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9F5C65.0e-1726.19Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS14049 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G09510.11.1e-1122.74Ribonuclease H-like superfamily protein [more]
AT2G04420.13.9e-0626.97Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
AT5G65005.15.1e-0624.82Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
AT3G26855.16.7e-0634.62RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT1G52990.11.5e-0526.35thioredoxin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 142..275
e-value: 6.8E-13
score: 50.7
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 1..54
e-value: 1.1E-9
score: 38.8
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 147..271
e-value: 1.2E-17
score: 63.9
NoneNo IPR availablePANTHERPTHR47723OS05G0353850 PROTEINcoord: 131..273
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 146..270
e-value: 4.87709E-17
score: 73.8876
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 143..275

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Tan0000935Tan0000935gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0000935.1-exonTan0000935.1-exon-LG11:59197814..59198052exon
Tan0000935.1-exonTan0000935.1-exon-LG11:59198134..59198848exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0000935.1-cdsTan0000935.1-cds-LG11:59197814..59198052CDS
Tan0000935.1-cdsTan0000935.1-cds-LG11:59198134..59198848CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Tan0000935.1Tan0000935.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity