Tan0022014 (gene) Snake gourd v1

Overview
NameTan0022014
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
LocationLG07: 32616257 .. 32616595 (-)
RNA-Seq ExpressionTan0022014
SyntenyTan0022014
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCGAGGTGGCAAACGAGGCAGGCAAGTGGAGACTGGAACTCAAGAAGCTACTGGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGAGGTGAACATGGAGGAACATATCTTTATGTGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCAGCACAAGCAGATCCAGAAAAGAAGTATGGCATTGAAAGACTGAAAGCTTTAGGTGCAACAACATTTGAAGGCACGACAGATCCCGCTGATGCTAAGGTTTGGTTAAATCTGATTGAGAAGTGTTTTAGGGTCATGCGATGCCTCAAAGAGGGAAGGTCAATTTAG

mRNA sequence

ATGGCTCGAGGTGGCAAACGAGGCAGGCAAGTGGAGACTGGAACTCAAGAAGCTACTGGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGAGGTGAACATGGAGGAACATATCTTTATGTGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCAGCACAAGCAGATCCAGAAAAGAAGTATGGCATTGAAAGACTGAAAGCTTTAGGTGCAACAACATTTGAAGGCACGACAGATCCCGCTGATGCTAAGGTTTGGTTAAATCTGATTGAGAAGTGTTTTAGGGTCATGCGATGCCTCAAAGAGGGAAGGTCAATTTAG

Coding sequence (CDS)

ATGGCTCGAGGTGGCAAACGAGGCAGGCAAGTGGAGACTGGAACTCAAGAAGCTACTGGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGAGGTGAACATGGAGGAACATATCTTTATGTGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCAGCACAAGCAGATCCAGAAAAGAAGTATGGCATTGAAAGACTGAAAGCTTTAGGTGCAACAACATTTGAAGGCACGACAGATCCCGCTGATGCTAAGGTTTGGTTAAATCTGATTGAGAAGTGTTTTAGGGTCATGCGATGCCTCAAAGAGGGAAGGTCAATTTAG

Protein sequence

MARGGKRGRQVETGTQEATGDRGREVSEGESSHPQQEVNMEEHIFMWITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAKVWLNLIEKCFRVMRCLKEGRSI
Homology
BLAST of Tan0022014 vs. NCBI nr
Match: XP_038896416.1 (uncharacterized protein LOC120084680 [Benincasa hispida])

HSP 1 Score: 111.3 bits (277), Expect = 5.4e-21
Identity = 51/66 (77.27%), Postives = 58/66 (87.88%), Query Frame = 0

Query: 40  MEEHIFMWITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTDPADAKVWLNLIEKC 99
           ME+ +F  ITQRLA SVGS Q DPEKK+GIERLKALGATTF+GTTDP DA++WL LIEKC
Sbjct: 1   MEDRVFDRITQRLAASVGSIQNDPEKKFGIERLKALGATTFDGTTDPLDAEIWLGLIEKC 60

Query: 100 FRVMRC 106
           F+VMRC
Sbjct: 61  FKVMRC 66

BLAST of Tan0022014 vs. NCBI nr
Match: KAA0035225.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa] >TYK21839.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 109.0 bits (271), Expect = 2.7e-20
Identity = 60/106 (56.60%), Postives = 68/106 (64.15%), Query Frame = 0

Query: 1   MARGGKRGR-QVETGTQEATGDRGREVSEGESSHPQQEVNMEEHIFMWITQRLAESVGSA 60
           M RG  R     +T         G   S+ ESS P  E NMEE +   + QRL   + SA
Sbjct: 20  MPRGRPRKHPDAKTSNAAREAAMGSGESDAESSRPHVEGNMEEQLLDRLAQRLISGIRSA 79

Query: 61  QADPEKKYGIERLKALGATTFEGTTDPADAKVWLNLIEKCFRVMRC 106
           Q+DPEKKYGIERLKALGATTF GTT+PADA+ WL LIEKCFRV RC
Sbjct: 80  QSDPEKKYGIERLKALGATTFVGTTNPADAEAWLTLIEKCFRVTRC 125

BLAST of Tan0022014 vs. NCBI nr
Match: XP_038882393.1 (uncharacterized protein LOC120073661 [Benincasa hispida])

HSP 1 Score: 106.7 bits (265), Expect = 1.3e-19
Identity = 51/85 (60.00%), Postives = 66/85 (77.65%), Query Frame = 0

Query: 26  VSEGESSHPQ--QEVNMEEHIFMWITQRLAESVGSAQADPEKKYGIERLKALGATTFEGT 85
           +SEGES  PQ   +  +E+ +F  I QRL  S+GSA+AD EKKYGIER KALGA TFEGT
Sbjct: 1   MSEGESCTPQARADTQLEDVVFDKIEQRLVASMGSARADSEKKYGIERFKALGAVTFEGT 60

Query: 86  TDPADAKVWLNLIEKCFRVMRCLKE 109
           TDPA+ ++WL+++EKCF VM CL++
Sbjct: 61  TDPAEVELWLDVVEKCFNVMSCLED 85

BLAST of Tan0022014 vs. NCBI nr
Match: KAA0025769.1 (hypothetical protein E6C27_scaffold34G00120 [Cucumis melo var. makuwa] >TYK09684.1 hypothetical protein E5676_scaffold447G00920 [Cucumis melo var. makuwa])

HSP 1 Score: 104.4 bits (259), Expect = 6.6e-19
Identity = 56/97 (57.73%), Postives = 68/97 (70.10%), Query Frame = 0

Query: 12  ETGTQEATGDRGREVSEGESSHPQQEVNMEEHIFMWITQRLAESVGSAQADPEKKYGIER 71
           E GT +AT    RE + G+S+H   + NMEE IF  I QRLA+ V  AQAD EKKYGIER
Sbjct: 81  EFGTCKATRS-PREFTVGDSNHSLAQENMEERIFNKIAQRLADGVRLAQADLEKKYGIER 140

Query: 72  LKALGATTFEGTTDPADAKVWLNLIEKCFRVMRCLKE 109
           +KALGAT F+GT D  + + WL LIEKCF VM CL++
Sbjct: 141 MKALGATPFKGTVDSVEVEAWLTLIEKCFWVMHCLED 176

BLAST of Tan0022014 vs. NCBI nr
Match: KAA0036813.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 102.4 bits (254), Expect = 2.5e-18
Identity = 56/109 (51.38%), Postives = 67/109 (61.47%), Query Frame = 0

Query: 1   MARGGKRGR-QVETGTQEATGDRGREVSEGESSHPQQEVNMEEHIFMWITQRLAESVGSA 60
           M RG  R     E          G   S+ ESS P+ E N+EE +   + QRL   + SA
Sbjct: 1   MPRGKPRKHPDAEASNAAKEAAMGSGESDAESSRPRVEENVEEQLLDRLAQRLVSGIRSA 60

Query: 61  QADPEKKYGIERLKALGATTFEGTTDPADAKVWLNLIEKCFRVMRCLKE 109
           Q+DPEKKYG ERLKALGATTF GTT+P D + WL LIEKCFRV R L++
Sbjct: 61  QSDPEKKYGFERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRYLED 109

BLAST of Tan0022014 vs. ExPASy TrEMBL
Match: A0A5D3DES5 (DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold991G00660 PE=4 SV=1)

HSP 1 Score: 109.0 bits (271), Expect = 1.3e-20
Identity = 60/106 (56.60%), Postives = 68/106 (64.15%), Query Frame = 0

Query: 1   MARGGKRGR-QVETGTQEATGDRGREVSEGESSHPQQEVNMEEHIFMWITQRLAESVGSA 60
           M RG  R     +T         G   S+ ESS P  E NMEE +   + QRL   + SA
Sbjct: 20  MPRGRPRKHPDAKTSNAAREAAMGSGESDAESSRPHVEGNMEEQLLDRLAQRLISGIRSA 79

Query: 61  QADPEKKYGIERLKALGATTFEGTTDPADAKVWLNLIEKCFRVMRC 106
           Q+DPEKKYGIERLKALGATTF GTT+PADA+ WL LIEKCFRV RC
Sbjct: 80  QSDPEKKYGIERLKALGATTFVGTTNPADAEAWLTLIEKCFRVTRC 125

BLAST of Tan0022014 vs. ExPASy TrEMBL
Match: A0A5A7SKG0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold447G00920 PE=4 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 3.2e-19
Identity = 56/97 (57.73%), Postives = 68/97 (70.10%), Query Frame = 0

Query: 12  ETGTQEATGDRGREVSEGESSHPQQEVNMEEHIFMWITQRLAESVGSAQADPEKKYGIER 71
           E GT +AT    RE + G+S+H   + NMEE IF  I QRLA+ V  AQAD EKKYGIER
Sbjct: 81  EFGTCKATRS-PREFTVGDSNHSLAQENMEERIFNKIAQRLADGVRLAQADLEKKYGIER 140

Query: 72  LKALGATTFEGTTDPADAKVWLNLIEKCFRVMRCLKE 109
           +KALGAT F+GT D  + + WL LIEKCF VM CL++
Sbjct: 141 MKALGATPFKGTVDSVEVEAWLTLIEKCFWVMHCLED 176

BLAST of Tan0022014 vs. ExPASy TrEMBL
Match: A0A5D3BB91 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold110G001760 PE=4 SV=1)

HSP 1 Score: 102.4 bits (254), Expect = 1.2e-18
Identity = 56/109 (51.38%), Postives = 67/109 (61.47%), Query Frame = 0

Query: 1   MARGGKRGR-QVETGTQEATGDRGREVSEGESSHPQQEVNMEEHIFMWITQRLAESVGSA 60
           M RG  R     E          G   S+ ESS P+ E N+EE +   + QRL   + SA
Sbjct: 1   MPRGKPRKHPDAEASNAAKEAAMGSGESDAESSRPRVEENVEEQLLDRLAQRLVSGIRSA 60

Query: 61  QADPEKKYGIERLKALGATTFEGTTDPADAKVWLNLIEKCFRVMRCLKE 109
           Q+DPEKKYG ERLKALGATTF GTT+P D + WL LIEKCFRV R L++
Sbjct: 61  QSDPEKKYGFERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRYLED 109

BLAST of Tan0022014 vs. ExPASy TrEMBL
Match: A0A5A7T1M0 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold20G001070 PE=4 SV=1)

HSP 1 Score: 102.4 bits (254), Expect = 1.2e-18
Identity = 56/109 (51.38%), Postives = 67/109 (61.47%), Query Frame = 0

Query: 1   MARGGKRGR-QVETGTQEATGDRGREVSEGESSHPQQEVNMEEHIFMWITQRLAESVGSA 60
           M RG  R     E          G   S+ ESS P+ E N+EE +   + QRL   + SA
Sbjct: 1   MPRGKPRKHPDAEASNAAKEAAMGSGESDAESSRPRVEENVEEQLLDRLAQRLVSGIRSA 60

Query: 61  QADPEKKYGIERLKALGATTFEGTTDPADAKVWLNLIEKCFRVMRCLKE 109
           Q+DPEKKYG ERLKALGATTF GTT+P D + WL LIEKCFRV R L++
Sbjct: 61  QSDPEKKYGFERLKALGATTFAGTTNPTDVEAWLTLIEKCFRVTRYLED 109

BLAST of Tan0022014 vs. ExPASy TrEMBL
Match: A0A5A7SJ99 (DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold305G00100 PE=4 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 3.5e-18
Identity = 50/86 (58.14%), Postives = 61/86 (70.93%), Query Frame = 0

Query: 23  GREVSEGESSHPQQEVNMEEHIFMWITQRLAESVGSAQADPEKKYGIERLKALGATTFEG 82
           G   S+ ESS P  E N+EE +   + QRL   +  AQ+D EKKYGIERLKALGATTF G
Sbjct: 24  GSGESDAESSRPHVEGNVEEQLLDRLAQRLILGIRLAQSDSEKKYGIERLKALGATTFVG 83

Query: 83  TTDPADAKVWLNLIEKCFRVMRCLKE 109
           TT+P DA+ WL LIEKCF+V RC ++
Sbjct: 84  TTNPVDAEEWLTLIEKCFKVTRCSED 109

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038896416.15.4e-2177.27uncharacterized protein LOC120084680 [Benincasa hispida][more]
KAA0035225.12.7e-2056.60DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa] >TYK21839.1 D... [more]
XP_038882393.11.3e-1960.00uncharacterized protein LOC120073661 [Benincasa hispida][more]
KAA0025769.16.6e-1957.73hypothetical protein E6C27_scaffold34G00120 [Cucumis melo var. makuwa] >TYK09684... [more]
KAA0036813.12.5e-1851.38DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5D3DES51.3e-2056.60DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7SKG03.2e-1957.73Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5D3BB911.2e-1851.38Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold11... [more]
A0A5A7T1M01.2e-1851.38Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold20... [more]
A0A5A7SJ993.5e-1858.14DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 G... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 18..37
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..39

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0022014.1Tan0022014.1mRNA