Tan0020304 (gene) Snake gourd v1

Overview
NameTan0020304
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG10: 37210990 .. 37211397 (+)
RNA-Seq ExpressionTan0020304
SyntenyTan0020304
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGGACAACCGTCTGGACAGATTCGACAAAAATCCCTCAAATACGTTTATAACTCCCGTATAAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGGAGCGGTCATTGACGAGCAAAGTCAGGTATCATTCATCCTGGAGTCTCTTCCGAAGAGTTTCCTGCAATTCCGCAACAATGCAGTGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAACTACAGACTTCCCAGTCTCTTATGAAGAATAAGGGACAGGCTAATGGAGAGGCAAATCTGTTTGCCCATTCCAGAAGGTTCCAGAAGGGTTCATCCTCTGGGACTAAGTCCTGTAGCTCCTCTTCTGGGCTTAAGAAGACCTAA

mRNA sequence

ATGTTTGGACAACCGTCTGGACAGATTCGACAAAAATCCCTCAAATACGTTTATAACTCCCGTATAAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGGAGCGGTCATTGACGAGCAAAGTCAGGTATCATTCATCCTGGAGTCTCTTCCGAAGAGTTTCCTGCAATTCCGCAACAATGCAGTGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAACTACAGACTTCCCAGTCTCTTATGAAGAATAAGGGACAGGCTAATGGAGAGGCAAATCTGTTTGCCCATTCCAGAAGGTTCCAGAAGGGTTCATCCTCTGGGACTAAGTCCTGTAGCTCCTCTTCTGGGCTTAAGAAGACCTAA

Coding sequence (CDS)

ATGTTTGGACAACCGTCTGGACAGATTCGACAAAAATCCCTCAAATACGTTTATAACTCCCGTATAAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGGAGCGGTCATTGACGAGCAAAGTCAGGTATCATTCATCCTGGAGTCTCTTCCGAAGAGTTTCCTGCAATTCCGCAACAATGCAGTGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAACTACAGACTTCCCAGTCTCTTATGAAGAATAAGGGACAGGCTAATGGAGAGGCAAATCTGTTTGCCCATTCCAGAAGGTTCCAGAAGGGTTCATCCTCTGGGACTAAGTCCTGTAGCTCCTCTTCTGGGCTTAAGAAGACCTAA

Protein sequence

MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSSSGTKSCSSSSGLKKT
Homology
BLAST of Tan0020304 vs. NCBI nr
Match: KAA0044955.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 181.4 bits (459), Expect = 5.1e-42
Identity = 99/134 (73.88%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILE 60
           MFGQ S QI+  +LKY+YN+R+ EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILE
Sbjct: 101 MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILE 160

Query: 61  SLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSS 120
           SLP+SFLQFR+NAVMNKI Y LTTLLNELQT +SLMK KGQ  GEAN+   +R+F +GS+
Sbjct: 161 SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ-KGEANVATSTRKFHRGST 220

Query: 121 SGTKSCSSSSGLKK 135
           SGTKS  SSSG KK
Sbjct: 221 SGTKSMPSSSGNKK 233

BLAST of Tan0020304 vs. NCBI nr
Match: KAA0048404.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 181.4 bits (459), Expect = 5.1e-42
Identity = 99/134 (73.88%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILE 60
           MFGQ S QI+  +LKY+YN+R+ EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILE
Sbjct: 100 MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILE 159

Query: 61  SLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSS 120
           SLP+SFLQFR+NAVMNKI Y LTTLLNELQT +SLMK KGQ  GEAN+   +R+F +GS+
Sbjct: 160 SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ-KGEANVATSTRKFHRGST 219

Query: 121 SGTKSCSSSSGLKK 135
           SGTKS  SSSG KK
Sbjct: 220 SGTKSMPSSSGNKK 232

BLAST of Tan0020304 vs. NCBI nr
Match: TYK14550.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 181.4 bits (459), Expect = 5.1e-42
Identity = 99/134 (73.88%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILE 60
           MFGQ S QI+  +LKY+YN+R+ EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILE
Sbjct: 101 MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILE 160

Query: 61  SLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSS 120
           SLP+SFLQFR+NAVMNKI Y LTTLLNELQT +SLMK KGQ  GEAN+   +R+F +GS+
Sbjct: 161 SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ-KGEANVATSTRKFHRGST 220

Query: 121 SGTKSCSSSSGLKK 135
           SGTKS  SSSG KK
Sbjct: 221 SGTKSMPSSSGNKK 233

BLAST of Tan0020304 vs. NCBI nr
Match: KAA0054490.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 181.4 bits (459), Expect = 5.1e-42
Identity = 99/134 (73.88%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILE 60
           MFGQ S QI+  +LKY+YN+R+ EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILE
Sbjct: 101 MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILE 160

Query: 61  SLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSS 120
           SLP+SFLQFR+NAVMNKI Y LTTLLNELQT +SLMK KGQ  GEAN+   +R+F +GS+
Sbjct: 161 SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ-KGEANVATSTRKFHRGST 220

Query: 121 SGTKSCSSSSGLKK 135
           SGTKS  SSSG KK
Sbjct: 221 SGTKSMPSSSGNKK 233

BLAST of Tan0020304 vs. NCBI nr
Match: KAA0061339.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 181.4 bits (459), Expect = 5.1e-42
Identity = 99/134 (73.88%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILE 60
           MFGQ S QI+  +LKY+YN+R+ EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILE
Sbjct: 101 MFGQASYQIKHDALKYIYNARMNEGASVREHVLNIMVHFNVAEMNGAVIDEASQVSFILE 160

Query: 61  SLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSS 120
           SLP+SFLQFR+NAVMNKI Y LTTLLNELQT +SLMK KGQ  GEAN+   +R+F +GS+
Sbjct: 161 SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ-KGEANVATSTRKFHRGST 220

Query: 121 SGTKSCSSSSGLKK 135
           SGTKS  SSSG KK
Sbjct: 221 SGTKSMPSSSGNKK 233

BLAST of Tan0020304 vs. ExPASy TrEMBL
Match: A0A5A7SMH8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G002560 PE=4 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 2.5e-42
Identity = 99/134 (73.88%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILE 60
           MFGQ S QI+  +LKY+YN+R+ EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILE
Sbjct: 101 MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILE 160

Query: 61  SLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSS 120
           SLP+SFLQFR+NAVMNKI Y LTTLLNELQT +SLMK KGQ  GEAN+   +R+F +GS+
Sbjct: 161 SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ-KGEANVATSTRKFHRGST 220

Query: 121 SGTKSCSSSSGLKK 135
           SGTKS  SSSG KK
Sbjct: 221 SGTKSMPSSSGNKK 233

BLAST of Tan0020304 vs. ExPASy TrEMBL
Match: A0A5D3CPJ6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00040 PE=4 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 2.5e-42
Identity = 99/134 (73.88%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILE 60
           MFGQ S QI+  +LKY+YN+R+ EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILE
Sbjct: 101 MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILE 160

Query: 61  SLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSS 120
           SLP+SFLQFR+NAVMNKI Y LTTLLNELQT +SLMK KGQ  GEAN+   +R+F +GS+
Sbjct: 161 SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ-KGEANVATSTRKFHRGST 220

Query: 121 SGTKSCSSSSGLKK 135
           SGTKS  SSSG KK
Sbjct: 221 SGTKSMPSSSGNKK 233

BLAST of Tan0020304 vs. ExPASy TrEMBL
Match: A0A5A7V4M1 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold468G00930 PE=4 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 2.5e-42
Identity = 99/134 (73.88%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILE 60
           MFGQ S QI+  +LKY+YN+R+ EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILE
Sbjct: 218 MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILE 277

Query: 61  SLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSS 120
           SLP+SFLQFR+NAVMNKI Y LTTLLNELQT +SLMK KGQ  GEAN+   +R+F +GS+
Sbjct: 278 SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ-KGEANVATSTRKFHRGST 337

Query: 121 SGTKSCSSSSGLKK 135
           SGTKS  SSSG KK
Sbjct: 338 SGTKSMPSSSGNKK 350

BLAST of Tan0020304 vs. ExPASy TrEMBL
Match: A0A5A7TU93 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G002590 PE=4 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 2.5e-42
Identity = 99/134 (73.88%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILE 60
           MFGQ S QI+  +LKY+YN+R+ EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILE
Sbjct: 101 MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILE 160

Query: 61  SLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSS 120
           SLP+SFLQFR+NAVMNKI Y LTTLLNELQT +SLMK KGQ  GEAN+   +R+F +GS+
Sbjct: 161 SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ-KGEANVATSTRKFHRGST 220

Query: 121 SGTKSCSSSSGLKK 135
           SGTKS  SSSG KK
Sbjct: 221 SGTKSMPSSSGNKK 233

BLAST of Tan0020304 vs. ExPASy TrEMBL
Match: A0A5A7TWB9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold133G00310 PE=4 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 2.5e-42
Identity = 99/134 (73.88%), Postives = 115/134 (85.82%), Query Frame = 0

Query: 1   MFGQPSGQIRQKSLKYVYNSRIKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILE 60
           MFGQ S QI+  +LKY+YN+R+ EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILE
Sbjct: 101 MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILE 160

Query: 61  SLPKSFLQFRNNAVMNKIEYNLTTLLNELQTSQSLMKNKGQANGEANLFAHSRRFQKGSS 120
           SLP+SFLQFR+NAVMNKI Y LTTLLNELQT +SLMK KGQ  GEAN+   +R+F +GS+
Sbjct: 161 SLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ-KGEANVATSTRKFHRGST 220

Query: 121 SGTKSCSSSSGLKK 135
           SGTKS  SSSG KK
Sbjct: 221 SGTKSMPSSSGNKK 233

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAA0044955.15.1e-4273.88gag/pol protein [Cucumis melo var. makuwa][more]
KAA0048404.15.1e-4273.88gag/pol protein [Cucumis melo var. makuwa][more]
TYK14550.15.1e-4273.88gag/pol protein [Cucumis melo var. makuwa][more]
KAA0054490.15.1e-4273.88gag/pol protein [Cucumis melo var. makuwa][more]
KAA0061339.15.1e-4273.88gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5A7SMH82.5e-4273.88Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G0025... [more]
A0A5D3CPJ62.5e-4273.88Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G0004... [more]
A0A5A7V4M12.5e-4273.88Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold468G0093... [more]
A0A5A7TU932.5e-4273.88Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G00259... [more]
A0A5A7TWB92.5e-4273.88Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold133G0031... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 16..97
e-value: 7.7E-10
score: 38.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 113..135
NoneNo IPR availablePANTHERPTHR35317:SF8POLYPROTEIN-LIKE PROTEINcoord: 1..125
NoneNo IPR availablePANTHERPTHR35317OS04G0629600 PROTEINcoord: 1..125

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020304.1Tan0020304.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding