Tan0013200 (gene) Snake gourd v1

Overview
NameTan0013200
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG01: 60627866 .. 60628441 (-)
RNA-Seq ExpressionTan0013200
SyntenyTan0013200
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCACAAATGCGGTGATGAATAAAATAGAGTATAACCTGACTACTCTCCTCAACGAGCTTCATACTTTTGAGTCCCTGATGAAATCAAAAGGAAAAGAGAAGGAGGCAAATGTTGTCACTTCGAAGAAGTTCCTAAGAGGATCGCCCTCTGGGACTAAATTCAGTCCTTCCTTTTCTAAGAATAAGGGTATTCAGAAAAAGAAGAAGAAGGACAAAGGGAAGGGACAGGCTCCCGCATGCAAGGCCAAAGCCACAGGAAAATGTTTCCACTGTGGTGCAGACGGGCACTGGAAGAGGAACTGCCCGAAGTACCTTGCAGAAAAGAAAGCTGAGAAAGAAAAACAAGGAAAATATGATTTACTCGTTATTGAAACATGTTTAGTGGAACATGATGATTCCGCCTGGATATTAGATTCAGGAGCCACTAACCATGTTTGTTCTTCTTTTCAGAAACTAGTTCTGGGCAAGAAATTGTCGATGGAGAGATATCTCTCAGGGTTGGAACGGGAGAGGTTGTCTCAGCCAAAGCAGTGGGAGAAGTGA

mRNA sequence

ATGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCACAAATGCGGTGATGAATAAAATAGAGTATAACCTGACTACTCTCCTCAACGAGCTTCATACTTTTGAGTCCCTGATGAAATCAAAAGGAAAAGAGAAGGAGGCAAATGTTGTCACTTCGAAGAAGTTCCTAAGAGGATCGCCCTCTGGGACTAAATTCAGTCCTTCCTTTTCTAAGAATAAGGGTATTCAGAAAAAGAAGAAGAAGGACAAAGGGAAGGGACAGGCTCCCGCATGCAAGGCCAAAGCCACAGGAAAATGTTTCCACTGTGGTGCAGACGGGCACTGGAAGAGGAACTGCCCGAAGTACCTTGCAGAAAAGAAAGCTGAGAAAGAAAAACAAGGAAAATATGATTTACTCGTTATTGAAACATGTTTAGTGGAACATGATGATTCCGCCTGGATATTAGATTCAGGAGCCACTAACCATGTTTGTTCTTCTTTTCAGAAACTAGTTCTGGGCAAGAAATTGTCGATGGAGAGATATCTCTCAGGGTTGGAACGGGAGAGGTTGTCTCAGCCAAAGCAGTGGGAGAAGTGA

Coding sequence (CDS)

ATGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCACAAATGCGGTGATGAATAAAATAGAGTATAACCTGACTACTCTCCTCAACGAGCTTCATACTTTTGAGTCCCTGATGAAATCAAAAGGAAAAGAGAAGGAGGCAAATGTTGTCACTTCGAAGAAGTTCCTAAGAGGATCGCCCTCTGGGACTAAATTCAGTCCTTCCTTTTCTAAGAATAAGGGTATTCAGAAAAAGAAGAAGAAGGACAAAGGGAAGGGACAGGCTCCCGCATGCAAGGCCAAAGCCACAGGAAAATGTTTCCACTGTGGTGCAGACGGGCACTGGAAGAGGAACTGCCCGAAGTACCTTGCAGAAAAGAAAGCTGAGAAAGAAAAACAAGGAAAATATGATTTACTCGTTATTGAAACATGTTTAGTGGAACATGATGATTCCGCCTGGATATTAGATTCAGGAGCCACTAACCATGTTTGTTCTTCTTTTCAGAAACTAGTTCTGGGCAAGAAATTGTCGATGGAGAGATATCTCTCAGGGTTGGAACGGGAGAGGTTGTCTCAGCCAAAGCAGTGGGAGAAGTGA

Protein sequence

MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTSKKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKAKATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKLSMERYLSGLERERLSQPKQWEK
Homology
BLAST of Tan0013200 vs. NCBI nr
Match: KAA0048404.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 209.9 bits (533), Expect = 1.9e-50
Identity = 117/175 (66.86%), Postives = 131/175 (74.86%), Query Frame = 0

Query: 1   MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGS 60
           +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS
Sbjct: 158 LESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGS 217

Query: 61  PSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----KATGKCFHCGADGHWKRNCPK 120
            SGTK  PS S NK  +KKK     K    A K       A G CFHC  +GHWKRNCPK
Sbjct: 218 TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPK 277

Query: 121 YLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL 170
           YLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Sbjct: 278 YLAEKK--KAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQL 330

BLAST of Tan0013200 vs. NCBI nr
Match: TYK14550.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 209.9 bits (533), Expect = 1.9e-50
Identity = 117/175 (66.86%), Postives = 131/175 (74.86%), Query Frame = 0

Query: 1   MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGS 60
           +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS
Sbjct: 159 LESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGS 218

Query: 61  PSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----KATGKCFHCGADGHWKRNCPK 120
            SGTK  PS S NK  +KKK     K    A K       A G CFHC  +GHWKRNCPK
Sbjct: 219 TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKTKAAKGICFHCNQEGHWKRNCPK 278

Query: 121 YLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL 170
           YLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Sbjct: 279 YLAEKK--KAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQL 331

BLAST of Tan0013200 vs. NCBI nr
Match: KAA0054490.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 209.9 bits (533), Expect = 1.9e-50
Identity = 117/175 (66.86%), Postives = 131/175 (74.86%), Query Frame = 0

Query: 1   MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGS 60
           +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS
Sbjct: 159 LESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGS 218

Query: 61  PSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----KATGKCFHCGADGHWKRNCPK 120
            SGTK  PS S NK  +KKK     K    A K       A G CFHC  +GHWKRNCPK
Sbjct: 219 TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPK 278

Query: 121 YLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL 170
           YLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Sbjct: 279 YLAEKK--KAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQL 331

BLAST of Tan0013200 vs. NCBI nr
Match: KAA0035879.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051221.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051893.1 gag/pol protein [Cucumis melo var. makuwa] >TYK00551.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 209.9 bits (533), Expect = 1.9e-50
Identity = 117/175 (66.86%), Postives = 131/175 (74.86%), Query Frame = 0

Query: 1   MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGS 60
           +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS
Sbjct: 159 LESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGS 218

Query: 61  PSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----KATGKCFHCGADGHWKRNCPK 120
            SGTK  PS S NK  +KKK     K    A K       A G CFHC  +GHWKRNCPK
Sbjct: 219 TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKTKAAKGICFHCNQEGHWKRNCPK 278

Query: 121 YLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL 170
           YLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Sbjct: 279 YLAEKK--KAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQL 331

BLAST of Tan0013200 vs. NCBI nr
Match: KAA0047792.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 209.9 bits (533), Expect = 1.9e-50
Identity = 117/175 (66.86%), Postives = 131/175 (74.86%), Query Frame = 0

Query: 1   MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGS 60
           +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS
Sbjct: 159 LESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGS 218

Query: 61  PSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----KATGKCFHCGADGHWKRNCPK 120
            SGTK  PS S NK  +KKK     K    A K       A G CFHC  +GHWKRNCPK
Sbjct: 219 TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPK 278

Query: 121 YLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL 170
           YLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Sbjct: 279 YLAEKK--KAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQL 331

BLAST of Tan0013200 vs. ExPASy TrEMBL
Match: A0A5A7SMH8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G002560 PE=4 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 9.2e-51
Identity = 117/175 (66.86%), Postives = 131/175 (74.86%), Query Frame = 0

Query: 1   MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGS 60
           +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS
Sbjct: 159 LESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGS 218

Query: 61  PSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----KATGKCFHCGADGHWKRNCPK 120
            SGTK  PS S NK  +KKK     K    A K       A G CFHC  +GHWKRNCPK
Sbjct: 219 TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPK 278

Query: 121 YLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL 170
           YLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Sbjct: 279 YLAEKK--KAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQL 331

BLAST of Tan0013200 vs. ExPASy TrEMBL
Match: A0A5D3CPJ6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00040 PE=4 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 9.2e-51
Identity = 117/175 (66.86%), Postives = 131/175 (74.86%), Query Frame = 0

Query: 1   MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGS 60
           +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS
Sbjct: 159 LESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGS 218

Query: 61  PSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----KATGKCFHCGADGHWKRNCPK 120
            SGTK  PS S NK  +KKK     K    A K       A G CFHC  +GHWKRNCPK
Sbjct: 219 TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKTKAAKGICFHCNQEGHWKRNCPK 278

Query: 121 YLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL 170
           YLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Sbjct: 279 YLAEKK--KAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQL 331

BLAST of Tan0013200 vs. ExPASy TrEMBL
Match: A0A5A7V4M1 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold468G00930 PE=4 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 9.2e-51
Identity = 117/175 (66.86%), Postives = 131/175 (74.86%), Query Frame = 0

Query: 1   MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGS 60
           +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS
Sbjct: 276 LESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGS 335

Query: 61  PSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----KATGKCFHCGADGHWKRNCPK 120
            SGTK  PS S NK  +KKK     K    A K       A G CFHC  +GHWKRNCPK
Sbjct: 336 TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPK 395

Query: 121 YLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL 170
           YLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Sbjct: 396 YLAEKK--KAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQL 448

BLAST of Tan0013200 vs. ExPASy TrEMBL
Match: A0A5A7TWB9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold133G00310 PE=4 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 9.2e-51
Identity = 117/175 (66.86%), Postives = 131/175 (74.86%), Query Frame = 0

Query: 1   MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGS 60
           +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS
Sbjct: 159 LESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGS 218

Query: 61  PSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----KATGKCFHCGADGHWKRNCPK 120
            SGTK  PS S NK  +KKK     K    A K       A G CFHC  +GHWKRNCPK
Sbjct: 219 TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPK 278

Query: 121 YLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL 170
           YLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Sbjct: 279 YLAEKK--KAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQL 331

BLAST of Tan0013200 vs. ExPASy TrEMBL
Match: A0A5D3CSZ6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold552G00320 PE=4 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 9.2e-51
Identity = 117/175 (66.86%), Postives = 131/175 (74.86%), Query Frame = 0

Query: 1   MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGS 60
           +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS
Sbjct: 159 LESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGS 218

Query: 61  PSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----KATGKCFHCGADGHWKRNCPK 120
            SGTK  PS S NK  +KKK     K    A K       A G CFHC  +GHWKRNCPK
Sbjct: 219 TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKTKAAKGICFHCNQEGHWKRNCPK 278

Query: 121 YLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL 170
           YLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Sbjct: 279 YLAEKK--KAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQL 331

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAA0048404.11.9e-5066.86gag/pol protein [Cucumis melo var. makuwa][more]
TYK14550.11.9e-5066.86gag/pol protein [Cucumis melo var. makuwa][more]
KAA0054490.11.9e-5066.86gag/pol protein [Cucumis melo var. makuwa][more]
KAA0035879.11.9e-5066.86gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumi... [more]
KAA0047792.11.9e-5066.86gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5A7SMH89.2e-5166.86Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G0025... [more]
A0A5D3CPJ69.2e-5166.86Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G0004... [more]
A0A5A7V4M19.2e-5166.86Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold468G0093... [more]
A0A5A7TWB99.2e-5166.86Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold133G0031... [more]
A0A5D3CSZ69.2e-5166.86Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold552G0032... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 98..114
e-value: 0.0019
score: 27.5
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 97..114
e-value: 1.4E-5
score: 24.9
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 98..114
score: 10.246099
NoneNo IPR availableGENE3D4.10.60.10coord: 92..126
e-value: 1.6E-6
score: 29.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 58..89
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 58..72
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 86..117

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0013200.1Tan0013200.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding