Tan0001662 (gene) Snake gourd v1

Overview
NameTan0001662
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionheavy metal-associated isoprenylated plant protein 16
LocationLG05: 8389263 .. 8390115 (+)
RNA-Seq ExpressionTan0001662
SyntenyTan0001662
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTATTCTCTCTCTCTACAAGTTCAAGAAAAATGAAGGTAATTAACGTACAGATTCAACCATGTTTTTAATGGTAATTAAGGATTTTGAAAACATATAAAAAAAATATATATATATTTTAAAATCGGTTATTTCACTTTGGCTTTTGCAGCAAAAGATTGTGGTTCGAGTGTCAATGAAGAGAGGCCCGAAGTACCGCAGCAAAGCCTTGAAGATTGCAGCATCCGTTTCTGGTAATCAAACATATAATATATAATAATCTGAATTAGAATGATCAATGTAAGAATGGAAAAATATGGTGATTGATTATTGTTTTTTCAGGCTTTTTATTCATTTTGTGTGTGTTGGAATGATGGAAATTGTAGGGAATGTTGAGAAGATTAGCTTGGAAGGCGATGACAAAGATAAAGTGGAGCTCGTCGGAGATGTCGATCCGATCGAACTCACCGAACTGCTGCGGAAGGGTTTCGGTTCTGCACAGTTAGAGTCTGTGAGCGCCGTTGAAGATAAAGATAAAGATAAAGATGATAAAGATAAACCTGAGGTTGTTTGGACTTATGGTTGGGGTGTACCTCATCAACCGTCTTACTACCATTGCTACCTTGACCCCTACTGGTGATTTATATTTGATGACGACAAAATAGAAATTAAGTTGTTTTCAAATATTTCGTACCAATAGAATTGTGATGGATATATATACTTATATCTGGCATAGATCTTTTTTTTTTCGTTTATTTTGGTTGATTTGGAATTATTGTTATTGCCTTCTTTTTAGAATAATTTGATGTTTGAAAATTTCTAGAAGAACAATTATGTTTGATGGATTTTTTAGTTTAATAATGTGTAGAGTGAG

mRNA sequence

GTTTATTCTCTCTCTCTACAAGTTCAAGAAAAATGAAGCAAAAGATTGTGGTTCGAGTGTCAATGAAGAGAGGCCCGAAGTACCGCAGCAAAGCCTTGAAGATTGCAGCATCCGTTTCTGGGAATGTTGAGAAGATTAGCTTGGAAGGCGATGACAAAGATAAAGTGGAGCTCGTCGGAGATGTCGATCCGATCGAACTCACCGAACTGCTGCGGAAGGGTTTCGGTTCTGCACAGTTAGAGTCTGTGAGCGCCGTTGAAGATAAAGATAAAGATAAAGATGATAAAGATAAACCTGAGGTTGTTTGGACTTATGGTTGGGGTGTACCTCATCAACCGTCTTACTACCATTGCTACCTTGACCCCTACTGGTGATTTATATTTGATGACGACAAAATAGAAATTAAGTTGTTTTCAAATATTTCGTACCAATAGAATTGTGATGGATATATATACTTATATCTGGCATAGATCTTTTTTTTTTCGTTTATTTTGGTTGATTTGGAATTATTGTTATTGCCTTCTTTTTAGAATAATTTGATGTTTGAAAATTTCTAGAAGAACAATTATGTTTGATGGATTTTTTAGTTTAATAATGTGTAGAGTGAG

Coding sequence (CDS)

ATGAAGCAAAAGATTGTGGTTCGAGTGTCAATGAAGAGAGGCCCGAAGTACCGCAGCAAAGCCTTGAAGATTGCAGCATCCGTTTCTGGGAATGTTGAGAAGATTAGCTTGGAAGGCGATGACAAAGATAAAGTGGAGCTCGTCGGAGATGTCGATCCGATCGAACTCACCGAACTGCTGCGGAAGGGTTTCGGTTCTGCACAGTTAGAGTCTGTGAGCGCCGTTGAAGATAAAGATAAAGATAAAGATGATAAAGATAAACCTGAGGTTGTTTGGACTTATGGTTGGGGTGTACCTCATCAACCGTCTTACTACCATTGCTACCTTGACCCCTACTGGTGA

Protein sequence

MKQKIVVRVSMKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVGDVDPIELTELLRKGFGSAQLESVSAVEDKDKDKDDKDKPEVVWTYGWGVPHQPSYYHCYLDPYW
Homology
BLAST of Tan0001662 vs. ExPASy Swiss-Prot
Match: Q5PNZ7 (Heavy metal-associated isoprenylated plant protein 46 OS=Arabidopsis thaliana OX=3702 GN=HIPP46 PE=2 SV=1)

HSP 1 Score: 50.4 bits (119), Expect = 1.5e-05
Identity = 34/86 (39.53%), Postives = 54/86 (62.79%), Query Frame = 0

Query: 1  MKQKIVVRVSMKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVG-DVDPIELTEL 60
          MKQKI++RV+M    K R+KA+  A    G V  + ++GD ++++E+ G +VD I L ++
Sbjct: 1  MKQKILIRVTM-TDDKTRAKAMTKAVQFKG-VSAVEIKGDHRNQIEVTGVEVDMIPLIQI 60

Query: 61 LRKGFGSAQLESVSAVEDKDKDKDDK 86
          LRK    A+L SV+ VE   K+ + K
Sbjct: 61 LRKKVAFAELVSVTKVEPPKKEDEKK 84

BLAST of Tan0001662 vs. NCBI nr
Match: XP_038900437.1 (uncharacterized protein LOC120087662 [Benincasa hispida])

HSP 1 Score: 169.9 bits (429), Expect = 1.3e-38
Identity = 86/112 (76.79%), Postives = 96/112 (85.71%), Query Frame = 0

Query: 1   MKQKIVVRVSMKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVGDVDPIELTELL 60
           MKQKIV+RVSMKRGPKYRSKALKIA+SV GNVE ISL GDDKDKVE+VGD+DPIELT+LL
Sbjct: 1   MKQKIVIRVSMKRGPKYRSKALKIASSVKGNVETISLVGDDKDKVEVVGDLDPIELTKLL 60

Query: 61  RKGFGSAQLESVSAVEDKDKDKDDKDKPEVVWTYGWGVPHQPSYYHCYLDPY 113
           RK FGSAQLESV+AVE+KDKDKD   +  + WT  WGVPHQ S  +CYL PY
Sbjct: 61  RKSFGSAQLESVNAVEEKDKDKDKDKELGITWTCTWGVPHQSSQCYCYLGPY 112

BLAST of Tan0001662 vs. NCBI nr
Match: XP_008448137.1 (PREDICTED: uncharacterized protein LOC103490424 [Cucumis melo])

HSP 1 Score: 168.7 bits (426), Expect = 2.9e-38
Identity = 89/115 (77.39%), Postives = 100/115 (86.96%), Query Frame = 0

Query: 1   MKQKIVVRVSMKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVGDVDPIELTELL 60
           MKQKIV+RVSMKRGPKYRSKALKIAASV G++E ISL GDDKDKVE+VGD+DP+ELTELL
Sbjct: 1   MKQKIVIRVSMKRGPKYRSKALKIAASVKGSIETISLVGDDKDKVEVVGDIDPMELTELL 60

Query: 61  RKGFGSAQLESVSAVEDKDKDKD-DKDKP-EVVWTYGWGVPHQPSYYHCY-LDPY 113
           RKGFGSAQLE+V+AVEDKDKDKD DK+K   + WT  WGVPH  SY +CY L PY
Sbjct: 61  RKGFGSAQLETVNAVEDKDKDKDKDKNKDGGITWTCSWGVPHHSSYCYCYNLGPY 115

BLAST of Tan0001662 vs. NCBI nr
Match: XP_004140067.1 (uncharacterized protein LOC101216311 [Cucumis sativus] >KGN46623.1 hypothetical protein Csa_005620 [Cucumis sativus])

HSP 1 Score: 162.2 bits (409), Expect = 2.7e-36
Identity = 86/113 (76.11%), Postives = 95/113 (84.07%), Query Frame = 0

Query: 1   MKQKIVVRVSMKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVGDVDPIELTELL 60
           MKQKIV++VSMKRGPKYRSKALKIAASV G++E ISL GD KDKVE+VGD+DPIELTELL
Sbjct: 1   MKQKIVIQVSMKRGPKYRSKALKIAASVKGSIETISLVGDHKDKVEVVGDLDPIELTELL 60

Query: 61  RKGFGSAQLESVSAVEDKDKDKDDKDKPEVVWTYGWGVPHQPSYYHCY-LDPY 113
           RKGFGSAQLESVSAVEDK+K KD  D   + WT  WGVPH  SY +CY L PY
Sbjct: 61  RKGFGSAQLESVSAVEDKEKKKDKDD--GITWTCSWGVPHHSSYCYCYNLRPY 111

BLAST of Tan0001662 vs. NCBI nr
Match: KAG7010798.1 (Heavy metal-associated isoprenylated plant protein 46, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 159.8 bits (403), Expect = 1.3e-35
Identity = 81/117 (69.23%), Postives = 97/117 (82.91%), Query Frame = 0

Query: 1   MKQKIVVRVSMKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVGDVDPIELTELL 60
           MKQKIV+RVSMKRG KYRSKALKIAASVSG++E ISL GDDKDK+E+VGD+DPIELTE+L
Sbjct: 1   MKQKIVIRVSMKRGAKYRSKALKIAASVSGDIETISLVGDDKDKIEVVGDLDPIELTEML 60

Query: 61  RKGFGSAQLESVSAVEDKDKDKD-----DKDKPEVVWTYGWGVPHQPSYYHCYLDPY 113
           RK FG AQLESVS VE K +D+D     D+++P++ WT  WGVPHQ +Y +CY  PY
Sbjct: 61  RKRFGCAQLESVSVVEAKKEDEDMNKDKDENEPQITWTCSWGVPHQTAYCYCYHCPY 117

BLAST of Tan0001662 vs. NCBI nr
Match: XP_022140688.1 (uncharacterized protein LOC111011290 [Momordica charantia])

HSP 1 Score: 156.8 bits (395), Expect = 1.1e-34
Identity = 84/117 (71.79%), Postives = 99/117 (84.62%), Query Frame = 0

Query: 1   MKQKIVVRVSMKRGPKYRSKALKIAASVSG------NVEKISLEGDDKDKVELVGDVDPI 60
           MKQKIV+RVSMK+G KYRSK LKIAASVSG      NVEKISLEG+DKDK+E++G+VD I
Sbjct: 1   MKQKIVIRVSMKKGQKYRSKVLKIAASVSGNHQFIRNVEKISLEGEDKDKLEVIGEVDAI 60

Query: 61  ELTELLRKGFGSAQLESVSAVEDKDKDKDDKDKPEVVWTYGWGVPHQPSYYHCYLDP 112
           ELT+LLRKGFGSAQLESVSAV+ KDKDK DK++P + WT  WG+P+Q SY +CYL P
Sbjct: 61  ELTDLLRKGFGSAQLESVSAVDGKDKDK-DKEEPGICWTNAWGLPYQ-SYCNCYLHP 115

BLAST of Tan0001662 vs. ExPASy TrEMBL
Match: A0A1S3BIE5 (uncharacterized protein LOC103490424 OS=Cucumis melo OX=3656 GN=LOC103490424 PE=4 SV=1)

HSP 1 Score: 168.7 bits (426), Expect = 1.4e-38
Identity = 89/115 (77.39%), Postives = 100/115 (86.96%), Query Frame = 0

Query: 1   MKQKIVVRVSMKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVGDVDPIELTELL 60
           MKQKIV+RVSMKRGPKYRSKALKIAASV G++E ISL GDDKDKVE+VGD+DP+ELTELL
Sbjct: 1   MKQKIVIRVSMKRGPKYRSKALKIAASVKGSIETISLVGDDKDKVEVVGDIDPMELTELL 60

Query: 61  RKGFGSAQLESVSAVEDKDKDKD-DKDKP-EVVWTYGWGVPHQPSYYHCY-LDPY 113
           RKGFGSAQLE+V+AVEDKDKDKD DK+K   + WT  WGVPH  SY +CY L PY
Sbjct: 61  RKGFGSAQLETVNAVEDKDKDKDKDKNKDGGITWTCSWGVPHHSSYCYCYNLGPY 115

BLAST of Tan0001662 vs. ExPASy TrEMBL
Match: A0A0A0KAU7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G113600 PE=4 SV=1)

HSP 1 Score: 162.2 bits (409), Expect = 1.3e-36
Identity = 86/113 (76.11%), Postives = 95/113 (84.07%), Query Frame = 0

Query: 1   MKQKIVVRVSMKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVGDVDPIELTELL 60
           MKQKIV++VSMKRGPKYRSKALKIAASV G++E ISL GD KDKVE+VGD+DPIELTELL
Sbjct: 1   MKQKIVIQVSMKRGPKYRSKALKIAASVKGSIETISLVGDHKDKVEVVGDLDPIELTELL 60

Query: 61  RKGFGSAQLESVSAVEDKDKDKDDKDKPEVVWTYGWGVPHQPSYYHCY-LDPY 113
           RKGFGSAQLESVSAVEDK+K KD  D   + WT  WGVPH  SY +CY L PY
Sbjct: 61  RKGFGSAQLESVSAVEDKEKKKDKDD--GITWTCSWGVPHHSSYCYCYNLRPY 111

BLAST of Tan0001662 vs. ExPASy TrEMBL
Match: A0A6J1CIJ8 (uncharacterized protein LOC111011290 OS=Momordica charantia OX=3673 GN=LOC111011290 PE=4 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 5.5e-35
Identity = 84/117 (71.79%), Postives = 99/117 (84.62%), Query Frame = 0

Query: 1   MKQKIVVRVSMKRGPKYRSKALKIAASVSG------NVEKISLEGDDKDKVELVGDVDPI 60
           MKQKIV+RVSMK+G KYRSK LKIAASVSG      NVEKISLEG+DKDK+E++G+VD I
Sbjct: 1   MKQKIVIRVSMKKGQKYRSKVLKIAASVSGNHQFIRNVEKISLEGEDKDKLEVIGEVDAI 60

Query: 61  ELTELLRKGFGSAQLESVSAVEDKDKDKDDKDKPEVVWTYGWGVPHQPSYYHCYLDP 112
           ELT+LLRKGFGSAQLESVSAV+ KDKDK DK++P + WT  WG+P+Q SY +CYL P
Sbjct: 61  ELTDLLRKGFGSAQLESVSAVDGKDKDK-DKEEPGICWTNAWGLPYQ-SYCNCYLHP 115

BLAST of Tan0001662 vs. ExPASy TrEMBL
Match: A0A6J1FYA9 (uncharacterized protein LOC111449069 OS=Cucurbita moschata OX=3662 GN=LOC111449069 PE=4 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 1.6e-34
Identity = 79/117 (67.52%), Postives = 95/117 (81.20%), Query Frame = 0

Query: 1   MKQKIVVRVSMKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVGDVDPIELTELL 60
           MKQKIV+RVSMKRG KYRS+ALKIAASV G++E ISL GDDKDK+E+VGD+DPIELTE+L
Sbjct: 1   MKQKIVIRVSMKRGAKYRSQALKIAASVLGDIETISLVGDDKDKIEVVGDLDPIELTEML 60

Query: 61  RKGFGSAQLESVSAVEDKDKDKD-----DKDKPEVVWTYGWGVPHQPSYYHCYLDPY 113
           RK FG AQLESVS VE K +D+D     D+++P++ WT  WGVPHQ  Y +CY  PY
Sbjct: 61  RKRFGCAQLESVSVVEAKKEDEDVNKDKDENEPQITWTCSWGVPHQTPYCYCYHCPY 117

BLAST of Tan0001662 vs. ExPASy TrEMBL
Match: A0A5A7UY93 (Putative Heavy metal transport/detoxification superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold264G001130 PE=4 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 2.7e-34
Identity = 81/105 (77.14%), Postives = 90/105 (85.71%), Query Frame = 0

Query: 11  MKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVGDVDPIELTELLRKGFGSAQLE 70
           MKRGPKYRSKALKIAASV G++E ISL GDDKDKVE+VGD+DPIELTELLRKGFGSAQLE
Sbjct: 1   MKRGPKYRSKALKIAASVKGSIETISLVGDDKDKVEVVGDIDPIELTELLRKGFGSAQLE 60

Query: 71  SVSAVEDKDKDKD-DKDKP-EVVWTYGWGVPHQPSYYHCY-LDPY 113
           +V+AVEDKDKDKD DK+K   + WT  WGVPH  SY +CY L PY
Sbjct: 61  TVNAVEDKDKDKDKDKNKDGGITWTCSWGVPHHSSYCYCYNLGPY 105

BLAST of Tan0001662 vs. TAIR 10
Match: AT5G48290.1 (Heavy metal transport/detoxification superfamily protein )

HSP 1 Score: 50.4 bits (119), Expect = 1.1e-06
Identity = 34/86 (39.53%), Postives = 54/86 (62.79%), Query Frame = 0

Query: 1  MKQKIVVRVSMKRGPKYRSKALKIAASVSGNVEKISLEGDDKDKVELVG-DVDPIELTEL 60
          MKQKI++RV+M    K R+KA+  A    G V  + ++GD ++++E+ G +VD I L ++
Sbjct: 1  MKQKILIRVTM-TDDKTRAKAMTKAVQFKG-VSAVEIKGDHRNQIEVTGVEVDMIPLIQI 60

Query: 61 LRKGFGSAQLESVSAVEDKDKDKDDK 86
          LRK    A+L SV+ VE   K+ + K
Sbjct: 61 LRKKVAFAELVSVTKVEPPKKEDEKK 84

BLAST of Tan0001662 vs. TAIR 10
Match: AT5G48290.2 (Heavy metal transport/detoxification superfamily protein )

HSP 1 Score: 41.6 bits (96), Expect = 5.0e-04
Identity = 26/71 (36.62%), Postives = 43/71 (60.56%), Query Frame = 0

Query: 16 KYRSKALKIAASVSGNVEKISLEGDDKDKVELVG-DVDPIELTELLRKGFGSAQLESVSA 75
          K R+KA+  A    G V  + ++GD ++++E+ G +VD I L ++LRK    A+L SV+ 
Sbjct: 5  KTRAKAMTKAVQFKG-VSAVEIKGDHRNQIEVTGVEVDMIPLIQILRKKVAFAELVSVTK 64

Query: 76 VEDKDKDKDDK 86
          VE   K+ + K
Sbjct: 65 VEPPKKEDEKK 74

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q5PNZ71.5e-0539.53Heavy metal-associated isoprenylated plant protein 46 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_038900437.11.3e-3876.79uncharacterized protein LOC120087662 [Benincasa hispida][more]
XP_008448137.12.9e-3877.39PREDICTED: uncharacterized protein LOC103490424 [Cucumis melo][more]
XP_004140067.12.7e-3676.11uncharacterized protein LOC101216311 [Cucumis sativus] >KGN46623.1 hypothetical ... [more]
KAG7010798.11.3e-3569.23Heavy metal-associated isoprenylated plant protein 46, partial [Cucurbita argyro... [more]
XP_022140688.11.1e-3471.79uncharacterized protein LOC111011290 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A1S3BIE51.4e-3877.39uncharacterized protein LOC103490424 OS=Cucumis melo OX=3656 GN=LOC103490424 PE=... [more]
A0A0A0KAU71.3e-3676.11Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G113600 PE=4 SV=1[more]
A0A6J1CIJ85.5e-3571.79uncharacterized protein LOC111011290 OS=Momordica charantia OX=3673 GN=LOC111011... [more]
A0A6J1FYA91.6e-3467.52uncharacterized protein LOC111449069 OS=Cucurbita moschata OX=3662 GN=LOC1114490... [more]
A0A5A7UY932.7e-3477.14Putative Heavy metal transport/detoxification superfamily protein OS=Cucumis mel... [more]
Match NameE-valueIdentityDescription
AT5G48290.11.1e-0639.53Heavy metal transport/detoxification superfamily protein [more]
AT5G48290.25.0e-0436.62Heavy metal transport/detoxification superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.30.70.100coord: 1..78
e-value: 6.1E-22
score: 79.6
NoneNo IPR availablePANTHERPTHR46371:SF7HEAVY METAL-ASSOCIATED ISOPRENYLATED PLANT PROTEIN 46coord: 3..89
IPR044296Heavy metal-associated isoprenylated plant protein 16/46/47PANTHERPTHR46371OS04G0464100 PROTEINcoord: 3..89

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001662.1Tan0001662.1mRNA