Tan0015549 (gene) Snake gourd v1

Overview
NameTan0015549
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
LocationLG01: 103659715 .. 103660138 (+)
RNA-Seq ExpressionTan0015549
SyntenyTan0015549
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAAATCGGAGGGGCACAAAAGCAAAGAAAAAGAGTTATTGTCGTGTTGGGGGCGTTTGAAGTTGAAGCTTTCGGATATAAAAAAAGAGGGCAATAATCTGTGTATTGGAGAGACAAAGCCATTGAATGGCGGATTCAGATACGACGCCTTAAGCTACGCTCAGAACTTCGATGACGGATTGAAGGATGCTGAAGAAGGACGTTATCGAGGTTTTTCTGCTAGATATGCTTCTGCTTCCAAACCGCTTGCTGAGAAAAAGTAAAATAGAAAGATTCCGAAATTTCGCATATGGTAATTTAATCTAATGGGCTTCTTTATTATATATATATGTATCCTCTCTTCTTCTTCTTTTTACAGATAGATTTATATAGGACTTGACAATATGGTATATATAATATAATATAATATACAGAAAGAAG

mRNA sequence

ATGGAGAAATCGGAGGGGCACAAAAGCAAAGAAAAAGAGTTATTGTCGTGTTGGGGGCGTTTGAAGTTGAAGCTTTCGGATATAAAAAAAGAGGGCAATAATCTGTGTATTGGAGAGACAAAGCCATTGAATGGCGGATTCAGATACGACGCCTTAAGCTACGCTCAGAACTTCGATGACGGATTGAAGGATGCTGAAGAAGGACGTTATCGAGGTTTTTCTGCTAGATATGCTTCTGCTTCCAAACCGCTTGCTGAGAAAAAGTAAAATAGAAAGATTCCGAAATTTCGCATATGGTAATTTAATCTAATGGGCTTCTTTATTATATATATATGTATCCTCTCTTCTTCTTCTTTTTACAGATAGATTTATATAGGACTTGACAATATGGTATATATAATATAATATAATATACAGAAAGAAG

Coding sequence (CDS)

ATGGAGAAATCGGAGGGGCACAAAAGCAAAGAAAAAGAGTTATTGTCGTGTTGGGGGCGTTTGAAGTTGAAGCTTTCGGATATAAAAAAAGAGGGCAATAATCTGTGTATTGGAGAGACAAAGCCATTGAATGGCGGATTCAGATACGACGCCTTAAGCTACGCTCAGAACTTCGATGACGGATTGAAGGATGCTGAAGAAGGACGTTATCGAGGTTTTTCTGCTAGATATGCTTCTGCTTCCAAACCGCTTGCTGAGAAAAAGTAA

Protein sequence

MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDDGLKDAEEGRYRGFSARYASASKPLAEKK
Homology
BLAST of Tan0015549 vs. NCBI nr
Match: KAG6608120.1 (hypothetical protein SDJN03_01462, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 131.0 bits (328), Expect = 5.2e-27
Identity = 69/89 (77.53%), Postives = 75/89 (84.27%), Query Frame = 0

Query: 1  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDD 60
          MEKSEG KS E EL+SCWGRLKLKL  +K+EGNN CIG+TKPLNGGFRYDALSYAQNFD+
Sbjct: 1  MEKSEGQKSSE-ELVSCWGRLKLKLLLMKREGNNPCIGDTKPLNGGFRYDALSYAQNFDE 60

Query: 61 GLKDA-EEGRYRGFSARYASASKPLAEKK 89
          GL D  EE R RGFSARY SASKPL + K
Sbjct: 61 GLDDEDEELRCRGFSARYVSASKPLPKNK 88

BLAST of Tan0015549 vs. NCBI nr
Match: KGN60281.1 (hypothetical protein Csa_002688 [Cucumis sativus])

HSP 1 Score: 84.3 bits (207), Expect = 5.6e-13
Identity = 54/101 (53.47%), Postives = 70/101 (69.31%), Query Frame = 0

Query: 1   MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGN-------NL-CI-GETKPLN----GGF 60
           M+K+E  K+KE E +SCW RL++K+   KKEGN       N+ C+ GET  LN    G F
Sbjct: 1   MKKAEKQKNKE-ETMSCWERLRMKILSRKKEGNINNKNDTNITCMGGETSGLNNRSGGLF 60

Query: 61  RYDALSYAQNFDDGLKDAE-EGRYRGFSARYASASKPLAEK 88
           +YDALSYA+NFD+GL +A+ EG +R FSARYA  SKP A+K
Sbjct: 61  KYDALSYAKNFDEGLANADGEGSFRSFSARYAVPSKPPAKK 100

BLAST of Tan0015549 vs. NCBI nr
Match: XP_007137662.1 (hypothetical protein PHAVU_009G145200g [Phaseolus vulgaris] >ESW09656.1 hypothetical protein PHAVU_009G145200g [Phaseolus vulgaris])

HSP 1 Score: 78.6 bits (192), Expect = 3.1e-11
Identity = 44/81 (54.32%), Postives = 55/81 (67.90%), Query Frame = 0

Query: 1  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDD 60
          MEKS   +SK    LSCWG LK+KL   K+  +N      KP+ GGF+YD LSYAQNFD+
Sbjct: 1  MEKSSSSRSKLNGWLSCWGCLKMKLPWAKRTSSN------KPV-GGFKYDPLSYAQNFDE 60

Query: 61 GLKDAEEGRYRGFSARYASAS 82
          GL + +E  +RGFSARYA+ S
Sbjct: 61 GLVEDDEELHRGFSARYAAPS 74

BLAST of Tan0015549 vs. NCBI nr
Match: OMO50828.1 (hypothetical protein CCACVL1_30223 [Corchorus capsularis])

HSP 1 Score: 78.2 bits (191), Expect = 4.0e-11
Identity = 51/98 (52.04%), Postives = 60/98 (61.22%), Query Frame = 0

Query: 1  MEKSEGHKSKEKE-LLSCWGRLKLKLSDIKKEGNNLCIGETKPLN-------GGFRYDAL 60
          ME+S   K+  +E LLSCWGRLKLKL   K+   NL    T P         GGFRYD L
Sbjct: 1  MERSSPSKNTNEEGLLSCWGRLKLKLPCTKRRMRNLGNTITAPFKAKNPKPAGGFRYDPL 60

Query: 61 SYAQNFDDGL--KDAEEGRYRGFSARYASAS-KPLAEK 88
          SYAQNFDDG    D E   YRGFS+RYA+ S + +A+K
Sbjct: 61 SYAQNFDDGCLDDDIEGSLYRGFSSRYAAPSLRSVADK 98

BLAST of Tan0015549 vs. NCBI nr
Match: KAG7024589.1 (hypothetical protein SDJN02_13407, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 76.3 bits (186), Expect = 1.5e-10
Identity = 48/89 (53.93%), Postives = 55/89 (61.80%), Query Frame = 0

Query: 1  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDD 60
          ME     K K KE  S W  LK K+S  KKEG       +K  +G F YDA+SYAQNFDD
Sbjct: 1  MEIEGPEKKKSKETTSIWKCLKSKISSGKKEGTT-----SKNEHGKFSYDAVSYAQNFDD 60

Query: 61 GLKDA-EEGRYRGFSARYASASKPLAEKK 89
          GL +A +EG  R FSARYA ASKP  +KK
Sbjct: 61 GLANADDEGSSRSFSARYAVASKPPPKKK 84

BLAST of Tan0015549 vs. ExPASy TrEMBL
Match: A0A0A0LJR5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G893310 PE=4 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 2.7e-13
Identity = 54/101 (53.47%), Postives = 70/101 (69.31%), Query Frame = 0

Query: 1   MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGN-------NL-CI-GETKPLN----GGF 60
           M+K+E  K+KE E +SCW RL++K+   KKEGN       N+ C+ GET  LN    G F
Sbjct: 1   MKKAEKQKNKE-ETMSCWERLRMKILSRKKEGNINNKNDTNITCMGGETSGLNNRSGGLF 60

Query: 61  RYDALSYAQNFDDGLKDAE-EGRYRGFSARYASASKPLAEK 88
           +YDALSYA+NFD+GL +A+ EG +R FSARYA  SKP A+K
Sbjct: 61  KYDALSYAKNFDEGLANADGEGSFRSFSARYAVPSKPPAKK 100

BLAST of Tan0015549 vs. ExPASy TrEMBL
Match: V7AZM1 (Uncharacterized protein OS=Phaseolus vulgaris OX=3885 GN=PHAVU_009G145200g PE=4 SV=1)

HSP 1 Score: 78.6 bits (192), Expect = 1.5e-11
Identity = 44/81 (54.32%), Postives = 55/81 (67.90%), Query Frame = 0

Query: 1  MEKSEGHKSKEKELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFDD 60
          MEKS   +SK    LSCWG LK+KL   K+  +N      KP+ GGF+YD LSYAQNFD+
Sbjct: 1  MEKSSSSRSKLNGWLSCWGCLKMKLPWAKRTSSN------KPV-GGFKYDPLSYAQNFDE 60

Query: 61 GLKDAEEGRYRGFSARYASAS 82
          GL + +E  +RGFSARYA+ S
Sbjct: 61 GLVEDDEELHRGFSARYAAPS 74

BLAST of Tan0015549 vs. ExPASy TrEMBL
Match: A0A1R3FYB4 (Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_30223 PE=4 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 1.9e-11
Identity = 51/98 (52.04%), Postives = 60/98 (61.22%), Query Frame = 0

Query: 1  MEKSEGHKSKEKE-LLSCWGRLKLKLSDIKKEGNNLCIGETKPLN-------GGFRYDAL 60
          ME+S   K+  +E LLSCWGRLKLKL   K+   NL    T P         GGFRYD L
Sbjct: 1  MERSSPSKNTNEEGLLSCWGRLKLKLPCTKRRMRNLGNTITAPFKAKNPKPAGGFRYDPL 60

Query: 61 SYAQNFDDGL--KDAEEGRYRGFSARYASAS-KPLAEK 88
          SYAQNFDDG    D E   YRGFS+RYA+ S + +A+K
Sbjct: 61 SYAQNFDDGCLDDDIEGSLYRGFSSRYAAPSLRSVADK 98

BLAST of Tan0015549 vs. ExPASy TrEMBL
Match: A0A1R3GHI4 (Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_35304 PE=4 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 9.6e-11
Identity = 50/98 (51.02%), Postives = 59/98 (60.20%), Query Frame = 0

Query: 1  MEKSEGHKSKEKE-LLSCWGRLKLKLSDIKKEGNNLCIGETKPLN-------GGFRYDAL 60
          ME+S   K+  +E  LSCWGRLKLKL   K+   NL    T P         GGFRYD L
Sbjct: 1  MERSSPSKNTNEEGSLSCWGRLKLKLPCTKRRMRNLGNTITAPFKAKNPKPAGGFRYDPL 60

Query: 61 SYAQNFDDGL--KDAEEGRYRGFSARYASAS-KPLAEK 88
          SYAQNFDDG    D E   YRGFS+RYA+ S + +A+K
Sbjct: 61 SYAQNFDDGCLDDDIEGSLYRGFSSRYAAPSLRSVADK 98

BLAST of Tan0015549 vs. ExPASy TrEMBL
Match: A0A0B2RTQ8 (Uncharacterized protein OS=Glycine soja OX=3848 GN=D0Y65_001296 PE=4 SV=1)

HSP 1 Score: 73.2 bits (178), Expect = 6.2e-10
Identity = 46/83 (55.42%), Postives = 55/83 (66.27%), Query Frame = 0

Query: 1  MEKSEGHKSKEK-ELLSCWGRLKLKLSDIKKEGNNLCIGETKPLNGGFRYDALSYAQNFD 60
          MEKS   +SK     LSCWGRLKLKL   K+  +       KP+ GGF YD LSYAQNFD
Sbjct: 1  MEKSSSLRSKTNGGRLSCWGRLKLKLPWAKRTSS------YKPI-GGFNYDPLSYAQNFD 60

Query: 61 DG-LKDAEEGRYRGFSARYASAS 82
          +G ++D EE  +RGFSARYA+ S
Sbjct: 61 EGWVEDDEESLHRGFSARYAAPS 76

BLAST of Tan0015549 vs. TAIR 10
Match: AT5G14890.1 (NHL domain-containing protein )

HSP 1 Score: 43.9 bits (102), Expect = 7.8e-05
Identity = 26/69 (37.68%), Postives = 39/69 (56.52%), Query Frame = 0

Query: 28  IKKEGNNLCI------GETKPLNGGFRYDALSYAQNFDDGLKDA---EEGRYRGFSARYA 87
           I++ G N C       G  +P +  FRYD+ SY+ NFDDG +     +E  YR +S R+A
Sbjct: 670 IRRFGRNHCCNGGIDGGCNRPEHVSFRYDSWSYSLNFDDGKQTGHFEDEFPYRDYSMRFA 729

BLAST of Tan0015549 vs. TAIR 10
Match: AT3G01430.1 (BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1); Has 98 Blast hits to 98 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 98; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 42.0 bits (97), Expect = 3.0e-04
Identity = 22/46 (47.83%), Postives = 30/46 (65.22%), Query Frame = 0

Query: 45  GGFRYDALSYAQNFDDGLKDA---EEGRYRGFSARYASASKPLAEK 88
           G FRYD LSY+ NFDDG +     +E  YR +S R+A+ S P++ K
Sbjct: 123 GKFRYDQLSYSLNFDDGNQTGHFDDEFPYRDYSMRFAAPSLPVSTK 168

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6608120.15.2e-2777.53hypothetical protein SDJN03_01462, partial [Cucurbita argyrosperma subsp. sorori... [more]
KGN60281.15.6e-1353.47hypothetical protein Csa_002688 [Cucumis sativus][more]
XP_007137662.13.1e-1154.32hypothetical protein PHAVU_009G145200g [Phaseolus vulgaris] >ESW09656.1 hypothet... [more]
OMO50828.14.0e-1152.04hypothetical protein CCACVL1_30223 [Corchorus capsularis][more]
KAG7024589.11.5e-1053.93hypothetical protein SDJN02_13407, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
A0A0A0LJR52.7e-1353.47Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G893310 PE=4 SV=1[more]
V7AZM11.5e-1154.32Uncharacterized protein OS=Phaseolus vulgaris OX=3885 GN=PHAVU_009G145200g PE=4 ... [more]
A0A1R3FYB41.9e-1152.04Uncharacterized protein OS=Corchorus capsularis OX=210143 GN=CCACVL1_30223 PE=4 ... [more]
A0A1R3GHI49.6e-1151.02Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_35304 PE=4 SV=1[more]
A0A0B2RTQ86.2e-1055.42Uncharacterized protein OS=Glycine soja OX=3848 GN=D0Y65_001296 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G14890.17.8e-0537.68NHL domain-containing protein [more]
AT3G01430.13.0e-0447.83BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33168STRESS INDUCED PROTEIN-RELATEDcoord: 8..83
NoneNo IPR availablePANTHERPTHR33168:SF39SUBFAMILY NOT NAMEDcoord: 8..83

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0015549.1Tan0015549.1mRNA