Tan0007649 (gene) Snake gourd v1

Overview
NameTan0007649
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGTP cyclohydrolase II isoform 1
LocationLG01: 14412943 .. 14413722 (+)
RNA-Seq ExpressionTan0007649
SyntenyTan0007649
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAAAGAGGCCAATTTTGAAAGGAATCAGTGTCTTGAAGCAAAAATAAACCGCAAGTTTATGAGAAGAAGCAAGCAAAGAATTCTTTCACAATAAATACCTTTTTGTCATTTTCCTGCTCTCTCTAAAAATCCAATCCCCTTCCTTCAGAATTCCGAGATCATGAAGCCGTCGACGATCGCCGCCTGGCCGACCAAGCCCAATCCAAATCACAAAGCCCTAGATTCCACCGGCGTTGACCTAATTCGGAATTGCGATCTCCCGCCGCCGCAGAAGGTATTCACGGCGATGGCGTGGGTGGGTAGAGGTAGAGGTCGGGAAGAGGTGGAATCGGGGGGGATGGAGGAGAAATTGGAGCTGTTGAAGGCTCTGAGACTGTCGCAAACGAGGGCGAGAGAAGCGGAGAGAAAGGCGGCGAAATTGATGAAGGAGAGGGATTGTATAAGTAGGGCTTTTGAAGATGAGGCCAGATTGATCTTCTGTTACAAACAGTGGCTGAAATTGATGGAGCTTAGGGTTTCGAAGTTGCAGAAGGGCAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAATTGCAATGGCGGAGAAATGAAATGGGTTTGGGCATTGGCGATTTGTCTGAGTGTTGTGGGAGTGGGCTTTCTCTTGGGCTATACATGTAATGTCGATGAACATCCATTTCTTATCAACAATACCTGACTATATTACCTTTTCCCCCTTTATATATATATTTTTTTTCTTTTTCCAACCATGTATTTATTTATTGCGAAAATTTCTC

mRNA sequence

AAAAAAAAGAGGCCAATTTTGAAAGGAATCAGTGTCTTGAAGCAAAAATAAACCGCAAGTTTATGAGAAGAAGCAAGCAAAGAATTCTTTCACAATAAATACCTTTTTGTCATTTTCCTGCTCTCTCTAAAAATCCAATCCCCTTCCTTCAGAATTCCGAGATCATGAAGCCGTCGACGATCGCCGCCTGGCCGACCAAGCCCAATCCAAATCACAAAGCCCTAGATTCCACCGGCGTTGACCTAATTCGGAATTGCGATCTCCCGCCGCCGCAGAAGGTATTCACGGCGATGGCGTGGGTGGGTAGAGGTAGAGGTCGGGAAGAGGTGGAATCGGGGGGGATGGAGGAGAAATTGGAGCTGTTGAAGGCTCTGAGACTGTCGCAAACGAGGGCGAGAGAAGCGGAGAGAAAGGCGGCGAAATTGATGAAGGAGAGGGATTGTATAAGTAGGGCTTTTGAAGATGAGGCCAGATTGATCTTCTGTTACAAACAGTGGCTGAAATTGATGGAGCTTAGGGTTTCGAAGTTGCAGAAGGGCAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAATTGCAATGGCGGAGAAATGAAATGGGTTTGGGCATTGGCGATTTGTCTGAGTGTTGTGGGAGTGGGCTTTCTCTTGGGCTATACATGTAATGTCGATGAACATCCATTTCTTATCAACAATACCTGACTATATTACCTTTTCCCCCTTTATATATATATTTTTTTTCTTTTTCCAACCATGTATTTATTTATTGCGAAAATTTCTC

Coding sequence (CDS)

ATGAAGCCGTCGACGATCGCCGCCTGGCCGACCAAGCCCAATCCAAATCACAAAGCCCTAGATTCCACCGGCGTTGACCTAATTCGGAATTGCGATCTCCCGCCGCCGCAGAAGGTATTCACGGCGATGGCGTGGGTGGGTAGAGGTAGAGGTCGGGAAGAGGTGGAATCGGGGGGGATGGAGGAGAAATTGGAGCTGTTGAAGGCTCTGAGACTGTCGCAAACGAGGGCGAGAGAAGCGGAGAGAAAGGCGGCGAAATTGATGAAGGAGAGGGATTGTATAAGTAGGGCTTTTGAAGATGAGGCCAGATTGATCTTCTGTTACAAACAGTGGCTGAAATTGATGGAGCTTAGGGTTTCGAAGTTGCAGAAGGGCAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAATTGCAATGGCGGAGAAATGAAATGGGTTTGGGCATTGGCGATTTGTCTGAGTGTTGTGGGAGTGGGCTTTCTCTTGGGCTATACATGTAATGTCGATGAACATCCATTTCTTATCAACAATACCTGA

Protein sequence

MKPSTIAAWPTKPNPNHKALDSTGVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTCNVDEHPFLINNT
Homology
BLAST of Tan0007649 vs. NCBI nr
Match: XP_022970087.1 (uncharacterized protein LOC111469054 [Cucurbita maxima])

HSP 1 Score: 250.4 bits (638), Expect = 1.2e-62
Identity = 143/193 (74.09%), Postives = 153/193 (79.27%), Query Frame = 0

Query: 1   MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---A 60
           MK STIAAW    N    N +ALDST          GVDLIRNCDLPPPQK+FTA    A
Sbjct: 43  MKQSTIAAWLNNYNNLKSNPEALDSTDMLLEFPYKPGVDLIRNCDLPPPQKLFTASDKGA 102

Query: 61  WVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARL 120
              RGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEARL
Sbjct: 103 MAARGRGREEVEAGGMEEKLELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARL 162

Query: 121 IFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGY 177
           IFCY+Q LKLMELRVSKL+K K   EEEE+  N NGG +KWVWALAICLSVVGVG LLGY
Sbjct: 163 IFCYRQSLKLMELRVSKLKKRK---EEEEDNGNGNGGGVKWVWALAICLSVVGVGILLGY 222

BLAST of Tan0007649 vs. NCBI nr
Match: KAG6600670.1 (hypothetical protein SDJN03_05903, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 247.7 bits (631), Expect = 7.7e-62
Identity = 141/193 (73.06%), Postives = 151/193 (78.24%), Query Frame = 0

Query: 1   MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---A 60
           MK STIAAW    N    N +ALDST          GVDLIRNCDLPPPQK+FTA    A
Sbjct: 1   MKQSTIAAWLNNYNNLKSNPEALDSTDMLLEFPYKPGVDLIRNCDLPPPQKLFTASDKGA 60

Query: 61  WVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARL 120
             GRGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEARL
Sbjct: 61  MAGRGRGREEVETGGMEEKLELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARL 120

Query: 121 IFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGY 177
           IFCY+Q LKLMELRVSKL+K KEEEE+           +KWVWALAICLSVVGVG LLGY
Sbjct: 121 IFCYRQSLKLMELRVSKLKKRKEEEEDNGNGNGNGNRGVKWVWALAICLSVVGVGILLGY 180

BLAST of Tan0007649 vs. NCBI nr
Match: KAG7031309.1 (hypothetical protein SDJN02_05349, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 247.7 bits (631), Expect = 7.7e-62
Identity = 143/194 (73.71%), Postives = 153/194 (78.87%), Query Frame = 0

Query: 1   MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---A 60
           MK STIAAW    N    N +ALDST          GVDLIRNCDLPPPQK+FTA    A
Sbjct: 1   MKQSTIAAWLNNYNNLKSNPEALDSTDMLLEFPYKPGVDLIRNCDLPPPQKLFTASDKGA 60

Query: 61  WVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARL 120
             GRGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEARL
Sbjct: 61  MAGRGRGREEVETGGMEEKLELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARL 120

Query: 121 IFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGE-MKWVWALAICLSVVGVGFLLG 177
           IFCY+Q LKLMELRVSKL+K K     EEEE+N NG   +KWVWALAICLSVVGVG LLG
Sbjct: 121 IFCYRQSLKLMELRVSKLKKRK-----EEEEDNGNGNRGVKWVWALAICLSVVGVGILLG 180

BLAST of Tan0007649 vs. NCBI nr
Match: XP_022943244.1 (uncharacterized protein LOC111448032 [Cucurbita moschata])

HSP 1 Score: 240.0 bits (611), Expect = 1.6e-59
Identity = 140/194 (72.16%), Postives = 150/194 (77.32%), Query Frame = 0

Query: 1   MKPSTIAAWPTKPN---PNHKALDSTG----------VDLIRNCDLPPPQKVFTAM---A 60
           MK STIA W    N    N +ALDST           VDLIRNCDLPPPQK+FTA    A
Sbjct: 1   MKQSTIAGWLNNYNNLKSNPEALDSTDMLLEFPYKPCVDLIRNCDLPPPQKLFTASDKGA 60

Query: 61  WVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARL 120
              RGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEARL
Sbjct: 61  MAARGRGREEVETGGMEEKLELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARL 120

Query: 121 IFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGE-MKWVWALAICLSVVGVGFLLG 177
           IFCY+Q LKLMELRVSKL+K K     EEEE+N NG   +KWVWALAICLSVVGVG LLG
Sbjct: 121 IFCYRQSLKLMELRVSKLKKRK-----EEEEDNGNGNRGVKWVWALAICLSVVGVGILLG 180

BLAST of Tan0007649 vs. NCBI nr
Match: XP_022136673.1 (uncharacterized protein LOC111008325 [Momordica charantia])

HSP 1 Score: 210.7 bits (535), Expect = 1.0e-50
Identity = 120/180 (66.67%), Postives = 135/180 (75.00%), Query Frame = 0

Query: 16  NHKALDST-------------GVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEE 75
           N KA+DS              GVDLIRNCDLPPPQK+FT MA V + R R+E ESGG+EE
Sbjct: 2   NPKAMDSADMLLMGLEFSHKPGVDLIRNCDLPPPQKIFTGMARV-KDRERDEAESGGVEE 61

Query: 76  KLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKL 135
           KLELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEARLIF Y+Q +KL++LR+S L
Sbjct: 62  KLELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARLIFFYRQSVKLLQLRLSNL 121

Query: 136 QKGKEEEEEEEEEEN-------CNGGE-MKWVWALAICLSVVGVGFLLGYTCNVDEHPFL 175
           QK  +EEE     ++         GGE MKWVWALAIC +VVGVGFL GYTCNVDE P L
Sbjct: 122 QKMHDEEESRRNGDSDAPAAGGGGGGEAMKWVWALAICFTVVGVGFLYGYTCNVDEDPIL 180

BLAST of Tan0007649 vs. ExPASy TrEMBL
Match: A0A6J1HY55 (uncharacterized protein LOC111469054 OS=Cucurbita maxima OX=3661 GN=LOC111469054 PE=4 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 5.7e-63
Identity = 143/193 (74.09%), Postives = 153/193 (79.27%), Query Frame = 0

Query: 1   MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---A 60
           MK STIAAW    N    N +ALDST          GVDLIRNCDLPPPQK+FTA    A
Sbjct: 43  MKQSTIAAWLNNYNNLKSNPEALDSTDMLLEFPYKPGVDLIRNCDLPPPQKLFTASDKGA 102

Query: 61  WVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARL 120
              RGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEARL
Sbjct: 103 MAARGRGREEVEAGGMEEKLELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARL 162

Query: 121 IFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGY 177
           IFCY+Q LKLMELRVSKL+K K   EEEE+  N NGG +KWVWALAICLSVVGVG LLGY
Sbjct: 163 IFCYRQSLKLMELRVSKLKKRK---EEEEDNGNGNGGGVKWVWALAICLSVVGVGILLGY 222

BLAST of Tan0007649 vs. ExPASy TrEMBL
Match: A0A6J1FTQ9 (uncharacterized protein LOC111448032 OS=Cucurbita moschata OX=3662 GN=LOC111448032 PE=4 SV=1)

HSP 1 Score: 240.0 bits (611), Expect = 7.8e-60
Identity = 140/194 (72.16%), Postives = 150/194 (77.32%), Query Frame = 0

Query: 1   MKPSTIAAWPTKPN---PNHKALDSTG----------VDLIRNCDLPPPQKVFTAM---A 60
           MK STIA W    N    N +ALDST           VDLIRNCDLPPPQK+FTA    A
Sbjct: 1   MKQSTIAGWLNNYNNLKSNPEALDSTDMLLEFPYKPCVDLIRNCDLPPPQKLFTASDKGA 60

Query: 61  WVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARL 120
              RGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEARL
Sbjct: 61  MAARGRGREEVETGGMEEKLELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARL 120

Query: 121 IFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGE-MKWVWALAICLSVVGVGFLLG 177
           IFCY+Q LKLMELRVSKL+K K     EEEE+N NG   +KWVWALAICLSVVGVG LLG
Sbjct: 121 IFCYRQSLKLMELRVSKLKKRK-----EEEEDNGNGNRGVKWVWALAICLSVVGVGILLG 180

BLAST of Tan0007649 vs. ExPASy TrEMBL
Match: A0A6J1C873 (uncharacterized protein LOC111008325 OS=Momordica charantia OX=3673 GN=LOC111008325 PE=4 SV=1)

HSP 1 Score: 210.7 bits (535), Expect = 5.0e-51
Identity = 120/180 (66.67%), Postives = 135/180 (75.00%), Query Frame = 0

Query: 16  NHKALDST-------------GVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEE 75
           N KA+DS              GVDLIRNCDLPPPQK+FT MA V + R R+E ESGG+EE
Sbjct: 2   NPKAMDSADMLLMGLEFSHKPGVDLIRNCDLPPPQKIFTGMARV-KDRERDEAESGGVEE 61

Query: 76  KLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKL 135
           KLELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEARLIF Y+Q +KL++LR+S L
Sbjct: 62  KLELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARLIFFYRQSVKLLQLRLSNL 121

Query: 136 QKGKEEEEEEEEEEN-------CNGGE-MKWVWALAICLSVVGVGFLLGYTCNVDEHPFL 175
           QK  +EEE     ++         GGE MKWVWALAIC +VVGVGFL GYTCNVDE P L
Sbjct: 122 QKMHDEEESRRNGDSDAPAAGGGGGGEAMKWVWALAICFTVVGVGFLYGYTCNVDEDPIL 180

BLAST of Tan0007649 vs. ExPASy TrEMBL
Match: A0A5D3BEN7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold220G00050 PE=4 SV=1)

HSP 1 Score: 201.4 bits (511), Expect = 3.1e-48
Identity = 114/163 (69.94%), Postives = 124/163 (76.07%), Query Frame = 0

Query: 16  NHKALDS------TGVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKA 75
           N ++LDS       GV+LIRNCDLPPPQKVF                  GMEEK+ELLKA
Sbjct: 8   NRQSLDSRDMVQRDGVELIRNCDLPPPQKVF----------------KSGMEEKIELLKA 67

Query: 76  LRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKG---- 135
           LRLSQTRAREAERKAAKLM+ERDCISRAFEDEARL+FCY+Q LKL+ELRV KLQK     
Sbjct: 68  LRLSQTRAREAERKAAKLMEERDCISRAFEDEARLMFCYRQSLKLLELRVCKLQKQEVEE 127

Query: 136 -KEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTCN 168
            +EEEEEEE +EN   G MKWVWALAICLSVVGVGFLLGYTCN
Sbjct: 128 VEEEEEEEERDENGGNGGMKWVWALAICLSVVGVGFLLGYTCN 154

BLAST of Tan0007649 vs. ExPASy TrEMBL
Match: A0A1S3C2P7 (uncharacterized protein LOC103495794 OS=Cucumis melo OX=3656 GN=LOC103495794 PE=4 SV=1)

HSP 1 Score: 201.4 bits (511), Expect = 3.1e-48
Identity = 114/163 (69.94%), Postives = 124/163 (76.07%), Query Frame = 0

Query: 16  NHKALDS------TGVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKA 75
           N ++LDS       GV+LIRNCDLPPPQKVF                  GMEEK+ELLKA
Sbjct: 8   NRQSLDSRDMVQRDGVELIRNCDLPPPQKVF----------------KSGMEEKIELLKA 67

Query: 76  LRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKG---- 135
           LRLSQTRAREAERKAAKLM+ERDCISRAFEDEARL+FCY+Q LKL+ELRV KLQK     
Sbjct: 68  LRLSQTRAREAERKAAKLMEERDCISRAFEDEARLMFCYRQSLKLLELRVCKLQKQEVEE 127

Query: 136 -KEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTCN 168
            +EEEEEEE +EN   G MKWVWALAICLSVVGVGFLLGYTCN
Sbjct: 128 VEEEEEEEERDENGGNGGMKWVWALAICLSVVGVGFLLGYTCN 154

BLAST of Tan0007649 vs. TAIR 10
Match: AT1G01240.1 (unknown protein; INVOLVED IN: N-terminal protein myristoylation; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G46550.1); Has 95 Blast hits to 78 proteins in 16 species: Archae - 0; Bacteria - 2; Metazoa - 11; Fungi - 0; Plants - 80; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 77.4 bits (189), Expect = 1.3e-14
Identity = 62/184 (33.70%), Postives = 87/184 (47.28%), Query Frame = 0

Query: 28  IRNCDLPPPQKVFTAM---------------AW---VGRGRGREEVESGGMEE------- 87
           I+NCDLPPPQK+  ++                W   V + R    +   G  E       
Sbjct: 141 IQNCDLPPPQKLHKSIHSSSGEKGFKTAVKSPWKQGVWKDRFERSLSYNGSTESKNTSPM 200

Query: 88  ---------KLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQWLK 147
                    K +LL+ALR SQTRAREAER A +   E+D +      +A  +  YKQWLK
Sbjct: 201 SSPRSDDLSKGQLLEALRHSQTRAREAERAAREACAEKDRVITILLKQASQMLAYKQWLK 260

Query: 148 LMELRVSKLQKGKEEEEEEE------------EEENCNGGEMKWVWALAICLSVVGVGFL 166
           L+E+    LQ  KEEE+EE+             E+   G   +++ A A+  S++G G L
Sbjct: 261 LLEMEALYLQMKKEEEQEEQVKGMNLKKRKQRGEKKKKGETGRYMMAFALGFSLIGAGLL 320

BLAST of Tan0007649 vs. TAIR 10
Match: AT1G01240.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G46550.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 77.4 bits (189), Expect = 1.3e-14
Identity = 62/184 (33.70%), Postives = 87/184 (47.28%), Query Frame = 0

Query: 28  IRNCDLPPPQKVFTAM---------------AW---VGRGRGREEVESGGMEE------- 87
           I+NCDLPPPQK+  ++                W   V + R    +   G  E       
Sbjct: 141 IQNCDLPPPQKLHKSIHSSSGEKGFKTAVKSPWKQGVWKDRFERSLSYNGSTESKNTSPM 200

Query: 88  ---------KLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQWLK 147
                    K +LL+ALR SQTRAREAER A +   E+D +      +A  +  YKQWLK
Sbjct: 201 SSPRSDDLSKGQLLEALRHSQTRAREAERAAREACAEKDRVITILLKQASQMLAYKQWLK 260

Query: 148 LMELRVSKLQKGKEEEEEEE------------EEENCNGGEMKWVWALAICLSVVGVGFL 166
           L+E+    LQ  KEEE+EE+             E+   G   +++ A A+  S++G G L
Sbjct: 261 LLEMEALYLQMKKEEEQEEQVKGMNLKKRKQRGEKKKKGETGRYMMAFALGFSLIGAGLL 320

BLAST of Tan0007649 vs. TAIR 10
Match: AT1G01240.3 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G46550.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 77.4 bits (189), Expect = 1.3e-14
Identity = 62/184 (33.70%), Postives = 87/184 (47.28%), Query Frame = 0

Query: 28  IRNCDLPPPQKVFTAM---------------AW---VGRGRGREEVESGGMEE------- 87
           I+NCDLPPPQK+  ++                W   V + R    +   G  E       
Sbjct: 141 IQNCDLPPPQKLHKSIHSSSGEKGFKTAVKSPWKQGVWKDRFERSLSYNGSTESKNTSPM 200

Query: 88  ---------KLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQWLK 147
                    K +LL+ALR SQTRAREAER A +   E+D +      +A  +  YKQWLK
Sbjct: 201 SSPRSDDLSKGQLLEALRHSQTRAREAERAAREACAEKDRVITILLKQASQMLAYKQWLK 260

Query: 148 LMELRVSKLQKGKEEEEEEE------------EEENCNGGEMKWVWALAICLSVVGVGFL 166
           L+E+    LQ  KEEE+EE+             E+   G   +++ A A+  S++G G L
Sbjct: 261 LLEMEALYLQMKKEEEQEEQVKGMNLKKRKQRGEKKKKGETGRYMMAFALGFSLIGAGLL 320

BLAST of Tan0007649 vs. TAIR 10
Match: AT2G46550.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G01240.3); Has 72 Blast hits to 68 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 0; Plants - 71; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 66.2 bits (160), Expect = 3.0e-11
Identity = 59/189 (31.22%), Postives = 84/189 (44.44%), Query Frame = 0

Query: 25  VDLIRNCDLPPPQKVFTAMAWVGRG-------------------------RGREEVESGG 84
           +D + NCDLP PQK+  +     RG                         + R E  S  
Sbjct: 200 LDYVENCDLPTPQKMKRSYYGSPRGFDSDGLRDYSVSGQTIKGTSKGSSCKNRPEASSES 259

Query: 85  MEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQWLKLMELRV 144
              K ELL+ALR SQTRAREAE  A +   E++ + +    +A  +F YKQWL+L++L  
Sbjct: 260 DLSKSELLEALRRSQTRAREAENMAKEAYAEKEHLVKILLKQAAELFGYKQWLQLLQLEA 319

Query: 145 SKLQ-KGKEEEEEEEEEEN------CNG----------------GEMKWVWALAICLSVV 166
             LQ K KE + +  ++         NG                   K+   LA+ +S+V
Sbjct: 320 LYLQIKNKEIDNKNNDDPGVSIPCWSNGKARKEGRKRRSKRGKPNGAKYAVGLALGMSLV 379

BLAST of Tan0007649 vs. TAIR 10
Match: AT2G46550.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G01240.3). )

HSP 1 Score: 66.2 bits (160), Expect = 3.0e-11
Identity = 59/189 (31.22%), Postives = 84/189 (44.44%), Query Frame = 0

Query: 25  VDLIRNCDLPPPQKVFTAMAWVGRG-------------------------RGREEVESGG 84
           +D + NCDLP PQK+  +     RG                         + R E  S  
Sbjct: 60  LDYVENCDLPTPQKMKRSYYGSPRGFDSDGLRDYSVSGQTIKGTSKGSSCKNRPEASSES 119

Query: 85  MEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQWLKLMELRV 144
              K ELL+ALR SQTRAREAE  A +   E++ + +    +A  +F YKQWL+L++L  
Sbjct: 120 DLSKSELLEALRRSQTRAREAENMAKEAYAEKEHLVKILLKQAAELFGYKQWLQLLQLEA 179

Query: 145 SKLQ-KGKEEEEEEEEEEN------CNG----------------GEMKWVWALAICLSVV 166
             LQ K KE + +  ++         NG                   K+   LA+ +S+V
Sbjct: 180 LYLQIKNKEIDNKNNDDPGVSIPCWSNGKARKEGRKRRSKRGKPNGAKYAVGLALGMSLV 239

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022970087.11.2e-6274.09uncharacterized protein LOC111469054 [Cucurbita maxima][more]
KAG6600670.17.7e-6273.06hypothetical protein SDJN03_05903, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7031309.17.7e-6273.71hypothetical protein SDJN02_05349, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022943244.11.6e-5972.16uncharacterized protein LOC111448032 [Cucurbita moschata][more]
XP_022136673.11.0e-5066.67uncharacterized protein LOC111008325 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1HY555.7e-6374.09uncharacterized protein LOC111469054 OS=Cucurbita maxima OX=3661 GN=LOC111469054... [more]
A0A6J1FTQ97.8e-6072.16uncharacterized protein LOC111448032 OS=Cucurbita moschata OX=3662 GN=LOC1114480... [more]
A0A6J1C8735.0e-5166.67uncharacterized protein LOC111008325 OS=Momordica charantia OX=3673 GN=LOC111008... [more]
A0A5D3BEN73.1e-4869.94Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3C2P73.1e-4869.94uncharacterized protein LOC103495794 OS=Cucumis melo OX=3656 GN=LOC103495794 PE=... [more]
Match NameE-valueIdentityDescription
AT1G01240.11.3e-1433.70unknown protein; INVOLVED IN: N-terminal protein myristoylation; EXPRESSED IN: 1... [more]
AT1G01240.21.3e-1433.70unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G01240.31.3e-1433.70unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G46550.13.0e-1131.22unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G46550.23.0e-1131.22unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 112..139
NoneNo IPR availableCOILSCoilCoilcoord: 70..97
NoneNo IPR availablePANTHERPTHR33868:SF10OS08G0483100 PROTEINcoord: 21..167
NoneNo IPR availablePANTHERPTHR33868EXPRESSED PROTEINcoord: 21..167

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007649.1Tan0007649.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane