Tan0001891 (gene) Snake gourd v1

Overview
NameTan0001891
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF761 domain-containing protein
LocationLG08: 61345451 .. 61346142 (-)
RNA-Seq ExpressionTan0001891
SyntenyTan0001891
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACTCCTTTACAACATAACTCAAAAAAAGAAAAAAAAAATGAAAAAGGGTTCCCTTTCCCTTTCACCATCTTCACTCCAAGTAATTTTTGGCTCAAATTCATCATCTTCAACCTTAACCCAGCTGGTGAAATTCAAAACCCTATTGCAGAGTCTCATTCTATCTCTGGCTAAAGCCATTTCCAGAGCCAAAACGACGGCGCTTCACATCTTCAAACAGGCCAATTACCAATCCACCGCCATGGCTAATTGGAAGAAGAAAAAGAATAAGCTTCTCTTCGGATCCTTCAGACTTCATTACAACTGGTGCTCTTCGTCGTCATCGCACGTGACTCCGGCGCCGGTCACGTGGGAGGGGGACTCCGGCGACGAGCTTTCTGGGTATTTGCAGTGGCTGGAGGAGAGAGATGAAAAAAAAGAAGTGAATGAGATTGATAAATTGGCAGAGATTTTTATTGCCAGGTGTCATGAGAAATTCAGGCTGGAAAAACAGGAGTCTTATAGGAGGTTTCAACAATTGATGGCTACAAGCTTGTGAGGATCTTTTTTTTAAAAAAAAATTAATTGTGGGGTTTTGTTTTGGTGGGAGGAAAAAAATAAATTTGAATTTCTTGAGGATGGGGATGGTGGTGATATTAATCTGTAAGATTTTTTTTCTTTTTTTTTTTTTACCAGTAATTGAAATTAATAAGGG

mRNA sequence

AACTCCTTTACAACATAACTCAAAAAAAGAAAAAAAAAATGAAAAAGGGTTCCCTTTCCCTTTCACCATCTTCACTCCAAGTAATTTTTGGCTCAAATTCATCATCTTCAACCTTAACCCAGCTGGTGAAATTCAAAACCCTATTGCAGAGTCTCATTCTATCTCTGGCTAAAGCCATTTCCAGAGCCAAAACGACGGCGCTTCACATCTTCAAACAGGCCAATTACCAATCCACCGCCATGGCTAATTGGAAGAAGAAAAAGAATAAGCTTCTCTTCGGATCCTTCAGACTTCATTACAACTGGTGCTCTTCGTCGTCATCGCACGTGACTCCGGCGCCGGTCACGTGGGAGGGGGACTCCGGCGACGAGCTTTCTGGGTATTTGCAGTGGCTGGAGGAGAGAGATGAAAAAAAAGAAGTGAATGAGATTGATAAATTGGCAGAGATTTTTATTGCCAGGTGTCATGAGAAATTCAGGCTGGAAAAACAGGAGTCTTATAGGAGGTTTCAACAATTGATGGCTACAAGCTTGTGAGGATCTTTTTTTTAAAAAAAAATTAATTGTGGGGTTTTGTTTTGGTGGGAGGAAAAAAATAAATTTGAATTTCTTGAGGATGGGGATGGTGGTGATATTAATCTGTAAGATTTTTTTTCTTTTTTTTTTTTTACCAGTAATTGAAATTAATAAGGG

Coding sequence (CDS)

ATGAAAAAGGGTTCCCTTTCCCTTTCACCATCTTCACTCCAAGTAATTTTTGGCTCAAATTCATCATCTTCAACCTTAACCCAGCTGGTGAAATTCAAAACCCTATTGCAGAGTCTCATTCTATCTCTGGCTAAAGCCATTTCCAGAGCCAAAACGACGGCGCTTCACATCTTCAAACAGGCCAATTACCAATCCACCGCCATGGCTAATTGGAAGAAGAAAAAGAATAAGCTTCTCTTCGGATCCTTCAGACTTCATTACAACTGGTGCTCTTCGTCGTCATCGCACGTGACTCCGGCGCCGGTCACGTGGGAGGGGGACTCCGGCGACGAGCTTTCTGGGTATTTGCAGTGGCTGGAGGAGAGAGATGAAAAAAAAGAAGTGAATGAGATTGATAAATTGGCAGAGATTTTTATTGCCAGGTGTCATGAGAAATTCAGGCTGGAAAAACAGGAGTCTTATAGGAGGTTTCAACAATTGATGGCTACAAGCTTGTGA

Protein sequence

MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTWEGDSGDELSGYLQWLEERDEKKEVNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL
Homology
BLAST of Tan0001891 vs. NCBI nr
Match: XP_022935250.1 (uncharacterized protein LOC111442186 [Cucurbita moschata])

HSP 1 Score: 193.0 bits (489), Expect = 2.1e-45
Identity = 115/170 (67.65%), Postives = 135/170 (79.41%), Query Frame = 0

Query: 1   MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQ 60
           MK  SLS S SSLQ IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQ
Sbjct: 1   MKMVSLSPSSSSLQ-IFPSSSS---------LKALLQTLILSLARAISRAKTTALHILKQ 60

Query: 61  ANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTW-EGDSGDELSGYLQWL 120
           AN+QS A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P P+TW +  + D L+GYLQWL
Sbjct: 61  ANHQS-AIA-FKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNSAADHLAGYLQWL 120

Query: 121 EERDEKKE----VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL 166
           E+RD+++E    VNEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Sbjct: 121 EDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL 158

BLAST of Tan0001891 vs. NCBI nr
Match: XP_022983138.1 (uncharacterized protein LOC111481779 [Cucurbita maxima])

HSP 1 Score: 192.6 bits (488), Expect = 2.7e-45
Identity = 111/167 (66.47%), Postives = 133/167 (79.64%), Query Frame = 0

Query: 6   LSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQS 65
           +SLSPSS   IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQAN+QS
Sbjct: 4   VSLSPSSSLQIFPSSSS---------LKALLQTLILSLARAISRAKTTALHILKQANHQS 63

Query: 66  TAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTWEGDS---GDELSGYLQWLEER 125
            A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P P+TW+ +S    D L+GYLQWLE+R
Sbjct: 64  -AIA-FKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWDDNSSGAADHLAGYLQWLEDR 123

Query: 126 DEKKEV----NEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL 166
           D+++E+    NEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Sbjct: 124 DKEEELCRHANEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL 159

BLAST of Tan0001891 vs. NCBI nr
Match: XP_023526429.1 (uncharacterized protein LOC111789933 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 191.0 bits (484), Expect = 7.9e-45
Identity = 115/173 (66.47%), Postives = 135/173 (78.03%), Query Frame = 0

Query: 1   MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQ 60
           MK  SLS S SSLQ IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQ
Sbjct: 1   MKMVSLSPSSSSLQ-IFPSSSS---------LKALLQTLILSLARAISRAKTTALHILKQ 60

Query: 61  ANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTWEGDS----GDELSGYL 120
           AN+QS A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P P+TW+ +S     D L+GYL
Sbjct: 61  ANHQS-AIA-FKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWDDNSAAGAADHLTGYL 120

Query: 121 QWLEERDEKKE----VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL 166
           QWLE +D+++E    VNEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Sbjct: 121 QWLEHKDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL 161

BLAST of Tan0001891 vs. NCBI nr
Match: KAG6580813.1 (hypothetical protein SDJN03_20815, partial [Cucurbita argyrosperma subsp. sororia] >KAG7017563.1 hypothetical protein SDJN02_19429, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 189.9 bits (481), Expect = 1.8e-44
Identity = 115/173 (66.47%), Postives = 134/173 (77.46%), Query Frame = 0

Query: 1   MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQ 60
           MK  SLS S SSLQ IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQ
Sbjct: 1   MKMVSLSPSSSSLQ-IFPSSSS---------LKALLQTLILSLARAISRAKTTALHILKQ 60

Query: 61  ANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTWEGDS----GDELSGYL 120
           AN+QS A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P P+TW  +S     D L+GYL
Sbjct: 61  ANHQS-AIA-FKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNSAAGAADHLAGYL 120

Query: 121 QWLEERDEK----KEVNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL 166
           QWLE+RD++    + VNEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Sbjct: 121 QWLEDRDKEEKLCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL 161

BLAST of Tan0001891 vs. NCBI nr
Match: XP_038906153.1 (uncharacterized protein LOC120092033 [Benincasa hispida])

HSP 1 Score: 174.9 bits (442), Expect = 5.9e-40
Identity = 107/177 (60.45%), Postives = 124/177 (70.06%), Query Frame = 0

Query: 4   GSL--SLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQA 63
           GSL  S SP S   I  S+SS S  + LVKFK +LQ+LILSLA+AISRAKTTA HI KQA
Sbjct: 2   GSLNSSSSPFSSLQILPSSSSPSPSSSLVKFKAVLQTLILSLARAISRAKTTAFHILKQA 61

Query: 64  NYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSS----SHVTPAPVTWE-------GDSGD 123
           N+Q       K+ K KLL+GSFRLHYNWCS SS    SHVTP  +TW+       G  GD
Sbjct: 62  NHQYAIAL--KRNKKKLLYGSFRLHYNWCSVSSNYYNSHVTPPVITWDHEYSGGGGGGGD 121

Query: 124 ELSGYLQWLEERDEKKE------VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLM 162
           +L GYL+WLEER+   +      VNEIDKLAEIFIAR HEKF+LEKQESYRRFQ ++
Sbjct: 122 QLGGYLEWLEERENNNKIKNEEGVNEIDKLAEIFIARSHEKFKLEKQESYRRFQDMI 176

BLAST of Tan0001891 vs. ExPASy TrEMBL
Match: A0A6J1FA41 (uncharacterized protein LOC111442186 OS=Cucurbita moschata OX=3662 GN=LOC111442186 PE=4 SV=1)

HSP 1 Score: 193.0 bits (489), Expect = 1.0e-45
Identity = 115/170 (67.65%), Postives = 135/170 (79.41%), Query Frame = 0

Query: 1   MKKGSLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQ 60
           MK  SLS S SSLQ IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQ
Sbjct: 1   MKMVSLSPSSSSLQ-IFPSSSS---------LKALLQTLILSLARAISRAKTTALHILKQ 60

Query: 61  ANYQSTAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTW-EGDSGDELSGYLQWL 120
           AN+QS A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P P+TW +  + D L+GYLQWL
Sbjct: 61  ANHQS-AIA-FKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNSAADHLAGYLQWL 120

Query: 121 EERDEKKE----VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL 166
           E+RD+++E    VNEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Sbjct: 121 EDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL 158

BLAST of Tan0001891 vs. ExPASy TrEMBL
Match: A0A6J1J6X1 (uncharacterized protein LOC111481779 OS=Cucurbita maxima OX=3661 GN=LOC111481779 PE=4 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 1.3e-45
Identity = 111/167 (66.47%), Postives = 133/167 (79.64%), Query Frame = 0

Query: 6   LSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQS 65
           +SLSPSS   IF S+SS          K LLQ+LILSLA+AISRAKTTALHI KQAN+QS
Sbjct: 4   VSLSPSSSLQIFPSSSS---------LKALLQTLILSLARAISRAKTTALHILKQANHQS 63

Query: 66  TAMANWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAPVTWEGDS---GDELSGYLQWLEER 125
            A+A +K+ KNKLLFGSFRLHYNWCSSS+ HV P P+TW+ +S    D L+GYLQWLE+R
Sbjct: 64  -AIA-FKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWDDNSSGAADHLAGYLQWLEDR 123

Query: 126 DEKKEV----NEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL 166
           D+++E+    NEIDKLA+IFIARCHEKFRLEKQESYR+FQ++ A SL
Sbjct: 124 DKEEELCRHANEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL 159

BLAST of Tan0001891 vs. ExPASy TrEMBL
Match: A0A0A0LBV5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G809400 PE=4 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 1.2e-38
Identity = 111/182 (60.99%), Postives = 126/182 (69.23%), Query Frame = 0

Query: 7   SLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQST 66
           S S SSLQV+   + SSSTL   +KFK LLQ+LILSLA+AISRAKTTA   F+ AN   T
Sbjct: 6   SSSSSSLQVL--PSPSSSTLRLAIKFKALLQTLILSLARAISRAKTTA---FQSAN---T 65

Query: 67  AMANWKKKKNKLLFGSFRLHYNWCSSSS---SHVTPAPVTWE------GDSGDELSGYLQ 126
           A+   K+ K KLL+GSFRLHYNWCS SS   SHVTPA +T +      G  GD+L GYLQ
Sbjct: 66  AL---KRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTCDHGIGGGGGGGDQLGGYLQ 125

Query: 127 WLEERD---------------EKKEVNEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMA 165
           WLEERD               E + VNEIDKLAEIFIARCHEKF+LEKQESYRRFQ +MA
Sbjct: 126 WLEERDVNKKSNHNSNVEDDHEDQSVNEIDKLAEIFIARCHEKFKLEKQESYRRFQDMMA 176

BLAST of Tan0001891 vs. ExPASy TrEMBL
Match: A0A5A7TJT8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold236G00520 PE=4 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 5.9e-38
Identity = 109/184 (59.24%), Postives = 125/184 (67.93%), Query Frame = 0

Query: 5   SLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQ 64
           S S S SSLQV+   + SSSTL   +KFK LLQ+LI SLA+AISRAKTTA        +Q
Sbjct: 9   SSSSSSSSLQVL--PSPSSSTLRLAIKFKALLQTLIFSLARAISRAKTTA--------FQ 68

Query: 65  STAMANWKKKKNKLLFGSFRLHYNWCSSSS---SHVTPAPVTWE-----GDSGDELSGYL 124
           S  +A  K+ K KLL+GSFRLHYNWCS SS   SHVTPA +T++     G  GD+L GYL
Sbjct: 69  SANIA-LKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTFDHGIGCGAGGDQLGGYL 128

Query: 125 QWLEERDEKKE----------------VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQL 165
           QWLEERD  K+                VNEIDKLAEIFIARCHEKF+LEKQESYRRFQ +
Sbjct: 129 QWLEERDVNKKSNNNNNNNNVEDREEGVNEIDKLAEIFIARCHEKFKLEKQESYRRFQDM 181

BLAST of Tan0001891 vs. ExPASy TrEMBL
Match: A0A1S3B8H1 (uncharacterized protein LOC103487158 OS=Cucumis melo OX=3656 GN=LOC103487158 PE=4 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 5.9e-38
Identity = 109/184 (59.24%), Postives = 125/184 (67.93%), Query Frame = 0

Query: 5   SLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQ 64
           S S S SSLQV+   + SSSTL   +KFK LLQ+LI SLA+AISRAKTTA        +Q
Sbjct: 9   SSSSSSSSLQVL--PSPSSSTLRLAIKFKALLQTLIFSLARAISRAKTTA--------FQ 68

Query: 65  STAMANWKKKKNKLLFGSFRLHYNWCSSSS---SHVTPAPVTWE-----GDSGDELSGYL 124
           S  +A  K+ K KLL+GSFRLHYNWCS SS   SHVTPA +T++     G  GD+L GYL
Sbjct: 69  SANIA-LKRNKKKLLYGSFRLHYNWCSVSSNKYSHVTPAVLTFDHGIGCGAGGDQLGGYL 128

Query: 125 QWLEERDEKKE----------------VNEIDKLAEIFIARCHEKFRLEKQESYRRFQQL 165
           QWLEERD  K+                VNEIDKLAEIFIARCHEKF+LEKQESYRRFQ +
Sbjct: 129 QWLEERDVNKKSNNNNNNNNVEDREEGVNEIDKLAEIFIARCHEKFKLEKQESYRRFQDM 181

BLAST of Tan0001891 vs. TAIR 10
Match: AT3G57950.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: 4 anthesis, C globular stage, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G42180.1); Has 81 Blast hits to 81 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 81; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 104.8 bits (260), Expect = 7.0e-23
Identity = 70/176 (39.77%), Postives = 102/176 (57.95%), Query Frame = 0

Query: 20  NSSSSTLTQLVKFKTLLQSL----ILSLAKAISRAKTTALHIFK-QANYQSTAM-----A 79
           +SSSS+ +  +K KTL+Q+L    +    +A+++AK+  L I K  +N +   +      
Sbjct: 9   SSSSSSSSSSMKLKTLIQNLLTHPLYRFLRALAKAKSIFLEISKHNSNNKKRKLMMLFPT 68

Query: 80  NWKKKKNKLLFGSFRLHYNWCSSSSSHVTPAP---------VTWEGDSGDELSGYLQWLE 139
              K + K+ FGSFRLHYNWC   SSHV P P         +  E +   +LSGYL+WLE
Sbjct: 69  KASKNQRKIFFGSFRLHYNWC---SSHVVPVPQPFPFPSSYINGEEEDDSQLSGYLEWLE 128

Query: 140 ER--DEKKEV---------NEIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL 166
            +  D+ +E+         ++ID LA++FIA CHEKF LEK ESYRRFQ+++   L
Sbjct: 129 HKKFDDVEEIGDVVADGGDDDIDHLADMFIANCHEKFLLEKVESYRRFQEMLERGL 181

BLAST of Tan0001891 vs. TAIR 10
Match: AT5G06790.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: LP.04 four leaves visible, LP.02 two leaves visible, petal differentiation and expansion stage, D bilateral stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G57950.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 92.4 bits (228), Expect = 3.6e-19
Identity = 81/203 (39.90%), Postives = 110/203 (54.19%), Query Frame = 0

Query: 5   SLSLSPSSLQVIFGSNSSSSTLTQLVKFKTLLQSLILS----LAKAISRAKTTALHIFKQ 64
           S S S SS Q     +SSSS  +  +K K+L+Q+LI+S    L + ISR  +  + + ++
Sbjct: 7   STSSSMSSSQ----CSSSSSPTSSSMKLKSLIQTLIISQVCRLLREISRVSSILVRVLRK 66

Query: 65  ANYQSTAMANW-------KKKKNKLLFGSFRLHYNWCSSSSSHVTP--APV--------- 124
             Y   ++++        KK+KN +LFGSFRLHYN+C   SSHV P  APV         
Sbjct: 67  KQYNFLSVSSLLYPKRVSKKQKNNILFGSFRLHYNFC---SSHVVPVSAPVRLPEELYLA 126

Query: 125 ------TWEG----------DSGD----ELSGYLQWLEER-----DEKKE--VNEIDKLA 159
                 TWE           D  D    +LS YL+ LE++     +E+ E  +NEIDKLA
Sbjct: 127 HLVHESTWESMYSTESMDGRDDDDQEPSQLSSYLRQLEDKVKDGQEEETETMMNEIDKLA 186

BLAST of Tan0001891 vs. TAIR 10
Match: AT2G42180.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G57950.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 73.2 bits (178), Expect = 2.2e-13
Identity = 63/173 (36.42%), Postives = 86/173 (49.71%), Query Frame = 0

Query: 19  SNSSSSTLTQLVKFKTLLQSLILSLAKAISRAKTTALHIFKQANYQSTAMANWKKK---- 78
           S+SSS +      F  L+   +  L +++SRA++  + I K    +   M  +  K    
Sbjct: 6   SSSSSHSTNLKTLFINLITHSLYRLLRSLSRARSVLIEISKHNKKRLFMMMFYTTKSSMN 65

Query: 79  KNKLLFGSFRLHYNWCSSSSSHVT-----PAPVTWEGDSGDE---LSGYLQWLEER-DEK 138
           ++ + FG            SSHV      P P + +G   DE    S YLQWLEER DE 
Sbjct: 66  QHNIFFG------------SSHVVVPVTKPFPFSLDGHVEDEDNLESQYLQWLEERVDEN 125

Query: 139 KEVN-------------EIDKLAEIFIARCHEKFRLEKQESYRRFQQLMATSL 166
             +N             +ID+LA+ FIARCHEKF LEK ESYRRFQ ++A SL
Sbjct: 126 NNINDDQSVGERDVGDDDIDRLADKFIARCHEKFLLEKVESYRRFQDMLARSL 166

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022935250.12.1e-4567.65uncharacterized protein LOC111442186 [Cucurbita moschata][more]
XP_022983138.12.7e-4566.47uncharacterized protein LOC111481779 [Cucurbita maxima][more]
XP_023526429.17.9e-4566.47uncharacterized protein LOC111789933 [Cucurbita pepo subsp. pepo][more]
KAG6580813.11.8e-4466.47hypothetical protein SDJN03_20815, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_038906153.15.9e-4060.45uncharacterized protein LOC120092033 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1FA411.0e-4567.65uncharacterized protein LOC111442186 OS=Cucurbita moschata OX=3662 GN=LOC1114421... [more]
A0A6J1J6X11.3e-4566.47uncharacterized protein LOC111481779 OS=Cucurbita maxima OX=3661 GN=LOC111481779... [more]
A0A0A0LBV51.2e-3860.99Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G809400 PE=4 SV=1[more]
A0A5A7TJT85.9e-3859.24Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3B8H15.9e-3859.24uncharacterized protein LOC103487158 OS=Cucumis melo OX=3656 GN=LOC103487158 PE=... [more]
Match NameE-valueIdentityDescription
AT3G57950.17.0e-2339.77unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G06790.13.6e-1939.90unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G42180.12.2e-1336.42unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008480Protein of unknown function DUF761, plantPFAMPF05553DUF761coord: 129..162
e-value: 5.3E-13
score: 48.3
NoneNo IPR availablePANTHERPTHR33450:SF4DUF761 DOMAIN PROTEINcoord: 19..164
NoneNo IPR availablePANTHERPTHR33450EMB|CAB67623.1-RELATEDcoord: 19..164

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001891.1Tan0001891.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005488 binding