Tan0009074 (gene) Snake gourd v1

Overview
NameTan0009074
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4050 domain-containing protein
LocationLG11: 7459777 .. 7462116 (-)
RNA-Seq ExpressionTan0009074
SyntenyTan0009074
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATGAATAAAGGAAGCTCCCATTCAAAAGAGAAACTGACTGTGGGTCGTTCTTCATCTTCCAGTGAAGTGAAAAAGCCTGCAGAAAAAGATTTGAGCTCCTCCACATTTGTTAATCAGGGTAATAAGCAAGCATGTAGGAAGTTTGTAATTGCAGTTTCAGATTTGATATTTGATATGCATGAGCCACCTGTTTTCCAGTTTCTTGGAATTAGTATTTCTTTGTGCTTGATTGCGCCTGCTATAATTGATGACCCCATGGAATTTAAGACTCTTAAATAAATGCTCAAAAGAAAAGGAAAAAAAAAAGAAAAAAAAAGTATATCTCGTTATTTGGTTCTATTCATGCAAATATTTTTTAGGTTCTTTATAGCAATTTATGTATTCCATTCAATTTAGAGGCTTCAACTAAATGGTGACTGACAATGCGAAGCAACAAAATTCAAATCCCATCTAGTTTCTTGGATGGGGTTGGAATCTGTCGATGATCTAGAATTTCACTTTCTTAAAGCGTGAATTTAATAATGATAGAAATATTGGATTGAAAGGAAAGTGGACTAGTAGTTATGGTGGCTTGGGAGAGGAGGAGGTTTTCTGAGGGAAGATTGGCACCTTCATGACCTATACATTTTTGTTTCATTGATTGAATATCAGATGATGCATATCATGAAAAAGAATCTACTACGAGTTAAAATTGAACATCTGGTAGAACTAGAAGGACATATCTCCCAGCACAAATAAATATACAAGTTGCTTAATTGATGTGACAACTGTACCTCTTTTGGCCTTTAAATGTTCATGAATTGTCTTAAATAAGCTGTTTGTTTCTTCTAGCTTAGATTTCTTAAAATAGTGACAGTAACTTTTAGTTCATTACGTTTTGTTTGAAATGTACTTTTCATTGTCAATCTGATTGGATGTGTTGTGTAGCTGCAATTCGTTGGCATGAGAGTAGAAAGAAGTGGATTGATAAACAATCTCAGCAACAACAAAGAATGGAGAGGGAATCAATCATAAGGTGATCTCTCGCTCTCCTGAATGTTCTCAGTTGCAGAATAGTTAAATCATTCCATGTCGTGCTTTTGCTTGCAAGTAGTTTAGTTGGTGAACTTGAAAAAAAGTTTATAAATCATAATAAAATGAAGAATTCTGCGCATACGGTGAAAGTTGAACAAGAAATCCTACTATGGTTATGATATTACCCAAAACACTATAATAGTATTAAATTGAATCCCGAAAGATAACAGCAAATTAACCCAAACAGGAGTAGGTAAATAAGTAGACCTTCCAATTTTTGAGGATCAAAATATGCCTGAAGATATTAACTTAAGCCTATAGATTTTAAGCAAAGCTTTTCTAAATGTCAAGGTAGTCATTACTCTGACCAAACCCCACGGGAAGGAAATTAGATAATGATTGAAATAAAAAGTTGTAATTCTACCCACCTGAGCACATCTTAATTGGGTAAGACAGCTATCCTCGATCAAAAGGTCATATGTTCAAATCCTTCCATTTAGCATGTGGTTGAACTTTAGAACCGAAGGATACACACTTGTACAACCTATAGAGCGAATTCTGTCCAAGTAAGGTCGTACATGTTGTTGAGCTAAAAAAAAGTTTTGTAATTCTTGAAATAACTCTTTGGTGGGGGATAACTCTTCCATATCCATAACTAATGAAGCAAGATTATCTATAATCGCTCGATTCAAATGCGCAGTCGCTGACTCACTGGTCCTTGTCATTCTGAATGATGCAAAAGTCTTAGCATAAAGAGAAGAAAAAGGAAAAGCAGAAAGAGGTCATTGTATATCTTGAAGGAACATACCTTTTCCATTTTTTAATAACCAAGTATTTGATTTTCTTCTGTTGCTTTCCTTCACCCCTAGCAAAAAAACATTTGTCGTTACAAACATAATCTGAATGATGAATTTTTCACTTTAAATTTGCAGCTGGTCAACAGCATACGAAGATCTGCTCTCCACCAACGAGCCCTTCACTGAGCCAATACCTCTCACAGTAAGCTTCTGAAAAGTGCATGCAATTTCTATATATGTACCTGCATATATAGTCTCCCTTTGACCGTTAATCTATATATGTACCTGCAATACCTCAATCAGTTTTCTTTTTGTCTTTCTCTCTCGAGTTCAACTATATGTGGGAGTGAAGATTTAAACCTCGACCTTTTCGATCTGAGACCAGGATTCTGTTGTTCTGGTTTCATATTTTCTTGATTCTTTTCAATGAGCATGTTGCGATTGTTGATGCTTGTAATATGACTGGCAGGAGATGGTAGATTTCTTGGTTGATATTTGGCAAGATGAAGGGCTTTTCGATTAG

mRNA sequence

ATGGAAATGAATAAAGGAAGCTCCCATTCAAAAGAGAAACTGACTGTGGGTCGTTCTTCATCTTCCAGTGAAGTGAAAAAGCCTGCAGAAAAAGATTTGAGCTCCTCCACATTTGTTAATCAGGCTGCAATTCGTTGGCATGAGAGTAGAAAGAAGTGGATTGATAAACAATCTCAGCAACAACAAAGAATGGAGAGGGAATCAATCATAAGCTGGTCAACAGCATACGAAGATCTGCTCTCCACCAACGAGCCCTTCACTGAGCCAATACCTCTCACAGAGATGGTAGATTTCTTGGTTGATATTTGGCAAGATGAAGGGCTTTTCGATTAG

Coding sequence (CDS)

ATGGAAATGAATAAAGGAAGCTCCCATTCAAAAGAGAAACTGACTGTGGGTCGTTCTTCATCTTCCAGTGAAGTGAAAAAGCCTGCAGAAAAAGATTTGAGCTCCTCCACATTTGTTAATCAGGCTGCAATTCGTTGGCATGAGAGTAGAAAGAAGTGGATTGATAAACAATCTCAGCAACAACAAAGAATGGAGAGGGAATCAATCATAAGCTGGTCAACAGCATACGAAGATCTGCTCTCCACCAACGAGCCCTTCACTGAGCCAATACCTCTCACAGAGATGGTAGATTTCTTGGTTGATATTTGGCAAGATGAAGGGCTTTTCGATTAG

Protein sequence

MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD
Homology
BLAST of Tan0009074 vs. NCBI nr
Match: XP_038886790.1 (uncharacterized protein LOC120076904 [Benincasa hispida] >XP_038886792.1 uncharacterized protein LOC120076904 [Benincasa hispida] >XP_038886793.1 uncharacterized protein LOC120076904 [Benincasa hispida] >XP_038886794.1 uncharacterized protein LOC120076904 [Benincasa hispida])

HSP 1 Score: 194.1 bits (492), Expect = 6.2e-46
Identity = 99/110 (90.00%), Postives = 103/110 (93.64%), Query Frame = 0

Query: 1   MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQ 60
           MEMNK SSHSKE LTVGRSSSSSEVKKP EKDLSSSTF+NQAAIRWHE RKKW+DK SQQ
Sbjct: 1   MEMNKKSSHSKEILTVGRSSSSSEVKKPVEKDLSSSTFINQAAIRWHEGRKKWVDKNSQQ 60

Query: 61  QQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           QQRMERESIISWSTAYEDLLST+EPF+EPIPL EMVDFLVDIW DEGLFD
Sbjct: 61  QQRMERESIISWSTAYEDLLSTHEPFSEPIPLPEMVDFLVDIWHDEGLFD 110

BLAST of Tan0009074 vs. NCBI nr
Match: XP_022134091.1 (uncharacterized protein LOC111006447 [Momordica charantia])

HSP 1 Score: 185.3 bits (469), Expect = 2.9e-43
Identity = 96/110 (87.27%), Postives = 102/110 (92.73%), Query Frame = 0

Query: 1   MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQ 60
           +EM+K SS+SKEK   G  SSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWI+KQSQQ
Sbjct: 19  IEMSKVSSYSKEKPNTGHFSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWINKQSQQ 78

Query: 61  QQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           QQRMERESIISWSTAYEDLLSTNEPF+EPIPL EMVDFLVDIW D+GLFD
Sbjct: 79  QQRMERESIISWSTAYEDLLSTNEPFSEPIPLPEMVDFLVDIWHDDGLFD 128

BLAST of Tan0009074 vs. NCBI nr
Match: XP_022957458.1 (uncharacterized protein LOC111458848 isoform X1 [Cucurbita moschata] >KAG6602292.1 hypothetical protein SDJN03_07525, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 182.2 bits (461), Expect = 2.5e-42
Identity = 95/110 (86.36%), Postives = 102/110 (92.73%), Query Frame = 0

Query: 1   MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQ 60
           MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   
Sbjct: 16  MEMNKGSSHSKEKPSMGRSSFSSEVKKPSEKDLSSSTFVNQAAIHWHESRKKWIDKQF-L 75

Query: 61  QQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLVDIW DEGLFD
Sbjct: 76  QQRMEKESMISWSTAYEDLLSSNEPFSEPIPLPEMVDFLVDIWHDEGLFD 124

BLAST of Tan0009074 vs. NCBI nr
Match: XP_022957466.1 (uncharacterized protein LOC111458848 isoform X2 [Cucurbita moschata])

HSP 1 Score: 182.2 bits (461), Expect = 2.5e-42
Identity = 95/110 (86.36%), Postives = 102/110 (92.73%), Query Frame = 0

Query: 1   MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQ 60
           MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   
Sbjct: 15  MEMNKGSSHSKEKPSMGRSSFSSEVKKPSEKDLSSSTFVNQAAIHWHESRKKWIDKQF-L 74

Query: 61  QQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLVDIW DEGLFD
Sbjct: 75  QQRMEKESMISWSTAYEDLLSSNEPFSEPIPLPEMVDFLVDIWHDEGLFD 123

BLAST of Tan0009074 vs. NCBI nr
Match: XP_022957473.1 (uncharacterized protein LOC111458848 isoform X3 [Cucurbita moschata] >XP_022957481.1 uncharacterized protein LOC111458848 isoform X3 [Cucurbita moschata] >XP_022957490.1 uncharacterized protein LOC111458848 isoform X3 [Cucurbita moschata] >XP_022990494.1 uncharacterized protein LOC111487341 [Cucurbita maxima] >XP_022990496.1 uncharacterized protein LOC111487341 [Cucurbita maxima] >XP_022990497.1 uncharacterized protein LOC111487341 [Cucurbita maxima] >XP_022990498.1 uncharacterized protein LOC111487341 [Cucurbita maxima] >XP_022990499.1 uncharacterized protein LOC111487341 [Cucurbita maxima] >XP_022990500.1 uncharacterized protein LOC111487341 [Cucurbita maxima] >XP_023528467.1 uncharacterized protein LOC111791378 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023528477.1 uncharacterized protein LOC111791378 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023528483.1 uncharacterized protein LOC111791378 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023528491.1 uncharacterized protein LOC111791378 isoform X2 [Cucurbita pepo subsp. pepo] >KAG7032973.1 hypothetical protein SDJN02_07024, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 182.2 bits (461), Expect = 2.5e-42
Identity = 95/110 (86.36%), Postives = 102/110 (92.73%), Query Frame = 0

Query: 1   MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQ 60
           MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   
Sbjct: 1   MEMNKGSSHSKEKPSMGRSSFSSEVKKPSEKDLSSSTFVNQAAIHWHESRKKWIDKQF-L 60

Query: 61  QQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLVDIW DEGLFD
Sbjct: 61  QQRMEKESMISWSTAYEDLLSSNEPFSEPIPLPEMVDFLVDIWHDEGLFD 109

BLAST of Tan0009074 vs. ExPASy TrEMBL
Match: A0A0A0LK63 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G324430 PE=4 SV=1)

HSP 1 Score: 191.4 bits (485), Expect = 2.0e-45
Identity = 98/110 (89.09%), Postives = 103/110 (93.64%), Query Frame = 0

Query: 1   MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQ 60
           MEMNK SSHSKEK T+GRSSSSSEVKKPAEKDLSS TFVNQAAI WHESRKKW+DK SQQ
Sbjct: 1   MEMNKKSSHSKEKPTMGRSSSSSEVKKPAEKDLSSPTFVNQAAICWHESRKKWVDKNSQQ 60

Query: 61  QQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           QQRMERES+ISWSTAYEDLLSTN+PF+EPIPL EMVDFLVDIW DEGLFD
Sbjct: 61  QQRMERESMISWSTAYEDLLSTNDPFSEPIPLPEMVDFLVDIWHDEGLFD 110

BLAST of Tan0009074 vs. ExPASy TrEMBL
Match: A0A6J1BWZ9 (uncharacterized protein LOC111006447 OS=Momordica charantia OX=3673 GN=LOC111006447 PE=4 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 1.4e-43
Identity = 96/110 (87.27%), Postives = 102/110 (92.73%), Query Frame = 0

Query: 1   MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQ 60
           +EM+K SS+SKEK   G  SSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWI+KQSQQ
Sbjct: 19  IEMSKVSSYSKEKPNTGHFSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWINKQSQQ 78

Query: 61  QQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           QQRMERESIISWSTAYEDLLSTNEPF+EPIPL EMVDFLVDIW D+GLFD
Sbjct: 79  QQRMERESIISWSTAYEDLLSTNEPFSEPIPLPEMVDFLVDIWHDDGLFD 128

BLAST of Tan0009074 vs. ExPASy TrEMBL
Match: A0A6J1JTG3 (uncharacterized protein LOC111487341 OS=Cucurbita maxima OX=3661 GN=LOC111487341 PE=4 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 1.2e-42
Identity = 95/110 (86.36%), Postives = 102/110 (92.73%), Query Frame = 0

Query: 1   MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQ 60
           MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   
Sbjct: 1   MEMNKGSSHSKEKPSMGRSSFSSEVKKPSEKDLSSSTFVNQAAIHWHESRKKWIDKQF-L 60

Query: 61  QQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLVDIW DEGLFD
Sbjct: 61  QQRMEKESMISWSTAYEDLLSSNEPFSEPIPLPEMVDFLVDIWHDEGLFD 109

BLAST of Tan0009074 vs. ExPASy TrEMBL
Match: A0A6J1H0B4 (uncharacterized protein LOC111458848 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111458848 PE=4 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 1.2e-42
Identity = 95/110 (86.36%), Postives = 102/110 (92.73%), Query Frame = 0

Query: 1   MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQ 60
           MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   
Sbjct: 1   MEMNKGSSHSKEKPSMGRSSFSSEVKKPSEKDLSSSTFVNQAAIHWHESRKKWIDKQF-L 60

Query: 61  QQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLVDIW DEGLFD
Sbjct: 61  QQRMEKESMISWSTAYEDLLSSNEPFSEPIPLPEMVDFLVDIWHDEGLFD 109

BLAST of Tan0009074 vs. ExPASy TrEMBL
Match: A0A6J1GZA2 (uncharacterized protein LOC111458848 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111458848 PE=4 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 1.2e-42
Identity = 95/110 (86.36%), Postives = 102/110 (92.73%), Query Frame = 0

Query: 1   MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQ 60
           MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   
Sbjct: 15  MEMNKGSSHSKEKPSMGRSSFSSEVKKPSEKDLSSSTFVNQAAIHWHESRKKWIDKQF-L 74

Query: 61  QQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLVDIW DEGLFD
Sbjct: 75  QQRMEKESMISWSTAYEDLLSSNEPFSEPIPLPEMVDFLVDIWHDEGLFD 123

BLAST of Tan0009074 vs. TAIR 10
Match: AT3G54880.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25360.2); Has 137 Blast hits to 137 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 100.1 bits (248), Expect = 1.1e-21
Identity = 55/106 (51.89%), Postives = 71/106 (66.98%), Query Frame = 0

Query: 7   SSHSKEKLTVGR--SSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRM 66
           S+ +K  L + +      S VK  +E  L   T VN  A  W E+R+KW+  QS+Q++  
Sbjct: 10  STENKPTLELSKLVKDEKSSVKTNSENTL---TLVNHGAKMWQENREKWVGDQSRQRKNT 69

Query: 67  ERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
            ++ IISWST YEDLLST+EPF+E IPL EMVDFLVDIW DEGL+D
Sbjct: 70  AKDQIISWSTTYEDLLSTHEPFSESIPLPEMVDFLVDIWYDEGLYD 112

BLAST of Tan0009074 vs. TAIR 10
Match: AT5G03440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G54880.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 87.0 bits (214), Expect = 1.0e-17
Identity = 44/92 (47.83%), Postives = 60/92 (65.22%), Query Frame = 0

Query: 19  SSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYED 78
           SS+SS  K+ + +++    FVN A I W E RKKW+   S +   M  E +I ++  YED
Sbjct: 11  SSNSSNDKEKSSEEI----FVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYED 70

Query: 79  LLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           LL++N PF +PIPL EMVDFL DIW  +GLF+
Sbjct: 71  LLTSNTPFNKPIPLAEMVDFLFDIWHGDGLFE 98

BLAST of Tan0009074 vs. TAIR 10
Match: AT5G03440.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G54880.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 87.0 bits (214), Expect = 1.0e-17
Identity = 44/92 (47.83%), Postives = 60/92 (65.22%), Query Frame = 0

Query: 19  SSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYED 78
           SS+SS  K+ + +++    FVN A I W E RKKW+   S +   M  E +I ++  YED
Sbjct: 11  SSNSSNDKEKSSEEI----FVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYED 70

Query: 79  LLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           LL++N PF +PIPL EMVDFL DIW  +GLF+
Sbjct: 71  LLTSNTPFNKPIPLAEMVDFLFDIWHGDGLFE 98

BLAST of Tan0009074 vs. TAIR 10
Match: AT5G25360.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 77.8 bits (190), Expect = 6.1e-15
Identity = 42/113 (37.17%), Postives = 65/113 (57.52%), Query Frame = 0

Query: 2   EMNKGSSHSKEKLT----VGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQ 61
           EM+  +  S+  ++       +S+S+    P E       FVN     W+++R++W+   
Sbjct: 64  EMDNSTLQSQRSMSSISFTNNTSTSASTSNPTE-------FVNHGLNLWNQTRQQWLANG 123

Query: 62  SQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           + Q++   RE  ISW+  YE LL  N+ F+ PIPL EMVDFLVD+W+ EGL+D
Sbjct: 124 TSQKKAKVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLYD 169

BLAST of Tan0009074 vs. TAIR 10
Match: AT5G25360.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32342.1). )

HSP 1 Score: 77.8 bits (190), Expect = 6.1e-15
Identity = 42/113 (37.17%), Postives = 65/113 (57.52%), Query Frame = 0

Query: 2   EMNKGSSHSKEKLT----VGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQ 61
           EM+  +  S+  ++       +S+S+    P E       FVN     W+++R++W+   
Sbjct: 64  EMDNSTLQSQRSMSSISFTNNTSTSASTSNPTE-------FVNHGLNLWNQTRQQWLANG 123

Query: 62  SQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD 111
           + Q++   RE  ISW+  YE LL  N+ F+ PIPL EMVDFLVD+W+ EGL+D
Sbjct: 124 TSQKKAKVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLYD 169

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038886790.16.2e-4690.00uncharacterized protein LOC120076904 [Benincasa hispida] >XP_038886792.1 unchara... [more]
XP_022134091.12.9e-4387.27uncharacterized protein LOC111006447 [Momordica charantia][more]
XP_022957458.12.5e-4286.36uncharacterized protein LOC111458848 isoform X1 [Cucurbita moschata] >KAG6602292... [more]
XP_022957466.12.5e-4286.36uncharacterized protein LOC111458848 isoform X2 [Cucurbita moschata][more]
XP_022957473.12.5e-4286.36uncharacterized protein LOC111458848 isoform X3 [Cucurbita moschata] >XP_0229574... [more]
Match NameE-valueIdentityDescription
A0A0A0LK632.0e-4589.09Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G324430 PE=4 SV=1[more]
A0A6J1BWZ91.4e-4387.27uncharacterized protein LOC111006447 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
A0A6J1JTG31.2e-4286.36uncharacterized protein LOC111487341 OS=Cucurbita maxima OX=3661 GN=LOC111487341... [more]
A0A6J1H0B41.2e-4286.36uncharacterized protein LOC111458848 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1GZA21.2e-4286.36uncharacterized protein LOC111458848 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT3G54880.11.1e-2151.89unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G03440.11.0e-1747.83unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G03440.21.0e-1747.83unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G25360.16.1e-1537.17unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G25360.26.1e-1537.17unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025124Domain of unknown function DUF4050PFAMPF13259DUF4050coord: 70..110
e-value: 5.1E-11
score: 43.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..36
NoneNo IPR availablePANTHERPTHR33373OS07G0479600 PROTEINcoord: 1..109
NoneNo IPR availablePANTHERPTHR33373:SF28OS07G0479600 PROTEINcoord: 1..109

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009074.1Tan0009074.1mRNA