Tan0015169 (gene) Snake gourd v1

Overview
NameTan0015169
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHistone-lysine N-methyltransferase EZA1 isoform X3
LocationLG04: 81170503 .. 81172180 (-)
RNA-Seq ExpressionTan0015169
SyntenyTan0015169
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTTCTAAAACCTTCATTTTTGTTTCATCTCTTCGCCATCAACCTTCTTGGCCTCTTGCTTCCCCTCTCTTTCCTCCTCCTCGCTCGCCTCTCCTCCGCCCTATATCTCGCCGCCCCGTCACCGGTACCGTCCTTGTTGCTTTCTCTCATTCTCTATGTAAACTCTCCTCTTCTTTACCTTCTTGTTGCTTTCGTTATCGTCTCCACCCTCCTTCATTCCCTCACCGGAAAATCCGCTCTCCTGACCAAGCTCTCCGGTCCGGTCTCCCAACCGCGTCTCTACATTGCTTGGATCTTCCTTTGTACACTTCAGGTATGTTAAAATAAACACTTAACCTAAAAAAGTAAGTTGATACAAGTTAATAGGATTTGAATTTGAATTTCTCTAATCTTTAGGAGAGGTTTATAGTGTCTTGAACAATTGAATTTAATCAGGTTACAATGTGAACTTTGAAACTTGAATGTTGGTGGGTTTGTTACTAGCTATAAAGTGTTATTATAGTTTGATGACATGATTTTGATGAGTTGCTTTTATATGATTTGTAAATTAATTAGTTTCCGGAGATTTAAATTATTTTGGTTACTCCAACAAATAACCTAGTTTGAATTTTAAGTTTCGAAATTGTTGGAAAAGAATAAGTATTACATTAATCGATTCAAGCAAAACTATGCTTGAGTAGGATTTTAAATTCTGTTTTTATGAGGTATTTAATTTTGTAGCACTTACAAAAATGGAAAGTTTAAACTCGTTTGCTATTTGATTTTTTTGTTTTTGGTTTCTGAAAATTAAACCTATATATAAATACTACTGTCACTTATAAATTTATTTGCTTTACTATCTACTTTCTACCTATGTTTTAAAAAACCAAACTGAGTTTTGAAAACTTAAAAAAATAGTTTTCAAAACCTTATTTTTACTTTGGTTAAGAATTTAAATGTTTACTTAAGAAAAATGGAATTTATTATAGAGAAATTGAGAAAATCAACTTAATTTTTAAAAACAAAAAACCAAAGTCCAAATGGTTATCAAACGGGATGTTAAAATTGATATGGGTTTGCGAAAAAATTAAACTTTTATTAATATTTCTTACCCTCAATATAAAAAAAATATTGTAGGGCTTAATTTAAAAGGATGATTCTGCTTTGACATCATGTTAAATTATAACCTAATGGTAATTTGTTTATATTGTTAACAATAGGTATGTGTCGGTGTCGGGATCGAGGGAAGCCTATCAACCGGTCTCAACGACGTGGCGGCCGGCCGCGTCGAGGGCAGGCTGTGGGGCAGGCTGTTGTTCTTCTTGGGACTTCATGAGGCAGTGGTGCACTGGACGACGACAGTGGTGAAGCCGGTGGTGGACGACACCGTAATTGGGGAATCTCGAAAAGAGAGGTGGTTTGAAACGGCGGCGACGGCGGTGAGCTTCGGCGGCCTGTGGTGGTGGCGGCTGAGGGATGAGGCGGAGGCGCTGGCAGTTGTGGCGGAAAGAAAGTGGTTGACGGCGGCGGAATTGGGTCCGGCGGACTTTTCCGGTTGGTGCTTGTATTATATCACCGTCGCCATTGGAATCGCTAAGATTGTTAAATCCGTTGCTTGGTTTGGTCGGATTTTTGTCTCCAAAAAACAATCTAAAAGCTCCGACGAGGTCGTGGTTGTTCAGGACAATGTTTGA

mRNA sequence

ATGGAAGTTCTAAAACCTTCATTTTTGTTTCATCTCTTCGCCATCAACCTTCTTGGCCTCTTGCTTCCCCTCTCTTTCCTCCTCCTCGCTCGCCTCTCCTCCGCCCTATATCTCGCCGCCCCGTCACCGGTACCGTCCTTGTTGCTTTCTCTCATTCTCTATGTAAACTCTCCTCTTCTTTACCTTCTTGTTGCTTTCGTTATCGTCTCCACCCTCCTTCATTCCCTCACCGGAAAATCCGCTCTCCTGACCAAGCTCTCCGGTCCGGTCTCCCAACCGCGTCTCTACATTGCTTGGATCTTCCTTTGTACACTTCAGGTATGTGTCGGTGTCGGGATCGAGGGAAGCCTATCAACCGGTCTCAACGACGTGGCGGCCGGCCGCGTCGAGGGCAGGCTGTGGGGCAGGCTGTTGTTCTTCTTGGGACTTCATGAGGCAGTGGTGCACTGGACGACGACAGTGGTGAAGCCGGTGGTGGACGACACCGTAATTGGGGAATCTCGAAAAGAGAGGTGGTTTGAAACGGCGGCGACGGCGGTGAGCTTCGGCGGCCTGTGGTGGTGGCGGCTGAGGGATGAGGCGGAGGCGCTGGCAGTTGTGGCGGAAAGAAAGTGGTTGACGGCGGCGGAATTGGGTCCGGCGGACTTTTCCGGTTGGTGCTTGTATTATATCACCGTCGCCATTGGAATCGCTAAGATTGTTAAATCCGTTGCTTGGTTTGGTCGGATTTTTGTCTCCAAAAAACAATCTAAAAGCTCCGACGAGGTCGTGGTTGTTCAGGACAATGTTTGA

Coding sequence (CDS)

ATGGAAGTTCTAAAACCTTCATTTTTGTTTCATCTCTTCGCCATCAACCTTCTTGGCCTCTTGCTTCCCCTCTCTTTCCTCCTCCTCGCTCGCCTCTCCTCCGCCCTATATCTCGCCGCCCCGTCACCGGTACCGTCCTTGTTGCTTTCTCTCATTCTCTATGTAAACTCTCCTCTTCTTTACCTTCTTGTTGCTTTCGTTATCGTCTCCACCCTCCTTCATTCCCTCACCGGAAAATCCGCTCTCCTGACCAAGCTCTCCGGTCCGGTCTCCCAACCGCGTCTCTACATTGCTTGGATCTTCCTTTGTACACTTCAGGTATGTGTCGGTGTCGGGATCGAGGGAAGCCTATCAACCGGTCTCAACGACGTGGCGGCCGGCCGCGTCGAGGGCAGGCTGTGGGGCAGGCTGTTGTTCTTCTTGGGACTTCATGAGGCAGTGGTGCACTGGACGACGACAGTGGTGAAGCCGGTGGTGGACGACACCGTAATTGGGGAATCTCGAAAAGAGAGGTGGTTTGAAACGGCGGCGACGGCGGTGAGCTTCGGCGGCCTGTGGTGGTGGCGGCTGAGGGATGAGGCGGAGGCGCTGGCAGTTGTGGCGGAAAGAAAGTGGTTGACGGCGGCGGAATTGGGTCCGGCGGACTTTTCCGGTTGGTGCTTGTATTATATCACCGTCGCCATTGGAATCGCTAAGATTGTTAAATCCGTTGCTTGGTTTGGTCGGATTTTTGTCTCCAAAAAACAATCTAAAAGCTCCGACGAGGTCGTGGTTGTTCAGGACAATGTTTGA

Protein sequence

MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWFGRIFVSKKQSKSSDEVVVVQDNV
Homology
BLAST of Tan0015169 vs. NCBI nr
Match: XP_038904328.1 (uncharacterized protein LOC120090682 [Benincasa hispida])

HSP 1 Score: 407.9 bits (1047), Expect = 6.6e-110
Identity = 217/259 (83.78%), Postives = 226/259 (87.26%), Query Frame = 0

Query: 1   MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPS-LLLSLILYVNSPL 60
           ME+L PSFLFHLFAINLLGLLLPLSFLLLARLSS LYL    P+ S LLLSLILYVNSPL
Sbjct: 1   MEILSPSFLFHLFAINLLGLLLPLSFLLLARLSSVLYLIGLLPLSSPLLLSLILYVNSPL 60

Query: 61  LYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLST 120
           L+LLV+FVIVSTL HSLTGKSAL TKL GPVSQPRLY AWIFLCTLQVCVGVGIEGSLS+
Sbjct: 61  LFLLVSFVIVSTLFHSLTGKSALPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATA 180
           GLN  AAG +EG LW RLLFF GLHEAVVHWT  VVKPVVDDTV GESRKE+WFETAATA
Sbjct: 121 GLNHAAAGHIEGGLWRRLLFFFGLHEAVVHWTRAVVKPVVDDTVFGESRKEKWFETAATA 180

Query: 181 VSFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAW 240
           VS GGLWWWRLRDEAEAL VVAE KWLT+AELGPAD SGWCLYYITVAIGIAKIV+ VAW
Sbjct: 181 VSLGGLWWWRLRDEAEALVVVAESKWLTSAELGPADISGWCLYYITVAIGIAKIVRFVAW 240

Query: 241 FGRIFVSKKQSKSSDEVVV 259
           FG IFVS K SK   EV V
Sbjct: 241 FGGIFVSTKHSKKPHEVGV 259

BLAST of Tan0015169 vs. NCBI nr
Match: XP_008454283.1 (PREDICTED: uncharacterized protein LOC103494729 [Cucumis melo])

HSP 1 Score: 390.2 bits (1001), Expect = 1.4e-104
Identity = 212/266 (79.70%), Postives = 229/266 (86.09%), Query Frame = 0

Query: 1   MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSP-VPSLLLSLILYVNSPL 60
           ME+L  SFLFHLFAINLLGLLLPLS LLLARLSSALYL A  P  PS LLSLILYVNSPL
Sbjct: 1   MEILNTSFLFHLFAINLLGLLLPLSLLLLARLSSALYLLALLPWPPSFLLSLILYVNSPL 60

Query: 61  LYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLST 120
           L+LLV+FVI+STLLHSLTGKS L TKL GPVSQPRLY AWIFLCTLQVCVGVGIEGSLS+
Sbjct: 61  LFLLVSFVILSTLLHSLTGKSTLPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLNDV-AAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAAT 180
           GLND+ + G VEG +W RLLFF GLHEAVVHWT  VVKPVVDDT+ GESRKE+WFETAAT
Sbjct: 121 GLNDLTSTGHVEGGMWRRLLFFFGLHEAVVHWTRVVVKPVVDDTIYGESRKEKWFETAAT 180

Query: 181 AVSFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVK-SV 240
           AVS GGLWWWRLRDEAE L VVAE KWLT+ ELG AD SGWCLYYITV IGIAKIVK  +
Sbjct: 181 AVSLGGLWWWRLRDEAEVLVVVAESKWLTSTELGWADISGWCLYYITVVIGIAKIVKYCI 240

Query: 241 AWFGRIFVSKKQSKSSDEVVVVQDNV 264
            WFG IFVS+K SK+S+ +V V+DNV
Sbjct: 241 GWFGGIFVSRKHSKTSN-LVGVEDNV 265

BLAST of Tan0015169 vs. NCBI nr
Match: XP_022983396.1 (uncharacterized protein LOC111482002 [Cucurbita maxima])

HSP 1 Score: 389.4 bits (999), Expect = 2.4e-104
Identity = 206/261 (78.93%), Postives = 220/261 (84.29%), Query Frame = 0

Query: 1   MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLL 60
           ME+L+P FLFHL A+NLL LLLPLS LLLARLSSALYLA    +P LLLSLILYV SPLL
Sbjct: 1   MEILRPWFLFHLLAVNLLALLLPLSLLLLARLSSALYLAGLPLLPPLLLSLILYVTSPLL 60

Query: 61  YLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLSTG 120
            LLV+FV+VS LLHSLTGKSAL TKL  P+SQPRLY  WIFLCTLQVCVGVGIEGSLS+G
Sbjct: 61  ILLVSFVVVSALLHSLTGKSALPTKLPAPLSQPRLYTTWIFLCTLQVCVGVGIEGSLSSG 120

Query: 121 LNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAV 180
           LN  AAG VEG LW RLLFF GLHEAVVHWT  VVKPVVDDTV GESRKERWFETAATAV
Sbjct: 121 LNSAAAGHVEGGLWRRLLFFFGLHEAVVHWTRAVVKPVVDDTVFGESRKERWFETAATAV 180

Query: 181 SFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWF 240
           S GG+WWWRLRDEA+AL VVAE KWLT+ ELGPA+ + WCLYYI VAIGIAKIV SVAW 
Sbjct: 181 SLGGVWWWRLRDEADALVVVAEIKWLTSTELGPAEVANWCLYYIIVAIGIAKIVNSVAWL 240

Query: 241 GRIFVSKKQSKSSDEVVVVQD 262
            RI V KK SK SDEVVVV +
Sbjct: 241 VRILVPKKHSKCSDEVVVVNN 261

BLAST of Tan0015169 vs. NCBI nr
Match: XP_022934389.1 (uncharacterized protein LOC111441578 [Cucurbita moschata])

HSP 1 Score: 387.5 bits (994), Expect = 9.2e-104
Identity = 203/261 (77.78%), Postives = 221/261 (84.67%), Query Frame = 0

Query: 1   MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLL 60
           ME+L+  FLFHL A+NLL LLLPLS LLLARLSSALYLA P  +P L LSLILY+ SPLL
Sbjct: 1   MEILRAWFLFHLLAVNLLALLLPLSLLLLARLSSALYLAGPPLLPPLFLSLILYLTSPLL 60

Query: 61  YLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLSTG 120
            LLV+FV++S LLHSLTGKSAL TKL  PVSQPRLY  WIFLCTLQVCVGVGIEGSLS+G
Sbjct: 61  ILLVSFVVLSALLHSLTGKSALPTKLPAPVSQPRLYTTWIFLCTLQVCVGVGIEGSLSSG 120

Query: 121 LNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAV 180
           LN+ AAG +EG LW RLLFF+GLHEAVVHWT  VVKPVVDDTV GESRKERWFETAATAV
Sbjct: 121 LNNPAAGHIEGGLWRRLLFFVGLHEAVVHWTRAVVKPVVDDTVFGESRKERWFETAATAV 180

Query: 181 SFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWF 240
           S GG+WWWRLRDEA+AL VVAE KWLT+AELGPA+ + WCLYYI V IGI KIV SVAW 
Sbjct: 181 SLGGVWWWRLRDEADALVVVAEIKWLTSAELGPAEVANWCLYYIIVGIGIGKIVNSVAWL 240

Query: 241 GRIFVSKKQSKSSDEVVVVQD 262
            RI VSKK SK SDEVVVV +
Sbjct: 241 VRILVSKKHSKCSDEVVVVNN 261

BLAST of Tan0015169 vs. NCBI nr
Match: KAG6581302.1 (hypothetical protein SDJN03_21304, partial [Cucurbita argyrosperma subsp. sororia] >KAG7018022.1 hypothetical protein SDJN02_19888, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 386.7 bits (992), Expect = 1.6e-103
Identity = 202/261 (77.39%), Postives = 221/261 (84.67%), Query Frame = 0

Query: 1   MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLL 60
           ME+L+  FLFHL A+NLL LLLPLS LLLARLSSALYLA P  +P L LSLILY+ SP+L
Sbjct: 1   MEILRAWFLFHLLAVNLLALLLPLSLLLLARLSSALYLAGPPLLPPLFLSLILYLTSPIL 60

Query: 61  YLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLSTG 120
            LLV+FV++S LLHSLTGKSAL TKL  PVSQPRLY  WIFLCTLQVCVGVGIEGSLS+G
Sbjct: 61  ILLVSFVVLSALLHSLTGKSALPTKLPAPVSQPRLYTTWIFLCTLQVCVGVGIEGSLSSG 120

Query: 121 LNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAV 180
           LN+ AAG +EG LW RLLFF+GLHEAVVHWT  VVKPVVDDTV GESRKERWFETAATAV
Sbjct: 121 LNNPAAGHIEGGLWRRLLFFVGLHEAVVHWTRAVVKPVVDDTVFGESRKERWFETAATAV 180

Query: 181 SFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWF 240
           S GG+WWWRLRDEA+AL VVAE KWLT+AELGPA+ + WCLYYI V IGI KIV SVAW 
Sbjct: 181 SLGGVWWWRLRDEADALVVVAEIKWLTSAELGPAEVANWCLYYIIVGIGIGKIVNSVAWL 240

Query: 241 GRIFVSKKQSKSSDEVVVVQD 262
            RI VSKK SK SDEVVVV +
Sbjct: 241 VRILVSKKHSKCSDEVVVVNN 261

BLAST of Tan0015169 vs. ExPASy TrEMBL
Match: A0A1S3BZH0 (uncharacterized protein LOC103494729 OS=Cucumis melo OX=3656 GN=LOC103494729 PE=4 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 6.9e-105
Identity = 212/266 (79.70%), Postives = 229/266 (86.09%), Query Frame = 0

Query: 1   MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSP-VPSLLLSLILYVNSPL 60
           ME+L  SFLFHLFAINLLGLLLPLS LLLARLSSALYL A  P  PS LLSLILYVNSPL
Sbjct: 1   MEILNTSFLFHLFAINLLGLLLPLSLLLLARLSSALYLLALLPWPPSFLLSLILYVNSPL 60

Query: 61  LYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLST 120
           L+LLV+FVI+STLLHSLTGKS L TKL GPVSQPRLY AWIFLCTLQVCVGVGIEGSLS+
Sbjct: 61  LFLLVSFVILSTLLHSLTGKSTLPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLNDV-AAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAAT 180
           GLND+ + G VEG +W RLLFF GLHEAVVHWT  VVKPVVDDT+ GESRKE+WFETAAT
Sbjct: 121 GLNDLTSTGHVEGGMWRRLLFFFGLHEAVVHWTRVVVKPVVDDTIYGESRKEKWFETAAT 180

Query: 181 AVSFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVK-SV 240
           AVS GGLWWWRLRDEAE L VVAE KWLT+ ELG AD SGWCLYYITV IGIAKIVK  +
Sbjct: 181 AVSLGGLWWWRLRDEAEVLVVVAESKWLTSTELGWADISGWCLYYITVVIGIAKIVKYCI 240

Query: 241 AWFGRIFVSKKQSKSSDEVVVVQDNV 264
            WFG IFVS+K SK+S+ +V V+DNV
Sbjct: 241 GWFGGIFVSRKHSKTSN-LVGVEDNV 265

BLAST of Tan0015169 vs. ExPASy TrEMBL
Match: A0A6J1J235 (uncharacterized protein LOC111482002 OS=Cucurbita maxima OX=3661 GN=LOC111482002 PE=4 SV=1)

HSP 1 Score: 389.4 bits (999), Expect = 1.2e-104
Identity = 206/261 (78.93%), Postives = 220/261 (84.29%), Query Frame = 0

Query: 1   MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLL 60
           ME+L+P FLFHL A+NLL LLLPLS LLLARLSSALYLA    +P LLLSLILYV SPLL
Sbjct: 1   MEILRPWFLFHLLAVNLLALLLPLSLLLLARLSSALYLAGLPLLPPLLLSLILYVTSPLL 60

Query: 61  YLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLSTG 120
            LLV+FV+VS LLHSLTGKSAL TKL  P+SQPRLY  WIFLCTLQVCVGVGIEGSLS+G
Sbjct: 61  ILLVSFVVVSALLHSLTGKSALPTKLPAPLSQPRLYTTWIFLCTLQVCVGVGIEGSLSSG 120

Query: 121 LNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAV 180
           LN  AAG VEG LW RLLFF GLHEAVVHWT  VVKPVVDDTV GESRKERWFETAATAV
Sbjct: 121 LNSAAAGHVEGGLWRRLLFFFGLHEAVVHWTRAVVKPVVDDTVFGESRKERWFETAATAV 180

Query: 181 SFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWF 240
           S GG+WWWRLRDEA+AL VVAE KWLT+ ELGPA+ + WCLYYI VAIGIAKIV SVAW 
Sbjct: 181 SLGGVWWWRLRDEADALVVVAEIKWLTSTELGPAEVANWCLYYIIVAIGIAKIVNSVAWL 240

Query: 241 GRIFVSKKQSKSSDEVVVVQD 262
            RI V KK SK SDEVVVV +
Sbjct: 241 VRILVPKKHSKCSDEVVVVNN 261

BLAST of Tan0015169 vs. ExPASy TrEMBL
Match: A0A6J1F2F5 (uncharacterized protein LOC111441578 OS=Cucurbita moschata OX=3662 GN=LOC111441578 PE=4 SV=1)

HSP 1 Score: 387.5 bits (994), Expect = 4.5e-104
Identity = 203/261 (77.78%), Postives = 221/261 (84.67%), Query Frame = 0

Query: 1   MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLL 60
           ME+L+  FLFHL A+NLL LLLPLS LLLARLSSALYLA P  +P L LSLILY+ SPLL
Sbjct: 1   MEILRAWFLFHLLAVNLLALLLPLSLLLLARLSSALYLAGPPLLPPLFLSLILYLTSPLL 60

Query: 61  YLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLSTG 120
            LLV+FV++S LLHSLTGKSAL TKL  PVSQPRLY  WIFLCTLQVCVGVGIEGSLS+G
Sbjct: 61  ILLVSFVVLSALLHSLTGKSALPTKLPAPVSQPRLYTTWIFLCTLQVCVGVGIEGSLSSG 120

Query: 121 LNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAV 180
           LN+ AAG +EG LW RLLFF+GLHEAVVHWT  VVKPVVDDTV GESRKERWFETAATAV
Sbjct: 121 LNNPAAGHIEGGLWRRLLFFVGLHEAVVHWTRAVVKPVVDDTVFGESRKERWFETAATAV 180

Query: 181 SFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWF 240
           S GG+WWWRLRDEA+AL VVAE KWLT+AELGPA+ + WCLYYI V IGI KIV SVAW 
Sbjct: 181 SLGGVWWWRLRDEADALVVVAEIKWLTSAELGPAEVANWCLYYIIVGIGIGKIVNSVAWL 240

Query: 241 GRIFVSKKQSKSSDEVVVVQD 262
            RI VSKK SK SDEVVVV +
Sbjct: 241 VRILVSKKHSKCSDEVVVVNN 261

BLAST of Tan0015169 vs. ExPASy TrEMBL
Match: A0A0A0KWP5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G003705 PE=4 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 1.3e-103
Identity = 212/266 (79.70%), Postives = 226/266 (84.96%), Query Frame = 0

Query: 1   MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSP-VPSLLLSLILYVNSPL 60
           ME+L  SFLFHLFAINLLGLLLPLS LLLARLSSALYL A  P  PS LLSLILYVNSPL
Sbjct: 1   MEILNTSFLFHLFAINLLGLLLPLSLLLLARLSSALYLLALLPWPPSFLLSLILYVNSPL 60

Query: 61  LYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLST 120
           L+LLV+FVI+STLLHSLTGKS L TKL GPVSQPRLY AWIFLCTLQVCVGVGIEGSLS+
Sbjct: 61  LFLLVSFVILSTLLHSLTGKSTLPTKLPGPVSQPRLYTAWIFLCTLQVCVGVGIEGSLSS 120

Query: 121 GLNDV-AAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAAT 180
           GLND+ + G VEG LW RLLFFLGLHEAVVHWT  VVKPVVDDT+ GE R E+WFETAAT
Sbjct: 121 GLNDLTSTGHVEGGLWRRLLFFLGLHEAVVHWTRAVVKPVVDDTIYGEPRTEKWFETAAT 180

Query: 181 AVSFGGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVK-SV 240
           AVS GGLWWWRLRDEAE L VVAE KWLT+AELG AD SGWCLYYITV IGIAKIVK  +
Sbjct: 181 AVSLGGLWWWRLRDEAEVLVVVAESKWLTSAELGWADISGWCLYYITVVIGIAKIVKYCI 240

Query: 241 AWFGRIFVSKKQSKSSDEVVVVQDNV 264
            WFG  FVSK  SK+S  +V V+DNV
Sbjct: 241 GWFGGSFVSKTHSKTS-HLVGVEDNV 265

BLAST of Tan0015169 vs. ExPASy TrEMBL
Match: A0A6J1GAY8 (uncharacterized protein LOC111452502 OS=Cucurbita moschata OX=3662 GN=LOC111452502 PE=4 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 3.0e-100
Identity = 203/265 (76.60%), Postives = 218/265 (82.26%), Query Frame = 0

Query: 1   MEVLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLL 60
           ME  KPSFLFHLFA+NLLGLLLPLSFLLL RLSSALYL AP   P L LSLILY+NSPLL
Sbjct: 1   MEAPKPSFLFHLFAVNLLGLLLPLSFLLLLRLSSALYLPAP---PPLFLSLILYLNSPLL 60

Query: 61  YLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLSTG 120
           +LLV FVIVS LLHSLTGKS L T   G VSQPRLY AWI LCT QVCVGVGIEGSLS G
Sbjct: 61  FLLVTFVIVSALLHSLTGKSILHTNFPGHVSQPRLYAAWILLCTFQVCVGVGIEGSLSIG 120

Query: 121 LNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAV 180
           L D   GRVEG LW R+LFFLGLHE+VVHWT TVVKPVVDDTV GESRKERWFET AT +
Sbjct: 121 LTDAIVGRVEGGLWSRILFFLGLHESVVHWTRTVVKPVVDDTVFGESRKERWFETTATTL 180

Query: 181 SFGGLWWWRLRDEAEALAVVAERKWLTAA-ELGPADFSGWCLYYITVAIGIAKIVKSVAW 240
           SF GLWWWRLRDEAEAL VVAERKWL AA ELGP D  GWCLYY+ VA+GIAK+VKSVA 
Sbjct: 181 SFSGLWWWRLRDEAEALVVVAERKWLMAAEELGPVDILGWCLYYVNVAVGIAKVVKSVAQ 240

Query: 241 F-GRIFVSKKQSKSSDEVVVVQDNV 264
           F  ++  S+KQSKS +  VV++DNV
Sbjct: 241 FLDKVLFSRKQSKSYE--VVLEDNV 260

BLAST of Tan0015169 vs. TAIR 10
Match: AT1G02570.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02575.1); Has 108 Blast hits to 55 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 108; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 166.0 bits (419), Expect = 4.1e-41
Identity = 110/260 (42.31%), Postives = 153/260 (58.85%), Query Frame = 0

Query: 10  FHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVP-SLLLSLILYVNSPLLYLLVAFVI 69
           F + +I+LL LL+PLSFL L+RLS +   ++ +PV  S + SL+   +  +LY +++ +I
Sbjct: 20  FQMISISLLSLLVPLSFLFLSRLSVS---SSSAPVTVSGVFSLLHQADVGILYTILSLII 79

Query: 70  VSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLSTGLN---DVA 129
           VSTL+H L+GK          V    LYI WI L  +Q CV  GIEG++ST ++   D +
Sbjct: 80  VSTLIHILSGKPEC------SVLHSHLYICWIVLFIVQACVAFGIEGTMSTTISIDTDKS 139

Query: 130 AGRVEGRLW--GRLLFFLGLHEAVVHWTTTVVKPVVDDTVIG-ESRKERWFETAATAVSF 189
                   W   R++FFLGLHE ++ W   VVKPV+DDTV G    +ERW E A  AV+F
Sbjct: 140 FSLAAQERWVLVRVMFFLGLHEVMLMWFRVVVKPVIDDTVFGVYVEEERWSERAVVAVTF 199

Query: 190 GGLWWWRLRDEAEALAVVAERKWLTAAELGPADFSGWCLYYITVAIGIAKIVKSVAWF-G 249
           G +WWWRLRDE E+L VVAE K      L   DF  W +YYI V IG+ KI K   +F  
Sbjct: 200 GLMWWWRLRDEVESLVVVAEVKRNLQIRLEGLDFVNWWMYYICVGIGLVKIFKGFLYFVN 259

Query: 250 RIFVSKKQSKSSDEVVVVQD 262
            + ++  +S+   E  +V D
Sbjct: 260 MLILTINRSRKCCESCLVDD 270

BLAST of Tan0015169 vs. TAIR 10
Match: AT2G47360.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02570.1); Has 58 Blast hits to 55 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 58; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 152.5 bits (384), Expect = 4.6e-37
Identity = 108/271 (39.85%), Postives = 149/271 (54.98%), Query Frame = 0

Query: 3   VLKPSFLFHLFAINLLGLLLPLSFLLLARLSSALYL-----AAPSPVPS-LLLSLILYVN 62
           V+KP   F L    LL LLLPLSFLLL+RLSSA +L     + P    S  + SL L  N
Sbjct: 16  VVKP---FRLVTTTLLSLLLPLSFLLLSRLSSASFLFSLTKSQPQTESSFFVFSLFLRAN 75

Query: 63  SPLLYLLVAFVIVSTLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGS 122
             ++Y +V+ + V TL+  LT K             P + IAW+ L  +Q+ VG+G+E +
Sbjct: 76  PAIVYAVVSSISVYTLVLGLTTKITATDPKHSIAFYPHVSIAWLTLFLVQISVGIGLETT 135

Query: 123 LSTGLNDVAAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVI----GESRKERW 182
           +S GL     G  E     RL+FF GLHE ++ W   +V+PVVD+T++    G+ R+E  
Sbjct: 136 ISNGL---IIGS-ERNFLSRLVFFFGLHEVMLLWYRVIVRPVVDNTLLGGEDGQRREETV 195

Query: 183 FETAATAVSFGGLWWWRLRDEAEALAVVAERKWL------------TAAELGPADFSGWC 242
            E  A AVS G LWWW+LRDE EAL  VAE K               + ++G  DF  W 
Sbjct: 196 VERVALAVSCGTLWWWKLRDEVEALVGVAEAKRALLLLLPIDGNVNVSFDVGTVDFVNWW 255

Query: 243 LYYITVAIGIAKIVKSVAWFGRIFVSKKQSK 252
           LYY+ V IG+ +I+K   WFG I + ++ S+
Sbjct: 256 LYYMVVTIGMVRIIKGSLWFGMILLFEQGSR 279

BLAST of Tan0015169 vs. TAIR 10
Match: AT1G02575.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02570.1). )

HSP 1 Score: 149.4 bits (376), Expect = 3.9e-36
Identity = 106/259 (40.93%), Postives = 145/259 (55.98%), Query Frame = 0

Query: 10  FHLFAINLLGLLLPLSFLLLARLSSALYLAAPSPVPSLLLSLILYVNSPLLYLLVAFVIV 69
           F + +I+ L LLLPLSFL L+RLS  LY ++     S + S+I   +  +LY ++  +IV
Sbjct: 20  FQMISISFLSLLLPLSFLFLSRLS--LYTSSTPVTVSGVSSVIHQADVGVLYTILFLIIV 79

Query: 70  STLLHSLTGKSALLTKLSGPVSQPRLYIAWIFLCTLQVCVGVGIEGSLSTGLN-----DV 129
            TL+HSL+GK          V    LYI WI L   Q C   GI+ ++ST ++     ++
Sbjct: 80  FTLIHSLSGKPEC------SVLHSHLYICWIVLFIAQAC-AFGIKRTMSTTMSINPDKNL 139

Query: 130 AAGRVEGRLWGRLLFFLGLHEAVVHWTTTVVKPVVDDTVIGESRKERWFETAATAVSFGG 189
                E  +  R+LFFLGLHE ++ W   VVKPVVD+T+ G   +ERW E A  AV+FG 
Sbjct: 140 FLATHERWMLVRVLFFLGLHEVMLMWFRVVVKPVVDNTIYGVYVEERWSERAVVAVTFGI 199

Query: 190 LWWWRLRDEAEALAVVAERKWLT-AAELGPADFSGWCLYYITVAIGIAKIVKSVAWF-GR 249
           +WWWRLRDE E+L VV     L     L   +F  WC+YYI V IG+ KI K    F   
Sbjct: 200 MWWWRLRDEVESLVVVVTADRLNLPIRLEGLNFVNWCMYYICVGIGLMKIFKGFLDFVNT 259

Query: 250 IFVSKKQSKSSDEVVVVQD 262
           + +S K+S+   E  V  D
Sbjct: 260 LTLSIKRSRKGCESCVFDD 269

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038904328.16.6e-11083.78uncharacterized protein LOC120090682 [Benincasa hispida][more]
XP_008454283.11.4e-10479.70PREDICTED: uncharacterized protein LOC103494729 [Cucumis melo][more]
XP_022983396.12.4e-10478.93uncharacterized protein LOC111482002 [Cucurbita maxima][more]
XP_022934389.19.2e-10477.78uncharacterized protein LOC111441578 [Cucurbita moschata][more]
KAG6581302.11.6e-10377.39hypothetical protein SDJN03_21304, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A1S3BZH06.9e-10579.70uncharacterized protein LOC103494729 OS=Cucumis melo OX=3656 GN=LOC103494729 PE=... [more]
A0A6J1J2351.2e-10478.93uncharacterized protein LOC111482002 OS=Cucurbita maxima OX=3661 GN=LOC111482002... [more]
A0A6J1F2F54.5e-10477.78uncharacterized protein LOC111441578 OS=Cucurbita moschata OX=3662 GN=LOC1114415... [more]
A0A0A0KWP51.3e-10379.70Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G003705 PE=4 SV=1[more]
A0A6J1GAY83.0e-10076.60uncharacterized protein LOC111452502 OS=Cucurbita moschata OX=3662 GN=LOC1114525... [more]
Match NameE-valueIdentityDescription
AT1G02570.14.1e-4142.31unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G47360.14.6e-3739.85unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G02575.13.9e-3640.93unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37172TRANSMEMBRANE PROTEINcoord: 7..263

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0015169.1Tan0015169.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032259 methylation
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0031519 PcG protein complex
molecular_function GO:0008168 methyltransferase activity