Tan0017559 (gene) Snake gourd v1

Overview
NameTan0017559
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTyrosine sulfotransferase-like protein
LocationLG01: 990081 .. 995466 (+)
RNA-Seq ExpressionTan0017559
SyntenyTan0017559
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGGATTTTCCCGTAACGCCCTGAGTCCTGAAGATTGAAGAATCGAAGATCCGCTCGTTGCCCGTCGCTCGTCTGCTCGTAGCTGCGCCGATTGTTGCCCGTCTGCTCGTCGGTCGTCTCTCATCTGCTCTTCGGTCGCCGCCCATCAGCTCGCCGAGCAGCGCTCGTTTCCTTCGGACGTTCGCTCGCCTCTTCCAGTGTTTTCACTCGTCTTCTTCCGGCGTTTTCGCTCGTCTCGTTCCGACGTGTGCTCCTTCATCTCCGTCCACACGATTCACCTCATTTGCTCGACTATCCTCTGCCCAACAACTGTAAGTTAAGAAAGTATTTGTTTTTTAGGCTTCCACTATGCCTTAGATTTCGTTCACTTATCTCTCAACATCTCCCCTTTCTTTCTCTACTTCCATTTGTTTTCAGTTTTGTTATTTTTAGATATGGGAGTAACAACTGTAAGTTCAGATGTGTGTTTATTTTCAGTTTCTCTACTTCCATTTATTTTCAGTTTGTTCTCTGGGATTGATTTCGTGATTTCTTATCACAGAGTGCACTGGCAGAGGAAAATGATTTTAGGAAGACAATTTTGTTCTCATTTCATTTTCACACTCAGTAATACAAGAGGGATACCCTTTCTTATAGAGAACCCGAAAGAAAGAGGGAGAGGAGATCAGACACAACTCCTTGTATACCAATTCTTTTTTCTTTGCAAACTGTAGCATTGGTTTGAACTCTCCACCACCATCAAACCTTATAGTTTTAATGGATGTCCCTTGCTGATTTTCCACAAGATTCTTGAATTGTTGGAACAAATTAGCTTTACTCGTCTGTTTTAATGGGTACAGCCAAGTATATCTGCTATGGTCATCAGGGAAAGCTATGTAGTATGTAGAAGTTGTGGCCATTTATGGATCGCACTGGGGCTGGTCCCCATAAGTCCATGTGAGTAAGATCTAGAGGTTGGAAGCTTATGAACTTTGCCAAACTGTCATGCCTCACAAAACGTAAGCTTATCGTTCATCTTAGTAGAAGCATTACAATCATCCAAGATTTGACTCAAATTTTTTTGTAGAAGGATGGCCTAGTCTTTGATGCCATATATTTTTACTCCAAGACACAAAAAGCATAATCAACTAAGAGGTCAAATCTTTCAATCCTTTGGTCCATAGAGCTTTCTCAGCAGGTGTTTCTACTTTGACAGCTCTTTTCATAGCTTCTCTATTCTCGGAAAGTCTTGATGTGGCTTTAGCACTAGCATCTTCTAGTTGGTAGAGTCCATTGTCAAGCTTCCCTATTAGTCACACTTTCCTGGACTTCTTTTCCTTGATAACACAAAACAGATCATGAAATTCAGCAACCACTTCATTGTCTTTTGTCAACTTAGATATCAAATCAGATTTTTTTTTTATGAGTGGAACATAAAGAACATCCTTTAAAACAACGTTTAGTCCTTTTGTAGTTAAGTAGGCAGTACCAATACTAGTTATAGGCAATTTAGTTCCATCTCCAACCATAATAGATTTCGTACTTGAGTTCTTACCCTTTAGACAAAGTTTTTCAAGCTCAGAGGTGACATGTTTACTTGCTCCACTATCAGGTTAAATCTTGTAAAATTTCAGGGTAGGCGATAAGAGCAGCTAGAGTGGTGGTAGAAGTGGTGTTCTAGTGAGTAATTATATTTTCTGTGGACTGTTGCTTATTCTTTTCATATCGACGATGGCAGATTGAGGCTATGTGCCCAACTTTTCCACATATTTGACACGTGGGCCTGCTGTTCTGAGAATAAGAGGAATATCTCTCTCCTCCCCTGAGCTTACCACCTCTGCTTGAAGAAGAATAGCCTCCTCGATTGTTATGATAATTGTTGTTTGGTTGAGACTTGAGTCTTAGGTTTGTCATAGTTTACGAAAGCTTGGTTGATGGCCACACTTCCTTTCAACTTTTGTAGCTTGTCATGACGATTTTCAAAGGACAGTAGTTCACATTGAATCTTTTCCCAGGTCATAACTTGATTTCTGATTACACACACAATTGATGTGTACTCTTCATCTAAACCAGCTAGACCATAAATCCATAGGAAAACCAGAAAAATTCAGATTATCATAGTACCTTTTTATAACTGTCAAACAATCTTCCATTTTCATACTCCCTTTTCGTGTTTGTTGAAGCATCTGTCTGTTGTGATCGTGATTGCATTCCCTAGTATTCTTATATTGCCTCTCAGAGGGAGACTGAGCATTGTCATGTCCCATGACCTGAGCGACCATTTAAGGTGCCATGGAATTATAAAGCTAACCAACAGGCAACTGGTCTGCGGTGTCCCAAATACTGCACTCAGGATTTGGAAGCTTCAGTCCCAATGGTTCATCTTCAGAAGGCGGTAGGATTATGGAGTGTTTTGGACATGGTGTCTTGCCTGTTAGATGTCCATCCATTTTGTAGCTTTTTAGTATTGACAAGACAATGCTTTGCAACAGTTGATAGTTTCCTCGATCAAGCTTAATGGAAGTCACTTGATTCAAAATATTAGTGGAGGGACTTCCCACTAGGCAATCTATGGATCTTACCACATATTTTTCTTGGTTTGGTGATGCTTCAATATACGCCTTCGGGAGACTTGGTGAAGGAGCTACACGGGTTCTCTTTGAGCCTTCTTCAATAGAAATTCAGTTTCTCCTTTTGAAGGAGAAGGATCAACCATTATGCTATGATACCAGGATTGATTTTGTGATTTCTTATCACAGAGTGCTCAGGTAGAGCAAAATGATTTTGGGAAGACAATCCTTTTCTCCGAAAGAGAGAGAGGAGGCTAGACACAACTGTAGCGTTAGGTAGAATCTTAAGCTAATTGCATTACCTGTTAGACTATGTTGATATAATTAAATTTACCCCAAACTATCAGCTTAAGCTTTTAGTTTGATTGGTGATTTAAGATGGTATCAGAGTAGGTGGTCTAGGAGTTCCTGTGTTGAAACCCCTGCAAAGTCAGTTTTCTCCCCATTTAATATTGATTCCACTTGTTTGGCTTTTCATCAAATTTCCGAGCCCATAAGTGAGGGGGAGTGTTAGACTATGTTGATATAATTAAATTTACCCCAAACTATCAGGTTAAGCTTTTAGTTTGATTGTAACAACACAACACAACATGTACTGTGGCCTAATACACACATTGGCCAACACAACACGTACAACTTAAAACAAGAAAGCTCACACAGTGAGATCGTACAAAGAGGCAGAAAAGAGTAGAGGAAACGGTCAACTCTTGACACTCTCACATTCCAAAATTTATATTGATGACAATTTTGGGTATCAACTGTAACGTTGAGGTGTGTGTTTATAAAATTATAATTTTTTATTTTTTATTTTTTTTGCTGAATGTGATTGTGAATTGCGTAATGTGTGAATCTCCTTTTACCCGTTTTTGGTGATATGAAACTCCAAGGGACAGCTATTTGAAAACATACTTGCTTTCCGGTAATCTAGAATCTTCCTTTTCTTTTTGATTCCCAAAACCTTCTATCCTCTTGATCTTTTGTTACCGCATACTGTTCCCTCTTCCTCTCTCTGTCAATCCGTTTTTTCTCTCTTTTTTTCTGTGTTTTTAGAGAGAGATTCTTTCATTTTTTTCCCCTTCTGTTCTATTGCCTTCATAGCCAAAGGACTTTGATGCCTGAGTTTTTATTTATTTTTTTATTATTATTATTTTTTAAGTTTTTATGGGATCTTTCGTTTTTCAAAGGGGTTTTTTAGGAATGAAATATAGTGTTGGATTTAGCGATGAATTGTGCGTTTGTTTTTGTCGTGTTGTGTTTTTTGGAGTGAAGAAAAAAGTAGTAGAATTTAGCTCTCAAACATACACTCATTAGAAGACCTTTGGTTTCCCAAGCAGTTAACCTTAACTACACGATTGATTCCAGTATTCTTCTTTGTTCTCTTAGGTTAGTTAGTTGAATCCAATAACTTTCTCTTCCATGCCCTTCCAGTTGGCTAGACAACTTACTAGACCAAAAGAAACTATCAACATTGCATTCGATTGTGTCTGAAATACACCGGATGACGGACATCATTGTGTATTATTTCATTAGATTGGTCCATTTATTTTGGCTTTCAGGATTGAGATTAAAAATGAAAACCCCAAAACATCTTCCCTTAACTTGTGGCCAAAAACCCACGATCAGAATCACCATCTTTTCAACTTGCATCCCCAGTTCATCTTTTGTCTATCTATAGCCGTAGATTATCTGCTTTTTTGTTCTTAAAAAATTAACTATCTGGACTCTGAAATTCTCAACTTCAGACTATATTAAGGACTTGTTTAACCCAAATCATCTAGTCTAAGTAGCGGTTTCCTTTGACTACCTATGTGTATCAGTGTGAAGTGACAAATTTAATTCTGATTCTGTCTTAAGGGTTTATAGTGCCCCTTCTAATAGCCCTATTGGGTAATTTTATTGTACATGGAAATAGTTAATGCTGTGTACGTAACAACTTTAAGCATGATAAAGTTATTTGTCTGAAATAACTATTAGGAATCTTGATGCCTACTCTTGGCCCAAATGTTGAAAAGATATAATCTGTTTTTCTTCAACTAATTTTTCTACATTGTAATTACTTTTGTTCTCTATACGTTTTTTCTTCTGCTGGATAAATTGGTTACTTTATTCTTCATATGGTCTCTTACAAGCAGAAACATTCCCAAGAGAAAATGGCAGGCAAGGCTGCCAAATCTGTGGCGATGGCCATTGGCGAGTATCGGTATCCATTGCAGGAGAAGTTGGCGAAATACAAGAATGAGCTATCCAAAGGTGTTTGGGGTTACTGGGAGCTAGGGGCATGGAAGCCACTGGGCATAAGTGCACGTCATCGTGCCAGACTTCGCAAGGAAGTACTTCTTGCTGGGCAGGATTGGCCATATGATCCGGAGAGGGGAGAGATGAGAACCAAGAGGAAAGGGCACAAATGTGATCGGATAGCGGCAGAGAAACGAGAAAACACTGCCAGGTTGATGGAGAAAATGCCAGAAATGTTGCTTGAATACAAGAAGCGCAGGTGGGAGAAGAAGATGAAGGAAGAAGAGAAGAATAAACAACAATGAACTGATATGGTTGGTTTCTTGTTAGCCAATTCCTCATCCATGGAGATATTGCTTTTGTTATCTAAAGTTGGATTAAATATCACTTTGTATTCGTTATCTTTGATTTCATATTTTTGGTTATCCTGAAATCCTTTTACATCATAGTGATATGAATCAAATTGATGTTTGTATTGAAATCAGGCAAAGATAACAAAATGAGGAAGAATTTAGAGAGCCCTCTCCTAGCTCCTATTTCATAGATTGAAACTTGGATCAGAAACATTCGTATCAGTTATCACTTATCACAAGTGTTCAGAG

mRNA sequence

TCGGATTTTCCCGTAACGCCCTGAGTCCTGAAGATTGAAGAATCGAAGATCCGCTCGTTGCCCGTCGCTCGTCTGCTCGTAGCTGCGCCGATTGTTGCCCGTCTGCTCGTCGGTCGTCTCTCATCTGCTCTTCGGTCGCCGCCCATCAGCTCGCCGAGCAGCGCTCGTTTCCTTCGGACGTTCGCTCGCCTCTTCCAGTGTTTTCACTCGTCTTCTTCCGGCGTTTTCGCTCGTCTCGTTCCGACGTGTGCTCCTTCATCTCCGTCCACACGATTCACCTCATTTGCTCGACTATCCTCTGCCCAACAACTAAACATTCCCAAGAGAAAATGGCAGGCAAGGCTGCCAAATCTGTGGCGATGGCCATTGGCGAGTATCGGTATCCATTGCAGGAGAAGTTGGCGAAATACAAGAATGAGCTATCCAAAGGTGTTTGGGGTTACTGGGAGCTAGGGGCATGGAAGCCACTGGGCATAAGTGCACGTCATCGTGCCAGACTTCGCAAGGAAGTACTTCTTGCTGGGCAGGATTGGCCATATGATCCGGAGAGGGGAGAGATGAGAACCAAGAGGAAAGGGCACAAATGTGATCGGATAGCGGCAGAGAAACGAGAAAACACTGCCAGGTTGATGGAGAAAATGCCAGAAATGTTGCTTGAATACAAGAAGCGCAGGTGGGAGAAGAAGATGAAGGAAGAAGAGAAGAATAAACAACAATGAACTGATATGGTTGGTTTCTTGTTAGCCAATTCCTCATCCATGGAGATATTGCTTTTGTTATCTAAAGTTGGATTAAATATCACTTTGTATTCGTTATCTTTGATTTCATATTTTTGGTTATCCTGAAATCCTTTTACATCATAGTGATATGAATCAAATTGATGTTTGTATTGAAATCAGGCAAAGATAACAAAATGAGGAAGAATTTAGAGAGCCCTCTCCTAGCTCCTATTTCATAGATTGAAACTTGGATCAGAAACATTCGTATCAGTTATCACTTATCACAAGTGTTCAGAG

Coding sequence (CDS)

ATGGCAGGCAAGGCTGCCAAATCTGTGGCGATGGCCATTGGCGAGTATCGGTATCCATTGCAGGAGAAGTTGGCGAAATACAAGAATGAGCTATCCAAAGGTGTTTGGGGTTACTGGGAGCTAGGGGCATGGAAGCCACTGGGCATAAGTGCACGTCATCGTGCCAGACTTCGCAAGGAAGTACTTCTTGCTGGGCAGGATTGGCCATATGATCCGGAGAGGGGAGAGATGAGAACCAAGAGGAAAGGGCACAAATGTGATCGGATAGCGGCAGAGAAACGAGAAAACACTGCCAGGTTGATGGAGAAAATGCCAGAAATGTTGCTTGAATACAAGAAGCGCAGGTGGGAGAAGAAGATGAAGGAAGAAGAGAAGAATAAACAACAATGA

Protein sequence

MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKEVLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKMKEEEKNKQQ
Homology
BLAST of Tan0017559 vs. NCBI nr
Match: XP_022157240.1 (uncharacterized protein LOC111023997 [Momordica charantia])

HSP 1 Score: 239.2 bits (609), Expect = 2.0e-59
Identity = 118/129 (91.47%), Postives = 125/129 (96.90%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKAAK++A AIGEY+YP QEKLAKYKNELSKG+WGYWELGAWKPLGISARHRARLRKE
Sbjct: 1   MAGKAAKAMAKAIGEYQYPWQEKLAKYKNELSKGIWGYWELGAWKPLGISARHRARLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VL+AGQDWP+DPER EMRTKRKGHKCDRIAAEKRENTARLMEKMP+MLLEYKKRRWEKKM
Sbjct: 61  VLVAGQDWPFDPERKEMRTKRKGHKCDRIAAEKRENTARLMEKMPDMLLEYKKRRWEKKM 120

Query: 121 KEEEKNKQQ 130
           KEEEK KQQ
Sbjct: 121 KEEEKKKQQ 129

BLAST of Tan0017559 vs. NCBI nr
Match: XP_038880142.1 (uncharacterized protein LOC120071824 [Benincasa hispida] >XP_038880151.1 uncharacterized protein LOC120071824 [Benincasa hispida])

HSP 1 Score: 238.4 bits (607), Expect = 3.4e-59
Identity = 120/129 (93.02%), Postives = 122/129 (94.57%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKAAKSV  AI EY+YP QEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE
Sbjct: 1   MAGKAAKSVVKAISEYQYPWQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VLLAGQDWPYDPER EMRTKRKGHKCDRIAAEKRENTARLMEKMP+MLL YKKRRWEKKM
Sbjct: 61  VLLAGQDWPYDPERKEMRTKRKGHKCDRIAAEKRENTARLMEKMPDMLLAYKKRRWEKKM 120

Query: 121 KEEEKNKQQ 130
           KEEEK KQQ
Sbjct: 121 KEEEKKKQQ 129

BLAST of Tan0017559 vs. NCBI nr
Match: XP_022984631.1 (uncharacterized protein LOC111482859 [Cucurbita maxima] >XP_022984632.1 uncharacterized protein LOC111482859 [Cucurbita maxima])

HSP 1 Score: 237.3 bits (604), Expect = 7.5e-59
Identity = 119/129 (92.25%), Postives = 122/129 (94.57%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKA KSVA AIGEY+YP QEKL KYKNELSKGVWGYWELGAWKPLGISARHRARLRKE
Sbjct: 1   MAGKAVKSVAKAIGEYQYPWQEKLVKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VLLAGQDW YDPER EMRTKRKGHKCDRIAAEKRENTARLMEKMP+MLL+YKKRRWEKKM
Sbjct: 61  VLLAGQDWAYDPERKEMRTKRKGHKCDRIAAEKRENTARLMEKMPDMLLQYKKRRWEKKM 120

Query: 121 KEEEKNKQQ 130
           KEEEK KQQ
Sbjct: 121 KEEEKKKQQ 129

BLAST of Tan0017559 vs. NCBI nr
Match: XP_004146224.1 (uncharacterized protein LOC101216182 [Cucumis sativus] >KGN57669.1 hypothetical protein Csa_010563 [Cucumis sativus])

HSP 1 Score: 236.9 bits (603), Expect = 9.8e-59
Identity = 119/128 (92.97%), Postives = 122/128 (95.31%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKAAKSVA AI EY+YP QEKLAKYKNELSKGVWGYWELGAWK LGISARHRARLRKE
Sbjct: 1   MAGKAAKSVAKAISEYQYPWQEKLAKYKNELSKGVWGYWELGAWKSLGISARHRARLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VLLAGQDWPYDPER EMRTKRKGHKCDRIAAEKRENTARLMEKMP+MLL+YKKRRWEKKM
Sbjct: 61  VLLAGQDWPYDPERKEMRTKRKGHKCDRIAAEKRENTARLMEKMPDMLLQYKKRRWEKKM 120

Query: 121 KEEEKNKQ 129
           KEEEK KQ
Sbjct: 121 KEEEKKKQ 128

BLAST of Tan0017559 vs. NCBI nr
Match: XP_008466828.1 (PREDICTED: uncharacterized protein LOC103504139 [Cucumis melo] >ADN33743.1 hypothetical protein [Cucumis melo subsp. melo] >KAA0050509.1 uncharacterized protein E6C27_scaffold175G001510 [Cucumis melo var. makuwa] >TYK29186.1 uncharacterized protein E5676_scaffold120G003760 [Cucumis melo var. makuwa])

HSP 1 Score: 236.5 bits (602), Expect = 1.3e-58
Identity = 119/128 (92.97%), Postives = 122/128 (95.31%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKAAKSVA AI EY+YP QEKLAKYKNELSKGVWGYWELGAWK LGISARHRARLRKE
Sbjct: 1   MAGKAAKSVAKAISEYQYPWQEKLAKYKNELSKGVWGYWELGAWKHLGISARHRARLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VLLAGQDWPYDPER EMRTKRKGHKCDRIAAEKRENTARLMEKMP+MLL+YKKRRWEKKM
Sbjct: 61  VLLAGQDWPYDPERKEMRTKRKGHKCDRIAAEKRENTARLMEKMPDMLLQYKKRRWEKKM 120

Query: 121 KEEEKNKQ 129
           KEEEK KQ
Sbjct: 121 KEEEKKKQ 128

BLAST of Tan0017559 vs. ExPASy TrEMBL
Match: A0A6J1DSK0 (uncharacterized protein LOC111023997 OS=Momordica charantia OX=3673 GN=LOC111023997 PE=4 SV=1)

HSP 1 Score: 239.2 bits (609), Expect = 9.6e-60
Identity = 118/129 (91.47%), Postives = 125/129 (96.90%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKAAK++A AIGEY+YP QEKLAKYKNELSKG+WGYWELGAWKPLGISARHRARLRKE
Sbjct: 1   MAGKAAKAMAKAIGEYQYPWQEKLAKYKNELSKGIWGYWELGAWKPLGISARHRARLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VL+AGQDWP+DPER EMRTKRKGHKCDRIAAEKRENTARLMEKMP+MLLEYKKRRWEKKM
Sbjct: 61  VLVAGQDWPFDPERKEMRTKRKGHKCDRIAAEKRENTARLMEKMPDMLLEYKKRRWEKKM 120

Query: 121 KEEEKNKQQ 130
           KEEEK KQQ
Sbjct: 121 KEEEKKKQQ 129

BLAST of Tan0017559 vs. ExPASy TrEMBL
Match: A0A6J1J2P5 (uncharacterized protein LOC111482859 OS=Cucurbita maxima OX=3661 GN=LOC111482859 PE=4 SV=1)

HSP 1 Score: 237.3 bits (604), Expect = 3.6e-59
Identity = 119/129 (92.25%), Postives = 122/129 (94.57%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKA KSVA AIGEY+YP QEKL KYKNELSKGVWGYWELGAWKPLGISARHRARLRKE
Sbjct: 1   MAGKAVKSVAKAIGEYQYPWQEKLVKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VLLAGQDW YDPER EMRTKRKGHKCDRIAAEKRENTARLMEKMP+MLL+YKKRRWEKKM
Sbjct: 61  VLLAGQDWAYDPERKEMRTKRKGHKCDRIAAEKRENTARLMEKMPDMLLQYKKRRWEKKM 120

Query: 121 KEEEKNKQQ 130
           KEEEK KQQ
Sbjct: 121 KEEEKKKQQ 129

BLAST of Tan0017559 vs. ExPASy TrEMBL
Match: A0A0A0L9K9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G239330 PE=4 SV=1)

HSP 1 Score: 236.9 bits (603), Expect = 4.8e-59
Identity = 119/128 (92.97%), Postives = 122/128 (95.31%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKAAKSVA AI EY+YP QEKLAKYKNELSKGVWGYWELGAWK LGISARHRARLRKE
Sbjct: 1   MAGKAAKSVAKAISEYQYPWQEKLAKYKNELSKGVWGYWELGAWKSLGISARHRARLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VLLAGQDWPYDPER EMRTKRKGHKCDRIAAEKRENTARLMEKMP+MLL+YKKRRWEKKM
Sbjct: 61  VLLAGQDWPYDPERKEMRTKRKGHKCDRIAAEKRENTARLMEKMPDMLLQYKKRRWEKKM 120

Query: 121 KEEEKNKQ 129
           KEEEK KQ
Sbjct: 121 KEEEKKKQ 128

BLAST of Tan0017559 vs. ExPASy TrEMBL
Match: A0A5A7UAJ5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold120G003760 PE=4 SV=1)

HSP 1 Score: 236.5 bits (602), Expect = 6.2e-59
Identity = 119/128 (92.97%), Postives = 122/128 (95.31%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKAAKSVA AI EY+YP QEKLAKYKNELSKGVWGYWELGAWK LGISARHRARLRKE
Sbjct: 1   MAGKAAKSVAKAISEYQYPWQEKLAKYKNELSKGVWGYWELGAWKHLGISARHRARLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VLLAGQDWPYDPER EMRTKRKGHKCDRIAAEKRENTARLMEKMP+MLL+YKKRRWEKKM
Sbjct: 61  VLLAGQDWPYDPERKEMRTKRKGHKCDRIAAEKRENTARLMEKMPDMLLQYKKRRWEKKM 120

Query: 121 KEEEKNKQ 129
           KEEEK KQ
Sbjct: 121 KEEEKKKQ 128

BLAST of Tan0017559 vs. ExPASy TrEMBL
Match: E5GBA1 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 236.5 bits (602), Expect = 6.2e-59
Identity = 119/128 (92.97%), Postives = 122/128 (95.31%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKAAKSVA AI EY+YP QEKLAKYKNELSKGVWGYWELGAWK LGISARHRARLRKE
Sbjct: 1   MAGKAAKSVAKAISEYQYPWQEKLAKYKNELSKGVWGYWELGAWKHLGISARHRARLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VLLAGQDWPYDPER EMRTKRKGHKCDRIAAEKRENTARLMEKMP+MLL+YKKRRWEKKM
Sbjct: 61  VLLAGQDWPYDPERKEMRTKRKGHKCDRIAAEKRENTARLMEKMPDMLLQYKKRRWEKKM 120

Query: 121 KEEEKNKQ 129
           KEEEK KQ
Sbjct: 121 KEEEKKKQ 128

BLAST of Tan0017559 vs. TAIR 10
Match: AT4G22000.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 196.4 bits (498), Expect = 1.4e-50
Identity = 96/128 (75.00%), Postives = 112/128 (87.50%), Query Frame = 0

Query: 1   MAGKAAKSVAMAIGEYRYPLQEKLAKYKNELSKGVWGYWELGAWKPLGISARHRARLRKE 60
           MAGKAA++VA  +  +++P + KL KY+ EL+KGVWGYWE+GAWKPLGISAR RA LRKE
Sbjct: 1   MAGKAAEAVAKTVTGFQHPWRAKLDKYRTELTKGVWGYWEMGAWKPLGISARRRAMLRKE 60

Query: 61  VLLAGQDWPYDPERGEMRTKRKGHKCDRIAAEKRENTARLMEKMPEMLLEYKKRRWEKKM 120
           VL  G+DWPYDPER  MRTKRKGHKCDRI+AEKRENTA+LM KMP+MLL+YKKRRWEKKM
Sbjct: 61  VLTNGEDWPYDPERKAMRTKRKGHKCDRISAEKRENTAKLMLKMPQMLLDYKKRRWEKKM 120

Query: 121 KEEEKNKQ 129
           KEEEK K+
Sbjct: 121 KEEEKAKE 128

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022157240.12.0e-5991.47uncharacterized protein LOC111023997 [Momordica charantia][more]
XP_038880142.13.4e-5993.02uncharacterized protein LOC120071824 [Benincasa hispida] >XP_038880151.1 unchara... [more]
XP_022984631.17.5e-5992.25uncharacterized protein LOC111482859 [Cucurbita maxima] >XP_022984632.1 uncharac... [more]
XP_004146224.19.8e-5992.97uncharacterized protein LOC101216182 [Cucumis sativus] >KGN57669.1 hypothetical ... [more]
XP_008466828.11.3e-5892.97PREDICTED: uncharacterized protein LOC103504139 [Cucumis melo] >ADN33743.1 hypot... [more]
Match NameE-valueIdentityDescription
A0A6J1DSK09.6e-6091.47uncharacterized protein LOC111023997 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A6J1J2P53.6e-5992.25uncharacterized protein LOC111482859 OS=Cucurbita maxima OX=3661 GN=LOC111482859... [more]
A0A0A0L9K94.8e-5992.97Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G239330 PE=4 SV=1[more]
A0A5A7UAJ56.2e-5992.97Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
E5GBA16.2e-5992.97Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G22000.11.4e-5075.00unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 113..129
NoneNo IPR availablePANTHERPTHR36781OS05G0114600 PROTEINcoord: 1..127

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0017559.1Tan0017559.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016740 transferase activity