Tan0007595 (gene) Snake gourd v1

Overview
NameTan0007595
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlcNAc kinase
LocationLG01: 9556708 .. 9564443 (-)
RNA-Seq ExpressionTan0007595
SyntenyTan0007595
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAATCGTTGAATTGTACGATTATCCAGTGGACAATCAGCAGCAAAAAATTCTGCTCGTCTTCCCCGTCGCTTACCATCGATCTTCTTCTTCTTCTTCTTTCTCTCCAAATTTCCAAGTATTTCTGCAGAACGGAAGAGAGTACTGGTTCGTTTTTGTGGCGGTTTGAGTGTAATTGAAGGAAACAGTAAGGCAGAAGGATCAAGGTTATTGTACCGGATTCTGATTCCTGAGTTTCCAGATGACGAAGAAACATCGGAATGGCGAAATCTGGGAAATTGAGCGAGAAATGTCCGACGGAGCAGCAGCAGGAGGAGGAGTTGGAGACGTGATTCTTGGAATTGATGGAGGAACAACCTCCACAATCTGCGTTTGCGTTCCGTTTTTACCACCTCAGTCTCTTCAGCTTCCTGATCCTATTCCTCTTCTTGCTCGAGTTGAAGCCGGCTGCTCCAACCATAACAGCGTTGGCGGTACTCTTATTGAATCTTGACTGGTCTTTCATTTTTCTTACTGATATTCTTCGGCATATGTTTATCATTGTTCTTCCTGGACTGGTGGCTTTATTTTGCAAGATTATTATTGAATGGGCAGTTGTAGGATAGGTCCTTTCGTTGATCAATTTACCTTTTAGTTCTGGTTTCATTTATTCAAGATAAATGCAAGTATCCTCAGCAATTTTATTTGCAAATCATTTGGTATTTAAAGAGAAGGTTAACCAAAGTGCATGTTAGCATAGTTTAAAAAAGTTAAATGAGTCTAAGTCGAATTTTCATACCAATTAATAACACTCACTAATTATAAGGACGATTGGTAGGAATTAGCCTTATTTTGTCGTTCTTTGGCTGTTAACCATCCTCTCATTGGGCAAAAAATCATCAAGTCGTCCATCAAATGGTTTGGAGCTTTTCTTTTCTTTTTTTTTTTATCTAGGTCAACTTCTTTTTTGAAAGAACATTGCTTGTTTGATAGAGATTTGTGACATATCACGTGGAATAACCAACTTTAACTACTCTCGGTCACACTTTTAACAATCCTCTTATAGAACAAAGGACCAACAAGTCTCCCGTCAACAGTCTCTCCATTAGAGGATAGTACCTACCTAATAGAACCCTACATATTGCTGCAAGTAACTAGGTATCACTACTTCTTTGTGGTATTCAGGAATTCCTTTCATACGACCAAGTCATTCAGGCTCTTTCCATTTTGTTTAGCATACGCGGATTTTATCCCCGTGACTAAGCTTGGAGAAAGACCCTGATACTATTTATTAAAATACTTGGTTTTTCTACATCTCTATGATGGTTTAAAGGAATTTGCTTGGAGCATAAATTTGTATGGTTTTGCCTTTTGAATTTAATCCAAAAGGCCTCATATTCATGGAAATATCATCCTTAATTATAGAGTTATGATATCCTTTATGTATAGTTGATCTGGGACTTTGGTTGTATCTTATAACACCCGTACAGACCAATATCATGGAATGAAGCATTGGTTATATTAAGATTCCCATGCACGTAGACTATGAGTAATGGTCAGCATCGGTCAAATATAACTTCTTGAACTTTCGGGACCAAATTAGATATAGGGGTTAAAACTGTATTTTGACCTTAATTTTTTCATACAACTAAAGACTATGGTATGACAACTCTCAAGGCTCTGTCTTTTCCGGATATTAGTAGTAATGATGGATGGTTGAACAGTGTCAGAAGTCCTTTAAATTCAATTACCATTTGGAAATTATCTACCAAAGTTAATTTATTGTTGATTTTTTTTATTTTTTATTATTATAATAAGATATTGAGAGTTGAGAGTTGCATTGGAGGCGTAAACTGCAACACTTCAAACCTAATCATCTTCTTTTAAATCAGAGGCAAACTATGAGTTGTTTTGCAACTGGCTTTACTGGTTTGATGTTGTGCCATTTGGTATATTTTCGAAAACTAAGGGCCTGTTTGGTAGGCAATCCGAAAACAGAAACTTGAAAACAAGGGATTCAATGAAAACAAAGTTGTATTTCATGTTTTCAGATATGTGTTTGGTTGCAGATTTAGAAATTCAATTTCATTTTAAACAGTAGTTTAAAGTGTGTTCTATAATATATTTATAGATCATAAAATTAGTTATAGTTGCAAAATTAAATATTAAGTTGAATATAAGACATTATTAATTAATTTATGAACATGTTTTTTTAATATAAATTTTATATAATTATTATTTTATAATATAATATTTTATAATTTATATTATTTTATATTATACATATTTTAATTTAACAAAAGTAAATTAAGTTTATAAACATGAAATACTATATTTATTAAGATTTTATCTTTGTTTAATACAATTCTGCATTTTAAATTTTCAAATTCAGAATCTGTATACAATGAAAACATAAAAATGTTGTTTTCAGAATTTCTACTTTTTGGATCACATAATCCGAAAACAGTTTTCGAAAACACGTCTGCCAAACACGTATTCACTGAATTCAGTGAATCCGAAAACATAAAACAAAATCTGGATTGCATACCAAACAGGCCCTACGATATTGACAGGGGCAATAGACTCTTATTTTGAATTGGGATGGAGAAAAAAGGCTGGTTGGTTTCTAAGTATTTAATATGAGATTTTAGTATTGATAGGTCCTTGGCATAAAATTTGCTTCTTCGTTTCAGAAACTGCTGCAAGGGAAACGTTGGAGCAAGTTATGGCTGAGGCCCTTTCAAAATCATGTTCAAATCGGTCTGCAGTTCGAGCTGTTTGTTTATCTGTTTCTGGTGTAAACCATCCAACGGATCAGCAAAGGATTTTGAATTGGCTCAGGTATGGGCCAAGAAAGGATTCTTAGCTGTTTCTATGAAACTATGTTCTTCTGTATTTGGGTTTACGTGATGGAGCATGTTATTTGCTCATTGTTTTATATGAAATCTTAGGATGGCCATTGTCCTTTCTACTCAGTTGACTAATTATAGCTATTATTACAATATCCCTTAGTGGTGTGCTTATAGTTATAGCTGAACCCTTTTATTCAATGAACTTCACGATCTGAGTTAGGTCTCAAATCAATTCCCAGGCTCCAAGCTATTATTCCATTCACTAGCTATTTTGTTATATTACACATGAAATCTGAATCTCATGTCTCTGGATTAATTATCCTTTTATACACTTCTGATATGTTTGTTTTAATTCGCACAGGGATATATTTCCTAGCAATGTCAAACTCTATGTTCGAAATGATGCTGTCGCTGCTCTTGCAAGTGGTACCATGGGAAGGCTTCACGGTTGTGTTCTAATTGCTGGCACTGGGTCTATTGCTTATGGATTTACAGATGATGGAAGAGAAGCTCGAGCAGCTGGTGCAGGACCAATCTTAGGCGATTGGGGAAGGTCTCTTTTTCTCACTCCCTCACATGGGCGCACACTTCTAAATTCGTTTATGATTGATTGCATGTGGATCATCTATTTGTTTCCCAAATACTTGTACCATTATTCGTTGTTGGTGAATGATTGAGTGGTGAATTTATTTTTTACTCTCATAGCATCGTTCTCAATATTTTATATATTTAGTCATGAAGACCATGCCTATCAATGTTTAAGCAAGTTCCTTGGAAATCTCTAATTGGCTTCTTTTGCTTCCGTAGTATGTTGTCAATTAGATGCTCAAATTTTGTTGAAAGAAATTTTGTTAAAAAACTTCCAGACTAATCTTTCCGGTCAATGTCTTTTATCTTTGATTTTTTTTTTGAGCCAATTGGAGCCTCATATTGTAAATTTCTTATTGAACAACGCTCGTGGAAAATATTTTAGCTTGAGTTTGTGTATCGCTTGAGGTAAATAATTATGTATCATCTTGTTGGTTATATCTGCACTAGCCATATTACTTGATCAACTGGGACTTGCTGAGTTATCTAGTGGATGAAATGGTTTGTTCTTTTGCTTTTATTGTGGTTAATGCTTCAAGGAAATTTTTTTTTGGAGAGTTCAACGTTCAACATCTCAAGACTTTTACAAAACTACGACACTAGGTTTTCTCCCATGGGCTCCTCTTATTACATTTATCAATGAAATTTGTTTCTCTTCTCAAAAAACTAGGTTTCCTCCCTTGGGCAGTTTTCATTATAATATTATATCGCTACTTAATCCAAAAGCTTAAATTGATAGCCATGGGTTACGGTATATTTAATTATTATTCTTATTCTTAACACTCTTCATTTAAGGGAATGAAAATTAACATACGGCTCAATAAATGACATTCATATTAATTAGAGAAGAAATGACATTGAAGAGTTCAAACACAGAACCTGCTACTCTGATACTGTGATACCCAAAAAATTAAGTTGATAGATTAGAGTATATTTGAATAAATGTTGATAGATTAGGGTATATTTAATCATTTATTCATATTTTTTTTACTATAAGAAACATTTTACTGTGATCTTTGCACTCTGCACGTTTGTATGTTTGTTTTGCAAATTTGTTTGTGTTTTAAAATTATATTACTGCATTGCATGCTGATCTATATTGGTAAATTTCTGAGAGTATTGTGCATTACATACAAATTTTTAGTTTCTTTTGCCACTTTTCATTGAAGGTGATACTTTTTGTAATTTCAGTGGGTATGGCATATCAGCGCAGGCTTTAACTGCAATTATAAGAGCGCATGATGGTCGTGGTCCTCAAACAAAGCTCACGAATAGCATTCTTCAGACTCTTGGTCTTTCTTCTGCTGATGAACTCATTGGGTATGTACACTCATTACCGGTCACAATTTTCACTATCTCATCTTTTATATTCTTTGACGCAGTTTCCGGACAATATAAATTAAACATAGTAGAAGGAATAAAATATCTTCCAATTATTAGAAGGAAATAATTCTTTTTAAATGATGTATAGTCAGTCTTTCTGTCTCTCTCTTTTATTTTTTTTTTATATATATTGAAGTCATGCTGAGGCTTGCTTTTATCCATTTTAATACCTTTTTTTTTTATTATTATTATGAATGTATTAATCTGTATTATTGTGAAGGCTGCCCTGATTTACGTGCAGCAGCTGTACAATTTTTTGAAAGATAATGGAAAGAGAATCAACATTTATCCAAGTATTTCTTGAAATTTCGAGGAGAAAGTAGTTGTTTCCTGTTCAGTTTCTGACATTTGTACCCCAATTCATATGTTGAAATGGTAGGAAAAGTACAGTATCATGCATGATTGATTAGGAATGGATCACATCTGTTTTTGGTTATCATTGAAAATTAGGTGGACCTACGCAGATCCATCTTGGGCTCGCATTGCTGCACTTGTTCCTCTTGTTGTATCATGTGCAGAAGCAGGGGATGAAGTTGCAAACAACATCTTGCAAGATTCAGTTAAGGAATTGGCTTTAAGCGTGAATGCTGTTGTTCAAAGACTCGGATTGTGTGGTTCAGGTACTGCAATAGTGGAAATCTTTGGTCGTAGTTAAGGGCCCGTTTGATAACGTTCCCGTTTCCTGTTTCTTGTTTCTATTTCTCATTTTTTAAGAAACAGACTTGTTTGATAATCCATCCTGTTTCTTGTTCCCAAAATTTGAGAAACGTTTCTAAAATTTGGACTAAATTTTAGAAACAACATAAAGTAGTTTCTTTTTCTGTTCCGTTTCTTTTTATTTTTTAAATGTTCCCAACTTTTCTTAATTGACTCTCCAGCTTTGGTAAAGTGTTTGGTAGTTTTTATCTCCTTCCCTAACCAATCTTGAGTTCGATCCCTAAAGAGTACAAATCTTTTTTTTTTTAATGAATTTCGGATTTTGCAATATATATAGTTTTAGTTTTTATAAATAATAATAATATAATATATATAAATTTATTTATTTTTATTTTATTTCTCAATTTTACCATATGTATATTTATATTATTTTAATTTAAAATTTAAAATTTTATATATATATATATATTATTTGAATTTTGAATTTTAATTTTGCAATTTTTATATTTATATATTATTGTAATTTTGCAATAAATAAATTTTGTTCACTAAAATATGGCTTTAGCCTCATTTTAATATTTTACAACCACTACATTTAACAATATAATTGAGTTTGATCTGAGTCGACATTATTTAAGACGAATGACTCGAGTAAGAGATGATATTACAATTCAAATTTTAACAACATTTAAAGGATAAAACAATTTTATTTATTTACTTTTATTATTAGTTAATTTTGTATCAAGACTTGTTGGTATTTGATAATATAAATTTGTTATAATTAATTATATATTTATGTATACAAGAAACAAGAAACAGTTATCAAACAAGTTTATGTTTCTATTTCTTATTTTCAAAGAAACAAGAAACAGAAATAGTTATCAAACATATTCTTGTTTCTTATTCTTAAAAAATGAGAAACAAGAAACAAGAGACAAGGAACAAGAAACAGGAAACAGAAAACAAGAACGTTATCAAACGGGGCCTAATAAACTTGATTCTCCTGGCCCCCGCTTCTGAAACTAACCTCAAAGTTGGAAGCACTTTAGCTATTAAACCTCCCTCAACGTCTTTCCTTCTTTGGGAGAATGCGTTTTACCCCATGTTCTGACTTTGAGATGAAGTACGTCTCCTCAATTTATTTCTCTTTGGATCTGCACCTGAATGTGCAACAAAATTGATCGTGAATAATCTCAATATATGCGCTTATAAAAAGTTTGGAGCCGTTCCATGTCTATTCGTTGGAATGCTACTGATCATGCAAAGATATTCGTTGGAATTAGGATTTCTAGAAATTTCTTGTAGATCCAAAGTTTTGCCTCAGTAATAGGATAGTCTCCTTTTTGATTTTTTTAACCAAAGTTTTGCCTCACCTCTCATTGTAAGCATCATTAACATTGCAGATGGGAAGGATTCTTTTCCCCTTGTCATGGTTGGTGGAGTTCTTGAAGGAAATAAGGGATGGGGTATAGCGCAAGAAGTTATTAACTGCATTTCCAAGGACTACCCTGGGGTAGTTCCCATTTGGCCTAAGGTAAGCATATTGTTCAATATCCTCTCTTTTCTGCATGTGTTTTTACATTATGATAACTGTGGTGCTGTTTGGGAGCTGTTGTGAAAAATATTCAATGAAAACATGTTCATTTATTTTTCTTAAATATGTACATGGACAGACTTGATTCTTAAAACAACAAAAATTCTCTCTTTCAAAACCACAAGTTTATGAACAGCGATTTCAAGTGTACGAAACAACCCCACATCTGTTTCTGCCTTTTGCTATGCTTTCACTAAAATGCATGTTCTTCACCCCGCAATAATCATCCATGCATTTTCTTGAACTAAAACCAGGTGGAGCCTGCTATTGGGGCTGCATTGTTAGCCTGGAATTTCTTGAAAGATTCCCATCAAGAATAAGGCTTTAAAAAAAAGTGACTGACAAATATTCTTAAGTTCTCATGAGTTGTACAGAAGCAAAGTTGAAAACAGTGAGCCCAGAATTCCAAAGGGGAAGAAGGAAAAAAAAGGTCCTGTGAGTGAGCTATTTCAGTTCTCAGTATAAATATAGGCAATTTACTAACCAAGATTCTGTCATTTTTTTAAAATAATAGTTATCGGGTGATTGAAAAAAGAAATGCTGTGTCAACATTAATTAATGCATGATTTTGGTGCATTTGCTTCAATGTCCTGTGTTACTGTCAACTTGAAAGCAATTCTTTGGCTTTTGAGCCCTTTATTTTTGTGCTTTGTTCCATTAATCTGTCATTTCCAATTTGCATGGG

mRNA sequence

TAATCGTTGAATTGTACGATTATCCAGTGGACAATCAGCAGCAAAAAATTCTGCTCGTCTTCCCCGTCGCTTACCATCGATCTTCTTCTTCTTCTTCTTTCTCTCCAAATTTCCAAGTATTTCTGCAGAACGGAAGAGAGTACTGGTTCGTTTTTGTGGCGGTTTGAGTGTAATTGAAGGAAACAGTAAGGCAGAAGGATCAAGGTTATTGTACCGGATTCTGATTCCTGAGTTTCCAGATGACGAAGAAACATCGGAATGGCGAAATCTGGGAAATTGAGCGAGAAATGTCCGACGGAGCAGCAGCAGGAGGAGGAGTTGGAGACGTGATTCTTGGAATTGATGGAGGAACAACCTCCACAATCTGCGTTTGCGTTCCGTTTTTACCACCTCAGTCTCTTCAGCTTCCTGATCCTATTCCTCTTCTTGCTCGAGTTGAAGCCGGCTGCTCCAACCATAACAGCGTTGGCGAAACTGCTGCAAGGGAAACGTTGGAGCAAGTTATGGCTGAGGCCCTTTCAAAATCATGTTCAAATCGGTCTGCAGTTCGAGCTGTTTGTTTATCTGTTTCTGGTGTAAACCATCCAACGGATCAGCAAAGGATTTTGAATTGGCTCAGGGATATATTTCCTAGCAATGTCAAACTCTATGTTCGAAATGATGCTGTCGCTGCTCTTGCAAGTGGTACCATGGGAAGGCTTCACGGTTGTGTTCTAATTGCTGGCACTGGGTCTATTGCTTATGGATTTACAGATGATGGAAGAGAAGCTCGAGCAGCTGGTGCAGGACCAATCTTAGGCGATTGGGGAAGTGGGTATGGCATATCAGCGCAGGCTTTAACTGCAATTATAAGAGCGCATGATGGTCGTGGTCCTCAAACAAAGCTCACGAATAGCATTCTTCAGACTCTTGGTCTTTCTTCTGCTGATGAACTCATTGGGTGGACCTACGCAGATCCATCTTGGGCTCGCATTGCTGCACTTGTTCCTCTTGTTGTATCATGTGCAGAAGCAGGGGATGAAGTTGCAAACAACATCTTGCAAGATTCAGTTAAGGAATTGGCTTTAAGCGTGAATGCTGTTGTTCAAAGACTCGGATTGTGTGGTTCAGATGGGAAGGATTCTTTTCCCCTTGTCATGGTTGGTGGAGTTCTTGAAGGAAATAAGGGATGGGGTATAGCGCAAGAAGTTATTAACTGCATTTCCAAGGACTACCCTGGGGTAGTTCCCATTTGGCCTAAGGTGGAGCCTGCTATTGGGGCTGCATTGTTAGCCTGGAATTTCTTGAAAGATTCCCATCAAGAATAAGGCTTTAAAAAAAAGTGACTGACAAATATTCTTAAGTTCTCATGAGTTGTACAGAAGCAAAGTTGAAAACAGTGAGCCCAGAATTCCAAAGGGGAAGAAGGAAAAAAAAGGTCCTGTGAGTGAGCTATTTCAGTTCTCAGTATAAATATAGGCAATTTACTAACCAAGATTCTGTCATTTTTTTAAAATAATAGTTATCGGGTGATTGAAAAAAGAAATGCTGTGTCAACATTAATTAATGCATGATTTTGGTGCATTTGCTTCAATGTCCTGTGTTACTGTCAACTTGAAAGCAATTCTTTGGCTTTTGAGCCCTTTATTTTTGTGCTTTGTTCCATTAATCTGTCATTTCCAATTTGCATGGG

Coding sequence (CDS)

ATGACGAAGAAACATCGGAATGGCGAAATCTGGGAAATTGAGCGAGAAATGTCCGACGGAGCAGCAGCAGGAGGAGGAGTTGGAGACGTGATTCTTGGAATTGATGGAGGAACAACCTCCACAATCTGCGTTTGCGTTCCGTTTTTACCACCTCAGTCTCTTCAGCTTCCTGATCCTATTCCTCTTCTTGCTCGAGTTGAAGCCGGCTGCTCCAACCATAACAGCGTTGGCGAAACTGCTGCAAGGGAAACGTTGGAGCAAGTTATGGCTGAGGCCCTTTCAAAATCATGTTCAAATCGGTCTGCAGTTCGAGCTGTTTGTTTATCTGTTTCTGGTGTAAACCATCCAACGGATCAGCAAAGGATTTTGAATTGGCTCAGGGATATATTTCCTAGCAATGTCAAACTCTATGTTCGAAATGATGCTGTCGCTGCTCTTGCAAGTGGTACCATGGGAAGGCTTCACGGTTGTGTTCTAATTGCTGGCACTGGGTCTATTGCTTATGGATTTACAGATGATGGAAGAGAAGCTCGAGCAGCTGGTGCAGGACCAATCTTAGGCGATTGGGGAAGTGGGTATGGCATATCAGCGCAGGCTTTAACTGCAATTATAAGAGCGCATGATGGTCGTGGTCCTCAAACAAAGCTCACGAATAGCATTCTTCAGACTCTTGGTCTTTCTTCTGCTGATGAACTCATTGGGTGGACCTACGCAGATCCATCTTGGGCTCGCATTGCTGCACTTGTTCCTCTTGTTGTATCATGTGCAGAAGCAGGGGATGAAGTTGCAAACAACATCTTGCAAGATTCAGTTAAGGAATTGGCTTTAAGCGTGAATGCTGTTGTTCAAAGACTCGGATTGTGTGGTTCAGATGGGAAGGATTCTTTTCCCCTTGTCATGGTTGGTGGAGTTCTTGAAGGAAATAAGGGATGGGGTATAGCGCAAGAAGTTATTAACTGCATTTCCAAGGACTACCCTGGGGTAGTTCCCATTTGGCCTAAGGTGGAGCCTGCTATTGGGGCTGCATTGTTAGCCTGGAATTTCTTGAAAGATTCCCATCAAGAATAA

Protein sequence

MTKKHRNGEIWEIEREMSDGAAAGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQLPDPIPLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQQRILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYADPSWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVMVGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE
Homology
BLAST of Tan0007595 vs. ExPASy Swiss-Prot
Match: Q54PM7 (N-acetyl-D-glucosamine kinase OS=Dictyostelium discoideum OX=44689 GN=nagk PE=3 SV=2)

HSP 1 Score: 200.7 bits (509), Expect = 2.8e-50
Identity = 120/333 (36.04%), Postives = 188/333 (56.46%), Query Frame = 0

Query: 29  DVILGIDGGTTSTICVCVPFLPPQSLQLPDPIPLLARVEAGCSNHNSVGETAARETLEQ- 88
           ++ +GIDGG T T  V V     +          LAR  + CSN++SVGE  A+  + + 
Sbjct: 4   EIFIGIDGGGTKTSTVAVDSNGQE----------LARHTSPCSNYHSVGEDLAKAAINEG 63

Query: 89  ------VMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQQRILNWLRDIFPSNVKLYVRND 148
                  + E ++   +    V ++CL +SGV+   D+  + +W+ ++   ++   + ND
Sbjct: 64  IKYVIRKVKETITDDDNKEVTVGSICLGMSGVDREKDKLLVKSWVTELLGESINYSIHND 123

Query: 149 AVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALT 208
           A+ AL+SGT G+L G V+I GTG I+ GF  +G   R+ G GP+LGD+GSGY I    L 
Sbjct: 124 AIVALSSGTQGKLFGVVIICGTGCISLGFNREGVSGRSGGWGPLLGDYGSGYQIGYDILR 183

Query: 209 AIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYADP---SWARIAALVPLVVSCAEA 268
            +++A D  GP+T LT  +L+ L L+  ++LI W Y DP   SW + A L PL    A+ 
Sbjct: 184 HVLKAKDQVGPKTSLTQVLLEKLQLTKEEDLISWAY-DPKTQSWQKFAQLSPLAFEQAQL 243

Query: 269 GDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVMVGGVLEGNKGWGIAQEVI 328
           GDE++N IL D+   L   +N+V+++LGL   D ++ FPLV  GG +E     GI  +++
Sbjct: 244 GDEISNLILVDAANALYDLINSVIKKLGL---DKEEKFPLVYTGGNIERK---GILSDLL 303

Query: 329 N-CISKDYPGVVPIWPKVEPAIGAALLAWNFLK 351
           +  I ++YP    +    +P++GAALLA N  K
Sbjct: 304 SKKIMENYPNAEILNTTCDPSMGAALLALNSKK 319

BLAST of Tan0007595 vs. ExPASy Swiss-Prot
Match: Q97ML3 (N-acetylmuramic acid/N-acetylglucosamine kinase OS=Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787) OX=272562 GN=murK PE=1 SV=1)

HSP 1 Score: 136.3 bits (342), Expect = 6.5e-31
Identity = 106/316 (33.54%), Postives = 161/316 (50.95%), Query Frame = 0

Query: 31  ILGIDGGTTSTICVCVPFLPPQSLQLPDPIPLLARVEAGCSNHNSVGETAARETLEQVMA 90
           ++GIDGG + T            +   D   +L  V  G SN NS  +   +  L++++ 
Sbjct: 4   VIGIDGGGSKT---------HMKISTLD-YKVLLEVFKGPSNINSSTKEEVKRVLQELIM 63

Query: 91  EALSKSCSNRSAVRAVCLSVSGVNHPTDQQRILNWLRDIFPSNVKLYVRNDAVAALASGT 150
           E L K   +     A+C+  +G +   D+  I + +R +     K+ V NDA  ALA G 
Sbjct: 64  EGLGKLGQSLEECSAICIGTAGADRTEDKSIIEDMIRSLGYMG-KIIVVNDAEIALAGGI 123

Query: 151 MGRLHGCVLIAGTGSIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGR 210
             R  G ++I+GTGSI YG   +GR AR+ G G I+GD GSGY I  +A+ A +++ D R
Sbjct: 124 EKR-EGIIVISGTGSICYGRNKEGRSARSGGWGHIIGDEGSGYDIGIKAIKAALKSFDKR 183

Query: 211 GPQTKLTNSILQTLGLSSADELIGWTY-ADPSWARIAALVPLVVSCAEAGDEVANNILQD 270
           G +T L   IL  L L S ++LI + Y +  +   IA+L  +V S    GD V+  IL++
Sbjct: 184 GEKTILEGDILDFLKLKSHEDLINYIYRSGVTKKEIASLTRVVNSAYIKGDLVSKRILKE 243

Query: 271 SVKELALSVNAVVQRLGLCGSDGKDSFPLVMVGGVLEGNKGWGIAQEVINCISKDYPGVV 330
           + +EL LSV AVV+ L +          L   GGV+  N    +  E    ++ +YP V 
Sbjct: 244 AARELFLSVKAVVEVLSM----QNKKVVLTTAGGVI--NNINYLYDEFRKFLNLNYPKVK 301

Query: 331 PIWPKVEPAIGAALLA 346
            I  K + A GA ++A
Sbjct: 304 IISMKNDSAFGAVIIA 301

BLAST of Tan0007595 vs. ExPASy Swiss-Prot
Match: Q3SZM9 (N-acetyl-D-glucosamine kinase OS=Bos taurus OX=9913 GN=NAGK PE=2 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 9.8e-11
Identity = 73/290 (25.17%), Postives = 126/290 (43.45%), Query Frame = 0

Query: 33  GIDGGTTSTICVCVPFLPPQSLQLPDPIPLLARVEAGCSNHNSVGETAARETLEQVMAEA 92
           G++GG T +          + L L +   +LA  +   +NH  +G     E + +++  A
Sbjct: 7   GVEGGGTRS----------KVLLLSEDGQILAEADGLSTNHWLIGTDKCVERINEMVNRA 66

Query: 93  LSKS-CSNRSAVRAVCLSVSGVNHPTDQQRILNWLRDIFPSNVKLY-VRNDAVAALASGT 152
             K+       +R + LS+SG +     + ++  LRD FP   + Y +  DA  ++A+ T
Sbjct: 67  KRKAGVDPLVPLRGLGLSLSGGDQEDAVRMLMEELRDRFPYLSESYLITTDAAGSIATAT 126

Query: 153 MGRLHGCVLIAGTGSIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGR 212
                G VLI+GTGS       DG E+   G G ++GD GS Y I+ QA+  +  + D  
Sbjct: 127 PD--GGVVLISGTGSNCRLINPDGSESGCGGWGHMMGDEGSAYWIAHQAVKIVFDSIDNL 186

Query: 213 GPQTKLTNSILQTL----GLSSADELIGWTYADPSWARIAALVPLVVSCAEAGDEVANNI 272
                    + Q +     +     ++   Y D   +R A     V   A+ GD ++  I
Sbjct: 187 EAAPHDIGYVKQAMFNYFQVPDRLGILTHLYRDFDKSRFAGFCRKVAEGAQQGDPLSRCI 246

Query: 273 LQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVMVGGVLEGNKGWGIAQE 317
            + + + L   V AV+  +      G+   P++ VG V    K W + +E
Sbjct: 247 FRKAGEMLGRHVVAVLPEIDPVLFQGEMGLPILCVGSVW---KSWELLKE 281

BLAST of Tan0007595 vs. ExPASy Swiss-Prot
Match: Q9UJ70 (N-acetyl-D-glucosamine kinase OS=Homo sapiens OX=9606 GN=NAGK PE=1 SV=4)

HSP 1 Score: 68.9 bits (167), Expect = 1.3e-10
Identity = 71/290 (24.48%), Postives = 126/290 (43.45%), Query Frame = 0

Query: 33  GIDGGTTSTICVCVPFLPPQSLQLPDPIPLLARVEAGCSNHNSVGETAARETLEQVMAEA 92
           G++GG T +          + L + +   +LA  +   +NH  +G     E + +++  A
Sbjct: 7   GVEGGGTRS----------EVLLVSEDGKILAEADGLSTNHWLIGTDKCVERINEMVNRA 66

Query: 93  LSKS-CSNRSAVRAVCLSVSGVNHPTDQQRILNWLRDIFPSNVKLY-VRNDAVAALASGT 152
             K+       +R++ LS+SG +     + ++  LRD FP   + Y +  DA  ++A+ T
Sbjct: 67  KRKAGVDPLVPLRSLGLSLSGGDQEDAGRILIEELRDRFPYLSESYLITTDAAGSIATAT 126

Query: 153 MGRLHGCVLIAGTGSIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGR 212
                G VLI+GTGS       DG E+   G G ++GD GS Y I+ QA+  +  + D  
Sbjct: 127 PD--GGVVLISGTGSNCRLINPDGSESGCGGWGHMMGDEGSAYWIAHQAVKIVFDSIDNL 186

Query: 213 GPQTKLTNSILQTL----GLSSADELIGWTYADPSWARIAALVPLVVSCAEAGDEVANNI 272
                    + Q +     +     ++   Y D    R A     +   A+ GD ++  I
Sbjct: 187 EAAPHDIGYVKQAMFHYFQVPDRLGILTHLYRDFDKCRFAGFCRKIAEGAQQGDPLSRYI 246

Query: 273 LQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVMVGGVLEGNKGWGIAQE 317
            + + + L   + AV+  +      GK   P++ VG V    K W + +E
Sbjct: 247 FRKAGEMLGRHIVAVLPEIDPVLFQGKIGLPILCVGSVW---KSWELLKE 281

BLAST of Tan0007595 vs. ExPASy Swiss-Prot
Match: P81799 (N-acetyl-D-glucosamine kinase OS=Rattus norvegicus OX=10116 GN=Nagk PE=1 SV=4)

HSP 1 Score: 67.8 bits (164), Expect = 2.9e-10
Identity = 70/290 (24.14%), Postives = 126/290 (43.45%), Query Frame = 0

Query: 33  GIDGGTTSTICVCVPFLPPQSLQLPDPIPLLARVEAGCSNHNSVGETAARETLEQVMAEA 92
           G++GG T +          + L L +   +LA  +   +NH  +G     E + +++  A
Sbjct: 7   GVEGGGTRS----------KVLLLSEDGQILAEADGLSTNHWLIGTGTCVERINEMVDRA 66

Query: 93  LSKS-CSNRSAVRAVCLSVSGVNHPTDQQRILNWLRDIFP-SNVKLYVRNDAVAALASGT 152
             K+       +R++ LS+SG       + ++  LRD FP  +   ++  DA  ++A+ T
Sbjct: 67  KRKAGVDPLVPLRSLGLSLSGGEQEDAVRLLMEELRDRFPYLSESYFITTDAAGSIATAT 126

Query: 153 MGRLHGCVLIAGTGSIAYGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGR 212
                G VLI+GTGS       DG E+   G G ++GD GS Y I+ QA+  +  + D  
Sbjct: 127 PD--GGIVLISGTGSNCRLINPDGSESGCGGWGHMMGDEGSAYWIAHQAVKIVFDSIDNL 186

Query: 213 GPQTKLTNSILQTL----GLSSADELIGWTYADPSWARIAALVPLVVSCAEAGDEVANNI 272
                    + Q +     +     ++   Y D   ++ A     +   A+ GD ++  I
Sbjct: 187 EAAPHDIGHVKQAMFNYFQVPDRLGILTHLYRDFDKSKFAGFCQKIAEGAQQGDPLSRFI 246

Query: 273 LQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVMVGGVLEGNKGWGIAQE 317
            + + + L   V AV+  +      G+   P++ VG V    K W + +E
Sbjct: 247 FRKAGEMLGRHVVAVLPEIDPVLFQGELGLPILCVGSVW---KSWELLKE 281

BLAST of Tan0007595 vs. NCBI nr
Match: XP_023526102.1 (N-acetyl-D-glucosamine kinase-like [Cucurbita pepo subsp. pepo] >XP_023526173.1 N-acetyl-D-glucosamine kinase-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 676.0 bits (1743), Expect = 1.7e-190
Identity = 336/355 (94.65%), Postives = 344/355 (96.90%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMSDGAAAGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQLPDPI 60
           MTKKHRNGEIWE EREMS G A GGGVGDVILGIDGGTTSTICVCVPFLPPQSLQ PDPI
Sbjct: 1   MTKKHRNGEIWEFEREMS-GGAEGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQFPDPI 60

Query: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQQ 120
           PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLS+SGVNHPTDQQ
Sbjct: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSLSGVNHPTDQQ 120

Query: 121 RILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARAA 180
           RILNWLRDIFPS+VKLYVRNDAVAALASGTMGRL GCVLIAGTG+IAYGFTDDGREARAA
Sbjct: 121 RILNWLRDIFPSHVKLYVRNDAVAALASGTMGRLRGCVLIAGTGTIAYGFTDDGREARAA 180

Query: 181 GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYADP 240
           GAGPILGDWGSGYGISAQALTA+IRAHDGRGPQTKLTNSIL TLGLSSADELIGWTYADP
Sbjct: 181 GAGPILGDWGSGYGISAQALTAVIRAHDGRGPQTKLTNSILHTLGLSSADELIGWTYADP 240

Query: 241 SWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVM 300
           +WARIAALVP+VVSCAEAGDEVANNILQ SVKELALSV  VVQRL LCGSDGKDSFPLVM
Sbjct: 241 TWARIAALVPVVVSCAEAGDEVANNILQHSVKELALSVTGVVQRLRLCGSDGKDSFPLVM 300

Query: 301 VGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 356
           VGGVLEGNKGWGIAQEV+NCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSH+E
Sbjct: 301 VGGVLEGNKGWGIAQEVVNCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHKE 354

BLAST of Tan0007595 vs. NCBI nr
Match: XP_022981650.1 (N-acetyl-D-glucosamine kinase-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 675.2 bits (1741), Expect = 3.0e-190
Identity = 336/355 (94.65%), Postives = 345/355 (97.18%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMSDGAAAGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQLPDPI 60
           MTKKHRNGEIWE EREMS G A GGGVGDVILGIDGGTTSTICVCVPFLPPQSLQ PDPI
Sbjct: 1   MTKKHRNGEIWEFEREMS-GGAEGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQFPDPI 60

Query: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQQ 120
           PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLS+SGVNHPTDQQ
Sbjct: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSLSGVNHPTDQQ 120

Query: 121 RILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARAA 180
           RILNWLRDIFPS+VKLYVRNDA+AALASGTMGRL GCVLIAGTG+IAYGFTDDGREARAA
Sbjct: 121 RILNWLRDIFPSHVKLYVRNDALAALASGTMGRLRGCVLIAGTGTIAYGFTDDGREARAA 180

Query: 181 GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYADP 240
           GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSIL TLGLSSADELIGWTYADP
Sbjct: 181 GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILHTLGLSSADELIGWTYADP 240

Query: 241 SWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVM 300
           +WARIAALVP+VVSCAEAGDEVANNILQ SVKELALSV AVVQRL LCGSDGKDSFPLVM
Sbjct: 241 TWARIAALVPVVVSCAEAGDEVANNILQHSVKELALSVTAVVQRLRLCGSDGKDSFPLVM 300

Query: 301 VGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 356
           VGGVL+GNKGWGIAQEV+NCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSH+E
Sbjct: 301 VGGVLKGNKGWGIAQEVVNCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHKE 354

BLAST of Tan0007595 vs. NCBI nr
Match: KAG6600454.1 (N-acetyl-D-glucosamine kinase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 674.5 bits (1739), Expect = 5.1e-190
Identity = 336/355 (94.65%), Postives = 344/355 (96.90%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMSDGAAAGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQLPDPI 60
           MTKKHRNGEIWE EREMS G A GGGVGDVILGIDGGTTSTICVCVPFLPPQSLQ PDPI
Sbjct: 1   MTKKHRNGEIWEFEREMS-GGAEGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQFPDPI 60

Query: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQQ 120
           PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLS+SGVNHPTDQQ
Sbjct: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSLSGVNHPTDQQ 120

Query: 121 RILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARAA 180
           RIL WLRDIFPS+VKLYVRNDAVAALASGTMGRL GCVLIAGTG+IAYGFTDDGREARAA
Sbjct: 121 RILIWLRDIFPSHVKLYVRNDAVAALASGTMGRLRGCVLIAGTGTIAYGFTDDGREARAA 180

Query: 181 GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYADP 240
           GAGPILGDWGSGYGISAQALTA+IRAHDGRGPQTKLTNSIL TLGLSSADELIGWTYADP
Sbjct: 181 GAGPILGDWGSGYGISAQALTAVIRAHDGRGPQTKLTNSILHTLGLSSADELIGWTYADP 240

Query: 241 SWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVM 300
           +WARIAALVP+VVSCAEAGDEVANNILQ SVKELALSV AVVQRLGLC SDGKDSFPLVM
Sbjct: 241 TWARIAALVPVVVSCAEAGDEVANNILQHSVKELALSVTAVVQRLGLCDSDGKDSFPLVM 300

Query: 301 VGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 356
           VGGVLEGNKGWGIAQEV+NCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSH+E
Sbjct: 301 VGGVLEGNKGWGIAQEVVNCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHKE 354

BLAST of Tan0007595 vs. NCBI nr
Match: XP_022941933.1 (N-acetyl-D-glucosamine kinase-like [Cucurbita moschata])

HSP 1 Score: 669.1 bits (1725), Expect = 2.1e-188
Identity = 333/355 (93.80%), Postives = 343/355 (96.62%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMSDGAAAGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQLPDPI 60
           MTKKHRNGEIWE EREMS G A GGGVGDVILGIDGGTTSTICVCVPFLPPQSLQ PDPI
Sbjct: 1   MTKKHRNGEIWEFEREMS-GGAEGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQFPDPI 60

Query: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQQ 120
            LLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLS+SGVNHPTDQQ
Sbjct: 61  SLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSLSGVNHPTDQQ 120

Query: 121 RILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARAA 180
           RILNWLRDIFPS+VKLYVRNDAVAALASGTMGRL GCVLIAGTG+IAYGFTDDGREARAA
Sbjct: 121 RILNWLRDIFPSHVKLYVRNDAVAALASGTMGRLRGCVLIAGTGTIAYGFTDDGREARAA 180

Query: 181 GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYADP 240
           GAGPILGDWGSGYGISAQALTA++RAHDGRGPQTKLTNSIL TLGLSSADELIGWTYADP
Sbjct: 181 GAGPILGDWGSGYGISAQALTAVMRAHDGRGPQTKLTNSILHTLGLSSADELIGWTYADP 240

Query: 241 SWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVM 300
           +WARIAALVP+VVSCAEAGDEVANNI+Q SVKELALSV AVVQRL LCGSDGK SFPLVM
Sbjct: 241 TWARIAALVPVVVSCAEAGDEVANNIVQHSVKELALSVTAVVQRLRLCGSDGKASFPLVM 300

Query: 301 VGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 356
           VGGVLEGNKGWGIAQEV+NCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSH+E
Sbjct: 301 VGGVLEGNKGWGIAQEVVNCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHKE 354

BLAST of Tan0007595 vs. NCBI nr
Match: XP_022136560.1 (N-acetyl-D-glucosamine kinase-like [Momordica charantia])

HSP 1 Score: 657.1 bits (1694), Expect = 8.4e-185
Identity = 325/355 (91.55%), Postives = 337/355 (94.93%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMSDGAAAGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQLPDPI 60
           MTKKHRNGEIWE EREMSDG   G G GDVILGIDGGTTSTICVCVPFL P SL L DP+
Sbjct: 1   MTKKHRNGEIWEFEREMSDGTGGGAGAGDVILGIDGGTTSTICVCVPFLQPHSLHLADPL 60

Query: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQQ 120
           P+LARVEAGCSNHNSVGETAA+ETLEQVMAEALSK+ SNRSAVRAVCLSVSGVNHPTDQQ
Sbjct: 61  PVLARVEAGCSNHNSVGETAAKETLEQVMAEALSKAGSNRSAVRAVCLSVSGVNHPTDQQ 120

Query: 121 RILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARAA 180
           RILNWLRDIFPS+VKLYVRNDAVAALASGTMGRL GCVLIAGTG+IAYGFTDDGREARAA
Sbjct: 121 RILNWLRDIFPSHVKLYVRNDAVAALASGTMGRLRGCVLIAGTGTIAYGFTDDGREARAA 180

Query: 181 GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYADP 240
           GAGP+ GDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQ LGLSS DELIGWTYAD 
Sbjct: 181 GAGPVFGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQILGLSSPDELIGWTYADT 240

Query: 241 SWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVM 300
           SWARIAALVP+VVSCAEAGDEVANNILQDSVKELALSVNAVV+RL LCGSDGKDSFPLVM
Sbjct: 241 SWARIAALVPIVVSCAEAGDEVANNILQDSVKELALSVNAVVRRLELCGSDGKDSFPLVM 300

Query: 301 VGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 356
           VGGVLEGNKGWGIA+EVINCISKDYPGVVP+WPKVEPAIGAALLAWNFLKDS +E
Sbjct: 301 VGGVLEGNKGWGIAEEVINCISKDYPGVVPVWPKVEPAIGAALLAWNFLKDSRRE 355

BLAST of Tan0007595 vs. ExPASy TrEMBL
Match: A0A6J1J041 (GlcNAc kinase OS=Cucurbita maxima OX=3661 GN=LOC111480704 PE=3 SV=1)

HSP 1 Score: 675.2 bits (1741), Expect = 1.4e-190
Identity = 336/355 (94.65%), Postives = 345/355 (97.18%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMSDGAAAGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQLPDPI 60
           MTKKHRNGEIWE EREMS G A GGGVGDVILGIDGGTTSTICVCVPFLPPQSLQ PDPI
Sbjct: 1   MTKKHRNGEIWEFEREMS-GGAEGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQFPDPI 60

Query: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQQ 120
           PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLS+SGVNHPTDQQ
Sbjct: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSLSGVNHPTDQQ 120

Query: 121 RILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARAA 180
           RILNWLRDIFPS+VKLYVRNDA+AALASGTMGRL GCVLIAGTG+IAYGFTDDGREARAA
Sbjct: 121 RILNWLRDIFPSHVKLYVRNDALAALASGTMGRLRGCVLIAGTGTIAYGFTDDGREARAA 180

Query: 181 GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYADP 240
           GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSIL TLGLSSADELIGWTYADP
Sbjct: 181 GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILHTLGLSSADELIGWTYADP 240

Query: 241 SWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVM 300
           +WARIAALVP+VVSCAEAGDEVANNILQ SVKELALSV AVVQRL LCGSDGKDSFPLVM
Sbjct: 241 TWARIAALVPVVVSCAEAGDEVANNILQHSVKELALSVTAVVQRLRLCGSDGKDSFPLVM 300

Query: 301 VGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 356
           VGGVL+GNKGWGIAQEV+NCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSH+E
Sbjct: 301 VGGVLKGNKGWGIAQEVVNCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHKE 354

BLAST of Tan0007595 vs. ExPASy TrEMBL
Match: A0A6J1FTG3 (GlcNAc kinase OS=Cucurbita moschata OX=3662 GN=LOC111447149 PE=3 SV=1)

HSP 1 Score: 669.1 bits (1725), Expect = 1.0e-188
Identity = 333/355 (93.80%), Postives = 343/355 (96.62%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMSDGAAAGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQLPDPI 60
           MTKKHRNGEIWE EREMS G A GGGVGDVILGIDGGTTSTICVCVPFLPPQSLQ PDPI
Sbjct: 1   MTKKHRNGEIWEFEREMS-GGAEGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQFPDPI 60

Query: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQQ 120
            LLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLS+SGVNHPTDQQ
Sbjct: 61  SLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSLSGVNHPTDQQ 120

Query: 121 RILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARAA 180
           RILNWLRDIFPS+VKLYVRNDAVAALASGTMGRL GCVLIAGTG+IAYGFTDDGREARAA
Sbjct: 121 RILNWLRDIFPSHVKLYVRNDAVAALASGTMGRLRGCVLIAGTGTIAYGFTDDGREARAA 180

Query: 181 GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYADP 240
           GAGPILGDWGSGYGISAQALTA++RAHDGRGPQTKLTNSIL TLGLSSADELIGWTYADP
Sbjct: 181 GAGPILGDWGSGYGISAQALTAVMRAHDGRGPQTKLTNSILHTLGLSSADELIGWTYADP 240

Query: 241 SWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVM 300
           +WARIAALVP+VVSCAEAGDEVANNI+Q SVKELALSV AVVQRL LCGSDGK SFPLVM
Sbjct: 241 TWARIAALVPVVVSCAEAGDEVANNIVQHSVKELALSVTAVVQRLRLCGSDGKASFPLVM 300

Query: 301 VGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 356
           VGGVLEGNKGWGIAQEV+NCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSH+E
Sbjct: 301 VGGVLEGNKGWGIAQEVVNCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHKE 354

BLAST of Tan0007595 vs. ExPASy TrEMBL
Match: A0A6J1C494 (GlcNAc kinase OS=Momordica charantia OX=3673 GN=LOC111008237 PE=3 SV=1)

HSP 1 Score: 657.1 bits (1694), Expect = 4.1e-185
Identity = 325/355 (91.55%), Postives = 337/355 (94.93%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMSDGAAAGGGVGDVILGIDGGTTSTICVCVPFLPPQSLQLPDPI 60
           MTKKHRNGEIWE EREMSDG   G G GDVILGIDGGTTSTICVCVPFL P SL L DP+
Sbjct: 1   MTKKHRNGEIWEFEREMSDGTGGGAGAGDVILGIDGGTTSTICVCVPFLQPHSLHLADPL 60

Query: 61  PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQQ 120
           P+LARVEAGCSNHNSVGETAA+ETLEQVMAEALSK+ SNRSAVRAVCLSVSGVNHPTDQQ
Sbjct: 61  PVLARVEAGCSNHNSVGETAAKETLEQVMAEALSKAGSNRSAVRAVCLSVSGVNHPTDQQ 120

Query: 121 RILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARAA 180
           RILNWLRDIFPS+VKLYVRNDAVAALASGTMGRL GCVLIAGTG+IAYGFTDDGREARAA
Sbjct: 121 RILNWLRDIFPSHVKLYVRNDAVAALASGTMGRLRGCVLIAGTGTIAYGFTDDGREARAA 180

Query: 181 GAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYADP 240
           GAGP+ GDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQ LGLSS DELIGWTYAD 
Sbjct: 181 GAGPVFGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQILGLSSPDELIGWTYADT 240

Query: 241 SWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLVM 300
           SWARIAALVP+VVSCAEAGDEVANNILQDSVKELALSVNAVV+RL LCGSDGKDSFPLVM
Sbjct: 241 SWARIAALVPIVVSCAEAGDEVANNILQDSVKELALSVNAVVRRLELCGSDGKDSFPLVM 300

Query: 301 VGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 356
           VGGVLEGNKGWGIA+EVINCISKDYPGVVP+WPKVEPAIGAALLAWNFLKDS +E
Sbjct: 301 VGGVLEGNKGWGIAEEVINCISKDYPGVVPVWPKVEPAIGAALLAWNFLKDSRRE 355

BLAST of Tan0007595 vs. ExPASy TrEMBL
Match: A0A6J1JC39 (GlcNAc kinase OS=Cucurbita maxima OX=3661 GN=LOC111483072 PE=3 SV=1)

HSP 1 Score: 648.3 bits (1671), Expect = 1.9e-182
Identity = 326/361 (90.30%), Postives = 337/361 (93.35%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMS------DGAAAGGGVGDVILGIDGGTTSTICVCVPFLPPQSL 60
           MTKK+RNGEIWE EREMS       G   GGGVG VILGIDGGTTST+CVCVP LP QSL
Sbjct: 1   MTKKYRNGEIWEFEREMSGRTGGGGGGGGGGGVGGVILGIDGGTTSTVCVCVPLLPLQSL 60

Query: 61  QLPDPIPLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVN 120
            LPDP+PLLARVEAGCSNHNSVGETAARETLEQVMAEALSKS S+RSAV+A+CLSVSGVN
Sbjct: 61  HLPDPLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSGSDRSAVQAICLSVSGVN 120

Query: 121 HPTDQQRILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDG 180
           HPTDQQRILNWLRD+FPS+VKLYVRNDA AALASGTMGRL GCVLIAGTGSIA+GFTDDG
Sbjct: 121 HPTDQQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDG 180

Query: 181 REARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIG 240
           REARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQT LTNSILQTLGLSSADELIG
Sbjct: 181 REARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTNLTNSILQTLGLSSADELIG 240

Query: 241 WTYADPSWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKD 300
           WTYAD SWARIAALVP VVSCAEAGDEVANNILQD+VKELALSVNAVVQRLG  GSDGK 
Sbjct: 241 WTYADSSWARIAALVPAVVSCAEAGDEVANNILQDAVKELALSVNAVVQRLGFSGSDGKG 300

Query: 301 SFPLVMVGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQ 356
           SFPLVMVGGV+EGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQ
Sbjct: 301 SFPLVMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQ 360

BLAST of Tan0007595 vs. ExPASy TrEMBL
Match: A0A6J1EY19 (GlcNAc kinase OS=Cucurbita moschata OX=3662 GN=LOC111437512 PE=3 SV=1)

HSP 1 Score: 647.1 bits (1668), Expect = 4.2e-182
Identity = 327/359 (91.09%), Postives = 339/359 (94.43%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMS---DGAAAGGGVGDVILGIDGGTTSTICVCVPFLP-PQSLQL 60
           MTKK+RNGEIWE EREMS    G   GGGVG VILGIDGGTTSTICVCVP LP  QSL L
Sbjct: 1   MTKKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHL 60

Query: 61  PDPIPLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHP 120
           PDP+PLLARVEAGCSNHNSVGETAARETLEQVMAEALS+S S+RSAV+A+CLSVSGVNHP
Sbjct: 61  PDPLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHP 120

Query: 121 TDQQRILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGRE 180
           TDQQRILNWLRD+FPS+VKLYVRNDA AALASGTMGRL GCVLIAGTGSIA+GFTDDGRE
Sbjct: 121 TDQQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGRE 180

Query: 181 ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT 240
           ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT
Sbjct: 181 ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT 240

Query: 241 YADPSWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSF 300
           YAD SWARIAALVP VVSCAE+GDEVANNILQD+VKELALSVNAVVQRLGL GSDGK SF
Sbjct: 241 YADSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSF 300

Query: 301 PLVMVGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 356
           PLVMVGGV+EGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE
Sbjct: 301 PLVMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 359

BLAST of Tan0007595 vs. TAIR 10
Match: AT1G30540.1 (Actin-like ATPase superfamily protein )

HSP 1 Score: 473.0 bits (1216), Expect = 2.1e-133
Identity = 236/350 (67.43%), Postives = 284/350 (81.14%), Query Frame = 0

Query: 1   MTKKHRNGEIWEIEREMSDGAAAGGG-VGDVILGIDGGTTSTICVCVPFLPPQSLQLPDP 60
           M   H NG + ++E +    A    G V  VILG+DGG TST+CVCVPF      + PDP
Sbjct: 1   MRNPHSNGNLRKLEADGGGEATEENGFVNGVILGLDGGATSTVCVCVPFF-SFGERFPDP 60

Query: 61  IPLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSCSNRSAVRAVCLSVSGVNHPTDQ 120
           +P+L R  AGC+N NSVGETAAR++LEQV++EAL +S  ++S VR VCL VSGVNHP+DQ
Sbjct: 61  LPILGRAVAGCTNRNSVGETAARDSLEQVISEALVQSGFDKSDVRGVCLGVSGVNHPSDQ 120

Query: 121 QRILNWLRDIFPSNVKLYVRNDAVAALASGTMGRLHGCVLIAGTGSIAYGFTDDGREARA 180
           ++I NW+RD+FPS+VK+YV+NDA+ ALASGTMG+LHGCVLIAGTG IAYGF +DG+EARA
Sbjct: 121 EKIENWIRDMFPSHVKVYVQNDAIVALASGTMGKLHGCVLIAGTGCIAYGFDEDGKEARA 180

Query: 181 AGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYAD 240
           +G GPILGDWGSGYGI+AQALTA+IRAHDGRGPQT LT++IL+ LGLSS DELIGWTYAD
Sbjct: 181 SGGGPILGDWGSGYGIAAQALTAVIRAHDGRGPQTMLTSTILKALGLSSPDELIGWTYAD 240

Query: 241 PSWARIAALVPLVVSCAEAGDEVANNILQDSVKELALSVNAVVQRLGLCGSDGKDSFPLV 300
           PSWARIAALVP VVSCAEAGDE+++ IL D+ ++LALSV AVVQRLGLCG DG  SFP+V
Sbjct: 241 PSWARIAALVPQVVSCAEAGDEISDKILVDAAEDLALSVKAVVQRLGLCGKDGTASFPVV 300

Query: 301 MVGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFL 350
           MVGGVL  N+ W I +EV   I++ +PG   I PKVEPA+GAALLA NFL
Sbjct: 301 MVGGVLNANQKWDIGKEVSKRINRYFPGAQTIIPKVEPAVGAALLAMNFL 349

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q54PM72.8e-5036.04N-acetyl-D-glucosamine kinase OS=Dictyostelium discoideum OX=44689 GN=nagk PE=3 ... [more]
Q97ML36.5e-3133.54N-acetylmuramic acid/N-acetylglucosamine kinase OS=Clostridium acetobutylicum (s... [more]
Q3SZM99.8e-1125.17N-acetyl-D-glucosamine kinase OS=Bos taurus OX=9913 GN=NAGK PE=2 SV=1[more]
Q9UJ701.3e-1024.48N-acetyl-D-glucosamine kinase OS=Homo sapiens OX=9606 GN=NAGK PE=1 SV=4[more]
P817992.9e-1024.14N-acetyl-D-glucosamine kinase OS=Rattus norvegicus OX=10116 GN=Nagk PE=1 SV=4[more]
Match NameE-valueIdentityDescription
XP_023526102.11.7e-19094.65N-acetyl-D-glucosamine kinase-like [Cucurbita pepo subsp. pepo] >XP_023526173.1 ... [more]
XP_022981650.13.0e-19094.65N-acetyl-D-glucosamine kinase-like isoform X1 [Cucurbita maxima][more]
KAG6600454.15.1e-19094.65N-acetyl-D-glucosamine kinase, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022941933.12.1e-18893.80N-acetyl-D-glucosamine kinase-like [Cucurbita moschata][more]
XP_022136560.18.4e-18591.55N-acetyl-D-glucosamine kinase-like [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1J0411.4e-19094.65GlcNAc kinase OS=Cucurbita maxima OX=3661 GN=LOC111480704 PE=3 SV=1[more]
A0A6J1FTG31.0e-18893.80GlcNAc kinase OS=Cucurbita moschata OX=3662 GN=LOC111447149 PE=3 SV=1[more]
A0A6J1C4944.1e-18591.55GlcNAc kinase OS=Momordica charantia OX=3673 GN=LOC111008237 PE=3 SV=1[more]
A0A6J1JC391.9e-18290.30GlcNAc kinase OS=Cucurbita maxima OX=3661 GN=LOC111483072 PE=3 SV=1[more]
A0A6J1EY194.2e-18291.09GlcNAc kinase OS=Cucurbita moschata OX=3662 GN=LOC111437512 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G30540.12.1e-13367.43Actin-like ATPase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002731ATPase, BadF/BadG/BcrA/BcrD typePFAMPF01869BcrAD_BadFGcoord: 32..221
e-value: 1.1E-31
score: 110.4
NoneNo IPR availableGENE3D3.30.420.40coord: 29..168
e-value: 3.7E-36
score: 126.4
NoneNo IPR availablePANTHERPTHR43190N-ACETYL-D-GLUCOSAMINE KINASEcoord: 1..238
NoneNo IPR availablePANTHERPTHR43190:SF1ACTIN-LIKE ATPASE SUPERFAMILY PROTEINcoord: 1..238
NoneNo IPR availableCDDcd00012NBD_sugar-kinase_HSP70_actincoord: 32..175
e-value: 5.17098E-4
score: 37.9536
IPR043129ATPase, nucleotide binding domainSUPERFAMILY53067Actin-like ATPase domaincoord: 155..248
IPR043129ATPase, nucleotide binding domainSUPERFAMILY53067Actin-like ATPase domaincoord: 30..150

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007595.1Tan0007595.1mRNA
Tan0007595.2Tan0007595.2mRNA
Tan0007595.3Tan0007595.3mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0046835 carbohydrate phosphorylation
molecular_function GO:0045127 N-acetylglucosamine kinase activity