CmoCh16G005200 (gene) Cucurbita moschata (Rifu)

NameCmoCh16G005200
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionN-acetyl-D-glucosamine kinase
LocationCmo_Chr16 : 2517063 .. 2521339 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGAAGAAATATCGGAATGGGGAAATCTGGGAATTCGAGCGAGAAATGTCCGGCAGAACAGGCGGAGGAGGAGGAGGTGGAGGAGTTGGAGGTGTGATTCTTGGAATTGATGGAGGAACAACCTCCACCATCTGCGTTTGTGTTCCCCTTTTACCACTTCAGCAGTCTCTTCATCTTCCTGATCCTCTTCCTCTTCTCGCCCGAGTCGAAGCCGGCTGCTCCAACCATAACAGCGTTGGCGGTACTTTCTCTCCCTTTGTGCCTATACTTTTCTTCAATTTTGACTGGTCTTTCATTTTCTTCTGCACATGTTTCTCACTATTCTTCCTGGATTGGTGGCTGTATTTTGCTAGATTTTTTAATGGGAAGTTGTAGGGATAGGTTCAACCTTCTGGTTTCATTTATTCTACAGAAATGCGAGTAGCCTCAATTTTATCTGATTTTTCGTTTGGCATAACAAAAGTACATGTAGCATGGCTATAATCTGTGAATTTATTTAGTTTTCATGAAAGAACTTTAAGTTTGAAGGGATTAAATAATTTCAAGTAGCATGGCTTTATGAAGTTAATGAGACTATTGGTAGGAATTAGCCAAAGGGTGGAGATGAAACTCATAAAAGTAAGGAGACCATACTAACACAATTACAAACATAACGATGAAATGCATAATCTAACCAAAAAGACCAATGAAAATTCAAAGTTGCAATGCAGCGTAGACTAACTGCAACTTTTAAAACCTAGTCATCTTTGTGTCAATCGAAGGCAACCTATGAGTTGTTTTTTTGACTGGCTTTACTGGTTTGATGTTGTGCCATTTGATATGTTTTCAAAAACTAAGATGCTGAGAGTTAAGACAGTAAACTCTTCGTTTGAATTAGTTAGGAGAAAAATGGCTGGTTGGTTTCTAAGTTCTTTATATGAGATTTTAGTATTGTAGGTCCTTGGAGTAAATTTGCTTCTCCCTTTCAGAAACTGCTGCAAGGGAAACGTTGGAGCAAGTTATGGCTGAGGCTCTTTCAAGATCAGGTTCAGATCGGTCTGCAGTTCAAGCTATTTGTTTATCTGTTTCTGGTGTAAACCATCCAACGGACCAGCAAAGGATTTTGAATTGGCTCAGGTATGGGCCATGGAAAGATTCTTAGATGTTTCGATGAAACTATGTTCCTTCTGTCATTGGATTTATCGTGATGGAGCATGTTATATACTCATTTTTTAATCTAAAATCTTAGGATGACCATTGTTCTTTCTACTTAGTTCATAGAGTGTTGCGGACTAATTTTAGCTGTTTATACTATACATATCCCTTAGTGGTGTACTTATACCCAAATATTCATTAACGATCTGAGTTAGGTCTCAAATCAATTCCCAGGATCCAAGCTGCTATTCCATTCACAAGCTTTTTTGTTCTATTTCATATGATATCTGAATCTCATGTTTAGGGATCAAGTATGTTTTTATACGCTTCTGATTATGTGCAGAGATATGTTTCCTAGCCATGTCAAACTCTATGTTCGAAATGATGCTGCGGCTGCTCTTGCAAGTGGTACCATGGGAAGGCTTAGTGGTTGTGTTCTAATTGCCGGCACTGGTAGTATTGCTTTTGGATTTACTGATGATGGAAGAGAAGCTCGAGCAGCTGGTGCAGGACCAATCTTAGGCGACTGGGGAAGGTCTCTTTCTCTCACTCCCTCACATGGACGCACACTTTTAAATTGGGTTATGATTGATCGTATGGGGATCATCTTATTTGTTTCCTAATACTTGTCTCATTATTCCTTGTTGATCATGTGGTGAATTTTAATTTTTACTCCCATAGTATCATTCTCAATATCGAGCACGTGTGTTCATAATCAGTCGAGGTTTTAGAGTAAAGTAACTATATGAAATACTATATAACTCATCGGAGTTGTTAAGATAATTTTGTTTCATTTTTGTTTGTCTGTTTCAATTATGTTTGTGTTTTAAAATTATGTTGCTGCACTCCATGCTGATCTATATTGATACATCTCTGAGAGTGTTGTGCATTACGTGCACATTTTTATTTTCTTTTGCCACTTTTCATTGAAGGTGATACTTTTTGTAATTTCAGTGGGTATGGCATATCAGCGCAGGCTTTAACTGCAATTATAAGAGCGCATGATGGTCGTGGTCCTCAAACAAAGCTCACAAATAGCATTCTTCAGACTCTTGGTCTTTCTTCTGCTGATGAACTCATTGGGTATGTACACTTGTCAATTGGCATTACCTATCACAAACTTCACTGTCTCCTCTTTTATATTCTTTGGTGCTGTTTTCGGACAATGTAAATTAAACACAGTAGAAGGAATAAAATATGGTATGAAAAGTACTGTATCGTGCATGATTGATTAGGAATGGATCACATATATTTTTGGTTATCATTGAAAATTAGGTGGACCTACGCAGATTCATCTTGGGCTCGAATTGCTGCCCTTGTTCCTGCTGTTGTGTCATGTGCAGAATCAGGGGACGAAGTTGCAAACAACATCTTGCAAGATGCAGTTAAGGAATTGGCTTTAAGCGTGAATGCCGTTGTTCAAAGACTCGGATTGTCTGGTTCAGGTACTGCAATAGTGGAAATCTTCGGTCGTAGTTAATGGACTTGATTCTCCTGTCCCCAAAGCTGGAAGTGCTTTTTTGCTATTAAACCCCCTTTGACACCTGTAACAGCCCACGCCCACTGCTACTAGATATTATTTTTTTTTTTGGGTTTTTCCTTTCGGGTTTCCCCTCAAGGTTTTTAAAACGCGTCTGCTAGGAAGAGGTTTCTACACCATTATAAAGAACACATTTGCTAGGGAGAGGTTTCCACACCATTATAAAGAACGCATTTGCTAGGGAGAGGTTTCCACACCATTATAAAGAACGCATTTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAATGCATCGTTCACCTGTCCAACCAATGTAGGATCTCACAATCCACCCCCTTTGGGGCCCACTGTCACCCTTATAAAGAATGCTTCGTTCTCCTCTCCAACCGACGTGGATCTCACAATCTACCCCCTTTCGGGGCCCAGCGTCCTCGTTGGCACTTGTTTCTCTCTTCAATCGACGTGGGATCTCACAATCCACTCCCATCGGGGCCTAGCGTCCTCGCTAGCACACCGCTCAATGTCTGGCTTTGATACCATTTGTAACAATCCAAGCCTATCGCTAGCAGATATTGTCCTCTTTGGGCTTCCCCTCAAGGTTTTTAAAACGCGTCTACTAGGGAGAGGTTTCCACACTCTTATAAAGCATGTTTTGTTCGCCTCTCCAACCGATGTGAGATCTCACAATCCACCTTCCTTCAGGACCCAACGTCCTTACTGGCACTCGTTCCTCTCTCCAATCAACGTGGGATCTCACAATCCACCTCCCTTCGAGGCCCAGCATCCTTGGTGGCACTCGTTTCTCTCTCCAATTGATGTGGGATCTCATATTCTTTGCGATAAAATTCGTCTCAACTTATTTCTCTTTGGATATGTACTTAAATGTGCAACAAAATTGATGGTGAATAATCTCAGTCTATGTGCCTTTAAAAGCTTTGGAGATGTTCTCTGTCTATTTATTATTTGTGGGTTTGTGTAAAATGCTAATGATCGTGGATATTCTTTGGAATTAGGATTCCTAGAAACTTCTTGTAGATTCAGTTAAGAAAAGAATGAAGGGAACCATGCAGGCAAATGAGGGTTCAATAACAGGATAATACTTTCCTTTTTGAATTTTTTAACCAAAGCTTTACCCGCAACTCTCATTGTTAGCATTAACATTGCAGATGGAAAGGGCTCTTTCCCCCTTGTCATGGTTGGTGGAGTTATTGAAGGAAATAAAGGATGGGGTATAGCGCAAGAAGTTATAAACTGCATTTCCAAGGACTACCCTGGGGTAGTTCCCATTTGGCCTAAGGTAATCATATTGTTTATTACCCTCTCTTATCTGCCAGTGTTTTTACAAATATTCAATGAAAACATGTTCAGTTATTGTTCTTAAATCTTTCCATGGACAGATTTGATTCTTAGAACAATAAAACATTGTTCTTTCTTCAAATCACAAGTATATAAACAGTGATTCCAGATGTGCAAAATAGCCCTGCACGTATCTGTTCTTTGTTCCCATGCTTTCCTTTATATGCATATGAGGCCTTGCAATAATCATCCATACATTTTCTTGAACTAAAACCAGGTGGAGCCTGCAATTGGGGCTGCATTGTTAGCCTGGAATTTCTTGAAAGATTCCCATCAAGAATAA

mRNA sequence

ATGACGAAGAAATATCGGAATGGGGAAATCTGGGAATTCGAGCGAGAAATGTCCGGCAGAACAGGCGGAGGAGGAGGAGGTGGAGGAGTTGGAGGTGTGATTCTTGGAATTGATGGAGGAACAACCTCCACCATCTGCGTTTGTGTTCCCCTTTTACCACTTCAGCAGTCTCTTCATCTTCCTGATCCTCTTCCTCTTCTCGCCCGAGTCGAAGCCGGCTGCTCCAACCATAACAGCGTTGGCGAAACTGCTGCAAGGGAAACGTTGGAGCAAGTTATGGCTGAGGCTCTTTCAAGATCAGGTTCAGATCGGTCTGCAGTTCAAGCTATTTGTTTATCTGTTTCTGGTGTAAACCATCCAACGGACCAGCAAAGGATTTTGAATTGGCTCAGAGATATGTTTCCTAGCCATGTCAAACTCTATGTTCGAAATGATGCTGCGGCTGCTCTTGCAAGTGGTACCATGGGAAGGCTTAGTGGTTGTGTTCTAATTGCCGGCACTGGTAGTATTGCTTTTGGATTTACTGATGATGGAAGAGAAGCTCGAGCAGCTGGTGCAGGACCAATCTTAGGCGACTGGGGAAGTGGGTATGGCATATCAGCGCAGGCTTTAACTGCAATTATAAGAGCGCATGATGGTCGTGGTCCTCAAACAAAGCTCACAAATAGCATTCTTCAGACTCTTGGTCTTTCTTCTGCTGATGAACTCATTGGGTGGACCTACGCAGATTCATCTTGGGCTCGAATTGCTGCCCTTGTTCCTGCTGTTGTGTCATGTGCAGAATCAGGGGACGAAGTTGCAAACAACATCTTGCAAGATGCAGTTAAGGAATTGGCTTTAAGCGTGAATGCCGTTGTTCAAAGACTCGGATTGTCTGGTTCAGATGGAAAGGGCTCTTTCCCCCTTGTCATGGTTGGTGGAGTTATTGAAGGAAATAAAGGATGGGGTATAGCGCAAGAAGTTATAAACTGCATTTCCAAGGACTACCCTGGGGTAGTTCCCATTTGGCCTAAGGTGGAGCCTGCAATTGGGGCTGCATTGTTAGCCTGGAATTTCTTGAAAGATTCCCATCAAGAATAA

Coding sequence (CDS)

ATGACGAAGAAATATCGGAATGGGGAAATCTGGGAATTCGAGCGAGAAATGTCCGGCAGAACAGGCGGAGGAGGAGGAGGTGGAGGAGTTGGAGGTGTGATTCTTGGAATTGATGGAGGAACAACCTCCACCATCTGCGTTTGTGTTCCCCTTTTACCACTTCAGCAGTCTCTTCATCTTCCTGATCCTCTTCCTCTTCTCGCCCGAGTCGAAGCCGGCTGCTCCAACCATAACAGCGTTGGCGAAACTGCTGCAAGGGAAACGTTGGAGCAAGTTATGGCTGAGGCTCTTTCAAGATCAGGTTCAGATCGGTCTGCAGTTCAAGCTATTTGTTTATCTGTTTCTGGTGTAAACCATCCAACGGACCAGCAAAGGATTTTGAATTGGCTCAGAGATATGTTTCCTAGCCATGTCAAACTCTATGTTCGAAATGATGCTGCGGCTGCTCTTGCAAGTGGTACCATGGGAAGGCTTAGTGGTTGTGTTCTAATTGCCGGCACTGGTAGTATTGCTTTTGGATTTACTGATGATGGAAGAGAAGCTCGAGCAGCTGGTGCAGGACCAATCTTAGGCGACTGGGGAAGTGGGTATGGCATATCAGCGCAGGCTTTAACTGCAATTATAAGAGCGCATGATGGTCGTGGTCCTCAAACAAAGCTCACAAATAGCATTCTTCAGACTCTTGGTCTTTCTTCTGCTGATGAACTCATTGGGTGGACCTACGCAGATTCATCTTGGGCTCGAATTGCTGCCCTTGTTCCTGCTGTTGTGTCATGTGCAGAATCAGGGGACGAAGTTGCAAACAACATCTTGCAAGATGCAGTTAAGGAATTGGCTTTAAGCGTGAATGCCGTTGTTCAAAGACTCGGATTGTCTGGTTCAGATGGAAAGGGCTCTTTCCCCCTTGTCATGGTTGGTGGAGTTATTGAAGGAAATAAAGGATGGGGTATAGCGCAAGAAGTTATAAACTGCATTTCCAAGGACTACCCTGGGGTAGTTCCCATTTGGCCTAAGGTGGAGCCTGCAATTGGGGCTGCATTGTTAGCCTGGAATTTCTTGAAAGATTCCCATCAAGAATAA
BLAST of CmoCh16G005200 vs. Swiss-Prot
Match: NAGK_DICDI (N-acetyl-D-glucosamine kinase OS=Dictyostelium discoideum GN=nagk PE=3 SV=2)

HSP 1 Score: 189.9 bits (481), Expect = 4.9e-47
Identity = 120/332 (36.14%), Postives = 181/332 (54.52%), Query Frame = 1

Query: 33  VILGIDGGTTSTICVCVPLLPLQQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQ- 92
           + +GIDGG T T  V V     +           LAR  + CSN++SVGE  A+  + + 
Sbjct: 5   IFIGIDGGGTKTSTVAVDSNGQE-----------LARHTSPCSNYHSVGEDLAKAAINEG 64

Query: 93  ------VMAEALSRSGSDRSAVQAICLSVSGVNHPTDQQRILNWLRDMFPSHVKLYVRND 152
                  + E ++   +    V +ICL +SGV+   D+  + +W+ ++    +   + ND
Sbjct: 65  IKYVIRKVKETITDDDNKEVTVGSICLGMSGVDREKDKLLVKSWVTELLGESINYSIHND 124

Query: 153 AAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGREARAAGAGPILGDWGSGYGISAQALT 212
           A  AL+SGT G+L G V+I GTG I+ GF  +G   R+ G GP+LGD+GSGY I    L 
Sbjct: 125 AIVALSSGTQGKLFGVVIICGTGCISLGFNREGVSGRSGGWGPLLGDYGSGYQIGYDILR 184

Query: 213 AIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTY--ADSSWARIAALVPAVVSCAESG 272
            +++A D  GP+T LT  +L+ L L+  ++LI W Y     SW + A L P     A+ G
Sbjct: 185 HVLKAKDQVGPKTSLTQVLLEKLQLTKEEDLISWAYDPKTQSWQKFAQLSPLAFEQAQLG 244

Query: 273 DEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSFPLVMVGGVIEGNKGWGIAQEVIN 332
           DE++N IL DA   L   +N+V+++LGL   D +  FPLV  GG IE     GI  ++++
Sbjct: 245 DEISNLILVDAANALYDLINSVIKKLGL---DKEEKFPLVYTGGNIERK---GILSDLLS 304

Query: 333 -CISKDYPGVVPIWPKVEPAIGAALLAWNFLK 355
             I ++YP    +    +P++GAALLA N  K
Sbjct: 305 KKIMENYPNAEILNTTCDPSMGAALLALNSKK 319

BLAST of CmoCh16G005200 vs. Swiss-Prot
Match: MURK_CLOAB (N-acetylmuramic acid/N-acetylglucosamine kinase OS=Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787) GN=murK PE=1 SV=1)

HSP 1 Score: 136.0 bits (341), Expect = 8.3e-31
Identity = 108/317 (34.07%), Postives = 158/317 (49.84%), Query Frame = 1

Query: 34  ILGIDGGTTSTICVCVPLLPLQQSLHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVM 93
           ++GIDGG + T      L              +L  V  G SN NS  +   +  L++++
Sbjct: 4   VIGIDGGGSKTHMKISTL-----------DYKVLLEVFKGPSNINSSTKEEVKRVLQELI 63

Query: 94  AEALSRSGSDRSAVQAICLSVSGVNHPTDQQRILNWLRDMFPSHVKLYVRNDAAAALASG 153
            E L + G       AIC+  +G +   D+  I + +R +     K+ V NDA  ALA G
Sbjct: 64  MEGLGKLGQSLEECSAICIGTAGADRTEDKSIIEDMIRSLGYMG-KIIVVNDAEIALAGG 123

Query: 154 TMGRLSGCVLIAGTGSIAFGFTDDGREARAAGAGPILGDWGSGYGISAQALTAIIRAHDG 213
              R  G ++I+GTGSI +G   +GR AR+ G G I+GD GSGY I  +A+ A +++ D 
Sbjct: 124 IEKR-EGIIVISGTGSICYGRNKEGRSARSGGWGHIIGDEGSGYDIGIKAIKAALKSFDK 183

Query: 214 RGPQTKLTNSILQTLGLSSADELIGWTYADS-SWARIAALVPAVVSCAESGDEVANNILQ 273
           RG +T L   IL  L L S ++LI + Y    +   IA+L   V S    GD V+  IL+
Sbjct: 184 RGEKTILEGDILDFLKLKSHEDLINYIYRSGVTKKEIASLTRVVNSAYIKGDLVSKRILK 243

Query: 274 DAVKELALSVNAVVQRLGLSGSDGKGSFPLVMVGGVIEGNKGWGIAQEVINCISKDYPGV 333
           +A +EL LSV AVV+ L +          L   GGVI  N    +  E    ++ +YP V
Sbjct: 244 EAARELFLSVKAVVEVLSMQNK----KVVLTTAGGVI--NNINYLYDEFRKFLNLNYPKV 301

Query: 334 VPIWPKVEPAIGAALLA 350
             I  K + A GA ++A
Sbjct: 304 KIISMKNDSAFGAVIIA 301

BLAST of CmoCh16G005200 vs. TrEMBL
Match: A0A0A0L083_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G062910 PE=4 SV=1)

HSP 1 Score: 620.5 bits (1599), Expect = 1.2e-174
Identity = 320/359 (89.14%), Postives = 333/359 (92.76%), Query Frame = 1

Query: 1   MTKKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHL 60
           MTKK+RNGEI EF+RE+S    G  GG  VG VILGIDGGTTST CVC+P L    SLHL
Sbjct: 1   MTKKHRNGEISEFDRELSA---GTAGGRAVGDVILGIDGGTTSTTCVCLPFLH-PHSLHL 60

Query: 61  PDPLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHP 120
           PD LPLLARVEAGCSNHNSVGETAARETLEQVMAEALS+SG D SAV+AICLS+SGVNHP
Sbjct: 61  PDSLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSGLDLSAVRAICLSISGVNHP 120

Query: 121 TDQQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGRE 180
           TDQQRILNW RD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTGSIA+GFTDDGRE
Sbjct: 121 TDQQRILNWFRDKFPSHVKLYVRNDAAAALASGTMGKLSGCVLIAGTGSIAYGFTDDGRE 180

Query: 181 ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT 240
           ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT
Sbjct: 181 ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT 240

Query: 241 YADSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSF 300
           YAD SWARIAALVPAVV+CAE+GDEVANNILQD+VKELALSV AVVQRLGL GSDGKGSF
Sbjct: 241 YADQSWARIAALVPAVVACAEAGDEVANNILQDSVKELALSVTAVVQRLGLCGSDGKGSF 300

Query: 301 PLVMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 360
           PLVMVGGV+EGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKD  QE
Sbjct: 301 PLVMVGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDCEQE 355

BLAST of CmoCh16G005200 vs. TrEMBL
Match: E0CTZ3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0028g01190 PE=4 SV=1)

HSP 1 Score: 537.0 bits (1382), Expect = 1.8e-149
Identity = 270/357 (75.63%), Postives = 308/357 (86.27%), Query Frame = 1

Query: 3   KKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHLPD 62
           K+YRNGEIW+FE EM     G         V+LG+DGGTTST+CVC+P  PL     LPD
Sbjct: 2   KRYRNGEIWDFEDEMPVSPDGSE-------VVLGLDGGTTSTVCVCMPFFPLSDR-PLPD 61

Query: 63  PLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHPTD 122
           P+P+LAR  AGCSNHNSVGETAARETLEQVMA+ALS+SGS+RSAV+A+CL+VSGVNHPTD
Sbjct: 62  PVPVLARAVAGCSNHNSVGETAARETLEQVMADALSKSGSNRSAVRAVCLAVSGVNHPTD 121

Query: 123 QQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGREAR 182
           QQRIL+WLRD+F SHVKLYV+NDA AALASGTMG L GCVLIAGTG+IA+GFT+DGREAR
Sbjct: 122 QQRILSWLRDIFSSHVKLYVQNDAVAALASGTMGELHGCVLIAGTGTIAYGFTEDGREAR 181

Query: 183 AAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYA 242
           AAGAGPILGDWGSGYGI+AQALTA++RAHDGRGPQT LT SIL+ L LSS DELIGWTYA
Sbjct: 182 AAGAGPILGDWGSGYGIAAQALTAVVRAHDGRGPQTALTYSILRALSLSSPDELIGWTYA 241

Query: 243 DSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSFPL 302
           D SWARIAALVP VVSCA++GDEVAN IL ++V+ELA SV AVVQRLGL G DGKGSFPL
Sbjct: 242 DPSWARIAALVPVVVSCADAGDEVANKILLESVEELASSVKAVVQRLGLCGEDGKGSFPL 301

Query: 303 VMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWN-FLKDSHQ 359
           VMVGGV+E NK W I +EV+NCI KDYPG +PI PKVEPA+GAALLAWN F+K++H+
Sbjct: 302 VMVGGVLEANKTWDIGKEVVNCIYKDYPGTLPIRPKVEPAVGAALLAWNFFMKEAHK 350

BLAST of CmoCh16G005200 vs. TrEMBL
Match: A0A0D2V1F7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_012G039600 PE=4 SV=1)

HSP 1 Score: 529.6 bits (1363), Expect = 2.9e-147
Identity = 269/358 (75.14%), Postives = 311/358 (86.87%), Query Frame = 1

Query: 3   KKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHLPD 62
           K+YRNGEIW+FE E++  T           VILG+DGGTTST+C+C+P++P   +  LPD
Sbjct: 2   KRYRNGEIWDFEHEVAVATNRP--------VILGLDGGTTSTVCICMPIMPFSDA--LPD 61

Query: 63  PLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHPTD 122
           PLP+LAR  AGCSNHNSVGETAARETLEQVMA+ALS+SGS+RSAV+A+CL+VSGVNHPTD
Sbjct: 62  PLPVLARAVAGCSNHNSVGETAARETLEQVMADALSKSGSNRSAVRAVCLAVSGVNHPTD 121

Query: 123 QQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGREAR 182
           QQRIL WLRD+FP+ VKLYVRNDA AALASGTMG+L GCVLIAGTG+IA+GFT+DGREAR
Sbjct: 122 QQRILTWLRDIFPTQVKLYVRNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREAR 181

Query: 183 AAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYA 242
           AAGAGP+LGDWGSGYGI+A ALTA+IRAHDGRGP T LT++ILQTLGLSSADELIGWTYA
Sbjct: 182 AAGAGPVLGDWGSGYGIAALALTAVIRAHDGRGPHTMLTSTILQTLGLSSADELIGWTYA 241

Query: 243 DSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSFPL 302
           D SWARIAALVP VVSCAE+GDEVAN IL++AV+ELALSV AVVQRLGL G+D K SFPL
Sbjct: 242 DPSWARIAALVPVVVSCAEAGDEVANKILKEAVQELALSVKAVVQRLGLCGADRKNSFPL 301

Query: 303 VMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWN-FLKDSHQE 360
           VMVGGV+E N+ W I +EV++ ISKDYPG  PI PKVEPA+GAALLA N F+K+  QE
Sbjct: 302 VMVGGVLEANQRWDIGREVMDFISKDYPGAHPIRPKVEPAVGAALLALNEFMKECVQE 349

BLAST of CmoCh16G005200 vs. TrEMBL
Match: A0A0B0P7N8_GOSAR (N-acetyl-D-glucosamine kinase OS=Gossypium arboreum GN=F383_02207 PE=4 SV=1)

HSP 1 Score: 528.1 bits (1359), Expect = 8.4e-147
Identity = 268/358 (74.86%), Postives = 311/358 (86.87%), Query Frame = 1

Query: 3   KKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHLPD 62
           K+YRNGEIW+FE E++  T           VILG+DGGTTST+C+C+P++P   +  LPD
Sbjct: 2   KRYRNGEIWDFEHEVAVATNRP--------VILGLDGGTTSTVCICMPIMPFSDA--LPD 61

Query: 63  PLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHPTD 122
           PLP+LAR  AGCSNHNSVGETAARETLEQVMA+ALS+SGS+RSAV+A+CL+VSGVNHPTD
Sbjct: 62  PLPVLARAVAGCSNHNSVGETAARETLEQVMADALSKSGSNRSAVRAVCLAVSGVNHPTD 121

Query: 123 QQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGREAR 182
           QQRIL WLRD+FP+ VKLYV+NDA AALASGTMG+L GCVLIAGTG+IA+GFT+DGREAR
Sbjct: 122 QQRILTWLRDIFPTQVKLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREAR 181

Query: 183 AAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYA 242
           AAGAGP+LGDWGSGYGI+A ALTA+IRAHDGRGP T LT++ILQTLGLSSADELIGWTYA
Sbjct: 182 AAGAGPVLGDWGSGYGIAALALTAVIRAHDGRGPHTMLTSTILQTLGLSSADELIGWTYA 241

Query: 243 DSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSFPL 302
           D SWARIAALVP VVSCAE+GDEVAN IL++AV+ELALSV AVVQRLGL G+D K SFPL
Sbjct: 242 DPSWARIAALVPVVVSCAEAGDEVANKILKEAVQELALSVKAVVQRLGLCGADRKNSFPL 301

Query: 303 VMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWN-FLKDSHQE 360
           VMVGGV+E N+ W I +EV++ ISKDYPG  PI PKVEPA+GAALLA N F+K+  QE
Sbjct: 302 VMVGGVLEANQRWDIGREVMDFISKDYPGAHPIRPKVEPAVGAALLALNEFMKECVQE 349

BLAST of CmoCh16G005200 vs. TrEMBL
Match: B9IA59_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s14040g PE=4 SV=1)

HSP 1 Score: 526.2 bits (1354), Expect = 3.2e-146
Identity = 264/359 (73.54%), Postives = 306/359 (85.24%), Query Frame = 1

Query: 1   MTKKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHL 60
           M K+YRNGEIW+FE E+        G  G   VILG+DGGTTST+C+C+P+ P       
Sbjct: 1   MKKRYRNGEIWDFEHEI--------GELGNREVILGLDGGTTSTVCICMPIFPFSDPF-- 60

Query: 61  PDPLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHP 120
           PDPLP+LAR  AGCSNHNSVGETAARETLEQVMA+AL +SGS+RSAV+A+CLSVSGVNH 
Sbjct: 61  PDPLPVLARAVAGCSNHNSVGETAARETLEQVMADALLKSGSNRSAVRAVCLSVSGVNHS 120

Query: 121 TDQQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGRE 180
           TD+ R+LNWLR++FP+HVKLYV+NDA AAL+SGTMG+L GCVLIAGTG+IAFGFT+DGR+
Sbjct: 121 TDELRVLNWLREIFPTHVKLYVQNDAVAALSSGTMGKLHGCVLIAGTGTIAFGFTEDGRQ 180

Query: 181 ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT 240
           ARAAGAGP+LGDWGSGYGI+AQALTAI+RA+DGRGP T L+++ILQTLGLSS DELIGWT
Sbjct: 181 ARAAGAGPVLGDWGSGYGIAAQALTAIVRAYDGRGPVTILSSNILQTLGLSSPDELIGWT 240

Query: 241 YADSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSF 300
           YAD SWARIAALVP VVSCAE+GD VA+ ILQD+V+ELALSV AVVQRLGL G DGK SF
Sbjct: 241 YADPSWARIAALVPVVVSCAEAGDRVAHEILQDSVEELALSVKAVVQRLGLCGEDGKASF 300

Query: 301 PLVMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 360
           PLVMVGGV+E NK W I +EV+N ISK YPGV+PI PKVEPA+GAALL WNFL    Q+
Sbjct: 301 PLVMVGGVLEANKRWDIGKEVVNHISKSYPGVLPIHPKVEPAVGAALLGWNFLMTESQK 349

BLAST of CmoCh16G005200 vs. TAIR10
Match: AT1G30540.1 (AT1G30540.1 Actin-like ATPase superfamily protein)

HSP 1 Score: 457.2 bits (1175), Expect = 9.2e-129
Identity = 235/356 (66.01%), Postives = 282/356 (79.21%), Query Frame = 1

Query: 1   MTKKYRNGEIWEFEREMSGRTGGGGG---GGGVGGVILGIDGGTTSTICVCVPLLPLQQS 60
           M   + NG + + E +     GGG      G V GVILG+DGG TST+CVCVP     + 
Sbjct: 1   MRNPHSNGNLRKLEAD-----GGGEATEENGFVNGVILGLDGGATSTVCVCVPFFSFGE- 60

Query: 61  LHLPDPLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGV 120
              PDPLP+L R  AGC+N NSVGETAAR++LEQV++EAL +SG D+S V+ +CL VSGV
Sbjct: 61  -RFPDPLPILGRAVAGCTNRNSVGETAARDSLEQVISEALVQSGFDKSDVRGVCLGVSGV 120

Query: 121 NHPTDQQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDD 180
           NHP+DQ++I NW+RDMFPSHVK+YV+NDA  ALASGTMG+L GCVLIAGTG IA+GF +D
Sbjct: 121 NHPSDQEKIENWIRDMFPSHVKVYVQNDAIVALASGTMGKLHGCVLIAGTGCIAYGFDED 180

Query: 181 GREARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELI 240
           G+EARA+G GPILGDWGSGYGI+AQALTA+IRAHDGRGPQT LT++IL+ LGLSS DELI
Sbjct: 181 GKEARASGGGPILGDWGSGYGIAAQALTAVIRAHDGRGPQTMLTSTILKALGLSSPDELI 240

Query: 241 GWTYADSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGK 300
           GWTYAD SWARIAALVP VVSCAE+GDE+++ IL DA ++LALSV AVVQRLGL G DG 
Sbjct: 241 GWTYADPSWARIAALVPQVVSCAEAGDEISDKILVDAAEDLALSVKAVVQRLGLCGKDGT 300

Query: 301 GSFPLVMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFL 354
            SFP+VMVGGV+  N+ W I +EV   I++ +PG   I PKVEPA+GAALLA NFL
Sbjct: 301 ASFPVVMVGGVLNANQKWDIGKEVSKRINRYFPGAQTIIPKVEPAVGAALLAMNFL 349

BLAST of CmoCh16G005200 vs. NCBI nr
Match: gi|659101891|ref|XP_008451846.1| (PREDICTED: N-acetyl-D-glucosamine kinase-like [Cucumis melo])

HSP 1 Score: 632.1 bits (1629), Expect = 5.9e-178
Identity = 327/359 (91.09%), Postives = 337/359 (93.87%), Query Frame = 1

Query: 1   MTKKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHL 60
           MTKK+RNGEI EFERE+SG   G GGG  VG VILGIDGGTTST+CVCVP L    SLHL
Sbjct: 1   MTKKHRNGEISEFERELSG---GTGGGRAVGDVILGIDGGTTSTVCVCVPFLH-PHSLHL 60

Query: 61  PDPLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHP 120
           PD  PLLARVEAGCSNHNSVGETAARETLEQVMAEALS+SGSD SAV+AICLSVSGVNHP
Sbjct: 61  PDSPPLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSGSDLSAVRAICLSVSGVNHP 120

Query: 121 TDQQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGRE 180
           TDQQRILNW RD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTGSIA+GFTDDGRE
Sbjct: 121 TDQQRILNWFRDKFPSHVKLYVRNDAAAALASGTMGKLSGCVLIAGTGSIAYGFTDDGRE 180

Query: 181 ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT 240
           ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT
Sbjct: 181 ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT 240

Query: 241 YADSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSF 300
           YAD SWARIAALVPAVVSCAE+GDEVANNILQD+VKELALSV AVVQRLGL GSDGKGSF
Sbjct: 241 YADQSWARIAALVPAVVSCAEAGDEVANNILQDSVKELALSVTAVVQRLGLCGSDGKGSF 300

Query: 301 PLVMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 360
           PLVMVGGV+EGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDS QE
Sbjct: 301 PLVMVGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSQQE 355

BLAST of CmoCh16G005200 vs. NCBI nr
Match: gi|449460020|ref|XP_004147744.1| (PREDICTED: N-acetyl-D-glucosamine kinase-like [Cucumis sativus])

HSP 1 Score: 620.5 bits (1599), Expect = 1.8e-174
Identity = 320/359 (89.14%), Postives = 333/359 (92.76%), Query Frame = 1

Query: 1   MTKKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHL 60
           MTKK+RNGEI EF+RE+S    G  GG  VG VILGIDGGTTST CVC+P L    SLHL
Sbjct: 1   MTKKHRNGEISEFDRELSA---GTAGGRAVGDVILGIDGGTTSTTCVCLPFLH-PHSLHL 60

Query: 61  PDPLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHP 120
           PD LPLLARVEAGCSNHNSVGETAARETLEQVMAEALS+SG D SAV+AICLS+SGVNHP
Sbjct: 61  PDSLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSKSGLDLSAVRAICLSISGVNHP 120

Query: 121 TDQQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGRE 180
           TDQQRILNW RD FPSHVKLYVRNDAAAALASGTMG+LSGCVLIAGTGSIA+GFTDDGRE
Sbjct: 121 TDQQRILNWFRDKFPSHVKLYVRNDAAAALASGTMGKLSGCVLIAGTGSIAYGFTDDGRE 180

Query: 181 ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT 240
           ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT
Sbjct: 181 ARAAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWT 240

Query: 241 YADSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSF 300
           YAD SWARIAALVPAVV+CAE+GDEVANNILQD+VKELALSV AVVQRLGL GSDGKGSF
Sbjct: 241 YADQSWARIAALVPAVVACAEAGDEVANNILQDSVKELALSVTAVVQRLGLCGSDGKGSF 300

Query: 301 PLVMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDSHQE 360
           PLVMVGGV+EGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKD  QE
Sbjct: 301 PLVMVGGVLEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFLKDCEQE 355

BLAST of CmoCh16G005200 vs. NCBI nr
Match: gi|1000960325|ref|XP_015576283.1| (PREDICTED: N-acetyl-D-glucosamine kinase [Ricinus communis])

HSP 1 Score: 538.9 bits (1387), Expect = 6.8e-150
Identity = 271/358 (75.70%), Postives = 311/358 (86.87%), Query Frame = 1

Query: 3   KKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHLPD 62
           K+YRNGEIW+FE E+           G   VILG+DGGTTST+C+C+P+LP   S  LPD
Sbjct: 2   KRYRNGEIWDFEHEIPV--------SGNNPVILGLDGGTTSTVCICMPILPF--STPLPD 61

Query: 63  PLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHPTD 122
           PLP+LAR  AGCSNHNSVGETAARETLE+VMA+AL +SGS+RSAVQA+CL+VSGVNHP D
Sbjct: 62  PLPVLARAVAGCSNHNSVGETAARETLEEVMADALLKSGSNRSAVQAVCLAVSGVNHPND 121

Query: 123 QQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGREAR 182
            QRILNWLRD+FP+HVKLYV+NDA AALASGTMG+L GCVLIAGTG+IA+GFT+DG+EAR
Sbjct: 122 VQRILNWLRDIFPNHVKLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGKEAR 181

Query: 183 AAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYA 242
           AAGAGPILGDWGSGYGI+AQALTA++RA+DGRGPQT LT+SILQTLGLSS DELIGWTYA
Sbjct: 182 AAGAGPILGDWGSGYGIAAQALTAVVRAYDGRGPQTILTSSILQTLGLSSPDELIGWTYA 241

Query: 243 DSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSFPL 302
           D SWARIAALVP VVSCAE+GDEVAN ILQ +V+ELALSV AVVQRLGL G DG  SFPL
Sbjct: 242 DPSWARIAALVPVVVSCAEAGDEVANKILQVSVEELALSVKAVVQRLGLCGEDGNSSFPL 301

Query: 303 VMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNF-LKDSHQE 360
           VMVGGV+E NK W I +EV+NCI +DYPG +PI PKVEPA+GAALLAWNF +K+ H+E
Sbjct: 302 VMVGGVLEANKRWDIGKEVVNCIYRDYPGALPIRPKVEPAVGAALLAWNFSMKEIHKE 349

BLAST of CmoCh16G005200 vs. NCBI nr
Match: gi|359485331|ref|XP_002278295.2| (PREDICTED: N-acetyl-D-glucosamine kinase [Vitis vinifera])

HSP 1 Score: 537.0 bits (1382), Expect = 2.6e-149
Identity = 270/357 (75.63%), Postives = 308/357 (86.27%), Query Frame = 1

Query: 3   KKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHLPD 62
           K+YRNGEIW+FE EM     G         V+LG+DGGTTST+CVC+P  PL     LPD
Sbjct: 2   KRYRNGEIWDFEDEMPVSPDGSE-------VVLGLDGGTTSTVCVCMPFFPLSDR-PLPD 61

Query: 63  PLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHPTD 122
           P+P+LAR  AGCSNHNSVGETAARETLEQVMA+ALS+SGS+RSAV+A+CL+VSGVNHPTD
Sbjct: 62  PVPVLARAVAGCSNHNSVGETAARETLEQVMADALSKSGSNRSAVRAVCLAVSGVNHPTD 121

Query: 123 QQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGREAR 182
           QQRIL+WLRD+F SHVKLYV+NDA AALASGTMG L GCVLIAGTG+IA+GFT+DGREAR
Sbjct: 122 QQRILSWLRDIFSSHVKLYVQNDAVAALASGTMGELHGCVLIAGTGTIAYGFTEDGREAR 181

Query: 183 AAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYA 242
           AAGAGPILGDWGSGYGI+AQALTA++RAHDGRGPQT LT SIL+ L LSS DELIGWTYA
Sbjct: 182 AAGAGPILGDWGSGYGIAAQALTAVVRAHDGRGPQTALTYSILRALSLSSPDELIGWTYA 241

Query: 243 DSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSFPL 302
           D SWARIAALVP VVSCA++GDEVAN IL ++V+ELA SV AVVQRLGL G DGKGSFPL
Sbjct: 242 DPSWARIAALVPVVVSCADAGDEVANKILLESVEELASSVKAVVQRLGLCGEDGKGSFPL 301

Query: 303 VMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWN-FLKDSHQ 359
           VMVGGV+E NK W I +EV+NCI KDYPG +PI PKVEPA+GAALLAWN F+K++H+
Sbjct: 302 VMVGGVLEANKTWDIGKEVVNCIYKDYPGTLPIRPKVEPAVGAALLAWNFFMKEAHK 350

BLAST of CmoCh16G005200 vs. NCBI nr
Match: gi|769809989|ref|XP_006845997.2| (PREDICTED: N-acetyl-D-glucosamine kinase [Amborella trichopoda])

HSP 1 Score: 533.1 bits (1372), Expect = 3.7e-148
Identity = 266/351 (75.78%), Postives = 306/351 (87.18%), Query Frame = 1

Query: 3   KKYRNGEIWEFEREMSGRTGGGGGGGGVGGVILGIDGGTTSTICVCVPLLPLQQSLHLPD 62
           K+YRNGEIW+FE EM     G         VILG+DGGTTST+C+C+ +    +   +PD
Sbjct: 2   KRYRNGEIWDFEVEMPLTPDGSE-------VILGLDGGTTSTVCICMAMPYFNE---VPD 61

Query: 63  PLPLLARVEAGCSNHNSVGETAARETLEQVMAEALSRSGSDRSAVQAICLSVSGVNHPTD 122
           PLP+LAR  AGCSNHNSVGETAARETLE+VMAEAL +SGS+RSAVQA+CL+VSGVNHPTD
Sbjct: 62  PLPVLARAVAGCSNHNSVGETAARETLEKVMAEALFKSGSNRSAVQAVCLAVSGVNHPTD 121

Query: 123 QQRILNWLRDMFPSHVKLYVRNDAAAALASGTMGRLSGCVLIAGTGSIAFGFTDDGREAR 182
           +QRILNWLRD+FPSHVKLYV+NDA AALASGTMG+L GCVLIAGTG+IA+GFT+DGR+AR
Sbjct: 122 EQRILNWLRDIFPSHVKLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGRQAR 181

Query: 183 AAGAGPILGDWGSGYGISAQALTAIIRAHDGRGPQTKLTNSILQTLGLSSADELIGWTYA 242
           AAGAGP+LGDWGSGYGI+AQALTA+++AHDGRGPQT LT+SI+Q LGLSS DE+IGWTYA
Sbjct: 182 AAGAGPVLGDWGSGYGIAAQALTAVVKAHDGRGPQTMLTSSIIQKLGLSSPDEVIGWTYA 241

Query: 243 DSSWARIAALVPAVVSCAESGDEVANNILQDAVKELALSVNAVVQRLGLSGSDGKGSFPL 302
           D SWARIAALVP VVSCAE+GDEVAN ILQD+V+ELA SV AVVQRLGLS  DG+ SFPL
Sbjct: 242 DPSWARIAALVPVVVSCAEAGDEVANRILQDSVQELAASVKAVVQRLGLSSEDGRDSFPL 301

Query: 303 VMVGGVIEGNKGWGIAQEVINCISKDYPGVVPIWPKVEPAIGAALLAWNFL 354
           VMVGGV+E NKGW I +EVINCISKD+PG  PI PKVEPA+GAA+LAWNFL
Sbjct: 302 VMVGGVLEANKGWDIGKEVINCISKDFPGARPIRPKVEPAVGAAMLAWNFL 342

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NAGK_DICDI4.9e-4736.14N-acetyl-D-glucosamine kinase OS=Dictyostelium discoideum GN=nagk PE=3 SV=2[more]
MURK_CLOAB8.3e-3134.07N-acetylmuramic acid/N-acetylglucosamine kinase OS=Clostridium acetobutylicum (s... [more]
Match NameE-valueIdentityDescription
A0A0A0L083_CUCSA1.2e-17489.14Uncharacterized protein OS=Cucumis sativus GN=Csa_4G062910 PE=4 SV=1[more]
E0CTZ3_VITVI1.8e-14975.63Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0028g01190 PE=4 SV=... [more]
A0A0D2V1F7_GOSRA2.9e-14775.14Uncharacterized protein OS=Gossypium raimondii GN=B456_012G039600 PE=4 SV=1[more]
A0A0B0P7N8_GOSAR8.4e-14774.86N-acetyl-D-glucosamine kinase OS=Gossypium arboreum GN=F383_02207 PE=4 SV=1[more]
B9IA59_POPTR3.2e-14673.54Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s14040g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G30540.19.2e-12966.01 Actin-like ATPase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659101891|ref|XP_008451846.1|5.9e-17891.09PREDICTED: N-acetyl-D-glucosamine kinase-like [Cucumis melo][more]
gi|449460020|ref|XP_004147744.1|1.8e-17489.14PREDICTED: N-acetyl-D-glucosamine kinase-like [Cucumis sativus][more]
gi|1000960325|ref|XP_015576283.1|6.8e-15075.70PREDICTED: N-acetyl-D-glucosamine kinase [Ricinus communis][more]
gi|359485331|ref|XP_002278295.2|2.6e-14975.63PREDICTED: N-acetyl-D-glucosamine kinase [Vitis vinifera][more]
gi|769809989|ref|XP_006845997.2|3.7e-14875.78PREDICTED: N-acetyl-D-glucosamine kinase [Amborella trichopoda][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002731ATPase_BadF
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016310 phosphorylation
biological_process GO:0009813 flavonoid biosynthetic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016301 kinase activity
molecular_function GO:0047893 flavonol 3-O-glucosyltransferase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G005200.1CmoCh16G005200.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002731ATPase, BadF/BadG/BcrA/BcrD typePFAMPF01869BcrAD_BadFGcoord: 35..349
score: 5.5
NoneNo IPR availableunknownCoilCoilcoord: 354..359
scor
NoneNo IPR availablePANTHERPTHR12862BADF TYPE ATPASE DOMAIN-CONTAINING PROTEINcoord: 1..353
score: 2.8E
NoneNo IPR availablePANTHERPTHR12862:SF6ACTIN-LIKE ATPASE SUPERFAMILY PROTEINcoord: 1..353
score: 2.8E
NoneNo IPR availableunknownSSF53067Actin-like ATPase domaincoord: 33..154
score: 8.31E-18coord: 159..354
score: 2.59