Cp4.1LG06g05630.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG06g05630.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionN-acetylglucosamine kinase, putative
LocationCp4.1LG06 : 3411571 .. 3415618 (+)
Sequence length1366
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTGTCCTCTAGAAACTTTAGCAGATGGCTCATTTGCCGGCCGAATCAGAGCCGAACAAAAATTTGTCCGTGATCATCCTCGATTCTTTAGCAAACAAATTCAAGAAGAAATCAAATTAACTATGAACAGCTTCAATTCAAGTTCGAATTCTGTCTCCTCCTCATTGTCGTTTCGTGTTCGTCCTTGCCTTTGCTCTCGCTATCTCTTACTGTAATTTATTTGTAATTTCTTTGTATATATGATCATGAACTCTAGGTGTTTTGACTGCGAGAGTCAATTTTGGTTGTGTTTTTTTAACTTTTGATCCAATTTGTCACTGTGTTGCTCTTGATCGAGGCGGCTGGAGCAGAGATGAAGAGGTGTAGAAATGGCGAACTCTGGGATTTTGAGCACGAGATTCTTGGCGGAGATGATATTATACTTGGAATCGACGGCGGCACCACTTCCACTGTTTGTGTTTGTATTGCTCTTTCCGATCCTCGAGCTATTTCTCCTTCAATGGCTTGTCCTATGCTCGCTCGTGTTGTTGGTGGCTGCTCAAACCATAATAGCGTTGGCGGTACTTCACTATCTTTGTCACTTCTCCCTGATTTAGCTATGTTTTCCGCCATTGCAGCATCAAATTGGACCTTGAATTGACTTTACTTCTTTAAAGAGCCATCGCTTGTATGATCTCGTCCCAAATGGCACAAACGAGTAAACACTTCAGACGATTGAAGGGTTGATTAGAACTCCTTATCGCGTTTTGTTAAGTATCAAATTTAGTTGTCTAGTGTCCGTGTTGATCTCTATTGTAAGCATATCTGATTCCTTCATCATGAATTTGAATGCTAAAAATATGATAACCTTCTGCCATCTCTAGAAACTGCTGCGAGGGAAACACTGGAGCAAGTTATGGCGGAGGCACTTTCAAAGTCTTGTTCAATTCGATCTTCAGTTCGAGCTGTGTGCCTAGCTGTTTCTGGAGTCAACCATCCAACGGATCAAGAAAGAATTCTGAATTGGCTAAGGTATTCATTCTTCTATTAACTGTGAATCCGTAGAGTTTTAAATTTTATTCTTTCTTTTTGTTCGTCATTGGTTAGCCGCTTTCTATTTATACTCTTACCATCCCCTCTAATGGGATTAGGTAATGCGTAGGAATAAAACATGAGCTCAACGAGACTTGATATCATGAACTGCAAATGAACAGGGATATCTTTCCGTGCCATGTAAACCTGTATGTTCAAAATGATGCTGTGGCGGCTCTTGCAAGTGGCACTATGGGAAAGCTCCATGGATGCGTTTTAATTGCTGGAACTGGAACAATTGCATATGGATTCACAGAGGATGGTCGAGAAGCTCGAGCTGCTGGTGCAGGACCAATCTTAGGTGATTGGGGAAGGTGGATTTTTCATTTTGTTCACTAATCCTCTCTCTCTCTCTCTCTCTCACACACACACACTGTTAAAATTCTGGCATAGGTTGGTGTAAACAAAATTGAGGTATAATTATTTAGCCTGATCTAGGAGACCCTTCTGCGCTTTCTAAATTTTCTGATGAACATTTGGTACCAAGTGTAATAGCCCAAGCCTAACCCCACCGGTCACAGACATGGTGCATTTTCTTTGAGTTTTCTCTTTTGGGCTTTTCTCTCAAGATTTTTAAAACGCGTGGAGAGGTTTCCACACCTTTATAAAGGATGTGGGACCTCACAATCCACTCACTTCGGGGTCGAGCGTCCTCGCTAGCACTTGTTCCCCTCTCCAATCGATGTGGGACCCCCCAATCCACCCCTTTGGGGCCTAGCGTCCTCGTGGCACACCGCCTGGTGTCCACCCACTTCGGAGTTTAGACTCCTCACTGGCACATCGATTAGTGTTTGGCTCCGAGTCTCTTATATCATTTGTAACTGCTGAAGCCCACTGGTAACAGATATTGTCTTCTTTGGGTTTTTCTCAAAGTTTACCATTTGTAACAGCTCAATCCCACCACTAACAGATATTATCCTCTTTGGATTTTCCTTATGATTTTTAAAACGTGTCTGCTAGGGAGAGGTTTTCACGCACTTATAAAGAATTCTTCATTCCTCTCTCCAACTAACGTGGGATGTCGAGTAAGTCTGAAAAGTAGTTTAAACATGCCCCTTCCTCTAGAGTCTGGCTAAAGGTCGATGCTCTTGAACAATCTGTTTCCGGTGCTTAATCATTTTATTTTTTTATCTTTCATTCTTTTTTGTAAAAAAGAGTAGTAGTGGGATGAACAGACCCAATATATGTTAATTTTTAACTCTTGATCTATCGCACCACTGTTTGAAAACTGATGTTTATGTTCTTGCAGTGGATATGGGATAGCTGCACAGGCGTTAACCGCAATAATTAGGGCTCATGATGGACGTGGTCCTCATACAATGCTCACTTACAGCATTTTGAAGACACTTGATCTTTCTTCACCAGATGAACTAATAGGGTATGTATTTAAAATACTTAAGATTGCAGTTATATTAAGTCTCATGTTATCTGGTCATTATTTTACCATCAATATTAATATTAGGTGGACATATGCGGATCCATCCTGGGCTCGAATTGCTGCTCTCGTTCCAGTTATTGTAGCATGTGCTGAGGCAGGCGATGAGGTTGCTGACAACATCCTCCTCGACTCAGTTGAAGAATTGGCTTCAAGTGTGAAGGCTGTTATTCAAAGACTCGGCCTGGCTGGTGAAGGTATGAAATCAAACAGTATTCATTATTAGCACTATGTTTATCCCTTGAGATAATTACACTTTGTTTTGCAATCTTCTTCATCTGTTTGAGTTATAATCCCTTGTGCAGATGGACAAGAGGCTTTTCCGCTTGTTATGGTTGGTGGTGTACTCGAAGCAAAAAGAAGGTGGGACATAGCAAAAAAAGTCATAAATTCAATATCCAAAGAATACCCCGGAGTTCTTCCTGTTTGGCCCAAGGTAGTCTTATTTTCCTCTCTTTACTTTGCCTTATTGCAATTGTGGCGTGAGATCCTACGTCGGTTGGAGAGGGGAACGAAACATTCTTTATAAGAGTGTGGAAACCTCTCTCTAGAGACGCGTTTTAAAAACCTTGAGGGGAAAACCCGAAAGGGAAAGCCCAAAAAGAACAATATTTGCTACTGGTAGGCTTGGACTATTACAAATGGTATCAGAGTCAGACACCAGGCGGTGTGCCAACGAGGACGCCAGACTCCCAAAGGGGGTGGATTGTGAGATCCCATATCGACTGGAGAGAGGAACGAGTGCCAGAGAGGACACCGTCCCTAAAAGGGGGGTAAATTGTGAGATCCCCACATTGATTGGAGAGGGAAATGAGTGCCGAAGAGAATGCTAGGCTCCGAAATGAAACATTCTTTAGAAGGGTGTGGAAACCTCTCTCTATCATTTTTAAAAAAAATCTTTGAGAAAAAGTTCAAAGTGGACAATATTTGCTAGCGGTGGGCTTGGATTTCATTCATTTACTCATTAATTTAGAGAATGAGAACCATCTTCTAGATACCCTCGTGTTCGTGAGCTCACAATTGAAGAGAAGCAACCGGTGCCTAAGTGGTTGAGTGTATAGTTATCCCACATCATTTGCTTAAAATGGGAATCAGAAACGAAAGTGATCCCTTTGTTTCTTACGGTTCAAAGTCCTCATTGTTCTTAACAGGTGGAACCGGCACTTGGTGCAGCATTACTAGCCTGGAATTTTTTGAGCAAGGATTATCAGCAGGAAGGAATATAGAGGGTGTTCTTTCTTTAGCTTCACGAGAAAGGGCTGATGAACATGAACTGTACAGTTTGATTGTCAGACCAAACCTAATTAGCGATGTTGGAAAGTTAAGGTAGAAACTTGAAGTAGTCTTAGCTTGTGTTAGTGTCCCAAGGTCTTCGTTGATTTTCATCGTTATTTTAGCTCTTCATGTTAAGATAGTATATATTTTTGAGTGATCCAATCTCTCGTCTGTATGGTAAACTGGATATGGAGGGAAAATCCAATTGAATGAAACCTTCTTTGATCCTCAGAACATTTTAAAATGGTTGGTGTATAAAATATC

mRNA sequence

ATGTCTGCGGCTGGAGCAGAGATGAAGAGGTGTAGAAATGGCGAACTCTGGGATTTTGAGCACGAGATTCTTGGCGGAGATGATATTATACTTGGAATCGACGGCGGCACCACTTCCACTGTTTGTGTTTGTATTGCTCTTTCCGATCCTCGAGCTATTTCTCCTTCAATGGCTTGTCCTATGCTCGCTCGTGTTGTTGGTGGCTGCTCAAACCATAATAGCGTTGGCGAAACTGCTGCGAGGGAAACACTGGAGCAAGTTATGGCGGAGGCACTTTCAAAGTCTTGTTCAATTCGATCTTCAGTTCGAGCTGTGTGCCTAGCTGTTTCTGGAGTCAACCATCCAACGGATCAAGAAAGAATTCTGAATTGGCTAAGGGATATCTTTCCGTGCCATGTAAACCTGTATGTTCAAAATGATGCTGTGGCGGCTCTTGCAAGTGGCACTATGGGAAAGCTCCATGGATGCGTTTTAATTGCTGGAACTGGAACAATTGCATATGGATTCACAGAGGATGGTCGAGAAGCTCGAGCTGCTGGTGCAGGACCAATCTTAGGTGATTGGGGAAGTGGATATGGGATAGCTGCACAGGCGTTAACCGCAATAATTAGGGCTCATGATGGACGTGGTCCTCATACAATGCTCACTTACAGCATTTTGAAGACACTTGATCTTTCTTCACCAGATGAACTAATAGGGTGGACATATGCGGATCCATCCTGGGCTCGAATTGCTGCTCTCGTTCCAGTTATTGTAGCATGTGCTGAGGCAGGCGATGAGGTTGCTGACAACATCCTCCTCGACTCAGTTGAAGAATTGGCTTCAAGTGTGAAGGCTGTTATTCAAAGACTCGGCCTGGCTGGTGAAGATGGACAAGAGGCTTTTCCGCTTGTTATGGTTGGTGGTGTACTCGAAGCAAAAAGAAGGTGGGACATAGCAAAAAAAGTCATAAATTCAATATCCAAAGAATACCCCGGAGTGGAACCGGCACTTGGTGCAGCATTACTAGCCTGGAATTTTTTGAGCAAGGATTATCAGCAGGAAGGAATATAGAGGGTGTTCTTTCTTTAGCTTCACGAGAAAGGGCTGATGAACATGAACTGTACAGTTTGATTGTCAGACCAAACCTAATTAGCGATGTTGGAAAGTTAAGGTAGAAACTTGAAGTAGTCTTAGCTTGTGTTAGTGTCCCAAGGTCTTCGTTGATTTTCATCGTTATTTTAGCTCTTCATGTTAAGATAGTATATATTTTTGAGTGATCCAATCTCTCGTCTGTATGGTAAACTGGATATGGAGGGAAAATCCAATTGAATGAAACCTTCTTTGATCCTCAGAACATTTTAAAATGGTTGGTGTATAAAATATC

Coding sequence (CDS)

ATGTCTGCGGCTGGAGCAGAGATGAAGAGGTGTAGAAATGGCGAACTCTGGGATTTTGAGCACGAGATTCTTGGCGGAGATGATATTATACTTGGAATCGACGGCGGCACCACTTCCACTGTTTGTGTTTGTATTGCTCTTTCCGATCCTCGAGCTATTTCTCCTTCAATGGCTTGTCCTATGCTCGCTCGTGTTGTTGGTGGCTGCTCAAACCATAATAGCGTTGGCGAAACTGCTGCGAGGGAAACACTGGAGCAAGTTATGGCGGAGGCACTTTCAAAGTCTTGTTCAATTCGATCTTCAGTTCGAGCTGTGTGCCTAGCTGTTTCTGGAGTCAACCATCCAACGGATCAAGAAAGAATTCTGAATTGGCTAAGGGATATCTTTCCGTGCCATGTAAACCTGTATGTTCAAAATGATGCTGTGGCGGCTCTTGCAAGTGGCACTATGGGAAAGCTCCATGGATGCGTTTTAATTGCTGGAACTGGAACAATTGCATATGGATTCACAGAGGATGGTCGAGAAGCTCGAGCTGCTGGTGCAGGACCAATCTTAGGTGATTGGGGAAGTGGATATGGGATAGCTGCACAGGCGTTAACCGCAATAATTAGGGCTCATGATGGACGTGGTCCTCATACAATGCTCACTTACAGCATTTTGAAGACACTTGATCTTTCTTCACCAGATGAACTAATAGGGTGGACATATGCGGATCCATCCTGGGCTCGAATTGCTGCTCTCGTTCCAGTTATTGTAGCATGTGCTGAGGCAGGCGATGAGGTTGCTGACAACATCCTCCTCGACTCAGTTGAAGAATTGGCTTCAAGTGTGAAGGCTGTTATTCAAAGACTCGGCCTGGCTGGTGAAGATGGACAAGAGGCTTTTCCGCTTGTTATGGTTGGTGGTGTACTCGAAGCAAAAAGAAGGTGGGACATAGCAAAAAAAGTCATAAATTCAATATCCAAAGAATACCCCGGAGTGGAACCGGCACTTGGTGCAGCATTACTAGCCTGGAATTTTTTGAGCAAGGATTATCAGCAGGAAGGAATATAG

Protein sequence

MSAAGAEMKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRAISPSMACPMLARVVGGCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERILNWLRDIFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPSWARIAALVPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPGVEPALGAALLAWNFLSKDYQQEGI
BLAST of Cp4.1LG06g05630.1 vs. Swiss-Prot
Match: NAGK_DICDI (N-acetyl-D-glucosamine kinase OS=Dictyostelium discoideum GN=nagk PE=3 SV=2)

HSP 1 Score: 197.2 bits (500), Expect = 3.0e-49
Identity = 120/329 (36.47%), Postives = 183/329 (55.62%), Query Frame = 1

Query: 28  DIILGIDGGTTSTVCVCIALSDPRAISPSMACPMLARVVGGCSNHNSVGETAARETLEQ- 87
           +I +GIDGG T T  V +  +             LAR    CSN++SVGE  A+  + + 
Sbjct: 4   EIFIGIDGGGTKTSTVAVDSNGQE----------LARHTSPCSNYHSVGEDLAKAAINEG 63

Query: 88  ------VMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERILNWLRDIFPCHVNLYVQND 147
                  + E ++   +   +V ++CL +SGV+   D+  + +W+ ++    +N  + ND
Sbjct: 64  IKYVIRKVKETITDDDNKEVTVGSICLGMSGVDREKDKLLVKSWVTELLGESINYSIHND 123

Query: 148 AVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALT 207
           A+ AL+SGT GKL G V+I GTG I+ GF  +G   R+ G GP+LGD+GSGY I    L 
Sbjct: 124 AIVALSSGTQGKLFGVVIICGTGCISLGFNREGVSGRSGGWGPLLGDYGSGYQIGYDILR 183

Query: 208 AIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADP---SWARIAALVPVIVACAEA 267
            +++A D  GP T LT  +L+ L L+  ++LI W Y DP   SW + A L P+    A+ 
Sbjct: 184 HVLKAKDQVGPKTSLTQVLLEKLQLTKEEDLISWAY-DPKTQSWQKFAQLSPLAFEQAQL 243

Query: 268 GDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVI 327
           GDE+++ IL+D+   L   + +VI++LGL   D +E FPLV  GG +E  R+  ++  + 
Sbjct: 244 GDEISNLILVDAANALYDLINSVIKKLGL---DKEEKFPLVYTGGNIE--RKGILSDLLS 303

Query: 328 NSISKEYPGVE-------PALGAALLAWN 340
             I + YP  E       P++GAALLA N
Sbjct: 304 KKIMENYPNAEILNTTCDPSMGAALLALN 316

BLAST of Cp4.1LG06g05630.1 vs. Swiss-Prot
Match: MURK_CLOAB (N-acetylmuramic acid/N-acetylglucosamine kinase OS=Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787) GN=murK PE=1 SV=1)

HSP 1 Score: 124.0 bits (310), Expect = 3.2e-27
Identity = 101/317 (31.86%), Postives = 157/317 (49.53%), Query Frame = 1

Query: 30  ILGIDGGTTSTVCVCIALSDPRAISPSMACPMLARVVGGCSNHNSVGETAARETLEQVMA 89
           ++GIDGG + T      L             +L  V  G SN NS  +   +  L++++ 
Sbjct: 4   VIGIDGGGSKTHMKISTLD----------YKVLLEVFKGPSNINSSTKEEVKRVLQELIM 63

Query: 90  EALSKSCSIRSSVRAVCLAVSGVNHPTDQERILNWLRDIFPCHVNLYVQNDAVAALASGT 149
           E L K         A+C+  +G +   D+  I + +R +      + V NDA  ALA G 
Sbjct: 64  EGLGKLGQSLEECSAICIGTAGADRTEDKSIIEDMIRSLGYMG-KIIVVNDAEIALAGG- 123

Query: 150 MGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDGR 209
           + K  G ++I+GTG+I YG  ++GR AR+ G G I+GD GSGY I  +A+ A +++ D R
Sbjct: 124 IEKREGIIVISGTGSICYGRNKEGRSARSGGWGHIIGDEGSGYDIGIKAIKAALKSFDKR 183

Query: 210 GPHTMLTYSILKTLDLSSPDELIGWTY-ADPSWARIAALVPVIVACAEAGDEVADNILLD 269
           G  T+L   IL  L L S ++LI + Y +  +   IA+L  V+ +    GD V+  IL +
Sbjct: 184 GEKTILEGDILDFLKLKSHEDLINYIYRSGVTKKEIASLTRVVNSAYIKGDLVSKRILKE 243

Query: 270 SVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGVL-EAKRRWDIAKKVINSISKEYPGV 329
           +  EL  SVKAV++ L +      +   L   GGV+      +D  +K +N     YP V
Sbjct: 244 AARELFLSVKAVVEVLSMQ----NKKVVLTTAGGVINNINYLYDEFRKFLN---LNYPKV 301

Query: 330 -------EPALGAALLA 338
                  + A GA ++A
Sbjct: 304 KIISMKNDSAFGAVIIA 301

BLAST of Cp4.1LG06g05630.1 vs. TrEMBL
Match: E0CTZ3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0028g01190 PE=4 SV=1)

HSP 1 Score: 530.4 bits (1365), Expect = 1.6e-147
Identity = 271/350 (77.43%), Postives = 301/350 (86.00%), Query Frame = 1

Query: 8   MKRCRNGELWDFEHEIL---GGDDIILGIDGGTTSTVCVCIA---LSDPRAISPSMACPM 67
           MKR RNGE+WDFE E+     G +++LG+DGGTTSTVCVC+    LSD     P    P+
Sbjct: 1   MKRYRNGEIWDFEDEMPVSPDGSEVVLGLDGGTTSTVCVCMPFFPLSDRPLPDP---VPV 60

Query: 68  LARVVGGCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERI 127
           LAR V GCSNHNSVGETAARETLEQVMA+ALSKS S RS+VRAVCLAVSGVNHPTDQ+RI
Sbjct: 61  LARAVAGCSNHNSVGETAARETLEQVMADALSKSGSNRSAVRAVCLAVSGVNHPTDQQRI 120

Query: 128 LNWLRDIFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGA 187
           L+WLRDIF  HV LYVQNDAVAALASGTMG+LHGCVLIAGTGTIAYGFTEDGREARAAGA
Sbjct: 121 LSWLRDIFSSHVKLYVQNDAVAALASGTMGELHGCVLIAGTGTIAYGFTEDGREARAAGA 180

Query: 188 GPILGDWGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPSW 247
           GPILGDWGSGYGIAAQALTA++RAHDGRGP T LTYSIL+ L LSSPDELIGWTYADPSW
Sbjct: 181 GPILGDWGSGYGIAAQALTAVVRAHDGRGPQTALTYSILRALSLSSPDELIGWTYADPSW 240

Query: 248 ARIAALVPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVG 307
           ARIAALVPV+V+CA+AGDEVA+ ILL+SVEELASSVKAV+QRLGL GEDG+ +FPLVMVG
Sbjct: 241 ARIAALVPVVVSCADAGDEVANKILLESVEELASSVKAVVQRLGLCGEDGKGSFPLVMVG 300

Query: 308 GVLEAKRRWDIAKKVINSISKEYPG-------VEPALGAALLAWNFLSKD 345
           GVLEA + WDI K+V+N I K+YPG       VEPA+GAALLAWNF  K+
Sbjct: 301 GVLEANKTWDIGKEVVNCIYKDYPGTLPIRPKVEPAVGAALLAWNFFMKE 347

BLAST of Cp4.1LG06g05630.1 vs. TrEMBL
Match: W9SHX8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_012708 PE=4 SV=1)

HSP 1 Score: 530.4 bits (1365), Expect = 1.6e-147
Identity = 271/346 (78.32%), Postives = 299/346 (86.42%), Query Frame = 1

Query: 9   KRCRNGELWDFEHE---ILGGDDIILGIDGGTTSTVCVCIALSDPRAISPSMACPMLARV 68
           KR RNGE+WDFEHE   + G  D+ILG+DGGTTSTVC+C+ +  P + SPS   P+LAR 
Sbjct: 3   KRNRNGEIWDFEHEMPVVAGAGDVILGLDGGTTSTVCICMPII-PFSDSPSDPPPVLARA 62

Query: 69  VGGCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERILNWL 128
           V GCSNHNSVGE AARETLE+VMA+AL KS S RS+VRAVCLAVSGVNHPTDQ+RILNWL
Sbjct: 63  VAGCSNHNSVGEAAARETLEKVMADALLKSGSNRSAVRAVCLAVSGVNHPTDQQRILNWL 122

Query: 129 RDIFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL 188
           R IFP HV LYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL
Sbjct: 123 RYIFPSHVGLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL 182

Query: 189 GDWGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPSWARIA 248
           GDWGSGYGIAAQALTA+I+AHDGRG  TMLT SIL+TL LSSPDELIGWTYADPSWARIA
Sbjct: 183 GDWGSGYGIAAQALTAVIKAHDGRGLETMLTSSILETLGLSSPDELIGWTYADPSWARIA 242

Query: 249 ALVPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGVLE 308
           ALVPV+V+CAEAGD+VA+ IL DSV +LASSVKAV+QRLGL GEDG+ +FPLVMVGGVLE
Sbjct: 243 ALVPVVVSCAEAGDDVANRILYDSVHDLASSVKAVVQRLGLCGEDGKHSFPLVMVGGVLE 302

Query: 309 AKRRWDIAKKVINSISKEYPG-------VEPALGAALLAWNFLSKD 345
           A +RWDI K+VI  ISK YPG       VEPA+GAALLAWNF  K+
Sbjct: 303 ANKRWDIGKEVIRCISKYYPGTIAIRPKVEPAVGAALLAWNFFMKE 347

BLAST of Cp4.1LG06g05630.1 vs. TrEMBL
Match: A0A067KRZ1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_05213 PE=4 SV=1)

HSP 1 Score: 520.8 bits (1340), Expect = 1.3e-144
Identity = 267/351 (76.07%), Postives = 294/351 (83.76%), Query Frame = 1

Query: 8   MKRCRNGELWDFEHEIL--GGDDIILGIDGGTTSTVCVCIALSDPRAISPSMACPMLARV 67
           MKR RNGE+WDFEHEI   G   +ILG+DGGTTSTVC+C+ +       P    P+LAR 
Sbjct: 1   MKRNRNGEIWDFEHEIAVAGNRQVILGVDGGTTSTVCICMPILPFSNPLPD-PLPVLARA 60

Query: 68  VGGCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERILNWL 127
           V GCSNHNSVGETAARETLEQVMA+ALSKS   RS+V+AVCLAVSGVNHPTD++RIL+WL
Sbjct: 61  VAGCSNHNSVGETAARETLEQVMADALSKSGFNRSAVQAVCLAVSGVNHPTDEQRILDWL 120

Query: 128 RDIFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL 187
           RDIFP HV LYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL
Sbjct: 121 RDIFPTHVKLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL 180

Query: 188 GDWGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPSWARIA 247
           GDWGSGYGIAAQAL A+IRAHDGRGP T+LT SIL  L L SPDELIGWTYADPSWARIA
Sbjct: 181 GDWGSGYGIAAQALAAVIRAHDGRGPQTLLTSSILHALGLCSPDELIGWTYADPSWARIA 240

Query: 248 ALVPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGVLE 307
           ALVPVIV+CAEAGDE A+ IL  SVEELA SVKAV+QRLGL G DG  +FPLVMVGGVLE
Sbjct: 241 ALVPVIVSCAEAGDEEANRILQYSVEELALSVKAVVQRLGLCGIDGNASFPLVMVGGVLE 300

Query: 308 AKRRWDIAKKVINSISKEYPG-------VEPALGAALLAWNFLSKDYQQEG 350
           A +RWDI K+V+N IS++YPG       VEPA+GAAL  WNFL K+  +EG
Sbjct: 301 ANKRWDIGKEVVNCISRDYPGALLIRPKVEPAVGAALSGWNFLMKETNREG 350

BLAST of Cp4.1LG06g05630.1 vs. TrEMBL
Match: A0A059DD75_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A00888 PE=4 SV=1)

HSP 1 Score: 519.2 bits (1336), Expect = 3.8e-144
Identity = 264/351 (75.21%), Postives = 304/351 (86.61%), Query Frame = 1

Query: 8   MKRCRNGELWDFEHE--ILGG-DDIILGIDGGTTSTVCVCIALSDPRAISPSMACPMLAR 67
           MKR RNGE+WDFEHE  ++GG D+++LG+DGGTTSTVC+C+ L       P    P+LAR
Sbjct: 1   MKRYRNGEIWDFEHEMPVVGGNDEVVLGLDGGTTSTVCICMPLLRVADPFPD-PLPVLAR 60

Query: 68  VVGGCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERILNW 127
            V GCSNHNSVGE AARETLEQVMA+AL+KS S RS+VRAVCLAVSGVNHPTDQ+RI+NW
Sbjct: 61  AVAGCSNHNSVGEAAARETLEQVMADALAKSGSNRSAVRAVCLAVSGVNHPTDQQRIVNW 120

Query: 128 LRDIFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPI 187
           LR++FP +V LYVQNDAVAALASGT+GKLHGCVLIAGTGTIAYGFTEDGREARAAGAGP 
Sbjct: 121 LREMFPSYVKLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPT 180

Query: 188 LGDWGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPSWARI 247
           LGDWGSGYGIAAQALTA+IRA+DGRGP T LT SIL+ + LSSPDELIGWTYADPSWARI
Sbjct: 181 LGDWGSGYGIAAQALTAVIRAYDGRGPETNLTSSILEKIGLSSPDELIGWTYADPSWARI 240

Query: 248 AALVPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGVL 307
           AALVPV+V+CAEAGDEVA+ IL +SV+ELA SVKAV++RL L GEDG+++FPLVMVGGVL
Sbjct: 241 AALVPVVVSCAEAGDEVANRILFESVQELALSVKAVVERLRLCGEDGKDSFPLVMVGGVL 300

Query: 308 EAKRRWDIAKKVINSISKEYPG-------VEPALGAALLAWNFLSKDYQQE 349
           EAK+RWDI K+VIN ISKE+PG       VEPA+GAALLA NF  K++ +E
Sbjct: 301 EAKKRWDIGKEVINCISKEFPGVFPIRPKVEPAVGAALLARNFYMKEFCKE 350

BLAST of Cp4.1LG06g05630.1 vs. TrEMBL
Match: B9IA59_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s14040g PE=4 SV=1)

HSP 1 Score: 519.2 bits (1336), Expect = 3.8e-144
Identity = 266/352 (75.57%), Postives = 301/352 (85.51%), Query Frame = 1

Query: 9   KRCRNGELWDFEHEI--LGGDDIILGIDGGTTSTVCVCIAL---SDPRAISPSMACPMLA 68
           KR RNGE+WDFEHEI  LG  ++ILG+DGGTTSTVC+C+ +   SDP    P    P+LA
Sbjct: 3   KRYRNGEIWDFEHEIGELGNREVILGLDGGTTSTVCICMPIFPFSDP---FPD-PLPVLA 62

Query: 69  RVVGGCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERILN 128
           R V GCSNHNSVGETAARETLEQVMA+AL KS S RS+VRAVCL+VSGVNH TD+ R+LN
Sbjct: 63  RAVAGCSNHNSVGETAARETLEQVMADALLKSGSNRSAVRAVCLSVSGVNHSTDELRVLN 122

Query: 129 WLRDIFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGP 188
           WLR+IFP HV LYVQNDAVAAL+SGTMGKLHGCVLIAGTGTIA+GFTEDGR+ARAAGAGP
Sbjct: 123 WLREIFPTHVKLYVQNDAVAALSSGTMGKLHGCVLIAGTGTIAFGFTEDGRQARAAGAGP 182

Query: 189 ILGDWGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPSWAR 248
           +LGDWGSGYGIAAQALTAI+RA+DGRGP T+L+ +IL+TL LSSPDELIGWTYADPSWAR
Sbjct: 183 VLGDWGSGYGIAAQALTAIVRAYDGRGPVTILSSNILQTLGLSSPDELIGWTYADPSWAR 242

Query: 249 IAALVPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGV 308
           IAALVPV+V+CAEAGD VA  IL DSVEELA SVKAV+QRLGL GEDG+ +FPLVMVGGV
Sbjct: 243 IAALVPVVVSCAEAGDRVAHEILQDSVEELALSVKAVVQRLGLCGEDGKASFPLVMVGGV 302

Query: 309 LEAKRRWDIAKKVINSISKEYPG-------VEPALGAALLAWNFLSKDYQQE 349
           LEA +RWDI K+V+N ISK YPG       VEPA+GAALL WNFL  + Q+E
Sbjct: 303 LEANKRWDIGKEVVNHISKSYPGVLPIHPKVEPAVGAALLGWNFLMTESQKE 350

BLAST of Cp4.1LG06g05630.1 vs. TAIR10
Match: AT1G30540.1 (AT1G30540.1 Actin-like ATPase superfamily protein)

HSP 1 Score: 447.6 bits (1150), Expect = 7.1e-126
Identity = 224/321 (69.78%), Postives = 263/321 (81.93%), Query Frame = 1

Query: 29  IILGIDGGTTSTVCVCIALSDPRAISPSMACPMLARVVGGCSNHNSVGETAARETLEQVM 88
           +ILG+DGG TSTVCVC+         P    P+L R V GC+N NSVGETAAR++LEQV+
Sbjct: 31  VILGLDGGATSTVCVCVPFFSFGERFPD-PLPILGRAVAGCTNRNSVGETAARDSLEQVI 90

Query: 89  AEALSKSCSIRSSVRAVCLAVSGVNHPTDQERILNWLRDIFPCHVNLYVQNDAVAALASG 148
           +EAL +S   +S VR VCL VSGVNHP+DQE+I NW+RD+FP HV +YVQNDA+ ALASG
Sbjct: 91  SEALVQSGFDKSDVRGVCLGVSGVNHPSDQEKIENWIRDMFPSHVKVYVQNDAIVALASG 150

Query: 149 TMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPILGDWGSGYGIAAQALTAIIRAHDG 208
           TMGKLHGCVLIAGTG IAYGF EDG+EARA+G GPILGDWGSGYGIAAQALTA+IRAHDG
Sbjct: 151 TMGKLHGCVLIAGTGCIAYGFDEDGKEARASGGGPILGDWGSGYGIAAQALTAVIRAHDG 210

Query: 209 RGPHTMLTYSILKTLDLSSPDELIGWTYADPSWARIAALVPVIVACAEAGDEVADNILLD 268
           RGP TMLT +ILK L LSSPDELIGWTYADPSWARIAALVP +V+CAEAGDE++D IL+D
Sbjct: 211 RGPQTMLTSTILKALGLSSPDELIGWTYADPSWARIAALVPQVVSCAEAGDEISDKILVD 270

Query: 269 SVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAKRRWDIAKKVINSISKEYPG-- 328
           + E+LA SVKAV+QRLGL G+DG  +FP+VMVGGVL A ++WDI K+V   I++ +PG  
Sbjct: 271 AAEDLALSVKAVVQRLGLCGKDGTASFPVVMVGGVLNANQKWDIGKEVSKRINRYFPGAQ 330

Query: 329 -----VEPALGAALLAWNFLS 343
                VEPA+GAALLA NFLS
Sbjct: 331 TIIPKVEPAVGAALLAMNFLS 350

BLAST of Cp4.1LG06g05630.1 vs. NCBI nr
Match: gi|659129372|ref|XP_008464652.1| (PREDICTED: N-acetyl-D-glucosamine kinase-like [Cucumis melo])

HSP 1 Score: 659.1 bits (1699), Expect = 4.4e-186
Identity = 331/350 (94.57%), Postives = 338/350 (96.57%), Query Frame = 1

Query: 8   MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRAISPSMACPMLARVVG 67
           MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPR +SPS++CPMLARVVG
Sbjct: 1   MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRVVSPSISCPMLARVVG 60

Query: 68  GCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERILNWLRD 127
           GCSNHNSVGETAARETLEQVMAEALSKS SIRSSVRAVCLAVSGVNHPTDQ+RI +WLRD
Sbjct: 61  GCSNHNSVGETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRISDWLRD 120

Query: 128 IFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPILGD 187
           IFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPILGD
Sbjct: 121 IFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPILGD 180

Query: 188 WGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPSWARIAAL 247
           WGSGYGIAAQALTAIIRAHDGRGPHT LTYSILKTLDLSSPDELIGWTYADPSWARIAAL
Sbjct: 181 WGSGYGIAAQALTAIIRAHDGRGPHTKLTYSILKTLDLSSPDELIGWTYADPSWARIAAL 240

Query: 248 VPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAK 307
           VPV+VACAEAGDEVA+NILLDSVEELA SVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAK
Sbjct: 241 VPVVVACAEAGDEVANNILLDSVEELALSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAK 300

Query: 308 RRWDIAKKVINSISKEYPG-------VEPALGAALLAWNFLSKDYQQEGI 351
           RRWDIAKKVINSISKEYPG       VEPALGAALLAWNFLSKDYQQEGI
Sbjct: 301 RRWDIAKKVINSISKEYPGILPVWPKVEPALGAALLAWNFLSKDYQQEGI 350

BLAST of Cp4.1LG06g05630.1 vs. NCBI nr
Match: gi|449463605|ref|XP_004149522.1| (PREDICTED: LOW QUALITY PROTEIN: N-acetyl-D-glucosamine kinase-like [Cucumis sativus])

HSP 1 Score: 655.2 bits (1689), Expect = 6.4e-185
Identity = 329/350 (94.00%), Postives = 336/350 (96.00%), Query Frame = 1

Query: 8   MKRCRNGELWDFEHEILGGDDIILGIDGGTTSTVCVCIALSDPRAISPSMACPMLARVVG 67
           MKRCRN ELWDFEHEILGGDDIILGIDGGTTSTVCVCI LSDPR +SPSM+CPMLARVVG
Sbjct: 1   MKRCRNDELWDFEHEILGGDDIILGIDGGTTSTVCVCIGLSDPRVVSPSMSCPMLARVVG 60

Query: 68  GCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERILNWLRD 127
           GCSNHNSVGETAARETLEQVMAEALSKS SIRSSVRAVCLAVSGVNHPTDQ+RIL+WLRD
Sbjct: 61  GCSNHNSVGETAARETLEQVMAEALSKSGSIRSSVRAVCLAVSGVNHPTDQQRILDWLRD 120

Query: 128 IFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPILGD 187
           IFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPILGD
Sbjct: 121 IFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPILGD 180

Query: 188 WGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPSWARIAAL 247
           WGSGYGIAAQALTAIIRAHDG GPHT LTYSILKTLDLSSPDELIGWTYADPSWARIAAL
Sbjct: 181 WGSGYGIAAQALTAIIRAHDGXGPHTKLTYSILKTLDLSSPDELIGWTYADPSWARIAAL 240

Query: 248 VPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGVLEAK 307
           VPV+VACAEAGDEVA+NILLDSVEELA SV+AVIQRLGLAGEDGQEAFPLVMVGGVLEAK
Sbjct: 241 VPVVVACAEAGDEVANNILLDSVEELALSVRAVIQRLGLAGEDGQEAFPLVMVGGVLEAK 300

Query: 308 RRWDIAKKVINSISKEYPG-------VEPALGAALLAWNFLSKDYQQEGI 351
           RRWDIAKKVINSISKEYPG       VEPALGAALLAWNFLSKDYQQEGI
Sbjct: 301 RRWDIAKKVINSISKEYPGILPVWPKVEPALGAALLAWNFLSKDYQQEGI 350

BLAST of Cp4.1LG06g05630.1 vs. NCBI nr
Match: gi|1009127369|ref|XP_015880663.1| (PREDICTED: N-acetyl-D-glucosamine kinase-like isoform X1 [Ziziphus jujuba])

HSP 1 Score: 532.7 bits (1371), Expect = 4.8e-148
Identity = 274/351 (78.06%), Postives = 299/351 (85.19%), Query Frame = 1

Query: 8   MKRCRNGELWDFEHEI----LGGDDIILGIDGGTTSTVCVC---IALSDPRAISPSMACP 67
           MKR RNGE+WDFEHE+    +G  D+ILG+DGGTTSTVC+C   I  SDP    P    P
Sbjct: 1   MKRYRNGEIWDFEHEMPVVAVGSRDVILGLDGGTTSTVCICMPMIPFSDPLPEPP----P 60

Query: 68  MLARVVGGCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQER 127
           +LAR V GCSNHNSVGE AARETLE VMA+ALSKS S RS+VRAVCLAVSGVNHPTDQ+R
Sbjct: 61  VLARAVAGCSNHNSVGEAAARETLELVMADALSKSGSNRSAVRAVCLAVSGVNHPTDQQR 120

Query: 128 ILNWLRDIFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAG 187
           ILNWLRDIFP HV LYVQNDAVAALASGT+GKLHGCVLIAGTGTIAYGFTEDGREARAAG
Sbjct: 121 ILNWLRDIFPGHVGLYVQNDAVAALASGTLGKLHGCVLIAGTGTIAYGFTEDGREARAAG 180

Query: 188 AGPILGDWGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPS 247
           AGP+LGDWGSGYGIAAQALTA+I+A+DGRGP T+LT SILK L LSSPDELIGWTYADPS
Sbjct: 181 AGPVLGDWGSGYGIAAQALTAVIKAYDGRGPLTLLTSSILKRLGLSSPDELIGWTYADPS 240

Query: 248 WARIAALVPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMV 307
           WARIAALVPV+++CAEAGDEVA+ IL DSV ELASSVKAV+QRLGL GEDG   FPLVMV
Sbjct: 241 WARIAALVPVVISCAEAGDEVANRILYDSVLELASSVKAVVQRLGLCGEDGNGKFPLVMV 300

Query: 308 GGVLEAKRRWDIAKKVINSISKEYPG-------VEPALGAALLAWNFLSKD 345
           GGVLEA RRWDI K+VI  ISK+YPG       VEPA+GAALLAWNF  K+
Sbjct: 301 GGVLEANRRWDIGKEVIKCISKDYPGALAIRPKVEPAVGAALLAWNFFMKE 347

BLAST of Cp4.1LG06g05630.1 vs. NCBI nr
Match: gi|703136470|ref|XP_010106164.1| (hypothetical protein L484_012708 [Morus notabilis])

HSP 1 Score: 530.4 bits (1365), Expect = 2.4e-147
Identity = 271/346 (78.32%), Postives = 299/346 (86.42%), Query Frame = 1

Query: 9   KRCRNGELWDFEHE---ILGGDDIILGIDGGTTSTVCVCIALSDPRAISPSMACPMLARV 68
           KR RNGE+WDFEHE   + G  D+ILG+DGGTTSTVC+C+ +  P + SPS   P+LAR 
Sbjct: 3   KRNRNGEIWDFEHEMPVVAGAGDVILGLDGGTTSTVCICMPII-PFSDSPSDPPPVLARA 62

Query: 69  VGGCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERILNWL 128
           V GCSNHNSVGE AARETLE+VMA+AL KS S RS+VRAVCLAVSGVNHPTDQ+RILNWL
Sbjct: 63  VAGCSNHNSVGEAAARETLEKVMADALLKSGSNRSAVRAVCLAVSGVNHPTDQQRILNWL 122

Query: 129 RDIFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL 188
           R IFP HV LYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL
Sbjct: 123 RYIFPSHVGLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGAGPIL 182

Query: 189 GDWGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPSWARIA 248
           GDWGSGYGIAAQALTA+I+AHDGRG  TMLT SIL+TL LSSPDELIGWTYADPSWARIA
Sbjct: 183 GDWGSGYGIAAQALTAVIKAHDGRGLETMLTSSILETLGLSSPDELIGWTYADPSWARIA 242

Query: 249 ALVPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVGGVLE 308
           ALVPV+V+CAEAGD+VA+ IL DSV +LASSVKAV+QRLGL GEDG+ +FPLVMVGGVLE
Sbjct: 243 ALVPVVVSCAEAGDDVANRILYDSVHDLASSVKAVVQRLGLCGEDGKHSFPLVMVGGVLE 302

Query: 309 AKRRWDIAKKVINSISKEYPG-------VEPALGAALLAWNFLSKD 345
           A +RWDI K+VI  ISK YPG       VEPA+GAALLAWNF  K+
Sbjct: 303 ANKRWDIGKEVIRCISKYYPGTIAIRPKVEPAVGAALLAWNFFMKE 347

BLAST of Cp4.1LG06g05630.1 vs. NCBI nr
Match: gi|359485331|ref|XP_002278295.2| (PREDICTED: N-acetyl-D-glucosamine kinase [Vitis vinifera])

HSP 1 Score: 530.4 bits (1365), Expect = 2.4e-147
Identity = 271/350 (77.43%), Postives = 301/350 (86.00%), Query Frame = 1

Query: 8   MKRCRNGELWDFEHEIL---GGDDIILGIDGGTTSTVCVCIA---LSDPRAISPSMACPM 67
           MKR RNGE+WDFE E+     G +++LG+DGGTTSTVCVC+    LSD     P    P+
Sbjct: 1   MKRYRNGEIWDFEDEMPVSPDGSEVVLGLDGGTTSTVCVCMPFFPLSDRPLPDP---VPV 60

Query: 68  LARVVGGCSNHNSVGETAARETLEQVMAEALSKSCSIRSSVRAVCLAVSGVNHPTDQERI 127
           LAR V GCSNHNSVGETAARETLEQVMA+ALSKS S RS+VRAVCLAVSGVNHPTDQ+RI
Sbjct: 61  LARAVAGCSNHNSVGETAARETLEQVMADALSKSGSNRSAVRAVCLAVSGVNHPTDQQRI 120

Query: 128 LNWLRDIFPCHVNLYVQNDAVAALASGTMGKLHGCVLIAGTGTIAYGFTEDGREARAAGA 187
           L+WLRDIF  HV LYVQNDAVAALASGTMG+LHGCVLIAGTGTIAYGFTEDGREARAAGA
Sbjct: 121 LSWLRDIFSSHVKLYVQNDAVAALASGTMGELHGCVLIAGTGTIAYGFTEDGREARAAGA 180

Query: 188 GPILGDWGSGYGIAAQALTAIIRAHDGRGPHTMLTYSILKTLDLSSPDELIGWTYADPSW 247
           GPILGDWGSGYGIAAQALTA++RAHDGRGP T LTYSIL+ L LSSPDELIGWTYADPSW
Sbjct: 181 GPILGDWGSGYGIAAQALTAVVRAHDGRGPQTALTYSILRALSLSSPDELIGWTYADPSW 240

Query: 248 ARIAALVPVIVACAEAGDEVADNILLDSVEELASSVKAVIQRLGLAGEDGQEAFPLVMVG 307
           ARIAALVPV+V+CA+AGDEVA+ ILL+SVEELASSVKAV+QRLGL GEDG+ +FPLVMVG
Sbjct: 241 ARIAALVPVVVSCADAGDEVANKILLESVEELASSVKAVVQRLGLCGEDGKGSFPLVMVG 300

Query: 308 GVLEAKRRWDIAKKVINSISKEYPG-------VEPALGAALLAWNFLSKD 345
           GVLEA + WDI K+V+N I K+YPG       VEPA+GAALLAWNF  K+
Sbjct: 301 GVLEANKTWDIGKEVVNCIYKDYPGTLPIRPKVEPAVGAALLAWNFFMKE 347

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NAGK_DICDI3.0e-4936.47N-acetyl-D-glucosamine kinase OS=Dictyostelium discoideum GN=nagk PE=3 SV=2[more]
MURK_CLOAB3.2e-2731.86N-acetylmuramic acid/N-acetylglucosamine kinase OS=Clostridium acetobutylicum (s... [more]
Match NameE-valueIdentityDescription
E0CTZ3_VITVI1.6e-14777.43Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0028g01190 PE=4 SV=... [more]
W9SHX8_9ROSA1.6e-14778.32Uncharacterized protein OS=Morus notabilis GN=L484_012708 PE=4 SV=1[more]
A0A067KRZ1_JATCU1.3e-14476.07Uncharacterized protein OS=Jatropha curcas GN=JCGZ_05213 PE=4 SV=1[more]
A0A059DD75_EUCGR3.8e-14475.21Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A00888 PE=4 SV=1[more]
B9IA59_POPTR3.8e-14475.57Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s14040g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G30540.17.1e-12669.78 Actin-like ATPase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659129372|ref|XP_008464652.1|4.4e-18694.57PREDICTED: N-acetyl-D-glucosamine kinase-like [Cucumis melo][more]
gi|449463605|ref|XP_004149522.1|6.4e-18594.00PREDICTED: LOW QUALITY PROTEIN: N-acetyl-D-glucosamine kinase-like [Cucumis sati... [more]
gi|1009127369|ref|XP_015880663.1|4.8e-14878.06PREDICTED: N-acetyl-D-glucosamine kinase-like isoform X1 [Ziziphus jujuba][more]
gi|703136470|ref|XP_010106164.1|2.4e-14778.32hypothetical protein L484_012708 [Morus notabilis][more]
gi|359485331|ref|XP_002278295.2|2.4e-14777.43PREDICTED: N-acetyl-D-glucosamine kinase [Vitis vinifera][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002731ATPase_BadF
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016310 phosphorylation
cellular_component GO:0005575 cellular_component
molecular_function GO:0016301 kinase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG06g05630Cp4.1LG06g05630gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG06g05630.1Cp4.1LG06g05630.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG06g05630.1:cds:001Cp4.1LG06g05630.1:cds:001CDS
Cp4.1LG06g05630.1:cds:002Cp4.1LG06g05630.1:cds:002CDS
Cp4.1LG06g05630.1:cds:003Cp4.1LG06g05630.1:cds:003CDS
Cp4.1LG06g05630.1:cds:004Cp4.1LG06g05630.1:cds:004CDS
Cp4.1LG06g05630.1:cds:005Cp4.1LG06g05630.1:cds:005CDS
Cp4.1LG06g05630.1:cds:006Cp4.1LG06g05630.1:cds:006CDS
Cp4.1LG06g05630.1:cds:007Cp4.1LG06g05630.1:cds:007CDS
Cp4.1LG06g05630.1:cds:008Cp4.1LG06g05630.1:cds:008CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG06g05630.1:three_prime_utr:001Cp4.1LG06g05630.1:three_prime_utr:001three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002731ATPase, BadF/BadG/BcrA/BcrD typePFAMPF01869BcrAD_BadFGcoord: 31..337
score: 5.3
NoneNo IPR availablePANTHERPTHR12862BADF TYPE ATPASE DOMAIN-CONTAINING PROTEINcoord: 1..348
score: 9.6E
NoneNo IPR availablePANTHERPTHR12862:SF6ACTIN-LIKE ATPASE SUPERFAMILY PROTEINcoord: 1..348
score: 9.6E
NoneNo IPR availableunknownSSF53067Actin-like ATPase domaincoord: 154..344
score: 1.41E-47coord: 29..149
score: 3.8