Cla006823 (gene) Watermelon (97103) v1

NameCla006823
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionOxidoreductase aldo/keto reductase family protein (AHRD V1 **-- F3MCJ9_9BACL); contains Interpro domain(s) IPR001395 Aldo/keto reductase
LocationChr2 : 10035270 .. 10039718 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCTTCTTATCCCAAGCTTCAGTTACGGGACCTGGGAAACACCGGCCTCAAAACCAGCAGTGTCGGCTTTGGCGCCTCCCCTCTCGGCAGCGTCTTCAGCCCCGTTTCTGAGGAGGATGCCGTCGCCGCCGTTCGTGAAGCCTTTCGTCTTGGCATCAACTTCTTCGACACCTCCCCGTATGTAAATTCATTCTGTTTCTCTGAGAATGAGGGTGGATATCTACGATGATTCTGTTTGTTTACTCGAAGTTTTGTATTGCGTTTTGTTTTGTTTTGTTTTTTAATGTTGAGATTGGTTTAGGTAAATGGGAGATGAAACAATGGTCAAAATTGCAACGTTTGTTCAAGTTTGATGTTTAATGGGATGTTGTTGAAAGTTTAGGTATTTGGCTAACCTGCCAAAGGCTTTGGGAAATTATATAGAGGTTTTTGTACGATGACCAATCTCAGATGCTTTAACGCGAAATGACCCAATCCTAAAAAAATGAATGAAAACTAATCTTGAACCCATGCCACAGACTACTATCGCAATAGCCCTCTTTCCCATCGCCAAATAATTGGCGCTTAATGCAATATTCATTTCAATCGTGATTTTCTTTTCTTTTCTTTTTTTTTTTTTTTTTTTAATCAACTGTCGTTTTCTTTAAATCACAGGTGATTAAAAATGAAAGTTGTGTTAGGGCTCTGACCATTTTGCGATGGTTGAAAAGTTCCTAACGCGATAATAATTGGTGAAATGTTCTATAACGTAAGTAGGAGGTTAGTTTTTATTGATTTTTTAGAATTGGGTTATTCAACGTTAACGTCGCAGTAATTAGCTCATCCGTACAAAAATCTTAAATTTTATATATCATTTCTTTTTATATGTCTAATTCATCGGCAATGATATGGAATGGTTTGCTTGTTTAGTTATCTTCTCTAGTACTTAATTTGGGGTTGCCTCGTTTACTACATGGATGAAGGTACTATGGTAGGACCTTATCAGAAAAGATGCTTGGTAAGGGACTGAAAGCTCTAGGAGTTCCAAGGAGTGAGTATATTGTGTCAACAAAGTGTGGGAGATATGGTGACGGTTTTGATTTCAGTGCTGAGAGGGTGACAAGGAGCATTGATGAGAGTTTGGCTAGACTACAACTAGATTATGTCGATATACTGCATTGCCATGATATTGAATTTGGATCTCTTGATCAGGTGTGTAACTTGTCTGATGTTTGTTACTTTTCTCAAGAACTTACTCACCATTGACTCTTAGTTTTTAATATGGTCATGTGACCTTTTGTTTTATTGGTTACAACTTTATCCTTGAGTTCTAAATAGTTATTAACATGGTTATTGTCATTATAATATAAAAGAAACAATGGATCTTCAGCCGCTTAAGTTTATTTCAGGTTGTGAATGAGACGGTTCCTGCACTTCAAAAGCTAAAGGAAGCTGGGAAGACTCGTTTCATTGGTATTACAGGACTTCCATTGGAAATATTTACATATGTGCTCGATCGAGTACCACCTGGCACCATTGATGTGATTCTTTCATATTGTCACTACAGTATTAACGATTCAACACTGTTAGATTTGCTACCTTACTTGAAGAGTAAGGGAGTTGGAATAATCAGTGCGTCTCCTCTAGCAATGGGACTTCTAACCGATCATGGTCCCCCAGAATGGCACCCGGCTTCACCAGAATTGAAGGTATAGTTCCTCACCCCTTTTCTTTCATTGGCTTGGAAGTTGAATTGTGAAACTTAAAAGCTTGAGCAGGGAAAGCTCTTTCAGTTGATTAATGTCATGTAAAGAAGACACATTTGTCTGGAGGAGTACATTTCTCGTCGGCTACTCTGATGAGGTTTAGGTCCCATTGTGGAAAAATTTTATCTCGTGAAATGAAAATAATTTATAGGTATTGATTATACTATGCACTAATGACTAGTTCGTTCAACCATACCAGCTAATCCAAAACCATTTGTTCTAGTGCTGTGATTTTTTCTTTTATGTGTCCAACTCTGTTTTGTTTGAGCTTTGGCTTTACATAATTTGAATGATATTGTTCTCTTATTTAAATTCCAACAGTCTGCATGTCAAGCTGCAGCTGCTCATTGTAAAAAGAAGGGAAAAAACATTTCAAAGTTAGCCCTCCAATACAGTCTGGCAAACAAAGATATTTCAACAGTGCTGGTTGGCATGAACTCTGTCAGACAGGTATCTCTCAATAATCTCATTGTTTTTTTTTTTTGGGGGGGGGGAAGCGAATCTCATTGTTTGGTATTCTCTTTCAATATTAATTTCAATATCTTTGAGTCCAATCTCCAGTTTTGAAGACCGTGAATTGTCCCTGTGCCATCGTACATATAGTCATTATGTATCCATTAGTTGTACTTGTACTTCTGTGGCCACTTTTTTGTTAAAAAAACTTGTCTCCACACCAGGTAGGAGGAAGTTCTTATTTTTTGATATGGTTTATCTGGATGCACTTATTAAATGTAATGAAAATATACCAATACAAGTAGGCCACACAAGTTGTAAGAGGTGAGTTTTGTTTTACTTGTGAGAGTTAAGAAAAAGTAAAAAAAGAAAAAAGGAAAAAAAGAATGAAAGACTTCATTGATTCTATGTTTTTAAATGGCAAAGTACTTAATTAACCCTATTTAATGGATGAGCTTTAGGCAAGGACAACAGATTTGAAAATACTGACTTGAAAGAAAGATAGACAAAAAAGAACATTTGTGTTACGAACTCGGCAATTAAAGAATACAAAAGAACGCAAAATATAGGGAAGATCGACACACAAATTTATGTCATTCACTAACAATGTGTTAGTTACGTCCACATAACAGAGGGAGAACAATTTTATTAGAGAGAATGGTACAAGATTATAGAATATCACAAAATGAAAAATGTCTATAGGGTTAGAGAATTTATATAATGTATTTCTCTAAACCCTAGGTGGGAGAAGGAAAATATTTAAATAATTAAATATATCAAATACCAATGCTAATTAGATTTAGGATTCAAGACATTTCAACAAATCTCCACCTTGACTTGAATTCTCTCATAGATAATAGATGTACTAGATGCAACATAAAACACATTGCCAGATGTACCCAGACTTCTTCCACATGCCTTAGATTGTCTCGGAAGAGACCACTAATAATTCAAAATATTTAGCAAGTCCAAGCAATGTTGAAACTTGCTATGTGGAACCGATTGGTGAACATATCAACAAGATTATCTCATCCCCCATAGTAGCAAGCCACTGAGCAAACTTACTACACGAAATAGGTTCTTTATAGGTGGATGACTCATGAGGATCCACCTTTTCAGCAATTTGTAGTGCATAATTCACCATATCTTCAAAACCATACCTCTTTGGAGGCCAGCTCCAACTCTTCTAACTCTATCACGAGCTATACTTGGTTGATCTATTTGTTGACTAGAATTCTCAGGCATAACATTTTTTGACATTTCAGTCACCTCAGTTTCTGGTAGCACAATAGTTTATGTAGTGTTAGTGTTGATCTTCTCCACCTTGGTGTTGTAATTCATTCACAGTTGAAGTGAATTGCATCTCCACTCGTTTATCAACACTACTACTCTCTCCTACACCAGTAAACATCATAACAGACTTTACATTTGAGTTAAGCATGAAATTCTCATCAAAGATCACATTCCTACTGAGTATAACTCTCTAGAAGGTAACAAAACTCTATATCCCTTAACTCCATCCCCAAAGCCAACAAATATACCTTTTTTAGCTCTTGGTTCTAATTTACCTTCATTAACATGATAGTAAACAGTACAACAAAAAACCTTTAATAAATAACTCCACGTAATTCTTTTCTCAAATTACCTCTTGTGTATTTCCTTTCTTGCCCCTTCTCTTATAAGTGTGAATGAACGGACGTCCAATATGATCAAGACTGTAGTAGAATTGCAAAAATGATTAGAAAGAGAGAATCAAGGAGAGATTAATTGGAGTTTCCTGTTGATACGGATCGTCTCTTCCTATGTGTCAAATTCCTTTTCATTTTTCTAGAAAAGTTGGGAGTTTCTTGTTGCTAGTTTGAATACGACTAGTAAGCTTGTTTTTAGAAGTATAATTTCCATCCTATATTCCTATCTAGAATGTTAAAATCTTATGAACTTTTATACGACCAAATATTACAAAGTTGGAAATCAGTAAGATTAGATGAGGTGCGTCTAAGTTGGTCCAAATGCTCATGAAAATTATTCTCTTACAATGTTAGTTTCCAAACTAGTTCCTAAGTCGGCTGACTTAAATATAACATTCTAACCAAGTTCTAAAAATTTGGAAGCAGGTGGAGGAAAACGTAGCTGCTGCTGAAGAACTTGCCACATTTGGGAGGGATGAGGAAACTCTGTCGGAGGTTGAAGCTATCCTTATCCCCGTCAAGAATCAGACATGGCCAAGTGGAATCCAAAAGAGCTGA

mRNA sequence

ATGGCGGCTTCTTATCCCAAGCTTCAGTTACGGGACCTGGGAAACACCGGCCTCAAAACCAGCAGTGTCGGCTTTGGCGCCTCCCCTCTCGGCAGCGTCTTCAGCCCCGTTTCTGAGGAGGATGCCGTCGCCGCCGTTCGTGAAGCCTTTCGTCTTGGCATCAACTTCTTCGACACCTCCCCGTACTATGGTAGGACCTTATCAGAAAAGATGCTTGGTAAGGGACTGAAAGCTCTAGGAGTTCCAAGGAGTGAGTATATTGTGTCAACAAAGTGTGGGAGATATGGTGACGGTTTTGATTTCAGTGCTGAGAGGGTGACAAGGAGCATTGATGAGAGTTTGGCTAGACTACAACTAGATTATGTCGATATACTGCATTGCCATGATATTGAATTTGGATCTCTTGATCAGGTTGTGAATGAGACGGTTCCTGCACTTCAAAAGCTAAAGGAAGCTGGGAAGACTCGTTTCATTGGTATTACAGGACTTCCATTGGAAATATTTACATATGTGCTCGATCGAGTACCACCTGGCACCATTGATGTGATTCTTTCATATTGTCACTACAGTATTAACGATTCAACACTGTTAGATTTGCTACCTTACTTGAAGAGTAAGGGAGTTGGAATAATCAGTGCGTCTCCTCTAGCAATGGGACTTCTAACCGATCATGGTCCCCCAGAATGGCACCCGGCTTCACCAGAATTGAAGTCTGCATGTCAAGCTGCAGCTGCTCATTGTAAAAAGAAGGGAAAAAACATTTCAAAGTTAGCCCTCCAATACAGTCTGGCAAACAAAGATATTTCAACAGTGCTGGTTGGCATGAACTCTGTCAGACAGGTGGAGGAAAACGTAGCTGCTGCTGAAGAACTTGCCACATTTGGGAGGGATGAGGAAACTCTGTCGGAGGTTGAAGCTATCCTTATCCCCGTCAAGAATCAGACATGGCCAAGTGGAATCCAAAAGAGCTGA

Coding sequence (CDS)

ATGGCGGCTTCTTATCCCAAGCTTCAGTTACGGGACCTGGGAAACACCGGCCTCAAAACCAGCAGTGTCGGCTTTGGCGCCTCCCCTCTCGGCAGCGTCTTCAGCCCCGTTTCTGAGGAGGATGCCGTCGCCGCCGTTCGTGAAGCCTTTCGTCTTGGCATCAACTTCTTCGACACCTCCCCGTACTATGGTAGGACCTTATCAGAAAAGATGCTTGGTAAGGGACTGAAAGCTCTAGGAGTTCCAAGGAGTGAGTATATTGTGTCAACAAAGTGTGGGAGATATGGTGACGGTTTTGATTTCAGTGCTGAGAGGGTGACAAGGAGCATTGATGAGAGTTTGGCTAGACTACAACTAGATTATGTCGATATACTGCATTGCCATGATATTGAATTTGGATCTCTTGATCAGGTTGTGAATGAGACGGTTCCTGCACTTCAAAAGCTAAAGGAAGCTGGGAAGACTCGTTTCATTGGTATTACAGGACTTCCATTGGAAATATTTACATATGTGCTCGATCGAGTACCACCTGGCACCATTGATGTGATTCTTTCATATTGTCACTACAGTATTAACGATTCAACACTGTTAGATTTGCTACCTTACTTGAAGAGTAAGGGAGTTGGAATAATCAGTGCGTCTCCTCTAGCAATGGGACTTCTAACCGATCATGGTCCCCCAGAATGGCACCCGGCTTCACCAGAATTGAAGTCTGCATGTCAAGCTGCAGCTGCTCATTGTAAAAAGAAGGGAAAAAACATTTCAAAGTTAGCCCTCCAATACAGTCTGGCAAACAAAGATATTTCAACAGTGCTGGTTGGCATGAACTCTGTCAGACAGGTGGAGGAAAACGTAGCTGCTGCTGAAGAACTTGCCACATTTGGGAGGGATGAGGAAACTCTGTCGGAGGTTGAAGCTATCCTTATCCCCGTCAAGAATCAGACATGGCCAAGTGGAATCCAAAAGAGCTGA

Protein sequence

MAASYPKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTSPYYGRTLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLDYVDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTIDVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSACQAAAAHCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEETLSEVEAILIPVKNQTWPSGIQKS
BLAST of Cla006823 vs. Swiss-Prot
Match: GALDH_ARATH (L-galactose dehydrogenase OS=Arabidopsis thaliana GN=LGALDH PE=1 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 2.9e-152
Identity = 265/317 (83.60%), Postives = 290/317 (91.48%), Query Frame = 1

Query: 7   KLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTSPYYGRT 66
           K++LR LGNTGLK S+VGFGASPLGSVF PV+E+DAVA VREAFRLGINFFDTSPYYG T
Sbjct: 3   KIELRALGNTGLKVSAVGFGASPLGSVFGPVAEDDAVATVREAFRLGINFFDTSPYYGGT 62

Query: 67  LSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLDYVDILH 126
           LSEKMLGKGLKAL VPRS+YIV+TKCGRY +GFDFSAERV +SIDESL RLQLDYVDILH
Sbjct: 63  LSEKMLGKGLKALQVPRSDYIVATKCGRYKEGFDFSAERVRKSIDESLERLQLDYVDILH 122

Query: 127 CHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTIDVILSY 186
           CHDIEFGSLDQ+V+ET+PALQKLK+ GKTRFIGITGLPL+IFTYVLDRVPPGT+DVILSY
Sbjct: 123 CHDIEFGSLDQIVSETIPALQKLKQEGKTRFIGITGLPLDIFTYVLDRVPPGTVDVILSY 182

Query: 187 CHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSACQAAAAH 246
           CHY +NDSTLLDLLPYLKSKGVG+ISASPLAMGLLT+ GPPEWHPASPELKSA +AA AH
Sbjct: 183 CHYGVNDSTLLDLLPYLKSKGVGVISASPLAMGLLTEQGPPEWHPASPELKSASKAAVAH 242

Query: 247 CKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEETLSEVEA 306
           CK KGK I+KLALQYSLANK+IS+VLVGM+SV QVEENVAA  EL + G D+ETLSEVEA
Sbjct: 243 CKSKGKKITKLALQYSLANKEISSVLVGMSSVSQVEENVAAVTELESLGMDQETLSEVEA 302

Query: 307 ILIPVKNQTWPSGIQKS 324
           IL PVKN TWPSGI ++
Sbjct: 303 ILEPVKNLTWPSGIHQN 319

BLAST of Cla006823 vs. Swiss-Prot
Match: ARA2_YEAST (D-arabinose 1-dehydrogenase OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=ARA2 PE=1 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 2.0e-28
Identity = 86/292 (29.45%), Postives = 155/292 (53.08%), Query Frame = 1

Query: 46  VREAFRLGINFFDTSPYYGRTLSEKMLGKGLKAL--GVPRSEYIVSTKCGRYG-DGFDFS 105
           ++ AF  GIN  DTSPYYG   SE + G+ L  L    PR  Y + TK GR G + F++S
Sbjct: 41  IKYAFSHGINAIDTSPYYGP--SEVLYGRALSNLRNEFPRDTYFICTKVGRIGAEEFNYS 100

Query: 106 AERVTRSIDESLARLQLDYVDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITG 165
            + V  S+  S  RL   Y+D+++ HD+EF     ++ E +  L+ LK  G  +  GI+G
Sbjct: 101 RDFVRFSVHRSCERLHTTYLDLVYLHDVEFVKFPDIL-EALKELRTLKNKGVIKNFGISG 160

Query: 166 LPLEIFTYVLDRVPP-----GTIDVILSYCHYSINDSTLLDLLPYL--KSKGVGIISASP 225
            P++  T++ +         G++D +LSYC+ ++ ++ LL+    L   +K   + +AS 
Sbjct: 161 YPIDFITWLAEYCSTEESDIGSLDAVLSYCNLNLQNNKLLNFRERLLRNAKLKMVCNASI 220

Query: 226 LAMGLLTDHGPPEWHPASPELKSACQAAAAHCKKKGKNISKLALQYSLAN-KDISTVLVG 285
           L+M LL      ++HP S EL+     AA +C+++  +++ LA +Y+++       V++G
Sbjct: 221 LSMSLLRSQETRQFHPCSHELRECASQAAKYCQEQNVDLADLATRYAISEWVGKGPVVLG 280

Query: 286 MNSVRQVEENVAAAEELATFG-----RDEETLSEVEAILIPVK-NQTWPSGI 321
           ++S+ +++  +   E + + G     +D + +  ++  +     N+ W SGI
Sbjct: 281 VSSMEELKLALDNYEIVKSNGNRLSSKDGQLVEYIQKNIFKEHFNEEWSSGI 329

BLAST of Cla006823 vs. Swiss-Prot
Match: FCDH_PSESP (D-threo-aldose 1-dehydrogenase OS=Pseudomonas sp. GN=fdh PE=1 SV=1)

HSP 1 Score: 122.9 bits (307), Expect = 6.6e-27
Identity = 89/302 (29.47%), Postives = 142/302 (47.02%), Query Frame = 1

Query: 17  GLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTSPYYGRTLSEKMLGKGL 76
           GL   ++G+GA+ +G++F  +S+++A A +  A+  GI ++DT+P+YG  LSEK LG  L
Sbjct: 12  GLAIPALGYGAANVGNLFRALSDDEAWAVLEAAWDAGIRYYDTAPHYGLGLSEKRLGAFL 71

Query: 77  KALGVPRSEYIVSTKCGR------------------------YGDGFDFSAERVTRSIDE 136
           +    PR E++VSTK GR                            +DF+ + +  SI E
Sbjct: 72  QT--KPRDEFVVSTKAGRLLRPNPERRPSGLDTDNDFHVPDDLRREWDFTEQGIRASIAE 131

Query: 137 SLARLQLDYVDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVL 196
           S  RL LD +D+L+ HD E   LD  +    PAL+K++  G  + IGI  +  +  T   
Sbjct: 132 SQERLGLDRIDLLYLHDPERHDLDLALASAFPALEKVRAEGVVKAIGIGSMVSDALTRA- 191

Query: 197 DRVPPGTIDVILSYCHYS-INDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPP---- 256
             V    +D+I+    Y+ +      ++LP       GI++AS    GLL    P     
Sbjct: 192 --VREADLDLIMVAGRYTLLEQPAATEVLPACAENATGIVAASVFNSGLLAQSEPKRDGR 251

Query: 257 -EWHPASPELKSACQAAAAHCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVA 289
            E+     EL       AA C+     +   A+Q+ L +  + +V+VG +   Q+ +N  
Sbjct: 252 YEYGQLPDELWDRLVRIAAICRNHDVPLPAAAIQFPLQSALVRSVVVGGSRPAQLTQNAE 308

BLAST of Cla006823 vs. Swiss-Prot
Match: YQKF_BACSU (Uncharacterized oxidoreductase YqkF OS=Bacillus subtilis (strain 168) GN=yqkF PE=3 SV=1)

HSP 1 Score: 110.2 bits (274), Expect = 4.4e-23
Identity = 84/299 (28.09%), Postives = 152/299 (50.84%), Query Frame = 1

Query: 8   LQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTSPYYGRTL 67
           ++ R LG + L  S VG G   LG+      +  A++ + EA  LGIN+ DT+  Y R  
Sbjct: 1   MRKRKLGTSDLDISEVGLGCMSLGT-----EKNKALSILDEAIELGINYLDTADLYDRGR 60

Query: 68  SEKMLGKGLKALGVPRSEYIVSTKCG-RYGDG-----FDFSAERVTRSIDESLARLQLDY 127
           +E+++G    A+   R + I++TK G R+ DG     +D S   +  ++ +SL RL+ DY
Sbjct: 61  NEEIVG---DAIQNRRHDIILATKAGNRWDDGSEGWYWDPSKAYIKEAVKKSLTRLKTDY 120

Query: 128 VDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGL-PLEIFTYVLDRVPPGTI 187
           +D+   H    G+++  ++ET+ A ++LK+ G  R+ GI+ + P  I  YV         
Sbjct: 121 IDLYQLHG---GTIEDNIDETIEAFEELKQEGVIRYYGISSIRPNVIKEYVKKS------ 180

Query: 188 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELK--- 247
           +++     +S+ D    + LP L+   + +++  P+A GLLT+    +   AS  +K   
Sbjct: 181 NIVSIMMQFSLFDRRPEEWLPLLEEHQISVVARGPVAKGLLTEKPLDQ---ASESMKQNG 240

Query: 248 --SACQAAAAHCKKKGKNI------SKLALQYSLANKDISTVLVGMNSVRQVEENVAAA 289
             S       + +K  + +      ++ +LQY LA   +++V+ G + + Q+ EN+ AA
Sbjct: 241 YLSYSFEELTNARKAMEEVAPDLSMTEKSLQYLLAQPAVASVITGASKIEQLRENIQAA 279

BLAST of Cla006823 vs. Swiss-Prot
Match: AKR1_SOYBN (Probable aldo-keto reductase 1 OS=Glycine max GN=AKR1 PE=2 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 2.2e-22
Identity = 76/256 (29.69%), Postives = 122/256 (47.66%), Query Frame = 1

Query: 7   KLQLRDLGNTGLKTSSVGFGASPL-GSVFSPVSEEDAVAAVREAFRLGINFFDTSPYYGR 66
           ++Q   LG  G + S +GFG   L G+   P+ E+D ++ ++ AF  GI FFDT+  YG 
Sbjct: 5   QIQPVKLGTQGFEVSKLGFGCMGLTGAYNDPLQEQDGISVIKYAFSKGITFFDTADVYGA 64

Query: 67  TLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGF-----DFSAERVTRSIDESLARLQLD 126
             +E ++GK LK L  PR +  ++TK G    GF     + S E V    +  L RL ++
Sbjct: 65  NANELLVGKALKQL--PREKIQIATKFGIASRGFPDMKIEGSPEYVRSCCETGLKRLDVE 124

Query: 127 YVDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTI 186
           Y+D+ + H ++       + ETV  L+KL E GK ++IG++    +         P   +
Sbjct: 125 YIDLYYQHRVD---TSVPIEETVGELKKLVEEGKVKYIGLSEASPDTIRRAHAIHPITAV 184

Query: 187 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSAC 246
            +  S     I +    +++P  +  G+GI+  SPL  G     G  E  P +  LK   
Sbjct: 185 QIEWSLWTRDIEE----EIVPLCRELGIGIVPYSPLGRGFFGGKGVVENVPTNSSLK--- 244

Query: 247 QAAAAHCKKKGKNISK 257
               AH + + +N+ K
Sbjct: 245 ----AHPRFQAENLDK 244

BLAST of Cla006823 vs. TrEMBL
Match: E5GCR2_CUCME (L-galactose dehydrogenase OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 613.6 bits (1581), Expect = 1.4e-172
Identity = 306/323 (94.74%), Postives = 315/323 (97.52%), Query Frame = 1

Query: 1   MAASYPKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTS 60
           MA SYPKL LR+LGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTS
Sbjct: 1   MAVSYPKLPLRELGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTS 60

Query: 61  PYYGRTLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLD 120
           PYYGR LSEKMLGKGLKALGVPRSEYIVSTKCGRYG+GFDFSAERVTRSIDESLARLQLD
Sbjct: 61  PYYGRNLSEKMLGKGLKALGVPRSEYIVSTKCGRYGEGFDFSAERVTRSIDESLARLQLD 120

Query: 121 YVDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTI 180
           YVDILHCHDIEFGSLDQ+VNET+PALQKLKEAGKTRF+GITGLPLEIFTYVLDRVPPGTI
Sbjct: 121 YVDILHCHDIEFGSLDQIVNETIPALQKLKEAGKTRFLGITGLPLEIFTYVLDRVPPGTI 180

Query: 181 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSAC 240
           DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTD GPPEWHPASPELKSAC
Sbjct: 181 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDQGPPEWHPASPELKSAC 240

Query: 241 QAAAAHCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEET 300
           QAAA HC+KKG+NI+KLA+QYSL NKDISTVLVGMNSVRQVEENVAAAEELATFGRDEET
Sbjct: 241 QAAADHCRKKGRNITKLAIQYSLVNKDISTVLVGMNSVRQVEENVAAAEELATFGRDEET 300

Query: 301 LSEVEAILIPVKNQTWPSGIQKS 324
           LSEVEAIL PVKNQTWPSGIQ S
Sbjct: 301 LSEVEAILNPVKNQTWPSGIQNS 323

BLAST of Cla006823 vs. TrEMBL
Match: B9HN60_POPTR (L-galactose dehydrogenase family protein OS=Populus trichocarpa GN=POPTR_0009s08490g PE=4 SV=1)

HSP 1 Score: 567.4 bits (1461), Expect = 1.1e-158
Identity = 279/323 (86.38%), Postives = 306/323 (94.74%), Query Frame = 1

Query: 1   MAASYPKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTS 60
           MA  +P L+LR LGNTGLK S VGFGASPLGSVF PVSE DA+++VREAFRLGINFFDTS
Sbjct: 1   MAYPHPNLELRPLGNTGLKLSCVGFGASPLGSVFGPVSEHDAISSVREAFRLGINFFDTS 60

Query: 61  PYYGRTLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLD 120
           PYYG TLSEKMLG+GLKALGVPR+EYIVSTKCGRY +GFDFSAERVT+SIDESLARLQLD
Sbjct: 61  PYYGGTLSEKMLGQGLKALGVPRNEYIVSTKCGRYVEGFDFSAERVTKSIDESLARLQLD 120

Query: 121 YVDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTI 180
           YVDIL CHDIEFGSLDQ+VNET+PAL+KLKEAGK RFIGITGLPL +FTYVLDRVPPGT+
Sbjct: 121 YVDILQCHDIEFGSLDQIVNETIPALRKLKEAGKIRFIGITGLPLGVFTYVLDRVPPGTV 180

Query: 181 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSAC 240
           DVILSYC YSINDSTL DLLPYLKSKGVG+ISASPLAMGLLT++GPPEWHPASPELKSAC
Sbjct: 181 DVILSYCRYSINDSTLADLLPYLKSKGVGVISASPLAMGLLTENGPPEWHPASPELKSAC 240

Query: 241 QAAAAHCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEET 300
           +AAAA CK+KGKNISK+A+QYSL+NKDIS+VLVGMNSVRQVEENV+AA ELATFG+D+ET
Sbjct: 241 EAAAAFCKEKGKNISKIAMQYSLSNKDISSVLVGMNSVRQVEENVSAATELATFGKDQET 300

Query: 301 LSEVEAILIPVKNQTWPSGIQKS 324
           LSEVEAILIPVKNQTWPSGIQ+S
Sbjct: 301 LSEVEAILIPVKNQTWPSGIQQS 323

BLAST of Cla006823 vs. TrEMBL
Match: A0A061DX80_THECC (NAD(P)-linked oxidoreductase superfamily protein OS=Theobroma cacao GN=TCM_006358 PE=4 SV=1)

HSP 1 Score: 565.1 bits (1455), Expect = 5.6e-158
Identity = 280/318 (88.05%), Postives = 303/318 (95.28%), Query Frame = 1

Query: 6   PKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTSPYYGR 65
           PKL+LR LGNTGLK SSVGFGASPLGSVF PVSE DAVA+VREAFRLGINFFDTSPYYG 
Sbjct: 24  PKLELRPLGNTGLKLSSVGFGASPLGSVFGPVSESDAVASVREAFRLGINFFDTSPYYGG 83

Query: 66  TLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLDYVDIL 125
           TLSEKMLGKGLKALGVPR+EYIVSTKCGRY +GFDFSAERVT+SIDESL RLQLDYVDIL
Sbjct: 84  TLSEKMLGKGLKALGVPRNEYIVSTKCGRYREGFDFSAERVTKSIDESLERLQLDYVDIL 143

Query: 126 HCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTIDVILS 185
            CHDIEFGSLDQVVNET+PALQKLKEAGK RFIGITGLPLEIFT+VLDRVPPGT+DVILS
Sbjct: 144 QCHDIEFGSLDQVVNETIPALQKLKEAGKIRFIGITGLPLEIFTFVLDRVPPGTVDVILS 203

Query: 186 YCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSACQAAAA 245
           YCHYSINDSTL DLLPYLK+KGVG+ISASPLAMGLLT+ GPP+WHPASPELKSACQAAAA
Sbjct: 204 YCHYSINDSTLEDLLPYLKNKGVGVISASPLAMGLLTELGPPDWHPASPELKSACQAAAA 263

Query: 246 HCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEETLSEVE 305
           +CK+KGKNISKLA+QYSL+N+DIS+VLVGMNSV+QVEENVAAA E+A FG+D ETLSE+E
Sbjct: 264 YCKEKGKNISKLAMQYSLSNEDISSVLVGMNSVKQVEENVAAATEVALFGKDLETLSEIE 323

Query: 306 AILIPVKNQTWPSGIQKS 324
           AIL PVKNQTWPSGIQ+S
Sbjct: 324 AILKPVKNQTWPSGIQRS 341

BLAST of Cla006823 vs. TrEMBL
Match: A0A0D2TWW3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G042900 PE=4 SV=1)

HSP 1 Score: 565.1 bits (1455), Expect = 5.6e-158
Identity = 281/318 (88.36%), Postives = 299/318 (94.03%), Query Frame = 1

Query: 6   PKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTSPYYGR 65
           PKL++R LGNTGLK SSVGFGASPLGSVF  VSE DAVA+V EAFRLGINFFDTSPYYG 
Sbjct: 4   PKLEMRPLGNTGLKLSSVGFGASPLGSVFGSVSESDAVASVLEAFRLGINFFDTSPYYGA 63

Query: 66  TLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLDYVDIL 125
           TLSEKMLGKGLKALGVPRSEYIVSTKCGRY +GFDFSAERVT+SIDESL RLQLDYVDI 
Sbjct: 64  TLSEKMLGKGLKALGVPRSEYIVSTKCGRYREGFDFSAERVTKSIDESLERLQLDYVDIF 123

Query: 126 HCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTIDVILS 185
            CHDIEFGSLDQVVNET+PALQKLKEAGK RFIGITGLPLEIFTYVLDRVPPGT+DVILS
Sbjct: 124 QCHDIEFGSLDQVVNETIPALQKLKEAGKIRFIGITGLPLEIFTYVLDRVPPGTVDVILS 183

Query: 186 YCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSACQAAAA 245
           YCHYSINDSTL DLLPYLK+KGVG+ISASPLAMGLLT+ GPPEWHPASPELKSACQAAA 
Sbjct: 184 YCHYSINDSTLEDLLPYLKTKGVGVISASPLAMGLLTEFGPPEWHPASPELKSACQAAAV 243

Query: 246 HCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEETLSEVE 305
           +CK+KGKNISKLA+QYSL+NKDISTVLVGMNSV+QVEENVAAA ELA FG+D ETL+EVE
Sbjct: 244 YCKEKGKNISKLAMQYSLSNKDISTVLVGMNSVKQVEENVAAATELALFGKDHETLAEVE 303

Query: 306 AILIPVKNQTWPSGIQKS 324
           AIL PVKNQTWPSGIQ+S
Sbjct: 304 AILKPVKNQTWPSGIQRS 321

BLAST of Cla006823 vs. TrEMBL
Match: A0A0B0NLB1_GOSAR (D-arabinose 1-dehydrogenase OS=Gossypium arboreum GN=F383_18998 PE=4 SV=1)

HSP 1 Score: 565.1 bits (1455), Expect = 5.6e-158
Identity = 282/318 (88.68%), Postives = 299/318 (94.03%), Query Frame = 1

Query: 6   PKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTSPYYGR 65
           PKL++R LGNTGLK SSVGFGASPLGSVF  VSE DAVA+V EAFRLGINFFDTSPYYG 
Sbjct: 4   PKLEMRPLGNTGLKLSSVGFGASPLGSVFGSVSESDAVASVLEAFRLGINFFDTSPYYGA 63

Query: 66  TLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLDYVDIL 125
           TLSEKMLGKGLKALGVPRSEYIVSTKCGRY +GFDFSAERVT+SIDESL RLQLDYVDI 
Sbjct: 64  TLSEKMLGKGLKALGVPRSEYIVSTKCGRYREGFDFSAERVTKSIDESLERLQLDYVDIF 123

Query: 126 HCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTIDVILS 185
            CHDIEFGSLDQVVNET+PALQKLKEAGK RFIGITGLPLEIFTYVLDRVPPGT+DVILS
Sbjct: 124 QCHDIEFGSLDQVVNETIPALQKLKEAGKIRFIGITGLPLEIFTYVLDRVPPGTVDVILS 183

Query: 186 YCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSACQAAAA 245
           YCHYSINDSTL DLLPYLK+KGVGIISASPLAMGLLT+ GPPEWHPASPELKSACQAAA 
Sbjct: 184 YCHYSINDSTLEDLLPYLKTKGVGIISASPLAMGLLTEFGPPEWHPASPELKSACQAAAV 243

Query: 246 HCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEETLSEVE 305
           +CK+KGKNISKLA+QYSL+NKDISTVLVGMNSV+QVEENVAAA ELA FG+D ETL+EVE
Sbjct: 244 YCKEKGKNISKLAMQYSLSNKDISTVLVGMNSVKQVEENVAAATELALFGKDHETLAEVE 303

Query: 306 AILIPVKNQTWPSGIQKS 324
           AIL PVKNQTWPSGIQ+S
Sbjct: 304 AILKPVKNQTWPSGIQQS 321

BLAST of Cla006823 vs. NCBI nr
Match: gi|778724639|ref|XP_004136903.2| (PREDICTED: L-galactose dehydrogenase [Cucumis sativus])

HSP 1 Score: 615.5 bits (1586), Expect = 5.2e-173
Identity = 308/323 (95.36%), Postives = 315/323 (97.52%), Query Frame = 1

Query: 1   MAASYPKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTS 60
           MA SYPKL LR+LGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAF LGINFFDTS
Sbjct: 1   MAVSYPKLPLRELGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFCLGINFFDTS 60

Query: 61  PYYGRTLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLD 120
           PYYGR LSEKMLGKGLKALGVPRSEYIVSTKCGRYG+GFDFSAERVTRSIDESLARLQLD
Sbjct: 61  PYYGRNLSEKMLGKGLKALGVPRSEYIVSTKCGRYGEGFDFSAERVTRSIDESLARLQLD 120

Query: 121 YVDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTI 180
           YVDILHCHDIEFGSLDQ+VNET+PALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGT+
Sbjct: 121 YVDILHCHDIEFGSLDQIVNETIPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTV 180

Query: 181 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSAC 240
           DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTD GPPEWHPASPELKSAC
Sbjct: 181 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDQGPPEWHPASPELKSAC 240

Query: 241 QAAAAHCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEET 300
           QAAAAHC+KKGKNISKLA+QYSL NKDISTVLVGMNSV QVEENVAAAEELATFGRDEET
Sbjct: 241 QAAAAHCRKKGKNISKLAIQYSLVNKDISTVLVGMNSVGQVEENVAAAEELATFGRDEET 300

Query: 301 LSEVEAILIPVKNQTWPSGIQKS 324
           LSEVEAILIPVKNQTWPSGIQ S
Sbjct: 301 LSEVEAILIPVKNQTWPSGIQNS 323

BLAST of Cla006823 vs. NCBI nr
Match: gi|659110219|ref|XP_008455112.1| (PREDICTED: L-galactose dehydrogenase [Cucumis melo])

HSP 1 Score: 613.6 bits (1581), Expect = 2.0e-172
Identity = 306/323 (94.74%), Postives = 315/323 (97.52%), Query Frame = 1

Query: 1   MAASYPKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTS 60
           MA SYPKL LR+LGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTS
Sbjct: 1   MAVSYPKLPLRELGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTS 60

Query: 61  PYYGRTLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLD 120
           PYYGR LSEKMLGKGLKALGVPRSEYIVSTKCGRYG+GFDFSAERVTRSIDESLARLQLD
Sbjct: 61  PYYGRNLSEKMLGKGLKALGVPRSEYIVSTKCGRYGEGFDFSAERVTRSIDESLARLQLD 120

Query: 121 YVDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTI 180
           YVDILHCHDIEFGSLDQ+VNET+PALQKLKEAGKTRF+GITGLPLEIFTYVLDRVPPGTI
Sbjct: 121 YVDILHCHDIEFGSLDQIVNETIPALQKLKEAGKTRFLGITGLPLEIFTYVLDRVPPGTI 180

Query: 181 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSAC 240
           DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTD GPPEWHPASPELKSAC
Sbjct: 181 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDQGPPEWHPASPELKSAC 240

Query: 241 QAAAAHCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEET 300
           QAAA HC+KKG+NI+KLA+QYSL NKDISTVLVGMNSVRQVEENVAAAEELATFGRDEET
Sbjct: 241 QAAADHCRKKGRNITKLAIQYSLVNKDISTVLVGMNSVRQVEENVAAAEELATFGRDEET 300

Query: 301 LSEVEAILIPVKNQTWPSGIQKS 324
           LSEVEAIL PVKNQTWPSGIQ S
Sbjct: 301 LSEVEAILNPVKNQTWPSGIQNS 323

BLAST of Cla006823 vs. NCBI nr
Match: gi|224103819|ref|XP_002313206.1| (L-galactose dehydrogenase family protein [Populus trichocarpa])

HSP 1 Score: 567.4 bits (1461), Expect = 1.6e-158
Identity = 279/323 (86.38%), Postives = 306/323 (94.74%), Query Frame = 1

Query: 1   MAASYPKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTS 60
           MA  +P L+LR LGNTGLK S VGFGASPLGSVF PVSE DA+++VREAFRLGINFFDTS
Sbjct: 1   MAYPHPNLELRPLGNTGLKLSCVGFGASPLGSVFGPVSEHDAISSVREAFRLGINFFDTS 60

Query: 61  PYYGRTLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLD 120
           PYYG TLSEKMLG+GLKALGVPR+EYIVSTKCGRY +GFDFSAERVT+SIDESLARLQLD
Sbjct: 61  PYYGGTLSEKMLGQGLKALGVPRNEYIVSTKCGRYVEGFDFSAERVTKSIDESLARLQLD 120

Query: 121 YVDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTI 180
           YVDIL CHDIEFGSLDQ+VNET+PAL+KLKEAGK RFIGITGLPL +FTYVLDRVPPGT+
Sbjct: 121 YVDILQCHDIEFGSLDQIVNETIPALRKLKEAGKIRFIGITGLPLGVFTYVLDRVPPGTV 180

Query: 181 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSAC 240
           DVILSYC YSINDSTL DLLPYLKSKGVG+ISASPLAMGLLT++GPPEWHPASPELKSAC
Sbjct: 181 DVILSYCRYSINDSTLADLLPYLKSKGVGVISASPLAMGLLTENGPPEWHPASPELKSAC 240

Query: 241 QAAAAHCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEET 300
           +AAAA CK+KGKNISK+A+QYSL+NKDIS+VLVGMNSVRQVEENV+AA ELATFG+D+ET
Sbjct: 241 EAAAAFCKEKGKNISKIAMQYSLSNKDISSVLVGMNSVRQVEENVSAATELATFGKDQET 300

Query: 301 LSEVEAILIPVKNQTWPSGIQKS 324
           LSEVEAILIPVKNQTWPSGIQ+S
Sbjct: 301 LSEVEAILIPVKNQTWPSGIQQS 323

BLAST of Cla006823 vs. NCBI nr
Match: gi|743795568|ref|XP_011002821.1| (PREDICTED: L-galactose dehydrogenase-like [Populus euphratica])

HSP 1 Score: 566.2 bits (1458), Expect = 3.6e-158
Identity = 279/323 (86.38%), Postives = 305/323 (94.43%), Query Frame = 1

Query: 1   MAASYPKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTS 60
           MA  +P L+LR LGNTGLK S VGFGASPLGSVF PVSE DA+++VREAF LGINFFDTS
Sbjct: 1   MAYPHPNLELRPLGNTGLKLSCVGFGASPLGSVFGPVSEHDAISSVREAFLLGINFFDTS 60

Query: 61  PYYGRTLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLD 120
           PYYG TLSEKMLGKGLKALGVPR+EYIVSTKCGRY +GFDFSAERVT+SIDESLARLQLD
Sbjct: 61  PYYGGTLSEKMLGKGLKALGVPRNEYIVSTKCGRYVEGFDFSAERVTKSIDESLARLQLD 120

Query: 121 YVDILHCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTI 180
           YVDIL CHDIEFGSLDQ+VNET+PAL+KLKEAGK RFIGITGLPL +FTYVLDRVPPGT+
Sbjct: 121 YVDILQCHDIEFGSLDQIVNETIPALRKLKEAGKIRFIGITGLPLGVFTYVLDRVPPGTV 180

Query: 181 DVILSYCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSAC 240
           DVILSYC YSINDSTL DLLPYLKSKGVG+ISASPLAMGLLT++GPPEWHPASPELKSAC
Sbjct: 181 DVILSYCRYSINDSTLADLLPYLKSKGVGVISASPLAMGLLTENGPPEWHPASPELKSAC 240

Query: 241 QAAAAHCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEET 300
           +AAAA CK+KGKNISK+A+QYSL+NKDIS+VLVGMNSVRQVEENV+AA ELATFG+D+ET
Sbjct: 241 EAAAAFCKEKGKNISKIAMQYSLSNKDISSVLVGMNSVRQVEENVSAATELATFGKDQET 300

Query: 301 LSEVEAILIPVKNQTWPSGIQKS 324
           LSEVEAILIPVKNQTWPSGIQ+S
Sbjct: 301 LSEVEAILIPVKNQTWPSGIQQS 323

BLAST of Cla006823 vs. NCBI nr
Match: gi|590682881|ref|XP_007041459.1| (NAD(P)-linked oxidoreductase superfamily protein [Theobroma cacao])

HSP 1 Score: 565.1 bits (1455), Expect = 8.0e-158
Identity = 280/318 (88.05%), Postives = 303/318 (95.28%), Query Frame = 1

Query: 6   PKLQLRDLGNTGLKTSSVGFGASPLGSVFSPVSEEDAVAAVREAFRLGINFFDTSPYYGR 65
           PKL+LR LGNTGLK SSVGFGASPLGSVF PVSE DAVA+VREAFRLGINFFDTSPYYG 
Sbjct: 24  PKLELRPLGNTGLKLSSVGFGASPLGSVFGPVSESDAVASVREAFRLGINFFDTSPYYGG 83

Query: 66  TLSEKMLGKGLKALGVPRSEYIVSTKCGRYGDGFDFSAERVTRSIDESLARLQLDYVDIL 125
           TLSEKMLGKGLKALGVPR+EYIVSTKCGRY +GFDFSAERVT+SIDESL RLQLDYVDIL
Sbjct: 84  TLSEKMLGKGLKALGVPRNEYIVSTKCGRYREGFDFSAERVTKSIDESLERLQLDYVDIL 143

Query: 126 HCHDIEFGSLDQVVNETVPALQKLKEAGKTRFIGITGLPLEIFTYVLDRVPPGTIDVILS 185
            CHDIEFGSLDQVVNET+PALQKLKEAGK RFIGITGLPLEIFT+VLDRVPPGT+DVILS
Sbjct: 144 QCHDIEFGSLDQVVNETIPALQKLKEAGKIRFIGITGLPLEIFTFVLDRVPPGTVDVILS 203

Query: 186 YCHYSINDSTLLDLLPYLKSKGVGIISASPLAMGLLTDHGPPEWHPASPELKSACQAAAA 245
           YCHYSINDSTL DLLPYLK+KGVG+ISASPLAMGLLT+ GPP+WHPASPELKSACQAAAA
Sbjct: 204 YCHYSINDSTLEDLLPYLKNKGVGVISASPLAMGLLTELGPPDWHPASPELKSACQAAAA 263

Query: 246 HCKKKGKNISKLALQYSLANKDISTVLVGMNSVRQVEENVAAAEELATFGRDEETLSEVE 305
           +CK+KGKNISKLA+QYSL+N+DIS+VLVGMNSV+QVEENVAAA E+A FG+D ETLSE+E
Sbjct: 264 YCKEKGKNISKLAMQYSLSNEDISSVLVGMNSVKQVEENVAAATEVALFGKDLETLSEIE 323

Query: 306 AILIPVKNQTWPSGIQKS 324
           AIL PVKNQTWPSGIQ+S
Sbjct: 324 AILKPVKNQTWPSGIQRS 341

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GALDH_ARATH2.9e-15283.60L-galactose dehydrogenase OS=Arabidopsis thaliana GN=LGALDH PE=1 SV=1[more]
ARA2_YEAST2.0e-2829.45D-arabinose 1-dehydrogenase OS=Saccharomyces cerevisiae (strain ATCC 204508 / S2... [more]
FCDH_PSESP6.6e-2729.47D-threo-aldose 1-dehydrogenase OS=Pseudomonas sp. GN=fdh PE=1 SV=1[more]
YQKF_BACSU4.4e-2328.09Uncharacterized oxidoreductase YqkF OS=Bacillus subtilis (strain 168) GN=yqkF PE... [more]
AKR1_SOYBN2.2e-2229.69Probable aldo-keto reductase 1 OS=Glycine max GN=AKR1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
E5GCR2_CUCME1.4e-17294.74L-galactose dehydrogenase OS=Cucumis melo subsp. melo PE=4 SV=1[more]
B9HN60_POPTR1.1e-15886.38L-galactose dehydrogenase family protein OS=Populus trichocarpa GN=POPTR_0009s08... [more]
A0A061DX80_THECC5.6e-15888.05NAD(P)-linked oxidoreductase superfamily protein OS=Theobroma cacao GN=TCM_00635... [more]
A0A0D2TWW3_GOSRA5.6e-15888.36Uncharacterized protein OS=Gossypium raimondii GN=B456_008G042900 PE=4 SV=1[more]
A0A0B0NLB1_GOSAR5.6e-15888.68D-arabinose 1-dehydrogenase OS=Gossypium arboreum GN=F383_18998 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|778724639|ref|XP_004136903.2|5.2e-17395.36PREDICTED: L-galactose dehydrogenase [Cucumis sativus][more]
gi|659110219|ref|XP_008455112.1|2.0e-17294.74PREDICTED: L-galactose dehydrogenase [Cucumis melo][more]
gi|224103819|ref|XP_002313206.1|1.6e-15886.38L-galactose dehydrogenase family protein [Populus trichocarpa][more]
gi|743795568|ref|XP_011002821.1|3.6e-15886.38PREDICTED: L-galactose dehydrogenase-like [Populus euphratica][more]
gi|590682881|ref|XP_007041459.1|8.0e-15888.05NAD(P)-linked oxidoreductase superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001395Aldo/keto reductase/potassium channel subunit beta
IPR023210NADP_OxRdtase_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006012 galactose metabolic process
biological_process GO:0019853 L-ascorbic acid biosynthetic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0042816 vitamin B6 metabolic process
cellular_component GO:0005829 cytosol
molecular_function GO:0019151 galactose 1-dehydrogenase activity
molecular_function GO:0010349 L-galactose dehydrogenase activity
molecular_function GO:0050235 pyridoxal 4-dehydrogenase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU28406watermelon EST collection version 2.0transcribed_cluster
WMU63893watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla006823Cla006823.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU28406WMU28406transcribed_cluster
WMU63893WMU63893transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001395Aldo/keto reductase/potassium channel subunit betaPANTHERPTHR11732ALDO/KETO REDUCTASEcoord: 8..322
score: 6.4E
IPR023210NADP-dependent oxidoreductase domainGENE3DG3DSA:3.20.20.100coord: 8..294
score: 4.2
IPR023210NADP-dependent oxidoreductase domainPFAMPF00248Aldo_ket_redcoord: 23..289
score: 2.2
IPR023210NADP-dependent oxidoreductase domainunknownSSF51430NAD(P)-linked oxidoreductasecoord: 9..308
score: 2.36
NoneNo IPR availablePANTHERPTHR11732:SF14HYPERKINETIC, ISOFORM Mcoord: 8..322
score: 6.4E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla006823Cla97C02G034220Watermelon (97103) v2wmwmbB326
Cla006823ClCG02G007540Watermelon (Charleston Gray)wcgwmB208
The following gene(s) are paralogous to this gene:

None