HG10010404 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10010404
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionaldehyde dehydrogenase 22A1-like
LocationChr06: 21918133 .. 21926248 (-)
RNA-Seq ExpressionHG10010404
SyntenyHG10010404
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACGACGGGAATCAGACGCAGGAGAATAGCTACATCTACGTAAGTATTTCCGGAGATTCTACAATTGATTATAGCCATAATTAGCTTTTATGGTACATTGTTGCTTCATTTCCTGCGTTAGTTTTTGGAATTAGTCGTCAATCTCATACTTTGATGTAAAACTGAACTGTTTGGTATCGAGGTATGTGAATATGAGTAGTTTCTTGAAATTCGCATTCATGAACTATTGTCCCAGGCTGTGATTCTTTAAACTACACTCTCGGTATTTTCTCACTTCGATTTAAAGTACACTGTTAAGTAAAATATGCGCAAGTTCTTGAAATTTCTATTTCGGAACTATCGTTTTATTTATGTAAACTATTGTTTCTTTTCGTACCAAGTCTGCATTCTCTCAATTCGATTTAAATTACACTGTTTAGTGTCCGAGCTATATGAATTTGAGTACGTTCTTAAAATCTCTATGCTGGAACCACCATCTCAGGCTTTGGTTTTTTAGACTACTGAGCTAAACAGATTACATGATTGATTTAACGACAATAGTTTTGATGATACATTGCTTCATTTCTAGCATTAAGTTTTTGTTATATGTCGATATTCTCTTACTTTGATGTAATCTACACGGTTTAGTCTATATGAATATGAGTAAACTATTAAAATTTCTATTCGGGAATTATATCTCAGGCTATTATTTTATTCTGTTATGGAACTATACTCTTTGAGATGATGTTCTTGGACCTGGTTAAAACAAACTACTAGCTTACAACTTTGAAAATCGGAAATCCTTTTAAGTTCTATTCTGAAAATTTTATTGACTCAGGCTAACATATTTATTGGCTAACACAAACCATTCGTTCTCCCATATCGAAGTTTGCAGAAGGCAAGGACTTTTGTTTCAAATTTTCTAAGGCAAAACTATCTGATTTCATAATTTTGGTTGGTGATTGCTGAACAGATTCCACCTAGGGTAAAGACACAACAGCAAGATAAGAGAGTCCAGTGTTATGAGCCTGCAACTATGAAATACCTGGGCTATTTTCCAGCATTGTCACGAGACGAGGTTAGAACTCATTGCTGGCTGGCACTAGCTTGATTGGTGAACTAGTTAAAAATTAGCATTTGTCTTGTTATTATTTCTGTTGAATTTGGATGGATGGGAGATTTTTCATTGTGTTTCATGCTGTAGTTTGTATCTTTCATGACAATCCTACCCTGTGATTCTTTTTTTCTCGATCTTTTGATAATTGATGGTAGGGGAAGGACTAGCAATTCTCTACTTCAAAGGAACATTACCTCATGTTTCCCTCTTTCATTCATCTAGTTGTCATCTCTATATAATATTTCTCATATTGATGGCATTAGTTATTAGTATTGCTAAATGAGAGGGAGAGAGAGAAACTGAAGTGGTTGATTACACTTATTCTGTTAAATTCTATGCTCAGGTCAAGGAGCGTGTTGCTTCAGCTAGGAAAGCGCAGAAAGAATGGGCCAAAAGTAGCTTCAAACAAAGGCGTTTGCTTTTACGGATACTTTTGAAATACATTATTGAACATCAAGAGCTCATTTGCGAGTAAGATCCAGTTATTTTAATATACTATAGTTTTTTGTTATTTTTATTTTCTCCAAGAGTTAAAATTAAAAAAATAATAATTATATATATTAAAAAAAAAAAAACAAACAAACGGGACAAGCAAAGCAGCCATTCTGTCATTCTAATTATTTATCATGTGGGATAGGATCTCCTCCCGTGAAACTGGAAAAACAATAGTGGATGCTAATATGGGAGAAGTAATGGCGACATGTGAAAAGATTACTTGGCTTCTTTCAGAGGGTGAGAAATGGCTGAAGCCTGAGAGCCGGTATCTCTCCTTTCTCTTACTACTTTTCCCCTCTTGAAAATATGGTGTTGCAGCTGATAACATTTCTTACTCATGTTGAAAACAGGTTAAATTACAATTTCTGTATCTTTCCTTCAGCAGTTAATTATTTCTCTTCCCTGTGACCAACATTGAAATTACAAATTTTTTTTGGTTTGGAGTGGGTTGTGTGTGTGTGTGTGTGTGTGTGTGTGGGGGGGGGGAGGCTTCCAAGTTGTTGCAAATGGCTCAGAAAGAGTAACCTGAAAAAGACTTTGATAAGGAACATCAAAGAGAAGCCTTAATCATGATGTTCTTGAAGCAATCCAACCAAGAGGAAAAATTATCTTAAAAAATTCTGATTTCTTTTGATCCAAATCGAAAGAATGGCTTTTAAAGCATTAATTCATAATAATTTGTCTTTGATAGAAAGAGCTGGAGCACAAAGAAGTTGATAAACATTTTAGTGGATATCATTGCTAAAAAATTGAAAATCCAGCTCATATTGAAGATAGAGAAGAAATAGGACCAACAAAACCTTGAGAAAGAGCACAGAAAAAAATAGATGAAAGTGGACTTCAGAAGCCGCCATACATTTCTCTGTTGCTTTTATAGTAACCTAGCATCTTCGAGGCATTCCAGCTGGACATGCAACTTGTCACTGTGAAAATGATATTGTCCCAATGTCCCTAGTTATGTTGGATTTGGATTATATTTTGATTAAGAAAGACTGAAAGAGAGCACTTCAAATTCTGGATGGTTAAATTCGTTATGCATTTGATCTCTTTAACTTATCCCAATTTGCAGATCATGTGGAAGGGCAACACTTCACAAGAAAGCCCGAGTAGAATTTCATCCCCTTGGAGTTATCGGGGCGATAGTCCCATGGAACTATCCATTCCATAACATTTTCAATCCAGTGCTTGCAGCAGTCTTTGCAGGAAACGGCATTGTAGTTAAGGTATTAAAGTTGAAAGGTTTCTTTTAATACTTTTGAGATATATCATCTTTTTTTATGCAATGCTATGTCAGAAACTACTTACTGCAGCAAATTTCATGTCTTACCATCCCAACCAATCATTGCATGTGCCCTACCCTGCTCTTAAGATGTCATGCATCATTAGGATACATGGATGCATATCCTCAAGGAATTTCAATATTTAGGACTGTTTTTCATTTATCTTATTGAAACCTCTGTTAAAAAAATAATAATAATATTAATAAATATATAAATTAGAGTTTTCTTTATGAAAAGAAAAACTTCAAATAGCTTTTGTTTGTAGTTTGGTTTGGTGTATTTGGTCCTCATTTTAGTTGTTTTTTAAGGCTCTTGTATTTTTGGTGTGTTGTTTCTTTGGGTGTTTTGTTTTTTATTCTCTCCCTTTGGGAATTGTATCCTTGAACATCTTTCTTCCTCTTCATTTATCAGTGAAACGTTTGTTTCTTTTTTTTGTTTAAAAAAAACAAAAAACAAAAAACAAAACTTCAACTAAGGACTTTGCAAAGGTTCAGTATCATGATCCTTTTCTTTAATGAAACAAATTTGTTTCGTACTGGTTTTGTTTTAATGTCATGCAGTTGATTGAAATTGTAGGTTTCAGAACATGCAAGTTGGTCAGGATGCTTTTATGTCAGGATTATCCAAGCTGCACTTGCTGCTGTTGGAGCTCCTGAGAGTTTAGTTGATGTGATAACAGGGTAGACATTTTAACTTTTCATTTGAACTATCTTTACGAGGCTGTTCATTTGTCCACTTTAATATGATCTTTTATCAGGTTTGCTGAAACAGGTGAAGCGCTAGTATCTTCAGTAGACAAAATGATATTTGTTGGATCTACTGGTGTGGGCAGGATGGTATGAACTATAATTGGTCTGGAATATTATAATTTTTTACTCCCATATTTTTGGTTCTTTTCATTTATTGATGCTGTTGTTTAGATGTCTTCCAAATTTATAATCATCAAAACTCTGAAGCCGTGTGGGGTCTGTTGTGGCAGAATAGATATCCATTGGGTTCAAAGTTGTGGGGATGAGAGCTAAATTTTCTATTATTGCGTTGCTACGATTATTTTGCAATCATGCTCAAAGTGAAGAAGATTGCCGATGATATTGTTTACAAAGAATGAGAATTAAACCTGAATTTGTTGGAATAAATAAATAGAAAAGCAAACAAAGAATGATAATTAAACCTGAATTTGTTGGAATAAATAAATAGAAAAGCAAACAAGATAGTGCTGGGACTCATTATACATGTTCTACCGAAAAACGTGTGGGGTCTGTAGGGATTCTTTCTCCATTCTATTTCTATTTTATAAACAATGGAAAGAAGGCTAGTAGGAAATGATTTCCATTCTGTATGAGTATTGTAGACTGATTAATTTGCTTATTTCTAGCAAATAAAGTAACACCCATTCTGCTTGGAAGTTCATTTGTTGTTTCCCATTGAATGTTAGTTATTATAGTGGGGCTTTTATTTATGGTCTTTTTACAATAAAAGAGATGGAAAAAGTGACTTGGTTTCTAGCACCCACCTCTAAGCTGTGTGTTACATATTTGGCATGACTTTTTGCAGATAATGAAAACTGCTGCTGAGACACTTATTCCAGTTACTCTTGAGTTGGGTGGAAAGGACGCATTTATCGTGTGTGAGGATGTAGATCTGGATCACGTATGAACACTCTACAGTTTTACTATGTACAAGTAGCTCTACTAGTTTAATATGCTTTCATGTCATAGAGAACTGATGGGAGTTTCTTTACTTAATATAGGTTGTAAACGTTGCTCTCAGGGCTTCTATTACATCAAGTGGACATAATTGTACTGGAGCTGAGAGATTTTATGTCCACAAGAACATTTATTCTTCGTTTGTGGATAAAATATCAGAACGTGTAAAGGCTATTACAGTTGTAAGTATGAGTTCATATTATATATACGTATTAAAATTCTGTCCATCCTTCTGCTGGGGTATAGGGTTCATGTGGTTTTTCAATTTCAAATTTCTACTTTACATTCTGCACATACGGGTCATTTGGAAGGTATGTGGCGTCTTGCAAAATGGACATTAGGAGTGGCAACTGAAAAAGCTTCCACGGGGATTAAAATTTCAAGGTTCTTAGTAGTTCTTTTTGCTTCATAGGGTGTGGGAAGTGTTTTTTGTTCTCACCACACTCTGTATTTCCAGTGGAAAGTTTTCAATTCAACGTTAAGAAACATCCAAAGAAAAATAGGAATTTCACGACTCATGTAAATAATAAAAACTGTAATTAAACAAAAGAATGTTACATGAAAGATTACTCTTTGAAGCCAGTAAGAAGATTTACAAGTAAGGTATCTGCTAGAATTGTTGTACCATATTCTTCTATCAACAATAAGAGATAGGTTTAAGTCTTGAATAAGAGCCGTTTGAGATGAAGGTCCAATGAGCTCAAAGATCATTGAGAATCAAAGTATTTTGGTTTTAAGAAACTCTATATAGACGAAGTAACAAAACTTTTAATGAGTTAGAATACTAAGCCAAATGACTTCTGGAATTGGGTTTTGGAACCATCTTTCCATATGAATTTTACACAAGGAATGTACGAATTAAAACCTTGAGGGATAAATTTCCAAGGACAAAAATGGGAGGTCATCAACTCAAGAATCTCAACCATTGATAGAAAGAACCTATTCACTTTTAGCGGAAGCTGGAAAATCTCAACAGCAGAAATTCCACAAGTCTCTACTTTGATGGTAATAATCGCTGAATGCAATATGGGCACGTAAATTCGTCTAAATGTTTCTGTGGTTGAATAATCATGCACCATCTATTTTGGTTGCCATATTGTATCTGTTCTATGTGAGGATCTTATTAGGACTAAGTCTTGATTACCTGTGGAACAAATAATTTTGATCAAATTCTGAATCTTATTTTATAATTGAAAGATGAGACTTTCTTCTCAAAAATGTATTTCAGGGTCCACCATTAGCTGGGAAATATGATATGGGTGCTATCTGTACGCAAGAGCAGTCTGAGAAACTTCAAAGCCTTGTTAATGATGCCTTAGACAGAGGGGCAAAAATTGTTGCTCGTGGAACTTTTGGACATCTACCTGAAGGTGCCGTTGATCAATATTTCCCGCCCACCGTTATTGTTGATGTCAACCATACGATGAAGTTAATGCAAGAGGAGGTAGCAATACTTGACATGCTTTTTTTATGCAATAGATTTTTTGGTACTCCCTATCCCTTTTAGTCGGCCTATACTATATTTTGTTAGAATAATTCTTCAAATAGATCTTGGAAACCTACTTAATACTTACTCTGACAACGTGATGGTGCTGTTTGTATTTGAAAGTCTTTAGAATTAAAAACAAAAAAATAGTTATTATAAATTATATATATAGAACAACTAAGACCCCTTTTGATAGTAATATAGAACAACTAAGACCCCTTTTGATAGTTATTATAAATATTAGTTTCTTTGATTTATTTTTTAATCTTTGAAAATATTTTTAAAATACAAACAATTTTGAAAGAAAATGTAGTTTTTTCTCTTTGGGAATTTGGCTAAAAATTTAAATCTTTCTTTAAGAAAGATAAAAAGCATAGTAAACAAATTGAAAACGAGCACAATTTTAGAAAACAAAAGACAAACAAACTTGTTATTAAATGGTCTTTTTTTTTCCTTTTTTTTTAAAAAAAATTAAATTTATATTTTATGAAATTGGTCTCTAGAAAAATCAGTGTATAACTGGAGTGGCCATCTTATCTTTTATATTAAGTTTGCAGCGGAATTGATCTCTAGAAAAATATCTTCTGGCTCCCAAACTCTCAAAGTTAAGCAAATGTGTTAACTTAGTGGTGGCATACAGGCATTTGGACCAATCCTGCCTATAATGAAATTCAGCACAGATGATGAGGCTGTAAAGCTTGCAAATGATTCGAGGTTTGGGCTTGGCTGTGCTGTCTTTTCTGGCAGTCAGCGGCGTGCCAAGAATATTGCTTCGCAGATACATTCTGGAAGTGTTGCAATTAATGACTTTGCCACAAATTATTTGTGTCAGGTAATATTCATTCTGAAGCATGGATGCAATGCTTTAGTTATGTTTACTAATCATCTTTGACACGGTTGTACACTTATTGGTACAGTTATCTTATCAAGCACATGTTCATACTGACTCAGGGTCACTTGTTAGACATTTATTGCGCATTTGTTGGTTTTACCTATTTGTATTTGTTTGTTTGAGAAACTAATGGATGGCAAGAATTATTAGATAATTTGTATATTTTAGTTAATATTTAGTTTAGTTTTAATTTAAATTAATTAGTGTCTAATTAGTTTTTAGAATTAATTAGCATATTGTTTTATTTCCTTGTAAGGTTATAAATAGCCTCTTTAGGTTGAAATAGGAAGCTTTTGGAATATCATTTTGAATATAGAACGTTATTCTTAGAGAAAGTTTTCTCAACTTAGGGATATATTTCCCTTGTTTAGCCACAAATTTGTCCTAATGCTTTGACCCGCTGCCTCACTAATTGTCATAGAAAGAAATCATTTCATGTGTGCACTATATAACTTGTAACGGATCCATTTTTTGAGATATTGAATTTATAGTTTAGTTGCCCATGTATCAGGAAGAATTTGTTAGTGTTATAGGGCAATTACATGTATTGAATTGTATAATGCATGCTTGACTAGAAAAACTCAAAAACTGATTCAAGTTATAGTCTTATATCAATGTTTGGATCTTCGATCTTTGTTGCGTTAGTTTATTTTACTTAGCAATAGCGTATCACCTCAATCAGTTACCTAGATCTTTACTATTTATGGTTTGTGCCACATTTATACTAAGGTCTTTTGATTTTCAGTCCTTGCCATTTGGTGGCGTAAAAGAGAGTGGATTCGGCCGATTTGCGGGCGTTGAAGGATTACGAGCTTGTTGTCTTGTTAAAGCAGTAGTTGAGGATAGATGGTGGCCATACATTCAGACTAAGCATCCTAAGCCTCTTACGGTGAGCCATTATAAGAATTTTGACAAAGTACTTATTTTTGCTTTTTATTTCATACGTATATATTGAAACGGTTCGTATTCTTTTTCGATTAGTATCCTGTTGCAGACAACGCCTTTGAGTTTCAGATGTCACTGGTTGAAGCAACGTACGGGCTGAACATATGGGATCGATTAAGAGCATTGGTTAATGTTCTGAAGATGCTGTCCGAGCAGAACACTCCGGCTAAAGATAACTCTACCGTTGATGATGGAAGTAAGAGAGCTAATTGA

mRNA sequence

ATGGACGACGGGAATCAGACGCAGGAGAATAGCTACATCTACATTCCACCTAGGGTAAAGACACAACAGCAAGATAAGAGAGTCCAGTGTTATGAGCCTGCAACTATGAAATACCTGGGCTATTTTCCAGCATTGTCACGAGACGAGGTCAAGGAGCGTGTTGCTTCAGCTAGGAAAGCGCAGAAAGAATGGGCCAAAAGTAGCTTCAAACAAAGGCGTTTGCTTTTACGGATACTTTTGAAATACATTATTGAACATCAAGAGCTCATTTGCGAGATCTCCTCCCGTGAAACTGGAAAAACAATAGTGGATGCTAATATGGGAGAAGTAATGGCGACATGTGAAAAGATTACTTGGCTTCTTTCAGAGGGTGAGAAATGGCTGAAGCCTGAGAGCCGATCATGTGGAAGGGCAACACTTCACAAGAAAGCCCGAGTAGAATTTCATCCCCTTGGAGTTATCGGGGCGATAGTCCCATGGAACTATCCATTCCATAACATTTTCAATCCAGTGCTTGCAGCAGTCTTTGCAGGAAACGGCATTGTAGTTAAGGTTTCAGAACATGCAAGTTGGTCAGGATGCTTTTATGTCAGGATTATCCAAGCTGCACTTGCTGCTGTTGGAGCTCCTGAGAGTTTAGTTGATGTGATAACAGGGTTTGCTGAAACAGGTGAAGCGCTAGTATCTTCAGTAGACAAAATGATATTTGTTGGATCTACTGGTGTGGGCAGGATGATAATGAAAACTGCTGCTGAGACACTTATTCCAGTTACTCTTGAGTTGGGTGGAAAGGACGCATTTATCGTGTGTGAGGATGTAGATCTGGATCACGTTGTAAACGTTGCTCTCAGGGCTTCTATTACATCAAGTGGACATAATTGTACTGGAGCTGAGAGATTTTATGTCCACAAGAACATTTATTCTTCGTTTGTGGATAAAATATCAGAACGTGTAAAGGCTATTACAGTTGGTCCACCATTAGCTGGGAAATATGATATGGGTGCTATCTGTACGCAAGAGCAGTCTGAGAAACTTCAAAGCCTTGTTAATGATGCCTTAGACAGAGGGGCAAAAATTGTTGCTCGTGGAACTTTTGGACATCTACCTGAAGGTGCCGTTGATCAATATTTCCCGCCCACCGTTATTGTTGATGTCAACCATACGATGAAGTTAATGCAAGAGGAGGCATTTGGACCAATCCTGCCTATAATGAAATTCAGCACAGATGATGAGGCTGTAAAGCTTGCAAATGATTCGAGGTTTGGGCTTGGCTGTGCTGTCTTTTCTGGCAGTCAGCGGCGTGCCAAGAATATTGCTTCGCAGATACATTCTGGAAGTGTTGCAATTAATGACTTTGCCACAAATTATTTGTGTCAGTCCTTGCCATTTGGTGGCGTAAAAGAGAGTGGATTCGGCCGATTTGCGGGCGTTGAAGGATTACGAGCTTGTTGTCTTGTTAAAGCAGTAGTTGAGGATAGATGGTGGCCATACATTCAGACTAAGCATCCTAAGCCTCTTACGTATCCTGTTGCAGACAACGCCTTTGAGTTTCAGATGTCACTGGTTGAAGCAACGTACGGGCTGAACATATGGGATCGATTAAGAGCATTGGTTAATGTTCTGAAGATGCTGTCCGAGCAGAACACTCCGGCTAAAGATAACTCTACCGTTGATGATGGAAGTAAGAGAGCTAATTGA

Coding sequence (CDS)

ATGGACGACGGGAATCAGACGCAGGAGAATAGCTACATCTACATTCCACCTAGGGTAAAGACACAACAGCAAGATAAGAGAGTCCAGTGTTATGAGCCTGCAACTATGAAATACCTGGGCTATTTTCCAGCATTGTCACGAGACGAGGTCAAGGAGCGTGTTGCTTCAGCTAGGAAAGCGCAGAAAGAATGGGCCAAAAGTAGCTTCAAACAAAGGCGTTTGCTTTTACGGATACTTTTGAAATACATTATTGAACATCAAGAGCTCATTTGCGAGATCTCCTCCCGTGAAACTGGAAAAACAATAGTGGATGCTAATATGGGAGAAGTAATGGCGACATGTGAAAAGATTACTTGGCTTCTTTCAGAGGGTGAGAAATGGCTGAAGCCTGAGAGCCGATCATGTGGAAGGGCAACACTTCACAAGAAAGCCCGAGTAGAATTTCATCCCCTTGGAGTTATCGGGGCGATAGTCCCATGGAACTATCCATTCCATAACATTTTCAATCCAGTGCTTGCAGCAGTCTTTGCAGGAAACGGCATTGTAGTTAAGGTTTCAGAACATGCAAGTTGGTCAGGATGCTTTTATGTCAGGATTATCCAAGCTGCACTTGCTGCTGTTGGAGCTCCTGAGAGTTTAGTTGATGTGATAACAGGGTTTGCTGAAACAGGTGAAGCGCTAGTATCTTCAGTAGACAAAATGATATTTGTTGGATCTACTGGTGTGGGCAGGATGATAATGAAAACTGCTGCTGAGACACTTATTCCAGTTACTCTTGAGTTGGGTGGAAAGGACGCATTTATCGTGTGTGAGGATGTAGATCTGGATCACGTTGTAAACGTTGCTCTCAGGGCTTCTATTACATCAAGTGGACATAATTGTACTGGAGCTGAGAGATTTTATGTCCACAAGAACATTTATTCTTCGTTTGTGGATAAAATATCAGAACGTGTAAAGGCTATTACAGTTGGTCCACCATTAGCTGGGAAATATGATATGGGTGCTATCTGTACGCAAGAGCAGTCTGAGAAACTTCAAAGCCTTGTTAATGATGCCTTAGACAGAGGGGCAAAAATTGTTGCTCGTGGAACTTTTGGACATCTACCTGAAGGTGCCGTTGATCAATATTTCCCGCCCACCGTTATTGTTGATGTCAACCATACGATGAAGTTAATGCAAGAGGAGGCATTTGGACCAATCCTGCCTATAATGAAATTCAGCACAGATGATGAGGCTGTAAAGCTTGCAAATGATTCGAGGTTTGGGCTTGGCTGTGCTGTCTTTTCTGGCAGTCAGCGGCGTGCCAAGAATATTGCTTCGCAGATACATTCTGGAAGTGTTGCAATTAATGACTTTGCCACAAATTATTTGTGTCAGTCCTTGCCATTTGGTGGCGTAAAAGAGAGTGGATTCGGCCGATTTGCGGGCGTTGAAGGATTACGAGCTTGTTGTCTTGTTAAAGCAGTAGTTGAGGATAGATGGTGGCCATACATTCAGACTAAGCATCCTAAGCCTCTTACGTATCCTGTTGCAGACAACGCCTTTGAGTTTCAGATGTCACTGGTTGAAGCAACGTACGGGCTGAACATATGGGATCGATTAAGAGCATTGGTTAATGTTCTGAAGATGCTGTCCGAGCAGAACACTCCGGCTAAAGATAACTCTACCGTTGATGATGGAAGTAAGAGAGCTAATTGA

Protein sequence

MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWLLSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERFYVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNVLKMLSEQNTPAKDNSTVDDGSKRAN
Homology
BLAST of HG10010404 vs. NCBI nr
Match: XP_038875847.1 (aldehyde dehydrogenase 22A1 [Benincasa hispida])

HSP 1 Score: 1122.5 bits (2902), Expect = 0.0e+00
Identity = 558/565 (98.76%), Postives = 563/565 (99.65%), Query Frame = 0

Query: 1   MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 60
           MDDGNQTQENSYIYIPPRVKTQQQDK+VQCYEPATMKYLGYFPALSRDEVKERVASARKA
Sbjct: 39  MDDGNQTQENSYIYIPPRVKTQQQDKKVQCYEPATMKYLGYFPALSRDEVKERVASARKA 98

Query: 61  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 120
           QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL
Sbjct: 99  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 158

Query: 121 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 180
           LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG
Sbjct: 159 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 218

Query: 181 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 240
           IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST
Sbjct: 219 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 278

Query: 241 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 300
           GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF
Sbjct: 279 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 338

Query: 301 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 360
           YVHKNIYSSFVD+ISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV
Sbjct: 339 YVHKNIYSSFVDRISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 398

Query: 361 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSR 420
           ARGTFGHLPEGAVDQYFPPTVIVDVNHTM LMQEEAFGPILPIMKFSTDDEAVKLANDSR
Sbjct: 399 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMTLMQEEAFGPILPIMKFSTDDEAVKLANDSR 458

Query: 421 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 480
           FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL
Sbjct: 459 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 518

Query: 481 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 540
           RACCLVKAVVEDRWWP+IQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV
Sbjct: 519 RACCLVKAVVEDRWWPFIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 578

Query: 541 LKMLSEQNTPAKDNSTVDDGSKRAN 566
           LKMLSEQNT AKDNST+DDGSKRA+
Sbjct: 579 LKMLSEQNTAAKDNSTIDDGSKRAD 603

BLAST of HG10010404 vs. NCBI nr
Match: XP_022959514.1 (aldehyde dehydrogenase 22A1-like [Cucurbita moschata])

HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 555/565 (98.23%), Postives = 562/565 (99.47%), Query Frame = 0

Query: 1   MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 60
           MDDGNQTQENSYIYIPPRV+TQQQDKRVQCYEPATMKYLGYFPALSRDEV +RVASARKA
Sbjct: 39  MDDGNQTQENSYIYIPPRVRTQQQDKRVQCYEPATMKYLGYFPALSRDEVNDRVASARKA 98

Query: 61  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 120
           QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSR+TGKTIVDANMGE+MATCEKITWL
Sbjct: 99  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDANMGEIMATCEKITWL 158

Query: 121 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 180
           LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG
Sbjct: 159 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 218

Query: 181 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 240
           IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST
Sbjct: 219 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 278

Query: 241 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 300
           GVGRMIMK+AAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF
Sbjct: 279 GVGRMIMKSAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 338

Query: 301 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 360
           YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV
Sbjct: 339 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 398

Query: 361 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSR 420
           ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAV+LANDSR
Sbjct: 399 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVRLANDSR 458

Query: 421 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 480
           FGLGCAVFSGSQRRAKNIASQI SGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL
Sbjct: 459 FGLGCAVFSGSQRRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 518

Query: 481 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 540
           RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV
Sbjct: 519 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 578

Query: 541 LKMLSEQNTPAKDNSTVDDGSKRAN 566
           LKMLSEQNTPAKDNS VDDGSKRA+
Sbjct: 579 LKMLSEQNTPAKDNSAVDDGSKRAD 603

BLAST of HG10010404 vs. NCBI nr
Match: XP_023006368.1 (aldehyde dehydrogenase 22A1 [Cucurbita maxima])

HSP 1 Score: 1117.1 bits (2888), Expect = 0.0e+00
Identity = 553/565 (97.88%), Postives = 562/565 (99.47%), Query Frame = 0

Query: 1   MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 60
           MDDGNQTQENSYIYIPPRV+TQQQDKRVQCYEPATMKYLGYFPALSRDEV +RVASARKA
Sbjct: 39  MDDGNQTQENSYIYIPPRVRTQQQDKRVQCYEPATMKYLGYFPALSRDEVNDRVASARKA 98

Query: 61  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 120
           QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSR+TGKTIVDANMGE+MATCEKITWL
Sbjct: 99  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDANMGEIMATCEKITWL 158

Query: 121 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 180
           LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG
Sbjct: 159 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 218

Query: 181 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 240
           IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST
Sbjct: 219 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 278

Query: 241 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 300
           GVGRMIMK+AAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF
Sbjct: 279 GVGRMIMKSAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 338

Query: 301 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 360
           YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV
Sbjct: 339 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 398

Query: 361 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSR 420
           ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAV+LANDSR
Sbjct: 399 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVRLANDSR 458

Query: 421 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 480
           FGLGC+VFSGSQRRAKNIASQI SGSVAINDFATNYLCQSLPFGGVKESGFGRFAG+EGL
Sbjct: 459 FGLGCSVFSGSQRRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGIEGL 518

Query: 481 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 540
           RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV
Sbjct: 519 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 578

Query: 541 LKMLSEQNTPAKDNSTVDDGSKRAN 566
           LKMLSEQNTPAKDNS VDDGSKRA+
Sbjct: 579 LKMLSEQNTPAKDNSAVDDGSKRAD 603

BLAST of HG10010404 vs. NCBI nr
Match: KAG6575111.1 (Aldehyde dehydrogenase 22A1, partial [Cucurbita argyrosperma subsp. sororia] >KAG7013680.1 Aldehyde dehydrogenase 22A1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1115.9 bits (2885), Expect = 0.0e+00
Identity = 553/565 (97.88%), Postives = 562/565 (99.47%), Query Frame = 0

Query: 1   MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 60
           MDDGNQTQENSYIYIPPRV+TQQQDKRVQCYEPATMKYLGYFPALSR+EV +RVASARKA
Sbjct: 39  MDDGNQTQENSYIYIPPRVRTQQQDKRVQCYEPATMKYLGYFPALSREEVNDRVASARKA 98

Query: 61  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 120
           QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSR+TGKTIVDANMGE+MATCEKITWL
Sbjct: 99  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDANMGEIMATCEKITWL 158

Query: 121 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 180
           LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG
Sbjct: 159 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 218

Query: 181 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 240
           IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST
Sbjct: 219 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 278

Query: 241 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 300
           GVGRMIMK+AAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRA+ITSSGHNCTGAERF
Sbjct: 279 GVGRMIMKSAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRATITSSGHNCTGAERF 338

Query: 301 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 360
           YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV
Sbjct: 339 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 398

Query: 361 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSR 420
           ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAV+LANDSR
Sbjct: 399 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVRLANDSR 458

Query: 421 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 480
           FGLGCAVFSGSQRRAKNIASQI SGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL
Sbjct: 459 FGLGCAVFSGSQRRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 518

Query: 481 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 540
           RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV
Sbjct: 519 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 578

Query: 541 LKMLSEQNTPAKDNSTVDDGSKRAN 566
           LKMLSEQNTPAKDNS VDDGSKRA+
Sbjct: 579 LKMLSEQNTPAKDNSAVDDGSKRAD 603

BLAST of HG10010404 vs. NCBI nr
Match: XP_023548872.1 (aldehyde dehydrogenase 22A1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1114.8 bits (2882), Expect = 0.0e+00
Identity = 554/565 (98.05%), Postives = 561/565 (99.29%), Query Frame = 0

Query: 1   MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 60
           MDDGNQTQENSYIYIPPRV+TQQQDKRVQCYEPATMKYLGYFPALSRDEV +RVASARKA
Sbjct: 39  MDDGNQTQENSYIYIPPRVRTQQQDKRVQCYEPATMKYLGYFPALSRDEVNDRVASARKA 98

Query: 61  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 120
           QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSR+TGKTIVDANMGE+MATCEKITWL
Sbjct: 99  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDANMGEIMATCEKITWL 158

Query: 121 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 180
           LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG
Sbjct: 159 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 218

Query: 181 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 240
           IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST
Sbjct: 219 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 278

Query: 241 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 300
           GVGRMIMK+AAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF
Sbjct: 279 GVGRMIMKSAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 338

Query: 301 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 360
           YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV
Sbjct: 339 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 398

Query: 361 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSR 420
           ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAV+LANDSR
Sbjct: 399 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVRLANDSR 458

Query: 421 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 480
           FGLGCAVFSGSQRRAKNIASQI SGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL
Sbjct: 459 FGLGCAVFSGSQRRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 518

Query: 481 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 540
           RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV
Sbjct: 519 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 578

Query: 541 LKMLSEQNTPAKDNSTVDDGSKRAN 566
           LKMLSEQNT AKDNS VDDGSKRA+
Sbjct: 579 LKMLSEQNTLAKDNSAVDDGSKRAD 603

BLAST of HG10010404 vs. ExPASy Swiss-Prot
Match: Q0WSF1 (Aldehyde dehydrogenase 22A1 OS=Arabidopsis thaliana OX=3702 GN=ALDH22A1 PE=2 SV=2)

HSP 1 Score: 890.2 bits (2299), Expect = 1.2e-257
Identity = 421/546 (77.11%), Postives = 491/546 (89.93%), Query Frame = 0

Query: 4   GNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKAQKE 63
           G  T+ENS+IYIPPR ++QQ DK+VQCYEPATMKYLGYFPALS  EV+ERV  +RKAQK 
Sbjct: 42  GKDTEENSFIYIPPRGRSQQSDKKVQCYEPATMKYLGYFPALSPTEVEERVTLSRKAQKT 101

Query: 64  WAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWLLSE 123
           WA+SSFK RR  LRILLKYIIEHQELICE+SSR+TGKT+VDA++GE+M TCEKITWLLSE
Sbjct: 102 WAQSSFKLRRQFLRILLKYIIEHQELICEVSSRDTGKTMVDASLGEIMTTCEKITWLLSE 161

Query: 124 GEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVV 183
           GE+WLKPESRS GRA LHK +RVEFHPLGVIGAIVPWNYPFHNIFNP+LAAVF+GNGIV+
Sbjct: 162 GERWLKPESRSSGRAMLHKVSRVEFHPLGVIGAIVPWNYPFHNIFNPMLAAVFSGNGIVI 221

Query: 184 KVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGSTGVG 243
           KVSEHASWSGCFY RIIQAALAAVGAPE+LVDVITGFAETGEALVSSVDKMIFVGST VG
Sbjct: 222 KVSEHASWSGCFYFRIIQAALAAVGAPENLVDVITGFAETGEALVSSVDKMIFVGSTAVG 281

Query: 244 RMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERFYVH 303
           +MIM+ AAETL PVTLELGGKDAFI+CED D+ HV  VA+R ++ SSG NC GAERFYVH
Sbjct: 282 KMIMRNAAETLTPVTLELGGKDAFIICEDADVSHVAQVAVRGTLQSSGQNCAGAERFYVH 341

Query: 304 KNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARG 363
           K+IY++F+ ++++ VK+++ GPPL G+YDMGAIC QE SE LQSLVNDALD+GA+I  RG
Sbjct: 342 KDIYTAFIGQVTKIVKSVSAGPPLTGRYDMGAICLQEHSEHLQSLVNDALDKGAEIAVRG 401

Query: 364 TFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGL 423
           +FGHL E AVDQYFPPTV+++VNH MK+M+EEAFGPI+PIM+FSTD+E +KLANDSR+ L
Sbjct: 402 SFGHLGEDAVDQYFPPTVLINVNHNMKIMKEEAFGPIMPIMQFSTDEEVIKLANDSRYAL 461

Query: 424 GCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRAC 483
           GCAVFSGS+ RAK IASQI  G  AINDFA+NY+CQSLPFGGVK+SGFGRFAG+EGLRAC
Sbjct: 462 GCAVFSGSKHRAKQIASQIQCGVAAINDFASNYMCQSLPFGGVKDSGFGRFAGIEGLRAC 521

Query: 484 CLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNVLKM 543
           CLVK+VVEDR+WP I+TK PKP+ YPVA+NAFEFQ +LVE  YGLNIWDRLR+L++VLK 
Sbjct: 522 CLVKSVVEDRFWPLIKTKIPKPIQYPVAENAFEFQEALVETLYGLNIWDRLRSLIDVLKF 581

Query: 544 LSEQNT 550
           L++Q++
Sbjct: 582 LTDQSS 587

BLAST of HG10010404 vs. ExPASy Swiss-Prot
Match: P38694 (Putative aldehyde dehydrogenase-like protein YHR039C OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=MSC7 PE=1 SV=1)

HSP 1 Score: 389.4 bits (999), Expect = 6.8e-107
Identity = 216/539 (40.07%), Postives = 318/539 (59.00%), Query Frame = 0

Query: 28  VQCYEPATMKYLGYFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIEHQ 87
           +QC+ PAT +YLG FP+ +  ++ E V+ A KAQ  W  S F +R  +L  L  YI+ +Q
Sbjct: 112 IQCHCPATGQYLGSFPSKTEADIDEMVSKAGKAQSTWGNSDFSRRLRVLASLHDYILNNQ 171

Query: 88  ELICEISSRETGKTIVDANMGEVMATCEKITWLLSEGEKWLKPESRSCGRATL----HKK 147
           +LI  ++ R++GKT++DA+MGE++ T EKI W +  G++ L+P SR  G        +K 
Sbjct: 172 DLIARVACRDSGKTMLDASMGEILVTLEKIQWTIKHGQRALQP-SRRPGPTNFFMKWYKG 231

Query: 148 ARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYVRIIQAA 207
           A + + PLGVI +IV WNYPFHN+  P++AA+F GN IVVK SE   WS  F+V +I+  
Sbjct: 232 AEIRYEPLGVISSIVSWNYPFHNLLGPIIAALFTGNAIVVKCSEQVVWSSEFFVELIRKC 291

Query: 208 LAAVGAPESLVDVI-----TGFAETGEALVS--SVDKMIFVGSTGVGRMIMKTAAETLIP 267
           L A      LV +      T   ++     S      + F+GS  V   I+K AA++L P
Sbjct: 292 LEACDEDPDLVQLCYCLPPTENDDSANYFTSHPGFKHITFIGSQPVAHYILKCAAKSLTP 351

Query: 268 VTLELGGKDAFIVCEDV-DLDHVVNVALRASITSSGHNCTGAERFYVHKNIYSSFVDKIS 327
           V +ELGGKDAFIV +   +LD + ++ +R +  SSG NC G ER  V K  Y   V  ++
Sbjct: 352 VVVELGGKDAFIVLDSAKNLDALSSIIMRGTFQSSGQNCIGIERVIVSKENYDDLVKILN 411

Query: 328 ERVKAITVGPPLAG-------KYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHL 387
           +R   +T  P   G         DMGA+ +  + ++L++LV DA+ +GA+++  G+    
Sbjct: 412 DR---MTANPLRQGSDIDHLENVDMGAMISDNRFDELEALVKDAVAKGARLLQGGSRFKH 471

Query: 388 PEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVF 447
           P+     YF PT++VDV   MK+ Q E FGPIL +MK    D  V+LAN + FGLG +VF
Sbjct: 472 PKYPQGHYFQPTLLVDVTPEMKIAQNEVFGPILVMMKAKNTDHCVQLANSAPFGLGGSVF 531

Query: 448 SGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKA 507
               +    +A+ + +G+VAINDFAT Y+CQ LPFGG+  SG+G+F G EGL   C  K+
Sbjct: 532 GADIKECNYVANSLQTGNVAINDFATFYVCQ-LPFGGINGSGYGKFGGEEGLLGLCNAKS 591

Query: 508 VVEDRWWPYIQTKHPKPLTYPVADN--AFEFQMSLVEATYGLNIWDRLRALVNVLKMLS 546
           V  D   P++ T+ PKPL YP+ +N  A+ F  S +   Y  + W R+++L ++ K  S
Sbjct: 592 VCFDT-LPFVSTQIPKPLDYPIRNNAKAWNFVKSFIVGAYTNSTWQRIKSLFSLAKEAS 644

BLAST of HG10010404 vs. ExPASy Swiss-Prot
Match: Q9P7K9 (Putative aldehyde dehydrogenase-like protein C21C3 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPBC21C3.15c PE=3 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 2.8e-100
Identity = 188/502 (37.45%), Postives = 294/502 (58.57%), Query Frame = 0

Query: 30  CYEPATMKYLGYFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIEHQEL 89
           CY P     LG     ++ ++ + +  A +AQKEW  +SF +RR  L+ L + II +Q+ 
Sbjct: 7   CYCPGDGSLLGEVKLFNKSDIDQSIILAEEAQKEWKSTSFAERRNFLKALKENIIRNQDK 66

Query: 90  ICEISSRETGKTIVDANMGEVMATCEKITWLLSEGEKWLKPESRSCGRATLHKKARVEFH 149
             EI+ ++TGKT+VDA  GE++ T EKI W L+ GE+ L+P  R     T +K   V++ 
Sbjct: 67  YAEIACKDTGKTLVDAAFGEILVTLEKINWTLANGEQSLRPTKRPNSLLTSYKGGYVKYE 126

Query: 150 PLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYVRIIQAALAAVGA 209
           PLGVI A+V WNYP HN   P+++A+FAGN IVVK SE  +WS   Y  ++++ L ++G 
Sbjct: 127 PLGVIAALVSWNYPLHNALGPIISALFAGNAIVVKGSELTAWSTHQYCEMVRSLLQSMGH 186

Query: 210 PESLVDVITGFAETGEALV--SSVDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAF 269
              LV  IT   +  + L   S +  + F+GS  + +++  +AA+ L P+ LELGGKD  
Sbjct: 187 SPELVQCITCLPDVADHLTSHSGIKHITFIGSQPIAKLVAASAAKQLTPLCLELGGKDPC 246

Query: 270 IVCEDVDLDHVVNVALRASITSSGHNCTGAERFYVHKNIYSSFVDKISERVKAITVGPPL 329
           I+ +D  L+ ++++ +R    S+G NC G ER      +Y + + K+  R+  + +G   
Sbjct: 247 ILTDDHRLEEILSIVMRGVFQSAGQNCIGIERIIALDGVYDTIITKLYNRISTMRLGMYT 306

Query: 330 AGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNH 389
               DMGA+ +  + + L+SL+ DA+ +GA++V  G     P+     YF PT++VD  +
Sbjct: 307 QNDVDMGAMVSNNRFDHLESLIQDAVSKGARLVYGGHRFQHPKYPKGNYFLPTLLVDATN 366

Query: 390 TMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSQRRAKNIASQIHSGSV 449
            MK+ QEE F PI  + +  + + A+++AN + FGLG +VF   ++  +     + +G V
Sbjct: 367 EMKIAQEECFAPIALVFRAKSPEHALEIANGTEFGLGASVFGRDKQLCQYFTDNLETGMV 426

Query: 450 AINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPLT 509
           A+NDF   YL Q +PFGG K+SG+GRFAG EGLR  C  KA+  DR +  I T  P  + 
Sbjct: 427 AVNDFGAFYLLQ-MPFGGCKKSGYGRFAGYEGLRGICNSKAIAYDR-FSAIHTGIPPAVD 486

Query: 510 YPVADN--AFEFQMSLVEATYG 528
           YP+ D+  A++F   L+   YG
Sbjct: 487 YPIPDSQKAWQFVRGLMGTVYG 506

BLAST of HG10010404 vs. ExPASy Swiss-Prot
Match: Q4VKV0 (4,4'-diapolycopene aldehyde oxidase OS=Methylomonas sp. OX=418 GN=ald PE=1 SV=1)

HSP 1 Score: 260.4 bits (664), Expect = 4.8e-68
Identity = 160/455 (35.16%), Postives = 243/455 (53.41%), Query Frame = 0

Query: 28  VQCYEPATMKYLGYFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIEHQ 87
           +    P   + LG+FP      +++++  +R+A   W +    +R   L  L K ++++ 
Sbjct: 4   IAAVSPLDGRLLGHFPVSKPALIQQQLTKSRRAALLWRELPVTERVKRLSPLKKQLLDNL 63

Query: 88  ELICEISSRETGKTIVDANMGEVMATCEKITWLLSEGEKWLKPESRSCGR-ATLHKKARV 147
           + +CE     TGK   +A +GE+    + + +      + L+  + S    A     AR+
Sbjct: 64  DRLCETIRLSTGKVRTEALLGEIYPVLDLLAYYQKRAPRILRTRAVSTSPFAFPAATARI 123

Query: 148 EFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYVRIIQAALAA 207
           E  P GV+  I PWNYPFH    P+L A+ AGN +++K SE     G    ++I    A 
Sbjct: 124 ERRPYGVVAVISPWNYPFHLSVAPLLTALLAGNAVILKPSELCLPVG----QLIVDLFAT 183

Query: 208 VGAPESLVDVITGFAETGEALVSS-VDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKD 267
           +  P+ LV  + G  +TG  L+ +  D + F G    GR +M+ AA   IPV LELGGKD
Sbjct: 184 LDLPDGLVQWVIGDGQTGAELIDARPDLVFFTGGLQTGRAVMQRAARHPIPVMLELGGKD 243

Query: 268 AFIVCEDVDLDHVVNVALRASITSSGHNCTGAERFYVHKNIYSSFVDKISERVKAITVGP 327
             +V  D DL      AL  +  +SG  C   ER YV +  ++ F+  + + +  + VG 
Sbjct: 244 TMLVLADADLKRASAAALYGAFCNSGQVCVSVERLYVQQACFAEFLAMLLKGLSKLKVGH 303

Query: 328 PLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDV 387
              G  D+G + +  Q + +Q+   DA+ +GAK  A G    L +G V Q   P V+ DV
Sbjct: 304 DPHG--DVGVMTSARQIDIVQAHYEDAIAQGAK--ASGPL--LRDGNVVQ---PVVLWDV 363

Query: 388 NHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSQRRAKNIASQIHSG 447
           +H MK+M+EE FGP+LP+M FS + EA+KLANDS  GL  +++S    +A+ +A Q+  G
Sbjct: 364 HHGMKVMREETFGPLLPVMPFSDEAEAIKLANDSDLGLNASIWSQDIIKAERLAGQLDVG 423

Query: 448 SVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 481
           + AIND   N     LPFGGVK+SGFGR+ G EGL
Sbjct: 424 NWAINDVLKNVGHSGLPFGGVKQSGFGRYHGAEGL 445

BLAST of HG10010404 vs. ExPASy Swiss-Prot
Match: P94428 (Succinate-semialdehyde dehydrogenase [NADP(+)] OS=Bacillus subtilis (strain 168) OX=224308 GN=gabD PE=1 SV=1)

HSP 1 Score: 259.6 bits (662), Expect = 8.1e-68
Identity = 160/475 (33.68%), Postives = 253/475 (53.26%), Query Frame = 0

Query: 31  YEPATMKYLGYFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIEHQELI 90
           Y PAT + +   P  S  EV+E +  + +A K W+K+S  +R  LL+   + I+EH+E +
Sbjct: 8   YNPATGEEIKTIPQQSATEVEEAIERSHQAFKTWSKTSANERTSLLKKWYELIVEHKEEL 67

Query: 91  CEISSRETGKTIVDANMGEVMATCEKITWLLSEGEKWLKPESRSCGRATLHKKARVEFHP 150
            ++ ++E GK   +A +GEV+     I W   E +   +   R+    T  K+  V   P
Sbjct: 68  ADLITKENGKPYQEA-VGEVLYGAGYIEWFAEEAK---RVYGRTVPAPTTGKRIVVTRQP 127

Query: 151 LGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYVRIIQAALAAVGAP 210
           +G + AI PWN+P   I      A+ AG   ++K +     S     R+   A    G P
Sbjct: 128 VGPVAAITPWNFPNAMITRKAAPALAAGCTFIIKPAPDTPLSAYELARLAYEA----GIP 187

Query: 211 ESLVDVITGFA-ETGEALVSS--VDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAF 270
           + ++ V+ G   E G    SS  + K+ F GST VG+++MK +A+T+  V++ELGG    
Sbjct: 188 KDVLQVVIGDGEEIGNVFTSSPKIRKITFTGSTPVGKILMKNSADTVKHVSMELGGHAPL 247

Query: 271 IVCEDVDLDHVVNVALRASITSSGHNCTGAERFYVHKNIYSSFVDKISERVKAITVGPPL 330
           IV ED D+D  V  A+ +   ++G  C  A R  VH++I   F  K+SE+V  + VG  L
Sbjct: 248 IVDEDADIDLAVEQAMASKYRNAGQTCVCANRLIVHESIKDEFAAKLSEQVSKLKVGNGL 307

Query: 331 AGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNH 390
               ++G I  +   EK+ S ++DA+++GAK++A GT+    +     +  PTV+ DV+ 
Sbjct: 308 EEGVNVGPIINKRGFEKIVSQIDDAVEKGAKVIAGGTYDRNDDKGC-YFVNPTVLTDVDT 367

Query: 391 TMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSQRRAKNIASQIHSGSV 450
           +M +M EE FGP+ PI+ FS  DEA++LAND+ +GL    F+ + RR   I+  +  G +
Sbjct: 368 SMNIMHEETFGPVAPIVTFSDIDEAIQLANDTPYGLAAYFFTENYRRGIYISENLEYGII 427

Query: 451 AINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKH 503
             ND   + +    PFGG+KESG GR  G EG+               PY++TK+
Sbjct: 428 GWNDGGPSAV--QAPFGGMKESGIGREGGSEGIE--------------PYLETKY 457

BLAST of HG10010404 vs. ExPASy TrEMBL
Match: A0A6J1H899 (aldehyde dehydrogenase 22A1-like OS=Cucurbita moschata OX=3662 GN=LOC111460471 PE=3 SV=1)

HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 555/565 (98.23%), Postives = 562/565 (99.47%), Query Frame = 0

Query: 1   MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 60
           MDDGNQTQENSYIYIPPRV+TQQQDKRVQCYEPATMKYLGYFPALSRDEV +RVASARKA
Sbjct: 39  MDDGNQTQENSYIYIPPRVRTQQQDKRVQCYEPATMKYLGYFPALSRDEVNDRVASARKA 98

Query: 61  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 120
           QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSR+TGKTIVDANMGE+MATCEKITWL
Sbjct: 99  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDANMGEIMATCEKITWL 158

Query: 121 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 180
           LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG
Sbjct: 159 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 218

Query: 181 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 240
           IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST
Sbjct: 219 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 278

Query: 241 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 300
           GVGRMIMK+AAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF
Sbjct: 279 GVGRMIMKSAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 338

Query: 301 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 360
           YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV
Sbjct: 339 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 398

Query: 361 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSR 420
           ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAV+LANDSR
Sbjct: 399 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVRLANDSR 458

Query: 421 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 480
           FGLGCAVFSGSQRRAKNIASQI SGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL
Sbjct: 459 FGLGCAVFSGSQRRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 518

Query: 481 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 540
           RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV
Sbjct: 519 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 578

Query: 541 LKMLSEQNTPAKDNSTVDDGSKRAN 566
           LKMLSEQNTPAKDNS VDDGSKRA+
Sbjct: 579 LKMLSEQNTPAKDNSAVDDGSKRAD 603

BLAST of HG10010404 vs. ExPASy TrEMBL
Match: A0A6J1L4Q4 (aldehyde dehydrogenase 22A1 OS=Cucurbita maxima OX=3661 GN=LOC111499119 PE=3 SV=1)

HSP 1 Score: 1117.1 bits (2888), Expect = 0.0e+00
Identity = 553/565 (97.88%), Postives = 562/565 (99.47%), Query Frame = 0

Query: 1   MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 60
           MDDGNQTQENSYIYIPPRV+TQQQDKRVQCYEPATMKYLGYFPALSRDEV +RVASARKA
Sbjct: 39  MDDGNQTQENSYIYIPPRVRTQQQDKRVQCYEPATMKYLGYFPALSRDEVNDRVASARKA 98

Query: 61  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 120
           QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSR+TGKTIVDANMGE+MATCEKITWL
Sbjct: 99  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDANMGEIMATCEKITWL 158

Query: 121 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 180
           LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG
Sbjct: 159 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 218

Query: 181 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 240
           IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST
Sbjct: 219 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 278

Query: 241 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 300
           GVGRMIMK+AAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF
Sbjct: 279 GVGRMIMKSAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 338

Query: 301 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 360
           YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV
Sbjct: 339 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 398

Query: 361 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSR 420
           ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAV+LANDSR
Sbjct: 399 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVRLANDSR 458

Query: 421 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 480
           FGLGC+VFSGSQRRAKNIASQI SGSVAINDFATNYLCQSLPFGGVKESGFGRFAG+EGL
Sbjct: 459 FGLGCSVFSGSQRRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGIEGL 518

Query: 481 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 540
           RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV
Sbjct: 519 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 578

Query: 541 LKMLSEQNTPAKDNSTVDDGSKRAN 566
           LKMLSEQNTPAKDNS VDDGSKRA+
Sbjct: 579 LKMLSEQNTPAKDNSAVDDGSKRAD 603

BLAST of HG10010404 vs. ExPASy TrEMBL
Match: A0A6J1CA89 (aldehyde dehydrogenase 22A1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111009819 PE=3 SV=1)

HSP 1 Score: 1085.5 bits (2806), Expect = 0.0e+00
Identity = 538/565 (95.22%), Postives = 552/565 (97.70%), Query Frame = 0

Query: 1   MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 60
           MDDGNQTQENS+IYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASAR A
Sbjct: 39  MDDGNQTQENSFIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARTA 98

Query: 61  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 120
           QKEWA+SSFKQRRLLLRILLKYI+EHQELICEISSRETGKTIVDANMGEVMATCEKITWL
Sbjct: 99  QKEWAQSSFKQRRLLLRILLKYIVEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 158

Query: 121 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 180
           LSEGEKWLKPE RSCGRAT+HKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG
Sbjct: 159 LSEGEKWLKPEMRSCGRATIHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 218

Query: 181 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 240
           IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST
Sbjct: 219 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 278

Query: 241 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 300
           GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF
Sbjct: 279 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 338

Query: 301 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 360
           YVHK IYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALD GAKIV
Sbjct: 339 YVHKTIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDGGAKIV 398

Query: 361 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSR 420
           ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPI+PIMKFSTDDE VKLANDSR
Sbjct: 399 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPIMPIMKFSTDDEVVKLANDSR 458

Query: 421 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 480
           FGLGCAVFSG+Q+RAKNIA QI SGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL
Sbjct: 459 FGLGCAVFSGNQQRAKNIALQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 518

Query: 481 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 540
           RACCLVKAVVEDRWWPYIQTK+PKPLTYPVADNAFEFQMS+VEATYGLNIWDRLRALVNV
Sbjct: 519 RACCLVKAVVEDRWWPYIQTKYPKPLTYPVADNAFEFQMSMVEATYGLNIWDRLRALVNV 578

Query: 541 LKMLSEQNTPAKDNSTVDDGSKRAN 566
           LK+LS+QNTP KD+S     +KRA+
Sbjct: 579 LKILSDQNTPPKDDSPTAGATKRAD 603

BLAST of HG10010404 vs. ExPASy TrEMBL
Match: A0A0A0KCW3 (Aldedh domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G307390 PE=3 SV=1)

HSP 1 Score: 1067.4 bits (2759), Expect = 2.1e-308
Identity = 530/563 (94.14%), Postives = 544/563 (96.63%), Query Frame = 0

Query: 1   MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 60
           MDDGNQTQENSYIYIPPR KTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA
Sbjct: 39  MDDGNQTQENSYIYIPPRTKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 98

Query: 61  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 120
           QKEWAKSSFKQRRLLLRILLKYIIE+QELICEISSR+TGKTIVDANMGEVMATCEKITWL
Sbjct: 99  QKEWAKSSFKQRRLLLRILLKYIIENQELICEISSRDTGKTIVDANMGEVMATCEKITWL 158

Query: 121 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 180
           LSEGEKWLKPESRSCGRATLHKKARVEFHPLGV+GAIVPWNYPFHNIFNPVLAAVFAGNG
Sbjct: 159 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVVGAIVPWNYPFHNIFNPVLAAVFAGNG 218

Query: 181 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 240
           IVVKVSEHASWSGCFYVRII AALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST
Sbjct: 219 IVVKVSEHASWSGCFYVRIIHAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 278

Query: 241 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 300
           GVGRMIMK+AAETLIPVTLELGGKDAFIVCED+DLDHVV++ALRASITSSGHNCTGAERF
Sbjct: 279 GVGRMIMKSAAETLIPVTLELGGKDAFIVCEDIDLDHVVDIALRASITSSGHNCTGAERF 338

Query: 301 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 360
           YVHKNIYSSFVDKISERVK ITVGPP AGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV
Sbjct: 339 YVHKNIYSSFVDKISERVKDITVGPPSAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 398

Query: 361 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSR 420
           ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTD EAVKLANDSR
Sbjct: 399 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDGEAVKLANDSR 458

Query: 421 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 480
           FGLGCAVFSGSQ RA+NIA QIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL
Sbjct: 459 FGLGCAVFSGSQDRARNIAWQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 518

Query: 481 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 540
           RACCLVKAVVEDRWWP++ TKHPK LTYPVADNAFEFQMSLVEATYGLNIW RL ALVNV
Sbjct: 519 RACCLVKAVVEDRWWPFLYTKHPKLLTYPVADNAFEFQMSLVEATYGLNIWHRLTALVNV 578

Query: 541 LKMLSEQNTPAKDNSTVDDGSKR 564
           LKMLSE NT  ++N  ++   KR
Sbjct: 579 LKMLSEHNTLTRNNPNINGRRKR 601

BLAST of HG10010404 vs. ExPASy TrEMBL
Match: A0A6J1EWW0 (aldehyde dehydrogenase 22A1-like OS=Cucurbita moschata OX=3662 GN=LOC111436875 PE=3 SV=1)

HSP 1 Score: 1063.1 bits (2748), Expect = 3.9e-307
Identity = 519/565 (91.86%), Postives = 546/565 (96.64%), Query Frame = 0

Query: 1   MDDGNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKA 60
           +DDGNQ  ENSYIYIPPRVKTQQQDK+VQCYEPA MKYLG+FPAL+RDEVKERVASARKA
Sbjct: 39  VDDGNQAPENSYIYIPPRVKTQQQDKKVQCYEPANMKYLGFFPALTRDEVKERVASARKA 98

Query: 61  QKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWL 120
           Q EWAKSSFKQRRLLLRI+LKYIIEHQ+LICEISSR+TGKTIVDA++GEVMA CEKITWL
Sbjct: 99  QNEWAKSSFKQRRLLLRIILKYIIEHQKLICEISSRDTGKTIVDASIGEVMAICEKITWL 158

Query: 121 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 180
           LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG
Sbjct: 159 LSEGEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNG 218

Query: 181 IVVKVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGST 240
           IVVK+SEHASWSGCFY+RIIQAALAAVGAPESLVDVITGF+ETGEALVSSVDKMIF+GST
Sbjct: 219 IVVKISEHASWSGCFYIRIIQAALAAVGAPESLVDVITGFSETGEALVSSVDKMIFIGST 278

Query: 241 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERF 300
           GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNC GAERF
Sbjct: 279 GVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCNGAERF 338

Query: 301 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 360
           YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV
Sbjct: 339 YVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIV 398

Query: 361 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSR 420
           ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDE VKLANDS+
Sbjct: 399 ARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEVVKLANDSK 458

Query: 421 FGLGCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGL 480
           FGLGC VFSGSQ+RA+ IA QI  GSVAINDFATNY CQSLPFGGVKESGFGRF+GVEGL
Sbjct: 459 FGLGCGVFSGSQQRARTIALQIRCGSVAINDFATNYFCQSLPFGGVKESGFGRFSGVEGL 518

Query: 481 RACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNV 540
           RACC+VKAVVEDRWWPYIQTK+PKPLTYPVADNAFEFQMS+VEATYGLNIWDRLRAL+NV
Sbjct: 519 RACCIVKAVVEDRWWPYIQTKYPKPLTYPVADNAFEFQMSVVEATYGLNIWDRLRALINV 578

Query: 541 LKMLSEQNTPAKDNSTVDDGSKRAN 566
           +KMLS++NT  K NS + DG K+A+
Sbjct: 579 MKMLSQENTSGKHNSVIGDGDKKAD 603

BLAST of HG10010404 vs. TAIR 10
Match: AT3G66658.2 (aldehyde dehydrogenase 22A1 )

HSP 1 Score: 890.2 bits (2299), Expect = 8.7e-259
Identity = 421/546 (77.11%), Postives = 491/546 (89.93%), Query Frame = 0

Query: 4   GNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKAQKE 63
           G  T+ENS+IYIPPR ++QQ DK+VQCYEPATMKYLGYFPALS  EV+ERV  +RKAQK 
Sbjct: 42  GKDTEENSFIYIPPRGRSQQSDKKVQCYEPATMKYLGYFPALSPTEVEERVTLSRKAQKT 101

Query: 64  WAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWLLSE 123
           WA+SSFK RR  LRILLKYIIEHQELICE+SSR+TGKT+VDA++GE+M TCEKITWLLSE
Sbjct: 102 WAQSSFKLRRQFLRILLKYIIEHQELICEVSSRDTGKTMVDASLGEIMTTCEKITWLLSE 161

Query: 124 GEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVV 183
           GE+WLKPESRS GRA LHK +RVEFHPLGVIGAIVPWNYPFHNIFNP+LAAVF+GNGIV+
Sbjct: 162 GERWLKPESRSSGRAMLHKVSRVEFHPLGVIGAIVPWNYPFHNIFNPMLAAVFSGNGIVI 221

Query: 184 KVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGSTGVG 243
           KVSEHASWSGCFY RIIQAALAAVGAPE+LVDVITGFAETGEALVSSVDKMIFVGST VG
Sbjct: 222 KVSEHASWSGCFYFRIIQAALAAVGAPENLVDVITGFAETGEALVSSVDKMIFVGSTAVG 281

Query: 244 RMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERFYVH 303
           +MIM+ AAETL PVTLELGGKDAFI+CED D+ HV  VA+R ++ SSG NC GAERFYVH
Sbjct: 282 KMIMRNAAETLTPVTLELGGKDAFIICEDADVSHVAQVAVRGTLQSSGQNCAGAERFYVH 341

Query: 304 KNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARG 363
           K+IY++F+ ++++ VK+++ GPPL G+YDMGAIC QE SE LQSLVNDALD+GA+I  RG
Sbjct: 342 KDIYTAFIGQVTKIVKSVSAGPPLTGRYDMGAICLQEHSEHLQSLVNDALDKGAEIAVRG 401

Query: 364 TFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGL 423
           +FGHL E AVDQYFPPTV+++VNH MK+M+EEAFGPI+PIM+FSTD+E +KLANDSR+ L
Sbjct: 402 SFGHLGEDAVDQYFPPTVLINVNHNMKIMKEEAFGPIMPIMQFSTDEEVIKLANDSRYAL 461

Query: 424 GCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRAC 483
           GCAVFSGS+ RAK IASQI  G  AINDFA+NY+CQSLPFGGVK+SGFGRFAG+EGLRAC
Sbjct: 462 GCAVFSGSKHRAKQIASQIQCGVAAINDFASNYMCQSLPFGGVKDSGFGRFAGIEGLRAC 521

Query: 484 CLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLRALVNVLKM 543
           CLVK+VVEDR+WP I+TK PKP+ YPVA+NAFEFQ +LVE  YGLNIWDRLR+L++VLK 
Sbjct: 522 CLVKSVVEDRFWPLIKTKIPKPIQYPVAENAFEFQEALVETLYGLNIWDRLRSLIDVLKF 581

Query: 544 LSEQNT 550
           L++Q++
Sbjct: 582 LTDQSS 587

BLAST of HG10010404 vs. TAIR 10
Match: AT3G66658.1 (aldehyde dehydrogenase 22A1 )

HSP 1 Score: 827.8 bits (2137), Expect = 5.3e-240
Identity = 392/503 (77.93%), Postives = 453/503 (90.06%), Query Frame = 0

Query: 4   GNQTQENSYIYIPPRVKTQQQDKRVQCYEPATMKYLGYFPALSRDEVKERVASARKAQKE 63
           G  T+ENS+IYIPPR ++QQ DK+VQCYEPATMKYLGYFPALS  EV+ERV  +RKAQK 
Sbjct: 42  GKDTEENSFIYIPPRGRSQQSDKKVQCYEPATMKYLGYFPALSPTEVEERVTLSRKAQKT 101

Query: 64  WAKSSFKQRRLLLRILLKYIIEHQELICEISSRETGKTIVDANMGEVMATCEKITWLLSE 123
           WA+SSFK RR  LRILLKYIIEHQELICE+SSR+TGKT+VDA++GE+M TCEKITWLLSE
Sbjct: 102 WAQSSFKLRRQFLRILLKYIIEHQELICEVSSRDTGKTMVDASLGEIMTTCEKITWLLSE 161

Query: 124 GEKWLKPESRSCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVV 183
           GE+WLKPESRS GRA LHK +RVEFHPLGVIGAIVPWNYPFHNIFNP+LAAVF+GNGIV+
Sbjct: 162 GERWLKPESRSSGRAMLHKVSRVEFHPLGVIGAIVPWNYPFHNIFNPMLAAVFSGNGIVI 221

Query: 184 KVSEHASWSGCFYVRIIQAALAAVGAPESLVDVITGFAETGEALVSSVDKMIFVGSTGVG 243
           KVSEHASWSGCFY RIIQAALAAVGAPE+LVDVITGFAETGEALVSSVDKMIFVGST VG
Sbjct: 222 KVSEHASWSGCFYFRIIQAALAAVGAPENLVDVITGFAETGEALVSSVDKMIFVGSTAVG 281

Query: 244 RMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERFYVH 303
           +MIM+ AAETL PVTLELGGKDAFI+CED D+ HV  VA+R ++ SSG NC GAERFYVH
Sbjct: 282 KMIMRNAAETLTPVTLELGGKDAFIICEDADVSHVAQVAVRGTLQSSGQNCAGAERFYVH 341

Query: 304 KNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARG 363
           K+IY++F+ ++++ VK+++ GPPL G+YDMGAIC QE SE LQSLVNDALD+GA+I  RG
Sbjct: 342 KDIYTAFIGQVTKIVKSVSAGPPLTGRYDMGAICLQEHSEHLQSLVNDALDKGAEIAVRG 401

Query: 364 TFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGL 423
           +FGHL E AVDQYFPPTV+++VNH MK+M+EEAFGPI+PIM+FSTD+E +KLANDSR+ L
Sbjct: 402 SFGHLGEDAVDQYFPPTVLINVNHNMKIMKEEAFGPIMPIMQFSTDEEVIKLANDSRYAL 461

Query: 424 GCAVFSGSQRRAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRAC 483
           GCAVFSGS+ RAK IASQI  G  AINDFA+NY+CQSLPFGGVK+SGFGRFAG+EGLRAC
Sbjct: 462 GCAVFSGSKHRAKQIASQIQCGVAAINDFASNYMCQSLPFGGVKDSGFGRFAGIEGLRAC 521

Query: 484 CLVKAVVEDRWWPYIQTKHPKPL 507
           CLVK+VVEDR+WP I+TK PKP+
Sbjct: 522 CLVKSVVEDRFWPLIKTKIPKPI 544

BLAST of HG10010404 vs. TAIR 10
Match: AT1G74920.1 (aldehyde dehydrogenase 10A8 )

HSP 1 Score: 235.7 bits (600), Expect = 8.9e-62
Identity = 168/488 (34.43%), Postives = 255/488 (52.25%), Query Frame = 0

Query: 26  KRVQCYEPATMKYLGYFPALSRDEVKERVASARKA-----QKEWAKSSFKQRRLLLRILL 85
           KR+    PAT + +G  PA + ++V   V +AR+A      K+WAK+    R   LR + 
Sbjct: 23  KRIPIVNPATEEVIGDIPAATTEDVDVAVNAARRALSRNKGKDWAKAPGAVRAKYLRAIA 82

Query: 86  KYIIEHQELICEISSRETGKTIVDA--NMGEVMATCEKITWLLSEGEKWLKPESRSCGRA 145
             + E +  + ++ + + GK + +A  +M +V A C +    L+EG   L  + ++    
Sbjct: 83  AKVNERKTDLAKLEALDCGKPLDEAVWDMDDV-AGCFEFYADLAEG---LDAKQKAPVSL 142

Query: 146 TLHK-KARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYV 205
            +   K+ V   PLGV+G I PWNYP       V  ++ AG   ++K SE AS + C  +
Sbjct: 143 PMESFKSYVLKQPLGVVGLITPWNYPLLMAVWKVAPSLAAGCTAILKPSELASVT-CLEL 202

Query: 206 RIIQAALAAVGAPESLVDVITGF-AETGEALVS--SVDKMIFVGSTGVGRMIMKTAAETL 265
             I      VG P  +++V+TGF +E G  L S   VDK+ F GS   G  +M  AA+ +
Sbjct: 203 ADI---CREVGLPPGVLNVLTGFGSEAGAPLASHPGVDKIAFTGSFATGSKVMTAAAQLV 262

Query: 266 IPVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERFYVHKNIYSSFVDKI 325
            PV++ELGGK   IV +DVDLD     AL     ++G  C+   R  VH++I S F++K+
Sbjct: 263 KPVSMELGGKSPLIVFDDVDLDKAAEWALFGCFWTNGQICSATSRLLVHESIASEFIEKL 322

Query: 326 SERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTF-GHLPEGAV 385
            +  K I +  P+     +G + ++ Q EK+   ++ A   GA I+  G+   HL +G  
Sbjct: 323 VKWSKNIKISDPMEEGCRLGPVVSKGQYEKILKFISTAKSEGATILHGGSRPEHLEKGF- 382

Query: 386 DQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSQR 445
             +  PT+I DV  +M++ +EE FGP+L +  F+++DEA++LANDS +GLG AV S    
Sbjct: 383 --FIEPTIITDVTTSMQIWREEVFGPVLCVKTFASEDEAIELANDSHYGLGAAVISNDTE 442

Query: 446 RAKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVV--- 498
           R   I+    +G V IN   +       P+GGVK SGFGR  G  GL     VK V    
Sbjct: 443 RCDRISEAFEAGIVWIN--CSQPCFTQAPWGGVKRSGFGRELGEWGLDNYLSVKQVTLYT 497

BLAST of HG10010404 vs. TAIR 10
Match: AT3G48170.1 (aldehyde dehydrogenase 10A9 )

HSP 1 Score: 231.9 bits (590), Expect = 1.3e-60
Identity = 161/477 (33.75%), Postives = 249/477 (52.20%), Query Frame = 0

Query: 26  KRVQCYEPATMKYLGYFPALSRDEVKERVASARKA-----QKEWAKSSFKQRRLLLRILL 85
           K +    PAT   +GY PA + ++V+  V +ARKA      K+WA+++   R   LR + 
Sbjct: 23  KTLPVVNPATEDIIGYIPAATSEDVELAVEAARKAFTRNNGKDWARATGAVRAKYLRAIA 82

Query: 86  KYIIEHQELICEISSRETGKTIVDA--NMGEVMATCEKITWLLSEGEKWLKPESRSCGRA 145
             +IE +  +  + + + GK + +A  +M +V A C +    L+EG    +    S    
Sbjct: 83  AKVIERKSELANLEAIDCGKPLDEAAWDMDDV-AGCFEYYADLAEGLDAKQKTPLSLPMD 142

Query: 146 TLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYVR 205
           T   K  +   P+GV+G I PWNYP       V  ++ AG   ++K SE AS + C  + 
Sbjct: 143 TF--KGYILKEPIGVVGMITPWNYPLLMAVWKVAPSLAAGCTAILKPSELASLT-CLELA 202

Query: 206 IIQAALAAVGAPESLVDVITGF-AETGEALVS--SVDKMIFVGSTGVGRMIMKTAAETLI 265
            I      VG P  +++++TG   E G  L S   VDK++F GST  G  IM +AA+ + 
Sbjct: 203 DI---CREVGLPPGVLNILTGLGTEAGAPLASHPHVDKIVFTGSTTTGSSIMTSAAKLVK 262

Query: 266 PVTLELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERFYVHKNIYSSFVDKIS 325
           PV+LELGGK   IV +DVD+D  V   +     ++G  C+   R  VH+ I   F+DK+ 
Sbjct: 263 PVSLELGGKSPIIVFDDVDIDKAVEWTMFGCFWTNGQICSATSRLLVHERIADEFLDKLV 322

Query: 326 ERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQ 385
           +  K I +  P      +G + ++ Q E++   V++A + GA ++  G     PE     
Sbjct: 323 KWTKNIKISDPFEEGCRLGPVVSKGQYERVLKFVSNARNEGATVLCGGV---RPEHLKKG 382

Query: 386 YF-PPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSQRR 445
           YF  P ++ +V  +M++ +EE FGP L +  FST+DEA++LANDS++GL  AV S    R
Sbjct: 383 YFVEPAIVSNVTTSMEIWREEVFGPALCVKTFSTEDEAIQLANDSQYGLAGAVLSNDLER 442

Query: 446 AKNIASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVE 492
              ++    +G V +N  +    CQ+ P+GG K SGFGR  G  GL     VK V +
Sbjct: 443 CDRVSKAFQAGIVWVN-CSQPCFCQA-PWGGTKRSGFGRELGEWGLENYLSVKQVTQ 487

BLAST of HG10010404 vs. TAIR 10
Match: AT1G74920.2 (aldehyde dehydrogenase 10A8 )

HSP 1 Score: 224.9 bits (572), Expect = 1.6e-58
Identity = 163/483 (33.75%), Postives = 247/483 (51.14%), Query Frame = 0

Query: 26  KRVQCYEPATMKYLGYFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIE 85
           KR+    PAT + +     +       R A +R   K+WAK+    R   LR +   + E
Sbjct: 23  KRIPIVNPATEEVIATTEDVDVAVNAARRALSRNKGKDWAKAPGAVRAKYLRAIAAKVNE 82

Query: 86  HQELICEISSRETGKTIVDA--NMGEVMATCEKITWLLSEGEKWLKPESRSCGRATLHK- 145
            +  + ++ + + GK + +A  +M +V A C +    L+EG   L  + ++     +   
Sbjct: 83  RKTDLAKLEALDCGKPLDEAVWDMDDV-AGCFEFYADLAEG---LDAKQKAPVSLPMESF 142

Query: 146 KARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYVRIIQA 205
           K+ V   PLGV+G I PWNYP       V  ++ AG   ++K SE AS + C  +  I  
Sbjct: 143 KSYVLKQPLGVVGLITPWNYPLLMAVWKVAPSLAAGCTAILKPSELASVT-CLELADI-- 202

Query: 206 ALAAVGAPESLVDVITGF-AETGEALVS--SVDKMIFVGSTGVGRMIMKTAAETLIPVTL 265
               VG P  +++V+TGF +E G  L S   VDK+ F GS   G  +M  AA+ + PV++
Sbjct: 203 -CREVGLPPGVLNVLTGFGSEAGAPLASHPGVDKIAFTGSFATGSKVMTAAAQLVKPVSM 262

Query: 266 ELGGKDAFIVCEDVDLDHVVNVALRASITSSGHNCTGAERFYVHKNIYSSFVDKISERVK 325
           ELGGK   IV +DVDLD     AL     ++G  C+   R  VH++I S F++K+ +  K
Sbjct: 263 ELGGKSPLIVFDDVDLDKAAEWALFGCFWTNGQICSATSRLLVHESIASEFIEKLVKWSK 322

Query: 326 AITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTF-GHLPEGAVDQYFP 385
            I +  P+     +G + ++ Q EK+   ++ A   GA I+  G+   HL +G    +  
Sbjct: 323 NIKISDPMEEGCRLGPVVSKGQYEKILKFISTAKSEGATILHGGSRPEHLEKGF---FIE 382

Query: 386 PTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSQRRAKNI 445
           PT+I DV  +M++ +EE FGP+L +  F+++DEA++LANDS +GLG AV S    R   I
Sbjct: 383 PTIITDVTTSMQIWREEVFGPVLCVKTFASEDEAIELANDSHYGLGAAVISNDTERCDRI 442

Query: 446 ASQIHSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVV----EDRW 498
           +    +G V IN   +       P+GGVK SGFGR  G  GL     VK V      D W
Sbjct: 443 SEAFEAGIVWIN--CSQPCFTQAPWGGVKRSGFGRELGEWGLDNYLSVKQVTLYTSNDPW 492

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038875847.10.0e+0098.76aldehyde dehydrogenase 22A1 [Benincasa hispida][more]
XP_022959514.10.0e+0098.23aldehyde dehydrogenase 22A1-like [Cucurbita moschata][more]
XP_023006368.10.0e+0097.88aldehyde dehydrogenase 22A1 [Cucurbita maxima][more]
KAG6575111.10.0e+0097.88Aldehyde dehydrogenase 22A1, partial [Cucurbita argyrosperma subsp. sororia] >KA... [more]
XP_023548872.10.0e+0098.05aldehyde dehydrogenase 22A1-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q0WSF11.2e-25777.11Aldehyde dehydrogenase 22A1 OS=Arabidopsis thaliana OX=3702 GN=ALDH22A1 PE=2 SV=... [more]
P386946.8e-10740.07Putative aldehyde dehydrogenase-like protein YHR039C OS=Saccharomyces cerevisiae... [more]
Q9P7K92.8e-10037.45Putative aldehyde dehydrogenase-like protein C21C3 OS=Schizosaccharomyces pombe ... [more]
Q4VKV04.8e-6835.164,4'-diapolycopene aldehyde oxidase OS=Methylomonas sp. OX=418 GN=ald PE=1 SV=1[more]
P944288.1e-6833.68Succinate-semialdehyde dehydrogenase [NADP(+)] OS=Bacillus subtilis (strain 168)... [more]
Match NameE-valueIdentityDescription
A0A6J1H8990.0e+0098.23aldehyde dehydrogenase 22A1-like OS=Cucurbita moschata OX=3662 GN=LOC111460471 P... [more]
A0A6J1L4Q40.0e+0097.88aldehyde dehydrogenase 22A1 OS=Cucurbita maxima OX=3661 GN=LOC111499119 PE=3 SV=... [more]
A0A6J1CA890.0e+0095.22aldehyde dehydrogenase 22A1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC1110... [more]
A0A0A0KCW32.1e-30894.14Aldedh domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G307390 PE=3... [more]
A0A6J1EWW03.9e-30791.86aldehyde dehydrogenase 22A1-like OS=Cucurbita moschata OX=3662 GN=LOC111436875 P... [more]
Match NameE-valueIdentityDescription
AT3G66658.28.7e-25977.11aldehyde dehydrogenase 22A1 [more]
AT3G66658.15.3e-24077.93aldehyde dehydrogenase 22A1 [more]
AT1G74920.18.9e-6234.43aldehyde dehydrogenase 10A8 [more]
AT3G48170.11.3e-6033.75aldehyde dehydrogenase 10A9 [more]
AT1G74920.21.6e-5833.75aldehyde dehydrogenase 10A8 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR016163Aldehyde dehydrogenase, C-terminalGENE3D3.40.309.10Aldehyde Dehydrogenase; Chain A, domain 2coord: 267..462
e-value: 3.8E-142
score: 476.0
IPR016162Aldehyde dehydrogenase, N-terminalGENE3D3.40.605.10Aldehyde Dehydrogenase; Chain A, domain 1coord: 27..487
e-value: 3.8E-142
score: 476.0
IPR015590Aldehyde dehydrogenase domainPFAMPF00171Aldedhcoord: 23..489
e-value: 3.3E-132
score: 441.3
NoneNo IPR availablePIRSRPIRSR036492-1PIRSR036492-1coord: 33..493
e-value: 1.2E-102
score: 341.8
NoneNo IPR availablePANTHERPTHR11699:SF25ALDEHYDE DEHYDROGENASE-LIKE PROTEIN YHR039C-RELATEDcoord: 11..524
NoneNo IPR availablePANTHERPTHR11699ALDEHYDE DEHYDROGENASE-RELATEDcoord: 11..524
NoneNo IPR availableCDDcd07098ALDH_F15-22coord: 31..493
e-value: 0.0
score: 749.513
IPR029510Aldehyde dehydrogenase, glutamic acid active sitePROSITEPS00687ALDEHYDE_DEHYDR_GLUcoord: 259..266
IPR016161Aldehyde/histidinol dehydrogenaseSUPERFAMILY53720ALDH-likecoord: 13..491

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10010404.1HG10010404.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004029 aldehyde dehydrogenase (NAD+) activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016620 oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor