HG10001980 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001980
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAldehyde dehydrogenase
LocationChr11: 2295711 .. 2298001 (-)
RNA-Seq ExpressionHG10001980
SyntenyHG10001980
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCCAATTTGGAGGTGCTGAGGGAAAGCTTCAGAAATGGAAGAACAAGGAGTTTTGAATGGAGGAAAACCCAGCTGAGTTCGCTGATTCAATTCATCCATGAGAAAGAAAACGCCATTTTTGAAGCCCTCTATCAAGATCTTGGCAAGCATCCTGCGGAAAGTTTTCGAGATGAGGTGATTTTTTAATCTATTACACCAATTTGGAAATTAGATAACAGCTCCCTTTCTATGGAAACAGAGCCTAATTTTGCTACTGCTTTTAAGGTTGGAATTGTTCTGAAATCTGCAAACAATGCTCTCTGCTCTTTACACAAATGGATGGCCCCTAAAAAGGTTGCTTTTCTTCTTTTTCCTCTGCTTTTTTTGTTCTTTCTTTCTTTCTTCTTCTTCTTCTCTCTTATATGCTTGTTTTGTTGTGAGGTAGAAATCTGTGCCATTACTGTTCTTCCCAGCAAAAGGAGAAGTTTTGTCTGAGCCATTTGGTCTAGTCCTCATAATTTCATCTTGGAATTTCCCCCTTTGTGAGTATATTTCATCTTTAGCTCAAGTAATATTTTTTAACCAAAATGGTCCCATCATCTGTTTTTGAGATTTCAGATCAATCAAAGTCATACTGAACTATTTTATAAGTTTTAAAGTAGGGCATTACAACCTTGGTGATTGAATTTTCCATTAAAAAATTGAAGCATTCTGATGCTTCTTCTACTTGTCTTTGCAGCTTTAGCATTGGATCCGTTAATTGGAGCGAAATCTGCAGGCAATACAGCGGTTATAAAACCGTCGGAATATGCTCCAGTTTTCTCCTCTTTTCTTTCTGCAACAATCCCTCTTTACCTTGACAATAAAGCCATCAAAGTTGTGGAGGGTGGAGCTGATGTTAGTGAACAACTTCTACAGTACAAGTGGGATAAGATCTTCTTCACTGGTATGAACTTAGCTTGTTTAACAAGGGCGATCGTTCGATATGTTCGTGTTCTATGATTTGATCGTTATGAGTGATCTTTTCAGGGAGCCCAAGAGTAGCTAGGATTGTGATGTCAGCAGCTGCAAAGCATCTAACTCCTGTTACTTTAGAGCTTGGGGGAAAATGTCCTGCAATCTTTGATTACTCCTCTGTCCATTCCAACATGAAGGTTTTTTAATTTAAACGTAACGTCCTTAACGGATCAAAGCTCGTTTTGTAATTGTTTCTCACCTTGAATTCATAGGTAGCAGCCAAGAGAATCGTTGGTGGAAAATGGGGGCCGTGCGCCGGGCAGGCATGCATAGGGATAGATTATGTGCTTGTGGAGGATAAGTTTGCTTCAGAATTGGTAAAAACAAGTTCACTCTAATAACTTTGTTCTTATTTGCTGCAGGCCTACTGATTTTCTAGCTGATCTTTGTGACCTTCCTCTTTCAGATCGAGTCATTAAAGCGAATACTCAAGAAGTTTTATGGTGAAAACTCGAAAAACTCAACGAGCATAGCCCGTATTGTTAATGAGAAAAATGTTGAAAGAATAAGCAATCTTCTCAAAGACCCTAAGGTTGCTGCTTCCATTGTCCATGGTGGCTCTATTGACAAAGAAAAACTGTAAGCTAAGCTATCCAATGAGGAATTTTTTTGCTGCTATTCACTGTTCTATTGAATGCTTTGCATTGATCTTGTAGCTTCATTGAGCCAACAATATTGTTGAATCCTCCAATCGACGCCGATATCATGACCGAAGAAATCTTCGGTCCCCTGCTACCGATAATCACAGTAAGAATTTAATGTATAACCTCTCAATAATTTCACAGAACCATAAAATCTCTGATCATCAGTGTTCTTGAAACAGTTGAACAAAATTGAAGAAAGCATTGAGTTCATCAATGCAAGACCGAAACCTCTCGCTCTCTATGCCTTCACGGGAGACGAAACTCTCAAGAAACGGATTTTATTCGAAACGTCATCGGGAAGTGTCACATTCAATGATAGCGTGGTTCAGGTATTGTTGTTCTACCCACCTACTCTCTCAAAGAAAGCTTTGCACTAAGCAGAAGAAATGAAATGGATTGTTCTCTTTCCAGTTTGTGTGTGATTCACTACCATTCGGCGGTGTTGGTCAGAGTGGTTTCGGGAGTTACCATGGCAAGTATTCATTTGATACGTTCAGCCATGAAAAAGCAGTAATGCAGAGAAGCTTTTTCATGGAAATCGAGTCACGATATCCACCATGGAACGATTTCAAGCTCAAGTTCTTTAGATTGGGGTATCGATTCGACTATTTCGGGCTGGTACTGCTCCTTTTGGGGTTGAAGTAG

mRNA sequence

ATGGAAGCCAATTTGGAGGTGCTGAGGGAAAGCTTCAGAAATGGAAGAACAAGGAGTTTTGAATGGAGGAAAACCCAGCTGAGTTCGCTGATTCAATTCATCCATGAGAAAGAAAACGCCATTTTTGAAGCCCTCTATCAAGATCTTGGCAAGCATCCTGCGGAAAGTTTTCGAGATGAGGTTGGAATTGTTCTGAAATCTGCAAACAATGCTCTCTGCTCTTTACACAAATGGATGGCCCCTAAAAAGAAATCTGTGCCATTACTGTTCTTCCCAGCAAAAGGAGAAGTTTTGTCTGAGCCATTTGGTCTAGTCCTCATAATTTCATCTTGGAATTTCCCCCTTTCTTTAGCATTGGATCCGTTAATTGGAGCGAAATCTGCAGGCAATACAGCGGTTATAAAACCGTCGGAATATGCTCCAGTTTTCTCCTCTTTTCTTTCTGCAACAATCCCTCTTTACCTTGACAATAAAGCCATCAAAGTTGTGGAGGGTGGAGCTGATGTTAGTGAACAACTTCTACAGTACAAGTGGGATAAGATCTTCTTCACTGGGAGCCCAAGAGTAGCTAGGATTGTGATGTCAGCAGCTGCAAAGCATCTAACTCCTGTTACTTTAGAGCTTGGGGGAAAATGTCCTGCAATCTTTGATTACTCCTCTGTCCATTCCAACATGAAGGTAGCAGCCAAGAGAATCGTTGGTGGAAAATGGGGGCCGTGCGCCGGGCAGGCATGCATAGGGATAGATTATGTGCTTGTGGAGGATAAGTTTGCTTCAGAATTGATCGAGTCATTAAAGCGAATACTCAAGAAGTTTTATGGTGAAAACTCGAAAAACTCAACGAGCATAGCCCGTATTGTTAATGAGAAAAATGTTGAAAGAATAAGCAATCTTCTCAAAGACCCTAAGGTTGCTGCTTCCATTGTCCATGGTGGCTCTATTGACAAAGAAAAACTCTTCATTGAGCCAACAATATTGTTGAATCCTCCAATCGACGCCGATATCATGACCGAAGAAATCTTCGGTCCCCTGCTACCGATAATCACATTGAACAAAATTGAAGAAAGCATTGAGTTCATCAATGCAAGACCGAAACCTCTCGCTCTCTATGCCTTCACGGGAGACGAAACTCTCAAGAAACGGATTTTATTCGAAACGTCATCGGGAAGTGTCACATTCAATGATAGCGTGGTTCAGTTTGTGTGTGATTCACTACCATTCGGCGGTGTTGGTCAGAGTGGTTTCGGGAGTTACCATGGCAAGTATTCATTTGATACGTTCAGCCATGAAAAAGCAGTAATGCAGAGAAGCTTTTTCATGGAAATCGAGTCACGATATCCACCATGGAACGATTTCAAGCTCAAGTTCTTTAGATTGGGGTATCGATTCGACTATTTCGGGCTGGTACTGCTCCTTTTGGGGTTGAAGTAG

Coding sequence (CDS)

ATGGAAGCCAATTTGGAGGTGCTGAGGGAAAGCTTCAGAAATGGAAGAACAAGGAGTTTTGAATGGAGGAAAACCCAGCTGAGTTCGCTGATTCAATTCATCCATGAGAAAGAAAACGCCATTTTTGAAGCCCTCTATCAAGATCTTGGCAAGCATCCTGCGGAAAGTTTTCGAGATGAGGTTGGAATTGTTCTGAAATCTGCAAACAATGCTCTCTGCTCTTTACACAAATGGATGGCCCCTAAAAAGAAATCTGTGCCATTACTGTTCTTCCCAGCAAAAGGAGAAGTTTTGTCTGAGCCATTTGGTCTAGTCCTCATAATTTCATCTTGGAATTTCCCCCTTTCTTTAGCATTGGATCCGTTAATTGGAGCGAAATCTGCAGGCAATACAGCGGTTATAAAACCGTCGGAATATGCTCCAGTTTTCTCCTCTTTTCTTTCTGCAACAATCCCTCTTTACCTTGACAATAAAGCCATCAAAGTTGTGGAGGGTGGAGCTGATGTTAGTGAACAACTTCTACAGTACAAGTGGGATAAGATCTTCTTCACTGGGAGCCCAAGAGTAGCTAGGATTGTGATGTCAGCAGCTGCAAAGCATCTAACTCCTGTTACTTTAGAGCTTGGGGGAAAATGTCCTGCAATCTTTGATTACTCCTCTGTCCATTCCAACATGAAGGTAGCAGCCAAGAGAATCGTTGGTGGAAAATGGGGGCCGTGCGCCGGGCAGGCATGCATAGGGATAGATTATGTGCTTGTGGAGGATAAGTTTGCTTCAGAATTGATCGAGTCATTAAAGCGAATACTCAAGAAGTTTTATGGTGAAAACTCGAAAAACTCAACGAGCATAGCCCGTATTGTTAATGAGAAAAATGTTGAAAGAATAAGCAATCTTCTCAAAGACCCTAAGGTTGCTGCTTCCATTGTCCATGGTGGCTCTATTGACAAAGAAAAACTCTTCATTGAGCCAACAATATTGTTGAATCCTCCAATCGACGCCGATATCATGACCGAAGAAATCTTCGGTCCCCTGCTACCGATAATCACATTGAACAAAATTGAAGAAAGCATTGAGTTCATCAATGCAAGACCGAAACCTCTCGCTCTCTATGCCTTCACGGGAGACGAAACTCTCAAGAAACGGATTTTATTCGAAACGTCATCGGGAAGTGTCACATTCAATGATAGCGTGGTTCAGTTTGTGTGTGATTCACTACCATTCGGCGGTGTTGGTCAGAGTGGTTTCGGGAGTTACCATGGCAAGTATTCATTTGATACGTTCAGCCATGAAAAAGCAGTAATGCAGAGAAGCTTTTTCATGGAAATCGAGTCACGATATCCACCATGGAACGATTTCAAGCTCAAGTTCTTTAGATTGGGGTATCGATTCGACTATTTCGGGCTGGTACTGCTCCTTTTGGGGTTGAAGTAG

Protein sequence

MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDEVGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALDPLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDKIFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLKDPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFINARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHGKYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK
Homology
BLAST of HG10001980 vs. NCBI nr
Match: XP_008453718.1 (PREDICTED: aldehyde dehydrogenase family 3 member F1 [Cucumis melo] >TYK16698.1 aldehyde dehydrogenase family 3 member F1 [Cucumis melo var. makuwa])

HSP 1 Score: 886.7 bits (2290), Expect = 8.7e-254
Identity = 440/476 (92.44%), Postives = 458/476 (96.22%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           MEANLEVLRESF+NGRTRS+EWRK QLSSLIQ IH+KEN IFEALYQDLGKHP E FRDE
Sbjct: 1   MEANLEVLRESFKNGRTRSYEWRKKQLSSLIQLIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           VGIVLKSAN+AL SLHKWMAPKKK VPLLFFPAKGEVLSEPFGLVLIISSWNFPLSL+LD
Sbjct: 61  VGIVLKSANDALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA SAGNTAV+KPSEYAPVFSSFL AT+PLYLDNKAIKVVEGGADV EQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDNKAIKVVEGGADVCEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP+V RIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELI+SLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIDSLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DPKVAASIVHGGS+DKEKLFIEPTILLNPP+D DIMTEEIFGPLLPIITLNKIEESIEFI
Sbjct: 301 DPKVAASIVHGGSVDKEKLFIEPTILLNPPLDTDIMTEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLALYAFT DETLKKRIL++TSSGSVTFND++VQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLALYAFTEDETLKKRILYKTSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           KYSFDTFSHEKAVMQRSF +E+E RYPPWNDFKLKF RL YR+DYFGL LLLLGLK
Sbjct: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLK 476

BLAST of HG10001980 vs. NCBI nr
Match: XP_004146483.1 (aldehyde dehydrogenase family 3 member F1 [Cucumis sativus] >KGN53271.1 hypothetical protein Csa_015039 [Cucumis sativus])

HSP 1 Score: 884.0 bits (2283), Expect = 5.7e-253
Identity = 441/476 (92.65%), Postives = 459/476 (96.43%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           MEA+LEVLRESF+NGRTRS+EWR  QLSSLIQFIH+KEN IFEALYQDLGKHP E FRDE
Sbjct: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           VGIVLKSANNAL SLHKWMAPKKK +PLLFFPAKGEVLSEPFGLVLIISSWNFPLSL+LD
Sbjct: 61  VGIVLKSANNALSSLHKWMAPKKKPLPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA SAGNTAV+KPSEYAPVFSSFL AT+PLYLD+KAIKVVEGGADVSEQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSPRVARIV SAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVN+KNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DPKVAASIVHGGS+DKEKLFIEPTILLNPP+ ADIMTEEIFGPLLPIITLNKIEESIEFI
Sbjct: 301 DPKVAASIVHGGSMDKEKLFIEPTILLNPPLYADIMTEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLALYAFTGDETLKKRIL+ETSSGSVTFND++VQFVCDSLPFGGVGQSG GSYHG
Sbjct: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420

Query: 421 KYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           KYSFDTFSHEKAVMQRSF +E+E RYPPWNDFKLKF RL YR+DYFGL LLLLGLK
Sbjct: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLK 476

BLAST of HG10001980 vs. NCBI nr
Match: KAA0044766.1 (aldehyde dehydrogenase family 3 member F1 [Cucumis melo var. makuwa])

HSP 1 Score: 883.6 bits (2282), Expect = 7.4e-253
Identity = 441/477 (92.45%), Postives = 459/477 (96.23%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           MEANLEVLRESF+NGRTRS+EWRK QLSSLIQ IH+KEN IFEALYQDLGKHP E FRDE
Sbjct: 1   MEANLEVLRESFKNGRTRSYEWRKKQLSSLIQLIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           VGIVLKSAN+AL SLHKWMAPKKK VPLLFFPAKGEVLSEPFGLVLIISSWNFPLSL+LD
Sbjct: 61  VGIVLKSANDALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA SAGNTAV+KPSEYAPVFSSFL AT+PLYLDNKAIKVVEGGADV EQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDNKAIKVVEGGADVCEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMK-VAAKRIVGGKWGP 240
           IFFTGSP+V RIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMK VAAKRIVGGKWGP
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVVAAKRIVGGKWGP 240

Query: 241 CAGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLL 300
           CAGQACIGIDYVLVEDKFASELI+SLKRILKKFYGENSKNSTSIARIVNEKNVERISNLL
Sbjct: 241 CAGQACIGIDYVLVEDKFASELIDSLKRILKKFYGENSKNSTSIARIVNEKNVERISNLL 300

Query: 301 KDPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEF 360
           KDPKVAASIVHGGS+DKEKLFIEPTILLNPP+DADIMTEEIFGPLLPIITLNKIEESIEF
Sbjct: 301 KDPKVAASIVHGGSVDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEF 360

Query: 361 INARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYH 420
           INARPKPLALYAFT DETLKKRIL++TSSGSVTFND++VQFVCDSLPFGGVGQSGFGSYH
Sbjct: 361 INARPKPLALYAFTEDETLKKRILYKTSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYH 420

Query: 421 GKYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           GKYSFDTFSHEKAVMQRSF +E+E RYPPWNDFKLKF RL YR+DYFGL LLLLGLK
Sbjct: 421 GKYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLK 477

BLAST of HG10001980 vs. NCBI nr
Match: XP_038876780.1 (aldehyde dehydrogenase family 3 member F1 [Benincasa hispida])

HSP 1 Score: 882.1 bits (2278), Expect = 2.2e-252
Identity = 440/476 (92.44%), Postives = 455/476 (95.59%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           MEANLE+LRESFRNGRTRS EWRK QLSSLIQF+H+KEN IFEALYQDLGKHP E FRDE
Sbjct: 1   MEANLELLRESFRNGRTRSLEWRKNQLSSLIQFVHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           VGIVLKSANNA+ SLHKWMAPKKK VPLLFFPAKGEVLSEPFGLVLIISSWNFPLSL+LD
Sbjct: 61  VGIVLKSANNAIRSLHKWMAPKKKYVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA SAGNTAVIK  EYAPVFSSFL+AT+PLYLDNKAIKVVEGGADVSEQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVIKLPEYAPVFSSFLAATLPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
            GQACIGIDYVLVED+FASELIESLKRILKKFY EN+KNSTSIARIVNEKNVER+SN LK
Sbjct: 241 NGQACIGIDYVLVEDRFASELIESLKRILKKFYSENTKNSTSIARIVNEKNVERLSNFLK 300

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIM EEIFGPLLPIITLNKIEESIEFI
Sbjct: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMNEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLALYAFTGD+TLKKRIL ETSSGSVTFND++VQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLALYAFTGDQTLKKRILSETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           KYSFD FSHEKAVMQRSF +E+E RYPPWNDFKLKF RL YRFDYFGL LLLLGLK
Sbjct: 421 KYSFDAFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFGLALLLLGLK 476

BLAST of HG10001980 vs. NCBI nr
Match: XP_022136446.1 (aldehyde dehydrogenase family 3 member F1 [Momordica charantia])

HSP 1 Score: 839.3 bits (2167), Expect = 1.6e-239
Identity = 414/477 (86.79%), Postives = 449/477 (94.13%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           ME NLE LRESFR+GRTRS EWRK QL SLIQFIH+KE++IFEA+YQDLGKHP E +RDE
Sbjct: 1   MERNLEELRESFRSGRTRSAEWRKNQLISLIQFIHDKESSIFEAMYQDLGKHPVEIYRDE 60

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           VG+VLKSA +ALC L KWMAP+KK VPLLFFPAKGEVLSEPFGLVLIISSWNFP+SL+LD
Sbjct: 61  VGVVLKSAKDALCCLQKWMAPQKKYVPLLFFPAKGEVLSEPFGLVLIISSWNFPISLSLD 120

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA SAGNTAV+KPSEYAP  SS L++T+PLYLD+KAIKV+EGGADVSEQLL +KWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPACSSLLASTLPLYLDSKAIKVMEGGADVSEQLLLHKWDK 180

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP+V RIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVA KRIVGGKWGPC
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAVKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
           +GQACIGIDYVLVE+KFASELIESLKRI+KKFYGENSKNSTSIARIVNE  VERISNLLK
Sbjct: 241 SGQACIGIDYVLVEEKFASELIESLKRIMKKFYGENSKNSTSIARIVNEHQVERISNLLK 300

Query: 301 DPKVAASIVH-GGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEF 360
           DPKVAASIVH GGSIDK+KLFIEPTILLNPP+DADIMTEEIFGPLLPIITLNKIEESIEF
Sbjct: 301 DPKVAASIVHGGGSIDKQKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEF 360

Query: 361 INARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYH 420
           IN+RPKPLA+YAFT DETLKKRILFETSSG+VTFND++VQF+CDSLPFGGVGQSGFG YH
Sbjct: 361 INSRPKPLAIYAFTRDETLKKRILFETSSGNVTFNDTMVQFLCDSLPFGGVGQSGFGRYH 420

Query: 421 GKYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           GKYSFDTFSHEKAV+QRSF +E+E RYPPWNDFKLKF RL Y FDYFGL+LLLLG+K
Sbjct: 421 GKYSFDTFSHEKAVLQRSFLLELEPRYPPWNDFKLKFIRLAYAFDYFGLLLLLLGIK 477

BLAST of HG10001980 vs. ExPASy Swiss-Prot
Match: Q70E96 (Aldehyde dehydrogenase family 3 member F1 OS=Arabidopsis thaliana OX=3702 GN=ALDH3F1 PE=2 SV=2)

HSP 1 Score: 634.8 bits (1636), Expect = 7.8e-181
Identity = 295/476 (61.97%), Postives = 384/476 (80.67%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           +E +L  +RE+F +GRTRS +WRK Q+ ++ + + + E+ I  AL+QDLGKH  E+FRDE
Sbjct: 8   VEESLREMRETFASGRTRSLKWRKAQIGAIYEMVKDNEDKICNALFQDLGKHSTEAFRDE 67

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           +G+VL++A  A+  L KW  PK   +PLLF+PAKG+V+SEP+G VL++SSWNFP+SL+LD
Sbjct: 68  LGVVLRTATVAINCLDKWAVPKHSKLPLLFYPAKGKVISEPYGTVLVLSSWNFPISLSLD 127

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA +AGNT ++K SE +P  S+FL+ TIP YLD KAIKV+EGG DV+  LLQ++WDK
Sbjct: 128 PLIGAIAAGNTVLLKSSELSPNASAFLAKTIPAYLDTKAIKVIEGGPDVATILLQHQWDK 187

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP++ RI+M+AAA+HLTPVTLELGGKCP I D+ ++  N+K   KRI GGKWG C
Sbjct: 188 IFFTGSPKIGRIIMAAAAQHLTPVTLELGGKCPTIVDHHTISKNIKSVVKRIAGGKWGSC 247

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
            GQACI +DYVL+E  FA  LI+ LK  +K F+GEN K S  ++RI N+ +V+R+S LL 
Sbjct: 248 NGQACISVDYVLIEKSFAPTLIDMLKPTIKSFFGENPKESGCLSRIANKHHVQRLSRLLS 307

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DP+V ASIV+GGSID++KL++EPTILL+PP+D++IM EEIFGP+LPIIT+  I+ESI  I
Sbjct: 308 DPRVQASIVYGGSIDEDKLYVEPTILLDPPLDSEIMNEEIFGPILPIITVRDIQESIGII 367

Query: 361 NARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHG 420
           N +PKPLA+YAFT DE LK RIL ETSSGSVTFND ++Q++CD+LPFGGVG+SG G YHG
Sbjct: 368 NTKPKPLAIYAFTNDENLKTRILSETSSGSVTFNDVMIQYMCDALPFGGVGESGIGRYHG 427

Query: 421 KYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           KYSFD FSHEKA+M+ S  M++E+RYPPWN+FKL F RL +R  YF L+LL+LGLK
Sbjct: 428 KYSFDCFSHEKAIMEGSLGMDLEARYPPWNNFKLTFIRLAFREAYFKLILLMLGLK 483

BLAST of HG10001980 vs. ExPASy Swiss-Prot
Match: Q8W033 (Aldehyde dehydrogenase family 3 member I1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ALDH3I1 PE=1 SV=2)

HSP 1 Score: 462.6 bits (1189), Expect = 5.3e-129
Identity = 231/470 (49.15%), Postives = 323/470 (68.72%), Query Frame = 0

Query: 5   LEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDEVGIV 64
           ++ LR +F +GRT+S+EWR +QL ++ + I EKE  I EALYQDL K   E+F  E+   
Sbjct: 79  VDELRSNFNSGRTKSYEWRISQLQNIARMIDEKEKCITEALYQDLSKPELEAFLAEISNT 138

Query: 65  LKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALDPLIG 124
             S   A+  L  WMAP+     +  FP+  +++SEP G+VL+IS+WNFP  L+++P+IG
Sbjct: 139 KSSCMLAIKELKNWMAPETVKTSVTTFPSSAQIVSEPLGVVLVISAWNFPFLLSVEPVIG 198

Query: 125 AKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDKIFFT 184
           A +AGN  V+KPSE AP  SS L+     YLDN  I+V+EGG   +  LL  KWDKIFFT
Sbjct: 199 AIAAGNAVVLKPSEIAPAASSLLAKLFSEYLDNTTIRVIEGGVPETTALLDQKWDKIFFT 258

Query: 185 GSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQA 244
           G  RVARI+M+AAA++LTPV LELGGKCPA+ D S V  N++VAA+RI+ GKW   +GQA
Sbjct: 259 GGARVARIIMAAAARNLTPVVLELGGKCPALVD-SDV--NLQVAARRIIAGKWACNSGQA 318

Query: 245 CIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLKDPKV 304
           CIG+DYV+    FAS+LI++LK  L+ F+G+N+  S  ++RIVN  + +R+ ++LK+  V
Sbjct: 319 CIGVDYVITTKDFASKLIDALKTELETFFGQNALESKDLSRIVNSFHFKRLESMLKENGV 378

Query: 305 AASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFINARP 364
           A  IVHGG I ++KL I PTILL+ P  + +M EEIFGPLLPIIT+ KIE+  + I ++P
Sbjct: 379 ANKIVHGGRITEDKLKISPTILLDVPEASSMMQEEIFGPLLPIITVQKIEDGFQVIRSKP 438

Query: 365 KPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHGKYSF 424
           KPLA Y FT ++ L+K+ + + S+G +T ND+V+      LPFGGVG+SG G+YHGK+S+
Sbjct: 439 KPLAAYLFTNNKELEKQFVQDVSAGGITINDTVLHVTVKDLPFGGVGESGIGAYHGKFSY 498

Query: 425 DTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLG 475
           +TFSH+K V+ RSF  + + RYPP+   K    +     + F  +L   G
Sbjct: 499 ETFSHKKGVLYRSFSGDADLRYPPYTPKKKMVLKALLSSNIFAAILAFFG 545

BLAST of HG10001980 vs. ExPASy Swiss-Prot
Match: Q70DU8 (Aldehyde dehydrogenase family 3 member H1 OS=Arabidopsis thaliana OX=3702 GN=ALDH3H1 PE=1 SV=2)

HSP 1 Score: 453.8 bits (1166), Expect = 2.5e-126
Identity = 232/468 (49.57%), Postives = 313/468 (66.88%), Query Frame = 0

Query: 8   LRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDEVGIVLKS 67
           LR SF +G TR +EWR TQL  L+      E  I  AL  DLGK   ES   EV ++  S
Sbjct: 19  LRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSLLRNS 78

Query: 68  ANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALDPLIGAKS 127
              AL  L  WMAP+K    L  FPA  E++SEP G+VL+IS+WN+P  L++DP+IGA S
Sbjct: 79  IKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVIGAIS 138

Query: 128 AGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDKIFFTGSP 187
           AGN  V+KPSE AP  S+ L+  +  YLD  A++VVEG    +  LL+ KWDKIF+TGS 
Sbjct: 139 AGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDKIFYTGSS 198

Query: 188 RVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQACIG 247
           ++ R++M+AAAKHLTPV LELGGK P + D     +++KV  +RI+ GKWG   GQAC+ 
Sbjct: 199 KIGRVIMAAAAKHLTPVVLELGGKSPVVVDSD---TDLKVTVRRIIVGKWGCNNGQACVS 258

Query: 248 IDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLKDPKVAAS 307
            DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  + +R+S LL + +V+  
Sbjct: 259 PDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLDEKEVSDK 318

Query: 308 IVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFINARPKPL 367
           IV+GG  D+E L I PTILL+ P+D+ IM+EEIFGPLLPI+TLN +EES + I +RPKPL
Sbjct: 319 IVYGGEKDRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVIRSRPKPL 378

Query: 368 ALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHGKYSFDTF 427
           A Y FT ++ LK+R     S+G +  ND  V     +LPFGGVG+SG G+YHGK+SFD F
Sbjct: 379 AAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHGKFSFDAF 438

Query: 428 SHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGL 476
           SH+KAV+ RS F +   RYPP++  KL+  +     + F L  +LLGL
Sbjct: 439 SHKKAVLYRSLFGDSAVRYPPYSRGKLRLLKALVDSNIFDLFKVLLGL 483

BLAST of HG10001980 vs. ExPASy Swiss-Prot
Match: Q8VXQ2 (Aldehyde dehydrogenase OS=Craterostigma plantagineum OX=4153 GN=ALDH PE=1 SV=1)

HSP 1 Score: 446.0 bits (1146), Expect = 5.2e-124
Identity = 220/473 (46.51%), Postives = 320/473 (67.65%), Query Frame = 0

Query: 2   EANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDEV 61
           E  ++ LR ++ +G+T+S+EWR +QL +L++     +  + EAL  DL K   E++  E+
Sbjct: 7   EGVVDGLRRTYISGKTKSYEWRVSQLKALLKITTHHDKEVVEALRADLKKPEHEAYVHEI 66

Query: 62  GIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALDP 121
            +V  +  +AL  LH+WM P+K    L  +P+  E++SEP G+VL+I++WN+P  LALDP
Sbjct: 67  FMVSNACKSALKELHQWMKPQKVKTSLATYPSSAEIVSEPLGVVLVITAWNYPFLLALDP 126

Query: 122 LIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDKI 181
           +IGA +AGN  V+KPSE AP  S+ L+  +  Y+D  AI+VVEG     + LL  +WDKI
Sbjct: 127 MIGAIAAGNCVVLKPSEIAPATSALLAKLLNQYVDTSAIRVVEGAVPEMQALLDQRWDKI 186

Query: 182 FFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCA 241
           F+TGS +V +IV+S+AAKHLTPV LELGGKCP + D    + ++KVAA+RI+  KW   +
Sbjct: 187 FYTGSSKVGQIVLSSAAKHLTPVVLELGGKCPTVVD---ANIDLKVAARRIISWKWSGNS 246

Query: 242 GQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLKD 301
           GQ CI  DY++  ++ A +L++++K  L+ FYG++   S  ++ I+NE+  ER++ LL D
Sbjct: 247 GQTCISPDYIITTEENAPKLVDAIKCELESFYGKDPLKSQDMSSIINERQFERMTGLLDD 306

Query: 302 PKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFIN 361
            KV+  IV+GG  DK  L I PTILL+   D+ +M+EEIFGPLLPIIT+ KIEE  + I 
Sbjct: 307 KKVSDKIVYGGQSDKSNLKIAPTILLDVSEDSSVMSEEIFGPLLPIITVGKIEECYKIIA 366

Query: 362 ARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHGK 421
           ++PKPLA Y FT D+   +  +   S+G +T ND  + F+   LPFGGVG+SG GSYHGK
Sbjct: 367 SKPKPLAAYLFTNDKKRTEEFVSNVSAGGITINDIALHFLEPRLPFGGVGESGMGSYHGK 426

Query: 422 YSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLG 475
           +SFD FSH+K+V++RSF  E+ +RYPP+  +KL F     + D FGL+   LG
Sbjct: 427 FSFDAFSHKKSVLKRSFGGEVAARYPPYAPWKLHFMEAILQGDIFGLLKAWLG 476

BLAST of HG10001980 vs. ExPASy Swiss-Prot
Match: Q2FWX9 (4,4'-diaponeurosporen-aldehyde dehydrogenase OS=Staphylococcus aureus (strain NCTC 8325 / PS 47) OX=93061 GN=aldH1 PE=1 SV=1)

HSP 1 Score: 370.5 bits (950), Expect = 2.8e-101
Identity = 196/451 (43.46%), Postives = 282/451 (62.53%), Query Frame = 0

Query: 12  FRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDEVGIVLKSANNA 71
           F   +T+   +RK QL  L + I   E+ I EALY DLGK+  E++  E+GI LKS   A
Sbjct: 15  FNTQQTKDISFRKEQLKKLSKAIKSYESDILEALYTDLGKNKVEAYATEIGITLKSIKIA 74

Query: 72  LCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALDPLIGAKSAGNT 131
              L  W   K    PL  FP K  +  EP+G VLII+ +N+P  L  +PLIGA +AGNT
Sbjct: 75  RKELKNWTKTKNVDTPLYLFPTKSYIKKEPYGTVLIIAPFNYPFQLVFEPLIGAIAAGNT 134

Query: 132 AVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDKIFFTGSPRVAR 191
           A+IKPSE  P  +  +   I    D   I+V+EGG + ++ L+   +D +FFTGS  V +
Sbjct: 135 AIIKPSELTPNVARVIKRLINETFDANYIEVIEGGIEETQTLIHLPFDYVFFTGSENVGK 194

Query: 192 IVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQACIGIDYV 251
           IV  AA+++L PVTLE+GGK P I D +   +N+KVA++RI  GK+   AGQ C+  DY+
Sbjct: 195 IVYQAASENLVPVTLEMGGKSPVIVDET---ANIKVASERICFGKF-TNAGQTCVAPDYI 254

Query: 252 LVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLKDPKVAASIVHG 311
           LV +    +LI +L + L++FYG+N + S    RIVN K+  R+++LL   ++  +IV G
Sbjct: 255 LVHESVKDDLITALSKTLREFYGQNIQQSPDYGRIVNLKHYHRLTSLLNSAQM--NIVFG 314

Query: 312 GSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFINARPKPLALYA 371
           G  D+++ +IEPT+L +   D+ IM EEIFGP+LPI+T   ++E+I FI+ RPKPL+LY 
Sbjct: 315 GHSDEDERYIEPTLLDHVTSDSAIMQEEIFGPILPILTYQSLDEAIAFIHQRPKPLSLYL 374

Query: 372 FTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHGKYSFDTFSHEK 431
           F+ DE   +R++ E S G    ND+++      LPFGGVG SG G YHGKYSFDTF+HEK
Sbjct: 375 FSEDENATQRVINELSFGGGAINDTLMHLANPKLPFGGVGASGMGRYHGKYSFDTFTHEK 434

Query: 432 AVMQRSFFMEIESRYPPWNDFKLKFFRLGYR 463
           + + +S  +E     PP+   K K+ +  ++
Sbjct: 435 SYIFKSTRLESGVHLPPYKG-KFKYIKAFFK 458

BLAST of HG10001980 vs. ExPASy TrEMBL
Match: A0A5D3CZI1 (Aldehyde dehydrogenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G005310 PE=3 SV=1)

HSP 1 Score: 886.7 bits (2290), Expect = 4.2e-254
Identity = 440/476 (92.44%), Postives = 458/476 (96.22%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           MEANLEVLRESF+NGRTRS+EWRK QLSSLIQ IH+KEN IFEALYQDLGKHP E FRDE
Sbjct: 1   MEANLEVLRESFKNGRTRSYEWRKKQLSSLIQLIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           VGIVLKSAN+AL SLHKWMAPKKK VPLLFFPAKGEVLSEPFGLVLIISSWNFPLSL+LD
Sbjct: 61  VGIVLKSANDALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA SAGNTAV+KPSEYAPVFSSFL AT+PLYLDNKAIKVVEGGADV EQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDNKAIKVVEGGADVCEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP+V RIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELI+SLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIDSLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DPKVAASIVHGGS+DKEKLFIEPTILLNPP+D DIMTEEIFGPLLPIITLNKIEESIEFI
Sbjct: 301 DPKVAASIVHGGSVDKEKLFIEPTILLNPPLDTDIMTEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLALYAFT DETLKKRIL++TSSGSVTFND++VQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLALYAFTEDETLKKRILYKTSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           KYSFDTFSHEKAVMQRSF +E+E RYPPWNDFKLKF RL YR+DYFGL LLLLGLK
Sbjct: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLK 476

BLAST of HG10001980 vs. ExPASy TrEMBL
Match: A0A1S3BWE5 (Aldehyde dehydrogenase OS=Cucumis melo OX=3656 GN=LOC103494364 PE=3 SV=1)

HSP 1 Score: 886.7 bits (2290), Expect = 4.2e-254
Identity = 440/476 (92.44%), Postives = 458/476 (96.22%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           MEANLEVLRESF+NGRTRS+EWRK QLSSLIQ IH+KEN IFEALYQDLGKHP E FRDE
Sbjct: 1   MEANLEVLRESFKNGRTRSYEWRKKQLSSLIQLIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           VGIVLKSAN+AL SLHKWMAPKKK VPLLFFPAKGEVLSEPFGLVLIISSWNFPLSL+LD
Sbjct: 61  VGIVLKSANDALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA SAGNTAV+KPSEYAPVFSSFL AT+PLYLDNKAIKVVEGGADV EQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDNKAIKVVEGGADVCEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP+V RIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELI+SLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIDSLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DPKVAASIVHGGS+DKEKLFIEPTILLNPP+D DIMTEEIFGPLLPIITLNKIEESIEFI
Sbjct: 301 DPKVAASIVHGGSVDKEKLFIEPTILLNPPLDTDIMTEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLALYAFT DETLKKRIL++TSSGSVTFND++VQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLALYAFTEDETLKKRILYKTSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           KYSFDTFSHEKAVMQRSF +E+E RYPPWNDFKLKF RL YR+DYFGL LLLLGLK
Sbjct: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLK 476

BLAST of HG10001980 vs. ExPASy TrEMBL
Match: A0A0A0KZF5 (Aldehyde dehydrogenase OS=Cucumis sativus OX=3659 GN=Csa_4G043870 PE=3 SV=1)

HSP 1 Score: 884.0 bits (2283), Expect = 2.7e-253
Identity = 441/476 (92.65%), Postives = 459/476 (96.43%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           MEA+LEVLRESF+NGRTRS+EWR  QLSSLIQFIH+KEN IFEALYQDLGKHP E FRDE
Sbjct: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           VGIVLKSANNAL SLHKWMAPKKK +PLLFFPAKGEVLSEPFGLVLIISSWNFPLSL+LD
Sbjct: 61  VGIVLKSANNALSSLHKWMAPKKKPLPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA SAGNTAV+KPSEYAPVFSSFL AT+PLYLD+KAIKVVEGGADVSEQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSPRVARIV SAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVN+KNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DPKVAASIVHGGS+DKEKLFIEPTILLNPP+ ADIMTEEIFGPLLPIITLNKIEESIEFI
Sbjct: 301 DPKVAASIVHGGSMDKEKLFIEPTILLNPPLYADIMTEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLALYAFTGDETLKKRIL+ETSSGSVTFND++VQFVCDSLPFGGVGQSG GSYHG
Sbjct: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420

Query: 421 KYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           KYSFDTFSHEKAVMQRSF +E+E RYPPWNDFKLKF RL YR+DYFGL LLLLGLK
Sbjct: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLK 476

BLAST of HG10001980 vs. ExPASy TrEMBL
Match: A0A5A7TRK4 (Aldehyde dehydrogenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G00550 PE=3 SV=1)

HSP 1 Score: 883.6 bits (2282), Expect = 3.6e-253
Identity = 441/477 (92.45%), Postives = 459/477 (96.23%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           MEANLEVLRESF+NGRTRS+EWRK QLSSLIQ IH+KEN IFEALYQDLGKHP E FRDE
Sbjct: 1   MEANLEVLRESFKNGRTRSYEWRKKQLSSLIQLIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           VGIVLKSAN+AL SLHKWMAPKKK VPLLFFPAKGEVLSEPFGLVLIISSWNFPLSL+LD
Sbjct: 61  VGIVLKSANDALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA SAGNTAV+KPSEYAPVFSSFL AT+PLYLDNKAIKVVEGGADV EQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDNKAIKVVEGGADVCEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMK-VAAKRIVGGKWGP 240
           IFFTGSP+V RIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMK VAAKRIVGGKWGP
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVVAAKRIVGGKWGP 240

Query: 241 CAGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLL 300
           CAGQACIGIDYVLVEDKFASELI+SLKRILKKFYGENSKNSTSIARIVNEKNVERISNLL
Sbjct: 241 CAGQACIGIDYVLVEDKFASELIDSLKRILKKFYGENSKNSTSIARIVNEKNVERISNLL 300

Query: 301 KDPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEF 360
           KDPKVAASIVHGGS+DKEKLFIEPTILLNPP+DADIMTEEIFGPLLPIITLNKIEESIEF
Sbjct: 301 KDPKVAASIVHGGSVDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEF 360

Query: 361 INARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYH 420
           INARPKPLALYAFT DETLKKRIL++TSSGSVTFND++VQFVCDSLPFGGVGQSGFGSYH
Sbjct: 361 INARPKPLALYAFTEDETLKKRILYKTSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYH 420

Query: 421 GKYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           GKYSFDTFSHEKAVMQRSF +E+E RYPPWNDFKLKF RL YR+DYFGL LLLLGLK
Sbjct: 421 GKYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLK 477

BLAST of HG10001980 vs. ExPASy TrEMBL
Match: A0A6J1C3I9 (Aldehyde dehydrogenase OS=Momordica charantia OX=3673 GN=LOC111008156 PE=3 SV=1)

HSP 1 Score: 839.3 bits (2167), Expect = 7.7e-240
Identity = 414/477 (86.79%), Postives = 449/477 (94.13%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           ME NLE LRESFR+GRTRS EWRK QL SLIQFIH+KE++IFEA+YQDLGKHP E +RDE
Sbjct: 1   MERNLEELRESFRSGRTRSAEWRKNQLISLIQFIHDKESSIFEAMYQDLGKHPVEIYRDE 60

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           VG+VLKSA +ALC L KWMAP+KK VPLLFFPAKGEVLSEPFGLVLIISSWNFP+SL+LD
Sbjct: 61  VGVVLKSAKDALCCLQKWMAPQKKYVPLLFFPAKGEVLSEPFGLVLIISSWNFPISLSLD 120

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA SAGNTAV+KPSEYAP  SS L++T+PLYLD+KAIKV+EGGADVSEQLL +KWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPACSSLLASTLPLYLDSKAIKVMEGGADVSEQLLLHKWDK 180

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP+V RIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVA KRIVGGKWGPC
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAVKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
           +GQACIGIDYVLVE+KFASELIESLKRI+KKFYGENSKNSTSIARIVNE  VERISNLLK
Sbjct: 241 SGQACIGIDYVLVEEKFASELIESLKRIMKKFYGENSKNSTSIARIVNEHQVERISNLLK 300

Query: 301 DPKVAASIVH-GGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEF 360
           DPKVAASIVH GGSIDK+KLFIEPTILLNPP+DADIMTEEIFGPLLPIITLNKIEESIEF
Sbjct: 301 DPKVAASIVHGGGSIDKQKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEF 360

Query: 361 INARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYH 420
           IN+RPKPLA+YAFT DETLKKRILFETSSG+VTFND++VQF+CDSLPFGGVGQSGFG YH
Sbjct: 361 INSRPKPLAIYAFTRDETLKKRILFETSSGNVTFNDTMVQFLCDSLPFGGVGQSGFGRYH 420

Query: 421 GKYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           GKYSFDTFSHEKAV+QRSF +E+E RYPPWNDFKLKF RL Y FDYFGL+LLLLG+K
Sbjct: 421 GKYSFDTFSHEKAVLQRSFLLELEPRYPPWNDFKLKFIRLAYAFDYFGLLLLLLGIK 477

BLAST of HG10001980 vs. TAIR 10
Match: AT4G36250.1 (aldehyde dehydrogenase 3F1 )

HSP 1 Score: 634.8 bits (1636), Expect = 5.6e-182
Identity = 295/476 (61.97%), Postives = 384/476 (80.67%), Query Frame = 0

Query: 1   MEANLEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDE 60
           +E +L  +RE+F +GRTRS +WRK Q+ ++ + + + E+ I  AL+QDLGKH  E+FRDE
Sbjct: 8   VEESLREMRETFASGRTRSLKWRKAQIGAIYEMVKDNEDKICNALFQDLGKHSTEAFRDE 67

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           +G+VL++A  A+  L KW  PK   +PLLF+PAKG+V+SEP+G VL++SSWNFP+SL+LD
Sbjct: 68  LGVVLRTATVAINCLDKWAVPKHSKLPLLFYPAKGKVISEPYGTVLVLSSWNFPISLSLD 127

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGA +AGNT ++K SE +P  S+FL+ TIP YLD KAIKV+EGG DV+  LLQ++WDK
Sbjct: 128 PLIGAIAAGNTVLLKSSELSPNASAFLAKTIPAYLDTKAIKVIEGGPDVATILLQHQWDK 187

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP++ RI+M+AAA+HLTPVTLELGGKCP I D+ ++  N+K   KRI GGKWG C
Sbjct: 188 IFFTGSPKIGRIIMAAAAQHLTPVTLELGGKCPTIVDHHTISKNIKSVVKRIAGGKWGSC 247

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
            GQACI +DYVL+E  FA  LI+ LK  +K F+GEN K S  ++RI N+ +V+R+S LL 
Sbjct: 248 NGQACISVDYVLIEKSFAPTLIDMLKPTIKSFFGENPKESGCLSRIANKHHVQRLSRLLS 307

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DP+V ASIV+GGSID++KL++EPTILL+PP+D++IM EEIFGP+LPIIT+  I+ESI  I
Sbjct: 308 DPRVQASIVYGGSIDEDKLYVEPTILLDPPLDSEIMNEEIFGPILPIITVRDIQESIGII 367

Query: 361 NARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHG 420
           N +PKPLA+YAFT DE LK RIL ETSSGSVTFND ++Q++CD+LPFGGVG+SG G YHG
Sbjct: 368 NTKPKPLAIYAFTNDENLKTRILSETSSGSVTFNDVMIQYMCDALPFGGVGESGIGRYHG 427

Query: 421 KYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGLK 477
           KYSFD FSHEKA+M+ S  M++E+RYPPWN+FKL F RL +R  YF L+LL+LGLK
Sbjct: 428 KYSFDCFSHEKAIMEGSLGMDLEARYPPWNNFKLTFIRLAFREAYFKLILLMLGLK 483

BLAST of HG10001980 vs. TAIR 10
Match: AT4G34240.1 (aldehyde dehydrogenase 3I1 )

HSP 1 Score: 462.6 bits (1189), Expect = 3.8e-130
Identity = 231/470 (49.15%), Postives = 323/470 (68.72%), Query Frame = 0

Query: 5   LEVLRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDEVGIV 64
           ++ LR +F +GRT+S+EWR +QL ++ + I EKE  I EALYQDL K   E+F  E+   
Sbjct: 79  VDELRSNFNSGRTKSYEWRISQLQNIARMIDEKEKCITEALYQDLSKPELEAFLAEISNT 138

Query: 65  LKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALDPLIG 124
             S   A+  L  WMAP+     +  FP+  +++SEP G+VL+IS+WNFP  L+++P+IG
Sbjct: 139 KSSCMLAIKELKNWMAPETVKTSVTTFPSSAQIVSEPLGVVLVISAWNFPFLLSVEPVIG 198

Query: 125 AKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDKIFFT 184
           A +AGN  V+KPSE AP  SS L+     YLDN  I+V+EGG   +  LL  KWDKIFFT
Sbjct: 199 AIAAGNAVVLKPSEIAPAASSLLAKLFSEYLDNTTIRVIEGGVPETTALLDQKWDKIFFT 258

Query: 185 GSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQA 244
           G  RVARI+M+AAA++LTPV LELGGKCPA+ D S V  N++VAA+RI+ GKW   +GQA
Sbjct: 259 GGARVARIIMAAAARNLTPVVLELGGKCPALVD-SDV--NLQVAARRIIAGKWACNSGQA 318

Query: 245 CIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLKDPKV 304
           CIG+DYV+    FAS+LI++LK  L+ F+G+N+  S  ++RIVN  + +R+ ++LK+  V
Sbjct: 319 CIGVDYVITTKDFASKLIDALKTELETFFGQNALESKDLSRIVNSFHFKRLESMLKENGV 378

Query: 305 AASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFINARP 364
           A  IVHGG I ++KL I PTILL+ P  + +M EEIFGPLLPIIT+ KIE+  + I ++P
Sbjct: 379 ANKIVHGGRITEDKLKISPTILLDVPEASSMMQEEIFGPLLPIITVQKIEDGFQVIRSKP 438

Query: 365 KPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHGKYSF 424
           KPLA Y FT ++ L+K+ + + S+G +T ND+V+      LPFGGVG+SG G+YHGK+S+
Sbjct: 439 KPLAAYLFTNNKELEKQFVQDVSAGGITINDTVLHVTVKDLPFGGVGESGIGAYHGKFSY 498

Query: 425 DTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLG 475
           +TFSH+K V+ RSF  + + RYPP+   K    +     + F  +L   G
Sbjct: 499 ETFSHKKGVLYRSFSGDADLRYPPYTPKKKMVLKALLSSNIFAAILAFFG 545

BLAST of HG10001980 vs. TAIR 10
Match: AT1G44170.1 (aldehyde dehydrogenase 3H1 )

HSP 1 Score: 453.8 bits (1166), Expect = 1.8e-127
Identity = 232/468 (49.57%), Postives = 313/468 (66.88%), Query Frame = 0

Query: 8   LRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDEVGIVLKS 67
           LR SF +G TR +EWR TQL  L+      E  I  AL  DLGK   ES   EV ++  S
Sbjct: 19  LRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSLLRNS 78

Query: 68  ANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALDPLIGAKS 127
              AL  L  WMAP+K    L  FPA  E++SEP G+VL+IS+WN+P  L++DP+IGA S
Sbjct: 79  IKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVIGAIS 138

Query: 128 AGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDKIFFTGSP 187
           AGN  V+KPSE AP  S+ L+  +  YLD  A++VVEG    +  LL+ KWDKIF+TGS 
Sbjct: 139 AGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDKIFYTGSS 198

Query: 188 RVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQACIG 247
           ++ R++M+AAAKHLTPV LELGGK P + D     +++KV  +RI+ GKWG   GQAC+ 
Sbjct: 199 KIGRVIMAAAAKHLTPVVLELGGKSPVVVDSD---TDLKVTVRRIIVGKWGCNNGQACVS 258

Query: 248 IDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLKDPKVAAS 307
            DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  + +R+S LL + +V+  
Sbjct: 259 PDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLDEKEVSDK 318

Query: 308 IVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFINARPKPL 367
           IV+GG  D+E L I PTILL+ P+D+ IM+EEIFGPLLPI+TLN +EES + I +RPKPL
Sbjct: 319 IVYGGEKDRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVIRSRPKPL 378

Query: 368 ALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHGKYSFDTF 427
           A Y FT ++ LK+R     S+G +  ND  V     +LPFGGVG+SG G+YHGK+SFD F
Sbjct: 379 AAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHGKFSFDAF 438

Query: 428 SHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGL 476
           SH+KAV+ RS F +   RYPP++  KL+  +     + F L  +LLGL
Sbjct: 439 SHKKAVLYRSLFGDSAVRYPPYSRGKLRLLKALVDSNIFDLFKVLLGL 483

BLAST of HG10001980 vs. TAIR 10
Match: AT1G44170.2 (aldehyde dehydrogenase 3H1 )

HSP 1 Score: 453.8 bits (1166), Expect = 1.8e-127
Identity = 232/468 (49.57%), Postives = 313/468 (66.88%), Query Frame = 0

Query: 8   LRESFRNGRTRSFEWRKTQLSSLIQFIHEKENAIFEALYQDLGKHPAESFRDEVGIVLKS 67
           LR SF +G TR +EWR TQL  L+      E  I  AL  DLGK   ES   EV ++  S
Sbjct: 19  LRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSLLRNS 78

Query: 68  ANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALDPLIGAKS 127
              AL  L  WMAP+K    L  FPA  E++SEP G+VL+IS+WN+P  L++DP+IGA S
Sbjct: 79  IKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVIGAIS 138

Query: 128 AGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDKIFFTGSP 187
           AGN  V+KPSE AP  S+ L+  +  YLD  A++VVEG    +  LL+ KWDKIF+TGS 
Sbjct: 139 AGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDKIFYTGSS 198

Query: 188 RVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQACIG 247
           ++ R++M+AAAKHLTPV LELGGK P + D     +++KV  +RI+ GKWG   GQAC+ 
Sbjct: 199 KIGRVIMAAAAKHLTPVVLELGGKSPVVVDSD---TDLKVTVRRIIVGKWGCNNGQACVS 258

Query: 248 IDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLKDPKVAAS 307
            DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  + +R+S LL + +V+  
Sbjct: 259 PDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLDEKEVSDK 318

Query: 308 IVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFINARPKPL 367
           IV+GG  D+E L I PTILL+ P+D+ IM+EEIFGPLLPI+TLN +EES + I +RPKPL
Sbjct: 319 IVYGGEKDRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVIRSRPKPL 378

Query: 368 ALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHGKYSFDTF 427
           A Y FT ++ LK+R     S+G +  ND  V     +LPFGGVG+SG G+YHGK+SFD F
Sbjct: 379 AAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHGKFSFDAF 438

Query: 428 SHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGL 476
           SH+KAV+ RS F +   RYPP++  KL+  +     + F L  +LLGL
Sbjct: 439 SHKKAVLYRSLFGDSAVRYPPYSRGKLRLLKALVDSNIFDLFKVLLGL 483

BLAST of HG10001980 vs. TAIR 10
Match: AT1G44170.3 (aldehyde dehydrogenase 3H1 )

HSP 1 Score: 416.0 bits (1068), Expect = 4.1e-116
Identity = 207/415 (49.88%), Postives = 285/415 (68.67%), Query Frame = 0

Query: 61  VGIVLKSANNALCSLHKWMAPKKKSVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLALD 120
           V ++  S   AL  L  WMAP+K    L  FPA  E++SEP G+VL+IS+WN+P  L++D
Sbjct: 9   VSLLRNSIKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSID 68

Query: 121 PLIGAKSAGNTAVIKPSEYAPVFSSFLSATIPLYLDNKAIKVVEGGADVSEQLLQYKWDK 180
           P+IGA SAGN  V+KPSE AP  S+ L+  +  YLD  A++VVEG    +  LL+ KWDK
Sbjct: 69  PVIGAISAGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDK 128

Query: 181 IFFTGSPRVARIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IF+TGS ++ R++M+AAAKHLTPV LELGGK P + D     +++KV  +RI+ GKWG  
Sbjct: 129 IFYTGSSKIGRVIMAAAAKHLTPVVLELGGKSPVVVDSD---TDLKVTVRRIIVGKWGCN 188

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300
            GQAC+  DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  + +R+S LL 
Sbjct: 189 NGQACVSPDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLD 248

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPIDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           + +V+  IV+GG  D+E L I PTILL+ P+D+ IM+EEIFGPLLPI+TLN +EES + I
Sbjct: 249 EKEVSDKIVYGGEKDRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVI 308

Query: 361 NARPKPLALYAFTGDETLKKRILFETSSGSVTFNDSVVQFVCDSLPFGGVGQSGFGSYHG 420
            +RPKPLA Y FT ++ LK+R     S+G +  ND  V     +LPFGGVG+SG G+YHG
Sbjct: 309 RSRPKPLAAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHG 368

Query: 421 KYSFDTFSHEKAVMQRSFFMEIESRYPPWNDFKLKFFRLGYRFDYFGLVLLLLGL 476
           K+SFD FSH+KAV+ RS F +   RYPP++  KL+  +     + F L  +LLGL
Sbjct: 369 KFSFDAFSHKKAVLYRSLFGDSAVRYPPYSRGKLRLLKALVDSNIFDLFKVLLGL 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008453718.18.7e-25492.44PREDICTED: aldehyde dehydrogenase family 3 member F1 [Cucumis melo] >TYK16698.1 ... [more]
XP_004146483.15.7e-25392.65aldehyde dehydrogenase family 3 member F1 [Cucumis sativus] >KGN53271.1 hypothet... [more]
KAA0044766.17.4e-25392.45aldehyde dehydrogenase family 3 member F1 [Cucumis melo var. makuwa][more]
XP_038876780.12.2e-25292.44aldehyde dehydrogenase family 3 member F1 [Benincasa hispida][more]
XP_022136446.11.6e-23986.79aldehyde dehydrogenase family 3 member F1 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q70E967.8e-18161.97Aldehyde dehydrogenase family 3 member F1 OS=Arabidopsis thaliana OX=3702 GN=ALD... [more]
Q8W0335.3e-12949.15Aldehyde dehydrogenase family 3 member I1, chloroplastic OS=Arabidopsis thaliana... [more]
Q70DU82.5e-12649.57Aldehyde dehydrogenase family 3 member H1 OS=Arabidopsis thaliana OX=3702 GN=ALD... [more]
Q8VXQ25.2e-12446.51Aldehyde dehydrogenase OS=Craterostigma plantagineum OX=4153 GN=ALDH PE=1 SV=1[more]
Q2FWX92.8e-10143.464,4'-diaponeurosporen-aldehyde dehydrogenase OS=Staphylococcus aureus (strain NC... [more]
Match NameE-valueIdentityDescription
A0A5D3CZI14.2e-25492.44Aldehyde dehydrogenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2... [more]
A0A1S3BWE54.2e-25492.44Aldehyde dehydrogenase OS=Cucumis melo OX=3656 GN=LOC103494364 PE=3 SV=1[more]
A0A0A0KZF52.7e-25392.65Aldehyde dehydrogenase OS=Cucumis sativus OX=3659 GN=Csa_4G043870 PE=3 SV=1[more]
A0A5A7TRK43.6e-25392.45Aldehyde dehydrogenase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold7... [more]
A0A6J1C3I97.7e-24086.79Aldehyde dehydrogenase OS=Momordica charantia OX=3673 GN=LOC111008156 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G36250.15.6e-18261.97aldehyde dehydrogenase 3F1 [more]
AT4G34240.13.8e-13049.15aldehyde dehydrogenase 3I1 [more]
AT1G44170.11.8e-12749.57aldehyde dehydrogenase 3H1 [more]
AT1G44170.21.8e-12749.57aldehyde dehydrogenase 3H1 [more]
AT1G44170.34.1e-11649.88aldehyde dehydrogenase 3H1 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR016163Aldehyde dehydrogenase, C-terminalGENE3D3.40.309.10Aldehyde Dehydrogenase; Chain A, domain 2coord: 210..413
e-value: 6.6E-151
score: 504.8
IPR015590Aldehyde dehydrogenase domainPFAMPF00171Aldedhcoord: 9..433
e-value: 6.5E-74
score: 249.1
IPR016162Aldehyde dehydrogenase, N-terminalGENE3D3.40.605.10Aldehyde Dehydrogenase; Chain A, domain 1coord: 9..431
e-value: 6.6E-151
score: 504.8
IPR012394Aldehyde dehydrogenase NAD(P)-dependentPIRSFPIRSF036492ALDHcoord: 1..468
e-value: 1.1E-169
score: 563.1
IPR012394Aldehyde dehydrogenase NAD(P)-dependentPANTHERPTHR43570ALDEHYDE DEHYDROGENASEcoord: 4..476
NoneNo IPR availablePANTHERPTHR43570:SF17ALDEHYDE DEHYDROGENASE FAMILY 3 MEMBER F1coord: 4..476
IPR016161Aldehyde/histidinol dehydrogenaseSUPERFAMILY53720ALDH-likecoord: 2..448

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001980.1HG10001980.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006081 cellular aldehyde metabolic process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004029 aldehyde dehydrogenase (NAD+) activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016620 oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor