Cp4.1LG14g01640 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG14g01640
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionAldehyde dehydrogenase
LocationCp4.1LG14: 3505085 .. 3507561 (+)
RNA-Seq ExpressionCp4.1LG14g01640
SyntenyCp4.1LG14g01640
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTAGTTGAAGGTGGCTTTTATGAAGGGCATGGAACATTAATTACTAACCCCAACTATTTCCGTAACATCATAAGCTTTGTCCACTTGGTAAAGCAATGTCATGACTCCAAAGTCATCCCAAAGATCAGATCAAAGCAAGTTCTTGAGTTGAAAATGGATACCCATTTGAAGGAGCTAAGGCAAAGCTTCAGAAATGGAAGAACAAGGAGCTTAGAATGGAGGAAAGAACAGCTGATTTCACTAATTCACTTCATTCAAGACAAAGAAAACGCCATTTTTGAAGCTCTCTTCCAAGATCTTGGGAAGCATCCTGTCGAAATTTATCGAGATGAGGTGAATTTTCAATCCAAATCACCAATTTGGCAATGTCTTTATATGGAAACAGAGCCTAATTTTACTTGTTCTTTAAGGTTGGAATCGTTCTAAAATCTGCAACCAACGCTCTGTCCTGTTTACACAAATGGATCGCTCCTAAAAAGGTTCGTTTTTTTCTTGTTCTTGTTCGTGTTCTTCGTGTTCTTGTTCGTGTTCTTATATGGTTGATTTATTGGGAGGTAGAAATACGTGCCATTACTGTTCTTCCCAGCAAAAGGGGAAGTTTTGCCTGAGCCATATGGTCTAGTCCTCATAATTTCATCATGGAATTTCCCCCTTTGTGAGTATATTCCTTCTTTACCTCAAATTAAACCGACCCATCATTTGTTCTTCTAATGCTTCTACTTGTCTTTGCAGCTTTGGCATTGGATCCGTTAATCGGAGCGATATCAGCAGGCAACACGGCGGTTCTAAAACCGTCGGAGTACGCTCCGGCGTGCTCGTCTCTTCTCGCTGCGACGCTTCCTCTTTACCTCGACGATAAAGCCATCAAGGTCGTGGAGGGCGGAGCTGATATCAGTGAACAGCTTCTGCAGCAGAAGTGGGATAAGATCTTCTTCACTGGTAGTCCAAGTGTAGCTAAGATTGTGATGTCTGAAGCTGCAAAGCATCTAACTCCTGTAACATTGGAGCTTGGGGGAAAGTGTCCTGCAATCTTTGATTACTCCTCTGTTCTTTCCAATATGAAGGTTGTTTAATTTGGGAGCTGAAATGTGATGTCTTTAATGTATTGAAGTTCATGATGTTTTGTTTTTCGGTCTCGAATTCATAGGTAGCTGCTAAGAGAATCGTCGGAGGAAAATGGGGACCGTGTTCGGGTCAGGCGTGTATAGGGATCGATTACTTGCTTGTTGAGGATAAGTTTGCTTCAGAATTGGTAAGAACAAGTTCACTCACGGGCTCTTACTGTTTTTCTAGCTCATGTTTGTAATCTTTCTCTTTCAGATCGAGTCGTTAAAGCGAGTGCTGAAGAAGTTTTACGGTGAAAACTCGAGAAATTCGACGAGTATAGCTCGGATTGTTAACGAGAAACATGTTGAAAGGATCAGTAATATGCTTAATGACCCTAAGGTTGCTGGTTGTATTGTCCATGGTGGTTTAATAGACAAACAAAAATGGTAAGCTAAGTTCTCGAGTGAGGATTTTTTTTTTTTTTTTTTNACTGACCGTTTCAACGGTAGGATAGTCCGAACCTTTTGTCTGTTTCTTTGAGCTTGAAATCCGTAAAAGAAAGGATTTTTTTTTTTTTTTTTTTGGTTGACCGTTCTATTGAATACTTGTTATTGATTTGTAGCTTCATTGAGCCGACAATATTGTTGAATCCTCCGATCGATGCTGATATCATGACCGAAGAAATCTTCGGTCCCGTGCTACCGATAATAACGGTAAGCAAAGCAATAGTATGCAACTTTCAATGATTTCACAGAACCATAAATTTTTCTGATCGTCTATGTTCTTGAAATAGCTGGATAGAATCGAAGAAAGCATTGAGTTTATCAATGCAAGACCGAAACCTCTAGCTATCTACGCCTTCACGGAAGACGAAACGCTAAAGAAACGGATTATATCCGAAACATCATCAGGAAGTGTCACGTTCAATGATACCATGGTTCAGGTACTGTTGTTCTACCTACTCTCTCAAAGCAAGCTTTGCACTAAGCTGAAGAAATGAGATGCATTTTCTCTTGCAGTTTGTGTGCGATTCGCTACCATTTGGTGGTGTCGGCCAGAGCGGTTTCGGGAGTTACCATGGCAAGTATTCTTTTGATGCATTCAGCCATGACAAAGCAGTGTTGCAGAGAAGCTTTTTGATAGAACTCGAGCCACGATATCCGCCATGGAACGATTTCAAGCTCAAGTTCATTAGATTGGCATATCGATTCGACTATTTCAGGCTAGTACTACTCCTTTTGGGGTTAAAGAAGTAGAAATTTTAGATGTACTGTAATCGATATCTACTCATAAACTTGATATCATATCATGAACTGTAATCGATATCTACTCGTAAACTTGATATCATATCATGAACTGTAATCGATATCATACTCGTAAACTTGATATCATATCATGAACAATAATAAATATCGGTTGATTTTATATCC

mRNA sequence

GTAGTTGAAGGTGGCTTTTATGAAGGGCATGGAACATTAATTACTAACCCCAACTATTTCCGTAACATCATAAGCTTTGTCCACTTGGTAAAGCAATGTCATGACTCCAAAGTCATCCCAAAGATCAGATCAAAGCAAGTTCTTGAGTTGAAAATGGATACCCATTTGAAGGAGCTAAGGCAAAGCTTCAGAAATGGAAGAACAAGGAGCTTAGAATGGAGGAAAGAACAGCTGATTTCACTAATTCACTTCATTCAAGACAAAGAAAACGCCATTTTTGAAGCTCTCTTCCAAGATCTTGGGAAGCATCCTGTCGAAATTTATCGAGATGAGGTTGGAATCGTTCTAAAATCTGCAACCAACGCTCTGTCCTGTTTACACAAATGGATCGCTCCTAAAAAGAAATACGTGCCATTACTGTTCTTCCCAGCAAAAGGGGAAGTTTTGCCTGAGCCATATGGTCTAGTCCTCATAATTTCATCATGGAATTTCCCCCTTTCTTTGGCATTGGATCCGTTAATCGGAGCGATATCAGCAGGCAACACGGCGGTTCTAAAACCGTCGGAGTACGCTCCGGCGTGCTCGTCTCTTCTCGCTGCGACGCTTCCTCTTTACCTCGACGATAAAGCCATCAAGGTCGTGGAGGGCGGAGCTGATATCAGTGAACAGCTTCTGCAGCAGAAGTGGGATAAGATCTTCTTCACTGGTAGTCCAAGTGTAGCTAAGATTGTGATGTCTGAAGCTGCAAAGCATCTAACTCCTGTAACATTGGAGCTTGGGGGAAAGTGTCCTGCAATCTTTGATTACTCCTCTGTTCTTTCCAATATGAAGGTAGCTGCTAAGAGAATCGTCGGAGGAAAATGGGGACCGTGTTCGGGTCAGGCGTGTATAGGGATCGATTACTTGCTTGTTGAGGATAAGTTTGCTTCAGAATTGATCGAGTCGTTAAAGCGAGTGCTGAAGAAGTTTTACGGTGAAAACTCGAGAAATTCGACGAGTATAGCTCGGATTGTTAACGAGAAACATGTTGAAAGGATCAGTAATATGCTTAATGACCCTAAGGTTGCTGGTTGTATTGTCCATGGTGGTTTAATAGACAAACAAAAATGCTTCATTGAGCCGACAATATTGTTGAATCCTCCGATCGATGCTGATATCATGACCGAAGAAATCTTCGGTCCCGTGCTACCGATAATAACGCTGGATAGAATCGAAGAAAGCATTGAGTTTATCAATGCAAGACCGAAACCTCTAGCTATCTACGCCTTCACGGAAGACGAAACGCTAAAGAAACGGATTATATCCGAAACATCATCAGGAAGTGTCACGTTCAATGATACCATGGTTCAGTTTGTGTGCGATTCGCTACCATTTGGTGGTGTCGGCCAGAGCGGTTTCGGGAGTTACCATGGCAAGTATTCTTTTGATGCATTCAGCCATGACAAAGCAGTGTTGCAGAGAAGCTTTTTGATAGAACTCGAGCCACGATATCCGCCATGGAACGATTTCAAGCTCAAGTTCATTAGATTGGCATATCGATTCGACTATTTCAGGCTAGTACTACTCCTTTTGGGGTTAAAGAAGTAGAAATTTTAGATGTACTGTAATCGATATCTACTCATAAACTTGATATCATATCATGAACTGTAATCGATATCTACTCGTAAACTTGATATCATATCATGAACTGTAATCGATATCATACTCGTAAACTTGATATCATATCATGAACAATAATAAATATCGGTTGATTTTATATCC

Coding sequence (CDS)

ATGGATACCCATTTGAAGGAGCTAAGGCAAAGCTTCAGAAATGGAAGAACAAGGAGCTTAGAATGGAGGAAAGAACAGCTGATTTCACTAATTCACTTCATTCAAGACAAAGAAAACGCCATTTTTGAAGCTCTCTTCCAAGATCTTGGGAAGCATCCTGTCGAAATTTATCGAGATGAGGTTGGAATCGTTCTAAAATCTGCAACCAACGCTCTGTCCTGTTTACACAAATGGATCGCTCCTAAAAAGAAATACGTGCCATTACTGTTCTTCCCAGCAAAAGGGGAAGTTTTGCCTGAGCCATATGGTCTAGTCCTCATAATTTCATCATGGAATTTCCCCCTTTCTTTGGCATTGGATCCGTTAATCGGAGCGATATCAGCAGGCAACACGGCGGTTCTAAAACCGTCGGAGTACGCTCCGGCGTGCTCGTCTCTTCTCGCTGCGACGCTTCCTCTTTACCTCGACGATAAAGCCATCAAGGTCGTGGAGGGCGGAGCTGATATCAGTGAACAGCTTCTGCAGCAGAAGTGGGATAAGATCTTCTTCACTGGTAGTCCAAGTGTAGCTAAGATTGTGATGTCTGAAGCTGCAAAGCATCTAACTCCTGTAACATTGGAGCTTGGGGGAAAGTGTCCTGCAATCTTTGATTACTCCTCTGTTCTTTCCAATATGAAGGTAGCTGCTAAGAGAATCGTCGGAGGAAAATGGGGACCGTGTTCGGGTCAGGCGTGTATAGGGATCGATTACTTGCTTGTTGAGGATAAGTTTGCTTCAGAATTGATCGAGTCGTTAAAGCGAGTGCTGAAGAAGTTTTACGGTGAAAACTCGAGAAATTCGACGAGTATAGCTCGGATTGTTAACGAGAAACATGTTGAAAGGATCAGTAATATGCTTAATGACCCTAAGGTTGCTGGTTGTATTGTCCATGGTGGTTTAATAGACAAACAAAAATGCTTCATTGAGCCGACAATATTGTTGAATCCTCCGATCGATGCTGATATCATGACCGAAGAAATCTTCGGTCCCGTGCTACCGATAATAACGCTGGATAGAATCGAAGAAAGCATTGAGTTTATCAATGCAAGACCGAAACCTCTAGCTATCTACGCCTTCACGGAAGACGAAACGCTAAAGAAACGGATTATATCCGAAACATCATCAGGAAGTGTCACGTTCAATGATACCATGGTTCAGTTTGTGTGCGATTCGCTACCATTTGGTGGTGTCGGCCAGAGCGGTTTCGGGAGTTACCATGGCAAGTATTCTTTTGATGCATTCAGCCATGACAAAGCAGTGTTGCAGAGAAGCTTTTTGATAGAACTCGAGCCACGATATCCGCCATGGAACGATTTCAAGCTCAAGTTCATTAGATTGGCATATCGATTCGACTATTTCAGGCTAGTACTACTCCTTTTGGGGTTAAAGAAGTAG

Protein sequence

MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDEVGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALDPLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDKIFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPCSGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLNDPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFINARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHGKYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK
Homology
BLAST of Cp4.1LG14g01640 vs. ExPASy Swiss-Prot
Match: Q70E96 (Aldehyde dehydrogenase family 3 member F1 OS=Arabidopsis thaliana OX=3702 GN=ALDH3F1 PE=2 SV=2)

HSP 1 Score: 637.1 bits (1642), Expect = 1.6e-181
Identity = 292/477 (61.22%), Postives = 385/477 (80.71%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           ++  L+E+R++F +GRTRSL+WRK Q+ ++   ++D E+ I  ALFQDLGKH  E +RDE
Sbjct: 8   VEESLREMRETFASGRTRSLKWRKAQIGAIYEMVKDNEDKICNALFQDLGKHSTEAFRDE 67

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           +G+VL++AT A++CL KW  PK   +PLLF+PAKG+V+ EPYG VL++SSWNFP+SL+LD
Sbjct: 68  LGVVLRTATVAINCLDKWAVPKHSKLPLLFYPAKGKVISEPYGTVLVLSSWNFPISLSLD 127

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAI+AGNT +LK SE +P  S+ LA T+P YLD KAIKV+EGG D++  LLQ +WDK
Sbjct: 128 PLIGAIAAGNTVLLKSSELSPNASAFLAKTIPAYLDTKAIKVIEGGPDVATILLQHQWDK 187

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSP + +I+M+ AA+HLTPVTLELGGKCP I D+ ++  N+K   KRI GGKWG C
Sbjct: 188 IFFTGSPKIGRIIMAAAAQHLTPVTLELGGKCPTIVDHHTISKNIKSVVKRIAGGKWGSC 247

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           +GQACI +DY+L+E  FA  LI+ LK  +K F+GEN + S  ++RI N+ HV+R+S +L+
Sbjct: 248 NGQACISVDYVLIEKSFAPTLIDMLKPTIKSFFGENPKESGCLSRIANKHHVQRLSRLLS 307

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           DP+V   IV+GG ID+ K ++EPTILL+PP+D++IM EEIFGP+LPIIT+  I+ESI  I
Sbjct: 308 DPRVQASIVYGGSIDEDKLYVEPTILLDPPLDSEIMNEEIFGPILPIITVRDIQESIGII 367

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
           N +PKPLAIYAFT DE LK RI+SETSSGSVTFND M+Q++CD+LPFGGVG+SG G YHG
Sbjct: 368 NTKPKPLAIYAFTNDENLKTRILSETSSGSVTFNDVMIQYMCDALPFGGVGESGIGRYHG 427

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 478
           KYSFD FSH+KA+++ S  ++LE RYPPWN+FKL FIRLA+R  YF+L+LL+LGLK+
Sbjct: 428 KYSFDCFSHEKAIMEGSLGMDLEARYPPWNNFKLTFIRLAFREAYFKLILLMLGLKR 484

BLAST of Cp4.1LG14g01640 vs. ExPASy Swiss-Prot
Match: Q8W033 (Aldehyde dehydrogenase family 3 member I1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ALDH3I1 PE=1 SV=2)

HSP 1 Score: 455.3 bits (1170), Expect = 8.5e-127
Identity = 228/471 (48.41%), Postives = 317/471 (67.30%), Query Frame = 0

Query: 7   ELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDEVGIVLK 66
           ELR +F +GRT+S EWR  QL ++   I +KE  I EAL+QDL K  +E +  E+     
Sbjct: 81  ELRSNFNSGRTKSYEWRISQLQNIARMIDEKEKCITEALYQDLSKPELEAFLAEISNTKS 140

Query: 67  SATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALDPLIGAI 126
           S   A+  L  W+AP+     +  FP+  +++ EP G+VL+IS+WNFP  L+++P+IGAI
Sbjct: 141 SCMLAIKELKNWMAPETVKTSVTTFPSSAQIVSEPLGVVLVISAWNFPFLLSVEPVIGAI 200

Query: 127 SAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDKIFFTGS 186
           +AGN  VLKPSE APA SSLLA     YLD+  I+V+EGG   +  LL QKWDKIFFTG 
Sbjct: 201 AAGNAVVLKPSEIAPAASSLLAKLFSEYLDNTTIRVIEGGVPETTALLDQKWDKIFFTGG 260

Query: 187 PSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPCSGQACI 246
             VA+I+M+ AA++LTPV LELGGKCPA+ D S V  N++VAA+RI+ GKW   SGQACI
Sbjct: 261 ARVARIIMAAAARNLTPVVLELGGKCPALVD-SDV--NLQVAARRIIAGKWACNSGQACI 320

Query: 247 GIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLNDPKVAG 306
           G+DY++    FAS+LI++LK  L+ F+G+N+  S  ++RIVN  H +R+ +ML +  VA 
Sbjct: 321 GVDYVITTKDFASKLIDALKTELETFFGQNALESKDLSRIVNSFHFKRLESMLKENGVAN 380

Query: 307 CIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFINARPKP 366
            IVHGG I + K  I PTILL+ P  + +M EEIFGP+LPIIT+ +IE+  + I ++PKP
Sbjct: 381 KIVHGGRITEDKLKISPTILLDVPEASSMMQEEIFGPLLPIITVQKIEDGFQVIRSKPKP 440

Query: 367 LAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHGKYSFDA 426
           LA Y FT ++ L+K+ + + S+G +T NDT++      LPFGGVG+SG G+YHGK+S++ 
Sbjct: 441 LAAYLFTNNKELEKQFVQDVSAGGITINDTVLHVTVKDLPFGGVGESGIGAYHGKFSYET 500

Query: 427 FSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 478
           FSH K VL RSF  + + RYPP+   K   ++     + F  +L   G  K
Sbjct: 501 FSHKKGVLYRSFSGDADLRYPPYTPKKKMVLKALLSSNIFAAILAFFGFSK 548

BLAST of Cp4.1LG14g01640 vs. ExPASy Swiss-Prot
Match: Q70DU8 (Aldehyde dehydrogenase family 3 member H1 OS=Arabidopsis thaliana OX=3702 GN=ALDH3H1 PE=1 SV=2)

HSP 1 Score: 449.5 bits (1155), Expect = 4.7e-125
Identity = 230/469 (49.04%), Postives = 316/469 (67.38%), Query Frame = 0

Query: 7   ELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDEVGIVLK 66
           ELR+SF +G TR  EWR  QL  L+    + E  I  AL  DLGK  +E    EV ++  
Sbjct: 18  ELRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSLLRN 77

Query: 67  SATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALDPLIGAI 126
           S   AL  L  W+AP+K    L  FPA  E++ EP G+VL+IS+WN+P  L++DP+IGAI
Sbjct: 78  SIKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVIGAI 137

Query: 127 SAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDKIFFTGS 186
           SAGN  VLKPSE APA S+LL   L  YLD  A++VVEG    +  LL+QKWDKIF+TGS
Sbjct: 138 SAGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDKIFYTGS 197

Query: 187 PSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPCSGQACI 246
             + +++M+ AAKHLTPV LELGGK P + D     +++KV  +RI+ GKWG  +GQAC+
Sbjct: 198 SKIGRVIMAAAAKHLTPVVLELGGKSPVVVDSD---TDLKVTVRRIIVGKWGCNNGQACV 257

Query: 247 GIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLNDPKVAG 306
             DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  H +R+S +L++ +V+ 
Sbjct: 258 SPDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLDEKEVSD 317

Query: 307 CIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFINARPKP 366
            IV+GG  D++   I PTILL+ P+D+ IM+EEIFGP+LPI+TL+ +EES + I +RPKP
Sbjct: 318 KIVYGGEKDRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVIRSRPKP 377

Query: 367 LAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHGKYSFDA 426
           LA Y FT ++ LK+R  +  S+G +  ND  V     +LPFGGVG+SG G+YHGK+SFDA
Sbjct: 378 LAAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHGKFSFDA 437

Query: 427 FSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGL 476
           FSH KAVL RS   +   RYPP++  KL+ ++     + F L  +LLGL
Sbjct: 438 FSHKKAVLYRSLFGDSAVRYPPYSRGKLRLLKALVDSNIFDLFKVLLGL 483

BLAST of Cp4.1LG14g01640 vs. ExPASy Swiss-Prot
Match: Q8VXQ2 (Aldehyde dehydrogenase OS=Craterostigma plantagineum OX=4153 GN=ALDH PE=1 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 1.7e-122
Identity = 224/467 (47.97%), Postives = 315/467 (67.45%), Query Frame = 0

Query: 8   LRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDEVGIVLKS 67
           LR+++ +G+T+S EWR  QL +L+      +  + EAL  DL K   E Y  E+ +V  +
Sbjct: 13  LRRTYISGKTKSYEWRVSQLKALLKITTHHDKEVVEALRADLKKPEHEAYVHEIFMVSNA 72

Query: 68  ATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALDPLIGAIS 127
             +AL  LH+W+ P+K    L  +P+  E++ EP G+VL+I++WN+P  LALDP+IGAI+
Sbjct: 73  CKSALKELHQWMKPQKVKTSLATYPSSAEIVSEPLGVVLVITAWNYPFLLALDPMIGAIA 132

Query: 128 AGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDKIFFTGSP 187
           AGN  VLKPSE APA S+LLA  L  Y+D  AI+VVEG     + LL Q+WDKIF+TGS 
Sbjct: 133 AGNCVVLKPSEIAPATSALLAKLLNQYVDTSAIRVVEGAVPEMQALLDQRWDKIFYTGSS 192

Query: 188 SVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPCSGQACIG 247
            V +IV+S AAKHLTPV LELGGKCP + D +    ++KVAA+RI+  KW   SGQ CI 
Sbjct: 193 KVGQIVLSSAAKHLTPVVLELGGKCPTVVDAN---IDLKVAARRIISWKWSGNSGQTCIS 252

Query: 248 IDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLNDPKVAGC 307
            DY++  ++ A +L++++K  L+ FYG++   S  ++ I+NE+  ER++ +L+D KV+  
Sbjct: 253 PDYIITTEENAPKLVDAIKCELESFYGKDPLKSQDMSSIINERQFERMTGLLDDKKVSDK 312

Query: 308 IVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFINARPKPL 367
           IV+GG  DK    I PTILL+   D+ +M+EEIFGP+LPIIT+ +IEE  + I ++PKPL
Sbjct: 313 IVYGGQSDKSNLKIAPTILLDVSEDSSVMSEEIFGPLLPIITVGKIEECYKIIASKPKPL 372

Query: 368 AIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHGKYSFDAF 427
           A Y FT D+   +  +S  S+G +T ND  + F+   LPFGGVG+SG GSYHGK+SFDAF
Sbjct: 373 AAYLFTNDKKRTEEFVSNVSAGGITINDIALHFLEPRLPFGGVGESGMGSYHGKFSFDAF 432

Query: 428 SHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLG 475
           SH K+VL+RSF  E+  RYPP+  +KL F+    + D F L+   LG
Sbjct: 433 SHKKSVLKRSFGGEVAARYPPYAPWKLHFMEAILQGDIFGLLKAWLG 476

BLAST of Cp4.1LG14g01640 vs. ExPASy Swiss-Prot
Match: Q2FWX9 (4,4'-diaponeurosporen-aldehyde dehydrogenase OS=Staphylococcus aureus (strain NCTC 8325 / PS 47) OX=93061 GN=aldH1 PE=1 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 3.5e-104
Identity = 199/451 (44.12%), Postives = 287/451 (63.64%), Query Frame = 0

Query: 12  FRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDEVGIVLKSATNA 71
           F   +T+ + +RKEQL  L   I+  E+ I EAL+ DLGK+ VE Y  E+GI LKS   A
Sbjct: 15  FNTQQTKDISFRKEQLKKLSKAIKSYESDILEALYTDLGKNKVEAYATEIGITLKSIKIA 74

Query: 72  LSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALDPLIGAISAGNT 131
              L  W   K    PL  FP K  +  EPYG VLII+ +N+P  L  +PLIGAI+AGNT
Sbjct: 75  RKELKNWTKTKNVDTPLYLFPTKSYIKKEPYGTVLIIAPFNYPFQLVFEPLIGAIAAGNT 134

Query: 132 AVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDKIFFTGSPSVAK 191
           A++KPSE  P  + ++   +    D   I+V+EGG + ++ L+   +D +FFTGS +V K
Sbjct: 135 AIIKPSELTPNVARVIKRLINETFDANYIEVIEGGIEETQTLIHLPFDYVFFTGSENVGK 194

Query: 192 IVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPCSGQACIGIDYL 251
           IV   A+++L PVTLE+GGK P I D +   +N+KVA++RI  GK+   +GQ C+  DY+
Sbjct: 195 IVYQAASENLVPVTLEMGGKSPVIVDET---ANIKVASERICFGKF-TNAGQTCVAPDYI 254

Query: 252 LVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLNDPKVAGCIVHG 311
           LV +    +LI +L + L++FYG+N + S    RIVN KH  R++++LN  ++   IV G
Sbjct: 255 LVHESVKDDLITALSKTLREFYGQNIQQSPDYGRIVNLKHYHRLTSLLNSAQMN--IVFG 314

Query: 312 GLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFINARPKPLAIYA 371
           G  D+ + +IEPT+L +   D+ IM EEIFGP+LPI+T   ++E+I FI+ RPKPL++Y 
Sbjct: 315 GHSDEDERYIEPTLLDHVTSDSAIMQEEIFGPILPILTYQSLDEAIAFIHQRPKPLSLYL 374

Query: 372 FTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHGKYSFDAFSHDK 431
           F+EDE   +R+I+E S G    NDT++      LPFGGVG SG G YHGKYSFD F+H+K
Sbjct: 375 FSEDENATQRVINELSFGGGAINDTLMHLANPKLPFGGVGASGMGRYHGKYSFDTFTHEK 434

Query: 432 AVLQRSFLIELEPRYPPWNDFKLKFIRLAYR 463
           + + +S  +E     PP+   K K+I+  ++
Sbjct: 435 SYIFKSTRLESGVHLPPYKG-KFKYIKAFFK 458

BLAST of Cp4.1LG14g01640 vs. NCBI nr
Match: XP_023552079.1 (aldehyde dehydrogenase family 3 member F1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 950 bits (2456), Expect = 0.0
Identity = 477/477 (100.00%), Postives = 477/477 (100.00%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE
Sbjct: 21  MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 80

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD
Sbjct: 81  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 140

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK
Sbjct: 141 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 200

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC
Sbjct: 201 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 260

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN
Sbjct: 261 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 320

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI
Sbjct: 321 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 380

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 381 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 440

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477
           KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK
Sbjct: 441 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 497

BLAST of Cp4.1LG14g01640 vs. NCBI nr
Match: KAG6577244.1 (Aldehyde dehydrogenase family 3 member F1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 935 bits (2416), Expect = 0.0
Identity = 470/477 (98.53%), Postives = 475/477 (99.58%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           MDTHL+ELRQSFRNGRTRSLEWRK+QLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE
Sbjct: 1   MDTHLEELRQSFRNGRTRSLEWRKKQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD
Sbjct: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSV SNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERIS+MLN
Sbjct: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISDMLN 300

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           DPKVAGCIVHGGLIDKQK FIEPTILLNPPIDA+IMTEEIFGPVLPIITLD+IEESIEFI
Sbjct: 301 DPKVAGCIVHGGLIDKQKRFIEPTILLNPPIDANIMTEEIFGPVLPIITLDKIEESIEFI 360

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477
           KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK
Sbjct: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477

BLAST of Cp4.1LG14g01640 vs. NCBI nr
Match: XP_022929292.1 (aldehyde dehydrogenase family 3 member F1 [Cucurbita moschata] >XP_022929293.1 aldehyde dehydrogenase family 3 member F1 [Cucurbita moschata])

HSP 1 Score: 930 bits (2404), Expect = 0.0
Identity = 468/477 (98.11%), Postives = 472/477 (98.95%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           MDTHL+ELRQSFRNGRTRSLEWRK+QLISLI FIQDKENAIFEALFQDLGKHPVEIYRDE
Sbjct: 1   MDTHLEELRQSFRNGRTRSLEWRKKQLISLIQFIQDKENAIFEALFQDLGKHPVEIYRDE 60

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           VGIVLKSA NALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD
Sbjct: 61  VGIVLKSANNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSV SNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN
Sbjct: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           DPKVAGCIVHGGLIDKQK FIEPTILLNPPIDA+IMTEEIFGPVLPIITLD+IEESIEFI
Sbjct: 301 DPKVAGCIVHGGLIDKQKRFIEPTILLNPPIDANIMTEEIFGPVLPIITLDKIEESIEFI 360

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477
           KYSFDAFSHDKAVLQRSF IELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK
Sbjct: 421 KYSFDAFSHDKAVLQRSFFIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477

BLAST of Cp4.1LG14g01640 vs. NCBI nr
Match: XP_022984451.1 (aldehyde dehydrogenase family 3 member F1 [Cucurbita maxima] >XP_022984452.1 aldehyde dehydrogenase family 3 member F1 [Cucurbita maxima])

HSP 1 Score: 930 bits (2403), Expect = 0.0
Identity = 468/477 (98.11%), Postives = 472/477 (98.95%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           MDTHL+ELRQSFRNGRTRSLEWRK+QLISLI FIQDKENAIFEALFQDLGKHPVEIYRDE
Sbjct: 1   MDTHLEELRQSFRNGRTRSLEWRKKQLISLIQFIQDKENAIFEALFQDLGKHPVEIYRDE 60

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           VGIVLKSA NALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD
Sbjct: 61  VGIVLKSANNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSV S+MKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVHSSMKVAAKRIVGGKWGPC 240

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNML+
Sbjct: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLH 300

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           DPKVAGCIVHGGLIDKQK FIEPTILLNPPIDADIMTEEIFGPVLPIITLD IEESIEFI
Sbjct: 301 DPKVAGCIVHGGLIDKQKLFIEPTILLNPPIDADIMTEEIFGPVLPIITLDNIEESIEFI 360

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477
           KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK
Sbjct: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477

BLAST of Cp4.1LG14g01640 vs. NCBI nr
Match: KAG7015334.1 (Aldehyde dehydrogenase family 3 member F1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 923 bits (2386), Expect = 0.0
Identity = 467/482 (96.89%), Postives = 473/482 (98.13%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           MDTHL+ELRQSFRNGRTRSLEWRK+QLISLI FIQDKENAIFEALFQDLGKHPVEIYRDE
Sbjct: 43  MDTHLEELRQSFRNGRTRSLEWRKKQLISLIQFIQDKENAIFEALFQDLGKHPVEIYRDE 102

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           VGIVLKSA NALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD
Sbjct: 103 VGIVLKSANNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 162

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK
Sbjct: 163 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 222

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSV SNMKVAAKRIVGGKWGPC
Sbjct: 223 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 282

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERIS+MLN
Sbjct: 283 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISDMLN 342

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIIT-----LDRIEE 360
           DPKVAGCIVHGGLIDKQK FIEPTILLNPPIDA+IMTEEIFGPVLPIIT     LD+IEE
Sbjct: 343 DPKVAGCIVHGGLIDKQKRFIEPTILLNPPIDANIMTEEIFGPVLPIITVSEAVLDKIEE 402

Query: 361 SIEFINARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGF 420
           SIEFINARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTM+QFVCDSLPFGGVGQSGF
Sbjct: 403 SIEFINARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMIQFVCDSLPFGGVGQSGF 462

Query: 421 GSYHGKYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGL 477
           GSYHGKYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGL
Sbjct: 463 GSYHGKYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGL 522

BLAST of Cp4.1LG14g01640 vs. ExPASy TrEMBL
Match: A0A6J1ERN8 (Aldehyde dehydrogenase OS=Cucurbita moschata OX=3662 GN=LOC111435918 PE=3 SV=1)

HSP 1 Score: 930 bits (2404), Expect = 0.0
Identity = 468/477 (98.11%), Postives = 472/477 (98.95%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           MDTHL+ELRQSFRNGRTRSLEWRK+QLISLI FIQDKENAIFEALFQDLGKHPVEIYRDE
Sbjct: 1   MDTHLEELRQSFRNGRTRSLEWRKKQLISLIQFIQDKENAIFEALFQDLGKHPVEIYRDE 60

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           VGIVLKSA NALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD
Sbjct: 61  VGIVLKSANNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSV SNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN
Sbjct: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           DPKVAGCIVHGGLIDKQK FIEPTILLNPPIDA+IMTEEIFGPVLPIITLD+IEESIEFI
Sbjct: 301 DPKVAGCIVHGGLIDKQKRFIEPTILLNPPIDANIMTEEIFGPVLPIITLDKIEESIEFI 360

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477
           KYSFDAFSHDKAVLQRSF IELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK
Sbjct: 421 KYSFDAFSHDKAVLQRSFFIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477

BLAST of Cp4.1LG14g01640 vs. ExPASy TrEMBL
Match: A0A6J1J280 (Aldehyde dehydrogenase OS=Cucurbita maxima OX=3661 GN=LOC111482755 PE=3 SV=1)

HSP 1 Score: 930 bits (2403), Expect = 0.0
Identity = 468/477 (98.11%), Postives = 472/477 (98.95%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           MDTHL+ELRQSFRNGRTRSLEWRK+QLISLI FIQDKENAIFEALFQDLGKHPVEIYRDE
Sbjct: 1   MDTHLEELRQSFRNGRTRSLEWRKKQLISLIQFIQDKENAIFEALFQDLGKHPVEIYRDE 60

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           VGIVLKSA NALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD
Sbjct: 61  VGIVLKSANNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSV S+MKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVHSSMKVAAKRIVGGKWGPC 240

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNML+
Sbjct: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLH 300

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           DPKVAGCIVHGGLIDKQK FIEPTILLNPPIDADIMTEEIFGPVLPIITLD IEESIEFI
Sbjct: 301 DPKVAGCIVHGGLIDKQKLFIEPTILLNPPIDADIMTEEIFGPVLPIITLDNIEESIEFI 360

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477
           KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK
Sbjct: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477

BLAST of Cp4.1LG14g01640 vs. ExPASy TrEMBL
Match: A0A5D3CZI1 (Aldehyde dehydrogenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G005310 PE=3 SV=1)

HSP 1 Score: 831 bits (2146), Expect = 5.11e-302
Identity = 411/477 (86.16%), Postives = 445/477 (93.29%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           M+ +L+ LR+SF+NGRTRS EWRK+QL SLI  I DKEN IFEAL+QDLGKHPVEI+RDE
Sbjct: 1   MEANLEVLRESFKNGRTRSYEWRKKQLSSLIQLIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           VGIVLKSA +ALS LHKW+APKKK VPLLFFPAKGEVL EP+GLVLIISSWNFPLSL+LD
Sbjct: 61  VGIVLKSANDALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAISAGNTAVLKPSEYAP  SS L ATLPLYLD+KAIKVVEGGAD+ EQLLQ KWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDNKAIKVVEGGADVCEQLLQYKWDK 180

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSP V +IVMS AAKHLTPVTLELGGKCPAIFDYSSV SNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           +GQACIGIDY+LVEDKFASELI+SLKR+LKKFYGENS+NSTSIARIVNEK+VERISN+L 
Sbjct: 241 AGQACIGIDYVLVEDKFASELIDSLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           DPKVA  IVHGG +DK+K FIEPTILLNPP+D DIMTEEIFGP+LPIITL++IEESIEFI
Sbjct: 301 DPKVAASIVHGGSVDKEKLFIEPTILLNPPLDTDIMTEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLA+YAFTEDETLKKRI+ +TSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLALYAFTEDETLKKRILYKTSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477
           KYSFD FSH+KAV+QRSFLIELEPRYPPWNDFKLKFIRLAYR+DYF L LLLLGLKK
Sbjct: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 477

BLAST of Cp4.1LG14g01640 vs. ExPASy TrEMBL
Match: A0A1S3BWE5 (Aldehyde dehydrogenase OS=Cucumis melo OX=3656 GN=LOC103494364 PE=3 SV=1)

HSP 1 Score: 831 bits (2146), Expect = 5.11e-302
Identity = 411/477 (86.16%), Postives = 445/477 (93.29%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           M+ +L+ LR+SF+NGRTRS EWRK+QL SLI  I DKEN IFEAL+QDLGKHPVEI+RDE
Sbjct: 1   MEANLEVLRESFKNGRTRSYEWRKKQLSSLIQLIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           VGIVLKSA +ALS LHKW+APKKK VPLLFFPAKGEVL EP+GLVLIISSWNFPLSL+LD
Sbjct: 61  VGIVLKSANDALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAISAGNTAVLKPSEYAP  SS L ATLPLYLD+KAIKVVEGGAD+ EQLLQ KWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDNKAIKVVEGGADVCEQLLQYKWDK 180

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSP V +IVMS AAKHLTPVTLELGGKCPAIFDYSSV SNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           +GQACIGIDY+LVEDKFASELI+SLKR+LKKFYGENS+NSTSIARIVNEK+VERISN+L 
Sbjct: 241 AGQACIGIDYVLVEDKFASELIDSLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           DPKVA  IVHGG +DK+K FIEPTILLNPP+D DIMTEEIFGP+LPIITL++IEESIEFI
Sbjct: 301 DPKVAASIVHGGSVDKEKLFIEPTILLNPPLDTDIMTEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
           NARPKPLA+YAFTEDETLKKRI+ +TSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG
Sbjct: 361 NARPKPLALYAFTEDETLKKRILYKTSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477
           KYSFD FSH+KAV+QRSFLIELEPRYPPWNDFKLKFIRLAYR+DYF L LLLLGLKK
Sbjct: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 477

BLAST of Cp4.1LG14g01640 vs. ExPASy TrEMBL
Match: A0A6J1C3I9 (Aldehyde dehydrogenase OS=Momordica charantia OX=3673 GN=LOC111008156 PE=3 SV=1)

HSP 1 Score: 829 bits (2141), Expect = 3.07e-301
Identity = 409/478 (85.56%), Postives = 447/478 (93.51%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           M+ +L+ELR+SFR+GRTRS EWRK QLISLI FI DKE++IFEA++QDLGKHPVEIYRDE
Sbjct: 1   MERNLEELRESFRSGRTRSAEWRKNQLISLIQFIHDKESSIFEAMYQDLGKHPVEIYRDE 60

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           VG+VLKSA +AL CL KW+AP+KKYVPLLFFPAKGEVL EP+GLVLIISSWNFP+SL+LD
Sbjct: 61  VGVVLKSAKDALCCLQKWMAPQKKYVPLLFFPAKGEVLSEPFGLVLIISSWNFPISLSLD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAISAGNTAVLKPSEYAPACSSLLA+TLPLYLD KAIKV+EGGAD+SEQLL  KWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPACSSLLASTLPLYLDSKAIKVMEGGADVSEQLLLHKWDK 180

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSP V +IVMS AAKHLTPVTLELGGKCPAIFDYSSV SNMKVA KRIVGGKWGPC
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAVKRIVGGKWGPC 240

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           SGQACIGIDY+LVE+KFASELIESLKR++KKFYGENS+NSTSIARIVNE  VERISN+L 
Sbjct: 241 SGQACIGIDYVLVEEKFASELIESLKRIMKKFYGENSKNSTSIARIVNEHQVERISNLLK 300

Query: 301 DPKVAGCIVHGG-LIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEF 360
           DPKVA  IVHGG  IDKQK FIEPTILLNPP+DADIMTEEIFGP+LPIITL++IEESIEF
Sbjct: 301 DPKVAASIVHGGGSIDKQKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEF 360

Query: 361 INARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYH 420
           IN+RPKPLAIYAFT DETLKKRI+ ETSSG+VTFNDTMVQF+CDSLPFGGVGQSGFG YH
Sbjct: 361 INSRPKPLAIYAFTRDETLKKRILFETSSGNVTFNDTMVQFLCDSLPFGGVGQSGFGRYH 420

Query: 421 GKYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 477
           GKYSFD FSH+KAVLQRSFL+ELEPRYPPWNDFKLKFIRLAY FDYF L+LLLLG+KK
Sbjct: 421 GKYSFDTFSHEKAVLQRSFLLELEPRYPPWNDFKLKFIRLAYAFDYFGLLLLLLGIKK 478

BLAST of Cp4.1LG14g01640 vs. TAIR 10
Match: AT4G36250.1 (aldehyde dehydrogenase 3F1 )

HSP 1 Score: 637.1 bits (1642), Expect = 1.1e-182
Identity = 292/477 (61.22%), Postives = 385/477 (80.71%), Query Frame = 0

Query: 1   MDTHLKELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDE 60
           ++  L+E+R++F +GRTRSL+WRK Q+ ++   ++D E+ I  ALFQDLGKH  E +RDE
Sbjct: 8   VEESLREMRETFASGRTRSLKWRKAQIGAIYEMVKDNEDKICNALFQDLGKHSTEAFRDE 67

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           +G+VL++AT A++CL KW  PK   +PLLF+PAKG+V+ EPYG VL++SSWNFP+SL+LD
Sbjct: 68  LGVVLRTATVAINCLDKWAVPKHSKLPLLFYPAKGKVISEPYGTVLVLSSWNFPISLSLD 127

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           PLIGAI+AGNT +LK SE +P  S+ LA T+P YLD KAIKV+EGG D++  LLQ +WDK
Sbjct: 128 PLIGAIAAGNTVLLKSSELSPNASAFLAKTIPAYLDTKAIKVIEGGPDVATILLQHQWDK 187

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IFFTGSP + +I+M+ AA+HLTPVTLELGGKCP I D+ ++  N+K   KRI GGKWG C
Sbjct: 188 IFFTGSPKIGRIIMAAAAQHLTPVTLELGGKCPTIVDHHTISKNIKSVVKRIAGGKWGSC 247

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           +GQACI +DY+L+E  FA  LI+ LK  +K F+GEN + S  ++RI N+ HV+R+S +L+
Sbjct: 248 NGQACISVDYVLIEKSFAPTLIDMLKPTIKSFFGENPKESGCLSRIANKHHVQRLSRLLS 307

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           DP+V   IV+GG ID+ K ++EPTILL+PP+D++IM EEIFGP+LPIIT+  I+ESI  I
Sbjct: 308 DPRVQASIVYGGSIDEDKLYVEPTILLDPPLDSEIMNEEIFGPILPIITVRDIQESIGII 367

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
           N +PKPLAIYAFT DE LK RI+SETSSGSVTFND M+Q++CD+LPFGGVG+SG G YHG
Sbjct: 368 NTKPKPLAIYAFTNDENLKTRILSETSSGSVTFNDVMIQYMCDALPFGGVGESGIGRYHG 427

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 478
           KYSFD FSH+KA+++ S  ++LE RYPPWN+FKL FIRLA+R  YF+L+LL+LGLK+
Sbjct: 428 KYSFDCFSHEKAIMEGSLGMDLEARYPPWNNFKLTFIRLAFREAYFKLILLMLGLKR 484

BLAST of Cp4.1LG14g01640 vs. TAIR 10
Match: AT4G34240.1 (aldehyde dehydrogenase 3I1 )

HSP 1 Score: 455.3 bits (1170), Expect = 6.1e-128
Identity = 228/471 (48.41%), Postives = 317/471 (67.30%), Query Frame = 0

Query: 7   ELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDEVGIVLK 66
           ELR +F +GRT+S EWR  QL ++   I +KE  I EAL+QDL K  +E +  E+     
Sbjct: 81  ELRSNFNSGRTKSYEWRISQLQNIARMIDEKEKCITEALYQDLSKPELEAFLAEISNTKS 140

Query: 67  SATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALDPLIGAI 126
           S   A+  L  W+AP+     +  FP+  +++ EP G+VL+IS+WNFP  L+++P+IGAI
Sbjct: 141 SCMLAIKELKNWMAPETVKTSVTTFPSSAQIVSEPLGVVLVISAWNFPFLLSVEPVIGAI 200

Query: 127 SAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDKIFFTGS 186
           +AGN  VLKPSE APA SSLLA     YLD+  I+V+EGG   +  LL QKWDKIFFTG 
Sbjct: 201 AAGNAVVLKPSEIAPAASSLLAKLFSEYLDNTTIRVIEGGVPETTALLDQKWDKIFFTGG 260

Query: 187 PSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPCSGQACI 246
             VA+I+M+ AA++LTPV LELGGKCPA+ D S V  N++VAA+RI+ GKW   SGQACI
Sbjct: 261 ARVARIIMAAAARNLTPVVLELGGKCPALVD-SDV--NLQVAARRIIAGKWACNSGQACI 320

Query: 247 GIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLNDPKVAG 306
           G+DY++    FAS+LI++LK  L+ F+G+N+  S  ++RIVN  H +R+ +ML +  VA 
Sbjct: 321 GVDYVITTKDFASKLIDALKTELETFFGQNALESKDLSRIVNSFHFKRLESMLKENGVAN 380

Query: 307 CIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFINARPKP 366
            IVHGG I + K  I PTILL+ P  + +M EEIFGP+LPIIT+ +IE+  + I ++PKP
Sbjct: 381 KIVHGGRITEDKLKISPTILLDVPEASSMMQEEIFGPLLPIITVQKIEDGFQVIRSKPKP 440

Query: 367 LAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHGKYSFDA 426
           LA Y FT ++ L+K+ + + S+G +T NDT++      LPFGGVG+SG G+YHGK+S++ 
Sbjct: 441 LAAYLFTNNKELEKQFVQDVSAGGITINDTVLHVTVKDLPFGGVGESGIGAYHGKFSYET 500

Query: 427 FSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGLKK 478
           FSH K VL RSF  + + RYPP+   K   ++     + F  +L   G  K
Sbjct: 501 FSHKKGVLYRSFSGDADLRYPPYTPKKKMVLKALLSSNIFAAILAFFGFSK 548

BLAST of Cp4.1LG14g01640 vs. TAIR 10
Match: AT1G44170.1 (aldehyde dehydrogenase 3H1 )

HSP 1 Score: 449.5 bits (1155), Expect = 3.3e-126
Identity = 230/469 (49.04%), Postives = 316/469 (67.38%), Query Frame = 0

Query: 7   ELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDEVGIVLK 66
           ELR+SF +G TR  EWR  QL  L+    + E  I  AL  DLGK  +E    EV ++  
Sbjct: 18  ELRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSLLRN 77

Query: 67  SATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALDPLIGAI 126
           S   AL  L  W+AP+K    L  FPA  E++ EP G+VL+IS+WN+P  L++DP+IGAI
Sbjct: 78  SIKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVIGAI 137

Query: 127 SAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDKIFFTGS 186
           SAGN  VLKPSE APA S+LL   L  YLD  A++VVEG    +  LL+QKWDKIF+TGS
Sbjct: 138 SAGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDKIFYTGS 197

Query: 187 PSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPCSGQACI 246
             + +++M+ AAKHLTPV LELGGK P + D     +++KV  +RI+ GKWG  +GQAC+
Sbjct: 198 SKIGRVIMAAAAKHLTPVVLELGGKSPVVVDSD---TDLKVTVRRIIVGKWGCNNGQACV 257

Query: 247 GIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLNDPKVAG 306
             DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  H +R+S +L++ +V+ 
Sbjct: 258 SPDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLDEKEVSD 317

Query: 307 CIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFINARPKP 366
            IV+GG  D++   I PTILL+ P+D+ IM+EEIFGP+LPI+TL+ +EES + I +RPKP
Sbjct: 318 KIVYGGEKDRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVIRSRPKP 377

Query: 367 LAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHGKYSFDA 426
           LA Y FT ++ LK+R  +  S+G +  ND  V     +LPFGGVG+SG G+YHGK+SFDA
Sbjct: 378 LAAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHGKFSFDA 437

Query: 427 FSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGL 476
           FSH KAVL RS   +   RYPP++  KL+ ++     + F L  +LLGL
Sbjct: 438 FSHKKAVLYRSLFGDSAVRYPPYSRGKLRLLKALVDSNIFDLFKVLLGL 483

BLAST of Cp4.1LG14g01640 vs. TAIR 10
Match: AT1G44170.2 (aldehyde dehydrogenase 3H1 )

HSP 1 Score: 449.5 bits (1155), Expect = 3.3e-126
Identity = 230/469 (49.04%), Postives = 316/469 (67.38%), Query Frame = 0

Query: 7   ELRQSFRNGRTRSLEWRKEQLISLIHFIQDKENAIFEALFQDLGKHPVEIYRDEVGIVLK 66
           ELR+SF +G TR  EWR  QL  L+    + E  I  AL  DLGK  +E    EV ++  
Sbjct: 18  ELRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSLLRN 77

Query: 67  SATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALDPLIGAI 126
           S   AL  L  W+AP+K    L  FPA  E++ EP G+VL+IS+WN+P  L++DP+IGAI
Sbjct: 78  SIKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVIGAI 137

Query: 127 SAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDKIFFTGS 186
           SAGN  VLKPSE APA S+LL   L  YLD  A++VVEG    +  LL+QKWDKIF+TGS
Sbjct: 138 SAGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDKIFYTGS 197

Query: 187 PSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPCSGQACI 246
             + +++M+ AAKHLTPV LELGGK P + D     +++KV  +RI+ GKWG  +GQAC+
Sbjct: 198 SKIGRVIMAAAAKHLTPVVLELGGKSPVVVDSD---TDLKVTVRRIIVGKWGCNNGQACV 257

Query: 247 GIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLNDPKVAG 306
             DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  H +R+S +L++ +V+ 
Sbjct: 258 SPDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLDEKEVSD 317

Query: 307 CIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFINARPKP 366
            IV+GG  D++   I PTILL+ P+D+ IM+EEIFGP+LPI+TL+ +EES + I +RPKP
Sbjct: 318 KIVYGGEKDRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVIRSRPKP 377

Query: 367 LAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHGKYSFDA 426
           LA Y FT ++ LK+R  +  S+G +  ND  V     +LPFGGVG+SG G+YHGK+SFDA
Sbjct: 378 LAAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHGKFSFDA 437

Query: 427 FSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGL 476
           FSH KAVL RS   +   RYPP++  KL+ ++     + F L  +LLGL
Sbjct: 438 FSHKKAVLYRSLFGDSAVRYPPYSRGKLRLLKALVDSNIFDLFKVLLGL 483

BLAST of Cp4.1LG14g01640 vs. TAIR 10
Match: AT1G44170.3 (aldehyde dehydrogenase 3H1 )

HSP 1 Score: 415.2 bits (1066), Expect = 7.0e-116
Identity = 206/415 (49.64%), Postives = 287/415 (69.16%), Query Frame = 0

Query: 61  VGIVLKSATNALSCLHKWIAPKKKYVPLLFFPAKGEVLPEPYGLVLIISSWNFPLSLALD 120
           V ++  S   AL  L  W+AP+K    L  FPA  E++ EP G+VL+IS+WN+P  L++D
Sbjct: 9   VSLLRNSIKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSID 68

Query: 121 PLIGAISAGNTAVLKPSEYAPACSSLLAATLPLYLDDKAIKVVEGGADISEQLLQQKWDK 180
           P+IGAISAGN  VLKPSE APA S+LL   L  YLD  A++VVEG    +  LL+QKWDK
Sbjct: 69  PVIGAISAGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDK 128

Query: 181 IFFTGSPSVAKIVMSEAAKHLTPVTLELGGKCPAIFDYSSVLSNMKVAAKRIVGGKWGPC 240
           IF+TGS  + +++M+ AAKHLTPV LELGGK P + D     +++KV  +RI+ GKWG  
Sbjct: 129 IFYTGSSKIGRVIMAAAAKHLTPVVLELGGKSPVVVDSD---TDLKVTVRRIIVGKWGCN 188

Query: 241 SGQACIGIDYLLVEDKFASELIESLKRVLKKFYGENSRNSTSIARIVNEKHVERISNMLN 300
           +GQAC+  DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  H +R+S +L+
Sbjct: 189 NGQACVSPDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLD 248

Query: 301 DPKVAGCIVHGGLIDKQKCFIEPTILLNPPIDADIMTEEIFGPVLPIITLDRIEESIEFI 360
           + +V+  IV+GG  D++   I PTILL+ P+D+ IM+EEIFGP+LPI+TL+ +EES + I
Sbjct: 249 EKEVSDKIVYGGEKDRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVI 308

Query: 361 NARPKPLAIYAFTEDETLKKRIISETSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420
            +RPKPLA Y FT ++ LK+R  +  S+G +  ND  V     +LPFGGVG+SG G+YHG
Sbjct: 309 RSRPKPLAAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHG 368

Query: 421 KYSFDAFSHDKAVLQRSFLIELEPRYPPWNDFKLKFIRLAYRFDYFRLVLLLLGL 476
           K+SFDAFSH KAVL RS   +   RYPP++  KL+ ++     + F L  +LLGL
Sbjct: 369 KFSFDAFSHKKAVLYRSLFGDSAVRYPPYSRGKLRLLKALVDSNIFDLFKVLLGL 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q70E961.6e-18161.22Aldehyde dehydrogenase family 3 member F1 OS=Arabidopsis thaliana OX=3702 GN=ALD... [more]
Q8W0338.5e-12748.41Aldehyde dehydrogenase family 3 member I1, chloroplastic OS=Arabidopsis thaliana... [more]
Q70DU84.7e-12549.04Aldehyde dehydrogenase family 3 member H1 OS=Arabidopsis thaliana OX=3702 GN=ALD... [more]
Q8VXQ21.7e-12247.97Aldehyde dehydrogenase OS=Craterostigma plantagineum OX=4153 GN=ALDH PE=1 SV=1[more]
Q2FWX93.5e-10444.124,4'-diaponeurosporen-aldehyde dehydrogenase OS=Staphylococcus aureus (strain NC... [more]
Match NameE-valueIdentityDescription
XP_023552079.10.0100.00aldehyde dehydrogenase family 3 member F1 [Cucurbita pepo subsp. pepo][more]
KAG6577244.10.098.53Aldehyde dehydrogenase family 3 member F1, partial [Cucurbita argyrosperma subsp... [more]
XP_022929292.10.098.11aldehyde dehydrogenase family 3 member F1 [Cucurbita moschata] >XP_022929293.1 a... [more]
XP_022984451.10.098.11aldehyde dehydrogenase family 3 member F1 [Cucurbita maxima] >XP_022984452.1 ald... [more]
KAG7015334.10.096.89Aldehyde dehydrogenase family 3 member F1, partial [Cucurbita argyrosperma subsp... [more]
Match NameE-valueIdentityDescription
A0A6J1ERN80.098.11Aldehyde dehydrogenase OS=Cucurbita moschata OX=3662 GN=LOC111435918 PE=3 SV=1[more]
A0A6J1J2800.098.11Aldehyde dehydrogenase OS=Cucurbita maxima OX=3661 GN=LOC111482755 PE=3 SV=1[more]
A0A5D3CZI15.11e-30286.16Aldehyde dehydrogenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2... [more]
A0A1S3BWE55.11e-30286.16Aldehyde dehydrogenase OS=Cucumis melo OX=3656 GN=LOC103494364 PE=3 SV=1[more]
A0A6J1C3I93.07e-30185.56Aldehyde dehydrogenase OS=Momordica charantia OX=3673 GN=LOC111008156 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G36250.11.1e-18261.22aldehyde dehydrogenase 3F1 [more]
AT4G34240.16.1e-12848.41aldehyde dehydrogenase 3I1 [more]
AT1G44170.13.3e-12649.04aldehyde dehydrogenase 3H1 [more]
AT1G44170.23.3e-12649.04aldehyde dehydrogenase 3H1 [more]
AT1G44170.37.0e-11649.64aldehyde dehydrogenase 3H1 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR016162Aldehyde dehydrogenase, N-terminalGENE3D3.40.605.10Aldehyde Dehydrogenase; Chain A, domain 1coord: 9..431
e-value: 6.5E-153
score: 511.4
IPR015590Aldehyde dehydrogenase domainPFAMPF00171Aldedhcoord: 9..433
e-value: 2.4E-77
score: 260.4
IPR012394Aldehyde dehydrogenase NAD(P)-dependentPIRSFPIRSF036492ALDHcoord: 1..470
e-value: 3.3E-172
score: 571.5
IPR012394Aldehyde dehydrogenase NAD(P)-dependentPANTHERPTHR43570ALDEHYDE DEHYDROGENASEcoord: 4..477
IPR016163Aldehyde dehydrogenase, C-terminalGENE3D3.40.309.10Aldehyde Dehydrogenase; Chain A, domain 2coord: 210..413
e-value: 6.5E-153
score: 511.4
NoneNo IPR availablePANTHERPTHR43570:SF17ALDEHYDE DEHYDROGENASE FAMILY 3 MEMBER F1coord: 4..477
IPR016161Aldehyde/histidinol dehydrogenaseSUPERFAMILY53720ALDH-likecoord: 2..449

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g01640.1Cp4.1LG14g01640.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006081 cellular aldehyde metabolic process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004029 aldehyde dehydrogenase (NAD+) activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016620 oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor