CSPI04G05130.1 (mRNA) Wild cucumber (PI 183967)

NameCSPI04G05130.1
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionAldehyde dehydrogenase
LocationChr4 : 3460754 .. 3463587 (-)
Sequence length1434
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGTTTTTAGCATCGTAATCATTGATTAATAGTTAAACGTGGCTTTCATAAACGAAGGGAGGACAAGGAACATTAAATTCAATGCCTTAAACCATTTTCCATAAAATCTGAAAATCTGTCCACTTGGGGAAAGTAATGACATGACTTCCAAAATCTTAGAACAAACATAAAATGCAAAAGCAAGAGAAAAGAAGAAACCCCACACCATAAAAATCAGATCAAACACAGCTTCTTGACAGAGCCTTCTTTGTGGTGGGGTTTAGGAAATGGAAGCCAGTTTGGAAGTGCTAAGAGAAAGCTTCAAAAATGGAAGAACAAGAAGTTATGAATGGAGGATAAAGCAGCTGAGTTCATTGATTCAGTTCATCCATGACAAAGAAAACACCATTTTTGAAGCCCTTTATCAAGATCTTGGCAAACATCCTGTCGAAATTTTTAGAGATGAGGTAAAATTTGGAAAATTAGATTATAGTTTTTCTTTGTATGGAAAGTGGAGCTAATTTTGCTAACGCTTTTAAGGTTGGAATTGTTCTGAAATCTGCAAACAATGCTCTTTCTTCTTTACACAAATGGATGGCTCCTAAAAAGGTTGGGTTTTTAAAGTTTTCCTTCTGTTTCTTTAGTTTCTTGTTCTTCTTCTTTCGTATATGCTTGTTTTGTTGTGTTGTAGAAACCTGTGCCATTACTCTTCTTCCCAGCAAAAGGAGAGGTTTTGTCTGAACCATTTGGTTTGGTCCTCATAATTTCATCATGGAATTTCCCCCTTTGTGAGTATTTTCCTTCTTTAGCTCAAGTAATTTTTTAACCAAAACGGACCCATCATCTGTTTATTTATTTTATTTTGAGATTTTGAGAGTTATACAACATTGGTGATTGATTTTTCCATTAAAAATTGAAGAATTCTGATGCTTCTTCTACTTATCTTTGCAGCTTTGTCATTGGATCCGTTAATCGGAGCGATATCGGCAGGCAATACGGCGGTTTTAAAACCGTCGGAATATGCTCCGGTTTTCTCCTCTTTTCTTGTTGCAACACTCCCTCTTTACCTTGACGATAAAGCCATCAAGGTTGTGGAGGGTGGAGCTGATGTTTCTGAACAACTTTTACAGTATAAGTGGGATAAGATCTTCTTCACTGGTATGAACTTGGCTCGTTTAACAAGGGCGATCCTTCAGTTTGTTCGTGCTCTACGATTTGATCGTTACGAGTGATTGTTTCAGGGAGTCCAAGGGTAGCTAGGATTGTGTCGTCTGCAGCCGCAAAGCATTTAACTCCTGTTACTTTAGAGCTTGGGGGAAAATGCCCTGCAATCTTTGATTACTCCTCTGTCCATTCCAATATGAAGGTTGTTTAACTTGGTAGCTGAAATGTAATGTCGTTAATGGATCCAAGTTCGTTTTGTTATTGTTTCTCATTTTGAATTCGTAGGTAGCGGCCAAGAGAATCGTTGGAGGAAAATGGGGGCCGTGCGCCGGACAGGCGTGCATAGGGATAGATTATGTGCTTGTGGAGGATAAGTTTGCTTCAGAATTGGTAAAAACAAGTTTCGCTTCGAACTATTTACTTATTTTCTTGCTTATCTCTGTGATCTTCCTCTTTCAGATCGAGTCATTAAAGCGAATACTCAAGAAGTTTTATGGTGAAAACTCGAAAAACTCAACGAGTATAGCTCGAATTGTTAATGATAAAAATGTTGAAAGAATAAGCAATCTTCTTAAAGACCCAAAAGTTGCTGCTTCCATCGTCCATGGTGGTTCTATTGACAAAGAGAAACTGTAAGCTAACTTCTCCATGAGGAAGTTTATGTTTCTGATTGTTTCTATTGAATGCTTTGCATTGATCTTGTAGCTTCATTGAACCAACAATATTGTTGAATCCTCCACTCGACGCGGATATCATGACCGAAGAAATCTTCGGTCCCCTGTTACCGATAATCACAGTAAGAATTACCAAATAACCTTTCAATAATTTCACACAACCATACTGATCACTAGTGTTATTGAAACAGTTGAACAAAATTGAAGAGAGCATTGAGTTCATCAATGCAAGACCGAAACCTCTCGCTCTATACGCCTTCACGGGAGACGAAACTCTCAAGAAACGAATTTTGTACGAAACATCATCAGGAAGTGTCACATTCAATGATACCATGGTTCAGGTATGAGCTTGAATTCCTTGAATCTGCATCTGACATCCTGAATGATACAGCACATGTCTTGCTATCACTGCTTGTACTTTCTCCCTAACAATAAAACTGGTTTCTGTTTATGTGTGAACGTAAGTAACACACTGTTAATAAATTATTACTATTTATTACTTTTCATGTATCAAGTGGAATTTGTTTTTGTCTTGCAGTTTGTGTGTGATTCGTTACCGTTCGGCGGTGTTGGTCAGAGCGGTTCCGGGAGTTACCACGGCAAGTATTCATTTGATACATTCAGCCATGAAAAGGCAGTGATGCAGAGAAGCTTTTTGATAGAACTCGAGCCACGATATCCACCATGGAATGATTTCAAGCTAAAGTTCATTAGATTGGCATACCGATACGACTACTTTGGGCTGGCACTGCTGCTTTTGGGGTTAAAGAAGTAGAAACTATAGCAGTTCAAAAATCAAAGAGCTAAGTGATCCTTGAGATAATAAACTGCTTTCCAAATCTCTTATTATGATCTCCTCATAAACTTTGTATCAAATCATGAACAAATAATATACCAGTTGTGCTTTTTGAAGGATTATTATATATATGTGGAGATTTATAATGATCCCATTTTAAATATTAGATTTTCAAACGTGTTTGCTCTTGCTGAGTGATTATAGAAAAGTTAATGGGAATA

mRNA sequence

ATGGAAGCCAGTTTGGAAGTGCTAAGAGAAAGCTTCAAAAATGGAAGAACAAGAAGTTATGAATGGAGGATAAAGCAGCTGAGTTCATTGATTCAGTTCATCCATGACAAAGAAAACACCATTTTTGAAGCCCTTTATCAAGATCTTGGCAAACATCCTGTCGAAATTTTTAGAGATGAGGTTGGAATTGTTCTGAAATCTGCAAACAATGCTCTTTCTTCTTTACACAAATGGATGGCTCCTAAAAAGAAACCTGTGCCATTACTCTTCTTCCCAGCAAAAGGAGAGGTTTTGTCTGAACCATTTGGTTTGGTCCTCATAATTTCATCATGGAATTTCCCCCTTTCTTTGTCATTGGATCCGTTAATCGGAGCGATATCGGCAGGCAATACGGCGGTTTTAAAACCGTCGGAATATGCTCCGGTTTTCTCCTCTTTTCTTGTTGCAACACTCCCTCTTTACCTTGACGATAAAGCCATCAAGGTTGTGGAGGGTGGAGCTGATGTTTCTGAACAACTTTTACAGTATAAGTGGGATAAGATCTTCTTCACTGGGAGTCCAAGGGTAGCTAGGATTGTGTCGTCTGCAGCCGCAAAGCATTTAACTCCTGTTACTTTAGAGCTTGGGGGAAAATGCCCTGCAATCTTTGATTACTCCTCTGTCCATTCCAATATGAAGGTAGCGGCCAAGAGAATCGTTGGAGGAAAATGGGGGCCGTGCGCCGGACAGGCGTGCATAGGGATAGATTATGTGCTTGTGGAGGATAAGTTTGCTTCAGAATTGATCGAGTCATTAAAGCGAATACTCAAGAAGTTTTATGGTGAAAACTCGAAAAACTCAACGAGTATAGCTCGAATTGTTAATGATAAAAATGTTGAAAGAATAAGCAATCTTCTTAAAGACCCAAAAGTTGCTGCTTCCATCGTCCATGGTGGTTCTATTGACAAAGAGAAACTCTTCATTGAACCAACAATATTGTTGAATCCTCCACTCGACGCGGATATCATGACCGAAGAAATCTTCGGTCCCCTGTTACCGATAATCACATTGAACAAAATTGAAGAGAGCATTGAGTTCATCAATGCAAGACCGAAACCTCTCGCTCTATACGCCTTCACGGGAGACGAAACTCTCAAGAAACGAATTTTGTACGAAACATCATCAGGAAGTGTCACATTCAATGATACCATGGTTCAGTTTGTGTGTGATTCGTTACCGTTCGGCGGTGTTGGTCAGAGCGGTTCCGGGAGTTACCACGGCAAGTATTCATTTGATACATTCAGCCATGAAAAGGCAGTGATGCAGAGAAGCTTTTTGATAGAACTCGAGCCACGATATCCACCATGGAATGATTTCAAGCTAAAGTTCATTAGATTGGCATACCGATACGACTACTTTGGGCTGGCACTGCTGCTTTTGGGGTTAAAGAAGTAG

Coding sequence (CDS)

ATGGAAGCCAGTTTGGAAGTGCTAAGAGAAAGCTTCAAAAATGGAAGAACAAGAAGTTATGAATGGAGGATAAAGCAGCTGAGTTCATTGATTCAGTTCATCCATGACAAAGAAAACACCATTTTTGAAGCCCTTTATCAAGATCTTGGCAAACATCCTGTCGAAATTTTTAGAGATGAGGTTGGAATTGTTCTGAAATCTGCAAACAATGCTCTTTCTTCTTTACACAAATGGATGGCTCCTAAAAAGAAACCTGTGCCATTACTCTTCTTCCCAGCAAAAGGAGAGGTTTTGTCTGAACCATTTGGTTTGGTCCTCATAATTTCATCATGGAATTTCCCCCTTTCTTTGTCATTGGATCCGTTAATCGGAGCGATATCGGCAGGCAATACGGCGGTTTTAAAACCGTCGGAATATGCTCCGGTTTTCTCCTCTTTTCTTGTTGCAACACTCCCTCTTTACCTTGACGATAAAGCCATCAAGGTTGTGGAGGGTGGAGCTGATGTTTCTGAACAACTTTTACAGTATAAGTGGGATAAGATCTTCTTCACTGGGAGTCCAAGGGTAGCTAGGATTGTGTCGTCTGCAGCCGCAAAGCATTTAACTCCTGTTACTTTAGAGCTTGGGGGAAAATGCCCTGCAATCTTTGATTACTCCTCTGTCCATTCCAATATGAAGGTAGCGGCCAAGAGAATCGTTGGAGGAAAATGGGGGCCGTGCGCCGGACAGGCGTGCATAGGGATAGATTATGTGCTTGTGGAGGATAAGTTTGCTTCAGAATTGATCGAGTCATTAAAGCGAATACTCAAGAAGTTTTATGGTGAAAACTCGAAAAACTCAACGAGTATAGCTCGAATTGTTAATGATAAAAATGTTGAAAGAATAAGCAATCTTCTTAAAGACCCAAAAGTTGCTGCTTCCATCGTCCATGGTGGTTCTATTGACAAAGAGAAACTCTTCATTGAACCAACAATATTGTTGAATCCTCCACTCGACGCGGATATCATGACCGAAGAAATCTTCGGTCCCCTGTTACCGATAATCACATTGAACAAAATTGAAGAGAGCATTGAGTTCATCAATGCAAGACCGAAACCTCTCGCTCTATACGCCTTCACGGGAGACGAAACTCTCAAGAAACGAATTTTGTACGAAACATCATCAGGAAGTGTCACATTCAATGATACCATGGTTCAGTTTGTGTGTGATTCGTTACCGTTCGGCGGTGTTGGTCAGAGCGGTTCCGGGAGTTACCACGGCAAGTATTCATTTGATACATTCAGCCATGAAAAGGCAGTGATGCAGAGAAGCTTTTTGATAGAACTCGAGCCACGATATCCACCATGGAATGATTTCAAGCTAAAGTTCATTAGATTGGCATACCGATACGACTACTTTGGGCTGGCACTGCTGCTTTTGGGGTTAAAGAAGTAG
BLAST of CSPI04G05130.1 vs. Swiss-Prot
Match: AL3F1_ARATH (Aldehyde dehydrogenase family 3 member F1 OS=Arabidopsis thaliana GN=ALDH3F1 PE=2 SV=2)

HSP 1 Score: 639.8 bits (1649), Expect = 2.4e-182
Identity = 301/477 (63.10%), Postives = 382/477 (80.08%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E SL  +RE+F +GRTRS +WR  Q+ ++ + + D E+ I  AL+QDLGKH  E FRDE
Sbjct: 8   VEESLREMRETFASGRTRSLKWRKAQIGAIYEMVKDNEDKICNALFQDLGKHSTEAFRDE 67

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G+VL++A  A++ L KW  PK   +PLLF+PAKG+V+SEP+G VL++SSWNFP+SLSLD
Sbjct: 68  LGVVLRTATVAINCLDKWAVPKHSKLPLLFYPAKGKVISEPYGTVLVLSSWNFPISLSLD 127

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAI+AGNT +LK SE +P  S+FL  T+P YLD KAIKV+EGG DV+  LLQ++WDK
Sbjct: 128 PLIGAIAAGNTVLLKSSELSPNASAFLAKTIPAYLDTKAIKVIEGGPDVATILLQHQWDK 187

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP++ RI+ +AAA+HLTPVTLELGGKCP I D+ ++  N+K   KRI GGKWG C
Sbjct: 188 IFFTGSPKIGRIIMAAAAQHLTPVTLELGGKCPTIVDHHTISKNIKSVVKRIAGGKWGSC 247

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACI +DYVL+E  FA  LI+ LK  +K F+GEN K S  ++RI N  +V+R+S LL 
Sbjct: 248 NGQACISVDYVLIEKSFAPTLIDMLKPTIKSFFGENPKESGCLSRIANKHHVQRLSRLLS 307

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DP+V ASIV+GGSID++KL++EPTILL+PPLD++IM EEIFGP+LPIIT+  I+ESI  I
Sbjct: 308 DPRVQASIVYGGSIDEDKLYVEPTILLDPPLDSEIMNEEIFGPILPIITVRDIQESIGII 367

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           N +PKPLA+YAFT DE LK RIL ETSSGSVTFND M+Q++CD+LPFGGVG+SG G YHG
Sbjct: 368 NTKPKPLAIYAFTNDENLKTRILSETSSGSVTFNDVMIQYMCDALPFGGVGESGIGRYHG 427

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFD FSHEKA+M+ S  ++LE RYPPWN+FKL FIRLA+R  YF L LL+LGLK+
Sbjct: 428 KYSFDCFSHEKAIMEGSLGMDLEARYPPWNNFKLTFIRLAFREAYFKLILLMLGLKR 484

BLAST of CSPI04G05130.1 vs. Swiss-Prot
Match: AL3I1_ARATH (Aldehyde dehydrogenase family 3 member I1, chloroplastic OS=Arabidopsis thaliana GN=ALDH3I1 PE=1 SV=2)

HSP 1 Score: 470.7 bits (1210), Expect = 1.9e-131
Identity = 238/478 (49.79%), Postives = 326/478 (68.20%), Query Frame = 1

Query: 2   EASLEV--LRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRD 61
           EA+L V  LR +F +GRT+SYEWRI QL ++ + I +KE  I EALYQDL K  +E F  
Sbjct: 74  EAALLVDELRSNFNSGRTKSYEWRISQLQNIARMIDEKEKCITEALYQDLSKPELEAFLA 133

Query: 62  EVGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSL 121
           E+     S   A+  L  WMAP+     +  FP+  +++SEP G+VL+IS+WNFP  LS+
Sbjct: 134 EISNTKSSCMLAIKELKNWMAPETVKTSVTTFPSSAQIVSEPLGVVLVISAWNFPFLLSV 193

Query: 122 DPLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWD 181
           +P+IGAI+AGN  VLKPSE AP  SS L      YLD+  I+V+EGG   +  LL  KWD
Sbjct: 194 EPVIGAIAAGNAVVLKPSEIAPAASSLLAKLFSEYLDNTTIRVIEGGVPETTALLDQKWD 253

Query: 182 KIFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGP 241
           KIFFTG  RVARI+ +AAA++LTPV LELGGKCPA+ D S V  N++VAA+RI+ GKW  
Sbjct: 254 KIFFTGGARVARIIMAAAARNLTPVVLELGGKCPALVD-SDV--NLQVAARRIIAGKWAC 313

Query: 242 CAGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLL 301
            +GQACIG+DYV+    FAS+LI++LK  L+ F+G+N+  S  ++RIVN  + +R+ ++L
Sbjct: 314 NSGQACIGVDYVITTKDFASKLIDALKTELETFFGQNALESKDLSRIVNSFHFKRLESML 373

Query: 302 KDPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEF 361
           K+  VA  IVHGG I ++KL I PTILL+ P  + +M EEIFGPLLPIIT+ KIE+  + 
Sbjct: 374 KENGVANKIVHGGRITEDKLKISPTILLDVPEASSMMQEEIFGPLLPIITVQKIEDGFQV 433

Query: 362 INARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYH 421
           I ++PKPLA Y FT ++ L+K+ + + S+G +T NDT++      LPFGGVG+SG G+YH
Sbjct: 434 IRSKPKPLAAYLFTNNKELEKQFVQDVSAGGITINDTVLHVTVKDLPFGGVGESGIGAYH 493

Query: 422 GKYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           GK+S++TFSH+K V+ RSF  + + RYPP+   K   ++     + F   L   G  K
Sbjct: 494 GKFSYETFSHKKGVLYRSFSGDADLRYPPYTPKKKMVLKALLSSNIFAAILAFFGFSK 548

BLAST of CSPI04G05130.1 vs. Swiss-Prot
Match: AL3H1_ARATH (Aldehyde dehydrogenase family 3 member H1 OS=Arabidopsis thaliana GN=ALDH3H1 PE=1 SV=2)

HSP 1 Score: 459.9 bits (1182), Expect = 3.4e-128
Identity = 234/468 (50.00%), Postives = 313/468 (66.88%), Query Frame = 1

Query: 8   LRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDEVGIVLKS 67
           LR SF +G TR YEWR+ QL  L+    + E  I  AL  DLGK  +E    EV ++  S
Sbjct: 19  LRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSLLRNS 78

Query: 68  ANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLDPLIGAIS 127
              AL  L  WMAP+K    L  FPA  E++SEP G+VL+IS+WN+P  LS+DP+IGAIS
Sbjct: 79  IKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVIGAIS 138

Query: 128 AGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDKIFFTGSP 187
           AGN  VLKPSE AP  S+ L   L  YLD  A++VVEG    +  LL+ KWDKIF+TGS 
Sbjct: 139 AGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDKIFYTGSS 198

Query: 188 RVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQACIG 247
           ++ R++ +AAAKHLTPV LELGGK P + D  +   ++KV  +RI+ GKWG   GQAC+ 
Sbjct: 199 KIGRVIMAAAAKHLTPVVLELGGKSPVVVDSDT---DLKVTVRRIIVGKWGCNNGQACVS 258

Query: 248 IDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLKDPKVAAS 307
            DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  + +R+S LL + +V+  
Sbjct: 259 PDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLDEKEVSDK 318

Query: 308 IVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFINARPKPL 367
           IV+GG  D+E L I PTILL+ PLD+ IM+EEIFGPLLPI+TLN +EES + I +RPKPL
Sbjct: 319 IVYGGEKDRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVIRSRPKPL 378

Query: 368 ALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHGKYSFDTF 427
           A Y FT ++ LK+R     S+G +  ND  V     +LPFGGVG+SG G+YHGK+SFD F
Sbjct: 379 AAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHGKFSFDAF 438

Query: 428 SHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGL 476
           SH+KAV+ RS   +   RYPP++  KL+ ++     + F L  +LLGL
Sbjct: 439 SHKKAVLYRSLFGDSAVRYPPYSRGKLRLLKALVDSNIFDLFKVLLGL 483

BLAST of CSPI04G05130.1 vs. Swiss-Prot
Match: ALDH_CRAPL (Aldehyde dehydrogenase OS=Craterostigma plantagineum GN=ALDH PE=1 SV=1)

HSP 1 Score: 449.1 bits (1154), Expect = 5.9e-125
Identity = 226/475 (47.58%), Postives = 320/475 (67.37%), Query Frame = 1

Query: 2   EASLEVLRESFKNGRTRSYEWRIKQLSSLIQFI--HDKENTIFEALYQDLGKHPVEIFRD 61
           E  ++ LR ++ +G+T+SYEWR+ QL +L++    HDKE  + EAL  DL K   E +  
Sbjct: 7   EGVVDGLRRTYISGKTKSYEWRVSQLKALLKITTHHDKE--VVEALRADLKKPEHEAYVH 66

Query: 62  EVGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSL 121
           E+ +V  +  +AL  LH+WM P+K    L  +P+  E++SEP G+VL+I++WN+P  L+L
Sbjct: 67  EIFMVSNACKSALKELHQWMKPQKVKTSLATYPSSAEIVSEPLGVVLVITAWNYPFLLAL 126

Query: 122 DPLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWD 181
           DP+IGAI+AGN  VLKPSE AP  S+ L   L  Y+D  AI+VVEG     + LL  +WD
Sbjct: 127 DPMIGAIAAGNCVVLKPSEIAPATSALLAKLLNQYVDTSAIRVVEGAVPEMQALLDQRWD 186

Query: 182 KIFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGP 241
           KIF+TGS +V +IV S+AAKHLTPV LELGGKCP + D    + ++KVAA+RI+  KW  
Sbjct: 187 KIFYTGSSKVGQIVLSSAAKHLTPVVLELGGKCPTVVD---ANIDLKVAARRIISWKWSG 246

Query: 242 CAGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLL 301
            +GQ CI  DY++  ++ A +L++++K  L+ FYG++   S  ++ I+N++  ER++ LL
Sbjct: 247 NSGQTCISPDYIITTEENAPKLVDAIKCELESFYGKDPLKSQDMSSIINERQFERMTGLL 306

Query: 302 KDPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEF 361
            D KV+  IV+GG  DK  L I PTILL+   D+ +M+EEIFGPLLPIIT+ KIEE  + 
Sbjct: 307 DDKKVSDKIVYGGQSDKSNLKIAPTILLDVSEDSSVMSEEIFGPLLPIITVGKIEECYKI 366

Query: 362 INARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYH 421
           I ++PKPLA Y FT D+   +  +   S+G +T ND  + F+   LPFGGVG+SG GSYH
Sbjct: 367 IASKPKPLAAYLFTNDKKRTEEFVSNVSAGGITINDIALHFLEPRLPFGGVGESGMGSYH 426

Query: 422 GKYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLG 475
           GK+SFD FSH+K+V++RSF  E+  RYPP+  +KL F+    + D FGL    LG
Sbjct: 427 GKFSFDAFSHKKSVLKRSFGGEVAARYPPYAPWKLHFMEAILQGDIFGLLKAWLG 476

BLAST of CSPI04G05130.1 vs. Swiss-Prot
Match: ALDH3_DICDI (Aldehyde dehydrogenase family 3 comG OS=Dictyostelium discoideum GN=comG PE=3 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 6.4e-103
Identity = 198/450 (44.00%), Postives = 285/450 (63.33%), Query Frame = 1

Query: 8   LRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDEVGIVLKS 67
           LR+ F + +TR  +WR  QL ++ + + + ++ I  A+ +DLGKH  EI + E+ ++   
Sbjct: 18  LRKVFLSQKTRKIDWRYSQLKAIKKMMSENKDNITAAVKKDLGKHEFEIHQTEIVMIQTE 77

Query: 68  ANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLDPLIGAIS 127
            +  +S L  W   +K   PL F PA   +L EP G+VLI+S WN+P++L+L PLIGAI+
Sbjct: 78  LDETISHLESWNKTEKVYSPLHFKPASSYILKEPLGVVLIMSPWNYPVNLALIPLIGAIA 137

Query: 128 AGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKV-VEGGADVSEQLLQYKWDKIFFTGS 187
            GN A+LK S ++   S  L   L  YLD +  +   EGGA    +LL+YKWD IFFTGS
Sbjct: 138 GGNCALLKLSRHSYNISKLLHGLLTKYLDPECFEFDCEGGAPYITELLEYKWDHIFFTGS 197

Query: 188 PRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC--AGQA 247
            +V +IV  AAAK LTPVTLELGGK P I D  +   ++K+ A+R++   WG C  AGQ 
Sbjct: 198 VKVGKIVYQAAAKFLTPVTLELGGKNPCIVDKDT---DIKLTARRLI---WGKCWNAGQT 257

Query: 248 CIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLKDPKV 307
           CIG+DY++V       LIE  K +LK+F+GE+ K STS ARI++    ER+  L    KV
Sbjct: 258 CIGLDYLIVHKSILEPLIEEFKVVLKEFFGEDIKKSTSFARIISSAAAERLQQLFSMGKV 317

Query: 308 AASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFINARP 367
               V GG  D  + +I PT++++P LD+ +M +EIFGP+LPI+T   I+E +EFI  RP
Sbjct: 318 ----VIGGEADIAERYIAPTVIVDPDLDSPLMQDEIFGPVLPIVTYENIDECLEFIQNRP 377

Query: 368 KPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHGKYSF 427
             L LY F+ D+ ++ ++L  T SGS+  NDT++ F   +LPFGG+G SG GSYHGK +F
Sbjct: 378 HALTLYLFSRDQAIQDKVLDGTQSGSLMINDTLLHFTNPNLPFGGIGDSGIGSYHGKGTF 437

Query: 428 DTFSHEKAVMQRSF--LIELEPRYPPWNDF 453
           D F H++ ++Q +    ++L  RYPP+  F
Sbjct: 438 DIFVHKRGLVQSTTKKFLDLPLRYPPYTPF 457

BLAST of CSPI04G05130.1 vs. TrEMBL
Match: A0A0A0KZF5_CUCSA (Aldehyde dehydrogenase OS=Cucumis sativus GN=Csa_4G043870 PE=3 SV=1)

HSP 1 Score: 943.3 bits (2437), Expect = 1.1e-271
Identity = 474/477 (99.37%), Postives = 476/477 (99.79%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE
Sbjct: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VGIVLKSANNALSSLHKWMAPKKKP+PLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD
Sbjct: 61  VGIVLKSANNALSSLHKWMAPKKKPLPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DPKVAASIVHGGS+DKEKLFIEPTILLNPPL ADIMTEEIFGPLLPIITLNKIEESIEFI
Sbjct: 301 DPKVAASIVHGGSMDKEKLFIEPTILLNPPLYADIMTEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG
Sbjct: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK
Sbjct: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 477

BLAST of CSPI04G05130.1 vs. TrEMBL
Match: A0A067KCB6_JATCU (Aldehyde dehydrogenase OS=Jatropha curcas GN=JCGZ_13661 PE=3 SV=1)

HSP 1 Score: 718.4 bits (1853), Expect = 5.8e-204
Identity = 342/477 (71.70%), Postives = 410/477 (85.95%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +EASLE LR++F++G+TR+ EWR  QL +LIQF +D E  IF+AL QDLGKHPVE +RDE
Sbjct: 6   IEASLEELRKTFRSGKTRTVEWRKTQLRALIQFFNDNEENIFQALNQDLGKHPVESYRDE 65

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VG+VLKSANN+LS + KWMAPKK  +PLL FPA G+V+ EPFG+VLI  SWNFP++++LD
Sbjct: 66  VGVVLKSANNSLSCIEKWMAPKKSHIPLLMFPASGQVIPEPFGVVLIFGSWNFPITMALD 125

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGNT +LKPS+ +P  SSFL  TLP YLD +AIKV+EGG +V EQ+LQ KWDK
Sbjct: 126 PLIGAISAGNTVLLKPSDLSPKCSSFLANTLPKYLDSEAIKVIEGGINVCEQILQQKWDK 185

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGS RV R++ + AAKHLTPVTLELGGKCP + D ++V S+MK+ AKRIVGGKWGPC
Sbjct: 186 IFFTGSQRVGRVIMTEAAKHLTPVTLELGGKCPLVLDTATVSSDMKIVAKRIVGGKWGPC 245

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           +GQACI +DYVLVE+KFAS LI+SL RI++KFYGEN+K S S++RI N K  +R+S+++K
Sbjct: 246 SGQACISVDYVLVEEKFASYLIDSLSRIIRKFYGENTKESKSLSRIANIKAFDRLSSVIK 305

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DP VAASIVHGGS D+EKLFIEPTILLNPPLD++IMTEEIFGPLLPIIT+N I+ESI+FI
Sbjct: 306 DPLVAASIVHGGSTDEEKLFIEPTILLNPPLDSEIMTEEIFGPLLPIITVNNIQESIQFI 365

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           ++RPKPL +YAFT DET K++IL +TSSGSV FNDTMVQFVCD LPFGGVG SG G YHG
Sbjct: 366 SSRPKPLVIYAFTKDETFKRQILTQTSSGSVVFNDTMVQFVCDELPFGGVGHSGFGRYHG 425

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFDTFSHEKAVMQR F  ELEPRYPPWN+FKL+FI+LAY ++Y GL LLLLGLKK
Sbjct: 426 KYSFDTFSHEKAVMQRGFFPELEPRYPPWNNFKLEFIKLAYSFNYLGLLLLLLGLKK 482

BLAST of CSPI04G05130.1 vs. TrEMBL
Match: D7SP43_VITVI (Aldehyde dehydrogenase OS=Vitis vinifera GN=VIT_04s0023g02810 PE=3 SV=1)

HSP 1 Score: 716.1 bits (1847), Expect = 2.9e-203
Identity = 348/477 (72.96%), Postives = 401/477 (84.07%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E S+  LR +F++G TRS  WR  QL +L+Q + D EN IFEAL QDLGKHPVE +RDE
Sbjct: 17  VEESIGELRRTFRSGETRSAAWRKAQLKALLQLLRDNENKIFEALKQDLGKHPVESYRDE 76

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G+V KS   +LS + +WMAPKK  +PL+FFP KG+VL EP GLVLI SSWNFP+SL+LD
Sbjct: 77  LGVVEKSVKYSLSHVDEWMAPKKSSLPLIFFPGKGQVLPEPLGLVLIFSSWNFPISLALD 136

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           P+IGAISAGN+ VLKPSE AP  SSFL  T+PLYLD KAIKV+EGGA +S+QLLQ KWDK
Sbjct: 137 PVIGAISAGNSVVLKPSEQAPACSSFLANTIPLYLDSKAIKVIEGGAAISQQLLQQKWDK 196

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP VARIV SAA KHLTPVT+ELGGKCP IFD  S  S+ +VA KR+VGGKWGPC
Sbjct: 197 IFFTGSPSVARIVMSAAVKHLTPVTIELGGKCPTIFDNLSSPSDTEVAVKRVVGGKWGPC 256

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACIG+DYVLVE+KFAS LIE LK+ +KKFYGEN K    I++IVN  + +R+ NLLK
Sbjct: 257 NGQACIGVDYVLVEEKFASHLIEMLKKTIKKFYGENPKELKDISKIVNKHHFQRLHNLLK 316

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           +P VAASIVHGG ID+EKLFIEPTILL+PPLDA+IMTEEIFGPLLPIITL  IEESIEFI
Sbjct: 317 EPLVAASIVHGGLIDEEKLFIEPTILLDPPLDAEIMTEEIFGPLLPIITLKNIEESIEFI 376

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           N+RPKPLALYAFT DE  K+RIL ETSSGSVTFND ++QFVCD+LPFGGVGQSG G YHG
Sbjct: 377 NSRPKPLALYAFTNDEAFKRRILSETSSGSVTFNDIIIQFVCDTLPFGGVGQSGFGRYHG 436

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFDTFSHEKAV++RSF +ELEPR+PPWNDFKLKFIRL Y +DY GL LLLLGLK+
Sbjct: 437 KYSFDTFSHEKAVLRRSFFLELEPRFPPWNDFKLKFIRLVYSFDYLGLILLLLGLKR 493

BLAST of CSPI04G05130.1 vs. TrEMBL
Match: A0A061DFH0_THECC (Aldehyde dehydrogenase OS=Theobroma cacao GN=TCM_000268 PE=3 SV=1)

HSP 1 Score: 701.8 bits (1810), Expect = 5.6e-199
Identity = 339/477 (71.07%), Postives = 401/477 (84.07%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           ME S+  LRE+FK+GRTRS  WR  QL ++I  I++ E TI++ L+QDLGK P E +RDE
Sbjct: 1   MEGSIAGLRETFKSGRTRSVAWRKNQLKAVIDLINENEQTIYKVLHQDLGKDPAESYRDE 60

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G++LKSAN ALS L KW+APKK  +PL+FFPAKGEVL EP G+VLI SSWNFP++L+LD
Sbjct: 61  MGVILKSANYALSCLDKWVAPKKAELPLVFFPAKGEVLPEPVGVVLIFSSWNFPITLALD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGN  VLKPSE AP  SSF + T+PLYLD+KA+KV+ GGADV E+LL+ KWDK
Sbjct: 121 PLIGAISAGNAVVLKPSELAPACSSFFIETIPLYLDNKAVKVIGGGADVGERLLELKWDK 180

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP+V R+V +AAA+HLTPVTLELGGKCPA+ D  S HS  KV AKRI GGKWG C
Sbjct: 181 IFFTGSPQVGRLVMTAAARHLTPVTLELGGKCPAVVDAFSSHSKTKVIAKRIAGGKWGLC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           +GQACI +DY+LVE+KFAS LIE LK+ +K+F+G N  +   ++RIVN  + ERI +LLK
Sbjct: 241 SGQACIAVDYLLVEEKFASTLIELLKKNIKRFFGGNLGDLKCVSRIVNKHHFERIYHLLK 300

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DP VA+SIVHGGS+D+E+L IEPTILL+PPLD++IMTEEIFGPLLPIITL  IEESI+FI
Sbjct: 301 DPHVASSIVHGGSVDEERLVIEPTILLDPPLDSEIMTEEIFGPLLPIITLKNIEESIDFI 360

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           N+RPKPL +YAFT D T KKRIL ETSSG+VTFND MVQFVCDSLPFGG GQSG G YHG
Sbjct: 361 NSRPKPLVIYAFTEDGTFKKRILSETSSGTVTFNDVMVQFVCDSLPFGGAGQSGFGRYHG 420

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFDTFSHEKAV+ R+F  ELEPRYPPWNDFKL+FI+LAYR+DYFGL LLLLGLKK
Sbjct: 421 KYSFDTFSHEKAVLHRAFFPELEPRYPPWNDFKLRFIKLAYRFDYFGLILLLLGLKK 477

BLAST of CSPI04G05130.1 vs. TrEMBL
Match: M5VPN0_PRUPE (Aldehyde dehydrogenase OS=Prunus persica GN=PRUPE_ppa004971mg PE=3 SV=1)

HSP 1 Score: 699.9 bits (1805), Expect = 2.1e-198
Identity = 333/477 (69.81%), Postives = 400/477 (83.86%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E +L  LR++FK+GRTRS  WR  Q+S+L+Q IHD+E+ IF+ALY+DLGKHPVE++RDE
Sbjct: 7   VEETLSELRQTFKSGRTRSVAWRKNQVSALLQLIHDQEDEIFKALYEDLGKHPVEVYRDE 66

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +GIV K+ N  LS+L KW+APKK  +PLL FP  GEVL EP G+VLI +SWNFP++L LD
Sbjct: 67  IGIVKKTINYTLSNLEKWVAPKKSRLPLLLFPTSGEVLPEPLGVVLIFASWNFPIALGLD 126

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           P+IGAISAGNT VLKPSE AP  SSFL  T+P Y+D KA++V+EGGA++SE LLQ KWDK
Sbjct: 127 PVIGAISAGNTVVLKPSEQAPACSSFLANTIPQYMDSKAVRVIEGGAEISELLLQQKWDK 186

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP+V RIV SAAAK+LTPVTLELGGKCP I D  S  S++KVA KRIVGGKWGPC
Sbjct: 187 IFFTGSPQVGRIVMSAAAKNLTPVTLELGGKCPTILDSFSNPSDLKVAIKRIVGGKWGPC 246

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACIG+DY+L+E+K AS LIE LK+ +K+FY ++ K+S  IAR++N  + ER+ NLLK
Sbjct: 247 NGQACIGVDYILIEEKLASTLIELLKKTVKRFYSDSPKDSKCIARVINRGHFERLRNLLK 306

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DP VAASIVHGGS+D+E LFIEPTILL+PPLDA IMTEEIFGPLLPIITL  I+ESIEFI
Sbjct: 307 DPLVAASIVHGGSLDEENLFIEPTILLDPPLDAAIMTEEIFGPLLPIITLKSIQESIEFI 366

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           N+RPKPLA+YAFT DE  ++RIL ETSSGSV FND ++QF+CD+LPFGGVGQSG G YHG
Sbjct: 367 NSRPKPLAIYAFTKDEEFRQRILLETSSGSVIFNDVLIQFICDALPFGGVGQSGFGRYHG 426

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFDTFSHEK VM+ +F++ELEPRYPPWNDFK+ F RLAY  DY GL LL LGLK+
Sbjct: 427 KYSFDTFSHEKVVMRGNFIVELEPRYPPWNDFKMNFFRLAYNLDYLGLLLLFLGLKR 483

BLAST of CSPI04G05130.1 vs. TAIR10
Match: AT4G36250.1 (AT4G36250.1 aldehyde dehydrogenase 3F1)

HSP 1 Score: 639.8 bits (1649), Expect = 1.3e-183
Identity = 301/477 (63.10%), Postives = 382/477 (80.08%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E SL  +RE+F +GRTRS +WR  Q+ ++ + + D E+ I  AL+QDLGKH  E FRDE
Sbjct: 8   VEESLREMRETFASGRTRSLKWRKAQIGAIYEMVKDNEDKICNALFQDLGKHSTEAFRDE 67

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G+VL++A  A++ L KW  PK   +PLLF+PAKG+V+SEP+G VL++SSWNFP+SLSLD
Sbjct: 68  LGVVLRTATVAINCLDKWAVPKHSKLPLLFYPAKGKVISEPYGTVLVLSSWNFPISLSLD 127

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAI+AGNT +LK SE +P  S+FL  T+P YLD KAIKV+EGG DV+  LLQ++WDK
Sbjct: 128 PLIGAIAAGNTVLLKSSELSPNASAFLAKTIPAYLDTKAIKVIEGGPDVATILLQHQWDK 187

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP++ RI+ +AAA+HLTPVTLELGGKCP I D+ ++  N+K   KRI GGKWG C
Sbjct: 188 IFFTGSPKIGRIIMAAAAQHLTPVTLELGGKCPTIVDHHTISKNIKSVVKRIAGGKWGSC 247

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACI +DYVL+E  FA  LI+ LK  +K F+GEN K S  ++RI N  +V+R+S LL 
Sbjct: 248 NGQACISVDYVLIEKSFAPTLIDMLKPTIKSFFGENPKESGCLSRIANKHHVQRLSRLLS 307

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DP+V ASIV+GGSID++KL++EPTILL+PPLD++IM EEIFGP+LPIIT+  I+ESI  I
Sbjct: 308 DPRVQASIVYGGSIDEDKLYVEPTILLDPPLDSEIMNEEIFGPILPIITVRDIQESIGII 367

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           N +PKPLA+YAFT DE LK RIL ETSSGSVTFND M+Q++CD+LPFGGVG+SG G YHG
Sbjct: 368 NTKPKPLAIYAFTNDENLKTRILSETSSGSVTFNDVMIQYMCDALPFGGVGESGIGRYHG 427

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFD FSHEKA+M+ S  ++LE RYPPWN+FKL FIRLA+R  YF L LL+LGLK+
Sbjct: 428 KYSFDCFSHEKAIMEGSLGMDLEARYPPWNNFKLTFIRLAFREAYFKLILLMLGLKR 484

BLAST of CSPI04G05130.1 vs. TAIR10
Match: AT4G34240.1 (AT4G34240.1 aldehyde dehydrogenase 3I1)

HSP 1 Score: 470.7 bits (1210), Expect = 1.1e-132
Identity = 238/478 (49.79%), Postives = 326/478 (68.20%), Query Frame = 1

Query: 2   EASLEV--LRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRD 61
           EA+L V  LR +F +GRT+SYEWRI QL ++ + I +KE  I EALYQDL K  +E F  
Sbjct: 74  EAALLVDELRSNFNSGRTKSYEWRISQLQNIARMIDEKEKCITEALYQDLSKPELEAFLA 133

Query: 62  EVGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSL 121
           E+     S   A+  L  WMAP+     +  FP+  +++SEP G+VL+IS+WNFP  LS+
Sbjct: 134 EISNTKSSCMLAIKELKNWMAPETVKTSVTTFPSSAQIVSEPLGVVLVISAWNFPFLLSV 193

Query: 122 DPLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWD 181
           +P+IGAI+AGN  VLKPSE AP  SS L      YLD+  I+V+EGG   +  LL  KWD
Sbjct: 194 EPVIGAIAAGNAVVLKPSEIAPAASSLLAKLFSEYLDNTTIRVIEGGVPETTALLDQKWD 253

Query: 182 KIFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGP 241
           KIFFTG  RVARI+ +AAA++LTPV LELGGKCPA+ D S V  N++VAA+RI+ GKW  
Sbjct: 254 KIFFTGGARVARIIMAAAARNLTPVVLELGGKCPALVD-SDV--NLQVAARRIIAGKWAC 313

Query: 242 CAGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLL 301
            +GQACIG+DYV+    FAS+LI++LK  L+ F+G+N+  S  ++RIVN  + +R+ ++L
Sbjct: 314 NSGQACIGVDYVITTKDFASKLIDALKTELETFFGQNALESKDLSRIVNSFHFKRLESML 373

Query: 302 KDPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEF 361
           K+  VA  IVHGG I ++KL I PTILL+ P  + +M EEIFGPLLPIIT+ KIE+  + 
Sbjct: 374 KENGVANKIVHGGRITEDKLKISPTILLDVPEASSMMQEEIFGPLLPIITVQKIEDGFQV 433

Query: 362 INARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYH 421
           I ++PKPLA Y FT ++ L+K+ + + S+G +T NDT++      LPFGGVG+SG G+YH
Sbjct: 434 IRSKPKPLAAYLFTNNKELEKQFVQDVSAGGITINDTVLHVTVKDLPFGGVGESGIGAYH 493

Query: 422 GKYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           GK+S++TFSH+K V+ RSF  + + RYPP+   K   ++     + F   L   G  K
Sbjct: 494 GKFSYETFSHKKGVLYRSFSGDADLRYPPYTPKKKMVLKALLSSNIFAAILAFFGFSK 548

BLAST of CSPI04G05130.1 vs. TAIR10
Match: AT1G44170.1 (AT1G44170.1 aldehyde dehydrogenase 3H1)

HSP 1 Score: 459.9 bits (1182), Expect = 1.9e-129
Identity = 234/468 (50.00%), Postives = 313/468 (66.88%), Query Frame = 1

Query: 8   LRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDEVGIVLKS 67
           LR SF +G TR YEWR+ QL  L+    + E  I  AL  DLGK  +E    EV ++  S
Sbjct: 19  LRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSLLRNS 78

Query: 68  ANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLDPLIGAIS 127
              AL  L  WMAP+K    L  FPA  E++SEP G+VL+IS+WN+P  LS+DP+IGAIS
Sbjct: 79  IKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVIGAIS 138

Query: 128 AGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDKIFFTGSP 187
           AGN  VLKPSE AP  S+ L   L  YLD  A++VVEG    +  LL+ KWDKIF+TGS 
Sbjct: 139 AGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDKIFYTGSS 198

Query: 188 RVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQACIG 247
           ++ R++ +AAAKHLTPV LELGGK P + D  +   ++KV  +RI+ GKWG   GQAC+ 
Sbjct: 199 KIGRVIMAAAAKHLTPVVLELGGKSPVVVDSDT---DLKVTVRRIIVGKWGCNNGQACVS 258

Query: 248 IDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLKDPKVAAS 307
            DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  + +R+S LL + +V+  
Sbjct: 259 PDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLDEKEVSDK 318

Query: 308 IVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFINARPKPL 367
           IV+GG  D+E L I PTILL+ PLD+ IM+EEIFGPLLPI+TLN +EES + I +RPKPL
Sbjct: 319 IVYGGEKDRENLKIAPTILLDVPLDSLIMSEEIFGPLLPILTLNNLEESFDVIRSRPKPL 378

Query: 368 ALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHGKYSFDTF 427
           A Y FT ++ LK+R     S+G +  ND  V     +LPFGGVG+SG G+YHGK+SFD F
Sbjct: 379 AAYLFTHNKKLKERFAATVSAGGIVVNDIAVHLALHTLPFGGVGESGMGAYHGKFSFDAF 438

Query: 428 SHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGL 476
           SH+KAV+ RS   +   RYPP++  KL+ ++     + F L  +LLGL
Sbjct: 439 SHKKAVLYRSLFGDSAVRYPPYSRGKLRLLKALVDSNIFDLFKVLLGL 483

BLAST of CSPI04G05130.1 vs. TAIR10
Match: AT3G66658.2 (AT3G66658.2 aldehyde dehydrogenase 22A1)

HSP 1 Score: 159.8 bits (403), Expect = 4.0e-39
Identity = 124/461 (26.90%), Postives = 205/461 (44.47%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E  + + R++ K     S++ R + L  L+++I + +  I E   +D GK  V+    E
Sbjct: 88  VEERVTLSRKAQKTWAQSSFKLRRQFLRILLKYIIEHQELICEVSSRDTGKTMVDASLGE 147

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +    +     LS   +W+ P+ +            V   P G++  I  WN+P     +
Sbjct: 148 IMTTCEKITWLLSEGERWLKPESRSSGRAMLHKVSRVEFHPLGVIGAIVPWNYPFHNIFN 207

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYL-----DDKAIKVVEGGADVSEQLLQ 180
           P++ A+ +GN  V+K SE+A     F    +   L      +  + V+ G A+  E L+ 
Sbjct: 208 PMLAAVFSGNGIVIKVSEHASWSGCFYFRIIQAALAAVGAPENLVDVITGFAETGEALVS 267

Query: 181 YKWDKIFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIF-DYSSVHSNMKVAAKRIVG 240
              DK+ F GS  V +++   AA+ LTPVTLELGGK   I  + + V    +VA +  + 
Sbjct: 268 -SVDKMIFVGSTAVGKMIMRNAAETLTPVTLELGGKDAFIICEDADVSHVAQVAVRGTLQ 327

Query: 241 GKWGPCAGQACIGIDYVLVEDKFASELIESLKRILKKFY-GENSKNSTSIARIVNDKNVE 300
                 +GQ C G +   V     +  I  + +I+K    G        +  I   ++ E
Sbjct: 328 S-----SGQNCAGAERFYVHKDIYTAFIGQVTKIVKSVSAGPPLTGRYDMGAICLQEHSE 387

Query: 301 RISNLLKDP-------KVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLP 360
            + +L+ D         V  S  H G  D    +  PT+L+N   +  IM EE FGP++P
Sbjct: 388 HLQSLVNDALDKGAEIAVRGSFGHLGE-DAVDQYFPPTVLINVNHNMKIMKEEAFGPIMP 447

Query: 361 IITLNKIEESIEFINARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLP 420
           I+  +  EE I+  N     L    F+G +   K+I  +   G    ND    ++C SLP
Sbjct: 448 IMQFSTDEEVIKLANDSRYALGCAVFSGSKHRAKQIASQIQCGVAAINDFASNYMCQSLP 507

Query: 421 FGGVGQSGSGSYHGKYSFDTFSHEKAVMQRSFLIELEPRYP 448
           FGGV  SG G + G          K+V++  F   ++ + P
Sbjct: 508 FGGVKDSGFGRFAGIEGLRACCLVKSVVEDRFWPLIKTKIP 541

BLAST of CSPI04G05130.1 vs. TAIR10
Match: AT1G79440.1 (AT1G79440.1 aldehyde dehydrogenase 5F1)

HSP 1 Score: 146.4 bits (368), Expect = 4.6e-35
Identity = 111/345 (32.17%), Postives = 176/345 (51.01%), Query Frame = 1

Query: 97  VLSEPFGLVLIISSWNFPLSLSLDPLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLY-- 156
           VL +P G+V  I+ WNFPL++    +  A+++G T V+KPSE  P+ ++   A L L   
Sbjct: 184 VLKQPVGVVGAITPWNFPLAMITRKVGPALASGCTVVVKPSELTPL-TALAAAELALQAG 243

Query: 157 LDDKAIKVVEGGA-DVSEQLL-QYKWDKIFFTGSPRVARIVSSAAAKHLTPVTLELGGKC 216
           +   A+ VV G A ++ + LL   +  KI FTGS  V + + +AAA  +  V+LELGG  
Sbjct: 244 VPPGALNVVMGNAPEIGDALLTSPQVRKITFTGSTAVGKKLMAAAAPTVKKVSLELGGNA 303

Query: 217 PAI-FDYSSVHSNMKVAAKRIVGGKWGPCAGQACIGIDYVLVEDKFASELIESLKRILKK 276
           P+I FD     +++ VA K  +  K+   +GQ C+  + VLV+D    +  E+    ++K
Sbjct: 304 PSIVFD----DADLDVAVKGTLAAKFRN-SGQTCVCANRVLVQDGIYDKFAEAFSEAVQK 363

Query: 277 F-YGENSKNSTSIARIVNDKNVERISNLLKD--PKVAASIVHGGSIDKEKLFIEPTILLN 336
              G+  ++ T+   ++ND  V+++   ++D   K A  I+ G        F EPT++ +
Sbjct: 364 LEVGDGFRDGTTQGPLINDAAVQKVETFVQDAVSKGAKIIIGGKRHSLGMTFYEPTVIRD 423

Query: 337 PPLDADIMTEEIFGPLLPIITLNKIEESIEFINARPKPLALYAFTGDETLKKRILYETSS 396
              +  +  EEIFGP+ P+I     E++I   N     LA Y FT       R+      
Sbjct: 424 VSDNMIMSKEEIFGPVAPLIRFKTEEDAIRIANDTIAGLAAYIFTNSVQRSWRVFEALEY 483

Query: 397 GSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHGKYSFDTFSHEKAV 434
           G V  N+ ++    +  PFGGV QSG G    KY  D +   K V
Sbjct: 484 GLVGVNEGLIS--TEVAPFGGVKQSGLGREGSKYGMDEYLEIKYV 520

BLAST of CSPI04G05130.1 vs. NCBI nr
Match: gi|449457494|ref|XP_004146483.1| (PREDICTED: aldehyde dehydrogenase family 3 member F1 [Cucumis sativus])

HSP 1 Score: 943.3 bits (2437), Expect = 1.6e-271
Identity = 474/477 (99.37%), Postives = 476/477 (99.79%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE
Sbjct: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VGIVLKSANNALSSLHKWMAPKKKP+PLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD
Sbjct: 61  VGIVLKSANNALSSLHKWMAPKKKPLPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DPKVAASIVHGGS+DKEKLFIEPTILLNPPL ADIMTEEIFGPLLPIITLNKIEESIEFI
Sbjct: 301 DPKVAASIVHGGSMDKEKLFIEPTILLNPPLYADIMTEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG
Sbjct: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK
Sbjct: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 477

BLAST of CSPI04G05130.1 vs. NCBI nr
Match: gi|659107520|ref|XP_008453718.1| (PREDICTED: aldehyde dehydrogenase family 3 member F1 [Cucumis melo])

HSP 1 Score: 921.0 bits (2379), Expect = 8.5e-265
Identity = 461/477 (96.65%), Postives = 469/477 (98.32%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           MEA+LEVLRESFKNGRTRSYEWR KQLSSLIQ IHDKENTIFEALYQDLGKHPVEIFRDE
Sbjct: 1   MEANLEVLRESFKNGRTRSYEWRKKQLSSLIQLIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VGIVLKSAN+ALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD
Sbjct: 61  VGIVLKSANDALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLD+KAIKVVEGGADV EQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDNKAIKVVEGGADVCEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP+V RIV SAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELI+SLKRILKKFYGENSKNSTSIARIVN+KNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIDSLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DPKVAASIVHGGS+DKEKLFIEPTILLNPPLD DIMTEEIFGPLLPIITLNKIEESIEFI
Sbjct: 301 DPKVAASIVHGGSVDKEKLFIEPTILLNPPLDTDIMTEEIFGPLLPIITLNKIEESIEFI 360

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           NARPKPLALYAFT DETLKKRILY+TSSGSVTFNDTMVQFVCDSLPFGGVGQSG GSYHG
Sbjct: 361 NARPKPLALYAFTEDETLKKRILYKTSSGSVTFNDTMVQFVCDSLPFGGVGQSGFGSYHG 420

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK
Sbjct: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 477

BLAST of CSPI04G05130.1 vs. NCBI nr
Match: gi|802638905|ref|XP_012078389.1| (PREDICTED: aldehyde dehydrogenase family 3 member F1 [Jatropha curcas])

HSP 1 Score: 718.4 bits (1853), Expect = 8.4e-204
Identity = 342/477 (71.70%), Postives = 410/477 (85.95%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +EASLE LR++F++G+TR+ EWR  QL +LIQF +D E  IF+AL QDLGKHPVE +RDE
Sbjct: 6   IEASLEELRKTFRSGKTRTVEWRKTQLRALIQFFNDNEENIFQALNQDLGKHPVESYRDE 65

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VG+VLKSANN+LS + KWMAPKK  +PLL FPA G+V+ EPFG+VLI  SWNFP++++LD
Sbjct: 66  VGVVLKSANNSLSCIEKWMAPKKSHIPLLMFPASGQVIPEPFGVVLIFGSWNFPITMALD 125

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGNT +LKPS+ +P  SSFL  TLP YLD +AIKV+EGG +V EQ+LQ KWDK
Sbjct: 126 PLIGAISAGNTVLLKPSDLSPKCSSFLANTLPKYLDSEAIKVIEGGINVCEQILQQKWDK 185

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGS RV R++ + AAKHLTPVTLELGGKCP + D ++V S+MK+ AKRIVGGKWGPC
Sbjct: 186 IFFTGSQRVGRVIMTEAAKHLTPVTLELGGKCPLVLDTATVSSDMKIVAKRIVGGKWGPC 245

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           +GQACI +DYVLVE+KFAS LI+SL RI++KFYGEN+K S S++RI N K  +R+S+++K
Sbjct: 246 SGQACISVDYVLVEEKFASYLIDSLSRIIRKFYGENTKESKSLSRIANIKAFDRLSSVIK 305

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           DP VAASIVHGGS D+EKLFIEPTILLNPPLD++IMTEEIFGPLLPIIT+N I+ESI+FI
Sbjct: 306 DPLVAASIVHGGSTDEEKLFIEPTILLNPPLDSEIMTEEIFGPLLPIITVNNIQESIQFI 365

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           ++RPKPL +YAFT DET K++IL +TSSGSV FNDTMVQFVCD LPFGGVG SG G YHG
Sbjct: 366 SSRPKPLVIYAFTKDETFKRQILTQTSSGSVVFNDTMVQFVCDELPFGGVGHSGFGRYHG 425

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFDTFSHEKAVMQR F  ELEPRYPPWN+FKL+FI+LAY ++Y GL LLLLGLKK
Sbjct: 426 KYSFDTFSHEKAVMQRGFFPELEPRYPPWNNFKLEFIKLAYSFNYLGLLLLLLGLKK 482

BLAST of CSPI04G05130.1 vs. NCBI nr
Match: gi|297735060|emb|CBI17422.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 716.1 bits (1847), Expect = 4.2e-203
Identity = 348/477 (72.96%), Postives = 401/477 (84.07%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E S+  LR +F++G TRS  WR  QL +L+Q + D EN IFEAL QDLGKHPVE +RDE
Sbjct: 17  VEESIGELRRTFRSGETRSAAWRKAQLKALLQLLRDNENKIFEALKQDLGKHPVESYRDE 76

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G+V KS   +LS + +WMAPKK  +PL+FFP KG+VL EP GLVLI SSWNFP+SL+LD
Sbjct: 77  LGVVEKSVKYSLSHVDEWMAPKKSSLPLIFFPGKGQVLPEPLGLVLIFSSWNFPISLALD 136

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           P+IGAISAGN+ VLKPSE AP  SSFL  T+PLYLD KAIKV+EGGA +S+QLLQ KWDK
Sbjct: 137 PVIGAISAGNSVVLKPSEQAPACSSFLANTIPLYLDSKAIKVIEGGAAISQQLLQQKWDK 196

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP VARIV SAA KHLTPVT+ELGGKCP IFD  S  S+ +VA KR+VGGKWGPC
Sbjct: 197 IFFTGSPSVARIVMSAAVKHLTPVTIELGGKCPTIFDNLSSPSDTEVAVKRVVGGKWGPC 256

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACIG+DYVLVE+KFAS LIE LK+ +KKFYGEN K    I++IVN  + +R+ NLLK
Sbjct: 257 NGQACIGVDYVLVEEKFASHLIEMLKKTIKKFYGENPKELKDISKIVNKHHFQRLHNLLK 316

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           +P VAASIVHGG ID+EKLFIEPTILL+PPLDA+IMTEEIFGPLLPIITL  IEESIEFI
Sbjct: 317 EPLVAASIVHGGLIDEEKLFIEPTILLDPPLDAEIMTEEIFGPLLPIITLKNIEESIEFI 376

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           N+RPKPLALYAFT DE  K+RIL ETSSGSVTFND ++QFVCD+LPFGGVGQSG G YHG
Sbjct: 377 NSRPKPLALYAFTNDEAFKRRILSETSSGSVTFNDIIIQFVCDTLPFGGVGQSGFGRYHG 436

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFDTFSHEKAV++RSF +ELEPR+PPWNDFKLKFIRL Y +DY GL LLLLGLK+
Sbjct: 437 KYSFDTFSHEKAVLRRSFFLELEPRFPPWNDFKLKFIRLVYSFDYLGLILLLLGLKR 493

BLAST of CSPI04G05130.1 vs. NCBI nr
Match: gi|731386947|ref|XP_002273358.2| (PREDICTED: aldehyde dehydrogenase family 3 member F1 [Vitis vinifera])

HSP 1 Score: 716.1 bits (1847), Expect = 4.2e-203
Identity = 348/477 (72.96%), Postives = 401/477 (84.07%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E S+  LR +F++G TRS  WR  QL +L+Q + D EN IFEAL QDLGKHPVE +RDE
Sbjct: 19  VEESIGELRRTFRSGETRSAAWRKAQLKALLQLLRDNENKIFEALKQDLGKHPVESYRDE 78

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G+V KS   +LS + +WMAPKK  +PL+FFP KG+VL EP GLVLI SSWNFP+SL+LD
Sbjct: 79  LGVVEKSVKYSLSHVDEWMAPKKSSLPLIFFPGKGQVLPEPLGLVLIFSSWNFPISLALD 138

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           P+IGAISAGN+ VLKPSE AP  SSFL  T+PLYLD KAIKV+EGGA +S+QLLQ KWDK
Sbjct: 139 PVIGAISAGNSVVLKPSEQAPACSSFLANTIPLYLDSKAIKVIEGGAAISQQLLQQKWDK 198

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP VARIV SAA KHLTPVT+ELGGKCP IFD  S  S+ +VA KR+VGGKWGPC
Sbjct: 199 IFFTGSPSVARIVMSAAVKHLTPVTIELGGKCPTIFDNLSSPSDTEVAVKRVVGGKWGPC 258

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACIG+DYVLVE+KFAS LIE LK+ +KKFYGEN K    I++IVN  + +R+ NLLK
Sbjct: 259 NGQACIGVDYVLVEEKFASHLIEMLKKTIKKFYGENPKELKDISKIVNKHHFQRLHNLLK 318

Query: 301 DPKVAASIVHGGSIDKEKLFIEPTILLNPPLDADIMTEEIFGPLLPIITLNKIEESIEFI 360
           +P VAASIVHGG ID+EKLFIEPTILL+PPLDA+IMTEEIFGPLLPIITL  IEESIEFI
Sbjct: 319 EPLVAASIVHGGLIDEEKLFIEPTILLDPPLDAEIMTEEIFGPLLPIITLKNIEESIEFI 378

Query: 361 NARPKPLALYAFTGDETLKKRILYETSSGSVTFNDTMVQFVCDSLPFGGVGQSGSGSYHG 420
           N+RPKPLALYAFT DE  K+RIL ETSSGSVTFND ++QFVCD+LPFGGVGQSG G YHG
Sbjct: 379 NSRPKPLALYAFTNDEAFKRRILSETSSGSVTFNDIIIQFVCDTLPFGGVGQSGFGRYHG 438

Query: 421 KYSFDTFSHEKAVMQRSFLIELEPRYPPWNDFKLKFIRLAYRYDYFGLALLLLGLKK 478
           KYSFDTFSHEKAV++RSF +ELEPR+PPWNDFKLKFIRL Y +DY GL LLLLGLK+
Sbjct: 439 KYSFDTFSHEKAVLRRSFFLELEPRFPPWNDFKLKFIRLVYSFDYLGLILLLLGLKR 495

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AL3F1_ARATH2.4e-18263.10Aldehyde dehydrogenase family 3 member F1 OS=Arabidopsis thaliana GN=ALDH3F1 PE=... [more]
AL3I1_ARATH1.9e-13149.79Aldehyde dehydrogenase family 3 member I1, chloroplastic OS=Arabidopsis thaliana... [more]
AL3H1_ARATH3.4e-12850.00Aldehyde dehydrogenase family 3 member H1 OS=Arabidopsis thaliana GN=ALDH3H1 PE=... [more]
ALDH_CRAPL5.9e-12547.58Aldehyde dehydrogenase OS=Craterostigma plantagineum GN=ALDH PE=1 SV=1[more]
ALDH3_DICDI6.4e-10344.00Aldehyde dehydrogenase family 3 comG OS=Dictyostelium discoideum GN=comG PE=3 SV... [more]
Match NameE-valueIdentityDescription
A0A0A0KZF5_CUCSA1.1e-27199.37Aldehyde dehydrogenase OS=Cucumis sativus GN=Csa_4G043870 PE=3 SV=1[more]
A0A067KCB6_JATCU5.8e-20471.70Aldehyde dehydrogenase OS=Jatropha curcas GN=JCGZ_13661 PE=3 SV=1[more]
D7SP43_VITVI2.9e-20372.96Aldehyde dehydrogenase OS=Vitis vinifera GN=VIT_04s0023g02810 PE=3 SV=1[more]
A0A061DFH0_THECC5.6e-19971.07Aldehyde dehydrogenase OS=Theobroma cacao GN=TCM_000268 PE=3 SV=1[more]
M5VPN0_PRUPE2.1e-19869.81Aldehyde dehydrogenase OS=Prunus persica GN=PRUPE_ppa004971mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G36250.11.3e-18363.10 aldehyde dehydrogenase 3F1[more]
AT4G34240.11.1e-13249.79 aldehyde dehydrogenase 3I1[more]
AT1G44170.11.9e-12950.00 aldehyde dehydrogenase 3H1[more]
AT3G66658.24.0e-3926.90 aldehyde dehydrogenase 22A1[more]
AT1G79440.14.6e-3532.17 aldehyde dehydrogenase 5F1[more]
Match NameE-valueIdentityDescription
gi|449457494|ref|XP_004146483.1|1.6e-27199.37PREDICTED: aldehyde dehydrogenase family 3 member F1 [Cucumis sativus][more]
gi|659107520|ref|XP_008453718.1|8.5e-26596.65PREDICTED: aldehyde dehydrogenase family 3 member F1 [Cucumis melo][more]
gi|802638905|ref|XP_012078389.1|8.4e-20471.70PREDICTED: aldehyde dehydrogenase family 3 member F1 [Jatropha curcas][more]
gi|297735060|emb|CBI17422.3|4.2e-20372.96unnamed protein product [Vitis vinifera][more]
gi|731386947|ref|XP_002273358.2|4.2e-20372.96PREDICTED: aldehyde dehydrogenase family 3 member F1 [Vitis vinifera][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR012394Aldehyde_DH_NAD(P)
IPR015590Aldehyde_DH_dom
IPR016161Ald_DH/histidinol_DH
IPR016162Ald_DH_N
IPR016163Ald_DH_C
Vocabulary: Molecular Function
TermDefinition
GO:0004030aldehyde dehydrogenase [NAD(P)+] activity
GO:0016491oxidoreductase activity
GO:0016620oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
Vocabulary: Biological Process
TermDefinition
GO:0006081cellular aldehyde metabolic process
GO:0055114oxidation-reduction process
GO:0008152metabolic process
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006081 cellular aldehyde metabolic process
biological_process GO:0008152 metabolic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0019722 calcium-mediated signaling
biological_process GO:0042631 cellular response to water deprivation
biological_process GO:0006094 gluconeogenesis
biological_process GO:0006096 glycolytic process
biological_process GO:0006547 histidine metabolic process
biological_process GO:0006558 L-phenylalanine metabolic process
biological_process GO:0009612 response to mechanical stimulus
biological_process GO:0006570 tyrosine metabolic process
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0016020 membrane
molecular_function GO:0004030 aldehyde dehydrogenase [NAD(P)+] activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016620 oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI04G05130CSPI04G05130gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI04G05130.1CSPI04G05130.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G05130.1.utr3p1CSPI04G05130.1.utr3p1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G05130.1.cds10CSPI04G05130.1.cds10CDS
CSPI04G05130.1.cds9CSPI04G05130.1.cds9CDS
CSPI04G05130.1.cds8CSPI04G05130.1.cds8CDS
CSPI04G05130.1.cds7CSPI04G05130.1.cds7CDS
CSPI04G05130.1.cds6CSPI04G05130.1.cds6CDS
CSPI04G05130.1.cds5CSPI04G05130.1.cds5CDS
CSPI04G05130.1.cds4CSPI04G05130.1.cds4CDS
CSPI04G05130.1.cds3CSPI04G05130.1.cds3CDS
CSPI04G05130.1.cds2CSPI04G05130.1.cds2CDS
CSPI04G05130.1.cds1CSPI04G05130.1.cds1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G05130.1.utr5p1CSPI04G05130.1.utr5p1five_prime_UTR


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012394Aldehyde dehydrogenase NAD(P)-dependentPIRPIRSF036492ALDHcoord: 1..469
score: 2.7E
IPR015590Aldehyde dehydrogenase domainPFAMPF00171Aldedhcoord: 9..433
score: 8.0
IPR016161Aldehyde/histidinol dehydrogenaseunknownSSF53720ALDH-likecoord: 2..449
score: 1.31E
IPR016162Aldehyde dehydrogenase N-terminal domainGENE3DG3DSA:3.40.605.10coord: 4..209
score: 4.3
IPR016163Aldehyde dehydrogenase, C-terminalGENE3DG3DSA:3.40.309.10coord: 213..413
score: 1.9
NoneNo IPR availablePANTHERPTHR11699ALDEHYDE DEHYDROGENASE-RELATEDcoord: 1..463
score:
NoneNo IPR availablePANTHERPTHR11699:SF117ALDEHYDE DEHYDROGENASE FAMILY 3 MEMBER F1coord: 1..463
score: