CSPI04G05130.3 (mRNA) Wild cucumber (PI 183967)

NameCSPI04G05130.3
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionAldehyde dehydrogenase
LocationChr4 : 3460754 .. 3463587 (-)
Sequence length960
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGTTTTTAGCATCGTAATCATTGATTAATAGTTAAACGTGGCTTTCATAAACGAAGGGAGGACAAGGAACATTAAATTCAATGCCTTAAACCATTTTCCATAAAATCTGAAAATCTGTCCACTTGGGGAAAGTAATGACATGACTTCCAAAATCTTAGAACAAACATAAAATGCAAAAGCAAGAGAAAAGAAGAAACCCCACACCATAAAAATCAGATCAAACACAGCTTCTTGACAGAGCCTTCTTTGTGGTGGGGTTTAGGAAATGGAAGCCAGTTTGGAAGTGCTAAGAGAAAGCTTCAAAAATGGAAGAACAAGAAGTTATGAATGGAGGATAAAGCAGCTGAGTTCATTGATTCAGTTCATCCATGACAAAGAAAACACCATTTTTGAAGCCCTTTATCAAGATCTTGGCAAACATCCTGTCGAAATTTTTAGAGATGAGGTAAAATTTGGAAAATTAGATTATAGTTTTTCTTTGTATGGAAAGTGGAGCTAATTTTGCTAACGCTTTTAAGGTTGGAATTGTTCTGAAATCTGCAAACAATGCTCTTTCTTCTTTACACAAATGGATGGCTCCTAAAAAGGTTGGGTTTTTAAAGTTTTCCTTCTGTTTCTTTAGTTTCTTGTTCTTCTTCTTTCGTATATGCTTGTTTTGTTGTGTTGTAGAAACCTGTGCCATTACTCTTCTTCCCAGCAAAAGGAGAGGTTTTGTCTGAACCATTTGGTTTGGTCCTCATAATTTCATCATGGAATTTCCCCCTTTGTGAGTATTTTCCTTCTTTAGCTCAAGTAATTTTTTAACCAAAACGGACCCATCATCTGTTTATTTATTTTATTTTGAGATTTTGAGAGTTATACAACATTGGTGATTGATTTTTCCATTAAAAATTGAAGAATTCTGATGCTTCTTCTACTTATCTTTGCAGCTTTGTCATTGGATCCGTTAATCGGAGCGATATCGGCAGGCAATACGGCGGTTTTAAAACCGTCGGAATATGCTCCGGTTTTCTCCTCTTTTCTTGTTGCAACACTCCCTCTTTACCTTGACGATAAAGCCATCAAGGTTGTGGAGGGTGGAGCTGATGTTTCTGAACAACTTTTACAGTATAAGTGGGATAAGATCTTCTTCACTGGTATGAACTTGGCTCGTTTAACAAGGGCGATCCTTCAGTTTGTTCGTGCTCTACGATTTGATCGTTACGAGTGATTGTTTCAGGGAGTCCAAGGGTAGCTAGGATTGTGTCGTCTGCAGCCGCAAAGCATTTAACTCCTGTTACTTTAGAGCTTGGGGGAAAATGCCCTGCAATCTTTGATTACTCCTCTGTCCATTCCAATATGAAGGTTGTTTAACTTGGTAGCTGAAATGTAATGTCGTTAATGGATCCAAGTTCGTTTTGTTATTGTTTCTCATTTTGAATTCGTAGGTAGCGGCCAAGAGAATCGTTGGAGGAAAATGGGGGCCGTGCGCCGGACAGGCGTGCATAGGGATAGATTATGTGCTTGTGGAGGATAAGTTTGCTTCAGAATTGGTAAAAACAAGTTTCGCTTCGAACTATTTACTTATTTTCTTGCTTATCTCTGTGATCTTCCTCTTTCAGATCGAGTCATTAAAGCGAATACTCAAGAAGTTTTATGGTGAAAACTCGAAAAACTCAACGAGTATAGCTCGAATTGTTAATGATAAAAATGTTGAAAGAATAAGCAATCTTCTTAAAGACCCAAAAGTTGCTGCTTCCATCGTCCATGGTGGTTCTATTGACAAAGAGAAACTGTAAGCTAACTTCTCCATGAGGAAGTTTATGTTTCTGATTGTTTCTATTGAATGCTTTGCATTGATCTTGTAGCTTCATTGAACCAACAATATTGTTGAATCCTCCACTCGACGCGGATATCATGACCGAAGAAATCTTCGGTCCCCTGTTACCGATAATCACAGTAAGAATTACCAAATAACCTTTCAATAATTTCACACAACCATACTGATCACTAGTGTTATTGAAACAGTTGAACAAAATTGAAGAGAGCATTGAGTTCATCAATGCAAGACCGAAACCTCTCGCTCTATACGCCTTCACGGGAGACGAAACTCTCAAGAAACGAATTTTGTACGAAACATCATCAGGAAGTGTCACATTCAATGATACCATGGTTCAGGTATGAGCTTGAATTCCTTGAATCTGCATCTGACATCCTGAATGATACAGCACATGTCTTGCTATCACTGCTTGTACTTTCTCCCTAACAATAAAACTGGTTTCTGTTTATGTGTGAACGTAAGTAACACACTGTTAATAAATTATTACTATTTATTACTTTTCATGTATCAAGTGGAATTTGTTTTTGTCTTGCAGTTTGTGTGTGATTCGTTACCGTTCGGCGGTGTTGGTCAGAGCGGTTCCGGGAGTTACCACGGCAAGTATTCATTTGATACATTCAGCCATGAAAAGGCAGTGATGCAGAGAAGCTTTTTGATAGAACTCGAGCCACGATATCCACCATGGAATGATTTCAAGCTAAAGTTCATTAGATTGGCATACCGATACGACTACTTTGGGCTGGCACTGCTGCTTTTGGGGTTAAAGAAGTAGAAACTATAGCAGTTCAAAAATCAAAGAGCTAAGTGATCCTTGAGATAATAAACTGCTTTCCAAATCTCTTATTATGATCTCCTCATAAACTTTGTATCAAATCATGAACAAATAATATACCAGTTGTGCTTTTTGAAGGATTATTATATATATGTGGAGATTTATAATGATCCCATTTTAAATATTAGATTTTCAAACGTGTTTGCTCTTGCTGAGTGATTATAGAAAAGTTAATGGGAATA

mRNA sequence

ATGGAAGCCAGTTTGGAAGTGCTAAGAGAAAGCTTCAAAAATGGAAGAACAAGAAGTTATGAATGGAGGATAAAGCAGCTGAGTTCATTGATTCAGTTCATCCATGACAAAGAAAACACCATTTTTGAAGCCCTTTATCAAGATCTTGGCAAACATCCTGTCGAAATTTTTAGAGATGAGGTTGGAATTGTTCTGAAATCTGCAAACAATGCTCTTTCTTCTTTACACAAATGGATGGCTCCTAAAAAGAAACCTGTGCCATTACTCTTCTTCCCAGCAAAAGGAGAGGTTTTGTCTGAACCATTTGGTTTGGTCCTCATAATTTCATCATGGAATTTCCCCCTTTCTTTGTCATTGGATCCGTTAATCGGAGCGATATCGGCAGGCAATACGGCGGTTTTAAAACCGTCGGAATATGCTCCGGTTTTCTCCTCTTTTCTTGTTGCAACACTCCCTCTTTACCTTGACGATAAAGCCATCAAGGTTGTGGAGGGTGGAGCTGATGTTTCTGAACAACTTTTACAGTATAAGTGGGATAAGATCTTCTTCACTGGGAGTCCAAGGGTAGCTAGGATTGTGTCGTCTGCAGCCGCAAAGCATTTAACTCCTGTTACTTTAGAGCTTGGGGGAAAATGCCCTGCAATCTTTGATTACTCCTCTGTCCATTCCAATATGAAGGTAGCGGCCAAGAGAATCGTTGGAGGAAAATGGGGGCCGTGCGCCGGACAGGCGTGCATAGGGATAGATTATGTGCTTGTGGAGGATAAGTTTGCTTCAGAATTGATCGAGTCATTAAAGCGAATACTCAAGAAGTTTTATGGTGAAAACTCGAAAAACTCAACGAGTATAGCTCGAATTGTTAATGATAAAAATGTTGAAAGAATAAGCAATCTTCTTAAAGACCCAAAAGTTGCTGCTTCCATCGTCCATGGTGGTTCTATTGACAAAGAGAAACTGTAA

Coding sequence (CDS)

ATGGAAGCCAGTTTGGAAGTGCTAAGAGAAAGCTTCAAAAATGGAAGAACAAGAAGTTATGAATGGAGGATAAAGCAGCTGAGTTCATTGATTCAGTTCATCCATGACAAAGAAAACACCATTTTTGAAGCCCTTTATCAAGATCTTGGCAAACATCCTGTCGAAATTTTTAGAGATGAGGTTGGAATTGTTCTGAAATCTGCAAACAATGCTCTTTCTTCTTTACACAAATGGATGGCTCCTAAAAAGAAACCTGTGCCATTACTCTTCTTCCCAGCAAAAGGAGAGGTTTTGTCTGAACCATTTGGTTTGGTCCTCATAATTTCATCATGGAATTTCCCCCTTTCTTTGTCATTGGATCCGTTAATCGGAGCGATATCGGCAGGCAATACGGCGGTTTTAAAACCGTCGGAATATGCTCCGGTTTTCTCCTCTTTTCTTGTTGCAACACTCCCTCTTTACCTTGACGATAAAGCCATCAAGGTTGTGGAGGGTGGAGCTGATGTTTCTGAACAACTTTTACAGTATAAGTGGGATAAGATCTTCTTCACTGGGAGTCCAAGGGTAGCTAGGATTGTGTCGTCTGCAGCCGCAAAGCATTTAACTCCTGTTACTTTAGAGCTTGGGGGAAAATGCCCTGCAATCTTTGATTACTCCTCTGTCCATTCCAATATGAAGGTAGCGGCCAAGAGAATCGTTGGAGGAAAATGGGGGCCGTGCGCCGGACAGGCGTGCATAGGGATAGATTATGTGCTTGTGGAGGATAAGTTTGCTTCAGAATTGATCGAGTCATTAAAGCGAATACTCAAGAAGTTTTATGGTGAAAACTCGAAAAACTCAACGAGTATAGCTCGAATTGTTAATGATAAAAATGTTGAAAGAATAAGCAATCTTCTTAAAGACCCAAAAGTTGCTGCTTCCATCGTCCATGGTGGTTCTATTGACAAAGAGAAACTGTAA
BLAST of CSPI04G05130.3 vs. Swiss-Prot
Match: AL3F1_ARATH (Aldehyde dehydrogenase family 3 member F1 OS=Arabidopsis thaliana GN=ALDH3F1 PE=2 SV=2)

HSP 1 Score: 405.2 bits (1040), Expect = 6.6e-112
Identity = 189/319 (59.25%), Postives = 247/319 (77.43%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E SL  +RE+F +GRTRS +WR  Q+ ++ + + D E+ I  AL+QDLGKH  E FRDE
Sbjct: 8   VEESLREMRETFASGRTRSLKWRKAQIGAIYEMVKDNEDKICNALFQDLGKHSTEAFRDE 67

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G+VL++A  A++ L KW  PK   +PLLF+PAKG+V+SEP+G VL++SSWNFP+SLSLD
Sbjct: 68  LGVVLRTATVAINCLDKWAVPKHSKLPLLFYPAKGKVISEPYGTVLVLSSWNFPISLSLD 127

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAI+AGNT +LK SE +P  S+FL  T+P YLD KAIKV+EGG DV+  LLQ++WDK
Sbjct: 128 PLIGAIAAGNTVLLKSSELSPNASAFLAKTIPAYLDTKAIKVIEGGPDVATILLQHQWDK 187

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP++ RI+ +AAA+HLTPVTLELGGKCP I D+ ++  N+K   KRI GGKWG C
Sbjct: 188 IFFTGSPKIGRIIMAAAAQHLTPVTLELGGKCPTIVDHHTISKNIKSVVKRIAGGKWGSC 247

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACI +DYVL+E  FA  LI+ LK  +K F+GEN K S  ++RI N  +V+R+S LL 
Sbjct: 248 NGQACISVDYVLIEKSFAPTLIDMLKPTIKSFFGENPKESGCLSRIANKHHVQRLSRLLS 307

Query: 301 DPKVAASIVHGGSIDKEKL 320
           DP+V ASIV+GGSID++KL
Sbjct: 308 DPRVQASIVYGGSIDEDKL 326

BLAST of CSPI04G05130.3 vs. Swiss-Prot
Match: AL3I1_ARATH (Aldehyde dehydrogenase family 3 member I1, chloroplastic OS=Arabidopsis thaliana GN=ALDH3I1 PE=1 SV=2)

HSP 1 Score: 322.4 bits (825), Expect = 5.6e-87
Identity = 165/320 (51.56%), Postives = 222/320 (69.38%), Query Frame = 1

Query: 2   EASLEV--LRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRD 61
           EA+L V  LR +F +GRT+SYEWRI QL ++ + I +KE  I EALYQDL K  +E F  
Sbjct: 74  EAALLVDELRSNFNSGRTKSYEWRISQLQNIARMIDEKEKCITEALYQDLSKPELEAFLA 133

Query: 62  EVGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSL 121
           E+     S   A+  L  WMAP+     +  FP+  +++SEP G+VL+IS+WNFP  LS+
Sbjct: 134 EISNTKSSCMLAIKELKNWMAPETVKTSVTTFPSSAQIVSEPLGVVLVISAWNFPFLLSV 193

Query: 122 DPLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWD 181
           +P+IGAI+AGN  VLKPSE AP  SS L      YLD+  I+V+EGG   +  LL  KWD
Sbjct: 194 EPVIGAIAAGNAVVLKPSEIAPAASSLLAKLFSEYLDNTTIRVIEGGVPETTALLDQKWD 253

Query: 182 KIFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGP 241
           KIFFTG  RVARI+ +AAA++LTPV LELGGKCPA+ D S V  N++VAA+RI+ GKW  
Sbjct: 254 KIFFTGGARVARIIMAAAARNLTPVVLELGGKCPALVD-SDV--NLQVAARRIIAGKWAC 313

Query: 242 CAGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLL 301
            +GQACIG+DYV+    FAS+LI++LK  L+ F+G+N+  S  ++RIVN  + +R+ ++L
Sbjct: 314 NSGQACIGVDYVITTKDFASKLIDALKTELETFFGQNALESKDLSRIVNSFHFKRLESML 373

Query: 302 KDPKVAASIVHGGSIDKEKL 320
           K+  VA  IVHGG I ++KL
Sbjct: 374 KENGVANKIVHGGRITEDKL 390

BLAST of CSPI04G05130.3 vs. Swiss-Prot
Match: AL3H1_ARATH (Aldehyde dehydrogenase family 3 member H1 OS=Arabidopsis thaliana GN=ALDH3H1 PE=1 SV=2)

HSP 1 Score: 302.0 bits (772), Expect = 7.8e-81
Identity = 152/312 (48.72%), Postives = 205/312 (65.71%), Query Frame = 1

Query: 8   LRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDEVGIVLKS 67
           LR SF +G TR YEWR+ QL  L+    + E  I  AL  DLGK  +E    EV ++  S
Sbjct: 19  LRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSLLRNS 78

Query: 68  ANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLDPLIGAIS 127
              AL  L  WMAP+K    L  FPA  E++SEP G+VL+IS+WN+P  LS+DP+IGAIS
Sbjct: 79  IKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVIGAIS 138

Query: 128 AGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDKIFFTGSP 187
           AGN  VLKPSE AP  S+ L   L  YLD  A++VVEG    +  LL+ KWDKIF+TGS 
Sbjct: 139 AGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDKIFYTGSS 198

Query: 188 RVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQACIG 247
           ++ R++ +AAAKHLTPV LELGGK P + D     +++KV  +RI+ GKWG   GQAC+ 
Sbjct: 199 KIGRVIMAAAAKHLTPVVLELGGKSPVVVDSD---TDLKVTVRRIIVGKWGCNNGQACVS 258

Query: 248 IDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLKDPKVAAS 307
            DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  + +R+S LL + +V+  
Sbjct: 259 PDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLDEKEVSDK 318

Query: 308 IVHGGSIDKEKL 320
           IV+GG  D+E L
Sbjct: 319 IVYGGEKDRENL 327

BLAST of CSPI04G05130.3 vs. Swiss-Prot
Match: ALDH_CRAPL (Aldehyde dehydrogenase OS=Craterostigma plantagineum GN=ALDH PE=1 SV=1)

HSP 1 Score: 294.3 bits (752), Expect = 1.6e-78
Identity = 146/320 (45.62%), Postives = 214/320 (66.88%), Query Frame = 1

Query: 2   EASLEVLRESFKNGRTRSYEWRIKQLSSLIQFI--HDKENTIFEALYQDLGKHPVEIFRD 61
           E  ++ LR ++ +G+T+SYEWR+ QL +L++    HDKE  + EAL  DL K   E +  
Sbjct: 7   EGVVDGLRRTYISGKTKSYEWRVSQLKALLKITTHHDKE--VVEALRADLKKPEHEAYVH 66

Query: 62  EVGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSL 121
           E+ +V  +  +AL  LH+WM P+K    L  +P+  E++SEP G+VL+I++WN+P  L+L
Sbjct: 67  EIFMVSNACKSALKELHQWMKPQKVKTSLATYPSSAEIVSEPLGVVLVITAWNYPFLLAL 126

Query: 122 DPLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWD 181
           DP+IGAI+AGN  VLKPSE AP  S+ L   L  Y+D  AI+VVEG     + LL  +WD
Sbjct: 127 DPMIGAIAAGNCVVLKPSEIAPATSALLAKLLNQYVDTSAIRVVEGAVPEMQALLDQRWD 186

Query: 182 KIFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGP 241
           KIF+TGS +V +IV S+AAKHLTPV LELGGKCP + D    + ++KVAA+RI+  KW  
Sbjct: 187 KIFYTGSSKVGQIVLSSAAKHLTPVVLELGGKCPTVVD---ANIDLKVAARRIISWKWSG 246

Query: 242 CAGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLL 301
            +GQ CI  DY++  ++ A +L++++K  L+ FYG++   S  ++ I+N++  ER++ LL
Sbjct: 247 NSGQTCISPDYIITTEENAPKLVDAIKCELESFYGKDPLKSQDMSSIINERQFERMTGLL 306

Query: 302 KDPKVAASIVHGGSIDKEKL 320
            D KV+  IV+GG  DK  L
Sbjct: 307 DDKKVSDKIVYGGQSDKSNL 321

BLAST of CSPI04G05130.3 vs. Swiss-Prot
Match: ALDH3_DICDI (Aldehyde dehydrogenase family 3 comG OS=Dictyostelium discoideum GN=comG PE=3 SV=1)

HSP 1 Score: 244.6 bits (623), Expect = 1.5e-63
Identity = 136/311 (43.73%), Postives = 190/311 (61.09%), Query Frame = 1

Query: 8   LRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDEVGIVLKS 67
           LR+ F + +TR  +WR  QL ++ + + + ++ I  A+ +DLGKH  EI + E+ ++   
Sbjct: 18  LRKVFLSQKTRKIDWRYSQLKAIKKMMSENKDNITAAVKKDLGKHEFEIHQTEIVMIQTE 77

Query: 68  ANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLDPLIGAIS 127
            +  +S L  W   +K   PL F PA   +L EP G+VLI+S WN+P++L+L PLIGAI+
Sbjct: 78  LDETISHLESWNKTEKVYSPLHFKPASSYILKEPLGVVLIMSPWNYPVNLALIPLIGAIA 137

Query: 128 AGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKV-VEGGADVSEQLLQYKWDKIFFTGS 187
            GN A+LK S ++   S  L   L  YLD +  +   EGGA    +LL+YKWD IFFTGS
Sbjct: 138 GGNCALLKLSRHSYNISKLLHGLLTKYLDPECFEFDCEGGAPYITELLEYKWDHIFFTGS 197

Query: 188 PRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC--AGQA 247
            +V +IV  AAAK LTPVTLELGGK P I D     +++K+ A+R++   WG C  AGQ 
Sbjct: 198 VKVGKIVYQAAAKFLTPVTLELGGKNPCIVDKD---TDIKLTARRLI---WGKCWNAGQT 257

Query: 248 CIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLKDPKV 307
           CIG+DY++V       LIE  K +LK+F+GE+ K STS ARI++    ER+  L    KV
Sbjct: 258 CIGLDYLIVHKSILEPLIEEFKVVLKEFFGEDIKKSTSFARIISSAAAERLQQLFSMGKV 317

Query: 308 AASIVHGGSID 316
               V GG  D
Sbjct: 318 ----VIGGEAD 318

BLAST of CSPI04G05130.3 vs. TrEMBL
Match: A0A0A0KZF5_CUCSA (Aldehyde dehydrogenase OS=Cucumis sativus GN=Csa_4G043870 PE=3 SV=1)

HSP 1 Score: 630.2 bits (1624), Expect = 1.4e-177
Identity = 317/319 (99.37%), Postives = 319/319 (100.00%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE
Sbjct: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VGIVLKSANNALSSLHKWMAPKKKP+PLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD
Sbjct: 61  VGIVLKSANNALSSLHKWMAPKKKPLPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKL 320
           DPKVAASIVHGGS+DKEKL
Sbjct: 301 DPKVAASIVHGGSMDKEKL 319

BLAST of CSPI04G05130.3 vs. TrEMBL
Match: A0A067KCB6_JATCU (Aldehyde dehydrogenase OS=Jatropha curcas GN=JCGZ_13661 PE=3 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 1.6e-125
Identity = 216/319 (67.71%), Postives = 268/319 (84.01%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +EASLE LR++F++G+TR+ EWR  QL +LIQF +D E  IF+AL QDLGKHPVE +RDE
Sbjct: 6   IEASLEELRKTFRSGKTRTVEWRKTQLRALIQFFNDNEENIFQALNQDLGKHPVESYRDE 65

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VG+VLKSANN+LS + KWMAPKK  +PLL FPA G+V+ EPFG+VLI  SWNFP++++LD
Sbjct: 66  VGVVLKSANNSLSCIEKWMAPKKSHIPLLMFPASGQVIPEPFGVVLIFGSWNFPITMALD 125

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGNT +LKPS+ +P  SSFL  TLP YLD +AIKV+EGG +V EQ+LQ KWDK
Sbjct: 126 PLIGAISAGNTVLLKPSDLSPKCSSFLANTLPKYLDSEAIKVIEGGINVCEQILQQKWDK 185

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGS RV R++ + AAKHLTPVTLELGGKCP + D ++V S+MK+ AKRIVGGKWGPC
Sbjct: 186 IFFTGSQRVGRVIMTEAAKHLTPVTLELGGKCPLVLDTATVSSDMKIVAKRIVGGKWGPC 245

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           +GQACI +DYVLVE+KFAS LI+SL RI++KFYGEN+K S S++RI N K  +R+S+++K
Sbjct: 246 SGQACISVDYVLVEEKFASYLIDSLSRIIRKFYGENTKESKSLSRIANIKAFDRLSSVIK 305

Query: 301 DPKVAASIVHGGSIDKEKL 320
           DP VAASIVHGGS D+EKL
Sbjct: 306 DPLVAASIVHGGSTDEEKL 324

BLAST of CSPI04G05130.3 vs. TrEMBL
Match: D7SP43_VITVI (Aldehyde dehydrogenase OS=Vitis vinifera GN=VIT_04s0023g02810 PE=3 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 4.9e-122
Identity = 217/319 (68.03%), Postives = 257/319 (80.56%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E S+  LR +F++G TRS  WR  QL +L+Q + D EN IFEAL QDLGKHPVE +RDE
Sbjct: 17  VEESIGELRRTFRSGETRSAAWRKAQLKALLQLLRDNENKIFEALKQDLGKHPVESYRDE 76

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G+V KS   +LS + +WMAPKK  +PL+FFP KG+VL EP GLVLI SSWNFP+SL+LD
Sbjct: 77  LGVVEKSVKYSLSHVDEWMAPKKSSLPLIFFPGKGQVLPEPLGLVLIFSSWNFPISLALD 136

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           P+IGAISAGN+ VLKPSE AP  SSFL  T+PLYLD KAIKV+EGGA +S+QLLQ KWDK
Sbjct: 137 PVIGAISAGNSVVLKPSEQAPACSSFLANTIPLYLDSKAIKVIEGGAAISQQLLQQKWDK 196

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP VARIV SAA KHLTPVT+ELGGKCP IFD  S  S+ +VA KR+VGGKWGPC
Sbjct: 197 IFFTGSPSVARIVMSAAVKHLTPVTIELGGKCPTIFDNLSSPSDTEVAVKRVVGGKWGPC 256

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACIG+DYVLVE+KFAS LIE LK+ +KKFYGEN K    I++IVN  + +R+ NLLK
Sbjct: 257 NGQACIGVDYVLVEEKFASHLIEMLKKTIKKFYGENPKELKDISKIVNKHHFQRLHNLLK 316

Query: 301 DPKVAASIVHGGSIDKEKL 320
           +P VAASIVHGG ID+EKL
Sbjct: 317 EPLVAASIVHGGLIDEEKL 335

BLAST of CSPI04G05130.3 vs. TrEMBL
Match: M5VPN0_PRUPE (Aldehyde dehydrogenase OS=Prunus persica GN=PRUPE_ppa004971mg PE=3 SV=1)

HSP 1 Score: 444.1 bits (1141), Expect = 1.4e-121
Identity = 211/319 (66.14%), Postives = 262/319 (82.13%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E +L  LR++FK+GRTRS  WR  Q+S+L+Q IHD+E+ IF+ALY+DLGKHPVE++RDE
Sbjct: 7   VEETLSELRQTFKSGRTRSVAWRKNQVSALLQLIHDQEDEIFKALYEDLGKHPVEVYRDE 66

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +GIV K+ N  LS+L KW+APKK  +PLL FP  GEVL EP G+VLI +SWNFP++L LD
Sbjct: 67  IGIVKKTINYTLSNLEKWVAPKKSRLPLLLFPTSGEVLPEPLGVVLIFASWNFPIALGLD 126

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           P+IGAISAGNT VLKPSE AP  SSFL  T+P Y+D KA++V+EGGA++SE LLQ KWDK
Sbjct: 127 PVIGAISAGNTVVLKPSEQAPACSSFLANTIPQYMDSKAVRVIEGGAEISELLLQQKWDK 186

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP+V RIV SAAAK+LTPVTLELGGKCP I D  S  S++KVA KRIVGGKWGPC
Sbjct: 187 IFFTGSPQVGRIVMSAAAKNLTPVTLELGGKCPTILDSFSNPSDLKVAIKRIVGGKWGPC 246

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACIG+DY+L+E+K AS LIE LK+ +K+FY ++ K+S  IAR++N  + ER+ NLLK
Sbjct: 247 NGQACIGVDYILIEEKLASTLIELLKKTVKRFYSDSPKDSKCIARVINRGHFERLRNLLK 306

Query: 301 DPKVAASIVHGGSIDKEKL 320
           DP VAASIVHGGS+D+E L
Sbjct: 307 DPLVAASIVHGGSLDEENL 325

BLAST of CSPI04G05130.3 vs. TrEMBL
Match: I1JCK5_SOYBN (Aldehyde dehydrogenase OS=Glycine max GN=GLYMA_02G051500 PE=3 SV=2)

HSP 1 Score: 438.0 bits (1125), Expect = 1.0e-119
Identity = 212/319 (66.46%), Postives = 257/319 (80.56%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E  +  LR+ FK G+T+S  WR  QL+SLI  +H+ E+ IF+AL++DLGKHPVE +RDE
Sbjct: 7   VEEPVRELRQYFKTGKTKSVTWRKNQLTSLIDLVHENEDAIFKALHKDLGKHPVEAYRDE 66

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VG V KSA+ ALS + KWMAPKK  +P LFFPAKGEVLSEP G+VLIISSWNFP+ L+LD
Sbjct: 67  VGGVEKSASKALSCVEKWMAPKKSDIPFLFFPAKGEVLSEPLGVVLIISSWNFPIILALD 126

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           P+IGAISAGN  V+KPSE AP  SSFL  T+P YLD  AIKV+EGG DV EQLL+ KWDK
Sbjct: 127 PIIGAISAGNVVVIKPSEQAPACSSFLANTIPRYLDSNAIKVIEGGEDVCEQLLRQKWDK 186

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSPRVA +V SAAAK+LTPVTLELGGKCPAI D     S  ++A KRIVGGKWGPC
Sbjct: 187 IFFTGSPRVASVVMSAAAKNLTPVTLELGGKCPAILDSLPNPSEFELAVKRIVGGKWGPC 246

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           +GQACIGIDY+LVE+KF+S +I+ LK+ +++FYGEN   S  I+RI+N ++ ER+ NLLK
Sbjct: 247 SGQACIGIDYLLVEEKFSSAVIKLLKKFIRRFYGENPVESKVISRIINKQHFERLCNLLK 306

Query: 301 DPKVAASIVHGGSIDKEKL 320
           DP VAASIVHGGS+D+E L
Sbjct: 307 DPLVAASIVHGGSVDEENL 325

BLAST of CSPI04G05130.3 vs. TAIR10
Match: AT4G36250.1 (AT4G36250.1 aldehyde dehydrogenase 3F1)

HSP 1 Score: 405.2 bits (1040), Expect = 3.7e-113
Identity = 189/319 (59.25%), Postives = 247/319 (77.43%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E SL  +RE+F +GRTRS +WR  Q+ ++ + + D E+ I  AL+QDLGKH  E FRDE
Sbjct: 8   VEESLREMRETFASGRTRSLKWRKAQIGAIYEMVKDNEDKICNALFQDLGKHSTEAFRDE 67

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G+VL++A  A++ L KW  PK   +PLLF+PAKG+V+SEP+G VL++SSWNFP+SLSLD
Sbjct: 68  LGVVLRTATVAINCLDKWAVPKHSKLPLLFYPAKGKVISEPYGTVLVLSSWNFPISLSLD 127

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAI+AGNT +LK SE +P  S+FL  T+P YLD KAIKV+EGG DV+  LLQ++WDK
Sbjct: 128 PLIGAIAAGNTVLLKSSELSPNASAFLAKTIPAYLDTKAIKVIEGGPDVATILLQHQWDK 187

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP++ RI+ +AAA+HLTPVTLELGGKCP I D+ ++  N+K   KRI GGKWG C
Sbjct: 188 IFFTGSPKIGRIIMAAAAQHLTPVTLELGGKCPTIVDHHTISKNIKSVVKRIAGGKWGSC 247

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACI +DYVL+E  FA  LI+ LK  +K F+GEN K S  ++RI N  +V+R+S LL 
Sbjct: 248 NGQACISVDYVLIEKSFAPTLIDMLKPTIKSFFGENPKESGCLSRIANKHHVQRLSRLLS 307

Query: 301 DPKVAASIVHGGSIDKEKL 320
           DP+V ASIV+GGSID++KL
Sbjct: 308 DPRVQASIVYGGSIDEDKL 326

BLAST of CSPI04G05130.3 vs. TAIR10
Match: AT4G34240.1 (AT4G34240.1 aldehyde dehydrogenase 3I1)

HSP 1 Score: 322.4 bits (825), Expect = 3.2e-88
Identity = 165/320 (51.56%), Postives = 222/320 (69.38%), Query Frame = 1

Query: 2   EASLEV--LRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRD 61
           EA+L V  LR +F +GRT+SYEWRI QL ++ + I +KE  I EALYQDL K  +E F  
Sbjct: 74  EAALLVDELRSNFNSGRTKSYEWRISQLQNIARMIDEKEKCITEALYQDLSKPELEAFLA 133

Query: 62  EVGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSL 121
           E+     S   A+  L  WMAP+     +  FP+  +++SEP G+VL+IS+WNFP  LS+
Sbjct: 134 EISNTKSSCMLAIKELKNWMAPETVKTSVTTFPSSAQIVSEPLGVVLVISAWNFPFLLSV 193

Query: 122 DPLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWD 181
           +P+IGAI+AGN  VLKPSE AP  SS L      YLD+  I+V+EGG   +  LL  KWD
Sbjct: 194 EPVIGAIAAGNAVVLKPSEIAPAASSLLAKLFSEYLDNTTIRVIEGGVPETTALLDQKWD 253

Query: 182 KIFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGP 241
           KIFFTG  RVARI+ +AAA++LTPV LELGGKCPA+ D S V  N++VAA+RI+ GKW  
Sbjct: 254 KIFFTGGARVARIIMAAAARNLTPVVLELGGKCPALVD-SDV--NLQVAARRIIAGKWAC 313

Query: 242 CAGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLL 301
            +GQACIG+DYV+    FAS+LI++LK  L+ F+G+N+  S  ++RIVN  + +R+ ++L
Sbjct: 314 NSGQACIGVDYVITTKDFASKLIDALKTELETFFGQNALESKDLSRIVNSFHFKRLESML 373

Query: 302 KDPKVAASIVHGGSIDKEKL 320
           K+  VA  IVHGG I ++KL
Sbjct: 374 KENGVANKIVHGGRITEDKL 390

BLAST of CSPI04G05130.3 vs. TAIR10
Match: AT1G44170.1 (AT1G44170.1 aldehyde dehydrogenase 3H1)

HSP 1 Score: 302.0 bits (772), Expect = 4.4e-82
Identity = 152/312 (48.72%), Postives = 205/312 (65.71%), Query Frame = 1

Query: 8   LRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDEVGIVLKS 67
           LR SF +G TR YEWR+ QL  L+    + E  I  AL  DLGK  +E    EV ++  S
Sbjct: 19  LRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDLGKPELESSVYEVSLLRNS 78

Query: 68  ANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLDPLIGAIS 127
              AL  L  WMAP+K    L  FPA  E++SEP G+VL+IS+WN+P  LS+DP+IGAIS
Sbjct: 79  IKLALKQLKNWMAPEKAKTSLTTFPASAEIVSEPLGVVLVISAWNYPFLLSIDPVIGAIS 138

Query: 128 AGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDKIFFTGSP 187
           AGN  VLKPSE AP  S+ L   L  YLD  A++VVEG    +  LL+ KWDKIF+TGS 
Sbjct: 139 AGNAVVLKPSELAPASSALLTKLLEQYLDPSAVRVVEGAVTETSALLEQKWDKIFYTGSS 198

Query: 188 RVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPCAGQACIG 247
           ++ R++ +AAAKHLTPV LELGGK P + D     +++KV  +RI+ GKWG   GQAC+ 
Sbjct: 199 KIGRVIMAAAAKHLTPVVLELGGKSPVVVDSD---TDLKVTVRRIIVGKWGCNNGQACVS 258

Query: 248 IDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLKDPKVAAS 307
            DY+L   ++A +LI+++K  L+KFYG+N   S  ++RIVN  + +R+S LL + +V+  
Sbjct: 259 PDYILTTKEYAPKLIDAMKLELEKFYGKNPIESKDMSRIVNSNHFDRLSKLLDEKEVSDK 318

Query: 308 IVHGGSIDKEKL 320
           IV+GG  D+E L
Sbjct: 319 IVYGGEKDRENL 327

BLAST of CSPI04G05130.3 vs. TAIR10
Match: AT1G74920.1 (AT1G74920.1 aldehyde dehydrogenase 10A8)

HSP 1 Score: 98.6 bits (244), Expect = 7.4e-21
Identity = 81/240 (33.75%), Postives = 120/240 (50.00%), Query Frame = 1

Query: 80  APKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLDPLIGAISAGNTAVLKPSEY 139
           A +K PV L     K  VL +P G+V +I+ WN+PL +++  +  +++AG TA+LKPSE 
Sbjct: 130 AKQKAPVSLPMESFKSYVLKQPLGVVGLITPWNYPLLMAVWKVAPSLAAGCTAILKPSEL 189

Query: 140 APVFSSFLV-ATLPLYLDDKAIKVVEG-GADVSEQLLQYKW-DKIFFTGSPRVARIVSSA 199
           A V    L      + L    + V+ G G++    L  +   DKI FTGS      V +A
Sbjct: 190 ASVTCLELADICREVGLPPGVLNVLTGFGSEAGAPLASHPGVDKIAFTGSFATGSKVMTA 249

Query: 200 AAKHLTPVTLELGGKCPAI-FDYSSVHSNMKVAAKRIVGGKWGPCAGQACIGIDYVLVED 259
           AA+ + PV++ELGGK P I FD   +    K A   + G  W    GQ C     +LV +
Sbjct: 250 AAQLVKPVSMELGGKSPLIVFDDVDLD---KAAEWALFGCFW--TNGQICSATSRLLVHE 309

Query: 260 KFASELIESLKRILKKF-YGENSKNSTSIARIVNDKNVERISNLLKDPK-VAASIVHGGS 314
             ASE IE L +  K     +  +    +  +V+    E+I   +   K   A+I+HGGS
Sbjct: 310 SIASEFIEKLVKWSKNIKISDPMEEGCRLGPVVSKGQYEKILKFISTAKSEGATILHGGS 364

BLAST of CSPI04G05130.3 vs. TAIR10
Match: AT3G66658.2 (AT3G66658.2 aldehyde dehydrogenase 22A1)

HSP 1 Score: 97.8 bits (242), Expect = 1.3e-20
Identity = 78/308 (25.32%), Postives = 137/308 (44.48%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E  + + R++ K     S++ R + L  L+++I + +  I E   +D GK  V+    E
Sbjct: 88  VEERVTLSRKAQKTWAQSSFKLRRQFLRILLKYIIEHQELICEVSSRDTGKTMVDASLGE 147

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +    +     LS   +W+ P+ +            V   P G++  I  WN+P     +
Sbjct: 148 IMTTCEKITWLLSEGERWLKPESRSSGRAMLHKVSRVEFHPLGVIGAIVPWNYPFHNIFN 207

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYL-----DDKAIKVVEGGADVSEQLLQ 180
           P++ A+ +GN  V+K SE+A     F    +   L      +  + V+ G A+  E L+ 
Sbjct: 208 PMLAAVFSGNGIVIKVSEHASWSGCFYFRIIQAALAAVGAPENLVDVITGFAETGEALVS 267

Query: 181 YKWDKIFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIF-DYSSVHSNMKVAAKRIVG 240
              DK+ F GS  V +++   AA+ LTPVTLELGGK   I  + + V    +VA +  + 
Sbjct: 268 -SVDKMIFVGSTAVGKMIMRNAAETLTPVTLELGGKDAFIICEDADVSHVAQVAVRGTL- 327

Query: 241 GKWGPCAGQACIGIDYVLVEDKFASELIESLKRILKKF-YGENSKNSTSIARIVNDKNVE 300
                 +GQ C G +   V     +  I  + +I+K    G        +  I   ++ E
Sbjct: 328 ----QSSGQNCAGAERFYVHKDIYTAFIGQVTKIVKSVSAGPPLTGRYDMGAICLQEHSE 387

Query: 301 RISNLLKD 302
            + +L+ D
Sbjct: 388 HLQSLVND 389

BLAST of CSPI04G05130.3 vs. NCBI nr
Match: gi|449457494|ref|XP_004146483.1| (PREDICTED: aldehyde dehydrogenase family 3 member F1 [Cucumis sativus])

HSP 1 Score: 630.2 bits (1624), Expect = 2.0e-177
Identity = 317/319 (99.37%), Postives = 319/319 (100.00%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE
Sbjct: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VGIVLKSANNALSSLHKWMAPKKKP+PLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD
Sbjct: 61  VGIVLKSANNALSSLHKWMAPKKKPLPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKL 320
           DPKVAASIVHGGS+DKEKL
Sbjct: 301 DPKVAASIVHGGSMDKEKL 319

BLAST of CSPI04G05130.3 vs. NCBI nr
Match: gi|659107520|ref|XP_008453718.1| (PREDICTED: aldehyde dehydrogenase family 3 member F1 [Cucumis melo])

HSP 1 Score: 612.8 bits (1579), Expect = 3.3e-172
Identity = 307/319 (96.24%), Postives = 314/319 (98.43%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           MEA+LEVLRESFKNGRTRSYEWR KQLSSLIQ IHDKENTIFEALYQDLGKHPVEIFRDE
Sbjct: 1   MEANLEVLRESFKNGRTRSYEWRKKQLSSLIQLIHDKENTIFEALYQDLGKHPVEIFRDE 60

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VGIVLKSAN+ALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD
Sbjct: 61  VGIVLKSANDALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLD+KAIKVVEGGADV EQLLQYKWDK
Sbjct: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDNKAIKVVEGGADVCEQLLQYKWDK 180

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP+V RIV SAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC
Sbjct: 181 IFFTGSPKVGRIVMSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           AGQACIGIDYVLVEDKFASELI+SLKRILKKFYGENSKNSTSIARIVN+KNVERISNLLK
Sbjct: 241 AGQACIGIDYVLVEDKFASELIDSLKRILKKFYGENSKNSTSIARIVNEKNVERISNLLK 300

Query: 301 DPKVAASIVHGGSIDKEKL 320
           DPKVAASIVHGGS+DKEKL
Sbjct: 301 DPKVAASIVHGGSVDKEKL 319

BLAST of CSPI04G05130.3 vs. NCBI nr
Match: gi|802638905|ref|XP_012078389.1| (PREDICTED: aldehyde dehydrogenase family 3 member F1 [Jatropha curcas])

HSP 1 Score: 457.2 bits (1175), Expect = 2.3e-125
Identity = 216/319 (67.71%), Postives = 268/319 (84.01%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +EASLE LR++F++G+TR+ EWR  QL +LIQF +D E  IF+AL QDLGKHPVE +RDE
Sbjct: 6   IEASLEELRKTFRSGKTRTVEWRKTQLRALIQFFNDNEENIFQALNQDLGKHPVESYRDE 65

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           VG+VLKSANN+LS + KWMAPKK  +PLL FPA G+V+ EPFG+VLI  SWNFP++++LD
Sbjct: 66  VGVVLKSANNSLSCIEKWMAPKKSHIPLLMFPASGQVIPEPFGVVLIFGSWNFPITMALD 125

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           PLIGAISAGNT +LKPS+ +P  SSFL  TLP YLD +AIKV+EGG +V EQ+LQ KWDK
Sbjct: 126 PLIGAISAGNTVLLKPSDLSPKCSSFLANTLPKYLDSEAIKVIEGGINVCEQILQQKWDK 185

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGS RV R++ + AAKHLTPVTLELGGKCP + D ++V S+MK+ AKRIVGGKWGPC
Sbjct: 186 IFFTGSQRVGRVIMTEAAKHLTPVTLELGGKCPLVLDTATVSSDMKIVAKRIVGGKWGPC 245

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
           +GQACI +DYVLVE+KFAS LI+SL RI++KFYGEN+K S S++RI N K  +R+S+++K
Sbjct: 246 SGQACISVDYVLVEEKFASYLIDSLSRIIRKFYGENTKESKSLSRIANIKAFDRLSSVIK 305

Query: 301 DPKVAASIVHGGSIDKEKL 320
           DP VAASIVHGGS D+EKL
Sbjct: 306 DPLVAASIVHGGSTDEEKL 324

BLAST of CSPI04G05130.3 vs. NCBI nr
Match: gi|297735060|emb|CBI17422.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 445.7 bits (1145), Expect = 7.0e-122
Identity = 217/319 (68.03%), Postives = 257/319 (80.56%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E S+  LR +F++G TRS  WR  QL +L+Q + D EN IFEAL QDLGKHPVE +RDE
Sbjct: 17  VEESIGELRRTFRSGETRSAAWRKAQLKALLQLLRDNENKIFEALKQDLGKHPVESYRDE 76

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G+V KS   +LS + +WMAPKK  +PL+FFP KG+VL EP GLVLI SSWNFP+SL+LD
Sbjct: 77  LGVVEKSVKYSLSHVDEWMAPKKSSLPLIFFPGKGQVLPEPLGLVLIFSSWNFPISLALD 136

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           P+IGAISAGN+ VLKPSE AP  SSFL  T+PLYLD KAIKV+EGGA +S+QLLQ KWDK
Sbjct: 137 PVIGAISAGNSVVLKPSEQAPACSSFLANTIPLYLDSKAIKVIEGGAAISQQLLQQKWDK 196

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP VARIV SAA KHLTPVT+ELGGKCP IFD  S  S+ +VA KR+VGGKWGPC
Sbjct: 197 IFFTGSPSVARIVMSAAVKHLTPVTIELGGKCPTIFDNLSSPSDTEVAVKRVVGGKWGPC 256

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACIG+DYVLVE+KFAS LIE LK+ +KKFYGEN K    I++IVN  + +R+ NLLK
Sbjct: 257 NGQACIGVDYVLVEEKFASHLIEMLKKTIKKFYGENPKELKDISKIVNKHHFQRLHNLLK 316

Query: 301 DPKVAASIVHGGSIDKEKL 320
           +P VAASIVHGG ID+EKL
Sbjct: 317 EPLVAASIVHGGLIDEEKL 335

BLAST of CSPI04G05130.3 vs. NCBI nr
Match: gi|731386947|ref|XP_002273358.2| (PREDICTED: aldehyde dehydrogenase family 3 member F1 [Vitis vinifera])

HSP 1 Score: 445.7 bits (1145), Expect = 7.0e-122
Identity = 217/319 (68.03%), Postives = 257/319 (80.56%), Query Frame = 1

Query: 1   MEASLEVLRESFKNGRTRSYEWRIKQLSSLIQFIHDKENTIFEALYQDLGKHPVEIFRDE 60
           +E S+  LR +F++G TRS  WR  QL +L+Q + D EN IFEAL QDLGKHPVE +RDE
Sbjct: 19  VEESIGELRRTFRSGETRSAAWRKAQLKALLQLLRDNENKIFEALKQDLGKHPVESYRDE 78

Query: 61  VGIVLKSANNALSSLHKWMAPKKKPVPLLFFPAKGEVLSEPFGLVLIISSWNFPLSLSLD 120
           +G+V KS   +LS + +WMAPKK  +PL+FFP KG+VL EP GLVLI SSWNFP+SL+LD
Sbjct: 79  LGVVEKSVKYSLSHVDEWMAPKKSSLPLIFFPGKGQVLPEPLGLVLIFSSWNFPISLALD 138

Query: 121 PLIGAISAGNTAVLKPSEYAPVFSSFLVATLPLYLDDKAIKVVEGGADVSEQLLQYKWDK 180
           P+IGAISAGN+ VLKPSE AP  SSFL  T+PLYLD KAIKV+EGGA +S+QLLQ KWDK
Sbjct: 139 PVIGAISAGNSVVLKPSEQAPACSSFLANTIPLYLDSKAIKVIEGGAAISQQLLQQKWDK 198

Query: 181 IFFTGSPRVARIVSSAAAKHLTPVTLELGGKCPAIFDYSSVHSNMKVAAKRIVGGKWGPC 240
           IFFTGSP VARIV SAA KHLTPVT+ELGGKCP IFD  S  S+ +VA KR+VGGKWGPC
Sbjct: 199 IFFTGSPSVARIVMSAAVKHLTPVTIELGGKCPTIFDNLSSPSDTEVAVKRVVGGKWGPC 258

Query: 241 AGQACIGIDYVLVEDKFASELIESLKRILKKFYGENSKNSTSIARIVNDKNVERISNLLK 300
            GQACIG+DYVLVE+KFAS LIE LK+ +KKFYGEN K    I++IVN  + +R+ NLLK
Sbjct: 259 NGQACIGVDYVLVEEKFASHLIEMLKKTIKKFYGENPKELKDISKIVNKHHFQRLHNLLK 318

Query: 301 DPKVAASIVHGGSIDKEKL 320
           +P VAASIVHGG ID+EKL
Sbjct: 319 EPLVAASIVHGGLIDEEKL 337

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AL3F1_ARATH6.6e-11259.25Aldehyde dehydrogenase family 3 member F1 OS=Arabidopsis thaliana GN=ALDH3F1 PE=... [more]
AL3I1_ARATH5.6e-8751.56Aldehyde dehydrogenase family 3 member I1, chloroplastic OS=Arabidopsis thaliana... [more]
AL3H1_ARATH7.8e-8148.72Aldehyde dehydrogenase family 3 member H1 OS=Arabidopsis thaliana GN=ALDH3H1 PE=... [more]
ALDH_CRAPL1.6e-7845.63Aldehyde dehydrogenase OS=Craterostigma plantagineum GN=ALDH PE=1 SV=1[more]
ALDH3_DICDI1.5e-6343.73Aldehyde dehydrogenase family 3 comG OS=Dictyostelium discoideum GN=comG PE=3 SV... [more]
Match NameE-valueIdentityDescription
A0A0A0KZF5_CUCSA1.4e-17799.37Aldehyde dehydrogenase OS=Cucumis sativus GN=Csa_4G043870 PE=3 SV=1[more]
A0A067KCB6_JATCU1.6e-12567.71Aldehyde dehydrogenase OS=Jatropha curcas GN=JCGZ_13661 PE=3 SV=1[more]
D7SP43_VITVI4.9e-12268.03Aldehyde dehydrogenase OS=Vitis vinifera GN=VIT_04s0023g02810 PE=3 SV=1[more]
M5VPN0_PRUPE1.4e-12166.14Aldehyde dehydrogenase OS=Prunus persica GN=PRUPE_ppa004971mg PE=3 SV=1[more]
I1JCK5_SOYBN1.0e-11966.46Aldehyde dehydrogenase OS=Glycine max GN=GLYMA_02G051500 PE=3 SV=2[more]
Match NameE-valueIdentityDescription
AT4G36250.13.7e-11359.25 aldehyde dehydrogenase 3F1[more]
AT4G34240.13.2e-8851.56 aldehyde dehydrogenase 3I1[more]
AT1G44170.14.4e-8248.72 aldehyde dehydrogenase 3H1[more]
AT1G74920.17.4e-2133.75 aldehyde dehydrogenase 10A8[more]
AT3G66658.21.3e-2025.32 aldehyde dehydrogenase 22A1[more]
Match NameE-valueIdentityDescription
gi|449457494|ref|XP_004146483.1|2.0e-17799.37PREDICTED: aldehyde dehydrogenase family 3 member F1 [Cucumis sativus][more]
gi|659107520|ref|XP_008453718.1|3.3e-17296.24PREDICTED: aldehyde dehydrogenase family 3 member F1 [Cucumis melo][more]
gi|802638905|ref|XP_012078389.1|2.3e-12567.71PREDICTED: aldehyde dehydrogenase family 3 member F1 [Jatropha curcas][more]
gi|297735060|emb|CBI17422.3|7.0e-12268.03unnamed protein product [Vitis vinifera][more]
gi|731386947|ref|XP_002273358.2|7.0e-12268.03PREDICTED: aldehyde dehydrogenase family 3 member F1 [Vitis vinifera][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR015590Aldehyde_DH_dom
IPR016161Ald_DH/histidinol_DH
IPR016162Ald_DH_N
IPR016163Ald_DH_C
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO:0055114oxidation-reduction process
Vocabulary: Molecular Function
TermDefinition
GO:0016491oxidoreductase activity
GO:0016620oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0019722 calcium-mediated signaling
biological_process GO:0006081 cellular aldehyde metabolic process
biological_process GO:0042631 cellular response to water deprivation
biological_process GO:0006094 gluconeogenesis
biological_process GO:0006096 glycolytic process
biological_process GO:0006547 histidine metabolic process
biological_process GO:0006558 L-phenylalanine metabolic process
biological_process GO:0009612 response to mechanical stimulus
biological_process GO:0006570 tyrosine metabolic process
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0016020 membrane
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016620 oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
molecular_function GO:0004030 aldehyde dehydrogenase [NAD(P)+] activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI04G05130CSPI04G05130gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI04G05130.3CSPI04G05130.3-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G05130.3.utr3p3CSPI04G05130.3.utr3p3three_prime_UTR
CSPI04G05130.3.utr3p2CSPI04G05130.3.utr3p2three_prime_UTR
CSPI04G05130.3.utr3p1CSPI04G05130.3.utr3p1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G05130.3.cds7CSPI04G05130.3.cds7CDS
CSPI04G05130.3.cds6CSPI04G05130.3.cds6CDS
CSPI04G05130.3.cds5CSPI04G05130.3.cds5CDS
CSPI04G05130.3.cds4CSPI04G05130.3.cds4CDS
CSPI04G05130.3.cds3CSPI04G05130.3.cds3CDS
CSPI04G05130.3.cds2CSPI04G05130.3.cds2CDS
CSPI04G05130.3.cds1CSPI04G05130.3.cds1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G05130.3.utr5p1CSPI04G05130.3.utr5p1five_prime_UTR


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR015590Aldehyde dehydrogenase domainPFAMPF00171Aldedhcoord: 9..314
score: 4.8
IPR016161Aldehyde/histidinol dehydrogenaseunknownSSF53720ALDH-likecoord: 2..316
score: 2.23
IPR016162Aldehyde dehydrogenase N-terminal domainGENE3DG3DSA:3.40.605.10coord: 4..209
score: 1.5
IPR016163Aldehyde dehydrogenase, C-terminalGENE3DG3DSA:3.40.309.10coord: 210..316
score: 1.1
NoneNo IPR availablePANTHERPTHR11699ALDEHYDE DEHYDROGENASE-RELATEDcoord: 1..319
score: 1.9E
NoneNo IPR availablePANTHERPTHR11699:SF117ALDEHYDE DEHYDROGENASE FAMILY 3 MEMBER F1coord: 1..319
score: 1.9E