CmaCh04G010350 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G010350
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionAspartate-semialdehyde dehydrogenase
LocationCma_Chr04 : 5368161 .. 5372311 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAATATATATATATAAATATTTTAAAATTTGGAAATATAAATGACTAAGAAGAAAAAAAAGTAGCAGTTTTGTCTTCTATTCAGTGGGTAACCGGCCGCCTCAGCTCCATTGAAGCTTTTCTTCTTTCTCTCTCTCTACAATGGCGGCTCTAGCTTTTTCAGGAACTTCCTTCTTCTTCTCCACTGCTTCTCCTCAACCTAAATCAAGGAGCATCTCTAGATTTGTTCGTATGGCGTATCAGGAGAACGGGCCTTCTCTGGCCGTCGTCGGCGTCACTGGCGCCGTCGGCCAGGAATTTCTCTCTGTCCTTTCCGACCGTGGATTCCCTTATCGCTCCATTAAGATGCTAGCCTCGAAGCGCTCCGCTGGCAAGTCCGTTCGCTTTCATGACGAAGAGCATATTGTTGAGGAGCTCACTGCGGATAGCTTCGATGGTGTGGACATTGCTCTCTTCAGCGCTGGAGGATCCATCAGCAAGAAGTTTGGACCGCTCGCCGCTGAGAAGGGGACTATTGTGGTCGATAATAGCTCCGCATTTCGGATGGATGAGAATGTGCCTCTTGTTATTCCGGAGGTTAATCCCGAAGCCATGAAGGGGATTAAGGTTGGGACTGGCAAGGGCGCTTTGATTGCCAATCCTAATTGCTCAACCATCATCTGCTTGATGGCTGTCACTCCTCTGCATCGTCACGCTAAGGTTTTGCGATTGATTCTTTCATCTTCTTCTTATTTTTTTCCCTTTGTTCTGTTTACAATTTTTGGGAATGACACGCGAGCTGTGTTTGAGTTTGAGATTTTACTCTCAGCATTTGCTTTTTCTTTCAAAAGTAGCCTGCCTTTTGATGTTTCCTTAATTGTGCTTTTAAGTGCTTCTCCTATAAGCATTTAAGTACTTCCAAAATGCTATAAAAATTTAATAAGTATTTTCAATGTGCTAAATCACTTTTTACTCTCCTAAATTTCATTTCAAGGTCACCCTTAATGGTAGATATATACCGGTATTGAAAATGACAGGTTTTACGCATGGTTGTTAGTACATATCAAGCAGCTAGTGGTGCTGGTGCTGCAGCAATGGAAGAGCTTGTGCAACAGACTCGCGAGGTTTTTTCCGTTTATTGTATCTCATAATCTGGATTGCTTTTTCTTTGACGTCTTATCTTTTGGATGGTTTATTAGTTCATTTCAATGGCTATTCTAAACTGTTTTGCTTCTCTTCCTTTTTTGAAGGTCCTGGAAGGAAAACCACCAACTTGTAATATATTTAGGGATCAGGTCAGTCTCTTTTACAATGACACTTATCTTTATGTGTCCTTTTTTTAGGGTCAAATTTCTATTTCCATTTTCTCAGTTTCTTCTCCTTGGTGCAGTATGCTTTCAATTTGTTCTCCCACAATGCATCTGTTCTTTCAAATGGATACAACGAAGAGGAAATGAAATTAGTAAAAGAAACTCGGAAAATATGGGTGAGTGTTGGTTGTGAATTATGGATACCACTTACGCCCTAAACTTTATTTGCACAACGTTTGTGAATTGTTTTTCTTTGGAAATTGGTAAATGGAATTGCGTTGTACTCTAACCGACTTGCTTTATGCCCCTCAAGAGTGATGCTAATGTTAAAGTTACTGCCACTTGCATACGAGTTCCTGTCATGCGTGCACATGCTGAGAGTGTGAATCTTCAATTTGAGAATCCCCTTGATGAGGTAAGATTTTATTTTTATATTGTTTTGATCAGTGCTATCGTGTATTCTTCTTGAGATATTTGTTGATTGCTGGCATATAGATTCTTAGTTTTCCTCTGCTTATTTTAGGTTGAGAATGAAATGATTTTGAAGTGCCTTTTCCCAGACTTTTGTATGGGAAATGGAAATCCACCCTAGTTGTTAAAATATCATGTCAGGACCCAAGGGGTAGGCCTTGAGGGTAATGGCTTGGGATGCTCGGAGATAACCTCCAAGGTCTTGGATTTAAGCCCCTAAGTTGAAAACTTTATAATTTGGCCTTGGGGTGGGTGAGGCCTCCTCAAAGATTAGTGGATGGGCTATAAGCATAATGTAAAGAGTGAACTTGTTTTTCTTAATTTTAACTTTTCTTTTGAATGAGGACATTTCCGCCTATCTTTTTCTTTTGTAATGCACAAAAGAATGGTGCTGAATCTAAGAGAATGTGCTTTCAACTTTTGAGGCCATGTGGTGAAGCCTTTTACAGCCTATTTGGACTTCTGCCAATCGGCCTCCAAATATATTGTCATGGTACATTTGCGTTCCATTTGAATGCTTCTTCATAGTGCTCTAAGAACTTCTTTGATTGAGCACATTGTAAAATTCGCTGTATAAGAACTTCTTTGATTGATTACTATATAATGGATGCCAATTGTTAGTTTCTTTAGAATGGTCCTGAACAATTACTCATCGATTATTTCCCATTTTTTCCTTCTTGTAGGAAATAAGTACTCATGGATTGTTCTCGTTTCTTTCTTCTTATAGGAAGCTAAAAATTCATAAATTCCCAAGAACTGAATTATAGAGCATTAGATTATTTAGCAAATAAAACCTACCACCGTGATGTTCATGGCGAAGTTCTTAAAGTCTGATATTTGCGGTTCTCTTTTCCTTTGAATTTGTTTTCTCGTTATGGGCTCTCCAAACCCACACTCGCAGTTGAATCTGAGATCATCATCAGTCCAGCATGAAATGTATCTGATTCATGTTTTGTGTCACATGACTAAAAACGTCCGACTTCATTTATCAATAGATCACCTCATACCTCTCTGTGTAGATTAATTCTTCTATGTCGAAGTTGTAATTGCATTAGTAAAATCTTTAACCATTCGATATGTTTGTGATTAATGATTCTAGTCGCAAAATTATAGTTCCACGTGTTGGATGATGAAAGTCCTACATCGGCTAATTTAGGGAATGATCATGGGTTTATAAGTGAAGAATACTAACTCCATTGGCATGAGGCTTTTTGGGAAGCCAAAGCCATGAGAGCTTATGCTAAAAGTGGACAATATCATACCATTGTGGAGAGTCGAGTTCGTCTAACATGGTATCAAAGTCATGCCCTAAACTTAGCCGTGCCAATAGATTGACAAATTCTCGAATGTCGAACAAAGGACTCCAAAAAGAAAAAAGGAGTCGAGCCTCCTCGAAGGTACGAAGGCAGTGAAAAAATGACAAACTCCAAAGGAGTCGAGCCTCGATTAAGGGGAAGCGTGCTTTGTTCGAGGGGAGGTGTTGGATGATGAAAGTCCCACATCGGCTAATTTAGGGAATGATCATGAGTTTATAAGTGAGGAATATAACTCCATTGGCATGAGGCCTTTTGGGAAGCCCAAAGCAAAGCCATGAGAGTTTATGCTCAAAGTGGATAATATCATACCAGAATCGAGTTCTAAGTTATTATTTCTGTTTTATGCAGGACACAGCCAGAGAGATTCTGAAAAATGCTCCTGGAGTCGTCATTATAGACGACCGGGCTGCCAACCAGTTTCCTACACCATTGAAAGTATCAAACAAAGATGATGTCGCTGTAGGAAGGATTCGACAGGATGTTTCACTTGATGGAAATAAAGGGTATAAACATCTACCTCGTAGAGTTATCTGTTTGTTTCCTGATTGAAAAATGAACACAAAACTAATGGGATTCTCTCATGGGTGCAGGTTGGATATCTTCGTCTGTGGTGATCAAATTCGCAAGGGTGCAGCGCTTAATGCTGTTCAGATCGCCGAGTTGTTACTGTAGTCCCGACGATGACCGAAAATTGGTACATTGCTAGACAAAGTTCTTGTTGTTGCTACATATGATATCACAGAATGTCTCTCAAGGTAAAAAAAATCATCTCTCGGTCTTTGTATCCATAACTTTTAGCAAATTTTGCAGGACGGTCAATTAAGATTATGAGTTTTCATTTAGATCTCTGGTTTGAAGGAAAAATTTGGAATTGATGTTAAGTTTTTTTTTACAATAAACCACTTTGATGCTTTGTTTATGAATTGCTTGCATTTAGATGTTTTAGATATTAACATTTCATTTTCTCACCCATTCGGGTTGACTATCATTTGTTCGAGTTCAAGTTGAAGAAATTTTCAACCAAAACAATTTGGATCGAGTTTAAAAGGTACCTCACTGAACCCGCGAATCCTAC

mRNA sequence

TTAATATATATATATAAATATTTTAAAATTTGGAAATATAAATGACTAAGAAGAAAAAAAAGTAGCAGTTTTGTCTTCTATTCAGTGGGTAACCGGCCGCCTCAGCTCCATTGAAGCTTTTCTTCTTTCTCTCTCTCTACAATGGCGGCTCTAGCTTTTTCAGGAACTTCCTTCTTCTTCTCCACTGCTTCTCCTCAACCTAAATCAAGGAGCATCTCTAGATTTGTTCGTATGGCGTATCAGGAGAACGGGCCTTCTCTGGCCGTCGTCGGCGTCACTGGCGCCGTCGGCCAGGAATTTCTCTCTGTCCTTTCCGACCGTGGATTCCCTTATCGCTCCATTAAGATGCTAGCCTCGAAGCGCTCCGCTGGCAAGTCCGTTCGCTTTCATGACGAAGAGCATATTGTTGAGGAGCTCACTGCGGATAGCTTCGATGGTGTGGACATTGCTCTCTTCAGCGCTGGAGGATCCATCAGCAAGAAGTTTGGACCGCTCGCCGCTGAGAAGGGGACTATTGTGGTCGATAATAGCTCCGCATTTCGGATGGATGAGAATGTGCCTCTTGTTATTCCGGAGGTTAATCCCGAAGCCATGAAGGGGATTAAGGTTGGGACTGGCAAGGGCGCTTTGATTGCCAATCCTAATTGCTCAACCATCATCTGCTTGATGGCTGTCACTCCTCTGCATCGTCACGCTAAGGTTTTACGCATGGTTGTTAGTACATATCAAGCAGCTAGTGGTGCTGGTGCTGCAGCAATGGAAGAGCTTGTGCAACAGACTCGCGAGGTCCTGGAAGGAAAACCACCAACTTGTAATATATTTAGGGATCAGTATGCTTTCAATTTGTTCTCCCACAATGCATCTGTTCTTTCAAATGGATACAACGAAGAGGAAATGAAATTAGTAAAAGAAACTCGGAAAATATGGAGTGATGCTAATGTTAAAGTTACTGCCACTTGCATACGAGTTCCTGTCATGCGTGCACATGCTGAGAGTGTGAATCTTCAATTTGAGAATCCCCTTGATGAGGACACAGCCAGAGAGATTCTGAAAAATGCTCCTGGAGTCGTCATTATAGACGACCGGGCTGCCAACCAGTTTCCTACACCATTGAAAGTATCAAACAAAGATGATGTCGCTGTAGGAAGGATTCGACAGGATGTTTCACTTGATGGAAATAAAGGGTTGGATATCTTCGTCTGTGGTGATCAAATTCGCAAGGGTGCAGCGCTTAATGCTGTTCAGATCGCCGAGTTGTTACTGTAGTCCCGACGATGACCGAAAATTGGTACATTGCTAGACAAAGTTCTTGTTGTTGCTACATATGATATCACAGAATGTCTCTCAAGGTAAAAAAAATCATCTCTCGGTCTTTGTATCCATAACTTTTAGCAAATTTTGCAGGACGGTCAATTAAGATTATGAGTTTTCATTTAGATCTCTGGTTTGAAGGAAAAATTTGGAATTGATGTTAAGTTTTTTTTTACAATAAACCACTTTGATGCTTTGTTTATGAATTGCTTGCATTTAGATGTTTTAGATATTAACATTTCATTTTCTCACCCATTCGGGTTGACTATCATTTGTTCGAGTTCAAGTTGAAGAAATTTTCAACCAAAACAATTTGGATCGAGTTTAAAAGGTACCTCACTGAACCCGCGAATCCTAC

Coding sequence (CDS)

ATGGCGGCTCTAGCTTTTTCAGGAACTTCCTTCTTCTTCTCCACTGCTTCTCCTCAACCTAAATCAAGGAGCATCTCTAGATTTGTTCGTATGGCGTATCAGGAGAACGGGCCTTCTCTGGCCGTCGTCGGCGTCACTGGCGCCGTCGGCCAGGAATTTCTCTCTGTCCTTTCCGACCGTGGATTCCCTTATCGCTCCATTAAGATGCTAGCCTCGAAGCGCTCCGCTGGCAAGTCCGTTCGCTTTCATGACGAAGAGCATATTGTTGAGGAGCTCACTGCGGATAGCTTCGATGGTGTGGACATTGCTCTCTTCAGCGCTGGAGGATCCATCAGCAAGAAGTTTGGACCGCTCGCCGCTGAGAAGGGGACTATTGTGGTCGATAATAGCTCCGCATTTCGGATGGATGAGAATGTGCCTCTTGTTATTCCGGAGGTTAATCCCGAAGCCATGAAGGGGATTAAGGTTGGGACTGGCAAGGGCGCTTTGATTGCCAATCCTAATTGCTCAACCATCATCTGCTTGATGGCTGTCACTCCTCTGCATCGTCACGCTAAGGTTTTACGCATGGTTGTTAGTACATATCAAGCAGCTAGTGGTGCTGGTGCTGCAGCAATGGAAGAGCTTGTGCAACAGACTCGCGAGGTCCTGGAAGGAAAACCACCAACTTGTAATATATTTAGGGATCAGTATGCTTTCAATTTGTTCTCCCACAATGCATCTGTTCTTTCAAATGGATACAACGAAGAGGAAATGAAATTAGTAAAAGAAACTCGGAAAATATGGAGTGATGCTAATGTTAAAGTTACTGCCACTTGCATACGAGTTCCTGTCATGCGTGCACATGCTGAGAGTGTGAATCTTCAATTTGAGAATCCCCTTGATGAGGACACAGCCAGAGAGATTCTGAAAAATGCTCCTGGAGTCGTCATTATAGACGACCGGGCTGCCAACCAGTTTCCTACACCATTGAAAGTATCAAACAAAGATGATGTCGCTGTAGGAAGGATTCGACAGGATGTTTCACTTGATGGAAATAAAGGGTTGGATATCTTCGTCTGTGGTGATCAAATTCGCAAGGGTGCAGCGCTTAATGCTGTTCAGATCGCCGAGTTGTTACTGTAG

Protein sequence

MAALAFSGTSFFFSTASPQPKSRSISRFVRMAYQENGPSLAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGPLAAEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMAVTPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSHNASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDEDTAREILKNAPGVVIIDDRAANQFPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQIRKGAALNAVQIAELLL
BLAST of CmaCh04G010350 vs. Swiss-Prot
Match: DHAS_PROMA (Aspartate-semialdehyde dehydrogenase OS=Prochlorococcus marinus (strain SARG / CCMP1375 / SS120) GN=asd PE=3 SV=2)

HSP 1 Score: 342.8 bits (878), Expect = 4.7e-93
Identity = 175/336 (52.08%), Postives = 245/336 (72.92%), Query Frame = 1

Query: 39  SLAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTADSFD 98
           +LAV+G +GAVG E L +L +R FP R +++LAS+RSAG+   F  E+ +V++++ + F+
Sbjct: 13  TLAVLGSSGAVGAEILKILEERSFPIRELRLLASERSAGQVQFFKGEDLVVKKVSPEGFE 72

Query: 99  GVDIALFSAGGSISKKFGPLAAEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGT 158
            VD+ L SAGGSIS+K+  +    G ++VDNS+A+RM+ +VPLV+PEVNP      +V T
Sbjct: 73  DVDLVLASAGGSISRKWRKVINSAGAVIVDNSNAYRMEPDVPLVVPEVNPS-----QVFT 132

Query: 159 GKGALIANPNCSTIICLMAVTPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLE 218
            KG LIANPNC+TI+  + + PL     + R+VVSTYQ+ASGAGA AM EL Q +++VL 
Sbjct: 133 HKG-LIANPNCTTILLALVLAPLSAQLPIKRVVVSTYQSASGAGARAMNELKQLSQDVLN 192

Query: 219 GKPPTCNIFRDQYAFNLFSHNASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPV 278
           G  P   I     AFNLF HN+ + SN Y EEEMK++ ETRKI + + + +TATC+RVPV
Sbjct: 193 GNIPKSEILPYSLAFNLFLHNSPLQSNNYCEEEMKMINETRKILNQSELAITATCVRVPV 252

Query: 279 MRAHAESVNLQFENPLDEDTAREILKNAPGVVIIDDRAANQFPTPLKVSNKDDVAVGRIR 338
           +RAH+ES+N++F  P   + AR+IL NA G+ +++D   N+FP P+ V+ KDD+AVGRIR
Sbjct: 253 LRAHSESINIEFAEPFPVEEARKILSNASGIKLLEDIQMNRFPMPIDVTGKDDIAVGRIR 312

Query: 339 QDVSLDGNKGLDIFVCGDQIRKGAALNAVQIAELLL 375
           QD+S    K L++++CGDQIRKGAALNA+QIAELLL
Sbjct: 313 QDLS--NPKALELWLCGDQIRKGAALNAIQIAELLL 340

BLAST of CmaCh04G010350 vs. Swiss-Prot
Match: DHAS_SYNY3 (Aspartate-semialdehyde dehydrogenase OS=Synechocystis sp. (strain PCC 6803 / Kazusa) GN=asd PE=3 SV=2)

HSP 1 Score: 340.9 bits (873), Expect = 1.8e-92
Identity = 172/335 (51.34%), Postives = 240/335 (71.64%), Query Frame = 1

Query: 40  LAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTADSFDG 99
           +A++G TGAVG E L +L+ R FP   +K+LAS RSAGK++ F  E+  ++ +   +F G
Sbjct: 7   VAILGATGAVGTELLELLASRNFPLAELKLLASPRSAGKTLEFQGEKLPIQAVDGSAFKG 66

Query: 100 VDIALFSAGGSISKKFGPLAAEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGTG 159
            D+ L SAGGS SK++     + G ++VDNSSAFRM   VPLV+PE+NPEA +  +    
Sbjct: 67  CDLVLASAGGSTSKRWAEEITKAGAVMVDNSSAFRMVPEVPLVVPEINPEAAQNHQ---- 126

Query: 160 KGALIANPNCSTIICLMAVTPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEG 219
              +IANPNC+TI+  +A+ PLH+   + R+VV+TYQ+ASGAGA AMEE+  Q+R++LEG
Sbjct: 127 --GIIANPNCTTILMGVAIYPLHQLQPIKRIVVATYQSASGAGAMAMEEVKHQSRDILEG 186

Query: 220 KPPTCNIFRDQYAFNLFSHNASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVM 279
           K P   I     AFNLF HN+ + +N Y EEEMK+V+ETRKI++  ++++TATC+RVPV+
Sbjct: 187 KIPQAEILPYPLAFNLFPHNSPITANHYCEEEMKMVQETRKIFAAEDIRITATCVRVPVL 246

Query: 280 RAHAESVNLQFENPLDEDTAREILKNAPGVVIIDDRAANQFPTPLKVSNKDDVAVGRIRQ 339
           RAH+E+VNL+F  P   + A+  +  APGV +++D   N FP P+  + +DDV VGRIRQ
Sbjct: 247 RAHSEAVNLEFATPFPVELAKTAIAKAPGVKLVEDWQKNYFPMPMDATGQDDVLVGRIRQ 306

Query: 340 DVSLDGNKGLDIFVCGDQIRKGAALNAVQIAELLL 375
           D+S     GLD+++CGDQIRKGAALNAVQIAELL+
Sbjct: 307 DIS--HPNGLDLWLCGDQIRKGAALNAVQIAELLV 333

BLAST of CmaCh04G010350 vs. Swiss-Prot
Match: DHAS_AQUAE (Aspartate-semialdehyde dehydrogenase OS=Aquifex aeolicus (strain VF5) GN=asd PE=3 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 1.9e-86
Identity = 171/340 (50.29%), Postives = 233/340 (68.53%), Query Frame = 1

Query: 37  GPSLAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTAD- 96
           G  +A+VG TG VG+ FL VL +R FP   + + AS+RS GK + F  +E+ V+ L  + 
Sbjct: 2   GYRVAIVGATGEVGRTFLKVLEERNFPVDELVLYASERSEGKVLTFKGKEYTVKALNKEN 61

Query: 97  SFDGVDIALFSAGGSISKKFGPLAAEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIK 156
           SF G+DIALFSAGGS SK++ P  A+ G +V+DNSSA+RMD +VPLV+PEVNPE +K  K
Sbjct: 62  SFKGIDIALFSAGGSTSKEWAPKFAKDGVVVIDNSSAWRMDPDVPLVVPEVNPEDVKDFK 121

Query: 157 VGTGKGALIANPNCSTIICLMAVTPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTRE 216
               K  +IANPNCSTI  ++A+ P++  A + R+VVSTYQA SGAGA A+E+L  QT+ 
Sbjct: 122 ----KKGIIANPNCSTIQMVVALKPIYDKAGIKRVVVSTYQAVSGAGAKAIEDLKNQTKA 181

Query: 217 VLEGKP-PTCNIFRDQYAFNLFSHNASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCI 276
             EGK  P    F  Q AFN   H      +GY +EE K++ ETRKI  D N+KV+ATC+
Sbjct: 182 WCEGKEMPKAQKFPHQIAFNALPHIDVFFEDGYTKEENKMLYETRKIMHDENIKVSATCV 241

Query: 277 RVPVMRAHAESVNLQFENPLDEDTAREILKNAPGVVIIDDRAANQFPTPLKVSNKDDVAV 336
           R+PV   H+ES++++ E  +  + ARE+LKNAPGV++ID+   N++P P+    +D+V V
Sbjct: 242 RIPVFYGHSESISMETEKEISPEEAREVLKNAPGVIVIDNPQNNEYPMPIMAEGRDEVFV 301

Query: 337 GRIRQDVSLDGNKGLDIFVCGDQIRKGAALNAVQIAELLL 375
           GRIR+D   +   GL ++V  D IRKGAA NAVQIAELL+
Sbjct: 302 GRIRKDRVFE--PGLSMWVVADNIRKGAATNAVQIAELLV 335

BLAST of CmaCh04G010350 vs. Swiss-Prot
Match: DHAS_LEGPN (Aspartate-semialdehyde dehydrogenase OS=Legionella pneumophila GN=asd PE=3 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 2.8e-77
Identity = 154/336 (45.83%), Postives = 218/336 (64.88%), Query Frame = 1

Query: 39  SLAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTADSFD 98
           ++A+VG TGAVG+ FL+VL +R FP +S+  LAS RS GK+V F D+E  V +L    F 
Sbjct: 6   NVAIVGATGAVGETFLTVLEERNFPIKSLYPLASSRSVGKTVTFRDQELDVLDLAEFDFS 65

Query: 99  GVDIALFSAGGSISKKFGPLAAEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGT 158
            VD+ALFSAGG++SK++ P A   G +VVDN+S FR ++++PLV+P     + +      
Sbjct: 66  KVDLALFSAGGAVSKEYAPKAVAAGCVVVDNTSCFRYEDDIPLVVPGSESSSNRDYT--- 125

Query: 159 GKGALIANPNCSTIICLMAVTPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLE 218
            K  +IANPNCSTI  ++A+ P++    + R+ V+TYQ+ SG G  A+ ELV Q  ++L 
Sbjct: 126 -KRGIIANPNCSTIQMVVALKPIYDAVGISRINVATYQSVSGTGKKAISELVAQVGDLLN 185

Query: 219 GKPPTCNIFRDQYAFNLFSHNASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPV 278
           G+P    ++  Q AFN   H      NGY  EEMK+V ETRKI  D ++ V  T +RVPV
Sbjct: 186 GRPANVQVYPQQIAFNALPHIDQFEDNGYTREEMKMVWETRKIMEDDSIMVNPTAVRVPV 245

Query: 279 MRAHAESVNLQFENPLDEDTAREILKNAPGVVIIDDRAANQFPTPLK-VSNKDDVAVGRI 338
           +  H+E+V+L+ + PL  D AR +L  APGV ++D+ +   +PT +K     DDV VGRI
Sbjct: 246 IYGHSEAVHLELKKPLTADDARALLAKAPGVTVVDNLSKASYPTAIKNAVGHDDVFVGRI 305

Query: 339 RQDVSLDGNKGLDIFVCGDQIRKGAALNAVQIAELL 374
           RQD+S     GL++++  D IRKGAA NAVQIAE+L
Sbjct: 306 RQDIS--HPCGLNLWIVADNIRKGAATNAVQIAEIL 335

BLAST of CmaCh04G010350 vs. Swiss-Prot
Match: DHAS2_VIBCH (Aspartate-semialdehyde dehydrogenase 2 OS=Vibrio cholerae serotype O1 (strain ATCC 39315 / El Tor Inaba N16961) GN=asd2 PE=1 SV=2)

HSP 1 Score: 277.7 bits (709), Expect = 1.8e-73
Identity = 150/337 (44.51%), Postives = 220/337 (65.28%), Query Frame = 1

Query: 39  SLAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTADSFD 98
           ++A+ G TGAVG+  L VL +R FP   + +LAS+RS GK+ RF+ +   V+ +    + 
Sbjct: 6   NVAIFGATGAVGETMLEVLQEREFPVDELFLLASERSEGKTYRFNGKTVRVQNVEEFDWS 65

Query: 99  GVDIALFSAGGSISKKFGPLAAEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGT 158
            V IALFSAGG +S K+ P+AAE G +V+DN+S FR D ++PLV+PEVNPEA+   +   
Sbjct: 66  QVHIALFSAGGELSAKWAPIAAEAGVVVIDNTSHFRYDYDIPLVVPEVNPEAIAEFR--- 125

Query: 159 GKGALIANPNCSTIICLMAVTPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLE 218
               +IANPNCSTI  L+A+ P++    + R+ V+TYQ+ SGAG A ++EL  QT ++L 
Sbjct: 126 -NRNIIANPNCSTIQMLVALKPIYDAVGIERINVTTYQSVSGAGKAGIDELAGQTAKLLN 185

Query: 219 GKPPTCNIFRDQYAFNLFSHNASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPV 278
           G P   N F  Q AFN        + NGY +EEMK+V ET+KI++D ++ V  TC+RVPV
Sbjct: 186 GYPAETNTFSQQIAFNCIPQIDQFMDNGYTKEEMKMVWETQKIFNDPSIMVNPTCVRVPV 245

Query: 279 MRAHAESVNLQFENPLDEDTAREILKNAPGVVIIDDRAANQFPTPLK-VSNKDDVAVGRI 338
              HAE+V+++   P+D +   ++L+   G+ +   R A+ FPT ++    KD V VGR+
Sbjct: 246 FYGHAEAVHVETRAPIDAEQVMDMLEQTDGIELF--RGAD-FPTQVRDAGGKDHVLVGRV 305

Query: 339 RQDVSLDGNKGLDIFVCGDQIRKGAALNAVQIAELLL 375
           R D+S   + G++++V  D +RKGAA NAVQIAELL+
Sbjct: 306 RNDIS--HHSGINLWVVADNVRKGAATNAVQIAELLV 333

BLAST of CmaCh04G010350 vs. TrEMBL
Match: A0A0A0KIR3_CUCSA (Aspartate-semialdehyde dehydrogenase OS=Cucumis sativus GN=Csa_5G021870 PE=3 SV=1)

HSP 1 Score: 671.8 bits (1732), Expect = 4.9e-190
Identity = 346/376 (92.02%), Postives = 355/376 (94.41%), Query Frame = 1

Query: 1   MAALAFSGT--SFFFSTASPQPKSRSISRFVRMAYQENGPSLAVVGVTGAVGQEFLSVLS 60
           MAAL+ S +  SFFFST+SP PK    S  VRMAYQEN PSLAVVGVTGAVGQEFLSVLS
Sbjct: 1   MAALSSSSSPSSFFFSTSSPHPKPTRFSTLVRMAYQENAPSLAVVGVTGAVGQEFLSVLS 60

Query: 61  DRGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGPL 120
           DR FPYRSIKMLASKRSAGK VRFH E+H+VEELTADSFDGVDIALFSAGGSISK FGPL
Sbjct: 61  DRDFPYRSIKMLASKRSAGKHVRFHGEDHVVEELTADSFDGVDIALFSAGGSISKHFGPL 120

Query: 121 AAEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMAV 180
           A  KGTIVVDNSSAFRMD NVPLVIPEVNPEAMKGIKVG GKGALIANPNCSTIICLMAV
Sbjct: 121 AVHKGTIVVDNSSAFRMDGNVPLVIPEVNPEAMKGIKVGNGKGALIANPNCSTIICLMAV 180

Query: 181 TPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSH 240
           TPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSH
Sbjct: 181 TPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSH 240

Query: 241 NASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDEDT 300
           NASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAES+NLQFENPLDE+T
Sbjct: 241 NASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAESINLQFENPLDENT 300

Query: 301 AREILKNAPGVVIIDDRAANQFPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQI 360
           AREILKNAPGVVIIDDR ANQFPTPLKVSNKDD+AVGRIRQDVSLDGNKGLDIF+CGDQI
Sbjct: 301 AREILKNAPGVVIIDDRKANQFPTPLKVSNKDDIAVGRIRQDVSLDGNKGLDIFICGDQI 360

Query: 361 RKGAALNAVQIAELLL 375
           RKGAALNAVQIAELLL
Sbjct: 361 RKGAALNAVQIAELLL 376

BLAST of CmaCh04G010350 vs. TrEMBL
Match: G7LBH4_MEDTR (Aspartate-semialdehyde dehydrogenase OS=Medicago truncatula GN=MTR_8g105860 PE=3 SV=1)

HSP 1 Score: 602.4 bits (1552), Expect = 3.6e-169
Identity = 304/360 (84.44%), Postives = 330/360 (91.67%), Query Frame = 1

Query: 15  TASPQPKSRSISRFVRMAYQENGPSLAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLASKR 74
           T  P+PK  S    VRM+ QEN P++AVVGVTGAVGQEFLSVLSDR FPY SIKMLASKR
Sbjct: 20  TTRPKPKYASAPGRVRMSLQENAPTIAVVGVTGAVGQEFLSVLSDRDFPYSSIKMLASKR 79

Query: 75  SAGKSVRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGPLAAEKGTIVVDNSSAFR 134
           SAG+ + F D+E++VEELTA+SFDGVDIALFSAGGSISK+FGP+A  +GTIVVDNSSAFR
Sbjct: 80  SAGRRMTFEDKEYVVEELTAESFDGVDIALFSAGGSISKEFGPIAVNRGTIVVDNSSAFR 139

Query: 135 MDENVPLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMAVTPLHRHAKVLRMVVST 194
           MDENVPLVIPEVNPEAM+ IKVG GKGALIANPNCSTIICLMA TPLHRHAKVLRMVVST
Sbjct: 140 MDENVPLVIPEVNPEAMEKIKVGMGKGALIANPNCSTIICLMAATPLHRHAKVLRMVVST 199

Query: 195 YQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSHNASVLSNGYNEEEMKL 254
           YQAASGAGAAAMEEL  QTREVLEGKPPTC IF  QYAFNLFSHNASVLSNGYNEEEMKL
Sbjct: 200 YQAASGAGAAAMEELELQTREVLEGKPPTCKIFNRQYAFNLFSHNASVLSNGYNEEEMKL 259

Query: 255 VKETRKIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDEDTAREILKNAPGVVIIDD 314
           VKETRKIW+D +VKVTATCIRVPVMRAHAESVNLQFE PLDEDTAR+ILKN+PGVV+IDD
Sbjct: 260 VKETRKIWNDKDVKVTATCIRVPVMRAHAESVNLQFERPLDEDTARDILKNSPGVVVIDD 319

Query: 315 RAANQFPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQIRKGAALNAVQIAELLL 374
           R +N FPTPL+VSNKDDVAVGRIR+D+S DGN+GLDIF+CGDQIRKGAALNA+QIAELLL
Sbjct: 320 RESNNFPTPLEVSNKDDVAVGRIRRDLSQDGNQGLDIFICGDQIRKGAALNAIQIAELLL 379

BLAST of CmaCh04G010350 vs. TrEMBL
Match: K4AS92_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=3 SV=1)

HSP 1 Score: 602.1 bits (1551), Expect = 4.8e-169
Identity = 302/355 (85.07%), Postives = 331/355 (93.24%), Query Frame = 1

Query: 20  PKSRSISRFVRMAYQENGPSLAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLASKRSAGKS 79
           P+S S    VRM+ QENGPS+AVVGVTGAVGQEFLSVLSDR FPYRS+K+LASKRSAGKS
Sbjct: 27  PRSSSSPFRVRMSLQENGPSVAVVGVTGAVGQEFLSVLSDRNFPYRSLKLLASKRSAGKS 86

Query: 80  VRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGPLAAEKGTIVVDNSSAFRMDENV 139
           ++F + ++ VEELT DSFDG+DIALFSAGGSISKKFGPLAA+KGTIVVDNSSAFRMDENV
Sbjct: 87  MKFEERDYTVEELTEDSFDGIDIALFSAGGSISKKFGPLAAQKGTIVVDNSSAFRMDENV 146

Query: 140 PLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMAVTPLHRHAKVLRMVVSTYQAAS 199
           PLVIPEVNPEAM  +K+G+GKGALIANPNCSTIICLMAVTPLHR AKV RMVVSTYQAAS
Sbjct: 147 PLVIPEVNPEAMAHVKLGSGKGALIANPNCSTIICLMAVTPLHRRAKVKRMVVSTYQAAS 206

Query: 200 GAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSHNASVLSNGYNEEEMKLVKETR 259
           GAGAAAMEELVQQTREVLEGK PTCNIF  QYAFNLFSHNA V  NGYNEEEMKLVKETR
Sbjct: 207 GAGAAAMEELVQQTREVLEGKEPTCNIFNQQYAFNLFSHNAPVQPNGYNEEEMKLVKETR 266

Query: 260 KIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDEDTAREILKNAPGVVIIDDRAANQ 319
           KIWSD +VKVTATCIRVPVMRAHAESVNLQFENPLDE+TAR+ILKNAPG+V++DDRA+N+
Sbjct: 267 KIWSDKDVKVTATCIRVPVMRAHAESVNLQFENPLDENTARDILKNAPGIVVVDDRASNR 326

Query: 320 FPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQIRKGAALNAVQIAELLL 375
           FPTPL+VSNKDDVAVGR+R+DVS DG+ GLDIFVCGDQIRKGAALNA+QIAE+LL
Sbjct: 327 FPTPLEVSNKDDVAVGRVRRDVSQDGDYGLDIFVCGDQIRKGAALNAIQIAEMLL 381

BLAST of CmaCh04G010350 vs. TrEMBL
Match: I1KQD8_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_08G047800 PE=3 SV=2)

HSP 1 Score: 602.1 bits (1551), Expect = 4.8e-169
Identity = 307/377 (81.43%), Postives = 338/377 (89.66%), Query Frame = 1

Query: 1   MAALAFSGTSFFFS---TASPQPKSRSISRFVRMAYQENGPSLAVVGVTGAVGQEFLSVL 60
           M++L+ S  +  FS    A P+PK    S  +RM+ QENGPS+AVVGVTGAVGQEFLSVL
Sbjct: 3   MSSLSVSRHNHLFSGPLPARPKPKPSFSSSRIRMSLQENGPSIAVVGVTGAVGQEFLSVL 62

Query: 61  SDRGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGP 120
           SDR FPY SIKMLASKRSAG+ + F D +++VEELTA+SFDGVDIALFSAGGSISK FGP
Sbjct: 63  SDRDFPYSSIKMLASKRSAGRRITFEDRDYVVEELTAESFDGVDIALFSAGGSISKYFGP 122

Query: 121 LAAEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMA 180
           +A ++GT+VVDNSSAFRMDENVPLVIPEVNPEAM+ IK GTGKGALIANPNCSTIICLMA
Sbjct: 123 IAVDRGTVVVDNSSAFRMDENVPLVIPEVNPEAMQNIKAGTGKGALIANPNCSTIICLMA 182

Query: 181 VTPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFS 240
            TPLHR AKVLRMVVSTYQAASGAGAAAMEEL  QTREVLEGKPPTC IF  QYAFNLFS
Sbjct: 183 ATPLHRRAKVLRMVVSTYQAASGAGAAAMEELQLQTREVLEGKPPTCKIFNRQYAFNLFS 242

Query: 241 HNASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDED 300
           HNASVLSNGYNEEEMK+VKETRKIW++ +VKVTATCIRVPVMRAHAESVNLQFE PLDED
Sbjct: 243 HNASVLSNGYNEEEMKMVKETRKIWNNKDVKVTATCIRVPVMRAHAESVNLQFETPLDED 302

Query: 301 TAREILKNAPGVVIIDDRAANQFPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQ 360
           TAR+ILKNAPGVV+IDDR +N FPTPL+VSNKDDVAVGRIRQD+S DGN+GLDIFVCGDQ
Sbjct: 303 TARDILKNAPGVVVIDDRESNHFPTPLEVSNKDDVAVGRIRQDLSQDGNQGLDIFVCGDQ 362

Query: 361 IRKGAALNAVQIAELLL 375
           IRKGAALNA+QIAE+LL
Sbjct: 363 IRKGAALNAIQIAEMLL 379

BLAST of CmaCh04G010350 vs. TrEMBL
Match: A0A059D9F0_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B03437 PE=3 SV=1)

HSP 1 Score: 601.7 bits (1550), Expect = 6.2e-169
Identity = 303/346 (87.57%), Postives = 326/346 (94.22%), Query Frame = 1

Query: 29  VRMAYQENGPSLAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLASKRSAGKSVRFHDEEHI 88
           +RM+ QENGPSLAVVGVTGAVGQEFLSVLSDR FPYRSIKMLASKRSAGK + F D +++
Sbjct: 32  IRMSLQENGPSLAVVGVTGAVGQEFLSVLSDRDFPYRSIKMLASKRSAGKRMTFEDRDYV 91

Query: 89  VEELTADSFDGVDIALFSAGGSISKKFGPLAAEKGTIVVDNSSAFRMDENVPLVIPEVNP 148
           VEELTADSF+GVDIALFSAGGSISK+ GP+A EKGTIVVDNSSAFRM E VPLVIPEVNP
Sbjct: 92  VEELTADSFEGVDIALFSAGGSISKELGPIAVEKGTIVVDNSSAFRMVEGVPLVIPEVNP 151

Query: 149 EAMKGIKVGTGKGALIANPNCSTIICLMAVTPLHRHAKVLRMVVSTYQAASGAGAAAMEE 208
           EAM+G+++GTGKGALIANPNCSTIICLMA TPLHR AKV+RMVVSTYQAASGAGAAAMEE
Sbjct: 152 EAMEGVRLGTGKGALIANPNCSTIICLMAATPLHRRAKVVRMVVSTYQAASGAGAAAMEE 211

Query: 209 LVQQTREVLEGKPPTCNIFRDQYAFNLFSHNASVLSNGYNEEEMKLVKETRKIWSDANVK 268
           L  QTREVLEGKPPTCNIF+ QYAFNLFSHNA VLSNGYNEEEMKLVKETRKIW+D NVK
Sbjct: 212 LELQTREVLEGKPPTCNIFKLQYAFNLFSHNAPVLSNGYNEEEMKLVKETRKIWNDTNVK 271

Query: 269 VTATCIRVPVMRAHAESVNLQFENPLDEDTAREILKNAPGVVIIDDRAANQFPTPLKVSN 328
           VTATCIRVPVMRAHAESVNLQFENPLDE+TA++ILKNAPGVV+IDDRAAN FPTPL+VSN
Sbjct: 272 VTATCIRVPVMRAHAESVNLQFENPLDEETAKDILKNAPGVVVIDDRAANHFPTPLEVSN 331

Query: 329 KDDVAVGRIRQDVSLDGNKGLDIFVCGDQIRKGAALNAVQIAELLL 375
           KDDVAVGRIR+DVS DGN GLDIFVCGDQIRKGAALNAVQIAE+LL
Sbjct: 332 KDDVAVGRIRRDVSQDGNYGLDIFVCGDQIRKGAALNAVQIAEMLL 377

BLAST of CmaCh04G010350 vs. TAIR10
Match: AT1G14810.1 (AT1G14810.1 semialdehyde dehydrogenase family protein)

HSP 1 Score: 598.2 bits (1541), Expect = 3.5e-171
Identity = 311/363 (85.67%), Postives = 325/363 (89.53%), Query Frame = 1

Query: 12  FFSTASPQPKSRSISRFVRMAYQENGPSLAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLA 71
           F S    + K R  S  V+M+ QE+ PSLAVVGVTGAVGQEFLSVLSDR FPY SIKMLA
Sbjct: 13  FLSRLPLRAKPRHFSARVKMSLQESAPSLAVVGVTGAVGQEFLSVLSDRDFPYSSIKMLA 72

Query: 72  SKRSAGKSVRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGPLAAEKGTIVVDNSS 131
           SKRSAGK V F   E+ VEELTADSF+GVDIALFSAGGSISK+FGPLAAEKGTIVVDNSS
Sbjct: 73  SKRSAGKRVAFDGHEYTVEELTADSFNGVDIALFSAGGSISKEFGPLAAEKGTIVVDNSS 132

Query: 132 AFRMDENVPLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMAVTPLHRHAKVLRMV 191
           AFRM + VPLVIPEVNPEAMKGIKVG GKGALIANPNCSTIICLMAVTPLH HAKV RMV
Sbjct: 133 AFRMVDGVPLVIPEVNPEAMKGIKVGMGKGALIANPNCSTIICLMAVTPLHHHAKVKRMV 192

Query: 192 VSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSHNASVLSNGYNEEE 251
           VSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIF  QYAFNLFSHNA +L NGYNEEE
Sbjct: 193 VSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFGQQYAFNLFSHNAPILDNGYNEEE 252

Query: 252 MKLVKETRKIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDEDTAREILKNAPGVVI 311
           MKLVKETRKIW+D  VKVTATCIRVPVMRAHAESVNLQFENPLDE+TAREILK APGV I
Sbjct: 253 MKLVKETRKIWNDTEVKVTATCIRVPVMRAHAESVNLQFENPLDENTAREILKKAPGVYI 312

Query: 312 IDDRAANQFPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQIRKGAALNAVQIAE 371
           IDDRA+N FPTPL VSNKDDVAVGRIR+DVS DGN GLDIFVCGDQIRKGAALNAVQIAE
Sbjct: 313 IDDRASNTFPTPLDVSNKDDVAVGRIRRDVSQDGNFGLDIFVCGDQIRKGAALNAVQIAE 372

Query: 372 LLL 375
           +LL
Sbjct: 373 MLL 375

BLAST of CmaCh04G010350 vs. NCBI nr
Match: gi|449466857|ref|XP_004151142.1| (PREDICTED: aspartate-semialdehyde dehydrogenase [Cucumis sativus])

HSP 1 Score: 671.8 bits (1732), Expect = 7.0e-190
Identity = 346/376 (92.02%), Postives = 355/376 (94.41%), Query Frame = 1

Query: 1   MAALAFSGT--SFFFSTASPQPKSRSISRFVRMAYQENGPSLAVVGVTGAVGQEFLSVLS 60
           MAAL+ S +  SFFFST+SP PK    S  VRMAYQEN PSLAVVGVTGAVGQEFLSVLS
Sbjct: 1   MAALSSSSSPSSFFFSTSSPHPKPTRFSTLVRMAYQENAPSLAVVGVTGAVGQEFLSVLS 60

Query: 61  DRGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGPL 120
           DR FPYRSIKMLASKRSAGK VRFH E+H+VEELTADSFDGVDIALFSAGGSISK FGPL
Sbjct: 61  DRDFPYRSIKMLASKRSAGKHVRFHGEDHVVEELTADSFDGVDIALFSAGGSISKHFGPL 120

Query: 121 AAEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMAV 180
           A  KGTIVVDNSSAFRMD NVPLVIPEVNPEAMKGIKVG GKGALIANPNCSTIICLMAV
Sbjct: 121 AVHKGTIVVDNSSAFRMDGNVPLVIPEVNPEAMKGIKVGNGKGALIANPNCSTIICLMAV 180

Query: 181 TPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSH 240
           TPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSH
Sbjct: 181 TPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSH 240

Query: 241 NASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDEDT 300
           NASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAES+NLQFENPLDE+T
Sbjct: 241 NASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAESINLQFENPLDENT 300

Query: 301 AREILKNAPGVVIIDDRAANQFPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQI 360
           AREILKNAPGVVIIDDR ANQFPTPLKVSNKDD+AVGRIRQDVSLDGNKGLDIF+CGDQI
Sbjct: 301 AREILKNAPGVVIIDDRKANQFPTPLKVSNKDDIAVGRIRQDVSLDGNKGLDIFICGDQI 360

Query: 361 RKGAALNAVQIAELLL 375
           RKGAALNAVQIAELLL
Sbjct: 361 RKGAALNAVQIAELLL 376

BLAST of CmaCh04G010350 vs. NCBI nr
Match: gi|659086788|ref|XP_008444110.1| (PREDICTED: aspartate-semialdehyde dehydrogenase [Cucumis melo])

HSP 1 Score: 671.8 bits (1732), Expect = 7.0e-190
Identity = 347/375 (92.53%), Postives = 355/375 (94.67%), Query Frame = 1

Query: 1   MAALAFSGT-SFFFSTASPQPKSRSISRFVRMAYQENGPSLAVVGVTGAVGQEFLSVLSD 60
           MAAL+ S + S FFSTASP PK    SR VRMAYQEN PSLAVVGVTGAVGQEFLSVLSD
Sbjct: 1   MAALSSSSSPSSFFSTASPHPKPTRFSRLVRMAYQENAPSLAVVGVTGAVGQEFLSVLSD 60

Query: 61  RGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGPLA 120
           R FPYRSIKMLASKRSAGKSVRF  EEH+VEELTADSFDGVDIALFSAGGSISK FGPLA
Sbjct: 61  RDFPYRSIKMLASKRSAGKSVRFQGEEHVVEELTADSFDGVDIALFSAGGSISKHFGPLA 120

Query: 121 AEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMAVT 180
            EKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVG GKGALIANPNCSTIICLMAVT
Sbjct: 121 VEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGNGKGALIANPNCSTIICLMAVT 180

Query: 181 PLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSHN 240
           PLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSHN
Sbjct: 181 PLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSHN 240

Query: 241 ASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDEDTA 300
           ASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAES+NLQFENPLDE+ A
Sbjct: 241 ASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAESINLQFENPLDENAA 300

Query: 301 REILKNAPGVVIIDDRAANQFPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQIR 360
           REILKN+PGVVIIDDR ANQFPTPLKVS KDD+AVGRIRQDVSLDGNKGLDIF+CGDQIR
Sbjct: 301 REILKNSPGVVIIDDRKANQFPTPLKVSKKDDIAVGRIRQDVSLDGNKGLDIFICGDQIR 360

Query: 361 KGAALNAVQIAELLL 375
           KGAALNAVQIAELLL
Sbjct: 361 KGAALNAVQIAELLL 375

BLAST of CmaCh04G010350 vs. NCBI nr
Match: gi|697107009|ref|XP_009607335.1| (PREDICTED: aspartate-semialdehyde dehydrogenase [Nicotiana tomentosiformis])

HSP 1 Score: 606.3 bits (1562), Expect = 3.6e-170
Identity = 309/364 (84.89%), Postives = 337/364 (92.58%), Query Frame = 1

Query: 14  STASPQPKSRSISRF---VRMAYQENGPSLAVVGVTGAVGQEFLSVLSDRGFPYRSIKML 73
           +TA PQP+S +IS     VRM+ QENGPS+AVVGVTGAVGQEFLSVLSDR FPYRS+K+L
Sbjct: 18  ATAIPQPRSITISTSPFRVRMSLQENGPSVAVVGVTGAVGQEFLSVLSDRNFPYRSLKLL 77

Query: 74  ASKRSAGKSVRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGPLAAEKGTIVVDNS 133
           ASKRSAGKS++F D ++ VEELT DSF+G+DIALFSAGGSISKKFGPLAA+KGTIVVDNS
Sbjct: 78  ASKRSAGKSMKFEDRDYTVEELTEDSFEGIDIALFSAGGSISKKFGPLAAQKGTIVVDNS 137

Query: 134 SAFRMDENVPLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMAVTPLHRHAKVLRM 193
           SAFRMDENVPLVIPEVNPEAM  +K+G+GKGALIANPNCSTIICLMAVTPLHR AKV RM
Sbjct: 138 SAFRMDENVPLVIPEVNPEAMAHLKLGSGKGALIANPNCSTIICLMAVTPLHRRAKVKRM 197

Query: 194 VVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSHNASVLSNGYNEE 253
           VVSTYQAASGAGAAAMEELVQQTREVLEGK PTCNIF  QYAFNLFSHNA V  NGYNEE
Sbjct: 198 VVSTYQAASGAGAAAMEELVQQTREVLEGKEPTCNIFNQQYAFNLFSHNAPVQPNGYNEE 257

Query: 254 EMKLVKETRKIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDEDTAREILKNAPGVV 313
           EMKLVKETRKIW+D +VKVTATCIRVPVMRAHAESVNLQFE PLDEDTAR+ILKNAPG+V
Sbjct: 258 EMKLVKETRKIWNDMDVKVTATCIRVPVMRAHAESVNLQFEKPLDEDTARDILKNAPGIV 317

Query: 314 IIDDRAANQFPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQIRKGAALNAVQIA 373
           IIDDRA+N+FPTPL+VSNKDDVAVGR+R+DVS DG+ GLDIFVCGDQIRKGAALNAVQIA
Sbjct: 318 IIDDRASNRFPTPLEVSNKDDVAVGRVRRDVSQDGDYGLDIFVCGDQIRKGAALNAVQIA 377

Query: 374 ELLL 375
           E+LL
Sbjct: 378 EMLL 381

BLAST of CmaCh04G010350 vs. NCBI nr
Match: gi|1021495624|ref|XP_016190828.1| (PREDICTED: aspartate-semialdehyde dehydrogenase [Arachis ipaensis])

HSP 1 Score: 603.2 bits (1554), Expect = 3.1e-169
Identity = 312/376 (82.98%), Postives = 340/376 (90.43%), Query Frame = 1

Query: 1   MAALAFSGTSFFFSTASP-QPKSRSISRF-VRMAYQENGPSLAVVGVTGAVGQEFLSVLS 60
           MA+L     + FFS + P +P  R  +   VRM+  E+GPS+AVVGVTGAVGQEFLSVLS
Sbjct: 1   MASLTLPRHTGFFSGSLPTRPTLRYAAPVKVRMSLSESGPSIAVVGVTGAVGQEFLSVLS 60

Query: 61  DRGFPYRSIKMLASKRSAGKSVRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGPL 120
           DR FPYRSIKMLASKRSAG+ + F   E++VEELTA+SF GVDIALFSAGGSISK+FGP+
Sbjct: 61  DRDFPYRSIKMLASKRSAGRRLTFEGNEYVVEELTAESFGGVDIALFSAGGSISKEFGPI 120

Query: 121 AAEKGTIVVDNSSAFRMDENVPLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMAV 180
           AAEKGTIVVDNSSAFRMDE VPLVIPEVNPEAM GIK+GTGKGALIANPNCSTIICLMA 
Sbjct: 121 AAEKGTIVVDNSSAFRMDEKVPLVIPEVNPEAMHGIKLGTGKGALIANPNCSTIICLMAA 180

Query: 181 TPLHRHAKVLRMVVSTYQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSH 240
           TPLHRHAKVLRMVVSTYQAASGAGAAAMEEL QQTREVLEG+PPTC IF+ QYAFN+FSH
Sbjct: 181 TPLHRHAKVLRMVVSTYQAASGAGAAAMEELEQQTREVLEGRPPTCKIFKQQYAFNIFSH 240

Query: 241 NASVLSNGYNEEEMKLVKETRKIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDEDT 300
           NASVLSNGYNEEEMK+VKETRKIW+D +VKVTATCIRVPVMRAHAESVNLQFE+PL EDT
Sbjct: 241 NASVLSNGYNEEEMKMVKETRKIWNDKDVKVTATCIRVPVMRAHAESVNLQFESPLHEDT 300

Query: 301 AREILKNAPGVVIIDDRAANQFPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQI 360
           AR+ILKNAPGVV+IDDR AN FPTPL+VSNKDDVAVGRIRQD+S DGNKGLDIFVCGDQI
Sbjct: 301 ARDILKNAPGVVVIDDRKANHFPTPLEVSNKDDVAVGRIRQDLSQDGNKGLDIFVCGDQI 360

Query: 361 RKGAALNAVQIAELLL 375
           RKGAALNAVQIAE+LL
Sbjct: 361 RKGAALNAVQIAEMLL 376

BLAST of CmaCh04G010350 vs. NCBI nr
Match: gi|357521393|ref|XP_003630985.1| (aspartate-semialdehyde dehydrogenase [Medicago truncatula])

HSP 1 Score: 602.4 bits (1552), Expect = 5.2e-169
Identity = 304/360 (84.44%), Postives = 330/360 (91.67%), Query Frame = 1

Query: 15  TASPQPKSRSISRFVRMAYQENGPSLAVVGVTGAVGQEFLSVLSDRGFPYRSIKMLASKR 74
           T  P+PK  S    VRM+ QEN P++AVVGVTGAVGQEFLSVLSDR FPY SIKMLASKR
Sbjct: 20  TTRPKPKYASAPGRVRMSLQENAPTIAVVGVTGAVGQEFLSVLSDRDFPYSSIKMLASKR 79

Query: 75  SAGKSVRFHDEEHIVEELTADSFDGVDIALFSAGGSISKKFGPLAAEKGTIVVDNSSAFR 134
           SAG+ + F D+E++VEELTA+SFDGVDIALFSAGGSISK+FGP+A  +GTIVVDNSSAFR
Sbjct: 80  SAGRRMTFEDKEYVVEELTAESFDGVDIALFSAGGSISKEFGPIAVNRGTIVVDNSSAFR 139

Query: 135 MDENVPLVIPEVNPEAMKGIKVGTGKGALIANPNCSTIICLMAVTPLHRHAKVLRMVVST 194
           MDENVPLVIPEVNPEAM+ IKVG GKGALIANPNCSTIICLMA TPLHRHAKVLRMVVST
Sbjct: 140 MDENVPLVIPEVNPEAMEKIKVGMGKGALIANPNCSTIICLMAATPLHRHAKVLRMVVST 199

Query: 195 YQAASGAGAAAMEELVQQTREVLEGKPPTCNIFRDQYAFNLFSHNASVLSNGYNEEEMKL 254
           YQAASGAGAAAMEEL  QTREVLEGKPPTC IF  QYAFNLFSHNASVLSNGYNEEEMKL
Sbjct: 200 YQAASGAGAAAMEELELQTREVLEGKPPTCKIFNRQYAFNLFSHNASVLSNGYNEEEMKL 259

Query: 255 VKETRKIWSDANVKVTATCIRVPVMRAHAESVNLQFENPLDEDTAREILKNAPGVVIIDD 314
           VKETRKIW+D +VKVTATCIRVPVMRAHAESVNLQFE PLDEDTAR+ILKN+PGVV+IDD
Sbjct: 260 VKETRKIWNDKDVKVTATCIRVPVMRAHAESVNLQFERPLDEDTARDILKNSPGVVVIDD 319

Query: 315 RAANQFPTPLKVSNKDDVAVGRIRQDVSLDGNKGLDIFVCGDQIRKGAALNAVQIAELLL 374
           R +N FPTPL+VSNKDDVAVGRIR+D+S DGN+GLDIF+CGDQIRKGAALNA+QIAELLL
Sbjct: 320 RESNNFPTPLEVSNKDDVAVGRIRRDLSQDGNQGLDIFICGDQIRKGAALNAIQIAELLL 379

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DHAS_PROMA4.7e-9352.08Aspartate-semialdehyde dehydrogenase OS=Prochlorococcus marinus (strain SARG / C... [more]
DHAS_SYNY31.8e-9251.34Aspartate-semialdehyde dehydrogenase OS=Synechocystis sp. (strain PCC 6803 / Kaz... [more]
DHAS_AQUAE1.9e-8650.29Aspartate-semialdehyde dehydrogenase OS=Aquifex aeolicus (strain VF5) GN=asd PE=... [more]
DHAS_LEGPN2.8e-7745.83Aspartate-semialdehyde dehydrogenase OS=Legionella pneumophila GN=asd PE=3 SV=1[more]
DHAS2_VIBCH1.8e-7344.51Aspartate-semialdehyde dehydrogenase 2 OS=Vibrio cholerae serotype O1 (strain AT... [more]
Match NameE-valueIdentityDescription
A0A0A0KIR3_CUCSA4.9e-19092.02Aspartate-semialdehyde dehydrogenase OS=Cucumis sativus GN=Csa_5G021870 PE=3 SV=... [more]
G7LBH4_MEDTR3.6e-16984.44Aspartate-semialdehyde dehydrogenase OS=Medicago truncatula GN=MTR_8g105860 PE=3... [more]
K4AS92_SOLLC4.8e-16985.07Uncharacterized protein OS=Solanum lycopersicum PE=3 SV=1[more]
I1KQD8_SOYBN4.8e-16981.43Uncharacterized protein OS=Glycine max GN=GLYMA_08G047800 PE=3 SV=2[more]
A0A059D9F0_EUCGR6.2e-16987.57Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B03437 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G14810.13.5e-17185.67 semialdehyde dehydrogenase family protein[more]
Match NameE-valueIdentityDescription
gi|449466857|ref|XP_004151142.1|7.0e-19092.02PREDICTED: aspartate-semialdehyde dehydrogenase [Cucumis sativus][more]
gi|659086788|ref|XP_008444110.1|7.0e-19092.53PREDICTED: aspartate-semialdehyde dehydrogenase [Cucumis melo][more]
gi|697107009|ref|XP_009607335.1|3.6e-17084.89PREDICTED: aspartate-semialdehyde dehydrogenase [Nicotiana tomentosiformis][more]
gi|1021495624|ref|XP_016190828.1|3.1e-16982.98PREDICTED: aspartate-semialdehyde dehydrogenase [Arachis ipaensis][more]
gi|357521393|ref|XP_003630985.1|5.2e-16984.44aspartate-semialdehyde dehydrogenase [Medicago truncatula][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000534Semialdehyde_DH_NAD-bd
IPR005986Asp_semialdehyde_DH_beta
IPR012080Asp_semialdehyde_DH
IPR012280Semialdhyde_DH_dimer_dom
IPR016040NAD(P)-bd_dom
Vocabulary: Molecular Function
TermDefinition
GO:0016620oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:0051287NAD binding
GO:0004073aspartate-semialdehyde dehydrogenase activity
GO:0050661NADP binding
GO:0003942N-acetyl-gamma-glutamyl-phosphate reductase activity
GO:0046983protein dimerization activity
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO:0009086methionine biosynthetic process
GO:0009088threonine biosynthetic process
GO:0009089lysine biosynthetic process via diaminopimelate
GO:0009097isoleucine biosynthetic process
GO:0008652cellular amino acid biosynthetic process
Vocabulary: Cellular Component
TermDefinition
GO:0005737cytoplasm
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006544 glycine metabolic process
biological_process GO:0008652 cellular amino acid biosynthetic process
biological_process GO:0006563 L-serine metabolic process
biological_process GO:0009089 lysine biosynthetic process via diaminopimelate
biological_process GO:0009086 methionine biosynthetic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0006164 purine nucleotide biosynthetic process
biological_process GO:0009088 threonine biosynthetic process
biological_process GO:0009097 isoleucine biosynthetic process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005739 mitochondrion
cellular_component GO:0009570 chloroplast stroma
molecular_function GO:0003942 N-acetyl-gamma-glutamyl-phosphate reductase activity
molecular_function GO:0051287 NAD binding
molecular_function GO:0050661 NADP binding
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0004073 aspartate-semialdehyde dehydrogenase activity
molecular_function GO:0016620 oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G010350.1CmaCh04G010350.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000534Semialdehyde dehydrogenase, NAD-bindingPFAMPF01118Semialdhyde_dhcoord: 40..153
score: 7.6
IPR000534Semialdehyde dehydrogenase, NAD-bindingSMARTSM00859Semialdhyde_dh_3coord: 39..154
score: 1.0
IPR005986Aspartate-semialdehyde dehydrogenase, beta-typeTIGRFAMsTIGR01296TIGR01296coord: 40..373
score: 3.7E
IPR012080Aspartate-semialdehyde dehydrogenaseHAMAPMF_02121ASADHcoord: 37..374
score: 34
IPR012080Aspartate-semialdehyde dehydrogenasePIRPIRSF000148ASA_dhcoord: 37..374
score: 4.5E
IPR012280Semialdehyde dehydrogenase, dimerisation domainPFAMPF02774Semialdhyde_dhCcoord: 179..360
score: 1.0
IPR016040NAD(P)-binding domainGENE3DG3DSA:3.40.50.720coord: 38..181
score: 7.0
IPR016040NAD(P)-binding domainunknownSSF51735NAD(P)-binding Rossmann-fold domainscoord: 40..196
score: 3.02
NoneNo IPR availableGENE3DG3DSA:3.30.360.10coord: 182..356
score: 3.3
NoneNo IPR availablePANTHERPTHR10174RETINALDEHYDE BINDING PROTEIN-RELATEDcoord: 81..374
score: 1.6
NoneNo IPR availablePANTHERPTHR10174:SF2MAJOR SPERM PROTEINcoord: 81..374
score: 1.6
NoneNo IPR availableunknownSSF55347Glyceraldehyde-3-phosphate dehydrogenase-like, C-terminal domaincoord: 168..361
score: 1.07