Cp4.1LG08g07080 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g07080
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPhospho-2-dehydro-3-deoxyheptonate aldolase
LocationCp4.1LG08 : 5525446 .. 5528694 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGTCCCCAACAAATTTGATACACAGACGTAAAGCGAGCCCATTATCCAGCCAGCAACAGAACCGACTTCAGAGTTTACCGGCGACTGGAGATTCACAAACAAGGTAAAAAATGATCATCTCCGCCTCCACAAATCTCCCTCCTCTTACTTCCTCCGCCGTCTGTACAACTCCCACCACCAAATCTTTCCTCTTCAAACCTCATTTTCCCATCCCCAACGCCGCCAAATTCTGCCGGACCATTCCCTCCGCCGTTTCCTTTTCTTCTTCCACGCATTTCACTTCTGGATCCTCCAATTGGGCGCCTGATTCTTGGAAATCCAAGAAGGCCCTTCAGCTCCCTGAATATCCCGACGCTAATGAGCTCGAGTCCGTTCTTCGCGTTCTCGAGTCCTTCCCGCCCATCGTCTTCGCTGGCGAGGCTCGCAAGCTCGAAGAGAGCCTCGCGAAGGCCGCCGTTGGCGAAGCGTTTTTGCTTCAAGGTGGGGATTGCGCTGAGAGCTTCAAGGAGTTTAATGGGAACAATATTAGGGATACCTTCAGGGTTTTGCTCCAGATGGGTATGGTTCTTACGTATGGCGCCCAAATGCCTATTATAAAGGTTGATTTCTGTTGGATTCTTGTCTTTTCCCTTCGGTTTCTGGTCAATTGCCTGGAACTTTGAATTCTATAGGATTATCTCTGATTATTTTATTCATATTTTACTTCACTGTTGATGACTTTTTGTGATATGCATTATGTGATCTCAATCCATGATTGTTTACACCTACCCTGCATTTGTTGGTATCTCGTTTTTTTAATTCATATTCACATTGGGAAGATATTGCAAATACTCTATCTAGATCTGTTCACTATCTTATCCAAATTCAGTCAGTCCATTGTTTTCAGATTCATTGAATTAAATGAATTCTGGGAATTTTTATTTGGTGTATCAATTTCTAAAAAGCTGTATTTGGATTATATTTAGATTCTACAACCCTGGAAGTGAAGATTTTGTATACAACATTAGCACGTTTTATGCTAAATTTGCTACCAAACACATATTCAAGAACCGAGTATACTAGTGTGTCGTCTATTAAATCCTGTCAAAGAGAACCGAACCGAACGCACCCTATATTTTTAGAATTTGGAATCTTTTTCTCTAAAACCATAGTCAAGCTTTGGTTGTAATGTAAAAGATGAGAAATGTTTGAGTTTGGAATTTCTTTGATATAAATTTGATCAACCAACTATGTTCACAGGTAGGAAGAATGGCAGGACAATTTGCTAAACCCAGGTCGGATTCGTTTGAGGTTAAAGATGGTGTAAAGCTCCCAAGTTATCGTGGAGATAACATCAATGCAGACGCTTTCGACGAAAAATCTCGAACCCCAGATCCTCAGAGATTGGTTAGAGCATATCTCCAATCCGTAGGCACGTTGAATCTTCTTAGAGCATTTGCCACTGGAGGATACGCTGCAATGCAGAGAGTTTCTCAATGGAATCTCGACTTCGTTCAACACAGTGAGCAAGGAGACAGGTGACTGTCATTTTGCAATTTATGCTCTTCAATAATTGCCTTACTTGAACAGATCTGTCCTGTAGACTATAGGTTGTACGGCTGTGTCGTTGAGTTCTAAATTTATGCTCTTCAGTATGAACTTAGCTCTCCATCTTCTCAAAAGGAGAAAGAAAAAGCTTTTTCCCCATTTCAAAATGAGTATCCAATTTGTTGGTCTTTTAGCCTCTTAATGTACTGAATGACCCTGAGGAGTGGCTTGGTGATAAGCACTTGCAAGGTTTTGAGATGCTTCTTAATGTCTTCGTTCTGATATTTCTATCTTCTAGAGTTGGCCTTTGGGTTAGACAAAACCGAGTCTAGTGATCGGTGTCCCATGAGCACCTAGTGTAATAGCTAGACCATGGACCATCCTATGATGAAGATTATCTTATTGGGTGCAGGTATAATTTACTAAATGAAGATTTACCTATTGGTTACAGGTATAAGGAACTTGCTCAGAGAGTTGATGAAGCGCTGGGGTTTATGACTGCTGCTGGAATCACTATGGACCATCCTATAATGAACACAATTGATTTCTGGACCTCTCATGAGTGCCTTCACTTACCATATGAGCAAGCCTTGACGAGGGAGGACTCAACGACAGGCCTCTATTACGATTGTTCTGCTCACATGCTTTGGGTAGGTGAGAGGACTCGACAGTTGGATGGTGCTCATGTTGAATTCCTGCGGGGCGTGTCGAATCCTCTCGGCATTAAGGTACCCAACAACAGCTGTTATTACTGGGATAGATTCACTGTTTGCATCTAGATACGAAGTGCTTGATCAATTCATAATCTGGGAGATACTTTGGTAGAATGTGTTTGATCAAGTCAGTGGCTTTGTTATGTAGGTAAGTGACAAGATGGATCCAGCAGAGCTTGTTCAGTTATGTGAGATTTTGAATCCTCACAACAAACCTGGACGTCTTACGATAATCACCCGAATGGGAGCCGATAACATGCGAGTCAAGTTACCTCATCTCATTAGAGCCGTGCGTCAAGCGGGGCTTATCGTCACATGGGTTAGCGATCCTATGCACGGCAATACAATAAAGGCACCTTGTGGTCTCAAGACTCGTTCATTTGATTCAATAAGGGTAAAAGCCTGTCTACCAACAAGAATCCACATTGCATACTCACTGTAAATCTTGATAGATATTGATGTTATCTTTTCATTTGACTTAGGCCGAGTTGAGAGCTTTCTTCGACGTTCATGAACAAGAAGGGAGCTACCCTGGAGGAGTTCATCTAGAAATGACTGGACAAAATGTGACAGAGTGCGTCGGAGGGTCGAAGGAAGTGACTTTCGACGACCTGAATTCTCGCTACCATACCCACTGCGATCCGAGACTGAATGCCTCGCAGTCGCTGGAGTTGGCCTTTGCAATATCCCAAAGGTTGCGGAGGAAAAGGATGCATTCTAAGCCTGGCTCTAATGGAATGCTTGTAGAAAATGGGTCTGTTGCTTAAGAATCTTCCATAATGCGTTATTTGCTCTATAATAACAACTTGCAACTCCAATATCTTTTCTGTTGAGGTAATAAATTTTACATACGGAGACTGAATGTTTGTCTAATCTGAATGATTTCTTGAAAGCTTTTGTTTGTGAGAATTCCTTGAAAGCTGTGTTAGGAACTTGGGAGATGGTTAAGTGCTTTCTGCCATCGCAAAATAAATCAAAATTCAGTACTTTATTAGTGTTTGTTTCACA

mRNA sequence

TGGTCCCCAACAAATTTGATACACAGACGTAAAGCGAGCCCATTATCCAGCCAGCAACAGAACCGACTTCAGAGTTTACCGGCGACTGGAGATTCACAAACAAGGTAAAAAATGATCATCTCCGCCTCCACAAATCTCCCTCCTCTTACTTCCTCCGCCGTCTGTACAACTCCCACCACCAAATCTTTCCTCTTCAAACCTCATTTTCCCATCCCCAACGCCGCCAAATTCTGCCGGACCATTCCCTCCGCCGTTTCCTTTTCTTCTTCCACGCATTTCACTTCTGGATCCTCCAATTGGGCGCCTGATTCTTGGAAATCCAAGAAGGCCCTTCAGCTCCCTGAATATCCCGACGCTAATGAGCTCGAGTCCGTTCTTCGCGTTCTCGAGTCCTTCCCGCCCATCGTCTTCGCTGGCGAGGCTCGCAAGCTCGAAGAGAGCCTCGCGAAGGCCGCCGTTGGCGAAGCGTTTTTGCTTCAAGGTGGGGATTGCGCTGAGAGCTTCAAGGAGTTTAATGGGAACAATATTAGGGATACCTTCAGGGTTTTGCTCCAGATGGGTATGGTTCTTACGTATGGCGCCCAAATGCCTATTATAAAGGTAGGAAGAATGGCAGGACAATTTGCTAAACCCAGGTCGGATTCGTTTGAGGTTAAAGATGGTGTAAAGCTCCCAAGTTATCGTGGAGATAACATCAATGCAGACGCTTTCGACGAAAAATCTCGAACCCCAGATCCTCAGAGATTGGTTAGAGCATATCTCCAATCCGTAGGCACGTATAAGGAACTTGCTCAGAGAGTTGATGAAGCGCTGGGGTTTATGACTGCTGCTGGAATCACTATGGACCATCCTATAATGAACACAATTGATTTCTGGACCTCTCATGAGTGCCTTCACTTACCATATGAGCAAGCCTTGACGAGGGAGGACTCAACGACAGGCCTCTATTACGATTGTTCTGCTCACATGCTTTGGGTAGGTGAGAGGACTCGACAGTTGGATGGTGCTCATGTTGAATTCCTGCGGGGCGTGTCGAATCCTCTCGGCATTAAGGTAAGTGACAAGATGGATCCAGCAGAGCTTGTTCAGTTATGTGAGATTTTGAATCCTCACAACAAACCTGGACGTCTTACGATAATCACCCGAATGGGAGCCGATAACATGCGAGTCAAGTTACCTCATCTCATTAGAGCCGTGCGTCAAGCGGGGCTTATCGTCACATGGGTTAGCGATCCTATGCACGGCAATACAATAAAGGCACCTTGTGGTCTCAAGACTCGTTCATTTGATTCAATAAGGGCCGAGTTGAGAGCTTTCTTCGACGTTCATGAACAAGAAGGGAGCTACCCTGGAGGAGTTCATCTAGAAATGACTGGACAAAATGTGACAGAGTGCGTCGGAGGGTCGAAGGAAGTGACTTTCGACGACCTGAATTCTCGCTACCATACCCACTGCGATCCGAGACTGAATGCCTCGCAGTCGCTGGAGTTGGCCTTTGCAATATCCCAAAGGTTGCGGAGGAAAAGGATGCATTCTAAGCCTGGCTCTAATGGAATGCTTGTAGAAAATGGGTCTGTTGCTTAAGAATCTTCCATAATGCGTTATTTGCTCTATAATAACAACTTGCAACTCCAATATCTTTTCTGTTGAGGTAATAAATTTTACATACGGAGACTGAATGTTTGTCTAATCTGAATGATTTCTTGAAAGCTTTTGTTTGTGAGAATTCCTTGAAAGCTGTGTTAGGAACTTGGGAGATGGTTAAGTGCTTTCTGCCATCGCAAAATAAATCAAAATTCAGTACTTTATTAGTGTTTGTTTCACA

Coding sequence (CDS)

ATGATCATCTCCGCCTCCACAAATCTCCCTCCTCTTACTTCCTCCGCCGTCTGTACAACTCCCACCACCAAATCTTTCCTCTTCAAACCTCATTTTCCCATCCCCAACGCCGCCAAATTCTGCCGGACCATTCCCTCCGCCGTTTCCTTTTCTTCTTCCACGCATTTCACTTCTGGATCCTCCAATTGGGCGCCTGATTCTTGGAAATCCAAGAAGGCCCTTCAGCTCCCTGAATATCCCGACGCTAATGAGCTCGAGTCCGTTCTTCGCGTTCTCGAGTCCTTCCCGCCCATCGTCTTCGCTGGCGAGGCTCGCAAGCTCGAAGAGAGCCTCGCGAAGGCCGCCGTTGGCGAAGCGTTTTTGCTTCAAGGTGGGGATTGCGCTGAGAGCTTCAAGGAGTTTAATGGGAACAATATTAGGGATACCTTCAGGGTTTTGCTCCAGATGGGTATGGTTCTTACGTATGGCGCCCAAATGCCTATTATAAAGGTAGGAAGAATGGCAGGACAATTTGCTAAACCCAGGTCGGATTCGTTTGAGGTTAAAGATGGTGTAAAGCTCCCAAGTTATCGTGGAGATAACATCAATGCAGACGCTTTCGACGAAAAATCTCGAACCCCAGATCCTCAGAGATTGGTTAGAGCATATCTCCAATCCGTAGGCACGTATAAGGAACTTGCTCAGAGAGTTGATGAAGCGCTGGGGTTTATGACTGCTGCTGGAATCACTATGGACCATCCTATAATGAACACAATTGATTTCTGGACCTCTCATGAGTGCCTTCACTTACCATATGAGCAAGCCTTGACGAGGGAGGACTCAACGACAGGCCTCTATTACGATTGTTCTGCTCACATGCTTTGGGTAGGTGAGAGGACTCGACAGTTGGATGGTGCTCATGTTGAATTCCTGCGGGGCGTGTCGAATCCTCTCGGCATTAAGGTAAGTGACAAGATGGATCCAGCAGAGCTTGTTCAGTTATGTGAGATTTTGAATCCTCACAACAAACCTGGACGTCTTACGATAATCACCCGAATGGGAGCCGATAACATGCGAGTCAAGTTACCTCATCTCATTAGAGCCGTGCGTCAAGCGGGGCTTATCGTCACATGGGTTAGCGATCCTATGCACGGCAATACAATAAAGGCACCTTGTGGTCTCAAGACTCGTTCATTTGATTCAATAAGGGCCGAGTTGAGAGCTTTCTTCGACGTTCATGAACAAGAAGGGAGCTACCCTGGAGGAGTTCATCTAGAAATGACTGGACAAAATGTGACAGAGTGCGTCGGAGGGTCGAAGGAAGTGACTTTCGACGACCTGAATTCTCGCTACCATACCCACTGCGATCCGAGACTGAATGCCTCGCAGTCGCTGGAGTTGGCCTTTGCAATATCCCAAAGGTTGCGGAGGAAAAGGATGCATTCTAAGCCTGGCTCTAATGGAATGCTTGTAGAAAATGGGTCTGTTGCTTAA

Protein sequence

MIISASTNLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTYKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQSLELAFAISQRLRRKRMHSKPGSNGMLVENGSVA
BLAST of Cp4.1LG08g07080 vs. Swiss-Prot
Match: AROF_ORYSJ (Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Oryza sativa subsp. japonica GN=DAHPS1 PE=2 SV=2)

HSP 1 Score: 701.0 bits (1808), Expect = 8.9e-201
Identity = 341/454 (75.11%), Postives = 383/454 (84.36%), Query Frame = 1

Query: 61  SNWAPDSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAF 120
           + WA DSW++KKALQLPEYP+A ELE+VL+ +E+FPPIVFAGEAR LEE LA AA+G AF
Sbjct: 93  ARWAVDSWRTKKALQLPEYPNAAELEAVLKTIEAFPPIVFAGEARHLEERLADAAMGRAF 152

Query: 121 LLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFE 180
           LLQGGDCAESFKEFNGNNIRDTFRVLLQM  VLT+G QMP+IKVGRMAGQFAKPRS++FE
Sbjct: 153 LLQGGDCAESFKEFNGNNIRDTFRVLLQMSAVLTFGGQMPVIKVGRMAGQFAKPRSEAFE 212

Query: 181 VKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT------------------ 240
            +DGVKLPSYRGDNIN DAF+EKSR PDPQR+VRAY QS  T                  
Sbjct: 213 ERDGVKLPSYRGDNINGDAFNEKSRIPDPQRMVRAYAQSAATLNLLRAFATGGYAAMQRV 272

Query: 241 ----------------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYE 300
                           Y+ELA RVDEALGFM+AAG+T+DHP+M + DFWTSHECL LPYE
Sbjct: 273 TQWNLDFTQHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPLMTSTDFWTSHECLLLPYE 332

Query: 301 QALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQ 360
           Q+LTR+DSTTG +YDCSAHMLWVGERTRQLDGAHVEFLRGV+NPLGIKVSDKM+P ELV+
Sbjct: 333 QSLTRQDSTTGHFYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKVSDKMNPTELVK 392

Query: 361 LCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG 420
           L EILNP NKPGR+TIITRMGA+NMRVKLPHLIRAVR AG IVTW++DPMHGNTIKAPCG
Sbjct: 393 LIEILNPSNKPGRITIITRMGAENMRVKLPHLIRAVRHAGQIVTWITDPMHGNTIKAPCG 452

Query: 421 LKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHT 480
           LKTR FDSI AE+RAFFDVH+QEGS+PGGVHLEMTGQNVTEC+GGS+ VTFDDL  RYHT
Sbjct: 453 LKTRPFDSI-AEVRAFFDVHDQEGSHPGGVHLEMTGQNVTECIGGSRTVTFDDLGDRYHT 512

BLAST of Cp4.1LG08g07080 vs. Swiss-Prot
Match: AROG_ARATH (Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Arabidopsis thaliana GN=DHS2 PE=2 SV=2)

HSP 1 Score: 698.0 bits (1800), Expect = 7.5e-200
Identity = 350/491 (71.28%), Postives = 394/491 (80.24%), Query Frame = 1

Query: 22  TTKSFL---FKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQLPE 81
           TTKSFL     P  PI  +  F      +     ST   S S  W+ +SWKSKKALQLP+
Sbjct: 11  TTKSFLPYRHAPRRPISFSPVFA---VHSTDPKKSTQSASASVKWSLESWKSKKALQLPD 70

Query: 82  YPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESFKEFNGNN 141
           YPD  +++SVL+ L SFPPIVFAGEARKLE+ L +AA+G+AF+LQGGDCAESFKEFN NN
Sbjct: 71  YPDQKDVDSVLQTLSSFPPIVFAGEARKLEDKLGQAAMGQAFMLQGGDCAESFKEFNANN 130

Query: 142 IRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNINAD 201
           IRDTFRVLLQMG+VL +G Q+P+IKVGRMAGQFAKPRSD FE KDGVKLPSYRGDNIN D
Sbjct: 131 IRDTFRVLLQMGVVLMFGGQLPVIKVGRMAGQFAKPRSDPFEEKDGVKLPSYRGDNINGD 190

Query: 202 AFDEKSRTPDPQRLVRAYLQSVGT----------------------------------YK 261
           AFDEKSR PDP R+VRAY QSV T                                  Y+
Sbjct: 191 AFDEKSRIPDPHRMVRAYTQSVATLNLLRAFATGGYAAMQRVSQWNLDFTQHSEQGDRYR 250

Query: 262 ELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTGLYYDCSA 321
           ELA RVDEALGFM AAG+T  HPIM T +FWTSHECL LPYEQALTREDST+GLYYDCSA
Sbjct: 251 ELANRVDEALGFMGAAGLTSAHPIMTTTEFWTSHECLLLPYEQALTREDSTSGLYYDCSA 310

Query: 322 HMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKPGRLTIIT 381
           HMLWVGERTRQLDGAHVEFLRG++NPLGIKVSDKM P+ELV+L EILNP NKPGR+T+I 
Sbjct: 311 HMLWVGERTRQLDGAHVEFLRGIANPLGIKVSDKMVPSELVKLIEILNPQNKPGRITVIV 370

Query: 382 RMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELRAFFD 441
           RMGA+NMRVKLP+LIRAVR AG IVTWVSDPMHGNTI AP GLKTRSFD+IRAELRAFFD
Sbjct: 371 RMGAENMRVKLPNLIRAVRGAGQIVTWVSDPMHGNTIMAPGGLKTRSFDAIRAELRAFFD 430

Query: 442 VHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQSLELAFAI 476
           VH+QEGS+PGGVHLEMTGQNVTECVGGS+ +T++DL+SRYHTHCDPRLNASQSLELAF I
Sbjct: 431 VHDQEGSFPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPRLNASQSLELAFII 490

BLAST of Cp4.1LG08g07080 vs. Swiss-Prot
Match: AROG_SOLTU (Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Solanum tuberosum GN=SHKB PE=2 SV=1)

HSP 1 Score: 681.0 bits (1756), Expect = 9.5e-195
Identity = 339/509 (66.60%), Postives = 396/509 (77.80%), Query Frame = 1

Query: 3   ISASTNLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIP-SAVSFSSSTHFTSGSS 62
           ++ S  L   +S ++  +    + L +P F +    +  R  P SAV  +  +       
Sbjct: 1   MALSNTLSLSSSKSLVQSHLLHNPLPQPRFSLFPTTQHGRRHPISAVHAAEPSKTAVKQG 60

Query: 63  NWAPDSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFL 122
            W+ DSWK+KKALQLPEYPD  ELESVL+ LE  PP+VFAGEAR LEE L +AA+G+AFL
Sbjct: 61  KWSLDSWKTKKALQLPEYPDEKELESVLKTLEMNPPLVFAGEARSLEEKLGEAALGKAFL 120

Query: 123 LQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEV 182
           LQGGDCAESFKEFN NNIRDTFR+LLQM +VL +G Q+P+IKVGRMAGQFAKPRSD  E 
Sbjct: 121 LQGGDCAESFKEFNANNIRDTFRILLQMSVVLMFGGQVPVIKVGRMAGQFAKPRSDPLEE 180

Query: 183 KDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT------------------- 242
            +GVKLPSY+GDNIN D FDEKSR PDP RL+RAY+QS  T                   
Sbjct: 181 INGVKLPSYKGDNINGDTFDEKSRIPDPHRLIRAYMQSAATLNLLRAFATGGYAAMQRVT 240

Query: 243 ---------------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYEQ 302
                          Y+ELA RVDEALGFM AAG+T+DHPIM+T DFWTSHECL LPYEQ
Sbjct: 241 EWNLDFVENCEQGDRYQELAHRVDEALGFMAAAGLTVDHPIMSTTDFWTSHECLLLPYEQ 300

Query: 303 ALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQL 362
           ALTREDST+GL+YDCSAHM+WVGERTRQLDGAHVEFLRGV+NPLGIKVS KMDP EL++L
Sbjct: 301 ALTREDSTSGLFYDCSAHMVWVGERTRQLDGAHVEFLRGVANPLGIKVSQKMDPNELIKL 360

Query: 363 CEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGL 422
            +ILNP NKPGR+T+I RMGA+NMRVKL HL+RAVR AG IVTWV DPMHGNTIKAPCGL
Sbjct: 361 IDILNPANKPGRITVIVRMGAENMRVKLSHLVRAVRGAGQIVTWVCDPMHGNTIKAPCGL 420

Query: 423 KTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTH 477
           KTR+FDSI AE+RAFFDVHEQEGS+PGG+HLEMTGQNVTEC+GGS+ VT+DDL SRYHTH
Sbjct: 421 KTRAFDSILAEVRAFFDVHEQEGSHPGGIHLEMTGQNVTECIGGSRTVTYDDLGSRYHTH 480

BLAST of Cp4.1LG08g07080 vs. Swiss-Prot
Match: AROF_ARATH (Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Arabidopsis thaliana GN=DHS1 PE=2 SV=2)

HSP 1 Score: 556.2 bits (1432), Expect = 3.5e-157
Identity = 295/503 (58.65%), Postives = 351/503 (69.78%), Query Frame = 1

Query: 16  AVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQ 75
           AV T P + + +   H   P  A+   ++  +V+ SSS     G+  W P+SWK KKALQ
Sbjct: 35  AVNTKPKSVNLVTAVHAAEP--ARNAVSVKESVASSSS-----GALKWTPESWKLKKALQ 94

Query: 76  LPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESFKEFN 135
           LP+YP+ANELESVL+ +E+FPPIVFAGEAR LEE LA AAVG+AFLLQGGDCAESFKEFN
Sbjct: 95  LPDYPNANELESVLKTIEAFPPIVFAGEARNLEERLADAAVGKAFLLQGGDCAESFKEFN 154

Query: 136 GNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNI 195
             NIRDTFRVLLQM +VLT+G Q+P+IKVGRMAGQFAKPRSD+FE KDGVKLPSY+GDNI
Sbjct: 155 ATNIRDTFRVLLQMSIVLTFGGQVPVIKVGRMAGQFAKPRSDAFEEKDGVKLPSYKGDNI 214

Query: 196 NADAFDEKSRTPDPQRLVRAYLQSVGTYKELAQRVDEALGFMTAAGITMDHPIMNTIDFW 255
           N D FDEKSR PDP R++RAY QS  T   L         F T     +       +DF 
Sbjct: 215 NGDTFDEKSRIPDPNRMIRAYTQSAATLNLLR-------AFATGGYAAIQRVTQWNLDFV 274

Query: 256 TSHECLHLPYEQALTRED------------------STTGLYYDCSAHMLWVGERTRQLD 315
              E     Y++   R D                  +TT  Y      +L   +   +LD
Sbjct: 275 EQSEQAD-RYQELANRVDEALGFMSACGLGTDHPLMTTTDFYTSHECLLLPYEQSLTRLD 334

Query: 316 GA------------------------HVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNP 375
                                     HVEFLRG++NPLGIKVS+KMDP ELV+L EILNP
Sbjct: 335 STSGLYYDCSAHMVWCGERTRQLDGAHVEFLRGIANPLGIKVSNKMDPFELVKLVEILNP 394

Query: 376 HNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD 435
           +NKPGR+T+I RMGA+NMRVKLPHLIRAVR++G IVTWV DPMHGNTIKAPCGLKTR+FD
Sbjct: 395 NNKPGRITVIVRMGAENMRVKLPHLIRAVRRSGQIVTWVCDPMHGNTIKAPCGLKTRAFD 454

Query: 436 SIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN 477
           SI AE+RAF DVHEQEGS+ GG+HLEMTGQNVTEC+GGS+ VT+DDL+SRYHTHCDPRLN
Sbjct: 455 SILAEVRAFLDVHEQEGSHAGGIHLEMTGQNVTECIGGSRTVTYDDLSSRYHTHCDPRLN 514

BLAST of Cp4.1LG08g07080 vs. Swiss-Prot
Match: AROF_CATRO (Probable phospho-2-dehydro-3-deoxyheptonate aldolase, chloroplastic OS=Catharanthus roseus GN=DHS1 PE=2 SV=2)

HSP 1 Score: 547.7 bits (1410), Expect = 1.3e-154
Identity = 293/487 (60.16%), Postives = 347/487 (71.25%), Query Frame = 1

Query: 1   MIISASTNLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGS 60
           ++ S   + P LT  +  TT   K    +   PI  A    +   + +S +++T      
Sbjct: 13  LLPSCKPHQPTLTFFSPSTTCQKKP---RSSRPISAAVHVTQPPKTPISSATATKRRLSL 72

Query: 61  SNWAPDSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAF 120
            N   +SWKSKKALQLPEYPD  +L+ VL+ +E+FPP+VFAGEAR LEE LA+AA+G AF
Sbjct: 73  LNGVWESWKSKKALQLPEYPDEGKLDGVLKTIEAFPPLVFAGEARSLEEKLAQAAMGNAF 132

Query: 121 LLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFE 180
           LLQGGDCAESFKE      R  F+        LT+G Q P+IKVGRMAGQFAKPR D FE
Sbjct: 133 LLQGGDCAESFKELMPLYSR-YFQNTASDECRLTFGGQCPVIKVGRMAGQFAKPRLDPFE 192

Query: 181 VKDGVKLPSYRGDNINADAFDEKSRTPDPQRL---VRAYLQSV-------------GTYK 240
            KDG+ L    G  +  +A+ +  +    + L   V  Y +S                Y+
Sbjct: 193 EKDGLWLSGANGWPVAWEAYCKLQQLSPSRALLLVVCCYAESHPMDLDFVEHSEQGDRYQ 252

Query: 241 ELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTGLYYDCSA 300
           ELA RVDEALGFM A G+T+DHPIM T +FWTSHECL LPYEQALTREDST+GL+YDCSA
Sbjct: 253 ELAHRVDEALGFMDACGLTVDHPIMATTEFWTSHECLLLPYEQALTREDSTSGLFYDCSA 312

Query: 301 HMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKPGRLTIIT 360
           HMLWVGERTRQLDGAHVEFLRGV+NPLGIKVS KMDP ELV L EILNP NKPGR+T+I 
Sbjct: 313 HMLWVGERTRQLDGAHVEFLRGVANPLGIKVSQKMDPNELVNLIEILNPTNKPGRITVIV 372

Query: 361 RMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELRAFFD 420
           RMGA+NMRVKLPHLIRAVR AG IVTWV DPMHGNTIKAPCGLKTR+FD+I AE+RAF+D
Sbjct: 373 RMGAENMRVKLPHLIRAVRGAGQIVTWVCDPMHGNTIKAPCGLKTRAFDAILAEVRAFYD 432

Query: 421 VHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQSLELAFAI 472
           VHEQEG+ PG           TECVGGS+ +T+DD  +RYHTHCDPRLNASQSLELAF I
Sbjct: 433 VHEQEGTLPG-----------TECVGGSRTITYDDRQTRYHTHCDPRLNASQSLELAFII 484

BLAST of Cp4.1LG08g07080 vs. TrEMBL
Match: A0A0A0L679_CUCSA (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Cucumis sativus GN=Csa_3G073840 PE=3 SV=1)

HSP 1 Score: 891.3 bits (2302), Expect = 5.2e-256
Identity = 449/517 (86.85%), Postives = 459/517 (88.78%), Query Frame = 1

Query: 8   NLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDS 67
           NL P +SSA  TT   K FLFKPHF  PNAA FCRTIPSAVS SSSTHF SGSSNW P+S
Sbjct: 5   NLLPPSSSAAPTTSIAKCFLFKPHFFTPNAANFCRTIPSAVSSSSSTHFISGSSNWTPES 64

Query: 68  WKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC 127
           WKSKKALQLP+YPD NEL+SVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC
Sbjct: 65  WKSKKALQLPQYPDPNELDSVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC 124

Query: 128 AESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKL 187
           AESFKEFNGNNIRDTFRVLLQMG+VLTYGAQMPIIKVGRMAGQFAKPRSD FEVKDGV+L
Sbjct: 125 AESFKEFNGNNIRDTFRVLLQMGIVLTYGAQMPIIKVGRMAGQFAKPRSDPFEVKDGVEL 184

Query: 188 PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT------------------------- 247
           PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT                         
Sbjct: 185 PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDF 244

Query: 248 ---------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYEQALTRED 307
                    YKELAQRVDEALGFM AAGIT DHPIMNTIDFWTSHECLHLPYEQALTRED
Sbjct: 245 VQHSEQGDRYKELAQRVDEALGFMAAAGITTDHPIMNTIDFWTSHECLHLPYEQALTRED 304

Query: 308 STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNP 367
           STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDP+ELVQLCEILNP
Sbjct: 305 STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPSELVQLCEILNP 364

Query: 368 HNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD 427
            N+PGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD
Sbjct: 365 RNRPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD 424

Query: 428 SIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN 487
           SIRAELRAFFDVHEQEGS+PGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN
Sbjct: 425 SIRAELRAFFDVHEQEGSHPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN 484

Query: 488 ASQSLELAFAISQRLRRKRMHSKPGSNGMLVENGSVA 491
           ASQSLELAFAISQRLR KRM SK G NG+LVENG VA
Sbjct: 485 ASQSLELAFAISQRLRSKRMRSKAGLNGLLVENGFVA 521

BLAST of Cp4.1LG08g07080 vs. TrEMBL
Match: B9SZ06_RICCO (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Ricinus communis GN=RCOM_0121060 PE=3 SV=1)

HSP 1 Score: 762.7 bits (1968), Expect = 2.8e-217
Identity = 387/504 (76.79%), Postives = 417/504 (82.74%), Query Frame = 1

Query: 6   STNLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAP 65
           +T  PP +++A   T   K  +F  H       K   T  ++VS +SST     S +W+ 
Sbjct: 7   TTPKPPFSTAAAAAT---KPQIFSFHVGTIKRPK--PTFIASVS-TSSTSNNIASPDWSL 66

Query: 66  DSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGG 125
           DSWKSK A QLPEYPD  ELE+VL+ L +FPPIVFAGEARKLEE LA AAVG AFLLQGG
Sbjct: 67  DSWKSKPAKQLPEYPDQQELETVLQSLNNFPPIVFAGEARKLEERLASAAVGNAFLLQGG 126

Query: 126 DCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGV 185
           DCAESFKEFN NNIRDTFRVLLQMG+VLT+GAQMPIIKVGRMAGQFAKPRSD FE+KDGV
Sbjct: 127 DCAESFKEFNANNIRDTFRVLLQMGVVLTFGAQMPIIKVGRMAGQFAKPRSDPFEIKDGV 186

Query: 186 KLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT----------------------- 245
           KLPSYRGDNINADAFDEKSR PDPQRL+RAYLQSVGT                       
Sbjct: 187 KLPSYRGDNINADAFDEKSRRPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNL 246

Query: 246 -----------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYEQALTR 305
                      Y ELA+RVDEALGFM AAG+T+DHP+MNT +FWTSHECLHLPYEQALTR
Sbjct: 247 DFVLHSEQGDRYMELARRVDEALGFMAAAGLTVDHPVMNTTEFWTSHECLHLPYEQALTR 306

Query: 306 EDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEIL 365
           EDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDP ELV+LCEIL
Sbjct: 307 EDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPKELVKLCEIL 366

Query: 366 NPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRS 425
           NPHN+PGRLTII RMGADN+R+KLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTR 
Sbjct: 367 NPHNRPGRLTIIARMGADNLRIKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRP 426

Query: 426 FDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPR 476
           FDSIRAELRAFFDVH+QEGSYPGGVHLEMTGQNVTECVGGSK VTFDDLNSRYHTHCDPR
Sbjct: 427 FDSIRAELRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSKTVTFDDLNSRYHTHCDPR 486

BLAST of Cp4.1LG08g07080 vs. TrEMBL
Match: D0VBC1_VITVI (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Vitis vinifera GN=VIT_00s1217g00010 PE=2 SV=2)

HSP 1 Score: 758.1 bits (1956), Expect = 6.8e-216
Identity = 385/509 (75.64%), Postives = 413/509 (81.14%), Query Frame = 1

Query: 1   MIISASTNLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGS 60
           M ++ + NL   T  ++C             FP P         P  +S S S+     S
Sbjct: 1   MAVTGTANLAAPTPPSLCRL-----------FPNPRYLPTHTLKPRPISASLSS-IDIRS 60

Query: 61  SNWAPDSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAF 120
            NW P SWKSKKA QLPEYPD  ELESVL+ LESFPP+VFAGEAR LEE LA AAVG+AF
Sbjct: 61  PNWTPGSWKSKKAQQLPEYPDPVELESVLKTLESFPPMVFAGEARNLEERLADAAVGKAF 120

Query: 121 LLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFE 180
           LLQGGDCAESFKEF G NIRDTFRVLLQMG+VLT+GAQ+P+IKVGRMAGQFAKPRSD FE
Sbjct: 121 LLQGGDCAESFKEFGGTNIRDTFRVLLQMGIVLTFGAQLPVIKVGRMAGQFAKPRSDPFE 180

Query: 181 VKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT------------------ 240
           VKDGVKLPSYRGDNIN+D FDEKSRTPDPQRL+RAYLQSVGT                  
Sbjct: 181 VKDGVKLPSYRGDNINSDDFDEKSRTPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRV 240

Query: 241 ----------------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYE 300
                           Y ELAQRVDEALGFM AAG+T DHPIMNTI+FWTSHECLHL YE
Sbjct: 241 SQWNLDFVQHSEQGDRYTELAQRVDEALGFMAAAGLTTDHPIMNTIEFWTSHECLHLLYE 300

Query: 301 QALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQ 360
           QALTR+DSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRG+SNPLGIKVSDKMDP ELV+
Sbjct: 301 QALTRQDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGISNPLGIKVSDKMDPKELVK 360

Query: 361 LCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG 420
           LCEILNP NKPGRLTIITRMGADNMR+KLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG
Sbjct: 361 LCEILNPRNKPGRLTIITRMGADNMRIKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG 420

Query: 421 LKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHT 476
           LKTRSFDSIR+ELRAFFDVH+QEGS+PGGVHLEMTGQNVTEC+GGSK VTFDDLNSRYHT
Sbjct: 421 LKTRSFDSIRSELRAFFDVHDQEGSHPGGVHLEMTGQNVTECIGGSKTVTFDDLNSRYHT 480

BLAST of Cp4.1LG08g07080 vs. TrEMBL
Match: A5C138_VITVI (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Vitis vinifera GN=VITISV_002672 PE=3 SV=1)

HSP 1 Score: 758.1 bits (1956), Expect = 6.8e-216
Identity = 385/509 (75.64%), Postives = 413/509 (81.14%), Query Frame = 1

Query: 1   MIISASTNLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGS 60
           M ++ + NL   T  ++C             FP P         P  +S S S+     S
Sbjct: 1   MAVTGTANLAAPTPPSLCRL-----------FPNPRYLPTHXLKPRPISASLSS-IDIRS 60

Query: 61  SNWAPDSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAF 120
            NW P SWKSKKA QLPEYPD  ELESVL+ LESFPP+VFAGEAR LEE LA AAVG+AF
Sbjct: 61  PNWTPGSWKSKKAQQLPEYPDPVELESVLKTLESFPPMVFAGEARNLEERLADAAVGKAF 120

Query: 121 LLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFE 180
           LLQGGDCAESFKEF G NIRDTFRVLLQMG+VLT+GAQ+P+IKVGRMAGQFAKPRSD FE
Sbjct: 121 LLQGGDCAESFKEFGGTNIRDTFRVLLQMGIVLTFGAQLPVIKVGRMAGQFAKPRSDPFE 180

Query: 181 VKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT------------------ 240
           VKDGVKLPSYRGDNIN+D FDEKSRTPDPQRL+RAYLQSVGT                  
Sbjct: 181 VKDGVKLPSYRGDNINSDDFDEKSRTPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRV 240

Query: 241 ----------------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYE 300
                           Y ELAQRVDEALGFM AAG+T DHPIMNTI+FWTSHECLHL YE
Sbjct: 241 SQWNLDFVQHSEQGDRYTELAQRVDEALGFMAAAGLTTDHPIMNTIEFWTSHECLHLLYE 300

Query: 301 QALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQ 360
           QALTR+DSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRG+SNPLGIKVSDKMDP ELV+
Sbjct: 301 QALTRQDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGISNPLGIKVSDKMDPKELVK 360

Query: 361 LCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG 420
           LCEILNP NKPGRLTIITRMGADNMR+KLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG
Sbjct: 361 LCEILNPRNKPGRLTIITRMGADNMRIKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG 420

Query: 421 LKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHT 476
           LKTRSFDSIR+ELRAFFDVH+QEGS+PGGVHLEMTGQNVTEC+GGSK VTFDDLNSRYHT
Sbjct: 421 LKTRSFDSIRSELRAFFDVHDQEGSHPGGVHLEMTGQNVTECIGGSKTVTFDDLNSRYHT 480

BLAST of Cp4.1LG08g07080 vs. TrEMBL
Match: A0A067KUZ2_JATCU (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Jatropha curcas GN=JCGZ_03503 PE=3 SV=1)

HSP 1 Score: 757.7 bits (1955), Expect = 8.9e-216
Identity = 384/512 (75.00%), Postives = 419/512 (81.84%), Query Frame = 1

Query: 1   MIISASTNLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGS 60
           M +SAS  L    ++A  ++P  K  +F+ H       K   T  ++VS SSS      S
Sbjct: 1   MALSASAKLTTAAAAASTSSPL-KPQIFQFH-ATTRTRKAKPTFVASVSTSSSASSNLIS 60

Query: 61  SNWAPDSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAF 120
            NWA DSWKSK A QLPEYPD  ELE VL+ L +FPPIVFAGEARKLEE +A AAVG AF
Sbjct: 61  PNWALDSWKSKPAQQLPEYPDQQELELVLQTLNNFPPIVFAGEARKLEERIASAAVGNAF 120

Query: 121 LLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFE 180
           LLQGGDCAESFKEFN NNIRDTFRVLLQMG+ LT+GAQMPIIKVGRMAGQFAKPRS+ FE
Sbjct: 121 LLQGGDCAESFKEFNANNIRDTFRVLLQMGVALTFGAQMPIIKVGRMAGQFAKPRSEPFE 180

Query: 181 VKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT------------------ 240
           +KDGVKLPSYRGDNINADAFDEKSR PDPQRL+RAYLQSVGT                  
Sbjct: 181 IKDGVKLPSYRGDNINADAFDEKSRIPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRV 240

Query: 241 ----------------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYE 300
                           Y ELA+RVDEALGFM AAG+T+DHPIMNT +F+TSHECLHLPYE
Sbjct: 241 SQWNLDFVLHSEQGDRYMELARRVDEALGFMAAAGLTVDHPIMNTTEFYTSHECLHLPYE 300

Query: 301 QALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQ 360
           QALTREDST+GL+YDCSAHMLWVGERTRQL GAHVEFLRGVSNPLGIKVSDKMDP ELV+
Sbjct: 301 QALTREDSTSGLFYDCSAHMLWVGERTRQLGGAHVEFLRGVSNPLGIKVSDKMDPTELVK 360

Query: 361 LCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG 420
           LCEILNPHN+PGRLTIITRMGADN+R+KLPHLIRA+RQAGLIVTWVSDPMHGNTIKAPCG
Sbjct: 361 LCEILNPHNRPGRLTIITRMGADNLRIKLPHLIRAIRQAGLIVTWVSDPMHGNTIKAPCG 420

Query: 421 LKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHT 479
           LKTR FD+IRAELRAFFDVH+QEGSYPGGVHLEMTGQNVTECVGGSK VTFDDLNSRYHT
Sbjct: 421 LKTRPFDNIRAELRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSKAVTFDDLNSRYHT 480

BLAST of Cp4.1LG08g07080 vs. TAIR10
Match: AT4G33510.1 (AT4G33510.1 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase)

HSP 1 Score: 698.0 bits (1800), Expect = 4.2e-201
Identity = 350/491 (71.28%), Postives = 394/491 (80.24%), Query Frame = 1

Query: 22  TTKSFL---FKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQLPE 81
           TTKSFL     P  PI  +  F      +     ST   S S  W+ +SWKSKKALQLP+
Sbjct: 11  TTKSFLPYRHAPRRPISFSPVFA---VHSTDPKKSTQSASASVKWSLESWKSKKALQLPD 70

Query: 82  YPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESFKEFNGNN 141
           YPD  +++SVL+ L SFPPIVFAGEARKLE+ L +AA+G+AF+LQGGDCAESFKEFN NN
Sbjct: 71  YPDQKDVDSVLQTLSSFPPIVFAGEARKLEDKLGQAAMGQAFMLQGGDCAESFKEFNANN 130

Query: 142 IRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNINAD 201
           IRDTFRVLLQMG+VL +G Q+P+IKVGRMAGQFAKPRSD FE KDGVKLPSYRGDNIN D
Sbjct: 131 IRDTFRVLLQMGVVLMFGGQLPVIKVGRMAGQFAKPRSDPFEEKDGVKLPSYRGDNINGD 190

Query: 202 AFDEKSRTPDPQRLVRAYLQSVGT----------------------------------YK 261
           AFDEKSR PDP R+VRAY QSV T                                  Y+
Sbjct: 191 AFDEKSRIPDPHRMVRAYTQSVATLNLLRAFATGGYAAMQRVSQWNLDFTQHSEQGDRYR 250

Query: 262 ELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTGLYYDCSA 321
           ELA RVDEALGFM AAG+T  HPIM T +FWTSHECL LPYEQALTREDST+GLYYDCSA
Sbjct: 251 ELANRVDEALGFMGAAGLTSAHPIMTTTEFWTSHECLLLPYEQALTREDSTSGLYYDCSA 310

Query: 322 HMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKPGRLTIIT 381
           HMLWVGERTRQLDGAHVEFLRG++NPLGIKVSDKM P+ELV+L EILNP NKPGR+T+I 
Sbjct: 311 HMLWVGERTRQLDGAHVEFLRGIANPLGIKVSDKMVPSELVKLIEILNPQNKPGRITVIV 370

Query: 382 RMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELRAFFD 441
           RMGA+NMRVKLP+LIRAVR AG IVTWVSDPMHGNTI AP GLKTRSFD+IRAELRAFFD
Sbjct: 371 RMGAENMRVKLPNLIRAVRGAGQIVTWVSDPMHGNTIMAPGGLKTRSFDAIRAELRAFFD 430

Query: 442 VHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQSLELAFAI 476
           VH+QEGS+PGGVHLEMTGQNVTECVGGS+ +T++DL+SRYHTHCDPRLNASQSLELAF I
Sbjct: 431 VHDQEGSFPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPRLNASQSLELAFII 490

BLAST of Cp4.1LG08g07080 vs. TAIR10
Match: AT4G39980.1 (AT4G39980.1 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase 1)

HSP 1 Score: 556.2 bits (1432), Expect = 2.0e-158
Identity = 295/503 (58.65%), Postives = 351/503 (69.78%), Query Frame = 1

Query: 16  AVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQ 75
           AV T P + + +   H   P  A+   ++  +V+ SSS     G+  W P+SWK KKALQ
Sbjct: 35  AVNTKPKSVNLVTAVHAAEP--ARNAVSVKESVASSSS-----GALKWTPESWKLKKALQ 94

Query: 76  LPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESFKEFN 135
           LP+YP+ANELESVL+ +E+FPPIVFAGEAR LEE LA AAVG+AFLLQGGDCAESFKEFN
Sbjct: 95  LPDYPNANELESVLKTIEAFPPIVFAGEARNLEERLADAAVGKAFLLQGGDCAESFKEFN 154

Query: 136 GNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNI 195
             NIRDTFRVLLQM +VLT+G Q+P+IKVGRMAGQFAKPRSD+FE KDGVKLPSY+GDNI
Sbjct: 155 ATNIRDTFRVLLQMSIVLTFGGQVPVIKVGRMAGQFAKPRSDAFEEKDGVKLPSYKGDNI 214

Query: 196 NADAFDEKSRTPDPQRLVRAYLQSVGTYKELAQRVDEALGFMTAAGITMDHPIMNTIDFW 255
           N D FDEKSR PDP R++RAY QS  T   L         F T     +       +DF 
Sbjct: 215 NGDTFDEKSRIPDPNRMIRAYTQSAATLNLLR-------AFATGGYAAIQRVTQWNLDFV 274

Query: 256 TSHECLHLPYEQALTRED------------------STTGLYYDCSAHMLWVGERTRQLD 315
              E     Y++   R D                  +TT  Y      +L   +   +LD
Sbjct: 275 EQSEQAD-RYQELANRVDEALGFMSACGLGTDHPLMTTTDFYTSHECLLLPYEQSLTRLD 334

Query: 316 GA------------------------HVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNP 375
                                     HVEFLRG++NPLGIKVS+KMDP ELV+L EILNP
Sbjct: 335 STSGLYYDCSAHMVWCGERTRQLDGAHVEFLRGIANPLGIKVSNKMDPFELVKLVEILNP 394

Query: 376 HNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD 435
           +NKPGR+T+I RMGA+NMRVKLPHLIRAVR++G IVTWV DPMHGNTIKAPCGLKTR+FD
Sbjct: 395 NNKPGRITVIVRMGAENMRVKLPHLIRAVRRSGQIVTWVCDPMHGNTIKAPCGLKTRAFD 454

Query: 436 SIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN 477
           SI AE+RAF DVHEQEGS+ GG+HLEMTGQNVTEC+GGS+ VT+DDL+SRYHTHCDPRLN
Sbjct: 455 SILAEVRAFLDVHEQEGSHAGGIHLEMTGQNVTECIGGSRTVTYDDLSSRYHTHCDPRLN 514

BLAST of Cp4.1LG08g07080 vs. TAIR10
Match: AT1G22410.1 (AT1G22410.1 Class-II DAHP synthetase family protein)

HSP 1 Score: 553.9 bits (1426), Expect = 9.9e-158
Identity = 292/488 (59.84%), Postives = 340/488 (69.67%), Query Frame = 1

Query: 30  PHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQLPEYPDANELESVL 89
           P  P  ++A    T P+ ++     +   G   WAP+SW++KKALQ P+YPD   LE+VL
Sbjct: 44  PKTPAASSASAATTTPATLTKPVGVNV--GKGKWAPESWRTKKALQQPDYPDLAALEAVL 103

Query: 90  RVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESFKEFNGNNIRDTFRVLLQM 149
             +E+FPPIVFAGEAR LEE L +AA+GEAFLLQGGDCAESFKEFN NNIRDTFR+LLQM
Sbjct: 104 ETIEAFPPIVFAGEARLLEERLGQAAMGEAFLLQGGDCAESFKEFNANNIRDTFRILLQM 163

Query: 150 GMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNINADAFDEKSRTPDP 209
           G VL +G Q+P++KVGRMAGQFAKPRSDSFE KDGVKLPSYRGDNIN DAFD KSR PDP
Sbjct: 164 GAVLMFGGQVPVVKVGRMAGQFAKPRSDSFEEKDGVKLPSYRGDNINGDAFDSKSRIPDP 223

Query: 210 QRLVRAYLQSVGTYKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTS------------ 269
           QR++RAY QS  T   L         F T     M       +DF               
Sbjct: 224 QRMIRAYCQSAATLNLLR-------AFATGGYAAMQRVTQWNLDFTERSEQGDRYRELAN 283

Query: 270 --HECLHLPYEQALTRED---STTGLYYDCSAHMLWVGERTRQLD---------GAHV-- 329
              E L   +   LT +     TT  +      +L   +   +LD          AH+  
Sbjct: 284 RVDEALGFMHAAGLTLDHPIMQTTDFWTSHECLLLPYEQSLTRLDSTSGLYYDCSAHMIW 343

Query: 330 -------------EFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKPGRLTIITRMGA 389
                        EFLRGV+NPLGIKVSDKMDP ELV+L EILN  NKPGR+TIITRMGA
Sbjct: 344 VGERTRQLDGAHVEFLRGVANPLGIKVSDKMDPKELVKLIEILNADNKPGRITIITRMGA 403

Query: 390 DNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELRAFFDVHEQ 449
           +NMRVKLPHLIR VR+AG IVTWVSDPMHGNTIKAPCGLKTR FD+I AE+RAFFDVHEQ
Sbjct: 404 ENMRVKLPHLIREVRRAGQIVTWVSDPMHGNTIKAPCGLKTRPFDAILAEVRAFFDVHEQ 463

Query: 450 EGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQSLELAFAISQRL 477
           EGS+PGG+HLEMTGQNVTEC+GGS+ VTFDDL SRYHTHCDPRLNASQSLEL+F I++RL
Sbjct: 464 EGSHPGGIHLEMTGQNVTECIGGSRTVTFDDLGSRYHTHCDPRLNASQSLELSFIIAERL 522

BLAST of Cp4.1LG08g07080 vs. NCBI nr
Match: gi|659096501|ref|XP_008449130.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cucumis melo])

HSP 1 Score: 892.1 bits (2304), Expect = 4.3e-256
Identity = 451/518 (87.07%), Postives = 459/518 (88.61%), Query Frame = 1

Query: 8   NLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSS-THFTSGSSNWAPD 67
           NL P T S   TT   K FLFKPHF  PNAAKFCRTIPSAVS SSS THF SGSSNW P+
Sbjct: 5   NLLPPTPSTAPTTSIAKCFLFKPHFSTPNAAKFCRTIPSAVSSSSSSTHFVSGSSNWTPE 64

Query: 68  SWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGD 127
           SWKSK+ALQLPEYPD NEL+SVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGD
Sbjct: 65  SWKSKRALQLPEYPDPNELDSVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGD 124

Query: 128 CAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVK 187
           CAESFKEFNGNNIRDTFRVLLQMG+VLTYGAQMPIIKVGRMAGQFAKPRSD FEVKDGVK
Sbjct: 125 CAESFKEFNGNNIRDTFRVLLQMGIVLTYGAQMPIIKVGRMAGQFAKPRSDPFEVKDGVK 184

Query: 188 LPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT------------------------ 247
           LPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT                        
Sbjct: 185 LPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLD 244

Query: 248 ----------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYEQALTRE 307
                     YKELAQRVDEALGFM AAGITMDHPIMNTIDFWTSHECLHLPYEQALTRE
Sbjct: 245 FVQHSEQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTRE 304

Query: 308 DSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILN 367
           DSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDP+ELVQLCEILN
Sbjct: 305 DSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPSELVQLCEILN 364

Query: 368 PHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSF 427
           P N+PGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSF
Sbjct: 365 PRNRPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSF 424

Query: 428 DSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRL 487
           DSIRAELRAFFDVHEQEGS+PGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRL
Sbjct: 425 DSIRAELRAFFDVHEQEGSHPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRL 484

Query: 488 NASQSLELAFAISQRLRRKRMHSKPGSNGMLVENGSVA 491
           NASQSLELAFAISQRLR KRM SK G NG+LVENG VA
Sbjct: 485 NASQSLELAFAISQRLRSKRMRSKAGLNGLLVENGFVA 522

BLAST of Cp4.1LG08g07080 vs. NCBI nr
Match: gi|449463236|ref|XP_004149340.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 891.3 bits (2302), Expect = 7.4e-256
Identity = 449/517 (86.85%), Postives = 459/517 (88.78%), Query Frame = 1

Query: 8   NLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDS 67
           NL P +SSA  TT   K FLFKPHF  PNAA FCRTIPSAVS SSSTHF SGSSNW P+S
Sbjct: 5   NLLPPSSSAAPTTSIAKCFLFKPHFFTPNAANFCRTIPSAVSSSSSTHFISGSSNWTPES 64

Query: 68  WKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC 127
           WKSKKALQLP+YPD NEL+SVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC
Sbjct: 65  WKSKKALQLPQYPDPNELDSVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC 124

Query: 128 AESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKL 187
           AESFKEFNGNNIRDTFRVLLQMG+VLTYGAQMPIIKVGRMAGQFAKPRSD FEVKDGV+L
Sbjct: 125 AESFKEFNGNNIRDTFRVLLQMGIVLTYGAQMPIIKVGRMAGQFAKPRSDPFEVKDGVEL 184

Query: 188 PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT------------------------- 247
           PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT                         
Sbjct: 185 PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDF 244

Query: 248 ---------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYEQALTRED 307
                    YKELAQRVDEALGFM AAGIT DHPIMNTIDFWTSHECLHLPYEQALTRED
Sbjct: 245 VQHSEQGDRYKELAQRVDEALGFMAAAGITTDHPIMNTIDFWTSHECLHLPYEQALTRED 304

Query: 308 STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNP 367
           STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDP+ELVQLCEILNP
Sbjct: 305 STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPSELVQLCEILNP 364

Query: 368 HNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD 427
            N+PGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD
Sbjct: 365 RNRPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD 424

Query: 428 SIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN 487
           SIRAELRAFFDVHEQEGS+PGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN
Sbjct: 425 SIRAELRAFFDVHEQEGSHPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN 484

Query: 488 ASQSLELAFAISQRLRRKRMHSKPGSNGMLVENGSVA 491
           ASQSLELAFAISQRLR KRM SK G NG+LVENG VA
Sbjct: 485 ASQSLELAFAISQRLRSKRMRSKAGLNGLLVENGFVA 521

BLAST of Cp4.1LG08g07080 vs. NCBI nr
Match: gi|255580809|ref|XP_002531225.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic [Ricinus communis])

HSP 1 Score: 762.7 bits (1968), Expect = 4.0e-217
Identity = 387/504 (76.79%), Postives = 417/504 (82.74%), Query Frame = 1

Query: 6   STNLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAP 65
           +T  PP +++A   T   K  +F  H       K   T  ++VS +SST     S +W+ 
Sbjct: 7   TTPKPPFSTAAAAAT---KPQIFSFHVGTIKRPK--PTFIASVS-TSSTSNNIASPDWSL 66

Query: 66  DSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGG 125
           DSWKSK A QLPEYPD  ELE+VL+ L +FPPIVFAGEARKLEE LA AAVG AFLLQGG
Sbjct: 67  DSWKSKPAKQLPEYPDQQELETVLQSLNNFPPIVFAGEARKLEERLASAAVGNAFLLQGG 126

Query: 126 DCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGV 185
           DCAESFKEFN NNIRDTFRVLLQMG+VLT+GAQMPIIKVGRMAGQFAKPRSD FE+KDGV
Sbjct: 127 DCAESFKEFNANNIRDTFRVLLQMGVVLTFGAQMPIIKVGRMAGQFAKPRSDPFEIKDGV 186

Query: 186 KLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT----------------------- 245
           KLPSYRGDNINADAFDEKSR PDPQRL+RAYLQSVGT                       
Sbjct: 187 KLPSYRGDNINADAFDEKSRRPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNL 246

Query: 246 -----------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYEQALTR 305
                      Y ELA+RVDEALGFM AAG+T+DHP+MNT +FWTSHECLHLPYEQALTR
Sbjct: 247 DFVLHSEQGDRYMELARRVDEALGFMAAAGLTVDHPVMNTTEFWTSHECLHLPYEQALTR 306

Query: 306 EDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEIL 365
           EDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDP ELV+LCEIL
Sbjct: 307 EDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPKELVKLCEIL 366

Query: 366 NPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRS 425
           NPHN+PGRLTII RMGADN+R+KLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTR 
Sbjct: 367 NPHNRPGRLTIIARMGADNLRIKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRP 426

Query: 426 FDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPR 476
           FDSIRAELRAFFDVH+QEGSYPGGVHLEMTGQNVTECVGGSK VTFDDLNSRYHTHCDPR
Sbjct: 427 FDSIRAELRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSKTVTFDDLNSRYHTHCDPR 486

BLAST of Cp4.1LG08g07080 vs. NCBI nr
Match: gi|743908215|ref|XP_011047555.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Populus euphratica])

HSP 1 Score: 759.2 bits (1959), Expect = 4.4e-216
Identity = 375/462 (81.17%), Postives = 399/462 (86.36%), Query Frame = 1

Query: 51  SSSTHFTSGSSNWAPDSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEES 110
           S+ T   + SSNWA DSWKSK A QLP+YPD  EL SVL+ L +FPPIVFAGEARKLEE 
Sbjct: 44  STPTSKPTISSNWALDSWKSKPARQLPDYPDPVELHSVLQTLTNFPPIVFAGEARKLEER 103

Query: 111 LAKAAVGEAFLLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQ 170
           +A AAVG AFLLQGGDCAESFKEFN NNIRDTFRVLLQMG+VLT+GAQMPIIKVGRMAGQ
Sbjct: 104 IASAAVGGAFLLQGGDCAESFKEFNANNIRDTFRVLLQMGVVLTFGAQMPIIKVGRMAGQ 163

Query: 171 FAKPRSDSFEVKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVG--------- 230
           FAKPRSD FE KDGVKLPSYRGDNINADAFDEKSRTPDPQRL+RAYLQSVG         
Sbjct: 164 FAKPRSDPFEEKDGVKLPSYRGDNINADAFDEKSRTPDPQRLIRAYLQSVGTLNLLRAFA 223

Query: 231 -------------------------TYKELAQRVDEALGFMTAAGITMDHPIMNTIDFWT 290
                                     + ELA+RVDEALGFM AAG+T+DHP+MNT +FWT
Sbjct: 224 TGGYAAMQRVSQWNLDFVEHSEQGDRFMELARRVDEALGFMAAAGLTIDHPVMNTTEFWT 283

Query: 291 SHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVS 350
           SHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVS
Sbjct: 284 SHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVS 343

Query: 351 DKMDPAELVQLCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPM 410
           DKMDP ELV+LCEILNPHN+PGRLTIITRMGADNMR+KLPHLIRAVRQAGLIVTWVSDPM
Sbjct: 344 DKMDPKELVKLCEILNPHNRPGRLTIITRMGADNMRIKLPHLIRAVRQAGLIVTWVSDPM 403

Query: 411 HGNTIKAPCGLKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVT 470
           HGNTIKAPCGLKTR FDSIRAELRAFFDVH+QEGSYPGGVHLEMTGQNVTECVGGSK +T
Sbjct: 404 HGNTIKAPCGLKTRPFDSIRAELRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSKTIT 463

Query: 471 FDDLNSRYHTHCDPRLNASQSLELAFAISQRLRRKRMHSKPG 479
           FDDLNSRYHTHCDPRLNASQSLELAFAIS+RLR+KR+ +  G
Sbjct: 464 FDDLNSRYHTHCDPRLNASQSLELAFAISERLRKKRLRAGDG 505

BLAST of Cp4.1LG08g07080 vs. NCBI nr
Match: gi|147853875|emb|CAN79559.1| (hypothetical protein VITISV_002672 [Vitis vinifera])

HSP 1 Score: 758.1 bits (1956), Expect = 9.8e-216
Identity = 385/509 (75.64%), Postives = 413/509 (81.14%), Query Frame = 1

Query: 1   MIISASTNLPPLTSSAVCTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGS 60
           M ++ + NL   T  ++C             FP P         P  +S S S+     S
Sbjct: 1   MAVTGTANLAAPTPPSLCRL-----------FPNPRYLPTHXLKPRPISASLSS-IDIRS 60

Query: 61  SNWAPDSWKSKKALQLPEYPDANELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAF 120
            NW P SWKSKKA QLPEYPD  ELESVL+ LESFPP+VFAGEAR LEE LA AAVG+AF
Sbjct: 61  PNWTPGSWKSKKAQQLPEYPDPVELESVLKTLESFPPMVFAGEARNLEERLADAAVGKAF 120

Query: 121 LLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFE 180
           LLQGGDCAESFKEF G NIRDTFRVLLQMG+VLT+GAQ+P+IKVGRMAGQFAKPRSD FE
Sbjct: 121 LLQGGDCAESFKEFGGTNIRDTFRVLLQMGIVLTFGAQLPVIKVGRMAGQFAKPRSDPFE 180

Query: 181 VKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGT------------------ 240
           VKDGVKLPSYRGDNIN+D FDEKSRTPDPQRL+RAYLQSVGT                  
Sbjct: 181 VKDGVKLPSYRGDNINSDDFDEKSRTPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRV 240

Query: 241 ----------------YKELAQRVDEALGFMTAAGITMDHPIMNTIDFWTSHECLHLPYE 300
                           Y ELAQRVDEALGFM AAG+T DHPIMNTI+FWTSHECLHL YE
Sbjct: 241 SQWNLDFVQHSEQGDRYTELAQRVDEALGFMAAAGLTTDHPIMNTIEFWTSHECLHLLYE 300

Query: 301 QALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQ 360
           QALTR+DSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRG+SNPLGIKVSDKMDP ELV+
Sbjct: 301 QALTRQDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGISNPLGIKVSDKMDPKELVK 360

Query: 361 LCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG 420
           LCEILNP NKPGRLTIITRMGADNMR+KLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG
Sbjct: 361 LCEILNPRNKPGRLTIITRMGADNMRIKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG 420

Query: 421 LKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHT 476
           LKTRSFDSIR+ELRAFFDVH+QEGS+PGGVHLEMTGQNVTEC+GGSK VTFDDLNSRYHT
Sbjct: 421 LKTRSFDSIRSELRAFFDVHDQEGSHPGGVHLEMTGQNVTECIGGSKTVTFDDLNSRYHT 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AROF_ORYSJ8.9e-20175.11Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Oryza sativa sub... [more]
AROG_ARATH7.5e-20071.28Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Arabidopsis thal... [more]
AROG_SOLTU9.5e-19566.60Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Solanum tuberosu... [more]
AROF_ARATH3.5e-15758.65Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Arabidopsis thal... [more]
AROF_CATRO1.3e-15460.16Probable phospho-2-dehydro-3-deoxyheptonate aldolase, chloroplastic OS=Catharant... [more]
Match NameE-valueIdentityDescription
A0A0A0L679_CUCSA5.2e-25686.85Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Cucumis sativus GN=Csa_3G073840 P... [more]
B9SZ06_RICCO2.8e-21776.79Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Ricinus communis GN=RCOM_0121060 ... [more]
D0VBC1_VITVI6.8e-21675.64Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Vitis vinifera GN=VIT_00s1217g000... [more]
A5C138_VITVI6.8e-21675.64Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Vitis vinifera GN=VITISV_002672 P... [more]
A0A067KUZ2_JATCU8.9e-21675.00Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Jatropha curcas GN=JCGZ_03503 PE=... [more]
Match NameE-valueIdentityDescription
AT4G33510.14.2e-20171.28 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase[more]
AT4G39980.12.0e-15858.65 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase 1[more]
AT1G22410.19.9e-15859.84 Class-II DAHP synthetase family protein[more]
Match NameE-valueIdentityDescription
gi|659096501|ref|XP_008449130.1|4.3e-25687.07PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cu... [more]
gi|449463236|ref|XP_004149340.1|7.4e-25686.85PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cu... [more]
gi|255580809|ref|XP_002531225.1|4.0e-21776.79PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic [Ricinus... [more]
gi|743908215|ref|XP_011047555.1|4.4e-21681.17PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Po... [more]
gi|147853875|emb|CAN79559.1|9.8e-21675.64hypothetical protein VITISV_002672 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0009073aromatic amino acid family biosynthetic process
Vocabulary: Molecular Function
TermDefinition
GO:00038493-deoxy-7-phosphoheptulonate synthase activity
Vocabulary: INTERPRO
TermDefinition
IPR002480DAHP_synth_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009094 L-phenylalanine biosynthetic process
biological_process GO:0000162 tryptophan biosynthetic process
biological_process GO:0006571 tyrosine biosynthetic process
biological_process GO:0009073 aromatic amino acid family biosynthetic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003849 3-deoxy-7-phosphoheptulonate synthase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g07080.1Cp4.1LG08g07080.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002480DAHP synthetase, class IIPANTHERPTHR21337PHOSPHO-2-DEHYDRO-3-DEOXYHEPTONATE ALDOLASE 1, 2coord: 1..475
score:
IPR002480DAHP synthetase, class IIPFAMPF01474DAHP_synth_2coord: 222..465
score: 9.0E
NoneNo IPR availablePANTHERPTHR21337:SF3SUBFAMILY NOT NAMEDcoord: 1..475
score:
NoneNo IPR availableunknownSSF51569Aldolasecoord: 52..469
score: 5.05E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG08g07080CmaCh16G011030Cucurbita maxima (Rimu)cmacpeB364
Cp4.1LG08g07080CmaCh06G008180Cucurbita maxima (Rimu)cmacpeB852
Cp4.1LG08g07080CmoCh06G008400Cucurbita moschata (Rifu)cmocpeB797
Cp4.1LG08g07080Carg12049Silver-seed gourdcarcpeB0808
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG08g07080Cp4.1LG14g09090Cucurbita pepo (Zucchini)cpecpeB250