CmoCh06G008400.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh06G008400.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPhospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic
LocationCmo_Chr06 : 4669308 .. 4673217 (-)
Sequence length1844
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCCCAACAAATTTGATACACAGACGTAAAGCGAGCCCATTATCCAGCCAGCAACAGAACCGACTTCAGAGTTTACCGGCGACTGGAGACTCACAAACAAGGTAAAAAATGATCATCTCCGCCTCCACAAATCTCCCTCCTCTTACTTCCTCCGCCGTCCGTACAACTCCCACCACCAAATCTTTCCTCTTCAAACCTCATTTTCCCATCCCCAACGCCGCCAAATTCTGCCGGACCATTCCCTCCGCCGTTTCTTTTTCTTCTTCCACGCATTTCACTTCTGGATCCTCCAATTGGGCGCCTGATTCTTGGAAATCCAAGAAGGCCCTTCAGCTCCCTGAATATCCCGACCCTAATGAGCTCGAGTCCGTTCTTCGCGTTCTCGAGTCCTTCCCGCCCATCGTCTTCGCTGGCGAGGCTCGCAAGCTCGAAGAGAGCCTCGCGAAGGCCGCCGTTGGCGAAGCGTTTCTGCTTCAAGGTGGGGATTGCGCTGAGAGCTTCAAGGAGTTTAATGGGAACAATATTAGGGATACCTTCAGGGTTTTGCTCCAGATGGGTATGGTTCTTACGTATGGCGCCCAAATGCCCATTATAAAGGTTGATTTCTGTCGGATTCTTGAATTTTATACGATTATCTCCGATTATTTTATTCATATGTTACTTCACTGTTGATGACTTATTGTGATATGCATGATGTGATCTCAATCCATGATTGTTTACACGTACCCTGCATTTGTTGGTATCTCGTTTTTTGAATTCATATTCACATTGGGAAGATATTGCAAATACTCTATCTAGATCTGTTCACTATCTTATCCAAATTCAGTCAGTCCATTGTTTTCAGATTCATTGAATTAAACGAATTCTGGGAATTTTTATTTGGTATATCAATTTCTAAAAGCTGTATTTGGATTATATTTAGATTCTACAACCCTGGAAGTGAAGATTTTGAAAACAACATTAGCATGTTTTATGCTAAATTTTCTACCAAACACATATTCAATATCCGAGTATACCAGTCTGTCGTCTATTAAATCCTTTCAAAGATGAGAAATGTTTGAGTTTGGAATTTCTTTGATATAAATTTGATCAACCAACTATGTTCACAGGTAGGAAGAATGGCAGGACAATTTGCTAAACCCAGGTCAGATTCATTTGAGGTTAAAGATGGTGTAAAGCTCCCAAGTTATCGTGGAGATAACATCAATGCAGATGCTTTCGACGAGAAATCTCGAACCCCGGATCCTCAGAGATTGGTTAGAGCATATCTCCAATCTGTAGGCACTTTGAATCTTCTTAGAGCATTTGCCACTGGAGGATACGCTGCAATGCAGAGAGTTTCTCAATGGAATCTCGACTTCGTTCAACACAGTGAGCAAGGAGACAGGTGACTGTCATTTTACAATTTATGCTCTTCAATAATTGCCTTACTTGAACAGATCTGTCCTGTAGACTATAGGTTGTACGGCTGCGTCCTTGAGATCTAAATTTATGATCTTCATTATGAACTCAGCCTCTCCATCTTCTCAAAATGAGAAAGAAAAAGCCTTTTCCCTGTTTCAAAAGGAGTGTCCAATTTGTTGGTCTTTTAGCCTCATGTACTGAATGACCCTGAGGAGTGGCTTGGTGATAAGTGACACAATCGCACTTTCGAAGGCAGCGGACTAGCGATTGTGCGGCACTCACTTTCCAGCGACAAGTGAGTTAAGCCAATTTGTTCTTGCGTCGCCTACGGCCAAGCGACGTACGGTCGAGCCTCTAGCCTCGGTTTTAAAAGAAGGTTTTAGAAAACGAGGAGAAAGCTACCCTGTGGCACATACGTGTGTCACAGTGTGGGCGAGCGTGCCGAGACGAGTGTCTTGGGCGACTTCCTTTGAAATGGGTTTTTGAAGGACAGTACCGGACGGCACGGGTGTGCCAGTATTGCGCCCTAGCCCGCGTGTTGGAGGTTGCTACATACACCCGCTATCCGGTACGGTATCCGTAAATGACTTGGCATGGGCACGGGAGGGCCTATGCCTCGATAGAACCCGGATGGATGGATGTTCGGGAAGCCCTGATGAGATGAACGACACGCGGGCCCATTCTCGATGGCACGACCGAACGAGTTATGATGGCCGACGACATCCCGAGACTCGGGATGGCGTCAAGACATATGGACCTCGTCGATGGGGATCGAGATGGCAAGACGAGATGCGACGTTGTATTGAGAGACCTTGGCCCGAATAGAGGCAAGGTCGAACCGAATGAGTTGGCCTAGCATAGAGGTAAGTCGGATGTGTCGCAGACGATTAAACATACACGCACAGGCGGCGTGTGTGGTTGGACAAGCTTGGATTCCGCACAAGCGATGTTCGATCGGCGAAGACAAACCTCGCCTAAGGTCCGACGAAGCCACAAAGCCGGGGCGAAAAATATATATAGGGCTTAATTCCCCTTAAGGCTAGTCGTGAGCGTGTATAGATCACGACGCCACACGGTCAAGAATGTAGGACCGTGACAATAAGCACTTGCAAGGTTTTGAGATGCTTCTTAATGTCTTCCGTTCCGATATTTCTATCTTCTAGAGTTGGCCTTTGGGTTAAACAAAACCGAGTCTAGTGATCGGTGTCCCATGAGCACCTAGTGTAATAGCTAGACTATGGACTATCCTATGATGAAGATTATCTTATTGGGTGCAGGTATAATGAACTAAATGAAGATTTACGTATTGGTTACAGGTATAAAGAACTTGCTCAGAGAGTTGATGAAGCGCTTGGATTCATGGCAGCTGCTGGAATCACTATGGACCATCCTATAATGAACACAATTGATTTCTGGACCTCTCATGAGTGCCTTCACTTACCATACGAGCAAGCCTTGACGAGGGAGGACTCAACGACAGGCCTCTATTACGATTGTTCTGCTCACATGCTTTGGGTAGGTGAGAGGACTCGACAGTTGGATGGTGCTCATGTTGAATTCCTGCGGGGCGTGTCGAATCCTCTCGGCATTAAGGTACCCAACAACAGCTGTTATTACTGGGATAGATTCACTGTTTGCATCTAGATACGAAGTGCTTGATCAATTCAAAATCTGGGAGGTACTTCGCTATAATGTGTTTGATCAAGTCAGTGGCTTTGTTATGTAGGTAAGTGACAAGATGGATCCAGCAGAGCTTGTTCAGTTATGTGAGATTTTGAATCCTCACAACAAACCTGGACGTCTTACGATAATCACCCGAATGGGAGCCGATAACATGCGAGTCAAGTTACCTCATCTCATTAGAGCCGTGCGTCAAGCCGGGCTTATTGTCACATGGGTTAGCGATCCCATGCACGGCAACACAATAAAGGCACCTTGTGGTCTCAAGACTCGTTCATTTGATTCAATAAGGGTAAAAGCCTGTCTACCAACAAGAATCCACATTGCATACTCACTGTAAATCTTGCTAGATATTGATGTTATCTTTTCATTTGACTTAGGCCGAGTTGAGAGCTTTCTTCGATGTTCATGAGCAAGAAGGGAGCTACCCTGGAGGAGTTCATCTAGAAATGACTGGACAAAATGTGACAGAGTGCGTCGGAGGGTCGAAGGAAGTGACTTTCGACGACCTGAATTCTCGCTACCATACCCACTGCGATCCGAGACTGAATGCTTCGCAGTCGCTGGAGTTGGCCTTTGCAATATCCCAAAGGTTGCGGAGGAAAAGGATGCATTCTAAGCCTGGCTCTAATGGGATGCTTGTAGAAAATGGGTCTGTTGCTTAAGAATCTTCCATAATGCGTTATTTGCTCTATAATAACAACTTGCAACTCCAATATCTTTTCTGTTGAGGTAATAAATTTTACATACGGAGACTGAATGTTTGTCTAATCTGAATGATTTCTTGAAAGCTTTTGTTTGTGAGAATTCCTTGAAATAAATCAAA

mRNA sequence

TCCCCAACAAATTTGATACACAGACGTAAAGCGAGCCCATTATCCAGCCAGCAACAGAACCGACTTCAGAGTTTACCGGCGACTGGAGACTCACAAACAAGGTAAAAAATGATCATCTCCGCCTCCACAAATCTCCCTCCTCTTACTTCCTCCGCCGTCCGTACAACTCCCACCACCAAATCTTTCCTCTTCAAACCTCATTTTCCCATCCCCAACGCCGCCAAATTCTGCCGGACCATTCCCTCCGCCGTTTCTTTTTCTTCTTCCACGCATTTCACTTCTGGATCCTCCAATTGGGCGCCTGATTCTTGGAAATCCAAGAAGGCCCTTCAGCTCCCTGAATATCCCGACCCTAATGAGCTCGAGTCCGTTCTTCGCGTTCTCGAGTCCTTCCCGCCCATCGTCTTCGCTGGCGAGGCTCGCAAGCTCGAAGAGAGCCTCGCGAAGGCCGCCGTTGGCGAAGCGTTTCTGCTTCAAGGTGGGGATTGCGCTGAGAGCTTCAAGGAGTTTAATGGGAACAATATTAGGGATACCTTCAGGGTTTTGCTCCAGATGGGTATGGTTCTTACGTATGGCGCCCAAATGCCCATTATAAAGGTAGGAAGAATGGCAGGACAATTTGCTAAACCCAGGTCAGATTCATTTGAGGTTAAAGATGGTGTAAAGCTCCCAAGTTATCGTGGAGATAACATCAATGCAGATGCTTTCGACGAGAAATCTCGAACCCCGGATCCTCAGAGATTGGTTAGAGCATATCTCCAATCTGTAGGCACTTTGAATCTTCTTAGAGCATTTGCCACTGGAGGATACGCTGCAATGCAGAGAGTTTCTCAATGGAATCTCGACTTCGTTCAACACAGTGAGCAAGGAGACAGGTATAAAGAACTTGCTCAGAGAGTTGATGAAGCGCTTGGATTCATGGCAGCTGCTGGAATCACTATGGACCATCCTATAATGAACACAATTGATTTCTGGACCTCTCATGAGTGCCTTCACTTACCATACGAGCAAGCCTTGACGAGGGAGGACTCAACGACAGGCCTCTATTACGATTGTTCTGCTCACATGCTTTGGGTAGGTGAGAGGACTCGACAGTTGGATGGTGCTCATGTTGAATTCCTGCGGGGCGTGTCGAATCCTCTCGGCATTAAGGTAAGTGACAAGATGGATCCAGCAGAGCTTGTTCAGTTATGTGAGATTTTGAATCCTCACAACAAACCTGGACGTCTTACGATAATCACCCGAATGGGAGCCGATAACATGCGAGTCAAGTTACCTCATCTCATTAGAGCCGTGCGTCAAGCCGGGCTTATTGTCACATGGGTTAGCGATCCCATGCACGGCAACACAATAAAGGCACCTTGTGGTCTCAAGACTCGTTCATTTGATTCAATAAGGGCCGAGTTGAGAGCTTTCTTCGATGTTCATGAGCAAGAAGGGAGCTACCCTGGAGGAGTTCATCTAGAAATGACTGGACAAAATGTGACAGAGTGCGTCGGAGGGTCGAAGGAAGTGACTTTCGACGACCTGAATTCTCGCTACCATACCCACTGCGATCCGAGACTGAATGCTTCGCAGTCGCTGGAGTTGGCCTTTGCAATATCCCAAAGGTTGCGGAGGAAAAGGATGCATTCTAAGCCTGGCTCTAATGGGATGCTTGTAGAAAATGGGTCTGTTGCTTAAGAATCTTCCATAATGCGTTATTTGCTCTATAATAACAACTTGCAACTCCAATATCTTTTCTGTTGAGGTAATAAATTTTACATACGGAGACTGAATGTTTGTCTAATCTGAATGATTTCTTGAAAGCTTTTGTTTGTGAGAATTCCTTGAAATAAATCAAA

Coding sequence (CDS)

ATGATCATCTCCGCCTCCACAAATCTCCCTCCTCTTACTTCCTCCGCCGTCCGTACAACTCCCACCACCAAATCTTTCCTCTTCAAACCTCATTTTCCCATCCCCAACGCCGCCAAATTCTGCCGGACCATTCCCTCCGCCGTTTCTTTTTCTTCTTCCACGCATTTCACTTCTGGATCCTCCAATTGGGCGCCTGATTCTTGGAAATCCAAGAAGGCCCTTCAGCTCCCTGAATATCCCGACCCTAATGAGCTCGAGTCCGTTCTTCGCGTTCTCGAGTCCTTCCCGCCCATCGTCTTCGCTGGCGAGGCTCGCAAGCTCGAAGAGAGCCTCGCGAAGGCCGCCGTTGGCGAAGCGTTTCTGCTTCAAGGTGGGGATTGCGCTGAGAGCTTCAAGGAGTTTAATGGGAACAATATTAGGGATACCTTCAGGGTTTTGCTCCAGATGGGTATGGTTCTTACGTATGGCGCCCAAATGCCCATTATAAAGGTAGGAAGAATGGCAGGACAATTTGCTAAACCCAGGTCAGATTCATTTGAGGTTAAAGATGGTGTAAAGCTCCCAAGTTATCGTGGAGATAACATCAATGCAGATGCTTTCGACGAGAAATCTCGAACCCCGGATCCTCAGAGATTGGTTAGAGCATATCTCCAATCTGTAGGCACTTTGAATCTTCTTAGAGCATTTGCCACTGGAGGATACGCTGCAATGCAGAGAGTTTCTCAATGGAATCTCGACTTCGTTCAACACAGTGAGCAAGGAGACAGGTATAAAGAACTTGCTCAGAGAGTTGATGAAGCGCTTGGATTCATGGCAGCTGCTGGAATCACTATGGACCATCCTATAATGAACACAATTGATTTCTGGACCTCTCATGAGTGCCTTCACTTACCATACGAGCAAGCCTTGACGAGGGAGGACTCAACGACAGGCCTCTATTACGATTGTTCTGCTCACATGCTTTGGGTAGGTGAGAGGACTCGACAGTTGGATGGTGCTCATGTTGAATTCCTGCGGGGCGTGTCGAATCCTCTCGGCATTAAGGTAAGTGACAAGATGGATCCAGCAGAGCTTGTTCAGTTATGTGAGATTTTGAATCCTCACAACAAACCTGGACGTCTTACGATAATCACCCGAATGGGAGCCGATAACATGCGAGTCAAGTTACCTCATCTCATTAGAGCCGTGCGTCAAGCCGGGCTTATTGTCACATGGGTTAGCGATCCCATGCACGGCAACACAATAAAGGCACCTTGTGGTCTCAAGACTCGTTCATTTGATTCAATAAGGGCCGAGTTGAGAGCTTTCTTCGATGTTCATGAGCAAGAAGGGAGCTACCCTGGAGGAGTTCATCTAGAAATGACTGGACAAAATGTGACAGAGTGCGTCGGAGGGTCGAAGGAAGTGACTTTCGACGACCTGAATTCTCGCTACCATACCCACTGCGATCCGAGACTGAATGCTTCGCAGTCGCTGGAGTTGGCCTTTGCAATATCCCAAAGGTTGCGGAGGAAAAGGATGCATTCTAAGCCTGGCTCTAATGGGATGCTTGTAGAAAATGGGTCTGTTGCTTAA
BLAST of CmoCh06G008400.1 vs. Swiss-Prot
Match: AROG_SOLLC (Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Solanum lycopersicum PE=2 SV=1)

HSP 1 Score: 789.3 bits (2037), Expect = 2.6e-227
Identity = 374/469 (79.74%), Postives = 420/469 (89.55%), Query Frame = 1

Query: 42  RTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQLPEYPDPNELESVLRVLESFPPIVFA 101
           ++ P A + +++       + WA DSWKSKKALQLPEYPD  EL SVL+ ++ FPPIVFA
Sbjct: 68  KSSPPAATATTAPAPAVTKTEWAVDSWKSKKALQLPEYPDQEELRSVLKTIDEFPPIVFA 127

Query: 102 GEARKLEESLAKAAVGEAFLLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPI 161
           GEAR LEE L +AA+G AFLLQGGDCAESFKEFN NNIRDTFR+LLQMG VL +G QMP+
Sbjct: 128 GEARSLEERLGEAAMGRAFLLQGGDCAESFKEFNANNIRDTFRILLQMGAVLMFGGQMPV 187

Query: 162 IKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVG 221
           IKVGRMAGQFAKPRSDSFE KDGVKLPSYRGDN+N DAFD KSRTPDPQRL+RAY QS  
Sbjct: 188 IKVGRMAGQFAKPRSDSFEEKDGVKLPSYRGDNVNGDAFDVKSRTPDPQRLIRAYCQSAA 247

Query: 222 TLNLLRAFATGGYAAMQRVSQWNLDFVQHSEQGDRYKELAQRVDEALGFMAAAGITMDHP 281
           TLNLLRAFATGGYAAMQR++QWNLDF +HSEQGDRY+ELA RVDEALGFM AAG+TMDHP
Sbjct: 248 TLNLLRAFATGGYAAMQRINQWNLDFTEHSEQGDRYRELASRVDEALGFMTAAGLTMDHP 307

Query: 282 IMNTIDFWTSHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGV 341
           IM T +FWTSHECL LPYEQ+LTR DST+GL+YDCSAH LWVGERTRQLDGAHVEFLRG+
Sbjct: 308 IMKTTEFWTSHECLLLPYEQSLTRRDSTSGLHYDCSAHFLWVGERTRQLDGAHVEFLRGI 367

Query: 342 SNPLGIKVSDKMDPAELVQLCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGL 401
           +NPLGIKVSDKMDP+ LV+L EILNP NK GR+TIITRMGA+NMRVKLPHLIRAVR+AG 
Sbjct: 368 ANPLGIKVSDKMDPSALVKLIEILNPQNKAGRITIITRMGAENMRVKLPHLIRAVRRAGQ 427

Query: 402 IVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTE 461
           IVTWVSDPMHGNTIKAPCGLKTR FDSIRAE+RAFFDVH+QEGS+PGGVHLEMTGQNVTE
Sbjct: 428 IVTWVSDPMHGNTIKAPCGLKTRPFDSIRAEVRAFFDVHDQEGSHPGGVHLEMTGQNVTE 487

Query: 462 CVGGSKEVTFDDLNSRYHTHCDPRLNASQSLELAFAISQRLRRKRMHSK 511
           C+GGS+ VTFDDL+SRYHTHCDPRLNASQSLEL+F I++RLR++R+ S+
Sbjct: 488 CIGGSRTVTFDDLSSRYHTHCDPRLNASQSLELSFIIAERLRKRRLGSQ 536

BLAST of CmoCh06G008400.1 vs. Swiss-Prot
Match: AROF_SOLTU (Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Solanum tuberosum GN=SHKA PE=1 SV=2)

HSP 1 Score: 789.3 bits (2037), Expect = 2.6e-227
Identity = 374/469 (79.74%), Postives = 420/469 (89.55%), Query Frame = 1

Query: 42  RTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQLPEYPDPNELESVLRVLESFPPIVFA 101
           ++ P A + +++       + WA DSWKSKKALQLPEYP+  EL SVL+ ++ FPPIVFA
Sbjct: 68  KSSPPAATATTAPAPAVTKTEWAVDSWKSKKALQLPEYPNQEELRSVLKTIDEFPPIVFA 127

Query: 102 GEARKLEESLAKAAVGEAFLLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPI 161
           GEAR LEE L +AA+G AFLLQGGDCAESFKEFN NNIRDTFR+LLQMG VL +G QMP+
Sbjct: 128 GEARSLEERLGEAAMGRAFLLQGGDCAESFKEFNANNIRDTFRILLQMGAVLMFGGQMPV 187

Query: 162 IKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVG 221
           IKVGRMAGQFAKPRSDSFE KDGVKLPSYRGDN+N DAFD KSRTPDPQRL+RAY QS  
Sbjct: 188 IKVGRMAGQFAKPRSDSFEEKDGVKLPSYRGDNVNGDAFDVKSRTPDPQRLIRAYCQSAA 247

Query: 222 TLNLLRAFATGGYAAMQRVSQWNLDFVQHSEQGDRYKELAQRVDEALGFMAAAGITMDHP 281
           TLNLLRAFATGGYAAMQR++QWNLDF +HSEQGDRY+ELA RVDEALGFM AAG+TMDHP
Sbjct: 248 TLNLLRAFATGGYAAMQRINQWNLDFTEHSEQGDRYRELASRVDEALGFMTAAGLTMDHP 307

Query: 282 IMNTIDFWTSHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGV 341
           IM T +FWTSHECL LPYEQ+LTR DST+GLYYDCSAH LWVGERTRQLDGAHVEFLRG+
Sbjct: 308 IMKTTEFWTSHECLLLPYEQSLTRRDSTSGLYYDCSAHFLWVGERTRQLDGAHVEFLRGI 367

Query: 342 SNPLGIKVSDKMDPAELVQLCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGL 401
           +NPLGIKVSDKMDP+ LV+L EILNP NK GR+TIITRMGA+NMRVKLPHLIRAVR+AG 
Sbjct: 368 ANPLGIKVSDKMDPSALVKLIEILNPQNKAGRITIITRMGAENMRVKLPHLIRAVRRAGQ 427

Query: 402 IVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTE 461
           IVTWVSDPMHGNTIKAPCGLKTR FDSIRAE+RAFFDVH+QEGS+PGGVHLEMTGQNVTE
Sbjct: 428 IVTWVSDPMHGNTIKAPCGLKTRPFDSIRAEVRAFFDVHDQEGSHPGGVHLEMTGQNVTE 487

Query: 462 CVGGSKEVTFDDLNSRYHTHCDPRLNASQSLELAFAISQRLRRKRMHSK 511
           C+GGS+ VTFDDL+SRYHTHCDPRLNASQSLEL+F I++RLR++R+ S+
Sbjct: 488 CIGGSRTVTFDDLSSRYHTHCDPRLNASQSLELSFIIAERLRKRRLGSQ 536

BLAST of CmoCh06G008400.1 vs. Swiss-Prot
Match: AROF_TOBAC (Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Nicotiana tabacum GN=DHAPS-1 PE=2 SV=1)

HSP 1 Score: 786.9 bits (2031), Expect = 1.3e-226
Identity = 375/466 (80.47%), Postives = 421/466 (90.34%), Query Frame = 1

Query: 45  PSAVSFSSSTHFTSGSSNWAPDSWKSKKALQLPEYPDPNELESVLRVLESFPPIVFAGEA 104
           P+A   +++T  T   + W  +SWKSKKALQLPEYP+  EL+SVL+ +E FPPIVFAGEA
Sbjct: 74  PAATVTAAATTVTK--TEWTVESWKSKKALQLPEYPNQEELQSVLKTIEEFPPIVFAGEA 133

Query: 105 RKLEESLAKAAVGEAFLLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKV 164
           R LEE L +AA+G AFLLQGGDCAESFKEFN NNIRDTFR+LLQMG VL +G QMP+IKV
Sbjct: 134 RSLEERLGEAAMGRAFLLQGGDCAESFKEFNANNIRDTFRILLQMGAVLMFGGQMPVIKV 193

Query: 165 GRMAGQFAKPRSDSFEVKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLN 224
           GRMAGQFAKPRSD+FE K+GVKLPSYRGDN+N DAFD KSRTPDPQRL+RAY QS  TLN
Sbjct: 194 GRMAGQFAKPRSDNFEEKNGVKLPSYRGDNVNGDAFDAKSRTPDPQRLIRAYCQSAATLN 253

Query: 225 LLRAFATGGYAAMQRVSQWNLDFVQHSEQGDRYKELAQRVDEALGFMAAAGITMDHPIMN 284
           LLRAFATGGYAAMQR++QWNLDF +HSEQGDRY+ELA RVDEALGFMAAAG+T+DHPIM 
Sbjct: 254 LLRAFATGGYAAMQRINQWNLDFTEHSEQGDRYRELANRVDEALGFMAAAGLTVDHPIMK 313

Query: 285 TIDFWTSHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNP 344
           T +FWTSHECL LPYEQ+LTR DST+GLYYDCSAH +WVGERTRQLDGAHVEFLRGV+NP
Sbjct: 314 TTEFWTSHECLLLPYEQSLTRLDSTSGLYYDCSAHFIWVGERTRQLDGAHVEFLRGVANP 373

Query: 345 LGIKVSDKMDPAELVQLCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVT 404
           LGIKVSDKMDP+ LV+L EILNP NK GR+TIITRMGA+NMRVKLPHLIRAVR+AG IVT
Sbjct: 374 LGIKVSDKMDPSALVKLIEILNPDNKAGRITIITRMGAENMRVKLPHLIRAVRRAGQIVT 433

Query: 405 WVSDPMHGNTIKAPCGLKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVG 464
           WVSDPMHGNTIKAPCGLKTR FDSIRAE+RAFFDVHEQEGS+PGGVHLEMTGQNVTEC+G
Sbjct: 434 WVSDPMHGNTIKAPCGLKTRPFDSIRAEVRAFFDVHEQEGSHPGGVHLEMTGQNVTECIG 493

Query: 465 GSKEVTFDDLNSRYHTHCDPRLNASQSLELAFAISQRLRRKRMHSK 511
           GS+ VTFDDL+SRYHTHCDPRLNASQSLELAF I++RLR++R+ S+
Sbjct: 494 GSRTVTFDDLSSRYHTHCDPRLNASQSLELAFIIAERLRKRRLGSQ 537

BLAST of CmoCh06G008400.1 vs. Swiss-Prot
Match: AROG_ARATH (Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Arabidopsis thaliana GN=DHS2 PE=2 SV=2)

HSP 1 Score: 783.1 bits (2021), Expect = 1.9e-225
Identity = 383/491 (78.00%), Postives = 427/491 (86.97%), Query Frame = 1

Query: 22  TTKSFL---FKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQLPE 81
           TTKSFL     P  PI  +  F      +     ST   S S  W+ +SWKSKKALQLP+
Sbjct: 11  TTKSFLPYRHAPRRPISFSPVFA---VHSTDPKKSTQSASASVKWSLESWKSKKALQLPD 70

Query: 82  YPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESFKEFNGNN 141
           YPD  +++SVL+ L SFPPIVFAGEARKLE+ L +AA+G+AF+LQGGDCAESFKEFN NN
Sbjct: 71  YPDQKDVDSVLQTLSSFPPIVFAGEARKLEDKLGQAAMGQAFMLQGGDCAESFKEFNANN 130

Query: 142 IRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNINAD 201
           IRDTFRVLLQMG+VL +G Q+P+IKVGRMAGQFAKPRSD FE KDGVKLPSYRGDNIN D
Sbjct: 131 IRDTFRVLLQMGVVLMFGGQLPVIKVGRMAGQFAKPRSDPFEEKDGVKLPSYRGDNINGD 190

Query: 202 AFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHSEQGDRYK 261
           AFDEKSR PDP R+VRAY QSV TLNLLRAFATGGYAAMQRVSQWNLDF QHSEQGDRY+
Sbjct: 191 AFDEKSRIPDPHRMVRAYTQSVATLNLLRAFATGGYAAMQRVSQWNLDFTQHSEQGDRYR 250

Query: 262 ELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTGLYYDCSA 321
           ELA RVDEALGFM AAG+T  HPIM T +FWTSHECL LPYEQALTREDST+GLYYDCSA
Sbjct: 251 ELANRVDEALGFMGAAGLTSAHPIMTTTEFWTSHECLLLPYEQALTREDSTSGLYYDCSA 310

Query: 322 HMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKPGRLTIIT 381
           HMLWVGERTRQLDGAHVEFLRG++NPLGIKVSDKM P+ELV+L EILNP NKPGR+T+I 
Sbjct: 311 HMLWVGERTRQLDGAHVEFLRGIANPLGIKVSDKMVPSELVKLIEILNPQNKPGRITVIV 370

Query: 382 RMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELRAFFD 441
           RMGA+NMRVKLP+LIRAVR AG IVTWVSDPMHGNTI AP GLKTRSFD+IRAELRAFFD
Sbjct: 371 RMGAENMRVKLPNLIRAVRGAGQIVTWVSDPMHGNTIMAPGGLKTRSFDAIRAELRAFFD 430

Query: 442 VHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQSLELAFAI 501
           VH+QEGS+PGGVHLEMTGQNVTECVGGS+ +T++DL+SRYHTHCDPRLNASQSLELAF I
Sbjct: 431 VHDQEGSFPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPRLNASQSLELAFII 490

Query: 502 SQRLRRKRMHS 510
           ++RLR++R+ S
Sbjct: 491 AERLRKRRLGS 498

BLAST of CmoCh06G008400.1 vs. Swiss-Prot
Match: AROG_ORYSJ (Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Oryza sativa subsp. japonica GN=DAHPS2 PE=2 SV=1)

HSP 1 Score: 779.6 bits (2012), Expect = 2.1e-224
Identity = 388/514 (75.49%), Postives = 435/514 (84.63%), Query Frame = 1

Query: 5   ASTNLPPL--TSSAVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSN 64
           A+T LP    T SAV      KS     + P+  AAK   + PS V+         G   
Sbjct: 27  AATFLPMRRRTVSAVHAADPAKS-----NGPVQAAAK--ASSPSTVAAPEKKPV--GLGK 86

Query: 65  WAPDSWKSKKALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLL 124
           W  DSWK+KKALQLPEYP   EL+SVL+ +E+FPP+VFAGEAR LEE LA AA+G AF+L
Sbjct: 87  WTVDSWKAKKALQLPEYPSQEELDSVLKTIETFPPVVFAGEARHLEERLADAAMGRAFVL 146

Query: 125 QGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKV-GRMAGQFAKPRSDSFEV 184
           QGGDCAESFKEFN NNIRDTFR+LLQMG VL +G QMP++KV GRMAGQFAKPRSDSFE 
Sbjct: 147 QGGDCAESFKEFNANNIRDTFRILLQMGAVLMFGGQMPVVKVVGRMAGQFAKPRSDSFEE 206

Query: 185 KDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVS 244
           +DGVKLPSYRGDNIN D FDEKSR PDPQR++RAY QSV TLNLLRAFATGGYAAMQRV+
Sbjct: 207 RDGVKLPSYRGDNINGDTFDEKSRVPDPQRMIRAYAQSVATLNLLRAFATGGYAAMQRVT 266

Query: 245 QWNLDFVQHSEQGDR-YKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYE 304
           QWNLDF+ HSEQGDR Y+ELA RVDEALGFM AAG+T+DHPIM T DFWTSHECL LPYE
Sbjct: 267 QWNLDFMDHSEQGDRRYRELAHRVDEALGFMTAAGLTVDHPIMTTTDFWTSHECLLLPYE 326

Query: 305 QALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQ 364
           Q+LTREDST+GL+YDCSAHMLWVGERTRQLDGAHVEFLRGV+NPLGIKVSDKM+P +LV+
Sbjct: 327 QSLTREDSTSGLFYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKVSDKMNPRDLVK 386

Query: 365 LCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCG 424
           L EILNP NKPGR+TIITRMGA+NMRVKLPHLIRAVR +G IVTW++DPMHGNTIKAPCG
Sbjct: 387 LIEILNPSNKPGRITIITRMGAENMRVKLPHLIRAVRNSGQIVTWITDPMHGNTIKAPCG 446

Query: 425 LKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHT 484
           LKTR FDSI AE+RAFFDVH+QEGS+PGG+HLEMTGQNVTEC+GGS+ VTFDDL+ RYHT
Sbjct: 447 LKTRPFDSILAEVRAFFDVHDQEGSHPGGIHLEMTGQNVTECIGGSRTVTFDDLSDRYHT 506

Query: 485 HCDPRLNASQSLELAFAISQRLRRKRMHSKPGSN 515
           HCDPRLNASQSLELAF I++RLRR+RM S   SN
Sbjct: 507 HCDPRLNASQSLELAFIIAERLRRRRMRSGVNSN 531

BLAST of CmoCh06G008400.1 vs. TrEMBL
Match: A0A0A0L679_CUCSA (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Cucumis sativus GN=Csa_3G073840 PE=3 SV=1)

HSP 1 Score: 982.6 bits (2539), Expect = 1.8e-283
Identity = 485/517 (93.81%), Postives = 495/517 (95.74%), Query Frame = 1

Query: 8   NLPPLTSSAVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDS 67
           NL P +SSA  TT   K FLFKPHF  PNAA FCRTIPSAVS SSSTHF SGSSNW P+S
Sbjct: 5   NLLPPSSSAAPTTSIAKCFLFKPHFFTPNAANFCRTIPSAVSSSSSTHFISGSSNWTPES 64

Query: 68  WKSKKALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC 127
           WKSKKALQLP+YPDPNEL+SVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC
Sbjct: 65  WKSKKALQLPQYPDPNELDSVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC 124

Query: 128 AESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKL 187
           AESFKEFNGNNIRDTFRVLLQMG+VLTYGAQMPIIKVGRMAGQFAKPRSD FEVKDGV+L
Sbjct: 125 AESFKEFNGNNIRDTFRVLLQMGIVLTYGAQMPIIKVGRMAGQFAKPRSDPFEVKDGVEL 184

Query: 188 PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDF 247
           PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDF
Sbjct: 185 PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDF 244

Query: 248 VQHSEQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTRED 307
           VQHSEQGDRYKELAQRVDEALGFMAAAGIT DHPIMNTIDFWTSHECLHLPYEQALTRED
Sbjct: 245 VQHSEQGDRYKELAQRVDEALGFMAAAGITTDHPIMNTIDFWTSHECLHLPYEQALTRED 304

Query: 308 STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNP 367
           STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDP+ELVQLCEILNP
Sbjct: 305 STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPSELVQLCEILNP 364

Query: 368 HNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD 427
            N+PGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD
Sbjct: 365 RNRPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD 424

Query: 428 SIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN 487
           SIRAELRAFFDVHEQEGS+PGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN
Sbjct: 425 SIRAELRAFFDVHEQEGSHPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN 484

Query: 488 ASQSLELAFAISQRLRRKRMHSKPGSNGMLVENGSVA 525
           ASQSLELAFAISQRLR KRM SK G NG+LVENG VA
Sbjct: 485 ASQSLELAFAISQRLRSKRMRSKAGLNGLLVENGFVA 521

BLAST of CmoCh06G008400.1 vs. TrEMBL
Match: A5C138_VITVI (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Vitis vinifera GN=VITISV_002672 PE=3 SV=1)

HSP 1 Score: 849.4 bits (2193), Expect = 2.4e-243
Identity = 421/498 (84.54%), Postives = 446/498 (89.56%), Query Frame = 1

Query: 12  LTSSAVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSK 71
           +T +A    PT  S      FP P         P  +S S S+     S NW P SWKSK
Sbjct: 3   VTGTANLAAPTPPSLCRL--FPNPRYLPTHXLKPRPISASLSS-IDIRSPNWTPGSWKSK 62

Query: 72  KALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESF 131
           KA QLPEYPDP ELESVL+ LESFPP+VFAGEAR LEE LA AAVG+AFLLQGGDCAESF
Sbjct: 63  KAQQLPEYPDPVELESVLKTLESFPPMVFAGEARNLEERLADAAVGKAFLLQGGDCAESF 122

Query: 132 KEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYR 191
           KEF G NIRDTFRVLLQMG+VLT+GAQ+P+IKVGRMAGQFAKPRSD FEVKDGVKLPSYR
Sbjct: 123 KEFGGTNIRDTFRVLLQMGIVLTFGAQLPVIKVGRMAGQFAKPRSDPFEVKDGVKLPSYR 182

Query: 192 GDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS 251
           GDNIN+D FDEKSRTPDPQRL+RAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS
Sbjct: 183 GDNINSDDFDEKSRTPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS 242

Query: 252 EQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTG 311
           EQGDRY ELAQRVDEALGFMAAAG+T DHPIMNTI+FWTSHECLHL YEQALTR+DSTTG
Sbjct: 243 EQGDRYTELAQRVDEALGFMAAAGLTTDHPIMNTIEFWTSHECLHLLYEQALTRQDSTTG 302

Query: 312 LYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKP 371
           LYYDCSAHMLWVGERTRQLDGAHVEFLRG+SNPLGIKVSDKMDP ELV+LCEILNP NKP
Sbjct: 303 LYYDCSAHMLWVGERTRQLDGAHVEFLRGISNPLGIKVSDKMDPKELVKLCEILNPRNKP 362

Query: 372 GRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRA 431
           GRLTIITRMGADNMR+KLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIR+
Sbjct: 363 GRLTIITRMGADNMRIKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRS 422

Query: 432 ELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQS 491
           ELRAFFDVH+QEGS+PGGVHLEMTGQNVTEC+GGSK VTFDDLNSRYHTHCDPRLNASQS
Sbjct: 423 ELRAFFDVHDQEGSHPGGVHLEMTGQNVTECIGGSKTVTFDDLNSRYHTHCDPRLNASQS 482

Query: 492 LELAFAISQRLRRKRMHS 510
           LELAFAI++RLRRKRM S
Sbjct: 483 LELAFAIAERLRRKRMRS 497

BLAST of CmoCh06G008400.1 vs. TrEMBL
Match: D0VBC1_VITVI (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Vitis vinifera GN=VIT_00s1217g00010 PE=2 SV=2)

HSP 1 Score: 849.4 bits (2193), Expect = 2.4e-243
Identity = 421/498 (84.54%), Postives = 446/498 (89.56%), Query Frame = 1

Query: 12  LTSSAVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSK 71
           +T +A    PT  S      FP P         P  +S S S+     S NW P SWKSK
Sbjct: 3   VTGTANLAAPTPPSLCRL--FPNPRYLPTHTLKPRPISASLSS-IDIRSPNWTPGSWKSK 62

Query: 72  KALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESF 131
           KA QLPEYPDP ELESVL+ LESFPP+VFAGEAR LEE LA AAVG+AFLLQGGDCAESF
Sbjct: 63  KAQQLPEYPDPVELESVLKTLESFPPMVFAGEARNLEERLADAAVGKAFLLQGGDCAESF 122

Query: 132 KEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYR 191
           KEF G NIRDTFRVLLQMG+VLT+GAQ+P+IKVGRMAGQFAKPRSD FEVKDGVKLPSYR
Sbjct: 123 KEFGGTNIRDTFRVLLQMGIVLTFGAQLPVIKVGRMAGQFAKPRSDPFEVKDGVKLPSYR 182

Query: 192 GDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS 251
           GDNIN+D FDEKSRTPDPQRL+RAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS
Sbjct: 183 GDNINSDDFDEKSRTPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS 242

Query: 252 EQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTG 311
           EQGDRY ELAQRVDEALGFMAAAG+T DHPIMNTI+FWTSHECLHL YEQALTR+DSTTG
Sbjct: 243 EQGDRYTELAQRVDEALGFMAAAGLTTDHPIMNTIEFWTSHECLHLLYEQALTRQDSTTG 302

Query: 312 LYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKP 371
           LYYDCSAHMLWVGERTRQLDGAHVEFLRG+SNPLGIKVSDKMDP ELV+LCEILNP NKP
Sbjct: 303 LYYDCSAHMLWVGERTRQLDGAHVEFLRGISNPLGIKVSDKMDPKELVKLCEILNPRNKP 362

Query: 372 GRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRA 431
           GRLTIITRMGADNMR+KLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIR+
Sbjct: 363 GRLTIITRMGADNMRIKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRS 422

Query: 432 ELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQS 491
           ELRAFFDVH+QEGS+PGGVHLEMTGQNVTEC+GGSK VTFDDLNSRYHTHCDPRLNASQS
Sbjct: 423 ELRAFFDVHDQEGSHPGGVHLEMTGQNVTECIGGSKTVTFDDLNSRYHTHCDPRLNASQS 482

Query: 492 LELAFAISQRLRRKRMHS 510
           LELAFAI++RLRRKRM S
Sbjct: 483 LELAFAIAERLRRKRMRS 497

BLAST of CmoCh06G008400.1 vs. TrEMBL
Match: B9MUW0_POPTR (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Populus trichocarpa GN=POPTR_0001s15050g PE=3 SV=1)

HSP 1 Score: 847.8 bits (2189), Expect = 7.0e-243
Identity = 409/462 (88.53%), Postives = 435/462 (94.16%), Query Frame = 1

Query: 51  SSSTHFTSGSSNWAPDSWKSKKALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEES 110
           S+ T   + SSNWA DSWKSK A QLP+YPDP EL SVL+ L +FPPIVFAGEARKLEE 
Sbjct: 44  STPTSKPTISSNWALDSWKSKPARQLPDYPDPVELHSVLQTLTNFPPIVFAGEARKLEER 103

Query: 111 LAKAAVGEAFLLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQ 170
           +A AAVG AFLLQGGDCAESFKEFN NNIRDTFRVLLQMG+VLT+GAQMPIIKVGRMAGQ
Sbjct: 104 IASAAVGGAFLLQGGDCAESFKEFNANNIRDTFRVLLQMGVVLTFGAQMPIIKVGRMAGQ 163

Query: 171 FAKPRSDSFEVKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFA 230
           FAKPRSD FE KDGVKLPSYRGDNINADAFD+KSRTPDPQRL+RAYLQSVGTLNLLRAFA
Sbjct: 164 FAKPRSDPFEEKDGVKLPSYRGDNINADAFDKKSRTPDPQRLIRAYLQSVGTLNLLRAFA 223

Query: 231 TGGYAAMQRVSQWNLDFVQHSEQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWT 290
           TGGYAAMQRVSQWNLDFV+HSEQGDR+ ELA+RVDEALGFMAAAG+T+DHP+MNT +FWT
Sbjct: 224 TGGYAAMQRVSQWNLDFVEHSEQGDRFMELARRVDEALGFMAAAGLTIDHPVMNTTEFWT 283

Query: 291 SHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVS 350
           SHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVS
Sbjct: 284 SHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVS 343

Query: 351 DKMDPAELVQLCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPM 410
           DKMDP ELV+LCEILNPHN+PGRLTIITRMGADNMR+KLPHLIRAVR AGLIVTWVSDPM
Sbjct: 344 DKMDPKELVKLCEILNPHNRPGRLTIITRMGADNMRIKLPHLIRAVRHAGLIVTWVSDPM 403

Query: 411 HGNTIKAPCGLKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVT 470
           HGNTIKAPCGLKTR FDSIRAELRAFFDVH+QEGSYPGGVHLEMTGQNVTECVGGSK +T
Sbjct: 404 HGNTIKAPCGLKTRPFDSIRAELRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSKTIT 463

Query: 471 FDDLNSRYHTHCDPRLNASQSLELAFAISQRLRRKRMHSKPG 513
           FDDLNSRYHTHCDPRLNASQSLELAFAIS+RLR+KR+ +  G
Sbjct: 464 FDDLNSRYHTHCDPRLNASQSLELAFAISERLRKKRLRAGDG 505

BLAST of CmoCh06G008400.1 vs. TrEMBL
Match: B9SZ06_RICCO (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Ricinus communis GN=RCOM_0121060 PE=3 SV=1)

HSP 1 Score: 847.4 bits (2188), Expect = 9.1e-243
Identity = 421/504 (83.53%), Postives = 451/504 (89.48%), Query Frame = 1

Query: 6   STNLPPLTSSAVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAP 65
           +T  PP +++A   T   K  +F  H       K   T  ++VS +SST     S +W+ 
Sbjct: 7   TTPKPPFSTAAAAAT---KPQIFSFHVGTIKRPK--PTFIASVS-TSSTSNNIASPDWSL 66

Query: 66  DSWKSKKALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGG 125
           DSWKSK A QLPEYPD  ELE+VL+ L +FPPIVFAGEARKLEE LA AAVG AFLLQGG
Sbjct: 67  DSWKSKPAKQLPEYPDQQELETVLQSLNNFPPIVFAGEARKLEERLASAAVGNAFLLQGG 126

Query: 126 DCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGV 185
           DCAESFKEFN NNIRDTFRVLLQMG+VLT+GAQMPIIKVGRMAGQFAKPRSD FE+KDGV
Sbjct: 127 DCAESFKEFNANNIRDTFRVLLQMGVVLTFGAQMPIIKVGRMAGQFAKPRSDPFEIKDGV 186

Query: 186 KLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNL 245
           KLPSYRGDNINADAFDEKSR PDPQRL+RAYLQSVGTLNLLRAFATGGYAAMQRVSQWNL
Sbjct: 187 KLPSYRGDNINADAFDEKSRRPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNL 246

Query: 246 DFVQHSEQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTR 305
           DFV HSEQGDRY ELA+RVDEALGFMAAAG+T+DHP+MNT +FWTSHECLHLPYEQALTR
Sbjct: 247 DFVLHSEQGDRYMELARRVDEALGFMAAAGLTVDHPVMNTTEFWTSHECLHLPYEQALTR 306

Query: 306 EDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEIL 365
           EDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDP ELV+LCEIL
Sbjct: 307 EDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPKELVKLCEIL 366

Query: 366 NPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRS 425
           NPHN+PGRLTII RMGADN+R+KLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTR 
Sbjct: 367 NPHNRPGRLTIIARMGADNLRIKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRP 426

Query: 426 FDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPR 485
           FDSIRAELRAFFDVH+QEGSYPGGVHLEMTGQNVTECVGGSK VTFDDLNSRYHTHCDPR
Sbjct: 427 FDSIRAELRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSKTVTFDDLNSRYHTHCDPR 486

Query: 486 LNASQSLELAFAISQRLRRKRMHS 510
           LNASQSLELAFAI++RLRRKR+ S
Sbjct: 487 LNASQSLELAFAIAERLRRKRLRS 504

BLAST of CmoCh06G008400.1 vs. TAIR10
Match: AT4G33510.1 (AT4G33510.1 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase)

HSP 1 Score: 783.1 bits (2021), Expect = 1.1e-226
Identity = 383/491 (78.00%), Postives = 427/491 (86.97%), Query Frame = 1

Query: 22  TTKSFL---FKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQLPE 81
           TTKSFL     P  PI  +  F      +     ST   S S  W+ +SWKSKKALQLP+
Sbjct: 11  TTKSFLPYRHAPRRPISFSPVFA---VHSTDPKKSTQSASASVKWSLESWKSKKALQLPD 70

Query: 82  YPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESFKEFNGNN 141
           YPD  +++SVL+ L SFPPIVFAGEARKLE+ L +AA+G+AF+LQGGDCAESFKEFN NN
Sbjct: 71  YPDQKDVDSVLQTLSSFPPIVFAGEARKLEDKLGQAAMGQAFMLQGGDCAESFKEFNANN 130

Query: 142 IRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNINAD 201
           IRDTFRVLLQMG+VL +G Q+P+IKVGRMAGQFAKPRSD FE KDGVKLPSYRGDNIN D
Sbjct: 131 IRDTFRVLLQMGVVLMFGGQLPVIKVGRMAGQFAKPRSDPFEEKDGVKLPSYRGDNINGD 190

Query: 202 AFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHSEQGDRYK 261
           AFDEKSR PDP R+VRAY QSV TLNLLRAFATGGYAAMQRVSQWNLDF QHSEQGDRY+
Sbjct: 191 AFDEKSRIPDPHRMVRAYTQSVATLNLLRAFATGGYAAMQRVSQWNLDFTQHSEQGDRYR 250

Query: 262 ELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTGLYYDCSA 321
           ELA RVDEALGFM AAG+T  HPIM T +FWTSHECL LPYEQALTREDST+GLYYDCSA
Sbjct: 251 ELANRVDEALGFMGAAGLTSAHPIMTTTEFWTSHECLLLPYEQALTREDSTSGLYYDCSA 310

Query: 322 HMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKPGRLTIIT 381
           HMLWVGERTRQLDGAHVEFLRG++NPLGIKVSDKM P+ELV+L EILNP NKPGR+T+I 
Sbjct: 311 HMLWVGERTRQLDGAHVEFLRGIANPLGIKVSDKMVPSELVKLIEILNPQNKPGRITVIV 370

Query: 382 RMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELRAFFD 441
           RMGA+NMRVKLP+LIRAVR AG IVTWVSDPMHGNTI AP GLKTRSFD+IRAELRAFFD
Sbjct: 371 RMGAENMRVKLPNLIRAVRGAGQIVTWVSDPMHGNTIMAPGGLKTRSFDAIRAELRAFFD 430

Query: 442 VHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQSLELAFAI 501
           VH+QEGS+PGGVHLEMTGQNVTECVGGS+ +T++DL+SRYHTHCDPRLNASQSLELAF I
Sbjct: 431 VHDQEGSFPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPRLNASQSLELAFII 490

Query: 502 SQRLRRKRMHS 510
           ++RLR++R+ S
Sbjct: 491 AERLRKRRLGS 498

BLAST of CmoCh06G008400.1 vs. TAIR10
Match: AT1G22410.1 (AT1G22410.1 Class-II DAHP synthetase family protein)

HSP 1 Score: 779.6 bits (2012), Expect = 1.2e-225
Identity = 377/496 (76.01%), Postives = 426/496 (85.89%), Query Frame = 1

Query: 15  SAVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKAL 74
           SAV+T P T         P  ++A    T P+ ++     +   G   WAP+SW++KKAL
Sbjct: 38  SAVQTDPKT---------PAASSASAATTTPATLTKPVGVNV--GKGKWAPESWRTKKAL 97

Query: 75  QLPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESFKEF 134
           Q P+YPD   LE+VL  +E+FPPIVFAGEAR LEE L +AA+GEAFLLQGGDCAESFKEF
Sbjct: 98  QQPDYPDLAALEAVLETIEAFPPIVFAGEARLLEERLGQAAMGEAFLLQGGDCAESFKEF 157

Query: 135 NGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDN 194
           N NNIRDTFR+LLQMG VL +G Q+P++KVGRMAGQFAKPRSDSFE KDGVKLPSYRGDN
Sbjct: 158 NANNIRDTFRILLQMGAVLMFGGQVPVVKVGRMAGQFAKPRSDSFEEKDGVKLPSYRGDN 217

Query: 195 INADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHSEQG 254
           IN DAFD KSR PDPQR++RAY QS  TLNLLRAFATGGYAAMQRV+QWNLDF + SEQG
Sbjct: 218 INGDAFDSKSRIPDPQRMIRAYCQSAATLNLLRAFATGGYAAMQRVTQWNLDFTERSEQG 277

Query: 255 DRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTGLYY 314
           DRY+ELA RVDEALGFM AAG+T+DHPIM T DFWTSHECL LPYEQ+LTR DST+GLYY
Sbjct: 278 DRYRELANRVDEALGFMHAAGLTLDHPIMQTTDFWTSHECLLLPYEQSLTRLDSTSGLYY 337

Query: 315 DCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKPGRL 374
           DCSAHM+WVGERTRQLDGAHVEFLRGV+NPLGIKVSDKMDP ELV+L EILN  NKPGR+
Sbjct: 338 DCSAHMIWVGERTRQLDGAHVEFLRGVANPLGIKVSDKMDPKELVKLIEILNADNKPGRI 397

Query: 375 TIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELR 434
           TIITRMGA+NMRVKLPHLIR VR+AG IVTWVSDPMHGNTIKAPCGLKTR FD+I AE+R
Sbjct: 398 TIITRMGAENMRVKLPHLIREVRRAGQIVTWVSDPMHGNTIKAPCGLKTRPFDAILAEVR 457

Query: 435 AFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQSLEL 494
           AFFDVHEQEGS+PGG+HLEMTGQNVTEC+GGS+ VTFDDL SRYHTHCDPRLNASQSLEL
Sbjct: 458 AFFDVHEQEGSHPGGIHLEMTGQNVTECIGGSRTVTFDDLGSRYHTHCDPRLNASQSLEL 517

Query: 495 AFAISQRLRRKRMHSK 511
           +F I++RLR++R+ S+
Sbjct: 518 SFIIAERLRKRRIKSQ 522

BLAST of CmoCh06G008400.1 vs. TAIR10
Match: AT4G39980.1 (AT4G39980.1 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase 1)

HSP 1 Score: 763.1 bits (1969), Expect = 1.1e-220
Identity = 368/495 (74.34%), Postives = 427/495 (86.26%), Query Frame = 1

Query: 16  AVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSKKALQ 75
           AV T P + + +   H   P  A+   ++  +V+ SSS     G+  W P+SWK KKALQ
Sbjct: 35  AVNTKPKSVNLVTAVHAAEP--ARNAVSVKESVASSSS-----GALKWTPESWKLKKALQ 94

Query: 76  LPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESFKEFN 135
           LP+YP+ NELESVL+ +E+FPPIVFAGEAR LEE LA AAVG+AFLLQGGDCAESFKEFN
Sbjct: 95  LPDYPNANELESVLKTIEAFPPIVFAGEARNLEERLADAAVGKAFLLQGGDCAESFKEFN 154

Query: 136 GNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYRGDNI 195
             NIRDTFRVLLQM +VLT+G Q+P+IKVGRMAGQFAKPRSD+FE KDGVKLPSY+GDNI
Sbjct: 155 ATNIRDTFRVLLQMSIVLTFGGQVPVIKVGRMAGQFAKPRSDAFEEKDGVKLPSYKGDNI 214

Query: 196 NADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHSEQGD 255
           N D FDEKSR PDP R++RAY QS  TLNLLRAFATGGYAA+QRV+QWNLDFV+ SEQ D
Sbjct: 215 NGDTFDEKSRIPDPNRMIRAYTQSAATLNLLRAFATGGYAAIQRVTQWNLDFVEQSEQAD 274

Query: 256 RYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTGLYYD 315
           RY+ELA RVDEALGFM+A G+  DHP+M T DF+TSHECL LPYEQ+LTR DST+GLYYD
Sbjct: 275 RYQELANRVDEALGFMSACGLGTDHPLMTTTDFYTSHECLLLPYEQSLTRLDSTSGLYYD 334

Query: 316 CSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKPGRLT 375
           CSAHM+W GERTRQLDGAHVEFLRG++NPLGIKVS+KMDP ELV+L EILNP+NKPGR+T
Sbjct: 335 CSAHMVWCGERTRQLDGAHVEFLRGIANPLGIKVSNKMDPFELVKLVEILNPNNKPGRIT 394

Query: 376 IITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRAELRA 435
           +I RMGA+NMRVKLPHLIRAVR++G IVTWV DPMHGNTIKAPCGLKTR+FDSI AE+RA
Sbjct: 395 VIVRMGAENMRVKLPHLIRAVRRSGQIVTWVCDPMHGNTIKAPCGLKTRAFDSILAEVRA 454

Query: 436 FFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQSLELA 495
           F DVHEQEGS+ GG+HLEMTGQNVTEC+GGS+ VT+DDL+SRYHTHCDPRLNASQSLELA
Sbjct: 455 FLDVHEQEGSHAGGIHLEMTGQNVTECIGGSRTVTYDDLSSRYHTHCDPRLNASQSLELA 514

Query: 496 FAISQRLRRKRMHSK 511
           F +++RLR++R  S+
Sbjct: 515 FIVAERLRKRRTGSQ 522

BLAST of CmoCh06G008400.1 vs. NCBI nr
Match: gi|659096501|ref|XP_008449130.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cucumis melo])

HSP 1 Score: 983.4 bits (2541), Expect = 1.5e-283
Identity = 487/518 (94.02%), Postives = 495/518 (95.56%), Query Frame = 1

Query: 8   NLPPLTSSAVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSS-STHFTSGSSNWAPD 67
           NL P T S   TT   K FLFKPHF  PNAAKFCRTIPSAVS SS STHF SGSSNW P+
Sbjct: 5   NLLPPTPSTAPTTSIAKCFLFKPHFSTPNAAKFCRTIPSAVSSSSSSTHFVSGSSNWTPE 64

Query: 68  SWKSKKALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGD 127
           SWKSK+ALQLPEYPDPNEL+SVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGD
Sbjct: 65  SWKSKRALQLPEYPDPNELDSVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGD 124

Query: 128 CAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVK 187
           CAESFKEFNGNNIRDTFRVLLQMG+VLTYGAQMPIIKVGRMAGQFAKPRSD FEVKDGVK
Sbjct: 125 CAESFKEFNGNNIRDTFRVLLQMGIVLTYGAQMPIIKVGRMAGQFAKPRSDPFEVKDGVK 184

Query: 188 LPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLD 247
           LPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLD
Sbjct: 185 LPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLD 244

Query: 248 FVQHSEQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTRE 307
           FVQHSEQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTRE
Sbjct: 245 FVQHSEQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTRE 304

Query: 308 DSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILN 367
           DSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDP+ELVQLCEILN
Sbjct: 305 DSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPSELVQLCEILN 364

Query: 368 PHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSF 427
           P N+PGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSF
Sbjct: 365 PRNRPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSF 424

Query: 428 DSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRL 487
           DSIRAELRAFFDVHEQEGS+PGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRL
Sbjct: 425 DSIRAELRAFFDVHEQEGSHPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRL 484

Query: 488 NASQSLELAFAISQRLRRKRMHSKPGSNGMLVENGSVA 525
           NASQSLELAFAISQRLR KRM SK G NG+LVENG VA
Sbjct: 485 NASQSLELAFAISQRLRSKRMRSKAGLNGLLVENGFVA 522

BLAST of CmoCh06G008400.1 vs. NCBI nr
Match: gi|449463236|ref|XP_004149340.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 982.6 bits (2539), Expect = 2.6e-283
Identity = 485/517 (93.81%), Postives = 495/517 (95.74%), Query Frame = 1

Query: 8   NLPPLTSSAVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDS 67
           NL P +SSA  TT   K FLFKPHF  PNAA FCRTIPSAVS SSSTHF SGSSNW P+S
Sbjct: 5   NLLPPSSSAAPTTSIAKCFLFKPHFFTPNAANFCRTIPSAVSSSSSTHFISGSSNWTPES 64

Query: 68  WKSKKALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC 127
           WKSKKALQLP+YPDPNEL+SVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC
Sbjct: 65  WKSKKALQLPQYPDPNELDSVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDC 124

Query: 128 AESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKL 187
           AESFKEFNGNNIRDTFRVLLQMG+VLTYGAQMPIIKVGRMAGQFAKPRSD FEVKDGV+L
Sbjct: 125 AESFKEFNGNNIRDTFRVLLQMGIVLTYGAQMPIIKVGRMAGQFAKPRSDPFEVKDGVEL 184

Query: 188 PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDF 247
           PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDF
Sbjct: 185 PSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDF 244

Query: 248 VQHSEQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTRED 307
           VQHSEQGDRYKELAQRVDEALGFMAAAGIT DHPIMNTIDFWTSHECLHLPYEQALTRED
Sbjct: 245 VQHSEQGDRYKELAQRVDEALGFMAAAGITTDHPIMNTIDFWTSHECLHLPYEQALTRED 304

Query: 308 STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNP 367
           STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDP+ELVQLCEILNP
Sbjct: 305 STTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPSELVQLCEILNP 364

Query: 368 HNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD 427
            N+PGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD
Sbjct: 365 RNRPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFD 424

Query: 428 SIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN 487
           SIRAELRAFFDVHEQEGS+PGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN
Sbjct: 425 SIRAELRAFFDVHEQEGSHPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLN 484

Query: 488 ASQSLELAFAISQRLRRKRMHSKPGSNGMLVENGSVA 525
           ASQSLELAFAISQRLR KRM SK G NG+LVENG VA
Sbjct: 485 ASQSLELAFAISQRLRSKRMRSKAGLNGLLVENGFVA 521

BLAST of CmoCh06G008400.1 vs. NCBI nr
Match: gi|743908215|ref|XP_011047555.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Populus euphratica])

HSP 1 Score: 851.3 bits (2198), Expect = 9.1e-244
Identity = 411/462 (88.96%), Postives = 436/462 (94.37%), Query Frame = 1

Query: 51  SSSTHFTSGSSNWAPDSWKSKKALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEES 110
           S+ T   + SSNWA DSWKSK A QLP+YPDP EL SVL+ L +FPPIVFAGEARKLEE 
Sbjct: 44  STPTSKPTISSNWALDSWKSKPARQLPDYPDPVELHSVLQTLTNFPPIVFAGEARKLEER 103

Query: 111 LAKAAVGEAFLLQGGDCAESFKEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQ 170
           +A AAVG AFLLQGGDCAESFKEFN NNIRDTFRVLLQMG+VLT+GAQMPIIKVGRMAGQ
Sbjct: 104 IASAAVGGAFLLQGGDCAESFKEFNANNIRDTFRVLLQMGVVLTFGAQMPIIKVGRMAGQ 163

Query: 171 FAKPRSDSFEVKDGVKLPSYRGDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFA 230
           FAKPRSD FE KDGVKLPSYRGDNINADAFDEKSRTPDPQRL+RAYLQSVGTLNLLRAFA
Sbjct: 164 FAKPRSDPFEEKDGVKLPSYRGDNINADAFDEKSRTPDPQRLIRAYLQSVGTLNLLRAFA 223

Query: 231 TGGYAAMQRVSQWNLDFVQHSEQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWT 290
           TGGYAAMQRVSQWNLDFV+HSEQGDR+ ELA+RVDEALGFMAAAG+T+DHP+MNT +FWT
Sbjct: 224 TGGYAAMQRVSQWNLDFVEHSEQGDRFMELARRVDEALGFMAAAGLTIDHPVMNTTEFWT 283

Query: 291 SHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVS 350
           SHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVS
Sbjct: 284 SHECLHLPYEQALTREDSTTGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVS 343

Query: 351 DKMDPAELVQLCEILNPHNKPGRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPM 410
           DKMDP ELV+LCEILNPHN+PGRLTIITRMGADNMR+KLPHLIRAVRQAGLIVTWVSDPM
Sbjct: 344 DKMDPKELVKLCEILNPHNRPGRLTIITRMGADNMRIKLPHLIRAVRQAGLIVTWVSDPM 403

Query: 411 HGNTIKAPCGLKTRSFDSIRAELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVT 470
           HGNTIKAPCGLKTR FDSIRAELRAFFDVH+QEGSYPGGVHLEMTGQNVTECVGGSK +T
Sbjct: 404 HGNTIKAPCGLKTRPFDSIRAELRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSKTIT 463

Query: 471 FDDLNSRYHTHCDPRLNASQSLELAFAISQRLRRKRMHSKPG 513
           FDDLNSRYHTHCDPRLNASQSLELAFAIS+RLR+KR+ +  G
Sbjct: 464 FDDLNSRYHTHCDPRLNASQSLELAFAISERLRKKRLRAGDG 505

BLAST of CmoCh06G008400.1 vs. NCBI nr
Match: gi|147853875|emb|CAN79559.1| (hypothetical protein VITISV_002672 [Vitis vinifera])

HSP 1 Score: 849.4 bits (2193), Expect = 3.4e-243
Identity = 421/498 (84.54%), Postives = 446/498 (89.56%), Query Frame = 1

Query: 12  LTSSAVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSK 71
           +T +A    PT  S      FP P         P  +S S S+     S NW P SWKSK
Sbjct: 3   VTGTANLAAPTPPSLCRL--FPNPRYLPTHXLKPRPISASLSS-IDIRSPNWTPGSWKSK 62

Query: 72  KALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESF 131
           KA QLPEYPDP ELESVL+ LESFPP+VFAGEAR LEE LA AAVG+AFLLQGGDCAESF
Sbjct: 63  KAQQLPEYPDPVELESVLKTLESFPPMVFAGEARNLEERLADAAVGKAFLLQGGDCAESF 122

Query: 132 KEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYR 191
           KEF G NIRDTFRVLLQMG+VLT+GAQ+P+IKVGRMAGQFAKPRSD FEVKDGVKLPSYR
Sbjct: 123 KEFGGTNIRDTFRVLLQMGIVLTFGAQLPVIKVGRMAGQFAKPRSDPFEVKDGVKLPSYR 182

Query: 192 GDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS 251
           GDNIN+D FDEKSRTPDPQRL+RAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS
Sbjct: 183 GDNINSDDFDEKSRTPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS 242

Query: 252 EQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTG 311
           EQGDRY ELAQRVDEALGFMAAAG+T DHPIMNTI+FWTSHECLHL YEQALTR+DSTTG
Sbjct: 243 EQGDRYTELAQRVDEALGFMAAAGLTTDHPIMNTIEFWTSHECLHLLYEQALTRQDSTTG 302

Query: 312 LYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKP 371
           LYYDCSAHMLWVGERTRQLDGAHVEFLRG+SNPLGIKVSDKMDP ELV+LCEILNP NKP
Sbjct: 303 LYYDCSAHMLWVGERTRQLDGAHVEFLRGISNPLGIKVSDKMDPKELVKLCEILNPRNKP 362

Query: 372 GRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRA 431
           GRLTIITRMGADNMR+KLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIR+
Sbjct: 363 GRLTIITRMGADNMRIKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRS 422

Query: 432 ELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQS 491
           ELRAFFDVH+QEGS+PGGVHLEMTGQNVTEC+GGSK VTFDDLNSRYHTHCDPRLNASQS
Sbjct: 423 ELRAFFDVHDQEGSHPGGVHLEMTGQNVTECIGGSKTVTFDDLNSRYHTHCDPRLNASQS 482

Query: 492 LELAFAISQRLRRKRMHS 510
           LELAFAI++RLRRKRM S
Sbjct: 483 LELAFAIAERLRRKRMRS 497

BLAST of CmoCh06G008400.1 vs. NCBI nr
Match: gi|526117351|ref|NP_001268000.1| (uncharacterized protein LOC100251487 [Vitis vinifera])

HSP 1 Score: 849.4 bits (2193), Expect = 3.4e-243
Identity = 421/498 (84.54%), Postives = 446/498 (89.56%), Query Frame = 1

Query: 12  LTSSAVRTTPTTKSFLFKPHFPIPNAAKFCRTIPSAVSFSSSTHFTSGSSNWAPDSWKSK 71
           +T +A    PT  S      FP P         P  +S S S+     S NW P SWKSK
Sbjct: 3   VTGTANLAAPTPPSLCRL--FPNPRYLPTHTLKPRPISASLSS-IDIRSPNWTPGSWKSK 62

Query: 72  KALQLPEYPDPNELESVLRVLESFPPIVFAGEARKLEESLAKAAVGEAFLLQGGDCAESF 131
           KA QLPEYPDP ELESVL+ LESFPP+VFAGEAR LEE LA AAVG+AFLLQGGDCAESF
Sbjct: 63  KAQQLPEYPDPVELESVLKTLESFPPMVFAGEARNLEERLADAAVGKAFLLQGGDCAESF 122

Query: 132 KEFNGNNIRDTFRVLLQMGMVLTYGAQMPIIKVGRMAGQFAKPRSDSFEVKDGVKLPSYR 191
           KEF G NIRDTFRVLLQMG+VLT+GAQ+P+IKVGRMAGQFAKPRSD FEVKDGVKLPSYR
Sbjct: 123 KEFGGTNIRDTFRVLLQMGIVLTFGAQLPVIKVGRMAGQFAKPRSDPFEVKDGVKLPSYR 182

Query: 192 GDNINADAFDEKSRTPDPQRLVRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS 251
           GDNIN+D FDEKSRTPDPQRL+RAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS
Sbjct: 183 GDNINSDDFDEKSRTPDPQRLIRAYLQSVGTLNLLRAFATGGYAAMQRVSQWNLDFVQHS 242

Query: 252 EQGDRYKELAQRVDEALGFMAAAGITMDHPIMNTIDFWTSHECLHLPYEQALTREDSTTG 311
           EQGDRY ELAQRVDEALGFMAAAG+T DHPIMNTI+FWTSHECLHL YEQALTR+DSTTG
Sbjct: 243 EQGDRYTELAQRVDEALGFMAAAGLTTDHPIMNTIEFWTSHECLHLLYEQALTRQDSTTG 302

Query: 312 LYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMDPAELVQLCEILNPHNKP 371
           LYYDCSAHMLWVGERTRQLDGAHVEFLRG+SNPLGIKVSDKMDP ELV+LCEILNP NKP
Sbjct: 303 LYYDCSAHMLWVGERTRQLDGAHVEFLRGISNPLGIKVSDKMDPKELVKLCEILNPRNKP 362

Query: 372 GRLTIITRMGADNMRVKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRA 431
           GRLTIITRMGADNMR+KLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIR+
Sbjct: 363 GRLTIITRMGADNMRIKLPHLIRAVRQAGLIVTWVSDPMHGNTIKAPCGLKTRSFDSIRS 422

Query: 432 ELRAFFDVHEQEGSYPGGVHLEMTGQNVTECVGGSKEVTFDDLNSRYHTHCDPRLNASQS 491
           ELRAFFDVH+QEGS+PGGVHLEMTGQNVTEC+GGSK VTFDDLNSRYHTHCDPRLNASQS
Sbjct: 423 ELRAFFDVHDQEGSHPGGVHLEMTGQNVTECIGGSKTVTFDDLNSRYHTHCDPRLNASQS 482

Query: 492 LELAFAISQRLRRKRMHS 510
           LELAFAI++RLRRKRM S
Sbjct: 483 LELAFAIAERLRRKRMRS 497

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AROG_SOLLC2.6e-22779.74Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Solanum lycopers... [more]
AROF_SOLTU2.6e-22779.74Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Solanum tuberosu... [more]
AROF_TOBAC1.3e-22680.47Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Nicotiana tabacu... [more]
AROG_ARATH1.9e-22578.00Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Arabidopsis thal... [more]
AROG_ORYSJ2.1e-22475.49Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Oryza sativa sub... [more]
Match NameE-valueIdentityDescription
A0A0A0L679_CUCSA1.8e-28393.81Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Cucumis sativus GN=Csa_3G073840 P... [more]
A5C138_VITVI2.4e-24384.54Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Vitis vinifera GN=VITISV_002672 P... [more]
D0VBC1_VITVI2.4e-24384.54Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Vitis vinifera GN=VIT_00s1217g000... [more]
B9MUW0_POPTR7.0e-24388.53Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Populus trichocarpa GN=POPTR_0001... [more]
B9SZ06_RICCO9.1e-24383.53Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Ricinus communis GN=RCOM_0121060 ... [more]
Match NameE-valueIdentityDescription
AT4G33510.11.1e-22678.00 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase[more]
AT1G22410.11.2e-22576.01 Class-II DAHP synthetase family protein[more]
AT4G39980.11.1e-22074.34 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase 1[more]
Match NameE-valueIdentityDescription
gi|659096501|ref|XP_008449130.1|1.5e-28394.02PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cu... [more]
gi|449463236|ref|XP_004149340.1|2.6e-28393.81PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cu... [more]
gi|743908215|ref|XP_011047555.1|9.1e-24488.96PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Po... [more]
gi|147853875|emb|CAN79559.1|3.4e-24384.54hypothetical protein VITISV_002672 [Vitis vinifera][more]
gi|526117351|ref|NP_001268000.1|3.4e-24384.54uncharacterized protein LOC100251487 [Vitis vinifera][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002480DAHP_synth_2
Vocabulary: Molecular Function
TermDefinition
GO:00038493-deoxy-7-phosphoheptulonate synthase activity
Vocabulary: Biological Process
TermDefinition
GO:0009073aromatic amino acid family biosynthetic process
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009073 aromatic amino acid family biosynthetic process
biological_process GO:0009094 L-phenylalanine biosynthetic process
biological_process GO:0000162 tryptophan biosynthetic process
biological_process GO:0006571 tyrosine biosynthetic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003849 3-deoxy-7-phosphoheptulonate synthase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh06G008400CmoCh06G008400gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh06G008400.1CmoCh06G008400.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh06G008400.1.three_prime_UTR.1CmoCh06G008400.1.three_prime_UTR.1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh06G008400.1.CDS.5CmoCh06G008400.1.CDS.5CDS
CmoCh06G008400.1.CDS.4CmoCh06G008400.1.CDS.4CDS
CmoCh06G008400.1.CDS.3CmoCh06G008400.1.CDS.3CDS
CmoCh06G008400.1.CDS.2CmoCh06G008400.1.CDS.2CDS
CmoCh06G008400.1.CDS.1CmoCh06G008400.1.CDS.1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh06G008400.1.five_prime_UTR.1CmoCh06G008400.1.five_prime_UTR.1five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh06G008400.1.exon.5CmoCh06G008400.1.exon.5exon
CmoCh06G008400.1.exon.4CmoCh06G008400.1.exon.4exon
CmoCh06G008400.1.exon.3CmoCh06G008400.1.exon.3exon
CmoCh06G008400.1.exon.2CmoCh06G008400.1.exon.2exon
CmoCh06G008400.1.exon.1CmoCh06G008400.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002480DAHP synthetase, class IIPANTHERPTHR21337PHOSPHO-2-DEHYDRO-3-DEOXYHEPTONATE ALDOLASE 1, 2coord: 1..509
score:
IPR002480DAHP synthetase, class IIPFAMPF01474DAHP_synth_2coord: 63..499
score: 2.8E
IPR002480DAHP synthetase, class IITIGRFAMsTIGR01358TIGR01358coord: 63..505
score: 9.1E
NoneNo IPR availablePANTHERPTHR21337:SF3SUBFAMILY NOT NAMEDcoord: 1..509
score:
NoneNo IPR availableunknownSSF51569Aldolasecoord: 52..503
score: 3.88E