CmoCh14G020470 (gene) Cucurbita moschata (Rifu)

NameCmoCh14G020470
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description4-hydroxy-tetrahydrodipicolinate synthase
LocationCmo_Chr14 : 14959091 .. 14962167 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACTTCTCCGTCTGCTCCCTCCGACGTTCTGTCACCGGAACTATTTCGACCACAACGACGACCACCGCAGCAATCGCAAACATCGCCATCATCATTCTGTAATCTTCTGCGCCCTTCCACATATTCTCCTCGGCGGTTTTCCTTTTCCGGAAACGATTCCAGTAGCTTGTTGGTCCGTGAAGAAGGTTCATCATCTGCGAAAGTTTTGTTCTCGGTTGGGTGTTCCAAATGACCAGTATTCAAGGCTTTGGCGCGTGCTTGCAGGAGCATGCTCTTCAGTTTCCGCGTCCCAGTTACAACGACAGCTGCAGGAGGTATACATACCACTTTCAGTTGAATTATTAAACAACGATTTAAGGTTTTTGGATTTACTTGAAAGCATTCAAATACGCCATTTTAGTAGCTGCCTTGACATATTATTTTATTTTAAGAAATGAATAATTTATTCGTACTGATCGATGCTGGTCATTAAGCATCCAGCTGACTTTGATCAAAGTTGCTCCATCGATGATTATTGCAACCAGTCGTGGCTGTGTGTTTCTACTTCTTAATCGAATTACTTCTATTTCGCATCTCCCTGCAATTTTTCGTTACAAACATACTGGCAATACATATAATTCAATGTATATTCTGTCTGCTTGTTCGTTTTTTTCACTTCAAGTATGTAAGAAAACGAAAGGAAATAATCTTAAGTCATGTCGCCAATGATACTGCCGACGAACGTTAGGTTCTTCCTATTTTGTTCAATAGATTAATTGTTTCCTGAAGTGTAAATGAAATGTAACGTAGAATATCAATTGGGTTCACCGTGAATGAATGGTTCAATTTGAAATGGAGGGCTGACTGAACACTACGTATCAAATTATGTACTTACCAATGTTACTTCCTTCATTACTGTCAGACGCAAGAGGACTGCCTTATGGAGATCTCCACAGGCTGCCATCCTACCCAATGCACACCTCCCAATGCGCAGTTTGGAAGTCAAGAACAGGTGAGCTACTTAATTTTTTGGATTCGTTCTTATGGAGGTTAAGTTGGGAAGGCCTCTCGCCTCTTAGTGAACTGGTCAATGTCTGTAAATAGGATTTGTAATTGCATTTTTCATGTTGGCTGTTTGCATTATGTAGCGTTTAGGACATTAGTCATTGCTGTCTGTTTACTTCAGTTCATAATAATAGTTGTTCTGTTAAAATTATTTTCCATCACAGAGCTCAAGTTTCCTCAGTTGGTGGATATAAGGTAACAGAAGATGTGGGCTTTTTTTTTTTTTTTTTTTGCATGCTTTCTCTCACTGATTAGACACAATGTAATAATTGACTGTTCTTCATTCACCATTACGGTCCGAGGAAGAATTAAACGACCAACTCGCTCACACCCCCCACCCCAAATTTTTTTGGGAGATACTGGCCGGGAATTTTGATATGGTGTGAACTCTTATAGTAAACATTCTGAACTCAACAACTGCTGTTACACGTTACTAGCAATTAACACTGTGATTGACTTGTGTAATCAGTGTCTGAATGTTTTTTGGCTCTACATTATTTATGATTATGAATCTGTGCTGGTGTATCCGCCGAGTCATAAAGTATTCCCTATTAGTCATTGCTGTCTGTTTATTTCAGTTCATAACAATCATAGACTGGCTCAATTTCTCCAGCTTTTGTTTCCAGGACATCCGCAGATGATATCAAGTCTCTCAGACTTGTAACTGCCATTAAAACACCTTATCTTCCGGATGGTAGATTTGATCTTGAAGCATATGATGCTTTGGTGAATCGGCAGATTGAAAATGGAGCTGAAGCTGTGATTGTGGGCGGTACGACTGGTGAAGGTCAGTTGATGAGCTGGGATGAGCACATAATGCTTATTGGTCACACTGTCAACTGTTTTGGTGGATCAATCAAGGTCATAGGCAACACTGGAAGTAACTCTACAAGAGAAGCAATTCATGCTTCGGAGCAGGGATTTGCTGTTGGAATGCACGCTGCCCTTCATATAAATCCTTATTATGGCAAAACTTCCATCGAGGGAATGATCTCACACTTCAATTGTGTGCTTTCTATGGGCCCAACTATTATATATAACGTACCACCTCGAACTGGTCAAGATATCCCCCCACATGTCATTCAAACTGTTGCTCAAAGTCCCAATCTGGCAGGTGTGAAGGAGTGCGTTGGAAACAATCGTGTTGAGCAGTACTCGAACCAAGGAATTGTGGTCTGGAGTGGAAACGACGATCAGTGTCATGATGCCAGGTGGAATCATGGAGCAACTGGAGTTATTTCTGTTACTAGCAACTTGGTGCCCGGTTTAATGAGAGAGCTCATGTTTGGAGGGAAGAACCCTTCCCTAAATGCAAAACTGATGCCTTTGATGGACTGGTTGTTCTTCGAACCCAACCCAATAGGCCTCAACACGGCGCTTGCTCAACTCGGGGTCGTGAGGCCTGTGTTTAGGTTACCATACATACCTCTTCCAAAAGCAAAGAGGGAGGAGTTTGTGAATTTAGTCAAGCAAATTGGGCGGGAAAACTTCGTCGGTGAAAAAGACGTACAAGTTCTCGACGACGACGATTTCATTTTGGTTAGTCGGTATTAACAGCACGGTTCCATGCTGCCTTACTGCTTCAGATTTTTCAGGTCTGTACTCTATTCCCCATGCCCTTAATCTTAAGATTTCTTGTTCGTTCTCTGTAGTGTAAGGCAGGCTGCTATGAGTTTTGTGGTGTTGAATTTGTAGTAATAATAAAATATGAATTAATGGACGTGAAACCCAAGTCTTCAACTCGTTTCTTGGCGTCTAAATGGATGCCTGTTGCTGTTGGTTGGGGCCTCATTCTGTTGAGATATCCAATTCCAAATTTTTAAAAATTATATTAAAGGTTTCCCCCTTACGTTACGTTCTTGTTTGTTGATTATATTCTCGTCTCTATAAGTACATTGAGAATATATTAAAGGGGGGGTTTTTTATTTTAATATTTTCTAGAAAATTCTATTTATTGTTCTTATTCTCCTCCCACTTTATTGACTGTTTTGTAACTATTTCATTATTGTTTTTTTAATAGAAAAAACCCC

mRNA sequence

ATGGACTTCTCCGTCTGCTCCCTCCGACGTTCTGTCACCGGAACTATTTCGACCACAACGACGACCACCGCAGCAATCGCAAACATCGCCATCATCATTCTGTTCATCATCTGCGAAAGTTTTGTTCTCGGTTGGGTGTTCCAAATGACCAGTATTCAAGGCTTTGGCGCGTGCTTGCAGGAGCATGCTCTTCAGTTTCCGCGTCCCAGTTACAACGACAGCTGCAGGAGACGCAAGAGGACTGCCTTATGGAGATCTCCACAGGCTGCCATCCTACCCAATGCACACCTCCCAATGCGCAGTTTGGAAGTCAAGAACAGGACATCCGCAGATGATATCAAGTCTCTCAGACTTGTAACTGCCATTAAAACACCTTATCTTCCGGATGGTAGATTTGATCTTGAAGCATATGATGCTTTGGTGAATCGGCAGATTGAAAATGGAGCTGAAGCTGTGATTGTGGGCGGTACGACTGGTGAAGGTCAGTTGATGAGCTGGGATGAGCACATAATGCTTATTGGTCACACTGTCAACTGTTTTGGTGGATCAATCAAGGTCATAGGCAACACTGGAAGTAACTCTACAAGAGAAGCAATTCATGCTTCGGAGCAGGGATTTGCTGTTGGAATGCACGCTGCCCTTCATATAAATCCTTATTATGGCAAAACTTCCATCGAGGGAATGATCTCACACTTCAATTGTGTGCTTTCTATGGGCCCAACTATTATATATAACGTACCACCTCGAACTGGTCAAGATATCCCCCCACATGTCATTCAAACTGTTGCTCAAAGTCCCAATCTGGCAGGTGTGAAGGAGTGCGTTGGAAACAATCGTGTTGAGCAGTACTCGAACCAAGGAATTGTGGTCTGGAGTGGAAACGACGATCAGTGTCATGATGCCAGGTGGAATCATGGAGCAACTGGAGTTATTTCTGTTACTAGCAACTTGGTGCCCGGTTTAATGAGAGAGCTCATGTTTGGAGGGAAGAACCCTTCCCTAAATGCAAAACTGATGCCTTTGATGGACTGGTTGTTCTTCGAACCCAACCCAATAGGCCTCAACACGGCGCTTGCTCAACTCGGGGTCGTGAGGCCTGTGTTTAGGTTACCATACATACCTCTTCCAAAAGCAAAGAGGGAGGAGTTTGTGAATTTAGTCAAGCAAATTGGGCGGGAAAACTTCGTCGGTGAAAAAGACGTACAAGTTCTCGACGACGACGATTTCATTTTGGTTAGTCGGTATTAACAGCACGGTTCCATGCTGCCTTACTGCTTCAGATTTTTCAGGTCTGTACTCTATTCCCCATGCCCTTAATCTTAAGATTTCTTGTTCGTTCTCTGTAGTGTAAGGCAGGCTGCTATGAGTTTTGTGGTGTTGAATTTGTAGTAATAATAAAATATGAATTAATGGACGTGAAACCCAAGTCTTCAACTCGTTTCTTGGCGTCTAAATGGATGCCTGTTGCTGTTGGTTGGGGCCTCATTCTGTTGAGATATCCAATTCCAAATTTTTAAAAATTATATTAAAGGTTTCCCCCTTACGTTACGTTCTTGTTTGTTGATTATATTCTCGTCTCTATAAGTACATTGAGAATATATTAAAGGGGGGGTTTTTTATTTTAATATTTTCTAGAAAATTCTATTTATTGTTCTTATTCTCCTCCCACTTTATTGACTGTTTTGTAACTATTTCATTATTGTTTTTTTAATAGAAAAAACCCC

Coding sequence (CDS)

ATGGACTTCTCCGTCTGCTCCCTCCGACGTTCTGTCACCGGAACTATTTCGACCACAACGACGACCACCGCAGCAATCGCAAACATCGCCATCATCATTCTGTTCATCATCTGCGAAAGTTTTGTTCTCGGTTGGGTGTTCCAAATGACCAGTATTCAAGGCTTTGGCGCGTGCTTGCAGGAGCATGCTCTTCAGTTTCCGCGTCCCAGTTACAACGACAGCTGCAGGAGACGCAAGAGGACTGCCTTATGGAGATCTCCACAGGCTGCCATCCTACCCAATGCACACCTCCCAATGCGCAGTTTGGAAGTCAAGAACAGGACATCCGCAGATGATATCAAGTCTCTCAGACTTGTAACTGCCATTAAAACACCTTATCTTCCGGATGGTAGATTTGATCTTGAAGCATATGATGCTTTGGTGAATCGGCAGATTGAAAATGGAGCTGAAGCTGTGATTGTGGGCGGTACGACTGGTGAAGGTCAGTTGATGAGCTGGGATGAGCACATAATGCTTATTGGTCACACTGTCAACTGTTTTGGTGGATCAATCAAGGTCATAGGCAACACTGGAAGTAACTCTACAAGAGAAGCAATTCATGCTTCGGAGCAGGGATTTGCTGTTGGAATGCACGCTGCCCTTCATATAAATCCTTATTATGGCAAAACTTCCATCGAGGGAATGATCTCACACTTCAATTGTGTGCTTTCTATGGGCCCAACTATTATATATAACGTACCACCTCGAACTGGTCAAGATATCCCCCCACATGTCATTCAAACTGTTGCTCAAAGTCCCAATCTGGCAGGTGTGAAGGAGTGCGTTGGAAACAATCGTGTTGAGCAGTACTCGAACCAAGGAATTGTGGTCTGGAGTGGAAACGACGATCAGTGTCATGATGCCAGGTGGAATCATGGAGCAACTGGAGTTATTTCTGTTACTAGCAACTTGGTGCCCGGTTTAATGAGAGAGCTCATGTTTGGAGGGAAGAACCCTTCCCTAAATGCAAAACTGATGCCTTTGATGGACTGGTTGTTCTTCGAACCCAACCCAATAGGCCTCAACACGGCGCTTGCTCAACTCGGGGTCGTGAGGCCTGTGTTTAGGTTACCATACATACCTCTTCCAAAAGCAAAGAGGGAGGAGTTTGTGAATTTAGTCAAGCAAATTGGGCGGGAAAACTTCGTCGGTGAAAAAGACGTACAAGTTCTCGACGACGACGATTTCATTTTGGTTAGTCGGTATTAA
BLAST of CmoCh14G020470 vs. Swiss-Prot
Match: DAPA_TOBAC (4-hydroxy-tetrahydrodipicolinate synthase, chloroplastic OS=Nicotiana tabacum GN=DHPS1 PE=2 SV=1)

HSP 1 Score: 608.2 bits (1567), Expect = 6.6e-173
Identity = 287/338 (84.91%), Postives = 318/338 (94.08%), Query Frame = 1

Query: 78  RKRTALWRSPQAAILPNAHLPMRSLEVKNRTSADDIKSLRLVTAIKTPYLPDGRFDLEAY 137
           ++RT  WRSP+AA++P+ HLPMRS EVKNRT ADDIK+LRL+TAIKTPYLPDGRFDLEAY
Sbjct: 22  KRRTTRWRSPRAAVIPSFHLPMRSNEVKNRTFADDIKALRLITAIKTPYLPDGRFDLEAY 81

Query: 138 DALVNRQIENGAEAVIVGGTTGEGQLMSWDEHIMLIGHTVNCFGGSIKVIGNTGSNSTRE 197
           D LVN QIENGAE VIVGGTTGEGQLMSWDEHIMLIGHTVNCFGGSIKVIGNTGSNSTRE
Sbjct: 82  DTLVNLQIENGAEGVIVGGTTGEGQLMSWDEHIMLIGHTVNCFGGSIKVIGNTGSNSTRE 141

Query: 198 AIHASEQGFAVGMHAALHINPYYGKTSIEGMISHFNCVLSMGPTIIYNVPPRTGQDIPPH 257
           AIHA+EQGFAVGMHAALHINPYYGKTS+EG+ISHF  VL MGPTIIYNVP RTGQDIPP 
Sbjct: 142 AIHATEQGFAVGMHAALHINPYYGKTSLEGLISHFESVLPMGPTIIYNVPSRTGQDIPPR 201

Query: 258 VIQTVAQSPNLAGVKECVGNNRVEQYSNQGIVVWSGNDDQCHDARWNHGATGVISVTSNL 317
           VIQT+A+SPNLAGVKECVGN+RVEQY++ G+VVWSGNDD+CH +RW++GATGVISVTSNL
Sbjct: 202 VIQTMAKSPNLAGVKECVGNDRVEQYTSDGVVVWSGNDDECHVSRWDYGATGVISVTSNL 261

Query: 318 VPGLMRELMFGGKNPSLNAKLMPLMDWLFFEPNPIGLNTALAQLGVVRPVFRLPYIPLPK 377
           VPGLMRELMFGGKNP+LN+KLMPLM+WLF EPNPI LNTALAQLGVVRPVFRLPY+PL K
Sbjct: 262 VPGLMRELMFGGKNPALNSKLMPLMEWLFHEPNPIALNTALAQLGVVRPVFRLPYVPLTK 321

Query: 378 AKREEFVNLVKQIGRENFVGEKDVQVLDDDDFILVSRY 416
           AKREEFV +VK+IGRENF+GE+DVQ+LDD+DFILV RY
Sbjct: 322 AKREEFVKIVKEIGRENFIGERDVQILDDNDFILVGRY 359

BLAST of CmoCh14G020470 vs. Swiss-Prot
Match: DAPA2_ARATH (4-hydroxy-tetrahydrodipicolinate synthase 2, chloroplastic OS=Arabidopsis thaliana GN=DHDPS2 PE=1 SV=2)

HSP 1 Score: 603.6 bits (1555), Expect = 1.6e-171
Identity = 287/367 (78.20%), Postives = 327/367 (89.10%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M +++G+G C  + ALQFP P   +S +RR  ++ W SP+AA++PN HLPMRSLEVKNRT
Sbjct: 1   MAALKGYGLCSMDSALQFPCPKLFNSYKRR--SSKWVSPKAAVVPNFHLPMRSLEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           + DDIK+LR++TAIKTPYLPDGRFDLEAYD LVN QI+NGAE VIVGGTTGEGQLMSWDE
Sbjct: 61  NTDDIKALRVITAIKTPYLPDGRFDLEAYDDLVNIQIQNGAEGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHA+EQGFAVGMHAALHINPYYGKTSIEG+
Sbjct: 121 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHATEQGFAVGMHAALHINPYYGKTSIEGL 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           I+HF  VL MGPTIIYNVP RTGQDIPP  I  ++Q+PNLAGVKECVGN RVE+Y+  G+
Sbjct: 181 IAHFQSVLHMGPTIIYNVPGRTGQDIPPRAIFKLSQNPNLAGVKECVGNKRVEEYTENGV 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDD+CHD+RW++GATGVISVTSNLVPGLMR+LMF G+N SLN+KL+PLM WLF E
Sbjct: 241 VVWSGNDDECHDSRWDYGATGVISVTSNLVPGLMRKLMFEGRNSSLNSKLLPLMAWLFHE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPIG+NTALAQLGV RPVFRLPY+PLP +KR EFV LVK+IGRE+FVGEKDVQ LDDDD
Sbjct: 301 PNPIGINTALAQLGVSRPVFRLPYVPLPLSKRLEFVKLVKEIGREHFVGEKDVQALDDDD 360

Query: 409 FILVSRY 416
           FIL+ RY
Sbjct: 361 FILIGRY 365

BLAST of CmoCh14G020470 vs. Swiss-Prot
Match: DAPA1_ARATH (4-hydroxy-tetrahydrodipicolinate synthase 1, chloroplastic OS=Arabidopsis thaliana GN=DHDPS1 PE=2 SV=2)

HSP 1 Score: 591.3 bits (1523), Expect = 8.4e-168
Identity = 284/367 (77.38%), Postives = 323/367 (88.01%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M++++ +G    + AL FPR +   S +RR   A W SP AA++PN HLPMRSLE KNRT
Sbjct: 1   MSALKNYGLISIDSALHFPRSNQLQSYKRRN--AKWVSPIAAVVPNFHLPMRSLEDKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           + DDI+SLR++TAIKTPYLPDGRFDL+AYD LVN QIENGAE VIVGGTTGEGQLMSWDE
Sbjct: 61  NTDDIRSLRVITAIKTPYLPDGRFDLQAYDDLVNTQIENGAEGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGG IKVIGNTGSNSTREAIHA+EQGFA+GMH ALHINPYYGKTSIEGM
Sbjct: 121 HIMLIGHTVNCFGGRIKVIGNTGSNSTREAIHATEQGFAMGMHGALHINPYYGKTSIEGM 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
            +HF  VL MGPTIIYNVP RT QDIPP VI  ++Q+PN+AGVKECVGNNRVE+Y+ +GI
Sbjct: 181 NAHFQTVLHMGPTIIYNVPGRTCQDIPPQVIFKLSQNPNMAGVKECVGNNRVEEYTEKGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDDQCHD+RW+HGATGVISVTSNLVPGLMR+LMF G+N +LNAKL+PLMDWLF E
Sbjct: 241 VVWSGNDDQCHDSRWDHGATGVISVTSNLVPGLMRKLMFEGRNSALNAKLLPLMDWLFQE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPIG+NTALAQLGV RPVFRLPY+PLP +KR EFV LVK+IGRE+FVG++DVQVLDDDD
Sbjct: 301 PNPIGVNTALAQLGVARPVFRLPYVPLPLSKRIEFVKLVKEIGREHFVGDRDVQVLDDDD 360

Query: 409 FILVSRY 416
           FIL+ RY
Sbjct: 361 FILIGRY 365

BLAST of CmoCh14G020470 vs. Swiss-Prot
Match: DAPA_SOYBN (4-hydroxy-tetrahydrodipicolinate synthase, chloroplastic OS=Glycine max GN=DHPS1 PE=2 SV=1)

HSP 1 Score: 568.2 bits (1463), Expect = 7.6e-161
Identity = 268/327 (81.96%), Postives = 299/327 (91.44%), Query Frame = 1

Query: 89  AAILPNAHLPMRSLEVKNRTSADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENG 148
           AA+ PN HLPMRS E+KNRTS +DIK+LRL+TAIKTPYLPDGRFDLEAYD LVN QI  G
Sbjct: 6   AAVKPNFHLPMRSFELKNRTSPEDIKALRLITAIKTPYLPDGRFDLEAYDDLVNMQIGQG 65

Query: 149 AEAVIVGGTTGEGQLMSWDEHIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAV 208
           AE VIVGGTTGEGQLMSW+EHI+LI HTVNCFGG IKVIGNTGSNSTREAIHA+EQGFAV
Sbjct: 66  AEGVIVGGTTGEGQLMSWEEHIILIAHTVNCFGGKIKVIGNTGSNSTREAIHATEQGFAV 125

Query: 209 GMHAALHINPYYGKTSIEGMISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNL 268
           GMHAALHINPYYGKTS++GM++HF  VLSMGPTIIYNVP RTGQDIPPHVIQT+A+S NL
Sbjct: 126 GMHAALHINPYYGKTSLDGMVAHFRSVLSMGPTIIYNVPARTGQDIPPHVIQTLAESVNL 185

Query: 269 AGVKECVGNNRVEQYSNQGIVVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFG 328
           AGVKECVGN+R++QY++ GIVVWSGNDDQCHDARW +GATGV+SV SNLVPGLMRELMFG
Sbjct: 186 AGVKECVGNDRIKQYTDDGIVVWSGNDDQCHDARWGYGATGVVSVASNLVPGLMRELMFG 245

Query: 329 GKNPSLNAKLMPLMDWLFFEPNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVK 388
           G NP+LN+KL+PL+DWLF  PNPIGLNTALAQLGV+RPVFRLP++PLP  KR EF NLVK
Sbjct: 246 GVNPTLNSKLLPLIDWLFHMPNPIGLNTALAQLGVIRPVFRLPFVPLPVDKRIEFANLVK 305

Query: 389 QIGRENFVGEKDVQVLDDDDFILVSRY 416
           +IGRE+FVG K V+VLDDDDF LVSRY
Sbjct: 306 EIGREHFVGNKVVEVLDDDDFFLVSRY 332

BLAST of CmoCh14G020470 vs. Swiss-Prot
Match: DAPA2_WHEAT (4-hydroxy-tetrahydrodipicolinate synthase 2, chloroplastic OS=Triticum aestivum PE=1 SV=1)

HSP 1 Score: 535.8 bits (1379), Expect = 4.2e-151
Identity = 255/327 (77.98%), Postives = 288/327 (88.07%), Query Frame = 1

Query: 89  AAILPNAHLPMRSLEVKNRTSADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENG 148
           AAI  + +LPMRS EVKNRTS D IKSLRL+TA+KTPYLPDGRFDLEAYD+L+N QI  G
Sbjct: 51  AAITTDDYLPMRSTEVKNRTSVDGIKSLRLITAVKTPYLPDGRFDLEAYDSLINTQINGG 110

Query: 149 AEAVIVGGTTGEGQLMSWDEHIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAV 208
           AE VIVGGTTGEG LMSWDEHIMLIGHTVNCFG +IKVIGNTGSNSTREAIHASEQGFAV
Sbjct: 111 AEGVIVGGTTGEGHLMSWDEHIMLIGHTVNCFGTNIKVIGNTGSNSTREAIHASEQGFAV 170

Query: 209 GMHAALHINPYYGKTSIEGMISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNL 268
           GMHAALH+NPYYGKTS  G+ISHF+ VL MGPTIIYNVP RTGQDIPP VI+ ++  PN+
Sbjct: 171 GMHAALHVNPYYGKTSTAGLISHFDEVLPMGPTIIYNVPSRTGQDIPPAVIEALSTYPNM 230

Query: 269 AGVKECVGNNRVEQYSNQGIVVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFG 328
           AGVKECVG+ RV+ Y+++GI +WSGNDD+CHD+RW +GATGVISVTSNLVPGLMR LMF 
Sbjct: 231 AGVKECVGHERVKCYTDKGITIWSGNDDECHDSRWKYGATGVISVTSNLVPGLMRSLMFE 290

Query: 329 GKNPSLNAKLMPLMDWLFFEPNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVK 388
           G+N +LN KL+PLM WLF EPNPIGLNTALAQLGVVRPVFR PY PL   KR EFV +V+
Sbjct: 291 GENAALNEKLLPLMKWLFSEPNPIGLNTALAQLGVVRPVFRRPYAPLSLEKRTEFVRIVE 350

Query: 389 QIGRENFVGEKDVQVLDDDDFILVSRY 416
            IGRENFVG+K+V+VLDDDDF+L+SRY
Sbjct: 351 AIGRENFVGQKEVRVLDDDDFVLISRY 377

BLAST of CmoCh14G020470 vs. TrEMBL
Match: A0A061E1S6_THECC (Dihydrodipicolinate synthase isoform 1 OS=Theobroma cacao GN=TCM_005490 PE=4 SV=1)

HSP 1 Score: 637.5 bits (1643), Expect = 1.1e-179
Identity = 305/367 (83.11%), Postives = 335/367 (91.28%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M +++ +G  L E   QFP P+  D+ +RR   A WRSPQAA++PN HLPMRS EVKNRT
Sbjct: 1   MATLKSYGVRLGESTHQFPLPNRGDNYKRRN--AKWRSPQAAVIPNFHLPMRSFEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           S++DIKSLRL+TAIKTPYLPDGRFDLEAYD LVN QIENGAE VIVGGTTGEGQLMSWDE
Sbjct: 61  SSEDIKSLRLITAIKTPYLPDGRFDLEAYDGLVNMQIENGAEGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHA+EQGFAVGMHAALHINPYYGKTS+EG+
Sbjct: 121 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHATEQGFAVGMHAALHINPYYGKTSLEGL 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           +SHF+ VL MGPTIIYNVP RTGQDIPP VI TVAQSPNLAGVKECVGN+R+EQY++ GI
Sbjct: 181 VSHFDSVLPMGPTIIYNVPSRTGQDIPPRVINTVAQSPNLAGVKECVGNDRIEQYTDNGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDDQCHDARW+HGATGVISVTSNL+PGLMRELMFGGKNPSLN KL+PL++WLF E
Sbjct: 241 VVWSGNDDQCHDARWSHGATGVISVTSNLIPGLMRELMFGGKNPSLNVKLLPLIEWLFEE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPIGLNTALAQLGVVRPVFRLPY+PLP AKR EFVNLV+QIGR+NFVGEKDVQVLD+DD
Sbjct: 301 PNPIGLNTALAQLGVVRPVFRLPYVPLPLAKRVEFVNLVRQIGRQNFVGEKDVQVLDNDD 360

Query: 409 FILVSRY 416
           FILV RY
Sbjct: 361 FILVGRY 365

BLAST of CmoCh14G020470 vs. TrEMBL
Match: A0A061DTV2_THECC (Dihydrodipicolinate synthase isoform 2 OS=Theobroma cacao GN=TCM_005490 PE=4 SV=1)

HSP 1 Score: 637.5 bits (1643), Expect = 1.1e-179
Identity = 305/367 (83.11%), Postives = 335/367 (91.28%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M +++ +G  L E   QFP P+  D+ + R R A WRSPQAA++PN HLPMRS EVKNRT
Sbjct: 1   MATLKSYGVRLGESTHQFPLPNRGDNYKSR-RNAKWRSPQAAVIPNFHLPMRSFEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           S++DIKSLRL+TAIKTPYLPDGRFDLEAYD LVN QIENGAE VIVGGTTGEGQLMSWDE
Sbjct: 61  SSEDIKSLRLITAIKTPYLPDGRFDLEAYDGLVNMQIENGAEGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHA+EQGFAVGMHAALHINPYYGKTS+EG+
Sbjct: 121 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHATEQGFAVGMHAALHINPYYGKTSLEGL 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           +SHF+ VL MGPTIIYNVP RTGQDIPP VI TVAQSPNLAGVKECVGN+R+EQY++ GI
Sbjct: 181 VSHFDSVLPMGPTIIYNVPSRTGQDIPPRVINTVAQSPNLAGVKECVGNDRIEQYTDNGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDDQCHDARW+HGATGVISVTSNL+PGLMRELMFGGKNPSLN KL+PL++WLF E
Sbjct: 241 VVWSGNDDQCHDARWSHGATGVISVTSNLIPGLMRELMFGGKNPSLNVKLLPLIEWLFEE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPIGLNTALAQLGVVRPVFRLPY+PLP AKR EFVNLV+QIGR+NFVGEKDVQVLD+DD
Sbjct: 301 PNPIGLNTALAQLGVVRPVFRLPYVPLPLAKRVEFVNLVRQIGRQNFVGEKDVQVLDNDD 360

Query: 409 FILVSRY 416
           FILV RY
Sbjct: 361 FILVGRY 366

BLAST of CmoCh14G020470 vs. TrEMBL
Match: A0A0D2R5Q5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G160600 PE=4 SV=1)

HSP 1 Score: 634.4 bits (1635), Expect = 9.6e-179
Identity = 304/367 (82.83%), Postives = 333/367 (90.74%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M  ++ +G  L+E   QFPRP+  D+ +RR     WRSPQAA++PN HLPMRS EVKNRT
Sbjct: 1   MAILKSYGVRLRESTPQFPRPNLCDNYKRRN--VKWRSPQAAVIPNFHLPMRSFEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           SADDIKSLRL+TAIKTPYLPDGRFDLEAYD L++ QIENGAEAVIVGGTTGEGQLMSWDE
Sbjct: 61  SADDIKSLRLITAIKTPYLPDGRFDLEAYDDLMHMQIENGAEAVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHA+EQGFAVGMHAALHINPYYGKTS++G+
Sbjct: 121 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHATEQGFAVGMHAALHINPYYGKTSLDGL 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           ISHF+ VL MGPTIIYNVP RTGQDIPPHVI  VAQSPNLAG+KECVGN+R+EQY+  GI
Sbjct: 181 ISHFDSVLPMGPTIIYNVPSRTGQDIPPHVINNVAQSPNLAGIKECVGNDRIEQYTGNGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDDQCHDARW+HGATGVISVTSNLVPGLMRELMFGGKNPSLNAKL+PL++WLF E
Sbjct: 241 VVWSGNDDQCHDARWSHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLLPLIEWLFQE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNP+GLNTALAQLGVVRPVFRLPY+PLP  KR  FVNLVK+IGRENFVG+ DVQVLDDDD
Sbjct: 301 PNPVGLNTALAQLGVVRPVFRLPYVPLPLEKRVGFVNLVKEIGRENFVGKNDVQVLDDDD 360

Query: 409 FILVSRY 416
           FILV RY
Sbjct: 361 FILVGRY 365

BLAST of CmoCh14G020470 vs. TrEMBL
Match: B9GPI1_POPTR (Dihydrodipicolinate synthase family protein OS=Populus trichocarpa GN=POPTR_0002s15050g PE=4 SV=1)

HSP 1 Score: 633.6 bits (1633), Expect = 1.6e-178
Identity = 303/367 (82.56%), Postives = 333/367 (90.74%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M ++  +  CL+E  LQFPRP+  D+ +RR     WRSPQAA +P+ HLPMRS EVKNRT
Sbjct: 1   MAAMMSYSVCLRESTLQFPRPNCGDNYKRRG--GKWRSPQAAAIPDLHLPMRSFEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           SA+DIKSLRL+TAIKTPYLPDGRFDLEAYDALVN QI NGAE VIVGGTTGEGQLMSWDE
Sbjct: 61  SAEDIKSLRLITAIKTPYLPDGRFDLEAYDALVNMQIVNGAEGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFG S+KVIGNTGSNSTREAIHA+EQGFAVGMHAALHINPYYGKTS+EGM
Sbjct: 121 HIMLIGHTVNCFGSSVKVIGNTGSNSTREAIHATEQGFAVGMHAALHINPYYGKTSVEGM 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           +SHF+CVL MGPTIIYNVP RTGQDIPP VI T+AQSPNLAGVKECVGN+RVEQY+++GI
Sbjct: 181 VSHFDCVLPMGPTIIYNVPSRTGQDIPPRVIHTIAQSPNLAGVKECVGNDRVEQYTDKGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDDQCHDARWNHGATGVISVTSNL+PGLMR+LMF GKN  LN+KL+PL+DWLF E
Sbjct: 241 VVWSGNDDQCHDARWNHGATGVISVTSNLLPGLMRKLMFEGKNSELNSKLLPLIDWLFQE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPI LNTALAQLGVVRPVFRLPY+PLP AKR EFVNLVK+IGRENFVGE +VQVLDDDD
Sbjct: 301 PNPIALNTALAQLGVVRPVFRLPYMPLPLAKRIEFVNLVKKIGRENFVGENNVQVLDDDD 360

Query: 409 FILVSRY 416
           FIL+SRY
Sbjct: 361 FILISRY 365

BLAST of CmoCh14G020470 vs. TrEMBL
Match: A0A0B0MRT2_GOSAR (Dihydrodipicolinate synthase 2, chloroplastic-like protein OS=Gossypium arboreum GN=F383_21399 PE=4 SV=1)

HSP 1 Score: 633.3 bits (1632), Expect = 2.1e-178
Identity = 302/367 (82.29%), Postives = 333/367 (90.74%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M  ++ +G  L+E   QFPRP+  D+ +RR     WRSPQAA++PN HLPMRS EVKNRT
Sbjct: 1   MAILKSYGVRLRESTPQFPRPNLCDNYKRRN--VKWRSPQAAVIPNFHLPMRSFEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           SADDIKSLRL+TAIKTPYLPDGRFDLEAYD L++ QIENGAEAVIVGGTTGEGQLMSWDE
Sbjct: 61  SADDIKSLRLITAIKTPYLPDGRFDLEAYDDLMHMQIENGAEAVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHA+EQGFAVGMHAALHINPYYGKTS++G+
Sbjct: 121 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHATEQGFAVGMHAALHINPYYGKTSLDGL 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           ISHF+ VL MGPTIIYNVP RTGQDIPP VI  VAQSPNLAG+KEC+GN+R+EQY+  GI
Sbjct: 181 ISHFDSVLPMGPTIIYNVPSRTGQDIPPRVINNVAQSPNLAGIKECIGNDRIEQYTGNGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDDQCHDARW+HGATGVISVTSNLVPGLMRELMFGGKNPSLNAKL+PL++WLF E
Sbjct: 241 VVWSGNDDQCHDARWSHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLLPLIEWLFQE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNP+GLNTALAQLGVVRPVFRLPY+PLP  KR EFVNLVK+IGRENFVG+ D+QVLDDDD
Sbjct: 301 PNPVGLNTALAQLGVVRPVFRLPYVPLPLEKRVEFVNLVKEIGRENFVGKNDIQVLDDDD 360

Query: 409 FILVSRY 416
           FILV RY
Sbjct: 361 FILVGRY 365

BLAST of CmoCh14G020470 vs. TAIR10
Match: AT2G45440.1 (AT2G45440.1 dihydrodipicolinate synthase)

HSP 1 Score: 603.6 bits (1555), Expect = 9.2e-173
Identity = 287/367 (78.20%), Postives = 327/367 (89.10%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M +++G+G C  + ALQFP P   +S +RR  ++ W SP+AA++PN HLPMRSLEVKNRT
Sbjct: 1   MAALKGYGLCSMDSALQFPCPKLFNSYKRR--SSKWVSPKAAVVPNFHLPMRSLEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           + DDIK+LR++TAIKTPYLPDGRFDLEAYD LVN QI+NGAE VIVGGTTGEGQLMSWDE
Sbjct: 61  NTDDIKALRVITAIKTPYLPDGRFDLEAYDDLVNIQIQNGAEGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHA+EQGFAVGMHAALHINPYYGKTSIEG+
Sbjct: 121 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHATEQGFAVGMHAALHINPYYGKTSIEGL 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           I+HF  VL MGPTIIYNVP RTGQDIPP  I  ++Q+PNLAGVKECVGN RVE+Y+  G+
Sbjct: 181 IAHFQSVLHMGPTIIYNVPGRTGQDIPPRAIFKLSQNPNLAGVKECVGNKRVEEYTENGV 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDD+CHD+RW++GATGVISVTSNLVPGLMR+LMF G+N SLN+KL+PLM WLF E
Sbjct: 241 VVWSGNDDECHDSRWDYGATGVISVTSNLVPGLMRKLMFEGRNSSLNSKLLPLMAWLFHE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPIG+NTALAQLGV RPVFRLPY+PLP +KR EFV LVK+IGRE+FVGEKDVQ LDDDD
Sbjct: 301 PNPIGINTALAQLGVSRPVFRLPYVPLPLSKRLEFVKLVKEIGREHFVGEKDVQALDDDD 360

Query: 409 FILVSRY 416
           FIL+ RY
Sbjct: 361 FILIGRY 365

BLAST of CmoCh14G020470 vs. TAIR10
Match: AT3G60880.2 (AT3G60880.2 dihydrodipicolinate synthase 1)

HSP 1 Score: 591.3 bits (1523), Expect = 4.7e-169
Identity = 284/367 (77.38%), Postives = 323/367 (88.01%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M++++ +G    + AL FPR +   S +RR   A W SP AA++PN HLPMRSLE KNRT
Sbjct: 1   MSALKNYGLISIDSALHFPRSNQLQSYKRRN--AKWVSPIAAVVPNFHLPMRSLEDKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           + DDI+SLR++TAIKTPYLPDGRFDL+AYD LVN QIENGAE VIVGGTTGEGQLMSWDE
Sbjct: 61  NTDDIRSLRVITAIKTPYLPDGRFDLQAYDDLVNTQIENGAEGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGG IKVIGNTGSNSTREAIHA+EQGFA+GMH ALHINPYYGKTSIEGM
Sbjct: 121 HIMLIGHTVNCFGGRIKVIGNTGSNSTREAIHATEQGFAMGMHGALHINPYYGKTSIEGM 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
            +HF  VL MGPTIIYNVP RT QDIPP VI  ++Q+PN+AGVKECVGNNRVE+Y+ +GI
Sbjct: 181 NAHFQTVLHMGPTIIYNVPGRTCQDIPPQVIFKLSQNPNMAGVKECVGNNRVEEYTEKGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDDQCHD+RW+HGATGVISVTSNLVPGLMR+LMF G+N +LNAKL+PLMDWLF E
Sbjct: 241 VVWSGNDDQCHDSRWDHGATGVISVTSNLVPGLMRKLMFEGRNSALNAKLLPLMDWLFQE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPIG+NTALAQLGV RPVFRLPY+PLP +KR EFV LVK+IGRE+FVG++DVQVLDDDD
Sbjct: 301 PNPIGVNTALAQLGVARPVFRLPYVPLPLSKRIEFVKLVKEIGREHFVGDRDVQVLDDDD 360

Query: 409 FILVSRY 416
           FIL+ RY
Sbjct: 361 FILIGRY 365

BLAST of CmoCh14G020470 vs. NCBI nr
Match: gi|659075443|ref|XP_008438146.1| (PREDICTED: 4-hydroxy-tetrahydrodipicolinate synthase, chloroplastic-like [Cucumis melo])

HSP 1 Score: 688.7 bits (1776), Expect = 6.2e-195
Identity = 335/369 (90.79%), Postives = 352/369 (95.39%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRR--KRTALWRSPQAAILPNAHLPMRSLEVKN 108
           M S++G+GACL+EHALQFPRPS NDS +R+  KRT  WRSPQAAILPN HLPMRSLEVKN
Sbjct: 1   MASVKGYGACLREHALQFPRPSCNDSYKRQRTKRTVGWRSPQAAILPNLHLPMRSLEVKN 60

Query: 109 RTSADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSW 168
           RT ADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGA+ VIVGGTTGEGQLMSW
Sbjct: 61  RTIADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGADGVIVGGTTGEGQLMSW 120

Query: 169 DEHIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIE 228
           DEHIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIE
Sbjct: 121 DEHIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIE 180

Query: 229 GMISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQ 288
           G+ISHFNCVLSMGPTIIYNVP RTGQDIPP+VIQTVA+S NLAGVKECVGN+RVEQY+ Q
Sbjct: 181 GLISHFNCVLSMGPTIIYNVPGRTGQDIPPYVIQTVAESANLAGVKECVGNDRVEQYTKQ 240

Query: 289 GIVVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLF 348
           GIV+WSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMF GKNPSLNAKL+PLMDWLF
Sbjct: 241 GIVIWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFEGKNPSLNAKLLPLMDWLF 300

Query: 349 FEPNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDD 408
            EPNPIGLNTALAQLGVVRPVFRLPY+PLPKAKREEFV LV+QIGRE+FVG KDVQVLDD
Sbjct: 301 CEPNPIGLNTALAQLGVVRPVFRLPYVPLPKAKREEFVKLVEQIGREHFVGVKDVQVLDD 360

Query: 409 DDFILVSRY 416
           DDFILVSRY
Sbjct: 361 DDFILVSRY 369

BLAST of CmoCh14G020470 vs. NCBI nr
Match: gi|449432235|ref|XP_004133905.1| (PREDICTED: 4-hydroxy-tetrahydrodipicolinate synthase, chloroplastic [Cucumis sativus])

HSP 1 Score: 685.6 bits (1768), Expect = 5.2e-194
Identity = 331/367 (90.19%), Postives = 348/367 (94.82%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M S++G+GACL+EHALQFPRPS N   +R KRT  WRSPQAAILPN HLPMRSLEVKNRT
Sbjct: 1   MASVKGYGACLREHALQFPRPSCNAKRQRTKRTVGWRSPQAAILPNLHLPMRSLEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
            ADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGA+ VIVGGTTGEGQLMSWDE
Sbjct: 61  IADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGADGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEG+
Sbjct: 121 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGL 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           ISHFNCVLSMGPTIIYNVP RTGQDIPP+VIQTVA+S NLAGVKECVGN+R+EQY+ QGI
Sbjct: 181 ISHFNCVLSMGPTIIYNVPGRTGQDIPPYVIQTVAESANLAGVKECVGNDRIEQYTKQGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           V+WSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMF GKNPSLNAKL+PLMDWLF E
Sbjct: 241 VIWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFEGKNPSLNAKLLPLMDWLFCE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPIGLNTALAQLGVVRPVFRLPY+PLPK KREEFV LV+QIGRE+FVG KDVQVLDDDD
Sbjct: 301 PNPIGLNTALAQLGVVRPVFRLPYVPLPKTKREEFVKLVEQIGREHFVGVKDVQVLDDDD 360

Query: 409 FILVSRY 416
           FILVSRY
Sbjct: 361 FILVSRY 367

BLAST of CmoCh14G020470 vs. NCBI nr
Match: gi|590722897|ref|XP_007052027.1| (Dihydrodipicolinate synthase isoform 2 [Theobroma cacao])

HSP 1 Score: 637.5 bits (1643), Expect = 1.6e-179
Identity = 305/367 (83.11%), Postives = 335/367 (91.28%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M +++ +G  L E   QFP P+  D+ + R R A WRSPQAA++PN HLPMRS EVKNRT
Sbjct: 1   MATLKSYGVRLGESTHQFPLPNRGDNYKSR-RNAKWRSPQAAVIPNFHLPMRSFEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           S++DIKSLRL+TAIKTPYLPDGRFDLEAYD LVN QIENGAE VIVGGTTGEGQLMSWDE
Sbjct: 61  SSEDIKSLRLITAIKTPYLPDGRFDLEAYDGLVNMQIENGAEGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHA+EQGFAVGMHAALHINPYYGKTS+EG+
Sbjct: 121 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHATEQGFAVGMHAALHINPYYGKTSLEGL 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           +SHF+ VL MGPTIIYNVP RTGQDIPP VI TVAQSPNLAGVKECVGN+R+EQY++ GI
Sbjct: 181 VSHFDSVLPMGPTIIYNVPSRTGQDIPPRVINTVAQSPNLAGVKECVGNDRIEQYTDNGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDDQCHDARW+HGATGVISVTSNL+PGLMRELMFGGKNPSLN KL+PL++WLF E
Sbjct: 241 VVWSGNDDQCHDARWSHGATGVISVTSNLIPGLMRELMFGGKNPSLNVKLLPLIEWLFEE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPIGLNTALAQLGVVRPVFRLPY+PLP AKR EFVNLV+QIGR+NFVGEKDVQVLD+DD
Sbjct: 301 PNPIGLNTALAQLGVVRPVFRLPYVPLPLAKRVEFVNLVRQIGRQNFVGEKDVQVLDNDD 360

Query: 409 FILVSRY 416
           FILV RY
Sbjct: 361 FILVGRY 366

BLAST of CmoCh14G020470 vs. NCBI nr
Match: gi|590722894|ref|XP_007052026.1| (Dihydrodipicolinate synthase isoform 1 [Theobroma cacao])

HSP 1 Score: 637.5 bits (1643), Expect = 1.6e-179
Identity = 305/367 (83.11%), Postives = 335/367 (91.28%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M +++ +G  L E   QFP P+  D+ +RR   A WRSPQAA++PN HLPMRS EVKNRT
Sbjct: 1   MATLKSYGVRLGESTHQFPLPNRGDNYKRRN--AKWRSPQAAVIPNFHLPMRSFEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           S++DIKSLRL+TAIKTPYLPDGRFDLEAYD LVN QIENGAE VIVGGTTGEGQLMSWDE
Sbjct: 61  SSEDIKSLRLITAIKTPYLPDGRFDLEAYDGLVNMQIENGAEGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHA+EQGFAVGMHAALHINPYYGKTS+EG+
Sbjct: 121 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHATEQGFAVGMHAALHINPYYGKTSLEGL 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           +SHF+ VL MGPTIIYNVP RTGQDIPP VI TVAQSPNLAGVKECVGN+R+EQY++ GI
Sbjct: 181 VSHFDSVLPMGPTIIYNVPSRTGQDIPPRVINTVAQSPNLAGVKECVGNDRIEQYTDNGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDDQCHDARW+HGATGVISVTSNL+PGLMRELMFGGKNPSLN KL+PL++WLF E
Sbjct: 241 VVWSGNDDQCHDARWSHGATGVISVTSNLIPGLMRELMFGGKNPSLNVKLLPLIEWLFEE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPIGLNTALAQLGVVRPVFRLPY+PLP AKR EFVNLV+QIGR+NFVGEKDVQVLD+DD
Sbjct: 301 PNPIGLNTALAQLGVVRPVFRLPYVPLPLAKRVEFVNLVRQIGRQNFVGEKDVQVLDNDD 360

Query: 409 FILVSRY 416
           FILV RY
Sbjct: 361 FILVGRY 365

BLAST of CmoCh14G020470 vs. NCBI nr
Match: gi|743804981|ref|XP_011017505.1| (PREDICTED: 4-hydroxy-tetrahydrodipicolinate synthase 2, chloroplastic-like [Populus euphratica])

HSP 1 Score: 637.1 bits (1642), Expect = 2.1e-179
Identity = 304/367 (82.83%), Postives = 335/367 (91.28%), Query Frame = 1

Query: 49  MTSIQGFGACLQEHALQFPRPSYNDSCRRRKRTALWRSPQAAILPNAHLPMRSLEVKNRT 108
           M +++ +  CL+E  LQFPRP   D+ +RR     WRSPQAA++P+ HLPMRSLEVKNRT
Sbjct: 1   MAAMKSYSVCLRESTLQFPRPYCGDNYKRRG--GKWRSPQAAVIPDVHLPMRSLEVKNRT 60

Query: 109 SADDIKSLRLVTAIKTPYLPDGRFDLEAYDALVNRQIENGAEAVIVGGTTGEGQLMSWDE 168
           SA+DIKSLRL+TAIKTPYLPDGRFDLEAYDALVN QI NG+E VIVGGTTGEGQLMSWDE
Sbjct: 61  SAEDIKSLRLITAIKTPYLPDGRFDLEAYDALVNMQIVNGSEGVIVGGTTGEGQLMSWDE 120

Query: 169 HIMLIGHTVNCFGGSIKVIGNTGSNSTREAIHASEQGFAVGMHAALHINPYYGKTSIEGM 228
           HIMLIGHTVNCFG S+KVIGNTGSNSTREAIHA+EQGFAVGMHAALHINPYYGKTS+EGM
Sbjct: 121 HIMLIGHTVNCFGSSVKVIGNTGSNSTREAIHATEQGFAVGMHAALHINPYYGKTSVEGM 180

Query: 229 ISHFNCVLSMGPTIIYNVPPRTGQDIPPHVIQTVAQSPNLAGVKECVGNNRVEQYSNQGI 288
           +SHF+CVL MGPTIIYNVP RTGQDIPP VI T+AQSPNLAGVKECVGN+RVEQY+++GI
Sbjct: 181 VSHFDCVLPMGPTIIYNVPSRTGQDIPPRVIHTIAQSPNLAGVKECVGNDRVEQYTDKGI 240

Query: 289 VVWSGNDDQCHDARWNHGATGVISVTSNLVPGLMRELMFGGKNPSLNAKLMPLMDWLFFE 348
           VVWSGNDDQCHDARWNHGATGVISVTSNL+PGLMR+LMF GKN  LN+KL+PL+DWLF E
Sbjct: 241 VVWSGNDDQCHDARWNHGATGVISVTSNLLPGLMRKLMFEGKNTELNSKLLPLIDWLFQE 300

Query: 349 PNPIGLNTALAQLGVVRPVFRLPYIPLPKAKREEFVNLVKQIGRENFVGEKDVQVLDDDD 408
           PNPI LNTALAQLGVVRPVFRLPY+PLP AKR EFVNLVK+IGRENFVGEK VQVLDDDD
Sbjct: 301 PNPIALNTALAQLGVVRPVFRLPYVPLPLAKRIEFVNLVKKIGRENFVGEKKVQVLDDDD 360

Query: 409 FILVSRY 416
           FIL+SRY
Sbjct: 361 FILISRY 365

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DAPA_TOBAC6.6e-17384.914-hydroxy-tetrahydrodipicolinate synthase, chloroplastic OS=Nicotiana tabacum GN... [more]
DAPA2_ARATH1.6e-17178.204-hydroxy-tetrahydrodipicolinate synthase 2, chloroplastic OS=Arabidopsis thalia... [more]
DAPA1_ARATH8.4e-16877.384-hydroxy-tetrahydrodipicolinate synthase 1, chloroplastic OS=Arabidopsis thalia... [more]
DAPA_SOYBN7.6e-16181.964-hydroxy-tetrahydrodipicolinate synthase, chloroplastic OS=Glycine max GN=DHPS1... [more]
DAPA2_WHEAT4.2e-15177.984-hydroxy-tetrahydrodipicolinate synthase 2, chloroplastic OS=Triticum aestivum ... [more]
Match NameE-valueIdentityDescription
A0A061E1S6_THECC1.1e-17983.11Dihydrodipicolinate synthase isoform 1 OS=Theobroma cacao GN=TCM_005490 PE=4 SV=... [more]
A0A061DTV2_THECC1.1e-17983.11Dihydrodipicolinate synthase isoform 2 OS=Theobroma cacao GN=TCM_005490 PE=4 SV=... [more]
A0A0D2R5Q5_GOSRA9.6e-17982.83Uncharacterized protein OS=Gossypium raimondii GN=B456_004G160600 PE=4 SV=1[more]
B9GPI1_POPTR1.6e-17882.56Dihydrodipicolinate synthase family protein OS=Populus trichocarpa GN=POPTR_0002... [more]
A0A0B0MRT2_GOSAR2.1e-17882.29Dihydrodipicolinate synthase 2, chloroplastic-like protein OS=Gossypium arboreum... [more]
Match NameE-valueIdentityDescription
AT2G45440.19.2e-17378.20 dihydrodipicolinate synthase[more]
AT3G60880.24.7e-16977.38 dihydrodipicolinate synthase 1[more]
Match NameE-valueIdentityDescription
gi|659075443|ref|XP_008438146.1|6.2e-19590.79PREDICTED: 4-hydroxy-tetrahydrodipicolinate synthase, chloroplastic-like [Cucumi... [more]
gi|449432235|ref|XP_004133905.1|5.2e-19490.19PREDICTED: 4-hydroxy-tetrahydrodipicolinate synthase, chloroplastic [Cucumis sat... [more]
gi|590722897|ref|XP_007052027.1|1.6e-17983.11Dihydrodipicolinate synthase isoform 2 [Theobroma cacao][more]
gi|590722894|ref|XP_007052026.1|1.6e-17983.11Dihydrodipicolinate synthase isoform 1 [Theobroma cacao][more]
gi|743804981|ref|XP_011017505.1|2.1e-17982.83PREDICTED: 4-hydroxy-tetrahydrodipicolinate synthase 2, chloroplastic-like [Popu... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002220DapA-like
IPR005263DapA
IPR013785Aldolase_TIM
IPR020624Schiff_base-form_aldolases_CS
IPR020625Schiff_base-form_aldolases_AS
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO:0009089lysine biosynthetic process via diaminopimelate
Vocabulary: Molecular Function
TermDefinition
GO:0016829lyase activity
GO:00088404-hydroxy-tetrahydrodipicolinate synthase
GO:0003824catalytic activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009089 lysine biosynthetic process via diaminopimelate
biological_process GO:0019877 diaminopimelate biosynthetic process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
molecular_function GO:0008840 4-hydroxy-tetrahydrodipicolinate synthase
molecular_function GO:0003824 catalytic activity
molecular_function GO:0016829 lyase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G020470.1CmoCh14G020470.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002220DapA-likePRINTSPR00146DHPICSNTHASEcoord: 239..256
score: 5.1E-36coord: 216..232
score: 5.1E-36coord: 148..169
score: 5.1E-36coord: 184..202
score: 5.1
IPR002220DapA-likePANTHERPTHR12128DIHYDRODIPICOLINATE SYNTHASEcoord: 100..405
score: 6.1E
IPR002220DapA-likePFAMPF00701DHDPScoord: 116..389
score: 2.0
IPR002220DapA-likeSMARTSM01130DHDPS_2coord: 115..391
score: 1.5
IPR0052634-hydroxy-tetrahydrodipicolinate synthase, DapATIGRFAMsTIGR00674TIGR00674coord: 118..388
score: 6.0
IPR013785Aldolase-type TIM barrelGENE3DG3DSA:3.20.20.70coord: 118..391
score: 3.2
IPR020624Schiff base-forming aldolase, conserved sitePROSITEPS00665DHDPS_1coord: 151..168
scor
IPR020625Schiff base-forming aldolase, active sitePROSITEPS00666DHDPS_2coord: 244..274
scor
NoneNo IPR availablePANTHERPTHR12128:SF154-HYDROXY-TETRAHYDRODIPICOLINATE SYNTHASE 1, CHLOROPLASTIC-RELATEDcoord: 100..405
score: 6.1E
NoneNo IPR availableunknownSSF51569Aldolasecoord: 119..390
score: 2.09

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh14G020470CmoCh06G011530Cucurbita moschata (Rifu)cmocmoB224