Cp4.1LG04g07100 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g07100
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPhospho-2-dehydro-3-deoxyheptonate aldolase
LocationCp4.1LG04 : 4390632 .. 4401435 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAAGTAAATGGAAGACTCTTGCTTAAACCCAGAGCAAATAGAAGCACAAAAAGCATAATTACTGCAGCCTTCACCACCTTGGAACCTCTTTCTGTGCCAAGCCTTACCTGCATAATAAGTTATCCAACACAGACAGATACAACATTAATTTCTCCAAATGTTTTTGACAAGCTGAAGCTGAGCAAAGACATGTTATACCAAAGGGGACATTTTTCCAACAGCCCTGTCTTCCTCAACCTATGCACAACAATGTCGGTGTTGGTCAAACATGATCGTTAATGTAAGACAGCAGAAGAGTTAAGGATACGTGTTTTTGTTTTGTTTTTGTCTCGACGAGGGGGGAAAGTACCTGATGGAAATGGCTGCAGAAGAGAATTAGAGTTGTCGTCGTTCCGACAAGGATAGAGGCAGAAAGGATCGTATTGGTTATGGGAAGATATCTCATCTCCCTGTGGATCAGACAGACAGTCAATATAGAAAAAAAAACCAGACAACAACTGCTAATCCAAACCAAATGATTGTAACCTTGCGTTGCTCTGTAATAAGTAAAATGCTGTTGTAGCAAATGGACCGAAAGCAGCAAAGCACAAGGGTTCTCCCAATCCTTGGTAGCTTAACCGGAACGGTGGACACTGCAACCGAGTCAATTAACAGAGCTAAAAGAATAAATAGCTTGTTATAGTGTTCGTGTATAGAATAAAATGGGTTGGTCGTGTTAGGAATCACGAATCTCGAAAATGGTATGATATTGTCCACTTTGAGCATAAGCTCTCGTGGCTTTGTTTTTTGTTTCCCCAAAAGGCCTCGTACCATTGGAGATGTACTCCTTACTTATAAACCCACGATCATCCCCTTAATTAGTCGATGTGGGACTCCTCTCCCAACAATCCTCAACAATTCACCCATCGAACAAAGTACACCATAGAGCCTCTCCTGAGGCCTATGGAGCCCTAGAACAGCCTCCCCTTAATCGAGGTTTACTCATTTGGAGCTCTCGAACAAAGTAAATCATTTGTTCGACACTTTAGTCACTTTTGACTACACCTTCGAGACTCACAACTTCTTTGTTCGACCTTTGAGGATTTAATTGACATGGCTAATTTAAGGGTATGACTCTAATATCATGTTAGAAATCACGACCCTCCACAATGGTATATTGTACACTTTGAGCATAAGCTCTCATGGTTTTGCTTTTGGTTTCCCAAAAGGCCTTGTATCAATGGAGATGTACTCCTTACTTATAAACCCATGATCAACCCCTTAATTAGTCGATGTCGGACTCCTCTCCCAACAATCCTCAACGATTCTCCCCTCAAACAAAGTACACTATAGAGCCTCCCTTGAGGCCTATGGAACCCTCGAACAGCCTCTCCTTAATCAAGGCTCGACTCCTTTCGAGCCCTCGAACAAAGTACACCATTTGTTCGATACTTTAGTCACTTTTGACTACACCTTCGAGACTCACAACTTCTTTGTTTGACATCTGAGGATTCAATTGACATGGCTAAGTTAAGGGTATGGCTCTAATACCATGTTAGAAATCACGACCCTCCACAATGGTATGATATTGTCCACTTTGAGCATAAGCTCTCATGGCTTTGCTTTTAGTTCTCCGGAAGGTCTCATACCAATGAAGATGTATTCCTTACTTATAAACCCATCATCAACCCCATAATTAGCCGATGTGAGAATCCTCTCCGAACAATCCTCAACACTGTCAACAGCTCAATCAATTACTTCTTTCGAATGCTAAAAGAAGGATCAGTGGCGTCTCGATTATTTCATTTCAAATTCATTGGAATCAACATGAATTATGAAGCAAAGAAATAGATCCCAGATGTTACCTGATATATGTAGCCACAAGTAATTGAAAAGGCTAGAAATAATATGGCACGCAGGCTCCCAGCCTCCAGAGATGCCCATGTCAGCCCCAAGAAGCCAAGAGCAAGAAAAGTGTAAGCAGCAACAAGTGTCCCTATAGGGCTTATTGCACCACAATAACACAAACAAGTCAATAAGTTTTGTATAGAAACAAGAAAAATGACAACGATTACGAGATTCGAAGGGTCCTTAGACGAGAAAGTAGTGTGAACGTTACCTGCCAACAAGTTTTACAACCGACTCCTTTTTGTTTTTATCTGCTCCGGTACTGAAGTCGTACGCGTCGTTACTGCAAATGGAGCGTTTTAATTCGTTAGCTTCAAATGGGGGAAGGAGCTCATAGTTCATACTTTGTCCTGAACTCGTGTTAGAAATGAGAAAGCAAAGGCTTCCATGGACAGTAGTGTGAACTCTAGACTAGACAAGATGGCAATAGATAGAATTTATGTAACAACTCAAGTCCACCGCTAGCAGATACAAGGAAAATATTTTATTAAAACCCATTCCCGCATGGTAAGAAGATAGCCTAGTGTCCTATCGAAGTGCTAAGAGAATTCCCTAAGAGACCCAGAACTCACGGAAGGAAGCAAGCATCTGACTTTCTCGCCCACGTATCAAAGGCAGTAGAAAAGAAATGCCTTTATCGATAGGATTTGTCTTTCCCGTTCTTTTGTCCAAAGGGACGCCCTCTTTGGGCTTTTCCTTGCCGGCGTGCCCTTAAGGTTTTTAAAACGCGTCTACTAGGGAATAGACTGGCAGATATTATCCTCTTTCGGCTTTCCCTAGGGCGAGATTTCCACACCCTTATAAAGAATGCTTCATTCTCTCACAATCCACCCCCTTCGAGGCTCAGCGTCCTCACTTGCCCTCGTTCCTCTCTCCATTCAATGTGGAATCTCACAATCCACCCCCTTCGGAGCCCAACGTCCTCGCTAGCGCACCACCTAGTGTCTGGCTCTGATACCATTTGTAACAGTCTAAGCCCACCGCTAGCAAATATTGTCCTCTTTAGGCTTCCCCTCAAGGTTTTTAAAACGCGTCAGCTAGGGAGAGCTTTCCACACCATTATAAAGAATGCTTCGCTTTCCTCCCCAACCGATATGAGATCTCACAATTCACCATGAAACAAACGTCTACGCAACCATTAAGCTATCAAAGATCTAGGTTGAGTGTAAAAACAGACAGATGTAAGTCATACAAGAACTAGTTCTTCACCTTAAATTAAGCCAACTGATGACAAGAACCGAAGAAGCCAGGATTACAAAAAAAAGCCGAGTAGAAAAATTCCCAGACTCAAAGTAAGCAGCTGCACTTCCTACCTGTTTGTAATCAAAGAGAAAAATCTTCAGAAATGGAATAGAACACACTCAAAGATCAAGAACAATCAAGAACAATCAAGTTCATCACAATTGATGCAACACATTTCTCAAAAGATTGATATCAAATCTTGCACAACTAGCTGCCATAGAATGCAACGAGTATATATATACATACCATTAAAGGAACCAAAGCAACAGAGTAAATTGGTAACTTAATAGCCCTCCAAATCAGAGTCCCCATGGCAACACCCTCTTCTCCATTCTCTTGGGGCATCGAATTAAACTCCATTGCTTTACATTTCGACACAAAACCTCTAAAACGGACCTTTCCGCACACACCCAATTGCTTTCCACGACTCCCGCAAGCAAATTTCCGAGCATCGGTTGATAAACTGGACAACCTGAAACAACCCAGATAAGAGCTGTGATAGCAGAAGAAAATAGTACCAATATGAAGATGGGCAAGAACATGTAGTGGCTATTTGCAGTGTCGTGAGCTTGCCTTGGAGTGAGGGTGCTGTGGTGGGAGAGATACTCATCGAGATTCTTCTTTAGGGCGCCGAGATGGGTGGCTATGCAGAGCGTGGAGGCCATTGGCATCTCATAACGGAAGGCGAAATGCGGAGTGAATTGGTTCCGGGAGCAGCCCGTTTCTTCGAGGTGCGCGCGCTTTGTAGTGTAGATAGAGGATGAGATTAAGTGGGGGAATGTTTTGAGGGATCTTATCATCTTAAAACTCTAATTAGCCGTCTCTAGCGTGTTCTTTGGGTTCACGTCGATCATTAAAATGTTAAACGTTTTTTCTATAATTAATTTATATATAAAAAAAAATGATTCTGATTTTGATTGATTGTATTAAGAATAACAGAACAAACACTCACCAAAGCCATCGGCCGCATCGATCTTCACTCTGTTTCTCTTTCTCAAACTTACCCCAAAATGGCTCTCCCAGCGACTTCCCCTCTCTCAAAATCCCTTCCAAAATCCCACTTTCCGATCACCCGTCCGCCTCCCTTCACCACCGCCACCGCCACCGCCCGATCGCTCAAACCCATCTCCGCAATTCATGCCGCCGACCCATCAAAGTCGTCAAAGCCATCGATTCTCGTACCGACGAAATGGACCCTCGATAGCTGGAAGTCGAAGAAGGCTCTGCAACTACCCGAATATCCTGATCAGGCGGTGCTTGAATCCGTCCTCAAGACCCTCGAATCCTTCCCTCCGATCGTGTTTGCTGGAGAAGCTAGGTCCCTTGAAGAGCGGCTGGCGCAGGCCGCTGTGGGGAAGGCGTTTCTTCTTCAGGGTGGGGATTGTGCTGAGAGCTTTAAGGAGTTTAATGCCAATAATATTCGTGATACCTTCCGTGTTCTTCTCCAGATGAGCGTTGTTCTTATGTTTGGAGGTCAAATGCCTGTTATCAAGGTTGATTTCTTGCATTGTGGTATGGTTTTGATGAATGATTTTGTCATGTTTTTGGGTTTTGCTTCTGTAAGATCATCGTTTGGTAAGCATGTTCATAATTGATTTGGCTTATTGGTTGATAGCGATTGAGAAATTGATATTACAGTGATGTCTTTTGTTTGTTGCATCCTAAAAGTCCTTTTGATTCAAAATCATCATGTTTATAAGTGATTTATGAGTGATCATAAAAGTTCAAAAGCATTTTAAGCGCTTTACAAGAAAGTATTTGTAGTGTTTTTCACTCAAGTGATTGTTAGGGAAGCTTTAAAAAGTTGAGCCAATCATTGAAGTGATTAGCCATTAAACACTTTTCTGGACTTATAATAAGTGTTCTTGTTGAAGTTACTTATTTTAATTGCACTTTTAAGCACACTCATGCATGGTTATGGTTTTTAATGTAACAACTCTAAGTCCACCGCTAGTAGATATTGTCTTTTTTGAGTTTTAATGTAACACATAGCCCACCGCGAGCAGATATTGTCTTCTTTGGGCTTTCCCTCAAGTTTTTTTAAAACGCGTTTGCTAGAAAGAGGTTTCCAAACCCTTATAAAGAATGTTTCGTTCTCCTCCCCAACCGATGTGGGATCTCACAATCCACCCCCTTTTGGGACCCAGCGTTCTTGTTAGCACTCGTTCCCTTCTCGTGGGACCCCCCAATCCACTTTCCTTTGGGGCCCAACGTCCTTGCTGGTACACCACCTCGTGTCCACCCCCTTTGGGGCTCAACTTCCTCCCTAGCACATTGCCCGAAGTTAATTCGGACATAGCTCATGAGACAAGGTACCAATTCCCATCACAATGGTCAAACTGGGGGGTCCTACATCGATTGGAGAAGGGAACGAGTGCCAGCGAGGACGTTGGGCCTCAAAGGGGTAGATTGTGAGATCCCACATTAGTTGGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAAACGTATTTTAAAAACCTTGAGAGGAAGCCTGAAAGGAAAAACCCAAAGAGNGGGGATTGTGCTGAGAGCTTTAAGGAGTTTAATGCCAATAATATTCGTGATACCTTCCGTGTTCTTCTCCAGATGAGCGTTGTTCTTATGTTTGGAGGTCAAATGCCTGTTATCAAGGTTGATTTCTTGCATTGTGGTATGGTTTTGATGAATGATTTTGTCATGTTTTTGGGTTTTGCTTCTGTAAGATCATCGTTTGGTAAGCATGTTCATAATTGATTTGGCTTATTGGTTGATAGCGATTGAGAAATTGATATTACAGTGATGTCTTTTGTTTGTTGCATCCTAAAAGTCCTTTTGATTCAAAATCATCATGTTTATAAGTGATTTATGAGTGATCATAAAAGTTCAAAAGCATTTTAAGCGCTTTACAAGAAAGTATTTGTAGTGTTTTTCACTCAAGTGATTGTTAGGGAAGCTTTAAAAAGTTGAGCCAATCATTGAAGTGATTAGCCATTAAACACTTTTCTGGACTTATAATAAGTGTTCTTGTTGAAGTTACTTATTTTAATTGCACTTTTAAGCACACTCATGCATGGTTATGGTTTTTAATGTAACAACTCTAAGTCCACCGCTAGTAGATATTGTCTTTTTTGAGTTTTAATGTAACACATAGCCCACCGCGAGCAGATATTGTCTTCTTTGGGCTTTCCCTCAAGTTTTTTTAAAACGCGTTTGCTAGAAAGAGGTTTCCAAACCCTTATAAAGAATGTTTCGTTCTCCTCCCCAACCGATGTGGGATCTCACAATCCACCCCCTTTTGGGACCCAGCGTTCTTGTTAGCACTCGTTCCCTTCTCGTGGGACCCCCCAATCCACTTTCCTTTGGGGCCCAACGTCCTTGCTGGTACACCACCTCGTGTCCACCCCCTTTGGGGCTCAACTTCCTCCCTAGCACATTGCCCGAAGTTAATTCGGACATAGCTCATGAGACAAGGTACCAATTCCCATCACAATGGTCAAACTGGGGGGTCCTACATCGATTGGAGAAGGGAACGAGTGCCAGCGAGGACGTTGGGCCTCAAAGGGGTAGATTGTGAGATCCCACATTAGTTGGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAAACGTATTTTAAAAACCTTGAGAGGAAGCCTGAAAGGAAAAACCCAAAGAGGACAATATCTGTTAGGGGTAGGCTTGGACGGTTACCTTTAATCTTCTAAACATCTTCTAAACATTTTCTTAACACGCATCCCTGCAATTTTCATTAGAGCTTAATGACTTAATGGAATGGTCACTGGAATTCCCTGTTATGTGGCCACTATGAAGTTTTCCCTTTCTTTTACTTATTTATTTTGTAGACCGAGTTTCTCAACGCTGGAGATTGGAGAAAAAGTTATATAACTCTTTCCTTGGAGACTGCTGAATGATTCCAGGGTTATTGTATGGATGTTCGTTTGGCGAGTATGTAATCGGACTTACGTATGAATGTAGGTCGGGAGAATGGCAGGTCAGTTTGCAAAGCCAAGATCGGACCCTTACGAGGAGAAGGATGGAGTGAAACTCCCAAGCTATAGAGGAGACAATATAAATGGAGATTCATTTGATGAGAAATCTAGAGTTCCCGACCCCGATCGAATGAATAGAGCCTACTGCCAGTCGGTCGCAACATTGAACCTCCTGAGGGCATTTGCCACAGGAGGTTATGCTGCAATGCAGAGAGTTACACAGTGGAATCTTGATTTCACCGAGCATAGCGAGCAGGGCGACAGGTTTGACTCTCGAGGACTTTGTTCTTTCATTTGCCTTTTGCATTGATATTAACTTTTCATTGTAGTTGCATAGTCTGGTTATATATTTTTCCACCACGTGCCAACAGATACCGAGAACTAGCTCATCGAGTTGACGAGGCCCTTGGATTCATGTCTGCTGCTGGACTCACTGTCGATCACCCTATCATGACCTCAACCGAGTTTTGGACGTCTCATGAGTGTTTGCTCCTCCCTTATGAGCAAGCACTTACTAGGGAGGATTCCACTTCTGGGATATACTACGACTGCTCGGCTCACATGCTTTGGGTCGGAGAACGAACTCGCCAACTCGATGGTGCACACGTTGAGTTTTTGAGAGGTGTTGCTAATCCTCTTGGCATTAAGGTACTTCCATCTTTCCTCTTTTCCCATTTTGTCTTGAAACTACACTTTGGTTTTATCTTTGTTGACACTTTGGTAATACAATCTCACTTGGAGACAGGATCTTTCATATAGGTTATACGACTTCGAGTTTGAACTTTTTATGGATGTTCTTATAACTTTTTATTTATCAAGTTTGAACTCGATATTCTTTTGCTACTTGTGAGATCTCACATCGGTTGGAGAGGGGAACGAAGTATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGCAGACGCGTTTTAAAACCGTGAGACTGATGGCGATATGTAACGGGCTAAGGTGGACAATATTTGCTAGCGGTGGTCTTGGGCTGTTACAATGGTATCAAAGCTAGACACTGGGCGGTGTGCCAGCGAGGACGCTGGGCACCCAAGGAGGGTGGATTGTGAGATCCCACATCGGTTGAAGAGGGAACGAAGCATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGCAGATGCGTTTTAAAACCGTGAGACTGATGGCGATATGTAACGGGCTAAGGTGGACAATATTTGCTAGCGGTGGTCTTGGGCTGTTACAATAGTATCAAAGCTAGACACTGGGCGGTGTGCCAGCGAGGACGCTGGGCCTCCAAGGAGGGTGGTTTGTGAGATCCTACGTTGGTTGAAGAGGGAAACGAAGCATTCCTTATAAGGGTATGGAAACCTCTCCCTAGTAGACGCGTTTTAAAACCATGAGGCTGACGGCGATATGTAACGAGTTAAGGTGGATAATATCTGCTAGCAGTGGTCTTGGGCTGTTACAAATGGTATCAGAGCCAGACACTGGGCAGTGTGCCAGCGAGGACGCTTGGCCCCTAAGGAAGGTGGTTTGTGAGATCCCACATCGGTTGGAGAGGGGAACGAAACATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGTAGACGCGTTTTAAAACCGTGAGGTTGACGGTGATACGCAACGGGCTACAGCGGACAATATCTGCGGGCGGTTACCTGCTAAAGATTTATGGTTCTTGAATTATTCCTCTGGTGGATGAAGAAATTTGATGACTGATAAATTATAGCATTTTGATTACAGGCAAGTGATAAGATGGATCCCAAAGAATTGGTGAAGCTTATTGACATCCTAAATCCCACGAACAAGCCTGGAAGAATTACAGTGATAGTAAGAATGGGAGCTGAGAACATGAGAGTGAAACTTCCACATCTAATCAGGGAAGTTCGCCAAGCTGGTCAAATCGTCACTTGGGTTAGCGATCCGATGCACGGGAATACAATCAAAGCTCCCTGTGGACTAAAAACGCGTTCGTTCGATGCAATAAGGGTATGATCATTCTTTCTGATTTATATATATTCGATTTGGTTATTAGAATAGCTTGCTTATTACATTCATTTTCGGTTTAACTAGTAGTTTTCCACCCTAAGTGCTTGGGATGGAGAGGGAAATGGGTGTAGAAACCTCTCCCTAGCAGACGTGTTTTAAAACCTTGAGAGGAAGCCCAAAATGGACAATATCTGCTAGCGGTGGGTTTGGGATGTTATAAATGGTATCAAAGCCAGACACTGGACGGTGTTTCAGTGAGGACGCTCACCCCCAAGGGGGTGGATTATGAGATCCCACATCGTTTGGAGAGGGGAACGGGTGTGGACACCTCTCCCTAGTAGACGTCTTTTAAAACCTTGAGGGAAAGCCCAAATAGGACAATATCTGCTAGCGGTGGGTTTGGGATGTTACAAAGGGTATTAGAGCCAAACACTGAGCAGTGTGCCAATGAGGACGCTCGCCCCCCAAGGAAGGTGGATTGTGAGATCCAAGATCGGTTGGATAGGGAATGGGTGTGGAAACCTCTCCCTAGTAGACGCGTTTTAAAACCATGAGGGGAAGCTTAAAGAGAACAATATCTGCTAGCGGTGGGTTTGGGATGTTACAATTAGAGCCAAACACCGAACAGTGTGCCAGTGAGGACGCTCGCCCTCAAGGGGGGTGAATTGTGAGATCCCCCACGTCGGTTGGAGAGGGGAACGGGTGTGGAAACCTCCACGTTTTAAAACCTTGAGGGGAAGCCTAAAGAGGACAATATCTGCTAGTGGTGGGTTTAGGATGTTAGAAATTGTATCAAAGCCAAACACCGAGCGGTGTGCCAGTGAGGGTGCTTGCCCCCATAGGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGGGAACGGGTGTAGAAACCTCTACCTAGCATACGCGTTTTAAAACTTTGAGGAGAAACCTAGAAGGAAAAGCTCAAAGAGAACAATATCGGCTAGCAGTGGGTTTGAGATGTTACATTTTGACCTTCTTGGAGTGCGTAAACCTAAATATTATTACTTCCTTGCTAGATCTTCATTTCATTTAGAACAATATTTGTCTGCATTTAGTTAAGTTTACTAACAAAATTAGTCAGATACCTTTAGTAGTTTACTTCCGCATCATTTAGAGACCTAAATATCAAAACGTGACGATTTCAGGCCGAGTTGAGAGCATTTTTCGACATACACGATCAAGAGGGAAGCTACCCAGGAGGAGTTCATTTGGAGATGACAGGACAGAACGTAACCGAATGTGTTGGAGGCTCACGAACTATAACATATAACGATCTAAGCTCACGGTACCATACGCACTGCGACCCCCGACTCAATGCATCTCAGTCTCTAGAGCTTGCCTTCATCATCGCCGAGCGGTTGCGAAGAAGAAGACTCAGAGCAGGACAAGCTCCGGGAGGTTTCTAACAACAACCGATAACAGTAGCCTTCGTTAAATCGTTAAATCGTACGTAGCTAAAATGCATTCGGAACATATCTATATACGTTTTGGTTAGTGTTACACATTACATCTTAAAATGGTGTTTTGAGTGAACATAATAAAATGCAATAAATAACAATTTAGTCGTACATATTTTGTCCATTGTAAGAATACATTTTTCGATATATAATTTTTCATTTATTTT

mRNA sequence

ATGCAAGGCGCCGAGATGGGTGGCTATGCAGAGCGTGGAGGCCATTGGCATCTCATAACGGAAGGCGAAATGCGGAGTGAATTGGTTCCGGGAGCAGCCCAACAAACACTCACCAAAGCCATCGGCCGCATCGATCTTCACTCTGTTTCTCTTTCTCAAACTTACCCCAAAATGGCTCTCCCAGCGACTTCCCCTCTCTCAAAATCCCTTCCAAAATCCCACTTTCCGATCACCCGTCCGCCTCCCTTCACCACCGCCACCGCCACCGCCCGATCGCTCAAACCCATCTCCGCAATTCATGCCGCCGACCCATCAAAGTCGTCAAAGCCATCGATTCTCGTACCGACGAAATGGACCCTCGATAGCTGGAAGTCGAAGAAGGCTCTGCAACTACCCGAATATCCTGATCAGGCGGTGCTTGAATCCGTCCTCAAGACCCTCGAATCCTTCCCTCCGATCGTGTTTGCTGGAGAAGCTAGGTCCCTTGAAGAGCGGCTGGCGCAGGCCGCTGTGGGGAAGGCGTTTCTTCTTCAGGGTGGGGATTGTGCTGAGAGCTTTAAGGAGTTTAATGCCAATAATATTCGTGATACCTTCCGTGTTCTTCTCCAGATGAGCGTTGTTCTTATGTTTGGAGGTCAAATGCCTGTTATCAAGGTCGGGAGAATGGCAGGTCAGTTTGCAAAGCCAAGATCGGACCCTTACGAGGAGAAGGATGGAGTGAAACTCCCAAGCTATAGAGGAGACAATATAAATGGAGATTCATTTGATGAGAAATCTAGAGTTCCCGACCCCGATCGAATGAATAGAGCCTACTGCCAGTCGGTCGCAACATTGAACCTCCTGAGGGCATTTGCCACAGGAGGTTATGCTGCAATGCAGAGAGTTACACAGTGGAATCTTGATTTCACCGAGCATAGCGAGCAGGGCGACAGATACCGAGAACTAGCTCATCGAGTTGACGAGGCCCTTGGATTCATGTCTGCTGCTGGACTCACTGTCGATCACCCTATCATGACCTCAACCGAGTTTTGGACGTCTCATGAGTGTTTGCTCCTCCCTTATGAGCAAGCACTTACTAGGGAGGATTCCACTTCTGGGATATACTACGACTGCTCGGCTCACATGCTTTGGGTCGGAGAACGAACTCGCCAACTCGATGGTGCACACGTTGAGTTTTTGAGAGGTGTTGCTAATCCTCTTGGCATTAAGGCAAGTGATAAGATGGATCCCAAAGAATTGGTGAAGCTTATTGACATCCTAAATCCCACGAACAAGCCTGGAAGAATTACAGTGATAGTAAGAATGGGAGCTGAGAACATGAGAGTGAAACTTCCACATCTAATCAGGGAAGTTCGCCAAGCTGGTCAAATCGTCACTTGGGTTAGCGATCCGATGCACGGGAATACAATCAAAGCTCCCTGTGGACTAAAAACGCGTTCGTTCGATGCAATAAGGGCCGAGTTGAGAGCATTTTTCGACATACACGATCAAGAGGGAAGCTACCCAGGAGGAGTTCATTTGGAGATGACAGGACAGAACGTAACCGAATGTGTTGGAGGCTCACGAACTATAACATATAACGATCTAAGCTCACGGTACCATACGCACTGCGACCCCCGACTCAATGCATCTCAGTCTCTAGAGCTTGCCTTCATCATCGCCGAGCGGTTGCGAAGAAGAAGACTCAGAGCAGGACAAGCTCCGGGAGGTTTCTAACAACAACCGATAACAGTAGCCTTCGTTAAATCGTTAAATCGTACGTAGCTAAAATGCATTCGGAACATATCTATATACGTTTTGGTTAGTGTTACACATTACATCTTAAAATGGTGTTTTGAGTGAACATAATAAAATGCAATAAATAACAATTTAGTCGTACATATTTTGTCCATTGTAAGAATACATTTTTCGATATATAATTTTTCATTTATTTT

Coding sequence (CDS)

ATGCAAGGCGCCGAGATGGGTGGCTATGCAGAGCGTGGAGGCCATTGGCATCTCATAACGGAAGGCGAAATGCGGAGTGAATTGGTTCCGGGAGCAGCCCAACAAACACTCACCAAAGCCATCGGCCGCATCGATCTTCACTCTGTTTCTCTTTCTCAAACTTACCCCAAAATGGCTCTCCCAGCGACTTCCCCTCTCTCAAAATCCCTTCCAAAATCCCACTTTCCGATCACCCGTCCGCCTCCCTTCACCACCGCCACCGCCACCGCCCGATCGCTCAAACCCATCTCCGCAATTCATGCCGCCGACCCATCAAAGTCGTCAAAGCCATCGATTCTCGTACCGACGAAATGGACCCTCGATAGCTGGAAGTCGAAGAAGGCTCTGCAACTACCCGAATATCCTGATCAGGCGGTGCTTGAATCCGTCCTCAAGACCCTCGAATCCTTCCCTCCGATCGTGTTTGCTGGAGAAGCTAGGTCCCTTGAAGAGCGGCTGGCGCAGGCCGCTGTGGGGAAGGCGTTTCTTCTTCAGGGTGGGGATTGTGCTGAGAGCTTTAAGGAGTTTAATGCCAATAATATTCGTGATACCTTCCGTGTTCTTCTCCAGATGAGCGTTGTTCTTATGTTTGGAGGTCAAATGCCTGTTATCAAGGTCGGGAGAATGGCAGGTCAGTTTGCAAAGCCAAGATCGGACCCTTACGAGGAGAAGGATGGAGTGAAACTCCCAAGCTATAGAGGAGACAATATAAATGGAGATTCATTTGATGAGAAATCTAGAGTTCCCGACCCCGATCGAATGAATAGAGCCTACTGCCAGTCGGTCGCAACATTGAACCTCCTGAGGGCATTTGCCACAGGAGGTTATGCTGCAATGCAGAGAGTTACACAGTGGAATCTTGATTTCACCGAGCATAGCGAGCAGGGCGACAGATACCGAGAACTAGCTCATCGAGTTGACGAGGCCCTTGGATTCATGTCTGCTGCTGGACTCACTGTCGATCACCCTATCATGACCTCAACCGAGTTTTGGACGTCTCATGAGTGTTTGCTCCTCCCTTATGAGCAAGCACTTACTAGGGAGGATTCCACTTCTGGGATATACTACGACTGCTCGGCTCACATGCTTTGGGTCGGAGAACGAACTCGCCAACTCGATGGTGCACACGTTGAGTTTTTGAGAGGTGTTGCTAATCCTCTTGGCATTAAGGCAAGTGATAAGATGGATCCCAAAGAATTGGTGAAGCTTATTGACATCCTAAATCCCACGAACAAGCCTGGAAGAATTACAGTGATAGTAAGAATGGGAGCTGAGAACATGAGAGTGAAACTTCCACATCTAATCAGGGAAGTTCGCCAAGCTGGTCAAATCGTCACTTGGGTTAGCGATCCGATGCACGGGAATACAATCAAAGCTCCCTGTGGACTAAAAACGCGTTCGTTCGATGCAATAAGGGCCGAGTTGAGAGCATTTTTCGACATACACGATCAAGAGGGAAGCTACCCAGGAGGAGTTCATTTGGAGATGACAGGACAGAACGTAACCGAATGTGTTGGAGGCTCACGAACTATAACATATAACGATCTAAGCTCACGGTACCATACGCACTGCGACCCCCGACTCAATGCATCTCAGTCTCTAGAGCTTGCCTTCATCATCGCCGAGCGGTTGCGAAGAAGAAGACTCAGAGCAGGACAAGCTCCGGGAGGTTTCTAA

Protein sequence

MQGAEMGGYAERGGHWHLITEGEMRSELVPGAAQQTLTKAIGRIDLHSVSLSQTYPKMALPATSPLSKSLPKSHFPITRPPPFTTATATARSLKPISAIHAADPSKSSKPSILVPTKWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAFLLQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEEKDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVTQWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYEQALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKLIDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCGLKTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPRLNASQSLELAFIIAERLRRRRLRAGQAPGGF
BLAST of Cp4.1LG04g07100 vs. Swiss-Prot
Match: AROG_ARATH (Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Arabidopsis thaliana GN=DHS2 PE=2 SV=2)

HSP 1 Score: 841.6 bits (2173), Expect = 4.9e-243
Identity = 422/513 (82.26%), Postives = 458/513 (89.28%), Query Frame = 1

Query: 58  MALPATSPLSKS--LPKSHFPITRPPPFTTATATARSLKPISAIHAADPSKSSKPSILVP 117
           + L A+SPL+    LP  H P  RP  F+          P+ A+H+ DP KS++ S    
Sbjct: 2   VTLNASSPLTTKSFLPYRHAP-RRPISFS----------PVFAVHSTDPKKSTQ-SASAS 61

Query: 118 TKWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAF 177
            KW+L+SWKSKKALQLP+YPDQ  ++SVL+TL SFPPIVFAGEAR LE++L QAA+G+AF
Sbjct: 62  VKWSLESWKSKKALQLPDYPDQKDVDSVLQTLSSFPPIVFAGEARKLEDKLGQAAMGQAF 121

Query: 178 LLQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYE 237
           +LQGGDCAESFKEFNANNIRDTFRVLLQM VVLMFGGQ+PVIKVGRMAGQFAKPRSDP+E
Sbjct: 122 MLQGGDCAESFKEFNANNIRDTFRVLLQMGVVLMFGGQLPVIKVGRMAGQFAKPRSDPFE 181

Query: 238 EKDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRV 297
           EKDGVKLPSYRGDNINGD+FDEKSR+PDP RM RAY QSVATLNLLRAFATGGYAAMQRV
Sbjct: 182 EKDGVKLPSYRGDNINGDAFDEKSRIPDPHRMVRAYTQSVATLNLLRAFATGGYAAMQRV 241

Query: 298 TQWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYE 357
           +QWNLDFT+HSEQGDRYRELA+RVDEALGFM AAGLT  HPIMT+TEFWTSHECLLLPYE
Sbjct: 242 SQWNLDFTQHSEQGDRYRELANRVDEALGFMGAAGLTSAHPIMTTTEFWTSHECLLLPYE 301

Query: 358 QALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVK 417
           QALTREDSTSG+YYDCSAHMLWVGERTRQLDGAHVEFLRG+ANPLGIK SDKM P ELVK
Sbjct: 302 QALTREDSTSGLYYDCSAHMLWVGERTRQLDGAHVEFLRGIANPLGIKVSDKMVPSELVK 361

Query: 418 LIDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCG 477
           LI+ILNP NKPGRITVIVRMGAENMRVKLP+LIR VR AGQIVTWVSDPMHGNTI AP G
Sbjct: 362 LIEILNPQNKPGRITVIVRMGAENMRVKLPNLIRAVRGAGQIVTWVSDPMHGNTIMAPGG 421

Query: 478 LKTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHT 537
           LKTRSFDAIRAELRAFFD+HDQEGS+PGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHT
Sbjct: 422 LKTRSFDAIRAELRAFFDVHDQEGSFPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHT 481

Query: 538 HCDPRLNASQSLELAFIIAERLRRRRLRAGQAP 569
           HCDPRLNASQSLELAFIIAERLR+RRL +G  P
Sbjct: 482 HCDPRLNASQSLELAFIIAERLRKRRLGSGNLP 502

BLAST of Cp4.1LG04g07100 vs. Swiss-Prot
Match: AROF_SOLTU (Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Solanum tuberosum GN=SHKA PE=1 SV=2)

HSP 1 Score: 828.6 bits (2139), Expect = 4.3e-239
Identity = 412/533 (77.30%), Postives = 455/533 (85.37%), Query Frame = 1

Query: 58  MALPATSPLSKSLPKSHFPITRP---PPFTTA------TATARSLKPISAIHAAD----- 117
           MAL +TS  +  LP       +P    P   A      T T R ++PISA+H++D     
Sbjct: 1   MALSSTSTTNSLLPNRSLVQNQPLLPSPLKNAFFSNNSTKTVRFVQPISAVHSSDSNKIP 60

Query: 118 -----PSKSSKPSI---------LVPTKWTLDSWKSKKALQLPEYPDQAVLESVLKTLES 177
                PSKSS P+          +  T+W +DSWKSKKALQLPEYP+Q  L SVLKT++ 
Sbjct: 61  IVSDKPSKSSPPAATATTAPAPAVTKTEWAVDSWKSKKALQLPEYPNQEELRSVLKTIDE 120

Query: 178 FPPIVFAGEARSLEERLAQAAVGKAFLLQGGDCAESFKEFNANNIRDTFRVLLQMSVVLM 237
           FPPIVFAGEARSLEERL +AA+G+AFLLQGGDCAESFKEFNANNIRDTFR+LLQM  VLM
Sbjct: 121 FPPIVFAGEARSLEERLGEAAMGRAFLLQGGDCAESFKEFNANNIRDTFRILLQMGAVLM 180

Query: 238 FGGQMPVIKVGRMAGQFAKPRSDPYEEKDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNR 297
           FGGQMPVIKVGRMAGQFAKPRSD +EEKDGVKLPSYRGDN+NGD+FD KSR PDP R+ R
Sbjct: 181 FGGQMPVIKVGRMAGQFAKPRSDSFEEKDGVKLPSYRGDNVNGDAFDVKSRTPDPQRLIR 240

Query: 298 AYCQSVATLNLLRAFATGGYAAMQRVTQWNLDFTEHSEQGDRYRELAHRVDEALGFMSAA 357
           AYCQS ATLNLLRAFATGGYAAMQR+ QWNLDFTEHSEQGDRYRELA RVDEALGFM+AA
Sbjct: 241 AYCQSAATLNLLRAFATGGYAAMQRINQWNLDFTEHSEQGDRYRELASRVDEALGFMTAA 300

Query: 358 GLTVDHPIMTSTEFWTSHECLLLPYEQALTREDSTSGIYYDCSAHMLWVGERTRQLDGAH 417
           GLT+DHPIM +TEFWTSHECLLLPYEQ+LTR DSTSG+YYDCSAH LWVGERTRQLDGAH
Sbjct: 301 GLTMDHPIMKTTEFWTSHECLLLPYEQSLTRRDSTSGLYYDCSAHFLWVGERTRQLDGAH 360

Query: 418 VEFLRGVANPLGIKASDKMDPKELVKLIDILNPTNKPGRITVIVRMGAENMRVKLPHLIR 477
           VEFLRG+ANPLGIK SDKMDP  LVKLI+ILNP NK GRIT+I RMGAENMRVKLPHLIR
Sbjct: 361 VEFLRGIANPLGIKVSDKMDPSALVKLIEILNPQNKAGRITIITRMGAENMRVKLPHLIR 420

Query: 478 EVRQAGQIVTWVSDPMHGNTIKAPCGLKTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEM 537
            VR+AGQIVTWVSDPMHGNTIKAPCGLKTR FD+IRAE+RAFFD+HDQEGS+PGGVHLEM
Sbjct: 421 AVRRAGQIVTWVSDPMHGNTIKAPCGLKTRPFDSIRAEVRAFFDVHDQEGSHPGGVHLEM 480

Query: 538 TGQNVTECVGGSRTITYNDLSSRYHTHCDPRLNASQSLELAFIIAERLRRRRL 563
           TGQNVTEC+GGSRT+T++DLSSRYHTHCDPRLNASQSLEL+FIIAERLR+RRL
Sbjct: 481 TGQNVTECIGGSRTVTFDDLSSRYHTHCDPRLNASQSLELSFIIAERLRKRRL 533

BLAST of Cp4.1LG04g07100 vs. Swiss-Prot
Match: AROG_ORYSJ (Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Oryza sativa subsp. japonica GN=DAHPS2 PE=2 SV=1)

HSP 1 Score: 828.6 bits (2139), Expect = 4.3e-239
Identity = 404/506 (79.84%), Postives = 452/506 (89.33%), Query Frame = 1

Query: 80  PPPFTTATATARSLKPISAIHAADPSKSSKP-----------SILVPTK-------WTLD 139
           P P   AT      + +SA+HAADP+KS+ P           ++  P K       WT+D
Sbjct: 22  PQPRLAATFLPMRRRTVSAVHAADPAKSNGPVQAAAKASSPSTVAAPEKKPVGLGKWTVD 81

Query: 140 SWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAFLLQGGD 199
           SWK+KKALQLPEYP Q  L+SVLKT+E+FPP+VFAGEAR LEERLA AA+G+AF+LQGGD
Sbjct: 82  SWKAKKALQLPEYPSQEELDSVLKTIETFPPVVFAGEARHLEERLADAAMGRAFVLQGGD 141

Query: 200 CAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKV-GRMAGQFAKPRSDPYEEKDGV 259
           CAESFKEFNANNIRDTFR+LLQM  VLMFGGQMPV+KV GRMAGQFAKPRSD +EE+DGV
Sbjct: 142 CAESFKEFNANNIRDTFRILLQMGAVLMFGGQMPVVKVVGRMAGQFAKPRSDSFEERDGV 201

Query: 260 KLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVTQWNL 319
           KLPSYRGDNINGD+FDEKSRVPDP RM RAY QSVATLNLLRAFATGGYAAMQRVTQWNL
Sbjct: 202 KLPSYRGDNINGDTFDEKSRVPDPQRMIRAYAQSVATLNLLRAFATGGYAAMQRVTQWNL 261

Query: 320 DFTEHSEQGDR-YRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYEQALT 379
           DF +HSEQGDR YRELAHRVDEALGFM+AAGLTVDHPIMT+T+FWTSHECLLLPYEQ+LT
Sbjct: 262 DFMDHSEQGDRRYRELAHRVDEALGFMTAAGLTVDHPIMTTTDFWTSHECLLLPYEQSLT 321

Query: 380 REDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKLIDI 439
           REDSTSG++YDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIK SDKM+P++LVKLI+I
Sbjct: 322 REDSTSGLFYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKVSDKMNPRDLVKLIEI 381

Query: 440 LNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCGLKTR 499
           LNP+NKPGRIT+I RMGAENMRVKLPHLIR VR +GQIVTW++DPMHGNTIKAPCGLKTR
Sbjct: 382 LNPSNKPGRITIITRMGAENMRVKLPHLIRAVRNSGQIVTWITDPMHGNTIKAPCGLKTR 441

Query: 500 SFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDP 559
            FD+I AE+RAFFD+HDQEGS+PGG+HLEMTGQNVTEC+GGSRT+T++DLS RYHTHCDP
Sbjct: 442 PFDSILAEVRAFFDVHDQEGSHPGGIHLEMTGQNVTECIGGSRTVTFDDLSDRYHTHCDP 501

Query: 560 RLNASQSLELAFIIAERLRRRRLRAG 566
           RLNASQSLELAFIIAERLRRRR+R+G
Sbjct: 502 RLNASQSLELAFIIAERLRRRRMRSG 527

BLAST of Cp4.1LG04g07100 vs. Swiss-Prot
Match: AROF_TOBAC (Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Nicotiana tabacum GN=DHAPS-1 PE=2 SV=1)

HSP 1 Score: 827.4 bits (2136), Expect = 9.6e-239
Identity = 410/522 (78.54%), Postives = 454/522 (86.97%), Query Frame = 1

Query: 60  LPATSPLSKSLPKSHFPITRPPPFTTATATARSLKPISAIHAADPSK--------SSKPS 119
           LP  S L ++      P+      T +T   R ++PISAIH++D SK        SSKPS
Sbjct: 13  LPNKSQLVQNQSLLPSPLKNVSFTTNSTKPVRFVQPISAIHSSDSSKNPIVSDKPSSKPS 72

Query: 120 -----------ILVPTKWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEAR 179
                       +  T+WT++SWKSKKALQLPEYP+Q  L+SVLKT+E FPPIVFAGEAR
Sbjct: 73  PPAATVTAAATTVTKTEWTVESWKSKKALQLPEYPNQEELQSVLKTIEEFPPIVFAGEAR 132

Query: 180 SLEERLAQAAVGKAFLLQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVG 239
           SLEERL +AA+G+AFLLQGGDCAESFKEFNANNIRDTFR+LLQM  VLMFGGQMPVIKVG
Sbjct: 133 SLEERLGEAAMGRAFLLQGGDCAESFKEFNANNIRDTFRILLQMGAVLMFGGQMPVIKVG 192

Query: 240 RMAGQFAKPRSDPYEEKDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNL 299
           RMAGQFAKPRSD +EEK+GVKLPSYRGDN+NGD+FD KSR PDP R+ RAYCQS ATLNL
Sbjct: 193 RMAGQFAKPRSDNFEEKNGVKLPSYRGDNVNGDAFDAKSRTPDPQRLIRAYCQSAATLNL 252

Query: 300 LRAFATGGYAAMQRVTQWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTS 359
           LRAFATGGYAAMQR+ QWNLDFTEHSEQGDRYRELA+RVDEALGFM+AAGLTVDHPIM +
Sbjct: 253 LRAFATGGYAAMQRINQWNLDFTEHSEQGDRYRELANRVDEALGFMAAAGLTVDHPIMKT 312

Query: 360 TEFWTSHECLLLPYEQALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPL 419
           TEFWTSHECLLLPYEQ+LTR DSTSG+YYDCSAH +WVGERTRQLDGAHVEFLRGVANPL
Sbjct: 313 TEFWTSHECLLLPYEQSLTRLDSTSGLYYDCSAHFIWVGERTRQLDGAHVEFLRGVANPL 372

Query: 420 GIKASDKMDPKELVKLIDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTW 479
           GIK SDKMDP  LVKLI+ILNP NK GRIT+I RMGAENMRVKLPHLIR VR+AGQIVTW
Sbjct: 373 GIKVSDKMDPSALVKLIEILNPDNKAGRITIITRMGAENMRVKLPHLIRAVRRAGQIVTW 432

Query: 480 VSDPMHGNTIKAPCGLKTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGG 539
           VSDPMHGNTIKAPCGLKTR FD+IRAE+RAFFD+H+QEGS+PGGVHLEMTGQNVTEC+GG
Sbjct: 433 VSDPMHGNTIKAPCGLKTRPFDSIRAEVRAFFDVHEQEGSHPGGVHLEMTGQNVTECIGG 492

Query: 540 SRTITYNDLSSRYHTHCDPRLNASQSLELAFIIAERLRRRRL 563
           SRT+T++DLSSRYHTHCDPRLNASQSLELAFIIAERLR+RRL
Sbjct: 493 SRTVTFDDLSSRYHTHCDPRLNASQSLELAFIIAERLRKRRL 534

BLAST of Cp4.1LG04g07100 vs. Swiss-Prot
Match: AROG_SOLLC (Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Solanum lycopersicum PE=2 SV=1)

HSP 1 Score: 827.0 bits (2135), Expect = 1.2e-238
Identity = 400/496 (80.65%), Postives = 443/496 (89.31%), Query Frame = 1

Query: 86  ATATARSLKPISAIHAAD----------PSKSSKPSI---------LVPTKWTLDSWKSK 145
           +T T R ++PI+A+H++D          P+KSS P+          +  T+W +DSWKSK
Sbjct: 38  STKTVRFVQPIAAVHSSDSNKNPIVSDKPTKSSPPAATATTAPAPAVTKTEWAVDSWKSK 97

Query: 146 KALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAFLLQGGDCAESF 205
           KALQLPEYPDQ  L SVLKT++ FPPIVFAGEARSLEERL +AA+G+AFLLQGGDCAESF
Sbjct: 98  KALQLPEYPDQEELRSVLKTIDEFPPIVFAGEARSLEERLGEAAMGRAFLLQGGDCAESF 157

Query: 206 KEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEEKDGVKLPSYR 265
           KEFNANNIRDTFR+LLQM  VLMFGGQMPVIKVGRMAGQFAKPRSD +EEKDGVKLPSYR
Sbjct: 158 KEFNANNIRDTFRILLQMGAVLMFGGQMPVIKVGRMAGQFAKPRSDSFEEKDGVKLPSYR 217

Query: 266 GDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVTQWNLDFTEHS 325
           GDN+NGD+FD KSR PDP R+ RAYCQS ATLNLLRAFATGGYAAMQR+ QWNLDFTEHS
Sbjct: 218 GDNVNGDAFDVKSRTPDPQRLIRAYCQSAATLNLLRAFATGGYAAMQRINQWNLDFTEHS 277

Query: 326 EQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYEQALTREDSTSG 385
           EQGDRYRELA RVDEALGFM+AAGLT+DHPIM +TEFWTSHECLLLPYEQ+LTR DSTSG
Sbjct: 278 EQGDRYRELASRVDEALGFMTAAGLTMDHPIMKTTEFWTSHECLLLPYEQSLTRRDSTSG 337

Query: 386 IYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKLIDILNPTNKP 445
           ++YDCSAH LWVGERTRQLDGAHVEFLRG+ANPLGIK SDKMDP  LVKLI+ILNP NK 
Sbjct: 338 LHYDCSAHFLWVGERTRQLDGAHVEFLRGIANPLGIKVSDKMDPSALVKLIEILNPQNKA 397

Query: 446 GRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCGLKTRSFDAIRA 505
           GRIT+I RMGAENMRVKLPHLIR VR+AGQIVTWVSDPMHGNTIKAPCGLKTR FD+IRA
Sbjct: 398 GRITIITRMGAENMRVKLPHLIRAVRRAGQIVTWVSDPMHGNTIKAPCGLKTRPFDSIRA 457

Query: 506 ELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPRLNASQS 563
           E+RAFFD+HDQEGS+PGGVHLEMTGQNVTEC+GGSRT+T++DLSSRYHTHCDPRLNASQS
Sbjct: 458 EVRAFFDVHDQEGSHPGGVHLEMTGQNVTECIGGSRTVTFDDLSSRYHTHCDPRLNASQS 517

BLAST of Cp4.1LG04g07100 vs. TrEMBL
Match: A0A0A0K233_CUCSA (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Cucumis sativus GN=Csa_7G064020 PE=3 SV=1)

HSP 1 Score: 949.5 bits (2453), Expect = 1.9e-273
Identity = 472/513 (92.01%), Postives = 490/513 (95.52%), Query Frame = 1

Query: 58  MALPATSPLSKSLPKSHFPIT-RPPPFTTATATARSLKPISAIHAADPSKSSKPSILVPT 117
           MALP T+PL KSLP   FPIT + P     + +AR  KPISAIHAADPS+SSK SI VP 
Sbjct: 1   MALPTTTPLPKSLPHPLFPITSKTPHHRRLSPSARFSKPISAIHAADPSRSSKSSIQVPM 60

Query: 118 KWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAFL 177
           KWTLDSWKSK+ALQLPEYPDQA LESVL+TLESFPPIVFAGEARSLE+RLAQAAVGKAFL
Sbjct: 61  KWTLDSWKSKRALQLPEYPDQAALESVLRTLESFPPIVFAGEARSLEDRLAQAAVGKAFL 120

Query: 178 LQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEE 237
           LQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEE
Sbjct: 121 LQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEE 180

Query: 238 KDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVT 297
           KDGVKLPSYRGDNINGDSFD++SR+PDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVT
Sbjct: 181 KDGVKLPSYRGDNINGDSFDKQSRIPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVT 240

Query: 298 QWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYEQ 357
           QWNLDFTEHSEQGDRYRELAHRVDEALGFM+A+GLTVDHPIMTSTEFWTSHECLLLPYEQ
Sbjct: 241 QWNLDFTEHSEQGDRYRELAHRVDEALGFMAASGLTVDHPIMTSTEFWTSHECLLLPYEQ 300

Query: 358 ALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKL 417
           ALTREDSTSG+YYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKL
Sbjct: 301 ALTREDSTSGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKL 360

Query: 418 IDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCGL 477
           I+ILNPTNKPGRI VIVRMGAENMRVKLPHLIREVR+AGQIVTWVSDPMHGNTIKAPCGL
Sbjct: 361 IEILNPTNKPGRIVVIVRMGAENMRVKLPHLIREVRRAGQIVTWVSDPMHGNTIKAPCGL 420

Query: 478 KTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTH 537
           KTRSFDAIRAE+RAFFD+HDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTH
Sbjct: 421 KTRSFDAIRAEVRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTH 480

Query: 538 CDPRLNASQSLELAFIIAERLRRRRLRAGQAPG 570
           CDPRLNASQSLELAFIIAERLRRRRL AGQ  G
Sbjct: 481 CDPRLNASQSLELAFIIAERLRRRRLVAGQTLG 513

BLAST of Cp4.1LG04g07100 vs. TrEMBL
Match: A0A061E2A9_THECC (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Theobroma cacao GN=TCM_007435 PE=3 SV=1)

HSP 1 Score: 865.5 bits (2235), Expect = 3.5e-248
Identity = 430/502 (85.66%), Postives = 457/502 (91.04%), Query Frame = 1

Query: 67  SKSLPKSHFPITRPPPFTTATATARSLKPISAIHAADPSKSSKPSIL------VPTKWTL 126
           S SL  S  P+     +   +     LKPI+AIH+ADP+KS+K +        +P KW+L
Sbjct: 5   SPSLLSSKSPLLPRHHYYCHSLLPAKLKPITAIHSADPAKSTKSTAASTTSPSIPIKWSL 64

Query: 127 DSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAFLLQGG 186
           DSWKSKKALQLPEYPDQ  L SVL+TL SFPPIVFAGEARSLEE+L QAA G AFLLQGG
Sbjct: 65  DSWKSKKALQLPEYPDQNDLVSVLQTLSSFPPIVFAGEARSLEEKLGQAAFGNAFLLQGG 124

Query: 187 DCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEEKDGV 246
           DCAESFKEFNANNIRDTFRVLLQM VVLMFGGQMPVIKVGRMAGQFAKPRSDP+EEK+GV
Sbjct: 125 DCAESFKEFNANNIRDTFRVLLQMGVVLMFGGQMPVIKVGRMAGQFAKPRSDPFEEKNGV 184

Query: 247 KLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVTQWNL 306
           KLPSYRGDNINGDSFDEK+RVPDP RM RAYCQSVATLNLLRAFATGGYAAMQRV+QWNL
Sbjct: 185 KLPSYRGDNINGDSFDEKARVPDPHRMIRAYCQSVATLNLLRAFATGGYAAMQRVSQWNL 244

Query: 307 DFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYEQALTR 366
           DFTE+SEQGDRYRELAHRVDEA+GFM+AAGLTV HPIMT+TEFWTSHECLLLPYEQALTR
Sbjct: 245 DFTENSEQGDRYRELAHRVDEAMGFMAAAGLTVGHPIMTTTEFWTSHECLLLPYEQALTR 304

Query: 367 EDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKLIDIL 426
           EDSTSG+YYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIK SDKMDP ELV+LI+IL
Sbjct: 305 EDSTSGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKVSDKMDPNELVRLIEIL 364

Query: 427 NPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCGLKTRS 486
           NP NKPGRITVIVRMGAENMRVKLPHLIR VR+AGQIVTWVSDPMHGNT KAPCGLKTRS
Sbjct: 365 NPQNKPGRITVIVRMGAENMRVKLPHLIRAVRRAGQIVTWVSDPMHGNTTKAPCGLKTRS 424

Query: 487 FDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPR 546
           FDAIRAE+RAFFD+HDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPR
Sbjct: 425 FDAIRAEVRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPR 484

Query: 547 LNASQSLELAFIIAERLRRRRL 563
           LNASQSLELAFII ERLR+RRL
Sbjct: 485 LNASQSLELAFIIGERLRKRRL 506

BLAST of Cp4.1LG04g07100 vs. TrEMBL
Match: A0A059BBF4_EUCGR (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Eucalyptus grandis GN=EUGRSUZ_G00854 PE=3 SV=1)

HSP 1 Score: 859.4 bits (2219), Expect = 2.5e-246
Identity = 419/469 (89.34%), Postives = 440/469 (93.82%), Query Frame = 1

Query: 94  KPISAIHAADPSKSSKPSILVPTKWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPI 153
           +PISA+HA DP   S  S     KW ++SWK+K ALQLPEYPD   L SVL+TLESFPPI
Sbjct: 33  RPISAVHATDPPPLSSSSAAT-AKWNIESWKNKNALQLPEYPDPDALGSVLRTLESFPPI 92

Query: 154 VFAGEARSLEERLAQAAVGKAFLLQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQ 213
           VFAGEARSLEERL QAA+G AFLLQGGDCAESFKEFNANNIRDTFRV+LQMSVVLMFGGQ
Sbjct: 93  VFAGEARSLEERLGQAAIGNAFLLQGGDCAESFKEFNANNIRDTFRVILQMSVVLMFGGQ 152

Query: 214 MPVIKVGRMAGQFAKPRSDPYEEKDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQ 273
           MPVIKVGRMAGQFAKPRS+ +EEKDGVKLPSYRGDNINGD+FDE SR+PDP RM RAY Q
Sbjct: 153 MPVIKVGRMAGQFAKPRSESFEEKDGVKLPSYRGDNINGDAFDEASRIPDPQRMIRAYTQ 212

Query: 274 SVATLNLLRAFATGGYAAMQRVTQWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTV 333
           SVATLNLLRAFATGGYAAMQRVT WNLDFTEHSEQGDRYRELAHRVDEALGFM+ AGLTV
Sbjct: 213 SVATLNLLRAFATGGYAAMQRVTHWNLDFTEHSEQGDRYRELAHRVDEALGFMATAGLTV 272

Query: 334 DHPIMTSTEFWTSHECLLLPYEQALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFL 393
           DHPIMT+TEFWTSHECLLLPYEQALTREDSTSG+YYDCSAHMLWVGERTRQLDGAHVEFL
Sbjct: 273 DHPIMTTTEFWTSHECLLLPYEQALTREDSTSGLYYDCSAHMLWVGERTRQLDGAHVEFL 332

Query: 394 RGVANPLGIKASDKMDPKELVKLIDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQ 453
           RGV+NPLGIKASDKMDP ELV+LIDILNP NKPGRITVIVRMGAENMRVKLPHLIR VRQ
Sbjct: 333 RGVSNPLGIKASDKMDPNELVRLIDILNPRNKPGRITVIVRMGAENMRVKLPHLIRAVRQ 392

Query: 454 AGQIVTWVSDPMHGNTIKAPCGLKTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQN 513
           AGQIVTWVSDPMHGNTIKAPCGLKTRSFDAIRAE+RAFFD+HDQEGSYPGGVHLEMTGQN
Sbjct: 393 AGQIVTWVSDPMHGNTIKAPCGLKTRSFDAIRAEVRAFFDVHDQEGSYPGGVHLEMTGQN 452

Query: 514 VTECVGGSRTITYNDLSSRYHTHCDPRLNASQSLELAFIIAERLRRRRL 563
           VTECVGGSRT+TYNDLSSRYHTHCDPRLNASQSLELAFIIAERLR+RRL
Sbjct: 453 VTECVGGSRTVTYNDLSSRYHTHCDPRLNASQSLELAFIIAERLRKRRL 500

BLAST of Cp4.1LG04g07100 vs. TrEMBL
Match: A0A0B0PUE5_GOSAR (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Gossypium arboreum GN=F383_15354 PE=3 SV=1)

HSP 1 Score: 858.2 bits (2216), Expect = 5.6e-246
Identity = 426/518 (82.24%), Postives = 462/518 (89.19%), Query Frame = 1

Query: 58  MALPATSPLSKS---LPKSHFPITRPPPFTTATATARSLKPISAIHAADPSKSSKPSIL- 117
           MAL + S LS     LP+ H     PP          SLKPI+A+H+ADP+KS+K +   
Sbjct: 1   MALTSPSFLSSKSPLLPRRHL---LPP----------SLKPITAVHSADPTKSTKSAAAA 60

Query: 118 ---------VPTKWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEE 177
                    +PT+WTL+SWKSKKALQLPEYPDQ  L SVL+TL +FPP+VFAGEARSLEE
Sbjct: 61  AAASTSSPSIPTQWTLESWKSKKALQLPEYPDQNDLVSVLETLSTFPPVVFAGEARSLEE 120

Query: 178 RLAQAAVGKAFLLQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAG 237
           +L  AA+G AFLLQGGDCAESFKEF+ANNIRDTFRVLLQM VVLMFGGQMP+IKVGRMAG
Sbjct: 121 KLGHAALGNAFLLQGGDCAESFKEFDANNIRDTFRVLLQMGVVLMFGGQMPIIKVGRMAG 180

Query: 238 QFAKPRSDPYEEKDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAF 297
           QFAKPRSDP+EEKDGVKLPSYRGDNINGDSFDEK RVPDP RM RAYCQSVATLNLLRAF
Sbjct: 181 QFAKPRSDPFEEKDGVKLPSYRGDNINGDSFDEKERVPDPHRMIRAYCQSVATLNLLRAF 240

Query: 298 ATGGYAAMQRVTQWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFW 357
           ATGGYAAMQRVTQWNLDFTEHSEQGDRYRELAHRVDEA+GFM+AAGL+V HP+MT+T+FW
Sbjct: 241 ATGGYAAMQRVTQWNLDFTEHSEQGDRYRELAHRVDEAMGFMAAAGLSVGHPVMTTTDFW 300

Query: 358 TSHECLLLPYEQALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKA 417
           TSHECLLLPYEQALTREDST+G+YYDCSAHMLWVG+RTRQLDGAHVEFLRGVANPLGIK 
Sbjct: 301 TSHECLLLPYEQALTREDSTTGLYYDCSAHMLWVGDRTRQLDGAHVEFLRGVANPLGIKV 360

Query: 418 SDKMDPKELVKLIDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDP 477
           SDKMDP ELV+LI+ILNP NKPGRITVIVRMGAENMRVKLPHLIR VR AGQ+VTWVSDP
Sbjct: 361 SDKMDPSELVRLIEILNPRNKPGRITVIVRMGAENMRVKLPHLIRAVRGAGQVVTWVSDP 420

Query: 478 MHGNTIKAPCGLKTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTI 537
           MHGNTIKAPCGLKTRSFDAIRAE+RAFFD+HD EGS+PGG+HLEMTGQNVTEC+GGSRTI
Sbjct: 421 MHGNTIKAPCGLKTRSFDAIRAEVRAFFDVHDDEGSHPGGIHLEMTGQNVTECLGGSRTI 480

Query: 538 TYNDLSSRYHTHCDPRLNASQSLELAFIIAERLRRRRL 563
           TYNDL SRYHTHCDPRLNASQSLELAFIIAERLRRRRL
Sbjct: 481 TYNDLGSRYHTHCDPRLNASQSLELAFIIAERLRRRRL 505

BLAST of Cp4.1LG04g07100 vs. TrEMBL
Match: M5VX27_PRUPE (Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Prunus persica GN=PRUPE_ppa004348mg PE=3 SV=1)

HSP 1 Score: 856.7 bits (2212), Expect = 1.6e-245
Identity = 432/519 (83.24%), Postives = 460/519 (88.63%), Query Frame = 1

Query: 58  MALPATSPLS-KSL----PKSHFPITRPPPFTTATATARSLKPISAIHAADPSKSS---K 117
           MA  A S LS KSL    P  H P     P         + KPISA+HAADPSK S   +
Sbjct: 1   MAFTAYSALSFKSLLHPEPIKHHPSLPQLP---------THKPISAVHAADPSKPSGTPQ 60

Query: 118 PSILVPTKWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQA 177
           P+     KW+L SWK+KKALQLPEYPDQ  L SVL TLE+FPPIVFAGEARSLEE+L QA
Sbjct: 61  PTFPTQPKWSLGSWKAKKALQLPEYPDQEELSSVLGTLETFPPIVFAGEARSLEEKLGQA 120

Query: 178 AVGKAFLLQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKP 237
           A+G+AFLLQGGDCAESFKEFNANNIRDTFRVLLQM VVLMFGGQMPVIKVGRMAGQFAKP
Sbjct: 121 AMGQAFLLQGGDCAESFKEFNANNIRDTFRVLLQMGVVLMFGGQMPVIKVGRMAGQFAKP 180

Query: 238 RSDPYEEKDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGY 297
           RSDP+EEK+GVKLPSYRGDN+NGD+FDEK R+PDP RM  AYCQSVATLNLLRAF+TGGY
Sbjct: 181 RSDPFEEKNGVKLPSYRGDNVNGDAFDEKERLPDPHRMVSAYCQSVATLNLLRAFSTGGY 240

Query: 298 AAMQRVTQWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHEC 357
           AAMQRVTQWNLDFTEHSEQGDRYRELA+RVDE LGFM+  GLTVDHPIMT+TEFWTSHEC
Sbjct: 241 AAMQRVTQWNLDFTEHSEQGDRYRELANRVDEVLGFMAVTGLTVDHPIMTTTEFWTSHEC 300

Query: 358 LLLPYEQALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMD 417
           LLLPYEQALTREDSTSG+Y+DCSAHMLWVGERTRQLDGAHVEFLRGV+NPLGIK SDKMD
Sbjct: 301 LLLPYEQALTREDSTSGLYFDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDKMD 360

Query: 418 PKELVKLIDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNT 477
           P ELV+LI+ILNP NKPGR+TVIVRMGAENMRVKLPHLIR VR AGQIVTWVSDPMHGNT
Sbjct: 361 PNELVRLIEILNPKNKPGRVTVIVRMGAENMRVKLPHLIRAVRGAGQIVTWVSDPMHGNT 420

Query: 478 IKAPCGLKTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDL 537
           IKAPCGLKTRSFDAIRAELRAFFD+HDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDL
Sbjct: 421 IKAPCGLKTRSFDAIRAELRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDL 480

Query: 538 SSRYHTHCDPRLNASQSLELAFIIAERLRRRRLRAGQAP 569
           SSRYHTHCDPRLNASQSLELAFIIAERLR+RRL +   P
Sbjct: 481 SSRYHTHCDPRLNASQSLELAFIIAERLRKRRLGSRHFP 510

BLAST of Cp4.1LG04g07100 vs. TAIR10
Match: AT4G33510.1 (AT4G33510.1 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase)

HSP 1 Score: 841.6 bits (2173), Expect = 2.8e-244
Identity = 422/513 (82.26%), Postives = 458/513 (89.28%), Query Frame = 1

Query: 58  MALPATSPLSKS--LPKSHFPITRPPPFTTATATARSLKPISAIHAADPSKSSKPSILVP 117
           + L A+SPL+    LP  H P  RP  F+          P+ A+H+ DP KS++ S    
Sbjct: 2   VTLNASSPLTTKSFLPYRHAP-RRPISFS----------PVFAVHSTDPKKSTQ-SASAS 61

Query: 118 TKWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAF 177
            KW+L+SWKSKKALQLP+YPDQ  ++SVL+TL SFPPIVFAGEAR LE++L QAA+G+AF
Sbjct: 62  VKWSLESWKSKKALQLPDYPDQKDVDSVLQTLSSFPPIVFAGEARKLEDKLGQAAMGQAF 121

Query: 178 LLQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYE 237
           +LQGGDCAESFKEFNANNIRDTFRVLLQM VVLMFGGQ+PVIKVGRMAGQFAKPRSDP+E
Sbjct: 122 MLQGGDCAESFKEFNANNIRDTFRVLLQMGVVLMFGGQLPVIKVGRMAGQFAKPRSDPFE 181

Query: 238 EKDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRV 297
           EKDGVKLPSYRGDNINGD+FDEKSR+PDP RM RAY QSVATLNLLRAFATGGYAAMQRV
Sbjct: 182 EKDGVKLPSYRGDNINGDAFDEKSRIPDPHRMVRAYTQSVATLNLLRAFATGGYAAMQRV 241

Query: 298 TQWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYE 357
           +QWNLDFT+HSEQGDRYRELA+RVDEALGFM AAGLT  HPIMT+TEFWTSHECLLLPYE
Sbjct: 242 SQWNLDFTQHSEQGDRYRELANRVDEALGFMGAAGLTSAHPIMTTTEFWTSHECLLLPYE 301

Query: 358 QALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVK 417
           QALTREDSTSG+YYDCSAHMLWVGERTRQLDGAHVEFLRG+ANPLGIK SDKM P ELVK
Sbjct: 302 QALTREDSTSGLYYDCSAHMLWVGERTRQLDGAHVEFLRGIANPLGIKVSDKMVPSELVK 361

Query: 418 LIDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCG 477
           LI+ILNP NKPGRITVIVRMGAENMRVKLP+LIR VR AGQIVTWVSDPMHGNTI AP G
Sbjct: 362 LIEILNPQNKPGRITVIVRMGAENMRVKLPNLIRAVRGAGQIVTWVSDPMHGNTIMAPGG 421

Query: 478 LKTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHT 537
           LKTRSFDAIRAELRAFFD+HDQEGS+PGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHT
Sbjct: 422 LKTRSFDAIRAELRAFFDVHDQEGSFPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHT 481

Query: 538 HCDPRLNASQSLELAFIIAERLRRRRLRAGQAP 569
           HCDPRLNASQSLELAFIIAERLR+RRL +G  P
Sbjct: 482 HCDPRLNASQSLELAFIIAERLRKRRLGSGNLP 502

BLAST of Cp4.1LG04g07100 vs. TAIR10
Match: AT1G22410.1 (AT1G22410.1 Class-II DAHP synthetase family protein)

HSP 1 Score: 805.8 bits (2080), Expect = 1.7e-233
Identity = 394/510 (77.25%), Postives = 447/510 (87.65%), Query Frame = 1

Query: 63  TSPLSKSLPKSHFPITRPPPFTTATATARSLKPISAIHAA---DPSKSSKPS--ILVPTK 122
           +S ++   P     ++RP  F  +        P ++  +A    P+  +KP    +   K
Sbjct: 15  SSMINHRQPNFSSAVSRPTSFRISAVQTDPKTPAASSASAATTTPATLTKPVGVNVGKGK 74

Query: 123 WTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAFLL 182
           W  +SW++KKALQ P+YPD A LE+VL+T+E+FPPIVFAGEAR LEERL QAA+G+AFLL
Sbjct: 75  WAPESWRTKKALQQPDYPDLAALEAVLETIEAFPPIVFAGEARLLEERLGQAAMGEAFLL 134

Query: 183 QGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEEK 242
           QGGDCAESFKEFNANNIRDTFR+LLQM  VLMFGGQ+PV+KVGRMAGQFAKPRSD +EEK
Sbjct: 135 QGGDCAESFKEFNANNIRDTFRILLQMGAVLMFGGQVPVVKVGRMAGQFAKPRSDSFEEK 194

Query: 243 DGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVTQ 302
           DGVKLPSYRGDNINGD+FD KSR+PDP RM RAYCQS ATLNLLRAFATGGYAAMQRVTQ
Sbjct: 195 DGVKLPSYRGDNINGDAFDSKSRIPDPQRMIRAYCQSAATLNLLRAFATGGYAAMQRVTQ 254

Query: 303 WNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYEQA 362
           WNLDFTE SEQGDRYRELA+RVDEALGFM AAGLT+DHPIM +T+FWTSHECLLLPYEQ+
Sbjct: 255 WNLDFTERSEQGDRYRELANRVDEALGFMHAAGLTLDHPIMQTTDFWTSHECLLLPYEQS 314

Query: 363 LTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKLI 422
           LTR DSTSG+YYDCSAHM+WVGERTRQLDGAHVEFLRGVANPLGIK SDKMDPKELVKLI
Sbjct: 315 LTRLDSTSGLYYDCSAHMIWVGERTRQLDGAHVEFLRGVANPLGIKVSDKMDPKELVKLI 374

Query: 423 DILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCGLK 482
           +ILN  NKPGRIT+I RMGAENMRVKLPHLIREVR+AGQIVTWVSDPMHGNTIKAPCGLK
Sbjct: 375 EILNADNKPGRITIITRMGAENMRVKLPHLIREVRRAGQIVTWVSDPMHGNTIKAPCGLK 434

Query: 483 TRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHC 542
           TR FDAI AE+RAFFD+H+QEGS+PGG+HLEMTGQNVTEC+GGSRT+T++DL SRYHTHC
Sbjct: 435 TRPFDAILAEVRAFFDVHEQEGSHPGGIHLEMTGQNVTECIGGSRTVTFDDLGSRYHTHC 494

Query: 543 DPRLNASQSLELAFIIAERLRRRRLRAGQA 568
           DPRLNASQSLEL+FIIAERLR+RR+++ +A
Sbjct: 495 DPRLNASQSLELSFIIAERLRKRRIKSQKA 524

BLAST of Cp4.1LG04g07100 vs. TAIR10
Match: AT4G39980.1 (AT4G39980.1 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase 1)

HSP 1 Score: 796.2 bits (2055), Expect = 1.3e-230
Identity = 391/499 (78.36%), Postives = 444/499 (88.98%), Query Frame = 1

Query: 73  SHFPITRPPPFTTATAT---ARSLKPISAIHAADPSKSS---KPSILVPT----KWTLDS 132
           SH P  R   FT   A     +S+  ++A+HAA+P++++   K S+   +    KWT +S
Sbjct: 20  SHRPSNRQSSFTFHPAVNTKPKSVNLVTAVHAAEPARNAVSVKESVASSSSGALKWTPES 79

Query: 133 WKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAFLLQGGDC 192
           WK KKALQLP+YP+   LESVLKT+E+FPPIVFAGEAR+LEERLA AAVGKAFLLQGGDC
Sbjct: 80  WKLKKALQLPDYPNANELESVLKTIEAFPPIVFAGEARNLEERLADAAVGKAFLLQGGDC 139

Query: 193 AESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEEKDGVKL 252
           AESFKEFNA NIRDTFRVLLQMS+VL FGGQ+PVIKVGRMAGQFAKPRSD +EEKDGVKL
Sbjct: 140 AESFKEFNATNIRDTFRVLLQMSIVLTFGGQVPVIKVGRMAGQFAKPRSDAFEEKDGVKL 199

Query: 253 PSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVTQWNLDF 312
           PSY+GDNINGD+FDEKSR+PDP+RM RAY QS ATLNLLRAFATGGYAA+QRVTQWNLDF
Sbjct: 200 PSYKGDNINGDTFDEKSRIPDPNRMIRAYTQSAATLNLLRAFATGGYAAIQRVTQWNLDF 259

Query: 313 TEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYEQALTRED 372
            E SEQ DRY+ELA+RVDEALGFMSA GL  DHP+MT+T+F+TSHECLLLPYEQ+LTR D
Sbjct: 260 VEQSEQADRYQELANRVDEALGFMSACGLGTDHPLMTTTDFYTSHECLLLPYEQSLTRLD 319

Query: 373 STSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKLIDILNP 432
           STSG+YYDCSAHM+W GERTRQLDGAHVEFLRG+ANPLGIK S+KMDP ELVKL++ILNP
Sbjct: 320 STSGLYYDCSAHMVWCGERTRQLDGAHVEFLRGIANPLGIKVSNKMDPFELVKLVEILNP 379

Query: 433 TNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCGLKTRSFD 492
            NKPGRITVIVRMGAENMRVKLPHLIR VR++GQIVTWV DPMHGNTIKAPCGLKTR+FD
Sbjct: 380 NNKPGRITVIVRMGAENMRVKLPHLIRAVRRSGQIVTWVCDPMHGNTIKAPCGLKTRAFD 439

Query: 493 AIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPRLN 552
           +I AE+RAF D+H+QEGS+ GG+HLEMTGQNVTEC+GGSRT+TY+DLSSRYHTHCDPRLN
Sbjct: 440 SILAEVRAFLDVHEQEGSHAGGIHLEMTGQNVTECIGGSRTVTYDDLSSRYHTHCDPRLN 499

Query: 553 ASQSLELAFIIAERLRRRR 562
           ASQSLELAFI+AERLR+RR
Sbjct: 500 ASQSLELAFIVAERLRKRR 518

BLAST of Cp4.1LG04g07100 vs. NCBI nr
Match: gi|449438211|ref|XP_004136883.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 949.5 bits (2453), Expect = 2.7e-273
Identity = 472/513 (92.01%), Postives = 490/513 (95.52%), Query Frame = 1

Query: 58  MALPATSPLSKSLPKSHFPIT-RPPPFTTATATARSLKPISAIHAADPSKSSKPSILVPT 117
           MALP T+PL KSLP   FPIT + P     + +AR  KPISAIHAADPS+SSK SI VP 
Sbjct: 1   MALPTTTPLPKSLPHPLFPITSKTPHHRRLSPSARFSKPISAIHAADPSRSSKSSIQVPM 60

Query: 118 KWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAFL 177
           KWTLDSWKSK+ALQLPEYPDQA LESVL+TLESFPPIVFAGEARSLE+RLAQAAVGKAFL
Sbjct: 61  KWTLDSWKSKRALQLPEYPDQAALESVLRTLESFPPIVFAGEARSLEDRLAQAAVGKAFL 120

Query: 178 LQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEE 237
           LQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEE
Sbjct: 121 LQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEE 180

Query: 238 KDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVT 297
           KDGVKLPSYRGDNINGDSFD++SR+PDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVT
Sbjct: 181 KDGVKLPSYRGDNINGDSFDKQSRIPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVT 240

Query: 298 QWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYEQ 357
           QWNLDFTEHSEQGDRYRELAHRVDEALGFM+A+GLTVDHPIMTSTEFWTSHECLLLPYEQ
Sbjct: 241 QWNLDFTEHSEQGDRYRELAHRVDEALGFMAASGLTVDHPIMTSTEFWTSHECLLLPYEQ 300

Query: 358 ALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKL 417
           ALTREDSTSG+YYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKL
Sbjct: 301 ALTREDSTSGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKL 360

Query: 418 IDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCGL 477
           I+ILNPTNKPGRI VIVRMGAENMRVKLPHLIREVR+AGQIVTWVSDPMHGNTIKAPCGL
Sbjct: 361 IEILNPTNKPGRIVVIVRMGAENMRVKLPHLIREVRRAGQIVTWVSDPMHGNTIKAPCGL 420

Query: 478 KTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTH 537
           KTRSFDAIRAE+RAFFD+HDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTH
Sbjct: 421 KTRSFDAIRAEVRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTH 480

Query: 538 CDPRLNASQSLELAFIIAERLRRRRLRAGQAPG 570
           CDPRLNASQSLELAFIIAERLRRRRL AGQ  G
Sbjct: 481 CDPRLNASQSLELAFIIAERLRRRRLVAGQTLG 513

BLAST of Cp4.1LG04g07100 vs. NCBI nr
Match: gi|659110332|ref|XP_008455172.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cucumis melo])

HSP 1 Score: 949.5 bits (2453), Expect = 2.7e-273
Identity = 472/511 (92.37%), Postives = 490/511 (95.89%), Query Frame = 1

Query: 58  MALPATSPLSKSLPKSHFPIT-RPPPFTTATATARSLKPISAIHAADPSKSSKPSILVPT 117
           MALP T+PL KSLP   FPIT + P     + +ARS KPISAIHAADPS+SSK SI VPT
Sbjct: 1   MALPTTTPLPKSLPHPLFPITSKTPHHRRLSPSARSSKPISAIHAADPSRSSKSSIQVPT 60

Query: 118 KWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAFL 177
           KWTLDSWKS +ALQLPEYPDQA LESVL+TLESFPPIVFAGEARSLE+RLAQAAVGKAFL
Sbjct: 61  KWTLDSWKSMRALQLPEYPDQAALESVLRTLESFPPIVFAGEARSLEDRLAQAAVGKAFL 120

Query: 178 LQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEE 237
           LQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEE
Sbjct: 121 LQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEE 180

Query: 238 KDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVT 297
           KDGVKLPSYRGDNINGDSFD++SR+PDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVT
Sbjct: 181 KDGVKLPSYRGDNINGDSFDKQSRIPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVT 240

Query: 298 QWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYEQ 357
           QWNLDFTEHSEQGDRYRELAHRVDEALGFM+A+GLTVDHPIMTSTEFWTSHECLLLPYEQ
Sbjct: 241 QWNLDFTEHSEQGDRYRELAHRVDEALGFMAASGLTVDHPIMTSTEFWTSHECLLLPYEQ 300

Query: 358 ALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKL 417
           ALTREDSTSG+YYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDK+DPKELVKL
Sbjct: 301 ALTREDSTSGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKIDPKELVKL 360

Query: 418 IDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCGL 477
           IDILNP NKPGRI VIVRMGAENMRVKLPHLIREVR+AGQIVTWVSDPMHGNTIKAPCGL
Sbjct: 361 IDILNPRNKPGRIVVIVRMGAENMRVKLPHLIREVRRAGQIVTWVSDPMHGNTIKAPCGL 420

Query: 478 KTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTH 537
           KTRSFDAIRAE+RAFFD+HDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTH
Sbjct: 421 KTRSFDAIRAEVRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTH 480

Query: 538 CDPRLNASQSLELAFIIAERLRRRRLRAGQA 568
           CDPRLNASQSLELAFIIAERLRRRRL AGQA
Sbjct: 481 CDPRLNASQSLELAFIIAERLRRRRLVAGQA 511

BLAST of Cp4.1LG04g07100 vs. NCBI nr
Match: gi|1009174791|ref|XP_015868528.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Ziziphus jujuba])

HSP 1 Score: 896.3 bits (2315), Expect = 2.7e-257
Identity = 448/521 (85.99%), Postives = 474/521 (90.98%), Query Frame = 1

Query: 58  MALPATSPLS-KSL--PKSHFPITRPPPFTTATATARSLKPISAIHAADPSKSSKPS--- 117
           MA  AT+P S KSL  PK  F  ++P         +   KPISA+HAADPSKSS  S   
Sbjct: 1   MAFAATTPFSSKSLLNPKPIFSTSKPHHNPPRRCHSLPHKPISAVHAADPSKSSNSSPPP 60

Query: 118 ----ILVPTKWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLA 177
                 +P KWTL+SWK+KKALQLPEYPD+  L+SVLKTLE+FPPIVFAGEARSLEE+L 
Sbjct: 61  PPSTASIPLKWTLESWKTKKALQLPEYPDKVALDSVLKTLETFPPIVFAGEARSLEEKLG 120

Query: 178 QAAVGKAFLLQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFA 237
           QAA+GKAFLLQGGDCAESFKEFNANNIRDTFRVLLQM VVLMFGGQMPVIKVGRMAGQFA
Sbjct: 121 QAAMGKAFLLQGGDCAESFKEFNANNIRDTFRVLLQMGVVLMFGGQMPVIKVGRMAGQFA 180

Query: 238 KPRSDPYEEKDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATG 297
           KPRSDP+EEKDGVKLPSYRGDNINGD+FDEKSRVPDPDRM RAYCQ+VATLNLLRAFATG
Sbjct: 181 KPRSDPFEEKDGVKLPSYRGDNINGDAFDEKSRVPDPDRMVRAYCQAVATLNLLRAFATG 240

Query: 298 GYAAMQRVTQWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSH 357
           GYAAMQRVT WNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSH
Sbjct: 241 GYAAMQRVTHWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSH 300

Query: 358 ECLLLPYEQALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDK 417
           ECLLLPYEQALTREDSTSG+YYDCSAHMLWVGERTRQLDGAHVEFLRGV+NPLGIK SDK
Sbjct: 301 ECLLLPYEQALTREDSTSGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDK 360

Query: 418 MDPKELVKLIDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHG 477
           MDP ELV+LIDILNP NKPGR+TVIVRMGAENMRVKLPHLIR VR AGQIVTWVSDPMHG
Sbjct: 361 MDPSELVRLIDILNPKNKPGRVTVIVRMGAENMRVKLPHLIRSVRGAGQIVTWVSDPMHG 420

Query: 478 NTIKAPCGLKTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYN 537
           NTIKAPCGLKTRSFDAIRAE+RAFFD+HDQEGSYPGGVHLEMTGQNVTECVGGSRTITYN
Sbjct: 421 NTIKAPCGLKTRSFDAIRAEVRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYN 480

Query: 538 DLSSRYHTHCDPRLNASQSLELAFIIAERLRRRRLRAGQAP 569
           DL SRYHTHCDPRLNASQSLELAFIIAERLR+RRL +G++P
Sbjct: 481 DLGSRYHTHCDPRLNASQSLELAFIIAERLRKRRLASGKSP 521

BLAST of Cp4.1LG04g07100 vs. NCBI nr
Match: gi|1009172401|ref|XP_015867251.1| (PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Ziziphus jujuba])

HSP 1 Score: 893.6 bits (2308), Expect = 1.7e-256
Identity = 447/521 (85.80%), Postives = 473/521 (90.79%), Query Frame = 1

Query: 58  MALPATSPLS-KSL--PKSHFPITRPPPFTTATATARSLKPISAIHAADPSKSSKPS--- 117
           MA  AT+P S KSL  PK  F  ++P         +   KPISA+HAADPSKSS  S   
Sbjct: 1   MAFAATTPFSSKSLLNPKPIFSTSKPHHNPPRRCHSLPHKPISAVHAADPSKSSNSSPPP 60

Query: 118 ----ILVPTKWTLDSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLA 177
                 +P KWTL+SWK+KKALQLPEYPD+  L+SVLKTLE+FPPIVFAGEARSLEE+L 
Sbjct: 61  PPSTASIPLKWTLESWKTKKALQLPEYPDKVALDSVLKTLETFPPIVFAGEARSLEEKLG 120

Query: 178 QAAVGKAFLLQGGDCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFA 237
           QAA+GKAFLLQGGDCAESFKEFNANNIRDTFRVLLQM VVLMFGGQMPVIKVGRMAGQFA
Sbjct: 121 QAAMGKAFLLQGGDCAESFKEFNANNIRDTFRVLLQMGVVLMFGGQMPVIKVGRMAGQFA 180

Query: 238 KPRSDPYEEKDGVKLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATG 297
           KPRSDP+EEKDGVKLPSYRGDNINGD+FDEKSRVPDPDRM RAYCQ+VATLNLLRAFATG
Sbjct: 181 KPRSDPFEEKDGVKLPSYRGDNINGDAFDEKSRVPDPDRMVRAYCQAVATLNLLRAFATG 240

Query: 298 GYAAMQRVTQWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSH 357
           GYAAMQRVT WNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSH
Sbjct: 241 GYAAMQRVTHWNLDFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSH 300

Query: 358 ECLLLPYEQALTREDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDK 417
           ECLLLPYEQALTREDS SG+YYDCSAHMLWVGERTRQLDGAHVEFLRGV+NPLGIK SDK
Sbjct: 301 ECLLLPYEQALTREDSISGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVSNPLGIKVSDK 360

Query: 418 MDPKELVKLIDILNPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHG 477
           MDP ELV+LIDILNP NKPGR+TVIVRMGAENMRVKLPHLIR VR AGQIVTWVSDPMHG
Sbjct: 361 MDPSELVRLIDILNPKNKPGRVTVIVRMGAENMRVKLPHLIRAVRGAGQIVTWVSDPMHG 420

Query: 478 NTIKAPCGLKTRSFDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYN 537
           NTIKAPCGLKTRSFDAIRAE+RAFFD+HDQEGSYPGGVHLEMTGQNVTECVGGSRTITYN
Sbjct: 421 NTIKAPCGLKTRSFDAIRAEVRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYN 480

Query: 538 DLSSRYHTHCDPRLNASQSLELAFIIAERLRRRRLRAGQAP 569
           DL SRYHTHCDPRLNASQSLELAFIIAERLR+RRL +G++P
Sbjct: 481 DLGSRYHTHCDPRLNASQSLELAFIIAERLRKRRLASGKSP 521

BLAST of Cp4.1LG04g07100 vs. NCBI nr
Match: gi|590688310|ref|XP_007042911.1| (3-deoxy-d-arabino-heptulosonate 7-phosphate synthase isoform 1 [Theobroma cacao])

HSP 1 Score: 865.5 bits (2235), Expect = 5.1e-248
Identity = 430/502 (85.66%), Postives = 457/502 (91.04%), Query Frame = 1

Query: 67  SKSLPKSHFPITRPPPFTTATATARSLKPISAIHAADPSKSSKPSIL------VPTKWTL 126
           S SL  S  P+     +   +     LKPI+AIH+ADP+KS+K +        +P KW+L
Sbjct: 5   SPSLLSSKSPLLPRHHYYCHSLLPAKLKPITAIHSADPAKSTKSTAASTTSPSIPIKWSL 64

Query: 127 DSWKSKKALQLPEYPDQAVLESVLKTLESFPPIVFAGEARSLEERLAQAAVGKAFLLQGG 186
           DSWKSKKALQLPEYPDQ  L SVL+TL SFPPIVFAGEARSLEE+L QAA G AFLLQGG
Sbjct: 65  DSWKSKKALQLPEYPDQNDLVSVLQTLSSFPPIVFAGEARSLEEKLGQAAFGNAFLLQGG 124

Query: 187 DCAESFKEFNANNIRDTFRVLLQMSVVLMFGGQMPVIKVGRMAGQFAKPRSDPYEEKDGV 246
           DCAESFKEFNANNIRDTFRVLLQM VVLMFGGQMPVIKVGRMAGQFAKPRSDP+EEK+GV
Sbjct: 125 DCAESFKEFNANNIRDTFRVLLQMGVVLMFGGQMPVIKVGRMAGQFAKPRSDPFEEKNGV 184

Query: 247 KLPSYRGDNINGDSFDEKSRVPDPDRMNRAYCQSVATLNLLRAFATGGYAAMQRVTQWNL 306
           KLPSYRGDNINGDSFDEK+RVPDP RM RAYCQSVATLNLLRAFATGGYAAMQRV+QWNL
Sbjct: 185 KLPSYRGDNINGDSFDEKARVPDPHRMIRAYCQSVATLNLLRAFATGGYAAMQRVSQWNL 244

Query: 307 DFTEHSEQGDRYRELAHRVDEALGFMSAAGLTVDHPIMTSTEFWTSHECLLLPYEQALTR 366
           DFTE+SEQGDRYRELAHRVDEA+GFM+AAGLTV HPIMT+TEFWTSHECLLLPYEQALTR
Sbjct: 245 DFTENSEQGDRYRELAHRVDEAMGFMAAAGLTVGHPIMTTTEFWTSHECLLLPYEQALTR 304

Query: 367 EDSTSGIYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKASDKMDPKELVKLIDIL 426
           EDSTSG+YYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIK SDKMDP ELV+LI+IL
Sbjct: 305 EDSTSGLYYDCSAHMLWVGERTRQLDGAHVEFLRGVANPLGIKVSDKMDPNELVRLIEIL 364

Query: 427 NPTNKPGRITVIVRMGAENMRVKLPHLIREVRQAGQIVTWVSDPMHGNTIKAPCGLKTRS 486
           NP NKPGRITVIVRMGAENMRVKLPHLIR VR+AGQIVTWVSDPMHGNT KAPCGLKTRS
Sbjct: 365 NPQNKPGRITVIVRMGAENMRVKLPHLIRAVRRAGQIVTWVSDPMHGNTTKAPCGLKTRS 424

Query: 487 FDAIRAELRAFFDIHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPR 546
           FDAIRAE+RAFFD+HDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPR
Sbjct: 425 FDAIRAEVRAFFDVHDQEGSYPGGVHLEMTGQNVTECVGGSRTITYNDLSSRYHTHCDPR 484

Query: 547 LNASQSLELAFIIAERLRRRRL 563
           LNASQSLELAFII ERLR+RRL
Sbjct: 485 LNASQSLELAFIIGERLRKRRL 506

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AROG_ARATH4.9e-24382.26Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Arabidopsis thal... [more]
AROF_SOLTU4.3e-23977.30Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Solanum tuberosu... [more]
AROG_ORYSJ4.3e-23979.84Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Oryza sativa sub... [more]
AROF_TOBAC9.6e-23978.54Phospho-2-dehydro-3-deoxyheptonate aldolase 1, chloroplastic OS=Nicotiana tabacu... [more]
AROG_SOLLC1.2e-23880.65Phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic OS=Solanum lycopers... [more]
Match NameE-valueIdentityDescription
A0A0A0K233_CUCSA1.9e-27392.01Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Cucumis sativus GN=Csa_7G064020 P... [more]
A0A061E2A9_THECC3.5e-24885.66Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Theobroma cacao GN=TCM_007435 PE=... [more]
A0A059BBF4_EUCGR2.5e-24689.34Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Eucalyptus grandis GN=EUGRSUZ_G00... [more]
A0A0B0PUE5_GOSAR5.6e-24682.24Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Gossypium arboreum GN=F383_15354 ... [more]
M5VX27_PRUPE1.6e-24583.24Phospho-2-dehydro-3-deoxyheptonate aldolase OS=Prunus persica GN=PRUPE_ppa004348... [more]
Match NameE-valueIdentityDescription
AT4G33510.12.8e-24482.26 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase[more]
AT1G22410.11.7e-23377.25 Class-II DAHP synthetase family protein[more]
AT4G39980.11.3e-23078.36 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase 1[more]
Match NameE-valueIdentityDescription
gi|449438211|ref|XP_004136883.1|2.7e-27392.01PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cu... [more]
gi|659110332|ref|XP_008455172.1|2.7e-27392.37PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Cu... [more]
gi|1009174791|ref|XP_015868528.1|2.7e-25785.99PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Zi... [more]
gi|1009172401|ref|XP_015867251.1|1.7e-25685.80PREDICTED: phospho-2-dehydro-3-deoxyheptonate aldolase 2, chloroplastic-like [Zi... [more]
gi|590688310|ref|XP_007042911.1|5.1e-24885.663-deoxy-d-arabino-heptulosonate 7-phosphate synthase isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0009073aromatic amino acid family biosynthetic process
Vocabulary: Molecular Function
TermDefinition
GO:00038493-deoxy-7-phosphoheptulonate synthase activity
Vocabulary: INTERPRO
TermDefinition
IPR002480DAHP_synth_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009094 L-phenylalanine biosynthetic process
biological_process GO:0000162 tryptophan biosynthetic process
biological_process GO:0006571 tyrosine biosynthetic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0009073 aromatic amino acid family biosynthetic process
biological_process GO:0009423 chorismate biosynthetic process
cellular_component GO:0009534 chloroplast thylakoid
cellular_component GO:0005575 cellular_component
molecular_function GO:0003849 3-deoxy-7-phosphoheptulonate synthase activity
molecular_function GO:0016829 lyase activity
molecular_function GO:0018580 nitronate monooxygenase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g07100.1Cp4.1LG04g07100.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002480DAHP synthetase, class IIPANTHERPTHR21337PHOSPHO-2-DEHYDRO-3-DEOXYHEPTONATE ALDOLASE 1, 2coord: 60..563
score:
IPR002480DAHP synthetase, class IIPFAMPF01474DAHP_synth_2coord: 118..554
score: 2.9E
IPR002480DAHP synthetase, class IITIGRFAMsTIGR01358TIGR01358coord: 118..560
score: 1.8E
NoneNo IPR availablePANTHERPTHR21337:SF6CLASS-II DAHP SYNTHETASE FAMILY PROTEIN-RELATEDcoord: 60..563
score:
NoneNo IPR availableunknownSSF51569Aldolasecoord: 109..558
score: 6.44E