Cp4.1LG01g09840 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g09840
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Description2-dehydro-3-deoxyphosphogluconate aldolase
LocationCp4.1LG01 : 7008055 .. 7018078 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTTCATGTTGTAATCGGAATGAAGGGAAGAGCGTTTCAGAAGAACCCTTTTCTTTATTGAGTTTGAATTTTAAGTGGCTTATCGAGATAAGAGAATGTCAATGAGAAAAGGGTAATTTGTAGATTTATATTTGTGTTTGTGGAAGTGAAATTCTGGAGTGTTTCTTTGGAAGCTTATAAGTGGTTTCTTTTTTTCGTGATGTGTTGGTGAAAATTGAGGTGCTGAGTTTTTGTATTTTAAGTGTTTGATGAAATGCCTGAGTGGTACAATAATGTTCTATCTGTTTATTTAGGCTGATAGGTTCATTGACCTCATGAATCTCACGACAATTCATTTTCGATTTCTTGCCAAACGGAATTTGGTTTTGTACCCAAGTAAGTATGCTTTTGGTTCCCAACTTAGATTCTGGAGGTCGGGGGCAGAAGGCGATATCGTGTCTTTTAGGACAGAAGATTTTCGTCATGACTATCTATTTGGATCAAACGTGATTTCCACGCGTGGCCACCTTGGGCAGGCGCTTTCACTGTTCTACTCTAGACAGCCTCATTCCCTCCAGACCTATGCCTATCTGTTCCATGCTTGTGCACGCCTCCGCTGCCTCCGGGAAGGTGTGGAACTACACCGTTACATGATGTCCCTGGATCCCATGGGCTCATTTGATCTCTTTGTTACCAATCACCTTATCAACATGTACTGTAAATGTGGTCATTTAGACTATGCCTACCAATTATTCAATGAGATGCCTAGGAGAAACCTTGTCTCTTGGACTGTGCTTATCTCAGGACTTTCTCAGTATGGCCATGTGGATGAGTGCTTCCTTATATTTTCGAGAATGTTGGTAGATCACAGGCCAAATGAGTTTACAGTTGCAAGTTTGCTTACCTCGTTTGGTGACCATGATGGTGAGCGTGGGCGGCAGGTACATGGGTTTGCCTTGAAAAGGTCACTAGATGCTTTCGTTTATGTTGCAAATGCTCTTATTACCATGTATAGCAAGAATTACTTTAAAGGTGGTGCTTTTAATGACGGTAAAGATGATGCTTGGACTATGTTCAAGAGCATAGAAAATCCCAGCCTTATAACATGGAACTCGATGATTGCAGGGTTTTGTTTCCGAAAACATGGAAATCGAGCTGTCCATTTATTTATGCAAATGAATCATAAAGGAATTGGATTTGATCGTGCAACACTTTTAAGCACTTTGTCTTCCATAAGTCTCTGCAATTGGGATGAACTTGATCTCGGTTTGGGCTTTTGTCGTGAATTACACTGTCAAGCATTAAAAACTGCTTTCACATCAGAAGTTGAAATAATTACTGCATTGGTAAAAACTTATGCTGAACTTGGAGGGGACATTACGGATAGTTATAGGCTTTTTATTGAAGCAGGATATAATCGGGATATAGTTTTATGGACCAGCATCATGACAGCTTTTGTCGACCATGACCCTGGGAAAACACTTTCCCTTTTTCGTCAGTTCCGACAAGAAGGCTTAACTCCAGATGGACACACTTTTTCGATTGTATTAAAGGCTTGTGCTGGATTCCTAACAGAGAAGCATGCTTCAACATATCATTCACTGCTAATTAAATCTATGTCTGAGGATGACACTGTCCTTAACAATGCCTTGATTCATGCTTATGGGAGGTGTGGTTCAATTACTTCATCCAAGAAAGTATTCAATCAAATGAAACATCACGATTTGGTTTCTTGGAACACAATGATGAAGGTCTATGCTGTCCATGGCCAAGCTGAGATTGCTTTGCAGCTTTTTTCAAAGATGACTGTGCCACCTGATTCTACTACATTTGTCTCTCTTCTTTCAGCATGCAGCCATGCAGGGCTCGTGGAAGAAGGGACCAACCTTTTTAATTCAATTGCAAATTATGGACTTGTTTGCCAACTAGATCACTATGCTTGCATGGTTGACATTTTGGGGAGATCTGGTCGGATTCAAGAGGCTGAAGATTTTATAAGTAAAATGCCTATAGAACCTGATTATGTTATTTGGAGTTCATTCCTGGGATCATGTAAAAAGCATGGCGCAACACAATTGGCCAAATTAGCATCTGATAAATTGAAGGAGTTAGATCCTAGCAATTCCTTAGCTTATGTGCAAATGTCAAATCTATATTGCTTAAGTGGTAGCTTTTATGAAGCAGACTTAATTAGAATGGAAATGAAAGGGTCTAGAGTGAGAAAGGAACCTGGATTAAGTTGGGTAGAAATAGAAAATCAAGTACATGAGTTTGCATCTGGAGGTCGCCATCATCCAGAGAGGGAGGTAATATGCAATGAACTCGAAGAGCTCATTGGGAGGTTAAAGGAAATTGGTTATGTGCCCGAGACAAGCTTAGCGATCCATGATGTGGAACAAGAGCAAAAAGAGGAGCAGCTATATCATCATAGCGAGAAGCTGGCTTTGGTTTTTTCTGTAATGAATGATAACAACTTGGGTTGTGTCAGTACACCTATTAGGATTATGAAAAACATCCGAATTTGTGTAGATTGTCATAATTTCATGAAGTTAGCTTCAAGGCTACTTCAGAAGGAGATTGTCATTAGAGACTCTAATCGTTTTCATCATTTCATGGCTGGTTTATGCTCGTGCAATGATTACTGGTAGTCAATAGGCGTCAAACATTCAAATACCCAAGTATCCATCTGCATTCATTCATAATTTTTTACCTCCAATATGAATAGGATAAAGTTGTGTTCTACATTGACACTTGTATCAATAATCTCTCAGTTCCTTCAACAACTGAGCAGCAAACTTTAGGAGGTGAAGATGTGGTCGGTCCTGGTTCCTACTTCTTGTTGAAGTATGGTACCACAGTTTCTCAAGTACATACAAAAATTTGGTTCCATATGAAGGTTGTAGCCACAGTACCGAGTTGAACTGTAAGGGATACTTTAAAGCAGGTAATGGGTAGACTATTTGTTATCTGCAGGATAACATGCCTTTCTTTAGGTCAACCATGATGATTTACTCCTCCATGAAGGATAAGCAGTCATTTTCTTATTCAAGTATTGATCCTTGTCGAACATCAGAAAAGAGAAAAAGTCAAACCCCTGCAGCTTGTTATCATTTTTTTACGAAGAAAAAAGAGAAAAGAAAGAATAGAAAAGAACAAAAAAAACCAAGACAAGATGGATACAGGAAAGACTACCAATCATTATGCCTTTACAAATCTGCAACGAGAAGCTTCTCCATTTGAGTGAGTATAAATGACGAGACTACCAATCATTATGCACCTTCTAATTCATATTGTTTTTGTAATCATTGTCTTTATATTAACCAACTAGATTTCATCAGACCCTCGTTCTTTTGTCTTTAAGCAGACCTCCTCAAGTTTAGTTACTAATGTTTCTCGAATTCATAACATTTATGTAATTCCTTGAATTTCTTTTTGTTGTTGGCATTATTATTTTCTCCTGTGGAAAAAAAAAAAGATATATATGAATAAAATACTAGAAAAGAAGAAACGAAGGCAGTGTACTGCCAAATCTATCAGGCAGTGAAGCTATAGAAGCGTTGCCCCAATTGGTCTTCATTGGCTTGAAAAAATTTCAGTACAGTTATCTAAGAAACATGATTCAGAGGTGAGAAATGAGTAAGATATACTGAACCTTGAGCCTCTCAACTTTCTTTTCAAAGATTCAATCGTTGTGAACAATCCAAGTCCTCACAAGAGGCAACGTAATAGTGGCTTACTTACCAGACTCCCCTCATCTTCTGTCATGGAAAATAAGATACATGAACCTGTGGAAATATTGAAGAACATCTGGAAAAGAAGCCAAGATCATTGAACAAAAATCTCCACTTGAAGGGGGCAACAGAAATTTAGAACAACTTTTTGCTGAAAATTGTTTGCAAATTTCTGTAAGTTTGACATGATGTTAGATAAGGTCATGTACATATTTTTGCATAATTAGTTGTCCATAATCATTTTCACATGGCCCTTTGAATTTTGTACTTTATTTGCATTGTTTATTGTCTGATCGCTTTTTTCTGAAACAGTATTTTCTCTATTGTAATCCTCTCGTGCTACTTCACATTATTCTCATGGGCTGCATTCCTTGTGCCTAAATTCTTTGCTGGTTTCCATACGAGCATCTTTAGTTCAATCAATAAAAATGGATTGCAGAATTTTGTCCTCTGGTCCATTTCTCTTGACTTATAATTGTCATATCAGCTTTCATCTTACAAAAAACTTATTTGCTGTTAGTATTTCGTGTATCTTATTGTTCATCACCTTTCTTTTCCATTAACATTATAAATTTTTTCACTATGGTAGATATTCCCTTGAATCCACGACTGCTGAAGTGACATCCATCATCATCAGTGTATCTGATTGGCTGGAATAAAATACTTCTCTAACAACCAGAAGACTATATTTGAAGTACATGTTGGTTTCTAGGAGAATCAGCATGTGGTCTTTGCCTGCTATTGGGTGTTGGAACCCTTTCGCATTTGCTCGCTTTAGAGTTTCTTGTGCTTCTAGTCAGCTACCAGTTTCTCCCAAAGACAAGACCCTGAGAACAATTTACAATTCTGGAGTCATTGCTTGCCTTCGCGCTAGCAGGTAACCCATTTTTATGTAACATTTTCTGCATTTTAGAATATCGTGGTGCTGGATTCTCTTAAAACTCTCGGCTTCACAAGATCGAACTATTGGTTCTTGATGTCCAGCAGGTAAAGCCAATGTCCTCACTACTGTCAACGTATGTTTTACTGATTAGCTAATAAATGATAGCAAGGTCTGTTATCTATATAAACCATTTCCGTGATTATATTACTAAAAAATGTTAGATAGCACCAACCTTATGGTAACATACATTTTCATTGTTTTTTTAGATGTTGAGTTTCAATAGTTTGCAAAAGTAGATGCTCTCCTCAATTTTGGAAATCATTGCAGATGTTTTTTTTTAAATAGTAAAGAAACGCATTGAAATTGAAATGTGTAGGCTTAAATGAAGGAGCTCCCAACCTAGCCATTTCGTATAAAGGATAAGTTACAAAAGTGTTAATGATGTGGATCCAAAGAAAGGCATCAAAGTTGGCCAAACTTCCTGCGTCCTTCAGAGACCCCTCAATCATGCCTTCATGTTTTTTTCTCTAATATAAAAAAGTCTAGGCTGTCGAAGTTTATTAGTGTCAAGAAATTCAATATCATTTGTCTTGAGTTATCTAGAGCCTTGATGTGCACCTTAATTCTTTTTTAAATTCACTATAGTGCGGAGCTGGCAATGAGTGCTGCTTGTGCTGCGCTAAATGGTGGAGTATCAGTTGTGAGTGCTCTTAATTTGAAGTTTTCTTTGTTCATAATTAATCTGTAAATGAAAACTGTATTTGAATTGGCGATATGAGTAATATTCACGGATAGATTCAAATATTCTTTTGATGTATGATTTCTTATTGCTGGTCCTTTCTAAATGATCAATAGAATTTAAAAAGGTTCAAGAAGTTGCCCTTAAAATGCTTTTCTAAGTGGCAAGTGCATGAAAAATTATAACCAAAGTGACTAGAGCATCAACTTTATTCAACGAAACAGTTTTTTTCCCTTTCCTTCAAATGCATTGATTGATGGTTTCTAACTTATTTCCTCTTCTGTCGCTGTGCAGCTCGAGATTGTTATGTCGACACCAGGTGTGCTTGAGGTTGTTGTAGAGAATATTTTCCTTGACATTGTAGTTGTTATCATTGTAAGTATTTGTCATACTTTTATTTATTTTTTCTGTGTTATAAAATTTGAAACAGGTTCTACAACAGTTGCTGCAAGACTATCCTACAAGAACACTGGGAGTAATTATTTTCTTTGCTGTTTGTCCGTGTGATGATTTCTTAATGTTTGTAATATTTCCTTGCCAGACTTGGGTCAAAATTTGTGAAGCCTTTTCCCTTGAACCTTTGATGAGTTAAGATGAACTATATCTTTAATCTAAATGCTTATTAAAATGCAACTTGTCAGCCTCAAAAAAGATGGATGGTTACCTGTTGAACTCTTCTGAAGATTCCCAAATATTGTGCAGGTTGGGACCGTTCTTAATGTTAAGGATGCAAAGAATGCTGTCGAAGCTGGAGCCAAGTTTCTAATGAGTCCCACTATGGTGAAGGTAAGCATACATTTACTTTTGCTTATATGCCGCAGCGACCTTTACATGCATATGTTGAAACCTCTTTATCCCTTTTTCATGTTTTCTTTTTCTATTGTGATTTTTCGTTTTTGACTCCAACTAAAAGAGACTATGAAGATATTTCTCCTTTCAGATTATTTTTACTGATTCACAGGGTATCATGGATGATCTTGAAGGGGAATTCTTGTATATACCTGGTGTGATGACCCCGACAGAAGTAATTGAATAACACTTTTGGCCCCTTTCTAATTCTCAAGGATTCCTCCTTTTTACATATGAAATCATTGCTTCTGCCACAAAGAGTTTTTGAATTTTTTAAATTACCTTTGTAAGTAGTTATGTGTATACTTTACAGGTACTGACTGCATATGAAGCTGGCGCTCAGATTGTTAAAGTAAGTAAATTTCTTGTCTGATTTAGGGAGATCATTGATTTCTTGGAGTTCGTGGTTTATCGCAAGTGAAACAGGATTAAAATTATACTCATATACATTAAAAGAAAAAAACACAAATGTCCTGCATAAACATACTTTTCATTTCTTGATGCTTTTCTGTACAACTAATTCCCATGGAAAAATCTGAATGGGAATTACTGGATTGGTATACAATTATGAAGAATAAGAACAGAAAACTTATGCAAGTAATTTATATTGCTGTGTTCTTGAGTGCTTACAATTTTTCTTTTGCTTGCAAGGTTTTGACATCAAAATATGGTTCTACCCTTATCATTTTTTATGGATTCATTTTGATTTTTAATCTTGAAAAGTAGGTTTATCCAGTTTCTGCATTAGGTGGTATCAAATATATATCAGCCCTCAAGAAGCCATTTCCTCATATCTCAATGGTTGCTTCTCAAGGCATAACTATTGGTCAGTACTCAACTTTAACTCTTTGCTCCTAAGCTCCAACTCATACTTGTTTTTGTATTAGCATCTAGTTAACTCTACCGATATGCATTCTTTATCTTTTTAGATTCATTGAATCAAATGGATTAAGCAAATATCTGATTGGTAGATTTATTTTTTAAAACTGTTTTTGAATTCTGTGATCCAAATTCTTGTGAAAAATAACATTTTTGTATTTTTAATGTATTCTAATTTTGAATTTGAAATTTAAAAGAATGAAATTAAATTGTGAAACGAAAAAACATAATAAATATAGTATTTCATGCTATAAACTCATTTCTTTTGTTAAATTAAAATATTTATAATATAATATTTTATAAATGATATGATATTGAATAATAATTATTCTAAATTTATAACATGAAATATGTTCTTAATAATAGTTGAAATTCAACTTAATGTTTAGTTGGTAACTACAACTATTTTGATGGTTTATAAATATTTCTTTAAGCATATCTTGAGTAATTAATTTAATTGTTATACTTTACCTAATTCTACTACCAAACATATATAAAAAACACTACAAACATGAAATAAAATTTTGATGGAATCTCTTGTTTTCAATTTTTTTTTTTTAAAAATTACGAACCATACGCACCCTTATGTTCAAACACTCTGACTAAGAAGATTGCAAAAGTAGCATAAGATGGCTGTTATATTATAACTTGTGTTCTTTCTTAAATTGGTTGAGGTTATCTTGAAACAAAGTAATTCAGAGCTTGTGTTCGTGTGTCGCGTTTTTCATTTTTCAGTTCACTGTTTCTGTTTTCTGGGCTAGTCAGTTCTTAGAAATTATTTTTACCAGTTTGATATAACTTGAGCTGATTCTGAGATTCATTTGTTACTTCTAAATTTGTACAGAATCTACTGGGGACTACATTAGAGAAGGAGCATCTTCGGTAGTTTTATCTGATGCAATATTTAACAAGGAGTTTATGAAGCAAAAGAACTTTGATGGAATATCTCAACTTTCTAAGTTGGCTGCTTCCCGGGCGATGGAAGCTTTAGAATGGTGAGGACTTCTAGTATCCATGCCTACTACGAAAAATATTTCAGATTAGTGCTTGTATCTGATGGACAGTTTGCCAATCGCCTCTTATTTGTCCAATTCTGTCTGTAGTCTTTCTTTTTTATAGCTTCTTCGGTTCATATTTGTTTGCTGGATGCGTTAGTTGTGAGATCCCACATTGGTTGGGGAGTAAAATAAAACATTCTTTATAAGAGTGTGGAAACCTTCCCCTAGCAGACGCGTTTTAAAAACCTTGAGGGAAAGCCCAAAGAAGACAATATTTGCTAGCGGTGGGCTTGGGCGGTTACAAATGGTATAGGAGCCAAACACTGGGCGATGTACCAGCGAGGAGGATGAGCCTCGAAGAGGAGGTGAACATGAGGCGGTGTGTCAGCAAGGACGTTGGGTCCTGAAGGGCGGTGGATTGAGGGGTCCTGAAGGGCGGTGGATTGAGGGGTCCCACATCGATTAGAGAAGATAACGAGTGCCGGCGAGAACATTGGGCCCTGAAAGGAGGTGGATTGTGAGATCCCACGTTGGTTGGGGAGGAGAACGCAACATTCTTTATAAGGGTGTGGAAACCTTCCCCTAGCAGACGCGTTTTGGAAAGCCCGAAAGAAAATTCCAAAGAGGACAATATCTGCTAGGCGTAGGTTTAGGCCGCTCCATTAGTAACCTCAATTTTAATATCTTCTTATAATTGAAGCATGGAGGCCTACTACTTGCATTATGCTCATACCCACCCCATTCATCCCGGGAGAACAATGGTTTTAAATTATTAAGCTTTCGTTGGGAATTCGGAACCAAAGACTTAAGAAAGCATCCTCCAATATCTCAAACCCTTCATCAAATCAGAACCCCTTGGATGCCTCTCACTTCAAATCAGTAAATAATTAGCAAAGTGGCCTTTATTTGCTTCATATCCTTCAACTGTTTTATGTCATTACAGGAAGCAGCAGTCATCGTTGAATGAATCATGTCACTAGTAGCCAATTGCCCTTTCTTTGTGCTTGGAGCTAGTCTCAGATGTAGTACAGAAGCTTTCAACTCTTGTAAAGGTGAAAAATCAAGGTAAAAACCTATGCACTTGTCTTCAGTTCATGATAAAAAAAATTGATTTGTATGCGGCCACTGCTTAGAAATGAAATACTTCATGTAGTGAATCGTCCAGTATTTCGTGTAACTATTTTCTTGTTGAAAATTAAGTTTGGTTCCTCGTACTTGTGCATAGAAAAACATGAAAAATGGCCTGGTTTGTTGAGCGCTTATGTAACATGTTATGACGTGGTGATGTTTTTTTCCTATGATTTAACAAGAAAAAGAATATTGTTTACTCTAACGGAACAAATCTAGTTCAATCTTTAAATTCATCCCATTAAATCTCAACCTTATAGTAGTAATGAAAATACCTCCATGGACTGATCGAGTTACCAATTACTTACAAATTTTTGACTCGGAAACATTTCTGTTAAGTTATACTATTTTTTTGGTGCTAGTTTTACTGTTGCTACTTTAGTTGTATTACTGCTATGACTTTATACTGCAATGACTTTTTGAATATCTTGTTGATTAAATACGGTTTTGATGAGCCATTAATCTTGCTACCTATTAACGAAAGTTGTTCGTTAATACCGTAGGGTTCAACTTGATGACGTTAAAAGCTTAAGGTCGTTGAACGGATGATACAACTTGCAACTCTAGCAAGGAGCATCAAAGTAGCTAGAGGTTAGTTCGGCCTATTCAATAGCGTTCTTAAAAATAATTAAGGTCTCGACTCGTATAAGATTATAGATGATTTCTTTTTCTATATTTAATACTGTATTTTCCCTTCTCTCGTCTTGAATAATTTGGATGCCTCTTCTAATAGAGGTTAATTTGTTTAAATCTTGAGCCAATAGAGTTATGCATATAGGGGAAAAAAATATCAATTTCATATCCATTAATTACATATTTTCTTTATGTGTTGGCA

mRNA sequence

ATGTTTTCATGTTGTAATCGGAATGAAGGGAAGAGCGTTTCAGAAGAACCCTTTTCTTTATTGAGTTTGAATTTTAAGTGGCTTATCGAGATAAGAGAATGTCAATGAGAAAAGGGTAATTTGTAGATTTATATTTGTGTTTGTGGAAGTGAAATTCTGGAGTGTTTCTTTGGAAGCTTATAAGTGGTTTCTTTTTTTCGTGATGTGTTGGTGAAAATTGAGGTGCTGAGTTTTTGTATTTTAAGTGTTTGATGAAATGCCTGAGTGGTACAATAATGTTCTATCTGTTTATTTAGGCTGATAGGTTCATTGACCTCATGAATCTCACGACAATTCATTTTCGATTTCTTGCCAAACGGAATTTGGTTTTGTACCCAAGTAAGTATGCTTTTGGTTCCCAACTTAGATTCTGGAGGTCGGGGGCAGAAGGCGATATCGTGTCTTTTAGGACAGAAGATTTTCGTCATGACTATCTATTTGGATCAAACGTGATTTCCACGCGTGGCCACCTTGGGCAGGCGCTTTCACTGTTCTACTCTAGACAGCCTCATTCCCTCCAGACCTATGCCTATCTGTTCCATGCTTGTGCACGCCTCCGCTGCCTCCGGGAAGGTGTGGAACTACACCGTTACATGATGTCCCTGGATCCCATGGGCTCATTTGATCTCTTTGTTACCAATCACCTTATCAACATGTACTGTAAATGTGGTCATTTAGACTATGCCTACCAATTATTCAATGAGATGCCTAGGAGAAACCTTGTCTCTTGGACTGTGCTTATCTCAGGACTTTCTCAGTATGGCCATGTGGATGAGTGCTTCCTTATATTTTCGAGAATGTTGGTAGATCACAGGCCAAATGAGTTTACAGTTGCAAGTTTGCTTACCTCGTTTGGTGACCATGATGGTGAGCGTGGGCGGCAGGTACATGGGTTTGCCTTGAAAAGGTCACTAGATGCTTTCGTTTATGTTGCAAATGCTCTTATTACCATGTATAGCAAGAATTACTTTAAAGGTGGTGCTTTTAATGACGGTAAAGATGATGCTTGGACTATGTTCAAGAGCATAGAAAATCCCAGCCTTATAACATGGAACTCGATGATTGCAGGGTTTTGTTTCCGAAAACATGGAAATCGAGCTGTCCATTTATTTATGCAAATGAATCATAAAGGAATTGGATTTGATCGTGCAACACTTTTAAGCACTTTGTCTTCCATAAGTCTCTGCAATTGGGATGAACTTGATCTCGGTTTGGGCTTTTGTCGTGAATTACACTGTCAAGCATTAAAAACTGCTTTCACATCAGAAGTTGAAATAATTACTGCATTGGTAAAAACTTATGCTGAACTTGGAGGGGACATTACGGATAGTTATAGGCTTTTTATTGAAGCAGGATATAATCGGGATATAGTTTTATGGACCAGCATCATGACAGCTTTTGTCGACCATGACCCTGGGAAAACACTTTCCCTTTTTCGTCAGTTCCGACAAGAAGGCTTAACTCCAGATGGACACACTTTTTCGATTGTATTAAAGGCTTGTGCTGGATTCCTAACAGAGAAGCATGCTTCAACATATCATTCACTGCTAATTAAATCTATGTCTGAGGATGACACTGTCCTTAACAATGCCTTGATTCATGCTTATGGGAGGTGTGGTTCAATTACTTCATCCAAGAAAGTATTCAATCAAATGAAACATCACGATTTGGTTTCTTGGAACACAATGATGAAGGTCTATGCTGTCCATGGCCAAGCTGAGATTGCTTTGCAGCTTTTTTCAAAGATGACTGTGCCACCTGATTCTACTACATTTGTCTCTCTTCTTTCAGCATGCAGCCATGCAGGGCTCGTGGAAGAAGGGACCAACCTTTTTAATTCAATTGCAAATTATGGACTTGTTTGCCAACTAGATCACTATGCTTGCATGGTTGACATTTTGGGGAGATCTGGTCGGATTCAAGAGGCTGAAGATTTTATAAGTAAAATGCCTATAGAACCTGATTATGTTATTTGGAGTTCATTCCTGGGATCATGTAAAAAGCATGGCGCAACACAATTGGCCAAATTAGCATCTGATAAATTGAAGGAGTTAGATCCTAGCAATTCCTTAGCTTATGTGCAAATGTCAAATCTATATTGCTTAAGTGGTAGCTTTTATGAAGCAGACTTAATTAGAATGGAAATGAAAGGGTCTAGAGTGAGAAAGGAACCTGGATTAAGTTGGGTAGAAATAGAAAATCAAGTACATGAGTTTGCATCTGGAGGTCGCCATCATCCAGAGAGGGAGGTAATATGCAATGAACTCGAAGAGCTCATTGGGAGGTTAAAGGAAATTGGTTATGTGCCCGAGACAAGCTTAGCGATCCATGATGTGGAACAAGAGCAAAAAGAGGAGCAGCTATATCATCATAGCGAGAAGCTGGCTTTGGTTTTTTCTGTAATGAATGATAACAACTTGGGTTGTGTCAGTACACCTATTAGGATTATGAAAAACATCCGAATTTGTGTAGATTGTCATAATTTCATGAAGTTAGCTTCAAGGCTACTTCAGAAGGAGATTGTCATTAGAGACTCTAATCGTTTTCATCATTTCATGGCTGGTTTATGCTCGTGCAATGATTACTGGTAGTCAATAGGCGTCAAACATTCAAATACCCAAGTATCCATCTGCATTCATTCATAATTTTTTACCTCCAATATGAATAGGATAAAGTTGTGTTCTACATTGACACTTGTATCAATAATCTCTCAGTTCCTTCAACAACTGAGCAGCAAACTTTAGGAGGTGAAGATGTGGTCGGTCCTGGTTCCTACTTCTTGTTGAAGTATGGTACCACAGTTTCTCAAGTACATACAAAAATTTGGTTCCATATGAAGGTTGTAGCCACAGTACCGAGTTGAACTGTAAGGGATACTTTAAAGCAGGTAATGGGTAGACTATTTGTTATCTGCAGGATAACATGCCTTTCTTTAGGTCAACCATGATGATTTACTCCTCCATGAAGGATAAGCAGTCATTTTCTTATTCAAGTATTGATCCTTGTCGAACATCAGAAAAGAGAAAAAGTCAAACCCCTGCAGCTTGTTATCATTTTTTTACGAAGAAAAAAGAGAAAAGAAAGAATAGAAAAGAACAAAAAAAACCAAGACAAGATGGATACAGGAAAGACTACCAATCATTATGCCTTTACAAATCTGCAACGAGAAGCTTCTCCATTTGAATATTCCCTTGAATCCACGACTGCTGAAGTGACATCCATCATCATCAGTGTATCTGATTGGCTGGAATAAAATACTTCTCTAACAACCAGAAGACTATATTTGAAGTACATGTTGGTTTCTAGGAGAATCAGCATGTGGTCTTTGCCTGCTATTGGGTGTTGGAACCCTTTCGCATTTGCTCGCTTTAGAGTTTCTTGTGCTTCTAGTCAGCTACCAGTTTCTCCCAAAGACAAGACCCTGAGAACAATTTACAATTCTGGAGTCATTGCTTGCCTTCGCGCTAGCAGTGCGGAGCTGGCAATGAGTGCTGCTTGTGCTGCGCTAAATGGTGGAGTATCAGTTCTCGAGATTGTTATGTCGACACCAGGTGTGCTTGAGGTTCTACAACAGTTGCTGCAAGACTATCCTACAAGAACACTGGGAGTTGGGACCGTTCTTAATGTTAAGGATGCAAAGAATGCTGTCGAAGCTGGAGCCAAGTTTCTAATGAGTCCCACTATGGTGAAGGGTATCATGGATGATCTTGAAGGGGAATTCTTGTATATACCTGGTGTGATGACCCCGACAGAAGTACTGACTGCATATGAAGCTGGCGCTCAGATTGTTAAAGTTTATCCAGTTTCTGCATTAGGTGGTATCAAATATATATCAGCCCTCAAGAAGCCATTTCCTCATATCTCAATGGTTGCTTCTCAAGGCATAACTATTGAATCTACTGGGGACTACATTAGAGAAGGAGCATCTTCGGTAGTTTTATCTGATGCAATATTTAACAAGGAGTTTATGAAGCAAAAGAACTTTGATGGAATATCTCAACTTTCTAAGTTGGCTGCTTCCCGGGCGATGGAAGCTTTAGAATGGGTTCAACTTGATGACGTTAAAAGCTTAAGGTCGTTGAACGGATGATACAACTTGCAACTCTAGCAAGGAGCATCAAAGTAGCTAGAGGTTAGTTCGGCCTATTCAATAGCGTTCTTAAAAATAATTAAGGTCTCGACTCGTATAAGATTATAGATGATTTCTTTTTCTATATTTAATACTGTATTTTCCCTTCTCTCGTCTTGAATAATTTGGATGCCTCTTCTAATAGAGGTTAATTTGTTTAAATCTTGAGCCAATAGAGTTATGCATATAGGGGAAAAAAATATCAATTTCATATCCATTAATTACATATTTTCTTTATGTGTTGGCA

Coding sequence (CDS)

ATGTTGGTTTCTAGGAGAATCAGCATGTGGTCTTTGCCTGCTATTGGGTGTTGGAACCCTTTCGCATTTGCTCGCTTTAGAGTTTCTTGTGCTTCTAGTCAGCTACCAGTTTCTCCCAAAGACAAGACCCTGAGAACAATTTACAATTCTGGAGTCATTGCTTGCCTTCGCGCTAGCAGTGCGGAGCTGGCAATGAGTGCTGCTTGTGCTGCGCTAAATGGTGGAGTATCAGTTCTCGAGATTGTTATGTCGACACCAGGTGTGCTTGAGGTTCTACAACAGTTGCTGCAAGACTATCCTACAAGAACACTGGGAGTTGGGACCGTTCTTAATGTTAAGGATGCAAAGAATGCTGTCGAAGCTGGAGCCAAGTTTCTAATGAGTCCCACTATGGTGAAGGGTATCATGGATGATCTTGAAGGGGAATTCTTGTATATACCTGGTGTGATGACCCCGACAGAAGTACTGACTGCATATGAAGCTGGCGCTCAGATTGTTAAAGTTTATCCAGTTTCTGCATTAGGTGGTATCAAATATATATCAGCCCTCAAGAAGCCATTTCCTCATATCTCAATGGTTGCTTCTCAAGGCATAACTATTGAATCTACTGGGGACTACATTAGAGAAGGAGCATCTTCGGTAGTTTTATCTGATGCAATATTTAACAAGGAGTTTATGAAGCAAAAGAACTTTGATGGAATATCTCAACTTTCTAAGTTGGCTGCTTCCCGGGCGATGGAAGCTTTAGAATGGGTTCAACTTGATGACGTTAAAAGCTTAAGGTCGTTGAACGGATGA

Protein sequence

MLVSRRISMWSLPAIGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRASSAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVEAGAKFLMSPTMVKGIMDDLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKYISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKNFDGISQLSKLAASRAMEALEWVQLDDVKSLRSLNG
BLAST of Cp4.1LG01g09840 vs. Swiss-Prot
Match: ALKH_DICD3 (KHG/KDPG aldolase OS=Dickeya dadantii (strain 3937) GN=eda PE=3 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 1.6e-15
Identity = 54/195 (27.69%), Postives = 101/195 (51.79%), Query Frame = 1

Query: 52  VIACLRASSAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLN 111
           V+  +  +  E A+  A A + GGV VLE+ + T   +E ++ + Q+ P   +G GTV N
Sbjct: 17  VVPVIVINKLEHAVPMAKALVAGGVRVLELTLRTECAVEAIRLIAQEVPDAIVGAGTVTN 76

Query: 112 VKDAKNAVEAGAKFLMSPTMVKGIMD-DLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYP 171
            +       AGA+F +SP + + ++    EG    IPG+ T +E++   + G +  K +P
Sbjct: 77  PQQLAEVTAAGAQFAISPGLTEPLLKAATEGTIPLIPGISTVSELMLGMDYGLREFKFFP 136

Query: 172 VSALGGIKYISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKN 231
             A GG+K + A+  PF  I    + GI++++  DY+   +   V    +   + ++  +
Sbjct: 137 AEANGGVKALQAIAGPFGKIRFCPTGGISLKNYRDYLALKSVLCVGGSWLVPADALESGD 196

Query: 232 FDGISQLSKLAASRA 246
           +D I+ L++ A + A
Sbjct: 197 YDRITALAREAVAGA 211

BLAST of Cp4.1LG01g09840 vs. Swiss-Prot
Match: ALKH_ECO57 (KHG/KDPG aldolase OS=Escherichia coli O157:H7 GN=eda PE=3 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 2.1e-15
Identity = 52/195 (26.67%), Postives = 100/195 (51.28%), Query Frame = 1

Query: 52  VIACLRASSAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLN 111
           V+  +     E A+  A A + GGV VLE+ + T   ++ ++ + ++ P   +G GTVLN
Sbjct: 17  VVPVIVVKKLEHAVPMAKALVAGGVRVLEVTLRTECAVDAIRAIAKEVPEAIVGAGTVLN 76

Query: 112 VKDAKNAVEAGAKFLMSPTMVKGIMD-DLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYP 171
            +      EAGA+F +SP + + ++    EG    IPG+ T +E++   + G +  K +P
Sbjct: 77  PQQLAEVTEAGAQFAISPGLTEPLLKAATEGTIPLIPGISTVSELMLGMDYGLKEFKFFP 136

Query: 172 VSALGGIKYISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKN 231
             A GG+K + A+  PF  +    + GI+  +  DY+   +   +    +   + ++  +
Sbjct: 137 AEANGGVKALQAIAGPFSQVRFCPTGGISPANYRDYLALKSVLCIGGSWLVPADALEAGD 196

Query: 232 FDGISQLSKLAASRA 246
           +D I++L++ A   A
Sbjct: 197 YDRITKLAREAVEGA 211

BLAST of Cp4.1LG01g09840 vs. Swiss-Prot
Match: ALKH_ECOL6 (KHG/KDPG aldolase OS=Escherichia coli O6:H1 (strain CFT073 / ATCC 700928 / UPEC) GN=eda PE=3 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 2.1e-15
Identity = 52/195 (26.67%), Postives = 100/195 (51.28%), Query Frame = 1

Query: 52  VIACLRASSAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLN 111
           V+  +     E A+  A A + GGV VLE+ + T   ++ ++ + ++ P   +G GTVLN
Sbjct: 17  VVPVIVVKKLEHAVPMAKALVAGGVRVLEVTLRTECAVDAIRAIAKEVPEAIVGAGTVLN 76

Query: 112 VKDAKNAVEAGAKFLMSPTMVKGIMD-DLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYP 171
            +      EAGA+F +SP + + ++    EG    IPG+ T +E++   + G +  K +P
Sbjct: 77  PQQLAEVTEAGAQFAISPGLTEPLLKAATEGTIPLIPGISTVSELMLGMDYGLKEFKFFP 136

Query: 172 VSALGGIKYISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKN 231
             A GG+K + A+  PF  +    + GI+  +  DY+   +   +    +   + ++  +
Sbjct: 137 AEANGGVKALQAIAGPFSQVRFCPTGGISPANYRDYLALKSVLCIGGSWLVPADALEAGD 196

Query: 232 FDGISQLSKLAASRA 246
           +D I++L++ A   A
Sbjct: 197 YDRITKLAREAVEGA 211

BLAST of Cp4.1LG01g09840 vs. Swiss-Prot
Match: ALKH_ECOLI (KHG/KDPG aldolase OS=Escherichia coli (strain K12) GN=eda PE=1 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 2.1e-15
Identity = 52/195 (26.67%), Postives = 100/195 (51.28%), Query Frame = 1

Query: 52  VIACLRASSAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLN 111
           V+  +     E A+  A A + GGV VLE+ + T   ++ ++ + ++ P   +G GTVLN
Sbjct: 17  VVPVIVVKKLEHAVPMAKALVAGGVRVLEVTLRTECAVDAIRAIAKEVPEAIVGAGTVLN 76

Query: 112 VKDAKNAVEAGAKFLMSPTMVKGIMD-DLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYP 171
            +      EAGA+F +SP + + ++    EG    IPG+ T +E++   + G +  K +P
Sbjct: 77  PQQLAEVTEAGAQFAISPGLTEPLLKAATEGTIPLIPGISTVSELMLGMDYGLKEFKFFP 136

Query: 172 VSALGGIKYISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKN 231
             A GG+K + A+  PF  +    + GI+  +  DY+   +   +    +   + ++  +
Sbjct: 137 AEANGGVKALQAIAGPFSQVRFCPTGGISPANYRDYLALKSVLCIGGSWLVPADALEAGD 196

Query: 232 FDGISQLSKLAASRA 246
           +D I++L++ A   A
Sbjct: 197 YDRITKLAREAVEGA 211

BLAST of Cp4.1LG01g09840 vs. Swiss-Prot
Match: ALKH_SHIFL (KHG/KDPG aldolase OS=Shigella flexneri GN=eda PE=3 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 2.1e-15
Identity = 52/195 (26.67%), Postives = 100/195 (51.28%), Query Frame = 1

Query: 52  VIACLRASSAELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLN 111
           V+  +     E A+  A A + GGV VLE+ + T   ++ ++ + ++ P   +G GTVLN
Sbjct: 17  VVPVIVVKKLEHAVPMAKALVAGGVRVLEVTLRTECAVDAIRAIAKEVPEAIVGAGTVLN 76

Query: 112 VKDAKNAVEAGAKFLMSPTMVKGIMD-DLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYP 171
            +      EAGA+F +SP + + ++    EG    IPG+ T +E++   + G +  K +P
Sbjct: 77  PQQLAEVTEAGAQFAISPGLTEPLLKAATEGTIPLIPGISTVSELMLGMDYGLKEFKFFP 136

Query: 172 VSALGGIKYISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKN 231
             A GG+K + A+  PF  +    + GI+  +  DY+   +   +    +   + ++  +
Sbjct: 137 AEANGGVKALQAIAGPFSQVRFCPTGGISPANYRDYLALKSVLCIGGSWLVPADALEAGD 196

Query: 232 FDGISQLSKLAASRA 246
           +D I++L++ A   A
Sbjct: 197 YDRITKLAREAVEGA 211

BLAST of Cp4.1LG01g09840 vs. TrEMBL
Match: A0A0A0KPQ0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G583300 PE=4 SV=1)

HSP 1 Score: 443.0 bits (1138), Expect = 2.6e-121
Identity = 229/252 (90.87%), Postives = 243/252 (96.43%), Query Frame = 1

Query: 1   MLVSRRISMWSLPAIGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRASS 60
           MLVSR ISMWSLPAIGCW P AFARFRV CASSQLP+ PKD+TLRTI+NSGVIACLRASS
Sbjct: 1   MLVSRSISMWSLPAIGCWYPCAFARFRVFCASSQLPLPPKDETLRTIHNSGVIACLRASS 60

Query: 61  AELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVE 120
           AELAMSAACAALNGG+SVLEIVMSTPGVLEVLQQLLQDYPT+TLGVGTVLN+KDAKNAV+
Sbjct: 61  AELAMSAACAALNGGISVLEIVMSTPGVLEVLQQLLQDYPTKTLGVGTVLNIKDAKNAVK 120

Query: 121 AGAKFLMSPTMVKG-IMDDLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKY 180
           AGAKFLMSPTMVKG IM D+EGEFLYIPGVMTPTEVLTAYE+G++IVKVYPVSALGGIKY
Sbjct: 121 AGAKFLMSPTMVKGIIMGDIEGEFLYIPGVMTPTEVLTAYESGSEIVKVYPVSALGGIKY 180

Query: 181 ISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKNFDGISQLSK 240
           ISALKKPFPHISMVASQGITIESTGDYIR+GASSVVLSDAIFNKEFM +KNFDGI QLSK
Sbjct: 181 ISALKKPFPHISMVASQGITIESTGDYIRQGASSVVLSDAIFNKEFMDKKNFDGIFQLSK 240

Query: 241 LAASRAMEALEW 252
           LAAS+AMEALEW
Sbjct: 241 LAASQAMEALEW 252

BLAST of Cp4.1LG01g09840 vs. TrEMBL
Match: B9GLV9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s07070g PE=4 SV=2)

HSP 1 Score: 285.4 bits (729), Expect = 7.0e-74
Identity = 165/255 (64.71%), Postives = 196/255 (76.86%), Query Frame = 1

Query: 1   MLVSRRISMWSLPAIGCWNPFAFARFRV--SCASSQLPVSPK-DKTLRTIYNSGVIACLR 60
           M  S R++  SLP +          FRV  S +SS L +SP   KT   I NSGVIACLR
Sbjct: 1   MATSWRLTPQSLPLLSP----KVKSFRVYSSSSSSSLSLSPIIQKTSSLIQNSGVIACLR 60

Query: 61  ASSAELAMSAACAALNGGVSVLEIVMSTPGVLEV-LQQLLQDYPTRTLGVGTVLNVKDAK 120
           A+SAELA  AA AALNGG+SVLEIVMSTPGV +V L+QL++DYPT  LGVGT LN +DA+
Sbjct: 61  ANSAELAYEAATAALNGGISVLEIVMSTPGVFQVVLRQLVKDYPTLALGVGTALNAEDAR 120

Query: 121 NAVEAGAKFLMSPTMVKGIMDDL-EGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALG 180
           NA+ AG+KF MSP  VK IMDD+ + E LYIPGVMTPTE+L+AY+AGA++VKVYPVSALG
Sbjct: 121 NAMNAGSKFFMSPATVKDIMDDVVKDEILYIPGVMTPTEILSAYDAGAKMVKVYPVSALG 180

Query: 181 GIKYISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKNFDGIS 240
           G++YISALKKPFPHI MVASQGITI+S G+YI  GASSVVLSDAIF+K  M Q+NF+ I 
Sbjct: 181 GVQYISALKKPFPHIPMVASQGITIDSIGEYISSGASSVVLSDAIFDKGAMTQRNFNVIH 240

Query: 241 QLSKLAASRAMEALE 251
           QL+ LAA    EA+E
Sbjct: 241 QLASLAALEGKEAVE 251

BLAST of Cp4.1LG01g09840 vs. TrEMBL
Match: A0A067L007_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24577 PE=4 SV=1)

HSP 1 Score: 284.6 bits (727), Expect = 1.2e-73
Identity = 161/230 (70.00%), Postives = 185/230 (80.43%), Query Frame = 1

Query: 25  RFRVSCASSQLPVS---PKDKTLRTIYNSGVIACLRASSAELAMSAACAALNGGVSVLEI 84
           RFRV   SS  P+S    +++TL  I+NSGVIACLRA+SAELA  AA AAL  G+SVLEI
Sbjct: 24  RFRVY--SSLPPISFSSTRERTLSLIHNSGVIACLRANSAELAFEAASAALRAGISVLEI 83

Query: 85  VMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVEAGAKFLMSPTMVKGIMDD-LE 144
           VMSTPGV +VLQQL++D+PT  LGVGTVLN +DA NA  AGAKFLMSP  V GIMD  L+
Sbjct: 84  VMSTPGVFQVLQQLVKDHPTVALGVGTVLNAEDAINAKRAGAKFLMSPATVMGIMDVVLD 143

Query: 145 GEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKYISALKKPFPHISMVASQGITI 204
           GE LYIPG MTPTE+L+AY+AGA+I+KVYPVSALGG +YISALKKPF HISMVASQGI I
Sbjct: 144 GEVLYIPGAMTPTEILSAYDAGAKIIKVYPVSALGGTQYISALKKPFAHISMVASQGIMI 203

Query: 205 ESTGDYIREGASSVVLSDAIFNKEFMKQKNFDGISQLSKLAASRAMEALE 251
           +S GDYI  GASSVVLSDAIFNKE M QKNF+ I QL+ LA  +  EA+E
Sbjct: 204 DSVGDYISCGASSVVLSDAIFNKEAMAQKNFNEIYQLACLANLQGQEAVE 251

BLAST of Cp4.1LG01g09840 vs. TrEMBL
Match: A0A0S3RS39_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G053400 PE=4 SV=1)

HSP 1 Score: 272.7 bits (696), Expect = 4.7e-70
Identity = 147/238 (61.76%), Postives = 184/238 (77.31%), Query Frame = 1

Query: 7   ISMWSLPAIGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRASSAELAMS 66
           +S+   P      P      RVSC    L  S  DKTL  I NSG+IACLRA+SAE+A+ 
Sbjct: 32  LSLCQWPIFSSSPPSLRMHVRVSCGIPHLSASAVDKTLSQINNSGIIACLRANSAEVALK 91

Query: 67  AACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVEAGAKFL 126
           AA AA+ GGVSVLEIV+STPGV EVLQQL++++PT  +GVGTVL ++DAKNA+ AGAKFL
Sbjct: 92  AANAAIAGGVSVLEIVVSTPGVFEVLQQLVKEHPTMAIGVGTVLKIEDAKNAINAGAKFL 151

Query: 127 MSPTMVKGIM--DDLE-GEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKYISAL 186
           +SP  VK IM  D ++ GE LYIPG MTPTE+L+A++AGA++VK+YP SALGG +YISAL
Sbjct: 152 LSPATVKDIMVMDYVQSGEVLYIPGTMTPTEILSAWDAGAKMVKIYPASALGGFQYISAL 211

Query: 187 KKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKNFDGISQLSKLA 242
           KK FPH+SMVASQGITI++ G+YI  GASSVVLSDAIF+KE ++Q NFD I +L++ A
Sbjct: 212 KKTFPHVSMVASQGITIDAIGEYILRGASSVVLSDAIFDKEAIEQLNFDKIHKLARSA 269

BLAST of Cp4.1LG01g09840 vs. TrEMBL
Match: A0A0L9U278_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan03g026200 PE=4 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 1.0e-69
Identity = 144/218 (66.06%), Postives = 179/218 (82.11%), Query Frame = 1

Query: 27  RVSCASSQLPVSPKDKTLRTIYNSGVIACLRASSAELAMSAACAALNGGVSVLEIVMSTP 86
           RVSC    L  S  DKTL  I NSG+IACLRA+SAE+A+ AA AA+ GGVSVLEIV+STP
Sbjct: 4   RVSCGIPHLSASAVDKTLSQINNSGIIACLRANSAEVALKAANAAIAGGVSVLEIVVSTP 63

Query: 87  GVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVEAGAKFLMSPTMVKGIM--DDLE-GEF 146
           GV EVLQQL++++PT  +GVGTVL ++DAKNA+ AGAKFL+SP  VK IM  D ++ GE 
Sbjct: 64  GVFEVLQQLVKEHPTMAIGVGTVLKIEDAKNAINAGAKFLLSPATVKDIMVMDYVQSGEV 123

Query: 147 LYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKYISALKKPFPHISMVASQGITIEST 206
           LYIPG MTPTE+L+A++AGA++VK+YP SALGG +YISALKK FPH+SMVASQGITI++ 
Sbjct: 124 LYIPGTMTPTEILSAWDAGAKMVKIYPASALGGFQYISALKKTFPHVSMVASQGITIDAI 183

Query: 207 GDYIREGASSVVLSDAIFNKEFMKQKNFDGISQLSKLA 242
           G+YI  GASSVVLSDAIF+KE ++Q NFD I +L++ A
Sbjct: 184 GEYILRGASSVVLSDAIFDKEAIEQLNFDKIHKLARSA 221

BLAST of Cp4.1LG01g09840 vs. NCBI nr
Match: gi|778704362|ref|XP_011655527.1| (PREDICTED: uncharacterized protein LOC101218794 isoform X1 [Cucumis sativus])

HSP 1 Score: 444.1 bits (1141), Expect = 1.7e-121
Identity = 230/254 (90.55%), Postives = 244/254 (96.06%), Query Frame = 1

Query: 1   MLVSRRISMWSLPAIGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRASS 60
           MLVSR ISMWSLPAIGCW P AFARFRV CASSQLP+ PKD+TLRTI+NSGVIACLRASS
Sbjct: 1   MLVSRSISMWSLPAIGCWYPCAFARFRVFCASSQLPLPPKDETLRTIHNSGVIACLRASS 60

Query: 61  AELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVE 120
           AELAMSAACAALNGG+SVLEIVMSTPGVLEVLQQLLQDYPT+TLGVGTVLN+KDAKNAV+
Sbjct: 61  AELAMSAACAALNGGISVLEIVMSTPGVLEVLQQLLQDYPTKTLGVGTVLNIKDAKNAVK 120

Query: 121 AGAKFLMSPTMVKG-IMDDLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKY 180
           AGAKFLMSPTMVKG IM D+EGEFLYIPGVMTPTEVLTAYE+G++IVKVYPVSALGGIKY
Sbjct: 121 AGAKFLMSPTMVKGIIMGDIEGEFLYIPGVMTPTEVLTAYESGSEIVKVYPVSALGGIKY 180

Query: 181 ISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKNFDGISQLSK 240
           ISALKKPFPHISMVASQGITIESTGDYIR+GASSVVLSDAIFNKEFM +KNFDGI QLSK
Sbjct: 181 ISALKKPFPHISMVASQGITIESTGDYIRQGASSVVLSDAIFNKEFMDKKNFDGIFQLSK 240

Query: 241 LAASRAMEALEWVQ 254
           LAAS+AMEALEW Q
Sbjct: 241 LAASQAMEALEWKQ 254

BLAST of Cp4.1LG01g09840 vs. NCBI nr
Match: gi|659090293|ref|XP_008445938.1| (PREDICTED: uncharacterized protein LOC103488815 [Cucumis melo])

HSP 1 Score: 443.7 bits (1140), Expect = 2.2e-121
Identity = 231/254 (90.94%), Postives = 241/254 (94.88%), Query Frame = 1

Query: 1   MLVSRRISMWSLPAIGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRASS 60
           MLVSR ISMWSLP IGCWNP AFARFRV CASSQLPV  KDKTLRTI+NSGVIACLRASS
Sbjct: 1   MLVSRSISMWSLPTIGCWNPCAFARFRVFCASSQLPVPSKDKTLRTIHNSGVIACLRASS 60

Query: 61  AELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVE 120
           AELAMSAACAALNGG+SVLEIVMSTPGVLEVLQQLLQDYPT+TLGVGTVLN+KDAKNAV+
Sbjct: 61  AELAMSAACAALNGGISVLEIVMSTPGVLEVLQQLLQDYPTKTLGVGTVLNIKDAKNAVK 120

Query: 121 AGAKFLMSPTMVKG-IMDDLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKY 180
           AGAKFLMSPTMVKG IM D+E EFLYIPGVMTPTEVLTAYE+G +IVKVYPVSALGGIKY
Sbjct: 121 AGAKFLMSPTMVKGIIMGDIEDEFLYIPGVMTPTEVLTAYESGCEIVKVYPVSALGGIKY 180

Query: 181 ISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKNFDGISQLSK 240
           ISALKKPFPHISMVASQGITIESTGDYIR+GASSVVLSDAIFNKEFM QKNFDGI QLSK
Sbjct: 181 ISALKKPFPHISMVASQGITIESTGDYIRQGASSVVLSDAIFNKEFMDQKNFDGIFQLSK 240

Query: 241 LAASRAMEALEWVQ 254
           LAAS+AMEALEW Q
Sbjct: 241 LAASQAMEALEWKQ 254

BLAST of Cp4.1LG01g09840 vs. NCBI nr
Match: gi|700196423|gb|KGN51600.1| (hypothetical protein Csa_5G583300 [Cucumis sativus])

HSP 1 Score: 443.0 bits (1138), Expect = 3.8e-121
Identity = 229/252 (90.87%), Postives = 243/252 (96.43%), Query Frame = 1

Query: 1   MLVSRRISMWSLPAIGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRASS 60
           MLVSR ISMWSLPAIGCW P AFARFRV CASSQLP+ PKD+TLRTI+NSGVIACLRASS
Sbjct: 1   MLVSRSISMWSLPAIGCWYPCAFARFRVFCASSQLPLPPKDETLRTIHNSGVIACLRASS 60

Query: 61  AELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVE 120
           AELAMSAACAALNGG+SVLEIVMSTPGVLEVLQQLLQDYPT+TLGVGTVLN+KDAKNAV+
Sbjct: 61  AELAMSAACAALNGGISVLEIVMSTPGVLEVLQQLLQDYPTKTLGVGTVLNIKDAKNAVK 120

Query: 121 AGAKFLMSPTMVKG-IMDDLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKY 180
           AGAKFLMSPTMVKG IM D+EGEFLYIPGVMTPTEVLTAYE+G++IVKVYPVSALGGIKY
Sbjct: 121 AGAKFLMSPTMVKGIIMGDIEGEFLYIPGVMTPTEVLTAYESGSEIVKVYPVSALGGIKY 180

Query: 181 ISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKNFDGISQLSK 240
           ISALKKPFPHISMVASQGITIESTGDYIR+GASSVVLSDAIFNKEFM +KNFDGI QLSK
Sbjct: 181 ISALKKPFPHISMVASQGITIESTGDYIRQGASSVVLSDAIFNKEFMDKKNFDGIFQLSK 240

Query: 241 LAASRAMEALEW 252
           LAAS+AMEALEW
Sbjct: 241 LAASQAMEALEW 252

BLAST of Cp4.1LG01g09840 vs. NCBI nr
Match: gi|778704369|ref|XP_011655529.1| (PREDICTED: uncharacterized protein LOC101218794 isoform X2 [Cucumis sativus])

HSP 1 Score: 438.7 bits (1127), Expect = 7.1e-120
Identity = 228/251 (90.84%), Postives = 242/251 (96.41%), Query Frame = 1

Query: 1   MLVSRRISMWSLPAIGCWNPFAFARFRVSCASSQLPVSPKDKTLRTIYNSGVIACLRASS 60
           MLVSR ISMWSLPAIGCW P AFARFRV CASSQLP+ PKD+TLRTI+NSGVIACLRASS
Sbjct: 1   MLVSRSISMWSLPAIGCWYPCAFARFRVFCASSQLPLPPKDETLRTIHNSGVIACLRASS 60

Query: 61  AELAMSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVE 120
           AELAMSAACAALNGG+SVLEIVMSTPGVLEVLQQLLQDYPT+TLGVGTVLN+KDAKNAV+
Sbjct: 61  AELAMSAACAALNGGISVLEIVMSTPGVLEVLQQLLQDYPTKTLGVGTVLNIKDAKNAVK 120

Query: 121 AGAKFLMSPTMVKG-IMDDLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKY 180
           AGAKFLMSPTMVKG IM D+EGEFLYIPGVMTPTEVLTAYE+G++IVKVYPVSALGGIKY
Sbjct: 121 AGAKFLMSPTMVKGIIMGDIEGEFLYIPGVMTPTEVLTAYESGSEIVKVYPVSALGGIKY 180

Query: 181 ISALKKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKNFDGISQLSK 240
           ISALKKPFPHISMVASQGITIESTGDYIR+GASSVVLSDAIFNKEFM +KNFDGI QLSK
Sbjct: 181 ISALKKPFPHISMVASQGITIESTGDYIRQGASSVVLSDAIFNKEFMDKKNFDGIFQLSK 240

Query: 241 LAASRAMEALE 251
           LAAS+AMEALE
Sbjct: 241 LAASQAMEALE 251

BLAST of Cp4.1LG01g09840 vs. NCBI nr
Match: gi|778704372|ref|XP_011655530.1| (PREDICTED: uncharacterized protein LOC101218794 isoform X3 [Cucumis sativus])

HSP 1 Score: 335.1 bits (858), Expect = 1.1e-88
Identity = 174/190 (91.58%), Postives = 185/190 (97.37%), Query Frame = 1

Query: 65  MSAACAALNGGVSVLEIVMSTPGVLEVLQQLLQDYPTRTLGVGTVLNVKDAKNAVEAGAK 124
           MSAACAALNGG+SVLEIVMSTPGVLEVLQQLLQDYPT+TLGVGTVLN+KDAKNAV+AGAK
Sbjct: 1   MSAACAALNGGISVLEIVMSTPGVLEVLQQLLQDYPTKTLGVGTVLNIKDAKNAVKAGAK 60

Query: 125 FLMSPTMVKGI-MDDLEGEFLYIPGVMTPTEVLTAYEAGAQIVKVYPVSALGGIKYISAL 184
           FLMSPTMVKGI M D+EGEFLYIPGVMTPTEVLTAYE+G++IVKVYPVSALGGIKYISAL
Sbjct: 61  FLMSPTMVKGIIMGDIEGEFLYIPGVMTPTEVLTAYESGSEIVKVYPVSALGGIKYISAL 120

Query: 185 KKPFPHISMVASQGITIESTGDYIREGASSVVLSDAIFNKEFMKQKNFDGISQLSKLAAS 244
           KKPFPHISMVASQGITIESTGDYIR+GASSVVLSDAIFNKEFM +KNFDGI QLSKLAAS
Sbjct: 121 KKPFPHISMVASQGITIESTGDYIRQGASSVVLSDAIFNKEFMDKKNFDGIFQLSKLAAS 180

Query: 245 RAMEALEWVQ 254
           +AMEALEW Q
Sbjct: 181 QAMEALEWKQ 190

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ALKH_DICD31.6e-1527.69KHG/KDPG aldolase OS=Dickeya dadantii (strain 3937) GN=eda PE=3 SV=1[more]
ALKH_ECO572.1e-1526.67KHG/KDPG aldolase OS=Escherichia coli O157:H7 GN=eda PE=3 SV=1[more]
ALKH_ECOL62.1e-1526.67KHG/KDPG aldolase OS=Escherichia coli O6:H1 (strain CFT073 / ATCC 700928 / UPEC)... [more]
ALKH_ECOLI2.1e-1526.67KHG/KDPG aldolase OS=Escherichia coli (strain K12) GN=eda PE=1 SV=1[more]
ALKH_SHIFL2.1e-1526.67KHG/KDPG aldolase OS=Shigella flexneri GN=eda PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KPQ0_CUCSA2.6e-12190.87Uncharacterized protein OS=Cucumis sativus GN=Csa_5G583300 PE=4 SV=1[more]
B9GLV9_POPTR7.0e-7464.71Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s07070g PE=4 SV=2[more]
A0A067L007_JATCU1.2e-7370.00Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24577 PE=4 SV=1[more]
A0A0S3RS39_PHAAN4.7e-7061.76Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G053400 PE=... [more]
A0A0L9U278_PHAAN1.0e-6966.06Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan03g026200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|778704362|ref|XP_011655527.1|1.7e-12190.55PREDICTED: uncharacterized protein LOC101218794 isoform X1 [Cucumis sativus][more]
gi|659090293|ref|XP_008445938.1|2.2e-12190.94PREDICTED: uncharacterized protein LOC103488815 [Cucumis melo][more]
gi|700196423|gb|KGN51600.1|3.8e-12190.87hypothetical protein Csa_5G583300 [Cucumis sativus][more]
gi|778704369|ref|XP_011655529.1|7.1e-12090.84PREDICTED: uncharacterized protein LOC101218794 isoform X2 [Cucumis sativus][more]
gi|778704372|ref|XP_011655530.1|1.1e-8891.58PREDICTED: uncharacterized protein LOC101218794 isoform X3 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO:0016829lyase activity
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR013785Aldolase_TIM
IPR000887Aldlse_KDPG_KHG
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0006525 arginine metabolic process
biological_process GO:0006098 pentose-phosphate shunt
biological_process GO:0006560 proline metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016829 lyase activity
molecular_function GO:0008675 2-dehydro-3-deoxy-phosphogluconate aldolase activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g09840.1Cp4.1LG01g09840.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000887KDPG/KHG aldolasePFAMPF01081Aldolasecoord: 50..237
score: 2.7
IPR013785Aldolase-type TIM barrelGENE3DG3DSA:3.20.20.70coord: 40..243
score: 2.1
NoneNo IPR availablePANTHERPTHR302462-KETO-3-DEOXY-6-PHOSPHOGLUCONATE ALDOLASEcoord: 34..251
score: 1.4E
NoneNo IPR availablePANTHERPTHR30246:SF12-DEHYDRO-3-DEOXY-6-PHOSPHOGALACTONATE ALDOLASEcoord: 34..251
score: 1.4E
NoneNo IPR availableunknownSSF51569Aldolasecoord: 42..241
score: 1.39