CmaCh11G010340 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh11G010340
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Description3-dehydroquinate synthase homolog
LocationCma_Chr11: 5665514 .. 5677073 (-)
RNA-Seq ExpressionCmaCh11G010340
SyntenyCmaCh11G010340
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCTCTCTTTTTTTGCGATCCGAGTTGCAGAATCGGCAATTGATTATCAAAGAACTGATGAAGATGATGATCCCAATCGAATCGATCTGAATCTGATGGTGTCTTCAGACAGGTCCGAATAATATGCGAAATTACTGAATTGTCCTTTTCTTTTTTTCGCATTGAGTGGGCCTGCTGAAAACATGGGCTGGACTTTTGATTTTGGGCTTTGGATTTCAGTAAATGGGCTCTCTTGGCTGGTAAGCTCTGATAAGATCGTTCAGAACGTTTCTTTTTCTTTTATACTTTTTTTTAATATATATAATATTTTAATCATAAATTTTGATTTTCTTATTCTTTATTTGTAAGAAAATTTCTCAACATTTATTAAAAAAAATCTACTTTCTTGGTTATTATTTTTAGTAATTTAATTAAAAATGTTAGATAAATAAATTTCTCGAACATTAAAATTGTTATTTTTAATAGTTTAACAACTATGAATAAAATAATATTATTAATCGCCACGTGGACTTACGTGCGTTAATCTCTTGAGACAAAGTATTTTTTTTTTTTTTTTATAAATCTCTCGACTTTTAAGATAGATCGATAAACAAAATTGGTTAAAATTTATTGATGTGTCATTATATTATTTTTTATTAAATAAATTTCTTTTTTTTTTATATATAAATTTGAAAAAATTGGTTAGATAGTTAAAAATTTGACTAAAAAAAATGTAAGGATCAATAATAAGTTATAATATATATAAATAAAATAAAATTTGAAGAACCTTAATTGAGTTTTTTTTTTTTTTTTCTAAGAATAAATATTAATTGATTTTTTAAAATACTAAATTGTGAATTTTAGAAACCGGTACGTGGAGTCAAATTTGAAAGCCTTTTTTTTCTTTTACTAAAGTCAAATTTAAAAGAAAATTGTATTTCCACGTAAATAAAATAAATATGACATCATTTAATAGTTTTCGTTTTTCTAAAAAAATGCGAAATTCGTAATTTCTTCGGTGGGCCCACTCCTCCACCTCATGGGCATGACAAGGACTTATTTTCCTTTTTATTTAAAAATATATTTTTAAGGGTTTTTTTTTTTTAATTAAAAATTTATGTATTCTTAATATGTCTTTTGAAATTTTAGGATCATATAACTTATATAACCCGTGTTAGCATTGATTTCGTCTAACTATCTTAACTCAACTTTATTCTATTTTAAAAAAAAATTAAAAATATTTAACATTTGTAACCAAGATTTTAATTATTAATTAACTCGACAACCCAACTCAGCTAAAAAAGAGAAAGTTGGATGGGACTGTGAACTCTATTCGAGTCACCAACTCAACTAACTTAATAGGTTTAAAGTATTTCTGTTATTCAATTCAACCGGCATACACCATTTACCTCGTAAATGTCAATTTTTTTTTCTTTCTTTCTAAAAATAAAGTATTTATCTTAAGATTCAACCCTTTCTAAAAAAACATGTTTCCATAAGTCCCATATTTCAAGCAAGAAAAAGCTTTCAAAGATGAGTCACTTTCACTAAAAGTTAAATGTGAATATTAACAAAAACGCCAATCTCTCCTCACATTCATAAAATTAAAAAAAGATTCTTATGGTATCTTTGTCGCAACACGACACTATAATATTATCTTATTTAAACATAAGTTTTAATGACATTGGATTTAAAATTATTTACAAATTTCAAACCAATGAACTTTGGTTTTACGACTGTATCTTTTAAAAATAATTTCATTACCTGATATGAAAATATTTTTTTTACGTTGAAAAAAAAAACTTTAATGGTTAAATATGGGGAATATAATTTCTAAAGTTTAAAAATTAGTAAAATATAACCATTATTCCAAATATTATAAAATATGAAAAATTATGAAAACAAAAAAACATTATGAATAAACATTTTACATTTTTTTTAACAAGTTAAAATGTTTGTTATATGTTGATACATTTTTTTATATTTTTTATAAAATAATAAGAATTTTGATTCATCTCAATATTGATATCAAACTGAAACAAGACACGTGTAAAAATGGGAAGCTAACACTTTAACAAAAAAAGGAAAAGAAAATAATAATGAAGACGTGGAAGAAGCCAAGTCATGTGCAAGGCACCTGGCATCCACGTGAGGGCATGCTTCGACACGTTACAGCCGGCAAGGCACAAAACACACACTAGGTGGGGCCCACAAAATTCGGATTCCATTGGGGGAATAATATAATAACGGGACAGTGTTCTTTTGACTGGTACGAGGAAACTTGCTATTGGTTAAGTGTTTGATGACGTGTCCTCTATTTTTGGTGCTATTAAAATATCTCTACCTTTTCCCCTCACGTTTGTTTGACTTTTTTTTTTCTTTTTTCTTTTTCTTTTTTTGATTGTGATCTCCTAACGCATAATTGATATTGAAGTGCCTTTTCTTTCCTATTATTATTTATTTTGTTAAAAAAAAAAATCTTTTAAATTTAAAAAATATTTGTTTCTAGTTTCAAAATTAAATTAATATTCAAAAGGACGAAGTTCTAATTTTATTTTTTTAAACATTTTAAAAACGTTGAAAAGATTAATTTAATAAATCATGTGGTTTAGTTAATTTTTTAACAAAAAAAAAATATTTAATGAAAATAATTCTTTCTCTTTTTCAGATTTAAATTTAGTATTTATTAGAATATTTAAATTTCAAAATTATAGAGAATTTGTTCTGCAAAAATAATACTATGAAGATTTAATATGCTAAAATGAACCAGCAAGAAGCCATAACGTGACAAGATACAACTTAAGTGGGTCGAGTATTTAATTTTAATTAAAACTCATTGTATAATAAAAATATATTATATTTATTATTATTTTCTATTATCTACAGATACATATTAGGTATCATATTAATATGTGTCTATATCTATAGAGAAAAATAAATAATAGATATTTAGTTATTTTATTCGTTTTTTTTTTTTTTGAATAAAAAGAAGTAAATGTATTGAAGGTCATGTTCCTTATTTTCTTGTTCAAATCATCAAACTCGATCTGAAAATCTATGTTTACAAATAATGAATATAATACCATTTTGTTAACCATATCTTATAAATTTTTTTTGTCATTTTGTTAATCACGTCGTCTCCAACCTATTCCTTCTAGAATATATCAACCCAACTCAACTGAAATATCATGTCGATCAAGCTGTCCTATGCTACAAGAAGGTTAGGCTCAAAACTCGACGACAAAAAACATCTAAAAAATTAAAATTAAAAAGGATAGATCCATTGAATTTCCTAATTTTACTTTTGCATTCCCTTACGACATAGTCATGTTACATACTCTTTATTTATAAGAGTAAAAATTATCCCCACAGACAAACATATGTCCTTCTAGAATGTTTTGTTCTCGATTGGACGGCAAATGAAGATGCATTTTATTGGTTTAGGTAGTGATTTTTAGCTATTTTAAGTCTTTCCTCGTATTCAAATATAACATTGATTTATTCACGTACCCTTTCTTTAGTTAGATGTCACAAAAACACTTCAAATATAATCGGAAATATATACAAGAAATTAGGTAAAAAAATTAGAAAATGATTAAAAATAAATAAAAATAATTAAAATATAATTTAATTTTCCTAAGTATATTAGGAAGAAAAAAAAAGTGTTTATTTTTATTTAAAATTAAAATATTTATTAAAAAAAAAACAAAATCCGCTGTTTGGAACAGCCCACTGGAATGGTTGGGAATGGCCCCTGGTTTTCTCTTCTTCAAACCCATCTCTCTCTCTCTCCGCTAAAACCTCTTTTATATCCCTTCTTTCTCCTTTCTCCGTTCTCGTCTACAGATATCCTCATTTTTTCATCTCCCTTCAATCCTTACCCTGTTTCTTCTTGCAATTCTATCCAGATCTCCGTTTTTCACCAATTGCTCCACTTTTCGCCTTCAATTTTCCACCGTTTTTTTTGAAATTTTAATGGGGTTTGAGAGGTCTTTGGCTGATCGCAAGGGGAAACAGTTTTTTGAATTGCCCAAAGAAACTACTAAGAACAATAAATCCAGGAAGAATCGGCGGCGCTCTAAGAAACCCTCCTCGCCGATTCAGAAACTTTACGAGACATGTAAGCAAGTATTTGCCTCTACTGAAACTGGAATTGTTCCCTCTCTTGAGGATATCCAACGCCTGCAATCTGTTCTCGGTATGTCTTTACTTTCGTTTTTCATTTTTCTTGTAAAATTTCTGATCGGACCAACCCTACATTGTTTTGAATTGAATTCCCAATTCTGATTGATAATGTGGTTTGATGTTATGATTATTCGCTTTGTTTGTGGGGTTTTAAAATTGATGATCCGTACTATCTGGGAAAGGTTTTGTGAATTTTTCTTGGGTTTCCTTCTTATTTGTTGTTTCTGTACTTTTCATTTTGGATCGTGGGTTTTAAGATATATATATATATATATATTTTTTAGCTTGGTCTCTTCGGCTGTTGCTGGGAACTTGTACTGCTCCTGCACGTGCTGTTCTAGATATTGGTGCTTTGAGTTATTTTAGTTGGATAATGATTGTCTTCAACAATGGATTTGGTTTTTCTATTCAATGATTGAATTTGAGCTTTTATCTGGAAATTACAAACTGTTTCAAGTTCCATGGTTATAGAATAAGTTTGTAAGTTTTTTTATGTTCTAACGCTGATTAGAACCTGTTGGCAAGTTTTATAAATCTTAATTCATCGAAGTCCTGGAAACTGGCTGACCTGCATTTTGTGTTACTAATGTTTCTTGTATATAGATAAAATGAAGGCAGTAGATGTCGGGTTGTCGCCGGAGATGCCGTATTTTCGTACCACAGCTGATGAAGGGACTCCTCCTATAACGTATTTGCACCTCTATGAGAACAACAAATTCTCCGTATGCATTTTCTGTTGCTTTATTAGGAAGCTGTCTAGCTGGGGAAGCTATACTTTTAAGTGATGTACTTTTTGACCATTTTGAGTTTTGGTTGTGTTGCAGATGGGGATATTTTGCCTGCCTCCTTCAGGTGTCATTCCACTTCACAACCATCCTGGAATGACAGTCTTCAGCAAGCTTCTCTTTGGGACTATGCACATCAAAGCGTATGATTGGGCAGAGGTCGGTGCCGAGAATGGCATATTAGCGAGTGTCGATGCTTCAAACGGCACAGCTCCCTCAAGTGAGTCTCAATACAATATTGTTCTGTACTCTCTCAGCAGTTCGTTGATATGGATGGAAATGTCTATTATAGTTTGATGTTTGATGGGTAGTTGTCTGGTGTGAGAACATAAGCTGGATGACATGTCCTGTTACAAAACCAAAATTGATAGGCCAACCCGAGCAACTAGTCAAGGCATTGTAGCATTGACCTAAAAGTTAGAAGTTCAAAGGGTTTGCTAATGATCATATATCAGTATATGTGTAACGGCCCAAGCCCACCACTAGTAGATATGGTTCTCTTTAGGCTTTCCCTCAAGGTTTTTAAAACGTGTCTGCTAGTGAGAGGATTCCACACACTTATAAAGAATATTTCGTTATCCTTCCCAACCAATGTGGGATCTCACAATCCACCTCCCTCTAGGGCCTAGCGTCCTTGCTAGCACTTGTTTCCTTCTCCAATTGATGTGGGACCACCAATCTACCCTCATTTGGGGCCCAACGTCCTTGCTGGCACACTACCTCGTGTCCACCCTGTTCGGGACTTGGCCTCCTCGTTGGCACATCACCCAGTGTCTGGCTTTAGTACTATTTGTAATGGCTCAAGCCCATTGCTAGTAGATATTATCTTCTTCGGGCTTTCCCTTTACTCAAGATTTTTAAAACGCATTTGCTAGGGAGAGGTTTCCACACCCTTACAAAGAAGGTTCCGTTCGATGTGGGATTTTACAGTATGAACTTTTTTTGTGTGTTTAATTGTTGAGATTAGTTGGGATGCAAATGATCCTGATACATTTATTGAAAAGAATCTCAAATCTTCTGGCCTTTGTCTTTATGGTTCGTAAAACTTAACGAACATATCACAATTTAGAGAAGATAAACTAATTTTGAGTTATAATAAACAGAGGGAAGAGTGATGGTTTTAAGCAAGTAAGATTCCAATCTATCAGCATTTACATCTCCAGGTATTAGGTTGGCCAAAGTTAAGGTAGATGCCAACTTCACAGCACCATGCGACTCGACCATTCTCTACCCTGCAGATGGTGGAAACATGCATTGCTTCACAGCCGTGACAGCATGTGCAGTGTTAGATGTACTTGGCCCACCTTACTCTGATCACGACGGTCGTCATTGCTCATATTACCTCGACTTTCCCTTCACCAAATTTTCAGGTACGTACTGTTGCAACTTTGTCGTAGTGTTGTTTTACCTCCAAAGCAGCAGCAGATGTTGACATGATAATTTTAATCGAAAAACAGCAGTGGATGGGAAATCGATCCCAGAAGCAGAGAGGGAAAGCTATGCTTGGCTTGAAGAAAGAGAGCAACCTGAAGACTTAGCAGCCGTTGGAGCAGAGTACATAGGGCCTAAGATAGTAGAGAGTAGGTGAATTCTGCAAACAAATCTTGTTCATTTGATACAAAATATGCTGTTCATGGATTGGGACTGCCACCTCTTCTGTGCAGCATATAATCTTTGTTTTAGTTTTGATCTTCCTTGTATATAGTTAAGTTAGATAGTGATAAGCAAAAAGAAGTATGGATTTGTTATGTTCTTCCAATAACTATTCCAGTTTGTTCATGTTTTAACAGATGCACTTCAAAAAAATGAGAAAAAAACACAAAATGAAAGGCTTCTTTCCAGATTTTCCTCTCTTTCTTGTCCAAATTCATTAGTAGACGTGTTTCTGTTTATCGACAATATCTTTAGCAACGAGAAACCTTCAAACTTGTAAACTCTCAATATCCAAACAAAAACTATAATTCCAAGAACAGAACATCCAAGATCAGAAGTTACAGAACATGATAGCTTGGAGATAAAAACAAAGAAATCCCGTTAATCCAACAAGGGAGACTTTTCTATTCCATTAGATTCACTCACTGATGCAGAAAAGTACACAATAACATATACATTGTACATGTTAATCAAAACAAAACAAAACAAAACGCCACTATGGGAAAACGTGCCTTAATAAAAAACTACAAAAACGCACCAATTACATATCACTATTTAGAATCTATTAAACGTCTAATCCTTTCTATGCCTAATTCTCTAAACTAGGGGAACATGTGAGTCGGGTTGATAATTTTTTTTGGATCATTTTGCAAATGTATTGAAAATAATTCTCTCACAATCGTGTCAATAACATTGCTAAATTGGGTGGCAATATCACACAACGGCTCCCCCCAACTTATCGATAATCTTTATTAGAAGTCTATCGTAGATATTATCATATTTGTCCACCTCAATATTCTTTGAGTATACCTCCATAGGTAACCTATAGTGACTCGTCATTTAACCTATAGGGACTCATTGTCCGGTGAGTAAGGATGTTTGTTTAACCAACCCATAAGAACCCGTCTCGAAAAATTATGAGTTGAAGGCTACTTACTTTGACGATGGAAAAATGTGTTCCAATTTACGCCTAAACCAATACGATTGATGTGTTGAAATCTTTTGTGAAAGGCCATGCTGATCGCTGTCTTTTTGCTCGTGAACTTGTGGTTTCAGATCATAGCGGCCCACCTTCTTCATCACCGGGCCTTTGGGCTTCTCAGCGGCGAGCGAAAGAGCGTGGAAGAAAACGATGGCGGCCATGGCCTTGCTCTCTTGGTCGCTTGTTTCTCCATTTTTTCCCAAACAGCGTATCATCGCCCACGAAGCACCAGGTTCTCAATTCGCAGTTCGAACCACATTTCGCCTCTCTTTTTGGTTGGTTTTACATCGTTACTGTACTGATTTGCTCCTTTGGCATTTAATCTTTGGTGGGATTCTCAGATAATTTGAAACTTCGCGCTCTGATTTCAAGGGGTTTTGGCGGAGCCATTGGAGGTGAATGTAAATCTTTGGAGATAAATCGTTTACTGTGTTCTTGCGCTTCGTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGAAGGGGGTGTGGATTTGGAGTGTGGATCGGCAGGTTATGACGGCGGCGGTTGAGAGGGGATGGAGTACCTTCATCTTCTCGCCTCATAATAAGGAGCTTGCTGATGAATGGTCCTGTAAGCTTTTCGTTTCACTTTCTGTTTAACTGATGAGATTTCGATTCGTTATTGCACTTCTTAAAACGTAAACCTTATACCTTTCCATGTCATTTTGCTGGCGCATGACACTCATCTTTTATGTCAAGAATGCTTTCAAAAAGTGTGTCTAAGAATGGATACTTGGGTTTGGCTCAACTCATCTTTAATGGTCTTGAATGTCTTCCCAAGATGGGTTATATAATTCATTTCGGCTGGAGAATAATTCCATGATCTTGTCTTACAAGAGGGGATGCTCTTTTTAGCTATTTGATGTTGAAACAAAACAAAAAAGCAAGTTTTAAATAGAAACTTATAGAAAATAATGGATAAAATCGACACAATTTCTTTTCTCGTAGTGTCACGAAATTTCTTAAGATTATAATTGCCCTGCAATGTTGAAAGTGTTAGAGAGTATGAGTCGAAGAGATTTTTTTCCGCATATTAGACATGTCATGCATGGAGTGACAACCTTATTTTTAATAGGAAAATAGGGTTGTTAATGTTGGTATCGGAGATGCACAACATAAGTGTGGTGGTCATGTGACACTTGAGTAAAGGAGTACATGAATGAACTAAGGTTACATCTGAATGAGAGTGATCTTGAGGATAGGATGACTATGAAAGGCTTAAAAGAATTGAAAGTCATTACCTGTACTAATAGGGTGTACCTTCCTTTTCAGTAGTTCAATCATAGAAACTTCATGGTTAAGTATGCTTTGCTTGGAGTAATTATATGTTGAGTGATCTCTTTGAAATTTTCCTAGGATGCACGTGAGTGTGGACAATACGTGCTGGAAGGATTTGTGTTGGTTTGTACGGATAGTCTTCACTCTTATAAGCGAGAAGTAAGTAATGTTGTGATCATGTCAGAGGGGTGTGCTAGCAAGATGTAAGTTGTGAGAGATGGGCAAGTTTAGGGTTTAAGTTTTAAAAAGGGTAATAATGACGTTTCATCCCTTGTAACTTTAGCAAAGACTTATTAGTTATTATTCTCTATTCAATGTCTCAAAAGTCTATCACTTAAAATTTGTCACATTTAAATAACATTAGCTCACAGAAAACAATATATACTTTAAACGATAAATCACCGGTCCATTGTTTAATATTAACGGTGTTATTGAGTAGATATCAGCTTTATAATTTTGATATCTTGAGTACAAAAGCATATTTGGTTTTCTGATGAGTTTATCGATTAGATATATTTGGTCCGAAAGTGTCATGATCACTGAATTCTTTTGAGGATTCTAATTTGAAGTTTGGTGTTCGATTGCCCAGCAATTGCACTAATAAGCCCACTTTTTATCAACGAGGACGGAGTTTTCGATGGAGAGGGTAGACTAATTGCCACAGTTATCGAGGTTTCTAACCCCCAGCAGTTGGAGCAGCTTCAGCCATCAAATGCATCAGTAGACAATGTTATTGTGGATTTACAAGATTGGCAGGTTCGTAGTTGCTTGCTTACAAGTCTGTTGGTTGAATCATCCTTTAGCTGGCTATGCATGTATAATTTAGAGAAGGGACTGCAAGAATGATTTTGAATTGTCTGGATGAGTGTGTAAACACTGGTGATAAATCTGTTCTATTTAGGGGTTAAAACAACCTTGCTGTGAATATTGATAATTTTTAAGTGATTCTTTTTATGTTCAGATAATACCTGCGGAGAATATTGTTGCAGCGTTTCAGGGGAGTCGAAAAACTGTATTTGCAGTCTCGAAAACTCCTATCGAAGCTCAAATCTTCCTCGAGGTGCATATTTTTGAGGCTGTATCAGTGTGGAAAGATGGACTAAATAATATTATTGTACTATATGTCATTTTCGTAGCATCATAAGATATGATTTGAATTCGTTTTCGTTTCTAGGCACTTGAACACGGTCTGGGTGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTTCAGCTAAAGGTGGTTTATCATTTTGCTTGATTTGTTTCAGTGCAAAGTCTTGCTTCTTATATTGTTTTAGTATCATTAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTCTTAGCTTGACTAAAGCTACTATAACTCATATTCATGTCGCTGGAATGGGAGATCGAGTTTGTGTCGATCTCTGCAGTCTCATGAGACCTGGTGAAGGACTTCTAGTAATCCTCCATTCTCTTCTCTTAATATTGCTCTCCTAGCTTCTAGTTTAGCATCTAAACGGATGAAATTCTTCTGTAGGTTGGGTCCTACGCCAGAGGACTATTTTTAGTTCACTCGGAATGCTTAGAGTCAAATTACATTGCAAGCCGACCTTTTCGAGTCAATGCTGTGAGTTCGGCATAATATTTTGGCTCTATTAGTCTCAGCACGCCAGAAGAATTGTACTCCTTTATTCTTTTGTAGCCTTTTCATCATCTTTGATTTGTGAATGTTCAGGGACCAGTCCATGCCTATGTAGCTGTCCCAGGAGATAAAACGAGCTACCTTTCTGAGTTACGAGCAGGCAAAGAGGTAATCGTAGTTGATCAAGAAGGCAGGCAACGAACCGCTATTGTTGGGCGTGTAAAGATAGAGACTAGGCAGCTGGTACTCATCCAGGCAAAGGTACACTTGGTGTTTTTATCCGAATCTTGCGTAGTAGTTCGATCTATTGATGTTTCTATGAATCGAGAACTAAAAAGGATTCTTCCATAGTATCTCTGCTGCTGCTTACTTTGAATCTTATGATCCCAGCTGAATTTTGTATGTCTGTTTATTTGGGTGTGATGCTAGAAATGACTTGCTAGAATAATTTACTATTGGCAGAGAGATTCAGATGAGCAAACTCTGTACAGCATCCTCCTGCAGAACGCAGAAACGGTTGCCTTAGTCTGCCCCGGTCGAGGTCGATGATTCGTATCTCAAATTAGAGTTTTGTATTGCTTTGCTTTGCTTTCTTCTCCAAAATTTGGCTTCAATGTTGTGCAGGAAATGAGAAGAAAGCCATCCCTGTTACCTCACTTAAAGTTGGCGATGAAGTGTTCTTGAGACTGCAAGGAGAAGCAAGGCATACAGGTATTGAAATCCAGGAGTTTATTGTAGAGAAATGATTGTTAAGCTATTTTGAGTGAAATACTGAAACCAAATGGTATAATTTTTTTTTATGCTAATCTCTTCCAAAGTTATAGGTACATGAAAATGACTAAAATATCATCGTACTATCTCCGTGTCTCTCATCGTTGTAATCTTACCACTAACGATCACACCAACGACTACTTCCGTGAGCATCGATGATGAATATGGTCTTAGTGTTTATTCATTGTCGTTTATAGTAATTTAATATTAGAAGCATCTTGAGAGTAAAACAAATCATTGTGTTTACTCAAATTAGTGTTGGC

mRNA sequence

ATGAAGCTCTCTTTTTTTGCGATCCGAGTTGCAGAATCGGCAATTGATTATCAAAGAACTGATGAAGATGATGATCCCAATCGAATCGATCTGAATCTGATGGTGTCTTCAGACAGATCTCCGTTTTTCACCAATTGCTCCACTTTTCGCCTTCAATTTTCCACCGTTTTTTTTGAAATTTTAATGGGGTTTGAGAGGTCTTTGGCTGATCGCAAGGGGAAACAGTTTTTTGAATTGCCCAAAGAAACTACTAAGAACAATAAATCCAGGAAGAATCGGCGGCGCTCTAAGAAACCCTCCTCGCCGATTCAGAAACTTTACGAGACATGTAAGCAAGTATTTGCCTCTACTGAAACTGGAATTGTTCCCTCTCTTGAGGATATCCAACGCCTGCAATCTGTTCTCGATAAAATGAAGGCAGTAGATGTCGGGTTGTCGCCGGAGATGCCGTATTTTCGTACCACAGCTGATGAAGGGACTCCTCCTATAACGTATTTGCACCTCTATGAGAACAACAAATTCTCCATGGGGATATTTTGCCTGCCTCCTTCAGGTGTCATTCCACTTCACAACCATCCTGGAATGACAGTCTTCAGCAAGCTTCTCTTTGGGACTATGCACATCAAAGCGTATGATTGGGCAGAGGTCGGTGCCGAGAATGGCATATTAGCGAGTGTCGATGCTTCAAACGGCACAGCTCCCTCAAGTATTAGGTTGGCCAAAGTTAAGGTAGATGCCAACTTCACAGCACCATGCGACTCGACCATTCTCTACCCTGCAGATGGTGGAAACATGCATTGCTTCACAGCCGTGACAGCATGTGCAGTGTTAGATGTACTTGGCCCACCTTACTCTGATCACGACGGTCGTCATTGCTCATATTACCTCGACTTTCCCTTCACCAAATTTTCAGCAGTGGATGGGAAATCGATCCCAGAAGCAGAGAGGGAAAGCTATGCTTGGCTTGAAGAAAGAGAGCAACCTGAAGACTTAGCAGCCGTTGGAGCAGAGTACATAGGGCCTAAGATAGTAGAGAGCCATGCTGATCGCTGTCTTTTTGCTCGTGAACTTGTGGTTTCAGATCATAGCGGCCCACCTTCTTCATCACCGGGCCTTTGGGCTTCTCAGCGGCGAGCGAAAGAGCGTGGAAGAAAACGATGGCGGCCATGGCCTTGCTCTCTTGGTCGCTTGTTTCTCCATTTTTTCCCAAACAGCGTATCATCGCCCACGAAGCACCAGGTTCTCAATTCGCAGTTCGAACCACATTTCGCCTCTCTTTTTGATAATTTGAAACTTCGCGCTCTGATTTCAAGGGGTTTTGGCGGAGCCATTGGAGGTGAATGTAAATCTTTGGAGATAAATCGTTTACTGTGTTCTTGCGCTTCGTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGAAGGGGGTGTGGATTTGGAGTGTGGATCGGCAGGTTATGACGGCGGCGGTTGAGAGGGGATGGAGTACCTTCATCTTCTCGCCTCATAATAAGGAGCTTGCTGATGAATGGTCCTCAATTGCACTAATAAGCCCACTTTTTATCAACGAGGACGGAGTTTTCGATGGAGAGGGTAGACTAATTGCCACAGTTATCGAGGTTTCTAACCCCCAGCAGTTGGAGCAGCTTCAGCCATCAAATGCATCAGTAGACAATGTTATTGTGGATTTACAAGATTGGCAGATAATACCTGCGGAGAATATTGTTGCAGCGTTTCAGGGGAGTCGAAAAACTGTATTTGCAGTCTCGAAAACTCCTATCGAAGCTCAAATCTTCCTCGAGGCACTTGAACACGGTCTGGGTGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTTCAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTCTTAGCTTGACTAAAGCTACTATAACTCATATTCATGTCGCTGGAATGGGAGATCGAGTTTGTGTCGATCTCTGCAGTCTCATGAGACCTGGTGAAGGACTTCTAGTTGGGTCCTACGCCAGAGGACTATTTTTAGTTCACTCGGAATGCTTAGAGTCAAATTACATTGCAAGCCGACCTTTTCGAGTCAATGCTGGACCAGTCCATGCCTATGTAGCTGTCCCAGGAGATAAAACGAGCTACCTTTCTGAGTTACGAGCAGGCAAAGAGGTAATCGTAGTTGATCAAGAAGGCAGGCAACGAACCGCTATTGTTGGGCGTGTAAAGATAGAGACTAGGCAGCTGGTACTCATCCAGGCAAAGAGAGATTCAGATGAGCAAACTCTGTACAGCATCCTCCTGCAGAACGCAGAAACGGTTGCCTTAGTCTGCCCCGGTCGAGGAAATGAGAAGAAAGCCATCCCTGTTACCTCACTTAAAGTTGGCGATGAAGTGTTCTTGAGACTGCAAGGAGAAGCAAGGCATACAGGTATTGAAATCCAGGAGTTTATTGTAGAGAAATGATTGTTAAGCTATTTTGAGTGAAATACTGAAACCAAATGGTATAATTTTTTTTTATGCTAATCTCTTCCAAAGTTATAGGTACATGAAAATGACTAAAATATCATCGTACTATCTCCGTGTCTCTCATCGTTGTAATCTTACCACTAACGATCACACCAACGACTACTTCCGTGAGCATCGATGATGAATATGGTCTTAGTGTTTATTCATTGTCGTTTATAGTAATTTAATATTAGAAGCATCTTGAGAGTAAAACAAATCATTGTGTTTACTCAAATTAGTGTTGGC

Coding sequence (CDS)

ATGAAGCTCTCTTTTTTTGCGATCCGAGTTGCAGAATCGGCAATTGATTATCAAAGAACTGATGAAGATGATGATCCCAATCGAATCGATCTGAATCTGATGGTGTCTTCAGACAGATCTCCGTTTTTCACCAATTGCTCCACTTTTCGCCTTCAATTTTCCACCGTTTTTTTTGAAATTTTAATGGGGTTTGAGAGGTCTTTGGCTGATCGCAAGGGGAAACAGTTTTTTGAATTGCCCAAAGAAACTACTAAGAACAATAAATCCAGGAAGAATCGGCGGCGCTCTAAGAAACCCTCCTCGCCGATTCAGAAACTTTACGAGACATGTAAGCAAGTATTTGCCTCTACTGAAACTGGAATTGTTCCCTCTCTTGAGGATATCCAACGCCTGCAATCTGTTCTCGATAAAATGAAGGCAGTAGATGTCGGGTTGTCGCCGGAGATGCCGTATTTTCGTACCACAGCTGATGAAGGGACTCCTCCTATAACGTATTTGCACCTCTATGAGAACAACAAATTCTCCATGGGGATATTTTGCCTGCCTCCTTCAGGTGTCATTCCACTTCACAACCATCCTGGAATGACAGTCTTCAGCAAGCTTCTCTTTGGGACTATGCACATCAAAGCGTATGATTGGGCAGAGGTCGGTGCCGAGAATGGCATATTAGCGAGTGTCGATGCTTCAAACGGCACAGCTCCCTCAAGTATTAGGTTGGCCAAAGTTAAGGTAGATGCCAACTTCACAGCACCATGCGACTCGACCATTCTCTACCCTGCAGATGGTGGAAACATGCATTGCTTCACAGCCGTGACAGCATGTGCAGTGTTAGATGTACTTGGCCCACCTTACTCTGATCACGACGGTCGTCATTGCTCATATTACCTCGACTTTCCCTTCACCAAATTTTCAGCAGTGGATGGGAAATCGATCCCAGAAGCAGAGAGGGAAAGCTATGCTTGGCTTGAAGAAAGAGAGCAACCTGAAGACTTAGCAGCCGTTGGAGCAGAGTACATAGGGCCTAAGATAGTAGAGAGCCATGCTGATCGCTGTCTTTTTGCTCGTGAACTTGTGGTTTCAGATCATAGCGGCCCACCTTCTTCATCACCGGGCCTTTGGGCTTCTCAGCGGCGAGCGAAAGAGCGTGGAAGAAAACGATGGCGGCCATGGCCTTGCTCTCTTGGTCGCTTGTTTCTCCATTTTTTCCCAAACAGCGTATCATCGCCCACGAAGCACCAGGTTCTCAATTCGCAGTTCGAACCACATTTCGCCTCTCTTTTTGATAATTTGAAACTTCGCGCTCTGATTTCAAGGGGTTTTGGCGGAGCCATTGGAGGTGAATGTAAATCTTTGGAGATAAATCGTTTACTGTGTTCTTGCGCTTCGTCGTCCTCTTCAATGTCTCCGATTGAGGCGTCGAAGGGGGTGTGGATTTGGAGTGTGGATCGGCAGGTTATGACGGCGGCGGTTGAGAGGGGATGGAGTACCTTCATCTTCTCGCCTCATAATAAGGAGCTTGCTGATGAATGGTCCTCAATTGCACTAATAAGCCCACTTTTTATCAACGAGGACGGAGTTTTCGATGGAGAGGGTAGACTAATTGCCACAGTTATCGAGGTTTCTAACCCCCAGCAGTTGGAGCAGCTTCAGCCATCAAATGCATCAGTAGACAATGTTATTGTGGATTTACAAGATTGGCAGATAATACCTGCGGAGAATATTGTTGCAGCGTTTCAGGGGAGTCGAAAAACTGTATTTGCAGTCTCGAAAACTCCTATCGAAGCTCAAATCTTCCTCGAGGCACTTGAACACGGTCTGGGTGGAGTTATTTTGAAAGTTGAAGATCCTGAAGCTGTTTTTCAGCTAAAGGACTATTTTGACAGAAGAAATGAAGCTAGTAATCTTCTTAGCTTGACTAAAGCTACTATAACTCATATTCATGTCGCTGGAATGGGAGATCGAGTTTGTGTCGATCTCTGCAGTCTCATGAGACCTGGTGAAGGACTTCTAGTTGGGTCCTACGCCAGAGGACTATTTTTAGTTCACTCGGAATGCTTAGAGTCAAATTACATTGCAAGCCGACCTTTTCGAGTCAATGCTGGACCAGTCCATGCCTATGTAGCTGTCCCAGGAGATAAAACGAGCTACCTTTCTGAGTTACGAGCAGGCAAAGAGGTAATCGTAGTTGATCAAGAAGGCAGGCAACGAACCGCTATTGTTGGGCGTGTAAAGATAGAGACTAGGCAGCTGGTACTCATCCAGGCAAAGAGAGATTCAGATGAGCAAACTCTGTACAGCATCCTCCTGCAGAACGCAGAAACGGTTGCCTTAGTCTGCCCCGGTCGAGGAAATGAGAAGAAAGCCATCCCTGTTACCTCACTTAAAGTTGGCGATGAAGTGTTCTTGAGACTGCAAGGAGAAGCAAGGCATACAGGTATTGAAATCCAGGAGTTTATTGTAGAGAAATGA

Protein sequence

MKLSFFAIRVAESAIDYQRTDEDDDPNRIDLNLMVSSDRSPFFTNCSTFRLQFSTVFFEILMGFERSLADRKGKQFFELPKETTKNNKSRKNRRRSKKPSSPIQKLYETCKQVFASTETGIVPSLEDIQRLQSVLDKMKAVDVGLSPEMPYFRTTADEGTPPITYLHLYENNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGILASVDASNGTAPSSIRLAKVKVDANFTAPCDSTILYPADGGNMHCFTAVTACAVLDVLGPPYSDHDGRHCSYYLDFPFTKFSAVDGKSIPEAERESYAWLEEREQPEDLAAVGAEYIGPKIVESHADRCLFARELVVSDHSGPPSSSPGLWASQRRAKERGRKRWRPWPCSLGRLFLHFFPNSVSSPTKHQVLNSQFEPHFASLFDNLKLRALISRGFGGAIGGECKSLEINRLLCSCASSSSSMSPIEASKGVWIWSVDRQVMTAAVERGWSTFIFSPHNKELADEWSSIALISPLFINEDGVFDGEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATITHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPVHAYVAVPGDKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLIQAKRDSDEQTLYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK
Homology
BLAST of CmaCh11G010340 vs. ExPASy Swiss-Prot
Match: Q8LGJ5 (Plant cysteine oxidase 2 OS=Arabidopsis thaliana OX=3702 GN=PCO2 PE=1 SV=1)

HSP 1 Score: 314.3 bits (804), Expect = 4.1e-84
Identity = 158/264 (59.85%), Postives = 198/264 (75.00%), Query Frame = 0

Query: 85  KNNKSRKNRRRSKKP-SSPIQKLYETCKQVFASTETGIVPSLEDIQRLQSVLDKMKAVDV 144
           ++N  +K +RRSKK    P+QKL++TCK+VFA  ++G VPS E+I+ L++VLD++K  DV
Sbjct: 28  RSNSRKKIQRRSKKTLICPVQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDV 87

Query: 145 GLSPEMPYFRTTADEGTPPITYLHLYENNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLF 204
           G++P+M YFR+T    +P +TYLH+Y  ++FS+ IFCLPPSGVIPLHNHP MTVFSKLLF
Sbjct: 88  GVNPKMSYFRSTVTGRSPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLF 147

Query: 205 GTMHIKAYDWAEVGAENGILASVDASNGTAPSS-IRLAKVKVDANFTAPCDSTILYPADG 264
           GTMHIK+YDW                +   PSS  RLAKVKVD++FTAPCD++ILYPADG
Sbjct: 148 GTMHIKSYDW--------------VPDSPQPSSDTRLAKVKVDSDFTAPCDTSILYPADG 207

Query: 265 GNMHCFTAVTACAVLDVLGPPYSDHDGRHCSYYLDFPFTKFSAVDGKSIPEAERESYAWL 324
           GNMHCFTA TACAVLDV+GPPYSD  GRHC+YY D+PF+ FS VDG  + E E+E YAWL
Sbjct: 208 GNMHCFTAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFS-VDGVVVAEEEKEGYAWL 267

Query: 325 EEREQ-PEDLAAVGAEYIGPKIVE 346
           +ERE+ PEDL      Y GP I E
Sbjct: 268 KEREEKPEDLTVTALMYSGPTIKE 276

BLAST of CmaCh11G010340 vs. ExPASy Swiss-Prot
Match: Q9LXG9 (Plant cysteine oxidase 1 OS=Arabidopsis thaliana OX=3702 GN=PCO1 PE=1 SV=1)

HSP 1 Score: 292.0 bits (746), Expect = 2.2e-77
Identity = 153/280 (54.64%), Postives = 195/280 (69.64%), Query Frame = 0

Query: 80  PKETTKNNKSRKNRR----RSKKPSSP------IQKLYETCKQVFASTETGIVPSLEDIQ 139
           P    K NK++  +     R KK  SP      +++L+ TCK+VF++   G++PS + IQ
Sbjct: 25  PNSVKKKNKNKNKKMMMTWRRKKIDSPADGITAVRRLFNTCKEVFSNGGPGVIPSEDKIQ 84

Query: 140 RLQSVLDKMKAVDVGLSPEMPYFRTTAD---EGTPPITYLHLYENNKFSMGIFCLPPSGV 199
           +L+ +LD MK  DVGL+P MPYFR  +      +PPITYLHL++ ++FS+GIFCLPPSGV
Sbjct: 85  QLREILDDMKPEDVGLTPTMPYFRPNSGVEARSSPPITYLHLHQCDQFSIGIFCLPPSGV 144

Query: 200 IPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGILASVDASNGTAPSSIRLAKVKVDA 259
           IPLHNHPGMTVFSKLLFGTMHIK+YDW            VDA      S  RLAK+KVD+
Sbjct: 145 IPLHNHPGMTVFSKLLFGTMHIKSYDWV-----------VDAP--MRDSKTRLAKLKVDS 204

Query: 260 NFTAPCDSTILYPADGGNMHCFTAVTACAVLDVLGPPYSDHDGRHCSYYLDFPFTKFSAV 319
            FTAPC+++ILYP DGGNMH FTA+TACAVLDVLGPPY + +GRHC+Y+L+FP  K S+ 
Sbjct: 205 TFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNPEGRHCTYFLEFPLDKLSSE 264

Query: 320 DGKSI-PEAERESYAWLEER-EQPED-LAAVGAEYIGPKI 344
           D   +  E E+E YAWL+ER + PED    VGA Y GPK+
Sbjct: 265 DDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKV 291

BLAST of CmaCh11G010340 vs. ExPASy Swiss-Prot
Match: O26680 (3-dehydroquinate synthase OS=Methanothermobacter thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM 10044 / NBRC 100330 / Delta H) OX=187420 GN=aroB' PE=3 SV=2)

HSP 1 Score: 252.7 bits (644), Expect = 1.4e-65
Identity = 156/373 (41.82%), Postives = 210/373 (56.30%), Query Frame = 0

Query: 479 WSVDRQVMTAAVERGWSTFIFSPHNKELADEWSSIALISP------LFINEDGVFDG--- 538
           W   +  +TAA+E G    + +  +  +  +  ++ LISP      + +  DG  DG   
Sbjct: 13  WDEKKTFITAALESGIDHIVDTADSGRI-KKLGNLTLISPDEDADIVLVGRDGEGDGTLE 72

Query: 539 -------------------EGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIP 598
                               GR +A  +E+ +    E  +     VD +I+  +DW+IIP
Sbjct: 73  LPETLEYSRDIEMASELSESGRQVAAYVEIRSKAHEELARRLGRVVDYLILVGEDWKIIP 132

Query: 599 AENIVAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFD-RR 658
            ENI+A  Q     + A      EA++ LE LEHG  GV++   +P  + Q+KD      
Sbjct: 133 LENIIADLQEEDVKLIAAVADVDEARVALETLEHGTDGVLI---EPADISQIKDIAALLE 192

Query: 659 NEASNLLSLTKATITHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESN 718
           N  S    L  ATIT I   G GDRVCVD CS+M  GEG+LVGSY++GLFLVHSE LES 
Sbjct: 193 NIESETYELKPATITRIEPIGSGDRVCVDTCSIMGIGEGMLVGSYSQGLFLVHSESLESE 252

Query: 719 YIASRPFRVNAGPVHAYVAVPGDKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQ 778
           Y+ASRPFRVNAGPV AYV VPG +T YLSEL  G EVI+VD++GR R+AIVGRVKIE R 
Sbjct: 253 YVASRPFRVNAGPVQAYVMVPGGRTRYLSELETGDEVIIVDRDGRSRSAIVGRVKIEKRP 312

Query: 779 LVLIQAKRDSDEQTLYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEAR 823
           L+L++A+    E      LLQNAET+ LV     ++ + + V+ L  GD V +     AR
Sbjct: 313 LMLVEAEY---EGMKVRTLLQNAETIRLV----NDKGEPVSVSELGEGDRVLVYFDESAR 372

BLAST of CmaCh11G010340 vs. ExPASy Swiss-Prot
Match: Q58646 (3-dehydroquinate synthase OS=Methanocaldococcus jannaschii (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC 100440) OX=243232 GN=aroB' PE=1 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 7.2e-65
Identity = 147/360 (40.83%), Postives = 215/360 (59.72%), Query Frame = 0

Query: 479 WSVDRQVMTAAVERGWSTFIFSPHNKELADEWSSIALISP------LFINEDGVFD---- 538
           W   ++++T A+E      +  P + E   E  +I + S       + +N++   +    
Sbjct: 13  WEEKKKIVTTALESSIPVVVAEPEDIEKIKELGNIKVASHSLDADIVLVNKNDNIEFLKE 72

Query: 539 ----GEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVA-AFQGSRK 598
               G+   I   IE    ++          VDN+I++ +DW IIP EN++A  F    K
Sbjct: 73  AKNLGKETAIYIPIESKEDEEFASEVARFGFVDNIILEGRDWTIIPLENLIADLFHRDVK 132

Query: 599 TVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKATI 658
            V +V+    EA++  E LE G  GV+L  ++ E + +L    +  N+    ++L  AT+
Sbjct: 133 IVASVNSVD-EAKVAYEILEKGTDGVLLNPKNLEDIKELSKLIEEMNKEK--VALDVATV 192

Query: 659 THIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGPV 718
           T +   G GDRVC+D CSLM+ GEG+L+GSY+R LFLVHSE +E+ Y+A+RPFRVNAGPV
Sbjct: 193 TKVEPIGSGDRVCIDTCSLMKIGEGMLIGSYSRALFLVHSETVENPYVATRPFRVNAGPV 252

Query: 719 HAYVAVPGDKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLIQAKRDSDEQT 778
           HAY+  PG+KT YLSEL+AG +V++VD++G  R AIVGRVKIE R LVLI+A+   D   
Sbjct: 253 HAYILCPGNKTKYLSELKAGDKVLIVDKDGNTREAIVGRVKIERRPLVLIEAEYKGD--- 312

Query: 779 LYSILLQNAETVALVCPGRGNEK-KAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK 823
           +   +LQNAET+ LV     NEK + I V  LK GD+V ++ +  ARH G+ I+E I+EK
Sbjct: 313 IIRTILQNAETIRLV-----NEKGEPISVVDLKPGDKVLIKPEEYARHFGMAIKETIIEK 361

BLAST of CmaCh11G010340 vs. ExPASy Swiss-Prot
Match: A0B6K6 (3-dehydroquinate synthase OS=Methanothrix thermoacetophila (strain DSM 6194 / JCM 14653 / NBRC 101360 / PT) OX=349307 GN=aroB' PE=3 SV=2)

HSP 1 Score: 246.1 bits (627), Expect = 1.4e-63
Identity = 144/360 (40.00%), Postives = 209/360 (58.06%), Query Frame = 0

Query: 479 WSVDRQVMTAAVERGWSTFIFSPHNKELADEWSSIALI--------SPLFINEDGV---- 538
           W   + ++T A+E G+   + S  + EL  E  SI +           L I +  V    
Sbjct: 13  WDDVKPLITTALESGFDCVVVSRDHIELVRELGSIRIACFGRERGSEDLLIMDTSVPREN 72

Query: 539 ----FDGEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENIVAAFQGSR 598
                +  GR I   +E+ + +           VD ++V   DW++IP EN++AA QG  
Sbjct: 73  QIKSVEKIGRPIGGYVEIRSKEDELFATELGKHVDYLLVVGTDWKVIPLENMIAALQGYD 132

Query: 599 KTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASNLLSLTKAT 658
             + +  ++  EA++ L  LEHG  GV+L   DP  + +++   +R     + + L  AT
Sbjct: 133 CKIISCVRSSEEAEVALSTLEHGADGVLLDTRDPSEIKRVQAAAERL--GMSRIDLKTAT 192

Query: 659 ITHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASRPFRVNAGP 718
           +  +   GMGDRVCVD CSLMR GEG+LVGS +R  FLV SE  ES Y+A+RPFRVNAG 
Sbjct: 193 VVAVKPVGMGDRVCVDTCSLMRRGEGMLVGSQSRAFFLVQSEAEESPYVAARPFRVNAGA 252

Query: 719 VHAYVAVPGDKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLIQAKRDSDEQ 778
           VHAY+ V GDKT YLSEL++G EV +VD++G  R+A+VGRVKIE R ++L++A+ D +  
Sbjct: 253 VHAYIRV-GDKTRYLSELKSGDEVTIVDKDGMTRSAVVGRVKIERRPMILVEAEVDGER- 312

Query: 779 TLYSILLQNAETVALVCPGRGNEKKAIPVTSLKVGDEVFLRLQGEARHTGIEIQEFIVEK 823
              S LLQNAET+ LV     ++   I V  LK GD+V + ++  ARH G+ I+E I+E+
Sbjct: 313 --VSTLLQNAETIKLV----SHDGTPISVAELKPGDKVLVHVETSARHFGMSIEETIIER 362

BLAST of CmaCh11G010340 vs. TAIR 10
Match: AT3G28760.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 390 Blast hits to 390 proteins in 131 species: Archae - 144; Bacteria - 105; Metazoa - 0; Fungi - 0; Plants - 54; Viruses - 0; Other Eukaryotes - 87 (source: NCBI BLink). )

HSP 1 Score: 495.4 bits (1274), Expect = 9.1e-140
Identity = 246/370 (66.49%), Postives = 305/370 (82.43%), Query Frame = 0

Query: 455 RLLCSCASSSSSMSPIEASKGVWIWSVDRQVMTAAVERGWSTFIFSPHNKELADEWSSIA 514
           R++   ++S+  M+ +  +K VWIW++ ++VMT AVERGW+TFIFS  N++L++EWSSIA
Sbjct: 53  RIVVKMSASTLPMN-LGKAKKVWIWTMCKEVMTVAVERGWNTFIFSSDNRKLSNEWSSIA 112

Query: 515 LISPLFINEDGVFDGEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENI 574
           L+  LFI E  V DG G ++A+V EVS P++L  L   N  ++N+++D  DW+ IPAEN+
Sbjct: 113 LMDTLFIEEKKVIDGTGNVVASVFEVSTPEELRSLNIENEQIENIVLDFLDWKSIPAENL 172

Query: 575 VAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASN 634
           VAA QGS KTVFAVS TP EA++FLEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+
Sbjct: 173 VAALQGSEKTVFAVSNTPSEAKLFLEALEHGLGGIILKSEDVKAVLDLKEYFDKRNEESD 232

Query: 635 LLSLTKATITHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASR 694
            LSLT+ATIT + + GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYI SR
Sbjct: 233 TLSLTEATITRVQMVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIESR 292

Query: 695 PFRVNAGPVHAYVAVPGDKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLIQ 754
           PFRVNAGPVHAYVAVPG KT YLSELR G+EVIVVDQ+G+QRTA+VGRVKIE R L++++
Sbjct: 293 PFRVNAGPVHAYVAVPGGKTCYLSELRTGREVIVVDQKGKQRTAVVGRVKIEKRPLIVVE 352

Query: 755 AKRDS-DEQTLYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHT 814
           AK  + +E+T+YSI+LQNAETVALV P + N   + A+PVTSLK GD+V +RLQG ARHT
Sbjct: 353 AKLSTKEEETVYSIILQNAETVALVTPHQVNSSGRTAVPVTSLKPGDQVLIRLQGGARHT 412

Query: 815 GIEIQEFIVE 822
           GIEIQEFIVE
Sbjct: 413 GIEIQEFIVE 421

BLAST of CmaCh11G010340 vs. TAIR 10
Match: AT3G28760.2 (CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPro:IPR002812); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 495.4 bits (1274), Expect = 9.1e-140
Identity = 246/370 (66.49%), Postives = 305/370 (82.43%), Query Frame = 0

Query: 455 RLLCSCASSSSSMSPIEASKGVWIWSVDRQVMTAAVERGWSTFIFSPHNKELADEWSSIA 514
           R++   ++S+  M+ +  +K VWIW++ ++VMT AVERGW+TFIFS  N++L++EWSSIA
Sbjct: 75  RIVVKMSASTLPMN-LGKAKKVWIWTMCKEVMTVAVERGWNTFIFSSDNRKLSNEWSSIA 134

Query: 515 LISPLFINEDGVFDGEGRLIATVIEVSNPQQLEQLQPSNASVDNVIVDLQDWQIIPAENI 574
           L+  LFI E  V DG G ++A+V EVS P++L  L   N  ++N+++D  DW+ IPAEN+
Sbjct: 135 LMDTLFIEEKKVIDGTGNVVASVFEVSTPEELRSLNIENEQIENIVLDFLDWKSIPAENL 194

Query: 575 VAAFQGSRKTVFAVSKTPIEAQIFLEALEHGLGGVILKVEDPEAVFQLKDYFDRRNEASN 634
           VAA QGS KTVFAVS TP EA++FLEALEHGLGG+ILK ED +AV  LK+YFD+RNE S+
Sbjct: 195 VAALQGSEKTVFAVSNTPSEAKLFLEALEHGLGGIILKSEDVKAVLDLKEYFDKRNEESD 254

Query: 635 LLSLTKATITHIHVAGMGDRVCVDLCSLMRPGEGLLVGSYARGLFLVHSECLESNYIASR 694
            LSLT+ATIT + + GMGDRVCVDLCSLMRPGEGLLVGS+ARGLFLVHSECLESNYI SR
Sbjct: 255 TLSLTEATITRVQMVGMGDRVCVDLCSLMRPGEGLLVGSFARGLFLVHSECLESNYIESR 314

Query: 695 PFRVNAGPVHAYVAVPGDKTSYLSELRAGKEVIVVDQEGRQRTAIVGRVKIETRQLVLIQ 754
           PFRVNAGPVHAYVAVPG KT YLSELR G+EVIVVDQ+G+QRTA+VGRVKIE R L++++
Sbjct: 315 PFRVNAGPVHAYVAVPGGKTCYLSELRTGREVIVVDQKGKQRTAVVGRVKIEKRPLIVVE 374

Query: 755 AKRDS-DEQTLYSILLQNAETVALVCPGRGNE--KKAIPVTSLKVGDEVFLRLQGEARHT 814
           AK  + +E+T+YSI+LQNAETVALV P + N   + A+PVTSLK GD+V +RLQG ARHT
Sbjct: 375 AKLSTKEEETVYSIILQNAETVALVTPHQVNSSGRTAVPVTSLKPGDQVLIRLQGGARHT 434

Query: 815 GIEIQEFIVE 822
           GIEIQEFIVE
Sbjct: 435 GIEIQEFIVE 443

BLAST of CmaCh11G010340 vs. TAIR 10
Match: AT5G39890.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 314.3 bits (804), Expect = 2.9e-85
Identity = 158/264 (59.85%), Postives = 198/264 (75.00%), Query Frame = 0

Query: 85  KNNKSRKNRRRSKKP-SSPIQKLYETCKQVFASTETGIVPSLEDIQRLQSVLDKMKAVDV 144
           ++N  +K +RRSKK    P+QKL++TCK+VFA  ++G VPS E+I+ L++VLD++K  DV
Sbjct: 28  RSNSRKKIQRRSKKTLICPVQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDV 87

Query: 145 GLSPEMPYFRTTADEGTPPITYLHLYENNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLF 204
           G++P+M YFR+T    +P +TYLH+Y  ++FS+ IFCLPPSGVIPLHNHP MTVFSKLLF
Sbjct: 88  GVNPKMSYFRSTVTGRSPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLF 147

Query: 205 GTMHIKAYDWAEVGAENGILASVDASNGTAPSS-IRLAKVKVDANFTAPCDSTILYPADG 264
           GTMHIK+YDW                +   PSS  RLAKVKVD++FTAPCD++ILYPADG
Sbjct: 148 GTMHIKSYDW--------------VPDSPQPSSDTRLAKVKVDSDFTAPCDTSILYPADG 207

Query: 265 GNMHCFTAVTACAVLDVLGPPYSDHDGRHCSYYLDFPFTKFSAVDGKSIPEAERESYAWL 324
           GNMHCFTA TACAVLDV+GPPYSD  GRHC+YY D+PF+ FS VDG  + E E+E YAWL
Sbjct: 208 GNMHCFTAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFS-VDGVVVAEEEKEGYAWL 267

Query: 325 EEREQ-PEDLAAVGAEYIGPKIVE 346
           +ERE+ PEDL      Y GP I E
Sbjct: 268 KEREEKPEDLTVTALMYSGPTIKE 276

BLAST of CmaCh11G010340 vs. TAIR 10
Match: AT5G15120.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 292.0 bits (746), Expect = 1.5e-78
Identity = 153/280 (54.64%), Postives = 195/280 (69.64%), Query Frame = 0

Query: 80  PKETTKNNKSRKNRR----RSKKPSSP------IQKLYETCKQVFASTETGIVPSLEDIQ 139
           P    K NK++  +     R KK  SP      +++L+ TCK+VF++   G++PS + IQ
Sbjct: 25  PNSVKKKNKNKNKKMMMTWRRKKIDSPADGITAVRRLFNTCKEVFSNGGPGVIPSEDKIQ 84

Query: 140 RLQSVLDKMKAVDVGLSPEMPYFRTTAD---EGTPPITYLHLYENNKFSMGIFCLPPSGV 199
           +L+ +LD MK  DVGL+P MPYFR  +      +PPITYLHL++ ++FS+GIFCLPPSGV
Sbjct: 85  QLREILDDMKPEDVGLTPTMPYFRPNSGVEARSSPPITYLHLHQCDQFSIGIFCLPPSGV 144

Query: 200 IPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGILASVDASNGTAPSSIRLAKVKVDA 259
           IPLHNHPGMTVFSKLLFGTMHIK+YDW            VDA      S  RLAK+KVD+
Sbjct: 145 IPLHNHPGMTVFSKLLFGTMHIKSYDWV-----------VDAP--MRDSKTRLAKLKVDS 204

Query: 260 NFTAPCDSTILYPADGGNMHCFTAVTACAVLDVLGPPYSDHDGRHCSYYLDFPFTKFSAV 319
            FTAPC+++ILYP DGGNMH FTA+TACAVLDVLGPPY + +GRHC+Y+L+FP  K S+ 
Sbjct: 205 TFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNPEGRHCTYFLEFPLDKLSSE 264

Query: 320 DGKSI-PEAERESYAWLEER-EQPED-LAAVGAEYIGPKI 344
           D   +  E E+E YAWL+ER + PED    VGA Y GPK+
Sbjct: 265 DDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKV 291

BLAST of CmaCh11G010340 vs. TAIR 10
Match: AT1G18490.1 (Protein of unknown function (DUF1637) )

HSP 1 Score: 198.0 bits (502), Expect = 3.0e-50
Identity = 116/290 (40.00%), Postives = 162/290 (55.86%), Query Frame = 0

Query: 75  QFFELPKETTKNNKSRKN---RRRSKKPSSPIQKLYETCKQVFASTETGIVPSLED--IQ 134
           + F+  ++   N  S+K+     R+++ S  +Q+LY+ CK+ F    TG  PS     IQ
Sbjct: 4   RLFKAGEKVLSNLVSKKDIYMASRNQEKSPKVQELYDLCKETF----TGKAPSPASMAIQ 63

Query: 135 RLQSVLDKMKAVDVGLSPEMPYFRTTADEG---------------TPPITYLHLYENNKF 194
           +L SVLD +   DVGL           D G                 PIT+L ++E + F
Sbjct: 64  KLCSVLDSVSPADVGLEE----VSQDDDRGYGVSGVSRFNRVGRWAQPITFLDIHECDTF 123

Query: 195 SMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEVGAENGILASVDASNGTAP 254
           +M IFC P S VIPLH+HP M VFSK+L+G++H+KAYDW E      I+       G+ P
Sbjct: 124 TMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDWVEPPC---IITQDKGVPGSLP 183

Query: 255 SSIRLAKVKVDANFTAPCDSTILYPADGGNMHCFTAVTACAVLDVLGPPYSDHDGRHCSY 314
           +  RLAK+  D   T   +   LYP  GGN+HCFTA+T CAVLD+L PPY +  GR CSY
Sbjct: 184 A--RLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLDILSPPYKESVGRSCSY 243

Query: 315 YLDFPFTKFSAVDG-KSIPEAERESYAWLEEREQPEDLAAVGAEYIGPKI 344
           Y+D+PF+ F+  +G K + E + + YAWL + + P+DL      Y GP I
Sbjct: 244 YMDYPFSTFALENGMKKVDEGKEDEYAWLVQIDTPDDLHMRPGSYTGPTI 280

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LGJ54.1e-8459.85Plant cysteine oxidase 2 OS=Arabidopsis thaliana OX=3702 GN=PCO2 PE=1 SV=1[more]
Q9LXG92.2e-7754.64Plant cysteine oxidase 1 OS=Arabidopsis thaliana OX=3702 GN=PCO1 PE=1 SV=1[more]
O266801.4e-6541.823-dehydroquinate synthase OS=Methanothermobacter thermautotrophicus (strain ATCC... [more]
Q586467.2e-6540.833-dehydroquinate synthase OS=Methanocaldococcus jannaschii (strain ATCC 43067 / ... [more]
A0B6K61.4e-6340.003-dehydroquinate synthase OS=Methanothrix thermoacetophila (strain DSM 6194 / JC... [more]
Match NameE-valueIdentityDescription
AT3G28760.19.1e-14066.49FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT3G28760.29.1e-14066.49CONTAINS InterPro DOMAIN/s: 3-dehydroquinate synthase, prokaryotic-type (InterPr... [more]
AT5G39890.12.9e-8559.85Protein of unknown function (DUF1637) [more]
AT5G15120.11.5e-7854.64Protein of unknown function (DUF1637) [more]
AT1G18490.13.0e-5040.00Protein of unknown function (DUF1637) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR0028123-dehydroquinate synthasePFAMPF01959DHQScoord: 475..822
e-value: 2.7E-132
score: 440.9
IPR0028123-dehydroquinate synthasePANTHERPTHR33563FAMILY NOT NAMEDcoord: 447..822
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 96..310
e-value: 7.5E-11
score: 43.5
IPR012864Cysteine oxygenase/2-aminoethanethiol dioxygenasePFAMPF07847PCO_ADOcoord: 134..343
e-value: 3.1E-70
score: 235.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 80..102
NoneNo IPR availablePANTHERPTHR33563:SF13-DEHYDROQUINATE SYNTHASEcoord: 447..822
NoneNo IPR availableCDDcd20289cupin_ADOcoord: 172..285
e-value: 4.41663E-40
score: 141.149
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 100..295

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh11G010340.1CmaCh11G010340.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009073 aromatic amino acid family biosynthetic process
biological_process GO:0008652 cellular amino acid biosynthetic process
molecular_function GO:0003856 3-dehydroquinate synthase activity
molecular_function GO:0017172 cysteine dioxygenase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen