Cp4.1LG00g00410 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG00g00410
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTerpene cyclase/mutase family member
LocationCp4.1LG00 : 758092 .. 768841 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTACCCTTTTCTTTTCAGGTATTTTCGTTGTATATTATGGGACATCTCCGCATCATATTCTCAGAACATCACCGAAAGGAAATCCTTCGCTATGCATATTGTCATCAAAATCAAGATGGTGGATGGGGATTGGACATTGTGGGACAAAGTTGTATGTTTTGTACTGTTTTTAACTATATTCAACTTCGTTTGTTGGGGGAAGAACCCGACAAGGATGAGTGTTTTAGAGCTCGAAAATGGATTCTAGATCACGGAGGTGCTATTTATACACCTTCCTGGGGAAAGATTTGGCTCTCGGTCAGTACTGTATATATCTCTACCTTATTTAATTTGATTTAAGTTACTAAACTTATAATTTAACTATGTAGATTCTGGGAGTTTACGAGTGGGAGGGAGCAAACCCCATGCCTCCAGAATTTTGGTTATTAGGGAAATTACTTCCTTTCATTCCACGAAGTTTGTTATGCTATTCTAGATTGACACTTCTTCCTATGTCATATTTATTTGGGAAGCGATTTGTTGGACCACTCACCCCTCTCATTCTTCAATTACGTCAAGAAATCTATACTCAACCTTACAATCACATTAAATGGAGTCCAACTCGTCATTATTGTGCAAAGGTATGCTCCTAAACCATTCAACCCTCTCTACAACAACAATGCACGAGTAAAGAACAAATCAATATTCAACCTTTCAAAAGGTTTGAGATTTGTATGGTATGAGGCTTTGATTACTTTGATTGAATTGGACATGATGATGTAGGAAGACAAGTGCTTTGAACGTTCTTTATTTCAGAAGCTTGCATGGAATGCTCTTCAATACTTTGGAGAACCCATTCTTAATAGTTGGGCTTTTAAAACAATAAGAAATAGAGCCCTTCAAATAGCTCAACGTCGTATTGATTATGAAGATCATAACAGTCATTACATTACAATTGGATGCATTGAAAAGGTTAATTTTATACTTCTTTTTTCTTTTATTGTTATAATTATAGATAATAATATTATATTAATAAATATAATTTAGCTAATATGGTGGAGGGTTATTGAACATGCAGCCATTGTTTACACTTGTTTGTTGGGTTGATGATCCTCATGGGGAAGCTTATAAGAAACATGTTGCTAGAATCAAAGATTACTTATGGATTGGTGAAGATGGAATGAAGATGCAAGTATGTTTTACCCATCATTAAATATTTGACTAATTTATAAATAAATAAATATTATCTTTTGATTTTGTCATACATACAAACATGAGCAGAGCAGAGTTATGGTAGTCAATCATGGGATGTTGCTTTTTCCATTCAAACTGTTCTTGCAACAAATCTTCACCACGAATTTTCAGAGACACTTAAAAAAGGACATGACTTCATTAAACAATCACAGGTACTAAAATTATTTAATTTGAATAAATATTTGTTTATATTATTTTAATTTATATTCATATAAAGGTCAGAGAGAATCCTTCAAGTGATTTTCGAAATATGTATCGTTACATATCAAAAGGAAGTTGGACATTCTCTGATCGAGATCATGGATGGCAAATTTCTGACTGTACTGCAGAAAACTTACTGGTATTTTATTTTATCCATAATAATATTAAATTTTAAATTAATTTACTGATCTCTTGTATATATAAATGACGTGTCAGTGTTGTTTAATATTTTCGACCATGTCTTCCAACATAGTAGGAGATCCAATGGAACCACAATGGTTTTATGAAGCTGTGAATATCATATTATCCCTTCAAGTAAGGATTTTAATTTGAATTTGCCTTACAATTATTTTTTTCTTGTGAACTTAATGATGAATTGGTGAATTTTGTTGTGAAATAAATAGGCAAAAAATGGTGGAGTCTCAGCTTGGGAGCCTACCGGAGGTGTACCCTCATGGTTTGAGGTACCTAATGCCTTATTTGAGAGTTATTCTTGATAAATCGTAACAACTAATTAATAGGATTATTTTGTTGTATCTTAAACTCTTCAGCTATTGAATCCAGTGGAATTCTTAGAATACACGATATTGGAGCTCGAGTAAGTAGACAATCTAATAATATTACACGAAGATACATTGCATTGGACTGATTATTATTATTTGTTTTGTTATGTTGTGATGAACCATGGAATGGGTAAAATTAGATATGTAGAATGCACGTCATCGTCGATACAGGCACTTGTTCTGTTTAGGAAGCTATTTCCGAATCATAGAAAGAAAGAGATAAAAACGTTTTTAAGTAAAGGAGTGAAGTATTTGGAAGAAACTCAGAAAGAGGATGGATCATGGCATGGATATTGGGGAATTTGTTACACTTACGCAACATACTTCGCTATAAAAGGATTAGTGGCAACCGGAAACACTTACAATAATTCCTCAACACTAAGAAGAGGTGTAGAGTTTCTTCTTAAAATTCAATGTCCGGACGGTGGATGGGGAGAGAGTTACATTTCATGTATGCAAAAGAAATACATTCCGCTTCCGGGAAATTCTTCCAATCTTGTTCAAACTTCCTTTGCTTTAATGGCTTTGATCCATTCTCAACAGGTTTGTTTTACTCTTTAAATTCAATTAGTATGTAAAGCTTATATATATAATAAAGAAATGACGTATGTATGTAGGAAAAGAGAGATCCAACTCCTCTTCACCGTGCAGCAAAGCTATTAATTAATTCTCAACTGAAGAATGGTGATTACTCTCAGCAGGTTGGTTATCGTTATGTGTGGTTTGGTTAAACTATAACTTTTGTGATATTAAAAAAGAGAGTTATTTGTATGACAGAAGATATCTGGAGCATTTATGAACACATGCACGCTACACTATGGACTATATAGGAACGTATTCCCATTGTGGGCACTTGCAGAGTATTATAATAAGGTTTCATTGCCATAGTCTCTAAGCTTTTGTTTCATTCCATCTCCATATATATATATATATATATATATATATATAAAAAAGACATTTCAACTTTTGCTTCTCTTCTTCTCATCGTTTCCCATAGTGATTTGAATTATATTCGCATACTACAATTTAAATTAATATTTAAACATCAAAATGAATATTCATTTGATTATGATTTGAATTAATAATTTCAATCGGTCATATTATCTATCATAAAGAGTTACAAAAACAATTACGAATGTATTTTTTTTTTCTTTTTTTTTTTAAGAACTAATTCTTCTATCTCAACCTACCAACCCTTTCCCACATTCATTCAGTGGCCCAACTGACTCAATCTGATTCATTTAACGTCAAATCCCAATCGTCATAATAAATTTCCTCCCAAAATGTCAAAGTTGATTAGTTGCCTTTGCTAAACGAACAATACCCTAGGGCCATATAAGTGCATACTCATGCTCTAAAGTTATGACACTTTAAGTTTGTTAGTACTATGATACGTACTTTAAGTGCTCATGACCTTTCCACACGTAGTGTGTTATGAAGCCCTGATTTCTCTTCTAGGTGAAATGTCCTATGTCACAATCTACAATGTAAGTCATCGAAGATAATAACTATTTTATGTGGGGAGAGTGTCACACCATCATTAAGAAAATAGTAGGTTGTGACCCTCACGTACATACTAATACTAACAAGGTAGGGTCTTGCCCTCTCAAGTGCAACCCTTCCACCCACATGAGGGCTGTAACAACATAGAAAAAAATTGTGTGGTCGCCGAAATTCTCGCACCCACCTCGGTCTCACACCCACCCGATGCTCCTGCTTCGCTAAAGCAATCCCCCGAACATCAGAACGACATTTCAGAGGAGAGACAAGGTCGCCAGAGAAAAATTGCTTGGTGAGCAAGACGACGTCTCGGGTCCAATACGCTACCTGCAGCCCAACTTACAACAAACCGAAAAATCGACCCACTTTACCTTCAACACAAACCATCTGCAGCCAGCCCGTTAGTAGGCCTGTAGCCCGTGATCACCATACTCCCGAGGCCCAAACTGCAGCCAGACCAAACACCAGCAGCCACAACCCGCGATCCGCGCCTCGACCTGAAAACGACCTGCACACGCAGCTGCTCCCTACTACACCTCGTGTCACCATCTGCAGCGTGCACCCCTTCAGCTCAATAAACTTAAACGGCTCGACTCGACTCCACGACTCAGTTTCTTGGATATTCGGCTCGATATTTACTTTTCAGCTTGATACCTAATATGTTTACCTTCAAGTGATTGATTGTGTGTTCTCCCTATGCAGCGTGTTTATCTGCACAGAGCTAGAATTAGAATACTAGAGCCCAGGCTGTCTAGACGATGACCTAAGTTAGGACACCTCAATAGAGTGAGAACTTTAGGATTGTGAGCAGCTATTAGAGGGTCTAGACCCTAGTGCTCCGAACTAAGAAAGACTCAGGAGTCATAAAAAAGTACACTAGTGTTCCGTGTTAAACCACCTTAGCTGCTTTATGAACCTAGTAAGCCTCTGTGGCAATTTATGATGACCAAGTTAGTGTTTAGATTGTGTTTTCCATGCTTTAAGTGAGATAGGAATATCTCCCAACCAAAGCATGGATGAGTGGACCAAGTTAAGATTAGGGCATTAGAGCTAGAGTGGTTACACATTAACTTAGGTCAATACGTTTTCAGTAGACCGAGTACTCTAGAATAGAAAGTAGCGATTAGAGGTTTAGACCTCGGTGCATCCAAAGATCAAATTAGCCCAAAGAAGTCAATCCCCAACCTCCTGAAAATGCGAGATGAGACTTAAACCGCCTAGATGCCCCTAAAGACCTAATAAATCTCAATGGCAAGGACGATTAAGAAAGCCTACAAGGCCTAGAAGTAGTGTGACGATGTCATCCAAGTCTTTGCCTCCTACCTCCTCTAGTATACCTTTAGCACCAACCGTGTGGAAAAGTTGTCAAACAAGCTAGTTATCATATTATCCTAACATATTTTATGAGATAATGTCTCAACTCAGGATATGGGTCATTCCACCGCTATTGCTACGCGAGCTAACTAAGTTTAAGAACAAGAAAGTAATTCATCCTAAATATGAGATTTGGCACAGTAGATTAACGGCACCAGAAATGAGTAAACCTCCTCCAAACTTAAAGTAGGTTTGTGAAAAGCATAAGATCTGCTCTGAACTTCGAGGTCTCCTTAGAAACTCAAATAGCGGTAGCACTAGACCTGAACTTCCTAGAGCCCATAAAAAGTCTATGGTAACTTACTAGGATAAGAATGCATCGAGTCTATAAATAGGTTGCTTATTCAGTGTTTTTTATACTCACAATTTTTCCTTTTTTTTTTTATCCATAGGCTACTAGTCAAGGTGGGGGAGCATTTGAATAAAGGGTGTCATCCTAGAGTGTCAGTCTTGTCTTTGTATTTTTGGATCTTTTGTATTTTATTTTTAATTATATCCCAACCTCATTCCCGAATTGTATTAAACACGAACAACACCGACTTGTAGTTGCATTTATTTTTTTAGTTTAACTATGGTGTTGCTTGTTTTAATTTGATTTAAATTTTACTATATTTTTATCTTTTATTTTATTTGCTTTTATTTTTGTCTTTTTTTTTGTCACGGTCTAAGATTTGAAGCTTATCTTTTATTTTATATGCTTTTGTCTCATGTCACTATTGCATGCATGACATACCTAATATACGTAGAAAATCCTATAAACAGTTTACTAAAACGTTTCTATTGATACAGTTCATAAAGTAAAATCATTTAAACTAACAATAGAGTCAACAAAATAGCAAAATTTAAATAATAAGAATTTAAAAGAAAAGCAACGCCCTCATTTAACCTTAGGGCTACACTATATAATCAAAATGCGATTAAGCGGACTTTATTTTCATGGTCTCCGTAGGGACATCATCATCTGTATATGAGTACCTTACCTTCACCTAAAAAAAGTAGTCATGACTTAAGTATGTAAAAATACTCAGTAAGTAGTCATACTATTGGTGTCGTAAACTGCACTTACATGTAATCATGATGGGTCCTATCTTTCCAAACAACTTTTTCTTGGTCTCAGGTAAGGGTTGAATGTGTATTACTTCTCTCCACATTCTACACACGCGCACATGAATCAATTAGGCAGCCTCAGTACATACCCCAAAAGTCACAAAAGAAAAGGAGCGCATATAATATCATGGTATATTCGTCCATACTAACGAGAAAGTATCCCGATCCAAATTATTTTTCGAAAGTAATGAAAGCATGGAATAGTATGTGGACTAATAATTGAGGGCCATAGTCTAGTACCGTCCATTCAATTAAAATGATGGATGTGTTGGGCATTGAATTTATTAGGTGGTGGGTGGTTGGTATAATTAAACCTCCCACTTGAATTTTTTTTTAAAAAATAATAATAATAAAGTGGACTTAATTGTTTGATACAAAAGGAACAACCTAGAATATTAGAAATGTTGTAAGGTGAATGTAGGTAGTAAATGATATGAGATTACAAATTTTATTACACATTCCACATAAAATTATGATGTTTATATAAATATTAGAGAATAAGTTAATATTATAAACATATTACCCTTTTAAATAAGTTTAAATTATTCATCGGATTTTTCTTTTTGGAAGTATCGGACTTTTTTCACTTGATTACGTTGACAAGACCGATGAACAATAAAGTAATACCCTCGTCTATATATACAAAAAAGACATTTGTGTGCAAAAGGCACAATTGAGAGAGTAAAAATGTGGAGACTTAAGTTGGGAGAAGGAGGACACGACCCGTACTTTTTCAGCTCTAACAATTTCGTGGGACGACAAACGTGGGACTTCGAGCCTGATGATGGCACTCCAGAAGAACGAGCTGAAGTGGAAGAAGCCCGCCACAATTATTATCAAAATCGCTTCAAAGTCCAATGCAGCAGTGATCTGTTTTGGAAATTTCAGGTTTGCTCCAATTAATTAGGATATAAATTTTAAGTTTTTAACTTTTTCCTAAATCTAAATTTGTTTTGTGTTTAGTTTTTTCTTTTTCTGATATAAAAAATAGATAGATAAATTTGAAAGATATTAAACATAGTAAGAATTTAAAATGATTTTAATATTATATCTAAAAAAATATGGATGAAAGTTTAAAAATAGTATTAATACCCTCCAATCATCTTTAAATTTTTAGTATTTTTAAAAAGATGTTCTTAAAGTTTTAAAAAATGTTTAAAGGTGATTTTAAAAGTTATTCCATCTAAAATTTAATTAAAAATATTATTTTGAGATTTTTTTTTAGATAAATTGTAAAGATACTTTTATTAATTTAACCCAAATTTAAAACTTTCAGTAAATTTAGTTAACTTCATCATTAAATGTAAAATTTGGAAGGAACTAGACACATATTTATTTGGTCTACTTTATCAATAAATGCGTATTTAAATAAGTGATTGAATACATCCTCCATGCAGTTTCTAAGAGAGCGAAATTTCAAGCAAACAATTCAAATAGTGAGAGTGGAGGACGTAATAGATAAGGAAACAGCGAGCATTGCACTGAGAAGAGCCACCAAGTTCTTTGCAGCCTTACAAAGCCCTCATGGCCATTGGCCTGCTGAAAACGCTGGCCCTATGTTTTATTTCCCTCCATTGGTACTCTTCCTCTTTCTATTCATTTTATCTTATATAATTCTACCACAATATTACCCTTTTCTTTCAGGTATTTTCGTTGTATATTACCGGACATCTCCACATCGTATTCTCAGAAGAACACCGAAAGGAAATCCTTCGCTATGCATATTGTCATCAGAATGAAGATGGTGGATGGGGATTGCACATTGTGGGACAAAGTTGTATGCTTTGCACTGTTTTTAACTATATTCAACTCCGTTTGTTAGGGGAAGAACTTGACAAGGATGAGTGTTTTAGAGCTCGAAAATGGATTCTAGATCATGGAGGTGCTATTTATATACCCTCCTGGGAAAGATTTGGCTCTCGGTTAGTACTATATATATATATATATACGTCTATACTATTTTATTTGAAATGGCGTACCTCAAATTTAACGAACTAAACTTATGATATAACTTTGTAGATTCTGGGAGTTTATGAGTGGGAGGGAGTGAATCCTATGCCTCCAGAATTTTGGATGTTCGGAAAATTACTTCCTTTTATTCCTAGAAATTTGTTTTGCCATTCTAGATTGACACTTCTTCCTATGACATATTTATTTGGGAAGAGGTTCGTTGGACCCCTCACACCTCTCATTCTTCAATTACGTCAAGAAATCTATACTCAACCTTATAACCACATTAAATGGAGTTCAACGCGCCATTATTGTGCAAAGGTATGCTCCTAAACCATTCAACCCTCCCCATAACAATACATGAGTAAAGAAATTATAGAATTTCTACCTACAAATCAATATTCATCCTTTAAAAGGTATAAAAATTGATAATTTCCATTTCATGTTAGGTATGAGACTTTGATTACTTTGTTTGAATTGGATATGATGATGCAGGAAGACAAATGCTTTGAACGTTCTTTATTTCAGAAGCTTGCATGGGATGCTCTTCAATACTGTGGAGAACCCATCCTTAATAGTTGGACCTTTAAAACGATAAGAAATAGAGCCCTTCAAATAGCTAAATGTTATATTGATTATGAAGATCATCATAGTCATTACATTACAATTGGATGCGTTGAAAAGGTTAATTTCATACTTGTTTTTTCTTTTATAGTTAATAACATTATAGGAATAAATGTAATTGAGCTAATATGGTGGAGGGGTATTGAACATGCAGCCATTGTTTACACTTGCTTCTTGGATTGATGATCCTCATGGGGAAGCTTATAAGAAACATGTTGCTAGAATCAAAGATTACTTATGGATGGTGAAGATGGAATGAAGATGCAAGTATGTTTTACCAATTATTAAATATTTTACTCATTTATAAATAAATAAATATTATCTTTCATCTTTACAATATATACACACACGAGCAGAGTTATGGTAGTCAATCATGGGATGTTGCTTTTGCCATTCAAGCAATGCTTGCCACAAATCTCCACCATGAAAATTCCGAGACACTTAAAAAAGGGCATGATTTCATTAAACAATCCCAGGTACTAAAATTTATTAATTTGAATAAATATTTGTTATATTATTTTAATGTATATTCATATAAAGGTCAGAGAGAATCCTTCCGGGGATTTTCGAAGTATGTATCGTCACATATCAAAAGGAAGTTGGACATTCTCTGATCGAGATCATGGATGGCAAGTTTCTGATTGTACCGCAGAAAATTTACTGGTATTTTATTTTGTTTTCCTAGTTTCAAAAAAAAAAAATTCAATTACTAATTTCTCATATATATATATATATATATATATATATATAAATGACGTGTCAGTGTTGTTTGAGATTTTCGACGATGCCTTCCAACATCGTAGGAGATCCAATGGAACCACAATGGTTTTTCGAAGCGGTCAATTTCATACTATCCCTCCAAGTAAGGATTGTAATTCGAATTTGCCTTATAATTATCTTTTTCTTGTTAACTTAACAATGAATTGGTGAATTTTGTTGTGAAATAAATAGGCAAAAAATGGTGGAGTCTCAGCTTGGGAGCCTACCGGAGCTGTACCATCGTGGTTTGAGGTAGCTTATTCCATATATAACTATTTAGTGAAGCAATACTTAAATACATAATAGATCATAACTAATTAATATAATTATTTTTCAGCTATTGAATCCAGTGGAATTCTTAGAATACACAGTATTGGAGCGGGAGTAAGTAGACAATTTAATAATATTACTTAAAAAAATATTACATACGAATAAATGATTATTATTTATTTTGTTTTGGTAAAATTAGATATGTAGAATGCACATCATCGTCCATACAAGCACTTGTTCTGTTTACGAAATTATTTCCAAATCACAGAAAGAAAGAGATAGAAACATTTTTAAGTAAAGCAATAAAGTATTTGGAAGAAACGCAGAAAGAGGATGGATCTTGGTTTGGTAATTGGGGAGTTTGTCACATTTATGCAACATACTTCGCTATAAAAGGACTGGTGGCGACTGGAAACACATACCATAACTCCTCGACCATACGAAAAGGTGTGGAGTTTCTTCTTAAAATTCAATGTCCAGACGGTGGATGGGGAGAGAGTCACGTTTCATGTATGCAAAAGAAATACATTCCTCTTCCTAGGAATTCTTCCAATCTTGTTCAAACTTCCTTTGCTTTAATGGCTTTGATCCATTCTCACCAGGTTTGTTATACTCTTTTACTTCAATTATTATAAGTACGTAAAGCTTTTATATATATATATATATATATATCATAAAGAAATGGCGTATGTATGTAGGAAAAGAGAGACCCAACTCCTCTCCACCATGGGGCAAAGCTGTTGATTAATTCTCAACTGGAGAATGGTGATTACCCTCAACAGGTAGGTTATTGTAATGTGTGTGTTTTAGTTAAACTATAACTTTTGTGATATTAAAAAGAGAGTTATTTGTATGGCAGGAGATAAGTGGAGTATTTATGAACACGTGCATGCTACACTATGGACTATATAGGAACGTATTCCCTATATAGGAACGTATTCCCTTTGTGGGCACTTGCAGAGTATTGTAATATGGTTTCATTGTCATAGTCCCGACCCAAATTTATTTAACATGCTTTCCGCCTTTGTTTCATTCCATCTCCATATATTTATATATTCTTTTAAATTGGACATACACGTTTCAACTTTTGTTTCTAATCGTTTACAATAGTGATCTATTGATATTCACATACAGTAATTTTAAATAATATTTAAACATCATATAATTTC

mRNA sequence

TTACCCTTTTCTTTTCAGGTATTTTCGTTGTATATTATGGGACATCTCCGCATCATATTCTCAGAACATCACCGAAAGGAAATCCTTCGCTATGCATATTGTCATCAAAATCAAGATGGTGGATGGGGATTGGACATTGTGGGACAAAGTTGTATGTTTTGTACTGTTTTTAACTATATTCAACTTCGTTTGTTGGGGGAAGAACCCGACAAGGATGAGTGTTTTAGAGCTCGAAAATGGATTCTAGATCACGGAGGTGCTATTTATACACCTTCCTGGGGAAAGATTTGGCTCTCGATTCTGGGAGTTTACGAGTGGGAGGGAGCAAACCCCATGCCTCCAGAATTTTGGTTATTAGGGAAATTACTTCCTTTCATTCCACGAAGTTTGTTATGCTATTCTAGATTGACACTTCTTCCTATGTCATATTTATTTGGGAAGCGATTTGTTGGACCACTCACCCCTCTCATTCTTCAATTACGTCAAGAAATCTATACTCAACCTTACAATCACATTAAATGGAGTCCAACTCGTCATTATTGTGCAAAGGAAGACAAGTGCTTTGAACGTTCTTTATTTCAGAAGCTTGCATGGAATGCTCTTCAATACTTTGGAGAACCCATTCTTAATAGTTGGGCTTTTAAAACAATAAGAAATAGAGCCCTTCAAATAGCTCAACGTCGTATTGATTATGAAGATCATAACAGTCATTACATTACAATTGGATGCATTGAAAAGCCATTGTTTACACTTGTTTGTTGGGTTGATGATCCTCATGGGGAAGCTTATAAGAAACATGTTGCTAGAATCAAAGATTACTTATGGATTGGTGAAGATGGAATGAAGATGCAAAGTTATGGTAGTCAATCATGGGATGTTGCTTTTTCCATTCAAACTGTTCTTGCAACAAATCTTCACCACGAATTTTCAGAGACACTTAAAAAAGGACATGACTTCATTAAACAATCACAGGTCAGAGAGAATCCTTCAAGTGATTTTCGAAATATGTATCGTTACATATCAAAAGGAAGTTGGACATTCTCTGATCGAGATCATGGATGGCAAATTTCTGACTGTACTGCAGAAAACTTACTGTGTTGTTTAATATTTTCGACCATGTCTTCCAACATAGTAGGAGATCCAATGGAACCACAATGGTTTTATGAAGCTGTGAATATCATATTATCCCTTCAAGCAAAAAATGGTGGAGTCTCAGCTTGGGAGCCTACCGGAGGTGTACCCTCATGGTTTGAGCTATTGAATCCAGTGGAATTCTTAGAATACACGATATTGGAGCTCGAATATGTAGAATGCACGTCATCGTCGATACAGGCACTTGTTCTGTTTAGGAAGCTATTTCCGAATCATAGAAAGAAAGAGATAAAAACGTTTTTAAGTAAAGGAGTGAAGTATTTGGAAGAAACTCAGAAAGAGGATGGATCATGGCATGGATATTGGGGAATTTGTTACACTTACGCAACATACTTCGCTATAAAAGGATTAGTGGCAACCGGAAACACTTACAATAATTCCTCAACACTAAGAAGAGGTGTAGAGTTTCTTCTTAAAATTCAATGTCCGGACGGTGGATGGGGAGAGAGTTACATTTCATGTATGCAAAAGAAATACATTCCGCTTCCGGGAAATTCTTCCAATCTTGTTCAAACTTCCTTTGCTTTAATGGCTTTGATCCATTCTCAACAGGAAAAGAGAGATCCAACTCCTCTTCACCGTGCAGCAAAGCTATTAATTAATTCTCAACTGAAGAATGGTGATTACTCTCAGCAGAAGATATCTGGAGCATTTATGAACACATGCACGCTACACTATGGACTATATAGGAACGTATTCCCATTGTGGGCACTTGCAGAGTATTATAATAAGTTGGGAGAAGGAGGACACGACCCGTACTTTTTCAGCTCTAACAATTTCGTGGGACGACAAACGTGGGACTTCGAGCCTGATGATGGCACTCCAGAAGAACGAGCTGAAGTGGAAGAAGCCCGCCACAATTATTATCAAAATCGCTTCAAAGTCCAATGCAGCAGTGATCTGTTTTGGAAATTTCAGTTTCTAAGAGAGCGAAATTTCAAGCAAACAATTCAAATAGTGAGAGTGGAGGACGTAATAGATAAGGAAACAGCGAGCATTGCACTGAGAAGAGCCACCAAGTTCTTTGCAGCCTTACAAAGCCCTCATGGCCATTGGCCTGCTGAAAACGCTGGCCCTATGTTTTATTTCCCTCCATTGGTATTTTCGTTGTATATTACCGGACATCTCCACATCGTATTCTCAGAAGAACACCGAAAGGAAATCCTTCGCTATGCATATTGTCATCAGAATGAAGATGGTGGATGGGGATTGCACATTGTGGGACAAAGTTGTATGCTTTGCACTGTTTTTAACTATATTCAACTCCGTTTGTTAGGGGAAGAACTTGACAAGGATGAGTGTTTTAGAGCTCGAAAATGGATTCTAGATCATGGAGGTGCTATTTATATACCCTCCTGGGAAAGATTTGGCTCTCGATTGACACTTCTTCCTATGACATATTTATTTGGGAAGAGGTTCGTTGGACCCCTCACACCTCTCATTCTTCAATTACGTCAAGAAATCTATACTCAACCTTATAACCACATTAAATGGAGTTCAACGCGCCATTATTGTGCAAAGGAAGACAAATGCTTTGAACGTTCTTTATTTCAGAAGCTTGCATGGGATGCTCTTCAATACTGTGGAGAACCCATCCTTAATAGTTGGACCTTTAAAACGATAAGAAATAGAGCCCTTCAAATAGCTAAATGTTATATTGATTATGAAGATCATCATAGTCATTACATTACAATTGGATGCGTTGAAAAGCCATTGTTTACACTTGCTTCTTGGATTGATGATCCTCATGGGGAAGCTTATAAGAAACATGTTGCTAGAATCAAAGATTACTTATGGATGAGTTATGGTAGTCAATCATGGGATGTTGCTTTTGCCATTCAAGCAATGCTTGCCACAAATCTCCACCATGAAAATTCCGAGACACTTAAAAAAGGGCATGATTTCATTAAACAATCCCAGGTCAGAGAGAATCCTTCCGGGGATTTTCGAAGTATGTATCGTCACATATCAAAAGGAAGTTGGACATTCTCTGATCGAGATCATGGATGGCAAGTTTCTGATTGTACCGCAGAAAATTTACTGTGTTGTTTGAGATTTTCGACGATGCCTTCCAACATCGTAGGAGATCCAATGGAACCACAATGGTTTTTCGAAGCGGTCAATTTCATACTATCCCTCCAAGCAAAAAATGGTGGAGTCTCAGCTTGGGAGCCTACCGGAGCTGTACCATCGTGGTTTGAGCTATTGAATCCAGTGGAATTCTTAGAATACACAGTATTGGAGCGGGAATATGTAGAATGCACATCATCGTCCATACAAGCACTTGTTCTGTTTACGAAATTATTTCCAAATCACAGAAAGAAAGAGATAGAAACATTTTTAAGTAAAGCAATAAAGTATTTGGAAGAAACGCAGAAAGAGGATGGATCTTGGTTTGGTAATTGGGGAGTTTGTCACATTTATGCAACATACTTCGCTATAAAAGGACTGGTGGCGACTGGAAACACATACCATAACTCCTCGACCATACGAAAAGGTGTGGAGTTTCTTCTTAAAATTCAATGTCCAGACGGTGGATGGGGAGAGAGTCACGTTTCATGTATGCAAAAGAAATACATTCCTCTTCCTAGGAATTCTTCCAATCTTGTTCAAACTTCCTTTGCTTTAATGGCTTTGATCCATTCTCACCAGGAAAAGAGAGACCCAACTCCTCTCCACCATGGGGCAAAGCTGTTGATTAATTCTCAACTGGAGAATGGTGATTACCCTCAACAGGAGATAAGTGGAGTATTTATGAACACGTGCATGCTACACTATGGACTATATAGGAACGTATTCCCTATATAGGAACGTATTCCCTTTGTGGGCACTTGCAGAGTATTGTAATATGGTTTCATTGTCATAGTCCCGACCCAAATTTATTTAACATGCTTTCCGCCTTTGTTTCATTCCATCTCCATATATTTATATATTCTTTTAAATTGGACATACACGTTTCAACTTTTGTTTCTAATCGTTTACAATAGTGATCTATTGATATTCACATACAGTAATTTTAAATAATATTTAAACATCATATAATTTC

Coding sequence (CDS)

TTACCCTTTTCTTTTCAGGTATTTTCGTTGTATATTATGGGACATCTCCGCATCATATTCTCAGAACATCACCGAAAGGAAATCCTTCGCTATGCATATTGTCATCAAAATCAAGATGGTGGATGGGGATTGGACATTGTGGGACAAAGTTGTATGTTTTGTACTGTTTTTAACTATATTCAACTTCGTTTGTTGGGGGAAGAACCCGACAAGGATGAGTGTTTTAGAGCTCGAAAATGGATTCTAGATCACGGAGGTGCTATTTATACACCTTCCTGGGGAAAGATTTGGCTCTCGATTCTGGGAGTTTACGAGTGGGAGGGAGCAAACCCCATGCCTCCAGAATTTTGGTTATTAGGGAAATTACTTCCTTTCATTCCACGAAGTTTGTTATGCTATTCTAGATTGACACTTCTTCCTATGTCATATTTATTTGGGAAGCGATTTGTTGGACCACTCACCCCTCTCATTCTTCAATTACGTCAAGAAATCTATACTCAACCTTACAATCACATTAAATGGAGTCCAACTCGTCATTATTGTGCAAAGGAAGACAAGTGCTTTGAACGTTCTTTATTTCAGAAGCTTGCATGGAATGCTCTTCAATACTTTGGAGAACCCATTCTTAATAGTTGGGCTTTTAAAACAATAAGAAATAGAGCCCTTCAAATAGCTCAACGTCGTATTGATTATGAAGATCATAACAGTCATTACATTACAATTGGATGCATTGAAAAGCCATTGTTTACACTTGTTTGTTGGGTTGATGATCCTCATGGGGAAGCTTATAAGAAACATGTTGCTAGAATCAAAGATTACTTATGGATTGGTGAAGATGGAATGAAGATGCAAAGTTATGGTAGTCAATCATGGGATGTTGCTTTTTCCATTCAAACTGTTCTTGCAACAAATCTTCACCACGAATTTTCAGAGACACTTAAAAAAGGACATGACTTCATTAAACAATCACAGGTCAGAGAGAATCCTTCAAGTGATTTTCGAAATATGTATCGTTACATATCAAAAGGAAGTTGGACATTCTCTGATCGAGATCATGGATGGCAAATTTCTGACTGTACTGCAGAAAACTTACTGTGTTGTTTAATATTTTCGACCATGTCTTCCAACATAGTAGGAGATCCAATGGAACCACAATGGTTTTATGAAGCTGTGAATATCATATTATCCCTTCAAGCAAAAAATGGTGGAGTCTCAGCTTGGGAGCCTACCGGAGGTGTACCCTCATGGTTTGAGCTATTGAATCCAGTGGAATTCTTAGAATACACGATATTGGAGCTCGAATATGTAGAATGCACGTCATCGTCGATACAGGCACTTGTTCTGTTTAGGAAGCTATTTCCGAATCATAGAAAGAAAGAGATAAAAACGTTTTTAAGTAAAGGAGTGAAGTATTTGGAAGAAACTCAGAAAGAGGATGGATCATGGCATGGATATTGGGGAATTTGTTACACTTACGCAACATACTTCGCTATAAAAGGATTAGTGGCAACCGGAAACACTTACAATAATTCCTCAACACTAAGAAGAGGTGTAGAGTTTCTTCTTAAAATTCAATGTCCGGACGGTGGATGGGGAGAGAGTTACATTTCATGTATGCAAAAGAAATACATTCCGCTTCCGGGAAATTCTTCCAATCTTGTTCAAACTTCCTTTGCTTTAATGGCTTTGATCCATTCTCAACAGGAAAAGAGAGATCCAACTCCTCTTCACCGTGCAGCAAAGCTATTAATTAATTCTCAACTGAAGAATGGTGATTACTCTCAGCAGAAGATATCTGGAGCATTTATGAACACATGCACGCTACACTATGGACTATATAGGAACGTATTCCCATTGTGGGCACTTGCAGAGTATTATAATAAGTTGGGAGAAGGAGGACACGACCCGTACTTTTTCAGCTCTAACAATTTCGTGGGACGACAAACGTGGGACTTCGAGCCTGATGATGGCACTCCAGAAGAACGAGCTGAAGTGGAAGAAGCCCGCCACAATTATTATCAAAATCGCTTCAAAGTCCAATGCAGCAGTGATCTGTTTTGGAAATTTCAGTTTCTAAGAGAGCGAAATTTCAAGCAAACAATTCAAATAGTGAGAGTGGAGGACGTAATAGATAAGGAAACAGCGAGCATTGCACTGAGAAGAGCCACCAAGTTCTTTGCAGCCTTACAAAGCCCTCATGGCCATTGGCCTGCTGAAAACGCTGGCCCTATGTTTTATTTCCCTCCATTGGTATTTTCGTTGTATATTACCGGACATCTCCACATCGTATTCTCAGAAGAACACCGAAAGGAAATCCTTCGCTATGCATATTGTCATCAGAATGAAGATGGTGGATGGGGATTGCACATTGTGGGACAAAGTTGTATGCTTTGCACTGTTTTTAACTATATTCAACTCCGTTTGTTAGGGGAAGAACTTGACAAGGATGAGTGTTTTAGAGCTCGAAAATGGATTCTAGATCATGGAGGTGCTATTTATATACCCTCCTGGGAAAGATTTGGCTCTCGATTGACACTTCTTCCTATGACATATTTATTTGGGAAGAGGTTCGTTGGACCCCTCACACCTCTCATTCTTCAATTACGTCAAGAAATCTATACTCAACCTTATAACCACATTAAATGGAGTTCAACGCGCCATTATTGTGCAAAGGAAGACAAATGCTTTGAACGTTCTTTATTTCAGAAGCTTGCATGGGATGCTCTTCAATACTGTGGAGAACCCATCCTTAATAGTTGGACCTTTAAAACGATAAGAAATAGAGCCCTTCAAATAGCTAAATGTTATATTGATTATGAAGATCATCATAGTCATTACATTACAATTGGATGCGTTGAAAAGCCATTGTTTACACTTGCTTCTTGGATTGATGATCCTCATGGGGAAGCTTATAAGAAACATGTTGCTAGAATCAAAGATTACTTATGGATGAGTTATGGTAGTCAATCATGGGATGTTGCTTTTGCCATTCAAGCAATGCTTGCCACAAATCTCCACCATGAAAATTCCGAGACACTTAAAAAAGGGCATGATTTCATTAAACAATCCCAGGTCAGAGAGAATCCTTCCGGGGATTTTCGAAGTATGTATCGTCACATATCAAAAGGAAGTTGGACATTCTCTGATCGAGATCATGGATGGCAAGTTTCTGATTGTACCGCAGAAAATTTACTGTGTTGTTTGAGATTTTCGACGATGCCTTCCAACATCGTAGGAGATCCAATGGAACCACAATGGTTTTTCGAAGCGGTCAATTTCATACTATCCCTCCAAGCAAAAAATGGTGGAGTCTCAGCTTGGGAGCCTACCGGAGCTGTACCATCGTGGTTTGAGCTATTGAATCCAGTGGAATTCTTAGAATACACAGTATTGGAGCGGGAATATGTAGAATGCACATCATCGTCCATACAAGCACTTGTTCTGTTTACGAAATTATTTCCAAATCACAGAAAGAAAGAGATAGAAACATTTTTAAGTAAAGCAATAAAGTATTTGGAAGAAACGCAGAAAGAGGATGGATCTTGGTTTGGTAATTGGGGAGTTTGTCACATTTATGCAACATACTTCGCTATAAAAGGACTGGTGGCGACTGGAAACACATACCATAACTCCTCGACCATACGAAAAGGTGTGGAGTTTCTTCTTAAAATTCAATGTCCAGACGGTGGATGGGGAGAGAGTCACGTTTCATGTATGCAAAAGAAATACATTCCTCTTCCTAGGAATTCTTCCAATCTTGTTCAAACTTCCTTTGCTTTAATGGCTTTGATCCATTCTCACCAGGAAAAGAGAGACCCAACTCCTCTCCACCATGGGGCAAAGCTGTTGATTAATTCTCAACTGGAGAATGGTGATTACCCTCAACAGGAGATAAGTGGAGTATTTATGAACACGTGCATGCTACACTATGGACTATATAGGAACGTATTCCCTATATAG

Protein sequence

LPFSFQVFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLGEEPDKDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLLPFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAKEDKCFERSLFQKLAWNALQYFGEPILNSWAFKTIRNRALQIAQRRIDYEDHNSHYITIGCIEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLATNLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAENLLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNPVEFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSWHGYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKKYIPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAFMNTCTLHYGLYRNVFPLWALAEYYNKLGEGGHDPYFFSSNNFVGRQTWDFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLFWKFQFLRERNFKQTIQIVRVEDVIDKETASIALRRATKFFAALQSPHGHWPAENAGPMFYFPPLVFSLYITGHLHIVFSEEHRKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVFNYIQLRLLGEELDKDECFRARKWILDHGGAIYIPSWERFGSRLTLLPMTYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSSTRHYCAKEDKCFERSLFQKLAWDALQYCGEPILNSWTFKTIRNRALQIAKCYIDYEDHHSHYITIGCVEKPLFTLASWIDDPHGEAYKKHVARIKDYLWMSYGSQSWDVAFAIQAMLATNLHHENSETLKKGHDFIKQSQVRENPSGDFRSMYRHISKGSWTFSDRDHGWQVSDCTAENLLCCLRFSTMPSNIVGDPMEPQWFFEAVNFILSLQAKNGGVSAWEPTGAVPSWFELLNPVEFLEYTVLEREYVECTSSSIQALVLFTKLFPNHRKKEIETFLSKAIKYLEETQKEDGSWFGNWGVCHIYATYFAIKGLVATGNTYHNSSTIRKGVEFLLKIQCPDGGWGESHVSCMQKKYIPLPRNSSNLVQTSFALMALIHSHQEKRDPTPLHHGAKLLINSQLENGDYPQQEISGVFMNTCMLHYGLYRNVFPI
BLAST of Cp4.1LG00g00410 vs. Swiss-Prot
Match: BAMS1_PANGI (Beta-Amyrin Synthase 1 OS=Panax ginseng GN=OSCPNY1 PE=1 SV=1)

HSP 1 Score: 875.9 bits (2262), Expect = 5.4e-253
Identity = 391/626 (62.46%), Postives = 476/626 (76.04%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           V  +YI GHL  +F   HRKEILRY YCHQN+DGGWGL I G S MFCT  +YI +R+LG
Sbjct: 132 VMCVYITGHLDTVFPAEHRKEILRYIYCHQNEDGGWGLHIEGHSTMFCTTLSYICMRILG 191

Query: 67  EEPD---KDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
           E PD    + C R RKWILDHG     PSWGK WLSILGVYEW G+NPMPPEFW+L   L
Sbjct: 192 EGPDGGVNNACARGRKWILDHGSVTAIPSWGKTWLSILGVYEWIGSNPMPPEFWILPSFL 251

Query: 127 PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
           P  P  + CY R+  +PMSYL+GKRFVGP+TPLILQLR+E+Y QPYN I W  TR  CAK
Sbjct: 252 PMHPAKMWCYCRMVYMPMSYLYGKRFVGPITPLILQLREELYGQPYNEINWRKTRRVCAK 311

Query: 187 EDKCFERSLFQKLAWNALQYFGEPILNSWAFKTIRNRALQIAQRRIDYEDHNSHYITIGC 246
           ED  +   L Q L W++L    EP+L  W F  +R +ALQ   + I YED NS YITIGC
Sbjct: 312 EDIYYPHPLIQDLLWDSLYVLTEPLLTRWPFNKLREKALQTTMKHIHYEDENSRYITIGC 371

Query: 247 IEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLAT 306
           +EK L  LVCWV+DP+G+ ++KH+ARI DY+W+ EDGMKMQS+GSQ WD  FSIQ +L +
Sbjct: 372 VEKVLCMLVCWVEDPNGDYFRKHLARIPDYIWVAEDGMKMQSFGSQEWDTGFSIQALLDS 431

Query: 307 NLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAEN 366
           +L HE   TL KGHDFIK+SQV++NPS DF++MYR+ISKGSWTFSD+DHGWQ+SDCTAE 
Sbjct: 432 DLTHEIGPTLMKGHDFIKKSQVKDNPSGDFKSMYRHISKGSWTFSDQDHGWQVSDCTAEG 491

Query: 367 LLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNPV 426
           L CCLIFSTM   IVG  ++P+  Y++VN++LSLQ KNGG+SAWEP  G   W ELLNP 
Sbjct: 492 LKCCLIFSTMPEEIVGKKIKPERLYDSVNVLLSLQRKNGGLSAWEP-AGAQEWLELLNPT 551

Query: 427 EFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSWH 486
           EF    ++E EYVECTSS+IQALVLF+KL+P HRKKEI  F++  V+YLE+TQ  DGSW+
Sbjct: 552 EFFADIVIEHEYVECTSSAIQALVLFKKLYPGHRKKEIDNFITNAVRYLEDTQMPDGSWY 611

Query: 487 GYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKKY 546
           G WG+C+TY ++FA+ GL A G TY N + +R+ VEFLLK Q  DGGWGESY+SC +K Y
Sbjct: 612 GNWGVCFTYGSWFALGGLAAAGKTYYNCAAVRKAVEFLLKSQMDDGGWGESYLSCPKKVY 671

Query: 547 IPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAFM 606
           +PL GN SNLV T +ALM LIHS+Q +RDPTPLHRAAKLLINSQ+++GD+ QQ+ISG FM
Sbjct: 672 VPLEGNRSNLVHTGWALMGLIHSEQAERDPTPLHRAAKLLINSQMEDGDFPQQEISGVFM 731

Query: 607 NTCTLHYGLYRNVFPLWALAEYYNKL 630
             C LHY  YRN++PLWALAEY  ++
Sbjct: 732 KNCMLHYAAYRNIYPLWALAEYRRRV 756

BLAST of Cp4.1LG00g00410 vs. Swiss-Prot
Match: LUPS_RICCO (Lupeol synthase OS=Ricinus communis PE=1 SV=1)

HSP 1 Score: 867.8 bits (2241), Expect = 1.5e-250
Identity = 384/622 (61.74%), Postives = 475/622 (76.37%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           VF++YI GHL  +FS  HRKEILRY YCHQN+DGGWG+ I G S MFCTV NYI +R+LG
Sbjct: 130 VFAVYITGHLNTVFSPEHRKEILRYIYCHQNEDGGWGIHIEGHSTMFCTVLNYICMRILG 189

Query: 67  EEPD---KDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
           E  D   ++ C R RKWILDHGGA    SWGK WLSILGVYEW+G NPMPPEFW      
Sbjct: 190 EARDGGIENACERGRKWILDHGGATGISSWGKTWLSILGVYEWDGTNPMPPEFWAFPSSF 249

Query: 127 PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
           P  P  + CY R+T +PMSYL+GKRFVGP+TPLILQ+R+EIY +PYN IKW+  RH CAK
Sbjct: 250 PLHPAKMFCYCRITYMPMSYLYGKRFVGPITPLILQIREEIYNEPYNKIKWNSVRHLCAK 309

Query: 187 EDKCFERSLFQKLAWNALQYFGEPILNSWAFKTIRNRALQIAQRRIDYEDHNSHYITIGC 246
           ED  F     QKL W+AL  F EP+ + W F  +R +AL+I    I YEDHNS YITIGC
Sbjct: 310 EDNYFPHPTIQKLLWDALYTFSEPLFSRWPFNKLREKALKITMDHIHYEDHNSRYITIGC 369

Query: 247 IEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLAT 306
           +EKPL  L CW++DPHGEA+KKH+ARI DY+W+GEDG+KMQS+GSQ+WD + ++Q ++A+
Sbjct: 370 VEKPLCMLACWIEDPHGEAFKKHLARIADYIWVGEDGIKMQSFGSQTWDTSLALQALIAS 429

Query: 307 NLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAEN 366
           +L HE   TLK+GH F K SQ  ENPS DFR M+R+ISKG+WTFSD+D GWQ+SDCTAE+
Sbjct: 430 DLSHEIGPTLKQGHVFTKNSQATENPSGDFRKMFRHISKGAWTFSDKDQGWQVSDCTAES 489

Query: 367 LLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNPV 426
           L CCL+FS M   IVG+ MEP+  Y++VN+ILSLQ++NGG +AWEP     SW E LNPV
Sbjct: 490 LKCCLLFSMMPPEIVGEKMEPEKVYDSVNVILSLQSQNGGFTAWEP-ARAGSWMEWLNPV 549

Query: 427 EFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSWH 486
           EF+E  ++E EYVECTSS+IQALVLF+KL+P HR KEI+  +    +++E  Q+ DGSW+
Sbjct: 550 EFMEDLVVEHEYVECTSSAIQALVLFKKLYPRHRNKEIENCIINAAQFIENIQEPDGSWY 609

Query: 487 GYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKKY 546
           G WGIC++Y T+FA+KGL A G TY N S +R+GV+FLLK Q  DGGW ESY+SC +K Y
Sbjct: 610 GNWGICFSYGTWFALKGLAAAGRTYENCSAIRKGVDFLLKSQRDDGGWAESYLSCPKKVY 669

Query: 547 IPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAFM 606
           +P  GN SNLVQT++A+M LI+  Q KRDP PLHRAAKLLINSQ   GD+ QQ+++GAFM
Sbjct: 670 VPFEGNRSNLVQTAWAMMGLIYGGQAKRDPMPLHRAAKLLINSQTDLGDFPQQELTGAFM 729

Query: 607 NTCTLHYGLYRNVFPLWALAEY 626
             C LHY L+RN FP+WALAEY
Sbjct: 730 RNCMLHYALFRNTFPIWALAEY 750

BLAST of Cp4.1LG00g00410 vs. Swiss-Prot
Match: BAMS_PEA (Beta-amyrin synthase OS=Pisum sativum GN=OSCPSY PE=2 SV=1)

HSP 1 Score: 857.8 bits (2215), Expect = 1.5e-247
Identity = 381/627 (60.77%), Postives = 477/627 (76.08%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           VF +YI GHL  +F   HRKEILRY YCHQN+DGGWGL I G S MFCT  NYI +R+LG
Sbjct: 130 VFCVYITGHLDSVFPPEHRKEILRYIYCHQNEDGGWGLHIEGHSTMFCTALNYICMRILG 189

Query: 67  EEPDKDE---CFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
           E PD  E   C RAR WI  HGG  + PSWGK WLSILGV++W G+NPMPPEFW+L   L
Sbjct: 190 EGPDGGEDNACVRARNWIRQHGGVTHIPSWGKTWLSILGVFDWLGSNPMPPEFWILPSFL 249

Query: 127 PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
           P  P  + CY RL  +PMSYL+GKRFVGP+TPLILQLR+E++T+PY  I W+ TRH CAK
Sbjct: 250 PMHPAKMWCYCRLVYMPMSYLYGKRFVGPITPLILQLREELHTEPYEKINWTKTRHLCAK 309

Query: 187 EDKCFERSLFQKLAWNALQYFGEPILNSWAF-KTIRNRALQIAQRRIDYEDHNSHYITIG 246
           ED  +   L Q L W++L  F EP+L  W F K +R RAL++  + I YED NS Y+TIG
Sbjct: 310 EDIYYPHPLIQDLIWDSLYIFTEPLLTRWPFNKLVRKRALEVTMKHIHYEDENSRYLTIG 369

Query: 247 CIEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLA 306
           C+EK L  L CWV+DP+G+A+KKH+AR+ DYLWI EDGM MQS+GSQ WD  F++Q +LA
Sbjct: 370 CVEKVLCMLACWVEDPNGDAFKKHIARVPDYLWISEDGMTMQSFGSQEWDAGFAVQALLA 429

Query: 307 TNLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAE 366
           TNL  E    L KGHDFIK+SQV ENPS DF++M+R+ISKGSWTFSD+DHGWQ+SDCTAE
Sbjct: 430 TNLIEEIKPALAKGHDFIKKSQVTENPSGDFKSMHRHISKGSWTFSDQDHGWQVSDCTAE 489

Query: 367 NLLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNP 426
            L CCL+ S +   IVG+ MEP+  +++VN++LSLQ+K GG++AWEP G    W ELLNP
Sbjct: 490 GLKCCLLLSLLPPEIVGEKMEPERLFDSVNLLLSLQSKKGGLAAWEPAGA-QEWLELLNP 549

Query: 427 VEFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSW 486
            EF    ++E EYVECT S+IQALVLF+KL+P HRKKEI+ F+   V++LE+TQ EDGSW
Sbjct: 550 TEFFADIVVEHEYVECTGSAIQALVLFKKLYPGHRKKEIENFIFNAVRFLEDTQTEDGSW 609

Query: 487 HGYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKK 546
           +G WG+C+TY ++FA+ GL A G TY N + +R+GV+FLL  Q  DGGWGESY+S  +K 
Sbjct: 610 YGNWGVCFTYGSWFALGGLAAAGKTYTNCAAIRKGVKFLLTTQREDGGWGESYLSSPKKI 669

Query: 547 YIPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAF 606
           Y+PL GN SN+V T++ALM LIH+ Q +RDPTPLHRAAKLLINSQL+ GD+ QQ+I+G F
Sbjct: 670 YVPLEGNRSNVVHTAWALMGLIHAGQSERDPTPLHRAAKLLINSQLEQGDWPQQEITGVF 729

Query: 607 MNTCTLHYGLYRNVFPLWALAEYYNKL 630
           M  C LHY +YR+++PLWALAEY  ++
Sbjct: 730 MKNCMLHYPMYRDIYPLWALAEYRRRV 755

BLAST of Cp4.1LG00g00410 vs. Swiss-Prot
Match: BAMS_SOLLC (Beta-amyrin synthase OS=Solanum lycopersicum GN=TTS1 PE=1 SV=1)

HSP 1 Score: 856.7 bits (2212), Expect = 3.4e-247
Identity = 383/622 (61.58%), Postives = 468/622 (75.24%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           V  +YI GHL  +F   HRKEILRY YCHQN+DGGWGL I G S MFCT  +YI +R+LG
Sbjct: 130 VMCMYITGHLNTVFPAEHRKEILRYIYCHQNEDGGWGLHIEGHSTMFCTALSYICMRILG 189

Query: 67  EEPD---KDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
           E PD    + C RARKWILDHG     PSWGK WLSILGV+EW G NPMPPEFW+L   L
Sbjct: 190 EGPDGGVNNACARARKWILDHGSVTAIPSWGKTWLSILGVFEWIGTNPMPPEFWILPSFL 249

Query: 127 PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
           P  P  + CY R+  +PMSYL+GKRFVGP+TPLILQLR+E+Y +PY+ I W   RH CAK
Sbjct: 250 PVHPAKMWCYCRMVYMPMSYLYGKRFVGPITPLILQLREELYDRPYDEINWKKVRHVCAK 309

Query: 187 EDKCFERSLFQKLAWNALQYFGEPILNSWAFKTIRNRALQIAQRRIDYEDHNSHYITIGC 246
           ED  +   L Q L W++L    EP+L  W F  +RN+AL++  + I YED NS YITIGC
Sbjct: 310 EDLYYPHPLVQDLMWDSLYICTEPLLTRWPFNKLRNKALEVTMKHIHYEDENSRYITIGC 369

Query: 247 IEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLAT 306
           +EK L  L CWV+DP+G+ +KKH+ARI DYLW+ EDGMKMQS+GSQ WD  F+IQ +LA+
Sbjct: 370 VEKVLCMLACWVEDPNGDYFKKHLARIPDYLWVAEDGMKMQSFGSQEWDTGFAIQALLAS 429

Query: 307 NLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAEN 366
            ++ E ++TL+KGHDFIKQSQV  NPS DF+ MYR+ISKGSWTFSD+DHGWQ+SDCTAE 
Sbjct: 430 EMNDEIADTLRKGHDFIKQSQVTNNPSGDFKGMYRHISKGSWTFSDQDHGWQVSDCTAEA 489

Query: 367 LLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNPV 426
           L CCL+ STM   +VG  MEP   Y++VN++LSLQ+KNGG++AWEP  G   + ELLNP 
Sbjct: 490 LKCCLLLSTMPRELVGQAMEPGRLYDSVNVVLSLQSKNGGLAAWEP-AGASEYLELLNPT 549

Query: 427 EFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSWH 486
           EF    ++E EYVECT+SSIQALVLF+KL+P HR KEI  F+   VKYLE+ Q  DGSW+
Sbjct: 550 EFFADIVIEHEYVECTASSIQALVLFKKLYPGHRTKEINIFIDNAVKYLEDVQMPDGSWY 609

Query: 487 GYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKKY 546
           G WG+C+TY ++FA+ GLVA G +YNNS+ +R+GVEFLL+ Q  DGGWGESY SC  K Y
Sbjct: 610 GNWGVCFTYGSWFALGGLVAAGKSYNNSAAVRKGVEFLLRTQRSDGGWGESYRSCPDKVY 669

Query: 547 IPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAFM 606
             L  N SNLVQT++ALM LIHS Q  RDP PLHRAAKLLINSQ+++GD+ QQ+I+G FM
Sbjct: 670 RELETNDSNLVQTAWALMGLIHSGQADRDPKPLHRAAKLLINSQMEDGDFPQQEITGVFM 729

Query: 607 NTCTLHYGLYRNVFPLWALAEY 626
             C LHY  YRN++PLW LAEY
Sbjct: 730 KNCMLHYAAYRNIYPLWGLAEY 750

BLAST of Cp4.1LG00g00410 vs. Swiss-Prot
Match: BAS_BRUGY (Beta-amyrin synthase OS=Bruguiera gymnorhiza GN=BAS PE=1 SV=1)

HSP 1 Score: 853.6 bits (2204), Expect = 2.9e-246
Identity = 379/627 (60.45%), Postives = 472/627 (75.28%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           V  +YI GHL  +F   HRKEILRY Y HQN+DGGWGL I G S MFCT  NYI +R++G
Sbjct: 130 VMCVYITGHLDAVFPAEHRKEILRYIYYHQNEDGGWGLHIEGHSTMFCTALNYICMRIIG 189

Query: 67  EEPD---KDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
           E P+    D C RARKWI DHG     PSWGK WLSILGVY+W G+NPMPPEFW+L   L
Sbjct: 190 EGPNGGQDDACARARKWIHDHGSVTNIPSWGKTWLSILGVYDWSGSNPMPPEFWMLPSFL 249

Query: 127 PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
           P  P  + CY R+  +PMSYL+GKRFVGP+TPLI QLR+E++TQPY+ I W  TRH CA 
Sbjct: 250 PMHPAKMWCYCRMVYMPMSYLYGKRFVGPITPLIQQLREELFTQPYDQINWKKTRHQCAP 309

Query: 187 EDKCFERSLFQKLAWNALQYFGEPILNSWAF-KTIRNRALQIAQRRIDYEDHNSHYITIG 246
           ED  +     Q L W+ L  F EP+L  W   + IR +AL++  + I YED +S YITIG
Sbjct: 310 EDLYYPHPFVQDLIWDCLYIFTEPLLTRWPLNEIIRKKALEVTMKHIHYEDESSRYITIG 369

Query: 247 CIEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLA 306
           C+EK L  L CWV+DP+G+ +KKH+ARI DY+W+ EDGMKMQS+GSQ WD  F+IQ +LA
Sbjct: 370 CVEKVLCMLACWVEDPNGDYFKKHLARIPDYIWVAEDGMKMQSFGSQEWDTGFAIQALLA 429

Query: 307 TNLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAE 366
           TNL  E  + L++GHDFIK+SQVR+NPS DF++MYR+ISKGSWTFSD+DHGWQ+SDCTAE
Sbjct: 430 TNLTDEIGDVLRRGHDFIKKSQVRDNPSGDFKSMYRHISKGSWTFSDQDHGWQVSDCTAE 489

Query: 367 NLLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNP 426
            L CCL+FS M   IVG+ M P+  Y++VN++LSLQ+KNGG+SAWEP G    W ELLNP
Sbjct: 490 GLKCCLLFSMMPPEIVGEHMVPERLYDSVNVLLSLQSKNGGLSAWEPAGA-QEWLELLNP 549

Query: 427 VEFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSW 486
            EF    ++E EYVECTSS+I ALVLF+KL+P HRKKEI  F+   V+YLE  Q  DG W
Sbjct: 550 TEFFADIVIEHEYVECTSSAIHALVLFKKLYPGHRKKEIDNFIVNAVRYLESIQTSDGGW 609

Query: 487 HGYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKK 546
           +G WG+C+TY T+FA+ GL A G TYNN   +R+ V+FLL+IQ  +GGWGESY+SC +K+
Sbjct: 610 YGNWGVCFTYGTWFALGGLAAAGKTYNNCLAMRKAVDFLLRIQRDNGGWGESYLSCPEKR 669

Query: 547 YIPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAF 606
           Y+PL GN SNLV T++ALMALIH+ Q  RDPTPLHRAA+L+INSQL++GD+ QQ+I+G F
Sbjct: 670 YVPLEGNRSNLVHTAWALMALIHAGQMDRDPTPLHRAARLMINSQLEDGDFPQQEITGVF 729

Query: 607 MNTCTLHYGLYRNVFPLWALAEYYNKL 630
           M  C LHY  YRN++PLWALAEY  ++
Sbjct: 730 MKNCMLHYAAYRNIYPLWALAEYRRRV 755

BLAST of Cp4.1LG00g00410 vs. TrEMBL
Match: A0A103XQS9_CYNCS (Prenyltransferase/squalene oxidase OS=Cynara cardunculus var. scolymus GN=Ccrd_002745 PE=3 SV=1)

HSP 1 Score: 1323.9 bits (3425), Expect = 0.0e+00
Identity = 650/1267 (51.30%), Postives = 806/1267 (63.61%), Query Frame = 1

Query: 7    VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
            V  L+I GHL  IF   HRKEILRY YCHQN+DGGWG  I G S MF +  +YI +RLLG
Sbjct: 813  VICLFITGHLNDIFPSEHRKEILRYLYCHQNEDGGWGFHIEGHSTMFGSTLSYICMRLLG 872

Query: 67   EEPD---KDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
            E PD      C +ARKWILDHGGA   P+WGK WLSILGV EW G NPMPPEFW+L   L
Sbjct: 873  EGPDGGLNGACTKARKWILDHGGATAIPAWGKTWLSILGVCEWAGNNPMPPEFWILPSFL 932

Query: 127  PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
            P  P  + CY RL  +PMSYL+GKRFVGP+TPL+LQLR E+Y QPYN I     RH CAK
Sbjct: 933  PMHPAKMWCYCRLVYMPMSYLYGKRFVGPITPLVLQLRDELYAQPYNKINRKSIRHLCAK 992

Query: 187  EDKCFERSLFQKLAWNALQYFGEPILNSWAFKTIRNRALQIAQRRIDYEDHNSHYITIGC 246
            ED     S  Q+L W++L  F EP+L  W F  +R +ALQ   + I YED NS YITIG 
Sbjct: 993  EDLYHPHSSLQELLWDSLYIFTEPLLTHWPFNKLREKALQTTMKHIHYEDENSRYITIGA 1052

Query: 247  IEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLAT 306
            +EK L  L CWV+DP+G  +KKH+ARI DY+W+ EDGMKMQ   SQ WD +  +Q +LAT
Sbjct: 1053 VEKALCMLSCWVEDPNGVCFKKHLARIPDYIWVAEDGMKMQGTNSQVWDASLVVQALLAT 1112

Query: 307  NLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAEN 366
            +L HE   TLKKGHDFI  SQV++NPS DF +M+R+ISKGSWTF+D+DHGWQ+SDCTAE 
Sbjct: 1113 DLPHEIGPTLKKGHDFINASQVKDNPSGDFESMHRHISKGSWTFADQDHGWQVSDCTAEG 1172

Query: 367  LLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNPV 426
            L CCL+ S M   IVG  M P+    AV+++LSLQ+KNGG+  WEP G    W E+LNP 
Sbjct: 1173 LKCCLLLSMMPPEIVGKKMAPEQLNNAVDVLLSLQSKNGGLPGWEPAGS-SKWLEILNPT 1232

Query: 427  EFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSWH 486
            EF    ++E EY ECTSS+IQALVLF+K +P HR KEI +FL+   +YLE+ Q  DGSW+
Sbjct: 1233 EFFVDIVIEHEYTECTSSAIQALVLFKKSYPEHRSKEIDSFLTVAGEYLEKMQMSDGSWY 1292

Query: 487  GYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKKY 546
            G WG+C+TYAT+FA+ GL A G TY N   + + V FLLK Q  DGGWGESY SC +K  
Sbjct: 1293 GNWGVCFTYATWFALGGLAAIGKTYENCPAIGKAVNFLLKTQREDGGWGESYQSCTKK-- 1352

Query: 547  IPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAFM 606
                                      KRDPTPLH+AAKLLINSQ +NGD+SQQ+ SG F 
Sbjct: 1353 -------------------------AKRDPTPLHKAAKLLINSQTRNGDFSQQETSGVFK 1412

Query: 607  NTCTLHYGLYRNVFPLWALAEY-------------YNKLGEGGHDPYFFSSNNFVGRQTW 666
              C LHY LYR++FP+WALA Y               K+  G ++PY +S+NNFVGRQTW
Sbjct: 1413 QNCLLHYALYRDIFPMWALAAYSVVAISNLDEKMWRLKIANGVNNPYLYSTNNFVGRQTW 1472

Query: 667  DFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLFWKFQFLRERNFKQTIQIVRVED-- 726
            +F+P+ GTPEER E+E+AR +++ +R +V+ SSD+ W+ QFLRE+ FKQTI  V++ED  
Sbjct: 1473 EFDPNYGTPEERNEIEKARLHFWDHRHEVKPSSDVLWRMQFLREKQFKQTIAQVKIEDGE 1532

Query: 727  VIDKETASIALRRATKFFAALQSPHGHWPAENAGPMFYFPPLVFSLYITGHLHIVFSEEH 786
             I+ E  +  LRR+   FAALQ+  GHWPAENAGPM++  PL                  
Sbjct: 1533 DINYEKVTTTLRRSVHLFAALQAEDGHWPAENAGPMYFIQPL------------------ 1592

Query: 787  RKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVFNYIQLRLLGEELD---KDECFRARKWI 846
                        NEDGGWG HI G S M  T  +YI +RLLGE  D      C +ARKWI
Sbjct: 1593 ------------NEDGGWGFHIEGHSTMFGTTLSYICMRLLGEGPDGGLNGACTKARKWI 1652

Query: 847  LDHGGAIYIPSWERFGSRLTLLPMTYLFGKRFVGPLTPLILQLRQEIYTQPY--NHIKWS 906
            LDHG A  IPSW                GK +          L +++Y   +   H+ W 
Sbjct: 1653 LDHGSATTIPSW----------------GKTW----------LSEDLYYPXHLLQHLMWD 1712

Query: 907  STRHYCAKEDKCFERSLFQKLAWDALQYCGEPILNSWTFKTIRNRALQIAKCYIDYEDHH 966
            S                        L    EP+L  W F  +R +AL+    +I YED +
Sbjct: 1713 S------------------------LYIFTEPLLTHWPFNKLREKALETTMKHIHYEDEN 1772

Query: 967  SHYITIGCVEKPLFTLASWIDDPHGEAYKKHVARIKDYLWM--------SYGSQSWDVAF 1026
            S YITIG VEK L  LA W++DP+G  +KKH+ARI DY+W+        ++GSQ+WD +F
Sbjct: 1773 SRYITIGSVEKALCMLACWVEDPNGVCFKKHLARIPDYIWLAEDGMKMQTFGSQAWDASF 1832

Query: 1027 AIQAMLATNLHHENSETLKKGHDFIKQSQVRENPSGDFRSMYRHISKGSWTFSDRDHGWQ 1086
            AIQA+LA++L +E   TLKKGHDFIK SQV++NPSGDF+SM+RHISKGSWTFSD+DHGWQ
Sbjct: 1833 AIQALLASDLINEIGPTLKKGHDFIKDSQVKDNPSGDFKSMHRHISKGSWTFSDQDHGWQ 1892

Query: 1087 VSDCTAENLLCCLRFSTMPSNIVGDPMEPQWFFEAVNFILSLQAKNGGVSAWEPTGAVPS 1146
            VSD TAE L+CCL  S MP   VG  MEP+    AVN ILS+                  
Sbjct: 1893 VSDSTAEGLMCCLLLSMMPPEFVGKKMEPEQLNNAVNVILSM------------------ 1951

Query: 1147 WFELLNPVEFLEYTVLEREYVECTSSSIQALVLFTKLFPNHRKKEIETFLSKAIKYLEET 1206
               +LNP EF    V+E EY+ECTSS IQAL LF   +P HR KEI++ L+KA +Y+E+ 
Sbjct: 1953 --PILNPTEFFADIVIEHEYIECTSSVIQALALFKNSYPEHRSKEIDSLLTKAGEYIEKM 1951

Query: 1207 QKEDGSWFGNWGVCHIYATYFAIKGLVATGNTYHNSSTIRKGVEFLLKIQCPDGGWGESH 1243
            Q  DGSW+GNWG+C  YAT+FA+ GL A G TY N   + K V FLLK Q  DGGWGES+
Sbjct: 2013 QMSDGSWYGNWGICFTYATWFALGGLAAIGKTYENCQAVGKAVNFLLKTQLKDGGWGESY 1951

BLAST of Cp4.1LG00g00410 vs. TrEMBL
Match: D7KVW8_ARALL (Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_316793 PE=3 SV=1)

HSP 1 Score: 1112.8 bits (2877), Expect = 0.0e+00
Identity = 516/864 (59.72%), Postives = 630/864 (72.92%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           VF L++ GHL  IF++ HR+EILRY YCHQN+DGGWGL I G S MFCT  NYI +R+LG
Sbjct: 131 VFCLFVTGHLHEIFTQEHRREILRYIYCHQNEDGGWGLHIEGDSTMFCTTLNYICMRILG 190

Query: 67  EEP---DKDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
           E P     + C RAR WILDHGGA Y PSWGK WLSILGV++W G+NPMPPEFW+L   L
Sbjct: 191 ESPFGGPGNACRRARDWILDHGGATYIPSWGKTWLSILGVFDWSGSNPMPPEFWILPSFL 250

Query: 127 PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
           P  P  + CY RL  +PMSYL+GKRFVGP++PLILQLR+EIY QPY  I W+  RH CAK
Sbjct: 251 PIHPAKMWCYCRLVYMPMSYLYGKRFVGPISPLILQLREEIYLQPYAKINWNRARHLCAK 310

Query: 187 EDKCFERSLFQKLAWNALQYFGEPILNSWAF-KTIRNRALQIAQRRIDYEDHNSHYITIG 246
           ED        Q + W+ L  F EP L  W F K +R +AL +A + I YED NS YITIG
Sbjct: 311 EDAYCPHPQIQDVIWDCLYIFTEPFLTCWPFNKLLREKALGVAMKHIHYEDENSRYITIG 370

Query: 247 CIEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLA 306
           C+EK L  L CWV+DP+G  +KKH+ RI DYLWI EDGMKMQS+GSQ WD  F++Q ++A
Sbjct: 371 CVEKALCMLACWVEDPNGSHFKKHLLRISDYLWIAEDGMKMQSFGSQLWDSGFALQALVA 430

Query: 307 TNLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAE 366
           ++L +E  + L++G+DF+K SQVRENPS DF NM+R+ISKGSWTFSDRDHGWQ SDCTAE
Sbjct: 431 SDLANEIPDVLRRGYDFLKNSQVRENPSGDFTNMFRHISKGSWTFSDRDHGWQASDCTAE 490

Query: 367 NLLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNP 426
              CCL+ S M  +IVG  M+P+  YEAV I+LSLQ+KNGGV+AWEP  G   W ELLNP
Sbjct: 491 GFKCCLLLSMMPPDIVGPKMDPEQLYEAVTILLSLQSKNGGVTAWEPARG-QEWLELLNP 550

Query: 427 VEFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSW 486
            E     ++E EY ECTSS+IQAL+LF++L+PNHR  EI T + K V+Y+E  Q  DGSW
Sbjct: 551 TEVFADIVVEHEYNECTSSAIQALILFKQLYPNHRTAEINTSIKKAVQYIESIQMHDGSW 610

Query: 487 HGYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKK 546
           +G WG+C+TY+T+F + GL A G TYNN   +R+GV FLL  Q  +GGWGESY+SC +K+
Sbjct: 611 YGSWGVCFTYSTWFGLGGLAAAGKTYNNCLAMRKGVHFLLTTQKDNGGWGESYLSCPKKR 670

Query: 547 YIPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAF 606
           YIP  G+ SNLVQTS+A+M L+H+ Q +RDP PLHRAAKLLINSQL+NGD+ QQ+I+GAF
Sbjct: 671 YIPSEGDRSNLVQTSWAMMGLLHAGQAERDPAPLHRAAKLLINSQLENGDFPQQEITGAF 730

Query: 607 MNTCTLHYGLYRNVFPLWALAEYYN-----------------------KLGEG-GHDPYF 666
           M  C LHY  YRN+FP+WALAEY                         K+GEG G DPY 
Sbjct: 731 MKNCLLHYAAYRNIFPVWALAEYRRRVPLPYENLEQREELCSFVMWRLKIGEGSGDDPYL 790

Query: 667 FSSNNFVGRQTWDFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLFWKFQFLRERNFK 726
           F++NNFVGRQTW+F+PD G+PEER  V EAR ++Y NRF V+ SSDL W+ QFL+E+ F+
Sbjct: 791 FTTNNFVGRQTWEFDPDAGSPEERYAVVEARQSFYDNRFHVKASSDLLWRMQFLKEKKFE 850

Query: 727 QTIQIVRVE--DVIDKETASIALRRATKFFAALQSPHGHWPAENAGPMFYFPPLVFSLYI 786
           Q I  V+VE  + +  ETA+ ALRR   FF+ALQ+  GHWPAENAGP+F+ PPLVF LYI
Sbjct: 851 QVIAPVKVEGSEKVTFETATNALRRGVHFFSALQASDGHWPAENAGPLFFLPPLVFCLYI 910

Query: 787 TGHLHIVFSEEHRKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVFNYIQLRLLGEEL--- 838
           TGHL  VF+ EHRKEILRY YCHQ EDGGWGLHI G S M CT  NYI +R+LGE     
Sbjct: 911 TGHLDEVFTLEHRKEILRYIYCHQKEDGGWGLHIEGHSTMFCTALNYICMRILGESPVGG 970

BLAST of Cp4.1LG00g00410 vs. TrEMBL
Match: A0A0D9YM41_9ORYZ (Uncharacterized protein OS=Oryza glumipatula PE=3 SV=1)

HSP 1 Score: 951.0 bits (2457), Expect = 1.5e-273
Identity = 482/1134 (42.50%), Postives = 654/1134 (57.67%), Query Frame = 1

Query: 7    VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
            + +LY+ G L    S  H+KEI RY Y HQN+DGGWGL I G S MF +   Y+ LRLLG
Sbjct: 130  IITLYVSGALNTALSSEHQKEIRRYLYNHQNEDGGWGLHIEGHSTMFGSALTYVSLRLLG 189

Query: 67   EEPDKDE--CFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLLP 126
            E PD  +    + RKWILDHGGA Y  SWGK WLS+LGV++W G NP+PPE WLL   LP
Sbjct: 190  EGPDSGDGAMEKGRKWILDHGGATYITSWGKFWLSVLGVFDWSGNNPVPPEIWLLPYFLP 249

Query: 127  FIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAKE 186
              P  + C+ R+  LPM Y++GKRFVGP+TP+IL+LR+E+Y  PYN + W   R+ CAKE
Sbjct: 250  IHPGRMWCHCRMVYLPMCYIYGKRFVGPVTPIILELRKELYEVPYNEVDWDKARNLCAKE 309

Query: 187  DKCFERSLFQKLAWNALQYFGEPILNSWAFKTIRNRALQIAQRRIDYEDHNSHYITIGCI 246
            D  +     Q + W  L  F EP +  W    +R +AL    + I YED N+ YI IG +
Sbjct: 310  DLYYPHPFVQDVLWATLHKFVEPAMLRWPGNKLREKALDTVMQHIHYEDENTRYICIGPV 369

Query: 247  EKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSY-GSQSWDVAFSIQTVLAT 306
             K L  L CW++DP+ EA+K H+ R+ DYLWI EDGMKMQ Y GSQ WD AF++Q ++AT
Sbjct: 370  NKVLNMLACWIEDPNSEAFKLHIPRVHDYLWIAEDGMKMQGYNGSQLWDTAFTVQAIVAT 429

Query: 307  NLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAEN 366
             L  EF  TLK  H +IK++QV ++   D    YR+ISKG+W FS  DHGW ISDCTAE 
Sbjct: 430  GLIEEFGPTLKLAHGYIKKTQVIDDCPGDLSQWYRHISKGAWPFSTADHGWPISDCTAEG 489

Query: 367  LLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNPV 426
            L   L+ S +S +IVG+ +E    Y++VN ++S    NGG + +E T    +W EL+NP 
Sbjct: 490  LKAALLLSKISPDIVGEAVEVNRLYDSVNCLMSYMNDNGGFATYELTRSY-AWLELINPA 549

Query: 427  EFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSWH 486
            E     +++  YVECTS++IQAL  F+KL+  HRK EI   +SK   ++E  QK DGSW+
Sbjct: 550  ETFGDIVIDYPYVECTSAAIQALTAFKKLYLGHRKSEIDNCISKAASFIEGIQKSDGSWY 609

Query: 487  GYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKKY 546
            G W +C+TY T+F +KGLVA G T+ NS  +R+  +FLL  + P GGWGESY+S   + Y
Sbjct: 610  GSWAVCFTYGTWFGVKGLVAAGRTFKNSPAIRKACDFLLSKELPSGGWGESYLSSQDQVY 669

Query: 547  IPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAFM 606
              L G   + V T +A++ALI + Q +RDP PLHRAAK+LIN Q ++G++ QQ+I G F 
Sbjct: 670  TNLEGKRPHAVNTGWAMLALIDAGQAERDPIPLHRAAKVLINLQSEDGEFPQQEIIGVFN 729

Query: 607  NTCTLHYGLYRNVFPLWALAEYYNKL----------GEGGHDPYFFSSNNFVGRQTWDFE 666
              C + Y  YRN+FP+WAL EY  ++          G GG + +  S+N  VGRQ W+F+
Sbjct: 730  KNCMISYSEYRNIFPIWALGEYRRRVLAADKVAEEAGAGG-EGWLSSTNAHVGRQVWEFD 789

Query: 667  P----DDGTPEERAEVEEARHNYYQNR-----------------FKVQCSSDLFWKF--- 726
                 DD       EVE AR  Y + R                      S+ L       
Sbjct: 790  AAAADDDDAAAAAEEVEAARREYIRRRRATTGGGGMAAAPPPRRLGALASAGLLHGIDAQ 849

Query: 727  --QFLRERNFKQTIQIVRV--EDVIDKETASIALRRATKFFAALQSPHGHWPAENAGPMF 786
              +F R    K  I  +++  ++ + +E    +L+RA + ++ LQ+  GHWP + AGPMF
Sbjct: 850  LRRFTRSNPSKLEIPGIKLGEDEDVTEEAVLTSLKRAIRRYSTLQAHDGHWPGDYAGPMF 909

Query: 787  YFPPLVFSLYITG----HLHIVFSEEHRKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVF 846
              P L +  Y T       +   +   + +         NEDGGWGLHI G S M CTV 
Sbjct: 910  LLPGLFYPGYSTACDWCTKYCAINRTSKGD-------SPNEDGGWGLHIEGTSTMFCTVL 969

Query: 847  NYIQLRLLGEELDKDECFRARKWILDHGGAIYIPSWERFGSRLTL---LPMTY------- 906
             Y+ LRLLG+E D  +           G    +P        L     LP  Y       
Sbjct: 970  TYVTLRLLGDESDGGDGAMVLGVFDWSGNNPLLPELWMLPYFLPFHPGLPSPYSVSSSYN 1029

Query: 907  ------------LFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSSTRHYCAKEDKCFERS 966
                        +  ++FVGP+TP++L LR+E+Y  PY+ I W   R+ CAKED  +   
Sbjct: 1030 NRENYRRYWLECMLVQKFVGPITPIVLTLRKELYNIPYDDINWDKARNQCAKEDLYYRHP 1089

Query: 967  LFQKLAWDALQYCGEPILNSWTFKTIRNRALQIAKCYIDYEDHHSHYITIGCVEKPLFTL 1026
            L Q + W  L    EP+L+ W    +R +AL+ A  +I YED ++ YI  G V+K L  L
Sbjct: 1090 LGQDILWATLYKFVEPVLSHWPGSKLREKALKNAMQHIHYEDENTRYICSGAVQKVLNML 1149

Query: 1027 ASWIDDPHGEAYKKHVARIKDYLWMS---------YGSQSWDVAFAIQAMLATNLHHENS 1065
            + WI++P+ EA++ H+ R+ DYLW++          GSQ WD AF +QA+LATNL  +  
Sbjct: 1150 SCWIENPNSEAFRFHIPRVHDYLWVAEDGMKMQGYNGSQLWDTAFTVQAILATNLIEDFG 1209

BLAST of Cp4.1LG00g00410 vs. TrEMBL
Match: F6GYJ7_VITVI (Terpene cyclase/mutase family member OS=Vitis vinifera GN=VIT_09s0054g01220 PE=3 SV=1)

HSP 1 Score: 922.5 bits (2383), Expect = 5.6e-265
Identity = 433/741 (58.43%), Postives = 531/741 (71.66%), Query Frame = 1

Query: 628  KLGEGGHDPYFFSSNNFVGRQTWDFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLFW 687
            K+ +GG+DPY +S+NNFVGRQ W+F+PD GTPEERAEVE AR N+++NR++V+ S DL W
Sbjct: 5    KVADGGNDPYIYSTNNFVGRQIWEFDPDYGTPEERAEVEAARENFWKNRYQVKPSGDLLW 64

Query: 688  KFQFLRERNFKQTIQIVRVED--VIDKETASIALRRATKFFAALQSPHGHWPAENAGPMF 747
            + QFLRE+NFKQTI  V+V D   I  ETA+ A+RR   FF+ALQ+  GHWPAENAGP++
Sbjct: 65   RMQFLREKNFKQTIPQVKVGDGEEITYETATAAVRRGAHFFSALQASDGHWPAENAGPLY 124

Query: 748  YFPPLVFSLYITGHLHIVFSEEHRKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVFNYIQ 807
            + PPLV  LYITGHL  VF  E+RKEILRY YCHQNEDGGWG HI G S M CT  +YI 
Sbjct: 125  FLPPLVMCLYITGHLDTVFPGEYRKEILRYLYCHQNEDGGWGFHIEGHSTMFCTTLSYIC 184

Query: 808  LRLLGEELD---KDECFRARKWILDHGGAIYIPSWER----------------------- 867
            +R+LGE  D   ++ C R RKWILD GG   IPSW +                       
Sbjct: 185  MRILGEGPDGGRENACARGRKWILDRGGVTSIPSWGKTWLSIFGLFDWSGSNPMPPEFWL 244

Query: 868  FGSRLTL-------------LPMTYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSSTR 927
            F SRL +             +PM+YL+GKRFVGP+TPL+L+LR+E++ QPYN I W   R
Sbjct: 245  FPSRLPMHPAKMWCYCRMVYMPMSYLYGKRFVGPITPLVLELREELFLQPYNEINWRKVR 304

Query: 928  HYCAKEDKCFERSLFQKLAWDALQYCGEPILNSWTFKTIRNRALQIAKCYIDYEDHHSHY 987
            H CAKED  +   L Q L WD L  C EP+L  W F  +R +AL++   +I YED +S Y
Sbjct: 305  HLCAKEDLYYPHPLIQDLMWDGLYICTEPLLTRWPFNKLRQKALEVTMKHIHYEDENSRY 364

Query: 988  ITIGCVEKPLFTLASWIDDPHGEAYKKHVARIKDYLWM--------SYGSQSWDVAFAIQ 1047
            ITIGCVEK L  L+ W++DP+G  +KKH+ARI DY+W+        S+GSQ WD  FA+Q
Sbjct: 365  ITIGCVEKVLCMLSCWVEDPNGNYFKKHLARIPDYIWVGEDGIKMQSFGSQEWDTGFALQ 424

Query: 1048 AMLATNLHHENSETLKKGHDFIKQSQVRENPSGDFRSMYRHISKGSWTFSDRDHGWQVSD 1107
            A+LA N+  E   TLKKGH+F+K+SQV++NPSGDF+SMYRHISKGSWTFSD+DHGWQVSD
Sbjct: 425  AVLACNMTDEIGPTLKKGHEFVKESQVKDNPSGDFKSMYRHISKGSWTFSDQDHGWQVSD 484

Query: 1108 CTAENLLCCLRFSTMPSNIVGDPMEPQWFFEAVNFILSLQAKNGGVSAWEPTGAVPSWFE 1167
            CTAE L CCL FS MP  IVG  MEP+  F++VN +LSLQ+KNGG++AWEP GA   W E
Sbjct: 485  CTAEGLKCCLLFSMMPPEIVGVKMEPERLFDSVNILLSLQSKNGGLAAWEPAGA-SEWLE 544

Query: 1168 LLNPVEFLEYTVLEREYVECTSSSIQALVLFTKLFPNHRKKEIETFLSKAIKYLEETQKE 1227
            LLNP EF    V+E EYVECT+S+IQALVLF KL+P HRKKEI+ F++ A KY+E+ Q  
Sbjct: 545  LLNPTEFFADIVIEHEYVECTASAIQALVLFKKLYPGHRKKEIDNFITYAAKYIEDIQMP 604

Query: 1228 DGSWFGNWGVCHIYATYFAIKGLVATGNTYHNSSTIRKGVEFLLKIQCPDGGWGESHVSC 1287
            DGSW+GNWGVC  Y ++FA+ GL A G TYHN   IR+ VEFLL  Q  DGGWGES+VSC
Sbjct: 605  DGSWYGNWGVCFTYGSWFALGGLAAAGKTYHNCHAIRRAVEFLLNSQRDDGGWGESYVSC 664

Query: 1288 MQKKYIPLPRNSSNLVQTSFALMALIHSHQEKRDPTPLHHGAKLLINSQLENGDYPQQEI 1320
              KKY PL  N SNLV T +ALM LI S Q +RDPTPLH  AKLLIN Q+E+GD+PQQEI
Sbjct: 665  PDKKYTPLEGNRSNLVHTGWALMGLISSGQAERDPTPLHRAAKLLINFQMEDGDFPQQEI 724

BLAST of Cp4.1LG00g00410 vs. TrEMBL
Match: F6GYJ6_VITVI (Terpene cyclase/mutase family member OS=Vitis vinifera GN=VIT_09s0054g01230 PE=3 SV=1)

HSP 1 Score: 904.4 bits (2336), Expect = 1.6e-259
Identity = 422/741 (56.95%), Postives = 532/741 (71.79%), Query Frame = 1

Query: 628  KLGEGGHDPYFFSSNNFVGRQTWDFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLFW 687
            K+ +GG+DPY +S+NNFVGRQ W+F+PD GTPEERAEVE AR N+++NRF+V+ SSDL W
Sbjct: 5    KVADGGNDPYIYSTNNFVGRQIWEFDPDYGTPEERAEVEAARENFWKNRFQVKPSSDLLW 64

Query: 688  KFQFLRERNFKQTIQIVRVED--VIDKETASIALRRATKFFAALQSPHGHWPAENAGPMF 747
            + QFLRE+NFKQTI  V+V D   I +ETA+ A+RR   FF+ALQ+  GHWPAE++GP+F
Sbjct: 65   RMQFLREKNFKQTIPQVKVGDGEEIAEETATTAVRRGAHFFSALQASDGHWPAEHSGPLF 124

Query: 748  YFPPLVFSLYITGHLHIVFSEEHRKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVFNYIQ 807
            + PPLV  LYITGHL  VF  E+RKEILRY YCHQNEDGGWGLHI   S M CT  +Y+ 
Sbjct: 125  FLPPLVMCLYITGHLDTVFPGEYRKEILRYLYCHQNEDGGWGLHIEEHSIMFCTTLSYVC 184

Query: 808  LRLLGEELD---KDECFRARKWILDHGGAIYIPSWER-----FG---------------- 867
            +R+LGE  D    + C R RKWIL+ GG   IP+W +     FG                
Sbjct: 185  MRILGEGRDGGRDNACARGRKWILNRGGVTSIPTWGKIWLSIFGLFDWSGSNPMPPEFSL 244

Query: 868  ---------------SRLTLLPMTYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSSTR 927
                           SRL  +PM+YL+GKRFVGP+TPL+L+LR+E++ QPYN I W   R
Sbjct: 245  FPSHLPMNPAKIWCYSRLIFMPMSYLYGKRFVGPITPLVLELREELFLQPYNEINWKKVR 304

Query: 928  HYCAKEDKCFERSLFQKLAWDALQYCGEPILNSWTFKTIRNRALQIAKCYIDYEDHHSHY 987
            H CAKED  +   L Q + WD+L  C EP+L  W F  +R +AL++   +I YED +S Y
Sbjct: 305  HLCAKEDLYYPHPLIQDMMWDSLYICTEPLLTRWPFNKLRKKALEVTMKHIHYEDENSRY 364

Query: 988  ITIGCVEKPLFTLASWIDDPHGEAYKKHVARIKDYLWM--------SYGSQSWDVAFAIQ 1047
            ITIGCVEK L  L+ W++DP+G+ +KKH+ARI DY+W+        ++GSQ WD + A+Q
Sbjct: 365  ITIGCVEKVLCMLSCWVEDPNGDYFKKHLARIPDYIWVGEDGIKMQTFGSQEWDTSLALQ 424

Query: 1048 AMLATNLHHENSETLKKGHDFIKQSQVRENPSGDFRSMYRHISKGSWTFSDRDHGWQVSD 1107
            A+LA N+ +E   TLKKGH+F+K+SQV++NPSGDF+SMYRHISKGSWTFSD+DHGWQVSD
Sbjct: 425  ALLACNMTNEIGPTLKKGHEFLKESQVKDNPSGDFKSMYRHISKGSWTFSDKDHGWQVSD 484

Query: 1108 CTAENLLCCLRFSTMPSNIVGDPMEPQWFFEAVNFILSLQAKNGGVSAWEPTGAVPSWFE 1167
            CTAE L CCL FS M   IVG  +EP   F++VN +LSLQ++NGG+  WEP GA   W E
Sbjct: 485  CTAEGLKCCLLFSMMAPEIVGMKIEPGRLFDSVNILLSLQSENGGIVGWEPAGA-SEWLE 544

Query: 1168 LLNPVEFLEYTVLEREYVECTSSSIQALVLFTKLFPNHRKKEIETFLSKAIKYLEETQKE 1227
            LLNP E  E  V+E EYVECT+S+IQALVLF KL+P HR +EI+ F++ A KY+E+ Q  
Sbjct: 545  LLNPSEMFEDLVIEHEYVECTASAIQALVLFKKLYPEHRTEEIDNFITNATKYIEDQQMP 604

Query: 1228 DGSWFGNWGVCHIYATYFAIKGLVATGNTYHNSSTIRKGVEFLLKIQCPDGGWGESHVSC 1287
            DGSW+G WGVC IY++++A+ GL A G TYHN   I + VEFLLK Q  DGGWGES++SC
Sbjct: 605  DGSWYGKWGVCFIYSSWWALGGLAAAGKTYHNCLAIGRAVEFLLKSQRDDGGWGESYISC 664

Query: 1288 MQKKYIPLPRNSSNLVQTSFALMALIHSHQEKRDPTPLHHGAKLLINSQLENGDYPQQEI 1320
              KKY PL  N SNLVQT +ALM L+ S Q +RDPTPLH  AKLLINSQ+E+GD+PQQEI
Sbjct: 665  RDKKYTPLEGNKSNLVQTGWALMGLLSSGQAERDPTPLHRAAKLLINSQMEDGDFPQQEI 724

BLAST of Cp4.1LG00g00410 vs. TAIR10
Match: AT1G78955.1 (AT1G78955.1 camelliol C synthase 1)

HSP 1 Score: 847.4 bits (2188), Expect = 1.2e-245
Identity = 382/627 (60.93%), Postives = 472/627 (75.28%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           VF LY+ GHL  IF++ HR+E+LRY YCHQN+DGGWGL I G S MFCT  NYI +R+LG
Sbjct: 131 VFCLYVTGHLHEIFTQDHRREVLRYIYCHQNEDGGWGLHIEGNSTMFCTTLNYICMRILG 190

Query: 67  EEPDK---DECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
           E P+    + C RAR WILDHGGA Y PSWGK WLSILGV++W G+NPMPPEFW+L   L
Sbjct: 191 EGPNGGPGNACKRARDWILDHGGATYIPSWGKTWLSILGVFDWSGSNPMPPEFWILPSFL 250

Query: 127 PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
           P  P  + CY RL  +PMSYL+GKRFVGP++PLILQLR+EIY QPY  I W+  RH CAK
Sbjct: 251 PIHPAKMWCYCRLVYMPMSYLYGKRFVGPISPLILQLREEIYLQPYAKINWNRARHLCAK 310

Query: 187 EDKCFERSLFQKLAWNALQYFGEPILNSWAF-KTIRNRALQIAQRRIDYEDHNSHYITIG 246
           ED        Q + WN L  F EP L  W F K +R +AL +A + I YED NS YITIG
Sbjct: 311 EDAYCPHPQIQDVIWNCLYIFTEPFLACWPFNKLLREKALGVAMKHIHYEDENSRYITIG 370

Query: 247 CIEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLA 306
           C+EK L  L CWV+DP+G  +KKH+ RI DYLWI EDGMKMQS+GSQ WD  F++Q ++A
Sbjct: 371 CVEKALCMLACWVEDPNGIHFKKHLLRISDYLWIAEDGMKMQSFGSQLWDSGFALQALVA 430

Query: 307 TNLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAE 366
           +NL +E  + L++G+DF+K SQVRENPS DF NMYR+ISKGSWTFSDRDHGWQ SDCTAE
Sbjct: 431 SNLVNEIPDVLRRGYDFLKNSQVRENPSGDFTNMYRHISKGSWTFSDRDHGWQASDCTAE 490

Query: 367 NLLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNP 426
           +  CCL+ S +  +IVG  M+P+  YEAV I+LSLQ+KNGGV+AWEP  G   W ELLNP
Sbjct: 491 SFKCCLLLSMIPPDIVGPKMDPEQLYEAVTILLSLQSKNGGVTAWEPARG-QEWLELLNP 550

Query: 427 VEFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSW 486
            E     ++E EY ECTSS+IQAL+LF++L+PNHR +EI T + K V+Y+E  Q  DGSW
Sbjct: 551 TEVFADIVVEHEYNECTSSAIQALILFKQLYPNHRTEEINTSIKKAVQYIESIQMLDGSW 610

Query: 487 HGYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKK 546
           +G WG+C+TY+T+F + GL A G TYNN   +R+GV FLL  Q  +GGWGESY+SC +K+
Sbjct: 611 YGSWGVCFTYSTWFGLGGLAAAGKTYNNCLAMRKGVHFLLTTQKDNGGWGESYLSCPKKR 670

Query: 547 YIPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAF 606
           YIP  G  SNLVQTS+A+M L+H+ Q +RDP+PLHRAAKLLINSQL+NGD+ QQ+I+GAF
Sbjct: 671 YIPSEGERSNLVQTSWAMMGLLHAGQAERDPSPLHRAAKLLINSQLENGDFPQQEITGAF 730

Query: 607 MNTCTLHYGLYRNVFPLWALAEYYNKL 630
           M  C LHY  YRN+FP+WALAEY  ++
Sbjct: 731 MKNCLLHYAAYRNIFPVWALAEYRRRV 756

BLAST of Cp4.1LG00g00410 vs. TAIR10
Match: AT1G78950.1 (AT1G78950.1 Terpenoid cyclases family protein)

HSP 1 Score: 841.3 bits (2172), Expect = 8.3e-244
Identity = 409/744 (54.97%), Postives = 507/744 (68.15%), Query Frame = 1

Query: 628  KLGEG-GHDPYFFSSNNFVGRQTWDFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLF 687
            K+GEG G DPY F++NNF GRQTW+F+PD G+PEER  V EAR  +Y NRF V+ SSDL 
Sbjct: 5    KIGEGNGDDPYLFTTNNFAGRQTWEFDPDGGSPEERHSVVEARRIFYDNRFHVKASSDLL 64

Query: 688  WKFQFLRERNFKQTIQIVRVEDV--IDKETASIALRRATKFFAALQSPHGHWPAENAGPM 747
            W+ QFLRE+ F+Q I  V+VED   +  ETA+ ALRR   FF+ALQ+  GHWPAENAGP+
Sbjct: 65   WRMQFLREKKFEQRIAPVKVEDSEKVTFETATSALRRGIHFFSALQASDGHWPAENAGPL 124

Query: 748  FYFPPLVFSLYITGHLHIVFSEEHRKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVFNYI 807
            F+ PPLVF LYITGHL  VF+ EHRKEILRY YCHQ EDGGWGLHI G S M CT  NYI
Sbjct: 125  FFLPPLVFCLYITGHLDEVFTSEHRKEILRYIYCHQKEDGGWGLHIEGHSTMFCTTLNYI 184

Query: 808  QLR---------------------------------------LLGE-ELDKDECFRARKW 867
             +R                                       +LG  +           W
Sbjct: 185  CMRILGESPDGGHDNACGRAREWILSHGGVTYIPSWGKTWLSILGVFDWSGSNPMPPEFW 244

Query: 868  ILDHGGAIYIPSWERFGSRLTLLPMTYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSS 927
            IL     ++ P+      R+  LPM+YL+GKRFVGP+T LILQLR+E+Y QPY  I W  
Sbjct: 245  ILPSFFPVH-PAKMWSYCRMVYLPMSYLYGKRFVGPITSLILQLRKELYLQPYEEINWMK 304

Query: 928  TRHYCAKEDKCFERSLFQKLAWDALQYCGEPILNSWTF-KTIRNRALQIAKCYIDYEDHH 987
             RH CAKED  + R L Q+L WD+L    EP L  W F K +R +ALQ+A  +I YED +
Sbjct: 305  VRHLCAKEDTYYPRPLVQELVWDSLYIFAEPFLARWPFNKLLREKALQLAMKHIHYEDEN 364

Query: 988  SHYITIGCVEKPLFTLASWIDDPHGEAYKKHVARIKDYLWM--------SYGSQSWDVAF 1047
            S YITIGCVEK L  LA W++DP+G+ +KKH++RI DYLWM        S+GSQ WD  F
Sbjct: 365  SRYITIGCVEKVLCMLACWVEDPNGDYFKKHLSRISDYLWMAEDGMKMQSFGSQLWDTGF 424

Query: 1048 AIQAMLATNLHHENSETLKKGHDFIKQSQVRENPSGDFRSMYRHISKGSWTFSDRDHGWQ 1107
            A+QA+LA+NL  E S+ L++GH+FIK SQV ENPSGD++SMYRHISKG+WTFSDRDHGWQ
Sbjct: 425  AMQALLASNLSSEISDVLRRGHEFIKNSQVGENPSGDYKSMYRHISKGAWTFSDRDHGWQ 484

Query: 1108 VSDCTAENLLCCLRFSTMPSNIVGDPMEPQWFFEAVNFILSLQAKNGGVSAWEPTGAVPS 1167
            VSDCTA  L CCL FS +  +IVG   +P+   ++VN +LSLQ+KNGG++AWEP GA P 
Sbjct: 485  VSDCTAHGLKCCLLFSMLAPDIVGPKQDPERLHDSVNILLSLQSKNGGMTAWEPAGA-PK 544

Query: 1168 WFELLNPVEFLEYTVLEREYVECTSSSIQALVLFTKLFPNHRKKEIETFLSKAIKYLEET 1227
            W ELLNP E     V+E EY ECTSS+IQAL LF +L+P+HR  EI  F+ KA +YLE  
Sbjct: 545  WLELLNPTEMFSDIVIEHEYSECTSSAIQALSLFKQLYPDHRTTEITAFIKKAAEYLENM 604

Query: 1228 QKEDGSWFGNWGVCHIYATYFAIKGLVATGNTYHNSSTIRKGVEFLLKIQCPDGGWGESH 1287
            Q  DGSW+GNWG+C  Y T+FA+ GL A G T+++   IRKGV+FLL  Q  +GGWGES+
Sbjct: 605  QTRDGSWYGNWGICFTYGTWFALAGLAAAGKTFNDCEAIRKGVQFLLAAQKDNGGWGESY 664

Query: 1288 VSCMQKKYIPLPRNSSNLVQTSFALMALIHSHQEKRDPTPLHHGAKLLINSQLENGDYPQ 1320
            +SC +K YI      SN+VQT++ALM LIHS Q +RDP PLH  AKL+INSQLE+GD+PQ
Sbjct: 665  LSCSKKIYIAQVGEISNVVQTAWALMGLIHSGQAERDPIPLHRAAKLIINSQLESGDFPQ 724

BLAST of Cp4.1LG00g00410 vs. TAIR10
Match: AT1G78970.2 (AT1G78970.2 lupeol synthase 1)

HSP 1 Score: 839.3 bits (2167), Expect = 3.2e-243
Identity = 378/620 (60.97%), Postives = 472/620 (76.13%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           +F LYI GHL  +F   HRKE+LR+ YCHQN+DGGWGL I  +S MFCTV NYI LR+LG
Sbjct: 131 IFCLYITGHLEEVFDAEHRKEMLRHIYCHQNEDGGWGLHIESKSVMFCTVLNYICLRMLG 190

Query: 67  EEPDKDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLLPFI 126
           E P++D C RAR+WILD GG I+ PSWGK WLSILGVY+W G NP PPE  +L   LP  
Sbjct: 191 ENPEQDACKRARQWILDRGGVIFIPSWGKFWLSILGVYDWSGTNPTPPELLMLPSFLPIH 250

Query: 127 PRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAKEDK 186
           P  +LCYSR+  +PMSYL+GKRFVGP+TPLIL LR+E+Y +PY  I W  +R   AKED 
Sbjct: 251 PGKILCYSRMVSIPMSYLYGKRFVGPITPLILLLREELYLEPYEEINWKKSRRLYAKEDM 310

Query: 187 CFERSLFQKLAWNALQYFGEPILNSWAF-KTIRNRALQIAQRRIDYEDHNSHYITIGCIE 246
            +   L Q L  + LQ F EP+L  W   K +R +ALQ+  + I YED NSHYITIGC+E
Sbjct: 311 YYAHPLVQDLLSDTLQNFVEPLLTRWPLNKLVREKALQLTMKHIHYEDENSHYITIGCVE 370

Query: 247 KPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLATNL 306
           K L  L CWV++P+G+ +KKH+ARI DY+W+ EDGMKMQS+G Q WD  F+IQ +LA+NL
Sbjct: 371 KVLCMLACWVENPNGDYFKKHLARIPDYMWVAEDGMKMQSFGCQLWDTGFAIQALLASNL 430

Query: 307 HHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAENLL 366
             E  + LK+GH++IK SQVRENPS DFR+MYR+ISKG+WTFSDRDHGWQ+SDCTAE L 
Sbjct: 431 PDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSDRDHGWQVSDCTAEALK 490

Query: 367 CCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNPVEF 426
           CCL+ S MS++IVG  ++ +  Y++VN++LSLQ+ NGGV+AWEP+     W ELLNP EF
Sbjct: 491 CCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEPSRAY-KWLELLNPTEF 550

Query: 427 LEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSWHGY 486
           +  T++E E+VECTSS IQAL LFRKL+P+HRKKEI   + K V+++++ Q  DGSW+G 
Sbjct: 551 MANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQFIQDNQTPDGSWYGN 610

Query: 487 WGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKKYIP 546
           WG+C+ YAT+FA+ GL A G TYN+   +R GV FLL  Q  DGGWGESY+SC +++YIP
Sbjct: 611 WGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGGWGESYLSCSEQRYIP 670

Query: 547 LPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAFMNT 606
             G  SNLVQTS+A+MALIH+ Q +RD  PLHRAAKL+INSQL+NGD+ QQ+I GAFMNT
Sbjct: 671 SEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLENGDFPQQEIVGAFMNT 730

Query: 607 CTLHYGLYRNVFPLWALAEY 626
           C LHY  YRN FPLWALAEY
Sbjct: 731 CMLHYATYRNTFPLWALAEY 749

BLAST of Cp4.1LG00g00410 vs. TAIR10
Match: AT1G78960.1 (AT1G78960.1 lupeol synthase 2)

HSP 1 Score: 837.4 bits (2162), Expect = 1.2e-242
Identity = 373/623 (59.87%), Postives = 470/623 (75.44%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           VF  YI GHL  IF   HRKE+LR+ YCHQN+DGGWGL I G+S MFCTV NYI LR+LG
Sbjct: 131 VFCFYITGHLEKIFDAEHRKEMLRHIYCHQNEDGGWGLHIEGKSVMFCTVLNYICLRMLG 190

Query: 67  EEPD---KDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
           E P+    + C RAR+WILDHGG  Y PSWGKIWLSILG+Y+W G NPMPPE WLL    
Sbjct: 191 EGPNGGRNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMPPEIWLLPSFF 250

Query: 127 PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
           P      LCY+R+  +PMSYL+GKRFVGPLTPLI+ LR+E++ QPY  I W+  R  CAK
Sbjct: 251 PIHLGKTLCYTRMVYMPMSYLYGKRFVGPLTPLIMLLRKELHLQPYEEINWNKARRLCAK 310

Query: 187 EDKCFERSLFQKLAWNALQYFGEPILNSWAFKT-IRNRALQIAQRRIDYEDHNSHYITIG 246
           ED  +   L Q L W+ L  F EPIL +W  K  +R +AL++A   I YED NSHYITIG
Sbjct: 311 EDMIYPHPLVQDLLWDTLHNFVEPILTNWPLKKLVREKALRVAMEHIHYEDENSHYITIG 370

Query: 247 CIEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLA 306
           C+EK L  L CW+++P+G+ +KKH+ARI D++W+ EDG+KMQS+GSQ WD  F+IQ +LA
Sbjct: 371 CVEKVLCMLACWIENPNGDHFKKHLARIPDFMWVAEDGLKMQSFGSQLWDTVFAIQALLA 430

Query: 307 TNLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAE 366
            +L  E  + L+KGH FIK+SQVRENPS DF++MYR+ISKG+WT SDRDHGWQ+SDCTAE
Sbjct: 431 CDLSDETDDVLRKGHSFIKKSQVRENPSGDFKSMYRHISKGAWTLSDRDHGWQVSDCTAE 490

Query: 367 NLLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNP 426
            L CC++ S M + +VG  ++P+  Y++VN++LSLQ + GG++AWEP      W ELLNP
Sbjct: 491 ALKCCMLLSMMPAEVVGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRA-QEWLELLNP 550

Query: 427 VEFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSW 486
            +F    + E EYVECTS+ IQALVLF++L+P+HR KEI   + KGV+++E  Q  DGSW
Sbjct: 551 TDFFTCVMAEREYVECTSAVIQALVLFKQLYPDHRTKEIIKSIEKGVQFIESKQTPDGSW 610

Query: 487 HGYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKK 546
           HG WGIC+ YAT+FA+ GL A G TY +   +R+GV+FLL IQ  DGGWGES++SC +++
Sbjct: 611 HGNWGICFIYATWFALSGLAAAGKTYKSCLAVRKGVDFLLAIQEEDGGWGESHLSCPEQR 670

Query: 547 YIPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAF 606
           YIPL GN SNLVQT++A+M LIH+ Q +RDPTPLHRAAKL+I SQL+NGD+ QQ+I G F
Sbjct: 671 YIPLEGNRSNLVQTAWAMMGLIHAGQAERDPTPLHRAAKLIITSQLENGDFPQQEILGVF 730

Query: 607 MNTCTLHYGLYRNVFPLWALAEY 626
           MNTC LHY  YRN+FPLWALAEY
Sbjct: 731 MNTCMLHYATYRNIFPLWALAEY 752

BLAST of Cp4.1LG00g00410 vs. TAIR10
Match: AT1G66960.1 (AT1G66960.1 Terpenoid cyclases family protein)

HSP 1 Score: 789.6 bits (2038), Expect = 2.9e-228
Identity = 351/623 (56.34%), Postives = 456/623 (73.19%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           VF LYI GHL  +F   HRKE+LRY YCHQN+DGGWG  I  +S MF T  NYI LR+LG
Sbjct: 131 VFCLYITGHLEEVFDAEHRKEMLRYIYCHQNEDGGWGFHIESKSIMFTTTLNYICLRILG 190

Query: 67  EEPD---KDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
             PD   ++ C RAR+WIL HGG IY P WGK+WLS+LG+Y+W G NPMPPE WLL   L
Sbjct: 191 VGPDGGLENACKRARQWILSHGGVIYIPCWGKVWLSVLGIYDWSGVNPMPPEIWLLPYFL 250

Query: 127 PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
           P        Y+R+T +P+SYL+GK+FVG +TPLI+QLR+E++ QPY  I W+  RH CAK
Sbjct: 251 PIHLGKAFSYTRITYMPISYLYGKKFVGQITPLIMQLREELHLQPYEEINWNKARHLCAK 310

Query: 187 EDKCFERSLFQKLAWNALQYFGEPILNSWAF-KTIRNRALQIAQRRIDYEDHNSHYITIG 246
           EDK +   L Q L W+AL  F EP+L SW   K +R +ALQ+A + I YED NSHYITIG
Sbjct: 311 EDKYYPHPLVQDLIWDALHTFVEPLLASWPINKLVRKKALQVAMKHIHYEDENSHYITIG 370

Query: 247 CIEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLA 306
           CIEK L  L CW+D+P G  +KKH++RI D +W+ EDGMKMQ +GSQ W   F++Q +LA
Sbjct: 371 CIEKNLCMLACWIDNPDGNHFKKHLSRIPDMMWVAEDGMKMQCFGSQLWMTGFAVQALLA 430

Query: 307 TNLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAE 366
           ++   E  + L++ HD+IK+SQVR+NPS DF++MYR+ISKG WT SDRDHGWQ+SDCTAE
Sbjct: 431 SDPRDETYDVLRRAHDYIKKSQVRDNPSGDFKSMYRHISKGGWTLSDRDHGWQVSDCTAE 490

Query: 367 NLLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNP 426
              CC++ STM ++I G+ +  +  Y++VN++LSLQ++NGG +AWEP      W EL+NP
Sbjct: 491 AAKCCMLLSTMPTDITGEKINLEQLYDSVNLMLSLQSENGGFTAWEPVRAY-KWMELMNP 550

Query: 427 VEFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSW 486
            +     + E EY ECTS+ +QALV+F +L+P+HR KEI   + K V+++E  Q  DGSW
Sbjct: 551 TDLFANAMTEREYTECTSAVLQALVIFNQLYPDHRTKEITKSIEKAVQFIESKQLRDGSW 610

Query: 487 HGYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKK 546
           +G WGIC+TY T+FA+ GL A G TYNN  ++R GV FLL IQ  DGGWGESY+SC +++
Sbjct: 611 YGSWGICFTYGTWFALCGLAAIGKTYNNCLSMRDGVHFLLNIQNEDGGWGESYMSCPEQR 670

Query: 547 YIPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAF 606
           YIPL GN SN+VQT++A+MALIH+ Q KRD  PLH AAK +I SQL+NGD+ QQ++ GA 
Sbjct: 671 YIPLEGNRSNVVQTAWAMMALIHAGQAKRDLIPLHSAAKFIITSQLENGDFPQQELLGAS 730

Query: 607 MNTCTLHYGLYRNVFPLWALAEY 626
           M+TC LHY  Y+++FP WALAEY
Sbjct: 731 MSTCMLHYSTYKDIFPPWALAEY 752

BLAST of Cp4.1LG00g00410 vs. NCBI nr
Match: gi|3152599|gb|AAC17080.1| (Strong similarity to lupeol synthase gb|U49919 and cycloartenol synthase gb|U02555 from A. thaliana (the third gene with similar homology) [Arabidopsis thaliana])

HSP 1 Score: 1620.9 bits (4196), Expect = 0.0e+00
Identity = 784/1426 (54.98%), Postives = 969/1426 (67.95%), Query Frame = 1

Query: 7    VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
            VF LY+ GHL  IF++ HR+E+LRY YCHQN+DGGWGL I G S MFCT  NYI +R+LG
Sbjct: 131  VFCLYVTGHLHEIFTQDHRREVLRYIYCHQNEDGGWGLHIEGNSTMFCTTLNYICMRILG 190

Query: 67   EEPDK---DECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
            E P+    + C RAR WILDHGGA Y PSWGK WLSILGV++W G+NPMPPEFW+L   L
Sbjct: 191  EGPNGGPGNACKRARDWILDHGGATYIPSWGKTWLSILGVFDWSGSNPMPPEFWILPSFL 250

Query: 127  PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
            P  P  + CY RL  +PMSYL+GKRFVGP++PLILQLR+EIY QPY  I W+  RH CAK
Sbjct: 251  PIHPAKMWCYCRLVYMPMSYLYGKRFVGPISPLILQLREEIYLQPYAKINWNRARHLCAK 310

Query: 187  EDKCFERSLFQKLAWNALQYFGEPILNSWAF-KTIRNRALQIAQRRIDYEDHNSHYITIG 246
            ED        Q + WN L  F EP L  W F K +R +AL +A + I YED NS YITIG
Sbjct: 311  EDAYCPHPQIQDVIWNCLYIFTEPFLACWPFNKLLREKALGVAMKHIHYEDENSRYITIG 370

Query: 247  CIEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLA 306
            C+EK L  L CWV+DP+G  +KKH+ RI DYLWI EDGMKMQS+GSQ WD  F++Q ++A
Sbjct: 371  CVEKALCMLACWVEDPNGIHFKKHLLRISDYLWIAEDGMKMQSFGSQLWDSGFALQALVA 430

Query: 307  TNLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAE 366
            +NL +E  + L++G+DF+K SQVRENPS DF NMYR+ISKGSWTFSDRDHGWQ SDCTAE
Sbjct: 431  SNLVNEIPDVLRRGYDFLKNSQVRENPSGDFTNMYRHISKGSWTFSDRDHGWQASDCTAE 490

Query: 367  NLLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNP 426
            +  CCL+ S +  +IVG  M+P+  YEAV I+LSLQ+KNGGV+AWEP  G   W ELLNP
Sbjct: 491  SFKCCLLLSMIPPDIVGPKMDPEQLYEAVTILLSLQSKNGGVTAWEPARG-QEWLELLNP 550

Query: 427  VEFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSW 486
             E     ++E EY ECTSS+IQAL+LF++L+PNHR +EI T + K V+Y+E  Q  DGSW
Sbjct: 551  TEVFADIVVEHEYNECTSSAIQALILFKQLYPNHRTEEINTSIKKAVQYIESIQMLDGSW 610

Query: 487  HGYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKK 546
            +G WG+C+TY+T+F + GL A G TYNN   +R+GV FLL  Q  +GGWGESY+SC +K+
Sbjct: 611  YGSWGVCFTYSTWFGLGGLAAAGKTYNNCLAMRKGVHFLLTTQKDNGGWGESYLSCPKKR 670

Query: 547  YIPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAF 606
            YIP  G  SNLVQTS+A+M L+H+ Q +RDP+PLHRAAKLLINSQL+NGD+ QQ+I+GAF
Sbjct: 671  YIPSEGERSNLVQTSWAMMGLLHAGQAERDPSPLHRAAKLLINSQLENGDFPQQEITGAF 730

Query: 607  MNTCTLHYGLYRNVFPLWALAEY-------YNK---------------------LGEG-G 666
            M  C LHY  YRN+FP+WALAEY       Y K                     +GEG G
Sbjct: 731  MKNCLLHYAAYRNIFPVWALAEYRRRVPLPYEKPSTERRKMRYVAHIFVMWRLKIGEGNG 790

Query: 667  HDPYFFSSNNFVGRQTWDFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLFWKFQFLR 726
             DPY F++NNF GRQTW+F+PD G+PEER  V EAR  +Y NRF V+ SSDL W+ QFLR
Sbjct: 791  DDPYLFTTNNFAGRQTWEFDPDGGSPEERHSVVEARRIFYDNRFHVKASSDLLWRMQFLR 850

Query: 727  ERNFKQTIQIVRVEDV--IDKETASIALRRATKFFAALQSPHGHWPAENAGPMFYFPPLV 786
            E+ F+Q I  V+VED   +  ETA+ ALRR   FF+ALQ+  GHWPAENAGP+F+ PPLV
Sbjct: 851  EKKFEQRIAPVKVEDSEKVTFETATSALRRGIHFFSALQASDGHWPAENAGPLFFLPPLV 910

Query: 787  FSLYITGHLHIVFSEEHRKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVFNYIQLR---- 846
            F LYITGHL  VF+ EHRKEILRY YCHQ EDGGWGLHI G S M CT  NYI +R    
Sbjct: 911  FCLYITGHLDEVFTSEHRKEILRYIYCHQKEDGGWGLHIEGHSTMFCTTLNYICMRILGE 970

Query: 847  -----------------------------------LLGE-ELDKDECFRARKWILDHGGA 906
                                               +LG  +           WIL     
Sbjct: 971  SPDGGHDNACGRAREWILSHGGVTYIPSWGKTWLSILGVFDWSGSNPMPPEFWILPSFFP 1030

Query: 907  IYIPSWERFGSRLTLLPMTYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSSTRHYCAK 966
            ++ P+      R+  LPM+YL+GKRFVGP+T LILQLR+E+Y QPY  I W   RH CAK
Sbjct: 1031 VH-PAKMWSYCRMVYLPMSYLYGKRFVGPITSLILQLRKELYLQPYEEINWMKVRHLCAK 1090

Query: 967  EDKCFERSLFQKLAWDALQYCGEPILNSWTF-KTIRNRALQIAKCYIDYEDHHSHYITIG 1026
            ED  + R L Q+L WD+L    EP L  W F K +R +ALQ+A  +I YED +S YITIG
Sbjct: 1091 EDTYYPRPLVQELVWDSLYIFAEPFLARWPFNKLLREKALQLAMKHIHYEDENSRYITIG 1150

Query: 1027 CVEKPLFTLASWIDDPHGEAYKKHVARIKDYLWM--------SYGSQSWDVAFAIQAMLA 1086
            CVEK L  LA W++DP+G+ +KKH++RI DYLWM        S+GSQ WD  FA+QA+LA
Sbjct: 1151 CVEKVLCMLACWVEDPNGDYFKKHLSRISDYLWMAEDGMKMQSFGSQLWDTGFAMQALLA 1210

Query: 1087 TNLHHENSETLKKGHDFIKQSQVRENPSGDFRSMYRHISKGSWTFSDRDHGWQVSDCT-- 1146
            +NL  E S+ L++GH+FIK SQV ENPSGD++SMYRHISKG+WTFSDRDHGWQVSDCT  
Sbjct: 1211 SNLSSEISDVLRRGHEFIKNSQVGENPSGDYKSMYRHISKGAWTFSDRDHGWQVSDCTAH 1270

Query: 1147 ---------------------------AENLLCCLRFSTMPSNIVGDPMEPQWFFEAVNF 1206
                                       + N+L  L+ S +   I     +   F    +F
Sbjct: 1271 GLKCCLLFSMLAPDIVGPKQDPERLHDSVNILLSLQVSIIRHRIEPTSSKKHPFSGRDSF 1330

Query: 1207 ILSLQAKNGGVSAWEPTGAVPSWFELLNPVEFLEYTVLEREYVECTSSSIQALVLFTKLF 1266
               LQ+KNGG++AWEP GA P W ELLNP E     V+E EY ECTSS+IQAL LF +L+
Sbjct: 1331 TC-LQSKNGGMTAWEPAGA-PKWLELLNPTEMFSDIVIEHEYSECTSSAIQALSLFKQLY 1390

Query: 1267 PNHRKKEIETFLSKAIKYLEETQKEDGSWFGNWGVCHIYATYFAIKGLVATGNTYHNSST 1320
            P+HR  EI  F+ KA +YLE  Q  DGSW+GNWG+C  Y T+FA+ GL A G T+++   
Sbjct: 1391 PDHRTTEITAFIKKAAEYLENMQTRDGSWYGNWGICFTYGTWFALAGLAAAGKTFNDCEA 1450

BLAST of Cp4.1LG00g00410 vs. NCBI nr
Match: gi|802601529|ref|XP_012073072.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105634775 [Jatropha curcas])

HSP 1 Score: 1588.2 bits (4111), Expect = 0.0e+00
Identity = 759/1381 (54.96%), Postives = 958/1381 (69.37%), Query Frame = 1

Query: 7    VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
            VF LYI G+L  I +E HRK+IL Y YCHQN+DGGWGL I G S MFC+V NYI +R+L 
Sbjct: 1585 VFVLYITGYLNTIITEEHRKQILNYIYCHQNEDGGWGLHIEGHSTMFCSVLNYICMRMLA 1644

Query: 67   EEPD--KDE-CFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
            E PD  KD  C RA+KWILDH  AI  PSWGKIWLSILG+Y+W G NP+PPEFW +    
Sbjct: 1645 EGPDGGKDNSCERAKKWILDHDSAIAIPSWGKIWLSILGLYDWYGTNPVPPEFWAIPSFF 1704

Query: 127  PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
            P  P  +LCY R+T LPMSY++GKRFV P+TPLILQLR+EI+ QPY  I W   RH CAK
Sbjct: 1705 PIHPAKMLCYCRMTYLPMSYIYGKRFVAPVTPLILQLREEIFNQPYEKIDWRSVRHLCAK 1764

Query: 187  EDKCFERSLFQKLAWNALQYFGEPILNSWAFKTIRNRALQIAQRRIDYEDHNSHYITIGC 246
            ED  + R+  Q L W+ L  F EP+LNSW F  +R +AL      + YED  S YITIG 
Sbjct: 1765 EDLYYPRTFVQTLLWDVLYNFAEPLLNSWPFNKLREKALNQTMLHLHYEDEVSRYITIGS 1824

Query: 247  IEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLAT 306
            IEKPLF L CW++D +G+ +KKH+AR  D+LW+ EDGM  QS+GSQ WD +F++Q ++A 
Sbjct: 1825 IEKPLFMLACWIEDANGDYFKKHLARFSDFLWVAEDGMTAQSFGSQGWDTSFALQALIAC 1884

Query: 307  NLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAEN 366
            NL +E   TL++GH FIK SQV ENP  DFR M+R+ISKG WTFSDRDHGWQ+SD TAE 
Sbjct: 1885 NLTNEIGPTLREGHSFIKNSQVSENPPGDFRRMFRHISKGGWTFSDRDHGWQVSDTTAEG 1944

Query: 367  LLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNPV 426
            L CCL+FS M    VG+ +EP+  ++AVN+ILSLQ+KNGG+ AWEP      W E LNP 
Sbjct: 1945 LTCCLLFSMMPPETVGEKLEPEKLFDAVNVILSLQSKNGGLPAWEPAPPT-FWMEWLNPT 2004

Query: 427  EFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSWH 486
            E  E  ++E E+VECTSSSI AL LF+KL+P+HRKKEI+ F++  V+++++ Q+ DGSW+
Sbjct: 2005 ELFEDAVIEHEHVECTSSSIYALALFKKLYPDHRKKEIEDFIANAVQFIQQIQRPDGSWY 2064

Query: 487  GYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKKY 546
            G WGIC+TY T+FA++GL A G TY N S +RRGV FL       G    SYI  +Q+ Y
Sbjct: 2065 GNWGICFTYGTWFALRGLAAAGKTYFNCSAVRRGVNFLHX----KGEHXASYIFWVQE-Y 2124

Query: 547  IPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAFM 606
            +PL G  SNLVQT++ALM LIH  Q + DP PLH AAKLLINSQ + GD+ QQ+I G +M
Sbjct: 2125 VPLEGERSNLVQTAWALMGLIHGGQGEIDPAPLHLAAKLLINSQTELGDFPQQEILGVYM 2184

Query: 607  NTCTLHYGLYRNVFPLWALAEYYN-------------KLGEGGHDPYFFSSNNFVGRQTW 666
              C +HY  YRN+FPLWALAEY               K+ EGGHDPY +S+NNF+GRQ W
Sbjct: 2185 KNCMVHYAAYRNIFPLWALAEYRQHFSFSSKSNMWRLKIAEGGHDPYIYSTNNFLGRQIW 2244

Query: 667  DFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLFWKFQFLRERNFKQTIQIVRVE--D 726
            +F+P+ GTPEERAEVEEA  N+++NRF+V+ ++DL W+ QFLRE+NFKQ I IV+V+  +
Sbjct: 2245 EFDPNAGTPEERAEVEEACQNFWKNRFQVKPNADLLWQKQFLREKNFKQEIPIVKVKGFE 2304

Query: 727  VIDKETASIALRRATKFFAALQSPHGHWPAENAGPMFYFPPLVFSLYITGHLHIVFSEEH 786
             +  ET + ALRR+   F+ALQ+  GHWPA N+G + + PPLVF LYITG+L+ + +EEH
Sbjct: 2305 EMTHETVTAALRRSVHVFSALQARDGHWPATNSGSLSFLPPLVFVLYITGYLNTIITEEH 2364

Query: 787  RKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVFNYIQLRLLGEELD---KDECFRARKWI 846
            RK+IL Y YCHQNEDGGWGLHI G S M CTV NYI +R+LGE  D    + C RARKWI
Sbjct: 2365 RKQILNYIYCHQNEDGGWGLHIEGHSIMFCTVLNYICMRMLGEGPDGGKDNSCERARKWI 2424

Query: 847  LDHGGAIYIPSWERFGSRLTLLPMTYLFGKRFVG----------PLTPLILQLRQEIYTQ 906
            LDHG AI IPSW +    L++L +   +G   V           P+ P  +     +   
Sbjct: 2425 LDHGSAIAIPSWGKIW--LSVLGLYDWYGTNPVPPEFWAIPSYFPIHPAKMLCYCRMTYL 2484

Query: 907  PYNHIKWSSTR----------------------------HYCAKEDKCFERSLFQKLAWD 966
            P ++I                                  H CAKED  + R+  Q L WD
Sbjct: 2485 PISYIYGKRFVAPVTPLILQLREEIFNQPYEKIDWRSVRHLCAKEDLYYPRTFVQTLLWD 2544

Query: 967  ALQYCGEPILNSWTFKTIRNRALQIAKCYIDYEDHHSHYITIGCVEKPLFTLASWIDDPH 1026
             L    EP+LNSW F  +R +AL     ++ YED  S YITIG +EKPLF LA WI+D +
Sbjct: 2545 VLYNFAEPLLNSWPFNKLREKALNQTMLHLHYEDEVSRYITIGSIEKPLFMLACWIEDAN 2604

Query: 1027 GEAYKKHVARIKDYLWM--------SYGSQSWDVAFAIQAMLATNLHHENSETLKKGHDF 1086
            G+ +KKH+AR  D+LW+        S+GSQ WD +FA+QA++A +L +E   TL++GH F
Sbjct: 2605 GDYFKKHLARFSDFLWVAEDGMTAQSFGSQGWDTSFALQALVACSLTNEIGPTLREGHSF 2664

Query: 1087 IKQSQVRENPSGDFRSMYRHISKGSWTFSDRDHGWQVSDCTAENLLCCLRFSTMPSNIVG 1146
            IK SQV ENP GDFR M+RHISKG WTFSDRDHGWQVSD TAE L CCL FS MP   VG
Sbjct: 2665 IKNSQVSENPPGDFRRMFRHISKGGWTFSDRDHGWQVSDTTAEGLTCCLLFSMMPPETVG 2724

Query: 1147 DPMEPQWFFEAVNFILSLQAKNGGVSAWEPTGAVPS-WFELLNPVEFLEYTVLEREYVEC 1206
            + +EP+  F+AVN ILSLQ+KNGG++AWEP  A P+ W E LNP E  E  V+E E+VEC
Sbjct: 2725 EKLEPEKLFDAVNVILSLQSKNGGLAAWEP--APPTFWMEWLNPTELFEDAVIEHEHVEC 2784

Query: 1207 TSSSIQALVLFTKLFPNHRKKEIETFLSKAIKYLEETQKEDGSWFGNWGVCHIYATYFAI 1266
            TSSSI AL LF KL+P+HRKKEIE F++ A++++++ Q+ DGSW+GNWG+C  Y T+FA+
Sbjct: 2785 TSSSIYALALFKKLYPDHRKKEIEDFIANAVQFIQQIQRPDGSWYGNWGICFTYGTWFAL 2844

Query: 1267 KGLVATGNTYHNSSTIRKGVEFLLKIQCPDGGWGESHVSCMQKKYIPLPRNSSNLVQTSF 1320
            +GL A G TY+N S +R+GV FLLK+Q  DGGWGES++SC  K+Y+PL    SNLVQT++
Sbjct: 2845 RGLAAAGKTYYNCSAVRRGVNFLLKLQKEDGGWGESYLSCPNKEYVPLEGERSNLVQTAW 2904

BLAST of Cp4.1LG00g00410 vs. NCBI nr
Match: gi|976907938|gb|KVH95175.1| (Prenyltransferase/squalene oxidase [Cynara cardunculus var. scolymus])

HSP 1 Score: 1323.1 bits (3423), Expect = 0.0e+00
Identity = 649/1267 (51.22%), Postives = 806/1267 (63.61%), Query Frame = 1

Query: 7    VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
            V  L+I GHL  IF   HRKEILRY YCHQN+DGGWG  I G S MF +  +YI +RLLG
Sbjct: 813  VICLFITGHLNDIFPSEHRKEILRYLYCHQNEDGGWGFHIEGHSTMFGSTLSYICMRLLG 872

Query: 67   EEPD---KDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
            E PD      C +ARKWILDHGGA   P+WGK WLSILGV EW G NPMPPEFW+L   L
Sbjct: 873  EGPDGGLNGACTKARKWILDHGGATAIPAWGKTWLSILGVCEWAGNNPMPPEFWILPSFL 932

Query: 127  PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
            P  P  + CY RL  +PMSYL+GKRFVGP+TPL+LQLR E+Y QPYN I     RH CAK
Sbjct: 933  PMHPAKMWCYCRLVYMPMSYLYGKRFVGPITPLVLQLRDELYAQPYNKINRKSIRHLCAK 992

Query: 187  EDKCFERSLFQKLAWNALQYFGEPILNSWAFKTIRNRALQIAQRRIDYEDHNSHYITIGC 246
            ED     S  Q+L W++L  F EP+L  W F  +R +ALQ   + I YED NS YITIG 
Sbjct: 993  EDLYHPHSSLQELLWDSLYIFTEPLLTHWPFNKLREKALQTTMKHIHYEDENSRYITIGA 1052

Query: 247  IEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLAT 306
            +EK L  L CWV+DP+G  +KKH+ARI DY+W+ EDGMKMQ   SQ WD +  +Q +LAT
Sbjct: 1053 VEKALCMLSCWVEDPNGVCFKKHLARIPDYIWVAEDGMKMQGTNSQVWDASLVVQALLAT 1112

Query: 307  NLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAEN 366
            +L HE   TLKKGHDFI  SQV++NPS DF +M+R+ISKGSWTF+D+DHGWQ+SDCTAE 
Sbjct: 1113 DLPHEIGPTLKKGHDFINASQVKDNPSGDFESMHRHISKGSWTFADQDHGWQVSDCTAEG 1172

Query: 367  LLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNPV 426
            L CCL+ S M   IVG  M P+    AV+++LSLQ+KNGG+  WEP G    W E+LNP 
Sbjct: 1173 LKCCLLLSMMPPEIVGKKMAPEQLNNAVDVLLSLQSKNGGLPGWEPAGS-SKWLEILNPT 1232

Query: 427  EFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSWH 486
            EF    ++E EY ECTSS+IQALVLF+K +P HR KEI +FL+   +YLE+ Q  DGSW+
Sbjct: 1233 EFFVDIVIEHEYTECTSSAIQALVLFKKSYPEHRSKEIDSFLTVAGEYLEKMQMSDGSWY 1292

Query: 487  GYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKKY 546
            G WG+C+TYAT+FA+ GL A G TY N   + + V FLLK Q  DGGWGESY SC +K  
Sbjct: 1293 GNWGVCFTYATWFALGGLAAIGKTYENCPAIGKAVNFLLKTQREDGGWGESYQSCTKK-- 1352

Query: 547  IPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAFM 606
                                      KRDPTPLH+AAKLLINSQ +NGD+SQQ+ SG F 
Sbjct: 1353 -------------------------AKRDPTPLHKAAKLLINSQTRNGDFSQQETSGVFK 1412

Query: 607  NTCTLHYGLYRNVFPLWALAEY-------------YNKLGEGGHDPYFFSSNNFVGRQTW 666
              C LHY LYR++FP+WALA Y               K+  G ++PY +S+NNFVGRQTW
Sbjct: 1413 QNCLLHYALYRDIFPMWALAAYSVVAISNLDEKMWRLKIANGVNNPYLYSTNNFVGRQTW 1472

Query: 667  DFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLFWKFQFLRERNFKQTIQIVRVED-- 726
            +F+P+ GTPEER E+E+AR +++ +R +V+ SSD+ W+ QFLRE+ FKQTI  V++ED  
Sbjct: 1473 EFDPNYGTPEERNEIEKARLHFWDHRHEVKPSSDVLWRMQFLREKQFKQTIAQVKIEDGE 1532

Query: 727  VIDKETASIALRRATKFFAALQSPHGHWPAENAGPMFYFPPLVFSLYITGHLHIVFSEEH 786
             I+ E  +  LRR+   FAALQ+  GHWPAENAGPM++  PL                  
Sbjct: 1533 DINYEKVTTTLRRSVHLFAALQAEDGHWPAENAGPMYFIQPL------------------ 1592

Query: 787  RKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVFNYIQLRLLGEELD---KDECFRARKWI 846
                        NEDGGWG HI G S M  T  +YI +RLLGE  +      C +ARKWI
Sbjct: 1593 ------------NEDGGWGFHIEGHSTMFGTTLSYICMRLLGEGPBGGLNGACTKARKWI 1652

Query: 847  LDHGGAIYIPSWERFGSRLTLLPMTYLFGKRFVGPLTPLILQLRQEIYTQPY--NHIKWS 906
            LDHG A  IPSW                GK +          L +++Y   +   H+ W 
Sbjct: 1653 LDHGSATTIPSW----------------GKTW----------LSEDLYYPXHLLQHLMWD 1712

Query: 907  STRHYCAKEDKCFERSLFQKLAWDALQYCGEPILNSWTFKTIRNRALQIAKCYIDYEDHH 966
            S                        L    EP+L  W F  +R +AL+    +I YED +
Sbjct: 1713 S------------------------LYIFTEPLLTHWPFNKLREKALETTMKHIHYEDEN 1772

Query: 967  SHYITIGCVEKPLFTLASWIDDPHGEAYKKHVARIKDYLWM--------SYGSQSWDVAF 1026
            S YITIG VEK L  LA W++DP+G  +KKH+ARI DY+W+        ++GSQ+WD +F
Sbjct: 1773 SRYITIGSVEKALCMLACWVEDPNGVCFKKHLARIPDYIWLAEDGMKMQTFGSQAWDASF 1832

Query: 1027 AIQAMLATNLHHENSETLKKGHDFIKQSQVRENPSGDFRSMYRHISKGSWTFSDRDHGWQ 1086
            AIQA+LA++L +E   TLKKGHDFIK SQV++NPSGDF+SM+RHISKGSWTFSD+DHGWQ
Sbjct: 1833 AIQALLASDLINEIGPTLKKGHDFIKDSQVKDNPSGDFKSMHRHISKGSWTFSDQDHGWQ 1892

Query: 1087 VSDCTAENLLCCLRFSTMPSNIVGDPMEPQWFFEAVNFILSLQAKNGGVSAWEPTGAVPS 1146
            VSD TAE L+CCL  S MP   VG  MEP+    AVN ILS+                  
Sbjct: 1893 VSDSTAEGLMCCLLLSMMPPEFVGKKMEPEQLNNAVNVILSM------------------ 1951

Query: 1147 WFELLNPVEFLEYTVLEREYVECTSSSIQALVLFTKLFPNHRKKEIETFLSKAIKYLEET 1206
               +LNP EF    V+E EY+ECTSS IQAL LF   +P HR KEI++ L+KA +Y+E+ 
Sbjct: 1953 --PILNPTEFFADIVIEHEYIECTSSVIQALALFKNSYPEHRSKEIDSLLTKAGEYIEKM 1951

Query: 1207 QKEDGSWFGNWGVCHIYATYFAIKGLVATGNTYHNSSTIRKGVEFLLKIQCPDGGWGESH 1243
            Q  DGSW+GNWG+C  YAT+FA+ GL A G TY N   + K V FLLK Q  DGGWGES+
Sbjct: 2013 QMSDGSWYGNWGICFTYATWFALGGLAAIGKTYENCQAVGKAVNFLLKTQLKDGGWGESY 1951

BLAST of Cp4.1LG00g00410 vs. NCBI nr
Match: gi|778708541|ref|XP_011656229.1| (PREDICTED: lupeol synthase-like [Cucumis sativus])

HSP 1 Score: 1119.8 bits (2895), Expect = 0.0e+00
Identity = 523/744 (70.30%), Postives = 603/744 (81.05%), Query Frame = 1

Query: 628  KLGEGGHDPYFFSSNNFVGRQTWDFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLFW 687
            K+G+G ++ Y FSSNNFVGRQTW F+ ++GT +E+A++EEAR +YYQNR  V CSSD  W
Sbjct: 5    KMGKGENESYLFSSNNFVGRQTWVFKANEGTHQEQAQIEEARLSYYQNRLNVPCSSDFLW 64

Query: 688  KFQFLRERNFKQTIQIVRVEDVID--------KETASIALRRATKFFAALQSPHGHWPAE 747
            +FQFLRE+ F+QTI  VRV +  D        KETAS A+RRAT  FAALQS HGHWPAE
Sbjct: 65   QFQFLREKKFRQTIPKVRVNEGRDGDEEIRITKETASNAMRRATNLFAALQSDHGHWPAE 124

Query: 748  NAGPMFYFPPLVFSLYITGHLHIVFSEEHRKEILRYAYCHQNEDGGWGLHIVGQSCMLCT 807
            N+GP+FYFPPLVF+LYITGHL I+F+EE+RKEILRYAYCHQNEDGGWGL+IVG+SCMLCT
Sbjct: 125  NSGPLFYFPPLVFALYITGHLGIIFTEEYRKEILRYAYCHQNEDGGWGLNIVGESCMLCT 184

Query: 808  VFNYIQLRLLGEELDKDECFRARKWILDHGGAIYIPSWER-------------------- 867
            V NYI+LRLLGEE DK+ C R RKWILDHGGA+Y PSW +                    
Sbjct: 185  VLNYIELRLLGEEADKEACDRGRKWILDHGGALYTPSWGKIWLCILGVYEWEGTNPMPPE 244

Query: 868  ---FG-------------SRLTLLPMTYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWS 927
               FG             +RLT LPM+YL+ KRF GPLTPLILQLR EIY QPYN IKW+
Sbjct: 245  IWMFGKILPLNLGGFLCYTRLTFLPMSYLYAKRFAGPLTPLILQLRHEIYIQPYNDIKWN 304

Query: 928  STRHYCAKEDKCFERSLFQKLAWDALQYCGEPILNSWTFKTIRNRALQIAKCYIDYEDHH 987
              R++CAKEDKCFERS+ QK  WD  QY GEPI NSW F  +R+R+LQI K YI YED +
Sbjct: 305  PARNFCAKEDKCFERSILQKAVWDVFQYIGEPIFNSWPFNRLRDRSLQIVKGYIHYEDQN 364

Query: 988  SHYITIGCVEKPLFTLASWIDDPHGEAYKKHVARIKDYLWM--------SYGSQSWDVAF 1047
            SH+ITIGCVEKPLFTL SWIDDP+GE YKKH+ARIKDYLW+        S+GSQSWD AF
Sbjct: 365  SHFITIGCVEKPLFTLISWIDDPNGETYKKHLARIKDYLWVGEDGMKMQSFGSQSWDAAF 424

Query: 1048 AIQAMLATNLHHENSETLKKGHDFIKQSQVRENPSGDFRSMYRHISKGSWTFSDRDHGWQ 1107
            A+QA++ATNLHHE S+TLKKGHDFIKQSQ+RENP GDF SMYRH+SKGSWTFSDRDHGW 
Sbjct: 425  AMQAIIATNLHHEFSDTLKKGHDFIKQSQIRENPGGDFPSMYRHMSKGSWTFSDRDHGWG 484

Query: 1108 VSDCTAENLLCCLRFSTMPSNIVGDPMEPQWFFEAVNFILSLQAKNGGVSAWEPTGAVPS 1167
            VSDCTAENLLCCL+ STMPS++VG+ MEPQ FFEAVNFILSLQAKNGGVSAWEP+G +PS
Sbjct: 485  VSDCTAENLLCCLKLSTMPSHVVGEAMEPQCFFEAVNFILSLQAKNGGVSAWEPSGILPS 544

Query: 1168 WFELLNPVEFLEYTVLEREYVECTSSSIQALVLFTKLFPNHRKKEIETFLSKAIKYLEET 1227
            W E LNPVEF EYT+LEREYVECTSS+IQALVLF KLFP+HRKKEIE F+ KA  ++++ 
Sbjct: 545  WLEELNPVEFFEYTLLEREYVECTSSAIQALVLFKKLFPSHRKKEIENFIEKAENFIKQL 604

Query: 1228 QKEDGSWFGNWGVCHIYATYFAIKGLVATGNTYHNSSTIRKGVEFLLKIQCPDGGWGESH 1287
            QKEDGSW+GNWG+CHIYAT+FAIKGLVA GNTY+N   I K VEFLLKIQC DGGWGESH
Sbjct: 605  QKEDGSWYGNWGICHIYATFFAIKGLVAAGNTYNNCLEISKAVEFLLKIQCEDGGWGESH 664

Query: 1288 VSCMQKKYIPLPRNSSNLVQTSFALMALIHSHQEKRDPTPLHHGAKLLINSQLENGDYPQ 1320
            +SC ++ +  LP N+SNLVQTSFALMALIHS Q KRDPTPLH  AKLLINSQL++GDYPQ
Sbjct: 665  ISCFKRVHTHLPNNASNLVQTSFALMALIHSQQGKRDPTPLHRAAKLLINSQLDDGDYPQ 724

BLAST of Cp4.1LG00g00410 vs. NCBI nr
Match: gi|297842681|ref|XP_002889222.1| (hypothetical protein ARALYDRAFT_316793 [Arabidopsis lyrata subsp. lyrata])

HSP 1 Score: 1112.8 bits (2877), Expect = 0.0e+00
Identity = 516/864 (59.72%), Postives = 630/864 (72.92%), Query Frame = 1

Query: 7   VFSLYIMGHLRIIFSEHHRKEILRYAYCHQNQDGGWGLDIVGQSCMFCTVFNYIQLRLLG 66
           VF L++ GHL  IF++ HR+EILRY YCHQN+DGGWGL I G S MFCT  NYI +R+LG
Sbjct: 131 VFCLFVTGHLHEIFTQEHRREILRYIYCHQNEDGGWGLHIEGDSTMFCTTLNYICMRILG 190

Query: 67  EEP---DKDECFRARKWILDHGGAIYTPSWGKIWLSILGVYEWEGANPMPPEFWLLGKLL 126
           E P     + C RAR WILDHGGA Y PSWGK WLSILGV++W G+NPMPPEFW+L   L
Sbjct: 191 ESPFGGPGNACRRARDWILDHGGATYIPSWGKTWLSILGVFDWSGSNPMPPEFWILPSFL 250

Query: 127 PFIPRSLLCYSRLTLLPMSYLFGKRFVGPLTPLILQLRQEIYTQPYNHIKWSPTRHYCAK 186
           P  P  + CY RL  +PMSYL+GKRFVGP++PLILQLR+EIY QPY  I W+  RH CAK
Sbjct: 251 PIHPAKMWCYCRLVYMPMSYLYGKRFVGPISPLILQLREEIYLQPYAKINWNRARHLCAK 310

Query: 187 EDKCFERSLFQKLAWNALQYFGEPILNSWAF-KTIRNRALQIAQRRIDYEDHNSHYITIG 246
           ED        Q + W+ L  F EP L  W F K +R +AL +A + I YED NS YITIG
Sbjct: 311 EDAYCPHPQIQDVIWDCLYIFTEPFLTCWPFNKLLREKALGVAMKHIHYEDENSRYITIG 370

Query: 247 CIEKPLFTLVCWVDDPHGEAYKKHVARIKDYLWIGEDGMKMQSYGSQSWDVAFSIQTVLA 306
           C+EK L  L CWV+DP+G  +KKH+ RI DYLWI EDGMKMQS+GSQ WD  F++Q ++A
Sbjct: 371 CVEKALCMLACWVEDPNGSHFKKHLLRISDYLWIAEDGMKMQSFGSQLWDSGFALQALVA 430

Query: 307 TNLHHEFSETLKKGHDFIKQSQVRENPSSDFRNMYRYISKGSWTFSDRDHGWQISDCTAE 366
           ++L +E  + L++G+DF+K SQVRENPS DF NM+R+ISKGSWTFSDRDHGWQ SDCTAE
Sbjct: 431 SDLANEIPDVLRRGYDFLKNSQVRENPSGDFTNMFRHISKGSWTFSDRDHGWQASDCTAE 490

Query: 367 NLLCCLIFSTMSSNIVGDPMEPQWFYEAVNIILSLQAKNGGVSAWEPTGGVPSWFELLNP 426
              CCL+ S M  +IVG  M+P+  YEAV I+LSLQ+KNGGV+AWEP  G   W ELLNP
Sbjct: 491 GFKCCLLLSMMPPDIVGPKMDPEQLYEAVTILLSLQSKNGGVTAWEPARG-QEWLELLNP 550

Query: 427 VEFLEYTILELEYVECTSSSIQALVLFRKLFPNHRKKEIKTFLSKGVKYLEETQKEDGSW 486
            E     ++E EY ECTSS+IQAL+LF++L+PNHR  EI T + K V+Y+E  Q  DGSW
Sbjct: 551 TEVFADIVVEHEYNECTSSAIQALILFKQLYPNHRTAEINTSIKKAVQYIESIQMHDGSW 610

Query: 487 HGYWGICYTYATYFAIKGLVATGNTYNNSSTLRRGVEFLLKIQCPDGGWGESYISCMQKK 546
           +G WG+C+TY+T+F + GL A G TYNN   +R+GV FLL  Q  +GGWGESY+SC +K+
Sbjct: 611 YGSWGVCFTYSTWFGLGGLAAAGKTYNNCLAMRKGVHFLLTTQKDNGGWGESYLSCPKKR 670

Query: 547 YIPLPGNSSNLVQTSFALMALIHSQQEKRDPTPLHRAAKLLINSQLKNGDYSQQKISGAF 606
           YIP  G+ SNLVQTS+A+M L+H+ Q +RDP PLHRAAKLLINSQL+NGD+ QQ+I+GAF
Sbjct: 671 YIPSEGDRSNLVQTSWAMMGLLHAGQAERDPAPLHRAAKLLINSQLENGDFPQQEITGAF 730

Query: 607 MNTCTLHYGLYRNVFPLWALAEYYN-----------------------KLGEG-GHDPYF 666
           M  C LHY  YRN+FP+WALAEY                         K+GEG G DPY 
Sbjct: 731 MKNCLLHYAAYRNIFPVWALAEYRRRVPLPYENLEQREELCSFVMWRLKIGEGSGDDPYL 790

Query: 667 FSSNNFVGRQTWDFEPDDGTPEERAEVEEARHNYYQNRFKVQCSSDLFWKFQFLRERNFK 726
           F++NNFVGRQTW+F+PD G+PEER  V EAR ++Y NRF V+ SSDL W+ QFL+E+ F+
Sbjct: 791 FTTNNFVGRQTWEFDPDAGSPEERYAVVEARQSFYDNRFHVKASSDLLWRMQFLKEKKFE 850

Query: 727 QTIQIVRVE--DVIDKETASIALRRATKFFAALQSPHGHWPAENAGPMFYFPPLVFSLYI 786
           Q I  V+VE  + +  ETA+ ALRR   FF+ALQ+  GHWPAENAGP+F+ PPLVF LYI
Sbjct: 851 QVIAPVKVEGSEKVTFETATNALRRGVHFFSALQASDGHWPAENAGPLFFLPPLVFCLYI 910

Query: 787 TGHLHIVFSEEHRKEILRYAYCHQNEDGGWGLHIVGQSCMLCTVFNYIQLRLLGEEL--- 838
           TGHL  VF+ EHRKEILRY YCHQ EDGGWGLHI G S M CT  NYI +R+LGE     
Sbjct: 911 TGHLDEVFTLEHRKEILRYIYCHQKEDGGWGLHIEGHSTMFCTALNYICMRILGESPVGG 970

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BAMS1_PANGI5.4e-25362.46Beta-Amyrin Synthase 1 OS=Panax ginseng GN=OSCPNY1 PE=1 SV=1[more]
LUPS_RICCO1.5e-25061.74Lupeol synthase OS=Ricinus communis PE=1 SV=1[more]
BAMS_PEA1.5e-24760.77Beta-amyrin synthase OS=Pisum sativum GN=OSCPSY PE=2 SV=1[more]
BAMS_SOLLC3.4e-24761.58Beta-amyrin synthase OS=Solanum lycopersicum GN=TTS1 PE=1 SV=1[more]
BAS_BRUGY2.9e-24660.45Beta-amyrin synthase OS=Bruguiera gymnorhiza GN=BAS PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A103XQS9_CYNCS0.0e+0051.30Prenyltransferase/squalene oxidase OS=Cynara cardunculus var. scolymus GN=Ccrd_0... [more]
D7KVW8_ARALL0.0e+0059.72Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRA... [more]
A0A0D9YM41_9ORYZ1.5e-27342.50Uncharacterized protein OS=Oryza glumipatula PE=3 SV=1[more]
F6GYJ7_VITVI5.6e-26558.43Terpene cyclase/mutase family member OS=Vitis vinifera GN=VIT_09s0054g01220 PE=3... [more]
F6GYJ6_VITVI1.6e-25956.95Terpene cyclase/mutase family member OS=Vitis vinifera GN=VIT_09s0054g01230 PE=3... [more]
Match NameE-valueIdentityDescription
AT1G78955.11.2e-24560.93 camelliol C synthase 1[more]
AT1G78950.18.3e-24454.97 Terpenoid cyclases family protein[more]
AT1G78970.23.2e-24360.97 lupeol synthase 1[more]
AT1G78960.11.2e-24259.87 lupeol synthase 2[more]
AT1G66960.12.9e-22856.34 Terpenoid cyclases family protein[more]
Match NameE-valueIdentityDescription
gi|3152599|gb|AAC17080.1|0.0e+0054.98Strong similarity to lupeol synthase gb|U49919 and cycloartenol synthase gb|U025... [more]
gi|802601529|ref|XP_012073072.1|0.0e+0054.96PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105634775 [Jatropha c... [more]
gi|976907938|gb|KVH95175.1|0.0e+0051.22Prenyltransferase/squalene oxidase [Cynara cardunculus var. scolymus][more]
gi|778708541|ref|XP_011656229.1|0.0e+0070.30PREDICTED: lupeol synthase-like [Cucumis sativus][more]
gi|297842681|ref|XP_002889222.1|0.0e+0059.72hypothetical protein ARALYDRAFT_316793 [Arabidopsis lyrata subsp. lyrata][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016866intramolecular transferase activity
Vocabulary: INTERPRO
TermDefinition
IPR018333Squalene_cyclase
IPR008930Terpenoid_cyclase/PrenylTrfase
IPR002365Terpene_synthase_CS
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0019745 pentacyclic triterpenoid biosynthetic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016866 intramolecular transferase activity
molecular_function GO:0042300 beta-amyrin synthase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG00g00410.1Cp4.1LG00g00410.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002365Terpene synthase, conserved sitePROSITEPS01074TERPENE_SYNTHASEScoord: 479..493
scor
IPR008930Terpenoid cyclases/protein prenyltransferase alpha-alpha toroidGENE3DG3DSA:1.50.10.20coord: 742..977
score: 1.4E-77coord: 7..274
score: 7.9E-108coord: 275..629
score: 7.5E-132coord: 978..1319
score: 2.4E
IPR008930Terpenoid cyclases/protein prenyltransferase alpha-alpha toroidunknownSSF48239Terpenoid cyclases/Protein prenyltransferasescoord: 5..283
score: 1.63E-88coord: 226..631
score: 7.22E-99coord: 742..985
score: 6.28E-66coord: 934..1319
score: 2.83
IPR018333Squalene cyclaseTIGRFAMsTIGR01787TIGR01787coord: 843..1319
score: 2.0E-128coord: 7..626
score: 1.1E
NoneNo IPR availablePANTHERPTHR11764LANOSTEROL SYNTHASEcoord: 628..1319
score:
NoneNo IPR availablePANTHERPTHR11764:SF18AMYRIN SYNTHASE LUP2-RELATEDcoord: 628..1319
score:

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG00g00410CmaCh05G009030Cucurbita maxima (Rimu)cmacpeB761
Cp4.1LG00g00410Csa5G478110Cucumber (Chinese Long) v2cpecuB015
Cp4.1LG00g00410ClCG09G022130Watermelon (Charleston Gray)cpewcgB000
Cp4.1LG00g00410Lsi02G017490Bottle gourd (USVL1VR-Ls)cpelsiB009
Cp4.1LG00g00410MELO3C002943.2Melon (DHL92) v3.6.1cpemedB003
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None