CcUC01G003870 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC01G003870
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionCytochrome P450
LocationCicolChr01: 3918170 .. 3928125 (-)
RNA-Seq ExpressionCcUC01G003870
SyntenyCcUC01G003870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGTTTTTTTTCTTTTTCTTTTTTCTTTTTTCTTTTTTCTTCCTTAAAAAAATGATTGATCCTTTTATAAGGTGAAGGCAGGGCCATAGATTCCTCATATGTCGTCTCGCCTCCACCAGATCTTGTTTCTGCCGACCTTCGCCCAACATCAACATTACAAAACTTTTCACCAAACTCTCCTTTCTTTCTCCACTCCCCCATGGAACAACCCACCTCCAATTCTCCTCTACTTCTCATAAATTCCCTCAATTCCTCAGTTGACCAATCTCCTTTCCTCTTCACCTTCTTCGCCGCTCTTCTAATTTTCCTCTATGTAAAACTCACGCGCCTGCGCGTGCCTCTTCCACCTGGTCCCTGGGGCGTTCCACTACTCGGAAACCTTCCATTCCTTGACCCCGATCTTCACACGTACTTTGCCGAATTAGGCCGAAAATATGGCCCAATAGTAAAGCTCCAACTCGGCAATAGAATTGGTATAATCGTAAATTCATCGTCCGTAGCACGTGAAGTGTTGAAAGACCACGATGTTACATTTGCGAATCGCGATGTTCCTCAAGCCGGGAGAGCCGCCTCGTACGGCGGTTCTGACATAGTTTGGACACCGTACGGACCCGAGTGGCGAATGTTGAGAAAAGTTTGTGTGTTAAAGATGCTGAGCAACGCCACATTGGATAGCGTTTACGAGCTCCGCCGTAGAGAGGTGAGAAACACGGTGGCTCACTTGTACCGGCAATGTGGGTCAGCAGTGAACGTGGGAGAGCAGGGGTTTTTGACGGTTTTCAACGTGGTTACAAGCATGTTGTGGGGTGGTTCAGTGGAAGGAGAGCAGAGGGATGGTCTTGCAGCGGAGTTTAGAGAGACAATTTCGGAAATAACAGAGCTATTGGGAAAACCTAATATTTCGGACTTTTTTCCAAGCTTGGCCCGTTTTGATCTCCAAGGGATTGAGAAGAAGATGCGTGAGATTGCTCCGAGATTTGATAACATTTTCGACAAGATGATAGATGAAAGATTGAAAATTGGTGGTGGAGATGACGACGGGAGTGTCAAGAAAAATGATTTCTTGCAGTTCTTACTTGAGGTTAATGATGAAGGAGAATCCAAGACTCCTCTCACCATGACCCACCTTAAAGCCTTGCTCATGGTAACTCATTTTAACCCACTATTTATTTATTTATTTATTATTATTATTATTATGTTATCTTACAAGTTTATTCCATTAATTTCCTAGGGTTAAAATGGTTAAAATATTATTTTGGTGTTTATACTTTAGATTTTGTTATATTTTGGTCTTTTCAAAGGTTTATTTTTTAAAAAATATTAGTCCATAAAATAGTTTATTGTTTCTCTGTATTTGTTTACTATAGTTTTCATCTTTGAATTATATTATTATTATTATTATTATAATATAACTGTTTTAATTTATAAAACTTGACTTGAAATTTTTTGAATAGTACTCTAAAAGAGTAGTTACAAAACAAAGAGCGGTTGATGATGATGATATGTGGGTTCTTGATTCTGGGGGTATCTATAGATAGATGGAATGAATGTTGGTATAGCTTACATTAGTGCTATGTGCAAGATAGTTGAAATGGACTCAGTGGCTTGCTGGTATGGGAATGAGATGGAAGAAATCTCACCTTTAGAGGGTAATAAATATAAAGTTCGAGTTGTTGCTAGAGGGTTCACACAAAGAGAAGAAGTTGATTGTAATGAGATATTCTCACCAATAGTCAGACATACTTCTATCAACATGTTACTAGCTATGGTAGCACATCAAAATTTAGAACTTGAGCAACTAGATGAAATGAGATTCATATGACCAACCCAAATGGGTTTCAATGTCTCAGCAAAAAAGATTATATTTTGCAAGCTGAAGAAGTTTTTGTATGGGCTAAAGTAGTCTTTAAGATATACAGTGGTACAAACGGTTTGATAGCTATATGGTTGGACTTCGGTATACCAAAAGTTTGTATGATTGTTGTGTATATCACAACAAAGCTGATGATGGTTCAATGGATTATCTACCTGTATATATAGGTAATATGCTCATAGTTGCAAAGTTCAAGTCTGATATTTAGAAGTTAAAAGATCTTTTCCGTGTTAGTTATGTACAGAGGCCCGTACTTGGGTACGTATGGCAACGTTTTTGTGTTCGTCTAGTAAATATGTTATTTGTAAGAGAATTCAAGGCAATGTGGAGATTTGTTGGAATGCCTTGAATTTGTTGTATGACACTTACCAAGTATAGGAATCCCACTGACTGGGTTTAGAGAAGTGAACTATATAGAATTTATATTTGACAATTATTTATGGTTATTTACAATTATGATGCTAGGGTTTAGAGGAGTGAACTATATATACTTTCTAACCTTAAAGTGTCACTTATTGTGTAATTAGTATTAGTTAATCATGTAAATCTACGCGTTGATTCTCTTCTATTTTATGTTTTATGTATTCTTTATTTGTCGATTTCATAACAAAAACACTTAGAAAGTAATTTGCAATGAAGATAAGGAAATTCTTTGTGTGGGGGCAGGATATGGTGGTCGGAGGGACAGACACATCGTCAAATACCATAGAATTTGCATTGGCAGAAATGATAAAGAACCCAAAAACTCTAAAGAAAGCACAAGAAGAACTCACCGCCGTGGTCGGAGAAGACAACATCGTCGAAGAGTCCCACATTCACAGTTTCCCATATCTGAAGGCCGTAATGAAAGAAACTTTACGTTTACACCCAATTCTACCTCTGCTCGTGCCGCACTGTCCAAGCGAGACCACCATCGTCTCCAATTACGCAATTCCAAAGGGCTCTCGAGTGTTCATCAATGCTTGGGCCATTCAGCGAGACCCCAATCATTGGGAAAATCCATTGGTGTTTGATCCAGAGAGGTTTCTCAAGACCGAGAAGTGGGATTTTGGTGGGAGTGATTTTCGTTACTTCCCATTCGGGTCCGGCAGAAGAAACTGTGCAGGGATAGCCATGGCGGAAAGAATGGTAATGTATTTACTTGCTACGCTTTTGCATTCTTTTGACTGGAAATTGGAAGATGGTAAGAAGATGGAAGTTGAGGAGAAATTTGGGATTGTTTTGAAGATGAAGTCGCCTCTTGTTCTCATTCCAACGCCAAGGCTATCTAATCCAACTTTGTATCAATAAGCTGATTGGTGACTTCCCCTTCTGCCATTGTTGTTCCTTCAAAGCTAGCAAAGTGTATACATATATATGCTCATACTATAATACTAGCTAAAACCTGCACTTTCATTCTTGTTGCCATTCCCAGTTCTATGATCGGCTGAACTTGGTGACCTTTTTTGGTTTGAATTGGACGTGTTAGGAAGGGATGGACTTATGCATAATATGTTCAAATTGATACATTCACATTGGAAGTGGAAGTGACAACATGTCAGTCAAGTTCGTTGTAAAGGTGTTCCCGGATAATATATATCATCAAAAATATCAATCATATATCAATGAGATATTAATGGGATATCAAATCATAGAATCAACCAAATAGAAATCCTAAAACATTTAATATGTGACGATAATCACATGACAATCCGATAGAATTTATATAACAAGCACACAGTAATCAAATAAAAATCACAAAGTATTTAATATGTAATGAAAATCATATGACAATCTGATTGTGGTTTAATTGGTCTATATAATTGACAATTTTTAATCATAATTTGTATAAAAGTAAGATAATATTTAGATATATATTGTAAATATTTGTGTAAATGTCGATGCAAAAGTATAAATTTAAAAAATAAATTAAAAAAATAATATCAAATAAATATATATTATTTTTCTCAATTTTGGTTTCTTGAAAATGAAAGATTTAGATGGTAAAAGAAAGGCAAATTACAATTACCACCATACTTCAAGAGTGATTTTTGAGATGGAATCTCATACTATCCATCACTTAAAAAGGTAGAAGAGTTATTTCTTTTATTCTTAATTGGTGAAACCTCATACTATCAATATTTTTTCATGTAATCAAGACAGAGATCATAGAACTGAAAAAGACTATAATTAAGCTAGCATATTAGATGATTTTAAGTCTTCCATGAATAGAAAAAGACAAAAGATAATGAAGAAAAATATAATAGAACTAGAAGAGGAAAAAGAAAGAAAAAGGCAAAAATGACTAAAACAGTAAAATAAAATAAAATAAAAAGTAGAACATATAAAGCACCTAAAGTATCTTTTTTATTATTTTCTAATTTTTCACCAGTTCAGTAACTATTAGAATATTGTATTTACCGTAGGTTAATTTGTTATTTTCTTTTTCTTTTTTTCTTTTTGTCCTTCTCCAACATTTACAAAACTTCACCAAACTCACCTTCTTTCTCCACTCTGCCATGGAACAACCCACCTCCAATTCTCTTCTACATCTCATAGATTCCCTCAATTCCTTCTTCCCATGGCGCTCCATTGACCAATCCGGCTTCCTCTTCATCTTCTTCGCTACTCTTCTAATTTTCCTCTATGTAAAACTTACCCGGGGGCGTGTGCCACTTCCACAGGGTCCTCCAGGTGTTCCAACCGACACGTGAGGTATTAAAGGACCACGATGTTACATTGGCCAATCGCGACGTTCCAAAAGCTGGGAGAGTCGCCTCGTATGGGGGTTGCAACATAGTTTGGACCCGAATGGCGGATGTTGACGAAAGTTTGTACAATCAAGATGTTTAGCAATGCTAGTTTGGACTCTGTTTATGAGCTCCGTCGTACAGAGGTTAGAAATACTGTTAGCTCACTTGTACCAGCAAGCTGGGTCGCCGGTGAATGTGGGTGAGCATGGATTTTTGATGATTTTCAAGGTGGTTACAAGTATGTTGTGGGGTGGATCAGTGGAAGGGGAACAGAGATATAGTGTTGCAGCAGAGTTTAGAGAGACAGTTTTAGAAATAACAGAGGTATTGGGCAAGCCTAATGTTTCAAACTTTTTTCTGATCTTGGCCAGGTTTGATCTCCAAGGGATTGAGAAGCAGATGCGTGAGCTTGCTTCGAGATTTGATAATATTTTCGAGAAGATGATTGATCAAGGATTGAAAATTGATGATGGAGAAGAGGACGAGAGTGAAACATCAAGTTTACGCAGCAGTTTAGTTCTATCTAATAAGTTTGTATATTTATACTCCTAAGTGAAACATCAAGCAATCATTTTCTAAAGAAAGCACAGGAAGGACTGGTGGCCATAGTGGGAGAAGACGATGTCGTAGAAGAGTGTCACATTCATAGTTTGCCATATAATTTAAAAGCCATAATGAAAGAAACTTAGCGTTTGCACCCAGCTATACCACTGCTAGTGCCGCACTGCCCAAGCGAGACCAGCTATCCGTTGGTGTTTGATCCAGAGAGGTTTCTGAAGAGAGAAAAATGGAATTTTGGTCGGTGAAGGGATAGCCACGGTAGAAAGAATTGTAGTGTATTTGCTTGCTACGCTTTTGCATTATTTCGATCGGAAATTGGAAGATGATCGTGGGAAGATGGAAGTTGAGGAGAAATTTGGGATTGTTTTGAAGATGAAGTCGCCTCTTGTTCTCATCCCACTCGGCCTCTACTTCTGCCATTGCTGTTCCTTTCAAATCTAGCAAAGTATATATATATATATGCTCACACTGTCAACTAATACGTGCACTTTCATTCTTTGATGATCGGCTTCAAATTTTCCCGAGTCCAGAGAAATAAACATTCTCATTTTCAAAAACCACCACTAAACTATGGTGTTGGTTAAAATTACACTCAGTAATTTTAAATTTGATCAATTATCCCCTTAATTTGTAAGTGTTGCAATCAACTCCTATTAAAATTAATTTATATTTGTGGATATTTATGGGTTTTCCTCCCTATCTGGTTCTCTCTTCCACTTCTCACAAGCAAACTCTTCACCAATGGTGAATTTGGAAACACAGAGTCGGCAACACACACCAATCTCCAAAGACAGAAATCTCCAGGCTCTCAACATCATAATCCAGCTCCATTTCGAGAAGAACCTTGAGAAGAAGCTCGCCATAAGATCTCCAAAAGAAAGAGCTCCACAAGCTTTTTCAGCTCTTCATCTTTCTGGGCCACTTTCCTCGCGCCTCAAATACCGCCATTGTTGGATCCCCATTGCTCTTCTCTCCATTTCCGATCTCAGTTTCTATGTCTCTGTAGCTCAGACCGTTCGCTACATCAATGGATTCAAGTATCAGCGGCGTTGCCATTAGCATTTTGTTCGTTGATGTTGTAACTCATAAACCCACAAATTTCCCTCACTTTCGATGCGAAAACCCGCTTCATTTACCTTCGCAACAGTGTGTCCCTTCTTTTCAACGAGCTCCAGCGTCTTCCATCAGCATTTTTGACCTTGGGTTCGACTTCCAGCTGCTGCCCACACTTCAATTTGGGCTTAAATCATATTACTCATAGAGTTTTAAGGTTCAGATCTGCAAGTTCTGATAATTTCAAACACTAGCCAACAAGGTTCATAGTTCTTCTTGGTAAGATTCTGTGAGTATCATTACCTTTTTGGTTTTTCAAATTGGTTTAGGTGTTTGTATTAGATTACAAGTCCTTGGAATTCTTCCTTTGCTATGTTTGGGTCAATTTCGGAGCACTCAGCTAAGGTTCTATAGGTGTTAAACATGCTTTGAGTTATTTCTGACTTTGACCAAAATGAGATTTAATCACATACAAAAAGTAAAATGTAAAATTTTGAAAAAGACGCCAGAAGAGTTTGAGCGCTATTTAGATTTACGAGAGAAAAAAAATATTAGAAAGAGTTATTTTTTAAAAATCTTTTGATTAAAAAAAAAAAATCTATTTAAAATACACTTTAAAAGTATTTTGAAAGTTATTTTAAGTGGTTGTCATACTCAAATTTTTTTCAAAATCACTCACTTTCAAAATTAAACACTTGAAAAGTTAAACTATATATAACTTTAATCTTACAAATTTTTTATGGAACTGTCTGTAAAATATCGATGTTGATAGATATTTTAGAAAAAAATTATAAAACAAAAAAATTCAAAAAAAAATTTAAAAAGTAAATATTTTTACCATTTTCAAAAATTAAAATATTTATTGTTTGTAGTATATTCATAATAGTAAAATTATGTTAACACTCCACTATTTTTTATGAAATTTTTTAGGTTATAATGGAGGTGTTTGATTCACTCCATTTAAACACGACGACATGTTTGCTGCTTTTCTTTTTTTCTTTTTTTTTTTTTTTCAATAGTAGAATTCTATAAAGATTATTAATATTCAAAATGGTTTGAAGATATTTTCGAGCTTGAATTATTCATTTCTGAACCTCCCATTTGGTATTGGTATGTCAGTCTCAGTTCCCTTCCTCTTCTCATTTTGGTCCGTAGATTTTTCCAACGAAGACAAACAAATTTCAACCGCTCTTCTTTTTATCTTCGTCTTCGTTGTTGCTCTGTTATGGCTCCGACCCGAATTCCGCCGTCCTTCTTTGCCTCCAGGCCCCTGTGGCCTGCCGCTGGTCGGATACCTTCCATTTTTATCTGGCAATCTCCACCAGACGTTCGCCGATTTGGCCCAAATCTACGGCCCAATTTTCAAGCTCCGCCTTGGAACCAAGCTCTGCATCGTTCTCAGCTCCCCTTCCTCCATCAACGTAGCCCTCCGCCACCAAGAAACCGTTCCACCAAGAAACCGTCTTCGCCAACCGAGATTCCACTGTCAGCTCTCTTCTCGCCACCTACGGTGGAGCCGATATTGTGTTCAGTCAGGACGACGGCGATTGGAAGAAGCTGAGAAAAATCTTCTCCCGCAAAATGCTTAGCAAATCAAATCTGGATGAGTCCTATCCTCTGCGAAGACAAGAAGTGAGAAAAGTGATTAAAGGCGTGTTGGAATCGGCCGGAACCCCAATTGATATCGGCAAATTGGGTTTCTTCGCTGCTGCTAAATCGGTTATGGCAATGACGTGGGGCGGCTCCGGGGGGATGATCGGAGTGGATGGAGCGGAGTTGGAAGATAAGTTCAGGGAAGTGGTGGATGAAATGATGGTGTTGATTGCATCTCCGAATTTGTCGGATCTTTTTCCGGTGTTGGGTCGGTTTGATTTGCAGGGAATTGCGAGGAAGATGAAGAAGGTGATGAATGTTTGTGATGAGATTCTAAATTCCGCCATTGAAGAACAGAGGAAGATGGGAGGAAATGGGGTGGAAAGGAGAGGGTTTTTGCAATTTCTGTTGAAGGTTAGGGACGGCGAGGATAGATCAGAGTCCATTACAGATAACCAACTCAAGGCCTTGCTAATGGTATGCTTTCTTAATCAATCACCATTTCATTCCCATGGAAATTCTTCGCTTACAAGTTGAACATTCTACTCTTCTTAAATGATTAGTAAAAGCTTTTTAAATGGCAACCACTCAATTTAAAGGATTTTCTTTTTCCTTTCTGCAACTAATTTAAAGGATTAGTTAACCATAAATGGTAATATTAAAGTTTTTAACAGGGGCAAAAGTGCTCGAAATATTTTTAGATATACCAAAATATTATCAGGCTACTACTATTGATAGACACTTGTAACAGTGATAATATAACATTTCACTATATTTATAAATATTTGAACTCTGAAAACACTTTCTTTTTTTTAATAACTAAATGTTTTTTTTTAATCCTAAAAATTATACTTATTTTCTTACCCCTCATAATCATGGTTTTCTAACTATATTTATTACACATTGTCTATGATAATTTCTAAATTTTTGCCTGATGGAACCTCCATTGTGACAATTTTTAATTACTAATACTTTCAAAGAAGGACTATTTTTAAATGTAGCAAAATAAGCTAAAATATTTACAAATATAACAAAATGTCATATTTATCAATAGTAGACACAAATAAAATTATCATCTATCACTGTCTATTAATGATAAACATTGATAGATGTCTATCAGAATCTATCGTTGTCTATCACTTCTAGACCGTAACATGTTATTATATTAATATTTTTTTTTTGAATAAATTAAAACAATCAAGTCCTTAAGCAATTAGTGTCTAGAGTTCCAAAATAAAAAATAAAAATAAAATAATAATAATATAAAAAAAAGGTGTTGAGATAAAAATAGATGGAGAGGAGGTTTTTTCCTTCCCATTGCATTTTACATTTTGGATTGAATTTATAATTTATACAGTGGGTCTTTATACCTAAAAACATTTCTCCTTTATATTATTTTTTGTGTGATTTTGAAAATATATAGGACATCATCATTGGAGGAACAGACTCAACATCAACTACGATCGAATGGGCAATAACAGAGTTAATACAACAACCAAACATAATGATGAAAGTCATGGAAGAATTAACAAAAGTTGTGGGATTAAACCAAATGGTTGAAGAATTTCACTTATCTAAATTATTTTATTTAGATGCAGTGATTAAAGAGAAACTTCGTTTACATCCACCCTTAACTCTTTTAGTACCACGTAAGTCTACCCAAACAAGCATTCTCGGAGGGTACACTATCCCAAAGGGCTCAACTATCTACTTTAATATGTGGGCAATCCAAAGAGACCCTAAAGTTTGGGATAACCCCTTAAACTTTATGCCTGAGAGATTCTTGAATGAAAGTAATGGAGAAGTATATGATTTCACTGGCAATAGCATAGAGTTTTGTCCATTTGGGTCTGGTAAAAAATTATGTGCAGGGATCCCTCTAGCTGAGAGGTTGTTGGTTTTAATATTAGCATCATTGTTGCATGCTTTTGAGTGGGAATTGCTTGAGGGTTCAAAGCTTGATCTGGAAGAGAAGTTTGGAATTGTCACCAAGAAGTTCAACCCCTTGGTTGCTATACTTATGCCAAGGATTTCCAATTTGGAGCTCTATAATATTATGTAGATTTGTACTAAATAATTGCATTTTACTACTCAAATCAAGAATTCCTATGTTCGTAATATAATTACAAGGGAAATTAGAG

mRNA sequence

ATGGGGGCCATAGATTCCTCATATGTCGTCTCGCCTCCACCAGATCTTGTTTCTGCCGACCTTCGCCCAACATCAACATTACAAAACTTTTCACCAAACTCTCCTTTCTTTCTCCACTCCCCCATGGAACAACCCACCTCCAATTCTCCTCTACTTCTCATAAATTCCCTCAATTCCTCAGTTGACCAATCTCCTTTCCTCTTCACCTTCTTCGCCGCTCTTCTAATTTTCCTCTATGTAAAACTCACGCGCCTGCGCGTGCCTCTTCCACCTGGTCCCTGGGGCGTTCCACTACTCGGAAACCTTCCATTCCTTGACCCCGATCTTCACACGTACTTTGCCGAATTAGGCCGAAAATATGGCCCAATAGTAAAGCTCCAACTCGGCAATAGAATTGGTATAATCGTAAATTCATCGTCCGTAGCACGTGAAGTGTTGAAAGACCACGATGTTACATTTGCGAATCGCGATGTTCCTCAAGCCGGGAGAGCCGCCTCGTACGGCGGTTCTGACATAGTTTGGACACCGTACGGACCCGAGTGGCGAATGTTGAGAAAAGTTTGTGTGTTAAAGATGCTGAGCAACGCCACATTGGATAGCGTTTACGAGCTCCGCCGTAGAGAGGTGAGAAACACGGTGGCTCACTTGTACCGGCAATGTGGGTCAGCAGTGAACGTGGGAGAGCAGGGGTTTTTGACGGTTTTCAACGTGGTTACAAGCATGTTGTGGGGTGGTTCAGTGGAAGGAGAGCAGAGGGATGGTCTTGCAGCGGAGTTTAGAGAGACAATTTCGGAAATAACAGAGCTATTGGGAAAACCTAATATTTCGGACTTTTTTCCAAGCTTGGCCCGTTTTGATCTCCAAGGGATTGAGAAGAAGATGCGTGAGATTGCTCCGAGATTTGATAACATTTTCGACAAGATGATAGATGAAAGATTGAAAATTGGTGGTGGAGATGACGACGGGAGTGTCAAGAAAAATGATTTCTTGCAGTTCTTACTTGAGGTTAATGATGAAGGAGAATCCAAGACTCCTCTCACCATGACCCACCTTAAAGCCTTGCTCATGGATATGGTGGTCGGAGGGACAGACACATCGTCAAATACCATAGAATTTGCATTGGCAGAAATGATAAAGAACCCAAAAACTCTAAAGAAAGCACAAGAAGAACTCACCGCCGTGGTCGGAGAAGACAACATCGTCGAAGAGTCCCACATTCACAGTTTCCCATATCTGAAGGCCGTAATGAAAGAAACTTTACGTTTACACCCAATTCTACCTCTGCTCGTGCCGCACTGTCCAAGCGAGACCACCATCGTCTCCAATTACGCAATTCCAAAGGGCTCTCGAGTGTTCATCAATGCTTGGGCCATTCAGCGAGACCCCAATCATTGGGAAAATCCATTGGTGTTTGATCCAGAGAGGTTTCTCAAGACCGAGAAGTGGGATTTTGGTGGGAGTGATTTTCGTTACTTCCCATTCGGGTCCGGCAGAAGAAACTGTGCAGGGATAGCCATGGCGGAAAGAATGGTAATGTATTTACTTGCTACGCTTTTGCATTCTTTTGACTGGAAATTGGAAGATGGTAAGAAGATGGAAGTTGAGGAGAAATTTGGGATTGTTTTGAAGATGAAGTCGCCTCTTGTTAATTTGTTATTTTCTTTTTCTTTTTTTCTTTTTGTCCTTCTCCAACATTTACAAAACTTCACCAAACTCACCTTCTTTCTCCACTCTGCCATGGAACAACCCACCTCCAATTCTCTTCTACATCTCATAGATTCCCTCAATTCCTTCTTCCCATGGCGCTCCATTGACCAATCCGGCTTCCTCTTCATCTTCTTCGCTACTCTTCTAATTTTCCTCTATGTGTTCCAACCGACACGTGAGGTATTAAAGGACCACGATGTTACATTGGCCAATCGCGACGTTCCAAAAGCTGGGAGAGTCGCCTCGTATGGGGGTTGCAACATAGTTTGGACCCGAATGGCGGATGTTGACGAAACAATGCTAGTTTGGACTCTGTTTATGAGCTCCGTCGTACAGAGGTTAGAAATACTGTTAGCTCACTTGTACCAGCAAGCTGGGTCGCCGGTGAATGTGGGTGAGCATGGATTTTTGATGATTTTCAAGGTGGTTACAAGTATGTTGTGGGGTGGATCAGTGGAAGGGGAACAGAGATATAGTGTTGCAGCAGAGTTTGATCTCCAAGGGATTGAGAAGCAGATGCGTGAGCTTGCTTCGAGATTTGATAATATTTTCGAGAAGATGATTGATCAAGGATTGAAAATTGATGATGGAGAAGAGGACGAGAGTGAAACATCAAGTTTACGCAGCAGTTTAGTTCTATCTAATAATCGCCTCTTGTTCTCATCCCACTCGGCCTCTACTTCTGCCATTGCTGTTCCTTTCAAATCTAGCAAAAGTCGGCAACACACACCAATCTCCAAAGACAGAAATCTCCAGGCTCTCAACATCATAATCCAGCTCCATTTCGAGAAGAACCTTGAGAAGAAGCTCGCCATAAGATCTCCAAAAGAAAGAGCTCCACAAGCTTTTTCAGCTCTTCATCTTTCTGGGCCACTTTCCTCGCGCCTCAAATACCGCCATTGTTGGATCCCCATTGCTCTTCTCTCCATTTCCGATCTCAGTTTCTATGTCTCTGTAGCTCAGACCGTTCGCTACATCAATGGATTCAAGTATCAGCGGCATTTTTCCAACGAAGACAAACAAATTTCAACCGCTCTTCTTTTTATCTTCGTCTTCGTTGTTGCTCTGTTATGGCTCCGACCCGAATTCCGCCGTCCTTCTTTGCCTCCAGGCCCCTGTGGCCTGCCGCTGGTCGGATACCTTCCATTTTTATCTGGCAATCTCCACCAGACGTTCGCCGATTTGGCCCAAATCTACGGCCCAATTTTCAAGCTCCGCCTTGGAACCAAGCTCTGCATCGTTCTCAGCTCCCCTTCCTCCATCAACCCCTCCGCCACCAAGAAACCGTTCCACCAAGAAACCGTCTTCGCCAACCGAGATTCCACTGTCAGCTCTCTTCTCGCCACCTACGGTGGAGCCGATATTGTGTTCAGTCAGGACGACGGCGATTGGAAGAAGCTGAGAAAAATCTTCTCCCGCAAAATGCTTAGCAAATCAAATCTGGATGAGTCCTATCCTCTGCGAAGACAAGAAGTGAGAAAAGTGATTAAAGGCGTGTTGGAATCGGCCGGAACCCCAATTGATATCGGCAAATTGGGTTTCTTCGCTGCTGCTAAATCGGTTATGGCAATGACGTGGGGCGGCTCCGGGGGGATGATCGGAGTGGATGGAGCGGAGTTGGAAGATAAGTTCAGGGAAGTGGTGGATGAAATGATGGTGTTGATTGCATCTCCGAATTTGTCGGATCTTTTTCCGGTGTTGGGTCGGTTTGATTTGCAGGGAATTGCGAGGAAGATGAAGAAGGTGATGAATGTTTGTGATGAGATTCTAAATTCCGCCATTGAAGAACAGAGGAAGATGGGAGGAAATGGGGTGGAAAGGAGAGGGTTTTTGCAATTTCTGTTGAAGGTTAGGGACGGCGAGGATAGATCAGAGTCCATTACAGATAACCAACTCAAGGCCTTGCTAATGGACATCATCATTGGAGGAACAGACTCAACATCAACTACGATCGAATGGGCAATAACAGAGTTAATACAACAACCAAACATAATGATGAAAGTCATGGAAGAATTAACAAAAGTTGTGGGATTAAACCAAATGGTTGAAGAATTTCACTTATCTAAATTATTTTATTTAGATGCAGTGATTAAAGAGAAACTTCGTTTACATCCACCCTTAACTCTTTTAGTACCACGTAAGTCTACCCAAACAAGCATTCTCGGAGGGTACACTATCCCAAAGGGCTCAACTATCTACTTTAATATGTGGGCAATCCAAAGAGACCCTAAAGTTTGGGATAACCCCTTAAACTTTATGCCTGAGAGATTCTTGAATGAAAGTAATGGAGAAGTATATGATTTCACTGGCAATAGCATAGAGTTTTGTCCATTTGGGTCTGGTAAAAAATTATGTGCAGGGATCCCTCTAGCTGAGAGGTTGTTGGTTTTAATATTAGCATCATTGTTGCATGCTTTTGAGTGGGAATTGCTTGAGGGTTCAAAGCTTGATCTGGAAGAGAAGTTTGGAATTGTCACCAAGAAGTTCAACCCCTTGGTTGCTATACTTATGCCAAGGATTTCCAATTTGGAGCTCTATAATATTATGTAGATTTGTACTAAATAATTGCATTTTACTACTCAAATCAAGAATTCCTATGTTCGTAATATAATTACAAGGGAAATTAGAG

Coding sequence (CDS)

ATGGGGGCCATAGATTCCTCATATGTCGTCTCGCCTCCACCAGATCTTGTTTCTGCCGACCTTCGCCCAACATCAACATTACAAAACTTTTCACCAAACTCTCCTTTCTTTCTCCACTCCCCCATGGAACAACCCACCTCCAATTCTCCTCTACTTCTCATAAATTCCCTCAATTCCTCAGTTGACCAATCTCCTTTCCTCTTCACCTTCTTCGCCGCTCTTCTAATTTTCCTCTATGTAAAACTCACGCGCCTGCGCGTGCCTCTTCCACCTGGTCCCTGGGGCGTTCCACTACTCGGAAACCTTCCATTCCTTGACCCCGATCTTCACACGTACTTTGCCGAATTAGGCCGAAAATATGGCCCAATAGTAAAGCTCCAACTCGGCAATAGAATTGGTATAATCGTAAATTCATCGTCCGTAGCACGTGAAGTGTTGAAAGACCACGATGTTACATTTGCGAATCGCGATGTTCCTCAAGCCGGGAGAGCCGCCTCGTACGGCGGTTCTGACATAGTTTGGACACCGTACGGACCCGAGTGGCGAATGTTGAGAAAAGTTTGTGTGTTAAAGATGCTGAGCAACGCCACATTGGATAGCGTTTACGAGCTCCGCCGTAGAGAGGTGAGAAACACGGTGGCTCACTTGTACCGGCAATGTGGGTCAGCAGTGAACGTGGGAGAGCAGGGGTTTTTGACGGTTTTCAACGTGGTTACAAGCATGTTGTGGGGTGGTTCAGTGGAAGGAGAGCAGAGGGATGGTCTTGCAGCGGAGTTTAGAGAGACAATTTCGGAAATAACAGAGCTATTGGGAAAACCTAATATTTCGGACTTTTTTCCAAGCTTGGCCCGTTTTGATCTCCAAGGGATTGAGAAGAAGATGCGTGAGATTGCTCCGAGATTTGATAACATTTTCGACAAGATGATAGATGAAAGATTGAAAATTGGTGGTGGAGATGACGACGGGAGTGTCAAGAAAAATGATTTCTTGCAGTTCTTACTTGAGGTTAATGATGAAGGAGAATCCAAGACTCCTCTCACCATGACCCACCTTAAAGCCTTGCTCATGGATATGGTGGTCGGAGGGACAGACACATCGTCAAATACCATAGAATTTGCATTGGCAGAAATGATAAAGAACCCAAAAACTCTAAAGAAAGCACAAGAAGAACTCACCGCCGTGGTCGGAGAAGACAACATCGTCGAAGAGTCCCACATTCACAGTTTCCCATATCTGAAGGCCGTAATGAAAGAAACTTTACGTTTACACCCAATTCTACCTCTGCTCGTGCCGCACTGTCCAAGCGAGACCACCATCGTCTCCAATTACGCAATTCCAAAGGGCTCTCGAGTGTTCATCAATGCTTGGGCCATTCAGCGAGACCCCAATCATTGGGAAAATCCATTGGTGTTTGATCCAGAGAGGTTTCTCAAGACCGAGAAGTGGGATTTTGGTGGGAGTGATTTTCGTTACTTCCCATTCGGGTCCGGCAGAAGAAACTGTGCAGGGATAGCCATGGCGGAAAGAATGGTAATGTATTTACTTGCTACGCTTTTGCATTCTTTTGACTGGAAATTGGAAGATGGTAAGAAGATGGAAGTTGAGGAGAAATTTGGGATTGTTTTGAAGATGAAGTCGCCTCTTGTTAATTTGTTATTTTCTTTTTCTTTTTTTCTTTTTGTCCTTCTCCAACATTTACAAAACTTCACCAAACTCACCTTCTTTCTCCACTCTGCCATGGAACAACCCACCTCCAATTCTCTTCTACATCTCATAGATTCCCTCAATTCCTTCTTCCCATGGCGCTCCATTGACCAATCCGGCTTCCTCTTCATCTTCTTCGCTACTCTTCTAATTTTCCTCTATGTGTTCCAACCGACACGTGAGGTATTAAAGGACCACGATGTTACATTGGCCAATCGCGACGTTCCAAAAGCTGGGAGAGTCGCCTCGTATGGGGGTTGCAACATAGTTTGGACCCGAATGGCGGATGTTGACGAAACAATGCTAGTTTGGACTCTGTTTATGAGCTCCGTCGTACAGAGGTTAGAAATACTGTTAGCTCACTTGTACCAGCAAGCTGGGTCGCCGGTGAATGTGGGTGAGCATGGATTTTTGATGATTTTCAAGGTGGTTACAAGTATGTTGTGGGGTGGATCAGTGGAAGGGGAACAGAGATATAGTGTTGCAGCAGAGTTTGATCTCCAAGGGATTGAGAAGCAGATGCGTGAGCTTGCTTCGAGATTTGATAATATTTTCGAGAAGATGATTGATCAAGGATTGAAAATTGATGATGGAGAAGAGGACGAGAGTGAAACATCAAGTTTACGCAGCAGTTTAGTTCTATCTAATAATCGCCTCTTGTTCTCATCCCACTCGGCCTCTACTTCTGCCATTGCTGTTCCTTTCAAATCTAGCAAAAGTCGGCAACACACACCAATCTCCAAAGACAGAAATCTCCAGGCTCTCAACATCATAATCCAGCTCCATTTCGAGAAGAACCTTGAGAAGAAGCTCGCCATAAGATCTCCAAAAGAAAGAGCTCCACAAGCTTTTTCAGCTCTTCATCTTTCTGGGCCACTTTCCTCGCGCCTCAAATACCGCCATTGTTGGATCCCCATTGCTCTTCTCTCCATTTCCGATCTCAGTTTCTATGTCTCTGTAGCTCAGACCGTTCGCTACATCAATGGATTCAAGTATCAGCGGCATTTTTCCAACGAAGACAAACAAATTTCAACCGCTCTTCTTTTTATCTTCGTCTTCGTTGTTGCTCTGTTATGGCTCCGACCCGAATTCCGCCGTCCTTCTTTGCCTCCAGGCCCCTGTGGCCTGCCGCTGGTCGGATACCTTCCATTTTTATCTGGCAATCTCCACCAGACGTTCGCCGATTTGGCCCAAATCTACGGCCCAATTTTCAAGCTCCGCCTTGGAACCAAGCTCTGCATCGTTCTCAGCTCCCCTTCCTCCATCAACCCCTCCGCCACCAAGAAACCGTTCCACCAAGAAACCGTCTTCGCCAACCGAGATTCCACTGTCAGCTCTCTTCTCGCCACCTACGGTGGAGCCGATATTGTGTTCAGTCAGGACGACGGCGATTGGAAGAAGCTGAGAAAAATCTTCTCCCGCAAAATGCTTAGCAAATCAAATCTGGATGAGTCCTATCCTCTGCGAAGACAAGAAGTGAGAAAAGTGATTAAAGGCGTGTTGGAATCGGCCGGAACCCCAATTGATATCGGCAAATTGGGTTTCTTCGCTGCTGCTAAATCGGTTATGGCAATGACGTGGGGCGGCTCCGGGGGGATGATCGGAGTGGATGGAGCGGAGTTGGAAGATAAGTTCAGGGAAGTGGTGGATGAAATGATGGTGTTGATTGCATCTCCGAATTTGTCGGATCTTTTTCCGGTGTTGGGTCGGTTTGATTTGCAGGGAATTGCGAGGAAGATGAAGAAGGTGATGAATGTTTGTGATGAGATTCTAAATTCCGCCATTGAAGAACAGAGGAAGATGGGAGGAAATGGGGTGGAAAGGAGAGGGTTTTTGCAATTTCTGTTGAAGGTTAGGGACGGCGAGGATAGATCAGAGTCCATTACAGATAACCAACTCAAGGCCTTGCTAATGGACATCATCATTGGAGGAACAGACTCAACATCAACTACGATCGAATGGGCAATAACAGAGTTAATACAACAACCAAACATAATGATGAAAGTCATGGAAGAATTAACAAAAGTTGTGGGATTAAACCAAATGGTTGAAGAATTTCACTTATCTAAATTATTTTATTTAGATGCAGTGATTAAAGAGAAACTTCGTTTACATCCACCCTTAACTCTTTTAGTACCACGTAAGTCTACCCAAACAAGCATTCTCGGAGGGTACACTATCCCAAAGGGCTCAACTATCTACTTTAATATGTGGGCAATCCAAAGAGACCCTAAAGTTTGGGATAACCCCTTAAACTTTATGCCTGAGAGATTCTTGAATGAAAGTAATGGAGAAGTATATGATTTCACTGGCAATAGCATAGAGTTTTGTCCATTTGGGTCTGGTAAAAAATTATGTGCAGGGATCCCTCTAGCTGAGAGGTTGTTGGTTTTAATATTAGCATCATTGTTGCATGCTTTTGAGTGGGAATTGCTTGAGGGTTCAAAGCTTGATCTGGAAGAGAAGTTTGGAATTGTCACCAAGAAGTTCAACCCCTTGGTTGCTATACTTATGCCAAGGATTTCCAATTTGGAGCTCTATAATATTATGTAG

Protein sequence

MGAIDSSYVVSPPPDLVSADLRPTSTLQNFSPNSPFFLHSPMEQPTSNSPLLLINSLNSSVDQSPFLFTFFAALLIFLYVKLTRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGPIVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWRMLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQCGSAVNVGEQGFLTVFNVVTSMLWGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFDNIFDKMIDERLKIGGGDDDGSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGGTDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRLHPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKWDFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGKKMEVEEKFGIVLKMKSPLVNLLFSFSFFLFVLLQHLQNFTKLTFFLHSAMEQPTSNSLLHLIDSLNSFFPWRSIDQSGFLFIFFATLLIFLYVFQPTREVLKDHDVTLANRDVPKAGRVASYGGCNIVWTRMADVDETMLVWTLFMSSVVQRLEILLAHLYQQAGSPVNVGEHGFLMIFKVVTSMLWGGSVEGEQRYSVAAEFDLQGIEKQMRELASRFDNIFEKMIDQGLKIDDGEEDESETSSLRSSLVLSNNRLLFSSHSASTSAIAVPFKSSKSRQHTPISKDRNLQALNIIIQLHFEKNLEKKLAIRSPKERAPQAFSALHLSGPLSSRLKYRHCWIPIALLSISDLSFYVSVAQTVRYINGFKYQRHFSNEDKQISTALLFIFVFVVALLWLRPEFRRPSLPPGPCGLPLVGYLPFLSGNLHQTFADLAQIYGPIFKLRLGTKLCIVLSSPSSINPSATKKPFHQETVFANRDSTVSSLLATYGGADIVFSQDDGDWKKLRKIFSRKMLSKSNLDESYPLRRQEVRKVIKGVLESAGTPIDIGKLGFFAAAKSVMAMTWGGSGGMIGVDGAELEDKFREVVDEMMVLIASPNLSDLFPVLGRFDLQGIARKMKKVMNVCDEILNSAIEEQRKMGGNGVERRGFLQFLLKVRDGEDRSESITDNQLKALLMDIIIGGTDSTSTTIEWAITELIQQPNIMMKVMEELTKVVGLNQMVEEFHLSKLFYLDAVIKEKLRLHPPLTLLVPRKSTQTSILGGYTIPKGSTIYFNMWAIQRDPKVWDNPLNFMPERFLNESNGEVYDFTGNSIEFCPFGSGKKLCAGIPLAERLLVLILASLLHAFEWELLEGSKLDLEEKFGIVTKKFNPLVAILMPRISNLELYNIM
Homology
BLAST of CcUC01G003870 vs. NCBI nr
Match: KAG5592995.1 (hypothetical protein H5410_043509 [Solanum commersonii])

HSP 1 Score: 997.7 bits (2578), Expect = 1.0e-286
Identity = 656/1762 (37.23%), Postives = 896/1762 (50.85%), Query Frame = 0

Query: 59   SSVDQSPFLFTF---FAALLIFLYVKLTRLR---VPLPPGPWGVPLLGNLPFLDPDLHTY 118
            S+++Q    +++     ALL FL++ + + +    PLPPGP  +PLLGNL  LDP+LHTY
Sbjct: 20   SNIEQKGVFYSYVLGIVALLWFLWLFINKSKKGLPPLPPGPKALPLLGNLHSLDPELHTY 79

Query: 119  FAELGRKYGPIVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDI 178
            FA L + YG I +L LG ++GII+ S ++AREVLKD D  FANRDVP AGR A+YGG+DI
Sbjct: 80   FASLSQTYGSICRLWLGKKLGIIITSPALAREVLKDQDTIFANRDVPAAGREATYGGTDI 139

Query: 179  VWTPYGPEWRMLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQCGSAVNVGEQGFL 238
            VWTPYGP+WRMLRKVCV +MLS +TLDSVY LRRRE+R T+ + Y Q G  VN+GEQ FL
Sbjct: 140  VWTPYGPKWRMLRKVCVREMLSGSTLDSVYALRRRELRQTINYFYSQAGLPVNIGEQMFL 199

Query: 239  TVFNVVTSMLWGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEK 298
            TVFNV+TSMLWGG+V+GE+R  L AEFR  ++++TELLG PN+SDF+P LARFDLQG+ K
Sbjct: 200  TVFNVITSMLWGGTVKGEERASLGAEFRHVVTKMTELLGTPNLSDFYPGLARFDLQGVAK 259

Query: 299  KMREIAPRFDNIFDKMIDERLKI---GGGDDDGSVKKNDFLQFLLEVNDEGESKTPLTMT 358
            KM+ +A RFD IF+ MID+R +I   GG +     +  DFLQ LL++ D+  +K PLTM 
Sbjct: 260  KMKVLAKRFDKIFESMIDKRHEIDTNGGMETSVGQENKDFLQVLLKLKDDEAAKMPLTMP 319

Query: 359  HLKALLMDMVVGGTDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSF 418
             LKALLMDMVVGGTDT+SNT+EFA+AE++  P  L+K Q+E+  VVGEDNIVEESHI   
Sbjct: 320  ELKALLMDMVVGGTDTTSNTVEFAMAEIMNKPDVLRKLQQEIDTVVGEDNIVEESHIQHL 379

Query: 419  PYLKAVMKETLRLHPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPL 478
            PYL AVMKE LRLHP LPLLVPHCPSET+ V  Y +PKGSRVFIN WAIQRDP+ WENP 
Sbjct: 380  PYLYAVMKEVLRLHPALPLLVPHCPSETSTVGGYIVPKGSRVFINVWAIQRDPSIWENPT 439

Query: 479  VFDPERFLKTEKWDFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDG 538
             F PERF    KWD+ G+D  YFPFGSGRR CAGIAMAERM MY LA+L+HSFDWKL +G
Sbjct: 440  EFHPERF-SGNKWDYSGNDLNYFPFGSGRRICAGIAMAERMFMYSLASLIHSFDWKLPEG 499

Query: 539  KKMEVEEKFGIVLKMKSPLVNLLFSFSFFLFVLLQHLQNFTKLTFFLHSAMEQPTSNSLL 598
            + +++ EKFGIVLK K PLV +                  +  T + +S    P     L
Sbjct: 500  ETLDLTEKFGIVLKKKMPLVAI-------------PTPRLSNPTLYDNSNKGLPPGPKPL 559

Query: 599  HLIDSLNSFFP-----WRSIDQS-GFLFIFFATLLIFLYVFQP--TREVLKDHDVTLANR 658
             LI +L S  P     + S+ Q+ G +   +    + + +  P    EVLKD D   ANR
Sbjct: 560  PLIGNLLSLDPQLHTYFASLSQTYGPICRLWLGKKLGIIITSPALASEVLKDQDTIFANR 619

Query: 659  DVPKAGRVASYGGCNIVWTRMADVDETMLVWTLFMSSVVQRLEILLAHLYQQAGSPVNVG 718
            DV  AGR  +YGG +IVWT           W + M                +AGSPVN+G
Sbjct: 620  DVSVAGREVTYGGTDIVWTPYGP------KWRMLM----------------KAGSPVNIG 679

Query: 719  EHGFLMIFKVVTSMLWGGSVEGEQRYSVAAE--------------------------FDL 778
            E  FL    V+TSMLWGG+V+GE+R S+ AE                          FDL
Sbjct: 680  EQMFLTALNVITSMLWGGTVKGEERASLGAEVRDVVTKMNELLVTPNLSDFYPGLAWFDL 739

Query: 779  QGIEKQMRELASRFDNIFEKMIDQG-------------------LKIDD----------- 838
            QG++K+M+ LA R+DNIF  MIDQ                    LK+ D           
Sbjct: 740  QGVKKKMKVLAKRYDNIFGSMIDQRQQMNRNGVGQESKDFLQVLLKLKDEADPKMPLTMT 799

Query: 839  ----------------------------------------------GEEDESETSSLRSS 898
                                                          G+++  E S ++  
Sbjct: 800  EIKALLTDMVVGGTETSTNTVEFAMTEIMNKPDVLRKLQQELDTIVGKDNIMEVSHIQHE 859

Query: 899  LV--------------------------------------------LSNNRL-------- 958
             V                                             S N L        
Sbjct: 860  TVTVGGYTVPKGSSVFINVWAIHRDPSIWENPTAFHPERFMENKWDFSGNDLTYFPFGSG 919

Query: 959  ------------------------LFSSHSASTSAIAVP--------------------- 1018
                                    +FSS S       +P                     
Sbjct: 920  RRICVGLAMAERMFMTDKLYYGFFVFSSTSQKKGQPPLPPGPKALPLLGNLHSLDPQLHT 979

Query: 1019 FKSSKSRQHTPISKDRNLQALNIIIQLHFE-----KNLEKKLAIR--------------- 1078
            + +S S+ + PI +    + L +++ LH       KN +   A R               
Sbjct: 980  YFASLSQTYGPICRLWLGKKLGLLL-LHLPACKVLKNQDTIFANRDVPAACKESSYGGKD 1039

Query: 1079 ------------SPKERAPQAF-SALH----------LSGPLSSRL--KYRHCWIPIA-L 1138
                        SP     Q F +AL+          + G   SRL  ++RH    +  L
Sbjct: 1040 IVWTPYGPNGAGSPVNIGEQMFLTALNVITSMLWGGTVKGGERSRLGAEFRHVVADMTEL 1099

Query: 1139 LSISDLS-FYVSVAQ-----TVRYINGFKY-------QRHFSNEDKQISTAL-------- 1198
            L   ++S FY  +A+       + +N  +        QR   + + ++ T +        
Sbjct: 1100 LGTPNISDFYPGLARFDLQGVTKKMNVLEKRFESMIDQRQKIDRNAEMGTVMKEALRIHP 1159

Query: 1199 --------------------------LFIFVFVV---ALLWLRPEFRRPS---------- 1258
                                      +FI V+ +     +W  P    P           
Sbjct: 1160 TLPLLVPHCPSETCTVGGYTVPKGSRVFINVWTIQRDPSIWKNPTEFHPERFLDSKWDYS 1219

Query: 1259 ------------------------------------------------------------ 1318
                                                                        
Sbjct: 1220 GNDFNYFPFGSGRRTCAGRVMAERMFMYSLASLIYSFDWKLSEGKTLDLTEKFGIVLKKK 1279

Query: 1319 ---------------------LPPGPCGLPLVGYLPFLSGNLHQTFADLAQIYGPIFKLR 1378
                                 LPPGP   PL+G L  L   LH  FA L+Q YGPI +L 
Sbjct: 1280 IPLVAIPTPRLSNPKLNSNKRLPPGPKAFPLIGNLHTLDPELHTYFASLSQTYGPICRLW 1339

Query: 1379 LGTKLCIVLSSPSSINPSATKKPFHQETVFANRDSTVSSLLATYGGADIVFSQDDGDWKK 1416
            LG KL I+++SP+        K    +T+FANRD   +    +YGG +++++     W+ 
Sbjct: 1340 LGKKLGIIITSPALAREVLKDK----DTIFANRDVPAAGSEFSYGGNNVLWTPYGPKWRM 1399

BLAST of CcUC01G003870 vs. NCBI nr
Match: KAF4350078.1 (hypothetical protein F8388_019722 [Cannabis sativa])

HSP 1 Score: 993.0 bits (2566), Expect = 2.6e-285
Identity = 584/1363 (42.85%), Postives = 748/1363 (54.88%), Query Frame = 0

Query: 68   FTFFAA---LLIFLYVKL--TRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGP 127
            FT  AA   LL+F+Y+KL   R   PLPPGP G+PLLGNL  LDP+LHTYF  L + +GP
Sbjct: 23   FTLSAAAVVLLLFIYLKLRSNRASPPLPPGPRGLPLLGNLLSLDPELHTYFTNLAQTHGP 82

Query: 128  IVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWR 187
            I+KLQLGN++GI++ S S+AREVLKD+DV FANRDVP AGR  +YGG DI WTPYGPEWR
Sbjct: 83   ILKLQLGNKVGIVITSPSLAREVLKDNDVVFANRDVPVAGRLITYGGYDITWTPYGPEWR 142

Query: 188  MLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQCGSAVNVGEQGFLTVFNVVTSML 247
            MLRKVCVLKMLSN TLDSVYELRRREVR TV + Y + GS V VGEQ FLT+ NV+T+ML
Sbjct: 143  MLRKVCVLKMLSNTTLDSVYELRRREVRKTVGYFYSRVGSPVGVGEQMFLTILNVITNML 202

Query: 248  WGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFD 307
            WGGSVEGE+RD L +EFR+ ISEITELLGKPN+SDF+P LARFDLQG+ K+M ++A RFD
Sbjct: 203  WGGSVEGEERDKLGSEFRQIISEITELLGKPNVSDFYPGLARFDLQGVGKQMTKLALRFD 262

Query: 308  NIFDKMIDERLKIGGGDDDGSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGG 367
             +F+ +I +RLK       G  ++ DFLQ+LLE+ +E +SKTPLT+  +KALL DM+VGG
Sbjct: 263  MMFENLIAQRLK-------GVSERRDFLQYLLELKEEVDSKTPLTLIQIKALLTDMIVGG 322

Query: 368  TDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRL 427
            +DTSSNTIEFA+AE++  P+ LKKAQ+EL AV+G+DNIVEES IH  PYL+AVMKETLRL
Sbjct: 323  SDTSSNTIEFAMAEIMNQPEILKKAQKELEAVIGKDNIVEESDIHKLPYLQAVMKETLRL 382

Query: 428  HPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKW 487
            HP+LPLLVPHCPSE+  V  Y IPKGSR+FIN WA QRDP+ WENPL FDP+RFL   KW
Sbjct: 383  HPVLPLLVPHCPSESCTVGGYTIPKGSRIFINVWATQRDPSIWENPLKFDPDRFLNDSKW 442

Query: 488  DFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGKKMEVEEKFGIVL 547
            DF GSDF Y PFGSGRR CAGIAMAER V+Y LATLLHSF+W+L  G+KM++EEKFGIVL
Sbjct: 443  DFSGSDFNYIPFGSGRRICAGIAMAERTVVYSLATLLHSFNWELPRGEKMDLEEKFGIVL 502

Query: 548  KMKSPLVNLLFSFSFFLFVLLQHLQNFTKLTFFLHSAMEQPTSNSLLHLIDSLNSFFPWR 607
            K K PLV +                               PT                 R
Sbjct: 503  KKKIPLVAI-------------------------------PTP----------------R 562

Query: 608  SIDQSGFLFIFFATLLIFLYVFQPTREVLKDHDVTLANRDVPKAGRVASYGGCNIVWTRM 667
              D S    ++  TLL FL                                         
Sbjct: 563  FSDPS----LYDTTLLNFL----------------------------------------K 622

Query: 668  ADVDETMLVWTLFMSSVVQRLEILLAHLYQQAGSPVNVGEHGFLMIFKVVTSMLWGGSVE 727
             +VD T + +TL  ++VV                        FL++              
Sbjct: 623  HNVDLTRVCFTLSAAAVV------------------------FLLLI------------- 682

Query: 728  GEQRYSVAAEFDLQGIEKQMRELASRFDNIFEKMIDQGLKIDDGEEDESETSSLRSSLVL 787
                                                                        
Sbjct: 683  ------------------------------------------------------------ 742

Query: 788  SNNRLLFSSHSASTSAIAVPFKSSKSRQHTPISKDRNLQALNIIIQLHFEKNLEKKLAIR 847
                                                                        
Sbjct: 743  ------------------------------------------------------------ 802

Query: 848  SPKERAPQAFSALHLSGPLSSRLKYRHCWIPIALLSISDLSFYVSVAQTVRYINGFKYQR 907
                                                                        
Sbjct: 803  ------------------------------------------------------------ 862

Query: 908  HFSNEDKQISTALLFIFVFVVALLWLRPEFRRPSLPPGPCGLPLVGYLPFLSGNLHQTFA 967
            H     K++S                      P LPPGP GLPL+G L  L   LH  F 
Sbjct: 863  HLKLRGKRVS----------------------PPLPPGPRGLPLLGNLLSLDPELHSYFR 922

Query: 968  DLAQIYGPIFKLRLGTKLCIVLSSPSSINPSATKKPFHQETVFANRDSTVSSLLATYGGA 1027
            DLAQ +GPI KL+LG K+ IV++SPS     A +     + VFANRD  V+  +ATYGG 
Sbjct: 923  DLAQTHGPILKLQLGNKVGIVITSPS----LAREVLRENDVVFANRDVPVAGRIATYGGY 982

Query: 1028 DIVFSQDDGDWKKLRKIFSRKMLSKSNLDESYPLRRQEVRKVIKGVLESAGTPIDIGKLG 1087
            DIV++    +W+ LRK+   KMLS + LD  Y LRR+EVR+ +       G+P+++G+  
Sbjct: 983  DIVWTPYGPEWRMLRKVCVLKMLSNTTLDSVYELRRREVRQTVGYCYSRVGSPVNVGEQM 1038

Query: 1088 FFAAAKSVMAMTWGGSGGMIGVDGAELEDKFREVVDEMMVLIASPNLSDLFPVLGRFDLQ 1147
            F      +  M WGG+  + G +   L  +FR++V EM  L+  PNLSD +P L RFDLQ
Sbjct: 1043 FLTILNVITNMLWGGA--VEGDERDSLGAEFRQIVSEMTDLLGKPNLSDFYPGLARFDLQ 1038

Query: 1148 GIARKMKKVMNVCDEILNS----------AIEEQRKMGGNGVERRGFLQFLLKVRDGEDR 1207
            G+ ++M K+    D +              I ++R   G G + R FLQ+LL++++  D 
Sbjct: 1103 GVGKQMTKLALRFDMMFEKLIAQRSSSSLKIRDERDSSGGG-QNRDFLQYLLELKEEVDS 1038

Query: 1208 SESITDNQLKALLMDIIIGGTDSTSTTIEWAITELIQQPNIMMKVMEELTKVVGLNQMVE 1267
               +T   +KALLMD+++GG+D++S TIE+A+ E++ QP I+ +  +EL  V+G + +VE
Sbjct: 1163 KTPLTITHVKALLMDMVVGGSDTSSNTIEFAMAEIMNQPEILKRAQQELEAVIGKDNIVE 1038

Query: 1268 EFHLSKLFYLDAVIKEKLRLHPPLTLLVPRKSTQTSILGGYTIPKGSTIYFNMWAIQRDP 1327
            E  + KL YL AV+KE LRLHP L LLVP   +++  +GGYTIPKGS I+ N+WA QRDP
Sbjct: 1223 ESDIHKLPYLQAVMKETLRLHPVLPLLVPHCPSESCTVGGYTIPKGSRIFINVWATQRDP 1038

Query: 1328 KVWDNPLNFMPERFLNESNGEVYDFTGNSIEFCPFGSGKKLCAGIPLAERLLVLILASLL 1387
             +W+NPL F PERFLN S    +DF+G+   + PFGSG+++CAGI +AER+++  LA+LL
Sbjct: 1283 SIWENPLKFDPERFLNNSK---WDFSGSDFNYIPFGSGRRICAGIAMAERMVMYSLATLL 1038

Query: 1388 HAFEWELLEGSKLDLEEKFGIVTKKFNPLVAILMPRISNLELY 1416
            H+F WEL  G K+DL EKFGIV KK  PLVAI  PR+ +  LY
Sbjct: 1343 HSFNWELPRGEKMDLAEKFGIVLKKKIPLVAIPTPRLLDRSLY 1038

BLAST of CcUC01G003870 vs. NCBI nr
Match: KAF4360438.1 (hypothetical protein F8388_001909 [Cannabis sativa])

HSP 1 Score: 986.9 bits (2550), Expect = 1.8e-283
Identity = 586/1365 (42.93%), Postives = 750/1365 (54.95%), Query Frame = 0

Query: 68   FTFFAA---LLIFLYVKL--TRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGP 127
            FT  AA   LL+F+Y+KL   R   PLPPGP G+PLLGNL  LDP+LHTYF  L + +GP
Sbjct: 23   FTLSAAAVVLLLFIYLKLRSNRASPPLPPGPRGLPLLGNLLSLDPELHTYFTNLAQTHGP 82

Query: 128  IVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWR 187
            I+KLQLGN++GI++ S S+AREVLKD+DV FANRDVP AGR  +YGG DI WTPYGPEWR
Sbjct: 83   ILKLQLGNKVGIVITSPSLAREVLKDNDVVFANRDVPVAGRLITYGGYDITWTPYGPEWR 142

Query: 188  MLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQCGSAVNVGEQGFLTVFNVVTSML 247
            MLRKVCVLKMLSN TLDSVYELRRREVR TV + Y + GS V VGEQ FLT+ NV+T+ML
Sbjct: 143  MLRKVCVLKMLSNTTLDSVYELRRREVRKTVGYFYSRVGSPVGVGEQMFLTILNVITNML 202

Query: 248  WGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFD 307
            WGGSVEGE+RD L +EFR+ ISEITELLGKPN+SDF+P LARFDLQG+ K+M ++A RFD
Sbjct: 203  WGGSVEGEERDKLGSEFRQIISEITELLGKPNVSDFYPGLARFDLQGVGKQMTKLALRFD 262

Query: 308  NIFDKMIDERLKIGGGDDDGSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGG 367
             +F+ +I +RLK       G  ++ DFLQ+LLE+ +E +SKTPLT+  +KALL DM+VGG
Sbjct: 263  MMFENLIAQRLK-------GVSERRDFLQYLLELKEEVDSKTPLTIIQIKALLTDMIVGG 322

Query: 368  TDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRL 427
            +DTSSNTIEFA+AE++  P+ LKKAQ+EL AV+G+DNIVEES I   PYL+AVMKETLRL
Sbjct: 323  SDTSSNTIEFAMAEIMNQPEILKKAQKELEAVIGKDNIVEESDIQKLPYLQAVMKETLRL 382

Query: 428  HPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKW 487
            HP+LPLLVPHCPSE+ IV  Y IPKGSR+FIN WA QRDP+ WENPL FDP+RFL   KW
Sbjct: 383  HPVLPLLVPHCPSESCIVGGYTIPKGSRIFINVWATQRDPSIWENPLKFDPDRFLNDSKW 442

Query: 488  DFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGKKMEVEEKFGIVL 547
            DF GSDF Y PFGSGRR CAGIAMAER V+Y LATLLHSF+W+L  G+KM++EEKFGIVL
Sbjct: 443  DFSGSDFNYIPFGSGRRICAGIAMAERTVVYSLATLLHSFNWELPHGEKMDLEEKFGIVL 502

Query: 548  KMKSPLVNLLFSFSFFLFVLLQHLQNFTKLTFFLHSAMEQPTSNSLLHLIDSLNSFFPWR 607
            K K PLV +                               PT                 R
Sbjct: 503  KKKIPLVAI-------------------------------PTP----------------R 562

Query: 608  SIDQSGFLFIFFATLLIFLYVFQPTREVLKDHDVTLANRDVPKAGRVASYGGCNIVWTRM 667
              D S    ++  TLL F           KD                             
Sbjct: 563  FSDPS----LYDTTLLNF----------FKD----------------------------- 622

Query: 668  ADVDETMLVWTLFMSSVVQRLEILLAHLYQQAGSPVNVGEHGFLMIFKVVTSMLWGGSVE 727
             +VD T + +TL  ++VV                        FL++              
Sbjct: 623  -NVDLTRVCFTLSAAAVV------------------------FLLL-------------- 682

Query: 728  GEQRYSVAAEFDLQGIEKQMRELASRFDNIFEKMIDQGLKIDDGEEDESETSSLRSSLVL 787
                                                                        
Sbjct: 683  ------------------------------------------------------------ 742

Query: 788  SNNRLLFSSHSASTSAIAVPFKSSKSRQHTPISKDRNLQALNIIIQLHFEKNLEKKLAIR 847
                                                                        
Sbjct: 743  ------------------------------------------------------------ 802

Query: 848  SPKERAPQAFSALHLSGPLSSRLKYRHCWIPIALLSISDLSFYVSVAQTVRYINGFKYQR 907
                        +HL                                             
Sbjct: 803  ------------IHLK-------------------------------------------- 862

Query: 908  HFSNEDKQISTALLFIFVFVVALLWLRPEFRRPSLPPGPCGLPLVGYLPFLSGNLHQTFA 967
                                     LR +   P LPPGP GLPL+G L  L   LH  F 
Sbjct: 863  -------------------------LRGKRASPPLPPGPRGLPLLGNLLSLDPELHSYFR 922

Query: 968  DLA--QIYGPIFKLRLGTKLCIVLSSPSSINPSATKKPFHQETVFANRDSTVSSLLATYG 1027
            DLA  Q +GPI KL+LG K+ IV++SPS     A +     + VFANRD  V+  +ATYG
Sbjct: 923  DLAQSQTHGPILKLQLGNKVGIVITSPS----LAREVLRENDVVFANRDVPVAGRIATYG 982

Query: 1028 GADIVFSQDDGDWKKLRKIFSRKMLSKSNLDESYPLRRQEVRKVIKGVLESAGTPIDIGK 1087
            G DIV++    +W+ LRK+   KMLS + LD  Y LRR+EVR+ +       G+P+++G+
Sbjct: 983  GYDIVWTPYGPEWRMLRKVCVLKMLSNTTLDSVYELRRREVRQTVGYCYSRVGSPVNVGE 1040

Query: 1088 LGFFAAAKSVMAMTWGGSGGMIGVDGAELEDKFREVVDEMMVLIASPNLSDLFPVLGRFD 1147
              F      +  M WGG+  + G +   L  +FR++V EM  L+  PNLSD +P L RFD
Sbjct: 1043 QMFLTILNVITNMLWGGA--VEGDERDSLGAEFRQIVSEMTDLLGKPNLSDFYPGLARFD 1040

Query: 1148 LQGIARKMKKVMNVCDEILNS----------AIEEQRKMGGNGVERRGFLQFLLKVRDGE 1207
            LQG+ ++M K+    D +              I ++R   G G + R FLQ+LL++++  
Sbjct: 1103 LQGVGKQMTKLALRFDMMFEKLIAQRSSSSLKIRDERDSSGGG-QNRDFLQYLLELKEEV 1040

Query: 1208 DRSESITDNQLKALLMDIIIGGTDSTSTTIEWAITELIQQPNIMMKVMEELTKVVGLNQM 1267
            D    +T   +KALLMD+++GG+D++S TIE+A+ E++ QP I+ +  +EL  V+G + +
Sbjct: 1163 DSKTPLTITHVKALLMDMVVGGSDTSSNTIEFAMAEIMNQPEILKRAQQELEAVIGKDNI 1040

Query: 1268 VEEFHLSKLFYLDAVIKEKLRLHPPLTLLVPRKSTQTSILGGYTIPKGSTIYFNMWAIQR 1327
            VEE  + KL YL AV+KE LRLHP L LLVP   +++  +GGYTIPKGS I+ N+WA QR
Sbjct: 1223 VEESDIHKLPYLQAVMKETLRLHPVLPLLVPHCPSESCTVGGYTIPKGSRIFINVWATQR 1040

Query: 1328 DPKVWDNPLNFMPERFLNESNGEVYDFTGNSIEFCPFGSGKKLCAGIPLAERLLVLILAS 1387
            DP +W+NPL F PERFLN S    +DF+G+   + PFGSG+++CAGI +AER+++  LA+
Sbjct: 1283 DPSIWENPLKFDPERFLNNSK---WDFSGSDFNYIPFGSGRRICAGIAMAERMVMYSLAT 1040

Query: 1388 LLHAFEWELLEGSKLDLEEKFGIVTKKFNPLVAILMPRISNLELY 1416
            LLH+F WEL  G K+DL EKFGIV KK  PLVAI  PR+ +  LY
Sbjct: 1343 LLHSFNWELPRGEKMDLAEKFGIVLKKKIPLVAIPTPRLLDRSLY 1040

BLAST of CcUC01G003870 vs. NCBI nr
Match: KAF4404053.1 (hypothetical protein G4B88_014509 [Cannabis sativa])

HSP 1 Score: 975.7 bits (2521), Expect = 4.3e-280
Identity = 580/1387 (41.82%), Postives = 768/1387 (55.37%), Query Frame = 0

Query: 45   PTSNSPLLLINSLNSSVDQSPFLFTFFAALLIFLYVKLTRLR-----VPLPPGPWGVPLL 104
            PT  S  LL N L  +VD +   FT  AA ++FL +   +LR      PLPPGP G+PLL
Sbjct: 3    PTITSTTLL-NFLKDNVDLTRVCFTLSAAAVVFLLLIHLKLRGKRASPPLPPGPRGLPLL 62

Query: 105  GNLPFLDPDLHTYFAELGRKYGPIVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVP 164
            GNL  LDP+LH+YF +L + +GPI+KLQLGN++GI++ S S+AREVL+++DV FANRDVP
Sbjct: 63   GNLLSLDPELHSYFRDLAQTHGPILKLQLGNKVGIVITSPSLAREVLRENDVVFANRDVP 122

Query: 165  QAGRAASYGGSDIVWTPYGPEWRMLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQ 224
             AGR A+YGG DIVWTPYGPEWRMLRKVCVLKMLSN TLDSVYELRRREVR TV + Y +
Sbjct: 123  VAGRIATYGGYDIVWTPYGPEWRMLRKVCVLKMLSNTTLDSVYELRRREVRQTVGYCYSR 182

Query: 225  CGSAVNVGEQGFLTVFNVVTSMLWGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFF 284
             GS VNVGEQ FLT+ NV+T+MLWGG+VEG++RD L AEFR+ +SE+T+LLGKPN+SDF+
Sbjct: 183  VGSPVNVGEQMFLTILNVITNMLWGGAVEGDERDSLGAEFRQIVSEMTDLLGKPNLSDFY 242

Query: 285  PSLARFDLQGIEKKMREIAPRFDNIFDKMIDER----LKI-GGGDDDGSVKKNDFLQFLL 344
            P LARFDLQG+ K+M ++A RFD +F+K+I +R    LKI    D  G  +  DFLQ+LL
Sbjct: 243  PGLARFDLQGVGKQMTKLALRFDMMFEKLIAQRSSSSLKIRDERDSSGGGQNRDFLQYLL 302

Query: 345  EVNDEGESKTPLTMTHLKALLMDMVVGGTDTSSNTIEFALAEMIKNPKTLKKAQEELTAV 404
            E+ +E +SKTPLT+TH+KALLMDMVVGG+DTSSNTIEFA+AE++  P+ LK+AQ+EL AV
Sbjct: 303  ELKEEVDSKTPLTITHVKALLMDMVVGGSDTSSNTIEFAMAEIMNQPEILKRAQQELEAV 362

Query: 405  VGEDNIVEESHIHSFPYLKAVMKETLRLHPILPLLVPHCPSETTIVSNYAIPKGSRVFIN 464
            +G+DNIVEES IH  PYL+AVMKETLRLHP+LPLLVPHCPSE+  V  Y IPKGSR+FIN
Sbjct: 363  IGKDNIVEESDIHKLPYLQAVMKETLRLHPVLPLLVPHCPSESCTVGGYTIPKGSRIFIN 422

Query: 465  AWAIQRDPNHWENPLVFDPERFLKTEKWDFGGSDFRYFPFGSGRRNCAGIAMAERMVMYL 524
             WA QRDP+ WENPL FDPERFL   KWDF GSDF Y PFGSGRR CAGIAMAERMVMY 
Sbjct: 423  VWATQRDPSIWENPLKFDPERFLNNSKWDFSGSDFNYIPFGSGRRICAGIAMAERMVMYS 482

Query: 525  LATLLHSFDWKLEDGKKMEVEEKFGIVLKMKSPLVNLLFSFSFFLFVLLQHLQNFTKLTF 584
            LATLLHSF+W+L  G+KM++ EKFGIVLK K PLV +                       
Sbjct: 483  LATLLHSFNWELPRGEKMDLAEKFGIVLKKKIPLVAI----------------------- 542

Query: 585  FLHSAMEQPTSNSLLHLIDSLNSFFPWRSIDQSGFLFIFFATLLIFLYVFQPTREVLKDH 644
                                                               PT  +   H
Sbjct: 543  ---------------------------------------------------PTPSIYTSH 602

Query: 645  DVTLANRDVPKAGRVASYGGCNIVWTRMADVDETMLVWTLFMSSVVQRLEILLAHLYQQA 704
                                                                        
Sbjct: 603  ------------------------------------------------------------ 662

Query: 705  GSPVNVGEHGFLMIFKVVTSMLWGGSVEGEQRYSVAAEFDLQGIEKQMRELASRFDNIFE 764
               + + +H       ++T++                                       
Sbjct: 663  ---LRIPKH------TMITTL--------------------------------------- 722

Query: 765  KMIDQGLKIDDGEEDESETSSLRSSLVLSNNRLLFSSHSASTSAIAVPFKSSKSRQHTPI 824
                            +ETS+L S         LF + SA                    
Sbjct: 723  ----------------AETSNLTS---------LFLTLSA-------------------- 782

Query: 825  SKDRNLQALNIIIQLHFEKNLEKKLAIRSPKERAPQAFSALHLSGPLSSRLKYRHCWIPI 884
                      III L                                             
Sbjct: 783  ----------IIITL--------------------------------------------- 842

Query: 885  ALLSISDLSFYVSVAQTVRYINGFKYQRHFSNEDKQISTALLFIFVFVVALLWLRPEFRR 944
                             + YI   K  ++ S                             
Sbjct: 843  ----------------IIWYIKSKKLNQYSS----------------------------- 902

Query: 945  PSLPPGPCGLPLVGYLPFLSGNLHQTFADLAQIYGPIFKLRLGTKLCIVLSSPSSINPSA 1004
              LPPGP G P+VG L  L  +LH  F  LA  YGPI KLRLG+KL I+++SP+  +   
Sbjct: 903  -PLPPGPRGFPVVGSLLTLKPDLHSYFKSLAHTYGPILKLRLGSKLKIIITSPTLAHQVL 962

Query: 1005 TKKPFHQETVFANRDSTVSSLLATYGGADIVFSQDDGDWKKLRKIFSRKMLSKSNLDESY 1064
             +     + VFANRD    +  + YG  +IV++    +W+ LRK+   KMLS ++LD   
Sbjct: 963  KE----NDIVFANRDVLSITRSSVYGITNIVWTPYGPEWRMLRKVCVLKMLSNASLDSVC 1022

Query: 1065 PLRRQEVRKVIKGVLESA---GTPIDIGKLGFFAAAKSVMAMTWGGSGGMIGVDGAELED 1124
             +RR+ VR+ +  +        +P+++G+  FF     +M M WGG+  +   +GA +  
Sbjct: 1023 SVRRRVVRQGVNQLYNRVVLDQSPVEVGEHVFFTILNLIMNMLWGGTVDV--EEGASVGA 1050

Query: 1125 KFREVVDEMMVLIASPNLSDLFPVLGRFDLQGIARKMKKVMNVCDEILNSAIEEQRKM-- 1184
            + RE+++ +  L++ PN+SD FPVL  FDLQGI++K ++++   D I+N  I  + K+  
Sbjct: 1083 EIREIINGISELMSKPNVSDFFPVLAWFDLQGISKKNRQLIQSFDRIVNRMISTRMKIEE 1050

Query: 1185 -GGNGVERRGFLQFLLKVRDGEDRSESITDNQLKALLMDIIIGGTDSTSTTIEWAITELI 1244
             GGNG     FLQFLL+++D ED    +T   +K+LL+D++ GGTD+++ TIE+A+ E+I
Sbjct: 1143 NGGNG--NNDFLQFLLRLKDEEDSKTPLTLTHVKSLLLDMVAGGTDTSTNTIEFAMAEII 1050

Query: 1245 QQPNIMMKVMEELTKVVGLNQMVEEFHLSKLFYLDAVIKEKLRLHPPLTLLVPRKSTQTS 1304
              P++M K  +EL  V+G + +VEE H+SKL YL AV+KE LRLHP L LL P   ++T 
Sbjct: 1203 NAPDVMNKAQKELEDVIGKDNIVEESHISKLPYLQAVMKETLRLHPSLPLLAPHSPSETC 1050

Query: 1305 ILGGYTIPKGSTIYFNMWAIQRDPKVWDNPLNFMPERFLNESNGEVYDFTGNSIEFCPFG 1364
            ++GGYT+PKG  +  N+WAI RDP  W++PL F PERFL +S    +DFTG   ++ PFG
Sbjct: 1263 VVGGYTVPKGCGVIINVWAIHRDPFNWEDPLKFDPERFLKDS--PKWDFTGTDFKYFPFG 1050

Query: 1365 SGKKLCAGIPLAERLLVLILASLLHAFEWELLEGSKLDLEEKFGIVTKKFNPLVAILMPR 1416
            SG++ CAGI +AER+++  LA+LLH+F+W++  G +LDL EKFG+V K   PLVAI  PR
Sbjct: 1323 SGRRNCAGIAMAERMVMYSLATLLHSFDWKIPHGQQLDLSEKFGLVMKMKIPLVAIPTPR 1050

BLAST of CcUC01G003870 vs. NCBI nr
Match: KAG9145073.1 (hypothetical protein Leryth_018361 [Lithospermum erythrorhizon])

HSP 1 Score: 954.5 bits (2466), Expect = 1.0e-273
Identity = 557/1369 (40.69%), Postives = 760/1369 (55.51%), Query Frame = 0

Query: 56   SLNSSVDQSPFLFT----FFAAL--LIFLYVKLTRLRVPLPPGPWGVPLLGNLPFLDPDL 115
            S NS+    PF  T     FA +  L  L++K  + + PLPPGP G+PL+GNL  LDP+L
Sbjct: 19   SSNSNDQLGPFHVTLAIGLFAVIWTLWVLFIKSNKSQPPLPPGPRGLPLVGNLLSLDPEL 78

Query: 116  HTYFAELGRKYGPIVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGG 175
            H YFA+L + YGPI+ L+LG ++GI+++S ++AREVLKD D  FANRDVP AG+ A+YGG
Sbjct: 79   HVYFAQLSQSYGPILTLRLGGKLGIVISSPAIAREVLKDQDTIFANRDVPVAGKEATYGG 138

Query: 176  SDIVWTPYGPEWRMLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQCGSAVNVGEQ 235
            +DIVWTPYGPEWRMLRKVCV +ML N +LDSVY+LRR+EVR T+ + Y Q GS VN+GEQ
Sbjct: 139  TDIVWTPYGPEWRMLRKVCVREMLCNTSLDSVYDLRRKEVRQTIKYFYSQAGSPVNIGEQ 198

Query: 236  GFLTVFNVVTSMLWGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQG 295
             FLTV NV+T+MLWGG+++GE+R  L AEFRE ++E+T LLG PN+SDF+PSLA FDLQG
Sbjct: 199  MFLTVLNVITNMLWGGTMKGEERSNLGAEFREVVAEMTALLGFPNLSDFYPSLAMFDLQG 258

Query: 296  IEKKMREIAPRFDNIFDKMIDERLKIGGGDDDGSVKKN--DFLQFLLEVNDEGESKTPLT 355
            ++KKM+++  RFD IF+ MI++RLK+ G D  G+  K   DFLQFLL++ D+ ++K PLT
Sbjct: 259  VKKKMKKLVHRFDVIFETMIEQRLKMDGVDGKGNDGKESIDFLQFLLKLKDQQDAKIPLT 318

Query: 356  MTHLKALLMDMVVGGTDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIH 415
            M H+KALLMDMVVGGTDT+SN++EF +AEM+  P  L+K Q EL  VVG+ NIVEE HIH
Sbjct: 319  MIHVKALLMDMVVGGTDTTSNSMEFVMAEMMNKPDILRKVQFELETVVGKANIVEEFHIH 378

Query: 416  SFPYLKAVMKETLRLHPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWEN 475
              PYL AVMKE LRLHP+LPLLVPHCPSETT V+ Y IPKG+R+F N WAI RDP  WEN
Sbjct: 379  KLPYLYAVMKEILRLHPVLPLLVPHCPSETTTVAGYTIPKGARIFFNVWAIHRDPTIWEN 438

Query: 476  PLVFDPERFLKTEKWDFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLE 535
            PL FDP+RFL + K D+ G+DF YFPFGSGRR CAG AMAERM M+ +A+L+HSF+W L 
Sbjct: 439  PLEFDPDRFLNS-KCDYSGNDFAYFPFGSGRRICAGTAMAERMFMFSVASLIHSFEWSLP 498

Query: 536  DGKKMEVEEKFGIVLKMKSPLVNLLFSFSFFLFVLLQHLQNFTKLTFFLHSAMEQPTSNS 595
            +   +++ EKFGIVLK + PL+                              +  PT   
Sbjct: 499  ERDTLDLSEKFGIVLKKRIPLI------------------------VIPTPRLSSPTMYE 558

Query: 596  LLHLIDSLNSFFPWRSIDQSGFLFIFFATLLIFLYVFQPTREVLKDHDVTLANRDVPKAG 655
               L+    SF P                                   V + NR +    
Sbjct: 559  FKALVGMFPSFVP-----------------------------------VRMHNRTI---- 618

Query: 656  RVASYGGCNIVWTRMADVDETMLVWTLFMSSVVQRLEILLAHLYQQAGSPVNVGEHGFLM 715
                                                     H Y              + 
Sbjct: 619  ----------------------------------------QHKYH-------------IQ 678

Query: 716  IFKVVTSMLWGGSVEGEQRYSVAAEFDLQGIEKQMRELASRFDNIFEKMIDQGLKIDDGE 775
              K   S+ W                                                  
Sbjct: 679  NPKAKISIFWS------------------------------------------------- 738

Query: 776  EDESETSSLRSSLVLSNNRLLFSSHSASTSAIAVPFKSSKSRQHTPISKDRNLQALNIII 835
                                                K   +   TP+SK           
Sbjct: 739  -----------------------------------TKECTNYATTPLSK----------- 798

Query: 836  QLHFEKNLEKKLAIRSPKERAPQAFSALHLSGPLSSRLKYRHCWIPIALLSISDLSFYVS 895
                                   A+S    +   SSR +Y   WI               
Sbjct: 799  -----------------------AWSWWWQTS--SSRDEYA-LWI--------------- 858

Query: 896  VAQTVRYINGFKYQRHFSNEDKQISTALLFIFVFVVALLWLRPEFRRPSLPPGPCGLPLV 955
                   +N               ST   F F +   +   + +  +P LPPGP GLPL+
Sbjct: 859  -------LN---------------STICFFFFAWFAKININKSKKIKPLLPPGPKGLPLI 918

Query: 956  GYLPFLSGNLHQTFADLAQIYGPIFKLRLGTKLCIVLSSPSSINPSATKKPFHQETVFAN 1015
            G L  L  +LH+ F  ++QIYGPIF LRLG K  +V++SP+     A +    Q+  FAN
Sbjct: 919  GNLLSLEPDLHKYFFKMSQIYGPIFSLRLGKKFGVVITSPA----LARQVLKDQDATFAN 978

Query: 1016 RDSTVSSLLATYGGADIVFSQDDGDWKKLRKIFSRKMLSKSNLDESYPLRRQEVRKVIKG 1075
             D   ++   TYGG DIV++    +W+ LR++ + +ML+ + LDE Y  RR+E+R+ +  
Sbjct: 979  HDVPEAAKEVTYGGMDIVWTPYGPEWRMLRRVCASQMLNSAVLDEVYKHRRKEIRRTVNY 1038

Query: 1076 VLESAGTPIDIGKLGFFAAAKSVMAMTWGGSGGMIGVDGAELEDKFREVVDEMMVLIASP 1135
                AG+P++ G+  F      +  M WGG   + G +   L  +FR++V E+  L+  P
Sbjct: 1039 FYNQAGSPVNFGEQMFLTVFNVITNMLWGGI--IEGGERESLATEFRQLVAEITALLGMP 1098

Query: 1136 NLSDLFPVLGRFDLQGIARKMKKVMNVCDEILNSAIEEQRKMGGNGVE-RRGFLQFLLKV 1195
            N+SD +P L RFDLQGI +KM+++    D I  + I E+ ++   G +  + FLQ LL++
Sbjct: 1099 NISDFYPSLARFDLQGIQKKMRRLALKFDGIFENLINERLQINRQGGKVSKDFLQVLLEL 1103

Query: 1196 RDGEDRSESITDNQLKALLMDIIIGGTDSTSTTIEWAITELIQQPNIMMKVMEELTKVVG 1255
            ++ E+     T + LK LLMD+++GGTD+TS T+E+A+ E++ +P+I+ K+ +EL  VVG
Sbjct: 1159 KNEENSKVPFTMSHLKGLLMDMVVGGTDTTSNTMEFAMAEVMNKPDILRKLQQELELVVG 1103

Query: 1256 LNQMVEEFHLSKLFYLDAVIKEKLRLHPPLTLLVPRKSTQTSILGGYTIPKGSTIYFNMW 1315
               +VEE H++KL YL AV+KE LRLHP + LLVP + TQTSI+GGY IP+GS ++FN+W
Sbjct: 1219 KGNIVEESHVTKLPYLYAVMKETLRLHPVVPLLVPHRPTQTSIVGGYVIPEGSRVFFNVW 1103

Query: 1316 AIQRDPKVWDNPLNFMPERFLNESNGEVYDFTGNSIEFCPFGSGKKLCAGIPLAERLLVL 1375
            AIQRD K WD PL F PERFLN    ++  ++GN   + PFGSG+++CAG+ +AE++ + 
Sbjct: 1279 AIQRDEKSWDKPLEFRPERFLNL---KIDHYSGNEFNYFPFGSGRRVCAGVAMAEKMFMF 1103

Query: 1376 ILASLLHAFEWELLEGSKLDLEEKFGIVTKKFNPLVAILMPRISNLELY 1416
            ++ASL+H+FEW+L +   +DL EKFGIV KK  PLVAI +PR S L  Y
Sbjct: 1339 LMASLIHSFEWKLPQEGVIDLTEKFGIVLKKKVPLVAIPIPRSSVLAFY 1103

BLAST of CcUC01G003870 vs. ExPASy Swiss-Prot
Match: A0A4D6Q415 (Flavonoid 3'-monooxygenase CYP75B137 OS=Crocosmia x crocosmiiflora OX=1053288 GN=CYP75B137 PE=1 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 1.1e-97
Identity = 195/501 (38.92%), Postives = 294/501 (58.68%), Query Frame = 0

Query: 66  FLFTFFAALLI------FLYVKLTRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRK 125
           F F + + LL+       LY + +    PLPPGP G P+LGNLP L    H     L ++
Sbjct: 4   FFFLWISTLLLSSFIVYLLYRRRSAQCPPLPPGPNGWPILGNLPQLGAKPHQTLDALSKQ 63

Query: 126 YGPIVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGP 185
           YGP+ +L+LG+   ++ +SS+VA + L+ HDV F+NR         +Y   D+V+ PYGP
Sbjct: 64  YGPLFRLRLGSVNVVVASSSAVAAQFLRTHDVNFSNRPPNSGAEHVAYNYQDLVFAPYGP 123

Query: 186 EWRMLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQC--GSAVNVGEQGFLTVFNV 245
            WRMLRK+C + + S   LD +  +R+ EV   V +L R    G  VN+G+   +   N 
Sbjct: 124 RWRMLRKLCSVHLFSLKALDDLRPVRQGEVACLVRNLRRHADTGVLVNLGKALNVCATNA 183

Query: 246 VTSMLWGGSVEGEQRDGLAA--EFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMR 305
           +   + G  V  ++   LA   EF+E + E+  L G  N+ DF P L   DLQG+  KM+
Sbjct: 184 LARAMLGRRVFADEDAQLAEADEFKEMVVELMRLAGVFNVGDFVPGLGWLDLQGVVGKMK 243

Query: 306 EIAPRFDNIFDKMIDERLKIGGGDDDGSVKKNDFLQFLLEVND-EGESKTPLTMTHLKAL 365
            +  R+D   D++I+E        +  + K  D L  L+ + + + E +  L  T +KAL
Sbjct: 244 RLHRRYDAFLDRVIEE--------NQANAKSGDLLSVLIRLKEADAEGEIKLNNTDIKAL 303

Query: 366 LMDMVVGGTDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKA 425
           L+++   GTDTSS+T+E+ LAE+I++P  L+K Q EL +V+G D +V ES + + PYL+A
Sbjct: 304 LLNLFTAGTDTSSSTVEWVLAELIRHPDILQKTQHELDSVIGRDRLVAESDLPNLPYLQA 363

Query: 426 VMKETLRLHPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPE 485
           V+KET RLHP  PL +P   SE  IV+ Y IPK + + +N W+I RD   W +PL F P 
Sbjct: 364 VVKETFRLHPSTPLSLPRMASEECIVNGYKIPKHATLLVNVWSIGRDAAVWNDPLEFRPS 423

Query: 486 RFL---KTEKWDFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDG-- 545
           RFL   + E  D  G+DF   PFG+GRR CAG+++  RMV ++ AT++H++DW L  G  
Sbjct: 424 RFLPGGEREHVDVKGNDFEVIPFGAGRRICAGLSLGLRMVQFMTATIVHAYDWSLPKGQE 483

Query: 546 -KKMEVEEKFGIVLKMKSPLV 550
            +K+++EE +G+ L+   PL+
Sbjct: 484 CQKLDMEEAYGLTLQRAVPLM 496

BLAST of CcUC01G003870 vs. ExPASy Swiss-Prot
Match: Q8VWZ7 (Geraniol 8-hydroxylase OS=Catharanthus roseus OX=4058 GN=CYP76B6 PE=1 SV=1)

HSP 1 Score: 358.6 bits (919), Expect = 3.2e-97
Identity = 196/487 (40.25%), Postives = 290/487 (59.55%), Query Frame = 0

Query: 67  LFTFFAALLIFLYVKLTRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGPIVKL 126
           L   FA  L   +  L+R    LPPGP  +P +G+L  L    H   A+L +K+GPI+ L
Sbjct: 8   LTLLFALTLYEAFSYLSRRTKNLPPGPSPLPFIGSLHLLGDQPHKSLAKLSKKHGPIMSL 67

Query: 127 QLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWRMLRK 186
           +LG    I+++SS++A+EVL+  D+ F++R VP A  A +     +VW P    WR LRK
Sbjct: 68  KLGQITTIVISSSTMAKEVLQKQDLAFSSRSVPNALHAHNQFKFSVVWLPVASRWRSLRK 127

Query: 187 VCVLKMLSNATLDSVYELRRREVRNTVAHLYR--QCGSAVNVGEQGFLTVFNVVTSMLWG 246
           V    + S   LD+   LR R+V+  +A+  +  Q G AV+VG   F T  N+++++++ 
Sbjct: 128 VLNSNIFSGNRLDANQHLRTRKVQELIAYCRKNSQSGEAVDVGRAAFRTSLNLLSNLIFS 187

Query: 247 GSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFDNI 306
             +     D  A EF++ +  I    GKPN+ DFFP L + D QGI  +M         +
Sbjct: 188 KDLTDPYSDS-AKEFKDLVWNIMVEAGKPNLVDFFPLLEKVDPQGIRHRMTIHFGEVLKL 247

Query: 307 FDKMIDERLKIGGGDDDGSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGGTD 366
           F  +++ERL+          +KND L  LL  +   ES   +  TH++ + +D+ V GTD
Sbjct: 248 FGGLVNERLE----QRRSKGEKNDVLDVLLTTSQ--ESPEEIDRTHIERMCLDLFVAGTD 307

Query: 367 TSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRLHP 426
           T+S+T+E+A++EM+KNP  +KK Q+EL  V+G    +EES I+  PYL+ VMKETLR+HP
Sbjct: 308 TTSSTLEWAMSEMLKNPDKMKKTQDELAQVIGRGKTIEESDINRLPYLRCVMKETLRIHP 367

Query: 427 ILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKWDF 486
            +P L+P    ++  V  Y +PKGS+V +NAWAI RD   W++ L F PERF+++E  D 
Sbjct: 368 PVPFLIPRKVEQSVEVCGYNVPKGSQVLVNAWAIGRDETVWDDALAFKPERFMESE-LDI 427

Query: 487 GGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDG---KKMEVEEKFGIV 546
            G DF   PFG+GRR C G+ +A R V  +L +LL+SF+WKLE G   K +++EEKFGI 
Sbjct: 428 RGRDFELIPFGAGRRICPGLPLALRTVPLMLGSLLNSFNWKLEGGMAPKDLDMEEKFGIT 486

Query: 547 LKMKSPL 549
           L+   PL
Sbjct: 488 LQKAHPL 486

BLAST of CcUC01G003870 vs. ExPASy Swiss-Prot
Match: D1MI46 (Geraniol 8-hydroxylase OS=Swertia mussotii OX=137888 GN=CYP76B10 PE=1 SV=1)

HSP 1 Score: 355.9 bits (912), Expect = 2.1e-96
Identity = 195/488 (39.96%), Postives = 286/488 (58.61%), Query Frame = 0

Query: 66  FLFTFFAALLIFLYVKLTRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGPIVK 125
           F  T + AL  F     +R    LPPGP  +PL+GNL  L    H   A+L +K+GPI+ 
Sbjct: 14  FTITLYQALNFF-----SRKSKNLPPGPSPLPLIGNLHLLGDQPHKSLAKLAKKHGPIMG 73

Query: 126 LQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWRMLR 185
           LQLG    I+V SS +A+EVL+  D+ F++R +P A  A       ++W P    WR LR
Sbjct: 74  LQLGQVTTIVVTSSGMAKEVLQKQDLAFSSRSIPNAIHAHDQYKYSVIWLPVASRWRGLR 133

Query: 186 KVCVLKMLSNATLDSVYELRRREVRNTVAHLYR--QCGSAVNVGEQGFLTVFNVVTSMLW 245
           K     M S   LD+   LR R+V+  +A+  +  Q G A++VG   F T  N++++ ++
Sbjct: 134 KALNSNMFSGNRLDANQHLRSRKVQELIAYCRKSSQTGDAIDVGRAAFRTSLNLLSNTMF 193

Query: 246 GGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFDN 305
              +     D  A EF++ +  +    GKPN+ D+FP L + D QGI K+M     +   
Sbjct: 194 SKDLTDPYSDS-AKEFKDLVWNVMVEAGKPNLVDYFPLLDKVDPQGIRKRMTIHFGKILE 253

Query: 306 IFDKMIDERLKIGGGDDDGSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGGT 365
           +F  +IDERL+            +D L  LL  ++  ES   +  TH++ + +D+ V GT
Sbjct: 254 LFGGLIDERLQ----QKKAKGVNDDVLDVLLTTSE--ESPEEIDRTHIQRMCLDLFVAGT 313

Query: 366 DTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRLH 425
           DT+S+T+E+A++EM+KNP+ +K AQ EL  V+G+   VEE+ +   PYL+  +KETLR+H
Sbjct: 314 DTTSSTLEWAMSEMLKNPEKMKAAQAELAQVIGKGKAVEEADLARLPYLRCAIKETLRIH 373

Query: 426 PILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKWD 485
           P +PLL+P    +   V  Y +PK S+V +N WAI RD   W++PL F PERFL++E  +
Sbjct: 374 PPVPLLIPRRTEQEVEVCGYTVPKNSQVLVNVWAISRDDAIWKDPLSFKPERFLESE-LE 433

Query: 486 FGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDG---KKMEVEEKFGI 545
             G DF   PFG+GRR C G+ +A RMV  +L +LL+SFDWKLE G   K +++EEKFGI
Sbjct: 434 MRGKDFELIPFGAGRRICPGLPLAVRMVPVMLGSLLNSFDWKLEGGIAPKDLDMEEKFGI 488

Query: 546 VLKMKSPL 549
            L+   PL
Sbjct: 494 TLQKAHPL 488

BLAST of CcUC01G003870 vs. ExPASy Swiss-Prot
Match: Q7G602 (Flavonoid 3'-monooxygenase CYP75B3 OS=Oryza sativa subsp. japonica OX=39947 GN=CYP75B3 PE=2 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 2.6e-94
Identity = 193/481 (40.12%), Postives = 274/481 (56.96%), Query Frame = 0

Query: 84  RLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGPIVKLQLGNRIGIIVNSSSVAR 143
           R R PLPPGP G P+LGNLP L    H     L R+YGP+ +L+ G    ++  S+ VA 
Sbjct: 36  RKRRPLPPGPRGWPVLGNLPQLGDKPHHTMCALARQYGPLFRLRFGCAEVVVAASAPVAA 95

Query: 144 EVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWRMLRKVCVLKMLSNATLDSVYE 203
           + L+ HD  F+NR         +Y   D+V+ PYG  WR LRK+C L + S   LD +  
Sbjct: 96  QFLRGHDANFSNRPPNSGAEHVAYNYQDLVFAPYGARWRALRKLCALHLFSAKALDDLRA 155

Query: 204 LRRREVRNTVAHLYRQCGSAVNVGEQGFLTVFNVVTSMLWGGSVEGEQRDGLAAEFRETI 263
           +R  EV   V +L RQ  ++V +G++  +   N +     G  V        A EF+E +
Sbjct: 156 VREGEVALMVRNLARQQAASVALGQEANVCATNTLARATIGHRVFAVDGGEGAREFKEMV 215

Query: 264 SEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFDNIFDKMIDERLKIGGGDDDGS 323
            E+ +L G  N+ DF P+L   D QG+  KM+ +  R+DN+ +  I+ER K G   D  +
Sbjct: 216 VELMQLAGVFNVGDFVPALRWLDPQGVVAKMKRLHRRYDNMMNGFINER-KAGAQPDGVA 275

Query: 324 VKK--NDFLQFLL-------EVNDEGESKTPLTMTHLKALLMDMVVGGTDTSSNTIEFAL 383
             +  ND L  LL       +++ +GE    +T T +KALL+++   GTDT+S+T+E+AL
Sbjct: 276 AGEHGNDLLSVLLARMQEEQKLDGDGEK---ITETDIKALLLNLFTAGTDTTSSTVEWAL 335

Query: 384 AEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRLHPILPLLVPHCP 443
           AE+I++P  LK+AQ EL  VVG   +V ES +   PYL AV+KET RLHP  PL +P   
Sbjct: 336 AELIRHPDVLKEAQHELDTVVGRGRLVSESDLPRLPYLTAVIKETFRLHPSTPLSLPREA 395

Query: 444 SETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKW---DFGGSDFRY 503
           +E   V  Y IPKG+ + +N WAI RDP  W +PL + P RFL        D  G+DF  
Sbjct: 396 AEECEVDGYRIPKGATLLVNVWAIARDPTQWPDPLQYQPSRFLPGRMHADVDVKGADFGL 455

Query: 504 FPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDG---KKMEVEEKFGIVLKMKSPL 550
            PFG+GRR CAG++   RMV  + ATL+H FDW L +G    K+ +EE +G+ L+   PL
Sbjct: 456 IPFGAGRRICAGLSWGLRMVTLMTATLVHGFDWTLANGATPDKLNMEEAYGLTLQRAVPL 512

BLAST of CcUC01G003870 vs. ExPASy Swiss-Prot
Match: A0A1D8QMG4 (Carnosic acid synthase OS=Rosmarinus officinalis OX=39367 GN=CYP76AK8 PE=1 SV=1)

HSP 1 Score: 348.2 bits (892), Expect = 4.4e-94
Identity = 186/470 (39.57%), Postives = 285/470 (60.64%), Query Frame = 0

Query: 89  LPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGPIVKLQLGNRIGIIVNSSSVAREVLKD 148
           LPPGP  +P++GN+  L  D H   A+L + YGP++ L+LGN+  ++V+S  +ARE+L+ 
Sbjct: 35  LPPGPPRLPIIGNILQLGRDPHKSLAQLAKTYGPLMSLKLGNQFAVVVSSPEMAREILQK 94

Query: 149 HDVTFANRDVPQAGRAASYGGSDIVWTPYGPE-WRMLRKVCVLKMLSNATLDSVYELRRR 208
             + F+    P A R   +    +   P   + W+ LR+V   ++ SN  L +  ++R+ 
Sbjct: 95  QGLIFSKPFTPSAVRVLGHNDISMNMLPASSDRWKKLRRVAREQLFSNPALQATQDIRQE 154

Query: 209 EVRNTVAHLYRQC--GSAVNVGEQGFLTVFNVVTSMLWGGSVE----GEQRDGLAAEFRE 268
            +R    +  R C  G A+NVGE  F T+ N++ + L+  SVE    G    G   +F+E
Sbjct: 155 RLRQLTDYASRCCAQGRAMNVGEATFTTMTNLMFATLF--SVELTQYGATDTGSDKKFKE 214

Query: 269 TISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFDNIFDKMIDERLKIGGGDDD 328
            ++ +T  +G PN++DFFP LA  D QG+ +K+         +   +I +RL+    +D 
Sbjct: 215 HVNALTRYMGVPNVADFFPFLAPLDPQGMRRKLTYHLGSLLELVQSLIQQRLQ--ARNDS 274

Query: 329 GSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGGTDTSSNTIEFALAEMIKNP 388
              KKNDFL  LL++++  E    L++  +K + +D+++ G+DTS+ T E+A+ E++ +P
Sbjct: 275 TYQKKNDFLDTLLDLSEGNE--YDLSIKEIKHMFVDLIIAGSDTSAATTEWAMVELLLHP 334

Query: 389 KTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRLHPILPLLVPHCPSETTIVS 448
             + K + EL +V+GE +IVEES I   PYL A +KE LR HP  PLL PH   E T VS
Sbjct: 335 DKMAKLKAELKSVLGEKSIVEESDISRLPYLLATVKEVLRYHPAAPLLAPHAAEEETQVS 394

Query: 449 NYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKWDFGGSDFRYFPFGSGRRNC 508
            Y IPK +++FIN W+I RDP+ W+NP  F+PERFL +E  DFGG  F   PFGSGRR C
Sbjct: 395 GYIIPKNTKMFINVWSITRDPSIWKNPESFEPERFLDSE-IDFGGQHFELIPFGSGRRIC 454

Query: 509 AGIAMAERMVMYLLATLLHSFDWKLEDG---KKMEVEEKFGIVLKMKSPL 549
            G+ +A RM+  ++ATL H+FDW+LE G   K+++ E+ FG+ L+ K PL
Sbjct: 455 PGMPLASRMLQCMVATLCHNFDWELEKGAESKQLQREDVFGLALQKKIPL 497

BLAST of CcUC01G003870 vs. ExPASy TrEMBL
Match: M1A0V4 (Cytochrome P450 OS=Solanum tuberosum OX=4113 PE=3 SV=1)

HSP 1 Score: 1085.5 bits (2806), Expect = 0.0e+00
Identity = 636/1445 (44.01%), Postives = 864/1445 (59.79%), Query Frame = 0

Query: 59   SSVDQSPFLFTF---FAALLIFLYV---KLTRLRVPLPPGPWGVPLLGNLPFLDPDLHTY 118
            S++ Q    +++     ALL FL+    K  + + PLPPGP  +PL+GNL  LDP+LHTY
Sbjct: 20   SNIGQKGVFYSYVLGIVALLWFLWFFINKSKKGQPPLPPGPKALPLIGNLHSLDPELHTY 79

Query: 119  FAELGRKYGPIVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDI 178
            FA L + YGPI +L LG ++GII+ S ++AREVLKD D  FANRDVP AGR A+YGG+DI
Sbjct: 80   FASLSQTYGPICRLWLGKKLGIIITSPALAREVLKDQDTIFANRDVPAAGREATYGGTDI 139

Query: 179  VWTPYGPEWRMLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQCGSAVNVGEQGFL 238
            VWTPYGP+WRMLRKVCV +MLS +TLDSVY LRRRE+R T+ + Y Q G  VN+GEQ FL
Sbjct: 140  VWTPYGPKWRMLRKVCVREMLSGSTLDSVYALRRRELRQTINYFYSQAGLPVNIGEQMFL 199

Query: 239  TVFNVVTSMLWGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEK 298
            TV NV+TSMLWGG+V+GE+R  L AEFR  ++++TELLG PN+SDF+P LARFDLQG+ K
Sbjct: 200  TVLNVITSMLWGGTVKGEERASLGAEFRHVVTKMTELLGTPNLSDFYPGLARFDLQGVTK 259

Query: 299  KMREIAPRFDNIFDKMIDERLKI--GGGDDDGSVKKN-DFLQFLLEVNDEGESKTPLTMT 358
            KM+ +A RFD IF+ MID+R +I   GG + G  ++N DFLQ LL++ D+  +K PLTM 
Sbjct: 260  KMKVLAKRFDKIFESMIDKRQEIDRNGGMETGVGQENKDFLQVLLKLKDDEAAKMPLTMP 319

Query: 359  HLKALLMDMVVGGTDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSF 418
             LKALLMDMVVGGTDT+SNT+EFA+AE++  P  L+K Q+E+  VVG+DNIVEESHI   
Sbjct: 320  ELKALLMDMVVGGTDTTSNTVEFAMAEIMNKPDVLRKLQQEIDTVVGKDNIVEESHIQHL 379

Query: 419  PYLKAVMKETLRLHPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPL 478
            PY  AVMKE LRLHP LPLLVPHCPSET+ V  Y +PKGSRVFIN WAIQRDP+ WENP 
Sbjct: 380  PYFYAVMKEVLRLHPALPLLVPHCPSETSTVGGYTVPKGSRVFINVWAIQRDPSIWENPT 439

Query: 479  VFDPERFLKTEKWDFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDG 538
             F PERF +  KWD+ G+D  YFPFGSGRR CAGIAMAERM MY LA+L+HSFDWK  +G
Sbjct: 440  EFHPERFFE-NKWDYSGNDLNYFPFGSGRRICAGIAMAERMFMYSLASLIHSFDWKFPEG 499

Query: 539  KKMEVEEKFGIVLKMKSPLVNLLFSFSFFLFVLLQHLQNFTKLTFFLHSAMEQPTSNSLL 598
            + +++ EKFGIVLK K PLV           +    L N T     L+S    P     L
Sbjct: 500  ETLDLTEKFGIVLKKKMPLV----------AIPTPRLSNPT-----LNSNKGLPPGPKPL 559

Query: 599  HLIDSLNSFFP-----WRSIDQS-GFLFIFFATLLIFLYVFQP--TREVLKDHDVTLANR 658
             LI +L S  P     + S+ Q+ G +   +    + + +  P   RE+LKD D   ANR
Sbjct: 560  PLIGNLLSLDPQLHTYFASLSQTYGPICRLWLGKKLGIIITSPALAREILKDQDTIFANR 619

Query: 659  DVPKAGRVASYGGCNIVWT---------RMADVDETMLVWTLFMSSVVQRLEI--LLAHL 718
            DV  AGR  +YGG +IVWT         R   V E +   TL     ++R E+   + + 
Sbjct: 620  DVSVAGREVTYGGTDIVWTPYGPKWRMLRKVCVQEMLSASTLDSLYALRRRELRQSINYF 679

Query: 719  YQQAGSPVNVGEHGFLMIFKVVTSMLWGGSVEGEQRYSVAAE------------------ 778
            Y QAGSPVN+GE  FL    V+TSMLWGG+V+GE+R S+ AE                  
Sbjct: 680  YNQAGSPVNIGEQMFLTALNVITSMLWGGTVKGEERASLGAEVRDVVTKMNELLVTPNLS 739

Query: 779  --------FDLQGIEKQMRELASRFDNIFEKMIDQG-------------------LKIDD 838
                    FDLQG++K+M+ LA R+DNIFE MIDQ                    LK+ D
Sbjct: 740  NFYPGLAWFDLQGVKKKMKVLAKRYDNIFESMIDQRQQMNRNGVGQESKDFLQVLLKLKD 799

Query: 839  GEEDESETSSLRSSLVLSNNRLLFSSHSASTSAIAVP--------FKSSKSRQHTPISKD 898
              + +   + +    +L++  +  +  SA+T   A+          +  +    T + KD
Sbjct: 800  EADPKMPLTMIEIKALLTDMVVGGTETSANTVEFAMTEILNKPDVLRKLQQEVDTIVGKD 859

Query: 899  RNLQALNIIIQLHFEKNLEKKLAIRSPKERAPQAFSALHLSGPLSSRLKYRHCWIPIALL 958
                  NI+ + H ++ L    A+     R       LH + PL          +P    
Sbjct: 860  ------NIVEESHIQQ-LPYFYAVMKEVFR-------LHPALPL---------LVPHCPS 919

Query: 959  SISDLSFYVSVAQTVRYINGFKYQRHFSNEDKQISTALLFIFVFVVALLWLRPEFRRPSL 1018
              S +  Y     +  ++N +  QR  S                    +W  P    P  
Sbjct: 920  ETSTVGGYTVPKGSSVFVNVWAIQRDPS--------------------IWENPTEFHPE- 979

Query: 1019 PPGPCGLPLVGYLPFLSGNLHQTFADLAQIYGPIFKLRLGTKLCIVLSSPSSINPSATKK 1078
                          F+      +  DL       F    G ++C+           A + 
Sbjct: 980  -------------RFMEKEWDFSGNDLT-----YFPFGSGRRICV-----------ACEV 1039

Query: 1079 PFHQETVFANRDSTVSSLLATYGGADIVFSQDDGDWKKLRKIFSRKMLSKSNLDESYPLR 1138
              +Q+T+FANRD   +   ++YGG DIV++     W+ LRK+  R+MLS S LD  Y  R
Sbjct: 1040 LKNQDTIFANRDVPAACKESSYGGKDIVWTPYGPKWRMLRKVCVREMLSGSTLDSVYAPR 1099

Query: 1139 RQEVRKVIKGVLESAGTPIDIGKLGFFAAAKSVMAMTWGGSGGMIGVDGAELEDKFREVV 1198
            ++E+R+ I      AG+P++IG+  F      + +M WGG+  + G + + L  +FR VV
Sbjct: 1100 KRELRQSINYFYNQAGSPVNIGEQMFLTVLNVITSMLWGGT--VKGEERSRLGAEFRHVV 1159

Query: 1199 DEMMVLIASPNLSDLFPVLGRFDLQGIARKMKKVMNVCDEILNSAIEEQRKMGGN----- 1258
             +M  L+ +PN+SD +P L RFDLQG+ +KM  +    D+I  S I++++K+  N     
Sbjct: 1160 ADMTELLGTPNISDFYPGLARFDLQGVTKKMNVLEKRFDKIFESMIDQRQKIDRNAEMGT 1219

Query: 1259 --GVERRGFLQFLLKVRDGEDRSESITDNQLKALLMDIIIGGTDSTSTTIEWAITELIQQ 1318
              G E + FLQ LLK++D  D    +T  +LKALLMD+++GGTD+TS  +E+A+ E++ +
Sbjct: 1220 GVGQESKDFLQVLLKLKDESDAKMPLTMIELKALLMDMVVGGTDTTSNAVEFAMAEIMNK 1279

Query: 1319 PNIMMKVMEELTKVVGLNQMVEEFHLSKLFYLDAVIKEKLRLHPPLTLLVPRKSTQTSIL 1378
            P ++ K+ +EL  VVG + +VEE+H+ +L YL AV+KE LR+HP L LLVP   ++T  +
Sbjct: 1280 PYVLRKLQQELETVVGKDNIVEEYHIQQLPYLYAVMKEALRIHPTLPLLVPHCPSETCTV 1339

Query: 1379 GGYTIPKGSTIYFNMWAIQRDPKVWDNPLNFMPERFLNESNGEVYDFTGNSIEFCPFGSG 1416
            GGYT+PKGS ++ N+WAIQRDP +W NP  F PERFL+      +D++GN   + PFGSG
Sbjct: 1340 GGYTVPKGSRVFINVWAIQRDPSIWKNPTEFHPERFLDSK----WDYSGNDFNYFPFGSG 1369

BLAST of CcUC01G003870 vs. ExPASy TrEMBL
Match: A0A7J6DVG2 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_019722 PE=3 SV=1)

HSP 1 Score: 993.0 bits (2566), Expect = 1.2e-285
Identity = 584/1363 (42.85%), Postives = 748/1363 (54.88%), Query Frame = 0

Query: 68   FTFFAA---LLIFLYVKL--TRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGP 127
            FT  AA   LL+F+Y+KL   R   PLPPGP G+PLLGNL  LDP+LHTYF  L + +GP
Sbjct: 23   FTLSAAAVVLLLFIYLKLRSNRASPPLPPGPRGLPLLGNLLSLDPELHTYFTNLAQTHGP 82

Query: 128  IVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWR 187
            I+KLQLGN++GI++ S S+AREVLKD+DV FANRDVP AGR  +YGG DI WTPYGPEWR
Sbjct: 83   ILKLQLGNKVGIVITSPSLAREVLKDNDVVFANRDVPVAGRLITYGGYDITWTPYGPEWR 142

Query: 188  MLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQCGSAVNVGEQGFLTVFNVVTSML 247
            MLRKVCVLKMLSN TLDSVYELRRREVR TV + Y + GS V VGEQ FLT+ NV+T+ML
Sbjct: 143  MLRKVCVLKMLSNTTLDSVYELRRREVRKTVGYFYSRVGSPVGVGEQMFLTILNVITNML 202

Query: 248  WGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFD 307
            WGGSVEGE+RD L +EFR+ ISEITELLGKPN+SDF+P LARFDLQG+ K+M ++A RFD
Sbjct: 203  WGGSVEGEERDKLGSEFRQIISEITELLGKPNVSDFYPGLARFDLQGVGKQMTKLALRFD 262

Query: 308  NIFDKMIDERLKIGGGDDDGSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGG 367
             +F+ +I +RLK       G  ++ DFLQ+LLE+ +E +SKTPLT+  +KALL DM+VGG
Sbjct: 263  MMFENLIAQRLK-------GVSERRDFLQYLLELKEEVDSKTPLTLIQIKALLTDMIVGG 322

Query: 368  TDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRL 427
            +DTSSNTIEFA+AE++  P+ LKKAQ+EL AV+G+DNIVEES IH  PYL+AVMKETLRL
Sbjct: 323  SDTSSNTIEFAMAEIMNQPEILKKAQKELEAVIGKDNIVEESDIHKLPYLQAVMKETLRL 382

Query: 428  HPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKW 487
            HP+LPLLVPHCPSE+  V  Y IPKGSR+FIN WA QRDP+ WENPL FDP+RFL   KW
Sbjct: 383  HPVLPLLVPHCPSESCTVGGYTIPKGSRIFINVWATQRDPSIWENPLKFDPDRFLNDSKW 442

Query: 488  DFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGKKMEVEEKFGIVL 547
            DF GSDF Y PFGSGRR CAGIAMAER V+Y LATLLHSF+W+L  G+KM++EEKFGIVL
Sbjct: 443  DFSGSDFNYIPFGSGRRICAGIAMAERTVVYSLATLLHSFNWELPRGEKMDLEEKFGIVL 502

Query: 548  KMKSPLVNLLFSFSFFLFVLLQHLQNFTKLTFFLHSAMEQPTSNSLLHLIDSLNSFFPWR 607
            K K PLV +                               PT                 R
Sbjct: 503  KKKIPLVAI-------------------------------PTP----------------R 562

Query: 608  SIDQSGFLFIFFATLLIFLYVFQPTREVLKDHDVTLANRDVPKAGRVASYGGCNIVWTRM 667
              D S    ++  TLL FL                                         
Sbjct: 563  FSDPS----LYDTTLLNFL----------------------------------------K 622

Query: 668  ADVDETMLVWTLFMSSVVQRLEILLAHLYQQAGSPVNVGEHGFLMIFKVVTSMLWGGSVE 727
             +VD T + +TL  ++VV                        FL++              
Sbjct: 623  HNVDLTRVCFTLSAAAVV------------------------FLLLI------------- 682

Query: 728  GEQRYSVAAEFDLQGIEKQMRELASRFDNIFEKMIDQGLKIDDGEEDESETSSLRSSLVL 787
                                                                        
Sbjct: 683  ------------------------------------------------------------ 742

Query: 788  SNNRLLFSSHSASTSAIAVPFKSSKSRQHTPISKDRNLQALNIIIQLHFEKNLEKKLAIR 847
                                                                        
Sbjct: 743  ------------------------------------------------------------ 802

Query: 848  SPKERAPQAFSALHLSGPLSSRLKYRHCWIPIALLSISDLSFYVSVAQTVRYINGFKYQR 907
                                                                        
Sbjct: 803  ------------------------------------------------------------ 862

Query: 908  HFSNEDKQISTALLFIFVFVVALLWLRPEFRRPSLPPGPCGLPLVGYLPFLSGNLHQTFA 967
            H     K++S                      P LPPGP GLPL+G L  L   LH  F 
Sbjct: 863  HLKLRGKRVS----------------------PPLPPGPRGLPLLGNLLSLDPELHSYFR 922

Query: 968  DLAQIYGPIFKLRLGTKLCIVLSSPSSINPSATKKPFHQETVFANRDSTVSSLLATYGGA 1027
            DLAQ +GPI KL+LG K+ IV++SPS     A +     + VFANRD  V+  +ATYGG 
Sbjct: 923  DLAQTHGPILKLQLGNKVGIVITSPS----LAREVLRENDVVFANRDVPVAGRIATYGGY 982

Query: 1028 DIVFSQDDGDWKKLRKIFSRKMLSKSNLDESYPLRRQEVRKVIKGVLESAGTPIDIGKLG 1087
            DIV++    +W+ LRK+   KMLS + LD  Y LRR+EVR+ +       G+P+++G+  
Sbjct: 983  DIVWTPYGPEWRMLRKVCVLKMLSNTTLDSVYELRRREVRQTVGYCYSRVGSPVNVGEQM 1038

Query: 1088 FFAAAKSVMAMTWGGSGGMIGVDGAELEDKFREVVDEMMVLIASPNLSDLFPVLGRFDLQ 1147
            F      +  M WGG+  + G +   L  +FR++V EM  L+  PNLSD +P L RFDLQ
Sbjct: 1043 FLTILNVITNMLWGGA--VEGDERDSLGAEFRQIVSEMTDLLGKPNLSDFYPGLARFDLQ 1038

Query: 1148 GIARKMKKVMNVCDEILNS----------AIEEQRKMGGNGVERRGFLQFLLKVRDGEDR 1207
            G+ ++M K+    D +              I ++R   G G + R FLQ+LL++++  D 
Sbjct: 1103 GVGKQMTKLALRFDMMFEKLIAQRSSSSLKIRDERDSSGGG-QNRDFLQYLLELKEEVDS 1038

Query: 1208 SESITDNQLKALLMDIIIGGTDSTSTTIEWAITELIQQPNIMMKVMEELTKVVGLNQMVE 1267
               +T   +KALLMD+++GG+D++S TIE+A+ E++ QP I+ +  +EL  V+G + +VE
Sbjct: 1163 KTPLTITHVKALLMDMVVGGSDTSSNTIEFAMAEIMNQPEILKRAQQELEAVIGKDNIVE 1038

Query: 1268 EFHLSKLFYLDAVIKEKLRLHPPLTLLVPRKSTQTSILGGYTIPKGSTIYFNMWAIQRDP 1327
            E  + KL YL AV+KE LRLHP L LLVP   +++  +GGYTIPKGS I+ N+WA QRDP
Sbjct: 1223 ESDIHKLPYLQAVMKETLRLHPVLPLLVPHCPSESCTVGGYTIPKGSRIFINVWATQRDP 1038

Query: 1328 KVWDNPLNFMPERFLNESNGEVYDFTGNSIEFCPFGSGKKLCAGIPLAERLLVLILASLL 1387
             +W+NPL F PERFLN S    +DF+G+   + PFGSG+++CAGI +AER+++  LA+LL
Sbjct: 1283 SIWENPLKFDPERFLNNSK---WDFSGSDFNYIPFGSGRRICAGIAMAERMVMYSLATLL 1038

Query: 1388 HAFEWELLEGSKLDLEEKFGIVTKKFNPLVAILMPRISNLELY 1416
            H+F WEL  G K+DL EKFGIV KK  PLVAI  PR+ +  LY
Sbjct: 1343 HSFNWELPRGEKMDLAEKFGIVLKKKIPLVAIPTPRLLDRSLY 1038

BLAST of CcUC01G003870 vs. ExPASy TrEMBL
Match: A0A7J6EPW8 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_001909 PE=3 SV=1)

HSP 1 Score: 986.9 bits (2550), Expect = 8.9e-284
Identity = 586/1365 (42.93%), Postives = 750/1365 (54.95%), Query Frame = 0

Query: 68   FTFFAA---LLIFLYVKL--TRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGP 127
            FT  AA   LL+F+Y+KL   R   PLPPGP G+PLLGNL  LDP+LHTYF  L + +GP
Sbjct: 23   FTLSAAAVVLLLFIYLKLRSNRASPPLPPGPRGLPLLGNLLSLDPELHTYFTNLAQTHGP 82

Query: 128  IVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWR 187
            I+KLQLGN++GI++ S S+AREVLKD+DV FANRDVP AGR  +YGG DI WTPYGPEWR
Sbjct: 83   ILKLQLGNKVGIVITSPSLAREVLKDNDVVFANRDVPVAGRLITYGGYDITWTPYGPEWR 142

Query: 188  MLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQCGSAVNVGEQGFLTVFNVVTSML 247
            MLRKVCVLKMLSN TLDSVYELRRREVR TV + Y + GS V VGEQ FLT+ NV+T+ML
Sbjct: 143  MLRKVCVLKMLSNTTLDSVYELRRREVRKTVGYFYSRVGSPVGVGEQMFLTILNVITNML 202

Query: 248  WGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFD 307
            WGGSVEGE+RD L +EFR+ ISEITELLGKPN+SDF+P LARFDLQG+ K+M ++A RFD
Sbjct: 203  WGGSVEGEERDKLGSEFRQIISEITELLGKPNVSDFYPGLARFDLQGVGKQMTKLALRFD 262

Query: 308  NIFDKMIDERLKIGGGDDDGSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGG 367
             +F+ +I +RLK       G  ++ DFLQ+LLE+ +E +SKTPLT+  +KALL DM+VGG
Sbjct: 263  MMFENLIAQRLK-------GVSERRDFLQYLLELKEEVDSKTPLTIIQIKALLTDMIVGG 322

Query: 368  TDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRL 427
            +DTSSNTIEFA+AE++  P+ LKKAQ+EL AV+G+DNIVEES I   PYL+AVMKETLRL
Sbjct: 323  SDTSSNTIEFAMAEIMNQPEILKKAQKELEAVIGKDNIVEESDIQKLPYLQAVMKETLRL 382

Query: 428  HPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKW 487
            HP+LPLLVPHCPSE+ IV  Y IPKGSR+FIN WA QRDP+ WENPL FDP+RFL   KW
Sbjct: 383  HPVLPLLVPHCPSESCIVGGYTIPKGSRIFINVWATQRDPSIWENPLKFDPDRFLNDSKW 442

Query: 488  DFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGKKMEVEEKFGIVL 547
            DF GSDF Y PFGSGRR CAGIAMAER V+Y LATLLHSF+W+L  G+KM++EEKFGIVL
Sbjct: 443  DFSGSDFNYIPFGSGRRICAGIAMAERTVVYSLATLLHSFNWELPHGEKMDLEEKFGIVL 502

Query: 548  KMKSPLVNLLFSFSFFLFVLLQHLQNFTKLTFFLHSAMEQPTSNSLLHLIDSLNSFFPWR 607
            K K PLV +                               PT                 R
Sbjct: 503  KKKIPLVAI-------------------------------PTP----------------R 562

Query: 608  SIDQSGFLFIFFATLLIFLYVFQPTREVLKDHDVTLANRDVPKAGRVASYGGCNIVWTRM 667
              D S    ++  TLL F           KD                             
Sbjct: 563  FSDPS----LYDTTLLNF----------FKD----------------------------- 622

Query: 668  ADVDETMLVWTLFMSSVVQRLEILLAHLYQQAGSPVNVGEHGFLMIFKVVTSMLWGGSVE 727
             +VD T + +TL  ++VV                        FL++              
Sbjct: 623  -NVDLTRVCFTLSAAAVV------------------------FLLL-------------- 682

Query: 728  GEQRYSVAAEFDLQGIEKQMRELASRFDNIFEKMIDQGLKIDDGEEDESETSSLRSSLVL 787
                                                                        
Sbjct: 683  ------------------------------------------------------------ 742

Query: 788  SNNRLLFSSHSASTSAIAVPFKSSKSRQHTPISKDRNLQALNIIIQLHFEKNLEKKLAIR 847
                                                                        
Sbjct: 743  ------------------------------------------------------------ 802

Query: 848  SPKERAPQAFSALHLSGPLSSRLKYRHCWIPIALLSISDLSFYVSVAQTVRYINGFKYQR 907
                        +HL                                             
Sbjct: 803  ------------IHLK-------------------------------------------- 862

Query: 908  HFSNEDKQISTALLFIFVFVVALLWLRPEFRRPSLPPGPCGLPLVGYLPFLSGNLHQTFA 967
                                     LR +   P LPPGP GLPL+G L  L   LH  F 
Sbjct: 863  -------------------------LRGKRASPPLPPGPRGLPLLGNLLSLDPELHSYFR 922

Query: 968  DLA--QIYGPIFKLRLGTKLCIVLSSPSSINPSATKKPFHQETVFANRDSTVSSLLATYG 1027
            DLA  Q +GPI KL+LG K+ IV++SPS     A +     + VFANRD  V+  +ATYG
Sbjct: 923  DLAQSQTHGPILKLQLGNKVGIVITSPS----LAREVLRENDVVFANRDVPVAGRIATYG 982

Query: 1028 GADIVFSQDDGDWKKLRKIFSRKMLSKSNLDESYPLRRQEVRKVIKGVLESAGTPIDIGK 1087
            G DIV++    +W+ LRK+   KMLS + LD  Y LRR+EVR+ +       G+P+++G+
Sbjct: 983  GYDIVWTPYGPEWRMLRKVCVLKMLSNTTLDSVYELRRREVRQTVGYCYSRVGSPVNVGE 1040

Query: 1088 LGFFAAAKSVMAMTWGGSGGMIGVDGAELEDKFREVVDEMMVLIASPNLSDLFPVLGRFD 1147
              F      +  M WGG+  + G +   L  +FR++V EM  L+  PNLSD +P L RFD
Sbjct: 1043 QMFLTILNVITNMLWGGA--VEGDERDSLGAEFRQIVSEMTDLLGKPNLSDFYPGLARFD 1040

Query: 1148 LQGIARKMKKVMNVCDEILNS----------AIEEQRKMGGNGVERRGFLQFLLKVRDGE 1207
            LQG+ ++M K+    D +              I ++R   G G + R FLQ+LL++++  
Sbjct: 1103 LQGVGKQMTKLALRFDMMFEKLIAQRSSSSLKIRDERDSSGGG-QNRDFLQYLLELKEEV 1040

Query: 1208 DRSESITDNQLKALLMDIIIGGTDSTSTTIEWAITELIQQPNIMMKVMEELTKVVGLNQM 1267
            D    +T   +KALLMD+++GG+D++S TIE+A+ E++ QP I+ +  +EL  V+G + +
Sbjct: 1163 DSKTPLTITHVKALLMDMVVGGSDTSSNTIEFAMAEIMNQPEILKRAQQELEAVIGKDNI 1040

Query: 1268 VEEFHLSKLFYLDAVIKEKLRLHPPLTLLVPRKSTQTSILGGYTIPKGSTIYFNMWAIQR 1327
            VEE  + KL YL AV+KE LRLHP L LLVP   +++  +GGYTIPKGS I+ N+WA QR
Sbjct: 1223 VEESDIHKLPYLQAVMKETLRLHPVLPLLVPHCPSESCTVGGYTIPKGSRIFINVWATQR 1040

Query: 1328 DPKVWDNPLNFMPERFLNESNGEVYDFTGNSIEFCPFGSGKKLCAGIPLAERLLVLILAS 1387
            DP +W+NPL F PERFLN S    +DF+G+   + PFGSG+++CAGI +AER+++  LA+
Sbjct: 1283 DPSIWENPLKFDPERFLNNSK---WDFSGSDFNYIPFGSGRRICAGIAMAERMVMYSLAT 1040

Query: 1388 LLHAFEWELLEGSKLDLEEKFGIVTKKFNPLVAILMPRISNLELY 1416
            LLH+F WEL  G K+DL EKFGIV KK  PLVAI  PR+ +  LY
Sbjct: 1343 LLHSFNWELPRGEKMDLAEKFGIVLKKKIPLVAIPTPRLLDRSLY 1040

BLAST of CcUC01G003870 vs. ExPASy TrEMBL
Match: A0A7J6I924 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_014509 PE=3 SV=1)

HSP 1 Score: 975.7 bits (2521), Expect = 2.1e-280
Identity = 580/1387 (41.82%), Postives = 768/1387 (55.37%), Query Frame = 0

Query: 45   PTSNSPLLLINSLNSSVDQSPFLFTFFAALLIFLYVKLTRLR-----VPLPPGPWGVPLL 104
            PT  S  LL N L  +VD +   FT  AA ++FL +   +LR      PLPPGP G+PLL
Sbjct: 3    PTITSTTLL-NFLKDNVDLTRVCFTLSAAAVVFLLLIHLKLRGKRASPPLPPGPRGLPLL 62

Query: 105  GNLPFLDPDLHTYFAELGRKYGPIVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVP 164
            GNL  LDP+LH+YF +L + +GPI+KLQLGN++GI++ S S+AREVL+++DV FANRDVP
Sbjct: 63   GNLLSLDPELHSYFRDLAQTHGPILKLQLGNKVGIVITSPSLAREVLRENDVVFANRDVP 122

Query: 165  QAGRAASYGGSDIVWTPYGPEWRMLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQ 224
             AGR A+YGG DIVWTPYGPEWRMLRKVCVLKMLSN TLDSVYELRRREVR TV + Y +
Sbjct: 123  VAGRIATYGGYDIVWTPYGPEWRMLRKVCVLKMLSNTTLDSVYELRRREVRQTVGYCYSR 182

Query: 225  CGSAVNVGEQGFLTVFNVVTSMLWGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFF 284
             GS VNVGEQ FLT+ NV+T+MLWGG+VEG++RD L AEFR+ +SE+T+LLGKPN+SDF+
Sbjct: 183  VGSPVNVGEQMFLTILNVITNMLWGGAVEGDERDSLGAEFRQIVSEMTDLLGKPNLSDFY 242

Query: 285  PSLARFDLQGIEKKMREIAPRFDNIFDKMIDER----LKI-GGGDDDGSVKKNDFLQFLL 344
            P LARFDLQG+ K+M ++A RFD +F+K+I +R    LKI    D  G  +  DFLQ+LL
Sbjct: 243  PGLARFDLQGVGKQMTKLALRFDMMFEKLIAQRSSSSLKIRDERDSSGGGQNRDFLQYLL 302

Query: 345  EVNDEGESKTPLTMTHLKALLMDMVVGGTDTSSNTIEFALAEMIKNPKTLKKAQEELTAV 404
            E+ +E +SKTPLT+TH+KALLMDMVVGG+DTSSNTIEFA+AE++  P+ LK+AQ+EL AV
Sbjct: 303  ELKEEVDSKTPLTITHVKALLMDMVVGGSDTSSNTIEFAMAEIMNQPEILKRAQQELEAV 362

Query: 405  VGEDNIVEESHIHSFPYLKAVMKETLRLHPILPLLVPHCPSETTIVSNYAIPKGSRVFIN 464
            +G+DNIVEES IH  PYL+AVMKETLRLHP+LPLLVPHCPSE+  V  Y IPKGSR+FIN
Sbjct: 363  IGKDNIVEESDIHKLPYLQAVMKETLRLHPVLPLLVPHCPSESCTVGGYTIPKGSRIFIN 422

Query: 465  AWAIQRDPNHWENPLVFDPERFLKTEKWDFGGSDFRYFPFGSGRRNCAGIAMAERMVMYL 524
             WA QRDP+ WENPL FDPERFL   KWDF GSDF Y PFGSGRR CAGIAMAERMVMY 
Sbjct: 423  VWATQRDPSIWENPLKFDPERFLNNSKWDFSGSDFNYIPFGSGRRICAGIAMAERMVMYS 482

Query: 525  LATLLHSFDWKLEDGKKMEVEEKFGIVLKMKSPLVNLLFSFSFFLFVLLQHLQNFTKLTF 584
            LATLLHSF+W+L  G+KM++ EKFGIVLK K PLV +                       
Sbjct: 483  LATLLHSFNWELPRGEKMDLAEKFGIVLKKKIPLVAI----------------------- 542

Query: 585  FLHSAMEQPTSNSLLHLIDSLNSFFPWRSIDQSGFLFIFFATLLIFLYVFQPTREVLKDH 644
                                                               PT  +   H
Sbjct: 543  ---------------------------------------------------PTPSIYTSH 602

Query: 645  DVTLANRDVPKAGRVASYGGCNIVWTRMADVDETMLVWTLFMSSVVQRLEILLAHLYQQA 704
                                                                        
Sbjct: 603  ------------------------------------------------------------ 662

Query: 705  GSPVNVGEHGFLMIFKVVTSMLWGGSVEGEQRYSVAAEFDLQGIEKQMRELASRFDNIFE 764
               + + +H       ++T++                                       
Sbjct: 663  ---LRIPKH------TMITTL--------------------------------------- 722

Query: 765  KMIDQGLKIDDGEEDESETSSLRSSLVLSNNRLLFSSHSASTSAIAVPFKSSKSRQHTPI 824
                            +ETS+L S         LF + SA                    
Sbjct: 723  ----------------AETSNLTS---------LFLTLSA-------------------- 782

Query: 825  SKDRNLQALNIIIQLHFEKNLEKKLAIRSPKERAPQAFSALHLSGPLSSRLKYRHCWIPI 884
                      III L                                             
Sbjct: 783  ----------IIITL--------------------------------------------- 842

Query: 885  ALLSISDLSFYVSVAQTVRYINGFKYQRHFSNEDKQISTALLFIFVFVVALLWLRPEFRR 944
                             + YI   K  ++ S                             
Sbjct: 843  ----------------IIWYIKSKKLNQYSS----------------------------- 902

Query: 945  PSLPPGPCGLPLVGYLPFLSGNLHQTFADLAQIYGPIFKLRLGTKLCIVLSSPSSINPSA 1004
              LPPGP G P+VG L  L  +LH  F  LA  YGPI KLRLG+KL I+++SP+  +   
Sbjct: 903  -PLPPGPRGFPVVGSLLTLKPDLHSYFKSLAHTYGPILKLRLGSKLKIIITSPTLAHQVL 962

Query: 1005 TKKPFHQETVFANRDSTVSSLLATYGGADIVFSQDDGDWKKLRKIFSRKMLSKSNLDESY 1064
             +     + VFANRD    +  + YG  +IV++    +W+ LRK+   KMLS ++LD   
Sbjct: 963  KE----NDIVFANRDVLSITRSSVYGITNIVWTPYGPEWRMLRKVCVLKMLSNASLDSVC 1022

Query: 1065 PLRRQEVRKVIKGVLESA---GTPIDIGKLGFFAAAKSVMAMTWGGSGGMIGVDGAELED 1124
             +RR+ VR+ +  +        +P+++G+  FF     +M M WGG+  +   +GA +  
Sbjct: 1023 SVRRRVVRQGVNQLYNRVVLDQSPVEVGEHVFFTILNLIMNMLWGGTVDV--EEGASVGA 1050

Query: 1125 KFREVVDEMMVLIASPNLSDLFPVLGRFDLQGIARKMKKVMNVCDEILNSAIEEQRKM-- 1184
            + RE+++ +  L++ PN+SD FPVL  FDLQGI++K ++++   D I+N  I  + K+  
Sbjct: 1083 EIREIINGISELMSKPNVSDFFPVLAWFDLQGISKKNRQLIQSFDRIVNRMISTRMKIEE 1050

Query: 1185 -GGNGVERRGFLQFLLKVRDGEDRSESITDNQLKALLMDIIIGGTDSTSTTIEWAITELI 1244
             GGNG     FLQFLL+++D ED    +T   +K+LL+D++ GGTD+++ TIE+A+ E+I
Sbjct: 1143 NGGNG--NNDFLQFLLRLKDEEDSKTPLTLTHVKSLLLDMVAGGTDTSTNTIEFAMAEII 1050

Query: 1245 QQPNIMMKVMEELTKVVGLNQMVEEFHLSKLFYLDAVIKEKLRLHPPLTLLVPRKSTQTS 1304
              P++M K  +EL  V+G + +VEE H+SKL YL AV+KE LRLHP L LL P   ++T 
Sbjct: 1203 NAPDVMNKAQKELEDVIGKDNIVEESHISKLPYLQAVMKETLRLHPSLPLLAPHSPSETC 1050

Query: 1305 ILGGYTIPKGSTIYFNMWAIQRDPKVWDNPLNFMPERFLNESNGEVYDFTGNSIEFCPFG 1364
            ++GGYT+PKG  +  N+WAI RDP  W++PL F PERFL +S    +DFTG   ++ PFG
Sbjct: 1263 VVGGYTVPKGCGVIINVWAIHRDPFNWEDPLKFDPERFLKDS--PKWDFTGTDFKYFPFG 1050

Query: 1365 SGKKLCAGIPLAERLLVLILASLLHAFEWELLEGSKLDLEEKFGIVTKKFNPLVAILMPR 1416
            SG++ CAGI +AER+++  LA+LLH+F+W++  G +LDL EKFG+V K   PLVAI  PR
Sbjct: 1323 SGRRNCAGIAMAERMVMYSLATLLHSFDWKIPHGQQLDLSEKFGLVMKMKIPLVAIPTPR 1050

BLAST of CcUC01G003870 vs. ExPASy TrEMBL
Match: A0A6N2KQQ4 (Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS124803 PE=3 SV=1)

HSP 1 Score: 952.2 bits (2460), Expect = 2.4e-273
Identity = 563/1353 (41.61%), Postives = 741/1353 (54.77%), Query Frame = 0

Query: 66   FLFTFFAALLIFLYVKLTRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGPIVK 125
            F  T+FA    +  VK  +    LPPGP G+PL+GNLP L PDLHTYFA L R YGPI+K
Sbjct: 27   FAITWFA----WTRVKSKKGSSSLPPGPRGLPLIGNLPSLGPDLHTYFAGLARTYGPILK 86

Query: 126  LQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWRMLR 185
            LQLG+++ IIV+S  +AREVLKDHDVTFANRDVP   R A+YGG DI W+PYGPEWRMLR
Sbjct: 87   LQLGSKLAIIVSSPDLAREVLKDHDVTFANRDVPAVARIATYGGLDIAWSPYGPEWRMLR 146

Query: 186  KVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQCGSAVNVGEQGFLTVFNVVTSMLWGG 245
            KVCVLKMLSN+TLDSVY LRRREV+NTVA++YR+ G  +NV EQ FLT+ NV+TSMLWGG
Sbjct: 147  KVCVLKMLSNSTLDSVYGLRRREVQNTVAYIYRKAGLPINVEEQTFLTILNVITSMLWGG 206

Query: 246  SVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFDNIF 305
            +V+GE+R  L AEFR  ++++TELLG PNISDFFP+LARFDLQG+ +KM  +APRFD IF
Sbjct: 207  TVQGEERGRLGAEFRRVVADMTELLGAPNISDFFPALARFDLQGLARKMSGLAPRFDQIF 266

Query: 306  DKMIDERLKIGGGDD--DGSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGGT 365
            D+MI+++L     D+  D S +  DFLQFLL V DEG++KTPLTMTH+KALLMDMVVGGT
Sbjct: 267  DRMIEKQLNF---DELGDSSRECKDFLQFLLRVKDEGDAKTPLTMTHIKALLMDMVVGGT 326

Query: 366  DTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRLH 425
            D++SN IEFA+AE++  P+ ++KAQ+EL  VVG+DNIVEESHI+   Y+ A+MKETLRLH
Sbjct: 327  DSTSNAIEFAIAEVMNKPEVMRKAQDELDNVVGKDNIVEESHIYKLQYVHAIMKETLRLH 386

Query: 426  PILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKWD 485
            P++P+L+PHCPS T  +  Y++PKGSR+FIN WA+ RDP+ WENP+ F PE      K+D
Sbjct: 387  PVVPMLIPHCPSGTCTIGGYSVPKGSRIFINVWAVHRDPSIWENPMEFKPE-----SKFD 446

Query: 486  FGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGKKMEVEEKFGIVLK 545
            + GS+                                          K+++ EKFGIVLK
Sbjct: 447  YSGSE------------------------------------------KVDLTEKFGIVLK 506

Query: 546  MKSPLVNLLFSFSFFLFVLLQHLQNFTKLTFFLHSAMEQPTSNSLLHLIDSLNSFFPWRS 605
            +K+PLV                                                      
Sbjct: 507  LKNPLV------------------------------------------------------ 566

Query: 606  IDQSGFLFIFFATLLIFLYVFQPTREVLKDHDVTLANRDVPKAGRVASYGGCNIVWTRMA 665
                       AT         PT                P+    A Y           
Sbjct: 567  -----------AT---------PT----------------PRLSNPALY----------- 626

Query: 666  DVDETMLVWTLFMSSVVQRLEILLAHLYQQAGSPVNVGEHGFLMIFKVVTSMLWGGSVEG 725
                T  V T+  S +      LL  L  +    +N    G       + S  W  S E 
Sbjct: 627  ---RTPYVSTIHHSRIPNNNVSLLHFLAPENTEMLNTVVAG-------LWSRWWDASNER 686

Query: 726  EQRYSVAAEFDLQGIEKQMRELASRFDNIFEKMIDQGLKIDDGEEDESETSSLRSSLVLS 785
            E+ +                                                 R+ L++ 
Sbjct: 687  EKLF-------------------------------------------------RTVLIM- 746

Query: 786  NNRLLFSSHSASTSAIAVPFKSSKSRQHTPISKDRNLQALNIIIQLHFEKNLEKKLAIRS 845
                          A+A+                       I +      N++ K A+ +
Sbjct: 747  --------------AVAM-----------------------ITVFWFLWNNIKPKKAVAA 806

Query: 846  PKERAPQAFSALHLSGPLSSRLKYRHCWIPIALLSISDLSFYVSVAQTVRYINGFKYQRH 905
            P                                                           
Sbjct: 807  PS---------------------------------------------------------- 866

Query: 906  FSNEDKQISTALLFIFVFVVALLWLRPEFRRPSLPPGPCGLPLVGYLPFLSGNLHQTFAD 965
                                              PPGP GLPLVGYLPFL  +LH+ F +
Sbjct: 867  ----------------------------------PPGPRGLPLVGYLPFLGYDLHKKFTE 926

Query: 966  LAQIYGPIFKLRLGTKLCIVLSSPSSINPSATKKPFHQETVFANRDSTVSSLLATYGGAD 1025
            LA +YGPI+KLRLG KLC+V+SSP    P A +    ++T+FANRD   S+L+ TYGG D
Sbjct: 927  LAGVYGPIYKLRLGNKLCVVVSSP----PLAKEIVRDKDTIFANRDPPTSALVFTYGGND 986

Query: 1026 IVFSQDDGDWKKLRKIFSRKMLSKSNLDESYPLRRQEVRKVIKGVLESAGTPIDIGKLGF 1085
            I +S     W K+RKIF R+MLS ++LD SY LR+QEV+K I+ V    G+P+D G+L +
Sbjct: 987  IAWSSYGPPWTKMRKIFVREMLSNASLDASYELRKQEVKKAIRDVYNKIGSPVDFGELAY 1030

Query: 1086 FAAAKSVMAMTWGGSGGMIGVDGAELEDKFREVVDEMMVLIASPNLSDLFPVLGRFDLQG 1145
              +  +V+ +  GG G + G    +   +FR    EMMVL+  PN+SDLFPVL RFDLQG
Sbjct: 1047 VTSINAVLRILLGG-GTIQGEKWTDFVAQFRSHAAEMMVLLGKPNVSDLFPVLARFDLQG 1030

Query: 1146 IARKMKKVMNVCDEILNSAIEEQRKMGGNGV-ERRGFLQFLLKVRDGEDRSESITDNQLK 1205
            I +K K++    D+ L  AIE++       + +R+ FLQ LL +   ED + SIT +Q+K
Sbjct: 1107 IEKKAKRLAVTIDQFLQYAIEQRLNEEKTHMDDRKDFLQILLDLSKHEDPATSITMDQVK 1030

Query: 1206 ALLMDIIIGGTDSTSTTIEWAITELIQQPNIMMKVMEELTKVVGLNQMVEEFHLSKLFYL 1265
            A+LMDI +GGTD+T+T IEW +  L+Q   +  KV +EL +VVG N +VEEFHL KL YL
Sbjct: 1167 AILMDIFLGGTDTTTTMIEWTMARLMQHQEVRQKVYQELQEVVGSNNVVEEFHLPKLRYL 1030

Query: 1266 DAVIKEKLRLHPPLTLLVPRKSTQTSILGGYTIPKGSTIYFNMWAIQRDPKVWDNPLNFM 1325
             AVIKE  RLHP L LLVPR S Q+ ++GGY +PKG+T+  N++AI RDP +WDNPL F 
Sbjct: 1227 SAVIKETFRLHPALPLLVPRFSRQSCMVGGYIVPKGTTVLLNVYAIHRDPDLWDNPLEFR 1030

Query: 1326 PERFLNESNGEVYDFTGNSIEFCPFGSGKKLCAGIPLAERLLVLILASLLHAFEWELLEG 1385
            PERFLN      +D++GN+ ++ PFGSG+++CAGIPLAE++L+L+ ASLLH+FEW+L  G
Sbjct: 1287 PERFLNGDTAGSFDYSGNNFQYLPFGSGRRVCAGIPLAEKMLMLLQASLLHSFEWKLPAG 1030

Query: 1386 SKLDLEEKFGIVTKKFNPLVAILMPRISNLELY 1416
              L+L +++GIV KK  PL+ I  PR+ NLELY
Sbjct: 1347 GVLELSDRYGIVIKKMEPLMVIPSPRLCNLELY 1030

BLAST of CcUC01G003870 vs. TAIR 10
Match: AT4G12300.1 (cytochrome P450, family 706, subfamily A, polypeptide 4 )

HSP 1 Score: 608.2 bits (1567), Expect = 1.7e-173
Identity = 292/498 (58.63%), Postives = 377/498 (75.70%), Query Frame = 0

Query: 58  NSSVDQSPFLFTFFAALLIFLYVKLTRLRVP-LPPGPWGVPLLGNLPFLDPDLHTYFAEL 117
           +++++ +P+       +   L+    R   P LPPGP G+P++GNLPFLDPDLHTYFA L
Sbjct: 10  DNTINLTPYAIVILTTVFSILWYIFKRSPQPSLPPGPRGLPIVGNLPFLDPDLHTYFANL 69

Query: 118 GRKYGPIVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTP 177
            + +GPI KL LG+++ I+VNS S+ARE+LKD D+ F+NRDVP  GRAA+YGG DIVWTP
Sbjct: 70  AQSHGPIFKLNLGSKLTIVVNSPSLAREILKDQDINFSNRDVPLTGRAATYGGIDIVWTP 129

Query: 178 YGPEWRMLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQ--CGSAVNVGEQGFLTV 237
           YG EWR LRK+CVLK+LS  TLDS YELRR+EVR    +LY Q    S V VG+Q FLT+
Sbjct: 130 YGAEWRQLRKICVLKLLSRKTLDSFYELRRKEVRERTRYLYEQGRKQSPVKVGDQLFLTM 189

Query: 238 FNVVTSMLWGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKM 297
            N+  +MLWGGSV+ E+ + +  EF+  ISEIT LL +PN+SDFFP LARFDLQG+ K+M
Sbjct: 190 MNLTMNMLWGGSVKAEEMESVGTEFKGVISEITRLLSEPNVSDFFPWLARFDLQGLVKRM 249

Query: 298 REIAPRFDNIFDKMIDERLKIGGGDDDGSVKKNDFLQFLLEVND-EGESKTPLTMTHLKA 357
              A   D + D+ I++   + G DDD   +  DFLQ+L+++ D EG+S+ P+T+ H+KA
Sbjct: 250 GVCARELDAVLDRAIEQMKPLRGRDDD---EVKDFLQYLMKLKDQEGDSEVPITINHVKA 309

Query: 358 LLMDMVVGGTDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLK 417
           LL DMVVGGTDTS+NTIEFA+AE++ NP+ +K+AQEEL  VVG+DNIVEESHI   PY+ 
Sbjct: 310 LLTDMVVGGTDTSTNTIEFAMAELMSNPELIKRAQEELDEVVGKDNIVEESHITRLPYIL 369

Query: 418 AVMKETLRLHPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDP 477
           A+MKETLRLHP LPLLVPH P+E T+V  Y IPK +++F+N W+IQRDPN WENP  F P
Sbjct: 370 AIMKETLRLHPTLPLLVPHRPAENTVVGGYTIPKDTKIFVNVWSIQRDPNVWENPTEFRP 429

Query: 478 ERFLKTEKWDFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGKKME 537
           ERFL     DF G+++ YFPFGSGRR CAG+A+AERMV+Y LATLLHSFDWK+ +G  ++
Sbjct: 430 ERFLDNNSCDFTGANYSYFPFGSGRRICAGVALAERMVLYTLATLLHSFDWKIPEGHVLD 489

Query: 538 VEEKFGIVLKMKSPLVNL 552
           ++EKFGIVLK+K PLV L
Sbjct: 490 LKEKFGIVLKLKIPLVAL 504

BLAST of CcUC01G003870 vs. TAIR 10
Match: AT4G12320.1 (cytochrome P450, family 706, subfamily A, polypeptide 6 )

HSP 1 Score: 597.4 bits (1539), Expect = 2.9e-170
Identity = 287/498 (57.63%), Postives = 377/498 (75.70%), Query Frame = 0

Query: 59  SSVDQSPFLFTFFAALLIFLYVKLTRLRVP-LPPGPWGVPLLGNLPFLDPDLHTYFAELG 118
           ++++ +P+      A+   L+    R   P LPPGP G+P++GNLPFLDPDLHTYF +L 
Sbjct: 11  NTINLTPYAILILIAIFSILWYLFKRSPQPHLPPGPRGLPIVGNLPFLDPDLHTYFTKLA 70

Query: 119 RKYGPIVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPY 178
             YGPI KL LG+++ ++VN+ S+ARE+LKD D+ F+N DVP   RA +YGG D+VW PY
Sbjct: 71  ESYGPIFKLNLGSKLTVVVNTPSLAREILKDQDINFSNHDVPLTARAVTYGGLDLVWLPY 130

Query: 179 GPEWRMLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYR--QCGSAVNVGEQGFLTVF 238
           G EWRMLRKVCVLK+LS+ TL+S YELRR+E+R    +LY+  Q  S VNVGEQ FLT+ 
Sbjct: 131 GAEWRMLRKVCVLKLLSHRTLNSFYELRRKEIRERTRYLYQKGQEESPVNVGEQVFLTMM 190

Query: 239 NVVTSMLWGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMR 298
           N+  +MLWGGSV+ E+ + +  EF+E ISEIT LLG+PN+SDFFP LARFDLQG+ KKM 
Sbjct: 191 NLTMNMLWGGSVKAEEMESVGTEFKEVISEITRLLGEPNVSDFFPRLARFDLQGLVKKMH 250

Query: 299 EIAPRFDNIFDKMIDERLKIGGGD-DDGSVKKNDFLQFLLEVND-EGESKTPLTMTHLKA 358
             A   D I D+ I++   +   D DDG  K  DFLQ L+++ D E +S+ P+T+ H+KA
Sbjct: 251 VCARELDAILDRAIEQMQLLRTRDGDDGECK--DFLQHLMKLKDQEADSEVPITVNHVKA 310

Query: 359 LLMDMVVGGTDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLK 418
           +L+D+VVGGTDTS+NTIEFA+AE+I+ P+ +K+AQ+EL  VVG+DNI+EESHI   P++ 
Sbjct: 311 VLVDLVVGGTDTSTNTIEFAMAELIRKPELMKRAQQELDEVVGKDNIIEESHITRLPFIS 370

Query: 419 AVMKETLRLHPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDP 478
           A+MKETLRL+P +PLLVPH PSET +V  Y IPK +++FIN W+IQRDPN WE P  F P
Sbjct: 371 AIMKETLRLYPTIPLLVPHRPSETALVGGYTIPKNTKIFINVWSIQRDPNVWEYPTEFRP 430

Query: 479 ERFLKTEKWDFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGKKME 538
           ERFL  +  DF G+D+ Y PFGSGRR CAGIA+AERM++Y LATLLHSFDWK+ +G  ++
Sbjct: 431 ERFLDKKSCDFTGTDYSYLPFGSGRRICAGIALAERMILYTLATLLHSFDWKIPEGHILD 490

Query: 539 VEEKFGIVLKMKSPLVNL 552
           ++EKFGIVLK+KSPLV L
Sbjct: 491 LKEKFGIVLKLKSPLVAL 506

BLAST of CcUC01G003870 vs. TAIR 10
Match: AT4G12310.1 (cytochrome P450, family 706, subfamily A, polypeptide 5 )

HSP 1 Score: 586.3 bits (1510), Expect = 6.8e-167
Identity = 285/501 (56.89%), Postives = 370/501 (73.85%), Query Frame = 0

Query: 58  NSSVDQSPFLFTFF---AALLIFLYVKLTRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFA 117
           ++++  +P+ +      A   I  Y+     + PLPPGP G+P++GNLPFLDPDLHTYF 
Sbjct: 10  DNAISLTPYAYAVLILTATFSILWYIFKRSPQPPLPPGPRGLPIVGNLPFLDPDLHTYFT 69

Query: 118 ELGRKYGPIVKLQLGNRIGIIVNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVW 177
           +L + +GPI KL LG+++ ++VNS S+A E+LKD D+ F+N DVP   RA +YGG D+VW
Sbjct: 70  KLAQSHGPIFKLNLGSKLTVVVNSPSLASEILKDQDINFSNHDVPLTARAVTYGGLDLVW 129

Query: 178 TPYGPEWRMLRKVCVLKMLSNATLDSVYELRRREVRNTVAHLYRQC--GSAVNVGEQGFL 237
            PYG EWRMLRKVC  K+ S  TLDS YELRR+E+R     LY++    S VNVGEQ FL
Sbjct: 130 LPYGAEWRMLRKVCAAKLFSRKTLDSFYELRRKEIRERTRCLYQKGLEKSPVNVGEQLFL 189

Query: 238 TVFNVVTSMLWGGSVEGEQRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEK 297
           T+ N++ +MLWGGSV+ E  + +  EF+  ISEIT LLG PN+SDFFP LARFDLQG+ K
Sbjct: 190 TMMNLMMNMLWGGSVKAEDMESVGTEFKGVISEITRLLGVPNVSDFFPMLARFDLQGLVK 249

Query: 298 KMREIAPRFDNIFDKMIDERLKIGGGD-DDGSVKKNDFLQFLLEVND-EGESKTPLTMTH 357
           KM   A   D I D+ I++  ++   D DDG  K  DFLQ L+++ D E +S  P+TM H
Sbjct: 250 KMHLYARDLDAILDRAIEQMQRLRSRDGDDGECK--DFLQHLMKLRDQEADSDVPITMNH 309

Query: 358 LKALLMDMVVGGTDTSSNTIEFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFP 417
           +KA+LMDMVVGGT++S+NTIEF +AE+I NP+ +++AQ+EL  VVG+DNIVEESHI S P
Sbjct: 310 VKAVLMDMVVGGTESSTNTIEFVMAELISNPELMRRAQQELDEVVGKDNIVEESHITSLP 369

Query: 418 YLKAVMKETLRLHPILPLLVPHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLV 477
           Y+ AV+KETLRL+P +PLLVPH PSET +V  Y IPK +++FIN W+IQRDPN WE P  
Sbjct: 370 YILAVLKETLRLYPTIPLLVPHRPSETALVGGYTIPKNTKIFINVWSIQRDPNVWEYPTE 429

Query: 478 FDPERFLKTEKWDFGGSDFRYFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGK 537
           F PERFL  +  DF G+D+ Y PFGSGRR CAGIA+AERM++Y LATLLHSFDW + DG 
Sbjct: 430 FRPERFLDKKSCDFTGTDYSYLPFGSGRRICAGIALAERMILYTLATLLHSFDWTIPDGH 489

Query: 538 KMEVEEKFGIVLKMKSPLVNL 552
            +++EEKFGIVLK+K+PLV L
Sbjct: 490 VLDLEEKFGIVLKLKTPLVAL 508

BLAST of CcUC01G003870 vs. TAIR 10
Match: AT5G44620.1 (cytochrome P450, family 706, subfamily A, polypeptide 3 )

HSP 1 Score: 579.3 bits (1492), Expect = 8.3e-165
Identity = 279/476 (58.61%), Postives = 361/476 (75.84%), Query Frame = 0

Query: 76  IFLYVKLTRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGPIVKLQLGNRIGII 135
           ++LY K  R   PLPPGPWG+P++GNLPFL P+LHTYF  L +K+GPI KL LG ++ I+
Sbjct: 33  LWLYAKCKRRSPPLPPGPWGLPIIGNLPFLQPELHTYFQGLAKKHGPIFKLWLGAKLTIV 92

Query: 136 VNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWRMLRKVCVLKMLSN 195
           V SS VA+E+LK +D+ FAN DVP  G   +YGG++I+W+PYGP+WRMLRK+CV ++L N
Sbjct: 93  VTSSEVAQEILKTNDIIFANHDVPAVGPVNTYGGTEIIWSPYGPKWRMLRKLCVNRILRN 152

Query: 196 ATLDSVYELRRREVRNTVAHLYRQC--GSAVNVGEQGFLTVFNVVTSMLWGGSVEGEQRD 255
           A LDS  +LRRRE R TV +L  Q   GS VN+GEQ FL + NVVT MLWG +V+ E+R+
Sbjct: 153 AMLDSSTDLRRRETRQTVRYLADQARVGSPVNLGEQIFLMMLNVVTQMLWGTTVKEEERE 212

Query: 256 GLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFDNIFDKMIDERL 315
            + AEF E I E+ +LL  PNISDFFP L+RFDLQG+ K+MR  A R D +FD++I++RL
Sbjct: 213 VVGAEFLEVIREMNDLLLVPNISDFFPVLSRFDLQGLAKRMRRPAQRMDQMFDRIINQRL 272

Query: 316 KIGGGDDDGSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGGTDTSSNTIEFA 375
            +     DG  +  DFL  LL+V DE   KT LTM  +KA+LMDMV+GGTDTS + IEFA
Sbjct: 273 GMDRDSSDG--RAVDFLDVLLKVKDEEAEKTKLTMNDVKAVLMDMVLGGTDTSLHVIEFA 332

Query: 376 LAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRLHPILPLLVPHC 435
           +AE++ NP  +K+AQ+E+  VVG++ +VEESHI   PY+ A+MKETLRLH + PLLVP  
Sbjct: 333 MAELLHNPDIMKRAQQEVDKVVGKEKVVEESHISKLPYILAIMKETLRLHTVAPLLVPRR 392

Query: 436 PSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKWDFGGSDFRYFP 495
           PS+TT+V  + IPK S++FINAWAI R+PN WENPL FDP+RFL    +DF G+DF Y P
Sbjct: 393 PSQTTVVGGFTIPKDSKIFINAWAIHRNPNVWENPLKFDPDRFLDM-SYDFKGNDFNYLP 452

Query: 496 FGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGKKMEVEEKFGIVLKMKSPLV 550
           FGSGRR C G+AM ER+V+Y LAT LHSFDWK+  G+++EVEEKFGIVL++K+PLV
Sbjct: 453 FGSGRRICVGMAMGERVVLYNLATFLHSFDWKIPQGERVEVEEKFGIVLELKNPLV 505

BLAST of CcUC01G003870 vs. TAIR 10
Match: AT4G12330.1 (cytochrome P450, family 706, subfamily A, polypeptide 7 )

HSP 1 Score: 575.5 bits (1482), Expect = 1.2e-163
Identity = 281/479 (58.66%), Postives = 367/479 (76.62%), Query Frame = 0

Query: 76  IFLYVKLTRLRVPLPPGPWGVPLLGNLPFLDPDLHTYFAELGRKYGPIVKLQLGNRIGII 135
           I++YVK  RL  PLPPGP G+P++GNLPFL P+LHTYF  L +K+GP+ KL LG ++ I+
Sbjct: 33  IWVYVKSKRLFPPLPPGPRGLPIVGNLPFLHPELHTYFHSLAQKHGPVFKLWLGAKLTIV 92

Query: 136 VNSSSVAREVLKDHDVTFANRDVPQAGRAASYGGSDIVWTPYGPEWRMLRKVCVLKMLSN 195
           + SS   R++L+ +DV FAN DVP AG  ++YGG DIVW+PYGPEW MLRK+C+ KMLSN
Sbjct: 93  ITSSEATRDILRTNDVIFANDDVPVAGSLSTYGGVDIVWSPYGPEWPMLRKICINKMLSN 152

Query: 196 ATLD--SVYELRRREVRNTVAHL--YRQCGSAVNVGEQGFLTVFNVVTSMLWGGSV-EGE 255
           ATLD  S   LRR+E R TV +L    + G AVNVGEQ F+T+ NVVT MLWG +V + E
Sbjct: 153 ATLDSNSFSALRRQETRRTVRYLADRARAGLAVNVGEQIFVTILNVVTQMLWGETVADDE 212

Query: 256 QRDGLAAEFRETISEITELLGKPNISDFFPSLARFDLQGIEKKMREIAPRFDNIFDKMID 315
           +R+ + AEF E I+EI +++GKPN+SDFFP L+RFDLQG+ K++R  A R D +FD++I 
Sbjct: 213 EREKVGAEFLELITEIIDVVGKPNVSDFFPVLSRFDLQGLAKRVRRSAQRMDRMFDRIIS 272

Query: 316 ERLKIGGGDDDGSVKKNDFLQFLLEVNDEGESKTPLTMTHLKALLMDMVVGGTDTSSNTI 375
           +R+   G D        DFL  LL   DE E+   ++M H+KALLMDMV+GGTDTS NTI
Sbjct: 273 QRM---GMDKGSKGNGGDFLMVLLNAKDEDEN---MSMNHVKALLMDMVLGGTDTSLNTI 332

Query: 376 EFALAEMIKNPKTLKKAQEELTAVVGEDNIVEESHIHSFPYLKAVMKETLRLHPILPLLV 435
           EFA+AE+I   + +K+AQ+EL  VVG++NIVEE HI   PY+ ++MKETLRLHP LPLL+
Sbjct: 333 EFAMAELINKLEIMKRAQQELDKVVGKNNIVEEKHITKLPYILSIMKETLRLHPALPLLI 392

Query: 436 PHCPSETTIVSNYAIPKGSRVFINAWAIQRDPNHWENPLVFDPERFLKTEKWDFGGSDFR 495
           P CPSETT++  Y IP  S+VFIN WAI R+PN WENPL F+P+RFL  + +DF G+D+ 
Sbjct: 393 PRCPSETTVIGGYTIPNDSKVFINVWAIHRNPNVWENPLEFNPDRFL-DKGYDFSGNDYS 452

Query: 496 YFPFGSGRRNCAGIAMAERMVMYLLATLLHSFDWKLEDGKKMEVEEKFGIVLKMKSPLV 550
           YFPFGSGRR CAG+AMAE++V+Y LATLLHSFDW++ +G+K+E+EEKFGI+LK+K+PLV
Sbjct: 453 YFPFGSGRRICAGMAMAEKVVLYNLATLLHSFDWRIGEGEKVELEEKFGILLKLKNPLV 504

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG5592995.11.0e-28637.23hypothetical protein H5410_043509 [Solanum commersonii][more]
KAF4350078.12.6e-28542.85hypothetical protein F8388_019722 [Cannabis sativa][more]
KAF4360438.11.8e-28342.93hypothetical protein F8388_001909 [Cannabis sativa][more]
KAF4404053.14.3e-28041.82hypothetical protein G4B88_014509 [Cannabis sativa][more]
KAG9145073.11.0e-27340.69hypothetical protein Leryth_018361 [Lithospermum erythrorhizon][more]
Match NameE-valueIdentityDescription
A0A4D6Q4151.1e-9738.92Flavonoid 3'-monooxygenase CYP75B137 OS=Crocosmia x crocosmiiflora OX=1053288 GN... [more]
Q8VWZ73.2e-9740.25Geraniol 8-hydroxylase OS=Catharanthus roseus OX=4058 GN=CYP76B6 PE=1 SV=1[more]
D1MI462.1e-9639.96Geraniol 8-hydroxylase OS=Swertia mussotii OX=137888 GN=CYP76B10 PE=1 SV=1[more]
Q7G6022.6e-9440.12Flavonoid 3'-monooxygenase CYP75B3 OS=Oryza sativa subsp. japonica OX=39947 GN=C... [more]
A0A1D8QMG44.4e-9439.57Carnosic acid synthase OS=Rosmarinus officinalis OX=39367 GN=CYP76AK8 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
M1A0V40.0e+0044.01Cytochrome P450 OS=Solanum tuberosum OX=4113 PE=3 SV=1[more]
A0A7J6DVG21.2e-28542.85Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_019722 PE=3 SV=1[more]
A0A7J6EPW88.9e-28442.93Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_001909 PE=3 SV=1[more]
A0A7J6I9242.1e-28041.82Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_014509 PE=3 SV=1[more]
A0A6N2KQQ42.4e-27341.61Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS124803 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT4G12300.11.7e-17358.63cytochrome P450, family 706, subfamily A, polypeptide 4 [more]
AT4G12320.12.9e-17057.63cytochrome P450, family 706, subfamily A, polypeptide 6 [more]
AT4G12310.16.8e-16756.89cytochrome P450, family 706, subfamily A, polypeptide 5 [more]
AT5G44620.18.3e-16558.61cytochrome P450, family 706, subfamily A, polypeptide 3 [more]
AT4G12330.11.2e-16358.66cytochrome P450, family 706, subfamily A, polypeptide 7 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002401Cytochrome P450, E-class, group IPRINTSPR00463EP450Icoord: 141..162
score: 25.28
coord: 233..251
score: 21.62
coord: 350..367
score: 31.04
coord: 370..396
score: 32.64
coord: 491..501
score: 52.1
coord: 501..524
score: 29.44
coord: 117..136
score: 27.89
coord: 413..431
score: 40.87
coord: 454..478
score: 41.34
IPR001128Cytochrome P450PRINTSPR00385P450coord: 361..378
score: 30.86
coord: 414..425
score: 48.01
coord: 492..501
score: 57.11
IPR001128Cytochrome P450PFAMPF00067p450coord: 938..1396
e-value: 7.9E-95
score: 318.3
coord: 90..547
e-value: 1.2E-101
score: 340.8
IPR036396Cytochrome P450 superfamilyGENE3D1.10.630.10Cytochrome P450coord: 80..550
e-value: 1.5E-123
score: 414.9
IPR036396Cytochrome P450 superfamilyGENE3D1.10.630.10Cytochrome P450coord: 932..1413
e-value: 8.6E-108
score: 362.9
IPR036396Cytochrome P450 superfamilySUPERFAMILY48264Cytochrome P450coord: 90..549
IPR036396Cytochrome P450 superfamilySUPERFAMILY48264Cytochrome P450coord: 938..1408
IPR036396Cytochrome P450 superfamilySUPERFAMILY48264Cytochrome P450coord: 595..771
NoneNo IPR availablePIRSRPIRSR000047-1PIRSR000047-1coord: 72..531
e-value: 1.8E-14
score: 51.4
coord: 915..1382
e-value: 1.2E-18
score: 65.2
NoneNo IPR availablePANTHERPTHR47951:SF3CYTOCHROME P450, FAMILY 706, SUBFAMILY A, POLYPEPTIDE 3-RELATEDcoord: 908..1416
NoneNo IPR availablePANTHERPTHR47951OS08G0547900 PROTEINcoord: 63..551
NoneNo IPR availablePANTHERPTHR47951:SF3CYTOCHROME P450, FAMILY 706, SUBFAMILY A, POLYPEPTIDE 3-RELATEDcoord: 63..551
NoneNo IPR availablePANTHERPTHR47951OS08G0547900 PROTEINcoord: 623..733
NoneNo IPR availablePANTHERPTHR47951:SF3CYTOCHROME P450, FAMILY 706, SUBFAMILY A, POLYPEPTIDE 3-RELATEDcoord: 727..777
NoneNo IPR availablePANTHERPTHR47951:SF3CYTOCHROME P450, FAMILY 706, SUBFAMILY A, POLYPEPTIDE 3-RELATEDcoord: 623..733
NoneNo IPR availablePANTHERPTHR47951OS08G0547900 PROTEINcoord: 908..1416
coord: 727..777
IPR017972Cytochrome P450, conserved sitePROSITEPS00086CYTOCHROME_P450coord: 494..503
IPR017972Cytochrome P450, conserved sitePROSITEPS00086CYTOCHROME_P450coord: 1347..1356

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC01G003870.1CcUC01G003870.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0020037 heme binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0004497 monooxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen