Sgr024537 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr024537
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPKS_ER domain-containing protein
Locationtig00001291: 3960015 .. 3980237 (-)
RNA-Seq ExpressionSgr024537
SyntenySgr024537
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGCTTCGTTGTATCCAAAGAATTGGATATGCAGGTGGAGGTGGAGGTTCTGCTGGCCTTATTTCCAGCACATTCGCGAGGTATTTCTCGAGGAAGCGAGCAGAGAATCTTAGGAAGATCAATCCCAAGTTGACCCCTCAGGAAGCTTCTTTAGTTGCTCAAGATCTCTATGGTGTCCTCAAGCAGCACGGGCCTCTCACTGTTTCCGACGCTTGGATTAAAGCCAAGGTTTCCCTCCCTCACTTTCTGTTAAATGTCATCAATTATTTTCTTTCATTTCGAATTCTGATGGTGGTTATCAAAACTACATTTCGTTGAAACTGAAATGATATTTACTTAGTACTTCCAAGTTCCAACTATTTGTGCAGAGAGAAAAGGCTTAACACCAACTACGTAAATCAATAAAGAATTTCTAACTGGAAACAGTTGAAACTAGTAAACAATATCCATAGACTTATTAACAGACACCCCATTTGGCATTAACCAGATGCTTCCAGCCCCCCTCAAGCAAAAGGGTTCAGGAATAACCAGAAGTTTGGACTTGAGATGGAAAATAATGAGAAAGCAGTAGCTTTAGTAAGAATATCAGCTGTTTGATCTTGAGTTGGAATAAATCACACTTCTAGCTCTTTACTCAGAATTTTATCATAGAAAAATGCAAGTCTATTACAAATTTTCAATATTGTTGGTGCGAGCATGTTAAGGGTGTTAGCAACTATTGAGGCAGCATTAATATCATCATGTTAAATTATTTGGAGTTCTTAGAATCAAGACATTCAATCAAGAGTTTCTTGATCTAGGTTAGCTCAGCAGCCACATTCGCTAAAGCCGGATATTTACTCTCAGAACTTGATCAAGAGACCACATTTTTTTCTTGGACGACCAAATTAATAAAATTTCACCAACTAATATTGCAAAATCACCGTTGGACTCTCTTTCATCCAAGCAACTAGTCCAAATCTGAATGTGGATATACTTTTAAAGAAAAACGATCACGCTTTTATAAGAATTAGACCTTCATTCATTCATAGTCCCCTTAAGATATCGAAGAATGCTTTCCAAGCCATCCAGCTAGAACATATAAATTGCTTCATTTAAAACGCCATTGTGAAATGCATTACTAATATCTAGTTACCTAATTTTCCAATCATTTTCCAAAGCAATACTAATCTACCAAGCAAATTAAAGGAGTAATCTTGATGAAACTCTTTTTTGCAACCAAACGAGTCTTATACCTTTGAACACTTACCATAACATGTAATTTTATCTTGAACACTCTATGCTATTTTCTAAAGAGGCTCAGGAGGAGCAGTAAGAATTGAAGTGGGTTAGAATGAGTACAAGAAGAAGTTGCATAAACCTAAACAAAGAGGATGACTTCCTAAAGAGGATGGTTGATTTGTAGAGTTTGAAACATCCAAAGATGAAGGTGGAGTAGATGAAGTATTCCTATTAGCTGTATGATGTGAAGCCAACAGTTCAAACAACTTGCTACTATCGGACAAACACCAATCTGGACTACTTGGAGTAAAAGAAAATCCTAAAGACCCAACCTCGCCATGTGTAGATTGTGCATTAGTACTTGAATAGTACTTGAATAAGATAAGGACCCTAAAAAAGGAAAAAATGTTTTTAAACTTTACATTCCTGACAGTATAAATTCTCTCCAACATATGCAAGCGAAAGTGCCCTTTGTAATACTGGAATCGGTAATAAGAACACACAAATCAGATCTGAATTGAAGCTTACTGTTATTGTACCATTGAAGGTGAGGAAGGCAAGCGCATCCAGAAGGGTGAAGAATTGAGTAACTAGGCTGCTTATCAGATAATATTTGCAAAGGAAACAAAATGTATAATTGGCATCTGCGACCAGTTAATAAGAAAAGTAGTTATATTAACAACCATTTGTCCTTTCGCCTAGTGGCTGTGGTGCTTCCTCTTAGAGTTCAATACACTATTGAATAAGATTTTGGTGAATAAGTCCGGGGGCTCTAATTGATGGGATGCTATTGTGGTGATTAAAGGCAATAGGAACCCATGGAAAGTGATTTTGGAATGTCTTTCTAGTTTTCCCAAGTTCATTAAAATTTTTGCAAGGGATGGGACCTATGTTTGCTTTTGGTTGGATCATTGGATTGTTAAAGGTACACTTTGTCTGCGTATTCTTGTTTTTACAACCTTTGTTCATTGAAAAATGATGACATTGCTTCTGTATTTTATTCCCCTACGAGCTTTCTCTCTTATAACTTTGATTTCAGGAGAGCTTTGTCTAATAGAGAAGCCAATGACTTGTTTTCTTTGTTGTCTCTTTTGCATGGCGTTTGTTGTCTCTCAAATTCTTTGATGGTTGTATTTGGTCACTTGAGATCTTTAGTGTGTTCTCTTGTAAGGTTTTCTGTTTTTGTTTTCTTTATCCCTCTCACTTGAGTCGTTGTCTCTTTTTCTCATCCCTTTTTCTTTGCCAAGCTTTGAGAGGTTAAAATTTTAAAATTTATCGAACTAACTTTTCATTAGACGTTTCTCTTGGTTGCCAAGAAGAGTATAGCTCAGCGGTTAAGACATCTCTACTTCTTCTAAGAAGTCATAGGTTCGAATCCCCCCCCCCAATTTGCGATATTCTCAAAAAAAAAAAAAAAAAACTTTTCTCTGGGTTAGTTGCTCATGGGATGATTAACAACCTTTGACTTTGTCTAAAGGTCTTCATCTATGGTCTAAGAAGCACGGACACTTCATTATAGGTGGAGAATCTGAGTCAGACATGTGTCGGACACGGAGATGTCCGGACATACCCCAGACACGTGTCGACATGCCAAATCAGTGTCCAATTTTTTTCTTTTTTAAAACTGGATATTAAATCAAAGACAAATCAACAACTACTATCAACAGTAGAAGATCTGTAGCAAACCCTTGGCATCATCAAACGTTTTGAACGCTTATCCAGCCTCAAGATTAATTTAGAAAAATTAGTAATTGCAGGGATTAAGATAAATAAACAAGAAATTGTGAACCAAGCTAAAAGAATTGGTTGTGCTGCTCTTTATGGGCCGCCCTCCTATTTAGGTATGCCTCTTGAGGTTAACGCTCTTAATCATTCCTTTTGGACCGGGGTTGTGAAGATAGTTGATAAAGGAATTGATCATTGGAAGACTGCTTGTTTCTTTGTCTTTTCTAAAGTTTGGCTCTTTACAACTGACTTTTTTTTTTCTTCTTCTTATTATTTTTTTGGTGTGGACATGCCTTCATTGTTGCTTGTATACTGAAATCATTATAATATCAATTGCTATATTTTATTAAAATTGCATCTATGCCTTGTCTGTTGCATTATTTGAATACAAAAGTTATAACTAATAATACAATTTAATGCTATACTTATATAGATTGCAAGTATTTGATTTTTTTATGAAAAAATGAATATAGCACCCAATGATTATTGAATCCTGTTCAATATAGTTCATCGAAGGAAATCAAAATGATTGAATAGATTATTGAAAATCATATTGAATAAATTATTGAATAATGTTTTAACAATTTTTTTAATCTTATTCAATACAATATTGATCATAACTAATAAAATATGGTACAGAGTATGAGAGTAGCTTCTTTCATTAATCAAGTGTGACAGTTAGAAGGAGCATAATCAAATTGATGTCATCCGTTAACTATTTGACATGTATTAAATAATATATTTAATAAATATTTTTTTTCTATTCAACAATTATGAAAAGATATTATTAAGTAAAAATTTATAATGATTGAAAACTATCAAGTGATTTTTTCTATTTGTGTAAATTATGAGCATAGTTGCTGTGTATAAAAAGGTCCAATAAAATTTGTCATTTTTTAAACTGGATCATTTGAATATATATTATAGAAATTGTGCACCATTCATGGAGTCCTCATGAAAACCAGTGGGTTAGCTGAAATTTTATGTCCTTTCTATCTAATTCTCATTTTCATATCAGGAATCTGGCGTGAGTGGATTGAACAGCAAAACACACATGAAGCTAATGCTGAAATGGATGAGGGGAAGGAAGATGCTGAAGCTTTTCTGCAACCAAGTTGGTTCCAGCAAGAACTTTCTGCTTTCTACTCTGCCCGATGATCCTCAGGCAGATCAGTTGAAGATTTCTTCAGAAGTAGGGATCCAGAGAGAGAAGCCACCAGTAAGGAGGAGAAAGCCATCAAAATAGACCTTTGCTGACGTCTGTTAGTGTTGCCGTGCAGTGGGCGGGCTGTCCTCCTCTGGTTCATTTGTACTGATGGCGGGCTCACAACGTATGGTGAATTTGTGAGTCTCTGCCATCACTGTTTTTGCCTCTATTTCTGCTCTTTTTGGTGGAGATACAAAAGGTTTTTGGATATGGAATAACACATTGACTTGTTTCCATGCATTTAAATTCATGGTTGATAAAGCATGGATTAGGATGTCTCGTGGATCAAGTAAAAAGAAGCAAGATCTTGAAAGATAAATGATTATTGTCCTGTTGTGACAGCTTAGAGTGAGTTTAGGATGATTTTTAGAAGAAATGTTTGTAGGTGAAATATTTTTTCTATAAGCATTTTTGGAAAGAAAACACTAATTAATTTTAGTGTTTCTTTTAAGTGTATCTAGAAATTTCAGAATCATTTTTTACTGTGTAACCAAACTCCATGCAAATTTTGAAAAATGTTTTTAATTGTCTAAGAGTTAAAAAATCATACTAAATTCACCATTAATTAAAGTGCATTTGGCTACTACTTTGAAGTGTTTAATGATTTTTGATTCTATGACTTTAAAAAGGACAAGCTAAAGATCCAATGACTTTTTCAATTCTTTTAGATGCAGTTTTTGGCCTTATGGGACAAGTTTAGTTTGATGCAGAAGCAGAAGAGGCATTAATAGAAACACAAGAGGACCATGGAGGATTGAATGTTGGTGTTATTTCTTTGTGGATAAAGTATGTTTATGTGAACTTGCAAAGTTAAAAACTAGATTGATAGTCCAATGCTCCCAGCATATGAGGTAGGGTGTACATCAATTGATCGGTGTTGGTTTTAGGTACAAATCGATGCAAAAAATAAACCAATCAGTTTTAATCATGTGGAAACCAACAGTTTTTGGCGACTCATGTTAAACCAATTGATCGATTGGCATTGTGGTCAGTTTGGTTTTGATCAAAACTGATAATTTGACTTTTTTTTATGGTCAGTTGAATCGTTTTTCATCGGTTGGTCGGGTCTATTTGGATGCAAATTGATTGATTTGTGCATCTTTTTTCAACTAGCTAATGAATTCATCTAACAATTTTTTTAATTGGATAATAGTTTATTTTGGCTTACTCTTATTATAATTAGATAAGCGATAGAAATGGTTTAATGTTTAATTATTGGATAATAGCTTATGAGTTGGATCTATTTTCATCAGCCAGTTTGTCAGGAAGGCTTCGGTCGATCGATTGGTCATAATAATATTAAATAACTATTAAAGCTTTTTTAAAATTTTAAGGCTGAATACACCAAATTATCCCTAATCTTTTAAGCAATTCTCAATTTAATCCTTAATCTTTTTGTAAAATTCTCATTTTTACCCTTAAAGTATTAAAAATTCTCAATTAAGTCATTCCATACAAAAATATGTTACATTTTGATCAATAAATTTTTTTTGTCTCATTAAATTATTGTTTTATTAACATATTTGAACATGTAAACATTCAACTTGGACATGCAACTTTGGAATAAATAGATTTTGTTGATGTGCCGGTTGAATTTGGTTGATGCATTGGTTGGGTTTCATGTATCAAGTAGCATCCAAGTGACATTTCTCCGTGTCCAAATATGTTATAAAGCAATAATTTAATAATACATCAAATTTTTATTAACAAATTTTAATAATTTTTTTACGGGAAGACCTAATTGAGAATTATTAATACATTAAGGACCAAATTGAGAATTTTAAAAGATTAAGGACTAAATTGAGAATTGCTTAAAAGATTAGGAACTATTTGGTATATTTAGCCAAATTTTAAATATCATTGGTCGGATGGTTGAAAAGGAGCCTTTTCACTATTTCAATCGAGACATGTCAGTTTTGTACATTGTAAAACCAACCCATTCGACCAACGTTGATTTGGTATGTTTCAATTAGTTTGGTCGGTTTTTTGGATTTTATTGTTTACCCCTAATATCAAGTAGAGATGCTTTAAATCCAAACATAGTAAGATTAGGAAAGAGGACCAAGACAATAAATTGCATATCTTAAAAATACTTTAGGCGATCATATTTAGACAAATGCAGATGATGGTATGACCATTTTTTTTTTTTTTTTGAGTTCAACAATTGGAGGTGGGAGATCGAACCATCGACCTTTAAGATGGTAATAGGTGCTTTATCTACTGAGCTATGCTCAGATTAGCATTACCACTTTTTTAGAACTATAGGATGCACGTTTGCAATCAACGTTTAACTCTCATTTCTCAAATTGAATTCTCATAGAATGATCAAGCGTGAGAGAGATTGATAAGAAAGACCAACTTTAAATTGTCATTTTACTTTGAGATGTCTTAATCTATAATGAGCTTTAGATCAATATATGAATTTTTTAAAAAAATTTGCTTAAAGTTGTTCCATATTAAAGTTTGTTCAAAATTGATAAAGGATTTATTACAAGATCAATAAAGTAAATATGAATTACTTCTTTTATTGACTTTTGAGCATCAATTCGAACTGACCCACTTAGATCTTATCTCAAAGACACAGTCGTTAGAATTATGTCTTAGTTTGTATCATAGTTAATATTGTAGTTATTGAATTTAAAATAATTTTTACATGGTTAATATTATATTTTCTAAATTTTATAAAATTAAAAAAAAAATTGAGTTATATTGTAACAGTCCTATAGTTTGATGGTTTAATTATAAGAATCATTATAGTTTAGAAGTTGTTTGTGTATTTTGCAATCTATTACTAATGTTGTCAAAATAGTAGTATTTAACTGTAACAATTTCTTGAGTCTGGAGATGATTGATCTAATTGAATATGAAATTGAAAGTTCATTTGTTGGATTGTAACCACTAGATTTGTTTAGTTTTATTTTTATTTATTATTATTATTATTATTATTTTTTGGAGGATTGAATACTTAAACTTAAAAATTAGATTTCGCTGCAAAAGCACAAACTACTTTCTGACCTGTCCCAAAAGATCGCATAGATCCCGTTCTTGAAAGGCATTGTCCAGCATGTCCACGTTTCGACATTTTATATGTTCACGAGTGCATCGACATTTTATATGTTCGTGAATTCGACATTAATATAAAGGGTGAATTGATATTTTTATTATACCGTTAAAAAATCAATAACATATAAAAAAATATTAACAAAATATCACTAATATGAACATAATATTAAAATTTAAATATCATTTTGGTCCCTATACTATCAGCTTTAGTTTATTTTGGTCATTGTACTTTCAAAACGTCTATTTTAGTTCATGTACTTTTAACTTTTGTCCATGTTAGTCTCTGTACTTTTGAAACGTTCGTTTTAGTCCCTGAACTTTTAAAAAGTGACCATTTTGGTCCCTTCATTTTCCTAATTATCTCATTTTTTTCGATCATAATTTTAACACAACACTATACGCAATAATTTTTTTGAAATGTAGCCATAACTTTGTATTAAAGATTTTTTTTTTAGTATATAATTTTTGTTAGAATTTTGCAAATAGGGACCAAAATAGTCACTTTTTTAAAGTTTATGAACCAAAATGAACATTTTAAAAGTATAAGGCTTAAAATTGACAAAAGTTAAAAGTACAAGGACTAGATTAATAATAAGAATTTTTAACGTGTAAAAAAAAAATATATATATGAAATGTCTATTTATTAACTTAAATATTTGTTTGAATTTGTTGATTTTTATGATTTTTCCATAAATATCTTATATAATATTTCCGTCACATTTTCGTAAATTGAGAATTGTTTAAAACCTTGGTATTGACTATTGACTATTGACTATTGAGCTCCAAGCCACATCATCCTTTCCTCTGATCTTTGCTGCCCTTTCACTTCATCTTCAGGCATTTTATGATTTTAGCTCTCAAGTTTCAATCCTATCCCGTTCCTTTCATTTCTCAGGCCATCGCGTTATCGTGAAAATCAAATTTATCCTTGAGTTATTTGTTTGTGGAGAACAATGAGCTCCCATACTTCCGCTGAAGACTGTCTCGGTTGGGCTGCAAGAGATTCCTCTGGGTTTCTATCTCCTTACAAATTCAGCCGTAGGTATGTGATCTCTCCATCCCTTAACGCGATGAGGCACCTGAAATTCCTTTTAGTTTTATTTTCTGGAGGTAGTATAATGAAAATGAGTTGTTTATCATTCATGAATGTTATGATATTAAATATAAGGGAATATGCAATCAACTAATTTCAAACAACTCATTAGCCAATTAACACTATTAGGTAAGTCCAGCTATACATTTAACTAAACTAAGTACAGTTGTGCAAGATAATGAATCAATCAACTAATACTCTAATTCTCCCCTGAAGGTGGTTGATGAAAATTCACAACCGCCATCTTGTCAAGTGTCACAAGCAGCCGCGGAGCAAAAATTTTGTAGGTTGGAAAAATCTTAAATTCTTACTTTATATGTATAATCTTTTAATTCTAAAAATTTAGGGGAGGGGAATTAGAGGGAAGACTTGAAAATCTTAATTTTTCTATTTAATTTTTTTTGAATATCAGACAACTAAGACAACTAAGGGGGAAATTGCTCTCTTCTTCCTTCGCTCCAGGCTCATTTGGAGATTGCGGGAAGTCATAGACACCAATAGAAGCCTTCCTTGTGCTGTTTTAGCAAATCCACTTGACTAATAAATAAGCCTTCCTTACAAGTTACCCCTTGGAAAAAGATGGAATTTGACCCCAAAATTGGACCCAAGAGAAGGGGAGTAGGGATGAATCTCATGCTCGCCCGCAATCCTCATGCTAGTTGTTTGCTTGATTGGTAGGCAGCCTACCCTCGTGATATGCCTAACCTTATTCATGCTAACCTCCCTAGGTTATTTTGCCATCCACACTCGCATGGTCAACCACTTTGTTAGCTAACCCTCACTAGAGTATCTTCCCCTCTTGTGCCTAAGTGTAGCTCAAGGAAAACTATGTCCATTAGGATCACCGCACCAATGATTTGACACCAAACGGAATCAACAAAGCTAGTAGAGCCTTGGTAAGCATATTAGCAAGCTATAAGAAAGATCAAATTGGTTGAAGTTTAGTAGACCTATCAAGAATTTGGTCCCCAGCGAATTGGAAACCTATTTCAATGTGCTTTGTCCACTTATGTTGATCAGTGACAAGTTAAATAGTAACTTCATTATCACAAATAATAAGGATGGAAATAGATGGAAAACTTTATGATCGATAAGAAGCTGATCAAGCCAAATTAACTTGCTGGTTATGACAGCAAGTGCTTGATATTGGGCCTCAACTGAATATTGAGGAACAATGATTGTTTCTTTGACTTCAATGATACTAAGGAGTCACCCAAAAGTACACAAAACCTACTTTCAAGGCAGGATTTCCAGCAGCATCTGTAAATCCTCTTTGGAATGAAGATGAGGCAACAGGCAACAAACATAATTTCTTGGCTAGAAGTGGCTGGAGATAATGCCACAAATGGTGAGTACTGAGTAGCAACTGTGAAACGTTGAAGGCTGATAAAACGTGACTTATAACACCATATATGACTTGATGCTCTTGAGTTTATTACCAAAGTGGCCAATGGATCAAGTGAGGACACTAAGGTGCATTTAGACACATACTTGCTCTAGCGTAGGCACATCCAGTTGCACAAGCATCATTCTTAGCTGATTTCAAGTTGGATTGGAGCAAGCTCTGTAATTATAGATAGTGCATTGAATTGAGGTTACTAAATAAATTGTTTGTCATTTCACTGTTTCCTTTGTTCATCAATGGAGTTGCAATTTTATCTGTTAAGGTATTCATCTAAGCTGCAACAATCAGGGCATTGGAGGCACGTTGATGCATAACCTGGTGAATAATTATGCAACTTCATAGACAAGCATTTGATTTGTTTATGGTGCGTCTAGTGAATTGACGACAATGAGAACAAATCGATCTATTTCTTTGCATCCTATTATCATGATTAGATTGTAGCCTTTGCACTGACTAGCGAAGGTTAATGTAATGAATTGAGCATCACAAGATTGTGTGGATTAATGCTCCTTTGTGTTTCCATTGCACTACAAGAGTGATTTCTTTGTTGATGGGTGGAATTGGCTCCACGAGCAGCCACTGCACACAAATTTGCAATAAGACACATGTACAATTAAGTTCATTTACATGCTCAATTTGGTAGAGATAAAAAAATGCCTTCAGTCCTCCATAGGTGCGCTTTCCACACAAACTAGATGATTTATTCTCATTCAATTCCTATCAGAGTTTTCAGGTTCGAGTAGTAAGCTCCCACAGTGTTGACAAAACCTCCTCTATATTTCTTTGTCCCCCACAGTGAGATCTCCCTAAAACAAATTCATCAATGCTTTTCTGAGTTGGAAGTTTTGAGGACCATTTCACTGTTGGAAGTTGGAACCTCTATTTCAAATCAAGCCAAATGTCATGTGGTGATTTAGATTAGATGATACTTCCTGAGATCTTGTTGAAGACTGAGGTTAAGATCCAAGCAATGAGTCCATGACCATATGGTTATTTCACTGCATAATGGATCTTTTTCAGGCCATGTGTTTGTTTGATCTTCGTTCGATCTAAGAAGCCAATTTTGTTCATCACACTGACGATGTTGTAGTCTGACATACTCCAAGAGGTAAATTTATTCAGTGAGAAAGGAGCACCAGACTAGTGTCATCAAATGATGTAGGAAGTAAGGATTATAGCGGTCGTAGTTCCCTCTGGCTGAGCTTTACTGGTTGCCATGGCCCAGCGTTTCTAGAAGTTAGTGTTCTGATACCATATTTTGAGAACTGATACAGAAATAAAATGCACATGAGTTGTATATCATTTATAAATGCTCTGATATTTAAATATTCAATCAACTGTCAGTAATTAACTGATTAACCTCTTGAGTAAACAAAGTCCATCTGTACAATTAACTAAACTAACCAAATCCTGTTGTATAAGAGGAGAACCCAATTAATTAGTACTCTTAAATGCTGCAAAATGGTTTTCTTTGAAGTATAATTTTTATCCATAAATACTGTTTGAAGTACTTGGACTTCGCTTCAATAGGATTAGGATCCACTAAAGGATTAATAATCTATTTGATTCATCATTATGCAATGTAATTAATCCAGCATTTCTCAATGATTTGAATCGCTGACAGCCATGATGCTGATTCCCATCCATGTACCACATTCTAATGTTGGCCTATGTGATTTCATGAACTTATAAATACCCGATGAAAATTTATTCTTATTGATATTTCAGATATATCTTGATTAAGCCTTAACCAGACTCTAACTTTGTCAGGGTTTCAGGAGATGATGATGTTTCAATTACAATAACGCATTGCGGAATTTGTTATGCAGATGTAATATGGACAAGAAATAAATTTGGAGATGCAAAGTATCCTTTGGTGCCTGGGTAAGTTAATTTTTTTTTAATGTTATTGCAAGTCCTCTCTTTTGGAACTTTTGAAGAGTTTCTCATTAATTGTGCAACAGACATGAGATTGCAGGAATTGTGAAAAATGTTGGCTCAAATGTTCAACGATTCAAAGTTGGAGATCATGTTGGGGTAGGAACTTACGTGAATTCTTGCAGACAATGCGAGTATTGCGAAGATGGTCAAGAAGTTAGTTGCATAAGTGGATCCATTTATACTTTTAATGCAATTGATGTTGATGGTACAATTACAAAAGGAGGATATTCCAACCACATAGTTGTCCATCAGCGGTGAGTGGTAACTCCAATTCTCTTTTATTATTCTGATATCAATTTTATACCGTTTCTAGATGTGTAATGGGGAAATTGTTTAAATATCTTTTAAAAAAAAAAGTTTAAATATGGTTGATCTCTACATGAGATAAGCATCGACTTATTTAGTATTGTTTTCCCTTCTTTCCATTGAAATTGGTCTAGAGGCTTTCTCAATTATTTTCAATCAAATTTTCTTAGTCACATACTGTGATTTTCAAGTGTCTGGGGTGGAGTATCTCATAGGCTTAAAAGAAACATTTTCAATTATTGTATATTGGTTTTAATAAGGCAGATTGTTTTTTTTTTTAACAAGAAACAAACTTTTCATTGATAGATGGAAAGAATGAAAAAAGTTCAATGATACAAACTCTCAAAGGGAGTGAAAATAACAAGCAACCAACCCAAAGAATTACAATAGCCAAAATAAAACATTGAAGAACAAAGCGCCCAAGGAAGAAACAAATAAGCATAATTATCTCATAAGCAAAACTGCAAAATCAAAGAACCACTGAAGCTAAAAGCATATCCATTGAAATACAAGACTAGTTCCAAACCCATTGCCCAAAAGACAAAACTCTGGAAGAGAAAGTGATGAAGACTCCAGCAAGGGAGGTTGCTCCAACTACTATCAAAGCTACTAAGCACCATGAAGAACATTGAAAACAAACCCCTACTTAGACTAGGGACCTACTGCAATCTCGAAAATTTGATGAAGATCCTCAATAATATCTAAAAAATCTTTGGATCTTCCATAGTGTTGTGCTCATCAATAATGAAAGAGGAAATTCCCCTTGAAGAAAGATGAAAAAGCCTTATTTCACTTAAATCTTTGGAAAGGCGATCTAAATTCTTTTCAAATTTTGAATCGATGCTACTGACACTAACAACCGAATCGTCACTAAAATTCGAAAATTCCTCCTCAAAAGCACCTTTAATAGCATCTTCATTAGTTGAATTAATAATACCATGAGAAAAAGAATGGGTACCCTTTGAGAGCACATTCAAGAACCACCCAACAGATTCTGAAATACTCTCCAATGAACAAATCAACAAAACTAGAGGGAAGGGCAGCCAACTTTGAACCATCTTTGATATTTCTTTGGTCGTAGCCGAAAGGAAAAGGCTCCATTGAGATCATTTAGGAGACAAAACTAAACCTTGAATCAATTTGGATTCTTGAATCCAAAGAATATCTTGACAGAGATGCTTAAGAAACTTCTTGGTAGTAACTCACAAGAAGAACTCCAAACCCCTGGAATTTCAAGAGATGATGTACATGAATTCAAAAAATACCCCTAATAAGGCAGATTGTTACCTGCTTTGTTTCATGATGTAGGCGATGTCAATTCATGATTACTTTAATGCTTTTTTTATGAACCTGTTAATAACATTTTTTCATGAGCTATAAGATTTATTGTTTGGAAAATATTATTTAGGTGATTAACATTACATCGCATTCACACTTTTATAATAATAATGATAAATGGTAATAAATGAATTTGGAAGTAGTTGATATACTACTTGCGGGGCAATTTATTCCAAGCTTGCTTTCTAGTTAATTCAGGTCTAACCTTCATTACGTGAAGCTGAATGTACATATAATACTTTATATATTACACATAGCTACTTACCATATAACCCTTTAGCCTATGGCAAGTTTGTGCAGATTTATATTTGTTTTCTGATTATTGGTCTACAGATACTGCTTCAAAATACCTGACAATTATCCATTAGCTTCAGCTGCTCCTTTGCTTTGTGCTGGAATTACTGTTTATGCTCCGATGGTTCGCAATAAGATGAATCAACCTGGTAAATCTCTTGGAGTGATTGGACTTGGTGGCCTTGGCCACTTAGCAGTGAAGTTCGGGAAGGCTTTTGGACTGAACGTGACGGTTTTTAGCACTAGCATATCCAAGAAGGAAGAAGCCCTGAGCATTCTTGGAGCAGACACATTTGTTCTCTCATCTGACAAAGAGCAAATGGAGGTTATGTAGAACAAGTTCTCTCGTGCATTTCATTTTCTTACATATATCAGAATTATCCTCTTATCTGTCTATTTTATTCCAATTAATTCCCATATTACCTTTTAACAGGCGTTGTCTAAGTCCCTCGACTTTATTATTGATGCTGCATCTGGTGATCACCCATTTGATCCATACATGTCTACGTTGAAGATTGGTGGCGTTATGGTTCTGGTGGGATTTCCTAATGAAGTCAAGTTTAGTCCTGCGAGCCTGAATTTGGGTATGTTCAAAGGCAACTAGGCTTTTATTAAATGTCATATGTGTTGATATTCTTCCTCATTTCAAAGTATATGAAATTGGCGACTTAATGAATTGAATGGATCGACTTTCTGGCAGGTTTGAGAACCATTTCCGGAAGCATAACTGGTGGCACTAGAGTTACCCAAGAAATGATCAATTTCTGTGCTGCTCATGGAATCTATCCAAACATAGAAGTGATTCCAATTCAGTACTCAAATGAAGCTATTGAAAGGGTAATAAAGAATGATGTTAAGTACCGATTTGTAATTGATATCCAGGGCTCCCTGAAATGATGTTGAAAGAAATGATGTTGAAAGAGAGAGAAAAAAGGTGGACTTGTCATCGATGAAAGTGAAACACCGCCATGTTATTTGTATTTTTTGCGTTAGGATATCAAATAACATTCATGTTTTCGTTTCTGTTACAGAATCCTTTACTGGGTTATGATTTACAGATGCATCCTGCATTTCAGTGTCAACAAAAATCTTTTCGTTCGCTTGCGGTTCAACTGCAACCATATGATCTGCTTTTTTACCCCACAGCACTAAATAGAGACCCAGTATGATAAAGAATGCTCCTACCAGGCTTGAAAAAAACAACAGTTTCAGAAAACAGGGTTAGCAGCTTAGCTACAAATTCAATTCCATACAAACAGACTAGGGAGGGAGTGTCTTACCTGCCTACATGGAGCCGCTCTGAGAAAACAAATGCAGAGAATATGGCCACAAATACAAGCAATAGGGGACTAAACATTGCAGTGAAAACTGGCCCGTTGCGGCTGATGCACCAGGTTTGTAAGAAGTACCCCACGGCCGAAATCACAATGCCCTGTAGCCAAACACGAGAGAGGATTAAGAAAGTGGAATCTAAGAGTTGGGGGTTTGAAACGAATCATTCAGCTATACTCACACTGTAGACGATGGTTAATAGTCGCACATCCCATTGCAGCTTCCATTTGTTTGGACTTCTTGCAAAAATCAGGGCAAGGAAGCCAGATTGGGCAGTTGCAAAGAAGCAAATCAAAACAGTCATGGTCAGTTGTGCTGGGTATTTTCTGGTGACAAATGCCTGCAAGCCATCGAGTTTTGATGAATACATTTTTTTTCAGCTAAATTTGCCCAACCCTTAGTCCTTTCCAAACTCCAAACTCTACATAAGTTTTTGAATGGATGTACCTGTAGAATTAGCCACGCACTCCAAGCAATATCACTAATAAGAATAAAAGCTGAGCCCTTGAGCCAGTTTTTATTGCCAATGCTACTTCCTGAACCTTGATTCTTGTATATGTCTATGAGTGGCTCCTTGAAGATGGCTTTTGTTACGTATGGTCCTTTCCAGAAGGTGAAAACAAGGGAACCACCAATGCAGAGAATTGTGCCCACCACCTTAGCCGTACCTCTTACAGTTCTAATCTTCACTTTCTCCAATCTTTATAACGTGGGAATGGGACACCATAAATAAACAACTTTTGACTGTAATAATCATAGTATTGACAAATTTGCTTGGAAGCAGCTTGAGGGGACAAAAAGAAAAAAGAAGAAAAAAAAAAAAAAAAGTAAAGTAAAAGAAAGAAAGAAAGATCGAAAGCATGTGTATAAGAAACAAGTGAGATAACAGTAATGCCTTGTAATAGTGTTTTCAAGTGAAATATATATATTTTTTTCTATAAGTATTTTTATAAGAAATGGAACAAATATGTTGGTATTTTGATTTCTACGTTGATTTTACAAGAATGTAGGATATTTAGGTATTTCTAGATGTCTAGACATATGTTAATCTGAGCATAGCTTAGTGAATAAAACACTTAATTACATTTCTTGAGGTTGAAAGTTTGATCCTCCACCCCACATTTGTTGAACTACAAAAAAAAAAAAGATGTCTAGACATCTCCAAGGAGAGAGAATTTACACCTACAGCCCATAAGTCTGACCACTTATAATCTTATATTTGCATTCAAAAAATTTTGATGCTCTCTTCAACTAATGGAATTTCTAAGATTATCATGATGAAGGTATCTTTTAATGACATTATTGTCCTTCATTTTTTTCTTCATATGTTTTCTTTTTTCAATCTCTTTATTTCATTTTTCATTTCATATTTAAAAATAATTAGAAATTTTTTTATTAATTCACTAATAAATGACTGTATACAATTAGCTTATATTTATATATCACATTGATATATAAAATAAATTTCAAATAAATTGTATAATACCTCAATCTATAAACATATTAAATTAAAGTAAAATTCTTCAAAAATTAAATTTTTTAGATTATATATATACATAAAATATTAATTAGCTTGAATTGTATATCTATTCAATTTATACTATATGAATATATTTTGACTTTACTATATATTAATTAAACATAGAGTTTTATTTTTTTAAAAAAGAATATTATCGAAATTGCATATAGTTATAATACATTCCAGAGGCTTTTACCTAAATAGTATAGTATAAATATGTATTTTTAAATATAGCAAGAAGTTTAAAATTTTACAAAAATAATACAATCAAAATCTAACTTAAATTCAAATGTAATATAACACATAATAATAAAAGTCAAATTTATGCCGCATCCGTAAAAGATGAAAAGTTATAACATTTTTTAAAGGCTAAATATATCAAATGATCCATAATCTTTTAATTAATTTTCAATTTAGTCTTTAATCTTTCAAAATTCTCAATTTTACCTCTAATGTATCAAAAGTTTTCACTTAGGTCATTCCATAAAAAAATTGTTAAAATTTGTTTACAAAAAGTTGATATATCATCAAATTATTGTTTTATTAACATATTGGGACAAGTGAATATGCCACCCGGATGCTACTTGACACATAAAATCCGTCTCGTTCATCAATAAAATCAGTATCATTTATCAACAAAATCTATTGCTCCTAAGTGACATGTCCATGTGTCCAAATATGTTAATAAAACAATAATTTATTGACACATCAACTTTCTATTAACAAATTTTAACAATTTTTTTTTATAGAAAAGACTTAATTGAGAATTTATAACACATTAAGGGTAAAATTGAGAATTTTAAAATATTAAGAATCAAATTAAAAAAAATCTAAAGATTAGAGACCATATGATATATTTAGCATTTTAAAAAATAATGTCATTTAATGAAATTTCTTTACATTTTCAATCCCATTGGCCATGGATATCTAAAATCCTCTCATACACAAATAGCAAACACGTGCTTAGCTGATTTAAAATATTATTTATTTATTTAATCTATAATTATAAAACCATAAATTCTTTGTTACTTTATTTAAATATTTAAAATTGATACTTAAGAAATTAAAAAAAAAATAACGAATATCGAGTATTAAGAAATAGTATCCCAAAGAATAATTTAGTTAACATAGTTTATTTATTAATTTTAATTATTAGTTTATTCATTAAAATTGTTAACAAATTAATTAAATAATTATTGATCGATAATATTTAAATTGATTGTTCAATACTATAATTAATTATACTAAACTACTGTCAGAAAATATCAAATAATATATTGTTTTTCTTACAATTCAAGTATGAGAGTAAGAATTCTTAGGATTTTTAAGAAAGATTACTAATGCCTTGACCATTTGAGCTATAGATCAAATATCAAATTATTATGTTATATTCTTATGACCATATCTAGTAAATCAAACAAATATATTAAATTTTCAGGTATTTAACTCTTCTATATAATTACAAATTGGACCATAGTTTAGTAATTAGACATGTGTATCTTTAATTATGGATCGTACTTTCAAATTTCAATCCCTTACTTCCGACATAATATTATTTTTTTGTAAAAAAACAAAACTCTCATACATAATCAAATACACAGTTATCTAATATAGAATTTGAATTCTTAGACATTCAAATTTCAAACCTCCATTACTCAAACAAAACCTTGCAACAAAAGACCAATAATTATTTTTAAAATTGTCAAAAATGTTTTTGGCCATCATGAAAAAATTTTAAAAGTACTTTTAATTAATCAAAATCATTTTTTATTAATCTAAAAGTTATGTCAAACACACTAAATATTCTATAACTACTTTTCCTTTCAAGAAATATATTTGACAAAAAGCAAAGTTCCACATAAGAAAATATAACTATCAACTTTTTAATTAAGTCGAAGAAAGAGAAAATGTTAAAAAAAAATAAAGATACTAGGATAGAATGTCAAGAAAAAAAAATGAGAGTTTGAAAAATAGTCATGCTTATTTAAAAGAACTTTTCTGTTTTTAAAATATATTTATAATTTCCAAGGCATTTTTCAAAATTTAAAAACCGGACAACTGAATGGTTTTTCAAACAAAAATATTTTTTAAAAATAAAAAAACTAAACATAAAATAAAATGGTCATCAAATGCAAACATTTAACATACTCAAATCCAACCACAAAATTAAACCATCATAAAATGCCAAAACTAACACCAAAATGTCTAAATACAACTAAAAATCTCCAACACTATTTGAAGAATCAACACTCACAAATATAATCTTGTCCAACTATAAGAACTTCCATAGTTAGGGTGTGTTTGGGAGTGCTTTCTTAAGGAATGCTTTGATAAAAATTACTTCTATTATAAATACTTTAATAAAAAGTGCTTAATTATAAATCATTTTGGTGTTTTGTTCTAATATTTCAAAAATGCGTTTGGTATAAATGGAATTTTTTTTTGAATAATTAATATTTACTTAACAAATATTTATTGAATTGAAAAATTACTAAAATGAAAGTAATAAAAAATTGTAAAAGTTACAAAAATGAATACAAAAAAATTACCAATATTAAAATAAGTGAGAGAAAATAACTATGGAGAGAGATTAGTGAGAGAAAATGTATGTATCTAAAGAGAAGTTTAGTGAAAGAAATTGTGTTTAAAAAAAAGAAGTTAGCTAGAAAAAGTTTGTAGAGAGTAATTAGGAAAAGAAAGTGGATCATAAGAGATTAGAATAGAGTGTTTGGCCAAAAACACTTGGAAAAATACCTACCAATTGCCTTTTTAGAAAGTACTTCCAAATATTTTTAACTTTTTTATGAAGTGGTTTGTAGTCAAATATAAATATTAAGGTAGGAAGTGTTTTGGATTAAAAGCACTCCACCCCAAACATGCCTTTAATCATAAATCTCTAAAAAAAAAAAAAAAAAAACTCAACTAAAGTAATAATTTTACCAATAATTTAAATTCAAAGATAAATTCGATAAAATTAAAACTATAAACTTAAATTCGACTCAAGCACATAAAATAAAGATACAAAATGAAAATTTCCCCATATTTCAAAACATAACGAGTTAAAAGAAAACTCTGCAAATACTATTCCAAAAAAAAAAAAAATCTCAAAAAACTTTAACACTTGTCAAAACGAGGATCTTGATCTTGACATATAATAAAAACTTCCTAAAAATGTAAAAAAGAATCCAAACTCAAAGCATTTAAAAAAAAAAAATGTAAACTCATAGACAACCAATACTTTCTCTAACGAACCCAAAAGGTTAAAAGGGCAAAAAAAAAAAAAAAAGAAATGAAAAAAAGGTACGGTACCTGAATACAACCGCCATGATAAACGTCAAGCTCGGGATAACGTTACTGAAGGCGGAGGCCACCGTCGGCGAAGTATAGCCAATTCCGGCGTAGAACAGATTCAGGTGAATCGTTGACCCTAATATTGCAAGCAAGAAGATCTTCACCGCCATAACGATCGAGAGAGAAGTCCGTTTGTTTCTGCATTCGAAAAACGGCGAGATAAAAATCGATCATCGGAACACGATCGATCGATCGAGGAAGAAAATTGAGGCTCAGGAGAATCGAATCGCGTACCTGTCGAGGACATAAGCGAAAGGAGCCAAGAAGATGAAGGCGATCAGATGGCGGTAAACAACGAAGACGAAGGGATTGAGATCCTTCTCGCTGGCGATTTTCATGAGAATGTTGGATCCACCATAGCCTATCTGCACCACCACCATTGCCGCGCAGGAGGCGTAGAGCTTCTTCGTCGTCATCGTGGCTGA

mRNA sequence

ATGTGGCTTCGTTGTATCCAAAGAATTGGATATGCAGGTGGAGGTGGAGGTTCTGCTGGCCTTATTTCCAGCACATTCGCGAGGTATTTCTCGAGGAAGCGAGCAGAGAATCTTAGGAAGATCAATCCCAAGTTGACCCCTCAGGAAGCTTCTTTAGTTGCTCAAGATCTCTATGGTGTCCTCAAGCAGCACGGGCCTCTCACTGTTTCCGACGCTTGGATTAAAGCCAAGGAATCTGGCGTGAGTGGATTGAACAGCAAAACACACATGAAGCTAATGCTGAAATGGATGAGGGGAAGGAAGATGCTGAAGCTTTTCTGCAACCAAGTTGGTTCCAGCAAGAACTTTCTGCTTTCTACTCTGCCCGATGATCCTCAGGCAGATCAGTTGAAGATTTCTTCAGAAGTAGGGATCCAGAGAGAGAAGCCACCATGTTGCCGTGCAGTGGGCGGGCTGTCCTCCTCTGGTTCATTTGTACTGATGGCGGGCTCACAACGAGATGATGATGTTTCAATTACAATAACGCATTGCGGAATTTGTTATGCAGATGTAATATGGACAAGAAATAAATTTGGAGATGCAAAGTATCCTTTGGTGCCTGGACATGAGATTGCAGGAATTGTGAAAAATGTTGGCTCAAATGTTCAACGATTCAAAGTTGGAGATCATGTTGGGGTAGGAACTTACGTGAATTCTTGCAGACAATGCGAGTATTGCGAAGATGGTCAAGAAGTTAGTTGCATAAGTGGATCCATTTATACTTTTAATGCAATTGATGTTGATGGTACAATTACAAAAGGAGGATATTCCAACCACATAGTTGTCCATCAGCGATACTGCTTCAAAATACCTGACAATTATCCATTAGCTTCAGCTGCTCCTTTGCTTTGTGCTGGAATTACTGTTTATGCTCCGATGGTTCGCAATAAGATGAATCAACCTGGTAAATCTCTTGGAGTGATTGGACTTGGTGGCCTTGGCCACTTAGCAGTGAAGTTCGGGAAGGCTTTTGGACTGAACGTGACGGTTTTTAGCACTAGCATATCCAAGAAGGAAGAAGCCCTGAGCATTCTTGGAGCAGACACATTTGTTCTCTCATCTGACAAAGAGCAAATGGAGGCGTTGTCTAAGTCCCTCGACTTTATTATTGATGCTGCATCTGGTGATCACCCATTTGATCCATACATGTCTACGTTGAAGATTGGTGGCGTTATGGTTCTGGTGGGATTTCCTAATGAAGTCAAGTTTAGTCCTGCGAGCCTGAATTTGGGTTTGAGAACCATTTCCGGAAGCATAACTGGTGGCACTAGAGTTACCCAAGAAATGATCAATTTCTGTGCTGCTCATGGAATCTATCCAAACATAGAAGTGATTCCAATTCAGTACTCAAATGAAGCTATTGAAAGGAATCCTTTACTGGGTTATGATTTACAGATGCATCCTGCATTTCAGTGTCAACAAAAATCTTTTCGTTCGCTTGCGGTTCAACTGCAACCATATGATCTGCTTTTTTACCCCACAGCACTAAATAGAGACCCAACTAGGGAGGGAGTGTCTTACCTGCCTACATGGAGCCGCTCTGAGAAAACAAATGCAGAGAATATGGCCACAAATACAAGCAATAGGGGACTAAACATTGCAGTGAAAACTGGCCCGTTGCGGCTGATGCACCAGCTATACTCACACTGTAGACGATGGTTAATAGTCGCACATCCCATTGCAGCTTCCATTTGTTTGGACTTCTTGCAAAAATCAGGGCAAGGAAGCCAGATTGGGCAGTTGCAAAGAAGCAAATCAAAACAGTCATGGCGGAGGCCACCGTCGGCGAAGTATAGCCAATTCCGGCGTAGAACAGATTCAGGAGAATCGAATCGCGTACCTGTCGAGGACATAAGCGAAAGGAGCCAAGAAGATGAAGGCGATCAGATGGCGGTAAACAACGAAGACGAAGGGATTGAGATCCTTCTCGCTGGCGATTTTCATGAGAATGTTGGATCCACCATAGCCTATCTGCACCACCACCATTGCCGCGCAGGAGGCGTAGAGCTTCTTCGTCGTCATCGTGGCTGA

Coding sequence (CDS)

ATGTGGCTTCGTTGTATCCAAAGAATTGGATATGCAGGTGGAGGTGGAGGTTCTGCTGGCCTTATTTCCAGCACATTCGCGAGGTATTTCTCGAGGAAGCGAGCAGAGAATCTTAGGAAGATCAATCCCAAGTTGACCCCTCAGGAAGCTTCTTTAGTTGCTCAAGATCTCTATGGTGTCCTCAAGCAGCACGGGCCTCTCACTGTTTCCGACGCTTGGATTAAAGCCAAGGAATCTGGCGTGAGTGGATTGAACAGCAAAACACACATGAAGCTAATGCTGAAATGGATGAGGGGAAGGAAGATGCTGAAGCTTTTCTGCAACCAAGTTGGTTCCAGCAAGAACTTTCTGCTTTCTACTCTGCCCGATGATCCTCAGGCAGATCAGTTGAAGATTTCTTCAGAAGTAGGGATCCAGAGAGAGAAGCCACCATGTTGCCGTGCAGTGGGCGGGCTGTCCTCCTCTGGTTCATTTGTACTGATGGCGGGCTCACAACGAGATGATGATGTTTCAATTACAATAACGCATTGCGGAATTTGTTATGCAGATGTAATATGGACAAGAAATAAATTTGGAGATGCAAAGTATCCTTTGGTGCCTGGACATGAGATTGCAGGAATTGTGAAAAATGTTGGCTCAAATGTTCAACGATTCAAAGTTGGAGATCATGTTGGGGTAGGAACTTACGTGAATTCTTGCAGACAATGCGAGTATTGCGAAGATGGTCAAGAAGTTAGTTGCATAAGTGGATCCATTTATACTTTTAATGCAATTGATGTTGATGGTACAATTACAAAAGGAGGATATTCCAACCACATAGTTGTCCATCAGCGATACTGCTTCAAAATACCTGACAATTATCCATTAGCTTCAGCTGCTCCTTTGCTTTGTGCTGGAATTACTGTTTATGCTCCGATGGTTCGCAATAAGATGAATCAACCTGGTAAATCTCTTGGAGTGATTGGACTTGGTGGCCTTGGCCACTTAGCAGTGAAGTTCGGGAAGGCTTTTGGACTGAACGTGACGGTTTTTAGCACTAGCATATCCAAGAAGGAAGAAGCCCTGAGCATTCTTGGAGCAGACACATTTGTTCTCTCATCTGACAAAGAGCAAATGGAGGCGTTGTCTAAGTCCCTCGACTTTATTATTGATGCTGCATCTGGTGATCACCCATTTGATCCATACATGTCTACGTTGAAGATTGGTGGCGTTATGGTTCTGGTGGGATTTCCTAATGAAGTCAAGTTTAGTCCTGCGAGCCTGAATTTGGGTTTGAGAACCATTTCCGGAAGCATAACTGGTGGCACTAGAGTTACCCAAGAAATGATCAATTTCTGTGCTGCTCATGGAATCTATCCAAACATAGAAGTGATTCCAATTCAGTACTCAAATGAAGCTATTGAAAGGAATCCTTTACTGGGTTATGATTTACAGATGCATCCTGCATTTCAGTGTCAACAAAAATCTTTTCGTTCGCTTGCGGTTCAACTGCAACCATATGATCTGCTTTTTTACCCCACAGCACTAAATAGAGACCCAACTAGGGAGGGAGTGTCTTACCTGCCTACATGGAGCCGCTCTGAGAAAACAAATGCAGAGAATATGGCCACAAATACAAGCAATAGGGGACTAAACATTGCAGTGAAAACTGGCCCGTTGCGGCTGATGCACCAGCTATACTCACACTGTAGACGATGGTTAATAGTCGCACATCCCATTGCAGCTTCCATTTGTTTGGACTTCTTGCAAAAATCAGGGCAAGGAAGCCAGATTGGGCAGTTGCAAAGAAGCAAATCAAAACAGTCATGGCGGAGGCCACCGTCGGCGAAGTATAGCCAATTCCGGCGTAGAACAGATTCAGGAGAATCGAATCGCGTACCTGTCGAGGACATAAGCGAAAGGAGCCAAGAAGATGAAGGCGATCAGATGGCGGTAAACAACGAAGACGAAGGGATTGAGATCCTTCTCGCTGGCGATTTTCATGAGAATGTTGGATCCACCATAGCCTATCTGCACCACCACCATTGCCGCGCAGGAGGCGTAGAGCTTCTTCGTCGTCATCGTGGCTGA

Protein sequence

MWLRCIQRIGYAGGGGGSAGLISSTFARYFSRKRAENLRKINPKLTPQEASLVAQDLYGVLKQHGPLTVSDAWIKAKESGVSGLNSKTHMKLMLKWMRGRKMLKLFCNQVGSSKNFLLSTLPDDPQADQLKISSEVGIQREKPPCCRAVGGLSSSGSFVLMAGSQRDDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGVGTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDNYPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFSTSISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMVLVGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEAIERNPLLGYDLQMHPAFQCQQKSFRSLAVQLQPYDLLFYPTALNRDPTREGVSYLPTWSRSEKTNAENMATNTSNRGLNIAVKTGPLRLMHQLYSHCRRWLIVAHPIAASICLDFLQKSGQGSQIGQLQRSKSKQSWRRPPSAKYSQFRRRTDSGESNRVPVEDISERSQEDEGDQMAVNNEDEGIEILLAGDFHENVGSTIAYLHHHHCRAGGVELLRRHRG
Homology
BLAST of Sgr024537 vs. NCBI nr
Match: KAF4353067.1 (hypothetical protein F8388_016912 [Cannabis sativa] >KAF4389528.1 hypothetical protein G4B88_006587 [Cannabis sativa])

HSP 1 Score: 645.2 bits (1663), Expect = 6.4e-181
Identity = 326/473 (68.92%), Postives = 374/473 (79.07%), Query Frame = 0

Query: 1   MWLRCIQRIGYAGGGGGSAGLISSTFARYFSRKRAENLRKINPKLTPQEASLVAQDLYGV 60
           MWL  +          GSAG   S+F RYFSRKRAEN+RKINPK++PQEAS +A++LY V
Sbjct: 1   MWLNSV-----GARFNGSAGGAVSSFVRYFSRKRAENVRKINPKVSPQEASSIARNLYDV 60

Query: 61  LKQHGPLTVSDAWIKAKESGVSGLNSKTHMKLMLKWMRGRKMLKLFCNQVGSSKNFLLST 120
           +K+HGPLT+ +AW+ AK+SGV G+NSKTHMKL+LKWMRGRKMLKL+   VGSSK FLLST
Sbjct: 61  IKEHGPLTIPNAWVHAKDSGVGGINSKTHMKLLLKWMRGRKMLKLWSTGVGSSKKFLLST 120

Query: 121 LPDDPQADQLKISSEVGIQREKPPCCRAV---GGLSSSGSFVLMAGSQRDDDVSITITHC 180
           LP+DP+  Q++   EV IQ +KP    A     G+ S   F   A     DDVSITITHC
Sbjct: 121 LPEDPKVTQIRSDMEVKIQHKKPSSSWAATDPSGIVSPYKFNRRAVG--SDDVSITITHC 180

Query: 181 GICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGVGTYVNSCRQCE 240
           GICYADV+WTRNK GD+KYP+VPGHEI G+V++VGS V RFKVGDHVGVGTYVNSCR C 
Sbjct: 181 GICYADVVWTRNKHGDSKYPVVPGHEIVGVVQSVGSGVSRFKVGDHVGVGTYVNSCRDCH 240

Query: 241 YCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDNYPLASAAPLLC 300
           YC  G E  C  GS+YTFN +D DGTITKGGYS+ IV        IPD+YPLASAAPLLC
Sbjct: 241 YCNQGLENHCQKGSVYTFNHVDADGTITKGGYSSFIV--------IPDSYPLASAAPLLC 300

Query: 301 AGITVYAPMVRNKMN-QPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFSTSISKKEEALS 360
           AGITVYAPMVR+KMN QPGKS GVIGLGGLGH+AVKFGKAFGLNVTVFSTSISKKEEAL+
Sbjct: 301 AGITVYAPMVRHKMNQQPGKSFGVIGLGGLGHMAVKFGKAFGLNVTVFSTSISKKEEALT 360

Query: 361 ILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMVLVGFPNEVKF 420
           +LGAD FVLSSD+EQM+ L+ S DFIID ASGDHPFDPYM+ LK  GV VLVGFP+EVK 
Sbjct: 361 LLGADKFVLSSDQEQMKGLANSFDFIIDTASGDHPFDPYMALLKTYGVFVLVGFPSEVKL 420

Query: 421 SPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEAIER 470
           SP +LNLG +T+ GS  GGTR  QEM+ FCAA GI PNIEVIPIQY+NEA+ER
Sbjct: 421 SPVNLNLGDKTLCGSTVGGTREIQEMLEFCAAKGIQPNIEVIPIQYANEALER 458

BLAST of Sgr024537 vs. NCBI nr
Match: XP_022159743.1 (probable cinnamyl alcohol dehydrogenase 1 isoform X1 [Momordica charantia] >XP_022159744.1 probable cinnamyl alcohol dehydrogenase 1 isoform X1 [Momordica charantia])

HSP 1 Score: 577.0 bits (1486), Expect = 2.1e-160
Identity = 284/327 (86.85%), Postives = 300/327 (91.74%), Query Frame = 0

Query: 146 CRAVGGLSSSGSFVLMAGSQR---DDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGH 205
           C       SSG       S+R   D DVSITITHCGICYADVIWTRNK  D+KYPLVPGH
Sbjct: 10  CLGWAATDSSGFLSPYKFSRRVLGDYDVSITITHCGICYADVIWTRNKHADSKYPLVPGH 69

Query: 206 EIAGIVKNVGSNVQRFKVGDHVGVGTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDG 265
           EIAGIVKNVGS+V RFKVGDHVGVGTYVNSCRQCEYCEDGQEVSC SGSIYTFNAIDVDG
Sbjct: 70  EIAGIVKNVGSSVHRFKVGDHVGVGTYVNSCRQCEYCEDGQEVSCTSGSIYTFNAIDVDG 129

Query: 266 TITKGGYSNHIVVHQRYCFKIPDNYPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIG 325
           TITKGGYSN+IVVH+RYC++IPDNYPLASAAPLLCAGITVYAPM+R+KMNQPGKSLGVIG
Sbjct: 130 TITKGGYSNYIVVHERYCYRIPDNYPLASAAPLLCAGITVYAPMIRHKMNQPGKSLGVIG 189

Query: 326 LGGLGHLAVKFGKAFGLNVTVFSTSISKKEEALSILGADTFVLSSDKEQMEALSKSLDFI 385
           LGGLGH+AVKFGKAFGLNVTVFSTSISKKE AL +L AD FV+SSDKEQMEALSKSLDFI
Sbjct: 190 LGGLGHMAVKFGKAFGLNVTVFSTSISKKEHALRLLKADKFVISSDKEQMEALSKSLDFI 249

Query: 386 IDAASGDHPFDPYMSTLKIGGVMVLVGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEM 445
           ID ASGDHPFDPYMSTLKIGGVMVLVGFP+EVKFSPASLNLGLRTISGSITGGT+VTQEM
Sbjct: 250 IDTASGDHPFDPYMSTLKIGGVMVLVGFPSEVKFSPASLNLGLRTISGSITGGTKVTQEM 309

Query: 446 INFCAAHGIYPNIEVIPIQYSNEAIER 470
           I+FCAA+GIYPNIEVIPIQYSNEAIER
Sbjct: 310 IDFCAANGIYPNIEVIPIQYSNEAIER 336

BLAST of Sgr024537 vs. NCBI nr
Match: XP_022958652.1 (probable cinnamyl alcohol dehydrogenase 1 [Cucurbita moschata])

HSP 1 Score: 573.5 bits (1477), Expect = 2.4e-159
Identity = 272/303 (89.77%), Postives = 290/303 (95.71%), Query Frame = 0

Query: 167 DDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGV 226
           DDDVSITITHCGICYADV+WTRNK GD+KYPLVPGHEIAGIVK VG+NV RFKVGDHVGV
Sbjct: 34  DDDVSITITHCGICYADVLWTRNKLGDSKYPLVPGHEIAGIVKTVGANVHRFKVGDHVGV 93

Query: 227 GTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDN 286
           GTYVNSCRQCEYCEDGQEV C  GS  TFN ID DGTITKGGYSN+IVVH+RYC++IP+N
Sbjct: 94  GTYVNSCRQCEYCEDGQEVCCTGGSTNTFNGIDFDGTITKGGYSNYIVVHERYCYRIPEN 153

Query: 287 YPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST 346
           YPLASAAPLLCAGITVY+PM+R+KMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST
Sbjct: 154 YPLASAAPLLCAGITVYSPMIRHKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST 213

Query: 347 SISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMV 406
           SISKKEEALS+LGAD FVLSSD EQMEALSKSLDFIIDAASGDHPFDPYMSTLK+GG MV
Sbjct: 214 SISKKEEALSVLGADKFVLSSDTEQMEALSKSLDFIIDAASGDHPFDPYMSTLKVGGTMV 273

Query: 407 LVGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEA 466
           LVGFP++VKFSPASLNLG+RTISGS+TGGTRVTQEMINFCAAHGIYPNI+VIPIQYSNEA
Sbjct: 274 LVGFPSQVKFSPASLNLGMRTISGSVTGGTRVTQEMINFCAAHGIYPNIDVIPIQYSNEA 333

Query: 467 IER 470
           IER
Sbjct: 334 IER 336

BLAST of Sgr024537 vs. NCBI nr
Match: KAG7035534.1 (putative cinnamyl alcohol dehydrogenase 1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 571.6 bits (1472), Expect = 9.0e-159
Identity = 271/303 (89.44%), Postives = 290/303 (95.71%), Query Frame = 0

Query: 167 DDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGV 226
           DDDVSITITHCGICYADV+WTRNK GD+KYPLVPGHEIAGIVK VG+NV RFKVGDHVGV
Sbjct: 34  DDDVSITITHCGICYADVLWTRNKLGDSKYPLVPGHEIAGIVKTVGANVHRFKVGDHVGV 93

Query: 227 GTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDN 286
           GTYVNSCRQCEYCEDGQEV C  GS  TFN ID DGTITKGGYSN+IVVH+RYC++IP+N
Sbjct: 94  GTYVNSCRQCEYCEDGQEVCCTGGSTNTFNGIDFDGTITKGGYSNYIVVHERYCYRIPEN 153

Query: 287 YPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST 346
           YPLASAAPLLCAGITVY+PM+R+KMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST
Sbjct: 154 YPLASAAPLLCAGITVYSPMIRHKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST 213

Query: 347 SISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMV 406
           SISKKEEALS+LGAD FVLSSD EQMEALSKSLDFIIDAASGDHPFDPYMSTLK+GG MV
Sbjct: 214 SISKKEEALSVLGADKFVLSSDTEQMEALSKSLDFIIDAASGDHPFDPYMSTLKVGGTMV 273

Query: 407 LVGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEA 466
           LVGFP++VKFSPASLNLG+RTISGS+TGGTRVTQEMI+FCAAHGIYPNI+VIPIQYSNEA
Sbjct: 274 LVGFPSQVKFSPASLNLGMRTISGSVTGGTRVTQEMIDFCAAHGIYPNIDVIPIQYSNEA 333

Query: 467 IER 470
           IER
Sbjct: 334 IER 336

BLAST of Sgr024537 vs. NCBI nr
Match: XP_008437476.1 (PREDICTED: probable cinnamyl alcohol dehydrogenase 1 [Cucumis melo] >XP_016898872.1 PREDICTED: probable cinnamyl alcohol dehydrogenase 1 [Cucumis melo])

HSP 1 Score: 571.6 bits (1472), Expect = 9.0e-159
Identity = 272/303 (89.77%), Postives = 292/303 (96.37%), Query Frame = 0

Query: 167 DDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGV 226
           DDDVSITITHCGICYADV+WTRNK GD+KYPLVPGHEIAGIVKNVG+NVQRFKVGDHVGV
Sbjct: 34  DDDVSITITHCGICYADVVWTRNKLGDSKYPLVPGHEIAGIVKNVGANVQRFKVGDHVGV 93

Query: 227 GTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDN 286
           GTYVNSCRQCEYCED QEVSC SG  +TFN+ID DGTITKGGYSN+IVVH+RYC+KIPDN
Sbjct: 94  GTYVNSCRQCEYCEDCQEVSCTSGCTHTFNSIDFDGTITKGGYSNYIVVHERYCYKIPDN 153

Query: 287 YPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST 346
           YPLASAAPLLCAGITVY+PM+R+ MNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST
Sbjct: 154 YPLASAAPLLCAGITVYSPMIRHNMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST 213

Query: 347 SISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMV 406
           SISKKEEALS+LGAD FVLSSD +QMEALSKSLDFIIDAASGDHPFDPYMSTLK+GGVMV
Sbjct: 214 SISKKEEALSVLGADKFVLSSDNKQMEALSKSLDFIIDAASGDHPFDPYMSTLKVGGVMV 273

Query: 407 LVGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEA 466
           LVGFP+EVKF PASL LG+RTISGS+TGGT++TQEMI+FCAAHGIYPNIEVIPIQYSNEA
Sbjct: 274 LVGFPSEVKFIPASLILGMRTISGSVTGGTKLTQEMIDFCAAHGIYPNIEVIPIQYSNEA 333

Query: 467 IER 470
           IER
Sbjct: 334 IER 336

BLAST of Sgr024537 vs. ExPASy Swiss-Prot
Match: Q9CAI3 (Probable cinnamyl alcohol dehydrogenase 1 OS=Arabidopsis thaliana OX=3702 GN=CAD1 PE=2 SV=1)

HSP 1 Score: 507.7 bits (1306), Expect = 2.1e-142
Identity = 234/302 (77.48%), Postives = 275/302 (91.06%), Query Frame = 0

Query: 168 DDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGVG 227
           DDVS+TITHCG+CYADVIW+RN+ GD+KYPLVPGHEIAGIV  VG NVQRFKVGDHVGVG
Sbjct: 36  DDVSLTITHCGVCYADVIWSRNQHGDSKYPLVPGHEIAGIVTKVGPNVQRFKVGDHVGVG 95

Query: 228 TYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDNY 287
           TYVNSCR+CEYC +GQEV+C  G ++TFN ID DG++TKGGYS+HIVVH+RYC+KIP +Y
Sbjct: 96  TYVNSCRECEYCNEGQEVNCAKG-VFTFNGIDHDGSVTKGGYSSHIVVHERYCYKIPVDY 155

Query: 288 PLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFSTS 347
           PL SAAPLLCAGITVYAPM+R+ MNQPGKSLGVIGLGGLGH+AVKFGKAFGL+VTVFSTS
Sbjct: 156 PLESAAPLLCAGITVYAPMMRHNMNQPGKSLGVIGLGGLGHMAVKFGKAFGLSVTVFSTS 215

Query: 348 ISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMVL 407
           ISKKEEAL++LGA+ FV+SSD +QM+AL KSLDF++D ASGDH FDPYMS LKI G  VL
Sbjct: 216 ISKKEEALNLLGAENFVISSDHDQMKALEKSLDFLVDTASGDHAFDPYMSLLKIAGTYVL 275

Query: 408 VGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEAI 467
           VGFP+E+K SPA+LNLG+R ++GS+TGGT++TQ+M++FCAAH IYPNIEVIPIQ  NEA+
Sbjct: 276 VGFPSEIKISPANLNLGMRMLAGSVTGGTKITQQMLDFCAAHKIYPNIEVIPIQKINEAL 335

Query: 468 ER 470
           ER
Sbjct: 336 ER 336

BLAST of Sgr024537 vs. ExPASy Swiss-Prot
Match: Q8H859 (Probable cinnamyl alcohol dehydrogenase 1 OS=Oryza sativa subsp. japonica OX=39947 GN=CAD1 PE=2 SV=1)

HSP 1 Score: 459.5 bits (1181), Expect = 6.5e-128
Identity = 213/304 (70.07%), Postives = 254/304 (83.55%), Query Frame = 0

Query: 166 RDDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVG 225
           + DDVS+ ITHCG+CYADV WTRN   ++ YPLVPGHEIAG+V  VG++V+ FKVGDHVG
Sbjct: 33  QSDDVSLRITHCGVCYADVAWTRNILNNSMYPLVPGHEIAGVVTEVGADVKSFKVGDHVG 92

Query: 226 VGTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPD 285
           VGTYVNSCR CE C    E  C S  ++TFN +D DGT+TKGGYS HIVVH+RYCFKIPD
Sbjct: 93  VGTYVNSCRDCENCNSSLENYC-SQHVFTFNGVDTDGTVTKGGYSTHIVVHERYCFKIPD 152

Query: 286 NYPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFS 345
            YPL  AAPLLCAGITVY+PM+R+ MNQPGKSLGVIGLGGLGH+AVKFGKAFGL VTV S
Sbjct: 153 GYPLEKAAPLLCAGITVYSPMMRHNMNQPGKSLGVIGLGGLGHMAVKFGKAFGLKVTVIS 212

Query: 346 TSISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVM 405
           TS SK++EA+ +LGAD FV+SSD+ QME L  SL+FIID ASGDHPFDPY++ LK+GGVM
Sbjct: 213 TSESKRKEAIDLLGADNFVVSSDENQMETLKSSLNFIIDTASGDHPFDPYLTLLKVGGVM 272

Query: 406 VLVGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNE 465
            L+ FP+E+K  PA+LNLG R++SGS+TGGT+  QEMINFCAA+ IYP+IE+I I Y NE
Sbjct: 273 ALLSFPSEIKVHPANLNLGGRSLSGSVTGGTKDIQEMINFCAANKIYPDIEMIKIDYINE 332

Query: 466 AIER 470
           A++R
Sbjct: 333 ALQR 335

BLAST of Sgr024537 vs. ExPASy Swiss-Prot
Match: Q2R114 (Putative cinnamyl alcohol dehydrogenase 4 OS=Oryza sativa subsp. japonica OX=39947 GN=CAD4 PE=3 SV=2)

HSP 1 Score: 456.4 bits (1173), Expect = 5.5e-127
Identity = 210/304 (69.08%), Postives = 256/304 (84.21%), Query Frame = 0

Query: 166 RDDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVG 225
           + +DVS+ ITHCG+CYADVIWTRN F D+ YPLVPGHEIAG+V  VG++V+ FKVGDHVG
Sbjct: 33  QSEDVSLRITHCGVCYADVIWTRNMFNDSIYPLVPGHEIAGVVTEVGADVKGFKVGDHVG 92

Query: 226 VGTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPD 285
           VG YVNSC+ CE C    E  C S  + T+N++D DGT+TKGGYS+HI+VHQRYCFKIP 
Sbjct: 93  VGVYVNSCQDCENCNSSLENHC-SKCVVTYNSVDSDGTVTKGGYSSHILVHQRYCFKIPA 152

Query: 286 NYPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFS 345
           +YPL+ AAPLLCAGITVY PM+R+ MNQPGKSLGVIGLGGLGH+AVKFGKAFGL VTVFS
Sbjct: 153 DYPLSKAAPLLCAGITVYTPMIRHNMNQPGKSLGVIGLGGLGHMAVKFGKAFGLKVTVFS 212

Query: 346 TSISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVM 405
           TS SK+EEA+++LGAD FV+SSD+ QME+L  SL FIID ASGDH FDPY+S LK+GGVM
Sbjct: 213 TSESKREEAINLLGADNFVISSDENQMESLKSSLHFIIDTASGDHQFDPYLSLLKVGGVM 272

Query: 406 VLVGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNE 465
           VL+ FP+E+K  P +LNL  R+++GS+TGGT+  QEMINFCAA+ +YP+IE+I I Y NE
Sbjct: 273 VLLSFPSEIKVHPENLNLAARSLAGSVTGGTKDIQEMINFCAANNVYPDIEMIKIDYVNE 332

Query: 466 AIER 470
           A++R
Sbjct: 333 ALQR 335

BLAST of Sgr024537 vs. ExPASy Swiss-Prot
Match: Q337Y2 (Probable cinnamyl alcohol dehydrogenase 3 OS=Oryza sativa subsp. japonica OX=39947 GN=CAD3 PE=2 SV=1)

HSP 1 Score: 327.4 bits (838), Expect = 3.9e-88
Identity = 166/313 (53.04%), Postives = 212/313 (67.73%), Query Frame = 0

Query: 158 FVLMAGSQRDDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQR 217
           F +   S  DDDV+I I  CGIC++D+   +N++  + YPLVPGHEIAG+V  VG NV R
Sbjct: 31  FAISRRSTGDDDVAIKILFCGICHSDLHCIKNEWKHSIYPLVPGHEIAGVVTEVGKNVTR 90

Query: 218 FKVGDHVGVGTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQ 277
           FK GD VGVG  VNSCR CE C +G E  C  G ++T+N++D DGT+T GGYS+ +VVH+
Sbjct: 91  FKAGDRVGVGCMVNSCRSCESCNNGFENHCPEG-VFTYNSVDKDGTVTYGGYSSMVVVHE 150

Query: 278 RYCFKIPDNYPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAF 337
           R+    P+  PL   APLLCAGITVY PM  + +N PGK +GV+GLGGLGH+AVKF +AF
Sbjct: 151 RFVVMFPEAMPLDVGAPLLCAGITVYTPMKYHGLNAPGKHVGVLGLGGLGHVAVKFARAF 210

Query: 338 GLNVTVFSTSISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMS 397
           GL VTV S+S  KK EAL  LGAD FV+SS  E+MEA   ++D +I+  S + P  PY++
Sbjct: 211 GLKVTVISSSPGKKREALERLGADAFVVSSSAEEMEAARSTMDGVINTVSANTPMAPYLA 270

Query: 398 TLKIGGVMVLVGFP-NEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIE 457
            LK  G M+LVG P N ++  P SL  G RT++GS  GG   TQEMI   A HG+  +IE
Sbjct: 271 LLKPNGKMILVGLPENPLEVPPFSLVHGNRTLAGSNIGGMADTQEMIELAAKHGVTADIE 330

Query: 458 VIPIQYSNEAIER 470
           VI     N A+ER
Sbjct: 331 VIGADDVNTAMER 342

BLAST of Sgr024537 vs. ExPASy Swiss-Prot
Match: Q43137 (Probable mannitol dehydrogenase 1 OS=Stylosanthes humilis OX=35628 GN=CAD1 PE=2 SV=1)

HSP 1 Score: 310.5 bits (794), Expect = 4.9e-83
Identity = 156/303 (51.49%), Postives = 209/303 (68.98%), Query Frame = 0

Query: 168 DDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGVG 227
           DDV++ I +CG+C++D+   +N +G   YP+VPGHEIAGIV  VGSNV +FK GD VGVG
Sbjct: 31  DDVTLKILYCGVCHSDLHTVKNDWGFTTYPVVPGHEIAGIVTKVGSNVTKFKEGDRVGVG 90

Query: 228 TYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDNY 287
             V+SC++CE C+   E  C    ++T+N+    GT T+GGYS+ +VVHQR+  + PDN 
Sbjct: 91  VIVDSCQECECCQQDLESYC-PKPVFTYNS-PYKGTRTQGGYSDFVVVHQRFVLQFPDNL 150

Query: 288 PLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFSTS 347
           PL + APLLCAGITVY+PM    M +PGK LGV GLGGLGH+A+KFGKAFGL VTV S+S
Sbjct: 151 PLDAGAPLLCAGITVYSPMKYYGMTEPGKHLGVAGLGGLGHVAIKFGKAFGLKVTVISSS 210

Query: 348 ISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMVL 407
            +K+ EA+ +LGAD+F+LSSD E+M+A + ++D+IID  S  H     +  LK+ G +V 
Sbjct: 211 PNKESEAIDVLGADSFLLSSDPEKMKAATGTMDYIIDTISAVHSLVSLLGLLKLNGKLVT 270

Query: 408 VGFPNEVKFSPA-SLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEA 467
           VG P++    P   L  G + I GS  GG + TQEM++FC  H I  NIE+I +   N A
Sbjct: 271 VGLPSKPLQLPIFPLVAGRKLIGGSNFGGLKETQEMLDFCGKHNIAANIELIKMDEINTA 330

Query: 468 IER 470
           IER
Sbjct: 331 IER 331

BLAST of Sgr024537 vs. ExPASy TrEMBL
Match: A0A803PPG7 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 671.8 bits (1732), Expect = 3.1e-189
Identity = 334/473 (70.61%), Postives = 382/473 (80.76%), Query Frame = 0

Query: 1   MWLRCIQRIGYAGGGGGSAGLISSTFARYFSRKRAENLRKINPKLTPQEASLVAQDLYGV 60
           MWL  +          GSAG   S+F RYFSRKRAEN+RKINPK++PQEAS +A++LY V
Sbjct: 1   MWLNSV-----GARFNGSAGGAVSSFVRYFSRKRAENVRKINPKVSPQEASSIARNLYDV 60

Query: 61  LKQHGPLTVSDAWIKAKESGVSGLNSKTHMKLMLKWMRGRKMLKLFCNQVGSSKNFLLST 120
           +K+HGPLT+ +AW+ AK+SGV G+NSKTHMKL+LKWMRGRKMLKL+   VGSSK FLLST
Sbjct: 61  IKEHGPLTIPNAWVHAKDSGVGGINSKTHMKLLLKWMRGRKMLKLWSTGVGSSKKFLLST 120

Query: 121 LPDDPQADQLKISSEVGIQREKPPCCRAV---GGLSSSGSFVLMAGSQRDDDVSITITHC 180
           LP+DP+  Q++   EV IQ +KP    A     G+ S   F   A     DDVSITITHC
Sbjct: 121 LPEDPKVTQIRSDMEVKIQHKKPSSSWAATDPSGIVSPYKFNRRAVG--SDDVSITITHC 180

Query: 181 GICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGVGTYVNSCRQCE 240
           GICYADV+WTRNK GD+KYP+VPGHEI G+V++VGS V RFKVGDHVGVGTYVNSCR C 
Sbjct: 181 GICYADVVWTRNKHGDSKYPVVPGHEIVGVVQSVGSGVSRFKVGDHVGVGTYVNSCRDCH 240

Query: 241 YCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDNYPLASAAPLLC 300
           YC  G E  C  GS+YTFN +D DGTITKGGYS+ IVVHQRYCFKIPD+YPLASAAPLLC
Sbjct: 241 YCNQGLENHCQKGSVYTFNHVDADGTITKGGYSSFIVVHQRYCFKIPDSYPLASAAPLLC 300

Query: 301 AGITVYAPMVRNKMN-QPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFSTSISKKEEALS 360
           AGITVYAPMVR+KMN QPGKS GVIGLGGLGH+AVKFGKAFGLNVTVFSTSISKKEEAL+
Sbjct: 301 AGITVYAPMVRHKMNQQPGKSFGVIGLGGLGHMAVKFGKAFGLNVTVFSTSISKKEEALT 360

Query: 361 ILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMVLVGFPNEVKF 420
           +LGAD FVLSSD+EQM+ L+ S DFIID ASGDHPFDPYM+ LK  GV VLVGFP+EVK 
Sbjct: 361 LLGADKFVLSSDQEQMKGLANSFDFIIDTASGDHPFDPYMALLKTYGVFVLVGFPSEVKL 420

Query: 421 SPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEAIER 470
           SP +LNLG +T+ GS  GGTR  QEM+ FCAA GI PNIEVIPIQY+NEA+ER
Sbjct: 421 SPVNLNLGDKTLCGSTVGGTREIQEMLEFCAAKGIQPNIEVIPIQYANEALER 466

BLAST of Sgr024537 vs. ExPASy TrEMBL
Match: A0A7J6H405 (PKS_ER domain-containing protein OS=Cannabis sativa OX=3483 GN=F8388_016912 PE=3 SV=1)

HSP 1 Score: 645.2 bits (1663), Expect = 3.1e-181
Identity = 326/473 (68.92%), Postives = 374/473 (79.07%), Query Frame = 0

Query: 1   MWLRCIQRIGYAGGGGGSAGLISSTFARYFSRKRAENLRKINPKLTPQEASLVAQDLYGV 60
           MWL  +          GSAG   S+F RYFSRKRAEN+RKINPK++PQEAS +A++LY V
Sbjct: 1   MWLNSV-----GARFNGSAGGAVSSFVRYFSRKRAENVRKINPKVSPQEASSIARNLYDV 60

Query: 61  LKQHGPLTVSDAWIKAKESGVSGLNSKTHMKLMLKWMRGRKMLKLFCNQVGSSKNFLLST 120
           +K+HGPLT+ +AW+ AK+SGV G+NSKTHMKL+LKWMRGRKMLKL+   VGSSK FLLST
Sbjct: 61  IKEHGPLTIPNAWVHAKDSGVGGINSKTHMKLLLKWMRGRKMLKLWSTGVGSSKKFLLST 120

Query: 121 LPDDPQADQLKISSEVGIQREKPPCCRAV---GGLSSSGSFVLMAGSQRDDDVSITITHC 180
           LP+DP+  Q++   EV IQ +KP    A     G+ S   F   A     DDVSITITHC
Sbjct: 121 LPEDPKVTQIRSDMEVKIQHKKPSSSWAATDPSGIVSPYKFNRRAVG--SDDVSITITHC 180

Query: 181 GICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGVGTYVNSCRQCE 240
           GICYADV+WTRNK GD+KYP+VPGHEI G+V++VGS V RFKVGDHVGVGTYVNSCR C 
Sbjct: 181 GICYADVVWTRNKHGDSKYPVVPGHEIVGVVQSVGSGVSRFKVGDHVGVGTYVNSCRDCH 240

Query: 241 YCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDNYPLASAAPLLC 300
           YC  G E  C  GS+YTFN +D DGTITKGGYS+ IV        IPD+YPLASAAPLLC
Sbjct: 241 YCNQGLENHCQKGSVYTFNHVDADGTITKGGYSSFIV--------IPDSYPLASAAPLLC 300

Query: 301 AGITVYAPMVRNKMN-QPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFSTSISKKEEALS 360
           AGITVYAPMVR+KMN QPGKS GVIGLGGLGH+AVKFGKAFGLNVTVFSTSISKKEEAL+
Sbjct: 301 AGITVYAPMVRHKMNQQPGKSFGVIGLGGLGHMAVKFGKAFGLNVTVFSTSISKKEEALT 360

Query: 361 ILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMVLVGFPNEVKF 420
           +LGAD FVLSSD+EQM+ L+ S DFIID ASGDHPFDPYM+ LK  GV VLVGFP+EVK 
Sbjct: 361 LLGADKFVLSSDQEQMKGLANSFDFIIDTASGDHPFDPYMALLKTYGVFVLVGFPSEVKL 420

Query: 421 SPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEAIER 470
           SP +LNLG +T+ GS  GGTR  QEM+ FCAA GI PNIEVIPIQY+NEA+ER
Sbjct: 421 SPVNLNLGDKTLCGSTVGGTREIQEMLEFCAAKGIQPNIEVIPIQYANEALER 458

BLAST of Sgr024537 vs. ExPASy TrEMBL
Match: A0A6J1E383 (probable cinnamyl alcohol dehydrogenase 1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111026080 PE=3 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 1.0e-160
Identity = 284/327 (86.85%), Postives = 300/327 (91.74%), Query Frame = 0

Query: 146 CRAVGGLSSSGSFVLMAGSQR---DDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGH 205
           C       SSG       S+R   D DVSITITHCGICYADVIWTRNK  D+KYPLVPGH
Sbjct: 10  CLGWAATDSSGFLSPYKFSRRVLGDYDVSITITHCGICYADVIWTRNKHADSKYPLVPGH 69

Query: 206 EIAGIVKNVGSNVQRFKVGDHVGVGTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDG 265
           EIAGIVKNVGS+V RFKVGDHVGVGTYVNSCRQCEYCEDGQEVSC SGSIYTFNAIDVDG
Sbjct: 70  EIAGIVKNVGSSVHRFKVGDHVGVGTYVNSCRQCEYCEDGQEVSCTSGSIYTFNAIDVDG 129

Query: 266 TITKGGYSNHIVVHQRYCFKIPDNYPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIG 325
           TITKGGYSN+IVVH+RYC++IPDNYPLASAAPLLCAGITVYAPM+R+KMNQPGKSLGVIG
Sbjct: 130 TITKGGYSNYIVVHERYCYRIPDNYPLASAAPLLCAGITVYAPMIRHKMNQPGKSLGVIG 189

Query: 326 LGGLGHLAVKFGKAFGLNVTVFSTSISKKEEALSILGADTFVLSSDKEQMEALSKSLDFI 385
           LGGLGH+AVKFGKAFGLNVTVFSTSISKKE AL +L AD FV+SSDKEQMEALSKSLDFI
Sbjct: 190 LGGLGHMAVKFGKAFGLNVTVFSTSISKKEHALRLLKADKFVISSDKEQMEALSKSLDFI 249

Query: 386 IDAASGDHPFDPYMSTLKIGGVMVLVGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEM 445
           ID ASGDHPFDPYMSTLKIGGVMVLVGFP+EVKFSPASLNLGLRTISGSITGGT+VTQEM
Sbjct: 250 IDTASGDHPFDPYMSTLKIGGVMVLVGFPSEVKFSPASLNLGLRTISGSITGGTKVTQEM 309

Query: 446 INFCAAHGIYPNIEVIPIQYSNEAIER 470
           I+FCAA+GIYPNIEVIPIQYSNEAIER
Sbjct: 310 IDFCAANGIYPNIEVIPIQYSNEAIER 336

BLAST of Sgr024537 vs. ExPASy TrEMBL
Match: A0A6J1H2E7 (probable cinnamyl alcohol dehydrogenase 1 OS=Cucurbita moschata OX=3662 GN=LOC111459811 PE=3 SV=1)

HSP 1 Score: 573.5 bits (1477), Expect = 1.1e-159
Identity = 272/303 (89.77%), Postives = 290/303 (95.71%), Query Frame = 0

Query: 167 DDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGV 226
           DDDVSITITHCGICYADV+WTRNK GD+KYPLVPGHEIAGIVK VG+NV RFKVGDHVGV
Sbjct: 34  DDDVSITITHCGICYADVLWTRNKLGDSKYPLVPGHEIAGIVKTVGANVHRFKVGDHVGV 93

Query: 227 GTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDN 286
           GTYVNSCRQCEYCEDGQEV C  GS  TFN ID DGTITKGGYSN+IVVH+RYC++IP+N
Sbjct: 94  GTYVNSCRQCEYCEDGQEVCCTGGSTNTFNGIDFDGTITKGGYSNYIVVHERYCYRIPEN 153

Query: 287 YPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST 346
           YPLASAAPLLCAGITVY+PM+R+KMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST
Sbjct: 154 YPLASAAPLLCAGITVYSPMIRHKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST 213

Query: 347 SISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMV 406
           SISKKEEALS+LGAD FVLSSD EQMEALSKSLDFIIDAASGDHPFDPYMSTLK+GG MV
Sbjct: 214 SISKKEEALSVLGADKFVLSSDTEQMEALSKSLDFIIDAASGDHPFDPYMSTLKVGGTMV 273

Query: 407 LVGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEA 466
           LVGFP++VKFSPASLNLG+RTISGS+TGGTRVTQEMINFCAAHGIYPNI+VIPIQYSNEA
Sbjct: 274 LVGFPSQVKFSPASLNLGMRTISGSVTGGTRVTQEMINFCAAHGIYPNIDVIPIQYSNEA 333

Query: 467 IER 470
           IER
Sbjct: 334 IER 336

BLAST of Sgr024537 vs. ExPASy TrEMBL
Match: A0A1S4DS95 (probable cinnamyl alcohol dehydrogenase 1 OS=Cucumis melo OX=3656 GN=LOC103482886 PE=3 SV=1)

HSP 1 Score: 571.6 bits (1472), Expect = 4.4e-159
Identity = 272/303 (89.77%), Postives = 292/303 (96.37%), Query Frame = 0

Query: 167 DDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGV 226
           DDDVSITITHCGICYADV+WTRNK GD+KYPLVPGHEIAGIVKNVG+NVQRFKVGDHVGV
Sbjct: 34  DDDVSITITHCGICYADVVWTRNKLGDSKYPLVPGHEIAGIVKNVGANVQRFKVGDHVGV 93

Query: 227 GTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDN 286
           GTYVNSCRQCEYCED QEVSC SG  +TFN+ID DGTITKGGYSN+IVVH+RYC+KIPDN
Sbjct: 94  GTYVNSCRQCEYCEDCQEVSCTSGCTHTFNSIDFDGTITKGGYSNYIVVHERYCYKIPDN 153

Query: 287 YPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST 346
           YPLASAAPLLCAGITVY+PM+R+ MNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST
Sbjct: 154 YPLASAAPLLCAGITVYSPMIRHNMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFST 213

Query: 347 SISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMV 406
           SISKKEEALS+LGAD FVLSSD +QMEALSKSLDFIIDAASGDHPFDPYMSTLK+GGVMV
Sbjct: 214 SISKKEEALSVLGADKFVLSSDNKQMEALSKSLDFIIDAASGDHPFDPYMSTLKVGGVMV 273

Query: 407 LVGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEA 466
           LVGFP+EVKF PASL LG+RTISGS+TGGT++TQEMI+FCAAHGIYPNIEVIPIQYSNEA
Sbjct: 274 LVGFPSEVKFIPASLILGMRTISGSVTGGTKLTQEMIDFCAAHGIYPNIEVIPIQYSNEA 333

Query: 467 IER 470
           IER
Sbjct: 334 IER 336

BLAST of Sgr024537 vs. TAIR 10
Match: AT1G72680.1 (cinnamyl-alcohol dehydrogenase )

HSP 1 Score: 507.7 bits (1306), Expect = 1.5e-143
Identity = 234/302 (77.48%), Postives = 275/302 (91.06%), Query Frame = 0

Query: 168 DDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGVG 227
           DDVS+TITHCG+CYADVIW+RN+ GD+KYPLVPGHEIAGIV  VG NVQRFKVGDHVGVG
Sbjct: 36  DDVSLTITHCGVCYADVIWSRNQHGDSKYPLVPGHEIAGIVTKVGPNVQRFKVGDHVGVG 95

Query: 228 TYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQRYCFKIPDNY 287
           TYVNSCR+CEYC +GQEV+C  G ++TFN ID DG++TKGGYS+HIVVH+RYC+KIP +Y
Sbjct: 96  TYVNSCRECEYCNEGQEVNCAKG-VFTFNGIDHDGSVTKGGYSSHIVVHERYCYKIPVDY 155

Query: 288 PLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVFSTS 347
           PL SAAPLLCAGITVYAPM+R+ MNQPGKSLGVIGLGGLGH+AVKFGKAFGL+VTVFSTS
Sbjct: 156 PLESAAPLLCAGITVYAPMMRHNMNQPGKSLGVIGLGGLGHMAVKFGKAFGLSVTVFSTS 215

Query: 348 ISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGVMVL 407
           ISKKEEAL++LGA+ FV+SSD +QM+AL KSLDF++D ASGDH FDPYMS LKI G  VL
Sbjct: 216 ISKKEEALNLLGAENFVISSDHDQMKALEKSLDFLVDTASGDHAFDPYMSLLKIAGTYVL 275

Query: 408 VGFPNEVKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYSNEAI 467
           VGFP+E+K SPA+LNLG+R ++GS+TGGT++TQ+M++FCAAH IYPNIEVIPIQ  NEA+
Sbjct: 276 VGFPSEIKISPANLNLGMRMLAGSVTGGTKITQQMLDFCAAHKIYPNIEVIPIQKINEAL 335

Query: 468 ER 470
           ER
Sbjct: 336 ER 336

BLAST of Sgr024537 vs. TAIR 10
Match: AT4G37970.1 (cinnamyl alcohol dehydrogenase 6 )

HSP 1 Score: 295.0 bits (754), Expect = 1.5e-79
Identity = 147/313 (46.96%), Postives = 206/313 (65.81%), Query Frame = 0

Query: 158 FVLMAGSQRDDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQR 217
           FV       +++V + + +CGIC++D+   +N++  + YPLVPGHEI G V  +G+ V +
Sbjct: 29  FVFSRRKTGEEEVRVKVLYCGICHSDLHCLKNEWHSSIYPLVPGHEIIGEVSEIGNKVSK 88

Query: 218 FKVGDHVGVGTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYSNHIVVHQ 277
           F +GD VGVG  V+SCR CE C + QE  C + +I T+N +  DGTI  GGYS+HIVV +
Sbjct: 89  FNLGDKVGVGCIVDSCRTCESCREDQENYC-TKAIATYNGVHHDGTINYGGYSDHIVVDE 148

Query: 278 RYCFKIPDNYPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLAVKFGKAF 337
           RY  KIP   PL SAAPLLCAGI++Y+PM    +  P K +G++GLGGLGH+ V+F KAF
Sbjct: 149 RYAVKIPHTLPLVSAAPLLCAGISMYSPMKYFGLTGPDKHVGIVGLGGLGHIGVRFAKAF 208

Query: 338 GLNVTVFSTSISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMS 397
           G  VTV S++  K ++AL  LGAD F++S+D++QM+A   ++D IID  S  H   P + 
Sbjct: 209 GTKVTVVSSTTGKSKDALDTLGADGFLVSTDEDQMKAAMGTMDGIIDTVSASHSISPLIG 268

Query: 398 TLKIGGVMVLVGFPNE-VKFSPASLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIE 457
            LK  G +VL+G   +    S  SL LG ++I+GS  GG + TQEMI+F A HGI   IE
Sbjct: 269 LLKSNGKLVLLGATEKPFDISAFSLILGRKSIAGSGIGGMQETQEMIDFAAEHGIKAEIE 328

Query: 458 VIPIQYSNEAIER 470
           +I + Y N A++R
Sbjct: 329 IISMDYVNTAMDR 340

BLAST of Sgr024537 vs. TAIR 10
Match: AT4G39330.1 (cinnamyl alcohol dehydrogenase 9 )

HSP 1 Score: 292.4 bits (747), Expect = 9.8e-79
Identity = 152/324 (46.91%), Postives = 207/324 (63.89%), Query Frame = 0

Query: 150 GGLSSSGSFVLMAGSQRD---DDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAG 209
           G    SG       S+RD   +DV++ I  CG+C+ D+   +N +G + YP+VPGHEI G
Sbjct: 17  GARDKSGVLSPFHFSRRDNGENDVTVKILFCGVCHTDLHTIKNDWGYSYYPVVPGHEIVG 76

Query: 210 IVKNVGSNVQRFKVGDHVGVGTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITK 269
           I   VG NV +FK GD VGVG    SC+ CE C+   E  C   S +T+NAI  DGT   
Sbjct: 77  IATKVGKNVTKFKEGDRVGVGVISGSCQSCESCDQDLENYCPQMS-FTYNAIGSDGTKNY 136

Query: 270 GGYSNHIVVHQRYCFKIPDNYPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGL 329
           GGYS +IVV QR+  + P+N P  S APLLCAGITVY+PM    M + GK LGV GLGGL
Sbjct: 137 GGYSENIVVDQRFVLRFPENLPSDSGAPLLCAGITVYSPMKYYGMTEAGKHLGVAGLGGL 196

Query: 330 GHLAVKFGKAFGLNVTVFSTSISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAA 389
           GH+AVK GKAFGL VTV S+S +K EEA++ LGAD+F++++D ++M+A   ++D+IID  
Sbjct: 197 GHVAVKIGKAFGLKVTVISSSSTKAEEAINHLGADSFLVTTDPQKMKAAIGTMDYIIDTI 256

Query: 390 SGDHPFDPYMSTLKIGGVMVLVGFPNE-VKFSPASLNLGLRTISGSITGGTRVTQEMINF 449
           S  H   P +  LK+ G ++ +G P + ++     L LG + + GS  GG + TQEM++F
Sbjct: 257 SAVHALYPLLGLLKVNGKLIALGLPEKPLELPMFPLVLGRKMVGGSDVGGMKETQEMLDF 316

Query: 450 CAAHGIYPNIEVIPIQYSNEAIER 470
           CA H I  +IE+I +   N A+ER
Sbjct: 317 CAKHNITADIELIKMDEINTAMER 339

BLAST of Sgr024537 vs. TAIR 10
Match: AT4G37980.1 (elicitor-activated gene 3-1 )

HSP 1 Score: 285.4 bits (729), Expect = 1.2e-76
Identity = 148/320 (46.25%), Postives = 200/320 (62.50%), Query Frame = 0

Query: 151 GLSSSGSFVLMAGSQRDDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKN 210
           G+ S  SF   A  ++  DV   +  CGIC+ D+   +N++G   YPLVPGHEI G+V  
Sbjct: 19  GILSPFSFSRRATGEK--DVRFKVLFCGICHTDLSMAKNEWGLTTYPLVPGHEIVGVVTE 78

Query: 211 VGSNVQRFKVGDHVGVGTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTITKGGYS 270
           VG+ V++F  GD VGVG    SCR C+ C DG E  C    I T  A + D T+T GGYS
Sbjct: 79  VGAKVKKFNAGDKVGVGYMAGSCRSCDSCNDGDENYC-PKMILTSGAKNFDDTMTHGGYS 138

Query: 271 NHIVVHQRYCFKIPDNYPLASAAPLLCAGITVYAPMVRNKMNQPGKSLGVIGLGGLGHLA 330
           +H+V  + +  +IPDN PL  AAPLLCAG+TVY+PM  + +++PG  +GV+GLGGLGH+A
Sbjct: 139 DHMVCAEDFIIRIPDNLPLDGAAPLLCAGVTVYSPMKYHGLDKPGMHIGVVGLGGLGHVA 198

Query: 331 VKFGKAFGLNVTVFSTSISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDH 390
           VKF KA G  VTV STS  K++EA++ LGAD F++S D +QM+    ++D IID  S  H
Sbjct: 199 VKFAKAMGTKVTVISTSERKRDEAVTRLGADAFLVSRDPKQMKDAMGTMDGIIDTVSATH 258

Query: 391 PFDPYMSTLKIGGVMVLVGFPNEVKFSPA-SLNLGLRTISGSITGGTRVTQEMINFCAAH 450
           P  P +  LK  G +V+VG P E    P   L  G + + GS+ GG + TQEM++    H
Sbjct: 259 PLLPLLGLLKNKGKLVMVGAPAEPLELPVFPLIFGRKMVVGSMVGGIKETQEMVDLAGKH 318

Query: 451 GIYPNIEVIPIQYSNEAIER 470
            I  +IE+I   Y N A+ER
Sbjct: 319 NITADIELISADYVNTAMER 335

BLAST of Sgr024537 vs. TAIR 10
Match: AT2G21730.1 (cinnamyl alcohol dehydrogenase homolog 2 )

HSP 1 Score: 279.6 bits (714), Expect = 6.6e-75
Identity = 144/306 (47.06%), Postives = 197/306 (64.38%), Query Frame = 0

Query: 167 DDDVSITITHCGICYADVIWTRNKFGDAKYPLVPGHEIAGIVKNVGSNVQRFKVGDHVGV 226
           ++DV++ I  CG+C++D+   +N +G ++YP++PGHEI GI   VG NV +FK GD VGV
Sbjct: 31  ENDVTVKILFCGVCHSDLHTIKNHWGFSRYPIIPGHEIVGIATKVGKNVTKFKEGDRVGV 90

Query: 227 GTYVNSCRQCEYCEDGQEVSCISGSIYTFNAIDVDGTI-TKGGYSNHIVVHQRYCFKIPD 286
           G  + SC+ CE C    E  C    ++T+N+   DGT   +GGYS+ IVV  R+   IPD
Sbjct: 91  GVIIGSCQSCESCNQDLENYC-PKVVFTYNSRSSDGTSRNQGGYSDVIVVDHRFVLSIPD 150

Query: 287 NYPLASAAPLLCAGITVYAPMVRNKM-NQPGKSLGVIGLGGLGHLAVKFGKAFGLNVTVF 346
             P  S APLLCAGITVY+PM    M  + GK LGV GLGGLGH+AVK GKAFGL VTV 
Sbjct: 151 GLPSDSGAPLLCAGITVYSPMKYYGMTKESGKRLGVNGLGGLGHIAVKIGKAFGLRVTVI 210

Query: 347 STSISKKEEALSILGADTFVLSSDKEQMEALSKSLDFIIDAASGDHPFDPYMSTLKIGGV 406
           S S  K+ EA+  LGAD+F++++D ++M+    ++DFIID  S +H   P  S LK+ G 
Sbjct: 211 SRSSEKEREAIDRLGADSFLVTTDSQKMKEAVGTMDFIIDTVSAEHALLPLFSLLKVNGK 270

Query: 407 MVLVGFPNEVKFSPA-SLNLGLRTISGSITGGTRVTQEMINFCAAHGIYPNIEVIPIQYS 466
           +V +G P +    P  SL LG + + GS  GG + TQEM+ FCA H I  +IE+I +   
Sbjct: 271 LVALGLPEKPLDLPIFSLVLGRKMVGGSQIGGMKETQEMLEFCAKHKIVSDIELIKMSDI 330

Query: 467 NEAIER 470
           N A++R
Sbjct: 331 NSAMDR 335

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAF4353067.16.4e-18168.92hypothetical protein F8388_016912 [Cannabis sativa] >KAF4389528.1 hypothetical p... [more]
XP_022159743.12.1e-16086.85probable cinnamyl alcohol dehydrogenase 1 isoform X1 [Momordica charantia] >XP_0... [more]
XP_022958652.12.4e-15989.77probable cinnamyl alcohol dehydrogenase 1 [Cucurbita moschata][more]
KAG7035534.19.0e-15989.44putative cinnamyl alcohol dehydrogenase 1 [Cucurbita argyrosperma subsp. argyros... [more]
XP_008437476.19.0e-15989.77PREDICTED: probable cinnamyl alcohol dehydrogenase 1 [Cucumis melo] >XP_01689887... [more]
Match NameE-valueIdentityDescription
Q9CAI32.1e-14277.48Probable cinnamyl alcohol dehydrogenase 1 OS=Arabidopsis thaliana OX=3702 GN=CAD... [more]
Q8H8596.5e-12870.07Probable cinnamyl alcohol dehydrogenase 1 OS=Oryza sativa subsp. japonica OX=399... [more]
Q2R1145.5e-12769.08Putative cinnamyl alcohol dehydrogenase 4 OS=Oryza sativa subsp. japonica OX=399... [more]
Q337Y23.9e-8853.04Probable cinnamyl alcohol dehydrogenase 3 OS=Oryza sativa subsp. japonica OX=399... [more]
Q431374.9e-8351.49Probable mannitol dehydrogenase 1 OS=Stylosanthes humilis OX=35628 GN=CAD1 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A803PPG73.1e-18970.61Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A7J6H4053.1e-18168.92PKS_ER domain-containing protein OS=Cannabis sativa OX=3483 GN=F8388_016912 PE=3... [more]
A0A6J1E3831.0e-16086.85probable cinnamyl alcohol dehydrogenase 1 isoform X1 OS=Momordica charantia OX=3... [more]
A0A6J1H2E71.1e-15989.77probable cinnamyl alcohol dehydrogenase 1 OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
A0A1S4DS954.4e-15989.77probable cinnamyl alcohol dehydrogenase 1 OS=Cucumis melo OX=3656 GN=LOC10348288... [more]
Match NameE-valueIdentityDescription
AT1G72680.11.5e-14377.48cinnamyl-alcohol dehydrogenase [more]
AT4G37970.11.5e-7946.96cinnamyl alcohol dehydrogenase 6 [more]
AT4G39330.19.8e-7946.91cinnamyl alcohol dehydrogenase 9 [more]
AT4G37980.11.2e-7646.25elicitor-activated gene 3-1 [more]
AT2G21730.16.6e-7547.06cinnamyl alcohol dehydrogenase homolog 2 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR020843Polyketide synthase, enoylreductase domainSMARTSM00829PKS_ER_names_modcoord: 151..495
e-value: 0.0025
score: -133.4
NoneNo IPR availableGENE3D3.90.180.10coord: 165..469
e-value: 6.6E-126
score: 421.8
NoneNo IPR availableGENE3D3.40.50.720coord: 298..453
e-value: 6.6E-126
score: 421.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 590..644
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 590..608
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 615..641
NoneNo IPR availablePANTHERPTHR42683:SF55SUBFAMILY NOT NAMEDcoord: 164..469
NoneNo IPR availablePANTHERPTHR42683ALDEHYDE REDUCTASEcoord: 164..469
NoneNo IPR availableCDDcd05283CAD1coord: 147..469
e-value: 5.06897E-148
score: 431.92
IPR013154Alcohol dehydrogenase, N-terminalPFAMPF08240ADH_Ncoord: 168..283
e-value: 1.5E-25
score: 89.2
IPR013149Alcohol dehydrogenase, C-terminalPFAMPF00107ADH_zinc_Ncoord: 325..446
e-value: 8.1E-15
score: 55.0
IPR029752D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding domain conserved site 1PROSITEPS00065D_2_HYDROXYACID_DH_1coord: 318..345
IPR002328Alcohol dehydrogenase, zinc-type, conserved sitePROSITEPS00059ADH_ZINCcoord: 201..215
IPR011032GroES-like superfamilySUPERFAMILY50129GroES-likecoord: 166..316
IPR036291NAD(P)-binding domain superfamilySUPERFAMILY51735NAD(P)-binding Rossmann-fold domainscoord: 288..451

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr024537.1Sgr024537.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009809 lignin biosynthetic process
molecular_function GO:0016616 oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016491 oxidoreductase activity