Cp4.1LG20g03980 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g03980
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Description(3S,6E)-nerolidol synthase 1-like
LocationCp4.1LG20: 2218181 .. 2238456 (-)
RNA-Seq ExpressionCp4.1LG20g03980
SyntenyCp4.1LG20g03980
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTGCGGTGGCTAAAGTGGAGAGTTTGAAGCATGTTTTGAGAGAATCAGGCGATTGTTTGGAGCGTTTGGACCTAATTGATGCAGCCCAACGCCTAGGCATTGACAACCATCTCCAAGAGGAAATTGAAGCTGTTCTAAAGCGGCAATACTTCTTATTGAATGCGCTTCAATTTGATCCAATGATTGATCTTCACAAGGCTGCTCTCCTTTTTCGACTTTTGACTCAACAGGGCTTCCTTGTCACTCCAAGTTATTATTCCTCCAAACCTTTAACTTCATAGAATTAGGGCTAATTTTAACTCTAATTGTAACATCCCAAGCATCATACATGGGGCCGTGTGCTAGCGATAACGCTGGCTAAGAGGATGGATTGTGAGATTCCACGTTAATTGAAGAGGAGAACGAAGCATTATAAGAGTGTGGAAATCTCTCCCTAACAAACGCGTTTTAAAATCCTGAAGAGAAGCCCAGAAGGGAAAGTCCTCTCCCTAACTGACGCGTTTTAGAACCTTGAGGGAAACCCTCTTCCTAACAGACGTGTTTTAAAACCGAAGCCCATAAGGGAAAACCCTCTCCCTAACAGACGTGTTTTAAAACCTTCATGGGAAGCCCAGAAGGGAAAGTTCTCTCTTTAACATACGCGTTTTAAAACTGAAGCCCATAAGGGAAAGCCCTCTCTCTAACAGACCTGTTTTAAAATCTTAATGGGAAGCCCAGAACCGAAAGTTCTCTCTTTAACAAACGCGTTTTAAAACTGAAGCCCATAAGGGAAAGCCCTCTCTCTAACAGACGTGTTTTAAAACCTTGATGGGAAGCCCAGAAGGGAAAGTTCTCTCTTTAACAAACTATTTTAAAACCGAAGCCCATAAGGGAAAGTCCTCTCTCTAACGGATGCGTTTTAAAACTTTAAGGGGAAGCCCAGAAGGGAAAGTTCTCTCTCTAAGAGACGCATTTTAAAACTGAAGTCCATGAGGGAAAGCCCTCTCTCTAACCGACGTGTTTTAAAATTTTGATGGGAAGCCCAGAAGGGAAAGTTCTCTCTTTAACAGACGCGTTTTAAAACTGAAGCTCATAAGGGAAAGCCCTCTCTCTAACCGACGTGTTTTAAAATCTTGATGGGAAGCCCAGAAGGGAAAGTTCTCTCTTTATCAGACGCGTTTTAAAGCTGAAGCCCATAAGGGAAAGCTCTCTCTCTAACATACGTGTTTTAAAACCTTGATGGGAAGCCCAGAAGGGATAGTTCTCTCTTTAACAAATGTGTTTTAAAACCGAAGCCCATAAGGGAAAGTCCTCTCTCTAACGGATGCGGTTTAAAACTTTAAGGGGAAGCCCAGAAGGGAAAGTCCAAAAAAGACAATATCGGCTCAGGGTGGGTTTGGGCTGAAGCCCAAAAGGAAAAGCCCACAAAAGACAATATTTAGCCGTGGGTTTGGGTTGAAGCACGAAGTAGAGTGAAAAGAGGACAATATCCGCTAGTGGGTTTGGACTATTACACTAATTCTCATTCATGTAGTTGTTATTTGAACAGATTTGTTTAAGATTTTCTTGGACGAGGAGGGGAAGTTCAACAAAGAGCTAAGGCATGATATCAAGGGGCTGACAAGTTTACATGAAGCTTCGCAGCTATGCATGCATGGAGATGATATTCTTGAAGAAGCTCAAAATTTCAGTAGCCATTGGCTAAACGCTTGGGTTGTTGTTCATGTCAACCATCATTCAGCCACTTTCGTTCATAACACTCTACTTCATCCTTATCATAAAACCTTGCCCCAATTCATGTTGCCTAACTATTTTGGCGACAACCAATGGACAAACAAATGGATACACACTTTGCAAGGCGTAGCAAAAGTGAATTTCAATACAACTCAACGCTTACGCCAATACGAACTTAACCAATTCACAAAGTAAGTGTGTTGAATCGATTAATGTGAATTTAATTACCATTTGCTTTCAATTTGAAGGCTTTTTTCCCTTTGTTTAGATGGTGGAAAGAATCAGATTTGGGTAGAGAGCTGAATTTTGCAAGAAATCAGCCCATGAAATGGTACATCGCGTCGCTATCGTGTCTGACGGATGTGTGCCATTCTGAACAGAGAATCCAACTCGCCAAAGCCATTGCTTTTGTTTATCTTATTGATGATATCTTTGATGTATTTGGAACGCTGGAAGAGCTCACCATGTTCACAGAAGTTGTTTGTAGGTCGGGAAGCTGATTCATAGAAGCTTTTATTTTTCAAACTTAACATTTTTATTGCTTTAGTTATCAATGTTTTTGTGCAGATGGGATTTGGTTGCTGCTGAAAAGTTACCAAGCTGCATGCGTATATGCTTCAAATCTCTATTTGTAGTTACGAATGAAATAAGTGACCAAATATATCAAAAGCATGGTTGGAATCCAGCTACCTCCCTACAGACAGCGGTTTGTGTTTCTCTATAATTTATGTTAAATTACAAGTTTAACGAATATAATTATCTCTAATCTTGTGAGATCCCACGTTGGTTGGAGAGGAGAACGAAACATTCTTTTTGAGGTAATTTATCGAGTATAATTGATCTCTAATCTTGTGAGATCTCACGTTGGTTGTAGAGAAGAACGGAACATTCTTTATGAGGGTTCTCTCTGCCCATTTGTAACATCTCCCTACTAAACACGTTTTAAAAATCGTGAGGCTGACAGTGATACGTAACGGGCTAAAGCGGACAATATCAACTCGTGGTAGGCTTGGGCTGTTACAAATGGTATCAGAGCCGGATATCCGACCGTGTGCAAGCGAGAACATTGGCCCCTAAGGGGGTGGATTGTGAGATCCCACATTGGTTGGAGAGGGTAACATACAATTTCTTAGAAGGGTGTGGAAACCTGTAACAGCCCCTCTTTGGGCTTTCCCTTTCGGGCTTCCCCTCAACGCTTTAAAACCCATATGCTAGGGGAAAGTTTCCACACCCTTGTAAATAAATGGTGGTTTGTTCTCCTCCCCAACTAATGTGTGACATCACAAAACCTCTCTCTAGCATAGAGTTTTAAAACTGAGATTGATGACTATACATAACCAGCTGAAGCCGATAATATCTAGTGGGCTTAGTCTGTTACAAATGGTATTAGAGCCAGACAAGAGTTAGTGTGCCAACGAGGATATTGGACCCCAAGGAAGGTGGATTGTGAGATCTCACGTTGGTTGGAGAAGGGTACCAAGTTCTTACATAGATGTGAAAACCTCTCTGCAGTAGAAGCCGCTTTAAAAAAAATAACATTAAGAAGAAGTCCAAAAAAGATAATATCTGCTAATCGTAAGCTTGAACCGACGGAATCTAACCGAATCTGTGTTGCATGCAGTGGGGTAAATTGTGCAAGGCATTTTTAGTTGAGGCAGAATGGTTCAGCTCAGGGCATTGGCCAAGTGCAGAAGAATATTTAAGAAATGGCAGAGTTAGCACGGGCGTCCACGTTACTTTAGTTCATGTTTTCTTTCTTCTTGGACACCAAATAAGGCAGCAAACCGTGGAGCTTTTGGAGGGGGATTTAGATATTGTTTGGTGCAGTGCTACAATATTGCGGCTTTGGGATGACATGGGAAGTGCCAAGGTCATACTCAATCCCCAATCTTACGCATATCTTTAATACAAATGATATGAATGTATTAATGACATTGTCCACAGGATGAGAATCAAGAGGGGCGTGATGGGTCGTACGTAAAATACTACATGAAGGAGCATCCAAGCACGTCCTTTAATGCCACACAAAAGCATATTATGTGGAAGATTTCTGAAGCCTGGAAGAGTTTGAATAGAGAATGGCTGCATTGTAATCCATTGTTTCCTTCCCAATTCACTCAAGCTTCTCTTAACATTGCCAGAGCAGTTCCTTTGTTCTATAGTTATGGCAGAAACCAGTCCTTTCCCACCTTCGAGCAGCATATGGAACGATTGCTCTTTCATAGCCTGGATATTTAGTTCAAGGATCTATTACTTTTATGTGTGTAATTGATACGAAAAGTTAACGTGAACTCTTGAACAAGAAGTTTTATATTTTGGTCGAAGATATATATCTTAATTTCTTAAGGTATGTTTCTTTATAAACTTGAACGAAAGTTGATTACCAAATTTGATTTTAACTTTGACCCAGAGGAATGGAAATAGAAAAGAGGATTGTTTTTGTGAGATCCCGCGTTCATTAGTGTAGAAACATTTCTCTAAAAAATGCGTTTTAAAAACGTGAGGCCGACGATTATACATAACAGTCTAAAGCAGACAATATCCGCTAACAGTGGACTTGAGCGGTTACAAATGGTACTAAGTCTCCAAAAGAGATGGATTGTGACATCCCACGTTGGAGAGAAGAGATTCTTGACAACTCATTCCAATCAAACGTAAGAAAAATCTATCTTCGGCATAAACTCCAACTCCAAATGGTCACATCTCACATGGCTCCCCCACTCCAATCCACCAAATTTCTTGTCACAGTGGGAACGGACTATGATTTAGGTGGCAGCTTTCCAATGTGGGCACCCAATTTTTAAGCTTATGAAAATCTAAATTTATATCATTAGGTAGAGATAAATATTAAAATAGGAGACAATAACGGACATGGAAAAGGGTTCTTTATTCCCCTTTTTAATGACTAGAACATTTGAATATGTAATGTCCCATGTCTACGATTCAGATCAAAATTTGGAACTCAAATTCAATTCCTCATGTTCCAGCATTTCCCTGTATCTCTTATAACATCACCAAAAAGAAAGGTGCACATTGTTAGTATAAATAGTAACTTTCATTTACTTTTAAGTCTTTTTTAGTCATTCTATCTTCAAGATTGCTCTCATTCGGATTTAGATTCATATCAAACGATGAAAGGATATTTGCAAATGGACCACAAGTCTCCATTTTATTTTATTATTAAATGGTCCACACTTGTAAAAATTGGCTGCATGTGTGGAAGAGAGAAATTAGAGGAAGAGAAGAGTGTGTTTCATGGCCAAGAGTCAAACTCTGAATAATGCTTTTTCATTTTTAGGTTGTCTGTCTCCTCAGCTTCATACCTACCAGCTTCATACCTACCAGCTTCATACCTACCATATCCAAAGGCTACCATGCAATATTGCATGATTCATGTTCCAATGGTTTGTAACTAATTTAGCAAATGTTACGTACACGCCAACATAGCATGTTTAAGGTTCAACTCAAAATATCTAGTGGGTGAGAAGTTCTTCCCTCCCCCATTGTTTAAGTTTTCGAGTTAATTGCGATAAAACACGGTACAAAAACACCAAGGTTTATATTAGAAGTTCTATAATGAGATCTATCAGTGATAACATAATATGTGAGATCCCACGTTGGTTTGAGAGAAACGAAACATTCAAGTCTGAAACCTCTCTCTAGTAGACGCATTTGAAAATCGCGAAACTAAAACCACGAATGGCTGCTGCAGCTAGGTGTGGAACGATCATGTTCAACCAACTCGTGTTTGTGGTGCATATTGTATAAAGAAAAAAGATACCATAAAATGTGTTTCACAACCCAGAGTCAGCAATTAAATTTGGTACACAACATGAAAGGGTGTAGTATGGCCTACTTTGTCTATTTTGTTTGTGGGTAGAAACTTGTGATCGGGATGGGTAAGATTTTTATGGTTGATTTCATTTAGTTCCCTTATGAACCTCATAGAGTTTCATGATCTTCCATATTTTGCACGATGAATTGATGGTGATGTATATCATAATATTTAATACATTTTTATCTCTTCCCTGTTTCGTTTCGTTGGGTTTCGTTGGGTTGATCTGATCTATGCTAATAAATATTATTCGCTTTAGTCCCTTACGTATTATCGTCAGGTAACGATTTTAAAATGCGTCAGATAGAGAAAAGTTTTCATACATTTACAAAGTCACTCTTACGGTGTTAAAACGCGTCTAACGAGAGAGATTTCCACACTTTTATAAACAATATTTCGTTTTATTCTTCAATCGATGTGGGATCTCACACTATCTAAGTCCTCCTTTCTATTGGTTTCTAATTTTTCTAATAAACAAAAATGATCTTTCATATAATTTTGGTTAAATTACTTTAAAAAATCATTTTAAATAGAAACTATTATTATTTTATTTCAAAAAAATAAGTTTAAATTTTAAAAAATATTTAATAAATATCAAGTTTTTTTTGGTACAAGTTTGTAAATATGCATATCTCCTAATAACTATTAGAAATAAAACACGAAACACTTTTACGTATACAAAATTATGAGTCGTACCGTAACTCATATCCCCCAACATCGAAAAGTAATGTATGATATCATCCCTGTGAGTTCCCACATCGATTGGAGAAGAAAACAAAACATTTCTTTTAAAATCGTGAAGCTAAAAGCGATATGTAATGGGCAAAATTGAATAATATCTACTGGCGGTAGGCTTGGACTGTTATAAATGATATCAAAGCTAGATTCCAAGCGGTGTGTCAGCGAGGACGCTAGCACTCCAAAAAGGGTGATTTGTGAGATCCCACATCAATTAACGAAGAAAACAAAACATTTCTTCTAAAACTGTGAGACAGACGATTATACACGTAACCCCAAAATCAAAACCTAATTTAGAAGGATGAACATCAAGAACTTTTTTTTCATTAAAAAGTAAAGAATTAAAGGAAAACGTTTATTGTGTTTTGGAATGAACATATATTGATATGGTTTTGTTCCATACGCTACAAATTCAACCTTTGCTTCCTCTAAATTTGGACTAAATTCATGTGGGGCATTAACCAAATTGCGTATTGACTTTAAATCCAGAGTTGAAGACGCCATACTATATATAAACATCACCACACACCATTCACCACTCACACTCCAACACAACAAAACATGGCTTTTTCTCCACATGCTTCCTTTGCTCCCTTACTTGCATCACTAAACATTCCACAAAGCAATGATTCTCCTGTTAACAAGCGCTTGTTTAACAAACAAACCATTGTTCTTCGTGATCATACATTACTTTCCACTCACTTTAAACCCAATCGTTCCAAGAACCAATCTTTCATTTTAACAGTATGCTACACAGATCTCTCTCACTCTCATACTATGCTGTTTTTTTGTTCTTCGGTTAAATGGATTTTATGTTGCAGGATGATATTGGGCATGAAGCTCCAGTTAAAGGGAGGATTTTGAAGCATGTTTTGAGAGAAATAGGAGACCCTTGGGAATGTTTGAATCTAATTGATGCAACCCAACGCCTTGGCATTGATTATCATTTCCAAGAGGAAATTCAAGCTATTCTGCAAAGGCACTACGTATTGTTTAATGCTGTTCAATTAAATTCAGACACTGATCTCCACATGACTGCCCTCCTTTTTCGACTTTTCAGACAACATGGGTACCTCGTGTCGTCAGGTTGATATTCAATTCTATTATGAAATTTGATTTAAGATATAAAGTTAGCCGTGGAACATATATTTAAGTAATTATGAGTAGTATATTTTATCTTATTTTCAAGATTTCGTTACTAATATATTTTCTACGAGTTAGTTGGTAGCTTATATCTGTTAGGATCACACAACAACGCACACATTCGATCTAGATGAACACAAAGAACAGGATAGAAAAAATGCAATGCAACGCATAGACTCGATCTAGATGAACACAAAGAACAGGATAGAGAAAATACAAAGAACAGGATATAGAAAATACAAGGAGAACTCTAGCTAAAGGATTTATATCGATGACTACAAGTACACAGGACAAGAGAAAATGCAAGGACAACTCTTACTAAAATATTTATATCGATGACTAGGAGCTTTCTCTATACATAGCTATTTAACAATTTTCTTTATATGTCATTTACCAAATATAGCGTATAAAGTTGACTCAGCTCTAAATAAGCATACTCCCTGGCCTAACAATATCTTATCTAAAAGTTGTGAACATTAATTGAAATTCTAATTTTGTTTCTCATTCTTAACAAATTTTCTTACCAGATGTGTTTGAAAGTTTCTTGGACAAGGAGGGGAAGTTCAAGGAAGGGTTGAAAGATGATATAAAGGGGCTGACGAGTTTATATGAAGCTTCGCAGCTATGCATGCATGGAGATGAGATTCTTGAAGAAGCTGAAAACTTTAGCAGCCATTGGCTAAAGGCTAAGGCTGAAGATGAAGAGGTTGATCATCATTTGGCCAGTTTTGTTCAGCATACTTTAGCTTATCCTCATCATAAAAGTGTGGTGCAATTAATGGCGCCTAACTATTTGAATGATATGCAATGGCCAAACAAATGGATTAGTATCTTTCGAGATGCTGCAAAAATGGAGCTTTACTCAGCTCAACGCTTGAGCCAACACGAACTCGCTCAATTTACGAAGTAAGCACTTATGGGTTTGATTTTTTTTTTTTTTTCTCTCGTATTAAATTGAAAATGTAACCGTTCAAGCTCATCATTAGTAGATATTGTCTTAGAAATAGCTTTACAGCTTTATTTCTCTCTCCAACCGACGTGAAATCTTTTAAAAAATATGTCTAAATTGCACTGGCTTTAGACTCGTGCAAAAGTGCCATGATTTTTACTAATAATGCATGGATTTTGTTCATGATGATGATTGACAGATGGTGGAAAGAAACAGATTTGGCGAAAGATTTGAATTTCTCTAGAGATCAACCCATTAAATGGTACGTTGCTTCACTGATTTGCCTCAGCACAGATTCCTTCTACTCCGAACAAAGAATCCAACTTGCAAAATCCATCTCTTTCATATATCTCATCGACGACATTTTCGACGTTTTCGGAACTCTAGACGAACTCACCATATTCACAGAAGCCGTTTGCAGGTTATAAATAAATTGCGTCCTGAAATCACTCTTATTTGCATTTCTTACCAGGGTGTGAAAATCTCTTAGTAATAGACACATTTTAAAAACTTTGAGGGAAAGCCCACAAAACAGACAATATCTGCTAGCAATGGGTTTGAGCTGTTACAAATGATATCAGAGTCAAACACTAAATGGTGTGCCAGCTAGACGCTGGACCTCCAAAAGGGATGGATTGTGAGATCCCACGTCGGTTGGAGAGGAGAACGAAACATTCCTTATAAAGGTGTGGAAACCTCTCCTAACAGATGCGTTTTAAAATCGTGATAAGTAACAAGCCAAAACAGACAATATTTACTAGCGGTAGACTTGAACGGTTACAGTTACATTATTAATATGTTATTTGCAATGGTTGTTTAGATGGGACATGGCTGCTGCTGAAGGATTACCAGACTGCATGCAAACATGTTTAAGAACCCTTTTTGAAGTTACTAATGAAATAAGCTGCCAGATCTATCAAGCCCATGGATGGAATCCTATTCACTCCTTACACAAAGCAGTATAATTTAAATTTACCCATTTCATTTGATCGAAGAAGCCATGGAATTAAATGTTTAAGTGAGTAAAGAGTTGAGTAACTTTTGCAGTGGGCTAAATTATGCAACGCATTTTTGGTGGAAGCTGAATGGATGAGTTCTGGGCAATCACCAAGCGCCGAAGAGTATCTAAAAAATGGAGTGGTTAGCACAGGCGTCCACGTTACATTAACACATGTCTTCTTTTTGTTAGGCGAGGCAATAAGCAAGGAAACTGTGGAGCTCTTTGATGAGGATTTAGATATCATTTCATCTAGTGCTACAGTTTTGCGGCTTTGGGATGACATGGGAAGTGCCAAGGTTAGCTTAATTAATTATGTACCCATATCGTGCACGTATTCGAATGTTCGATTTTAATTTTTAAACTATGCTCATGTTGACATAGGATGAGAAGCAAGAGGGGCATGATGGGTCGTATTTAGAATATTACATGAAGGAGCATCCGAGTATGTGTTACGAAGAAACGAAGCGACATACTATGACGCAAATTTGTAATGCATGGAAGACTCTGAATACAGAATGCTTATTATCCAATCTATTTCCTGCCAAGTTCAATCAAGCTTGTCTCAATCTCGCAAGAGTGGTTCCTATTGCATATAACTATGGTAGAACCCAATCCATTATGTCCCTTGAAAACCTTATTAAACAATTTCTCTTTCATCAAATGGAGGTGTAAATACAAATCAATTTTTTTTTCTTTTTTAGCGAAAAATTTTGAATTTTAACATTTTCGTCGATAATATAAATTTTAATCAATTAGATTGTAATTTGGTTGATTACTTTCAAATAAGCATTTTTTTTTCTTTTTTCGATTTTATTATTTCTTAGTCTTGAACTAAACTTGATTCTGGACTAGGTTAGTCATGAACTAAATTTATAATTCACACTTGGGGGCATGTAGGGTGCGCGTGCATAAGCCAATAGCTCATGTGTGCGCATGGAGCCAACTAACGTACTCGCTTGCAAGAGCCAATATTGACGCCCATAGGCACGTCAAGTCTATAGGCCTTAGCATGTACGCAAGAATTGGGTATTTTGACGCGCATTGTACATTTTTTATTCTTTATTTGATTGTCGTAGCTTTAATTCGAGAGGGAGTTTGTGGTGGAATGTTGCCATGGAAAATAGCTGGTCTTTGGATTCTTGAAAATGGACCATTGTTGCATTTATTTACAGAGTTATTTGAGACATTTCAGGCTCATTTTGAGCTCATACCAGTCTCAAATTTTTCTTCAAAATTTTTAAGACACCCACTTGTGAGAGAGAGGTTTCTAGGCCATTATAACTGCAAAAACACTTATAAAAACTTTTTATAACATAGACTATCACAACATACAAAAAACACTATAAAAAGTCTACGATAGCATAAGTTATCATATCCATTTTTTTTTTTTTTTGTAATGCATGTTTGGTTTAACGTAAATTAGGTAGCAACGGTTTGGTTTAACATAAATTAGGTTATTTAATAATTTTGTTTATCACATAAATATTTATGCAATATTAAAATTAAAAAAAAAAGAGAAAAAAAAAAAGAAAGACCACTCATATTGAATTATTGACACACATAATGATTGATATTTAAAACTTATTTAAAAGGTGGGAAATAGGTCCATCAATAATTGTGTGGAGGCATCAATTTTTGGTTTGTTTTTTTCAATTCGTTTCTGGGTTGTTTGTTAGTGCTTGATTGTGTGGTTTTAAATGCTACAAAATTTTACCCTATTTTTAATAGTTTATTTTATTTTATTTTCGTTGAACCCGAATCTCTCGAAGTCTCGAAAATCGAGTCGTGACATGCTTTTAAGATATTTGTCTCAAGTCTATTAACATATTTGATACGATTGTTTGTGTCTGTGAATGAAAGTGATGTACACTGAACATGTTTAAGATGCGTAAAAGTAAATTTTTATTAGAACCCTGAAAGTGATACAGGAAAAGAAAAGAAATGGAAACGAAAGTGTGGGCCTATTTCGGAAACCATAGCCATCAAGCTACAACTGGGTCTCAATTTTCAAACTCGTGTTTACATTCATTCAATAAACGTTCAGACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANTGAAATCTTACAACTAAAACTGAAATTGAAATATTATTTTGGTACCAAAGTAATTGAGATGGAAAAAGTGTTAGTAAATTCAACATGTCATTTAGGCTGCCAAATTCCATTCTATACTTTTGCCTTTTTTTTGTAGTCATTTTCATGTAAATCAATGATATGCTTACCTAAATAGGAATTGGCATTTTTTTTTTTTTTTTTGCTTGTGTATAATAAAAAAGAAAGGTACAATTACGACAATAGCACGTGGTCCCTTACTTTACTTGGTTGTAAGTATTCCTCGAGTATAAATCTAGTTCTAGTAGACACTTAAAATTGTGTATAATAAAAAAGAAAGGTACAATTACGACAATAGCACGTGGTCCCTTACTTTACTTGGTTGTAAGTATTCCTCGAGTATAAATCTAGTTCTAGTAGACACTTAAAATTGTGTATAATAAAAAAGAAAGGTACAATTACGACAATAGCACGTGGTCCCTTACTTTACTTGGTTGTAAGTATTCCTCGAGTATAAATCTAGTTCTAGTAGACACTTAAAATTGTGTATAATAAAAAAGAAAGGTACAATTACGACAATAGCACGTGGTCCCTTACTTTACTTGGTTGTAAGTATTCCTCGAGTATAAATCTAGTTCTAGTAGACACTTAAAATTGTAAGGCTGACAACGATACGTAATGGATCAAAGTGGACAATATCTACTAGCGGTAGACTTGGGTCATTATAAAAATGGTATTAAAGTCCAACACCAATCGATATGCCAGTAAAGAGCAGTGGATTACTAGATCCCACATTAATTAGAAAGGTAAACAAAACCTTTCTTATAAAAATATAAAAATTGATTGGAATGATGCCTATAAATTGGTGGTATCACCAAATACATCCCACCATCTTACAATATCTTGAGCCATTCAAGGATTTCTTCATCCATCCCATGGAAGCAGCAGCATCAGCAATGCTAATAGTTGGAGTAATTCCTCATCCTCCTCCTCCTCCTCCAAAAGCTTCCATGCACAATATGGTTCCTCACTCTCAAAAATGGACCATTCCTCAACATCACTCCTCCCTTCCAATTTCCCTTCCTGCAGATTCACAAATTTGTCCTTCAATTGTATGCCTCTTTGATTTTATATTTTGTTTTGAAAGTGAAATTATCATGAATTTGTTGATAATTAGTGTAGATTTTGGTGCAATTTACGCAGGAGGAAATGAGCATGAAATATGAACTTGAAATGAAGAATCTGAAGCATTTGTTGAGTGAGACGGCCAAAATTGATTCTTTGGAGAGCTTGAATATGATTGATGCAATACAACGCTTGGGGATTGACCACTGCTTCAAACAGGAGATTAAAGCAATTCTTCAAACACAATACACAATGGAAACTCACAATTTTGATGCTAAATGTGGCCTTCATCACGTTGCTCTTCGCTTTCGACTCTTCAGACAACATGGCTATTTTGTACCACAAGGTGTGATTTCAATAACCTTTCATCATGCCAATAACTAAAACAAGAACCCAACAAAATAATAAAAAAAATTTGCTTTTGGGCAGATGTTTTTGAAGGCTTCATCGATCACGAGGGTCTATTAGACACAAAATTTATTGAAAATATCGAAGGGCTCACGAGTTTATACGAAGCTTCCCAATTATGCTTCCCCGAAGACGAAAAACTCGAGAAAATTGGAAATTTTAGTGCTCGTATCTTGAAGAAACTTGCGTGGAATCGCGACGATAATTTAGGCAAACATGTGAGGAAAGCTGTGGCCAATCCCTTTCATAAGAGCTTGGTGAAATTTGTGGTGATGGATTACTTTGGATCCCAATCTCCAAATAAGTGGATATATGTTTTCCAACCCATGGCAAAATTGGATTTCAATAGACTCCAGAACCTACACCGACTTGAATTGTCTCAATTCATTATGTAAGTTTATGCGTGCACTTATAAGCCACATATTTAGGGAATCTTATCACTTTTTTTTTGTTCCATTTCTCAAACATATTTTTTAAGATTTTATACTTTAAAAATTGTGAAATCTCTAGAGAAAAGAACGAAACATCCTTTACAAGAGTAAAGTGATTTAAAGGGTTGTTGTGAGATCGAACGTCAGTTGAAAAAAAGGAATGAAGCATTCCTTATAAGGGTGAGATGATTTAAAGGGTTGTTGTGAGATCTCACATCAGTTGGAGAGGAGTACTAAACATTCGTTGATGAAGTGATTCAAAAGGTTATTGTGAGATCCCACCTAGGTTGGAAAGGAGAACGAATCATTCCTTACAAGGGTGGGGTTGTTGTGAGATCATAAACATTGTTTATAAAGATATGAAAACTTCTCCACATTTTAAAATCGTGAAGCTGTTGGCGATACATAACCGGTCAAAACAAACAATATGGGCTAGTGATAAACGTAGTTGTAGTTTAAATTTTTGGGTTTGATGGGTTTTTTAATGTATTGAAAAACTATTATTATGAAGATGTTTTAATATTATTGTATGATATAATGGGCACATAGGTGGTGGAAGAAGACAGGATTATGTGAAGAACTGAAGTTTGCAAGAGATCAGCCACTTAAATGGTACATTTGTTCGATGGCCGTCTTAGCAGATCCAATGTTTTCAGAAGAGAGAGTTGAGCTGACAAAATCTATATCTTTTATCTATCTTATTGACGACATCTACGATGTATACGGTTCACTTGAAGAACTTAGACTATTCACTAAAGCAATCCAAAGGTATGACACTAATTTCTTCGTGTTTTTTAAGGTACATGCATGTTTTGTGGACTGACCTCAGTTCATGCAGATGGGACTTGGCAGCTCTCGACGGCTTACCAAACCCAATGAAATTTTGCATTATCAAACTTCATGAAACAACAAATGAAATATGTCACAAGTTTTATCTAAAGCATGGCTGGAACCCAATTAATTCCATACACAAATCGGTAAATTTTTATATTAAAGCTGCTATGATTGTGATATCCTACATTGGTTGGGGAGGAGAACGAAACATCCTTTGTAAGGGTGGGAAAATCTTCCCCTAAGAGATGAGTTTTAAAGCTCTAAGGGGAAGCCCAAGAGGGAAAGCCCCAAAAGGACGCTATCAGCTAGTAGTGGGCTTGGGCTGTTACAAATGGTATCAAAGTCAACACATGGCGATGTGCTAGTGAGGAGGCTATTCCCTGAAGGGGATAGACACGAGGTGGTGTGCCAGTAAGAATACTGGCCCCAAAGGGGGTGGATTTGGCGGGGTCCCACATCAATTAGAGGAAGGAAAAAGTGTCAGCGCGGACGCTCGACCCTAAAGAATGGTGGATTGTGATATCCCACATTGGTTGGAGAGGAGAACGAAACACTCTTTATAAGAGTGTGAAAACCTTCCCCTAGTCTACTCATTTTAAAGTCTTGAGGAGAATCTCGAAAGTGAAACCCAAAGAATACAATATCTATTAGCGATGAGCTTGGACTGTTTGATTGTTTGATTTTGTAATGAGCTGAATATTCAATTTTTTTTTCAGTGGGTTAGCCTTTGTGAGGCGTTTCTAGTGGAAGCAGAATGGTTTGGTTCTAGCCATTTGCCAAGTGCCGAAGAATACTTGGAGAATGGAGCAGTTAGTTCTGGAGTTCATGTTGTTTTAGCTCACATCTTCTTTCTCTTAGGCCAAGGTGTGAGCAACGAGGCAGTTCTTTTGAGTAGCAATCCAGACATTGTGTCTTCCACTGCGTCAATTCTTCGACTCAGCGATGATTTGGGAAGTGCAAAGGTAATCTAAGAACGTTTATTTCATTATGTTGAAAATTATTGGGATGGAGTCCCACGTTAACTAATTCAGGAAATGATCATGAATTTATAAGTAAATAATACATCTTACGAGACCTTATGTTCAAAGTGAACAATATCATGTCATTGTTTAAGTGATGAATATCGTTTAATTTATTACACAGGACGAGAACCAAGAAGGACATGATGGGTCTTATATAGAATGCTATATGAAAGAAAATCCAGAGATCTCTGTTGACAGCACACGGGAGCGTATTAGCCACATGATATCCGATGCGTGGAAGAGGCTGAACCAAGAAAGCCTGTTTTCTCCAAATCCATATCCACCGACATTTATTCAAGCTTCTCTAAATATTGCTAGATTTGTTCCTCTTTTGTATGGTTATGATGAAAATCAAGACCTTCCAACGCTAGAGAAGCTTGTGAAGTTTGTGCTTTATGAAAGTGTAGGAGTTTAAGAGACGTTTAAGCTCTTCACCGAGCTTAAAAAATATAATTGAGTCACGAAAAAGAAAAAATATACTACTCTTTGTTTGCATTACCCTACAAATATAGGACGTGTTGTGACAACAAAGAGATTTCATGTACATTTAAGGTCTATTGTCATCACAAAATTAAACCTCTATAAACGCTTATAAAATATTTTAACATAACCCAACTCATAAATTTATTTATCTCTCTACCAGTCTAAGGTTCAAATGTAACCGTTTAAGTCTACCGCTAGCAGATGTTGTCTTATTTGGATTTTTTCGGACTTTCCCTCAAAAATTTTAAAATGTGTTAGTTAAGAGAGGTTTCCACACCCTTACAAAGAATGTGTTTTAGTTAAAAGAGGTTTTCACACCCTTACAAAGAATATTTCGTTCCTCTCTCCAATCGAAGTGGGATCTCACTATCCACCCCCTTGAGGGCCAACATCATCGCTGACACACTGTCTCATGTTTGGCTCTGATACAATTTGTAACAGCTCAAGCACACTCCTAACAGGTCTTTCCCTCTCGAGCGTCTCCTCAAAGTTTTTAAAACGCACCTGTTGGAAGTAAAACTTCGAAGGAAAGCCTAGAAATCAGAATTCAAAGAGAACGATATTCGTGAGTAAGCTGATAATACTTAATAAATTTAAACTAAAAAGTAGAATCAAAATCCAATACTTAATGGAGGCTATTAGTTCATTAAATGCACACTCCCTCCAAAAATATTGGAAACTGGAACACATTTATTTTCACAGTTACCAAAAATGTTTGTATAAATGTCTGGAAAAAAAGAAAAAGAAACTAAATAGAAAGTGGTCACTCCGAGGGTGTTTGTCTTTGTCTCTGTCTTCCACTTCTTTATTAGCTTCAATATTCGTTATCCTAATATTCAATGTTTCAAACAATCTAAAACGACAACTTCATTCATTGAAAATATATGGATCAAAACAATAATTTGAATTCCTCTATCCCTTATATGTTACAACCTGCAAATTAATTGATCTAAGCAAACTTGTTTATAGCTCAATTAGAACACACAGTTTGATTTTTCTTTGCTGCTATGAGCTTTGGATTCATGGCTTTCAGTTCTTCTCTTACCTGCATATTTCCATTAGCAGTGCAGGTAGAAGGCGAGACTCGTACAACTTTACTGATCATACTGCGACAGAACAGATCTATGGGGCTGTATTGCTTGGAAATCAGTGCTCTTCTACAAGGAAATTTTTGGTTTTGTGTCAAATGCTTCAAAGTTTCAGTAGAGAGCTTACTGTAGTTCAGTGAAGAACAGATTTTGATCTTTTCATCTTCACATAATTCAACATGAACCTGTCAATATCAAGACTTTAGTGTTAGTTCCCACTGAAAGGAAGTTGGATTTCCTGTGAAAAATCCCACATTTTTCATTCAGTTTAATGGAAGTTCATTACCTGAAAGTAGATATCAATTGCTTGATAAAGTTTGTCATGAGATTCTCTTGAATTTGACAGTGCCATGGCTAAAGCATTAAACTTGGAGGCCTTCAAATGTGAATCTGGAGCTACTTCCATCAAGTAAAAATCCATCAATCTAGCAACTTTCTCAAATTTACAAAGCTGGAACGGGCAGATTTTGGATTCAACCTGAAAGGACTTTAAAAGCCTCAGAACCAAATTCACATCATACACATAATCCTTTCCACAGGGAGCGGGAATAAGCAAATGGCTCACTGTCAATTGATCCAATTGTGAACCCATCAAAACCTCTAGTTTGTGTTTAGATTTTCTGCTTATCTTCAAACTCAGAGCTGCCTGATATATGTCAAACAAACCTTCACCAGAAAAAGAACTCTGGTGATCAAGCAAACAGAGTAAATCAATAACACGCTCAGTGATTCTCTGCTGCTCATTTAGTTCAGAATGACAAGATCTTGATTTTCTATAGAAAAAGAGAAACCTACAGATTATCTTGTGGTCAAGATTGCGTGAAACCATCATTTTTGTTACTTTCTGGATCATGTCAATATTCAAGATCACCAAATCTTGAAACCACCAAGTTGTTCTAGTGAACTTGCTGGTGGCACTCTCATGGCTTTTAGTATCATAAGAATAACGAAAACTAGAGCTGGTAGAAAAAGATGAACTTGTACTTTCATAACTAGAAAAATCAAGCCTGCCTATCAGGGAATCCAAGAGTTTATGAACCAAAAATGAATCTTTCCTGACTGAAAGACAAACTTGGTACTGTTTCAAGGCCAACAGAAGCTCAGACCAGCTCCAAAATTTCAGTCCTTGGAAGGATTTTTTGGTTTCAGCTATTAAGTGGGTCTCTGCAGAATCACTAGGACATTCCAGCTCCAAGAAAAGGGCTGCATAGTGAAGAAGGACTATATTAGATGGGGTGATTTCAACTCTTCCACCATTGTAGCAAAACCTTGCAATATGTTCAAAGCCTTCAGGACCTCCAGGAAAGCCATGTAAGACAACTTTCAGACCCCTTGCTACATCTGTGGATGTTCTTATTAACTTTCTTAGCCTACCAGAGAAAGAAAGAATGATTCTCTGCAAATTTCAATTGCAAACGCTGCTGTTAGAAATTTTCTGAAAATGTTTATTAGATAAAACAGACAAATGAATGGAAAGGCAGTATAAGCAAATGGGTAACAGAGGAACATGAAGGGAACAGAGCTTCAGAGTAATCATGGAAACTTTTGAGGAAGCTTTCAGAGGGAAGCTCTTTTGCTTTTTTACTTTATTGGTAATAGAAATGAAGGGAGTGACAAAAAAACATATATATATAAATATTATGAACTGCATATACCTTGTCCACCATAAAAGTTGCTTCTCCATTGACATCTACTTCAAGATCACAACACACTCCCATTCTTCAATATGAAACATATAAACACATATGAGTTCATTCAGCTTCAGAAGATCTGCATTTGGAAAGAGAGGAAGTCTTGACACAAGATTGAAAAAATGAGAATCCTCAAGAGTTGCATGCAAGAAAAGGAAAGTGGAAAGAAAAAAGGGGAAAAGGTGGGAAGAGTTCTCTTTGAATCTTCTACTGTAACTTAACGACAGTTTTTGAGAGTTCTAACTTTTTTTTTTCTTTTTGATGCTTCTTACAACAAAAACCAAATTCAAAAGATAAATTTAAGACGGCAGAACTGAGGCGAACTGACCAAGATTGGTTGAACAACATGGTCTTAATTTCCACGACGATTCTTCTGAGAAATCTTGGGTCTATCAAAGCACTAGTACCAAGTAGTAGAAACAAAAGACAGAGCTTAAAAGTTGGAAATTCAGAACCCAAGAAGCCGACGAGAACTGGTAGTGTCATTCCAAAGTGACGTTTGTTAGCAAATGGGCATAGGCCTAATTATTTTAGTGCGGCTTTTTCCGGTACTCAAGCCACCAATTCGTGGATAGTGGTGGTTAGGCCCTTTGGGTGGATGTTGGATCCCACAGGCTTGAAAGTCGAAACTAACAAGTCCTTTTAGTGGAAGCCAGATGTTGGGAAAAGATGCTAACTCAGTTTCTGGGCAGCAATTTTTGGCACTACGATAAGAGCTTGAAGCCATAAGAGTCGACAAGCTTGGTTTTGTGGCAGGTTCTATCGTAGTGCCACCTGAATTCTAATAATGATCACTTCCTATTTGGTTAATGCAGAAAAAAAATTGCGATGAAAAATGACTTGTTAAACGTAATAACAGACAATTTTCTGCAAAGAAAGTTTTCAGCAAACGATCATTCTGCAATACTACTGAATAAACATTCTCAAGGGAGAGAAAAATTCATAAAGTTTCGCCATGAAAATTAATGAAGCAGCTGCCATGTTTCACTGTCAATACAATCGTTTCCACCACGTAGCAAAGCCAAACAACAAACTAAAAGTTTCAACTTCTTTTTGTTTATTGCGTTTGTTCCCACCCATTTCCTTTCCTTTCTTTTTTTTCTTTGTTCTTTTTCCGACGACCGAGAAGGGTTGCTATTTTAATAACCATTAATTTGAGGTCTAGGGGATGGACCCATTAGGCGCGAAAAAGGCATCTGAGACCGGCCTGGCAGGTTCCTTTGACTGGGAACCCTATCGGGCACGTTAGAATCTGAGACTGATCTAGAGAAGGCTTGAGCCAGCTCAAGGGCAGAAGCTACTTTCCCATAACGCAAACCAACAGCGTTTGAGGCAGGTTTCGGCTGCTGCTGCTCAGCAGGCTTGCGCCAAGTCTCTGGTGAAGGTGGCCGCTCCAGCTGCTGCTGCTGTTGTTTTTCAGTTTCTCTATTCTTCCTTCGGTTATCACGGAAATTCCTCCTTTGCATCTCTGGATCTGCCCTGTTATCCTTCCTCTCACCCTCCTTACCTATCCTTCGATCAAAAGATGGATTCTCCATTTTTCCAGACTGATGCACAGGAACAGGGGGTCCTTGAACCCTTTCAGTCCTTGAAAAATTTTCGTCAATCCTTCCAACAACAAAACGTAACGCATAAACAGTTATCAATGAAAGCTTCTTCTTTTTCCTTGTCCCCTTTTTTTCTTTTGTAGGGATGGGGGAGGGAGGAGGGCCAGATGGGAGAGAGACAGAGAAA

mRNA sequence

GGTGCGGTGGCTAAAGTGGAGAGTTTGAAGCATGTTTTGAGAGAATCAGGCGATTGTTTGGAGCGTTTGGACCTAATTGATGCAGCCCAACGCCTAGGCATTGACAACCATCTCCAAGAGGAAATTGAAGCTGTTCTAAAGCGGCAATACTTCTTATTGAATGCGCTTCAATTTGATCCAATGATTGATCTTCACAAGGCTGCTCTCCTTTTTCGACTTTTGACTCAACAGGGCTTCCTTGTCACTCCAAAGCTAAGGCATGATATCAAGGGGCTGACAAGTTTACATGAAGCTTCGCAGCTATGCATGCATGGAGATGATATTCTTGAAGAAGCTCAAAATTTCAGTAGCCATTGGCTAAACGCTTGGGTTGTTGTTCATGTCAACCATCATTCAGCCACTTTCGTTCATAACACTCTACTTCATCCTTATCATAAAACCTTGCCCCAATTCATGTTGCCTAACTATTTTGGCGACAACCAATGGACAAACAAATGGATACACACTTTGCAAGGCGTAGCAAAAGTGAATTTCAATACAACTCAACGCTTACGCCAATACGAACTTAACCAATTCACAAAATGGTGGAAAGAATCAGATTTGGGTAGAGAGCTGAATTTTGCAAGAAATCAGCCCATGAAATGGTACATCGCGTCGCTATCGTGTCTGACGGATGTGTGCCATTCTGAACAGAGAATCCAACTCGCCAAAGCCATTGCTTTTGTTTATCTTATTGATGATATCTTTGATGTATTTGGAACGCTGGAAGAGCTCACCATGTTCACAGAAGTTGTTTGTAGATGGGATTTGGTTGCTGCTGAAAAGTTACCAAGCTGCATGCGTATATGCTTCAAATCTCTATTTGTAGTTACGAATGAAATAAGTGACCAAATATATCAAAAGCATGGTTGGAATCCAGCTACCTCCCTACAGACAGCGTGGGGTAAATTGTGCAAGGCATTTTTAGTTGAGGCAGAATGGTTCAGCTCAGGGCATTGGCCAAGTGCAGAAGAATATTTAAGAAATGGCAGAGTTAGCACGGGCGTCCACGTTACTTTAGTTCATGTTTTCTTTCTTCTTGGACACCAAATAAGGCAGCAAACCGTGGAGCTTTTGGAGGGGGATTTAGATATTGTTTGGTGCAGTGCTACAATATTGCGGCTTTGGGATGACATGGGAAGTGCCAAGGATGAGAATCAAGAGGGGCGTGATGGGTCGTACGTAAAATACTACATGAAGGAGCATCCAAGCACGTCCTTTAATGCCACACAAAAGCATATTATGTGGAAGATTTCTGAAGCCTGGAAGAGTTTGAATAGAGAATGGCTGCATTGTAATCCATTGTTTCCTTCCCAATTCACTCAAGCTTCTCTTAACATTGCCAGAGCAGTTCCTTTGTTCTATAGTTATGGCAGAAACCAGTCCTTTCCCACCTTCGAGCAGCATATGGAACGATTGCTCTTTCATAGCCTGGATATTTACAATGATTCTCCTGTTAACAAGCGCTTGTTTAACAAACAAACCATTGTTCTTCGTGATCATACATTACTTTCCACTCACTTTAAACCCAATCGTTCCAAGAACCAATCTTTCATTTTAACAGATGATATTGGGCATGAAGCTCCAGTTAAAGGGAGGATTTTGAAGCATGTTTTGAGAGAAATAGGAGACCCTTGGGAATGTTTGAATCTAATTGATGCAACCCAACGCCTTGGCATTGATTATCATTTCCAAGAGGAAATTCAAGCTATTCTGCAAAGGCACTACGTATTTTTCTTGGACAAGGAGGGGAAGTTCAAGGAAGGGTTGAAAGATGATATAAAGGGGCTGACGAGTTTATATGAAGCTTCGCAGCTATGCATGCATGGAGATGAGATTCTTGAAGAAGCTGAAAACTTTAGCAGCCATTGGCTAAAGGCTAAGGCTGAAGATGAAGAGGTTGATCATCATTTGGCCAGTTTTGTTCAGCATACTTTAGCTTATCCTCATCATAAAAGTGTGGTGCAATTAATGGCGCCTAACTATTTGAATGATATGCAATGGCCAAACAAATGGATTAATTTGGCGAAAGATTTGAATTTCTCTAGAGATCAACCCATTAAATGGTACGTTGCTTCACTGATTTGCCTCAGCACAGATTCCTTCTACTCCGAACAAAGAATCCAACTTGCAAAATCCATCTCTTTCATATATCTCATCGACGACATTTTCGACTGGGCTAAATTATGCAACGCATTTTTGGTGGAAGCTGAATGGATGAGTTCTGGGCAATCACCAAGCGCCGAAGAGTATCTAAAAAATGGAGTGGTTAGCACAGGCGTCCACGTTACATTAACACATGTCTTCTTTTTGTTAGGCGAGGCAATAAGCAAGGAAACTGTGGAGCTCTTTGATGAGGATTTAGATATCATTTCATCTAGTGCTACAGTTTTGCGGCTTTGGGATGACATGGGAAGTGCCAAGGATGAGAAGCAAGAGGGGCATGATGGGTCGTATTTAGAATATTACATGAAGGAGCATCCGAGTATGTGTTACGAAGAAACGAAGCGACATACTATGACGCAAATTTGTAATGCATGGAAGACTCTGAATACAGAATGCTTATTATCCAATCTATTTCCTGCCAAGTTCAATCAAGCTTGTCTCAATCTCGCAAGAGTGGTTCCTATTGCATATAACTATGCAGCATCAGCAATGCTAATAGTTGGAGTAATTCCTCATCCTCCTCCTCCTCCTCCAAAAGCTTCCATGCACAATATGGTTCCTCACTCTCAAAAATGGACCATTCCTCAACATCACTCCTCCCTTCCAATTTCCCTTCCTGCAGATTCACAAATTTGTCCTTCAATTGAGGAAATGAGCATGAAATATGAACTTGAAATGAAGAATCTGAAGCATTTGTTGAGTGAGACGGCCAAAATTGATTCTTTGGAGAGCTTGAATATGATTGATGCAATACAACGCTTGGGGATTGACCACTGCTTCAAACAGGAGATTAAAGCAATTCTTCAAACACAATACACAATGGAAACTCACAATTTTGATGCTAAATGTGGCCTTCATCACGTTGCTCTTCGCTTTCGACTCTTCAGACAACATGGCTATTTTGTACCACAAGATGTTTTTGAAGGCTTCATCGATCACGAGGGTCTATTAGACACAAAATTTATTGAAAATATCGAAGGGCTCACGAGTTTATACGAAGCTTCCCAATTATGCTTCCCCGAAGACGAAAAACTCGAGAAAATTGGAAATTTTAGTGCTCGTATCTTGAAGAAACTTGCGTGGAATCGCGACGATAATTTAGGCAAACATGTGAGGAAAGCTGTGGCCAATCCCTTTCATAAGAGCTTGGTGAAATTTGTGGTGATGGATTACTTTGGATCCCAATCTCCAAATAAGTGGATATATGTTTTCCAACCCATGGCAAAATTGGATTTCAATAGACTCCAGAACCTACACCGACTTGAATTGTCTCAATTCATTATGTGGTGGAAGAAGACAGGATTATGTGAAGAACTGAAGTTTGCAAGAGATCAGCCACTTAAATGGTACATTTGTTCGATGGCCGTCTTAGCAGATCCAATGTTTTCAGAAGAGAGAGTTGAGCTGACAAAATCTATATCTTTTATCTATCTTATTGACGACATCTACGATGTATACGGTTCACTTGAAGAACTTAGACTATTCACTAAAGCAATCCAAAGGTACATGCATGTTTTGTGGACTGACCTCAGTTCATGCAGATGGGACTTGGCAGCTCTCGACGGCTTACCAAACCCAATGAAATTTTGCATTATCAAACTTCATGAAACAACAAATGAAATATGTCACAACCATTTGCCAAGTGCCGAAGAATACTTGGAGAATGGAGCAGTTAGTTCTGGAGTTCATGTTGTTTTAGCTCACATCTTCTTTCTCTTAGGCCAAGAAAATCCAGAGATCTCTGTTGACAGCACACGGGAGCGTATTAGCCACATGATATCCGATGCGTGGAAGAGGCTGAACCAAGAAAGCCTGTTTTCTCCAAATCCATATCCACCGACATTTATTCAAGCTTCTCTAAATATTGCTAGATTTGTTCCTCTTTTGTATGGTTATGATGAAAATCAAGACCTTCCAACGCTAGAGAAGCTTGTGAAGTTTGTGCTTTATGAAAGTGGGATGGACCCATTAGGCGCGAAAAAGGCATCTGAGACCGGCCTGGCAGGGATGGGGGAGGGAGGAGGGCCAGATGGGAGAGAGACAGAGAAA

Coding sequence (CDS)

GGTGCGGTGGCTAAAGTGGAGAGTTTGAAGCATGTTTTGAGAGAATCAGGCGATTGTTTGGAGCGTTTGGACCTAATTGATGCAGCCCAACGCCTAGGCATTGACAACCATCTCCAAGAGGAAATTGAAGCTGTTCTAAAGCGGCAATACTTCTTATTGAATGCGCTTCAATTTGATCCAATGATTGATCTTCACAAGGCTGCTCTCCTTTTTCGACTTTTGACTCAACAGGGCTTCCTTGTCACTCCAAAGCTAAGGCATGATATCAAGGGGCTGACAAGTTTACATGAAGCTTCGCAGCTATGCATGCATGGAGATGATATTCTTGAAGAAGCTCAAAATTTCAGTAGCCATTGGCTAAACGCTTGGGTTGTTGTTCATGTCAACCATCATTCAGCCACTTTCGTTCATAACACTCTACTTCATCCTTATCATAAAACCTTGCCCCAATTCATGTTGCCTAACTATTTTGGCGACAACCAATGGACAAACAAATGGATACACACTTTGCAAGGCGTAGCAAAAGTGAATTTCAATACAACTCAACGCTTACGCCAATACGAACTTAACCAATTCACAAAATGGTGGAAAGAATCAGATTTGGGTAGAGAGCTGAATTTTGCAAGAAATCAGCCCATGAAATGGTACATCGCGTCGCTATCGTGTCTGACGGATGTGTGCCATTCTGAACAGAGAATCCAACTCGCCAAAGCCATTGCTTTTGTTTATCTTATTGATGATATCTTTGATGTATTTGGAACGCTGGAAGAGCTCACCATGTTCACAGAAGTTGTTTGTAGATGGGATTTGGTTGCTGCTGAAAAGTTACCAAGCTGCATGCGTATATGCTTCAAATCTCTATTTGTAGTTACGAATGAAATAAGTGACCAAATATATCAAAAGCATGGTTGGAATCCAGCTACCTCCCTACAGACAGCGTGGGGTAAATTGTGCAAGGCATTTTTAGTTGAGGCAGAATGGTTCAGCTCAGGGCATTGGCCAAGTGCAGAAGAATATTTAAGAAATGGCAGAGTTAGCACGGGCGTCCACGTTACTTTAGTTCATGTTTTCTTTCTTCTTGGACACCAAATAAGGCAGCAAACCGTGGAGCTTTTGGAGGGGGATTTAGATATTGTTTGGTGCAGTGCTACAATATTGCGGCTTTGGGATGACATGGGAAGTGCCAAGGATGAGAATCAAGAGGGGCGTGATGGGTCGTACGTAAAATACTACATGAAGGAGCATCCAAGCACGTCCTTTAATGCCACACAAAAGCATATTATGTGGAAGATTTCTGAAGCCTGGAAGAGTTTGAATAGAGAATGGCTGCATTGTAATCCATTGTTTCCTTCCCAATTCACTCAAGCTTCTCTTAACATTGCCAGAGCAGTTCCTTTGTTCTATAGTTATGGCAGAAACCAGTCCTTTCCCACCTTCGAGCAGCATATGGAACGATTGCTCTTTCATAGCCTGGATATTTACAATGATTCTCCTGTTAACAAGCGCTTGTTTAACAAACAAACCATTGTTCTTCGTGATCATACATTACTTTCCACTCACTTTAAACCCAATCGTTCCAAGAACCAATCTTTCATTTTAACAGATGATATTGGGCATGAAGCTCCAGTTAAAGGGAGGATTTTGAAGCATGTTTTGAGAGAAATAGGAGACCCTTGGGAATGTTTGAATCTAATTGATGCAACCCAACGCCTTGGCATTGATTATCATTTCCAAGAGGAAATTCAAGCTATTCTGCAAAGGCACTACGTATTTTTCTTGGACAAGGAGGGGAAGTTCAAGGAAGGGTTGAAAGATGATATAAAGGGGCTGACGAGTTTATATGAAGCTTCGCAGCTATGCATGCATGGAGATGAGATTCTTGAAGAAGCTGAAAACTTTAGCAGCCATTGGCTAAAGGCTAAGGCTGAAGATGAAGAGGTTGATCATCATTTGGCCAGTTTTGTTCAGCATACTTTAGCTTATCCTCATCATAAAAGTGTGGTGCAATTAATGGCGCCTAACTATTTGAATGATATGCAATGGCCAAACAAATGGATTAATTTGGCGAAAGATTTGAATTTCTCTAGAGATCAACCCATTAAATGGTACGTTGCTTCACTGATTTGCCTCAGCACAGATTCCTTCTACTCCGAACAAAGAATCCAACTTGCAAAATCCATCTCTTTCATATATCTCATCGACGACATTTTCGACTGGGCTAAATTATGCAACGCATTTTTGGTGGAAGCTGAATGGATGAGTTCTGGGCAATCACCAAGCGCCGAAGAGTATCTAAAAAATGGAGTGGTTAGCACAGGCGTCCACGTTACATTAACACATGTCTTCTTTTTGTTAGGCGAGGCAATAAGCAAGGAAACTGTGGAGCTCTTTGATGAGGATTTAGATATCATTTCATCTAGTGCTACAGTTTTGCGGCTTTGGGATGACATGGGAAGTGCCAAGGATGAGAAGCAAGAGGGGCATGATGGGTCGTATTTAGAATATTACATGAAGGAGCATCCGAGTATGTGTTACGAAGAAACGAAGCGACATACTATGACGCAAATTTGTAATGCATGGAAGACTCTGAATACAGAATGCTTATTATCCAATCTATTTCCTGCCAAGTTCAATCAAGCTTGTCTCAATCTCGCAAGAGTGGTTCCTATTGCATATAACTATGCAGCATCAGCAATGCTAATAGTTGGAGTAATTCCTCATCCTCCTCCTCCTCCTCCAAAAGCTTCCATGCACAATATGGTTCCTCACTCTCAAAAATGGACCATTCCTCAACATCACTCCTCCCTTCCAATTTCCCTTCCTGCAGATTCACAAATTTGTCCTTCAATTGAGGAAATGAGCATGAAATATGAACTTGAAATGAAGAATCTGAAGCATTTGTTGAGTGAGACGGCCAAAATTGATTCTTTGGAGAGCTTGAATATGATTGATGCAATACAACGCTTGGGGATTGACCACTGCTTCAAACAGGAGATTAAAGCAATTCTTCAAACACAATACACAATGGAAACTCACAATTTTGATGCTAAATGTGGCCTTCATCACGTTGCTCTTCGCTTTCGACTCTTCAGACAACATGGCTATTTTGTACCACAAGATGTTTTTGAAGGCTTCATCGATCACGAGGGTCTATTAGACACAAAATTTATTGAAAATATCGAAGGGCTCACGAGTTTATACGAAGCTTCCCAATTATGCTTCCCCGAAGACGAAAAACTCGAGAAAATTGGAAATTTTAGTGCTCGTATCTTGAAGAAACTTGCGTGGAATCGCGACGATAATTTAGGCAAACATGTGAGGAAAGCTGTGGCCAATCCCTTTCATAAGAGCTTGGTGAAATTTGTGGTGATGGATTACTTTGGATCCCAATCTCCAAATAAGTGGATATATGTTTTCCAACCCATGGCAAAATTGGATTTCAATAGACTCCAGAACCTACACCGACTTGAATTGTCTCAATTCATTATGTGGTGGAAGAAGACAGGATTATGTGAAGAACTGAAGTTTGCAAGAGATCAGCCACTTAAATGGTACATTTGTTCGATGGCCGTCTTAGCAGATCCAATGTTTTCAGAAGAGAGAGTTGAGCTGACAAAATCTATATCTTTTATCTATCTTATTGACGACATCTACGATGTATACGGTTCACTTGAAGAACTTAGACTATTCACTAAAGCAATCCAAAGGTACATGCATGTTTTGTGGACTGACCTCAGTTCATGCAGATGGGACTTGGCAGCTCTCGACGGCTTACCAAACCCAATGAAATTTTGCATTATCAAACTTCATGAAACAACAAATGAAATATGTCACAACCATTTGCCAAGTGCCGAAGAATACTTGGAGAATGGAGCAGTTAGTTCTGGAGTTCATGTTGTTTTAGCTCACATCTTCTTTCTCTTAGGCCAAGAAAATCCAGAGATCTCTGTTGACAGCACACGGGAGCGTATTAGCCACATGATATCCGATGCGTGGAAGAGGCTGAACCAAGAAAGCCTGTTTTCTCCAAATCCATATCCACCGACATTTATTCAAGCTTCTCTAAATATTGCTAGATTTGTTCCTCTTTTGTATGGTTATGATGAAAATCAAGACCTTCCAACGCTAGAGAAGCTTGTGAAGTTTGTGCTTTATGAAAGTGGGATGGACCCATTAGGCGCGAAAAAGGCATCTGAGACCGGCCTGGCAGGGATGGGGGAGGGAGGAGGGCCAGATGGGAGAGAGACAGAGAAA

Protein sequence

GAVAKVESLKHVLRESGDCLERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPMIDLHKAALLFRLLTQQGFLVTPKLRHDIKGLTSLHEASQLCMHGDDILEEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFGDNQWTNKWIHTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTDVCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFKSLFVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRVSTGVHVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQEGRDGSYVKYYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVPLFYSYGRNQSFPTFEQHMERLLFHSLDIYNDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDDIGHEAPVKGRILKHVLREIGDPWECLNLIDATQRLGIDYHFQEEIQAILQRHYVFFLDKEGKFKEGLKDDIKGLTSLYEASQLCMHGDEILEEAENFSSHWLKAKAEDEEVDHHLASFVQHTLAYPHHKSVVQLMAPNYLNDMQWPNKWINLAKDLNFSRDQPIKWYVASLICLSTDSFYSEQRIQLAKSISFIYLIDDIFDWAKLCNAFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVFFLLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEGHDGSYLEYYMKEHPSMCYEETKRHTMTQICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNYAASAMLIVGVIPHPPPPPPKASMHNMVPHSQKWTIPQHHSSLPISLPADSQICPSIEEMSMKYELEMKNLKHLLSETAKIDSLESLNMIDAIQRLGIDHCFKQEIKAILQTQYTMETHNFDAKCGLHHVALRFRLFRQHGYFVPQDVFEGFIDHEGLLDTKFIENIEGLTSLYEASQLCFPEDEKLEKIGNFSARILKKLAWNRDDNLGKHVRKAVANPFHKSLVKFVVMDYFGSQSPNKWIYVFQPMAKLDFNRLQNLHRLELSQFIMWWKKTGLCEELKFARDQPLKWYICSMAVLADPMFSEERVELTKSISFIYLIDDIYDVYGSLEELRLFTKAIQRYMHVLWTDLSSCRWDLAALDGLPNPMKFCIIKLHETTNEICHNHLPSAEEYLENGAVSSGVHVVLAHIFFLLGQENPEISVDSTRERISHMISDAWKRLNQESLFSPNPYPPTFIQASLNIARFVPLLYGYDENQDLPTLEKLVKFVLYESGMDPLGAKKASETGLAGMGEGGGPDGRETEK
Homology
BLAST of Cp4.1LG20g03980 vs. ExPASy Swiss-Prot
Match: P0CV94 ((3S,6E)-nerolidol synthase 1 OS=Fragaria ananassa OX=3747 PE=1 SV=1)

HSP 1 Score: 502.7 bits (1293), Expect = 1.4e-140
Identity = 258/506 (50.99%), Postives = 345/506 (68.18%), Query Frame = 0

Query: 5   KVESLKHVLRESG--DCLERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPMI 64
           K+E LK VLR     D LE L++IDA QRLGID + Q EI+ +L +Q  +++A       
Sbjct: 21  KLELLKTVLRNVAELDALEGLNMIDAVQRLGIDYNFQREIDEILHKQMSIVSARD----- 80

Query: 65  DLHKAALLFRLLTQQGFLVTPK---------------LRHDIKGLTSLHEASQLCMHGDD 124
           DLH+ AL FRLL Q G+ V                  L  DIKGL SL+EASQL   G+D
Sbjct: 81  DLHEVALRFRLLRQHGYFVPEDVFNNFKDSKGTFKQVLGEDIKGLMSLYEASQLGTEGED 140

Query: 125 ILEEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFGDNQWTNKWI 184
           IL EA+ FS H L    + H++HH    V NTL +P+HK+L  FM  N+F  +Q TN W+
Sbjct: 141 ILVEAEKFSGHLLKT-SLSHLDHHRVRIVANTLRNPHHKSLAPFMARNFFVTSQATNSWL 200

Query: 185 HTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTDVC 244
           + L+ VAK +FN  + L Q E+ Q +KWWKE  L +EL FAR+QP+KWYI S++CLTD  
Sbjct: 201 NLLKEVAKTDFNMVRSLHQNEIVQMSKWWKELGLAKELKFARDQPLKWYIWSMACLTDPK 260

Query: 245 HSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFKSL 304
            SE+R++L K I+FVYLIDDIFDV+GTL++L +FTE V RW++ A + LP  M+ICFK+L
Sbjct: 261 LSEERVELTKPISFVYLIDDIFDVYGTLDDLILFTEAVNRWEITAIDHLPDYMKICFKAL 320

Query: 305 FVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRVST 364
           + +TNE S ++Y KHGWNP  SL+ +W  LC AFLVEA+WF+SG  P +EEYL+NG VS+
Sbjct: 321 YDMTNEFSSKVYLKHGWNPLQSLKISWASLCNAFLVEAKWFASGKLPKSEEYLKNGIVSS 380

Query: 365 GVHVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQEGRDGSY 424
           GV+V LVH+FFLLG  I +++VELL     I+  SA ILRLWDD+GSAKDENQ+G DGSY
Sbjct: 381 GVNVVLVHMFFLLGQNITRKSVELLNETPAIISSSAAILRLWDDLGSAKDENQDGNDGSY 440

Query: 425 VKYYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVPLF 484
           V+ Y++EH   S    ++  +  IS+ WK LNRE L  NP FP+ FT ASLN+AR +PL 
Sbjct: 441 VRCYLEEHEGCSIEEAREKTINMISDEWKKLNRELLSPNP-FPASFTLASLNLARMIPLM 500

Query: 485 YSYGRNQSFPTFEQHMERLLFHSLDI 494
           YSY  NQ  P+ +++M+ +L+ ++ +
Sbjct: 501 YSYDGNQCLPSLKEYMKLMLYETVSM 519

BLAST of Cp4.1LG20g03980 vs. ExPASy Swiss-Prot
Match: P0CV95 ((3S,6E)-nerolidol synthase 2, chloroplastic/mitochondrial OS=Fragaria ananassa OX=3747 PE=1 SV=1)

HSP 1 Score: 497.3 bits (1279), Expect = 5.8e-139
Identity = 255/506 (50.40%), Postives = 342/506 (67.59%), Query Frame = 0

Query: 5   KVESLKHVLRESG--DCLERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPMI 64
           K+E  ++VLR     D LE L++IDA QRLGID H Q EI+ +L +Q   ++A       
Sbjct: 80  KLELFRNVLRNVAELDALEGLNMIDAVQRLGIDFHFQREIDEILHKQMSNVSASD----- 139

Query: 65  DLHKAALLFRLLTQQGFLVTPK---------------LRHDIKGLTSLHEASQLCMHGDD 124
           DLH+ AL FRLL Q G+ V                  L  DIKGL SL+EASQL   G+D
Sbjct: 140 DLHEVALRFRLLRQHGYFVPEDVFNNFKDSKGTFKQVLGEDIKGLMSLYEASQLGTEGED 199

Query: 125 ILEEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFGDNQWTNKWI 184
            L EA+ FS H L    + H++HH A  V NTL +P+HK+L  FM  N+F   Q TN W+
Sbjct: 200 TLVEAEKFSGHLLKT-SLSHLDHHHARIVGNTLRNPHHKSLASFMARNFFVTTQATNSWL 259

Query: 185 HTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTDVC 244
           + L+ VAK +FN  + L Q E+ Q +KWWKE  L +EL FAR+QP KWYI S++CLTD  
Sbjct: 260 NLLKDVAKTDFNMVRSLHQNEIVQISKWWKELGLAKELKFARDQPQKWYIWSMACLTDPK 319

Query: 245 HSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFKSL 304
            SE+R++L K I+FVYLIDDIFDV+GTL++L +FTE V RW++ A + LP  M+ICFK+L
Sbjct: 320 LSEERVELTKPISFVYLIDDIFDVYGTLDDLILFTEAVNRWEITAIDHLPDYMKICFKAL 379

Query: 305 FVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRVST 364
           + +TNEIS ++YQKHGWNP  SL+ +W  LC AFLVEA+WF+SG  P ++EYL+NG VS+
Sbjct: 380 YDMTNEISCKVYQKHGWNPLQSLKISWASLCNAFLVEAKWFASGQLPKSKEYLKNGIVSS 439

Query: 365 GVHVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQEGRDGSY 424
           GV+V LVH+FF+LG  I  ++VELL     ++  SA ILRLWDD+GSAKDENQ+G DGSY
Sbjct: 440 GVNVVLVHMFFILGQNITTKSVELLNETPAMISSSAAILRLWDDLGSAKDENQDGNDGSY 499

Query: 425 VKYYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVPLF 484
           V+ Y++EH   S    ++  +  IS+ WK LNRE L  NP FP+  T ASLN+AR +PL 
Sbjct: 500 VRCYLEEHEGCSIEEAREKTINMISDEWKKLNRELLSPNP-FPATITLASLNLARMIPLM 559

Query: 485 YSYGRNQSFPTFEQHMERLLFHSLDI 494
           YSY  NQ  P+ +++M+ +L+ ++ +
Sbjct: 560 YSYDGNQCLPSLKEYMKLMLYETVSM 578

BLAST of Cp4.1LG20g03980 vs. ExPASy Swiss-Prot
Match: P0CV96 ((3S,6E)-nerolidol synthase 1, chloroplastic OS=Fragaria vesca OX=57918 PE=1 SV=1)

HSP 1 Score: 490.0 bits (1260), Expect = 9.3e-137
Identity = 252/506 (49.80%), Postives = 344/506 (67.98%), Query Frame = 0

Query: 5   KVESLKHVLRESG--DCLERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPMI 64
           K+E  ++VLR +   D LE L++IDA QRLGID H Q EI+ +L +Q  +++A       
Sbjct: 82  KLELFRNVLRNAAELDALEGLNMIDAVQRLGIDYHFQREIDEILHKQMGIVSACD----- 141

Query: 65  DLHKAALLFRLLTQQGFLVTPK---------------LRHDIKGLTSLHEASQLCMHGDD 124
           DL++ AL FRLL Q G+ V                  L  DIKGL SL+EASQL   G+D
Sbjct: 142 DLYEVALRFRLLRQHGYFVPEDVFNNFKDSKGTFKQVLGEDIKGLMSLYEASQLGTEGED 201

Query: 125 ILEEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFGDNQWTNKWI 184
            L EA+ FS H L    + H++ H A  V NTL +P+ K+L  FM  N+F  +Q TN W+
Sbjct: 202 TLVEAEKFSGHLLKT-SLSHLDRHRARIVGNTLRNPHRKSLASFMARNFFVTSQATNSWL 261

Query: 185 HTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTDVC 244
           + L+ VAK +FN  + + Q E+ Q +KWWKE  L +EL FAR+QP+KWY  S++ LTD  
Sbjct: 262 NLLKEVAKTDFNMVRSVHQKEIVQISKWWKELGLVKELKFARDQPLKWYTWSMAGLTDPK 321

Query: 245 HSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFKSL 304
            SE+R++L K I+FVYLIDDIFDV+GTL++L +FTE V RW++ A + LP  M+ICFK+L
Sbjct: 322 LSEERVELTKPISFVYLIDDIFDVYGTLDDLILFTEAVNRWEITAIDHLPDYMKICFKAL 381

Query: 305 FVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRVST 364
           + +TNE S ++YQKHGWNP  SL+ +W  LC AFLVEA+WF+SG  P +EEYL+NG VS+
Sbjct: 382 YDMTNEFSCKVYQKHGWNPLRSLKISWASLCNAFLVEAKWFASGQLPKSEEYLKNGIVSS 441

Query: 365 GVHVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQEGRDGSY 424
           GV+V LVH+FFLLG  I +++VELL     ++  SA ILRLWDD+GSAKDENQ+G DGSY
Sbjct: 442 GVNVGLVHMFFLLGQNITRKSVELLNETPAMISSSAAILRLWDDLGSAKDENQDGNDGSY 501

Query: 425 VKYYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVPLF 484
           V+ Y++EH   S    ++  +  IS+ WK LNRE L  NP FP+ FT ASLN+AR +PL 
Sbjct: 502 VRCYLEEHEGCSIEEAREKTINMISDEWKKLNRELLSPNP-FPATFTSASLNLARMIPLM 561

Query: 485 YSYGRNQSFPTFEQHMERLLFHSLDI 494
           YSY  NQS P+ +++M+ +L+ ++ +
Sbjct: 562 YSYDGNQSLPSLKEYMKLMLYETVSM 580

BLAST of Cp4.1LG20g03980 vs. ExPASy Swiss-Prot
Match: B9RXW4 (Probable terpene synthase 13 OS=Ricinus communis OX=3988 GN=TPS13 PE=3 SV=1)

HSP 1 Score: 467.6 bits (1202), Expect = 4.9e-130
Identity = 244/508 (48.03%), Postives = 334/508 (65.75%), Query Frame = 0

Query: 5   KVESLKHVLRESGD-CLERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPMID 64
           +  + KH+LR+ G+   E L +IDA QRLGID+H Q+EI+ +L+RQY + +    +   D
Sbjct: 70  RTRAFKHILRKEGEGSHEVLAMIDAIQRLGIDHHFQDEIDEILQRQYTIPSYYNDN---D 129

Query: 65  LHKAALLFRLLTQQGFLVT---------------PKLRHDIKGLTSLHEASQLCMHGDD- 124
           LH  AL FRLL Q G+ V+                KL  DI+GL  L+EASQL +  +D 
Sbjct: 130 LHGLALRFRLLRQGGYNVSAGVFDKFKDKEGNFDQKLSDDIRGLMELYEASQLSIGAEDH 189

Query: 125 ILEEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFGDNQWTN--K 184
           IL+EA ++S   L++W +  ++   A  + NTL HP+HK L +F   N+       N   
Sbjct: 190 ILDEAGDYSHQLLSSW-MTRLDDSQARIIKNTLDHPHHKNLARFRATNFNRYFHMANIEG 249

Query: 185 WIHTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTD 244
           W++ LQ +AK++F   Q   Q E+ Q   WWK+  + +EL F RNQP+KWYI S++ L+D
Sbjct: 250 WMNELQELAKIDFQMVQSQNQQEIFQVAGWWKDLGISKELKFVRNQPLKWYIWSMATLSD 309

Query: 245 VCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFK 304
              S+QRI L K I+F+Y+IDDIFDV G+L+ELT+FTE+V RWD+ A E+LP  MR CFK
Sbjct: 310 PSLSQQRIDLTKPISFIYIIDDIFDVQGSLDELTLFTEIVKRWDVEAVEQLPGYMRACFK 369

Query: 305 SLFVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRV 364
           +L  VTNEI  ++Y++HGWNP  SL+  W  LCKAFLVEA WF+SGH P+AEEYL+NG V
Sbjct: 370 ALDSVTNEIGYKVYKQHGWNPVHSLRETWASLCKAFLVEARWFASGHLPAAEEYLQNGIV 429

Query: 365 STGVHVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQEGRDG 424
           S+GVHV LVH+F+LLGH + ++ V+ +     I+  +ATILRLWDD+G +KDENQ+G DG
Sbjct: 430 SSGVHVVLVHIFYLLGHGVTREGVDFIGNRPAIITSTATILRLWDDLGISKDENQDGHDG 489

Query: 425 SYVKYYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVP 484
           SYV+ Y+KEH  +      K +   IS+AWK LN+E LH NP  P+ FT++ LN+AR VP
Sbjct: 490 SYVECYVKEHKGSLVEIATKKVTVMISDAWKQLNQECLHPNPFSPN-FTKSCLNLARMVP 549

Query: 485 LFYSYGRNQSFPTFEQHMERLLFHSLDI 494
           L YSY  N   P  E + + LLF S+ I
Sbjct: 550 LMYSYDDNHRLPVLEYYTKSLLFESVSI 572

BLAST of Cp4.1LG20g03980 vs. ExPASy Swiss-Prot
Match: B9RHX7 (Probable terpene synthase 4 OS=Ricinus communis OX=3988 GN=TPS4 PE=3 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 5.7e-118
Identity = 231/495 (46.67%), Postives = 315/495 (63.64%), Query Frame = 0

Query: 18  DCLERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPMIDLHKAALLFRLLTQQ 77
           D  + L +IDA QRLGID H QEE + VL+ QY  + A        L  AAL FRLL QQ
Sbjct: 73  DHSQNLIMIDALQRLGIDYHFQEETQDVLEGQYNKIFAAH--QQHHLSDAALRFRLLRQQ 132

Query: 78  GFLV----------------TPKLRHDIKGLTSLHEASQLCMHGDDILEEAQNFSSHWLN 137
           G+ V                  +L  DIKGL  L+EASQL +  ++IL+EA  FS+H+LN
Sbjct: 133 GYYVPASDVFSELKNREGKFKQELAADIKGLMELYEASQLSIQEENILDEAGAFSAHFLN 192

Query: 138 AWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNY--FGDNQWTNKWIHTLQGVAKVNFN 197
            W   H+  H    V NTL HP+H+TL +  + N+  F + Q  N++I T   +AK++FN
Sbjct: 193 CW-TPHLCDHQTRIVSNTLKHPFHRTLARSTIKNFLHFYNFQGENEYIQTFTELAKLDFN 252

Query: 198 TTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTDVCHSEQRIQLAKAI 257
             Q + + E+NQ + WW    L  EL FAR+QP KW +  L  +TD   S QRI+LAK +
Sbjct: 253 MIQSIHRQEINQVSNWWNNLGLASELKFARDQPEKWCMWPLVGVTDPSLSWQRIELAKPV 312

Query: 258 AFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFKSLFVVTNEISDQIY 317
           + VYLIDDIFD+ GT ++LT+FTE V RW++ A E LP  M+ICF++L+ VTN+I+ ++Y
Sbjct: 313 SLVYLIDDIFDLGGTPDQLTLFTEAVNRWEITATEDLPYHMKICFRALYDVTNQIAYKVY 372

Query: 318 QKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRVSTGVHVTLVHVFFL 377
           +KH +NP  SL+ AW +LC AFL EA+WF++G  P A+EYL    V++GVH+ LVH FFL
Sbjct: 373 KKHQYNPIHSLKKAWARLCNAFLEEAKWFAAGKLPKADEYLNTAIVTSGVHLVLVHTFFL 432

Query: 378 LGHQIRQQTVELLEG-DLDIVWCSATILRLWDDMGSAKDENQEGRDGSYVKYYMKEHPST 437
           +G  I  QT+ LL   D  I+   ATILRLWDD+GSA+DENQ+G DGSY++ YMK+ P T
Sbjct: 433 MGDGITDQTINLLNNDDPGIISSVATILRLWDDLGSAQDENQDGYDGSYIECYMKDFPGT 492

Query: 438 SFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVPLFYSYGRNQSFPT 494
           S    + H++  IS+ WK LN+  L  NP F   F +A+LN AR VPL Y +  N + P 
Sbjct: 493 SVRDARNHVISMISDTWKKLNQHCLSPNP-FSGSFIRATLNGARMVPLMYDFDSNHNLPI 552

BLAST of Cp4.1LG20g03980 vs. NCBI nr
Match: KAG7019658.1 (hypothetical protein SDJN02_18621, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 2100 bits (5442), Expect = 0.0
Identity = 1077/1296 (83.10%), Postives = 1098/1296 (84.72%), Query Frame = 0

Query: 101  LCMHGDDILEEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFGDN 160
            LCMHGDDILEEAQNFSSHWLNAWVVVHVNHHSAT VHNTLLHPYHK+LPQFMLPNYFGD 
Sbjct: 1    LCMHGDDILEEAQNFSSHWLNAWVVVHVNHHSATLVHNTLLHPYHKSLPQFMLPNYFGDK 60

Query: 161  QWTNKWIHTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASL 220
            QWTNKWIHTLQGVAKVNFNTTQRLRQYELNQF+KWWKES LGRELNFARNQPMKWYIASL
Sbjct: 61   QWTNKWIHTLQGVAKVNFNTTQRLRQYELNQFSKWWKESYLGRELNFARNQPMKWYIASL 120

Query: 221  SCLTDVCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCM 280
            SCLTDVCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCM
Sbjct: 121  SCLTDVCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCM 180

Query: 281  RICFKSLFVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYL 340
            RICFKSLF VTNEISDQIYQKH                                      
Sbjct: 181  RICFKSLFEVTNEISDQIYQKH-------------------------------------- 240

Query: 341  RNGRVSTGVHVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQ 400
                VSTGVHVTLVHVFFLLG QI  QTVELLE DLDIV CSATILRLWDDMGSAKDENQ
Sbjct: 241  ----VSTGVHVTLVHVFFLLGQQISNQTVELLEEDLDIVSCSATILRLWDDMGSAKDENQ 300

Query: 401  EGRDGSYVKYYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNI 460
            EG DGSYVKYYMKEHPSTSF+ATQKHIMWKISEAWKSLNREWL+CNPLFPSQFTQASLNI
Sbjct: 301  EGHDGSYVKYYMKEHPSTSFSATQKHIMWKISEAWKSLNREWLYCNPLFPSQFTQASLNI 360

Query: 461  ARAVPLFYSYGRNQSFPTFEQHMERLLFHSLDIYNDSPVNKRLFNKQTIVLRDHTLLSTH 520
            ARAVPLFYSYGRNQS PTFEQHMERLLFHSLDIYNDSPVNKRLFNKQTIVLRDHTLLST+
Sbjct: 361  ARAVPLFYSYGRNQSLPTFEQHMERLLFHSLDIYNDSPVNKRLFNKQTIVLRDHTLLSTN 420

Query: 521  FKPNRSKNQSFILTDD-IGHEAPVKGRILKHVLREIGDPWECLNLIDATQRLGIDYHFQE 580
            FKPNRSKNQSFILTDD IGHEAPVKGRILKHVLREIGDPWECLNLIDATQRLGIDYHFQ+
Sbjct: 421  FKPNRSKNQSFILTDDDIGHEAPVKGRILKHVLREIGDPWECLNLIDATQRLGIDYHFQQ 480

Query: 581  EIQAILQRHYVFFLDKEGKFKEGLKDDIKGLTSLYEASQ-----LCMHGDEILEEAENFS 640
            EI+AILQR YV F       +     D+     L+   +     LCMHGDEILEEAENFS
Sbjct: 481  EIEAILQRQYVLF----NAVQLNSDTDLHKTAFLFRLFRQHGYLLCMHGDEILEEAENFS 540

Query: 641  SHWLKAKAEDEEVDHHLASFVQHTLAYPHHKSVVQLMAPNYLNDMQWPNKWINLAKDLNF 700
            SHWLKA+AE E+VDHHLASFVQHTLAYPHHKSVVQLMAPNYL D+QW  K  +LAKDL+F
Sbjct: 541  SHWLKARAEAEQVDHHLASFVQHTLAYPHHKSVVQLMAPNYLEDVQWW-KETDLAKDLSF 600

Query: 701  SRDQPIKWYVASLICLSTDSFYSEQRIQLAKSISFIYLIDDIFDWAKLCNAFLVEAEWMS 760
            SRDQPIKWYVASLICLSTDSFYSEQRIQLAKSISFIYLIDDIFDWAKLC AFLVEAEWMS
Sbjct: 601  SRDQPIKWYVASLICLSTDSFYSEQRIQLAKSISFIYLIDDIFDWAKLCKAFLVEAEWMS 660

Query: 761  SGQSPSAEEYLKNGVVSTGVHVTLTHVFFLLGEAISKETVELFDEDLDIISSSATVLRLW 820
            SGQSPSAEEYLKNGVVSTGVHVTLTHVFFLLGEAISKETVELFDEDLDIISSSATVLRLW
Sbjct: 661  SGQSPSAEEYLKNGVVSTGVHVTLTHVFFLLGEAISKETVELFDEDLDIISSSATVLRLW 720

Query: 821  DDMGSAKDEKQEGHDGSYLEYYMKEHPSMCYEETKRHTMTQICNAWKTLNTECLLSNLFP 880
            DDMGSAKDEKQEG DGSYLEYYMKEHPSMCYEETKRHTM QICNAWKTLNTECLLSNLFP
Sbjct: 721  DDMGSAKDEKQEGRDGSYLEYYMKEHPSMCYEETKRHTMKQICNAWKTLNTECLLSNLFP 780

Query: 881  AKFNQACLNLARVVPIAYNYA-ASAMLIVGVIPHPPPPPPKASMHNMVPHSQKWTIPQHH 940
            AKFNQACLNLARVVPIAYNYA ASAMLIVGVIP  PPPPPKASMHNMVPHSQKWTIP HH
Sbjct: 781  AKFNQACLNLARVVPIAYNYASASAMLIVGVIP--PPPPPKASMHNMVPHSQKWTIPLHH 840

Query: 941  SSLPISLPADSQICPSIEEMSMKYELEMKNLKHLLSETAKIDSLESLNMIDAIQRLGIDH 1000
            S LPISLPADSQI PSI+E SMKYE EMKNLKHLL ETAKIDSLESLNMIDAIQRLGIDH
Sbjct: 841  SFLPISLPADSQISPSIDETSMKYEFEMKNLKHLLRETAKIDSLESLNMIDAIQRLGIDH 900

Query: 1001 CFKQEIKAILQTQYTMETHNFDAKCGLHHVALRFRLFRQHGYFVPQDVFEGFI--DHEGL 1060
            CFKQEIK ILQTQYTMETHNFDAKCGLHHVALRFRL RQHGYFVPQDVFEGFI  DHE L
Sbjct: 901  CFKQEIKPILQTQYTMETHNFDAKCGLHHVALRFRLLRQHGYFVPQDVFEGFIHHDHEDL 960

Query: 1061 LDTKFIENIEGLTSLYEASQLCFPEDEKLEKIGNFSARILKKLAWNRDDNLGKHVRKAVA 1120
            LDTKF ENIEGLTSLYEASQLC PEDEKLEKIGNFSARILKKL  NRDDNLGKHVRKA+A
Sbjct: 961  LDTKFSENIEGLTSLYEASQLCLPEDEKLEKIGNFSARILKKLVRNRDDNLGKHVRKAMA 1020

Query: 1121 NPFHKSLVKFVVMDYFGSQSPNKWIYVFQPMAKLDFNRLQNLHRLELSQFIMWWKKTGLC 1180
            NPFHKSLVKFVV D+FGSQSPNKWIYVFQ MAKLDFNR+Q LH LELSQFIMWWKKTGLC
Sbjct: 1021 NPFHKSLVKFVVKDHFGSQSPNKWIYVFQHMAKLDFNRVQKLHGLELSQFIMWWKKTGLC 1080

Query: 1181 EELKFARDQPLKWYICSMAVLADPMFSEERVELTKSISFIYLIDDIYDVYGSLEELRLFT 1240
            EELKFARDQPLKWYICSMA LADPMFSEERVELTK ISFIYLIDDIYDVYGSLEELRLFT
Sbjct: 1081 EELKFARDQPLKWYICSMAFLADPMFSEERVELTKCISFIYLIDDIYDVYGSLEELRLFT 1140

Query: 1241 KAIQRYMHVLWTDLSSCRWDLAALDGLPNPMKFCIIKLHETTNEICHNHLPSAEEYLENG 1300
            KAIQ                                                        
Sbjct: 1141 KAIQ-------------------------------------------------------- 1164

Query: 1301 AVSSGVHVVLAHIFFLLGQENPEISVDSTRERISHMISDAWKRLNQESLFSPNPYPPTFI 1360
                                       S RERISHMISDAWKRLNQESLFSPNPYPPTFI
Sbjct: 1201 ---------------------------SARERISHMISDAWKRLNQESLFSPNPYPPTFI 1164

Query: 1361 QASLNIARFVPLLYGYDENQDLPTLEKLVKFVLYES 1387
            QASLNIARFVPLLYGYDENQDLPTLEKLVKFVLYE+
Sbjct: 1261 QASLNIARFVPLLYGYDENQDLPTLEKLVKFVLYEN 1164

BLAST of Cp4.1LG20g03980 vs. NCBI nr
Match: XP_023520214.1 (uncharacterized protein LOC111783517 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1655 bits (4285), Expect = 0.0
Identity = 877/1166 (75.21%), Postives = 879/1166 (75.39%), Query Frame = 0

Query: 495  NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDDIGHEAPVKGRILKHVLRE 554
            NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDDIGHEAPVKGRILKHVLRE
Sbjct: 22   NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDDIGHEAPVKGRILKHVLRE 81

Query: 555  IGDPWECLNLIDATQRLGIDYHFQEEIQAILQRHYVFF---------------------- 614
            IGDPWECLNLIDATQRLGIDYHFQEEIQAILQRHYV F                      
Sbjct: 82   IGDPWECLNLIDATQRLGIDYHFQEEIQAILQRHYVLFNAVQLNSDTDLHMTALLFRLFR 141

Query: 615  --------------LDKEGKFKEGLKDDIKGLTSLYEASQLCMHGDEILEEAENFSSHWL 674
                          LDKEGKFKEGLKDDIKGLTSLYEASQLCMHGDEILEEAENFSSHWL
Sbjct: 142  QHGYLVSSDVFESFLDKEGKFKEGLKDDIKGLTSLYEASQLCMHGDEILEEAENFSSHWL 201

Query: 675  KAKAEDEEVDHHLASFVQHTLAYPHHKSVVQLMAPNYLNDMQWPNKWIN----------- 734
            KAKAEDEEVDHHLASFVQHTLAYPHHKSVVQLMAPNYLNDMQWPNKWI+           
Sbjct: 202  KAKAEDEEVDHHLASFVQHTLAYPHHKSVVQLMAPNYLNDMQWPNKWISIFRDAAKMELY 261

Query: 735  ---------------------LAKDLNFSRDQPIKWYVASLICLSTDSFYSEQRIQLAKS 794
                                 LAKDLNFSRDQPIKWYVASLICLSTDSFYSEQRIQLAKS
Sbjct: 262  SAQRLSQHELAQFTKWWKETDLAKDLNFSRDQPIKWYVASLICLSTDSFYSEQRIQLAKS 321

Query: 795  ISFIYLIDDIFD------------------------------------------------ 854
            ISFIYLIDDIFD                                                
Sbjct: 322  ISFIYLIDDIFDVFGTLDELTIFTEAVCRWDMAAAEGLPDCMQTCLRTLFEVTNEISCQI 381

Query: 855  ---------------WAKLCNAFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVFF 914
                           WAKLCNAFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVFF
Sbjct: 382  YQAHGWNPIHSLHKAWAKLCNAFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVFF 441

Query: 915  LLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEGHDGSYLEYYMKEHPSM 974
            LLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEGHDGSYLEYYMKEHPSM
Sbjct: 442  LLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEGHDGSYLEYYMKEHPSM 501

Query: 975  CYEETKRHTMTQICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNY--------- 1034
            CYEETKRHTMTQICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNY         
Sbjct: 502  CYEETKRHTMTQICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNYGRTQSIMSL 561

Query: 1035 -------------------------------------------------AASAMLIVGVI 1094
                                                             AASAMLIVGVI
Sbjct: 562  ENLIKQFLFHQMEKLIGMMPINWWYHQIHPTILQYLEPFKDFFIHPMEAAASAMLIVGVI 621

Query: 1095 PHPPPPPPKASMHNMVPHSQKWTIPQHHSSLPISLPADSQICPSIEEMSMKYELEMKNLK 1154
            PHPPPPPPKASMHNMVPHSQKWTIPQHHSSLPISLPADSQICPSIEEMSMKYELEMKNLK
Sbjct: 622  PHPPPPPPKASMHNMVPHSQKWTIPQHHSSLPISLPADSQICPSIEEMSMKYELEMKNLK 681

Query: 1155 HLLSETAKIDSLESLNMIDAIQRLGIDHCFKQEIKAILQTQYTMETHNFDAKCGLHHVAL 1214
            HLLSETAKIDSLESLNMIDAIQRLGIDHCFKQEIKAILQTQYTMETHNFDAKCGLHHVAL
Sbjct: 682  HLLSETAKIDSLESLNMIDAIQRLGIDHCFKQEIKAILQTQYTMETHNFDAKCGLHHVAL 741

Query: 1215 RFRLFRQHGYFVPQDVFEGFIDHEGLLDTKFIENIEGLTSLYEASQLCFPEDEKLEKIGN 1274
            RFRLFRQHGYFVPQDVFEGFIDHEGLLDTKFIENIEGLTSLYEASQLCFPEDEKLEKIGN
Sbjct: 742  RFRLFRQHGYFVPQDVFEGFIDHEGLLDTKFIENIEGLTSLYEASQLCFPEDEKLEKIGN 801

Query: 1275 FSARILKKLAWNRDDNLGKHVRKAVANPFHKSLVKFVVMDYFGSQSPNKWIYVFQPMAKL 1334
            FSARILKKLAWNRDDNLGKHVRKAVANPFHKSLVKFVVMDYFGSQSPNKWIYVFQPMAKL
Sbjct: 802  FSARILKKLAWNRDDNLGKHVRKAVANPFHKSLVKFVVMDYFGSQSPNKWIYVFQPMAKL 861

Query: 1335 DFNRLQNLHRLELSQFIMWWKKTGLCEELKFARDQPLKWYICSMAVLADPMFSEERVELT 1387
            DFNRLQNLHRLELSQFIMWWKKTGLCEELKFARDQPLKWYICSMAVLADPMFSEERVELT
Sbjct: 862  DFNRLQNLHRLELSQFIMWWKKTGLCEELKFARDQPLKWYICSMAVLADPMFSEERVELT 921

BLAST of Cp4.1LG20g03980 vs. NCBI nr
Match: XP_022927129.1 (uncharacterized protein LOC111434068 [Cucurbita moschata])

HSP 1 Score: 1535 bits (3973), Expect = 0.0
Identity = 828/1160 (71.38%), Postives = 845/1160 (72.84%), Query Frame = 0

Query: 495  NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDD-IGHEAPVKGRILKHVLR 554
            NDSPVNKRLFNKQTIVLRDHTLLST+FKPNRSKNQSFILTDD IGHEAPVKGRILKHVLR
Sbjct: 22   NDSPVNKRLFNKQTIVLRDHTLLSTNFKPNRSKNQSFILTDDDIGHEAPVKGRILKHVLR 81

Query: 555  EIGDPWECLNLIDATQRLGIDYHFQEEIQAILQRHYVFF--------------------- 614
            EIGDPWECLNLIDATQRLGIDYHFQ+EI+AILQR YV F                     
Sbjct: 82   EIGDPWECLNLIDATQRLGIDYHFQQEIEAILQRQYVLFNAVQLNSDTDLHKTAFLFRLF 141

Query: 615  ---------------LDKEGKFKEGLKDDIKGLTSLYEASQLCMHGDEILEEAENFSSHW 674
                           LD EGKFKE LKDDIKGL SLYEASQLCMHGDEILEEAENFSSHW
Sbjct: 142  RQHGYLVSSDVFESFLDGEGKFKEELKDDIKGLMSLYEASQLCMHGDEILEEAENFSSHW 201

Query: 675  LKAKAEDEEVDHHLASFVQHTLAYPHHKSVVQLMAPNYLNDMQWPNKWIN---------- 734
            LKA+AE E+VDHHLASFVQHTLAYPHHKSVVQLMAPNYL D+QWPNKWI+          
Sbjct: 202  LKARAEAEQVDHHLASFVQHTLAYPHHKSVVQLMAPNYLEDVQWPNKWISIFRDAAKMEL 261

Query: 735  ----------------------LAKDLNFSRDQPIKWYVASLICLSTDSFYSEQRIQLAK 794
                                  LAKDL+FSRDQPIKWYVASLICLSTDSFYSEQRIQLAK
Sbjct: 262  YSAQRLRQHELAQFTKWWKETDLAKDLSFSRDQPIKWYVASLICLSTDSFYSEQRIQLAK 321

Query: 795  SISFIYLIDDIFD----------------------------------------------- 854
            SISFIYLIDDIFD                                               
Sbjct: 322  SISFIYLIDDIFDVFGTLDELTIFTEAVCRWDLAAAEGLPDCMQICLRTLFEVTNEISCQ 381

Query: 855  ----------------WAKLCNAFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVF 914
                            WAKLC AFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVF
Sbjct: 382  IYQAHGWNPIHSLHKAWAKLCKAFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVF 441

Query: 915  FLLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEGHDGSYLEYYMKEHPS 974
            FLLGE ISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEG DGSYLEYYMKEHPS
Sbjct: 442  FLLGELISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEGRDGSYLEYYMKEHPS 501

Query: 975  MCYEETKRHTMTQICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNY-------- 1034
            MCYEETKRHTM QICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNY        
Sbjct: 502  MCYEETKRHTMKQICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNYGRAQSIMS 561

Query: 1035 ----------------------------AASAMLIVGVIPHPPPPP-------------P 1094
                                        +ASAMLIVGVI  PPPPP             P
Sbjct: 562  LENLIKQFLFHQMELEPIKDFFIHPMAASASAMLIVGVIAPPPPPPSSSSSSSSSFQPYP 621

Query: 1095 KASMHNMVPHSQKWTIPQHHSSLPISLPADSQICPSIEEMSMKYELEMKNLKHLLSETAK 1154
            KASMHNMVPHS+KWTIP HHS LPISLPADSQI PSI+E SMKYE EMKNLKHLL ETAK
Sbjct: 622  KASMHNMVPHSKKWTIPLHHSFLPISLPADSQISPSIDETSMKYEFEMKNLKHLLRETAK 681

Query: 1155 IDSLESLNMIDAIQRLGIDHCFKQEIKAILQTQYTMETHNFDAKCGLHHVALRFRLFRQH 1214
            IDSLESLNMIDAIQRLGIDHCFKQEIKAILQTQYTMETHNFDAKCGLH VALRFRL RQH
Sbjct: 682  IDSLESLNMIDAIQRLGIDHCFKQEIKAILQTQYTMETHNFDAKCGLHQVALRFRLLRQH 741

Query: 1215 GYFVPQDVFEGFIDH--EGLLDTKFIENIEGLTSLYEASQLCFPEDEKLEKIGNFSARIL 1274
            GYFVPQDVFEGFIDH  EGLLDTKF ENIEGLTSLYEASQLC PEDEKLEKIGNFSARIL
Sbjct: 742  GYFVPQDVFEGFIDHNHEGLLDTKFSENIEGLTSLYEASQLCLPEDEKLEKIGNFSARIL 801

Query: 1275 KKLAWNRDDNLGKHVRKAVANPFHKSLVKFVVMDYFGSQSPNKWIYVFQPMAKLDFNRLQ 1334
            KKL  NRDDNL KHVRKA+ANPFHKSLVKFVV DYFGSQSPNKWIYVFQ MAKLDFNR+Q
Sbjct: 802  KKLVRNRDDNLSKHVRKAMANPFHKSLVKFVVKDYFGSQSPNKWIYVFQHMAKLDFNRVQ 861

Query: 1335 NLHRLELSQFIMWWKKTGLCEELKFARDQPLKWYICSMAVLADPMFSEERVELTKSISFI 1387
             LH LELSQFIMWWKKTGLCEELKFARDQPLKWYICSMA LADPMFSEERVELTK ISFI
Sbjct: 862  KLHGLELSQFIMWWKKTGLCEELKFARDQPLKWYICSMAFLADPMFSEERVELTKCISFI 921

BLAST of Cp4.1LG20g03980 vs. NCBI nr
Match: XP_023001644.1 (uncharacterized protein LOC111495716 [Cucurbita maxima])

HSP 1 Score: 1504 bits (3894), Expect = 0.0
Identity = 819/1202 (68.14%), Postives = 837/1202 (69.63%), Query Frame = 0

Query: 495  NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDD-IGHEAPVKGRILKHVLR 554
            NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDD IGHEAPVKGRILKHVLR
Sbjct: 22   NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDDDIGHEAPVKGRILKHVLR 81

Query: 555  EIGDPWECLNLIDATQRLGIDYHFQEEIQAILQRHYVFF--------------------- 614
            EIGDPWECLNLIDATQRLGIDYHFQEEI+AILQR YV F                     
Sbjct: 82   EIGDPWECLNLIDATQRLGIDYHFQEEIEAILQRQYVLFNAVQLNSDTDLHKTALLFRLF 141

Query: 615  ---------------LDKEGKFKEGLKDDIKGLTSLYEASQLCMHGDEILEEAENFSSHW 674
                           LDKEGKFKE LKDDIKGLTSLYEASQLCMHG+EILEEAENFSSHW
Sbjct: 142  RQHGYLLSSDVFESFLDKEGKFKEELKDDIKGLTSLYEASQLCMHGEEILEEAENFSSHW 201

Query: 675  LKAKAEDEEVDHHLASFVQHTLAYPHHKSVVQLMAPNYLNDMQWPNKWIN---------- 734
            LKA+AE E+VDHHLASFVQHTLAYPHHKSVVQLMAPNYL D+QWPNKWIN          
Sbjct: 202  LKARAEAEQVDHHLASFVQHTLAYPHHKSVVQLMAPNYLEDVQWPNKWINIFRDAAKMEL 261

Query: 735  ----------------------LAKDLNFSRDQPIKWYVASLICLSTDSFYSEQRIQLAK 794
                                  LAKDLNFSRDQPIKWYVASLICLSTDSFYS+QRIQLAK
Sbjct: 262  YSAQRLCQHELALFTKWWKETDLAKDLNFSRDQPIKWYVASLICLSTDSFYSKQRIQLAK 321

Query: 795  SISFIYLIDDIFD----------------------------------------------- 854
            SISF+YLIDDIFD                                               
Sbjct: 322  SISFVYLIDDIFDVFGTLDELTIFTEAVCRWDLAAAEGLPDCMQTCLRTLFEVTNEISCQ 381

Query: 855  ----------------WAKLCNAFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVF 914
                            WAKLC AFLVEAEW+SSGQSPSAEEYLKNGVVSTGVHVTL HVF
Sbjct: 382  IYQAHGWNPIHSLHKAWAKLCKAFLVEAEWLSSGQSPSAEEYLKNGVVSTGVHVTLIHVF 441

Query: 915  FLLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEGHDGSYLEYYMKEHPS 974
            FLLGEAISKE+VELFDEDLDIISSSATVLRLWDDMGSAKDE QEGHDGSYLEYYMKEHPS
Sbjct: 442  FLLGEAISKESVELFDEDLDIISSSATVLRLWDDMGSAKDENQEGHDGSYLEYYMKEHPS 501

Query: 975  MCYEETKRHTMTQICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNY-------- 1034
             CYEETK HTM QICNAWKTLNTECLLSNLFP  FNQACLNLAR VPIAYNY        
Sbjct: 502  TCYEETKLHTMKQICNAWKTLNTECLLSNLFPPNFNQACLNLARGVPIAYNYGRTQSIMS 561

Query: 1035 ------------------------------------------------------------ 1094
                                                                        
Sbjct: 562  LENLIKQFLFHKMEVCTNRYGEVFESYTVTEGCRYREVFESIINGRYHPIHPTILQYFEP 621

Query: 1095 -----------AASAMLIVGVIPHPPP--------------PPPKASMHNMVPHSQKWTI 1154
                       +ASAML+VGVIP P                P PKASMHNMVPHSQKWTI
Sbjct: 622  FKDFFIHPMAASASAMLLVGVIPPPSAASSSSSSSSSSSFQPYPKASMHNMVPHSQKWTI 681

Query: 1155 PQHHSSLPISLPADSQICPSIEEMSMKYELEMKNLKHLLSETAKIDSLESLNMIDAIQRL 1214
            P HHS LPISLPADSQI PS ++MSMKYE EMKNLKHLLSE AKIDSLESLNMIDAIQRL
Sbjct: 682  PLHHSFLPISLPADSQISPSTDKMSMKYEFEMKNLKHLLSEMAKIDSLESLNMIDAIQRL 741

Query: 1215 GIDHCFKQEIKAILQTQYTMETHNFDAKCGLHHVALRFRLFRQHGYFVPQDVFEGFIDHE 1274
            GIDHCFK+EIKAILQTQYTMETHNFDAKCGLH VALRFRL RQHGYFVPQDVF GFIDHE
Sbjct: 742  GIDHCFKEEIKAILQTQYTMETHNFDAKCGLHQVALRFRLLRQHGYFVPQDVFGGFIDHE 801

Query: 1275 GLLDTKFIENIEGLTSLYEASQLCFPEDEKLEKIGNFSARILKKLAWNRDDNLGKHVRKA 1334
            GLLDTKF ENIEGLTSLYEASQLC PEDEKLEKIGNFSA ILKKLA N DDNLGKHVRKA
Sbjct: 802  GLLDTKFRENIEGLTSLYEASQLCLPEDEKLEKIGNFSACILKKLARNHDDNLGKHVRKA 861

Query: 1335 VANPFHKSLVKFVVMDYFGSQSPNKWIYVFQPMAKLDFNRLQNLHRLELSQFIMWWKKTG 1387
            VANPFHK LVKFVV DYFGSQSPNKWIYVFQ MAKLDFNR+Q LH LELSQFIMWWK+TG
Sbjct: 862  VANPFHKRLVKFVVKDYFGSQSPNKWIYVFQHMAKLDFNRVQKLHGLELSQFIMWWKRTG 921

BLAST of Cp4.1LG20g03980 vs. NCBI nr
Match: KAG6584051.1 (hypothetical protein SDJN03_19983, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1154 bits (2985), Expect = 0.0
Identity = 650/1036 (62.74%), Postives = 675/1036 (65.15%), Query Frame = 0

Query: 535  DDIGHEAPVKGRILKHVLREIGDPWECLNLIDATQRLGIDYHFQEEIQAILQRHYVFF-- 594
            DDIGHEAPVKGRILKHVLREIGDPWECLNLIDATQRLGIDYHFQ+EI+AILQR YV F  
Sbjct: 4    DDIGHEAPVKGRILKHVLREIGDPWECLNLIDATQRLGIDYHFQQEIEAILQRQYVLFNA 63

Query: 595  ----------------------------------LDKEGKFKEGLKDDIKGLTSLYEASQ 654
                                              LD EGKFKE LKDDIKGLTSLYEASQ
Sbjct: 64   VQLNSDTDLHKTAFLFRLFRQHGYLVSSDVFESFLDGEGKFKEELKDDIKGLTSLYEASQ 123

Query: 655  LCMHGDEILEEAENFSSHWLKAKAEDEEVDHHLASFVQHTLAYPHHKSVVQLMAPNYLND 714
            LCMHGDEILEEAENFSSHWLKA+AE E+VDHHLASFVQHTLAYPHHKSVVQLMAPNYL D
Sbjct: 124  LCMHGDEILEEAENFSSHWLKARAEAEQVDHHLASFVQHTLAYPHHKSVVQLMAPNYLED 183

Query: 715  MQWPNKWIN--------------------------------LAKDLNFSRDQPIKWYVAS 774
            +QWPNKWI+                                LAKDL+FSRDQPIKWYVAS
Sbjct: 184  VQWPNKWISIFRDAAKMELYSAQRLRQHELAQFTKWWKETDLAKDLSFSRDQPIKWYVAS 243

Query: 775  LICLSTDSFYSEQRIQLAKSISFIYLIDDIFD---------------------------- 834
            LICLSTDSFYSEQRIQLAKSISFIYLIDDIFD                            
Sbjct: 244  LICLSTDSFYSEQRIQLAKSISFIYLIDDIFDVFGTLDELTIFTEAVCRWDLAAAEGLPD 303

Query: 835  -----------------------------------WAKLCNAFLVEAEWMSSGQSPSAEE 894
                                               WAKLC AFLVEAEWMSSGQSPSAEE
Sbjct: 304  CMQICLRTLFEVTNEISCQIYQAHGWNPIHSLHKAWAKLCKAFLVEAEWMSSGQSPSAEE 363

Query: 895  YLKNGVVSTGVHVTLTHVFFLLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDE 954
            YLKNGVVSTGVHVTLTHVFFLLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDE
Sbjct: 364  YLKNGVVSTGVHVTLTHVFFLLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDE 423

Query: 955  KQEGHDGSYLEYYMKEHPSMCYEETKRHTMTQICNAWKTLNTECLLSNLFPAKFNQACLN 1014
            KQEG DGSYLEYYMKEHPSMCYEETKRHTM QICNAWKTLNTECLLSNLFPAKFNQACLN
Sbjct: 424  KQEGRDGSYLEYYMKEHPSMCYEETKRHTMKQICNAWKTLNTECLLSNLFPAKFNQACLN 483

Query: 1015 LARVVPIAYNYAASAMLIVGVIPHPPPPPPKASMHNMVPHSQKWTIPQHHSSLPISLPAD 1074
            LARVVPIAYNY  +  ++              S+ N++   +++   Q            
Sbjct: 484  LARVVPIAYNYGRTQSIM--------------SLENLI---KQFLFHQME---------- 543

Query: 1075 SQICPSIEEMSMKYELEMKNLKHLLSETAKIDSLESLNMIDAIQRLGIDHCFKQEIKAIL 1134
                   +E SMKYE EMKNLKHLL ETAKIDSLESLNMIDAIQRLGIDHCFKQEIK IL
Sbjct: 544  -------DETSMKYEFEMKNLKHLLRETAKIDSLESLNMIDAIQRLGIDHCFKQEIKPIL 603

Query: 1135 QTQYTMETHNFDAKCGLHHVALRFRLFRQHGYFVPQDVFEGFI--DHEGLLDTKFIENIE 1194
            QTQYTMETHNFDAKCGLHHVALRFRL RQHGYFVPQDVFEGFI  DHE LLDTKF ENIE
Sbjct: 604  QTQYTMETHNFDAKCGLHHVALRFRLLRQHGYFVPQDVFEGFIHHDHEDLLDTKFSENIE 663

Query: 1195 GLTSLYEASQLCFPEDEKLEKIGNFSARILKKLAWNRDDNLGKHVRKAVANPFHKSLVKF 1254
            GLTSLYEASQLC PEDEKLEKIGNFSA ILKKL  NRDDNLGKHVRKA+ANPFHKSLVKF
Sbjct: 664  GLTSLYEASQLCLPEDEKLEKIGNFSACILKKLVRNRDDNLGKHVRKAMANPFHKSLVKF 723

Query: 1255 VVMDYFGSQSPNKWIYVFQPMAKLDFNRLQNLHRLELSQFIMWWKKTGLCEELKFARDQP 1314
            VV DYFGSQSPNKWIYVFQ MAKLDFNR+Q LH LELSQFI+      LCE      +  
Sbjct: 724  VVKDYFGSQSPNKWIYVFQHMAKLDFNRVQKLHGLELSQFII------LCEAFLVEAE-- 783

Query: 1315 LKWYICSMAVLADPMFSEERVELTKSISFIYLIDDIYDVYGSLEELRLFTKAIQRYMHVL 1374
              W+                                    GS                  
Sbjct: 784  --WF------------------------------------GS------------------ 843

Query: 1375 WTDLSSCRWDLAALDGLPNPMKFCIIKLHETTNEICHNHLPSAEEYLENGAVSSGVHVVL 1387
                                                 +HLPSA+EYLENG VSSGVHVVL
Sbjct: 844  -------------------------------------SHLPSAKEYLENGEVSSGVHVVL 903

BLAST of Cp4.1LG20g03980 vs. ExPASy TrEMBL
Match: A0A6J1EH51 (uncharacterized protein LOC111434068 OS=Cucurbita moschata OX=3662 GN=LOC111434068 PE=4 SV=1)

HSP 1 Score: 1535 bits (3973), Expect = 0.0
Identity = 828/1160 (71.38%), Postives = 845/1160 (72.84%), Query Frame = 0

Query: 495  NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDD-IGHEAPVKGRILKHVLR 554
            NDSPVNKRLFNKQTIVLRDHTLLST+FKPNRSKNQSFILTDD IGHEAPVKGRILKHVLR
Sbjct: 22   NDSPVNKRLFNKQTIVLRDHTLLSTNFKPNRSKNQSFILTDDDIGHEAPVKGRILKHVLR 81

Query: 555  EIGDPWECLNLIDATQRLGIDYHFQEEIQAILQRHYVFF--------------------- 614
            EIGDPWECLNLIDATQRLGIDYHFQ+EI+AILQR YV F                     
Sbjct: 82   EIGDPWECLNLIDATQRLGIDYHFQQEIEAILQRQYVLFNAVQLNSDTDLHKTAFLFRLF 141

Query: 615  ---------------LDKEGKFKEGLKDDIKGLTSLYEASQLCMHGDEILEEAENFSSHW 674
                           LD EGKFKE LKDDIKGL SLYEASQLCMHGDEILEEAENFSSHW
Sbjct: 142  RQHGYLVSSDVFESFLDGEGKFKEELKDDIKGLMSLYEASQLCMHGDEILEEAENFSSHW 201

Query: 675  LKAKAEDEEVDHHLASFVQHTLAYPHHKSVVQLMAPNYLNDMQWPNKWIN---------- 734
            LKA+AE E+VDHHLASFVQHTLAYPHHKSVVQLMAPNYL D+QWPNKWI+          
Sbjct: 202  LKARAEAEQVDHHLASFVQHTLAYPHHKSVVQLMAPNYLEDVQWPNKWISIFRDAAKMEL 261

Query: 735  ----------------------LAKDLNFSRDQPIKWYVASLICLSTDSFYSEQRIQLAK 794
                                  LAKDL+FSRDQPIKWYVASLICLSTDSFYSEQRIQLAK
Sbjct: 262  YSAQRLRQHELAQFTKWWKETDLAKDLSFSRDQPIKWYVASLICLSTDSFYSEQRIQLAK 321

Query: 795  SISFIYLIDDIFD----------------------------------------------- 854
            SISFIYLIDDIFD                                               
Sbjct: 322  SISFIYLIDDIFDVFGTLDELTIFTEAVCRWDLAAAEGLPDCMQICLRTLFEVTNEISCQ 381

Query: 855  ----------------WAKLCNAFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVF 914
                            WAKLC AFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVF
Sbjct: 382  IYQAHGWNPIHSLHKAWAKLCKAFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVF 441

Query: 915  FLLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEGHDGSYLEYYMKEHPS 974
            FLLGE ISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEG DGSYLEYYMKEHPS
Sbjct: 442  FLLGELISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEGRDGSYLEYYMKEHPS 501

Query: 975  MCYEETKRHTMTQICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNY-------- 1034
            MCYEETKRHTM QICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNY        
Sbjct: 502  MCYEETKRHTMKQICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNYGRAQSIMS 561

Query: 1035 ----------------------------AASAMLIVGVIPHPPPPP-------------P 1094
                                        +ASAMLIVGVI  PPPPP             P
Sbjct: 562  LENLIKQFLFHQMELEPIKDFFIHPMAASASAMLIVGVIAPPPPPPSSSSSSSSSFQPYP 621

Query: 1095 KASMHNMVPHSQKWTIPQHHSSLPISLPADSQICPSIEEMSMKYELEMKNLKHLLSETAK 1154
            KASMHNMVPHS+KWTIP HHS LPISLPADSQI PSI+E SMKYE EMKNLKHLL ETAK
Sbjct: 622  KASMHNMVPHSKKWTIPLHHSFLPISLPADSQISPSIDETSMKYEFEMKNLKHLLRETAK 681

Query: 1155 IDSLESLNMIDAIQRLGIDHCFKQEIKAILQTQYTMETHNFDAKCGLHHVALRFRLFRQH 1214
            IDSLESLNMIDAIQRLGIDHCFKQEIKAILQTQYTMETHNFDAKCGLH VALRFRL RQH
Sbjct: 682  IDSLESLNMIDAIQRLGIDHCFKQEIKAILQTQYTMETHNFDAKCGLHQVALRFRLLRQH 741

Query: 1215 GYFVPQDVFEGFIDH--EGLLDTKFIENIEGLTSLYEASQLCFPEDEKLEKIGNFSARIL 1274
            GYFVPQDVFEGFIDH  EGLLDTKF ENIEGLTSLYEASQLC PEDEKLEKIGNFSARIL
Sbjct: 742  GYFVPQDVFEGFIDHNHEGLLDTKFSENIEGLTSLYEASQLCLPEDEKLEKIGNFSARIL 801

Query: 1275 KKLAWNRDDNLGKHVRKAVANPFHKSLVKFVVMDYFGSQSPNKWIYVFQPMAKLDFNRLQ 1334
            KKL  NRDDNL KHVRKA+ANPFHKSLVKFVV DYFGSQSPNKWIYVFQ MAKLDFNR+Q
Sbjct: 802  KKLVRNRDDNLSKHVRKAMANPFHKSLVKFVVKDYFGSQSPNKWIYVFQHMAKLDFNRVQ 861

Query: 1335 NLHRLELSQFIMWWKKTGLCEELKFARDQPLKWYICSMAVLADPMFSEERVELTKSISFI 1387
             LH LELSQFIMWWKKTGLCEELKFARDQPLKWYICSMA LADPMFSEERVELTK ISFI
Sbjct: 862  KLHGLELSQFIMWWKKTGLCEELKFARDQPLKWYICSMAFLADPMFSEERVELTKCISFI 921

BLAST of Cp4.1LG20g03980 vs. ExPASy TrEMBL
Match: A0A6J1KNB1 (uncharacterized protein LOC111495716 OS=Cucurbita maxima OX=3661 GN=LOC111495716 PE=4 SV=1)

HSP 1 Score: 1504 bits (3894), Expect = 0.0
Identity = 819/1202 (68.14%), Postives = 837/1202 (69.63%), Query Frame = 0

Query: 495  NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDD-IGHEAPVKGRILKHVLR 554
            NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDD IGHEAPVKGRILKHVLR
Sbjct: 22   NDSPVNKRLFNKQTIVLRDHTLLSTHFKPNRSKNQSFILTDDDIGHEAPVKGRILKHVLR 81

Query: 555  EIGDPWECLNLIDATQRLGIDYHFQEEIQAILQRHYVFF--------------------- 614
            EIGDPWECLNLIDATQRLGIDYHFQEEI+AILQR YV F                     
Sbjct: 82   EIGDPWECLNLIDATQRLGIDYHFQEEIEAILQRQYVLFNAVQLNSDTDLHKTALLFRLF 141

Query: 615  ---------------LDKEGKFKEGLKDDIKGLTSLYEASQLCMHGDEILEEAENFSSHW 674
                           LDKEGKFKE LKDDIKGLTSLYEASQLCMHG+EILEEAENFSSHW
Sbjct: 142  RQHGYLLSSDVFESFLDKEGKFKEELKDDIKGLTSLYEASQLCMHGEEILEEAENFSSHW 201

Query: 675  LKAKAEDEEVDHHLASFVQHTLAYPHHKSVVQLMAPNYLNDMQWPNKWIN---------- 734
            LKA+AE E+VDHHLASFVQHTLAYPHHKSVVQLMAPNYL D+QWPNKWIN          
Sbjct: 202  LKARAEAEQVDHHLASFVQHTLAYPHHKSVVQLMAPNYLEDVQWPNKWINIFRDAAKMEL 261

Query: 735  ----------------------LAKDLNFSRDQPIKWYVASLICLSTDSFYSEQRIQLAK 794
                                  LAKDLNFSRDQPIKWYVASLICLSTDSFYS+QRIQLAK
Sbjct: 262  YSAQRLCQHELALFTKWWKETDLAKDLNFSRDQPIKWYVASLICLSTDSFYSKQRIQLAK 321

Query: 795  SISFIYLIDDIFD----------------------------------------------- 854
            SISF+YLIDDIFD                                               
Sbjct: 322  SISFVYLIDDIFDVFGTLDELTIFTEAVCRWDLAAAEGLPDCMQTCLRTLFEVTNEISCQ 381

Query: 855  ----------------WAKLCNAFLVEAEWMSSGQSPSAEEYLKNGVVSTGVHVTLTHVF 914
                            WAKLC AFLVEAEW+SSGQSPSAEEYLKNGVVSTGVHVTL HVF
Sbjct: 382  IYQAHGWNPIHSLHKAWAKLCKAFLVEAEWLSSGQSPSAEEYLKNGVVSTGVHVTLIHVF 441

Query: 915  FLLGEAISKETVELFDEDLDIISSSATVLRLWDDMGSAKDEKQEGHDGSYLEYYMKEHPS 974
            FLLGEAISKE+VELFDEDLDIISSSATVLRLWDDMGSAKDE QEGHDGSYLEYYMKEHPS
Sbjct: 442  FLLGEAISKESVELFDEDLDIISSSATVLRLWDDMGSAKDENQEGHDGSYLEYYMKEHPS 501

Query: 975  MCYEETKRHTMTQICNAWKTLNTECLLSNLFPAKFNQACLNLARVVPIAYNY-------- 1034
             CYEETK HTM QICNAWKTLNTECLLSNLFP  FNQACLNLAR VPIAYNY        
Sbjct: 502  TCYEETKLHTMKQICNAWKTLNTECLLSNLFPPNFNQACLNLARGVPIAYNYGRTQSIMS 561

Query: 1035 ------------------------------------------------------------ 1094
                                                                        
Sbjct: 562  LENLIKQFLFHKMEVCTNRYGEVFESYTVTEGCRYREVFESIINGRYHPIHPTILQYFEP 621

Query: 1095 -----------AASAMLIVGVIPHPPP--------------PPPKASMHNMVPHSQKWTI 1154
                       +ASAML+VGVIP P                P PKASMHNMVPHSQKWTI
Sbjct: 622  FKDFFIHPMAASASAMLLVGVIPPPSAASSSSSSSSSSSFQPYPKASMHNMVPHSQKWTI 681

Query: 1155 PQHHSSLPISLPADSQICPSIEEMSMKYELEMKNLKHLLSETAKIDSLESLNMIDAIQRL 1214
            P HHS LPISLPADSQI PS ++MSMKYE EMKNLKHLLSE AKIDSLESLNMIDAIQRL
Sbjct: 682  PLHHSFLPISLPADSQISPSTDKMSMKYEFEMKNLKHLLSEMAKIDSLESLNMIDAIQRL 741

Query: 1215 GIDHCFKQEIKAILQTQYTMETHNFDAKCGLHHVALRFRLFRQHGYFVPQDVFEGFIDHE 1274
            GIDHCFK+EIKAILQTQYTMETHNFDAKCGLH VALRFRL RQHGYFVPQDVF GFIDHE
Sbjct: 742  GIDHCFKEEIKAILQTQYTMETHNFDAKCGLHQVALRFRLLRQHGYFVPQDVFGGFIDHE 801

Query: 1275 GLLDTKFIENIEGLTSLYEASQLCFPEDEKLEKIGNFSARILKKLAWNRDDNLGKHVRKA 1334
            GLLDTKF ENIEGLTSLYEASQLC PEDEKLEKIGNFSA ILKKLA N DDNLGKHVRKA
Sbjct: 802  GLLDTKFRENIEGLTSLYEASQLCLPEDEKLEKIGNFSACILKKLARNHDDNLGKHVRKA 861

Query: 1335 VANPFHKSLVKFVVMDYFGSQSPNKWIYVFQPMAKLDFNRLQNLHRLELSQFIMWWKKTG 1387
            VANPFHK LVKFVV DYFGSQSPNKWIYVFQ MAKLDFNR+Q LH LELSQFIMWWK+TG
Sbjct: 862  VANPFHKRLVKFVVKDYFGSQSPNKWIYVFQHMAKLDFNRVQKLHGLELSQFIMWWKRTG 921

BLAST of Cp4.1LG20g03980 vs. ExPASy TrEMBL
Match: A0A6J1EK53 ((3S,6E)-nerolidol synthase 1-like OS=Cucurbita moschata OX=3662 GN=LOC111434067 PE=4 SV=1)

HSP 1 Score: 957 bits (2473), Expect = 0.0
Identity = 472/508 (92.91%), Postives = 477/508 (93.90%), Query Frame = 0

Query: 1   GAVAKVESLKHVLRESGDCLERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDP 60
           GAVAKVESLKHVLRESGDCLE LDLIDAAQRLGID+ LQEEIEAVLKRQYFLLNAL+FDP
Sbjct: 58  GAVAKVESLKHVLRESGDCLECLDLIDAAQRLGIDSLLQEEIEAVLKRQYFLLNALEFDP 117

Query: 61  MIDLHKAALLFRLLTQQGFLVTP---------------KLRHDIKGLTSLHEASQLCMHG 120
           MIDLHKAALLFR LTQQGFLVTP               +LRHDIKGLTSLHEASQLCMHG
Sbjct: 118 MIDLHKAALLFRHLTQQGFLVTPNLFKIFLDEEGKFNKELRHDIKGLTSLHEASQLCMHG 177

Query: 121 DDILEEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFGDNQWTNK 180
           DDILEEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHK LPQFMLPNYFGD QWTNK
Sbjct: 178 DDILEEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKALPQFMLPNYFGDKQWTNK 237

Query: 181 WIHTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTD 240
           WIHTLQ VAKVNFNTTQRLRQYEL QFTKWWKESDLGRELNFARNQPMKWYIASLSCLTD
Sbjct: 238 WIHTLQDVAKVNFNTTQRLRQYELKQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTD 297

Query: 241 VCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFK 300
           VCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMF EVVCRWDLVAAEKLPSCMRICFK
Sbjct: 298 VCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFAEVVCRWDLVAAEKLPSCMRICFK 357

Query: 301 SLFVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRV 360
           SL  VTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRV
Sbjct: 358 SLLEVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRV 417

Query: 361 STGVHVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQEGRDG 420
           STGVHVTLVHVFFLLG QI  QTVELLE DLDIV CSATILRLWDDMGSAKDENQEGRDG
Sbjct: 418 STGVHVTLVHVFFLLGQQISNQTVELLEEDLDIVSCSATILRLWDDMGSAKDENQEGRDG 477

Query: 421 SYVKYYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVP 480
           SYVKYYMKEHPSTSF+ATQKHIMWKISEAWKSLNREWL+CNPLFPSQFTQASLNIARAVP
Sbjct: 478 SYVKYYMKEHPSTSFSATQKHIMWKISEAWKSLNREWLYCNPLFPSQFTQASLNIARAVP 537

Query: 481 LFYSYGRNQSFPTFEQHMERLLFHSLDI 493
           LFYSYGRNQS PTFEQHMERLLFHSLDI
Sbjct: 538 LFYSYGRNQSLPTFEQHMERLLFHSLDI 565

BLAST of Cp4.1LG20g03980 vs. ExPASy TrEMBL
Match: A0A6J1CE33 ((3S,6E)-nerolidol synthase 1-like OS=Momordica charantia OX=3673 GN=LOC111010706 PE=4 SV=1)

HSP 1 Score: 716 bits (1847), Expect = 1.83e-241
Identity = 354/507 (69.82%), Postives = 417/507 (82.25%), Query Frame = 0

Query: 2   AVAKVESLKHVLRESGDCLERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPM 61
           A AK++ LKHVLRE GD  E L+LIDAAQRLGID H QEEIE+VL +QY L NA+QFDP 
Sbjct: 70  ATAKLKILKHVLREIGDSYECLNLIDAAQRLGIDYHFQEEIESVLHKQYILFNAVQFDPH 129

Query: 62  IDLHKAALLFRLLTQQGFLVTP---------------KLRHDIKGLTSLHEASQLCMHGD 121
            DL KA+LLFRL  QQG+  +                +LRHDIKGLTSL+EASQLCMH +
Sbjct: 130 TDLPKASLLFRLFRQQGYPASADVFKIFLDHVGKFDEELRHDIKGLTSLYEASQLCMHEE 189

Query: 122 DILEEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFGDNQWTNKW 181
           DILEEA++FSSHWL+AW V HV+HHSA FVHNTL HP HK++ QFM+PNYFGD QWTNKW
Sbjct: 190 DILEEAESFSSHWLSAWAV-HVDHHSAAFVHNTLAHPCHKSVAQFMVPNYFGDIQWTNKW 249

Query: 182 IHTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTDV 241
           IH LQ VA+++FNTTQRLRQ E+ QF+KWW+E+DL +EL+FARNQP+KWY+ SL CLTD 
Sbjct: 250 IHILQDVARLDFNTTQRLRQNEIAQFSKWWEETDLAKELSFARNQPIKWYMGSLICLTDP 309

Query: 242 CHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFKS 301
           C SE+R+ LAK+I F+YLIDDIFDVFG+ +ELT+FT++VCRWDLV  +KLP+ M++  KS
Sbjct: 310 CFSEERVLLAKSITFIYLIDDIFDVFGSRDELTLFTQLVCRWDLVGGKKLPNSMQVVLKS 369

Query: 302 LFVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRVS 361
           LF VTNEIS +IYQKHGWNPA+SL+ AW KLCKAFLVEAEW S GH PSAEEYLRNG VS
Sbjct: 370 LFEVTNEISYKIYQKHGWNPASSLKKAWAKLCKAFLVEAEWLSCGHSPSAEEYLRNGIVS 429

Query: 362 TGVHVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQEGRDGS 421
           TGVHV LVHVFFLLG +I +QTVELLE DLDI+  SATILRLWDDMG+AKDENQEGRDGS
Sbjct: 430 TGVHVVLVHVFFLLGERISKQTVELLEDDLDIILSSATILRLWDDMGNAKDENQEGRDGS 489

Query: 422 YVKYYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVPL 481
           Y KYYMKEHPS   N T++H+M KISEAWK+LN+E    +   P++F QASLNIARAVPL
Sbjct: 490 YEKYYMKEHPSMCCNITRQHMMRKISEAWKTLNKESQLLSTTLPAKFIQASLNIARAVPL 549

Query: 482 FYSYGRNQSFPTFEQHMERLLFHSLDI 493
            Y+YGRNQS PTFEQ +E+LLFHS++I
Sbjct: 550 TYNYGRNQSLPTFEQLIEQLLFHSVEI 575

BLAST of Cp4.1LG20g03980 vs. ExPASy TrEMBL
Match: A0A6J1CEY1 ((3S,6E)-nerolidol synthase 1-like OS=Momordica charantia OX=3673 GN=LOC111010887 PE=4 SV=1)

HSP 1 Score: 670 bits (1729), Expect = 3.10e-220
Identity = 340/503 (67.59%), Postives = 394/503 (78.33%), Query Frame = 0

Query: 4   AKVESLKHVLRESGDCLERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPMID 63
           AK+  LKHVLRE GD  E L+LIDAAQRLGIDNH QEEIEAVL+ QY L NA       D
Sbjct: 72  AKLRILKHVLREVGDPWECLNLIDAAQRLGIDNHFQEEIEAVLQTQYVLFNAASSRNN-D 131

Query: 64  LHKAALLFRLLTQQGFLVTP--------------KLRHDIKGLTSLHEASQLCMHGDDIL 123
           LHKAALLFRL  QQG+ V+P              KLR DI GLTSL+EASQLCMHGD+IL
Sbjct: 132 LHKAALLFRLFRQQGYQVSPVFKNFLDKEGKFHEKLREDINGLTSLYEASQLCMHGDEIL 191

Query: 124 EEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFGDNQWTNKWIHT 183
           EEA+NFS HWLNA V  HV+HH ATFVHNTL HP+HK++ QFM+PNYFGD QW NKWI+ 
Sbjct: 192 EEAENFSRHWLNARVA-HVDHHVATFVHNTLAHPHHKSMAQFMVPNYFGDIQWGNKWINI 251

Query: 184 LQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTDVCHS 243
           LQ  AK+ F  TQRL Q E+ QF KWWKE+DL +ELNFAR+QP+KWY+ASL CLTD   S
Sbjct: 252 LQDAAKIEFYATQRLHQNEIAQFLKWWKETDLAKELNFARDQPIKWYMASLVCLTDSYFS 311

Query: 244 EQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFKSLFV 303
           EQR++LAK+I+FVYLIDD+FDVFGT +ELT+ TE V RWDL AA+ LP CMRIC +SLF 
Sbjct: 312 EQRVELAKSISFVYLIDDLFDVFGTHDELTLLTEAVHRWDLAAAKGLPDCMRICLRSLFE 371

Query: 304 VTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRVSTGV 363
           VTNEIS +IYQKHGWNP  SL+ AW KLCK FLVEAEW S GH PSAEEYLRNG  STG+
Sbjct: 372 VTNEISYKIYQKHGWNPIGSLKKAWAKLCKTFLVEAEWLSCGHSPSAEEYLRNGIASTGI 431

Query: 364 HVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQEGRDGSYVK 423
            V LVHVFFL+G QI  +TV+LLE DLDIV  SA +LRLWDDMG+A+DE QEGRDGSY++
Sbjct: 432 PVILVHVFFLIGQQISNETVQLLEDDLDIVSSSAAVLRLWDDMGTAQDEKQEGRDGSYLE 491

Query: 424 YYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVPLFYS 483
            YMKEHPS S  +TQ+H+M KI EAWK+LNRE+L  +  FPS+FTQASLN+ARAVPL Y+
Sbjct: 492 CYMKEHPSVSCESTQQHVMKKICEAWKTLNREYL-LSKTFPSKFTQASLNVARAVPLAYN 551

Query: 484 YGRNQSFPTFEQHMERLLFHSLD 492
           YGRN+S  T E  +E L FH ++
Sbjct: 552 YGRNRSIMTMENIVENLFFHGME 571

BLAST of Cp4.1LG20g03980 vs. TAIR 10
Match: AT1G61680.2 (terpene synthase 14 )

HSP 1 Score: 399.1 bits (1024), Expect = 1.5e-110
Identity = 215/495 (43.43%), Postives = 315/495 (63.64%), Query Frame = 0

Query: 6   VESLKHVLRESGDC-LERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPMIDL 65
           ++ +K++L  + D   E L++ID  Q LGID H ++EIE  L   Y     LQF+   DL
Sbjct: 76  IKKIKNILSANVDVPSENLEMIDVIQSLGIDLHFRQEIEQTLHMIY--KEGLQFNG--DL 135

Query: 66  HKAALLFRLLTQQGFLVTPK--------LRHDIKGLTSLHEASQLCMHGDDILEEAQNFS 125
           H+ AL FRLL Q+G  V           +++D+KGLT L EAS+L + G++ L+ A+ F+
Sbjct: 136 HEIALRFRLLRQEGHYVQENKKGGFKDVVKNDVKGLTELFEASELRVEGEETLDGAREFT 195

Query: 126 SHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFG----DNQWTNKWIHTLQG 185
              LN       +H     +  +L  P HKT+       +        Q   +W+ +L  
Sbjct: 196 YSRLNELCSGRESHQKQE-IMKSLAQPRHKTVRGLTSKRFTSMIKIAGQEDPEWLQSLLR 255

Query: 186 VAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTDVCHSEQR 245
           VA+++    + L Q E++Q  KWW E  L +++  AR+QP+KW+  S+  L D   +EQR
Sbjct: 256 VAEIDSIRLKSLTQGEMSQTFKWWTELGLEKDVEKARSQPLKWHTWSMKILQDPTLTEQR 315

Query: 246 IQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFKSLFVVTN 305
           + L K I+ VY+IDDIFDV+G LEELT+FT VV RWD    + LP  MR+CF++L ++T 
Sbjct: 316 LDLTKPISLVYVIDDIFDVYGELEELTIFTRVVERWDHKGLKTLPKYMRVCFEALDMITT 375

Query: 306 EISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRVSTGVHVT 365
           EIS +IY+ HGWNP  +L+ +W  LCKAFLVEA+WF+SG+ P+ EEY++NG VS+GVH+ 
Sbjct: 376 EISMKIYKSHGWNPTYALRQSWASLCKAFLVEAKWFNSGYLPTTEEYMKNGVVSSGVHLV 435

Query: 366 LVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQEGRDGSYVKYYM 425
           ++H + LLG ++ ++ VEL+E +  IV  +ATILRLWDD+GSAKDENQ+G DGSYV+ Y+
Sbjct: 436 MLHAYILLGEELTKEKVELIESNPGIVSSAATILRLWDDLGSAKDENQDGTDGSYVECYL 495

Query: 426 KEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVPLFYSYGR 485
            E+  ++ +  + H+  KIS AWK LNRE L+  P F   F++A LNIAR VPL YSY  
Sbjct: 496 NEYKGSTVDEARTHVAQKISRAWKRLNRECLNPCP-FSRSFSKACLNIARTVPLMYSYDD 555

Query: 486 NQSFPTFEQHMERLL 488
           +Q  P  +++++ L+
Sbjct: 556 DQRLP--DEYLKSLM 562

BLAST of Cp4.1LG20g03980 vs. TAIR 10
Match: AT1G61680.1 (terpene synthase 14 )

HSP 1 Score: 395.6 bits (1015), Expect = 1.7e-109
Identity = 215/502 (42.83%), Postives = 315/502 (62.75%), Query Frame = 0

Query: 6   VESLKHVLRESGDC-LERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPMIDL 65
           ++ +K++L  + D   E L++ID  Q LGID H ++EIE  L   Y     LQF+   DL
Sbjct: 76  IKKIKNILSANVDVPSENLEMIDVIQSLGIDLHFRQEIEQTLHMIY--KEGLQFNG--DL 135

Query: 66  HKAALLFRLLTQQGFLV---------------TPKLRHDIKGLTSLHEASQLCMHGDDIL 125
           H+ AL FRLL Q+G  V                  +++D+KGLT L EAS+L + G++ L
Sbjct: 136 HEIALRFRLLRQEGHYVQEIIFKNILDKKGGFKDVVKNDVKGLTELFEASELRVEGEETL 195

Query: 126 EEAQNFSSHWLNAWVVVHVNHHSATFVHNTLLHPYHKTLPQFMLPNYFG----DNQWTNK 185
           + A+ F+   LN       +H     +  +L  P HKT+       +        Q   +
Sbjct: 196 DGAREFTYSRLNELCSGRESHQKQE-IMKSLAQPRHKTVRGLTSKRFTSMIKIAGQEDPE 255

Query: 186 WIHTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTD 245
           W+ +L  VA+++    + L Q E++Q  KWW E  L +++  AR+QP+KW+  S+  L D
Sbjct: 256 WLQSLLRVAEIDSIRLKSLTQGEMSQTFKWWTELGLEKDVEKARSQPLKWHTWSMKILQD 315

Query: 246 VCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFK 305
              +EQR+ L K I+ VY+IDDIFDV+G LEELT+FT VV RWD    + LP  MR+CF+
Sbjct: 316 PTLTEQRLDLTKPISLVYVIDDIFDVYGELEELTIFTRVVERWDHKGLKTLPKYMRVCFE 375

Query: 306 SLFVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRV 365
           +L ++T EIS +IY+ HGWNP  +L+ +W  LCKAFLVEA+WF+SG+ P+ EEY++NG V
Sbjct: 376 ALDMITTEISMKIYKSHGWNPTYALRQSWASLCKAFLVEAKWFNSGYLPTTEEYMKNGVV 435

Query: 366 STGVHVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDENQEGRDG 425
           S+GVH+ ++H + LLG ++ ++ VEL+E +  IV  +ATILRLWDD+GSAKDENQ+G DG
Sbjct: 436 SSGVHLVMLHAYILLGEELTKEKVELIESNPGIVSSAATILRLWDDLGSAKDENQDGTDG 495

Query: 426 SYVKYYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARAVP 485
           SYV+ Y+ E+  ++ +  + H+  KIS AWK LNRE L+  P F   F++A LNIAR VP
Sbjct: 496 SYVECYLNEYKGSTVDEARTHVAQKISRAWKRLNRECLNPCP-FSRSFSKACLNIARTVP 555

Query: 486 LFYSYGRNQSFPTFEQHMERLL 488
           L YSY  +Q  P  +++++ L+
Sbjct: 556 LMYSYDDDQRLP--DEYLKSLM 569

BLAST of Cp4.1LG20g03980 vs. TAIR 10
Match: AT4G16740.1 (terpene synthase 03 )

HSP 1 Score: 243.8 bits (621), Expect = 8.2e-64
Identity = 162/513 (31.58%), Postives = 262/513 (51.07%), Query Frame = 0

Query: 7   ESLKHVLRESGDCLERLDLIDAAQRLGIDNHLQEEIEAVL-----KRQYFLLNALQFDPM 66
           + +  +L E+   LE+L+LID  QRLG+  H ++EI+  L     K      N +  +  
Sbjct: 62  QEVSKMLNETEGLLEQLELIDTLQRLGVSYHFEQEIKKTLTNVHVKNVRAHKNRIDRNRW 121

Query: 67  IDLHKAALLFRLLTQQGFLVTPKL----------RHDIKGLTSLHEASQLCMHGDDILEE 126
            DL+  AL FRLL Q GF +   +            DIKG+ SL+EAS L    D  L+E
Sbjct: 122 GDLYATALEFRLLRQHGFSIAQDVFDGNIGVDLDDKDIKGILSLYEASYLSTRIDTKLKE 181

Query: 127 AQNFSSHWLNAWVVVHVNHHSATFVHNTLLH----PYHKTLPQFMLPNY---FGDNQWTN 186
           +  +++  L  +V V+ N   +  +   ++H    PYH+ + +     Y   +G+    N
Sbjct: 182 SIYYTTKRLRKFVEVNKNETKSYTLRRMVIHALEMPYHRRVGRLEARWYIEVYGERHDMN 241

Query: 187 KWIHTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLT 246
                L  +AK++FN  Q + Q EL   + WW ++ L + L+F R++  + Y +S+  + 
Sbjct: 242 P---ILLELAKLDFNFVQAIHQDELKSLSSWWSKTGLTKHLDFVRDRITEGYFSSVGVMY 301

Query: 247 DVCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICF 306
           +   +  R  L K    +  IDDI+D++GTLEEL +FT +V +WD+   E+LP+ M++CF
Sbjct: 302 EPEFAYHRQMLTKVFMLITTIDDIYDIYGTLEELQLFTTIVEKWDVNRLEELPNYMKLCF 361

Query: 307 KSLFVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGR 366
             L    N+I   + +  G+N    L+ +W  +C  FL EA+W+ SG+ P+ EEY++NG 
Sbjct: 362 LCLVNEINQIGYFVLRDKGFNVIPYLKESWADMCTTFLKEAKWYKSGYKPNFEEYMQNGW 421

Query: 367 VSTGVHVTLVHVFFLLGHQIRQQTVELLEG-DLDIVWCSATILRLWDDMGSAKDENQEGR 426
           +S+ V   L+H+F LL      QT+++L   +  +V  SATILRL +D+ ++ +E   G 
Sbjct: 422 ISSSVPTILLHLFCLLS----DQTLDILGSYNHSVVRSSATILRLANDLATSSEELARGD 481

Query: 427 DGSYVKYYMKEHPSTSFNATQKHIMWKISEAWKSLNREWLHCNPLFPSQFTQASLNIARA 486
               V+ +M E    S   ++ +I   I  AW  LN E   C       F +A+ N+ R 
Sbjct: 482 TMKSVQCHMHE-TGASEAESRAYIQGIIGVAWDDLNMEKKSCR--LHQGFLEAAANLGRV 541

Query: 487 VPLFYSYGRNQSFPTFEQ---HMERLLFHSLDI 494
               Y YG     P   +   H+  LL H L +
Sbjct: 542 AQCVYQYGDGHGCPDKAKTVNHVRSLLVHPLPL 564

BLAST of Cp4.1LG20g03980 vs. TAIR 10
Match: AT4G16730.1 (terpene synthase 02 )

HSP 1 Score: 231.9 bits (590), Expect = 3.2e-60
Identity = 154/500 (30.80%), Postives = 252/500 (50.40%), Query Frame = 0

Query: 2   AVAKVESLKHVLRESGDCLERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPM 61
           A+   E ++  L E    +E+L++ID+ QRLGI  H + EI  +L++ +     ++ +  
Sbjct: 24  AILFKEEVRKTLNEIEGSIEQLEMIDSLQRLGISYHYKHEIHDILRKIHDQHGEIERETQ 83

Query: 62  IDLHKAALLFRLLTQQGFLVT---------------PKLRHDIKGLTSLHEASQLCMHGD 121
            DLH  +L F LL Q GF V+                 L  DIKGL SL+EAS   M  +
Sbjct: 84  -DLHATSLEFILLRQHGFDVSQDAFDVFISETGEFRKTLHSDIKGLLSLYEASYFSMDSE 143

Query: 122 DILEEAQNFSSHWLNAWVVVH---VNHHSATF----VHNTLLHPYHKTLPQFMLPNYFGD 181
             L+E + +++  L+ +V      +     T+    V   L  PYH ++ +     Y   
Sbjct: 144 FKLKETRIYANKRLSEFVAESSKTICREDETYILEMVKRALETPYHWSIRRLEARWYINV 203

Query: 182 NQWTNKWIHTLQGVAKVNFNTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIAS 241
            +  ++    L   A ++FN  Q   Q EL   + WW  + L ++L+F R++  + Y  +
Sbjct: 204 YEKKHEMNPLLLEFAAIDFNMLQANHQEELKLISSWWNSTGLMKQLDFVRDRITESYFWT 263

Query: 242 LSCLTDVCHSEQRIQLAKAIAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSC 301
           +    +      R  L K    + ++DDI+D++GTLEEL +FT VV +WD+   E+LP+ 
Sbjct: 264 IGIFYEPEFKYCRKILTKIFMLIVIMDDIYDIYGTLEELELFTNVVEKWDVNHVERLPNY 323

Query: 302 MRICFKSLFVVTNEISDQIYQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEY 361
           MR+CF  L+   N+I   + +  G N    L+  W  L K FL E++W+ +GH PS EEY
Sbjct: 324 MRMCFLFLYNEINQIGYDVLRDKGLNVIPYLKQVWTDLFKTFLTESKWYKTGHKPSFEEY 383

Query: 362 LRNGRVSTGVHVTLVHVFFLLGHQIRQQTVELLEGDLDIVWCSATILRLWDDMGSAKDEN 421
           ++NG +S+ V   L+H+F +L   I  QT+     +  +V   ATILRL +D+ ++ +E 
Sbjct: 384 MQNGVISSSVPTILLHLFSVLSDHISDQTLTDDSKNHSVVRSCATILRLANDLATSTEEM 443

Query: 422 QEGRDGSYVKYYMKEHPSTSFNATQKHIMWKISEAWKSLNRE--WLHCNPLFPSQFTQAS 478
             G     V+ YM E  ++   A ++H+   IS++W  +N +    H + L P  F  A+
Sbjct: 444 ARGDSPKSVQCYMYETRASEEEA-RRHMQSMISDSWDIINSDLKTAHTSSL-PRGFLAAA 503

BLAST of Cp4.1LG20g03980 vs. TAIR 10
Match: AT2G24210.1 (terpene synthase 10 )

HSP 1 Score: 224.2 bits (570), Expect = 6.8e-58
Identity = 149/491 (30.35%), Postives = 246/491 (50.10%), Query Frame = 0

Query: 20  LERLDLIDAAQRLGIDNHLQEEIEAVLKRQYFLLNALQFDPMIDLHKAALLFRLLTQQGF 79
           L++L+ ID  Q+LG+  H + EI+ +L   Y        +   DLH  AL FRL  Q GF
Sbjct: 96  LDQLEFIDDLQKLGVSYHFEAEIDNILTSSYKKDRTNIQES--DLHATALEFRLFRQHGF 155

Query: 80  LVTPKL------------RHDIKGLTSLHEASQLCMHGDDILE-EAQNFSSHWLNAWVVV 139
            V+  +            R DI GL SL+EAS L    D  L+   + F++  L  +V  
Sbjct: 156 NVSEDVFDVFMENCGKFDRDDIYGLISLYEASYLSTKLDKNLQIFIRPFATQQLRDFVDT 215

Query: 140 HVNHHSAT-----FVHNTLLHPYHKTLPQFMLPNY---FGDNQWTNKWIHTLQGVAKVNF 199
           H N    +      V   L  PY+  + +     Y   +G  Q  N     +   AK++F
Sbjct: 216 HSNEDFGSCDMVEIVVQALDMPYYWQMRRLSTRWYIDVYGKRQ--NYKNLVVVEFAKIDF 275

Query: 200 NTTQRLRQYELNQFTKWWKESDLGRELNFARNQPMKWYIASLSCLTDVCHSEQRIQLAKA 259
           N  Q + Q EL   + WW E+ LG++L FAR++ ++ Y  ++  + +  +   R  + K 
Sbjct: 276 NIVQAIHQEELKNVSSWWMETGLGKQLYFARDRIVENYFWTIGQIQEPQYGYVRQTMTKI 335

Query: 260 IAFVYLIDDIFDVFGTLEELTMFTEVVCRWDLVAAEKLPSCMRICFKSLFVVTNEISDQI 319
            A +  IDDI+D++GTLEEL +FT     WD+   ++LP  MR+CF  ++   N I+ +I
Sbjct: 336 NALLTTIDDIYDIYGTLEELQLFTVAFENWDINRLDELPEYMRLCFLVIYNEVNSIACEI 395

Query: 320 YQKHGWNPATSLQTAWGKLCKAFLVEAEWFSSGHWPSAEEYLRNGRVSTGVHVTLVHVFF 379
            +    N    L+ +W  + KA+LVEA+W+ SGH P+ EEY++N R+S       VH + 
Sbjct: 396 LRTKNINVIPFLKKSWTDVSKAYLVEAKWYKSGHKPNLEEYMQNARISISSPTIFVHFYC 455

Query: 380 LLGHQIRQQTVELL-EGDLDIVWCSATILRLWDDMGSAKDENQEGRDGSYVKYYMKEHPS 439
           +   Q+  Q +E L +   ++V CS+++ RL +D+ ++ DE   G     ++ YM E   
Sbjct: 456 VFSDQLSIQVLETLSQHQQNVVRCSSSVFRLANDLVTSPDELARGDVCKSIQCYMSE-TG 515

Query: 440 TSFNATQKHIMWKISEAWKSLNREWL-HCNPLFPSQFTQASLNIARAVPLFYSYGRNQSF 488
            S +  + H+   I++ W  +N E + H + +    F +  +N+AR     Y YG     
Sbjct: 516 ASEDKARSHVRQMINDLWDEMNYEKMAHSSSILHHDFMETVINLARMSQCMYQYGDGHGS 575

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0CV941.4e-14050.99(3S,6E)-nerolidol synthase 1 OS=Fragaria ananassa OX=3747 PE=1 SV=1[more]
P0CV955.8e-13950.40(3S,6E)-nerolidol synthase 2, chloroplastic/mitochondrial OS=Fragaria ananassa O... [more]
P0CV969.3e-13749.80(3S,6E)-nerolidol synthase 1, chloroplastic OS=Fragaria vesca OX=57918 PE=1 SV=1[more]
B9RXW44.9e-13048.03Probable terpene synthase 13 OS=Ricinus communis OX=3988 GN=TPS13 PE=3 SV=1[more]
B9RHX75.7e-11846.67Probable terpene synthase 4 OS=Ricinus communis OX=3988 GN=TPS4 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
KAG7019658.10.083.10hypothetical protein SDJN02_18621, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023520214.10.075.21uncharacterized protein LOC111783517 [Cucurbita pepo subsp. pepo][more]
XP_022927129.10.071.38uncharacterized protein LOC111434068 [Cucurbita moschata][more]
XP_023001644.10.068.14uncharacterized protein LOC111495716 [Cucurbita maxima][more]
KAG6584051.10.062.74hypothetical protein SDJN03_19983, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1EH510.071.38uncharacterized protein LOC111434068 OS=Cucurbita moschata OX=3662 GN=LOC1114340... [more]
A0A6J1KNB10.068.14uncharacterized protein LOC111495716 OS=Cucurbita maxima OX=3661 GN=LOC111495716... [more]
A0A6J1EK530.092.91(3S,6E)-nerolidol synthase 1-like OS=Cucurbita moschata OX=3662 GN=LOC111434067 ... [more]
A0A6J1CE331.83e-24169.82(3S,6E)-nerolidol synthase 1-like OS=Momordica charantia OX=3673 GN=LOC111010706... [more]
A0A6J1CEY13.10e-22067.59(3S,6E)-nerolidol synthase 1-like OS=Momordica charantia OX=3673 GN=LOC111010887... [more]
Match NameE-valueIdentityDescription
AT1G61680.21.5e-11043.43terpene synthase 14 [more]
AT1G61680.11.7e-10942.83terpene synthase 14 [more]
AT4G16740.18.2e-6431.58terpene synthase 03 [more]
AT4G16730.13.2e-6030.80terpene synthase 02 [more]
AT2G24210.16.8e-5830.35terpene synthase 10 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF19086Terpene_syn_C_2coord: 1278..1334
e-value: 3.0E-5
score: 23.9
NoneNo IPR availableSFLDSFLDG01014Terpene_Cyclase_Like_1_N-termcoord: 970..1121
e-value: 2.5E-32
score: 106.5
NoneNo IPR availableSFLDSFLDS00005Isoprenoid_Synthase_Type_Icoord: 173..491
e-value: 0.0
score: 255.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1394..1418
NoneNo IPR availablePANTHERPTHR31225:SF148(3S,6E)-NEROLIDOL SYNTHASE 2, CHLOROPLASTIC/MITOCHONDRIAL-LIKEcoord: 4..486
NoneNo IPR availablePANTHERPTHR31225:SF148(3S,6E)-NEROLIDOL SYNTHASE 2, CHLOROPLASTIC/MITOCHONDRIAL-LIKEcoord: 526..593
NoneNo IPR availablePANTHERPTHR31225OS04G0344100 PROTEIN-RELATEDcoord: 4..486
NoneNo IPR availablePANTHERPTHR31225OS04G0344100 PROTEIN-RELATEDcoord: 1310..1385
NoneNo IPR availablePANTHERPTHR31225OS04G0344100 PROTEIN-RELATEDcoord: 1279..1311
coord: 686..738
NoneNo IPR availablePANTHERPTHR31225OS04G0344100 PROTEIN-RELATEDcoord: 739..900
NoneNo IPR availablePANTHERPTHR31225OS04G0344100 PROTEIN-RELATEDcoord: 592..681
NoneNo IPR availablePANTHERPTHR31225:SF148(3S,6E)-NEROLIDOL SYNTHASE 2, CHLOROPLASTIC/MITOCHONDRIAL-LIKEcoord: 1279..1311
NoneNo IPR availablePANTHERPTHR31225:SF148(3S,6E)-NEROLIDOL SYNTHASE 2, CHLOROPLASTIC/MITOCHONDRIAL-LIKEcoord: 1310..1385
coord: 739..900
NoneNo IPR availablePANTHERPTHR31225OS04G0344100 PROTEIN-RELATEDcoord: 915..1279
NoneNo IPR availablePANTHERPTHR31225:SF148(3S,6E)-NEROLIDOL SYNTHASE 2, CHLOROPLASTIC/MITOCHONDRIAL-LIKEcoord: 592..681
NoneNo IPR availablePANTHERPTHR31225OS04G0344100 PROTEIN-RELATEDcoord: 526..593
NoneNo IPR availablePANTHERPTHR31225:SF148(3S,6E)-NEROLIDOL SYNTHASE 2, CHLOROPLASTIC/MITOCHONDRIAL-LIKEcoord: 686..738
coord: 915..1279
IPR001906Terpene synthase, N-terminal domainPFAMPF01397Terpene_synthcoord: 949..1109
e-value: 4.9E-40
score: 137.3
coord: 16..127
e-value: 6.1E-21
score: 75.0
coord: 592..659
e-value: 6.9E-15
score: 55.3
IPR008949Isoprenoid synthase domain superfamilyGENE3D1.10.600.10Farnesyl Diphosphate Synthasecoord: 739..900
e-value: 6.1E-52
score: 178.5
coord: 181..491
e-value: 1.3E-103
score: 348.3
IPR008949Isoprenoid synthase domain superfamilyGENE3D1.10.600.10Farnesyl Diphosphate Synthasecoord: 1315..1389
e-value: 3.1E-11
score: 44.8
IPR008949Isoprenoid synthase domain superfamilyGENE3D1.10.600.10Farnesyl Diphosphate Synthasecoord: 1150..1279
e-value: 2.0E-38
score: 134.2
IPR008949Isoprenoid synthase domain superfamilyGENE3D1.10.600.10Farnesyl Diphosphate Synthasecoord: 1280..1314
e-value: 1.4E-6
score: 29.6
IPR008949Isoprenoid synthase domain superfamilySUPERFAMILY48576Terpenoid synthasescoord: 686..894
IPR008949Isoprenoid synthase domain superfamilySUPERFAMILY48576Terpenoid synthasescoord: 1141..1368
IPR008949Isoprenoid synthase domain superfamilySUPERFAMILY48576Terpenoid synthasescoord: 169..478
IPR005630Terpene synthase, metal-binding domainPFAMPF03936Terpene_synth_Ccoord: 174..436
e-value: 1.8E-76
score: 257.0
coord: 1142..1276
e-value: 5.5E-39
score: 134.1
coord: 739..862
e-value: 7.0E-30
score: 104.3
IPR036965Terpene synthase, N-terminal domain superfamilyGENE3D1.50.10.130coord: 591..695
e-value: 8.5E-24
score: 86.0
IPR036965Terpene synthase, N-terminal domain superfamilyGENE3D1.50.10.130coord: 3..180
e-value: 1.4E-33
score: 118.0
coord: 543..590
e-value: 1.3E-9
score: 39.8
IPR036965Terpene synthase, N-terminal domain superfamilyGENE3D1.50.10.130coord: 944..1149
e-value: 4.2E-49
score: 168.9
IPR034741Terpene cyclase-like 1, C-terminal domainSFLDSFLDG01019Terpene_Cyclase_Like_1_C_Termcoord: 173..491
e-value: 0.0
score: 255.1
IPR008930Terpenoid cyclases/protein prenyltransferase alpha-alpha toroidSUPERFAMILY48239Terpenoid cyclases/Protein prenyltransferasescoord: 552..677
IPR008930Terpenoid cyclases/protein prenyltransferase alpha-alpha toroidSUPERFAMILY48239Terpenoid cyclases/Protein prenyltransferasescoord: 12..157
IPR008930Terpenoid cyclases/protein prenyltransferase alpha-alpha toroidSUPERFAMILY48239Terpenoid cyclases/Protein prenyltransferasescoord: 958..1126

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g03980.1Cp4.1LG20g03980.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044237 cellular metabolic process
biological_process GO:0071704 organic substance metabolic process
biological_process GO:0044238 primary metabolic process
molecular_function GO:0000287 magnesium ion binding
molecular_function GO:0010333 terpene synthase activity
molecular_function GO:0016829 lyase activity