Sgr026500 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr026500
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptioncarotene epsilon-monooxygenase, chloroplastic
Locationtig00153031: 6044629 .. 6059437 (+)
RNA-Seq ExpressionSgr026500
SyntenySgr026500
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTCCACACTCTTTCTTCCCTCAATCTCTTCCCTCTCTGCTCCCCTCCATAAACGAGGCCCTCTCCGGCGAAGACCCCCACCTCCATATCTCTCCATTAAATCCTCCATAGACGAAGGGAATCCACCCACCCCCACGAAGCTTAAAAGCTCCACCAACAATGCAAAAACTGGTTCCTGGGTCAGCCCGGATTGGCTCACCTCTCTGACTCGCTACATTACCCTAGGCCAGGGCGACGACTCCGGCATTCCCATAGCAAGTGCGAAGCTCGATGACGTTTCTGATCTTCTGGGCGGCGCTCTTTTCCTTCCGCTGTTCAAGTGGATGAATGATTATGGACCGATTTACAGGCTCGCAGCTGGTCCTAGGAATTTCGTGGTGGTTAGTGACCCCGCCATTGCTAAGCATGTTCTGAGGAATTATGGGACGTATGCGAAGGGGCTTATTTCTGAGGTCTCCGAGTTCCTGTTCGGGTCGGGTTTCGCGATTGCAGAAGGCCCGCTTTGGACGGTAATCTTTTCGCTAACTGTAGATTAACGAAGATAAATTAAGAATTCATCATCGTATATTGCATTTTAAATTGCAAATTCGAACTATTAAGTTACGGTTAAGTTATAATTCTGGCATATGAACATTGTCAGTTTTAATGTTTTAATTATTCGAATTTTCATTTGTTTGATTTTAACACTTGTACTTTAATAAATATATCGTTTTCATATTTAAGTTTTGATAGTAACCAAAAATTGACTTCGGCAAACCGTAATCATGAGATATATTCAAATGTCACACTAACTTCTAAAATGTGACAAAAGAATGAACGGCTAAAAGCCACTAACAATAGGGACTATAATTGTAATTGAAGTTTAACTATTTATTATATTATTTTTCAACTATATTTTCCCGCCCGGGCTCATGGGGTTCACATTTCATATTGATCGATGGGTACTGTAACAAAACAGTTGAGAATTTCTGTTTGTAACGTTAAATTATTATCAACCTCAAAAGGTTATTGTATTAAATTATGAGCAAGAGATTGCTTTGCTGTAATTTGAATCAGACAATTCCTGCACGTAGTACACTGTTGCTGGTACTTGTAGTTTTAGAACCTTTTAGTCTTTTGTATGCTTGATCTTCTGCAATCTGTATGAAGTTGCTTCTTTGCTATACATGTTTGCTGTGAAAGATGTTTTATAGTTTCCAGTGGAAGCAGTATATGGAAAATAATCTTTTGAATGAAGTATTGGTGGGTGAGTTTTAGAGCATGAATGCAGAAGGGAAAGTCCTTTGAAAATGTTGGAAATTCTATACCTTTATGCATTCAGACGTGCTTTTCCTACCCCTTTCATTGAATCTTTTTAGAATACTGATTATTGATTGAAATTCAGCTTTATATGGAACTCAGTTTAAAGACAATGAAGTTTTAGATTATTGTACTACTTGAATTATGGGAGAAAGCAAAAAGATTTAAACCTGTGGGATGCACACGTTTAACTGAAAGTGTTCCAATCATTCAAATTTTCTCCATGAGTCTCACTGGACCTCGGATGTTATATTTTATATTAGGTACGCCGTAGGGCTGTGGTTCCATCTCTTCACAAGAAGTACTTATCGGTTATTGTTGATCGAGTATTCTGTAAATGTGCCATGAGATTGGTGGAGAAGCTGCAAAAGAATGCATTAAATAATAATTCTGTTAACATGGAGGAAAAGTTTTCTCAACTAACTCTTGATGTTATTGGTCTATCTGTATTCAACTACAGTTTTGATTCTCTCTCTGCTGACAGCCCTGTGATTGATGCAATTTACACTGCTTTGAAAGAGGCAGAGGCTCGTTCTACAGATATTTTACCATATTGGAAGGCACGTTTCCTTATGTATCTTGTTCTGCTTTTGACTACCATTTCTTTCAACTCATTTTTCACTTTTGCTTTTTGTTGTTGCAGATTAAAGCTCTATGTAAGATAATTCCAAGACAGATAAAAGCTGAAGAAGCAGTTACAGTGATCAGGAGAACTGTTGAGGAGCTAATTGCCAAATGCAAAGAAATTGTGGAAACTGAGGGTGAGCGTATCGATGAGGAGGAATATGTGAATGACGCTGATCCTAGCATCCTTCGTTTTCTGCTGGCCAGTAGACAAGAGGTCTGCTCTTGATAGCTTTAAAGTCTTCCGTTCATCACTGACATTTTGCAATGTAGTTCAAGCATTTGTTGTGCTATTTCAGTTCAGTGTTACATTTCCACTTGATGTACACTGAAGTCAGACAATTTCCTTGATATTTCTCAGATTCTTGCTAAGTACAACAATTATCCACTTTTATCTTGTTCTGTAAGCAGTTGATTCTTAAGTATGCTAAGTAACTGATCAAGGAATTTATTTCATGATTATGAAAGAGGTCCCATCGAATGGTGGGCTGAGATTATTATAGGTTTTGATCATGTTGTGAATTAATATACTAATATCGGGAAAGAAGCTTTTTTCCCCTTCCTATAAAGTTTTGTTTCTACAAAAAAGAAAAATGGAAATAAAAAATGTTCTAACAACGAGGCTGTTAGGAATACTAGTAGGGATATATTAGGTATATTAATGGTGATATAATTGTAAATTACCTAAGAAGTTTGTTTGAGTTGAGTGGTTATAAATAGAGTGAGTTAGGAGATTTAGAAGGTAGTTAGAATTTGGAGGGTAGTTTTCAGAGAGTCTAAGCCTCTCCCAAAGTGTTAGGGATGTTTTAATTTTCCTTTCTTATATTAAAAATTTTTCTGTTAATTTCTATTTTAATTTCGGGTTCCTACAGGAGGGCTGAACACTGGAATTAGGATATTTTGTTTCTCTTACCATATGTCAAACGAATGCTTAGTTCTCCCTCAATTTGCTAGATTCTTCCTATGGCGTGCTCATAATATTGGAAAAACCACTTATCCTTTAATGATCTGCCATGTTCTGTCACTGATATTTGCCTTTTGTGTCGTTTAGGTCTCAAGCGTACAATTGCGAGACGATCTCTTGTCAATGTTAGTTGCTGGACATGAGACTACTGGTTCTGTTCTGACTTGGACACTGTATCTCTTAAGTAAGGTAACCTTGTCATTTGCATATATGTAGAAATCGATAACTTTCACCCATTTATAGATCCTTGTCCCTTAGTAACTTTTTTCCTTTCTTTTAAGGGAAAAATTCATTTATGTCCTGTATTATATCAGTGAGTCAATTTTAATTTATAATTTCGTTGAATGGAACCCTAAATTTGAATAAGTGGTGTAATTGTTACTCTCTCCAAACGAAATTATCCCAAACTCAGCAAGAAAAATATTCAAATCTAGCCATAAAATCAAACCAGTAAAAAAAAACTCAAACTTAACACCAAAATGCCCAAACCTAGCTAAAAAGCATCCAACACAATCAAAACAATCATTATCCGTTAAAGTAGTTTTGTCCAGCTAATGCCCATAACTAGCATAAATCTATAAAAATATGAAAACTCAACTTATGAAGGTATAATTTCACTACCATTGACTCACTCAACTAGTATATGCGTAAAAAAAAAATGAAGCCTACCTTGAAATCACAAATGCCTTAGTGTGGTGGCTTGGCATGTTTGCCACTTGCCAGTCATAATGGGTCAAATTACATCTCTTTTTTATAATGAATAAGAAGCGTGAATGGATTATGAAAATGGTGAGGAAAGATTTTGGCTATTCACCATGATTTCTGTGTTTGTACTTTGTAGTTCTATCTTTCTGTTAATAATAGCTATCTTCAGGTACATGATTTAAATGAAGTTGTAGCTGGGGATTTTTTGTTTAATTGTGAAATTGAACTTTTTTTCTTTTCTTTTTATGAAAAACCAAGCTTTTATTGAGATGAATGTAAGAATCAAAGGGGAAAAATAAAAAAGAGGAAGGCCAAAACAAAAAGAGTCTGTATATAGAAAACTAACTACAAAAGATAGAAAGTTGCAAAAAAAAAGGGACTCCAATTGTTGAGAATAAGAGACTAATGATACTCACAAGAGATTAATGATACTAAGAAAATCCTTAGCTACAGAAACTCAAAGAGAAGCATTGAATCTAATGAAAGCCCAAACCTTCTTGAGAACTCTCCAGCCTTCCAAATATCTCCTATTTCTCTCCAGCTAAATATTCCATAATTTGAGTTCACTTCAGTTCAATATATATGTGCGTCTGTGCATTGACATTTGCTTTGTGGTTTCTTGAACGTGATCATCAATCTGCTCTAAGTTGATTTCTTTCATTATATTTTTCACTTTAGCAAGTTGTTTTTTATATGATTTTTTTTACTATTATTCTGCAAATCAGGATTCCTCATCATTGATCAAGGCTCAAAATGAAGTTGATAGAGTCTTACAAGGAAGGCCTCCTTCCTACGAAGACACAAAGGAGCTTAAATTTTTGACACGTTGTATCCTCGAGTCAATGCGTCTCTACCCACATCCGCCTGTAGGTTGCAATACTTGGTTCTCATCCAAGTGAATTATGACATGAAGAGAGAAGAAGGAAAAATATGAATATTTATTCATCATTCTGCTAATCGAGTTTTTATCTTCATGTTTCTTAAACCTCTCAAGGTTTTGATAAGAAGAGCTCAAGTTGCTGACGTACTGCCCGGAAATTACAAGGTTAATGCTGGTCAAGACATCATGATTTCAGTATATAATATCCATCACTCTTCTCAGGTACATACTTCAAGGGTCATGTTTCTATGTTTGTGTTAGATAGACTCTTTTGTCCATTCTGGTTCAAGCTGCTCAATAAAAAATTCAGAAGATGTTTTCTGCTGTTATGTTGGTATACTGGTAGTAGCTGATAGAACTCAGTTGAAGGTTTCTGAGTATTGCTTGCAGGAATGCTTACACTTCATCAACCAGACATCACTTTCGATATCCAATTAGTTAACTGTTTTCCTTAATTAACTCCCCTGAAATGATGGCCTCAGAAATATTGAACTTGCACATGACATTATGTCTTTCTGCTTATTGTTCTCCTGATAGTAATTTGACAATTTCATCTTCCATGAACAGTTCGAGGATTTCTTACCAAATGAAATAACTTAGTAACAGTAATTAATTACCATAATTAGGAATTGGATGATAAGTTATGGCTCAAATTTGAGCAATGGAGGGGGAGGCTTTCTTTGATGTATATAATTTAAGGTGAAATTTCTTGCATGTTTGTGTCTACATTCTACGACATCCTTAGAAATCACAAACACATAAATACATATAAGACATCAGGAAAGGCATCATTTTCTTAAAACAGTCTCAAAGAATAAGTCGTTGGTCATTACAAAAGTCAAAATCAACCATGCGATAATGGCAAAACAAGCAAAATCCATAGAAAACTCAAATACAAAAAGAATACGTACTTACCAAAAATTAAGTAAAAATCATTGACTTTGTCTCTAGTGCTTTCCTTTTTTCTTTTTTGTTGCTAGTTTGGATATATATATTCAACATAGGGATAGGGACAGTCGAACATATTCTTTTCTTTGTAAAGGGTTCTTTTTGCAATATTATAACTCATACTTATGTTTCAAGGTTTGGGAACGTGCACAAGAGTTTATACCAGAAAGATTTGACTTGGATGGTCCTGTGCCTAATGAAAGCAATACAGATTTCAGGTACATATCCTGTTTGTGTTTGGGAACTACACTGTTTTCAAAAGCAAATTAGGCACTTGCCAGGCGCTCATAGGTATGTTTGGTGCGAGATGCTTGACCTGAAGCAAGTCATCCTCTTTAAGGCAAGCACCTAATACACCTGCTGACACGCTTGAATGGCAGGCATTTGAATGCTCTGGCACAACAGACACTCTCTTTGCGTGGCTGGCAGAAAACAAAAGCCAACATTATTTTCCTCTCATAGAAGTATTTGTCAGAACCCTTCCCGCACTTCCCTTTGGATGGTGCCCGTCCATCTCTCTCTGGCAAGAAAGTAACCAAGGACCAACATTTTTTCTCCTTCCTGCATATTAGTTTCCTAGAAATCCTTTGCCCCATTGTCTTTTTGACAGTTTCTGACAGTGCCTATCCATTTCCTGAGGTTTCTATTCCCCCTCTCCCTTTCCCTTTTTTATGTTCCCTCACCTTGATTTCACTTGTTTGATCTATTTATTGGCAATGAATTTTAGTTTTTAGTTATTCAGATTGTGAACTCAATTTCAAAAGCTTGAACTTTGTTGTATATATTAGTGACACTACTAGAGCAACATAATATTTGGATCATTTGCATTTTTGACTTCCATTTTGAATGATGCATGCTTTTGATCTGCCATCAGACATAGCTACTCAATTATCCTTGCTAGTTGACCGACCTGAATAACCAAATCATACAACACAGATTTATTCCCTTCAGCGGAGGGCCCCGAAAGTGCGTTGGTGATCAATTTGCACTGCTAGAAGCTATCGTTGCGCTTGCCATATTTCTGCAGCACATGAACTTCGAGCTGGTTCCAAATCAGACCATTGGGATGACTACTGGAGCGACTATACATACAACAAATGTAATGTTTTCTATGCTCGTGCAAGCATTGCTTCCTTTGATCCTACATTTCTCCCTAGAAATACCTACAACGGCAATTTCTTCTGATTATTATTTCAACAATTCATTTCAGGAATTGTGTGCTTGATATTTTAGCCCATGCTTGATAATTACTTTTTATAATCGCTTTTGAAATCTTAGCCAAATTAATATATATATATTTCTGAAAATTGTCTTTCTTAAGACCGTTTGATAAATCCATGTAGATCTCAATTTTGAATAATTTGTATTTTGTTTGCTTAAATTAGTATTGTCAGATCAAACTTTCGGCTGATATTTGAAAGATTATTACTGTAATAATGAGTTTGCTTATAATTTATAAGAAGAATAGATGATGAAATTGTAGAAAACTATGATTTATATGGACTACTAATACTTCTGGGTGCTAATTGCAAATGCAGGGTTTATACATGAAACTCAGCCAAAGGCAGCCGACCCCAGCTTTAGCTTCCTCTGCTTCAAGGTAAATGTTGAAAATGTACATAAATTCTGGTGAGGGAAGAGAGAGAAAGAGAGATTGACATTTTTATTTATTCTGCTTATTCTTTTGTTTATAGACAATCACATGATCAAATATGCAAATCCAATACATTAAAGAGTCGTGTGGTTTAAGATTTGGGTAGAAATAAGATGCTTGTAAACACAACCTAATCTAGAGGGATTTTATTTGAATTTGTACTAACCAAAACAAACTTATAACTCCATTCATCTCCAATGGTTAGAGTTGGGTTGATTAATCTTTTCGATCATGAGATTTTCTGAACACTCGTATTAAACAGTAAAAAGTCAACGAAGAAAAAAGAGAAGGATTGAGTGGCATCTATTATGTTGGGTACATGTGGAAATGAGAGAAAATAAAGGCAAAAAGAGATAAATGTAGTCACATGTTGAAAAGAGAGAGAAAAAAGGAAGCTAAAACAAAAAGGGGAGTAAGCCCACCCATTTTAAAATTAACTTTTGAAATGGGCTGTCCTAACTTGTTAGAATGACCCAGATTTTTTTTTTGCCCCAAATTAAAATTTTGGACATGGAAACAAATTTAAAAGGCTTAGATCACAAAAAATCATCTCCAAAATTTAAGAATAAAAAGGAATAAAATCTAGTACACTGAGATACAATTTAGCATCAATGGAGATGAGTTGGTGTGGTGTCATGGGATGTGGGGGAAAAAAAGATGCCAATGCCAAAGGCATATCCCTTGGAAGATGCTGAAAATTGCAGCCTTTTCAGTCGCTTTCTACTTTTCTCTCGATGTTCCCTTTAACAAAAAGTGGCCACAAACTCACTTCTTAGGTCATTTCTACCCTTTTCATTGCCTTCATAAACCAACCACATCCTATATAAAACCCAACCCAACCTTGATCATCAACCCCTCACAATTGTACCAACTCCACCGCCCCTGATGGCAGTGGCCGGAGAAGGAAGCCGGGGTGGCTGCTATGGCTTTCCATGGTGATGCCTCTGCTGCATATTTGCCTCACGTCAGGTGAGAGGTGGGAGATTGAAAGGGCAACTAACATCCTTTGGCATATTTTTCCAAAACTTTCCTCTGAATCTCACTATGCACTTTTCTGGTGATTTTGGGATCCAGGAACCTCTGAACAGGGCAAGTCAGATTATGGCCACCCCTCTCCCCCTCTCTTCTTCTTCTCCTTCTTTCTTGTTTCTTTTTTCTTTTTAAAAGTTTTAGATTGGGGTAGAATGAATAGCTATATCCTTTTCGCAAATGTATGATGGAGATTCTTATGACCATTTGAGTTAACATGTCGATTTCTCAACATCAATGTATGTCCAAGACATTTATGTATTAAAGTACTAAAGAATAATTGTTTACTAGTAGCATATTGAAAAGTGAGTTTTTTTTTTTCTAAAATTTTTCAATGTTTTTTAGTCAGAATTATAGCAAATTGTTACCATGCCTGTAGACACTAGTTCAAGTCTTGACTATGTCAACAATTTTCTCCCAGAAATATGTCAATGCAAATATTCCAAACTTATAACTGCTCAAATTATTGAATAGAAGAATGGAATAAAGTTTTAAAGTCTTGTTAGTTCATCAACGTAACTTTCGAATTGAGTATAGTTTAAATGGTTAGACATTAGTGTCTCTTATTATGGATTGTAGGATTGAATCTACCCTCCCTATTGTAATATAATATTTTAAAGAAAAAAGTTCATCAGCTTATACCATTTAGAATCAATTTCATCAAATCAAAATATAATCAAATTTTACAAATTAAAACGATCTCAATGTTAGTTGAGCAAGCCCTAGAATTCCTAAAAGGAGGGCTAAACATCATTTTGTCAAAAACTCTATATAGCTTCTTCAAGAGAACATGAATGTGCTGGGGTTGTTGCTTGACAAAAAGGGATATTATAAGAATTAACTACATTTGGCCAATGCAGAATATGCAACTATTATTGTGAAATGAGAGAAGAAATAACGTTAAAAAAATGGTAGCCAATATGAGCATAGTTATGGTTAAGACATTAGTAGCTTTCATATAGATCGAAGATTTAAATCTTCACATTTACAATTGTATAATGCTATATTCTAAAATAAAAAGAAAAATGATAATAATTATATCGAGATTTAAAAGAAAATTGATCTTTACAAAAAAATAATAAAGACATAATAATTTTATTGAAATTTAAACAAGTATTAATGTTTACAGAAAAAATATAGTTATAATTTTGATGTTTACAGAAAAATATAGTTACAATTTTTAAGATATGTTTTAGGTGCACGAAAGATACATACGGTATTCTCACGAAAAGATGAGACATAATCTCGCTGGAGCATACCTACATCAATTTTTCTGTGATTTTTTTTCTCTTTACTTTCTTTGTTATATATATATATATATTTTTTTTGGAAAAGAAAAATACATTTAAGCAACTAAACTAAACTAAACAAACATCCAACTGACACGAATGAAGCCAAGGCCGAAAAGGGGGGCTCCAGCCACAAGCCTGATCCCCAAAAGTCCAAAACTTGCTAAGGGAATGAAGTTGAACCACAAAATTGAAGTTGCCTTGACAAAAATAAAGGAAAAGCTATCAAATTTGAGAGAGATAGATCTCATGTTCTCAACTAACAAGTTGATCTCGGAGAAGTTCTCTGTTAGCTGGTCACCAAACTAGTTTTTAATAAGCAAGCAATCAATTTTGCAATAATACTTTTGAACTCTAAATTACGGGTATACTCCAGCGCCTCCCAAGTCGCAAATGCTTTAGCCATCATCACCCTTTTTTTTAATAAGATCATCATCACACTTTTTTTAATAAGATCATCATCACACTTTGACACAACTCTCCCTCCCCCTTTCCACTTATTTCAAAAGAAAAAATTCTCTTTCAAATCTTTGTTTCATCTCCGTTGTTACCACCACCCTTCCATGACTGCTAACTTCGTCTTATTTCCTTTAATGGCGCAAGGCCACAAGATTCCCATGGTGGATATCGCCAAGCTCTTGGCTGAAAGATCAGGCATCATTGTGACCATATTCACCACTCCAACAAAAACAATATTGCAGCAAACACATATATTGCAACAGTTAAAATATGCTGCTTCCAGCGGTTTACAAAACTGCTGCAATTAAGCTTTCAACTGTGGATTTCAAAGCGGTTTATGAACCGCTGGAATAGCCTTTGCAGCGGTTAGAAACTGCTGCAATAGACTTTGACAACCGCTGCAATAAGTGGTTTGTTGTAGTGCTTCAACAGCCAAAAAAGAAAAAAAGAGAAAAAAAAACATCATCAAGGACAAAAAAGCACAATATTTTAAAAATAGTTCAACAAATTTGTGTGTATATCTACTTGGTAAGCGTTAAAAAGAAAAAACGGAAAATGGTGGTCACCTAGGTAGTGTTCAATAATTCAATAATTTTTCCACCTATATAGTGTTCATTCTTACTCTGCCGACAATGTAGTTGAACAACAATCTGGATAAAATTTTGGGACGAAGAAAGTATATGGCACTGCAGATAACTTCTGTAGCTTTCCTGATAGCTAACTGTCCTGATGACTGTGAGTGTGAACTACATGAAGCACTTGGACCAGCATTTCATGAACAGAAACAGACTAATGAGTTCTCGTATGATTCATCTGTACTTACAAAACCAAGCTGAGAAACTGAATCACCTGGACCATGAAAATGGTTGTGCTCTACATCAAAATAAATGCCTTGTAGCCTTAGAAATTAGTTTACTGATTGTGCTCTTTAAATGTTTTGGAATGCTTTTACAGGATTCTGAATGTTTCAGCAAAATGGGGCTGGCTAGTTGCACTTGGGCCTTTGATATTGGAAGTGAGCTCCAAGTTTGCCCCAGTAGAAGATCTAAAATAACAGGGACATGCTTATAAAGGATTTTGTTTATTTTCTTTTTAAATGTCTTTTTTCTTAGGAGCTGCCAGAACATCCTTTAGATGATATGCAATGATGTGAGACTGTTCTTAGAGTTGGTTTGAATAATTTGGAATTTGAAATTAACAATATTGAAGGGTGTGATTGAACTTCATTCAAATAACTCTTGGGCTCATTTCATTGTCGAAGTATGCAGCTTACTATTCAATTTGGCAAAGGTAAAGTTAAATATCATATTGAAAATTTTTGTTGAATTTATCACAGGCACAAAGAGACTTTCATAGAAGTGGTGGTTTTATTATTTATTAATGAGCTCAAAATGGAAAGATACAATTTAAAAGTGCTATTAATTTTTTTCACTAAAAGAAATTATTGGTGATAAAAAATCAATAGGTGCACTTAGACATCCCAACTATGTTGACACACTTTTAGCACCCTCATCATCTAACTCATCCCAATATATATTTTACACGTGTCATCTAAAATTAAAAAAAAAAAAAAAAATTAGAGAAGACCACCATCGTGTAGCACAAACATTTGTTACAAGTTGAATGCTGATGCCAAAGATCTTGAACCGTTGCAAAGAAATCATGAAATTATGAAAAATTATGAGTAAAGGAAAAAAACACATGAAAAAAAGACAAGTTATGAAAATCATGAATATAAGAATGTTTATAAAAAACAAACTCACATTTATAATATTATCAATGATGAGTAGAGTTTGCATTGTTGTACAATTCAAGTTCCCTCTCTATTTCTTTTATGATCTTCTGTGAACTTCATGCTTTAGGGATCTAGATCACGGGATTCCATTATTATGCTATAAGCAAAATTGAAATAATCTTTCTTATGGTAATTATTAAAAAAAATTAATCTCACAAGTATAATACTATCTATGAAGAAACTCAGATGAGGCCTTAATGCTATTCATTATCAAAATTACAATCTTACCATATATTAGCAGCTCCATAAGTTCCTTTGTTATCTCATTAAACTTTATGTTTTTGGGACGGGGCTGGGCTGGGCTGGATGCCTTTTCAATATTCTTCTCCGCGAGATTTTAATCCCCATCCCTAATATCATAGGAGATTTCTATTGTCTGTTACTTTTTTTTTTAATTTTTCAATTAAATGTTAAATAACATACAAAATATAAAATTTATAGAAAATTAGTTTATTAAATTATTTATTATTTTTATAATTAATTTTATATTTTTTTCATCTATGAATATAAAACTTAATAAATATAGTAAAAAAATTATTAAAAAATTTAGAGACGGACGGGGAGGGAACACATTCTCCGTCCCCAAATGAAATTGAGGAAAAAAATTTAACATTCCTGAATTTTTAATTGTTATTTTAGGCCATTATTGTTACGATCACGTTGAAAGAAATCTAGAAAGGATATGCAAATTTCACATTCCTGAAATTTTTATCGTTATTTTAGGCCATTATTGTTATGATCACGTGGAAAGAAATCTAGAAAGGATATTGCAAATTCCAAATTCCAAATTCCAAATTCCCAAACCCTTTTCAGTCCTATCTTAAATTCTGTTTAAATCTTTGACTTTGTACTTATTTTAAAAGAAAAAGAACTCTTTTCAACTCTCTCAAATCACTGTTGGATCTTCGTCCTTCGAACTCATGGCTTCAAACTCCTCCGTCCACTTCGTCTTGTTCCCTTTAATGGCTCAAGGCCACATGATTCCCATGGTAGACATTGCCAAGCTCCTCGCCCAAAGACCAGGCGTCACCGTCACCATATTCACCACTCCCCACAATGCCGCCCGCTTCCAAAACTCACTCTCTCCTAACCTCCAAATCCGACTGCTTCTCCTCCATTTTCCCACCAACCAAACAGGACTACCAGATGGCTGTGAGAATCTCGACCAGCTCCCGTCCTTAAACTTCTCCCTCCCTTTCTTTTCCTCCCTCGCCTTCTCGAAGACCCGCTCAACACCTCTGCGAAACCCTAACCCCAAAACCCAGCTGCATCATCTCCGACTTGTGCCTCCCATGGACGCTCAACATCGCTCGCAAGCTTCGCCTTCCTTGGGTCTCCTTCCACGGCGGGTCTTGTTTCTGCGACGTGGTTATGAGTAAGATTCGAGGCTCCGGGATTTGCACAAGTATAGCCTCTGAGACGGAGTACTTCAAGGTGCCGGAGTTGCCGGATGAAATAGTGATAACCAAAGCTCAGTTGCCTACGAGCATTTTAAGTGGCCGGAGCATGATGAAGTTTCTGCAGCAGATCGGAGAAGCTGACATGGCGGCGTATGGGATGGTTGTGAATAGCTTCGAAGAGCTGGAGTCTTCTTACGTCCAAGAGTTGAAGAAGGAGAGAGGCAATAGAATCTGGTGCATTGGACCTGTTTCCTTATGCAGCAAAGACAACGTGGACAAAGCTCATAGAGGTACTGTGTTTAGTTTTTATTATAACTTTCCAATTGTTCGTTTAATTGCTTCAACTTACATCTATTGGTATGATGTTGAAGTCTAGAGCGGAATACCTGTAACAACTTGAGCAAGAATGACCTTCCTCAACCGAAACTCAAAATATGTTCGATGTATTGGGATCCTAATTCTAATGTTCTATATATAAATTGCAAACTGCACCTAGCCATATATCCTTTCTTGTTAGCACCCTAGCAACCTGCTCATGTGTTCTTCTAAAACGGGTAAAGCGAATAAGGAAGGAGGGAGACAATTGTCCCCTCTTGATTTGTCTTAGATTAAAAATATTATGAACAAAAAGATTAAGATTTTAAAGGTTTCTTTCTAATTTGTCCTTTAAACTTTTAGAATTATTGAAAGATTATATATAAAGTTAAAAGGTAAAGTTTTGTCCTTCATTATAAAAATTTTTGGCTCCACCCCTAGTTCAAAAATAGGTCTTGTATTCATGCGCCTGTTCATCACAATTCCACAATCATATAACTTCTCCTCAAATGCAAGTTTCCCATGTTGTGTTCTATAATGCAAGTTGTACCAAGACATTTCGAATTTTTAAGTCTCCTTTTCTATCCTAATCCCTTTGAGACACATGATAGTTTCGATTAAAACCTTCTAAATTATGATAAAAGTCCTATCACAATTCTTAATTTTCTCTACTTCCGATCCATATAAATACAATACTAAAATACCCAAGATTCATCTTTTCACATGAAGTAAAATTTCAACAGGCAAACCTCCACCACAGAGGACAACAATCATCACATGAAGTGGCTTGACTCTCAAGAGCCAAACTCAGTGCTCTATGTTTGTCTCGGAAGCCTCCACAACCTATCAACTCTGCAACTAATAGAGCTAGGGTTGGGCTTGGAGTCATCAAACAGGGCATTTATATGGGTGATAAGGGCCTCAACCAAGGTGGAAAAGTGGATTCAAGAAGAGGGTTTTGAAGAAAGAATCAAAGGGAGAGGGCTTCTTATAAGGGGTTGGGCTCCACAAGTTCTAATATTATCCCACCCTGCCGTCGGCGGCTTCCTCACTCACTGCGGTTGGAACTCAGTTCTTGAAGGAATCTCTGCTGGCTTGCCCATGGTGACCTGGCCTCTATTCTCAGAGCAGTTTCTGAACGAGAAAATGGTGGCAGATCTTCTAAATATAGCAGTGAAAGTTGGAGCAGAATATCCAGTGAAATCATAGAAAAGAACAAGAAGAAGAAGAGAAGGGGATACAGGTGAAGAAAGATGATGTTAGGGAAGCCATTGAAAGGGTAATGGGGGAAGGAGATGAAGCTGATAAAAGAAGGAAGAAAGCAAGAGAAATTGGGGTGATGGCAAAGAATGCCTTGGAAGAAGGTGGGTCTTCTGAGCTCAACATTATATTACTGATTGATGATATATTGAAAGAAGAGGAAATGAAGAATCTTTCATGTTAG

mRNA sequence

ATGTCTTCCACACTCTTTCTTCCCTCAATCTCTTCCCTCTCTGCTCCCCTCCATAAACGAGGCCCTCTCCGGCGAAGACCCCCACCTCCATATCTCTCCATTAAATCCTCCATAGACGAAGGGAATCCACCCACCCCCACGAAGCTTAAAAGCTCCACCAACAATGCAAAAACTGGTTCCTGGGTCAGCCCGGATTGGCTCACCTCTCTGACTCGCTACATTACCCTAGGCCAGGGCGACGACTCCGGCATTCCCATAGCAAGTGCGAAGCTCGATGACGTTTCTGATCTTCTGGGCGGCGCTCTTTTCCTTCCGCTGTTCAAGTGGATGAATGATTATGGACCGATTTACAGGCTCGCAGCTGGTCCTAGGAATTTCGTGGTGGTTAGTGACCCCGCCATTGCTAAGCATGTTCTGAGGAATTATGGGACGTATGCGAAGGGGCTTATTTCTGAGGTCTCCGAGTTCCTGTTCGGGTCGGGTTTCGCGATTGCAGAAGGCCCGCTTTGGACGGTACGCCGTAGGGCTGTGGTTCCATCTCTTCACAAGAAGTACTTATCGGTTATTGTTGATCGAGTATTCTGTAAATGTGCCATGAGATTGGTGGAGAAGCTGCAAAAGAATGCATTAAATAATAATTCTGTTAACATGGAGGAAAAGTTTTCTCAACTAACTCTTGATGTTATTGGTCTATCTGTATTCAACTACAGTTTTGATTCTCTCTCTGCTGACAGCCCTGTGATTGATGCAATTTACACTGCTTTGAAAGAGGCAGAGGCTCGTTCTACAGATATTTTACCATATTGGAAGATTAAAGCTCTATGTAAGATAATTCCAAGACAGATAAAAGCTGAAGAAGCAGTTACAGTGATCAGGAGAACTGTTGAGGAGCTAATTGCCAAATGCAAAGAAATTGTGGAAACTGAGGGTGAGCGTATCGATGAGGAGGAATATGTGAATGACGCTGATCCTAGCATCCTTCGTTTTCTGCTGGCCAGTAGACAAGAGGTCTCAAGCGTACAATTGCGAGACGATCTCTTGTCAATGTTAGTTGCTGGACATGAGACTACTGGTTCTGTTCTGACTTGGACACTGTATCTCTTAAGTAAGGATTCCTCATCATTGATCAAGGCTCAAAATGAAGTTGATAGAGTCTTACAAGGAAGGCCTCCTTCCTACGAAGACACAAAGGAGCTTAAATTTTTGACACGTTGTATCCTCGAGTCAATGCGTCTCTACCCACATCCGCCTGTTTTGATAAGAAGAGCTCAAGTTGCTGACGTACTGCCCGGAAATTACAAGGTTAATGCTGGTCAAGACATCATGATTTCAGTATATAATATCCATCACTCTTCTCAGGTTTGGGAACGTGCACAAGAGTTTATACCAGAAAGATTTGACTTGGATGGTCCTGTGCCTAATGAAAGCAATACAGATTTCAGATTTATTCCCTTCAGCGGAGGGCCCCGAAAGTGCGTTGGTGATCAATTTGCACTGCTAGAAGCTATCGTTGCGCTTGCCATATTTCTGCAGCACATGAACTTCGAGCTGGTTCCAAATCAGACCATTGGGATGACTACTGGAGCGACTATACATACAACAAATGGTTTATACATGAAACTCAGCCAAAGGCAGCCGACCCCAGCTTTAGCTTCCTCTGCTTCAAGTGGCCGGAGAAGGAAGCCGGGGTGGCTGCTATGGCTTTCCATGGTGATGCCTCTGCTGCATATTTGCCTCACGTCAGGAACCTCTGAACAGGGCAAGTCAGATTATGGCCACCCCTCTCCCCCTCTCTTCTTCTTCTCCTTCTTTCTTGTTTCTTTTTTCTTTTTAAAAGTTTTAGATTGGGGTAGAATGAATAGCTATATCCTTTTCGCAAATTTGAACAACAATCTGGATAAAATTTTGGGACGAAGAAAGTATATGGCACTGCAGATAACTTCTGTAGCTTTCCTGATAGCTAACTGTCCTGATGACTGTGAGTGTGAACTACATGAAGCACTTGGACCAGCATTTCATGAACAGAAACAGACTAATGAGTTCTCGATTCTGAATGTTTCAGCAAAATGGGGCTGGCTAGTTGCACTTGGGCCTTTGATATTGGAAGTGAAGAAAGATGATGTTAGGGAAGCCATTGAAAGGGTAATGGGGGAAGGAGATGAAGCTGATAAAAGAAGGAAGAAAGCAAGAGAAATTGGGGTGATGGCAAAGAATGCCTTGGAAGAAGGTGGGTCTTCTGAGCTCAACATTATATTACTGATTGATGATATATTGAAAGAAGAGGAAATGAAGAATCTTTCATGTTAG

Coding sequence (CDS)

ATGTCTTCCACACTCTTTCTTCCCTCAATCTCTTCCCTCTCTGCTCCCCTCCATAAACGAGGCCCTCTCCGGCGAAGACCCCCACCTCCATATCTCTCCATTAAATCCTCCATAGACGAAGGGAATCCACCCACCCCCACGAAGCTTAAAAGCTCCACCAACAATGCAAAAACTGGTTCCTGGGTCAGCCCGGATTGGCTCACCTCTCTGACTCGCTACATTACCCTAGGCCAGGGCGACGACTCCGGCATTCCCATAGCAAGTGCGAAGCTCGATGACGTTTCTGATCTTCTGGGCGGCGCTCTTTTCCTTCCGCTGTTCAAGTGGATGAATGATTATGGACCGATTTACAGGCTCGCAGCTGGTCCTAGGAATTTCGTGGTGGTTAGTGACCCCGCCATTGCTAAGCATGTTCTGAGGAATTATGGGACGTATGCGAAGGGGCTTATTTCTGAGGTCTCCGAGTTCCTGTTCGGGTCGGGTTTCGCGATTGCAGAAGGCCCGCTTTGGACGGTACGCCGTAGGGCTGTGGTTCCATCTCTTCACAAGAAGTACTTATCGGTTATTGTTGATCGAGTATTCTGTAAATGTGCCATGAGATTGGTGGAGAAGCTGCAAAAGAATGCATTAAATAATAATTCTGTTAACATGGAGGAAAAGTTTTCTCAACTAACTCTTGATGTTATTGGTCTATCTGTATTCAACTACAGTTTTGATTCTCTCTCTGCTGACAGCCCTGTGATTGATGCAATTTACACTGCTTTGAAAGAGGCAGAGGCTCGTTCTACAGATATTTTACCATATTGGAAGATTAAAGCTCTATGTAAGATAATTCCAAGACAGATAAAAGCTGAAGAAGCAGTTACAGTGATCAGGAGAACTGTTGAGGAGCTAATTGCCAAATGCAAAGAAATTGTGGAAACTGAGGGTGAGCGTATCGATGAGGAGGAATATGTGAATGACGCTGATCCTAGCATCCTTCGTTTTCTGCTGGCCAGTAGACAAGAGGTCTCAAGCGTACAATTGCGAGACGATCTCTTGTCAATGTTAGTTGCTGGACATGAGACTACTGGTTCTGTTCTGACTTGGACACTGTATCTCTTAAGTAAGGATTCCTCATCATTGATCAAGGCTCAAAATGAAGTTGATAGAGTCTTACAAGGAAGGCCTCCTTCCTACGAAGACACAAAGGAGCTTAAATTTTTGACACGTTGTATCCTCGAGTCAATGCGTCTCTACCCACATCCGCCTGTTTTGATAAGAAGAGCTCAAGTTGCTGACGTACTGCCCGGAAATTACAAGGTTAATGCTGGTCAAGACATCATGATTTCAGTATATAATATCCATCACTCTTCTCAGGTTTGGGAACGTGCACAAGAGTTTATACCAGAAAGATTTGACTTGGATGGTCCTGTGCCTAATGAAAGCAATACAGATTTCAGATTTATTCCCTTCAGCGGAGGGCCCCGAAAGTGCGTTGGTGATCAATTTGCACTGCTAGAAGCTATCGTTGCGCTTGCCATATTTCTGCAGCACATGAACTTCGAGCTGGTTCCAAATCAGACCATTGGGATGACTACTGGAGCGACTATACATACAACAAATGGTTTATACATGAAACTCAGCCAAAGGCAGCCGACCCCAGCTTTAGCTTCCTCTGCTTCAAGTGGCCGGAGAAGGAAGCCGGGGTGGCTGCTATGGCTTTCCATGGTGATGCCTCTGCTGCATATTTGCCTCACGTCAGGAACCTCTGAACAGGGCAAGTCAGATTATGGCCACCCCTCTCCCCCTCTCTTCTTCTTCTCCTTCTTTCTTGTTTCTTTTTTCTTTTTAAAAGTTTTAGATTGGGGTAGAATGAATAGCTATATCCTTTTCGCAAATTTGAACAACAATCTGGATAAAATTTTGGGACGAAGAAAGTATATGGCACTGCAGATAACTTCTGTAGCTTTCCTGATAGCTAACTGTCCTGATGACTGTGAGTGTGAACTACATGAAGCACTTGGACCAGCATTTCATGAACAGAAACAGACTAATGAGTTCTCGATTCTGAATGTTTCAGCAAAATGGGGCTGGCTAGTTGCACTTGGGCCTTTGATATTGGAAGTGAAGAAAGATGATGTTAGGGAAGCCATTGAAAGGGTAATGGGGGAAGGAGATGAAGCTGATAAAAGAAGGAAGAAAGCAAGAGAAATTGGGGTGATGGCAAAGAATGCCTTGGAAGAAGGTGGGTCTTCTGAGCTCAACATTATATTACTGATTGATGATATATTGAAAGAAGAGGAAATGAAGAATCTTTCATGTTAG

Protein sequence

MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDEGNPPTPTKLKSSTNNAKTGSWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMKLSQRQPTPALASSASSGRRRKPGWLLWLSMVMPLLHICLTSGTSEQGKSDYGHPSPPLFFFSFFLVSFFFLKVLDWGRMNSYILFANLNNNLDKILGRRKYMALQITSVAFLIANCPDDCECELHEALGPAFHEQKQTNEFSILNVSAKWGWLVALGPLILEVKKDDVREAIERVMGEGDEADKRRKKAREIGVMAKNALEEGGSSELNIILLIDDILKEEEMKNLSC
Homology
BLAST of Sgr026500 vs. NCBI nr
Match: XP_022143881.1 (carotene epsilon-monooxygenase, chloroplastic [Momordica charantia])

HSP 1 Score: 1047.0 bits (2706), Expect = 8.1e-302
Identity = 529/555 (95.32%), Postives = 543/555 (97.84%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDEGNPPTPTKLKSSTNNAKTGS 60
           MSS L  PS+S  SA LHKRGPLRRR PPP+LSIKSSIDEGNPPTPTKLK+STNNAK+GS
Sbjct: 1   MSSALCFPSLSFSSAHLHKRGPLRRRIPPPFLSIKSSIDEGNPPTPTKLKNSTNNAKSGS 60

Query: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120
           WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA
Sbjct: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120

Query: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180
           AGPRNFVVVSDPAIAKHVLRNYGTYAKGL+SEVSEFLFGSGFAIAEGPLWTVRRRAVVPS
Sbjct: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180

Query: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240
           LHKKYLSVIVDRVFCKCAMRLVEKLQK+ALNNNSVNMEEKFSQLTLD+IGLSVFNYSFDS
Sbjct: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKDALNNNSVNMEEKFSQLTLDIIGLSVFNYSFDS 240

Query: 241 LSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300
           LSADSPVIDA+YTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA
Sbjct: 241 LSADSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300

Query: 301 KCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360
           KCKEIVETEGERIDEEEYVND DPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV
Sbjct: 301 KCKEIVETEGERIDEEEYVNDTDPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360

Query: 361 LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420
           LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFL RCILESMRLYPHPPVLI
Sbjct: 361 LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLMRCILESMRLYPHPPVLI 420

Query: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDF 480
           RRA+VAD+LPGNYKVNAGQDIMISVYNIH SSQVWE+A+EFIPERFDL+GPVPNESNTDF
Sbjct: 421 RRARVADILPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDF 480

Query: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK 540
           RFIPFSGGPRKCVGDQFALLEA+VALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK
Sbjct: 481 RFIPFSGGPRKCVGDQFALLEAVVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK 540

Query: 541 LSQRQPTPALASSAS 556
           LSQRQ TP LASSAS
Sbjct: 541 LSQRQLTPELASSAS 555

BLAST of Sgr026500 vs. NCBI nr
Match: XP_022972774.1 (carotene epsilon-monooxygenase, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1015.4 bits (2624), Expect = 2.6e-292
Identity = 515/555 (92.79%), Postives = 535/555 (96.40%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDEGNPPTPTKLKSSTNNAKTGS 60
           MSS L  PS++S S PLHKR PLRRR   P+LSIKSSIDE +PPTP KL +STN +K+GS
Sbjct: 1   MSSILCYPSLTSPSFPLHKRIPLRRRTQFPFLSIKSSIDERDPPTPAKLNNSTNTSKSGS 60

Query: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120
           WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA
Sbjct: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120

Query: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180
           AGPRNFVVVSDPAIAKHVLRNYGTYAKGL+SEVSEFLFGSGFAIAEGPLWTVRRRAVVPS
Sbjct: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180

Query: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240
           LHKKYLSVIVDRVFCKCAMRLVEKLQ++ALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS
Sbjct: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240

Query: 241 LSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300
           L+ DSPVIDA+YTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVE+LIA
Sbjct: 241 LTTDSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEDLIA 300

Query: 301 KCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360
           KCK IVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV
Sbjct: 301 KCKAIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360

Query: 361 LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420
           LTWTLYLLSKDSS+L+KAQ EVDRVLQGRPPSY+DTKELKFLTRCILESMRLYPHPPVLI
Sbjct: 361 LTWTLYLLSKDSSALMKAQTEVDRVLQGRPPSYKDTKELKFLTRCILESMRLYPHPPVLI 420

Query: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDF 480
           RRAQVADVLPGNYKVNAGQDIMISVYNIH SSQVWERA+EFIPERFDLDGPVPNESNTDF
Sbjct: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWERAEEFIPERFDLDGPVPNESNTDF 480

Query: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK 540
           RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVP+QTIGMTTGATIHTTNGLYMK
Sbjct: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPDQTIGMTTGATIHTTNGLYMK 540

Query: 541 LSQRQPTPALASSAS 556
           LSQ+Q TP LA+ AS
Sbjct: 541 LSQKQMTPRLATPAS 555

BLAST of Sgr026500 vs. NCBI nr
Match: XP_022962840.1 (carotene epsilon-monooxygenase, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1010.4 bits (2611), Expect = 8.5e-291
Identity = 513/555 (92.43%), Postives = 533/555 (96.04%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDEGNPPTPTKLKSSTNNAKTGS 60
           MSS L  PS++S S PLHKR PLRRR   P+LSI+SSIDE +PPTP KL +STN +K+GS
Sbjct: 1   MSSILCYPSLTSPSFPLHKRIPLRRRTQFPFLSIRSSIDERDPPTPAKLNNSTNTSKSGS 60

Query: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120
           WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA
Sbjct: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120

Query: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180
           AGPRNFVVVSDPAIAKHVLRNYGTYAKGL+SEVSEFLFGSGFAIAEGPLWTVRRRAVVPS
Sbjct: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180

Query: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240
           LHKKYLSVIVDRVFCKCAMRLVEKLQK+ALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS
Sbjct: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240

Query: 241 LSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300
           L+ DSPVIDA+YTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVI+RTVE+LIA
Sbjct: 241 LTTDSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIQRTVEDLIA 300

Query: 301 KCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360
           KCK IVE EGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV
Sbjct: 301 KCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360

Query: 361 LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420
           LTWTLYLLSKDSS+L KAQ EVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI
Sbjct: 361 LTWTLYLLSKDSSALNKAQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420

Query: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDF 480
           RRAQVADVLPGNYKVNAGQDIMISVYNIH SSQVWERA+EFIPERFDL+GPVPNESNTDF
Sbjct: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWERAEEFIPERFDLEGPVPNESNTDF 480

Query: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK 540
           RFIPFSGGPRKCVGDQFALLEAIVALAIFLQH+NFELVP+QTIGMTTGATIHTTNGLYMK
Sbjct: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHINFELVPDQTIGMTTGATIHTTNGLYMK 540

Query: 541 LSQRQPTPALASSAS 556
           LSQ+Q TP LAS AS
Sbjct: 541 LSQKQMTPGLASPAS 555

BLAST of Sgr026500 vs. NCBI nr
Match: KAG6595192.1 (Carotene epsilon-monooxygenase, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1006.9 bits (2602), Expect = 9.3e-290
Identity = 512/555 (92.25%), Postives = 533/555 (96.04%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDEGNPPTPTKLKSSTNNAKTGS 60
           MSS L  PS++S S PLHKR PLRRR   P+LSI+SSIDE + PTP KL +STN +K+GS
Sbjct: 1   MSSILCYPSLTSPSFPLHKRIPLRRRTQFPFLSIRSSIDERDLPTPAKLNNSTNTSKSGS 60

Query: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120
           WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA
Sbjct: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120

Query: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180
           AGPRNFVVVSDPAIAKHVLRNYGTYAKGL+SEVSEFLFGSGFAIAEGPLWTVRRRAVVPS
Sbjct: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180

Query: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240
           LHKKYLSVIVDRVFCKCAMRLVEKLQK+ALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS
Sbjct: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240

Query: 241 LSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300
           L+ DSPVIDA+YTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVI+RTVE+LIA
Sbjct: 241 LTTDSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIQRTVEDLIA 300

Query: 301 KCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360
           KCK IVE EGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV
Sbjct: 301 KCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360

Query: 361 LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420
           LTWTLYLLSKDSS+L KAQ+EVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI
Sbjct: 361 LTWTLYLLSKDSSALNKAQSEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420

Query: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDF 480
           RRAQVADVLPGNYKVNAGQDIMISVYNIH SSQVWERA+EFIPERFDL+GPVPNESNTDF
Sbjct: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWERAEEFIPERFDLEGPVPNESNTDF 480

Query: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK 540
           RFIPFSGGPRKCVGDQFALLEAIVALAIFLQH+NFELVP+QTIGMTTGATIHTTNGLYMK
Sbjct: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHINFELVPDQTIGMTTGATIHTTNGLYMK 540

Query: 541 LSQRQPTPALASSAS 556
           LSQ+Q TP LAS AS
Sbjct: 541 LSQKQMTPGLASPAS 555

BLAST of Sgr026500 vs. NCBI nr
Match: XP_038882587.1 (carotene epsilon-monooxygenase, chloroplastic [Benincasa hispida])

HSP 1 Score: 1003.4 bits (2593), Expect = 1.0e-288
Identity = 511/556 (91.91%), Postives = 533/556 (95.86%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDEGNPPTPTKLKSSTNNAKTGS 60
           M+STL  PSI+  S+PLH R PLRRR P P+  IKSSIDEG    P+KLK+ST  AK+GS
Sbjct: 1   MASTLCFPSITFPSSPLHIRIPLRRRTPSPFPFIKSSIDEGQ--NPSKLKNSTKTAKSGS 60

Query: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120
           WVSPDWLTSLTRYITLGQGDDSGIPIA+AKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA
Sbjct: 61  WVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120

Query: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180
           AGPRNFVVVSDPAIAKHVLRNYGTYAKGL+SEVSEFLFGSGFAIAEGPLWTVRRRAVVPS
Sbjct: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180

Query: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240
           LHKKYLSVIVD+VFCKCAMRLVEKL+K+ALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS
Sbjct: 181 LHKKYLSVIVDQVFCKCAMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240

Query: 241 LSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300
           LSADSPVIDA+YTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA
Sbjct: 241 LSADSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300

Query: 301 KCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360
           KCKEIVETEGERIDEEEYVNDADPSILRFLLASR++VSSVQLRDDLLSMLVAGHETTGSV
Sbjct: 301 KCKEIVETEGERIDEEEYVNDADPSILRFLLASREDVSSVQLRDDLLSMLVAGHETTGSV 360

Query: 361 LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420
           LTWTLYLLSK SSSLIKA+NEVDRVLQGRPPSYEDTKELK+LTRCILESMRLYPHPPVLI
Sbjct: 361 LTWTLYLLSKHSSSLIKARNEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLI 420

Query: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDF 480
           RRAQVAD+LPGNYKVNAGQDIMISVYNIH SSQVWE+A+EFIPERFDL GPVPNESNTDF
Sbjct: 421 RRAQVADILPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLGGPVPNESNTDF 480

Query: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK 540
           RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQ+IGMTTGATIHTTNGLYMK
Sbjct: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQSIGMTTGATIHTTNGLYMK 540

Query: 541 LSQRQPTPALASSASS 557
           LSQR+ TP L S ASS
Sbjct: 541 LSQRKLTPELVSPASS 554

BLAST of Sgr026500 vs. ExPASy Swiss-Prot
Match: Q6TBX7 (Carotene epsilon-monooxygenase, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CYP97C1 PE=1 SV=1)

HSP 1 Score: 849.0 bits (2192), Expect = 4.3e-245
Identity = 426/546 (78.02%), Postives = 488/546 (89.38%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPP--YLSIKSSIDEGNPPTPTKLKSSTNNAKT 60
           M S+LF PS SS S+ L    P R   P P    SI+SSI++  P      K  TN++K+
Sbjct: 1   MESSLFSPSSSSYSS-LFTAKPTRLLSPKPKFTFSIRSSIEKPKP------KLETNSSKS 60

Query: 61  GSWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYR 120
            SWVSPDWLT+LTR ++ G+ D+SGIPIA+AKLDDV+DLLGGALFLPL+KWMN+YGPIYR
Sbjct: 61  QSWVSPDWLTTLTRTLSSGKNDESGIPIANAKLDDVADLLGGALFLPLYKWMNEYGPIYR 120

Query: 121 LAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVV 180
           LAAGPRNFV+VSDPAIAKHVLRNY  YAKGL++EVSEFLFGSGFAIAEGPLWT RRRAVV
Sbjct: 121 LAAGPRNFVIVSDPAIAKHVLRNYPKYAKGLVAEVSEFLFGSGFAIAEGPLWTARRRAVV 180

Query: 181 PSLHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSF 240
           PSLH++YLSVIV+RVFCKCA RLVEKLQ  A + ++VNME KFSQ+TLDVIGLS+FNY+F
Sbjct: 181 PSLHRRYLSVIVERVFCKCAERLVEKLQPYAEDGSAVNMEAKFSQMTLDVIGLSLFNYNF 240

Query: 241 DSLSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEEL 300
           DSL+ DSPVI+A+YTALKEAE RSTD+LPYWKI ALCKI+PRQ+KAE+AVT+IR TVE+L
Sbjct: 241 DSLTTDSPVIEAVYTALKEAELRSTDLLPYWKIDALCKIVPRQVKAEKAVTLIRETVEDL 300

Query: 301 IAKCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTG 360
           IAKCKEIVE EGERI++EEYVNDADPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTG
Sbjct: 301 IAKCKEIVEREGERINDEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTG 360

Query: 361 SVLTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPV 420
           SVLTWTLYLLSK+SS+L KAQ EVDRVL+GR P++ED KELK++TRCI ESMRLYPHPPV
Sbjct: 361 SVLTWTLYLLSKNSSALRKAQEEVDRVLEGRNPAFEDIKELKYITRCINESMRLYPHPPV 420

Query: 421 LIRRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNT 480
           LIRRAQV D+LPGNYKVN GQDIMISVYNIH SS+VWE+A+EF+PERFD+DG +PNE+NT
Sbjct: 421 LIRRAQVPDILPGNYKVNTGQDIMISVYNIHRSSEVWEKAEEFLPERFDIDGAIPNETNT 480

Query: 481 DFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLY 540
           DF+FIPFSGGPRKCVGDQFAL+EAIVALA+FLQ +N ELVP+QTI MTTGATIHTTNGLY
Sbjct: 481 DFKFIPFSGGPRKCVGDQFALMEAIVALAVFLQRLNVELVPDQTISMTTGATIHTTNGLY 539

Query: 541 MKLSQR 545
           MK+SQR
Sbjct: 541 MKVSQR 539

BLAST of Sgr026500 vs. ExPASy Swiss-Prot
Match: Q93VK5 (Protein LUTEIN DEFICIENT 5, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CYP97A3 PE=1 SV=1)

HSP 1 Score: 446.4 bits (1147), Expect = 6.4e-124
Identity = 228/468 (48.72%), Postives = 322/468 (68.80%), Query Frame = 0

Query: 79  GDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHV 138
           G D   P        +  +   A F+PL++    YG I+RL  GP++F++VSDP+IAKH+
Sbjct: 105 GSDQDYPKVPEAKGSIQAVRNEAFFIPLYELFLTYGGIFRLTFGPKSFLIVSDPSIAKHI 164

Query: 139 LR-NYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKC 198
           L+ N   Y+KG+++E+ +F+ G G   A+G +W  RRRA+VP+LH+KY++ ++  +F + 
Sbjct: 165 LKDNAKAYSKGILAEILDFVMGKGLIPADGEIWRRRRRAIVPALHQKYVAAMIS-LFGEA 224

Query: 199 AMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSPVIDAIYTALKE 258
           + RL +KL   AL    V ME  FS+LTLD+IG +VFNY FDSL+ D+ VI+A+YT L+E
Sbjct: 225 SDRLCQKLDAAALKGEEVEMESLFSRLTLDIIGKAVFNYDFDSLTNDTGVIEAVYTVLRE 284

Query: 259 AEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIAKCKEIVETEGERIDEEE 318
           AE RS   +P W I     I PRQ K   ++ +I  T+++LIA CK +VE E E    EE
Sbjct: 285 AEDRSVSPIPVWDIPIWKDISPRQRKVATSLKLINDTLDDLIATCKRMVEEE-ELQFHEE 344

Query: 319 YVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKDSSSLIK 378
           Y+N+ DPSIL FLLAS  +VSS QLRDDL++ML+AGHET+ +VLTWT YLL+ + S + K
Sbjct: 345 YMNERDPSILHFLLASGDDVSSKQLRDDLMTMLIAGHETSAAVLTWTFYLLTTEPSVVAK 404

Query: 379 AQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNA 438
            Q EVD V+  R P+ +D K+LK+ TR + ES+RLYP PPVLIRR+   D+L G Y +  
Sbjct: 405 LQEEVDSVIGDRFPTIQDMKKLKYTTRVMNESLRLYPQPPVLIRRSIDNDIL-GEYPIKR 464

Query: 439 GQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDFRFIPFSGGPRKCVGDQF 498
           G+DI ISV+N+H S   W+ A++F PER+ LDGP PNE+N +F ++PF GGPRKC+GD F
Sbjct: 465 GEDIFISVWNLHRSPLHWDDAEKFNPERWPLDGPNPNETNQNFSYLPFGGGPRKCIGDMF 524

Query: 499 ALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTTNGLYMKLSQR 545
           A  E +VA+A+ ++  NF++ P    + MTTGATIHTT GL + +++R
Sbjct: 525 ASFENVVAIAMLIRRFNFQIAPGAPPVKMTTGATIHTTEGLKLTVTKR 569

BLAST of Sgr026500 vs. ExPASy Swiss-Prot
Match: O48921 (Cytochrome P450 97B2, chloroplastic OS=Glycine max OX=3847 GN=CYP97B2 PE=2 SV=1)

HSP 1 Score: 396.7 bits (1018), Expect = 5.8e-109
Identity = 220/506 (43.48%), Postives = 319/506 (63.04%), Query Frame = 0

Query: 76  LGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIA 135
           L  G    +PIA      VSDLLG  LF  L+ W  ++G +Y+LA GP+ FVVVSDP +A
Sbjct: 71  LSGGSIGSMPIAEGA---VSDLLGRPLFFSLYDWFLEHGAVYKLAFGPKAFVVVSDPIVA 130

Query: 136 KHVLR-NYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVF 195
           +H+LR N  +Y KG+++++ E + G G   A+   W  RRR + P+ H  YL  +V ++F
Sbjct: 131 RHILRENAFSYDKGVLADILEPIMGKGLIPADLDTWKQRRRVIAPAFHNSYLEAMV-KIF 190

Query: 196 CKCAMRLVEKLQK-------NALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSPV 255
             C+ R + K  K       +  ++  +++E +FS L LD+IGL VFNY F S++ +SPV
Sbjct: 191 TTCSERTILKFNKLLEGEGYDGPDSIELDLEAEFSSLALDIIGLGVFNYDFGSVTKESPV 250

Query: 256 IDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIAKCKEI-V 315
           I A+Y  L EAE RST  +PYWKI     I+PRQ K ++ + VI   ++ LI   KE   
Sbjct: 251 IKAVYGTLFEAEHRSTFYIPYWKIPLARWIVPRQRKFQDDLKVINTCLDGLIRNAKESRQ 310

Query: 316 ETEGERIDEEEYVNDADPSILRFLLASR-QEVSSVQLRDDLLSMLVAGHETTGSVLTWTL 375
           ET+ E++ + +Y+N  D S+LRFL+  R  +V   QLRDDL++ML+AGHETT +VLTW +
Sbjct: 311 ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAV 370

Query: 376 YLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQV 435
           +LL+++ S + KAQ EVD VL    P++E  KEL+++   ++E++RLYP PP+LIRR+  
Sbjct: 371 FLLAQNPSKMKKAQAEVDLVLGTGRPTFESLKELQYIRLIVVEALRLYPQPPLLIRRSLK 430

Query: 436 ADVLPGNYK-------VNAGQDIMISVYNIHHSSQVWERAQEFIPERF-------DLDG- 495
           +DVLPG +K       + AG D+ ISVYN+H S   W+R  +F PERF       +++G 
Sbjct: 431 SDVLPGGHKGEKDGYAIPAGTDVFISVYNLHRSPYFWDRPDDFEPERFLVQNKNEEIEGW 490

Query: 496 -----------PVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVP 545
                        PNE  +DF F+PF GGPRKCVGDQFAL+E+ VAL + LQ+ + EL  
Sbjct: 491 AGLDPSRSPGALYPNEVISDFAFLPFGGGPRKCVGDQFALMESTVALTMLLQNFDVELKG 550

BLAST of Sgr026500 vs. ExPASy Swiss-Prot
Match: O23365 (Cytochrome P450 97B3, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CYP97B3 PE=1 SV=2)

HSP 1 Score: 384.4 bits (986), Expect = 3.0e-105
Identity = 222/534 (41.57%), Postives = 318/534 (59.55%), Query Frame = 0

Query: 49  LKSSTNNAKTGSWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFK 108
           +K  +   KT   +  +    LT +  L  G    +P A      VSDL G  LFL L+ 
Sbjct: 51  IKCQSTEPKTNGNILDNASNLLTNF--LSGGSLGSMPTAEG---SVSDLFGKPLFLSLYD 110

Query: 109 WMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLISEVSEFLFGSGFAIAEG 168
           W  ++G IY+LA GP+ FVV+SDP IA+HVLR N  +Y KG+++E+ E + G G   A+ 
Sbjct: 111 WFLEHGGIYKLAFGPKAFVVISDPIIARHVLRENAFSYDKGVLAEILEPIMGKGLIPADL 170

Query: 169 PLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQK--------NALNNNSVNMEE 228
             W +RRRA+ P+ HK YL  +V +VF  C+ +++ K +K        +  +   +++E 
Sbjct: 171 DTWKLRRRAITPAFHKLYLEAMV-KVFSDCSEKMILKSEKLIREKETSSGEDTIELDLEA 230

Query: 229 KFSQLTLDVIGLSVFNYSFDSLSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIP 288
           +FS L LD+IGLSVFNY F S++ +SPVI A+Y  L EAE RST   PYW       I+P
Sbjct: 231 EFSSLALDIIGLSVFNYDFGSVTKESPVIKAVYGTLFEAEHRSTFYFPYWNFPPARWIVP 290

Query: 289 RQIKAEEAVTVIRRTVEELIAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-QEV 348
           RQ K +  + +I   ++ LI   KE   ET+ E++ E +Y N  D S+LRFL+  R  ++
Sbjct: 291 RQRKFQSDLKIINDCLDGLIQNAKETRQETDVEKLQERDYTNLKDASLLRFLVDMRGVDI 350

Query: 349 SSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTK 408
              QLRDDL++ML+AGHETT +VLTW ++LLS++   + KAQ E+D VL   PP+YE  K
Sbjct: 351 DDRQLRDDLMTMLIAGHETTAAVLTWAVFLLSQNPEKIRKAQAEIDAVLGQGPPTYESMK 410

Query: 409 ELKFLTRCILESMRLYPHPPVLIRRAQVADVLPG-------NYKVNAGQDIMISVYNIHH 468
           +L+++   ++E +RL+P PP+LIRR    + LPG        +KV  G DI ISVYN+H 
Sbjct: 411 KLEYIRLIVVEVLRLFPQPPLLIRRTLKPETLPGGHKGEKEGHKVPKGTDIFISVYNLHR 470

Query: 469 SSQVWERAQEFIPERF-------DLDG------------PVPNESNTDFRFIPFSGGPRK 528
           S   W+   +F PERF        ++G              PNE   DF F+PF GGPRK
Sbjct: 471 SPYFWDNPHDFEPERFLRTKESNGIEGWAGFDPSRSPGALYPNEIIADFAFLPFGGGPRK 530

Query: 529 CVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTTNGLYMKLSQR 545
           C+GDQFAL+E+ VALA+  Q  + EL    +++ + +GATIH  NG++ KL +R
Sbjct: 531 CIGDQFALMESTVALAMLFQKFDVELRGTPESVELVSGATIHAKNGMWCKLKRR 578

BLAST of Sgr026500 vs. ExPASy Swiss-Prot
Match: Q43078 (Cytochrome P450 97B1, chloroplastic OS=Pisum sativum OX=3888 GN=CYP97B1 PE=2 SV=1)

HSP 1 Score: 351.3 bits (900), Expect = 2.8e-95
Identity = 202/476 (42.44%), Postives = 290/476 (60.92%), Query Frame = 0

Query: 67  LTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNF 126
           LTSL     LG      +PIA      V+DL    LF  L+ W  ++G +Y+LA GP+ F
Sbjct: 76  LTSLLSGANLG-----SMPIAEGA---VTDLFDRPLFFSLYDWFLEHGSVYKLAFGPKAF 135

Query: 127 VVVSDPAIAKHVLR-NYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKY 186
           VVVSDP +A+H+LR N  +Y KG+++++ E + G G   A+   W  RRR + P  H  Y
Sbjct: 136 VVVSDPIVARHILRENAFSYDKGVLADILEPIMGKGLIPADLETWKQRRRVIAPGFHTSY 195

Query: 187 LSVIVDRVFCKCAMRLVEKLQ-------KNALNNNSVNMEEKFSQLTLDVIGLSVFNYSF 246
           L  +V ++F  C+ R V K+        ++   +  +++E +FS L L++IGL VFNY F
Sbjct: 196 LEAMV-QLFTSCSERTVLKVNELLEGEGRDGQKSVELDLEAEFSNLALEIIGLGVFNYDF 255

Query: 247 DSLSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEEL 306
            S++ +SPVI A+Y  L EAE RST  +PYWK      I+PRQ K ++ + VI   ++ L
Sbjct: 256 GSVTNESPVIKAVYGTLFEAEHRSTFYIPYWKFPLARWIVPRQRKFQDDLKVINTCLDGL 315

Query: 307 IAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-QEVSSVQLRDDLLSMLVAGHET 366
           I   KE   ET+ E++ + +Y N  D S+LRFL+  R  +V   QLRDDL++ML+AGHET
Sbjct: 316 IRNAKESRQETDVEKLQQRDYSNLKDASLLRFLVDMRGVDVDDRQLRDDLMTMLIAGHET 375

Query: 367 TGSVLTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHP 426
           T +VLTW ++LL+++   + KAQ EVD VL    P++E  K+L+++   ++E++RLYP P
Sbjct: 376 TAAVLTWAVFLLAQNPDKMKKAQAEVDLVLGMGKPTFELLKKLEYIRLIVVETLRLYPQP 435

Query: 427 PVLIRRAQVADVLPGNYK-------VNAGQDIMISVYNIHHSSQVWERAQEFIPERF--- 486
           P+LIRR+   DVLPG +K       + AG D+ ISVYN+H S   W+R  +F PERF   
Sbjct: 436 PLLIRRSLKPDVLPGGHKGDKDGYTIPAGTDVFISVYNLHRSPYFWDRPNDFEPERFLVQ 495

Query: 487 ----DLDG------------PVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVAL 507
               +++G              PNE  +DF F+PF GGPRKCVGDQFAL+E+ VAL
Sbjct: 496 NNNEEVEGWAGFDPSRSPGALYPNEIISDFAFLPFGGGPRKCVGDQFALMESTVAL 542

BLAST of Sgr026500 vs. ExPASy TrEMBL
Match: A0A6J1CRM7 (carotene epsilon-monooxygenase, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111013687 PE=3 SV=1)

HSP 1 Score: 1047.0 bits (2706), Expect = 3.9e-302
Identity = 529/555 (95.32%), Postives = 543/555 (97.84%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDEGNPPTPTKLKSSTNNAKTGS 60
           MSS L  PS+S  SA LHKRGPLRRR PPP+LSIKSSIDEGNPPTPTKLK+STNNAK+GS
Sbjct: 1   MSSALCFPSLSFSSAHLHKRGPLRRRIPPPFLSIKSSIDEGNPPTPTKLKNSTNNAKSGS 60

Query: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120
           WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA
Sbjct: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120

Query: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180
           AGPRNFVVVSDPAIAKHVLRNYGTYAKGL+SEVSEFLFGSGFAIAEGPLWTVRRRAVVPS
Sbjct: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180

Query: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240
           LHKKYLSVIVDRVFCKCAMRLVEKLQK+ALNNNSVNMEEKFSQLTLD+IGLSVFNYSFDS
Sbjct: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKDALNNNSVNMEEKFSQLTLDIIGLSVFNYSFDS 240

Query: 241 LSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300
           LSADSPVIDA+YTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA
Sbjct: 241 LSADSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300

Query: 301 KCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360
           KCKEIVETEGERIDEEEYVND DPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV
Sbjct: 301 KCKEIVETEGERIDEEEYVNDTDPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360

Query: 361 LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420
           LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFL RCILESMRLYPHPPVLI
Sbjct: 361 LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLMRCILESMRLYPHPPVLI 420

Query: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDF 480
           RRA+VAD+LPGNYKVNAGQDIMISVYNIH SSQVWE+A+EFIPERFDL+GPVPNESNTDF
Sbjct: 421 RRARVADILPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDF 480

Query: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK 540
           RFIPFSGGPRKCVGDQFALLEA+VALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK
Sbjct: 481 RFIPFSGGPRKCVGDQFALLEAVVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK 540

Query: 541 LSQRQPTPALASSAS 556
           LSQRQ TP LASSAS
Sbjct: 541 LSQRQLTPELASSAS 555

BLAST of Sgr026500 vs. ExPASy TrEMBL
Match: A0A6J1ICJ1 (carotene epsilon-monooxygenase, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111471281 PE=3 SV=1)

HSP 1 Score: 1015.4 bits (2624), Expect = 1.3e-292
Identity = 515/555 (92.79%), Postives = 535/555 (96.40%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDEGNPPTPTKLKSSTNNAKTGS 60
           MSS L  PS++S S PLHKR PLRRR   P+LSIKSSIDE +PPTP KL +STN +K+GS
Sbjct: 1   MSSILCYPSLTSPSFPLHKRIPLRRRTQFPFLSIKSSIDERDPPTPAKLNNSTNTSKSGS 60

Query: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120
           WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA
Sbjct: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120

Query: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180
           AGPRNFVVVSDPAIAKHVLRNYGTYAKGL+SEVSEFLFGSGFAIAEGPLWTVRRRAVVPS
Sbjct: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180

Query: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240
           LHKKYLSVIVDRVFCKCAMRLVEKLQ++ALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS
Sbjct: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQEDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240

Query: 241 LSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300
           L+ DSPVIDA+YTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVE+LIA
Sbjct: 241 LTTDSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEDLIA 300

Query: 301 KCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360
           KCK IVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV
Sbjct: 301 KCKAIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360

Query: 361 LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420
           LTWTLYLLSKDSS+L+KAQ EVDRVLQGRPPSY+DTKELKFLTRCILESMRLYPHPPVLI
Sbjct: 361 LTWTLYLLSKDSSALMKAQTEVDRVLQGRPPSYKDTKELKFLTRCILESMRLYPHPPVLI 420

Query: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDF 480
           RRAQVADVLPGNYKVNAGQDIMISVYNIH SSQVWERA+EFIPERFDLDGPVPNESNTDF
Sbjct: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWERAEEFIPERFDLDGPVPNESNTDF 480

Query: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK 540
           RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVP+QTIGMTTGATIHTTNGLYMK
Sbjct: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPDQTIGMTTGATIHTTNGLYMK 540

Query: 541 LSQRQPTPALASSAS 556
           LSQ+Q TP LA+ AS
Sbjct: 541 LSQKQMTPRLATPAS 555

BLAST of Sgr026500 vs. ExPASy TrEMBL
Match: A0A6J1HG80 (carotene epsilon-monooxygenase, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111463211 PE=3 SV=1)

HSP 1 Score: 1010.4 bits (2611), Expect = 4.1e-291
Identity = 513/555 (92.43%), Postives = 533/555 (96.04%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDEGNPPTPTKLKSSTNNAKTGS 60
           MSS L  PS++S S PLHKR PLRRR   P+LSI+SSIDE +PPTP KL +STN +K+GS
Sbjct: 1   MSSILCYPSLTSPSFPLHKRIPLRRRTQFPFLSIRSSIDERDPPTPAKLNNSTNTSKSGS 60

Query: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120
           WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA
Sbjct: 61  WVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLA 120

Query: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180
           AGPRNFVVVSDPAIAKHVLRNYGTYAKGL+SEVSEFLFGSGFAIAEGPLWTVRRRAVVPS
Sbjct: 121 AGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVPS 180

Query: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240
           LHKKYLSVIVDRVFCKCAMRLVEKLQK+ALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS
Sbjct: 181 LHKKYLSVIVDRVFCKCAMRLVEKLQKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS 240

Query: 241 LSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIA 300
           L+ DSPVIDA+YTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVI+RTVE+LIA
Sbjct: 241 LTTDSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIQRTVEDLIA 300

Query: 301 KCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360
           KCK IVE EGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV
Sbjct: 301 KCKAIVEMEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSV 360

Query: 361 LTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420
           LTWTLYLLSKDSS+L KAQ EVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI
Sbjct: 361 LTWTLYLLSKDSSALNKAQTEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLI 420

Query: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDF 480
           RRAQVADVLPGNYKVNAGQDIMISVYNIH SSQVWERA+EFIPERFDL+GPVPNESNTDF
Sbjct: 421 RRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWERAEEFIPERFDLEGPVPNESNTDF 480

Query: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYMK 540
           RFIPFSGGPRKCVGDQFALLEAIVALAIFLQH+NFELVP+QTIGMTTGATIHTTNGLYMK
Sbjct: 481 RFIPFSGGPRKCVGDQFALLEAIVALAIFLQHINFELVPDQTIGMTTGATIHTTNGLYMK 540

Query: 541 LSQRQPTPALASSAS 556
           LSQ+Q TP LAS AS
Sbjct: 541 LSQKQMTPGLASPAS 555

BLAST of Sgr026500 vs. ExPASy TrEMBL
Match: A0A1S3CIN0 (carotene epsilon-monooxygenase, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103500849 PE=3 SV=1)

HSP 1 Score: 987.6 bits (2552), Expect = 2.8e-284
Identity = 504/557 (90.48%), Postives = 527/557 (94.61%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDE-GNPPTPTKLKSSTNNAKTG 60
           M+STL   S++  S+ LHKR PL    P PY SIKSS+DE GNP TP KLK+ TN  K+ 
Sbjct: 1   MASTLCFSSLTFPSSSLHKRIPLTPTTPFPYPSIKSSLDERGNPSTPPKLKNPTNAPKSR 60

Query: 61  SWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRL 120
           SWVSPDWLTSLTR ITLGQGDDSGIPIA+AKLDDVSDLLGGALFLPLFKWMNDYGPIYRL
Sbjct: 61  SWVSPDWLTSLTRSITLGQGDDSGIPIATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRL 120

Query: 121 AAGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVP 180
           AAGPRNFV+VSDPAIAKHVLRNYGTYAKGL+SEVSEFLFGSGFAIAEGPLWTVRRRAVVP
Sbjct: 121 AAGPRNFVIVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVP 180

Query: 181 SLHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFD 240
           SLHKKYLSVIVDRVFCKCAMRLVEKL+K+ALNNNSVNMEEKFSQLTLDVIGLSVFNYSFD
Sbjct: 181 SLHKKYLSVIVDRVFCKCAMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFD 240

Query: 241 SLSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELI 300
           SLSADSPVIDA+YTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELI
Sbjct: 241 SLSADSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELI 300

Query: 301 AKCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGS 360
           AKCKEIVE EGERI+EEEYVNDADPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTGS
Sbjct: 301 AKCKEIVEAEGERINEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGS 360

Query: 361 VLTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVL 420
           VLTWTLYLLSK SSSL+KAQNEVDRVLQGRPPSYEDTKELK+LTRCILESMRLYPHPPVL
Sbjct: 361 VLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVL 420

Query: 421 IRRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTD 480
           IRRAQVAD LPGNYKVNAGQDIMISVYNIH S QVWE+A+EFIPERFDL+GPVPNESNTD
Sbjct: 421 IRRAQVADTLPGNYKVNAGQDIMISVYNIHRSPQVWEQAEEFIPERFDLEGPVPNESNTD 480

Query: 481 FRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYM 540
           FRFIPFSGGPRKCVGDQFALLEAIVALAIFLQH+NFELVPNQTIGMTTGATIHTTNGLYM
Sbjct: 481 FRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHLNFELVPNQTIGMTTGATIHTTNGLYM 540

Query: 541 KLSQRQPTPALASSASS 557
           KLSQR+ TP L S A+S
Sbjct: 541 KLSQRKLTPELVSPATS 557

BLAST of Sgr026500 vs. ExPASy TrEMBL
Match: A0A5D3C7X9 (Carotene epsilon-monooxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold202G00730 PE=3 SV=1)

HSP 1 Score: 987.3 bits (2551), Expect = 3.7e-284
Identity = 504/559 (90.16%), Postives = 528/559 (94.45%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPPYLSIKSSIDE-GNPPTPTKLKSSTNNAKTG 60
           M+STL   S++  S+ LHKR PL    P PY SIKSS+DE GNP TP KLK+ TN  K+ 
Sbjct: 1   MASTLCFSSLTFPSSSLHKRIPLTPTTPFPYPSIKSSLDERGNPSTPPKLKNPTNAPKSR 60

Query: 61  SWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRL 120
           SWVSPDWLTSLTR ITLGQGDDSGIPIA+AKLDDVSDLLGGALFLPLFKWMNDYGPIYRL
Sbjct: 61  SWVSPDWLTSLTRSITLGQGDDSGIPIATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRL 120

Query: 121 AAGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVP 180
           AAGPRNFV+VSDPAIAKHVLRNYGTYAKGL+SEVSEFLFGSGFAIAEGPLWTVRRRAVVP
Sbjct: 121 AAGPRNFVIVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWTVRRRAVVP 180

Query: 181 SLHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFD 240
           SLHKKYLSVIVDRVFCKCAMRLVEKL+K+ALNNNSVNMEEKFSQLTLDVIGLSVFNYSFD
Sbjct: 181 SLHKKYLSVIVDRVFCKCAMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFD 240

Query: 241 SLSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELI 300
           SLSADSPVIDA+YTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELI
Sbjct: 241 SLSADSPVIDAVYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELI 300

Query: 301 AKCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGS 360
           AKCKEIVE EGERI+EEEYVNDADPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTGS
Sbjct: 301 AKCKEIVEAEGERINEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGS 360

Query: 361 VLTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVL 420
           VLTWTLYLLSK SSSL+KAQNEVD+VLQGRPPSYEDTKELK+LTRCILESMRLYPHPPVL
Sbjct: 361 VLTWTLYLLSKHSSSLVKAQNEVDKVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVL 420

Query: 421 IRRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTD 480
           IRRAQVAD LPGNYKVNAGQDIMISVYNIH S QVWE+A+EFIPERFDL+GPVPNESNTD
Sbjct: 421 IRRAQVADTLPGNYKVNAGQDIMISVYNIHRSLQVWEQAEEFIPERFDLEGPVPNESNTD 480

Query: 481 FRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLYM 540
           FRFIPFSGGPRKCVGDQFALLEAIVALAIFLQH+NFELVPNQTIGMTTGATIHTTNGLYM
Sbjct: 481 FRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHLNFELVPNQTIGMTTGATIHTTNGLYM 540

Query: 541 KLSQRQPTPALASSASSGR 559
           KLSQR+ TP L S A+S R
Sbjct: 541 KLSQRKLTPELVSPATSRR 559

BLAST of Sgr026500 vs. TAIR 10
Match: AT3G53130.1 (Cytochrome P450 superfamily protein )

HSP 1 Score: 849.0 bits (2192), Expect = 3.0e-246
Identity = 426/546 (78.02%), Postives = 488/546 (89.38%), Query Frame = 0

Query: 1   MSSTLFLPSISSLSAPLHKRGPLRRRPPPP--YLSIKSSIDEGNPPTPTKLKSSTNNAKT 60
           M S+LF PS SS S+ L    P R   P P    SI+SSI++  P      K  TN++K+
Sbjct: 1   MESSLFSPSSSSYSS-LFTAKPTRLLSPKPKFTFSIRSSIEKPKP------KLETNSSKS 60

Query: 61  GSWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYR 120
            SWVSPDWLT+LTR ++ G+ D+SGIPIA+AKLDDV+DLLGGALFLPL+KWMN+YGPIYR
Sbjct: 61  QSWVSPDWLTTLTRTLSSGKNDESGIPIANAKLDDVADLLGGALFLPLYKWMNEYGPIYR 120

Query: 121 LAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVV 180
           LAAGPRNFV+VSDPAIAKHVLRNY  YAKGL++EVSEFLFGSGFAIAEGPLWT RRRAVV
Sbjct: 121 LAAGPRNFVIVSDPAIAKHVLRNYPKYAKGLVAEVSEFLFGSGFAIAEGPLWTARRRAVV 180

Query: 181 PSLHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSF 240
           PSLH++YLSVIV+RVFCKCA RLVEKLQ  A + ++VNME KFSQ+TLDVIGLS+FNY+F
Sbjct: 181 PSLHRRYLSVIVERVFCKCAERLVEKLQPYAEDGSAVNMEAKFSQMTLDVIGLSLFNYNF 240

Query: 241 DSLSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEEL 300
           DSL+ DSPVI+A+YTALKEAE RSTD+LPYWKI ALCKI+PRQ+KAE+AVT+IR TVE+L
Sbjct: 241 DSLTTDSPVIEAVYTALKEAELRSTDLLPYWKIDALCKIVPRQVKAEKAVTLIRETVEDL 300

Query: 301 IAKCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTG 360
           IAKCKEIVE EGERI++EEYVNDADPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTG
Sbjct: 301 IAKCKEIVEREGERINDEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTG 360

Query: 361 SVLTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPV 420
           SVLTWTLYLLSK+SS+L KAQ EVDRVL+GR P++ED KELK++TRCI ESMRLYPHPPV
Sbjct: 361 SVLTWTLYLLSKNSSALRKAQEEVDRVLEGRNPAFEDIKELKYITRCINESMRLYPHPPV 420

Query: 421 LIRRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNT 480
           LIRRAQV D+LPGNYKVN GQDIMISVYNIH SS+VWE+A+EF+PERFD+DG +PNE+NT
Sbjct: 421 LIRRAQVPDILPGNYKVNTGQDIMISVYNIHRSSEVWEKAEEFLPERFDIDGAIPNETNT 480

Query: 481 DFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNGLY 540
           DF+FIPFSGGPRKCVGDQFAL+EAIVALA+FLQ +N ELVP+QTI MTTGATIHTTNGLY
Sbjct: 481 DFKFIPFSGGPRKCVGDQFALMEAIVALAVFLQRLNVELVPDQTISMTTGATIHTTNGLY 539

Query: 541 MKLSQR 545
           MK+SQR
Sbjct: 541 MKVSQR 539

BLAST of Sgr026500 vs. TAIR 10
Match: AT1G31800.1 (cytochrome P450, family 97, subfamily A, polypeptide 3 )

HSP 1 Score: 446.4 bits (1147), Expect = 4.5e-125
Identity = 228/468 (48.72%), Postives = 322/468 (68.80%), Query Frame = 0

Query: 79  GDDSGIPIASAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHV 138
           G D   P        +  +   A F+PL++    YG I+RL  GP++F++VSDP+IAKH+
Sbjct: 105 GSDQDYPKVPEAKGSIQAVRNEAFFIPLYELFLTYGGIFRLTFGPKSFLIVSDPSIAKHI 164

Query: 139 LR-NYGTYAKGLISEVSEFLFGSGFAIAEGPLWTVRRRAVVPSLHKKYLSVIVDRVFCKC 198
           L+ N   Y+KG+++E+ +F+ G G   A+G +W  RRRA+VP+LH+KY++ ++  +F + 
Sbjct: 165 LKDNAKAYSKGILAEILDFVMGKGLIPADGEIWRRRRRAIVPALHQKYVAAMIS-LFGEA 224

Query: 199 AMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSPVIDAIYTALKE 258
           + RL +KL   AL    V ME  FS+LTLD+IG +VFNY FDSL+ D+ VI+A+YT L+E
Sbjct: 225 SDRLCQKLDAAALKGEEVEMESLFSRLTLDIIGKAVFNYDFDSLTNDTGVIEAVYTVLRE 284

Query: 259 AEARSTDILPYWKIKALCKIIPRQIKAEEAVTVIRRTVEELIAKCKEIVETEGERIDEEE 318
           AE RS   +P W I     I PRQ K   ++ +I  T+++LIA CK +VE E E    EE
Sbjct: 285 AEDRSVSPIPVWDIPIWKDISPRQRKVATSLKLINDTLDDLIATCKRMVEEE-ELQFHEE 344

Query: 319 YVNDADPSILRFLLASRQEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKDSSSLIK 378
           Y+N+ DPSIL FLLAS  +VSS QLRDDL++ML+AGHET+ +VLTWT YLL+ + S + K
Sbjct: 345 YMNERDPSILHFLLASGDDVSSKQLRDDLMTMLIAGHETSAAVLTWTFYLLTTEPSVVAK 404

Query: 379 AQNEVDRVLQGRPPSYEDTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNA 438
            Q EVD V+  R P+ +D K+LK+ TR + ES+RLYP PPVLIRR+   D+L G Y +  
Sbjct: 405 LQEEVDSVIGDRFPTIQDMKKLKYTTRVMNESLRLYPQPPVLIRRSIDNDIL-GEYPIKR 464

Query: 439 GQDIMISVYNIHHSSQVWERAQEFIPERFDLDGPVPNESNTDFRFIPFSGGPRKCVGDQF 498
           G+DI ISV+N+H S   W+ A++F PER+ LDGP PNE+N +F ++PF GGPRKC+GD F
Sbjct: 465 GEDIFISVWNLHRSPLHWDDAEKFNPERWPLDGPNPNETNQNFSYLPFGGGPRKCIGDMF 524

Query: 499 ALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTTNGLYMKLSQR 545
           A  E +VA+A+ ++  NF++ P    + MTTGATIHTT GL + +++R
Sbjct: 525 ASFENVVAIAMLIRRFNFQIAPGAPPVKMTTGATIHTTEGLKLTVTKR 569

BLAST of Sgr026500 vs. TAIR 10
Match: AT4G15110.1 (cytochrome P450, family 97, subfamily B, polypeptide 3 )

HSP 1 Score: 384.4 bits (986), Expect = 2.1e-106
Identity = 222/534 (41.57%), Postives = 318/534 (59.55%), Query Frame = 0

Query: 49  LKSSTNNAKTGSWVSPDWLTSLTRYITLGQGDDSGIPIASAKLDDVSDLLGGALFLPLFK 108
           +K  +   KT   +  +    LT +  L  G    +P A      VSDL G  LFL L+ 
Sbjct: 51  IKCQSTEPKTNGNILDNASNLLTNF--LSGGSLGSMPTAEG---SVSDLFGKPLFLSLYD 110

Query: 109 WMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLISEVSEFLFGSGFAIAEG 168
           W  ++G IY+LA GP+ FVV+SDP IA+HVLR N  +Y KG+++E+ E + G G   A+ 
Sbjct: 111 WFLEHGGIYKLAFGPKAFVVISDPIIARHVLRENAFSYDKGVLAEILEPIMGKGLIPADL 170

Query: 169 PLWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQK--------NALNNNSVNMEE 228
             W +RRRA+ P+ HK YL  +V +VF  C+ +++ K +K        +  +   +++E 
Sbjct: 171 DTWKLRRRAITPAFHKLYLEAMV-KVFSDCSEKMILKSEKLIREKETSSGEDTIELDLEA 230

Query: 229 KFSQLTLDVIGLSVFNYSFDSLSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIP 288
           +FS L LD+IGLSVFNY F S++ +SPVI A+Y  L EAE RST   PYW       I+P
Sbjct: 231 EFSSLALDIIGLSVFNYDFGSVTKESPVIKAVYGTLFEAEHRSTFYFPYWNFPPARWIVP 290

Query: 289 RQIKAEEAVTVIRRTVEELIAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-QEV 348
           RQ K +  + +I   ++ LI   KE   ET+ E++ E +Y N  D S+LRFL+  R  ++
Sbjct: 291 RQRKFQSDLKIINDCLDGLIQNAKETRQETDVEKLQERDYTNLKDASLLRFLVDMRGVDI 350

Query: 349 SSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKDSSSLIKAQNEVDRVLQGRPPSYEDTK 408
              QLRDDL++ML+AGHETT +VLTW ++LLS++   + KAQ E+D VL   PP+YE  K
Sbjct: 351 DDRQLRDDLMTMLIAGHETTAAVLTWAVFLLSQNPEKIRKAQAEIDAVLGQGPPTYESMK 410

Query: 409 ELKFLTRCILESMRLYPHPPVLIRRAQVADVLPG-------NYKVNAGQDIMISVYNIHH 468
           +L+++   ++E +RL+P PP+LIRR    + LPG        +KV  G DI ISVYN+H 
Sbjct: 411 KLEYIRLIVVEVLRLFPQPPLLIRRTLKPETLPGGHKGEKEGHKVPKGTDIFISVYNLHR 470

Query: 469 SSQVWERAQEFIPERF-------DLDG------------PVPNESNTDFRFIPFSGGPRK 528
           S   W+   +F PERF        ++G              PNE   DF F+PF GGPRK
Sbjct: 471 SPYFWDNPHDFEPERFLRTKESNGIEGWAGFDPSRSPGALYPNEIIADFAFLPFGGGPRK 530

Query: 529 CVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTTNGLYMKLSQR 545
           C+GDQFAL+E+ VALA+  Q  + EL    +++ + +GATIH  NG++ KL +R
Sbjct: 531 CIGDQFALMESTVALAMLFQKFDVELRGTPESVELVSGATIHAKNGMWCKLKRR 578

BLAST of Sgr026500 vs. TAIR 10
Match: AT2G26710.1 (Cytochrome P450 superfamily protein )

HSP 1 Score: 151.8 bits (382), Expect = 2.3e-36
Identity = 125/435 (28.74%), Postives = 202/435 (46.44%), Query Frame = 0

Query: 109 WMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLISEVSEFLFGSGFAIAEGP 168
           W   YG  + +  GP   + V+DP + + +      Y K     + + L G G    +G 
Sbjct: 91  WRKIYGATFLVWFGPTFRLTVADPDLIREIFSKSEFYEKNEAHPLVKQLEGDGLLSLKGE 150

Query: 169 LWTVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNME--EKFSQLTL 228
            W   R+ + P+ H + L ++V  V  K    +V+K       N  V ++  E F  LT 
Sbjct: 151 KWAHHRKIISPTFHMENLKLLVP-VVLKSVTDMVDKWSDKLSENGEVEVDVYEWFQILTE 210

Query: 229 DVIGLSVFNYSFDSLSADSPVIDAIYTALKEAEARSTDILPYWKIKALCKIIPRQ--IKA 288
           DVI  + F  S++   A   +       L  AEA     +P +      +  P +  +K+
Sbjct: 211 DVISRTAFGSSYEDGRAVFRL--QAQQMLLCAEAFQKVFIPGY------RFFPTRGNLKS 270

Query: 289 EEAVTVIRRTVEELIAKCKE-IVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQLR 348
            +    IR+++ +LI + ++  ++ EGE   E    +      L  L+   + V+   + 
Sbjct: 271 WKLDKEIRKSLLKLIERRRQNAIDGEGEECKEPAAKD------LLGLMIQAKNVTVQDIV 330

Query: 349 DDLLSMLVAGHETTGSVLTWTLYLLSKDSSSLIKAQNEVDRVLQGRP-PSYEDTKELKFL 408
           ++  S   AG +TT ++LTWT  LLS       KA++EV RV   R  P+ +   +LK L
Sbjct: 331 EECKSFFFAGKQTTSNLLTWTTILLSMHPEWQAKARDEVLRVCGSRDVPTKDHVVKLKTL 390

Query: 409 TRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQVW-ERAQEF 468
           +  + ES+RLYP     IRRA+ +DV  G YK+  G +++I +  +HH   +W     EF
Sbjct: 391 SMILNESLRLYPPIVATIRRAK-SDVKLGGYKIPCGTELLIPIIAVHHDQAIWGNDVNEF 450

Query: 469 IPERFDLDGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQ 528
            P RF  DG VP  +     FIPF  G R C+G   A+L+A + LA+ +Q   F L P  
Sbjct: 451 NPARF-ADG-VPRAAKHPVGFIPFGLGVRTCIGQNLAILQAKLTLAVMIQRFTFHLAPTY 507

Query: 529 TIGMTTGATIHTTNG 537
               T    ++  +G
Sbjct: 511 QHAPTVLMLLYPQHG 507

BLAST of Sgr026500 vs. TAIR 10
Match: AT2G44890.1 (cytochrome P450, family 704, subfamily A, polypeptide 1 )

HSP 1 Score: 149.8 bits (377), Expect = 8.8e-36
Identity = 121/455 (26.59%), Postives = 210/455 (46.15%), Query Frame = 0

Query: 115 PIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLISEVS-EFLFGSGFAIAEGPLWTV 174
           P +R  +  ++ +  +DP   +H+L+  +  Y+KG +  V+   L G G    +G  W  
Sbjct: 63  PTFRFLSPGQSEIFTADPRNVEHILKTRFHNYSKGPVGTVNLADLLGHGIFAVDGEKWKQ 122

Query: 175 RRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLQKNALNNNSVNMEEKFSQLTLDVIGLS 234
           +R+ V      + L      VF   A +LV  + + AL+  S + ++   + TLD I   
Sbjct: 123 QRKLVSFEFSTRVLRNFSYSVFRTSASKLVGFIAEFALSGKSFDFQDMLMKCTLDSIFKV 182

Query: 235 VFNYSFDSLSADSPVIDAIYTALKE----AEARSTDILPYWKIKALCKIIPRQIKAEEAV 294
            F      L   S   +    A  E      +R TD  P+WK+K     I  + + ++++
Sbjct: 183 GFGVELGCLDGFSKEGEEFMKAFDEGNGATSSRVTD--PFWKLKCFLN-IGSESRLKKSI 242

Query: 295 TVIRRTVEELIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASRQEVSSVQ---LRDD 354
            +I + V  LI        T+ + + +E+  +  +  + +FLL S ++  ++    LRD 
Sbjct: 243 AIIDKFVYSLIT-------TKRKELSKEQNTSVREDILSKFLLESEKDPENMNDKYLRDI 302

Query: 355 LLSMLVAGHETTGSVLTWTLYLLSKDSSSLIKAQNEVDRVLQGRPP-----------SYE 414
           +L+++VAG +TT + L+W LY+L K+     K   E+  V                 + E
Sbjct: 303 ILNVMVAGKDTTAASLSWFLYMLCKNPLVQEKIVQEIRDVTSSHEKTTDVNGFIESVTEE 362

Query: 415 DTKELKFLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHHSSQV 474
              ++++L   + E+MRLYP  P  +R A+  DVLP  ++V+ G +I    Y +   + +
Sbjct: 363 ALAQMQYLHAALSETMRLYPPVPEHMRCAENDDVLPDGHRVSKGDNIYYISYAMGRMTYI 422

Query: 475 W-ERAQEFIPERFDLDGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHM 534
           W + A+EF PER+  DG    ES   F+FI F  GPR C+G  FA  +  +     L   
Sbjct: 423 WGQDAEEFKPERWLKDGVFQPES--QFKFISFHAGPRICIGKDFAYRQMKIVSMALLHFF 482

Query: 535 NFELV-PNQTIGMTTGATIHTTNGLYMKLSQRQPT 548
            F++   N  +      T+H   GL++    R  T
Sbjct: 483 RFKMADENSKVSYKKMLTLHVDGGLHLCAIPRTST 505

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022143881.18.1e-30295.32carotene epsilon-monooxygenase, chloroplastic [Momordica charantia][more]
XP_022972774.12.6e-29292.79carotene epsilon-monooxygenase, chloroplastic [Cucurbita maxima][more]
XP_022962840.18.5e-29192.43carotene epsilon-monooxygenase, chloroplastic [Cucurbita moschata][more]
KAG6595192.19.3e-29092.25Carotene epsilon-monooxygenase, chloroplastic, partial [Cucurbita argyrosperma s... [more]
XP_038882587.11.0e-28891.91carotene epsilon-monooxygenase, chloroplastic [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Q6TBX74.3e-24578.02Carotene epsilon-monooxygenase, chloroplastic OS=Arabidopsis thaliana OX=3702 GN... [more]
Q93VK56.4e-12448.72Protein LUTEIN DEFICIENT 5, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CYP... [more]
O489215.8e-10943.48Cytochrome P450 97B2, chloroplastic OS=Glycine max OX=3847 GN=CYP97B2 PE=2 SV=1[more]
O233653.0e-10541.57Cytochrome P450 97B3, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CYP97B3 P... [more]
Q430782.8e-9542.44Cytochrome P450 97B1, chloroplastic OS=Pisum sativum OX=3888 GN=CYP97B1 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A6J1CRM73.9e-30295.32carotene epsilon-monooxygenase, chloroplastic OS=Momordica charantia OX=3673 GN=... [more]
A0A6J1ICJ11.3e-29292.79carotene epsilon-monooxygenase, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC... [more]
A0A6J1HG804.1e-29192.43carotene epsilon-monooxygenase, chloroplastic OS=Cucurbita moschata OX=3662 GN=L... [more]
A0A1S3CIN02.8e-28490.48carotene epsilon-monooxygenase, chloroplastic OS=Cucumis melo OX=3656 GN=LOC1035... [more]
A0A5D3C7X93.7e-28490.16Carotene epsilon-monooxygenase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
Match NameE-valueIdentityDescription
AT3G53130.13.0e-24678.02Cytochrome P450 superfamily protein [more]
AT1G31800.14.5e-12548.72cytochrome P450, family 97, subfamily A, polypeptide 3 [more]
AT4G15110.12.1e-10641.57cytochrome P450, family 97, subfamily B, polypeptide 3 [more]
AT2G26710.12.3e-3628.74Cytochrome P450 superfamily protein [more]
AT2G44890.18.8e-3626.59cytochrome P450, family 704, subfamily A, polypeptide 1 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002401Cytochrome P450, E-class, group IPRINTSPR00463EP450Icoord: 341..358
score: 29.84
coord: 492..515
score: 29.89
coord: 444..468
score: 27.01
coord: 110..129
score: 32.62
coord: 361..387
score: 28.3
coord: 482..492
score: 53.34
IPR001128Cytochrome P450PRINTSPR00385P450coord: 352..369
score: 41.91
coord: 404..415
score: 35.27
coord: 483..492
score: 57.58
coord: 492..503
score: 38.58
IPR001128Cytochrome P450PFAMPF00067p450coord: 106..527
e-value: 6.2E-78
score: 262.6
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 740..753
e-value: 3.1E-8
score: 35.5
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 670..739
e-value: 3.1E-8
score: 35.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 17..57
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..57
NoneNo IPR availablePANTHERPTHR24291CYTOCHROME P450 FAMILY 4coord: 122..544
NoneNo IPR availablePANTHERPTHR24291:SF134CAROTENE EPSILON-MONOOXYGENASE, CHLOROPLASTICcoord: 122..544
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 699..761
IPR036396Cytochrome P450 superfamilyGENE3D1.10.630.10Cytochrome P450coord: 85..560
e-value: 8.2E-104
score: 349.6
IPR036396Cytochrome P450 superfamilySUPERFAMILY48264Cytochrome P450coord: 99..544
IPR017972Cytochrome P450, conserved sitePROSITEPS00086CYTOCHROME_P450coord: 485..494

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr026500.1Sgr026500.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016117 carotenoid biosynthetic process
molecular_function GO:0020037 heme binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0009974 zeinoxanthin epsilon hydroxylase activity
molecular_function GO:0004497 monooxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen