Sgr027961 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr027961
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptioncarotenoid 9,10(9',10')-cleavage dioxygenase 1-like
Locationtig00153056: 1952796 .. 1970609 (+)
RNA-Seq ExpressionSgr027961
SyntenySgr027961
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGGAAGATCGAGTTACATATGGGTGGAAGGAGAAGGGATGCTTCACGCCTTGTATTTCAATAAAGACAGCCGAGGCACATGGAATCTCCTCTACAACAATAGATATGTCCAAACTGAAACATTTCGACTCGAAAAACTTGTCAGAAACCGACCATCTTTCCTTCCTGCTGTGGAGGGCGATTCTCTCGCTGTTCTCTCTGCCTTTTTCCTCAACTTGGTCTGCCCATTCTACTACACTCTCTCCCTGCATTCACCACCTTTTGTATTAAAATGTACATATTCTGTTGGGTTGGTTCATTTTACAGCTAAGATTCGGCAAAGTCTACAAAGACATCAGCAACACCAACGTGTTGGAGCACTCGGGGAAGTTTTACTCACTCGCAGAAAATTTTATGCCCCAAGAGTTCGACATCACGACGCTGGAAACTTTGGGCGATTGGGATGTCAATGGTGCTTGGAACAGACCTTTTACAAGCCATCCAAAGGTTTCTTCAATTTATATATGCCGGCCGGGGCGCTGATCTTTATTTCTAATAATTTCTCTTTTTGCCTGAATTCTTCTTCTTTTGCAATCACAGAAAGCTCCGGAAACCGGTGAGTTGGTGGTCTTGGGTGCCACTGCAACCAAACCCTTCATGGTATCGGAATCATTTCTGGTAATAATATTCTCCTACAAACAATTTCTTTATTATTTTTCTTTGACTAAAACTTTTGCCACCTATACTTATTTATTTATGTTTATTGCTTTGCAGAAGACGGAAAGCGAATGGTTCATAAAGTCGACGTCAAACTCAGTAGAAGTAGCCTCAGCCATGAGATCGGAGTCACACAGAGGTAAAGCCAGTTGCTTTCCCTTTGAAGAAAAACATATCGGCCGTGATATCTGATAAAAACTTGATATCATTAATAAGTAAATTCAATCCTAACTTATGGGATAATAATATTAGAACGATAAATCAAAATCTATTGGTTTGATCGATGTATGGAGCTGGAATAATGCAGGTACAATGTGATATTGGATTACCCCGTAACTGTCGACTTGAACAGACTTATCAGAGGCGGACCGTACGTTCTTTCTTTTGAGCCGATCATACTGTTTTATACGTAAAATAATGACGAATTTCTGACATATAAGCATGCATTGCAGATTAATAAAATACGATAAGGAAGGGTACGCCAGAATCGGAGTAATGCCTCGTTACGGAGATGCCGATTCAATTCAATGGTTTCAGGTGAAACCCAATTGCACGATGCATCTTTTCAACTGCTTTGAGGACAAGGATGAGGTTGGATTACTAAACTAATGCTTAATAAAGCTAAAAAAAAAAAACAAAATGAAGATGCTAATCAGTTGATAAACAATATAGGTTATGGTGTGGGGATGTAGAGCTTCTGATTCAGTCATACCTGGACCTGAAAAGGGACTCAACAAATTCGAGTGGTTCTCTAAGAGATTTAAGCCATTACCTGCAACTGATCATGAAGAACACAACACTGATTCCTCCATTGAAGATGGGTCGTTGTTTTCTCGTGCTTACGAGTGGAGGCTGAACCTCAAAACTGGAGAGGCCCGGGAGAGATCTCTCACCGGAACTCAATTTTCCATGGATTTTCCCTTCATAAATTCACGCTTCACTGGTCTTAAAAATAAATTTGGATACGCACAGGTTCTTGACTCCTTCGCTAGTTCTAACTCAGGTATGTCCAATCATCACTAATCTCAATAACTAAATTAACCCCGAGTCTTTGAAACTATTAATTATTCAATAGTAAACATTATATTTGCTTTGGTGTTCTAATGTTGTTTGATTTAGGCATGTTTAAATTTGGGGGCCTGGCGAAGCTACATTTTGAAGAGCCTCAAAGTTTTGAATTTTCGTTGGTAATTCCGCATTTTAATTATGAAAAGCTACCTTGTATACTTCTTCTGTACGTTTTCTGACTCCATGAATGGGAAGTTGCAGCCAAAAAAGTGTGAAGAACAGCGTATTATAAAAGTGGAATACCATATGTTTGAGAACAACTCCTTTTGCAGCGGAGCCTCCTTTGTGTCCAGAGAAGGCAGTCCCGAAGAAGATGATGGTTGGATAATCGCTCATGTTCACAATGAGATCACCAACACGTCTCAGGTTAACTCCATTTTTAACCCATTAATTTGTTGCTTAAATTCTATCTCTCTCTTCTTTTTAGTCTATATATAACCGCATTTGTTTTTTTTTTTTTTTTCGTGATCTCATCGATCAGGTATACATAATTGATGCAAAAAAGTTCAGCGAGGAGGCAGTTGCCAAAATCACACTGCCACACAGAGTTCCCTATGGATTCCATGGTGTTTTTATGCCCATGTCATGTAACAAATTATAATGAATGCAATAAATTTTAATTCAGAAAAAAGGTGCCCATCATAGCTCTGGTCCGTTTTTCAGTTGTTTAGATGGTGGTACCGACTTATGTAATTATGGCTCTTTAATATGTTTATTGAACCTTAATCAATAGCAACCCTGTTAATTAGAGTGTTCACATCACATTGGCATTTCAAAACGACCAGTTTACTAGTATTAGATGTTTTTTTTTTTTAATATCATATCACATGTAGAGGTGAGAATTCGATTTTGTGACCTCTTAAAAGAAATAAAATATTTTAACTACTAAACTACGCTCGTGTTGACTAGTATTGGACGTTATTTTGGAAGGAGATTATTATTTTATGTGACACATTATATAATAAAGCTGTGATTAGTTGAAAAAAGTTGATAATGAAGTTAAGGTTGGTAATTAAGAGTGTAAGAAAGGAAAAAGAAACTGGCATATAACATGGATTTATTTAGTGTGCCATGTGTCAAAACCTTACTAGAAGCGATAGTGGGTTGTGACCCACGAAATGTGTAGTCAAGTAGTGGCAACGTGTGTCATTGTACTTTGAACTATTAAGGGTCAAGAGCGAAGTGCCTCGATGCGACCAAGGAGGATGGTGCATCGGTTAACACACCAAGAAGAAATGCACTAGAAAGCTTAGCTAAGATGCAAGCAAGATTGAGCGTAATGGTGACGTCAGTTGTGTTAGTGACAAGCTTAGTGGGGGCCAGATGGGGCCCCAGGCCACGACCCCAAAATTTGGGGTTGCATCAAGAGTTTTGACGAGCAACTCTAGAGTTAGATGCATATTAGTGGGATATGTAAGCATGCCAGATTTCAGGTCAATCAAAGATCAGAAGTGCCCTCTACATTGGCGAGTGACTTCTCGCCACGTGCATGGGTGTTAGACCAAATATGCCTTAGAATGTACTATAGGGGGAGACATGATGTTGGCTCTCCACCACTCTTGGGCCTCATGGATTAAGTGAGCCACCCCATGACACATGCGTGTGTCATGGCGTGGGCAAGCGTTGCAAGACTAGCGTCTTGGGCGTCTTTTTTGGTAATGCGAATCTCAAGCACAGTATCGAGTGGCACTAATGTGCCGGCATTGTGCCTCTTCCTATGGGTCAGAGGTTTGGATATAGGGTTAGGCTCCGGCACGGTATGCACAAATGAGTTGGCACGAGCATGGGAGTCACCATACCACGATAGTGCTCGAATGGATGGATGTTTGGGGCAACCTCGATATGAAGTAAATACTCATGGTTCTCGGGATGTGTGCCCGAGCAAGCAGCAAGGTCCAATGACACTCCAATGTTCGGGTGGCATCCTAAAGCATCAGTGGGATTCGGGATTACCCAGTAAGTATTATAATCGCTCGATAGAGCCGATAAATGGCCGGTAAAGCCTGGTAAGCACGATAATGGCCTAGTAGGGCCAGTGCCGTATTTACCCGATAAGTCCTAATAAAGCATTATAAGCCTGTTCTGACCTAACGAAGGGTCAGAACTCATCCTGGTGAAGTTTCAGATCGATGTAGAAGTGAGACGACTGTGTCGTCAAAACCAAGATGAAGGAAAAAGATGTGACGATGTGTTCAAGATGACCTTCCTAGTGTAGAGGCAAGGCGGCTAGAGTGGCAAATGAGTTGAAATGCACGTACGGGCGATGTGTGTTGCTGAGGCAAGTGAAGTTGTACACATAGTTAAGCTCGGTTGAGGATGAAAGAATCTCGCCTATGGTTCAATGAAGCCACAAAGAGTCGGAAAAATTATATATGGGAGTAGATTAGTCCCAAGACTGGCCATAGTTGCGTATATCATAGTGTCGCACCATTGGAAAGAATACGACGTGACACCAATGTTGTTGAAATTATGAATAAAAGCAATGGAGCAAAGGATCGTGGGGAAAGTGAGGATGTAGTTGGGTTGAACAGTGAGTAGCCAGACATGTGAACATGCAAAGCCCTAAGCCTACTTGGGAATGTAGAAGCTATACATGCACAGATCAGTCCCAGCTGAAAATGGAATGAACTCCTTGCGGCATGTTTGCAAGAAGCAGAAATCATAATTAAACTCATACATAAGTGATTTTGATTTAATTTAAAGTGATTCTCGTCAAAATCACTCTCTAACTTTTTTCTCTCCAAAATTTTTGTTCCTCCCTAAATTTTTTTTCTTTAAATTAATTTTTGTATCTCGCCCAAAATTTTTTCTGCTCTCCAAATTTTTTCTTCCCTAACATTTTTCTCTTTCTAAAATTTTTTTCTCTCTCTAAATGTTTTTTCTCTCTTTCAATTTTTCCTTTCTAAAATTTTTTATCTCTCTTTAAATTTTTTCTCTTTCAATGTTTTCCTCTCTAAAATTTTGTCTCTCTCCATTTCTATTTTTTTCTCTCTTAAATTTTTATCCCTCTCTAAAATTTGTCCCTCTCTTTCTAATTTTTTTCACTCTCTAAAATTTTTGTCTCTCTCCGCTCTCCTTTTTTTTTCTCCCTCTTAAATTTTTATCCCTCTCTCTAACTTTTTTTACTCTTTTATAATTTTTGTCCCCCACTCTCTTAGCTTTTTTTTTCCTCTCAAATATAATGCATTCTTTTAGTAATTTAATGATTCTAAAATTACTTTTAAAGACAAAGATTAATGTAGTAAGCATTTTAATTATATTAAATCCCTTTTAAAAGTGTGGAATCAAATATTAAAATGATTTATTATTAAACACTTTTTACAAACATAATTATAATCAAATCACTTTGGCTAAAAACATTTATTCTAAAATGACTCCCTATGCCCTAAATCATACTCTTTTTTTCCTGCTTGCTTAACAAAATAATAATTTAAATGATTATATATATGTTAAAATATGAATGATTCTAGAATTAGAAATTCTCTCTCCTGACCCATACTTAGCATTTTCTTCAACTACCCACTCATTTTCAATCCCCACAGTCTTTAACAATTTTGGGAATCCAATTCATTTTCCGGGTCTACTCTTTAAATTTTACCAAATTTGTTCATGTTTATTTCATCAACAAGGAATAAAATCTGTACTCGTGGATGAATCTGGGCCATATCAAGTAAATGACCCCATTTGCAATTTACCCAAAAAAAAGCGACCTGTCGTTTCGCTTTCAAATTTTCAATGAGAGCTTGATCCATAATTAAATGTGTGGATGTGTTTCTGATGTTGAACTTGATCATTAGGAAGTATGAGTCGGCAAAGAATTTGGAGAGAGCTTATTTCTACAGATCTCAGTTTGATCAAATACAATTTACAGTCACTCAGATAAAGTCCGACCTTTTTTCGTTTTTTTTTTAGAATATTTTTATTGTTAGGAAGATAGGCAGCATGAATTTTCCCCTCGTCTTCTTTTAAGAAAAAGAAATGGAAAACCCTGTTCCATTAATAATGTCTTGTTTCTCCTAATACAAAGATTACAAAAGTAAAAGGCTTCTGTAACCCTAGCTCTTGAAGCAAAAATAACGCATATAGTTTATAAAATTTATGCAGATATCTCTTCAACCTGTTCTTATAAAGCAGGAGCGAGGAAAAGTGAATTCTTCTGGCGATGGATATGAACAGCAATGCGCCATTAACTACCGAACCCACATAATTCATTACGTTTTCTGATTCAGACCAAACCCAATCTTCAACTTAATTGCTATCCCATTAAAAAAATCCACTATACCAATTAATTCCAACCCATTTTCATGCTCTAAAACATCCAGCTAGGGTTTATTAAACTCTTATCAAGTAATATGACAGTAAATTATATATAGTTTGTTAATTGGTGGAAGAAATATTTTTTTAATGGGATAAGAATCTCGGGTGCATGTTTTTGTTTGTTATGCTTGATGAATCCCATTAACTCTCACATTAATTATGGAAATTTATACGTCAACTTCTTGTTTGCCTAGCTAGCCATGGCGACAGCCATGTACACCTCGCTTCAACTGTTTCTTAATTACAACACCCTCCTGCTCCTTCCTTCAATTCCCCATAACAAAAGAGCCTCCGTCACCTTTATGGCCTCCCAAGCCAATAAGTCTTTACTCAGCTTCATGGACGATGAGAAACTGAGGACTCCACAAATTCAATCCCCCGCCCAAGTTTCCTCAGAAACAACCCCACAAGTTATGTCTGATCAACAGGGCGTGGCGCCGTCGTCCAGCCCACATCACTCTAAAGTGCTCAAGTTAATCCCTCCGCCGAGTCCGGAATCGCCATGGACGCTGTCTCCTCTCCAGACCCCTTCGCCTCCTCTGCTTTACCACTGCATAGCGTCGCTTCATCGCGGCGAAGGGAATATCTACTCCATTGCTATTTCCAAGGGAGTGGTGTTTACTGGCTCCGAAACCAGCCGCATTCGTGCATGGAAGCAACCGGACTGCATGGAACGTGGGTATCTTAAGGCCGCCGCCGGCGGAGTCAAGGCCATTTCAGCTTCCGGTATATATTCGTCTCAGATTTTAGTACTAATTAAAATACCTAGATGTAATTTGTAATCCTGCCCTAACTTATAATAGGAAAGAAATAAAACTCAAACATTTAAAATTAGATTTACATAAGTTAAATTCAAGTTTGGTCCCTATTTAATTATTTTAAAAAAGATTCTAAAAAGTATTTGAACTTTTAAAGTTGTGTTTTTTTTTTTTAACTTTAAAATTTTTTAATGGGTACTTGAACTTTTAATTTTCTGTATAATTATCTCTTTCGTTAGTTTTGTCTATATGGATGTCATATTTAACTATTACATAGTAGACTAATTTCAAACAAGTTTAAGTGTTTGGTGGTGTAAGTGTTAAATTGATGGAGCTAACAACAAAGCCTATTAGACACAAGAGTTTATTAGAAGTTTTTCCAAGATTAATAACAACTCTTAGAATTTAAGAATTTATTAAAATTTTTTAAAGTATAGAGATTAAATAAACACTACCTAAATCTGGATCAAATTTATAATTTAACCATATTTGTATACATTATAGTATAATTTTTTTTTTAATTCATTAATTAGGGTTTAACGTAAATACTTTCACGTGTAACAGAAAACATGTTGTTCAGTTCACATAGAGATCTGAGAATCCGGGTTTGGAATTTCACGGTTTCGGAAAGTTTAATGTCCAAAAAGGTGTGTTCGCTTCCGAAAAGAAGCTCATTTCTGTTGTTCTCTCGAACGGAAACCGCCCACCTCCACAAAGATTCATATCTTGCATGGCTTATTACCACGCACAGGACCTCCTCTACACCGGCTCCCACGATAAAACCATCAAAGCCTGGCGAGTTTCCGACCGCAAATGCGTCGACTCCTTCGTCGCCCATGAAGATAACGTCAACTCCATAGCGGTCAACCAAAACGACGGCTGCCTTTTCACTTGCTCTTCAGACGGAACCGTCAAAATCTGGAGGAGGGTTTACCGGGAAAACTCACACACTCTCACCATGACGCTCAAATTCCAAACTTCTCCGGTAAACGCCGTCGCTTTATGCTCCCACTACGACACCACCTTTCTGTACTCCGGTTCCTCCGATGGGACCATAAACTTTTGGGAGAAGGAGAAGCTGTCTTACAGATATAACCACAGGGGGTTCTTACAAGGCCACCGATTTGCGGTTCTGTGTCTGGTAGTGGTGGAGAGGCTGATATTTAGTGGATCGGAGGACACGACGGTGAGGATATGGAGGAAGGAAGAGGGAAGCTATTACCATGAATGCTTGGCGGTGCTGGACGGCCACAGAGGGCCGGTGAGATGCTTGGCGGCGAGCCTGGAAGTGGAAAAGATTACGACGAGTTTTCTGGTTTACAGTGCTAGTCTGGACCAAACGTTTAAAGTATGGAGAGTGAAGGTATTGCCTGAAGAGAAAAGGTGCTTGGATTATGGGGAGGGTGGGGAGTCCAAGATGAAGAAGCTGGGGGAGCTCTATGAGATGAGCCCTGTGCTGTCTCCTTCCTGGGTCGAAAAAAAACTTCAAGCTAATTTTTTTTGAGTTCCAGATGAGGTTATTTAATCCGTTCAATTGTAAATTCAGTCATTGTAGTTTTAGTGTTTCTGTTTATTTAGGTTGAACTTTAAAAAAAAAAATTAATTATCTTCTTTAGTTTTATTAGTTTATGCCTTTAATATTATTTATTTATGGAAAGTTGAGGTGACATTATTCATTAATTTATCTCCTTATTCCAACAGCCACATCACGTCGATTCATTAATGAAACTTCTTTAAATTTAAGAAATATGCGCTACATTTTTTTAATATAATCTTTGTGAATTTTTTTTTGTACATCTTTGTGAAATGTTTAATGTGTGGATGACCATTATAGATAGAATATAACAGTGTGATGATATTTGAAGTTATAGATTTAAATTCTCATGTAGCAGTTAATGTATGTATTTAGTATGTATTTTCATTTATCAAACTCATTTGTTTTGATGAAGCCAATCTGACAAAACTTTGTTTTGCTATATATATATATTAGGACATTAGAAAACCAAATAAAAATTTTAATTTAAAATAGTTTAATAATCAAATTCAAAATGAGAAAAGAAAAATTCAGATTAGTTTCTTGTAGTGCATGAAAGGCTTGTCAGGGCTCAGGAGCAAGATTGGATCCTATGGAAACTATTGTACTGTAGAACTTGTGGGGCCTTTCAACTCTGTCAATATGATTGGCCGTTACAGTATAGTAGATAATTCTGATATTCTCTTTGAACAATCTATAAAAAAAAAAATAATAATAAATAAATAAATAAAAATAAAATAAAATTAGAAAAGACAACATTTTAGAGACATATTACCTTCTTACCTACAGGAAAACAATTTAATACTTTATTTTAGTAAACGGAGAATATCAATTTAAAGACCCAAATTAATTGTTTGTAGAAACTATGGAGACAGGACAAGAACTTGAAAGAGGTAGTGAGTGAAACATATTCAGAAAGGATAACACCTCTATAGAAACTATGTTCTTTCTAAATCATTTCTTAAATTGTAGTTTTCTCTACCTTAAAAGTTTTGCCTTTTTCAGGGAAAGCTTGCAAGTGGATCTTGAAAGTTTTGGAATGTGCATCATGATTTTTGAACCAATCCCTCATCTTTTAGATGCCTAAGTTATGAATTTAAATCTTTGTTAGATTCTTCATAACAGTCACTGACAGAAGAACACTACAAACTATAGAAAGCAGCTTCATTCATTTTATTCCAATGTTCAATATGCCTATTTTTCAGGGTCGGAATCATTTCATGTAGGCACGAAAGTTCAACATATATTGTATCCCCTCAATCCACATCTGTCTTTCTCCTTTCCCACTGCACTCAAACTCTATTGTTCTATCCGTTGTTACTATTCCAAAGTAAGCCCTTGCTCGCCATCCATTTCCCTTTCCCGCCCAGGCCAGGCTGCAACGTCACAGTTCACACCCGAGATTACACCTGAAACAGAACATGGTTCATCAATGTTGATGTTGTGCAAGCAACTGCAGAGAAGATTTGTTCTGTGTATGAAGCTTAGTCTTACATTTTTTGTTTTTTGTAAATGTTCCGGCCATGTGCCTGCTTTTCAATTTTGCCACAACCTGTATACACATTTTACTTCTGAACACTTATTTCCAAGGATGTTGATTAAGAGAAGATTAACATAATATAAAGAAACTTGCTGCATTTTTTTTTGTAAAAATAGAAATATCAATCACAACTTCATCACTTGATCATCTGGCACTCGCTTCAACTTCAATACAACTATCAACCGTTTCTTGCCTTACCTGCCAATTTGAATTTATGTTGAAAGAAACTTGCTTCCAGTGGAGAACCCCTGTCATCACCACGTTTCTAATCTCAGTGACAGCAGAGCATCAAAATTTGTGACACTATTCAAGCTAAAAATGCACCCAAACTGTATCATTTTTTCCTTGTCGTTGTAACCAACCTTTCCTTGTGCGTTTGAGAAGCTCTCCTCCTCTGGAGACATAATTAATTGCTATCAAGATGTTTGATTCTTTCCCTTCTTCTTCAACTTTATCCTCTCCCACCGCGAAGTTTGTGGCTCCCAACCCTTTCTCAAGCCTAGCTTTGAGGGTTGCAGCTCCTCTTAAAGCTGCACTAGTAAACAAGTGTTCTGAGTAGTTAGCATTTTACTTCATCTGTTCTGAGAATTAATTGTTTATGAAGTTTTCTTGGTACCTGTTGCTGCTCCAGCAGTTAGGGTCATGATATCTCCGTTAGTTCTAGCATTGATTGCAGAGTTAACCACATTCAAGATATGTTCATGGCTAGCTCCCATTTCCTCTGCCATCTCGATGCAATGAGATGCGACGAGAGCAGCTGCAGAAGCTATAGCTGCTGATGTCTTGGAAGGCCATTTTTGATTACCAGATGATGTTTCCCGCGACACAAGTGATGCAATAAATGCTGCTACTGAAGCAGCAACACCTGCCACAGATACAGCTGCATGTAATTGAGCATTCTGTGTTCTGATTTCCTGCTTCTTCCTTTCCTTTTGATCCTTCATCCACCTCCCTAATGTTTTATTACCTCTTAATATGCTCTTGTATAACTGCAGCCCAAAAAAAAAAAATGATGAATATGATAATTTTGCTTCTTTTTTCCCGTGTATGTGTGCGTGTTTCTAAGGAAGAGCGAGTGAGGCATACGCCATTTCCCAGTAGTTGCTGATTGGAAAGGAACTCTGGATTCAATGCTTGGTGAAGTAACAATAGTTCCTGCTCAAAATAAGATGCATGTGCAGTTAACTGAATTTGAGAGATACTCTGTACCAACATAACAAATGATGATGCAGGAAAAACTAGTTGGTTTGCAGAATCTATCTTTCTTCTGGTGTGTGTGCTCGACCGTGAGCTGCAGAGAGATTCAAAATAAAGTACCTTCATTTCGTCACTTCCTCTTGGTGAAGCTGGAGGGCTATCTCCACTGGGCAAGTGCTGAAGTAGCTGTAACATGAAGATCAGCTGAACAATTTATCAAATTCACATCTACCGAAGTGATGCAGCAAGATTCAAGTTTTGGCTAAGGGCTCAAGTTATTATCCTTCTTGTTCATTTCATAAAACTACATATCATAGAACAAGAGAATATTGCACATTTAACACTTTGATGTGGTTTTAATTCCCTGTCCTGTGACCTTCGGCTTTTTGCCAGACACAGCTAATACCTTAGATATAAAAGATACATGACTTAAAAGAGAGTTGGAAGAGGTATCTCACTGGTTCCCCTAAAACTGTGGAATTTGCATCATGTGCTTCAACACTAAGAAAGCCAACCACTGACTTTTGAACATGACTCGGAGTATCTTGGGCAGTGGAAAGAGCCTTGGAGAGCTCTTTGGCCGAAAGGCTCCACGATCTGCCAAGAAACTCCATGGACTCGATTGGAGTTTCAGGTAGAACACATGAACCCGGTAACCAACTTGCAGGGCCATTTTCATCTATGTTTTCCAGCCGAGTTGATGAACATTGAGCTAAACATGAGCTCACTGAAACCATTTCAACACACACAGAGATAGGTGATCACTGCTCTAGTCTGACTCTATAACATTTAGCCAAAAGAATGATTACAGTAATGTCAAGAGAAGTGACTATGCATGGCCTTACTTTCCCTGTCTGAGTTAAAGAGAGATGAGCATGACTAATTCGTTCAGTCTTCAGATTCTCATTATACAGAAAGAAACAGAGACCGAGCACTCCCGAGGAAATGAGAAAATCATTTGGGTATTTAACAGAGAATGATGAAATGGAGGTAACGTTTTGGTCTCGAGTTTCAAGATTCTTACTTTATTGAGGAAAGTGAAAGCCAAAAAGAGGACACGAATTCCATAAACATATAAAAATAAGGACCCAACCCATTACGAGGTCGATAGTGTATTTGTTTTTGCTGCTTTTGTTTTCCACTTAAACTGACATCGTGCATAAACTCAGCGGCAGTCTTCATGATTGCCATGGTACTCATGTCTGTCCTTTGTCAACTTTATCAGAACGTAAGGTAAGAAATCTTGTACGATTGTATCTGCGCCCATTGAATCTCCACTGAACACTGAACAGAACTAAGCAAGTAGTAAAAAGAATGTCGCTAAAAGCTATGGCTTTTGTTTTTATTCCCCGGAATAAACTCTCGTCCAGTCCCGATTGCTTCTTTATCTTAAACTTTTCACGGCGATCAACAATACTAGTTTTCACTCAAAAATAAAGTTAAGGGACTATGAACTTACGTCGAACAGTTAGCCTGCATTTGCAAAAAGAGCCAGAGATAGCTTAGAAAAAGAAAAGCACATCTTTTTTTAAATCTTTCCGGGGCTGAACTTTTACGGCAGAACATCTGATGATTGATTTATGTACATTATACCATACCAACAAAATATACTACTTTAACAAGGCTTTTAAAGTGTGCAGCATTTAAAAATGTTAAAGGCGCATGAATTGCGATACCTTTTGGCTTGTCTAAGAGAAGAGCCACGGGGAATCGATAATATATGATAATGATTATCAATAATAATGATATGATCCCAAGGATGTCGTCTCTCTGTAACCCAAAATGAGAGTGAGTGGTAGACCAGAGAGAGCATAGAGAAGAGATCCACCGAGGGAGACGCCATCTCTGCGCTTTGCAACTTTAATTATTGCTATTATGTCCAATCATGAAAGCGGAATATAGTGCACCATACCTAGTGCCTCCTGGATCTCAGGCACTCACCACAAACATTATCACAGTTTTCTTCCCTTTACCTATTTGATTGCGGCGCCACTCCAATTTATCGGCTTTTAATATTATTTTAAAGATTGAACGAGTTCTATTATTAAGATACACTAATATGAAAATGACAATTTGATAAAGGAGCCATAGCATGACATTTGAAATAATCATCGGATGCACAGCAATATCTTCCTCTTTTTGTTTCATCGAATAGAAGAAATTACTGTAGGTAAATTATTGCCAACATCAACCTAGGTCAGTGAATAAGACATATCTATTTCATCTAATAAGTAGTAGGTTCAAATTCTGTGGCTTGTAAGAAAAGTTGTTATGTTAAAATGTTAAAGTTGACCTGAAGTTATCTAATCACAAAAAAGTTCCAATTCAACAATAAAATAATAATAAAAAAACCTCAAATGTAGCACCAACATATCTAAACCTTGCCTCAATTGAATGTAATAAACTCAACATCTTTAAAAGGAGTTTCACCCAATTAGAAAAGCTTTTTTTTTTAGTTCAACAAATGTGGGATGAAATAAATCTTCAATCTCAAGCAAAGTAATAGAATGCTTTATCTAATGAACTATGTTTGGATTGGAATTGAAAAAACATTTATATCTTAGCAAAAAATAATTTCATTAATTATTCAAACTTAGAATTAAAAAAGAATAAGATTAAATTTAATTAAGTACAAGAATTAAAAAAAAATATGCCCCTTTTATTTTCCCTATCCTAACAAAGTAATTGGATGTACATCATATTAACACCATTCGAATATGGTAGATTTTTAAGCAGTGGCACCGCCTTGTTAGAACTATAATTTGAGATAGGGTGACGATGGCATGGAACCACACCATCCACTCGATTGCCATTGATTGAGCACGAGCAGCAGAAGCCAGAGAGTCAGCCCAAGCACGAGCACCATTATTGGTGAGAGAGAGAGAGAGACACACACACAGAGTTTCTTGAAAGTGGTGACAACATTCATGGCTTCCCATGCATACCAAAATCCCATTTTTCTTTTACCTTCATCTGGGGAAGCCTTATGGTGAATCTTAGATTTTGCTGTCGGAAAAGAGATTTTATACCCAAAACAACATCAACGCCGACACCCATTAGCCCATACACCATTAGAGACGCACCCAATGCAATCTCAGCTTTCCCTTCTTCATAATACACAGAGAGAGCTGAGAAAGCATCATGCACGTCGCCAACAAATACTGAAAAGAGAGCAAGGGAAGACGATAGTAAAACACAACCAAGACGACGAAGATGATAACAAGAAGAATGAGAGCGATGAAGGAGCCGTGAAGAAGCGCAAGAACAATCAGATCCGAACTGGGAACCATCGGTTCAGTTCCTGCGATCTTTTGGAAGCCAGTTTGGTAATGGTTAGAAGTTGAGGAAAGGAATGAGAATGGAGTGAGTCGCACTCATGCTATGCTTTGGAAAATGAACTAACGAAGCGGGAACCAAGTTTGTAATGGAGAATTTCACAGGATTGCAAAGCAAGGCAAATCCTTTGTATATCCTTGTTTGGCACAAAAGAGAGCGAGAGGTAAGAATCTGGTCACAGTAGAGTGTAGAAGTTTTCATGACGATACTCTTGGGCGTCAAGAACCTCCCATTTCTACGGAGGCAAATTTCAATCTTGTTTTAGATACATACATCTTCATTTTCATTATCATGAATAAAATAATGACAATTAAATGATAAAAATTAGAATCCATCAAATCAAATTGAAGGTTCTGATTAGTTTCCACCCCTTCCATTAATCTTGAGTTGTTCTGTAATGTTTAGTAGAGAGAATGTTAAATGCCTTCACCAAATAAATAACTCGAAAGCAAGTGTACAGGAGCAAGCCAAAGCAATGCAGTTTGACGCCTAAAAACACAGTCATTCAAAATTGCTTTTACTCTTGCAATCTAAAAAAATGAACTCAAAGCCGGTCCTCTTCAAGCGTCTCTCTGAATCTTAAGGGATGATTTCAAATTACAGTCTAAGCATCTATCCCTAACAAAGGGAAGGAAATGCAGCTGGGTCCGTCTGTACAATATCAAATGCCCTTCAAAAGAAATGGAGATCAATATTGAATTGTACTACAACAAGACTTACTTCCAAAAGCTTTGCCATTTTGGCAAAGTAGAAGAATGCATTAGCTACATCAGGTGAACAATCAGTCGAATAATGCTAGCAGGAAGTGCACCGTTCATCTCGCTCACCAACAATGCCAAACAATTTGTATGCAGGAGAAACCATTTGAAAAATAATGCTGGGCATTGGAAGGCCATCTGAGTAGATCAAGGTGCAGTGACAATGTAACTTTTTCTGGATGACATGAGGATACCATCATAGTTAACTCATTGGTTTTGAATCGCAGACCATAGAACAAAATTTTAAGCTTTTTGTGTAAGTCCATGAAGGCTAAAAAGAAGTCAACTAGATTTCAAACATCTAAAAGATGAGTTTTCTGTATATTGTGTCTTGGAAGTGGATCTGGAGTAAGAGATACAAGAGAATGTAAAGAATTTTAAATGGCCCACTAGACTCATCACACTTGAAAATCCAATGGGGTCTCTCTGCTGTTCATTTTCCCACTCAGTTGAATAGAAAAAGACATTCATTCACAAGTTAGCACAGAACAATCATCATTGAAAAAGAGAGCTTCAAGTACTCACCTTCCTTTCAGATTGAACTCCTTTCAGATCTAGGCTCTCATGTAAGTGAAGCAGTGACATCTAGATTATTCATCACGAACCAATTGCGAGGATAAACTGCGCTTGACGGTTTAAAATTAACATTCCAGACCATGATTCTCGAGCAGTCTTCCAGTAAGCCTTCCATTTGGCTTTCATCTTCCACAATTTGTTCGATGAAGTCACTTGCCTAAAAGAATTGGGAATTGGATTCAGTGGATTTTAATCTGGATATCTTGATCAAATCTCATTCACAAATCTATATTTTCAAATAATGATGTTGAATATGAATAAAGAAACACAGCTCAGTAGTATGAGTAACAATCCAAAGAATCAAATAGCTAAGCTATAGCACAATTTAAGAAAAAGTTTACATATTACTTTTCTAATTTTGTTAATTATTCATATTTTGTGTATTTTTCATGAAAAGTATGTAGCCCTAGATCAACAATGGACTGCAAATTATGAAGAACTTCAATTATTTCTTCTGCTATGTGGAATATTTTTTACCCCATTCTTTAGCTGATGGTCCATGATGATTCTCATAATGGTTTTCAAGTCACACAGTTCAGTACCACTACCATTCACTAGCAGTTGAAGTAAATAGATTATATATGAACAAGAAGAAAGACTAACTTGGAAGCCATCTCTGATGGTATCAAGCTGCCTAACAGGTTCTAAAGTGCGTCGACGCCACCTCTCTGGATCCCATAGTATGGTGCTATTGAAAGCAAAGCCTGACATTTCAGCATGAAATCTGTGGCGTCTCGTACTTGATTCATGTATGTGCCATCCAATTACAAGACTTTCATTACATATGGGGCCTTCCAGAATGGATCCACTGGTACCCCCTATTAGTTTGGCCACCGTCCAAGTTCCAAACCGTCTACGAATGAAAAATAAAAAATAAAAAGACTGGTTGCAAGGGCATTCATGAGATGAGAGATGCAGATATGATGATGCACATTAGTGCATATAGCCTACCCAAAGAGATGTCTAGCAACAAGGTTTTTAAAAATTAGAAGGAAAAAAAACGATTTCACTCTTCAAACATTTTAGAATCAGCATGTTCGAGAGAAGATACAAACAGTTTGTAATAGGTATCAACAGAATATCTTCCTGCATATGCAGAGAGAGAAAAGTAATGTTGACTCCACACTTTTTGAGGTAAAGGGGAAATTTGTTTACTCTTTATTTTTGTGATCATCTTCGTAAACAGAAGATCCAGAGTTCATGCTTTTAGAAAGCAAACAGCATAGACATGAATGTGCTTGAAAAACATGCTTGAGCACTGAGTTAATTTCATAGGCAAGCATGTGGAAATGATAGACAGACTCATGTTTCATCATAAATGTTAATCTTATGAATCGATATGTTGACAGTACCAAAAAATATTTAAAGAGCACGCACGGATCTGACAGGGTAGAATTATCTCTTATTACTATCTTGAGAATTGAGGACACTATACAAAAAGGTACTAAGGTCAAGTGTAAAGTCTGGTCCATCCCCATTGGTGCATTTTCTAACAAGGCAAGTATTTTATAGGCATTAGGAAATAACAGATCAGCTACATGTATGAATGGTTATTATAGTAAAAGTACCTGATCTCCCTCATTTTCTCGAAGAGATCAACCAAATAGATGTTATTCTCATCTGCAAAGTAAACAATTCCATCAAGGCGGTGGGTTTCAATGTGAGACAGAGCTAGATTTCTTTGATGAACTCTTCCATCTCTCGTATCCGTTAAATTTTTACTGCAAACAATATGCCTATACATAATCCAGTACTCCTCAATACATCAGCTGTTTCATCGGATTGGGAGGACATTTCCACAACAATCCATAGAAGAGGTGGTTGGACCAACTTTAGCGTGTGCGCCAAACGATCAAGATAATAGGATTGGAGTGGATGAGCAGATGTCGGAGTCACAATTATCAGTAGCTTTTGAGGCTCTAATTCTTGAGCGATTAGATGGTTATCCAGATTATCATAAGAAACATTATACATTGGCACTTCTGCATGGGTTGAGTTACTCTTCATGTCTTCACCCTCCAAGGGAATAG

mRNA sequence

ATGTTTGGAAGATCGAGTTACATATGGGTGGAAGGAGAAGGGATGCTTCACGCCTTGTATTTCAATAAAGACAGCCGAGGCACATGGAATCTCCTCTACAACAATAGATATGTCCAAACTGAAACATTTCGACTCGAAAAACTTGTCAGAAACCGACCATCTTTCCTTCCTGCTGTGGAGGGCGATTCTCTCGCTGTTCTCTCTGCCTTTTTCCTCAACTTGCTAAGATTCGGCAAAGTCTACAAAGACATCAGCAACACCAACGTGTTGGAGCACTCGGGGAAGTTTTACTCACTCGCAGAAAATTTTATGCCCCAAGAGTTCGACATCACGACGCTGGAAACTTTGGGCGATTGGGATGTCAATGGTGCTTGGAACAGACCTTTTACAAGCCATCCAAAGAAAGCTCCGGAAACCGGTGAGTTGGTGGTCTTGGGTGCCACTGCAACCAAACCCTTCATGGTATCGGAATCATTTCTGCGAATGGTTCATAAAGTCGACGTCAAACTCAGTAGAAGTAGCCTCAGCCATGAGATCGGAGTCACACAGAGGTACAATGTGATATTGGATTACCCCGTAACTGTCGACTTGAACAGACTTATCAGAGGCGGACCATTAATAAAATACGATAAGGAAGGGTACGCCAGAATCGGAGTAATGCCTCGTTACGGAGATGCCGATTCAATTCAATGGTTTCAGGTGAAACCCAATTGCACGATGCATCTTTTCAACTGCTTTGAGGACAAGGATGAGGTTATGGTGTGGGGATGTAGAGCTTCTGATTCAGTCATACCTGGACCTGAAAAGGGACTCAACAAATTCGAGTGGTTCTCTAAGAGATTTAAGCCATTACCTGCAACTGATCATGAAGAACACAACACTGATTCCTCCATTGAAGATGGGTCGTTGTTTTCTCGTGCTTACGAGTGGAGGCTGAACCTCAAAACTGGAGAGGCCCGGGAGAGATCTCTCACCGGAACTCAATTTTCCATGGATTTTCCCTTCATAAATTCACGCTTCACTGGTCTTAAAAATAAATTTGGATACGCACAGGTTCTTGACTCCTTCGCTAGTTCTAACTCAGGCATGTTTAAATTTGGGGGCCTGGCGAAGCTACATTTTGAAGAGCCTCAAAGTTTTGAATTTTCGTTGCCAAAAAAGTGTGAAGAACAGCGTATTATAAAAGTGGAATACCATATGTTTGAGAACAACTCCTTTTGCAGCGGAGCCTCCTTTGTGTCCAGAGAAGGCAGTCCCGAAGAAGATGATGGTTGGATAATCGCTCATGTTCACAATGAGATCACCAACACGTCTCAGCTAGCCATGGCGACAGCCATGTACACCTCGCTTCAACTGTTTCTTAATTACAACACCCTCCTGCTCCTTCCTTCAATTCCCCATAACAAAAGAGCCTCCGTCACCTTTATGGCCTCCCAAGCCAATAAGTCTTTACTCAGCTTCATGGACGATGAGAAACTGAGGACTCCACAAATTCAATCCCCCGCCCAAGTTTCCTCAGAAACAACCCCACAAGTTATGTCTGATCAACAGGGCGTGGCGCCGTCGTCCAGCCCACATCACTCTAAAGTGCTCAAGTTAATCCCTCCGCCGAGTCCGGAATCGCCATGGACGCTGTCTCCTCTCCAGACCCCTTCGCCTCCTCTGCTTTACCACTGCATAGCGTCGCTTCATCGCGGCGAAGGGAATATCTACTCCATTGCTATTTCCAAGGGAGTGGTGTTTACTGGCTCCGAAACCAGCCGCATTCGTGCATGGAAGCAACCGGACTGCATGGAACGTGGGTATCTTAAGGCCGCCGCCGGCGGAGTCAAGGCCATTTCAGCTTCCGGTGTGTTCGCTTCCGAAAAGAAGCTCATTTCTGTTGTTCTCTCGAACGGAAACCGCCCACCTCCACAAAGATTCATATCTTGCATGGCTTATTACCACGCACAGGACCTCCTCTACACCGGCTCCCACGATAAAACCATCAAAGCCTGGCGAGTTTCCGACCGCAAATGCGTCGACTCCTTCGTCGCCCATGAAGATAACGTCAACTCCATAGCGGTCAACCAAAACGACGGCTGCCTTTTCACTTGCTCTTCAGACGGAACCGTCAAAATCTGGAGGAGGGTTTACCGGGAAAACTCACACACTCTCACCATGACGCTCAAATTCCAAACTTCTCCGGTAAACGCCGTCGCTTTATGCTCCCACTACGACACCACCTTTCTGTACTCCGGTTCCTCCGATGGGACCATAAACTTTTGGGAGAAGGAGAAGCTGTCTTACAGATATAACCACAGGGGGTTCTTACAAGGCCACCGATTTGCGGTTCTGTGTCTGGTAGTGGTGGAGAGGCTGATATTTAGTGGATCGGAGGACACGACGGTGAGGATATGGAGGAAGGAAGAGGGAAGCTATTACCATGAATGCTTGGCGGTGCTGGACGGCCACAGAGGGCCGGTGAGATGCTTGGCGGCGAGCCTGGAAGTGGAAAAGATTACGACGAGTTTTCTGGTTTACAGTGCTAGTCTGGACCAAACGTTTAAAGTATGGAGAGTGAAGGTATTGCCTGAAGAGAAAAGGTGCTTGGATTATGGGGAGGGTGGGGAGTCCAAGATGAAGAAGCTGGGGGAGCTCTATGAGATGAGCCCTGTGCTGTCTCCTTCCTGGGCTGCAACGTCACAGTTCACACCCGAGATTACACCTGAAACAGAACATGGTTCATCAATGTTGATGTTGTGCAAGCAACTGCAGAGAAGATTTGTTCTGTATGATGTTTCCCGCGACACAAGTGATGCAATAAATGCTGCTACTGAAGCAGCAACACCTGCCACAGATACAGCTGCATTGGAAAGAGCCTTGGAGAGCTCTTTGGCCGAAAGGCTCCACGATCTGCCAAGAAACTCCATGGACTCGATTGGAGTTTCAGATTTTGCTGTCGGAAAAGAGATTTTATACCCAAAACAACATCAACGCCGACACCCATTAGCCCATACACCATTAGAGACGCACCCAATGCAATCTCAGCTTTCCCTTCTTCATAATACACAGAGAGAGCTGAGAAAGCATCATGCACGTCGCCAACAAATACTGAAAAGAGAGCAAGGGAAGACGATAGTAAAACACAACCAAGACGACGAAGATGATAACAAGAAGAATGAGAGCGATGAAGGAGCCGTGAAGAAGCGCAAGAACAATCAGATCCGAACTGGGAACCATCGGTTCAGTTCCTGCGATCTTTTGGAAGCCAGTTTGGTAATGGATTGCAAAGCAAGGCAAATCCTTTGTATATCCTTGTTTGGCACAAAAGAGAGCGAGAGGGATGATTTCAAATTACAGTCTAAGCATCTATCCCTAACAAAGGGAAGGAAATGCAGCTGGGTCCGTCTGTACAATATCAAATGCCCTTCAAAAGAAATGGAGATCAATATTGAATTGTACTACAACAAGACTTACTTCCAAAAGCTTTGCCATTTTGGCAAAGTAGAAGAATGCATTAGCTACATCAGTACCAAAAAATATTTAAAGAGCACGCACGGATCTGACAGGGTAGAATTATCTCTTATTACTATCTTGAGAATTGAGGACACTATACAAAAAGTAAAAGTACCTGATCTCCCTCATTTTCTCGAAGAGATCAACCAAATAGATGTTATTCTCATCTGCAAAGTAAACAATTCCATCAAGGCGGTGGGTTTCAATTACTCCTCAATACATCAGCTGTTTCATCGGATTGGGAGGACATTTCCACAACAATCCATAGAAGAGGTGGTTGGACCAACTTTAGCGTGTGCGCCAAACGATCAAGATAATAGGATTGGAGTGGATGAGCAGATGTCGGAGTCACAATTATCAGTAGCTTTTGAGGCTCTAATTCTTGAGCGATTAGATGGTTATCCAGATTATCATAAGAAACATTATACATTGGCACTTCTGCATGGGTTGAGTTACTCTTCATGTCTTCACCCTCCAAGGGAATAG

Coding sequence (CDS)

ATGTTTGGAAGATCGAGTTACATATGGGTGGAAGGAGAAGGGATGCTTCACGCCTTGTATTTCAATAAAGACAGCCGAGGCACATGGAATCTCCTCTACAACAATAGATATGTCCAAACTGAAACATTTCGACTCGAAAAACTTGTCAGAAACCGACCATCTTTCCTTCCTGCTGTGGAGGGCGATTCTCTCGCTGTTCTCTCTGCCTTTTTCCTCAACTTGCTAAGATTCGGCAAAGTCTACAAAGACATCAGCAACACCAACGTGTTGGAGCACTCGGGGAAGTTTTACTCACTCGCAGAAAATTTTATGCCCCAAGAGTTCGACATCACGACGCTGGAAACTTTGGGCGATTGGGATGTCAATGGTGCTTGGAACAGACCTTTTACAAGCCATCCAAAGAAAGCTCCGGAAACCGGTGAGTTGGTGGTCTTGGGTGCCACTGCAACCAAACCCTTCATGGTATCGGAATCATTTCTGCGAATGGTTCATAAAGTCGACGTCAAACTCAGTAGAAGTAGCCTCAGCCATGAGATCGGAGTCACACAGAGGTACAATGTGATATTGGATTACCCCGTAACTGTCGACTTGAACAGACTTATCAGAGGCGGACCATTAATAAAATACGATAAGGAAGGGTACGCCAGAATCGGAGTAATGCCTCGTTACGGAGATGCCGATTCAATTCAATGGTTTCAGGTGAAACCCAATTGCACGATGCATCTTTTCAACTGCTTTGAGGACAAGGATGAGGTTATGGTGTGGGGATGTAGAGCTTCTGATTCAGTCATACCTGGACCTGAAAAGGGACTCAACAAATTCGAGTGGTTCTCTAAGAGATTTAAGCCATTACCTGCAACTGATCATGAAGAACACAACACTGATTCCTCCATTGAAGATGGGTCGTTGTTTTCTCGTGCTTACGAGTGGAGGCTGAACCTCAAAACTGGAGAGGCCCGGGAGAGATCTCTCACCGGAACTCAATTTTCCATGGATTTTCCCTTCATAAATTCACGCTTCACTGGTCTTAAAAATAAATTTGGATACGCACAGGTTCTTGACTCCTTCGCTAGTTCTAACTCAGGCATGTTTAAATTTGGGGGCCTGGCGAAGCTACATTTTGAAGAGCCTCAAAGTTTTGAATTTTCGTTGCCAAAAAAGTGTGAAGAACAGCGTATTATAAAAGTGGAATACCATATGTTTGAGAACAACTCCTTTTGCAGCGGAGCCTCCTTTGTGTCCAGAGAAGGCAGTCCCGAAGAAGATGATGGTTGGATAATCGCTCATGTTCACAATGAGATCACCAACACGTCTCAGCTAGCCATGGCGACAGCCATGTACACCTCGCTTCAACTGTTTCTTAATTACAACACCCTCCTGCTCCTTCCTTCAATTCCCCATAACAAAAGAGCCTCCGTCACCTTTATGGCCTCCCAAGCCAATAAGTCTTTACTCAGCTTCATGGACGATGAGAAACTGAGGACTCCACAAATTCAATCCCCCGCCCAAGTTTCCTCAGAAACAACCCCACAAGTTATGTCTGATCAACAGGGCGTGGCGCCGTCGTCCAGCCCACATCACTCTAAAGTGCTCAAGTTAATCCCTCCGCCGAGTCCGGAATCGCCATGGACGCTGTCTCCTCTCCAGACCCCTTCGCCTCCTCTGCTTTACCACTGCATAGCGTCGCTTCATCGCGGCGAAGGGAATATCTACTCCATTGCTATTTCCAAGGGAGTGGTGTTTACTGGCTCCGAAACCAGCCGCATTCGTGCATGGAAGCAACCGGACTGCATGGAACGTGGGTATCTTAAGGCCGCCGCCGGCGGAGTCAAGGCCATTTCAGCTTCCGGTGTGTTCGCTTCCGAAAAGAAGCTCATTTCTGTTGTTCTCTCGAACGGAAACCGCCCACCTCCACAAAGATTCATATCTTGCATGGCTTATTACCACGCACAGGACCTCCTCTACACCGGCTCCCACGATAAAACCATCAAAGCCTGGCGAGTTTCCGACCGCAAATGCGTCGACTCCTTCGTCGCCCATGAAGATAACGTCAACTCCATAGCGGTCAACCAAAACGACGGCTGCCTTTTCACTTGCTCTTCAGACGGAACCGTCAAAATCTGGAGGAGGGTTTACCGGGAAAACTCACACACTCTCACCATGACGCTCAAATTCCAAACTTCTCCGGTAAACGCCGTCGCTTTATGCTCCCACTACGACACCACCTTTCTGTACTCCGGTTCCTCCGATGGGACCATAAACTTTTGGGAGAAGGAGAAGCTGTCTTACAGATATAACCACAGGGGGTTCTTACAAGGCCACCGATTTGCGGTTCTGTGTCTGGTAGTGGTGGAGAGGCTGATATTTAGTGGATCGGAGGACACGACGGTGAGGATATGGAGGAAGGAAGAGGGAAGCTATTACCATGAATGCTTGGCGGTGCTGGACGGCCACAGAGGGCCGGTGAGATGCTTGGCGGCGAGCCTGGAAGTGGAAAAGATTACGACGAGTTTTCTGGTTTACAGTGCTAGTCTGGACCAAACGTTTAAAGTATGGAGAGTGAAGGTATTGCCTGAAGAGAAAAGGTGCTTGGATTATGGGGAGGGTGGGGAGTCCAAGATGAAGAAGCTGGGGGAGCTCTATGAGATGAGCCCTGTGCTGTCTCCTTCCTGGGCTGCAACGTCACAGTTCACACCCGAGATTACACCTGAAACAGAACATGGTTCATCAATGTTGATGTTGTGCAAGCAACTGCAGAGAAGATTTGTTCTGTATGATGTTTCCCGCGACACAAGTGATGCAATAAATGCTGCTACTGAAGCAGCAACACCTGCCACAGATACAGCTGCATTGGAAAGAGCCTTGGAGAGCTCTTTGGCCGAAAGGCTCCACGATCTGCCAAGAAACTCCATGGACTCGATTGGAGTTTCAGATTTTGCTGTCGGAAAAGAGATTTTATACCCAAAACAACATCAACGCCGACACCCATTAGCCCATACACCATTAGAGACGCACCCAATGCAATCTCAGCTTTCCCTTCTTCATAATACACAGAGAGAGCTGAGAAAGCATCATGCACGTCGCCAACAAATACTGAAAAGAGAGCAAGGGAAGACGATAGTAAAACACAACCAAGACGACGAAGATGATAACAAGAAGAATGAGAGCGATGAAGGAGCCGTGAAGAAGCGCAAGAACAATCAGATCCGAACTGGGAACCATCGGTTCAGTTCCTGCGATCTTTTGGAAGCCAGTTTGGTAATGGATTGCAAAGCAAGGCAAATCCTTTGTATATCCTTGTTTGGCACAAAAGAGAGCGAGAGGGATGATTTCAAATTACAGTCTAAGCATCTATCCCTAACAAAGGGAAGGAAATGCAGCTGGGTCCGTCTGTACAATATCAAATGCCCTTCAAAAGAAATGGAGATCAATATTGAATTGTACTACAACAAGACTTACTTCCAAAAGCTTTGCCATTTTGGCAAAGTAGAAGAATGCATTAGCTACATCAGTACCAAAAAATATTTAAAGAGCACGCACGGATCTGACAGGGTAGAATTATCTCTTATTACTATCTTGAGAATTGAGGACACTATACAAAAAGTAAAAGTACCTGATCTCCCTCATTTTCTCGAAGAGATCAACCAAATAGATGTTATTCTCATCTGCAAAGTAAACAATTCCATCAAGGCGGTGGGTTTCAATTACTCCTCAATACATCAGCTGTTTCATCGGATTGGGAGGACATTTCCACAACAATCCATAGAAGAGGTGGTTGGACCAACTTTAGCGTGTGCGCCAAACGATCAAGATAATAGGATTGGAGTGGATGAGCAGATGTCGGAGTCACAATTATCAGTAGCTTTTGAGGCTCTAATTCTTGAGCGATTAGATGGTTATCCAGATTATCATAAGAAACATTATACATTGGCACTTCTGCATGGGTTGAGTTACTCTTCATGTCTTCACCCTCCAAGGGAATAG

Protein sequence

MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFMVSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATAMYTSLQLFLNYNTLLLLPSIPHNKRASVTFMASQANKSLLSFMDDEKLRTPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASGVFASEKKLISVVLSNGNRPPPQRFISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEVEKITTSFLVYSASLDQTFKVWRVKVLPEEKRCLDYGEGGESKMKKLGELYEMSPVLSPSWAATSQFTPEITPETEHGSSMLMLCKQLQRRFVLYDVSRDTSDAINAATEAATPATDTAALERALESSLAERLHDLPRNSMDSIGVSDFAVGKEILYPKQHQRRHPLAHTPLETHPMQSQLSLLHNTQRELRKHHARRQQILKREQGKTIVKHNQDDEDDNKKNESDEGAVKKRKNNQIRTGNHRFSSCDLLEASLVMDCKARQILCISLFGTKESERDDFKLQSKHLSLTKGRKCSWVRLYNIKCPSKEMEINIELYYNKTYFQKLCHFGKVEECISYISTKKYLKSTHGSDRVELSLITILRIEDTIQKVKVPDLPHFLEEINQIDVILICKVNNSIKAVGFNYSSIHQLFHRIGRTFPQQSIEEVVGPTLACAPNDQDNRIGVDEQMSESQLSVAFEALILERLDGYPDYHKKHYTLALLHGLSYSSCLHPPRE
Homology
BLAST of Sgr027961 vs. NCBI nr
Match: KAG7031724.1 (Protein JINGUBANG, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1181.8 bits (3056), Expect = 0.0e+00
Identity = 621/943 (65.85%), Postives = 703/943 (74.55%), Query Frame = 0

Query: 15  MLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNL 74
           MLHA+YF K+  G W+L YNNRYVQT++F+ EK V+NRPSFLPAVEGD+LAV++AF LN 
Sbjct: 1   MLHAIYFKKNINGEWHLYYNNRYVQTDSFQFEKHVKNRPSFLPAVEGDALAVVAAFVLNW 60

Query: 75  LRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPK 134
           LRFG+V KD+SNTNV+EHSG+FYS+AEN MPQE DI +L++LG WDV  AWNRPFTSHPK
Sbjct: 61  LRFGEVNKDMSNTNVIEHSGRFYSVAENSMPQEIDIMSLQSLGVWDVCRAWNRPFTSHPK 120

Query: 135 KAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILD 194
           KAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+HEIGVT+R+NVILD
Sbjct: 121 KAPDTGELVIMGIAPTKPYMEIGIISEDGKRMVHKVDVKLSRSSLTHEIGVTRRFNVILD 180

Query: 195 YPVTVDLNRLIRGGP-------------------------------LIKYDKEGYARIGV 254
           YPVT+DLNRLIRGG                                LIK++KEGYA+IGV
Sbjct: 181 YPVTIDLNRLIRGGSYVFLPLNPIRLCFIAQIRFRILKFKQSLCCRLIKFEKEGYAKIGV 240

Query: 255 MPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSK 314
           MPRYGDADSIQWF+VKPNCTMHLFNCFE + EV+VWGCRA+DS IPGPEKGLNKFEWFS+
Sbjct: 241 MPRYGDADSIQWFEVKPNCTMHLFNCFEHEHEVLVWGCRATDSFIPGPEKGLNKFEWFSQ 300

Query: 315 RFKPLPATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSR 374
           RFKP+  TD  E NT    EDGSL SRAY+WR+NLKTGE RER LTG QFSMDFPFINS 
Sbjct: 301 RFKPVRVTD--EQNT----EDGSLLSRAYQWRVNLKTGEVRERFLTGNQFSMDFPFINSA 360

Query: 375 FTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYH 434
           FTGLKNK+GYAQVLDS ASSNS      GLAKLHFEEPQS E SL K  EE+  IKVEYH
Sbjct: 361 FTGLKNKYGYAQVLDSVASSNS------GLAKLHFEEPQSGEVSLLKDSEEEP-IKVEYH 420

Query: 435 MFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATAMYTSLQLFLNYNTL 494
           M ENNSFC+GASFV RE S +EDDGW+IAHVHNEITNTSQ+ +  A   S +        
Sbjct: 421 MLENNSFCTGASFVPREDSLQEDDGWVIAHVHNEITNTSQIYIVDAKRFSDEPVAKITLP 480

Query: 495 LLLPSIPHNKR--ASVTFMASQANKSLLSFMDDEKLRTPQIQSPAQVSSETTPQVMSDQQ 554
             +P   H     +   F    A+ +L  ++    +    +    Q +     +VM +Q 
Sbjct: 481 HRVPYGFHGNHLGSGPYFSCLGASSNLCDYLSSRIV--IMLDDEKQTTMPQRERVMGEQP 540

Query: 555 GVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIAISK 614
           G         SK +K IPPPSPESPW LSPL+TPSP LL+HCIASLHRGEGNIYSIAIS 
Sbjct: 541 G---------SKAMKPIPPPSPESPWMLSPLRTPSPLLLHHCIASLHRGEGNIYSIAISN 600

Query: 615 GVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG--VFAS-------------- 674
           GVVFTGS++ R+RAWKQPDCM+RGYLKAAAGGV AI+A G  +F S              
Sbjct: 601 GVVFTGSDSCRVRAWKQPDCMDRGYLKAAAGGVWAIAAYGNMLFTSHRDSRIRVWNFTAS 660

Query: 675 ----EKKLIS-------VVLSNGNRPPPQR-FISCMAYYHAQDLLYTGSHDKTIKAWRVS 734
                KKL S       ++ S        +  ISCMAYYHAQDLLYT SHDKT+KAWR+S
Sbjct: 661 HNLTSKKLCSLPKRTSFLLFSKTETTHLHKDTISCMAYYHAQDLLYTASHDKTVKAWRIS 720

Query: 735 DRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTLTMTLKFQTSP 794
           DRKCVDSF+AHED VNSI VNQNDGCLFTCSSDGTVKIWR +YRENSHTLTMTLKFQTSP
Sbjct: 721 DRKCVDSFLAHEDKVNSITVNQNDGCLFTCSSDGTVKIWRCLYRENSHTLTMTLKFQTSP 780

Query: 795 VNAVALCSHYD-TTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLI 854
           VNAV L SH D  TFLYSGSSDGTINFWEKE++SYRYNH GFLQGHRF VLCL VVERL+
Sbjct: 781 VNAVVLSSHSDGETFLYSGSSDGTINFWEKERVSYRYNHGGFLQGHRFGVLCLAVVERLV 840

Query: 855 FSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEVEKITTSFLVYSASLDQT 892
            SGSEDTTVRIWRKEEG  YHECLAVLDGHRGPVRCLA SLE+E +  SFLVYS SLDQ+
Sbjct: 841 LSGSEDTTVRIWRKEEGGCYHECLAVLDGHRGPVRCLAGSLEMENMEMSFLVYSGSLDQS 900

BLAST of Sgr027961 vs. NCBI nr
Match: XP_022139412.1 (carotenoid 9,10(9',10')-cleavage dioxygenase 1-like [Momordica charantia])

HSP 1 Score: 729.9 bits (1883), Expect = 3.8e-206
Identity = 356/453 (78.59%), Postives = 390/453 (86.09%), Query Frame = 0

Query: 1   MFGRSSYIWVEGEGMLHALYFNKD--SRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPA 60
           +FGRS+Y WVEGEGMLHALYFNKD      WNL YNNRYVQT+TF+LEK V+N P FLPA
Sbjct: 118 IFGRSNYTWVEGEGMLHALYFNKDHTKLRKWNLFYNNRYVQTQTFQLEKHVKNGPYFLPA 177

Query: 61  VEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLETLGD 120
           +EGDSLAVLSAFFLN+LRFG V KD+SNTNV+EHSGKFYS+A+N +PQ+ DI +L++LG 
Sbjct: 178 IEGDSLAVLSAFFLNMLRFGTVNKDMSNTNVIEHSGKFYSVADNSLPQQIDIMSLQSLGV 237

Query: 121 WDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSS 180
           WDVNGAWNRPFTSHPKKAP TGELV++G T  KPFM    +SE    MVHKVDVKLSR S
Sbjct: 238 WDVNGAWNRPFTSHPKKAPGTGELVIMGVTPAKPFMLLGIISEDGKEMVHKVDVKLSRGS 297

Query: 181 LSHEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQV 240
            SHEIGVT+RYNVILDYPVT+D NRLI GG LIKYDKEGYARIGVMPRYGD DSIQWFQV
Sbjct: 298 FSHEIGVTRRYNVILDYPVTIDFNRLITGGSLIKYDKEGYARIGVMPRYGDGDSIQWFQV 357

Query: 241 KPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNT 300
           KPNCTMHLFN FE  DEV+VWGCRASDSVIPGPEKG NKFEWFS RFKPLP  D +  ++
Sbjct: 358 KPNCTMHLFNSFEQNDEVVVWGCRASDSVIPGPEKGHNKFEWFSNRFKPLPIADEQNSDS 417

Query: 301 -DSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVL 360
             SS+EDGSLFSRAYEWRLNLKTGEARER L+GTQFSMDFPFINSRFTGL+NKFGYAQ+L
Sbjct: 418 CSSSMEDGSLFSRAYEWRLNLKTGEARERFLSGTQFSMDFPFINSRFTGLQNKFGYAQIL 477

Query: 361 DSFASSNSGMFKFGGLAKLHFEEP-QSFEFSLPKKCEEQRIIKVEYHMFENNSFCSGASF 420
           DSFASSNSGMFKFGGLAKLHFEEP QS  FSLPKK   +  IKVEYHMFEN+SFCSGASF
Sbjct: 478 DSFASSNSGMFKFGGLAKLHFEEPDQSCGFSLPKKWGNE-AIKVEYHMFENSSFCSGASF 537

Query: 421 VSREGSPEEDDGWIIAHVHNEITNTSQLAMATA 446
           V RE   EEDDGWIIAH+HNEITNTSQ+ +  A
Sbjct: 538 VPREDGAEEDDGWIIAHLHNEITNTSQVYIIDA 569

BLAST of Sgr027961 vs. NCBI nr
Match: XP_022980961.1 (carotenoid 9,10(9',10')-cleavage dioxygenase 1-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 713.0 bits (1839), Expect = 4.8e-201
Identity = 345/449 (76.84%), Postives = 391/449 (87.08%), Query Frame = 0

Query: 1   MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVE 60
           MFGRSS+IWVEGEGMLHA+YF K+  G W+L YNNRYVQTE+FR EK V+NRPSFLPAVE
Sbjct: 135 MFGRSSFIWVEGEGMLHAMYFKKNMDGKWHLSYNNRYVQTESFRFEKHVKNRPSFLPAVE 194

Query: 61  GDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLETLGDWD 120
           GD++A+++AF LN LRFG+V KD+SNTNV+EHSGKFYS+AEN MPQ+ DI +L++LG WD
Sbjct: 195 GDAVAIVAAFLLNWLRFGEVNKDMSNTNVIEHSGKFYSVAENSMPQQIDIMSLQSLGVWD 254

Query: 121 VNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLS 180
           V  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+
Sbjct: 255 VCRAWNRPFTSHPKKAPDTGELVIMGIAPTKPYMEIGIISEDGKRMVHKVDVKLSRSSLT 314

Query: 181 HEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKP 240
           HEIGVT+R+NVILDYPVT+DLNRLIRGG LIK++KEGYA+IGVMPRYGDADSIQWF+VKP
Sbjct: 315 HEIGVTRRFNVILDYPVTIDLNRLIRGGSLIKFEKEGYAKIGVMPRYGDADSIQWFEVKP 374

Query: 241 NCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS 300
           NCTMHLFNCFE ++EV+VWGCRA+DS+IPGPEKGLNKFEWFS+RFKP+  TD  E NT  
Sbjct: 375 NCTMHLFNCFEHQNEVVVWGCRATDSIIPGPEKGLNKFEWFSQRFKPVHVTD--EQNT-- 434

Query: 301 SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSF 360
             EDGSL S AYEWRLNLKTGEARER LTG QFSMDFPFINS FTGLKNK+GYAQVLDS 
Sbjct: 435 --EDGSLLSCAYEWRLNLKTGEARERFLTGNQFSMDFPFINSAFTGLKNKYGYAQVLDSV 494

Query: 361 ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSRE 420
           ASSNSG+FKFGGLAKLHFEEPQS E SL K  EE+  IKVEYHM ENNSFC+GASFV RE
Sbjct: 495 ASSNSGIFKFGGLAKLHFEEPQSGEVSLLKDSEEEP-IKVEYHMLENNSFCTGASFVPRE 554

Query: 421 GSPEEDDGWIIAHVHNEITNTSQLAMATA 446
            S +EDDGWIIAHVHNEIT TSQ+ +  A
Sbjct: 555 DSLQEDDGWIIAHVHNEITKTSQVYIVDA 576

BLAST of Sgr027961 vs. NCBI nr
Match: XP_023523500.1 (carotenoid 9,10(9',10')-cleavage dioxygenase 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 713.0 bits (1839), Expect = 4.8e-201
Identity = 344/449 (76.61%), Postives = 390/449 (86.86%), Query Frame = 0

Query: 1   MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVE 60
           MFGRSS+IWVEGEGMLHA+YF K   G W+LLYNNRYVQTE+FR EK V+NRPSFLPAVE
Sbjct: 98  MFGRSSFIWVEGEGMLHAIYFKKHINGEWHLLYNNRYVQTESFRFEKHVKNRPSFLPAVE 157

Query: 61  GDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLETLGDWD 120
           GD++AV++AF LN LRFG+V KD+SNTNV+EHSG+FYS+AEN MPQE DI +L++LG WD
Sbjct: 158 GDAVAVVAAFVLNWLRFGEVNKDMSNTNVIEHSGRFYSVAENSMPQEIDIMSLQSLGVWD 217

Query: 121 VNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLS 180
           V  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+
Sbjct: 218 VCRAWNRPFTSHPKKAPDTGELVIMGIAPTKPYMEIGIISEDGKRMVHKVDVKLSRSSLT 277

Query: 181 HEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKP 240
           HEIG+T+R+NVILDYPVT+DLNRLIRGG LIK++KEGYA+IGVMPRYGDADSIQWF+VKP
Sbjct: 278 HEIGITRRFNVILDYPVTIDLNRLIRGGSLIKFEKEGYAKIGVMPRYGDADSIQWFEVKP 337

Query: 241 NCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS 300
           NCTMH+FNCFE + EV+VWGCRA+DS IPGPEKGLNKFEWFS+RFKP+  TD  E NT  
Sbjct: 338 NCTMHVFNCFEHEHEVVVWGCRATDSFIPGPEKGLNKFEWFSQRFKPVCVTD--EQNT-- 397

Query: 301 SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSF 360
             EDGSL SRAY+WRLNLKTGE RER LTG QFSMDFPFINS FTGLKNK+GYAQVLDS 
Sbjct: 398 --EDGSLLSRAYQWRLNLKTGEVRERFLTGNQFSMDFPFINSAFTGLKNKYGYAQVLDSV 457

Query: 361 ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSRE 420
           ASSNSG+FKFGGLAKLHFEEPQS E SL K  EE+  IKVEYHM ENNSFC+GASFV RE
Sbjct: 458 ASSNSGIFKFGGLAKLHFEEPQSGEVSLLKDSEEEP-IKVEYHMLENNSFCTGASFVPRE 517

Query: 421 GSPEEDDGWIIAHVHNEITNTSQLAMATA 446
            S +EDDGW+IAHVHNEITNTSQ+ +  A
Sbjct: 518 DSLQEDDGWVIAHVHNEITNTSQVYIVDA 539

BLAST of Sgr027961 vs. NCBI nr
Match: XP_022940198.1 (carotenoid 9,10(9',10')-cleavage dioxygenase 1-like [Cucurbita moschata] >XP_022940199.1 carotenoid 9,10(9',10')-cleavage dioxygenase 1-like [Cucurbita moschata])

HSP 1 Score: 711.4 bits (1835), Expect = 1.4e-200
Identity = 342/449 (76.17%), Postives = 390/449 (86.86%), Query Frame = 0

Query: 1   MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVE 60
           M GRSS+IWVEGEGMLHA+YF K+  G W+L YNNRYVQT++F+ EK V+NRPSFLPAVE
Sbjct: 128 MLGRSSFIWVEGEGMLHAIYFKKNINGEWHLYYNNRYVQTDSFQFEKHVKNRPSFLPAVE 187

Query: 61  GDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLETLGDWD 120
           GD+LAV++AF LN LRFG+V KD+SNTNV+EHSG+FYS+AEN MPQE DI +L++LG WD
Sbjct: 188 GDALAVVAAFVLNWLRFGEVNKDMSNTNVIEHSGRFYSVAENSMPQEIDIMSLQSLGVWD 247

Query: 121 VNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLS 180
           V  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+
Sbjct: 248 VCRAWNRPFTSHPKKAPDTGELVIMGIAPTKPYMEIGIISEDGKRMVHKVDVKLSRSSLT 307

Query: 181 HEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKP 240
           HEIGVT+R+NVILDYPVT+DLNRLIRGG LIK++KEGYA+IGVMPRYGDADSIQWF+VKP
Sbjct: 308 HEIGVTRRFNVILDYPVTIDLNRLIRGGSLIKFEKEGYAKIGVMPRYGDADSIQWFEVKP 367

Query: 241 NCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS 300
           NCTMHLFNCFE ++EV+VWGCRA+DS+IPGPEKGLNKFEWFS+RFKP+  TD      + 
Sbjct: 368 NCTMHLFNCFEHQNEVVVWGCRATDSIIPGPEKGLNKFEWFSQRFKPVHVTD------EQ 427

Query: 301 SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSF 360
           + EDGSL S AYEWRLNLKTGEARER LTG QFSMDFPFINS FTGLKN++GYAQVLDS 
Sbjct: 428 NAEDGSLLSCAYEWRLNLKTGEARERFLTGNQFSMDFPFINSAFTGLKNRYGYAQVLDSV 487

Query: 361 ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSRE 420
           ASSNSGMFKFGGLAKLHFEEPQS E SL K  EE+  IKVEYHM ENNSFC+GASFV RE
Sbjct: 488 ASSNSGMFKFGGLAKLHFEEPQSGEVSLLKGSEEEP-IKVEYHMLENNSFCTGASFVPRE 547

Query: 421 GSPEEDDGWIIAHVHNEITNTSQLAMATA 446
            S +EDDGWIIAHVHNEITNTSQ+ +  A
Sbjct: 548 DSLQEDDGWIIAHVHNEITNTSQVYIVDA 569

BLAST of Sgr027961 vs. ExPASy Swiss-Prot
Match: O48716 (Protein JINGUBANG OS=Arabidopsis thaliana OX=3702 GN=JGB PE=1 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 1.7e-52
Identity = 136/368 (36.96%), Postives = 200/368 (54.35%), Query Frame = 0

Query: 533 LIPPPSPESPWTLSPLQTPSPPLLYH-CIASLHRGEGNIYSIAISKGVVFTGSETSRIRA 592
           ++ P +  +P+T +   T    L  +  I SL R EG+IYS+A +K +++TGS++  IR 
Sbjct: 61  MMSPWNQATPFTQTQWSTVEENLPQNGLIGSLVREEGHIYSLAATKDLLYTGSDSKNIRV 120

Query: 593 WKQPDCMERGYLKAAAGGVKAISASG-------------VFASEKKLISVVLSNGNRP-- 652
           WK  +  E    K  +G VKAI  SG             V+    K  S+   +G  P  
Sbjct: 121 WK--NLKEFSAFKCNSGLVKAIVISGEKIFTGHQDGKIRVWKVSPKNQSLHKRSGTLPTL 180

Query: 653 --------PPQRF-----------------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDR 712
                    P+ +                 +SC++    Q LLY+ S D+TIK WR++D 
Sbjct: 181 KDIFKASLKPRNYVEVKKHRTALWIKHADAVSCLSLNDEQGLLYSASWDRTIKVWRIADS 240

Query: 713 KCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSP 772
           KC++S  AH+D VNS+ V+  +  +F+ S+DGTVK W+R    +   HTL  TL  Q S 
Sbjct: 241 KCLESIPAHDDAVNSV-VSTTEAIVFSGSADGTVKAWKRDQQGKYTKHTLMQTLTKQESA 300

Query: 773 VNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIF 832
           V A+A+    +   +Y GSSDG +NFWE+EK   + N+ G L+GH+ AVLCL V   L+F
Sbjct: 301 VTALAVSK--NGAAVYFGSSDGLVNFWEREK---QLNYGGILKGHKLAVLCLEVAGSLVF 360

Query: 833 SGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLE---VEKITTSFLVYSASLD 855
           SGS D T+ +W+++     H CL+VL GH GPV+CLA   +    E+    ++VYS SLD
Sbjct: 361 SGSADKTICVWKRD--GNIHTCLSVLTGHTGPVKCLAVEADREASERRDKKWIVYSGSLD 418

BLAST of Sgr027961 vs. ExPASy Swiss-Prot
Match: Q94IR2 (Carotenoid 9,10(9',10')-cleavage dioxygenase 1 OS=Phaseolus vulgaris OX=3885 GN=CCD1 PE=1 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 1.4e-38
Identity = 131/446 (29.37%), Postives = 208/446 (46.64%), Query Frame = 0

Query: 5   SSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSL 64
           + Y W +G+GM+H L   KD + T    Y +R+V+T   + E+    R  F+   +   L
Sbjct: 94  AGYHWFDGDGMIHGLRI-KDGKAT----YVSRFVETSRLKQEEYF-GRSKFMKIGDLKGL 153

Query: 65  AVLSAFFLNLLRFGKVYKDIS------NTNVLEHSGKFYSLAENFMP---QEFDITTLET 124
             L    +++LR      D+S      NT ++ H GK  +L+E   P   + F+   L+T
Sbjct: 154 FGLLMVNIHMLRTKLKVLDLSYGGGTTNTALVYHHGKLLALSEADKPYAIKVFEDGDLQT 213

Query: 125 LGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFMVSESFLR---MVHKVDVKLSR 184
           LG  D +      FT+HPK  P TGE+   G   T P++      +   M   V + +S 
Sbjct: 214 LGMLDYDKRLGHSFTAHPKVDPFTGEMFSFGYAHTPPYITYRVISKDGYMHDPVPITISD 273

Query: 185 SSLSHEIGVTQRYNVILDYPVTVDLNRLIRGGPLI-KYDKEGYARIGVMPRYG-DADSIQ 244
             + H+  +T+ Y V +D P+      +++   LI  +D    AR GV+PRY  D   I+
Sbjct: 274 PIMMHDFAITENYAVFMDLPLIFRPKEMVKNKTLIFSFDSTKKARFGVLPRYAKDEQHIR 333

Query: 245 WFQVKPNC-TMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDH 304
           WF++ PNC   H  N +E++DEV++  CR  +                       P  D+
Sbjct: 334 WFEL-PNCFIFHNANAWEEEDEVVLITCRLQN-----------------------PKLDN 393

Query: 305 EEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGY 364
                   +E+ S  +  YE R N+KTGEA ++ L+ +  ++DFP +N  +TG K ++ Y
Sbjct: 394 VGGTVQEKLENFS--NELYEMRFNMKTGEASQKKLSAS--TVDFPRVNENYTGRKQRYVY 453

Query: 365 AQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMFENNSFCSG 424
              LDS A   +G+ KF     LH E     E     K E    ++  Y +     F S 
Sbjct: 454 GTTLDSIAKV-TGIIKF----DLHAEPDHGKE-----KLEVGGNVQGLYDL-GPGKFGSE 494

Query: 425 ASFVSREG--SPEEDDGWIIAHVHNE 434
           A ++ R      EEDDG+++  VH+E
Sbjct: 514 AVYIPRVPGIESEEDDGYLVLFVHDE 494

BLAST of Sgr027961 vs. ExPASy Swiss-Prot
Match: O65572 (Carotenoid 9,10(9',10')-cleavage dioxygenase 1 OS=Arabidopsis thaliana OX=3702 GN=CCD1 PE=1 SV=2)

HSP 1 Score: 157.1 bits (396), Expect = 1.3e-36
Identity = 133/468 (28.42%), Postives = 213/468 (45.51%), Query Frame = 0

Query: 5   SSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSL 64
           + Y W +G+GM+H +   KD + T    Y +RYV+T   + E+       F  A +   +
Sbjct: 92  AGYHWFDGDGMIHGVRI-KDGKAT----YVSRYVKTSRLKQEE-------FFGAAKFMKI 151

Query: 65  AVLSAFF----LNLLRFGKVYKDI--------SNTNVLEHSGKFYSLAENFMPQEFDIT- 124
             L  FF    +N+ +     K +        +NT ++ H GK  +L E   P    +  
Sbjct: 152 GDLKGFFGLLMVNVQQLRTKLKILDNTYGNGTANTALVYHHGKLLALQEADKPYVIKVLE 211

Query: 125 --TLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFMVSESFLR---MVHKV 184
              L+TLG  D +      FT+HPK  P TGE+   G + T P++      +   M   V
Sbjct: 212 DGDLQTLGIIDYDKRLTHSFTAHPKVDPVTGEMFTFGYSHTPPYLTYRVISKDGIMHDPV 271

Query: 185 DVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNRLIRGGPLI-KYDKEGYARIGVMPRYG- 244
            + +S   + H+  +T+ Y + +D P+      +++   +I  +D    AR GV+PRY  
Sbjct: 272 PITISEPIMMHDFAITETYAIFMDLPMHFRPKEMVKEKKMIYSFDPTKKARFGVLPRYAK 331

Query: 245 DADSIQWFQVKPNC-TMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKP 304
           D   I+WF++ PNC   H  N +E++DEV++  CR  +                      
Sbjct: 332 DELMIRWFEL-PNCFIFHNANAWEEEDEVVLITCRLEN---------------------- 391

Query: 305 LPATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGL 364
            P  D         +E  +  +  YE R N+KTG A ++ L+ +  ++DFP IN  +TG 
Sbjct: 392 -PDLDMVSGKVKEKLE--NFGNELYEMRFNMKTGSASQKKLSAS--AVDFPRINECYTGK 451

Query: 365 KNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMFEN 424
           K ++ Y  +LDS A   +G+ KF     LH E          +  E    IK  Y + E 
Sbjct: 452 KQRYVYGTILDSIAKV-TGIIKF----DLHAEAETG-----KRMLEVGGNIKGIYDLGEG 507

Query: 425 NSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATAMYTSLQ 452
             + S A +V RE + EEDDG++I  VH+E T  S + +  A   S +
Sbjct: 512 R-YGSEAIYVPRE-TAEEDDGYLIFFVHDENTGKSCVTVIDAKTMSAE 507

BLAST of Sgr027961 vs. ExPASy Swiss-Prot
Match: Q9LRR7 (9-cis-epoxycarotenoid dioxygenase NCED3, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=NCED3 PE=1 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 2.6e-32
Identity = 121/456 (26.54%), Postives = 203/456 (44.52%), Query Frame = 0

Query: 11  EGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAV-EGDSLAVLSA 70
           +G+GM+HA+ F   S       Y  R+ QT  F  E+ +  RP F  A+ E      ++ 
Sbjct: 172 DGDGMVHAVKFEHGSAS-----YACRFTQTNRFVQERQL-GRPVFPKAIGELHGHTGIAR 231

Query: 71  FFLNLLRFGKVYKD------ISNTNVLEHSGKFYSLAENFMPQEFDIT---TLETLGDWD 130
             L   R      D      ++N  ++  +G+  +++E+ +P +  IT    L+T+G +D
Sbjct: 232 LMLFYARAAAGIVDPAHGTGVANAGLVYFNGRLLAMSEDDLPYQVQITPNGDLKTVGRFD 291

Query: 131 VNGAWNRPFTSHPKKAPETGELVVLG-ATATKPFMVSESFLRMVHK---VDVKLSRSSLS 190
            +G       +HPK  PE+GEL  L     +KP++    F     K   V+++L + ++ 
Sbjct: 292 FDGQLESTMIAHPKVDPESGELFALSYDVVSKPYLKYFRFSPDGTKSPDVEIQLDQPTMM 351

Query: 191 HEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMPRYG-DADSIQWFQVK 250
           H+  +T+ + V+ D  V   L  +IRGG  + YDK   AR G++ +Y  D+ +I+W    
Sbjct: 352 HDFAITENFVVVPDQQVVFKLPEMIRGGSPVVYDKNKVARFGILDKYAEDSSNIKWIDAP 411

Query: 251 PNCTMHLFNCFE--DKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHN 310
                HL+N +E  + DEV+V G     S +  P+   N+                    
Sbjct: 412 DCFCFHLWNAWEEPETDEVVVIG-----SCMTPPDSIFNE-------------------- 471

Query: 311 TDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGT---QFSMDFPFINSRFTGLKNKFGYA 370
                 D +L S   E RLNLKTGE+  R +      Q +++   +N    G K KF Y 
Sbjct: 472 -----SDENLKSVLSEIRLNLKTGESTRRPIISNEDQQVNLEAGMVNRNMLGRKTKFAYL 531

Query: 371 QVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMFENNSFCSGA 430
            + + +        K  G AK+     +                 V+ H++ +N +    
Sbjct: 532 ALAEPWP-------KVSGFAKVDLTTGE-----------------VKKHLYGDNRYGGEP 566

Query: 431 SFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATAM 447
            F+  EG  EED+G+I+  VH+E T  S+L +  A+
Sbjct: 592 LFLPGEGG-EEDEGYILCFVHDEKTWKSELQIVNAV 566

BLAST of Sgr027961 vs. ExPASy Swiss-Prot
Match: Q69NX5 (9-cis-epoxycarotenoid dioxygenase NCED4, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=NCED4 PE=2 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 5.4e-30
Identity = 118/464 (25.43%), Postives = 203/464 (43.75%), Query Frame = 0

Query: 4   RSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAV---E 63
           R+ +   +G+GMLHA+       G     Y  R+ +T   R E+ +  RP F  A+    
Sbjct: 138 RAGHHLFDGDGMLHAVRL----AGGRAESYACRFTETARLRQEREM-GRPVFPKAIGELH 197

Query: 64  GDSLAVLSAFFLNLLRFGKVYKD----ISNTNVLEHSGKFYSLAENFMPQEFDIT---TL 123
           G S       F +    G +       ++N  ++ H G+  +++E+ +P    +T    L
Sbjct: 198 GHSGVARLLLFGSRALCGVLDASRGIGVANAGLVYHDGRLLAMSEDDLPYHVRVTHDGDL 257

Query: 124 ETLGDWDVNGAWNRPFT--SHPKKAPETGELVVLG-ATATKPFMVSESFL---RMVHKVD 183
           ET+G +D +G  +   T  +HPK  P TGEL  L     +KP++    F    R    VD
Sbjct: 258 ETVGRYDFHGQLDADGTMIAHPKLDPVTGELFALSYNVVSKPYLKYFYFTADGRKSRDVD 317

Query: 184 VKLSRSSLSHEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMP-RYGDA 243
           + +   ++ H+  VT+ Y V+ D  +   L  ++RGG  + YD+E  +R GV+P R  DA
Sbjct: 318 IPVGAPTMIHDFAVTENYAVVPDQQIVFKLQEMVRGGSPVVYDREKASRFGVLPKRAADA 377

Query: 244 DSIQWFQVKPNCTMHLFNCFED--KDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPL 303
             ++W +V      HL+N +ED    E++V G     S +  P+   N+           
Sbjct: 378 SELRWVEVPGCFCFHLWNAWEDDATGEIVVIG-----SCMTPPDAVFNE----------- 437

Query: 304 PATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSL---TGTQFSMDFPFINSRFT 363
           P+   EE +  S +          E RL+ +TG +R R +      Q +++   +N +  
Sbjct: 438 PSQSPEEESFRSVLS---------EIRLDPRTGVSRRRDVLRDAAEQVNLEAGMVNRQLL 497

Query: 364 GLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMF 423
           G K ++ Y  + + +        +  G AK+  E   + +F                 ++
Sbjct: 498 GRKTRYAYLAIAEPWP-------RVSGFAKVDLESGTAEKF-----------------IY 547

Query: 424 ENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA 446
               +     FV R G+  EDDG ++  VH+E   TS+L +  A
Sbjct: 558 GEGRYGGEPCFVPRAGAAAEDDGHVLCFVHDEERGTSELVVVDA 547

BLAST of Sgr027961 vs. ExPASy TrEMBL
Match: A0A803NFI9 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 771.5 bits (1991), Expect = 5.5e-219
Identity = 402/774 (51.94%), Postives = 511/774 (66.02%), Query Frame = 0

Query: 104 MPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPF----MVSESF 163
           MPQE DI TL+TL  W ++  WNRPFTSHPK+   +GELV +G   TKP+    ++S   
Sbjct: 1   MPQEIDIFTLQTLAHWHLSPIWNRPFTSHPKRVACSGELVTIGIAPTKPYVEIGVISADG 60

Query: 164 LRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGV 223
            +++H+ D+KL R  + H+IG+T+RYN+I+D P+T+D+ RL+RGGPLIKYDK+ YARIGV
Sbjct: 61  KKLIHRADLKLKRCPICHDIGITKRYNLIMDIPLTIDIMRLVRGGPLIKYDKDEYARIGV 120

Query: 224 MPRYGDADSIQWFQVKPNCTMHLFNCFED-KDEVMVWGCRASDSVIPGPEKGLNKFEWFS 283
           MPRYGDA SI+WF+V+PNCTMH FNCFED  DE++VWGCRA DS+I              
Sbjct: 121 MPRYGDAYSIKWFEVEPNCTMHFFNCFEDGDDEIVVWGCRALDSII-------------- 180

Query: 284 KRFKPLPATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLT-GTQFSMDFPFIN 343
                  ++  E+ +T ++  + S     YEWRLN+K G  +E+ LT  +++SMD+P IN
Sbjct: 181 -------SSTQEQIDTPTTNANNSYHDSPYEWRLNMKNGIIKEKKLTLDSEYSMDYPTIN 240

Query: 344 SRFTGLKNKFGYAQVLDSF-ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV 403
             + GLKNKFGYAQV+D   A++++ + K+ G+AKLHFEE ++   S     E + ++K+
Sbjct: 241 ENYIGLKNKFGYAQVVDPISATTSNNLLKYCGIAKLHFEELETRICS--TSGESKEMVKI 300

Query: 404 EYHMFENNSFCSGASFVSREGSP--EEDDGWIIAHVHNEITNTSQLAMATAMYTSLQLFL 463
           EYHMFE N+FCSGA+FVS +     +EDDGWII  VHNE TN SQ+ +  +   S +   
Sbjct: 301 EYHMFEKNTFCSGATFVSNDKKECVDEDDGWIITFVHNEDTNISQVLLIDSKNFSSEPVA 360

Query: 464 NYNTLLLLPSIPHNKRASVTFMASQANKSLLSFMDDEKLRTPQIQSPAQVSSETTPQVMS 523
                  +P   H    S+   +S                     SP Q  ++  P    
Sbjct: 361 KITLPYRVPYGFHGTFVSIPLASS---------------------SPNQAHNQARP---- 420

Query: 524 DQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIA 583
                             LIPPPSPESPWT SPLQTPSP LLYHCIASLHR +G IYS+A
Sbjct: 421 -----------------SLIPPPSPESPWTCSPLQTPSPSLLYHCIASLHRHDGTIYSVA 480

Query: 584 ISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG--VFASEKKL------I 643
           +S GVVFTGS+++RIRAW+ PDC+ERG LK+ +G V+AI A G  +F S K L       
Sbjct: 481 VSNGVVFTGSDSTRIRAWRPPDCVERGLLKSTSGEVRAILAYGDMLFTSHKDLKIRTWNF 540

Query: 644 SVVLSN------GNRPPPQR---------------------FISCMAYYHAQDLLYTGSH 703
           +VV SN       + P  +R                      +SC+AYYHA+ +LYTGSH
Sbjct: 541 AVVSSNFLFKKLSSIPRKRRSSLSFQLFSSKPKLLTERHKDCVSCLAYYHAEGILYTGSH 600

Query: 704 DKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTL 763
           D+T+KAWRVS RKCVDSFVAHEDNVN I VNQ DGC+FTCSSDG+VKIWRR+YRENSHTL
Sbjct: 601 DRTVKAWRVSTRKCVDSFVAHEDNVNGILVNQEDGCVFTCSSDGSVKIWRRIYRENSHTL 660

Query: 764 TMTLKFQTSPVNAVALCSHYD----------TTFLYSGSSDGTINFWEKEKLSYRYNHRG 823
           TM L+FQ SPVNA+AL S             T FLYSGSSDGTINFWEKE++SYR+NH G
Sbjct: 661 TMILRFQPSPVNAIALSSSSSSGSDGGGGGGTGFLYSGSSDGTINFWEKERMSYRFNHGG 709

BLAST of Sgr027961 vs. ExPASy TrEMBL
Match: A0A6J1CDW2 (carotenoid 9,10(9',10')-cleavage dioxygenase 1-like OS=Momordica charantia OX=3673 GN=LOC111010347 PE=3 SV=1)

HSP 1 Score: 729.9 bits (1883), Expect = 1.8e-206
Identity = 356/453 (78.59%), Postives = 390/453 (86.09%), Query Frame = 0

Query: 1   MFGRSSYIWVEGEGMLHALYFNKD--SRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPA 60
           +FGRS+Y WVEGEGMLHALYFNKD      WNL YNNRYVQT+TF+LEK V+N P FLPA
Sbjct: 118 IFGRSNYTWVEGEGMLHALYFNKDHTKLRKWNLFYNNRYVQTQTFQLEKHVKNGPYFLPA 177

Query: 61  VEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLETLGD 120
           +EGDSLAVLSAFFLN+LRFG V KD+SNTNV+EHSGKFYS+A+N +PQ+ DI +L++LG 
Sbjct: 178 IEGDSLAVLSAFFLNMLRFGTVNKDMSNTNVIEHSGKFYSVADNSLPQQIDIMSLQSLGV 237

Query: 121 WDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSS 180
           WDVNGAWNRPFTSHPKKAP TGELV++G T  KPFM    +SE    MVHKVDVKLSR S
Sbjct: 238 WDVNGAWNRPFTSHPKKAPGTGELVIMGVTPAKPFMLLGIISEDGKEMVHKVDVKLSRGS 297

Query: 181 LSHEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQV 240
            SHEIGVT+RYNVILDYPVT+D NRLI GG LIKYDKEGYARIGVMPRYGD DSIQWFQV
Sbjct: 298 FSHEIGVTRRYNVILDYPVTIDFNRLITGGSLIKYDKEGYARIGVMPRYGDGDSIQWFQV 357

Query: 241 KPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNT 300
           KPNCTMHLFN FE  DEV+VWGCRASDSVIPGPEKG NKFEWFS RFKPLP  D +  ++
Sbjct: 358 KPNCTMHLFNSFEQNDEVVVWGCRASDSVIPGPEKGHNKFEWFSNRFKPLPIADEQNSDS 417

Query: 301 -DSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVL 360
             SS+EDGSLFSRAYEWRLNLKTGEARER L+GTQFSMDFPFINSRFTGL+NKFGYAQ+L
Sbjct: 418 CSSSMEDGSLFSRAYEWRLNLKTGEARERFLSGTQFSMDFPFINSRFTGLQNKFGYAQIL 477

Query: 361 DSFASSNSGMFKFGGLAKLHFEEP-QSFEFSLPKKCEEQRIIKVEYHMFENNSFCSGASF 420
           DSFASSNSGMFKFGGLAKLHFEEP QS  FSLPKK   +  IKVEYHMFEN+SFCSGASF
Sbjct: 478 DSFASSNSGMFKFGGLAKLHFEEPDQSCGFSLPKKWGNE-AIKVEYHMFENSSFCSGASF 537

Query: 421 VSREGSPEEDDGWIIAHVHNEITNTSQLAMATA 446
           V RE   EEDDGWIIAH+HNEITNTSQ+ +  A
Sbjct: 538 VPREDGAEEDDGWIIAHLHNEITNTSQVYIIDA 569

BLAST of Sgr027961 vs. ExPASy TrEMBL
Match: A0A6J1IV34 (carotenoid 9,10(9',10')-cleavage dioxygenase 1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111480255 PE=3 SV=1)

HSP 1 Score: 713.0 bits (1839), Expect = 2.3e-201
Identity = 345/449 (76.84%), Postives = 391/449 (87.08%), Query Frame = 0

Query: 1   MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVE 60
           MFGRSS+IWVEGEGMLHA+YF K+  G W+L YNNRYVQTE+FR EK V+NRPSFLPAVE
Sbjct: 135 MFGRSSFIWVEGEGMLHAMYFKKNMDGKWHLSYNNRYVQTESFRFEKHVKNRPSFLPAVE 194

Query: 61  GDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLETLGDWD 120
           GD++A+++AF LN LRFG+V KD+SNTNV+EHSGKFYS+AEN MPQ+ DI +L++LG WD
Sbjct: 195 GDAVAIVAAFLLNWLRFGEVNKDMSNTNVIEHSGKFYSVAENSMPQQIDIMSLQSLGVWD 254

Query: 121 VNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLS 180
           V  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+
Sbjct: 255 VCRAWNRPFTSHPKKAPDTGELVIMGIAPTKPYMEIGIISEDGKRMVHKVDVKLSRSSLT 314

Query: 181 HEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKP 240
           HEIGVT+R+NVILDYPVT+DLNRLIRGG LIK++KEGYA+IGVMPRYGDADSIQWF+VKP
Sbjct: 315 HEIGVTRRFNVILDYPVTIDLNRLIRGGSLIKFEKEGYAKIGVMPRYGDADSIQWFEVKP 374

Query: 241 NCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS 300
           NCTMHLFNCFE ++EV+VWGCRA+DS+IPGPEKGLNKFEWFS+RFKP+  TD  E NT  
Sbjct: 375 NCTMHLFNCFEHQNEVVVWGCRATDSIIPGPEKGLNKFEWFSQRFKPVHVTD--EQNT-- 434

Query: 301 SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSF 360
             EDGSL S AYEWRLNLKTGEARER LTG QFSMDFPFINS FTGLKNK+GYAQVLDS 
Sbjct: 435 --EDGSLLSCAYEWRLNLKTGEARERFLTGNQFSMDFPFINSAFTGLKNKYGYAQVLDSV 494

Query: 361 ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSRE 420
           ASSNSG+FKFGGLAKLHFEEPQS E SL K  EE+  IKVEYHM ENNSFC+GASFV RE
Sbjct: 495 ASSNSGIFKFGGLAKLHFEEPQSGEVSLLKDSEEEP-IKVEYHMLENNSFCTGASFVPRE 554

Query: 421 GSPEEDDGWIIAHVHNEITNTSQLAMATA 446
            S +EDDGWIIAHVHNEIT TSQ+ +  A
Sbjct: 555 DSLQEDDGWIIAHVHNEITKTSQVYIVDA 576

BLAST of Sgr027961 vs. ExPASy TrEMBL
Match: A0A6J1FNM1 (carotenoid 9,10(9',10')-cleavage dioxygenase 1-like OS=Cucurbita moschata OX=3662 GN=LOC111445898 PE=3 SV=1)

HSP 1 Score: 711.4 bits (1835), Expect = 6.8e-201
Identity = 342/449 (76.17%), Postives = 390/449 (86.86%), Query Frame = 0

Query: 1   MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVE 60
           M GRSS+IWVEGEGMLHA+YF K+  G W+L YNNRYVQT++F+ EK V+NRPSFLPAVE
Sbjct: 128 MLGRSSFIWVEGEGMLHAIYFKKNINGEWHLYYNNRYVQTDSFQFEKHVKNRPSFLPAVE 187

Query: 61  GDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLETLGDWD 120
           GD+LAV++AF LN LRFG+V KD+SNTNV+EHSG+FYS+AEN MPQE DI +L++LG WD
Sbjct: 188 GDALAVVAAFVLNWLRFGEVNKDMSNTNVIEHSGRFYSVAENSMPQEIDIMSLQSLGVWD 247

Query: 121 VNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLS 180
           V  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+
Sbjct: 248 VCRAWNRPFTSHPKKAPDTGELVIMGIAPTKPYMEIGIISEDGKRMVHKVDVKLSRSSLT 307

Query: 181 HEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKP 240
           HEIGVT+R+NVILDYPVT+DLNRLIRGG LIK++KEGYA+IGVMPRYGDADSIQWF+VKP
Sbjct: 308 HEIGVTRRFNVILDYPVTIDLNRLIRGGSLIKFEKEGYAKIGVMPRYGDADSIQWFEVKP 367

Query: 241 NCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS 300
           NCTMHLFNCFE ++EV+VWGCRA+DS+IPGPEKGLNKFEWFS+RFKP+  TD      + 
Sbjct: 368 NCTMHLFNCFEHQNEVVVWGCRATDSIIPGPEKGLNKFEWFSQRFKPVHVTD------EQ 427

Query: 301 SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSF 360
           + EDGSL S AYEWRLNLKTGEARER LTG QFSMDFPFINS FTGLKN++GYAQVLDS 
Sbjct: 428 NAEDGSLLSCAYEWRLNLKTGEARERFLTGNQFSMDFPFINSAFTGLKNRYGYAQVLDSV 487

Query: 361 ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSRE 420
           ASSNSGMFKFGGLAKLHFEEPQS E SL K  EE+  IKVEYHM ENNSFC+GASFV RE
Sbjct: 488 ASSNSGMFKFGGLAKLHFEEPQSGEVSLLKGSEEEP-IKVEYHMLENNSFCTGASFVPRE 547

Query: 421 GSPEEDDGWIIAHVHNEITNTSQLAMATA 446
            S +EDDGWIIAHVHNEITNTSQ+ +  A
Sbjct: 548 DSLQEDDGWIIAHVHNEITNTSQVYIVDA 569

BLAST of Sgr027961 vs. ExPASy TrEMBL
Match: A0A7J6FQE6 (WD_REPEATS_REGION domain-containing protein OS=Cannabis sativa OX=3483 GN=G4B88_018099 PE=3 SV=1)

HSP 1 Score: 667.2 bits (1720), Expect = 1.5e-187
Identity = 382/867 (44.06%), Postives = 488/867 (56.29%), Query Frame = 0

Query: 104 MPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPF----MVSESF 163
           MPQE DI TL+TLG+W ++  WNRPFTSHPK+   +GELV +G   TKP+    ++S   
Sbjct: 1   MPQEIDIFTLQTLGNWHLSPTWNRPFTSHPKRVACSGELVTIGIAPTKPYVEIGVISADG 60

Query: 164 LRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGV 223
            +++H+ D+KL R  + H+IG+T+R                     LIKYDK+ YARIGV
Sbjct: 61  KKLIHRADLKLKRCPICHDIGITKR---------------------LIKYDKDEYARIGV 120

Query: 224 MPRYGDADSIQWFQVKPNCTMHLFNCFED-KDEVMVWGCRASDSVIPGPEKGLNKFEWFS 283
           MPRYGDADSI+WF+++PNCTMH FNCFED  DE++VWGCRA DS+I              
Sbjct: 121 MPRYGDADSIKWFEIEPNCTMHFFNCFEDGDDEIVVWGCRALDSII-------------- 180

Query: 284 KRFKPLPATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLT-GTQFSMDFPFIN 343
                  ++  E+ +T ++  + S     YEWRLN+K G  +E+ LT  +++SMD+P IN
Sbjct: 181 -------SSTQEQIDTPTTNANNSYHDSPYEWRLNMKNGIIKEKKLTLDSEYSMDYPTIN 240

Query: 344 SRFTGLKNKFGYAQVLDSF-ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV 403
             + GLKNKFGYAQV+D   A++++ + K+ G+AKLHFEE ++   S     E + ++K+
Sbjct: 241 ENYIGLKNKFGYAQVVDPISANTSNNLLKYCGIAKLHFEELETRICS--TSGESKEMVKI 300

Query: 404 EYHMFENNSFCSGASFVSREGSP--EEDDGWIIAHVHNEITNTSQLAMATAMYTSLQLFL 463
           EYHMFE N+FCSGA+FVS +     +EDDGWII  VHNE TN SQ+ +            
Sbjct: 301 EYHMFEKNTFCSGATFVSNDKKECVDEDDGWIITFVHNEDTNISQVLL------------ 360

Query: 464 NYNTLLLLPSIPHNKRASVTFMASQANKSLLSFMDDEKLRTPQIQSPAQVSSETTPQVMS 523
                                                            + S+       
Sbjct: 361 -------------------------------------------------IDSKN------ 420

Query: 524 DQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIA 583
                                                                       
Sbjct: 421 ------------------------------------------------------------ 480

Query: 584 ISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG--VFASEKKL------I 643
                 F+ S+++RIRAW+ PDC+ERG LK+ +G V+AI A G  +F S K L       
Sbjct: 481 ------FSSSDSTRIRAWRPPDCVERGLLKSTSGEVRAILAYGDMLFTSHKDLKIRTWNF 540

Query: 644 SVVLSN------GNRPPPQR---------------------FISCMAYYHAQDLLYTGSH 703
           +VV SN       + P  +R                      +SC+AYYHA+ +LYTGSH
Sbjct: 541 AVVSSNFLFKKLSSIPRKRRSSLSFQLFSSKPKLLTERHKDCVSCLAYYHAEGILYTGSH 600

Query: 704 DKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTL 763
           D+T+KAWRVS RKCVDSFVAHEDNVN I VNQ DGC+FTCSSDG+VKIWRR+YRENSHTL
Sbjct: 601 DRTVKAWRVSTRKCVDSFVAHEDNVNGILVNQEDGCVFTCSSDGSVKIWRRIYRENSHTL 660

Query: 764 TMTLKFQTSPVNAVALCSHYD---------TTFLYSGSSDGTINFWEKEKLSYRYNHRGF 823
           TM L+FQ SPVNA+AL S            T FLYSGSSDGTINFWEKE++SYR+NH GF
Sbjct: 661 TMILRFQPSPVNAIALSSSSSSGSDAGGGGTGFLYSGSSDGTINFWEKERMSYRFNHGGF 690

Query: 824 LQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLE 883
           LQGHRFAVLC+V VE+++FSGSEDTT+RIWR+EEG+  HECLAVLDGHRGPVRCLAA LE
Sbjct: 721 LQGHRFAVLCVVAVEKMVFSGSEDTTIRIWRREEGNCLHECLAVLDGHRGPVRCLAACLE 690

Query: 884 VEKITTS-FLVYSASLDQTFKVWRVKVLPEEK---------RCLDYGEGG---------- 892
            E +    FL+YSASLD+TFKVWRVK+LPEE+         R +  G GG          
Sbjct: 781 RENLVMGLFLIYSASLDRTFKVWRVKILPEEEPEPETEPENRSVYCGGGGGGSEFSSELD 690

BLAST of Sgr027961 vs. TAIR 10
Match: AT3G50390.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 223.0 bits (567), Expect = 1.4e-57
Identity = 162/449 (36.08%), Postives = 230/449 (51.22%), Query Frame = 0

Query: 505 SSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLH 564
           SS  +    S Q   +   SP+HS  +K+    + E           SP +L   + SL 
Sbjct: 47  SSPLSKSPWSVQVDPSAVDSPYHS--VKVTTDTTKEKS------TNHSPNIL---LGSLV 106

Query: 565 RGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG------- 624
           R EG+IYS+A S  +++TGS++  IR WK  + +E    K+ +G VKAI  +G       
Sbjct: 107 REEGHIYSLATSGDLLYTGSDSKNIRVWK--NHVEFSSFKSNSGLVKAIVLAGDKIFTGH 166

Query: 625 ------VFASEKKLISVVLSNGNRP-----------PPQRF------------------- 684
                 V+ +  K  +V    G  P           P   F                   
Sbjct: 167 QDGKIRVWKAASKESNVHRRVGTMPNLLDYIRNSIVPSSYFNFTRRNRSSAALGFRHLDA 226

Query: 685 ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSS 744
           ISC+A    + LLY+GS DKT K WRVSD +CV+S  AHED VN++ V+  DG +FT S+
Sbjct: 227 ISCLALSEDKRLLYSGSWDKTFKVWRVSDLRCVESVNAHEDAVNAV-VSGFDGLVFTGSA 286

Query: 745 DGTVKIWRR--VYRENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKE 804
           DGTVK+WRR    ++  H  + TL  Q   V A+A+      T +Y GSSDGT+NFWE+E
Sbjct: 287 DGTVKVWRREDQAKDTKHFFSETLLKQDCAVTAIAV--DQSATLVYCGSSDGTVNFWERE 346

Query: 805 KLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHR 864
                  + G L+GH+ AVLCLV    L+FSGS D  +R+WR+ EG   H CL+VL GH 
Sbjct: 347 N---NMKNGGVLKGHKLAVLCLVAAGNLMFSGSADLGIRVWRRPEGGGEHVCLSVLTGHA 406

Query: 865 GPVRCLAASLEVEKIT--TSFLVYSASLDQTFKVWRVK------VLPEEKRCLDYGEGGE 901
           GPV+CLA   + E ++    ++VYS SLD++ K+WRV       V  E K    +G GG 
Sbjct: 407 GPVKCLAVERDQESVSGERRWIVYSGSLDRSVKMWRVSESSPPMVNQEFKLPDQFGGGGG 466

BLAST of Sgr027961 vs. TAIR 10
Match: AT2G26490.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 209.9 bits (533), Expect = 1.2e-53
Identity = 136/368 (36.96%), Postives = 200/368 (54.35%), Query Frame = 0

Query: 533 LIPPPSPESPWTLSPLQTPSPPLLYH-CIASLHRGEGNIYSIAISKGVVFTGSETSRIRA 592
           ++ P +  +P+T +   T    L  +  I SL R EG+IYS+A +K +++TGS++  IR 
Sbjct: 61  MMSPWNQATPFTQTQWSTVEENLPQNGLIGSLVREEGHIYSLAATKDLLYTGSDSKNIRV 120

Query: 593 WKQPDCMERGYLKAAAGGVKAISASG-------------VFASEKKLISVVLSNGNRP-- 652
           WK  +  E    K  +G VKAI  SG             V+    K  S+   +G  P  
Sbjct: 121 WK--NLKEFSAFKCNSGLVKAIVISGEKIFTGHQDGKIRVWKVSPKNQSLHKRSGTLPTL 180

Query: 653 --------PPQRF-----------------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDR 712
                    P+ +                 +SC++    Q LLY+ S D+TIK WR++D 
Sbjct: 181 KDIFKASLKPRNYVEVKKHRTALWIKHADAVSCLSLNDEQGLLYSASWDRTIKVWRIADS 240

Query: 713 KCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSP 772
           KC++S  AH+D VNS+ V+  +  +F+ S+DGTVK W+R    +   HTL  TL  Q S 
Sbjct: 241 KCLESIPAHDDAVNSV-VSTTEAIVFSGSADGTVKAWKRDQQGKYTKHTLMQTLTKQESA 300

Query: 773 VNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIF 832
           V A+A+    +   +Y GSSDG +NFWE+EK   + N+ G L+GH+ AVLCL V   L+F
Sbjct: 301 VTALAVSK--NGAAVYFGSSDGLVNFWEREK---QLNYGGILKGHKLAVLCLEVAGSLVF 360

Query: 833 SGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLE---VEKITTSFLVYSASLD 855
           SGS D T+ +W+++     H CL+VL GH GPV+CLA   +    E+    ++VYS SLD
Sbjct: 361 SGSADKTICVWKRD--GNIHTCLSVLTGHTGPVKCLAVEADREASERRDKKWIVYSGSLD 418

BLAST of Sgr027961 vs. TAIR 10
Match: AT1G49450.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 201.4 bits (511), Expect = 4.4e-51
Identity = 132/382 (34.55%), Postives = 187/382 (48.95%), Query Frame = 0

Query: 540 ESPWTLSPLQTPSPPLLYH-------------CIASLHRGEGNIYSIAISKGVVFTGSET 599
           +SPW  +       P +Y               I ++ R EG++YS+A S  ++FTGS++
Sbjct: 94  QSPWNQTYSPYHKSPWIYQTRNSDFEDDPDNGLIGTVVRQEGHVYSLAASGDLLFTGSDS 153

Query: 600 SRIRAWKQPDCMERGYLKAAAGGVKAISAS--------------GVFASEKKLISVVLSN 659
             IR WK  D  +    K+ +G VKAI  +               V+   KK        
Sbjct: 154 KNIRVWK--DLKDFSGFKSTSGFVKAIVVTRDNRVFTGHQDGKIRVWRGSKKNPEKYSRV 213

Query: 660 GNRPPPQRF---------------------------ISCMAYYHAQDLLYTGSHDKTIKA 719
           G+ P  + F                           +SC++      LLY+GS DKT+K 
Sbjct: 214 GSLPTLKEFLTKSVNPRNYVEVRRRKNVLKIRHFDAVSCLSLNEDLGLLYSGSWDKTLKV 273

Query: 720 WRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTL 779
           WR+SD KC++S  AH+D VN++ V+  D  +FT S+DGT+K+W+R    +E  H L   L
Sbjct: 274 WRLSDSKCLESIEAHDDAVNTV-VSGFDDLVFTGSADGTLKVWKREVQGKEMKHVLVQVL 333

Query: 780 KFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVV 839
             Q + V A+A+  +     +Y GSSDGT+NFWE++K      H+G + GHR AVLCL  
Sbjct: 334 MKQENAVTALAV--NLTDAVVYCGSSDGTVNFWERQKY---LTHKGTIHGHRMAVLCLAT 393

Query: 840 VERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEV-----------E 855
              L+ SG  D  + +W K  G   H CL+VL  H GPV+CLAA  E            E
Sbjct: 394 AGSLLLSGGADKNICVW-KRNGDGSHTCLSVLMDHEGPVKCLAAVEEAEEDHNDGDDGGE 453

BLAST of Sgr027961 vs. TAIR 10
Match: AT3G18950.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 200.3 bits (508), Expect = 9.8e-51
Identity = 144/419 (34.37%), Postives = 210/419 (50.12%), Query Frame = 0

Query: 502 AQVSSETTPQVMSDQQGVAP---SSSPHHSKVLKLIPPPSPESPWTLSPLQTPSP----P 561
           +Q ++ TT    S  Q ++P   + SP+++      P  SP SPW     QT SP    P
Sbjct: 61  SQSNASTTSGESSPNQVLSPWNQTYSPYYTDTTN-TPSMSP-SPWN----QTYSPYYKSP 120

Query: 562 LLYH-----------CIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYL 621
            +Y             I ++ R +G++YS+A S  ++FTGS++  IR WK  D  +    
Sbjct: 121 WIYQTRNIDDDTDNGLIGTIVRQDGHVYSLAASGDLLFTGSDSKNIRVWK--DLKDHTGF 180

Query: 622 KAAAGGVKAISASG--------------VFASEKKLISVVLSNGNRPPPQRF-------- 681
           K+ +G VKAI  +G              V+   K+        G+ P  + F        
Sbjct: 181 KSTSGLVKAIVITGDNRIFTGHQDGKIRVWRGSKRRTGGYSRIGSLPTLKEFLTKSVNPK 240

Query: 682 -------------------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHED 741
                              +SC++      LLY+GS DKT+K WR+SD KC++S  AH+D
Sbjct: 241 NYVEVRRRKNVLKIRHYDAVSCLSLNEELGLLYSGSWDKTLKVWRLSDSKCLESIQAHDD 300

Query: 742 NVNSIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVNAVALCSHYD 801
            +N++A   +D  LFT S+DGT+K+W+R    +   H L   L  Q + V A+A+  +  
Sbjct: 301 AINTVAAGFDD-LLFTGSADGTLKVWKRELQGKGTKHFLVNVLMKQENAVTALAV--NIT 360

Query: 802 TTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIW 855
              +Y GSSDGT+NFWE +K     +H G L+GHR AVLCL     L+ SG  D  + +W
Sbjct: 361 AAVVYCGSSDGTVNFWEGQKY---LSHGGTLRGHRLAVLCLAAAGSLVLSGGADKNICVW 420

BLAST of Sgr027961 vs. TAIR 10
Match: AT4G34380.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 196.1 bits (497), Expect = 1.8e-49
Identity = 139/433 (32.10%), Postives = 209/433 (48.27%), Query Frame = 0

Query: 485 LSFMDDEKLRTPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWT 544
           +S  D +++  P    P  ++S  T Q  S       SS P   +           SP  
Sbjct: 44  ISKTDHDEIFPP--DDPIIINSNVTHQRFSSVSASTMSSGPASGE----------GSPCV 103

Query: 545 LSPLQTPSPPL-----------LYHCIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWK 604
           +SP    SPP                I S+ R EG+IYS+A S  +++TGS++  IR WK
Sbjct: 104 MSPWARVSPPWGADFNEDNVLETNGLIGSIVRKEGHIYSLAASGDLLYTGSDSKNIRVWK 163

Query: 605 QPDCMERGYLKAAAGGVKAISASG--VFASEKKLISVVLSNGNRPP-------------- 664
             +  E    K+++G +KAI   G  +F   +     +     R P              
Sbjct: 164 --NLKEHAGFKSSSGLIKAIVIFGDRIFTGHQDGKIRIWKVSKRKPGKHKRVGTLPTFKS 223

Query: 665 -------PQRF-----------------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKC 724
                  P+ F                 +S ++      LLY+ S D TIK WR++D KC
Sbjct: 224 MVKSSVNPKHFMEVRRNRNSVKTKHNDAVSSLSLDVELGLLYSSSWDTTIKVWRIADSKC 283

Query: 725 VDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVN 784
           ++S  AH+D +NS+ ++  D  +FT S+DGTVK+W+R    +   HTL   L  Q + V 
Sbjct: 284 LESIHAHDDAINSV-MSGFDDLVFTGSADGTVKVWKRELQGKGTKHTLAQVLLKQENAVT 343

Query: 785 AVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSG 844
           A+A+ S   ++ +Y GSSDG +N+WE+ K S+     G L+GH+ AVLCL +   L+ SG
Sbjct: 344 ALAVKS--QSSIVYCGSSDGLVNYWERSKRSFT---GGILKGHKSAVLCLGIAGNLLLSG 403

Query: 845 SEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLA----------ASLEVEKITTSFLVY 855
           S D  + +WR++     H+CL+VL GH GPV+CLA          A   V +    +++Y
Sbjct: 404 SADKNICVWRRDPSDKSHQCLSVLTGHMGPVKCLAVEEERACHQGAKASVAEGDRKWIIY 456

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7031724.10.0e+0065.85Protein JINGUBANG, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022139412.13.8e-20678.59carotenoid 9,10(9',10')-cleavage dioxygenase 1-like [Momordica charantia][more]
XP_022980961.14.8e-20176.84carotenoid 9,10(9',10')-cleavage dioxygenase 1-like isoform X1 [Cucurbita maxima... [more]
XP_023523500.14.8e-20176.61carotenoid 9,10(9',10')-cleavage dioxygenase 1-like [Cucurbita pepo subsp. pepo][more]
XP_022940198.11.4e-20076.17carotenoid 9,10(9',10')-cleavage dioxygenase 1-like [Cucurbita moschata] >XP_022... [more]
Match NameE-valueIdentityDescription
O487161.7e-5236.96Protein JINGUBANG OS=Arabidopsis thaliana OX=3702 GN=JGB PE=1 SV=1[more]
Q94IR21.4e-3829.37Carotenoid 9,10(9',10')-cleavage dioxygenase 1 OS=Phaseolus vulgaris OX=3885 GN=... [more]
O655721.3e-3628.42Carotenoid 9,10(9',10')-cleavage dioxygenase 1 OS=Arabidopsis thaliana OX=3702 G... [more]
Q9LRR72.6e-3226.549-cis-epoxycarotenoid dioxygenase NCED3, chloroplastic OS=Arabidopsis thaliana O... [more]
Q69NX55.4e-3025.439-cis-epoxycarotenoid dioxygenase NCED4, chloroplastic OS=Oryza sativa subsp. ja... [more]
Match NameE-valueIdentityDescription
A0A803NFI95.5e-21951.94Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A6J1CDW21.8e-20678.59carotenoid 9,10(9',10')-cleavage dioxygenase 1-like OS=Momordica charantia OX=36... [more]
A0A6J1IV342.3e-20176.84carotenoid 9,10(9',10')-cleavage dioxygenase 1-like isoform X1 OS=Cucurbita maxi... [more]
A0A6J1FNM16.8e-20176.17carotenoid 9,10(9',10')-cleavage dioxygenase 1-like OS=Cucurbita moschata OX=366... [more]
A0A7J6FQE61.5e-18744.06WD_REPEATS_REGION domain-containing protein OS=Cannabis sativa OX=3483 GN=G4B88_... [more]
Match NameE-valueIdentityDescription
AT3G50390.11.4e-5736.08Transducin/WD40 repeat-like superfamily protein [more]
AT2G26490.11.2e-5336.96Transducin/WD40 repeat-like superfamily protein [more]
AT1G49450.14.4e-5134.55Transducin/WD40 repeat-like superfamily protein [more]
AT3G18950.19.8e-5134.37Transducin/WD40 repeat-like superfamily protein [more]
AT4G34380.11.8e-4932.10Transducin/WD40 repeat-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 556..593
e-value: 2.2
score: 14.9
coord: 622..667
e-value: 1.0
score: 17.0
coord: 716..757
e-value: 1.0E-4
score: 31.6
coord: 670..709
e-value: 1.5E-9
score: 47.7
coord: 808..853
e-value: 0.04
score: 23.0
coord: 764..801
e-value: 0.048
score: 22.8
IPR001680WD40 repeatPFAMPF00400WD40coord: 672..708
e-value: 1.0E-5
score: 26.2
coord: 770..800
e-value: 0.027
score: 15.3
coord: 721..756
e-value: 0.016
score: 16.1
coord: 810..852
e-value: 0.099
score: 13.5
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 642..676
score: 9.004632
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 771..810
score: 10.274523
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 677..708
score: 12.446706
IPR004294Carotenoid oxygenasePFAMPF03055RPE65coord: 6..443
e-value: 5.8E-69
score: 233.4
IPR004294Carotenoid oxygenasePANTHERPTHR10543BETA-CAROTENE DIOXYGENASEcoord: 1..443
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 549..710
e-value: 5.0E-21
score: 76.8
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 711..883
e-value: 1.5E-24
score: 88.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 496..524
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1019..1069
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1032..1065
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 496..528
NoneNo IPR availablePANTHERPTHR10543:SF30OS06G0162550 PROTEINcoord: 1..443
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 677..708
score: 10.363778
NoneNo IPR availableCDDcd00200WD40coord: 564..853
e-value: 3.25878E-44
score: 160.578
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 558..855

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr027961.1Sgr027961.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding
molecular_function GO:0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen
molecular_function GO:0005515 protein binding