Sgr015962 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr015962
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionCytochrome P450 704C1-like isoform X1
Locationtig00006406: 156059 .. 181848 (-)
RNA-Seq ExpressionSgr015962
SyntenySgr015962
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCGGAAAATATTCCAAAGCAACGACGAGAGGATCAGAGTGATTCAGGCATTGTTGTTGCAGGAAATGAATATGACGCTGCAGGAGAGAAGGAGATGGAACCGAAAACTTGATTGTGAAGCCGAATTATCAAGTCAAGCTCGAAATCTTCTTCTTCGATTCGCAGCTCCGACCAAGCGAAAATTGAGAAAGAGGTCGTCGAGAAAATTAGCGGGGACGGGGAGAGAGAGGATGCGAATTGAGCGCGAGAGGGAAGGAAGATTACGGAGCTATTATGAGGCTCCGCCCGTTGGATTTGTTGCCGAGTCAAAAATTAAAATCCAAACATTTTTCCTAACAAAAAAATTAAAACAATTTTTTTTTACAACGTAGGCCAGGGAGGTGCGCTTTAAAGTGTAGAAAATAGAATAAATTCTGCTTTTTCTAAATCTCGGTTTCCACTGTTTTTTCCTTTTTTGTTTTTTTCTTTTTAATTAACAATTTAATCGGTATAGTTTTAGCTTATTTAGATTTGATCAGGTTTCAATTTACCCCTTCTAATTTGGATTTAGTTTTAATTTCATCCTATAGTTAAAAAAAGTTAACTTGGTCTCTATAATTTGATTAATATTATTATGCCTAAATCGTTTTTCACCACCTAAATTTATTATTTTTTTTCCTAATAAACTTTACACTATAATGCATTCAACTAATACAATTAATCAAATAAAAAACATCAGCTCATTTTTCATATAAGTACAAATTTTGCTACATATTCAATATAGTAATAACTTGTAATGGCGGTAACTATTCATGATGTTTGACTAAATTATATTCATTAAATTGAAACATTTAAAACTATCAATATCAAATTGGAACTCTCAATACTATAGGGACTAAATCGAATTATTTGCTAAAAAAAAAAGAAACTAAATCGAATCTTAATCCAAACCATGAGATCAAAACGGTAATTTAACTTTTTTGTTTTATATAACTATTTGCCAATTATTTATCTTGGCTTTAAAGTGAAGAAAATGGGACGAACCATAATTCTCCTATTTTATCAAATTACTTTCTGATACTGGAGTTGGAATTTGTAAACCATAGATAGCCATGATTTTGTGATATTTTACCGAACATAACTTATACATGCAAATACGAGCTTCTATTACTTTTTAATGTGATTTATTTGTATTCATTTATTTATGTAACTAGAATTATGACAAATAATAGCATCTGACGCTGTAATTATATGTATGTTGTTTTTTTTTTTTTTTTTTTTTTGCTTGGGCATGAGTTATTGAATTATAATAGAAAATATATTGAATTACTAGCAACTAAAATATTTAATTACAGTCTATTGGTCTGTTTTTTTATTTGAGTCTGTCATTTATCACTTGGTCAAATAAAGTGAGTAAATTTAATTTAATGGACGCTCATGCATGCTTTATTAAATTAAAACATTAGATAAATATAGCTTAGTTGCTAAAATTTATTTTAATATAAAATCAATAATTTATTAATAAAATTGAAACATATCATGTGATGATATTAAATTTTCAAAATATTAGAAATGAACTTTGAAAATAACACAAATATTATGAACAAAATTATAAACCTAACTTAATTTTACTTGCTCTTTATTTGTAGACTAAACAAAATTTATGACTAAGACTGTTTAAATATTTCAATAGATTTTATATATATTTATTTTGAATGCTAAATAAAGGTGCATTTATCAATGATAAGGTATTTGACATTTTCAATATCCCGCACTATAAGAAAGTATTATATCTGTCGGATTTAACCAATTTAATAACGACCACTTGTCGTTGTAAATGATGAACTGACGTATGCAAAATAGGATAACATTTTAATATTCTTTTAACTAATTATTTATATATCAAATCCCAATTGGGGATGTAGCTCACATGGTAGAGCGCTCGCTTTGCATGCGAGAGGTACGGGGTTCGATACCCGCATCTCCATTTTACTTAAATACTCTCTTTTTTTGCCTTCGTTTTTATCCTTACTTTGTCCTTTTAACTTTTATTTTATTTTAATTTAAAATATGATTTTTTTCATATTAAATATATAATTAATTATATATATTATTGAAAAATATTGAAATTTCAAGATGCTCAAATCATAATTTTTCCATATTAATAATGAATGATATATCTTAATTCTTTAGCTGCAAATAAAATAATGGAAAATTTGTTATTTTAATCCTCGAATTTTCTACATGTCAATTTAATCCCCAAATTTTTAAAAGTTACAATTTAACTCATGAGTTTAGAAAATAGTTCTTTATATATGACGTTACTTTTCCTTTTAGCTACAATAACACTATCTTAGTAGATATTTTTATTACTTTTAAATGTTGATATGGTAAAAATAAATATGTTTTATTGTTTTTATGTGGCAAAAATATATATATTATTTTTCTACTCCTTCTTTCTTCATGTCTCGTCCTTCTCCCCCTCTCCCTTCCGCCACCCCCTTCTCCCTTCGATCGCTCCCTTCCCCTTCCCCGTTCTATTTTTGTCCCCAACTTCCTCCAATTTGTTGATAGACACCTCATTCAACAATCTGTCGTTTCAATAGTTGCCTTTGGAGGACAAAGCCATAAGATTAGCCTTTGGTGACCTCAAAGAGGGTCATTCCCCACCAATATCGAGTGGCAAAGATAGCGAAAACAACGTAGCATCGAACCATGTAGGTTAATGTCGATGCACAACTATCACTCATCCTTTGACATAATATGGGCTCTTACAGCCAAACAACTATCAATGTCGCATTATTGCCGGTACTAATACTACGATTTTCCTTTAATTTTTTTTCTTGAATCTTTCTCTTTCAATCTGTGAAGTTTTGATCTTTTAGGGTTGCATGTTGTGATTTTGTTGGGTTAGTGTTTATGTTTAACTTTAACTTTTGACAATTGAAATTTAGATTGCAACCTACAAGATAGTTTTGACTTTAAGAAATATATATGTGACCAAATTGGAGGGAGCAGCAGCAAAATAGAGGGGCAAGAAAGCAATGAAACGGGCAACGAAAGACAAAGGCGAAGGGGGCGCCGAGAGAGAGAAAGAGAGTGAACAAATAGATATTTTTGGTACATCATAATATGTGTGTGTATATATATATATTTGTCACATCAATATTATATATATAAAAAAAAAAGTACAATCAGCTACCGTCATTGTAGCTAACAAAAATATATATAACATTATTAGAGATCAAATTGAACAAGCTTTTATTGAAATATGGAAACTAAAAGAGGATGATAAAAGAAAGGAAAAGCATAGCTTTTGAAAATTCAGACAAACTTGACTCAAGTTAAAACTCAAAGACTAATTTTTCCTAAAATTACCGCAAACAAACTTGAAAACTAAAGATGCTAAAACAAAATACTACACACGTAGAAAATTAAAAAAAAAAGGTAATGGAATATACAAAGAGTAGCATGCTGCATATTGTAATAACGAGAATATAATTTATAAAAGTTGTCCGCAAGACATTTATAATGTATTTATAAAAAATGTTAAATATATAAATTAGAAGGTTTTTAACAAAATAAAATATTACATATTTTAAATTCTAAAGATATATTCAGCCAGTTGCAATTAACCATCATATTCAAAATAGTGAAACAAGAATGATGTCATTAACTGCAATTCGAGCATAGCTCGACAGATAAGACACATATTACCATTTTAAATGTAGATGGTTTGATCCCCACCCCACAATTATTGAACTTAAAAAAAAAAGAGCGATTCCATTAACTATCACAATCAACATTATGAAACAGTATATTAATATTGATTAATAATTAATGTTGGCTATTTGCTACTTTCTAGTTTCCTTTTCTTTATAAAATATATTTGACAACGTCAGAATTGTAAGAGATTCTTCAAAATATTCAGAATGTGCTCAAATTGTATGCTTCAAAATACTAATTAATTTGTCAAATCGAACATAACTCAATTGGTTAAGACATCCAATAACCCCTTAAAAGGTCAGAGATTCGAACCCTCACCCCACCCTTGTTAAACTACAAAAAACTAAGTAATTTTAGATTAAAAAATATATTATCAACTACTTTCAACATTTGTATTTTTAAGGTATGCTCGCTAAATGAAAATATAATAACCAATTTGTGATTATTCAATTGATTTAAGATTTTCATGTAGAGTCCAAATCTCGTTATTCTCTTATATTTTTTTTGTTAAAAAATGAAAAAGAACAGGCAAAAATAGTCATCGTCTTGTATTTTCTTAAAGAAAAATTAAAGTTAACAAAATTTAAAATTAGGTAATCCTGCTCCCATGATCTCTCTCTTGGAAGGACCGTCTCTCCTAATCTCAATTTTGCCCATATTTGAAGCTCTTTATGCGATTGCATGAGCTGCACAATTCCCTTCCCTTTTTTGTGTGTTTGAAAAAGTCCACACCTAGGACATCAACAATTTGCAGCGTTTCCTTGATGGACATATCTAGATTTCTATGAGGTCTTCGACTTCCTTCCTCAGCAGTCGAATAGTTAATCGGCCATCTCCCCGTAACATTGCATATGGCAAGTTACATGGTTGTGATGTAAAAAAAGTAAAAAAATTTATTGATTAGCCATACTAGGGATGCATTCTCAAAACGAAAATTAATAAATCTTTAAATATTTGCCTGTATCATTAACGGGGATGTAGCTCAAACGGTAGAGCGCTCGCTTTGCATGCGAGAGGTACGGGGTTCGATACCCCGCATCTCCATTTCTTTTCGCTTTTGTTTTTTCTTTCTGGACACGATTAATATAATTAAAATTTTATTTTTTTGCAATGCGATCAAAGTTGGGGATGTAGCTCACATGGTAGAGCGCTCGCTTTGCATGCGAGAGGTACGGGGTTCGATACCCCGCATCTCCACATGTATTTTATCTTCTGCTTTTTTTTTTACAAGTTTTAAACTGCGAAAAAAAAAAAGGAGGAGTTGGGCTTTGGCCTTATGGGCTCAAGCTCGATGATACCACTAGGCCATTTTTTCCAAGTGAAATTCGATACGTATTTCTTTCATTATTGTTTTTATGGGATTTTTTTTGTGTCTCTATTTATTGTTTATCCTTTTTTTTTTTTAGAATATTGCTTATTTTTTTTCTTATTGAATTATTAAAACTGTTTTTCATTAAATTTACATAATCTAACAAATAATTTACTTTTTAGTTTTATCTTAATATATCACATAAAAAATACAAATAGGATTCAAATCTACAATCTTGAAGTGGATAGCTCGATAGATAAGACACCTATTACATTTTTTAAGGTTGAAAGTTTGAATCTCACCTCTACATTTATTATATTCAAAAAACAAATTTGTAGTGAGATGTCTTAACCACCAATTATGCTATTAAAGTTGTTTTTCATTAAATTTTCATAATCTAAAAAATAATTGGCTGGATTCACTTAAATATCTAATTTAGTATAACTCATGCGACCTTTTTTTTTTTCACACGTAATAAATTAATTTATGTACTCTAATTTGTCACGTGATAATTTTCGTTGAAAAAAAAACCCATAATAAATTGAATTTTTTTGCTATAAAAAAATGAGATTTTTTATTTTTAACGAAATGGATGAAAATTTAAGTAAACATTTGTGAAAGAAATTGCAGTTCCCCGGCTCCCAGTTTTCAGTTTTTGGTCGTCTGACTCTCCGGCGTCTCCGCTCCCTCTAATAGGCAAGCTCCGGTGCTGTGCATGCGAACTTGGGAAAAAACCATCGATTTCGCCGAAATTGAATTTTCCTTTTCGAAACGGAGCTCAATCTTCCCAATCATACACGACTGGAGCAACGCAGAGAATGAAGTTGGAAAATTTAGTAAACTGGTTAGAACTACCTGCCTTCTTTCCCCTCACTTGCCTTACCTTTACGGATTACTTGCTCACTGTGTTTGTATGCTTAGGTATGGATTTTGTCAGAGGTGTAGTTGACTATCTTGGATCAATCTTCTCAGAGACGAGCTCAATTCACGAGTCGCCGCATAATCCTAGCGGTGTGGGTGCTTCAACCATGGAAGGCGTTAATGGAGTTCCTGTTTTGAACGAGCGCTATGCTTCCAAGCTCAAAGGGTACTTCGATTTGGCAAAGGAGGAGATCGCCAAGGCCGTCAGAGCGGAGGAGTGGGGCATAATCGACGATGCGATCCTGCACTACCAGAATGCTCAGCGCATTCTGGCCGAGGCCAGTTCAACCGCTGTGCCTTCGTTTATCAGTTCCAGGTATTCAGAATTCTGCCTATTTTCTGGATTATTAATTCCCAGTTCACCATTTTGTCAATTGAGGCTTTAAACTGAAAACATGTAGATGTCTAAATTTGAGGTTTAAAAGGTTAGGGAAGGAATCTATATCTGAACTTTGAGAAAATAAGTAAGTGGTACTTGGGAGGATGGGAAAAGTTAGAAAATATTTGGAGTTTTAGTTTAAATTATCATGTTTTGTCGTGCAAAATTGAGAAAATAAATAAGTGGTAACTTGGGAATCTTCTTTCATTTGTTTTATTTTGTACACTAATCTGTTTCACAGCTTGACAGTGTGGTATTGTCACGATAGCATTTAGCCATTATTTTGCAATCTCTTATCATTTGTTTAGGTACACTAATCTTTTATCTTCTATTATTTTGTAATCCACATCTGCCTTAGTTAATACTTTTACACCCCCTCAAAAAGATATACCTGTAAATTAATGACGTAACCAGTCAAAGTTCATATATTTTGTGCTCTGTCTGCCCTATGTATATTTTTTCATTTTCCAACAGCGAACAAGAGAAGGTGAAATCTCATAGACAAAAGATATCAAAGTGGCAGAGTCAAGTTTCTGAGAGATTACAAGCTTTAAATATGCGAGCAGGTTCAATCTTGTCCTCTAATAGCTTAAACAATTTCATCCATGATGATCTTGGTTGTCGGGGATATTTAATAACTTCTAATCTTTCTCTTATTTTGCTATCTTCTAATTTCAGGTGTTACATCCACAAACAAGGTATGGTTTCGTATTTCTGAGTATAAGTATTCAGTGTAAGATTCTATTATCTTACTTAAATTAAAACATAGCCATTCCTCCCCCTCCTGGTGGTGGTGAGTAATTGCTTAGATATATTATAAACCCTAAAGTTCAATGACTTGAACTTCAAATAGGAAAGAAAGAGAATGCAAACTTCACTGTTATCTTATAAATGCGGCGTGTTAGGATTATTAACCAACATAAGTAACAGATTATCTCTCTGGAGTTGGAAAAGTAGACTTTACAATCATTTGGCTGCATACTTGATGTTTATTCTTCCGAAAGGAAGGGGCCAATTAAGAATGCACTGTTGGCTCTCTTATGAATGAAGTATATTTTTCTTTATTTGTTTTTCCAATTTGAAATATGGTGGAGTTAGATTAACATGATGATTTCATATTCTATGTATTTTCATTTTATTAAATTTTATTTCAGAGCTCCTTGAATCATGTGCAAAGAGCTGGAATTGCTTCAACAATGTCAAATACTAAAAAAGCAGTGTTAAGGAGCTCTTCTCATAGTGTTGCAAGTAATCCAATAACAAGAAGTCAACCACCTAATGTTGGAACTTCAAAATCTATGCAAGAAGTTCCTAATGGATACGATGCAAAATTGGTTGAAATGATAAATACTGCTATAGTGGATCATAGTCCTTCTGTAAAGTGGGATGATATTGGTAGTGCATTTTTTCCCTTCCATTTCTCATTTACATCATTTTATTTCATCCTTTTTTCCGTTGTGCTCGTATTCTCAAATTGCACCGTGATTGGTCTGGTTCAATTGTAGCTGGACTTCAGAAGGCAAAACAAGCTCTATTGGAGATGGTTATTTTGCCTACAAAGAGAAGAGACTTATTCACTGGCCTCCGAAAGCCAGCTAGAGGTAGTGCTTTTAGAAATTGTCTCCTACTACTTGTTTCAAGATTGAAATAAAGTGTCATACCATCTACAAAAAATGGGTGCTGCGTCAAGATATCATGTGCCTGGACATCATACATTACTGCAAGTTTGTGGTTTATGTTGAAACAATAAAGATGATTACATTTGATTTGAATGCTTTGGAAATGGGTAATTGAAACTACAAATTCTATCTGATTAAAGGAGTCATGGCTTGCAAGACTAGGTGCAGCAAACCAGATGGAGTGCTAAAATAATACTCGGCACCTACACAATCTTCATTCTGATTTACTTGATTTTCTGGGTCTTTTCTAACATCTCAACTCATCGGAGTAGCTTGTTGCAATTCTATTCTCGAAGCTGCTATATCATCTTCCACCCCTTATTAATTGGATTTGTCAGTTTCTAGCTAGTAATGCTCCATATAATAATCAACTTGATCTTCAATTTTAATATTTTATTAGATATGTAGTTTTTCAACTTATATTTTGCGGAGGCCTCCTATGAAAAAAAGACCTTTTTATTTTATGGAGTTCTAGGGGATTATTTGCGTTAAGATTTGCTTGTCAAATTAACATTTATTGCTCTAAGTAATCAACTGCATTGTAAAATACCATGATATCAGCACAATCTCAAGTCTTAAACATATTCATTTCAATATTTTGCACATATCTTATATCTTCTAAGGACAAGCAAGTACCTCCTTTTCTTGGCCGATTTCCCCTTTTTTTAAAAGGTGTGTTAGAGCCGAGGTTCTGGAAATTTTTGAGACTTTAATCTCATTAATCATTATTGTGGATCCAACATTGTCACATATTTTCCTCAGGTCTTCTCCTCTTTGGTCCACCCGGTAATGGAAAGACTATGCTTGCTAAAGCTGTAGCTTCAGAATCAGAAGCCACCTTTTTTAATGTTTCAGCCGCATCCTTAACATCAAAATGGGTGTGTCTCAGCTCCATGTTTAAGAATTCTATTCTTCCTGCGGAAAAAATTCTTCCTGTGTATTTTCCTTCCAGTTATGCCACGCAAATTCTAATAAGTTGTATTTGCTTTAAGATAGGTGGGGGAAGGTGAAAAGCTTGTACGGACTCTCTTCATGGTTGCTAAATCCAGGCAGCCCTCCGTAATTTTCATGGATGAAGTATGTCTCATTTTCTTGCCAAACCATTATTTTAAGGATGGGCTAACAGTACATTATTTTTGTATGCTTAATTCTTCTGTTTTATTTTGTTTTGGGTGCGATGTTGTTTTTGGTTATATGACATAAGGGAAATAAAAGTTGATAAATCACAGAAGGGTTTGAAAAGGTGGATATTTTAGTATTCTTAGCTTAGCTCTTGTTCAACAACTGAACTTTTTGGTCAGAAAATAGTTTTCTTTTTTTTTCCCCTGCTTTGCTGAGCCGCTGACATATTGCACAGATTTCATTCTTGTTAATAACTATTTCAAAATTTAGTAGGCTTGTCTAATCCTCTAGTATTTTCCCTTTGTTCTAATTAGAATCTCCTATATCGCCTAGTCAAACAAAAACAATCGGTGACCTGTAAAGACGAGTTATTTTGTCAAAATCTAATTTTATCTGTTCAATATAGTATCTGGTATTTAATGTGCAGAATGCCATGTAAGAAGACTAATAAGTATCCATATAAATTATGGAAAGGAAACCACTATCAGACAAAATACTGTCAGTGAAGTGCCTAGCAAAGGTGCAAGTCCTCTTACCAAAGCAAAGAAAGCTTGGACCCAAAACTGGAGATTGGGTCTTTATTGGGCATGCTGTTAATAGTCCATCATCTAGCCTTTTAGTCAAATAGGTACTTCATATTGACCATAATATTAGTATGAAATCAATAAATGCTAAGCGTATCCTCTACTGATGTTTTTGGATGTCGTGCAATCAATAGTTTATTTCACAAGTTATTTTCTGTGTGCAACATAATTGATGGCATTCTGCTTTTCTTTTTAATTTTGACCGTACAGATTGATAGTGTCATGTCAACAAGGCAGGCTAATGAAAATGAAGCTAGCCGGAGGCTGAAGTCAGAGTTTCTTGTACAGTTTGATGGAGTAACATCAAATTCTACTGATCTTGTAATTGTAATTGGTAAGCTTTGGTGAAGCTACATTCTGTTCTGCTCTTTTGGCAATTCTTTCGTGTATAGTGAAGGAGAGGAATAACCTAATTGTTTAGCGCAGAAAAAAATTGGTTTCAATTTGGACAAAAAGTCTGTTTTCTAGCATTTCCATGTGTCTATTTGACTAAGGAGTTTTGGTTTTTTCTTTTTCTTTTTCATGATTATTGAAGGTTACATGTCTTGTTGTCTTTTGCAGTCCTTTTTCCTTTTCGAGGATTTTGTCCTGTGACTCTGTATATTCTCTTTTCTTATTATTAGGACACTGTACAAGTTGATGTATTATATTTATTAACATTTGCATAATATTCCTCAATCTCCATTTGCATTTTTTTTCCTTTTATTTTCTATTTATATTTTGTAGATTTTCATCTGTATATCCTTTTCTTAATTCATAGCCATTGTGACTTCCACATATATGATATGACCTAACTTACGTGTGCCTGCAGTCAGAAAAGATCTCCTTTTATATGTTGAAATGATAGATGGCATTATGGCAACAATCTAAAAGAAAAGCATGTGTTCTCTTCCTTTTGTCAAGATGAATGCATGAGCTAGTTGAAATCCATATGAACTTTTAGATTGTATTTTGATACGAGCTAAGTGGCTGGCATAAGCATGCTATTTATTCTTTTTGCAAATTTGTGTTTTATGTTTATCTCTTTTCCTTCGGAAGTACAAGGTTTTATGCTTTCGTGCAGGTGCTACTAATAAGCCACAAGAACTGGATGATGCAGTTCTCAGGAGATTGGTATGTAAACTTAATGGTAGAGGATGTAAGCGGATTGGAGAATGTCATTTATGTTATGCTTTCTGAATTGACCTCCTGCGTCCATCCATAAGTACCTCCGTTTTGAAAGGGAAAAGGGGAAAAAAATATTCAAAATGTTTAGTTTCAAGTGATATGGACCATATGGTATTCTGTGGTTGTATAAGTCCTAATTCAATTTGTGCCAATTGGAGACTGTTTCGATAGCCTCTTTGGCATTTCGAGTCTCTGTTTTGTAGATTTCATTGTTTCAATGACATGGTTCTTAGTCCCCACCAACAAAAGAAAGAAAAGTTATATGGCATTCTGGATTTGACGTTTTATGTGGTGCAAGTCGATTCCACTAGGCAATACTTAGGGTAGATATGATTGCTTCCAAAGTACGGGGGGCCGCCTTTTGACTTCAGCTTTGTGAAACACTGATTAAAGCACCATTTGAAGATGAGCAAAAATCTGTCAATATGGAACTGCTCTCTCTCTCTCTCACACACACGCACGCACACATGCATGCTGGCATGCATATCTGGATAGTGAGGTGTGACGTGTAGCTGATAGATTAGCATCCTTTTTGTATTTTACTTCATTGAAACGTCTATTGATTTCTCATTGACCACACCCCCACCCCACCCCCCCCAAAAAAAAAAAAAAAAACAGGTGAAGAGAATTTACATTCCCTTGCCAGATGATAATGTTAGAAGACTTCTTCTCAAGCACATACTCAAGGGACAGTCATTTTCCCTACCAAGTAAGAATGAGGTTTTAGTGAATTTTCGGTGAAAATTTCCTTAAATATTTTTAACTTTATTGTAAATATTTGATGTATCATAATTATTATGAGCTCTCAAAAAGTAGAATACATATTGGACTTCAATTAACCAACCCCAGTAGGATAAGTAGTTGACTTAGATGTAAAGTTTATGCATTCAAATTATGGTAGCAACTTACCTACGATTTCAACTTTTTATTGTATGGTAGGGTAATTATTCCAAGGGATTAGTCGGGTTGCGTGTAAGCCGATGCGATGGCTATTACCATGGACTTTTTATACCCCTCCCCTCAGTTACCAGCACCACTAGCCCCGTACTTGTTAACTCTTTCATTTTTGTCTTTTATATTCACCCAAATTTAGCCTTTTATCGTATTTAATGCTTTTGAGGCTGATTGATTAATTTAGTGTGCTAGTTAGAATTGGATGTCCTGTTTTGACCAATTGCATATATCACATTGTATTACATGCTTTCATTTGCTAATACAACGTGGTGGAACATTGCTCTTGTGCTATATGATTTGTGTCCCTTCAGTCGAACCATTATATCTTTTTGGTAATTTATTATAATTAACATAAATGACTCATTTTCCTCGACAGGTAGAGAAGTAGAAAGACTAGTTAGAGAGACTGAAGGTATTATTTTCTTTTTCTTTTTCTTTTTCCCTAAATTTCTTAACCTTTCCATGTTTCATCAGTTGGAAAACTTGCATCATCTAATGATCTTTACACAATCCATCCAGTGTATTTGTGCATATGGGAAACTTGTACTTGCAGTGAGATTTATTTTTAAATAAGAAACCAAACTTTTATTAAGAAGAATAAAAGAACAGAGGTTAAAAGTTAGAAGGGCCGTGCTGAAAAAGGGGAACAAACCAAAAGGGAACATTTATATCTGGTTTATTTCTGATTGGGTTGATTCCTTACCCCCCTTTTTGTGTACCCTTTTTTTTTTCCGCAATGAAAATGTTTTCTGGGGTAGGGAAACAATTGCAAGTAGGAACCCTAATGGTTAGTGATCAAGAACCGAGGAATGCCCTGAAAAAAGCTAGAAACAGTAGCCCCGCACCGAGCTGCCTCCCACTGAGCAAACTTATCCTAGAAAACCCTCCTGTTCCTCCTAATCCAGATGCCTCACATAATAGACAAAATGCAGCCTGATGGGGAGCGAACCTTATGGGCAATTTGAAGTTTGAACTAGTATTATTTTCATGTACAGGATACTCTGGAAGCGATCTACAAGCCTTGTGCGAGGAAGCTGCAATGATGCCAATTAGGGAGCTAGGTGGGAACATTCTCACAGTAAAGGCAAATCAGGTGATTCACATAAAAACTGTTGTCATACTTTTAAACTCTGTTCACTGACTCTATTCTCTCTGTTTCACTAAAGATTGGAGTGAAGTAGCATTTATCTTTTGTCAACTCATATCACATCACATGTAGTTCTTTTCAGAATAAAGCTCATTAATAATGTATATTAAACTCAAGTTTGGTCCTAAACTTTCGTAGTAGTGTCGCCCTTATACTTGAAAAATTTTCTAATAGGTCTTTAAACTTTCAAGGTTGTGTTTATCCGATCTTTGTGTTATAAAAAGTTCTAATAGATCTCTGAACTTTTAAGTTTGTGTCTAATAGGTTCTTATTGTTAACCCTGTCAATTTAATATTTTTGTGGCACACTGAATATATTATTTAAGGATAATGTAATGGGTTGATTTTGGGTAGGTTTAGGCATTTGGTGACCTAAACGTTAAACTAACGGACTTAGCACGGAAATCTATTAAACCCAAAATTGTTAGTTCAAAAACTTATTAGAAAATTTTTAAAGTACATGAACCAAATAAACACAATCTTAAAAGTTTAGAGACCTATTAAAAAAATTTTAAAATATAAGGACTAAATAGATACAACTCTAAAAGTTTAGAGACCAAACTTTTAATTTAATCTTAAACTTATATTGTATTTATGTCAAACTCTTCATATAGTTAATTTGGATTTTTTTTAATTCTCACTTCCCCCTTCTTTTGGACTGTTAGTCGCACTTTAAACCAGTCCTCAAATTGTGTGGGTAGTTGTCATCCTGTAGGACTTCATCCTTCATATCATCCTACTACAAGCTGATAATAGTTCTTGATGGATATTTTACTTTTTAAGAACTCTGAATTTGACTTGGTAATCTTACCTGTTCTGTTTCTCTGTGTGATTTTTCGACTCTGGTACGGGGTAATCTCGTGCAAAGATTGTGTCTAAAATTTTTACATCTGTACTATTTCATTGAAATTTTAAATTTTTTGTGTACCAGATAAGGCCATTAAGGTATGAAGATTTTAAAGAGGCAATGAAAGTCATCAGACCCAGTTTAAACAAAAGCAGGTGGGAGGAGCTTGAAGAATGGAACCAGAGTTTTGGATCCAACTAGATGCGTGCAGAAATTTCCTTTTTGTTAAAGGCAAGCTGTAGTGAAGTAAACTGAAGGTTGAAGAACCCTTCGCTATGCTATATATTGTATTGTACAGTCTAACTTGAAAATAAGCTTCTAAATAGTTCGATTCTTTGTGTTTGAGATGGCTAAAATTCTTAATATGTGAAAATAATTTGTATATGAGAAAAAAAAAAAGGAAAAATGATGAAATAATTATTATTTCTGTTTGTCTTAAATGCAAACAAATGTTACTGAATGCTTGTACTTGTTGGCAGGGAATTTGATGCTGAAGTGAGTTTGAAGCCTCTGAGTAATTTTAACATTCAATTTGATTCAAGTATAGGAGCGATCATAAAAATTAAATTATAATTAAGATTGTAGTTTAATTTAGATTAAATTATAAATCTAGTCACTAAACAATAATTATATGTTTAATAAGTATCTAAATTTTAAATTTTGTGTTTAATAAGTCTGTAAATTTTAAAAATGTTTAATAAATTCTATTAAATATAAAATTAAAAATTTAAAGTCTTATTAAATATTTTTAAACGTCAGAAACCTGCACAAAATTAAAAATTTAGAGATTTATTAAACATTGGACATAATTTTAAAAACTTAATAACTACACTTATAATTTAATCTTTAATTTAATTGTAATTATTTATTTTTAATTAAAAATAATTATTTTTAACTTTAGTATGAAATAATTTACTAAACAATTCGATTAATTTTTAACCAAAAGACTGTAGATTAATAAATTTGTAAAAAACTTAATCAAATTTTAAACATTTTACAAATCTTATTTCTAGTTGAAATTGAAAACATATAAACTTATTCTCAACATTTACTCTTCGAATAGTCGATTTATTTCTCGAGTAGTCGAGTTTCTCCTGGCTTCGAATCTCTTATTGATATAATTCTTACCTAAAAAATTAGTTTTGCATAGATAACAACTGATAATAAAACATTAATTCGAGCATAACTTAATTAATAAAATATCAATTATCATTCTAAATTCTAAATTCGATCTCCCACCATTGTTGAACTTTAGAAAAAAACAAACAAACGATGAAACGATGACGACAAAGAAAACAAAAATCTGAAAGTTAAATGATCGCCCACTCTCGGTTGACCTTCTGAATCTGTCTTCTTCGGTCCAAATCCCCAACAATTTCTCCAACAGTATCTGCATTCTGCCGCTTGGTGATTCTCTCCTGAGGTTTAAACGTTCAATTTCATTCTCTCTACAACATTTTCTCCCGTTTCTAGAGGCAAAATCCATTACCTTCATGCGTAAACTTCGATCGAACAACATAATCAAATCGATATTTATGATCGTGATATAGGCATGGAAGTGGATATCAATATCTTCACCGTCTTTTCCTTCGTATTATGCACAGTCTTCCTCTTCTTTCTATCCTTCTTGATCCTCCTCCTCCTCCGAACGCTCGCCGGAAAATCCATAACGAGCTCCGAGTACACGCCGGTGTACGGCACCGTCTACGGTCAGGCTTTCTATTTCAACAACCTGTACGATCATCTAACGGAGGTGGCCAAGAGACATCGAACCTTCCGGCTGCTTGCGCCGGCGTACAGCGAGATATACACGACCGATCCGAGAAACATCGAGCATATGTTGAAGACGAAATTCGATAAGTATTCGAAAGGAAGCAAGGATCAAGAAATCGTTGGGGATCTGTTTGGAGAGGGGATATTTGCAGTCGATGGAGATAAGTGGAAGCAGCAGAGGAAGCTGGCTAGCTATGAATTCTCGACGAGGATTCTTAGGGATTTTAGCTGCTCGGTTTTCAGACGAAGTGCTGCTAAACTTGATGGAGTTGTTTCGGAGTTTTCCAGCATGGGTCGGGTTTTTGATATCCAGGTCAGCACATTCGAATCAGCAAAAGATATATATAAATGATTAATACTTGCATGTGGCATATTTTTAGAAAATGTTGTGTTGGCAAATATTTTTCTATACGTGTGTGTGTGTTAAATAACAATTTCCTTTAGTATATCATACGGGATAAAAATAAGAAATAGAAAAACTTTAACTTCTAAGCGTCATCTTGGCCCGCATATGTGTAATAATATATTTTCAAAATAAAATTTGGATCACATGACTTGATCTTACCGTTTAATGAATTATAAAATTTGATAAAAGAGTGTCAACTAATCATTATACTTTAGCCTCAATATTAACCACTCTAATCAATGACTCCATCTAATTCAGTCCTAATTATATGTGTCATAATCTATTTAAATCACATCTTTTCTTTTTTTGAAAATATAGCTCTACAAATGTAGAGGTGTGAAATAGAATCTTCAACTTCTAAAAAAGATTTATGATATCAATTTATGCTCAAGTTGATATATTATCTCATTTAAATTTTATTTAATAGTGTATTATGCCAATATTTTTAATCTTATTTATCCTTTATCAAATTTGTACATTTTATTGAGCCAACCTAATGTTCGGTGGCTCTAGTTAGAAGTCATAACTTTGAATCACTTGTATTTGTGACGTAATATTTTAAAAAATGTGTTTTACATTGATTGAAAATTCTGGTCGCTTTTATTTTTAAAACTCAAAAACTTGCTTGCAATTGCAACATGAAAAAACAAAGCTGAAATTTGTCTTTCGCTTTGTTAGGATTTGCTAATGCGGTGCGCTTTGGACTCCATTTTTAAAGTGGGGTTCGGGGTTGATTTGAATTGCTTGGAGGAATCAAGCAAAGAAGGGAGCGATTTCATGAAAGCCTTCGATGATTCTAGCGCTCAGATTTTTTGGCGCTATATCGATCCCTTCTGGAAATTGAAGAGATTGCTTAACATCGGTTCCGAAGCTTCGTTTAGGAACAACATAAAAACCATAGATGCTTTTGTGCACCAGTTGATCAGAGACAAGAGAAAATTGCTTCAGCAACCGAATCACGTAAACTTCATTCTCTACTATCAATGCCTATAAACATTGTTTTCTGTAACTTTGTAGTGATTTTTGTTATTTATGTCAATAATTTGATAGCATAGTTCTTTATTTCTAATTGATTTTACTATTCTTACTGGAATAAAAGAGATTAATTCGTTTATGAAATGATTGAATTTGGAAATGTTGCGGTTAAGTTATTATTCGCTAGGTTCAAATTATGGCGATTACTTAACTAATTATATCCTACAAGTGGTTTTGGAACGTGTGTTGTAGCCTTGTAGGATCAGCCAATATCAAGTTGACGCGAACACTCTCGGTTACTTTTAGGACTATAAGCAAGTTTTTTATTCCAAATGTTGTAGTGTCAAGCGGTCCTAGTCGAGATATGTGTAAACTTTTGACACTTATGATTATTAATGAAAAAGAGCATCATGCCCAGAATGTACAAGTTAAACTAAGTAATACGTTCATTAACATTTTCTTAGTAGAATGTTGATGGGAAAAAATGAATTGAAATAGAATCGAAGCATGTATAGCTTTGTTTGAACTCTTTGCTTTTGTTAACTAGTTTGCTGAATGTTGTTTGGAACTTGAAGAAGAATGACAAAGAGGACATACTTTGGAGGTTTCTGATGGAAAGTGAGAAGGATCCAACAAGAATGAATGATCAATATCTAAGGGATATAGTCCTCAATTTCATGTTGGCTGGCAAAGATTCAAGTGGAGGAACTCTGTCCTGGTTCTTCTACATGCTATGCAAGAACCCTTTAATACAGGAAAAAGTTGCAGAAGAAGTGAGGCAAATTGTTGCGTTTGAAGGGGAAGAAGTTGACATCAATTTGTTCATACAAAACTTAACTGATTCAGCTCTTGACAAAATGCATTATCTTCATGCAGCATTGACCGAGACTCTGAGGCTATATCCTGCAGTCCCTTTGGTAAGTTTTCACAAAACTGACATTTTGGCTATGAATATGAGAATGGATCAACCGTAGATTAATGTTGTAGAGCAAGTTTAGGGACTATTCAATAGATTTCTCAATGAATACTTGCAAATTATACCCCAAATTTGTGTAGTTTGTAAATTTTGTGGAATCTGCAAATCATGTCCTTCTGTAAATTAGGATCTTTAGTTTTTGATAGAATTCACTTGTTATTGATGGTAACAGAGAAGTACAAGTATATATCTAGGGAGAAATTTGAGAGCTCCTGTCTCTCCAGACTCCAAGACTACTATAGCCAAACACAACCATGACTTATATATCCACCACCCTATTCACTCACAACTTTTTCATGCTTACATCATTACTAAGTAAGAACTAACCCGCTATATGGGTCCCAAACTCTAACAGCTTATTCAATAGTTCCCTCGCTAGGAGAAATTATTTGAGGAGACTGAAACAATGTAGCTAGCAATTCTATACTGTTAGATGTATGAGAGAGAATATAAAAACTATATTTCAAACCTGTATCGAACGGTTATTAGGGTGGATAGATTGTGAAACTGCATTTAATGGTTATTAGGGAGCAATTATATTGTGACGGCCATATTGAATAATTTTTCCTCACTAGATCTCCCTTTTGACCTCTTCCTATGAATACATGTTAATATTGGGGCTATTGAATTCTTGGTAGAAATGGTCACATGACACACATAGAATAGCAATGTGTCAATTTTTTAAAAGATAATGGTAAAATTTAATAAATTTATATGATAATCATATGGATGTTTAGAACCATTTTTAAAATGAGGTTGTGGTTTGCATATTTCACAAACCATATAGTGTAGTTTGTAATTATTCCTTAATGTAATGTTATTAAAGCACCTATATGGTAACAAGAATTAGTGTTATGTGTTAAGGATGGAAGGACTGCAGAAATAGATGACATTCTTCCTGATGGCTATAAACTAAGAAAAGGGGATGGAGTATACTACATGGCCTATTCCATGGGCAGGATGTCCTCCCTTTGGGGAGAAGATGCTGAAGATTTTAAACCCGAAAGATGGCTTGAAATGGAACTTTTCAACCCGAATCACCTTTCAAATTCATCGCTTTCATGTTAGTATTGATTTTATGTTAATCATATCATCATTATATTTTATATGTCTTTACTAAGTTTCTTCATTTTAGGCGGGTCCTCGAATGTGTTTGGGAAAAGAGTTTGCTTATCGACAAATGAAGATAGTATCTGCTGCTTTCTTCAATTTTTCGATTCAAAGTAGCTGATACAACAGGAATGTGACTTATAGGATCATGCTTACCCTTCACATTGATGGAGGTCTCCCTCTTCTTGCAATTCCGAGAATTAGAAAATTTACCTAACGTTTGTATTAGCGTGACAAAGTGTATAATCTAAGAAGATTGGTTTGTAATTTGAATTTGCTTGTATATATAATAATTTACATATTTGAATATGATTTGCTATAGATTGTCAAAGTAATTAAACTAGTTAAGCCCTCCTCAATGGTCCATTTTGTTTAGAGATTTTTTTTTCCATTTGTTATATTTGTTTATTTAAATCCATTAAAACTAAGTGAATACTCTTTTATCGTTATTATTCGTTTTAAAAATATAATAGAGAAAACACTCTAAAAAGTAAATTTATTTTTTTATTTTTCTTTTATAAAACTTTTATTTTTTTAAATATTAAGTCGCAAATAAAGAAGATGGATCTAAACCTTCAGATACTTAATTACATTTTAGTTGTTCAGATTGGCATAAAACTTCTATATTTAAAATTGCCTACAAATATTTTTAATTAAAAATGCTTCTGGAATTAGCTGATTGGTATTACTTAAAAATTTATTCAAAAAGTTACTTTTGATTGGGCAGGGAAATATAATATATAATATAATATAAATAATAATGTCGGAAATGGAAGTTGAAACGGTGCGTTTAAGGTTCGACATGATATAAGAATTCAAATTCGTTCCGTTCGATATCCAACGTCCCCAATTCCCTCGCTTTCCTCTGTCCCCAGATTTCTTGCTATCGGAGTTCTACGGTGGCGAACCGATGAAATCTCAGGCGAAGGACGAAGATGAAACTCCAAAGCGGCAGTGGTCTTTGCAAGACTTCGACGTAGGAAAGCCCCTCGGCAAAGGAAAATTCGGCAGAGTCTATCTAGCGAGAGAAGTCAAGGTTCCCTTTCCTTCCGTCTCATTTTCTGTTTGATTCTTATTGAAATGTTCTTGAATTGCTGATGATGCCGGATTTTTAGACCTTCGGTTCTTTGTTGTAGTACGATTGATTTTTGGATGGTGATATTTTGTTTTGGTCCACTCAATAGCTTCTAAATCTGAAGAGATTTTCGTCATTTTGATTTTTGTGGCTGCATGTGGATTTTTCTCTGGTTTGAATTGTTATTTGTTTGTATTTTATCTCCTCTTTTCCCGGAAACCAAGCAGAGAGTACTAGGTCTCCTTTTCTGGAAATTATCGTAGCTACCCAGTGCTATCACTTCTCAATGACCGAACAGAGTGATACAAACTAAATTTGCTTGGTAGCGAGTAGCGATAGCGTATGATTTCTCATAAATACGCAACTTATAATCTCTCCAGTCCATTTTGCATAAAAAAATAAACACGAAATTCCTTTGTATTTTCTCTGTTTCCAATCATAAGATTGTATATCTGCATTTAGGTTTGACCTTAGGAGCTTCGATTTGATTTTCAGAGCAAGTATATAGTGGCACTGAAGGTGATTTTCAAGGAGCAGATGGAGAAGTACAGGATTCATCATCAGTTAAGGAGAGAGATGGAGATACAGACCAGTCTTCGGCACCCGAACATCCTGCGCCTCTATGGATGGTTTCATGATGCTGAACGGATTTTCTTGATATTGGAATACGCTCACCGCGGCGAGCTCTATAGGGAGCTTAGGAAAAGTGGTCATCTCAGCGAGAAGCAAGCTGCCACTGTAATCTCTCTCTCTCTCTCTCCTCTCCTTGTGAACTTGTTTACTTTGGACTGAGGATGTTGAAGAAGTGAAGGATACCAGAATATTGAGCCAGAAACAAAATTGTAGGAACTCGTTTCATGAATTTCAGGATTTTCCATATTCTACTTCTCATCGATTTTCGTAATTTTTTGTCACTTCACAACTTCGTATTTGACTGAGATACAGGAAACCAGGATCTTGCAATCTGAACTTAGTCCTTACAATATATCGTTATCTAAAGGTTCTTTTATGTTTGTTTTTTGTGAATTTCAGTACATTTTAAGCCTCACCCAAGCATTGGCATACTGCCATGAGAAGCATGTGATTCATAGGGACATTAAGCCAGAAAACTTGTTGCTTGATCATGAGGTATAATTCTCTCTCTTTGTTATTCAACAGCTTAGATAATGTTTCTATGCAAAATTTTTTAAAACCTTAGTAAGCCAAATTTCACTACTCTTTGTAGACAGAGAAGTCAAATCTTTTTATTATTCTTATTGTGTAATTATGTTGTATTTGTAGGGCCGGTTGAAAATTGCAGATTTTGGATGGTCTGTTCAGTCAAGAAGCAAGAGACATACCATGTGCGGAACTCTGGATTATTTAGCCCCAGAAATGGTAGAGAACAAAGCTCACGACTACGCAGTAGATAATTGGACTCTGGGCATCCTATGCTATGAGTTCCTTTATGGTGTGCCTCCATTTGAAGCCGAAAGTCAAAGTGACACATTTAAAAGGTTAGGGACATAATTTGAATGAACACATCTTAATTAATCATTTAACTAAAAAATTTAACACATTTTAAAACTTGACATTGGATTTATTATATGTGATTTCAAATACTTTTCATCTTAGATATATGCTGATTTGATAAACAAATAAAAGATGTGTCTATGTTATGTTATGGTTCTGTTTAATAACTTAAAAAATTAAGACAATGACTAGAATAATTTGTATATAAAATTTAAGGATAGTACGTACAGTTTAGAAACTAAAATAGAATAAACACAAAAGTTTATGAATTAAAATTGAATAAAAACCTATAGAAGTTTAGTGAACTTTTTTCTTCTTAATTAACAACAAAAACTTGGTCTAAATAACCGTTTGGAATCAAATGCAGGATAATGAAGGTTGACTTGAATTTTCCGTTGACCCGTCACGTTTCCCCAGAAGCAAAAGATCTCATTGGCCGGGTTAGTTAAATTGTGAACATGTCTTTGATTGGTAGGACTAGGAGCAGAGAAAATGATGTGCAGTATTTTTGGTTTGGTCCTGAATCTGTAAAGTATTTTTCGTCAGCTCTTGGTGAAGGATTCCTCTAGAAGACTTTCACTTCAGAAGATAGTGGAGCACCCATGGATAATCAGGAATGCAGATCCATCTGGTATTTGCAATAGGTAGAAGAAACATCACTTCACTTTTAATAATTGGTCTTAATCATCATGGCTCCTTTCATATAAATAGAGTTTTGATTAAGATCTGTGTAATTTCCCAGTGTTGTAGCATTCATGATTTATAACCATAACGTTGAAAAAGTTGCTAGTAGTATTCCCCTAGTATTACAAGCTAGATCATGCCTTGGGGCTAAATATTCCAATTTTTCCCCTTTTTGGTGTGATTTCGAGTTTGAAATAGTATTATGAGATGTATGAAATTTAACATCTTGATAGATCTTTAATCAATCTTAGTTGATCTCAATCATTCGCATCTTGTTTGTTCAACTTGCTCGTCTTGCAAATAAAGTTGAGCTCAATTGATTATCAATCAATCTTAGACAGACTTGAAAATTTTCTCAAAGTTTTTAAAAGGATGAGAATAAGATGAGATAAAGTGGGAAAAAAAGTAGAATATTATAAAAAAGTGAGCGACGCCCAAATTTTGCTTTGTATAGTACGTATTAGGCTATTTCTCCAAAAACCTGTTTCATTTATTTCGTTTGGATTCTAAAACAATAAATAAACGAAAAATGTTGTAATGAGTAGCTTTTTTAATTTAAAATATAATTTCTTTTACAGTAATTTAAAAGTATGATCAATTTAAGTTAATATGCTAATCATATGACTATACTCTTATTAATGATTAATAAGTATATTTAATATCAATTAAATATCACCAATCTGTGCATAGCTCAACTGATTAAATATTAAAAATCTCTTAAAGGTTAGATTCTAATTTACGCCTCATGCCTTGTCGAACTCAAAAAAAAAAAAATTCAATTATGTTAATATATCAAATATCAATAACATATTTCAATTCGCTATTTTTACGATTTCACTTTGAATCTATTTATTTCTTTAATTTTACATCTTAATAAGGTCTTTTCACCCCCTAATCTTGAATAAACACAAAAATAACAAAGTAGGTTTTATTTTTTTTTAGTTCAACAAATGTGTGGGGTAGAAGATCAAACTTTCAACCTTGGTAAAGGTAATAACAAAATAGGGTTTGGGGACTGATAAACATTCACAATTTATATTGTTCATAAATCCAAAACAGAATCACTTGACGATCCGATTACTTGATTAGAGCTATAGACCAGTCAGCATAGCATGGTTACCTTTTATTGATATGCAATGCAACCAAATCGAAGAGAAAATGCACCCTTTTTTCTAACAACTCCATCGCGTTCATATCCTCCACGCCCGCAGCTGATGAATCAGGACAGCAGGAACGGAACTGAAGAGTCGCTTTAGAAAATTGGCGCACGTTTGTTAAGATCAAAACATATATATAAACGGATGTCCAAAGTTATCTTTTCACTTTTCAGTGAGGTTTTAGGGGTCAAATATGACATAACAGAGACTTGCAGGAACGTTACTCGAAGATTTACCTCTTACCGTAATTACAAGAAATTGACTATGCCGAAATGGTACAAAACTGAAATAAAACAGCAGATATAGAATAATTCCAGGGCCAAATGATTCTGCTTCGTACCTTCTTATCAGGGGGAAATGTTTTCGTAGGCAGTGATAGTGGTGTCGGTAATGGTGGTCTCACTGCTACTGCTGACATTGCTCCTTTCTTCATCATGTCTCTGTTTATTGCTTGCCCAGATCGACATTGAGAGATTTGTGTTTGGGGTTACAGGCTCGGGTTTTGTGAGCAGTGTTCCTGGTGAAGGTAGCAGGGTAGTGCCAGAGGTAGCAGCAGAAGTAGCAAGAACAGCTTCCTTTTGGGAGGCCTTCAAATGCTTCTTTGAACCTCTGTGCATGTGCAGTCCACAATACTTCTGATTTGGAATAACATCTCTCCGGCATCGCCATTTCTTCCCATCCGTTCTTCTACACCTTCCTGGTTCAAGTTCAAGGTTGATTTTACTATCACAGCACGAGCTGCTGCAACCTGGAACTGGAAAATAAAAATGACAGCATAATTTAAAACTTCACTTTCTAAATAAACCAGCAATTATCGATAAAACTTTCATGTAGAAACTATGAATCCATTACTTGAGTAAGAGGCAGCAGGGGATATAGATCAGAACAAACACAACAGATTTAAAATTTTAGCTGAAAAAAAAATCCAGAAAATTTTGGCATTCTACTCTATAAATTATAAAGATTTTACTTGGGGTAAAATCATAAAAATTGGAATATGTATCTCTTCTTCGTTAGATGCCTTGGGGGGAAGAGCAGTAAATGTTATGTATAAAAGAAGGCATGAGTCCTTCAAAACCAATAAATTGTTTGAAAGTCCTATGTTGCTAATTGCAGCATTCAAGTTCCACTTTCAGAGATCAGATAACATCTACACCTAGAAGATCACGAGTATAAAAGCTTCTCTCAAAGCTGGAACAACAAAAGAAAACAGGTTAAAATGTAAAAACATTTTAGGGATTAACTTTTGATACAAAAGTGATCTTCATGGAGAGTAAGGTTCCATAATGGAGGGCATAAGAGAATACATTTTTTGCATTCAAGAGTTTTAACTTAGACGTCAATTAGGAGAGATGAAAGTTAACACTTCTTTTCCCTTCTGGGTGGGGGGGAATAAAAGCATGCTAAACTTTCCATAAGGAATGAACAGGCATAAAGTGTCGCACAAAGGCATAAGAAAAGAAGATGTACATGAAATAAATAAATAAACGGAAAGGGGAAGACTGTAGCCCACATAACGGGCAGCTTCTATCAAAATCATGATGCAAATAGCCTTCCAAACAATACCTCATAAAAAATAACCATTTTGCAGCACTCAAGGATTTGTTCCTGGAATGATACCTTGAAGAACACTGTTAGGAGAGAACCCTAAACCTGGAGATGGGATCTTCTGGACATTGATATCCTTTCTTCCAAAGCCATTGCTGTTTGTGCTGCAACTCTCACGAAAATTCATGTTGCAGACATCTTTGAAGTCAACTGAATAACCAGAGCGAACGCTGGTAGTGGCAGTTGAATTAGTGCTAGGAACTGTTGTAATATGTGTTGCAGTGGTGTTGGGGGCACTGGCAGGGACATTTCTGATGATTTTGGTTGCATCAGCACAGCTGGTAAGAATGGTTGTTGCTGCATTCTTGCTGCAGGTAATGGTTTCAGTAACTGTTTTAGGACCTGTTTTGGGACGGGGACTATGGCTTCGACGGGGGAACGGTGATAGCAGTCGGAAGATCACAACTTAAGTGATTCTTGGTTTTGTTATTCCCACAAATGGTGGAGGCAGGAGCATCAACCGTACATCTTACATCTCCGACTGGATTATCGAGTAGAATTGTCTTTGATGGAGAGTCAATTTCAGAAGCTTCCACAGGCTTTCTTGAACGCTGTCGGCCTCTATGCATGTGCCGTTCACAGTATTTCTGATGCAGAACAGTATTCCGGCTACATCTCCATTTCTTCCCATCAGTTCTTCTACATCTCCCTGGTTCAGGATCCATCATACTCCTGTAGTCGAATCCCAGAGGACTGA

mRNA sequence

ATGAAGCGGAAAATATTCCAAAGCAACGACGAGAGGATCAGAGTGATTCAGGCATTGTTGTTGCAGGAAATGAATATGACGCTGCAGGAGAGAAGGAGATGGAACCGAAAACTTGATTGTGAAGCCGAATTATCAAGTCAAGCTCGAAATCTTCTTCTTCGATTCGCAGCTCCGACCAAGCGAAAATTGAGAAAGAGGTCGTCGAGAAAATTAGCGGGGACGGGGAGAGAGAGGATGCGAATTGAGCGCGAGAGGGAAGGAAGATTACGGAGCTATTATGAGGCTCCGCCCGTTGGATTTGTTGCCGATTTTTGGTCGTCTGACTCTCCGGCGTCTCCGCTCCCTCTAATAGGCAAGCTCCGGTGCTGTGCATGCGAACTTGGGAAAAAACCATCGATTTCGCCGAAATTGAATTTTCCTTTTCGAAACGGAGCTCAATCTTCCCAATCATACACGACTGGAGCAACGCAGAGAATGAAGTTGGAAAATTTAGTAAACTGGTTAGAACTACCTGCCTTCTTTCCCCTCACTTGCCTTACCTTTACGGATTACTTGCTCACTGTGTTTGTATGCTTAGGTATGGATTTTGTCAGAGGTGTAGTTGACTATCTTGGATCAATCTTCTCAGAGACGAGCTCAATTCACGAGTCGCCGCATAATCCTAGCGGTGTGGGTGCTTCAACCATGGAAGGCGTTAATGGAGTTCCTGTTTTGAACGAGCGCTATGCTTCCAAGCTCAAAGGGTACTTCGATTTGGCAAAGGAGGAGATCGCCAAGGCCGTCAGAGCGGAGGAGTGGGGCATAATCGACGATGCGATCCTGCACTACCAGAATGCTCAGCGCATTCTGGCCGAGGCCAGTTCAACCGCTGTGCCTTCGTTTATCAGTTCCAGCGAACAAGAGAAGGTGAAATCTCATAGACAAAAGATATCAAAGTGGCAGAGTCAAGTTTCTGAGAGATTACAAGCTTTAAATATGCGAGCAGGTGTTACATCCACAAACAAGAGCTCCTTGAATCATGTGCAAAGAGCTGGAATTGCTTCAACAATGTCAAATACTAAAAAAGCAGTGTTAAGGAGCTCTTCTCATAGTGTTGCAAGTAATCCAATAACAAGAAGTCAACCACCTAATGTTGGAACTTCAAAATCTATGCAAGAAGTTCCTAATGGATACGATGCAAAATTGGTTGAAATGATAAATACTGCTATAGTGGATCATAGTCCTTCTGTAAAGTGGGATGATATTGCTGGACTTCAGAAGGCAAAACAAGCTCTATTGGAGATGGTTATTTTGCCTACAAAGAGAAGAGACTTATTCACTGGCCTCCGAAAGCCAGCTAGAGGTCTTCTCCTCTTTGGTCCACCCGGTAATGGAAAGACTATGCTTGCTAAAGCTGTAGCTTCAGAATCAGAAGCCACCTTTTTTAATGTTTCAGCCGCATCCTTAACATCAAAATGGGTGGGGGAAGGTGAAAAGCTTGTACGGACTCTCTTCATGGTTGCTAAATCCAGGCAGCCCTCCGTAATTTTCATGGATGAAATTGATAGTGTCATGTCAACAAGGCAGGCTAATGAAAATGAAGCTAGCCGGAGGCTGAAGTCAGAGTTTCTTGTACAGTTTGATGGAGTAACATCAAATTCTACTGATCTTGTAATTGTAATTGGTGCTACTAATAAGCCACAAGAACTGGATGATGCAGTTCTCAGGAGATTGGTGAAGAGAATTTACATTCCCTTGCCAGATGATAATGTTAGAAGACTTCTTCTCAAGCACATACTCAAGGGACAGTCATTTTCCCTACCAAGTAGAGAAGTAGAAAGACTAGTTAGAGAGACTGAAGGATACTCTGGAAGCGATCTACAAGCCTTGTGCGAGGAAGCTGCAATGATGCCAATTAGGGAGCTAGGTGGGAACATTCTCACAGTAAAGGCAAATCAGATAAGGCCATTAAGGTATGAAGATTTTAAAGAGGCAATGAAAGTCATCAGACCCAGTTTAAACAAAAGCAGGGAATTTGATGCTGAAGTGAGTTTGAAGCCTCTGAGTAATTTTAACATTCAATTTGATTCAAGCATGGAAGTGGATATCAATATCTTCACCGTCTTTTCCTTCGTATTATGCACAGTCTTCCTCTTCTTTCTATCCTTCTTGATCCTCCTCCTCCTCCGAACGCTCGCCGGAAAATCCATAACGAGCTCCGAGTACACGCCGGTGTACGGCACCGTCTACGGTCAGGCTTTCTATTTCAACAACCTGTACGATCATCTAACGGAGGTGGCCAAGAGACATCGAACCTTCCGGCTGCTTGCGCCGGCGTACAGCGAGATATACACGACCGATCCGAGAAACATCGAGCATATGTTGAAGACGAAATTCGATAAGTATTCGAAAGGAAGCAAGGATCAAGAAATCGTTGGGGATCTGTTTGGAGAGGGGATATTTGCAGTCGATGGAGATAAGTGGAAGCAGCAGAGGAAGCTGGCTAGCTATGAATTCTCGACGAGGATTCTTAGGGATTTTAGCTGCTCGGTTTTCAGACGAAGTGCTGCTAAACTTGATGGAGTTGTTTCGGAGTTTTCCAGCATGGGTCGGGTTTTTGATATCCAGGATTTGCTAATGCGGTGCGCTTTGGACTCCATTTTTAAAGTGGGGTTCGGGGTTGATTTGAATTGCTTGGAGGAATCAAGCAAAGAAGGGAGCGATTTCATGAAAGCCTTCGATGATTCTAGCGCTCAGATTTTTTGGCGCTATATCGATCCCTTCTGGAAATTGAAGAGATTGCTTAACATCGGTTCCGAAGCTTCGTTTAGGAACAACATAAAAACCATAGATGCTTTTGTGCACCAGTTGATCAGAGACAAGAGAAAATTGCTTCAGCAACCGAATCACAATGACAAAGAGGACATACTTTGGAGGTTTCTGATGGAAAGTGAGAAGGATCCAACAAGAATGAATGATCAATATCTAAGGGATATAGTCCTCAATTTCATGTTGGCTGGCAAAGATTCAAGTGGAGGAACTCTGTCCTGGTTCTTCTACATGCTATGCAAGAACCCTTTAATACAGGAAAAAGTTGCAGAAGAAGTGAGGCAAATTGTTGCGTTTGAAGGGGAAGAAGTTGACATCAATTTGTTCATACAAAACTTAACTGATTCAGCTCTTGACAAAATGCATTATCTTCATGCAGCATTGACCGAGACTCTGAGGCTATATCCTGCAGTCCCTTTGGATGGAAGGACTGCAGAAATAGATGACATTCTTCCTGATGGCTATAAACTAAGAAAAGGGGATGGAGTATACTACATGGCCTATTCCATGGGCAGGATGTCCTCCCTTTGGGGAGAAGATGCTGAAGATTTTAAACCCGAAAGATGGCTTGAAATGGAACTTTTCAACCCGAATCACCTTTCAAATTCATCGCTTTCATATTTCTTGCTATCGGAGTTCTACGGTGGCGAACCGATGAAATCTCAGGCGAAGGACGAAGATGAAACTCCAAAGCGGCAGTGGTCTTTGCAAGACTTCGACGTAGGAAAGCCCCTCGGCAAAGGAAAATTCGGCAGAGTCTATCTAGCGAGAGAAGTCAAGAGCAAGTATATAGTGGCACTGAAGGTGATTTTCAAGGAGCAGATGGAGAAGTACAGGATTCATCATCAGTTAAGGAGAGAGATGGAGATACAGACCAGTCTTCGGCACCCGAACATCCTGCGCCTCTATGGATGGTTTCATGATGCTGAACGGATTTTCTTGATATTGGAATACGCTCACCGCGGCGAGCTCTATAGGGAGCTTAGGAAAAGTGGTCATCTCAGCGAGAAGCAAGCTGCCACTTACATTTTAAGCCTCACCCAAGCATTGGCATACTGCCATGAGAAGCATGTGATTCATAGGGACATTAAGCCAGAAAACTTGTTGCTTGATCATGAGGGCCGGTTGAAAATTGCAGATTTTGGATGGTCTGTTCAGTCAAGAAGCAAGAGACATACCATGTGCGGAACTCTGGATTATTTAGCCCCAGAAATGGTAGAGAACAAAGCTCACGACTACGCAGTAGATAATTGGACTCTGGGCATCCTATGCTATGAGTTCCTTTATGGTGTGCCTCCATTTGAAGCCGAAAGTCAAAGTGACACATTTAAAAGGATAATGAAGGTTGACTTGAATTTTCCGTTGACCCGTCACGTTTCCCCAGAAGCAAAAGATCTCATTGGCCGGCTCTTGGTGAAGGATTCCTCTAGAAGACTTTCACTTCAGAAGATAGTGGAGCACCCATGGATAATCAGGAATGCAGATCCATCTGGTATTTGCAATAGTGAGGTTTTAGGGGTCAAATATGACATAACAGAGACTTGCAGGAACGTTACTCGAAGATTTACCTCTTACCGGGGAAATGTTTTCGTAGGCAGTGATAGTGGTGTCGGCTCGGGTTTTGTGAGCAGTGTTCCTGGTGAAGGTAGCAGGGTAGTGCCAGAGGTAGCAGCAGAAGTAGCAAGAACAGCTTCCTTTTGGGAGGCCTTCAAATGCTTCTTTGAACCTCTGTGCATTGGTGTTGGGGGCACTGGCAGGGACATTTCTGATGATTTTGGTTGCATCAGCACAGCTGGTAAGAATGGTTGTTGCTGCATTCTTGCTGCAGGTAATGGTTTCAGTAACTGTTTTAGGACCTGTTTTGGGACGGGGACTATGGCTTCGACGGGGGAACGGCAGGAGCATCAACCGTACATCTTACATCTCCGACTGGATTATCGAGTAGAATTGTCTTTGATGGAGAGTCAATTTCAGAAGCTTCCACAGGCTTTCTTGAACGCTGTCGGCCTCTATGCATGTGCCGTTCACAGTATTTCTGATGCAGAACAGTATTCCGGCTACATCTCCATTTCTTCCCATCAGTTCTTCTACATCTCCCTGGTTCAGGATCCATCATACTCCTGTAGTCGAATCCCAGAGGACTGA

Coding sequence (CDS)

ATGAAGCGGAAAATATTCCAAAGCAACGACGAGAGGATCAGAGTGATTCAGGCATTGTTGTTGCAGGAAATGAATATGACGCTGCAGGAGAGAAGGAGATGGAACCGAAAACTTGATTGTGAAGCCGAATTATCAAGTCAAGCTCGAAATCTTCTTCTTCGATTCGCAGCTCCGACCAAGCGAAAATTGAGAAAGAGGTCGTCGAGAAAATTAGCGGGGACGGGGAGAGAGAGGATGCGAATTGAGCGCGAGAGGGAAGGAAGATTACGGAGCTATTATGAGGCTCCGCCCGTTGGATTTGTTGCCGATTTTTGGTCGTCTGACTCTCCGGCGTCTCCGCTCCCTCTAATAGGCAAGCTCCGGTGCTGTGCATGCGAACTTGGGAAAAAACCATCGATTTCGCCGAAATTGAATTTTCCTTTTCGAAACGGAGCTCAATCTTCCCAATCATACACGACTGGAGCAACGCAGAGAATGAAGTTGGAAAATTTAGTAAACTGGTTAGAACTACCTGCCTTCTTTCCCCTCACTTGCCTTACCTTTACGGATTACTTGCTCACTGTGTTTGTATGCTTAGGTATGGATTTTGTCAGAGGTGTAGTTGACTATCTTGGATCAATCTTCTCAGAGACGAGCTCAATTCACGAGTCGCCGCATAATCCTAGCGGTGTGGGTGCTTCAACCATGGAAGGCGTTAATGGAGTTCCTGTTTTGAACGAGCGCTATGCTTCCAAGCTCAAAGGGTACTTCGATTTGGCAAAGGAGGAGATCGCCAAGGCCGTCAGAGCGGAGGAGTGGGGCATAATCGACGATGCGATCCTGCACTACCAGAATGCTCAGCGCATTCTGGCCGAGGCCAGTTCAACCGCTGTGCCTTCGTTTATCAGTTCCAGCGAACAAGAGAAGGTGAAATCTCATAGACAAAAGATATCAAAGTGGCAGAGTCAAGTTTCTGAGAGATTACAAGCTTTAAATATGCGAGCAGGTGTTACATCCACAAACAAGAGCTCCTTGAATCATGTGCAAAGAGCTGGAATTGCTTCAACAATGTCAAATACTAAAAAAGCAGTGTTAAGGAGCTCTTCTCATAGTGTTGCAAGTAATCCAATAACAAGAAGTCAACCACCTAATGTTGGAACTTCAAAATCTATGCAAGAAGTTCCTAATGGATACGATGCAAAATTGGTTGAAATGATAAATACTGCTATAGTGGATCATAGTCCTTCTGTAAAGTGGGATGATATTGCTGGACTTCAGAAGGCAAAACAAGCTCTATTGGAGATGGTTATTTTGCCTACAAAGAGAAGAGACTTATTCACTGGCCTCCGAAAGCCAGCTAGAGGTCTTCTCCTCTTTGGTCCACCCGGTAATGGAAAGACTATGCTTGCTAAAGCTGTAGCTTCAGAATCAGAAGCCACCTTTTTTAATGTTTCAGCCGCATCCTTAACATCAAAATGGGTGGGGGAAGGTGAAAAGCTTGTACGGACTCTCTTCATGGTTGCTAAATCCAGGCAGCCCTCCGTAATTTTCATGGATGAAATTGATAGTGTCATGTCAACAAGGCAGGCTAATGAAAATGAAGCTAGCCGGAGGCTGAAGTCAGAGTTTCTTGTACAGTTTGATGGAGTAACATCAAATTCTACTGATCTTGTAATTGTAATTGGTGCTACTAATAAGCCACAAGAACTGGATGATGCAGTTCTCAGGAGATTGGTGAAGAGAATTTACATTCCCTTGCCAGATGATAATGTTAGAAGACTTCTTCTCAAGCACATACTCAAGGGACAGTCATTTTCCCTACCAAGTAGAGAAGTAGAAAGACTAGTTAGAGAGACTGAAGGATACTCTGGAAGCGATCTACAAGCCTTGTGCGAGGAAGCTGCAATGATGCCAATTAGGGAGCTAGGTGGGAACATTCTCACAGTAAAGGCAAATCAGATAAGGCCATTAAGGTATGAAGATTTTAAAGAGGCAATGAAAGTCATCAGACCCAGTTTAAACAAAAGCAGGGAATTTGATGCTGAAGTGAGTTTGAAGCCTCTGAGTAATTTTAACATTCAATTTGATTCAAGCATGGAAGTGGATATCAATATCTTCACCGTCTTTTCCTTCGTATTATGCACAGTCTTCCTCTTCTTTCTATCCTTCTTGATCCTCCTCCTCCTCCGAACGCTCGCCGGAAAATCCATAACGAGCTCCGAGTACACGCCGGTGTACGGCACCGTCTACGGTCAGGCTTTCTATTTCAACAACCTGTACGATCATCTAACGGAGGTGGCCAAGAGACATCGAACCTTCCGGCTGCTTGCGCCGGCGTACAGCGAGATATACACGACCGATCCGAGAAACATCGAGCATATGTTGAAGACGAAATTCGATAAGTATTCGAAAGGAAGCAAGGATCAAGAAATCGTTGGGGATCTGTTTGGAGAGGGGATATTTGCAGTCGATGGAGATAAGTGGAAGCAGCAGAGGAAGCTGGCTAGCTATGAATTCTCGACGAGGATTCTTAGGGATTTTAGCTGCTCGGTTTTCAGACGAAGTGCTGCTAAACTTGATGGAGTTGTTTCGGAGTTTTCCAGCATGGGTCGGGTTTTTGATATCCAGGATTTGCTAATGCGGTGCGCTTTGGACTCCATTTTTAAAGTGGGGTTCGGGGTTGATTTGAATTGCTTGGAGGAATCAAGCAAAGAAGGGAGCGATTTCATGAAAGCCTTCGATGATTCTAGCGCTCAGATTTTTTGGCGCTATATCGATCCCTTCTGGAAATTGAAGAGATTGCTTAACATCGGTTCCGAAGCTTCGTTTAGGAACAACATAAAAACCATAGATGCTTTTGTGCACCAGTTGATCAGAGACAAGAGAAAATTGCTTCAGCAACCGAATCACAATGACAAAGAGGACATACTTTGGAGGTTTCTGATGGAAAGTGAGAAGGATCCAACAAGAATGAATGATCAATATCTAAGGGATATAGTCCTCAATTTCATGTTGGCTGGCAAAGATTCAAGTGGAGGAACTCTGTCCTGGTTCTTCTACATGCTATGCAAGAACCCTTTAATACAGGAAAAAGTTGCAGAAGAAGTGAGGCAAATTGTTGCGTTTGAAGGGGAAGAAGTTGACATCAATTTGTTCATACAAAACTTAACTGATTCAGCTCTTGACAAAATGCATTATCTTCATGCAGCATTGACCGAGACTCTGAGGCTATATCCTGCAGTCCCTTTGGATGGAAGGACTGCAGAAATAGATGACATTCTTCCTGATGGCTATAAACTAAGAAAAGGGGATGGAGTATACTACATGGCCTATTCCATGGGCAGGATGTCCTCCCTTTGGGGAGAAGATGCTGAAGATTTTAAACCCGAAAGATGGCTTGAAATGGAACTTTTCAACCCGAATCACCTTTCAAATTCATCGCTTTCATATTTCTTGCTATCGGAGTTCTACGGTGGCGAACCGATGAAATCTCAGGCGAAGGACGAAGATGAAACTCCAAAGCGGCAGTGGTCTTTGCAAGACTTCGACGTAGGAAAGCCCCTCGGCAAAGGAAAATTCGGCAGAGTCTATCTAGCGAGAGAAGTCAAGAGCAAGTATATAGTGGCACTGAAGGTGATTTTCAAGGAGCAGATGGAGAAGTACAGGATTCATCATCAGTTAAGGAGAGAGATGGAGATACAGACCAGTCTTCGGCACCCGAACATCCTGCGCCTCTATGGATGGTTTCATGATGCTGAACGGATTTTCTTGATATTGGAATACGCTCACCGCGGCGAGCTCTATAGGGAGCTTAGGAAAAGTGGTCATCTCAGCGAGAAGCAAGCTGCCACTTACATTTTAAGCCTCACCCAAGCATTGGCATACTGCCATGAGAAGCATGTGATTCATAGGGACATTAAGCCAGAAAACTTGTTGCTTGATCATGAGGGCCGGTTGAAAATTGCAGATTTTGGATGGTCTGTTCAGTCAAGAAGCAAGAGACATACCATGTGCGGAACTCTGGATTATTTAGCCCCAGAAATGGTAGAGAACAAAGCTCACGACTACGCAGTAGATAATTGGACTCTGGGCATCCTATGCTATGAGTTCCTTTATGGTGTGCCTCCATTTGAAGCCGAAAGTCAAAGTGACACATTTAAAAGGATAATGAAGGTTGACTTGAATTTTCCGTTGACCCGTCACGTTTCCCCAGAAGCAAAAGATCTCATTGGCCGGCTCTTGGTGAAGGATTCCTCTAGAAGACTTTCACTTCAGAAGATAGTGGAGCACCCATGGATAATCAGGAATGCAGATCCATCTGGTATTTGCAATAGTGAGGTTTTAGGGGTCAAATATGACATAACAGAGACTTGCAGGAACGTTACTCGAAGATTTACCTCTTACCGGGGAAATGTTTTCGTAGGCAGTGATAGTGGTGTCGGCTCGGGTTTTGTGAGCAGTGTTCCTGGTGAAGGTAGCAGGGTAGTGCCAGAGGTAGCAGCAGAAGTAGCAAGAACAGCTTCCTTTTGGGAGGCCTTCAAATGCTTCTTTGAACCTCTGTGCATTGGTGTTGGGGGCACTGGCAGGGACATTTCTGATGATTTTGGTTGCATCAGCACAGCTGGTAAGAATGGTTGTTGCTGCATTCTTGCTGCAGGTAATGGTTTCAGTAACTGTTTTAGGACCTGTTTTGGGACGGGGACTATGGCTTCGACGGGGGAACGGCAGGAGCATCAACCGTACATCTTACATCTCCGACTGGATTATCGAGTAGAATTGTCTTTGATGGAGAGTCAATTTCAGAAGCTTCCACAGGCTTTCTTGAACGCTGTCGGCCTCTATGCATGTGCCGTTCACAGTATTTCTGATGCAGAACAGTATTCCGGCTACATCTCCATTTCTTCCCATCAGTTCTTCTACATCTCCCTGGTTCAGGATCCATCATACTCCTGTAGTCGAATCCCAGAGGACTGA

Protein sequence

MKRKIFQSNDERIRVIQALLLQEMNMTLQERRRWNRKLDCEAELSSQARNLLLRFAAPTKRKLRKRSSRKLAGTGRERMRIEREREGRLRSYYEAPPVGFVADFWSSDSPASPLPLIGKLRCCACELGKKPSISPKLNFPFRNGAQSSQSYTTGATQRMKLENLVNWLELPAFFPLTCLTFTDYLLTVFVCLGMDFVRGVVDYLGSIFSETSSIHESPHNPSGVGASTMEGVNGVPVLNERYASKLKGYFDLAKEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASSTAVPSFISSSEQEKVKSHRQKISKWQSQVSERLQALNMRAGVTSTNKSSLNHVQRAGIASTMSNTKKAVLRSSSHSVASNPITRSQPPNVGTSKSMQEVPNGYDAKLVEMINTAIVDHSPSVKWDDIAGLQKAKQALLEMVILPTKRRDLFTGLRKPARGLLLFGPPGNGKTMLAKAVASESEATFFNVSAASLTSKWVGEGEKLVRTLFMVAKSRQPSVIFMDEIDSVMSTRQANENEASRRLKSEFLVQFDGVTSNSTDLVIVIGATNKPQELDDAVLRRLVKRIYIPLPDDNVRRLLLKHILKGQSFSLPSREVERLVRETEGYSGSDLQALCEEAAMMPIRELGGNILTVKANQIRPLRYEDFKEAMKVIRPSLNKSREFDAEVSLKPLSNFNIQFDSSMEVDINIFTVFSFVLCTVFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLYDHLTEVAKRHRTFRLLAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGIFAVDGDKWKQQRKLASYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLMRCALDSIFKVGFGVDLNCLEESSKEGSDFMKAFDDSSAQIFWRYIDPFWKLKRLLNIGSEASFRNNIKTIDAFVHQLIRDKRKLLQQPNHNDKEDILWRFLMESEKDPTRMNDQYLRDIVLNFMLAGKDSSGGTLSWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEVDINLFIQNLTDSALDKMHYLHAALTETLRLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYMAYSMGRMSSLWGEDAEDFKPERWLEMELFNPNHLSNSSLSYFLLSEFYGGEPMKSQAKDEDETPKRQWSLQDFDVGKPLGKGKFGRVYLAREVKSKYIVALKVIFKEQMEKYRIHHQLRREMEIQTSLRHPNILRLYGWFHDAERIFLILEYAHRGELYRELRKSGHLSEKQAATYILSLTQALAYCHEKHVIHRDIKPENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYGVPPFEAESQSDTFKRIMKVDLNFPLTRHVSPEAKDLIGRLLVKDSSRRLSLQKIVEHPWIIRNADPSGICNSEVLGVKYDITETCRNVTRRFTSYRGNVFVGSDSGVGSGFVSSVPGEGSRVVPEVAAEVARTASFWEAFKCFFEPLCIGVGGTGRDISDDFGCISTAGKNGCCCILAAGNGFSNCFRTCFGTGTMASTGERQEHQPYILHLRLDYRVELSLMESQFQKLPQAFLNAVGLYACAVHSISDAEQYSGYISISSHQFFYISLVQDPSYSCSRIPED
Homology
BLAST of Sgr015962 vs. NCBI nr
Match: KAA0049019.1 (cytochrome P450 704C1-like isoform X1 [Cucumis melo var. makuwa] >TYK17545.1 cytochrome P450 704C1-like protein isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 1243.0 bits (3215), Expect = 0.0e+00
Identity = 620/801 (77.40%), Postives = 679/801 (84.77%), Query Frame = 0

Query: 692  MEVDINIFTVFSFVLCTVFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNN 751
            MEV+ NI T F+ VLC   LFFLSF I LLL+TLAGKSIT+S+Y+PVYGT+YGQAFY NN
Sbjct: 1    MEVNFNIITFFTVVLC---LFFLSFFI-LLLKTLAGKSITNSDYSPVYGTIYGQAFYLNN 60

Query: 752  LYDHLTEVAKRHRTFRLLAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGE 811
            LYDHLT VAKRHRTFRLL  +YSEIYT DPRN+EH+LKTKF+ Y KGSKDQE+ GDLFGE
Sbjct: 61   LYDHLTAVAKRHRTFRLLGESYSEIYTVDPRNVEHILKTKFENYRKGSKDQEVCGDLFGE 120

Query: 812  GIFAVDGDKWKQQRKLASYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDL 871
            GIFAVDG+KWK+QRKLASYEFST+ILRDFSCSVFRR+A KL G+VSEFS+M RVFD+QDL
Sbjct: 121  GIFAVDGEKWKEQRKLASYEFSTKILRDFSCSVFRRNAEKLVGIVSEFSTMARVFDVQDL 180

Query: 872  LMRCALDSIFKVGFGVDLNCLEESSKEGS--DFMKAFDDSSAQIFWRYIDPFWKLKRLLN 931
            LMRC+LDSIFKVGFGVDLNC+EE SK      FM+AFDD+SAQ+FWRYIDPFWKLKR LN
Sbjct: 181  LMRCSLDSIFKVGFGVDLNCMEEPSKAAGRRGFMEAFDDASAQVFWRYIDPFWKLKRFLN 240

Query: 932  IGSEASFRNNIKTIDAFVHQLIRDKRKLLQQPNHN-DKEDILWRFLMESEKDPTRMNDQY 991
            IGSEASFRNN+K IDAFVHQLI  +RKLL QPN   DKEDIL RFLMESEKDPTRMNDQY
Sbjct: 241  IGSEASFRNNLKIIDAFVHQLISARRKLLHQPNLKIDKEDILSRFLMESEKDPTRMNDQY 300

Query: 992  LRDIVLNFMLAGKDSSGGTLSWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEVDINLFIQN 1051
            LRDIVLNFMLAG+D+S GTLSWFFYMLCKNPLIQEKVAEEV QIV  +GEE DINLF+QN
Sbjct: 301  LRDIVLNFMLAGRDTSAGTLSWFFYMLCKNPLIQEKVAEEVSQIVGVQGEETDINLFVQN 360

Query: 1052 LTDSALDKMHYLHAALTETLRLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYMAYSMGR 1111
            LTDSALDKMHYLHAALTETLRLYPAVP+DGRTAE DDILPDGYKLRKGDGVYY+AYSMGR
Sbjct: 361  LTDSALDKMHYLHAALTETLRLYPAVPIDGRTAETDDILPDGYKLRKGDGVYYLAYSMGR 420

Query: 1112 MSSLWGEDAEDFKPERWLEMELFNPNH--------------------------LSNSSLS 1171
            M  LWGEDAEDFKPERWLE   F P                            +S + L 
Sbjct: 421  MPCLWGEDAEDFKPERWLENGTFRPESPFKFISFHAGPRMCLGKDFAYRQMKIVSAALLQ 480

Query: 1172 YFLLS-------------------------EFYGGEPMKSQAKDEDETPKRQWSLQDFDV 1231
            +F                            EF GG+PMK  AK+ +E+PKRQWSL+DFDV
Sbjct: 481  FFRFKLADPTRNVTYRIMLTLHIDGDFSQPEFDGGDPMKYLAKNNNESPKRQWSLKDFDV 540

Query: 1232 GKPLGKGKFGRVYLAREVKSKYIVALKVIFKEQMEKYRIHHQLRREMEIQTSLRHPNILR 1291
            GKPLGKGKFGRVYLAREV+SKYIVALKVIF+EQM+KY IH QL REMEIQTSLRHPNILR
Sbjct: 541  GKPLGKGKFGRVYLAREVRSKYIVALKVIFREQMKKYGIHRQLMREMEIQTSLRHPNILR 600

Query: 1292 LYGWFHDAERIFLILEYAHRGELYRELRKSGHLSEKQAATYILSLTQALAYCHEKHVIHR 1351
            LYGWFHDAERIF+ILEYAHRGELYRELRK+GHLSEKQAATY+LSL QALAYCHEK VIHR
Sbjct: 601  LYGWFHDAERIFMILEYAHRGELYRELRKNGHLSEKQAATYMLSLAQALAYCHEKDVIHR 660

Query: 1352 DIKPENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLG 1411
            DIKPENLLLDHEGRLKI DFGW+VQSRSKR+TMCGTLDYLAPEMVENK HD+A+DNWT+G
Sbjct: 661  DIKPENLLLDHEGRLKIGDFGWAVQSRSKRYTMCGTLDYLAPEMVENKGHDFAIDNWTMG 720

Query: 1412 ILCYEFLYGVPPFEAESQSDTFKRIMKVDLNFPLTRHVSPEAKDLIGRLLVKDSSRRLSL 1439
            ILCYEFLYGVPPFEAESQ+DTFKRI KV+LNFP T H+S EAKDLIGRLLVKD+S+RLSL
Sbjct: 721  ILCYEFLYGVPPFEAESQNDTFKRIRKVELNFPSTPHISTEAKDLIGRLLVKDASKRLSL 780

BLAST of Sgr015962 vs. NCBI nr
Match: CAB4269788.1 (unnamed protein product [Prunus armeniaca])

HSP 1 Score: 983.0 bits (2540), Expect = 3.1e-282
Identity = 608/1259 (48.29%), Postives = 694/1259 (55.12%), Query Frame = 0

Query: 194  MDFVRGVVDYLGSIFSETSSIHES-PHNPSGVGASTMEGV--NGVPVLNERYASKLKGYF 253
            M F++G++D  GS+FS  SS +ES P++ S   +S MEG+   G  V NER A KL+GYF
Sbjct: 1    MSFLKGIIDSFGSVFSAASSSYESHPNSDSPPNSSIMEGIAGPGASVSNERVAYKLRGYF 60

Query: 254  DLAKEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASSTAVPSFISSSEQEKVKSHRQKI 313
            DLAK+EIAKAVRAEEWG++DDAI HY NAQR+L EA+ST VPS+IS SE+EKVKS+RQKI
Sbjct: 61   DLAKDEIAKAVRAEEWGLVDDAIAHYNNAQRVLVEATSTPVPSYISPSEREKVKSYRQKI 120

Query: 314  SKWQSQVSERLQALNMRAGVTSTNKSSLNHVQRAGIASTMSNTKKAVLRSSSHSVASNPI 373
            SKWQ +VSERLQAL+ RAG TS +KS+L H Q A +  T SN +K VL  S     + P 
Sbjct: 121  SKWQGEVSERLQALSRRAGGTSVSKSTLAHAQTAVVKPTTSNARKHVLPKSPRPTTNRPE 180

Query: 374  TRS--QPPNVGTSKSMQEVPNGYDAKLVEMINTAIVDHSPSVKWDDI-----------AG 433
            TR+  Q  N+ +SK +QE   GYDAKLVEMIN+AIVD SPSV+W+D+           AG
Sbjct: 181  TRNQIQTNNIVSSKPVQETGGGYDAKLVEMINSAIVDRSPSVQWEDVDYQVTHFSFSAAG 240

Query: 434  LQKAKQALLEMVILPTKRRDLFTGLRKPARGLLLFGPPGNGKTMLAKAVASESEATFFNV 493
            L+K K+ L+EMVILPTKRRDLFTGLR+PARGLLLFGPPGNGKTMLAKAVASESEATFFNV
Sbjct: 241  LEKVKKTLMEMVILPTKRRDLFTGLRRPARGLLLFGPPGNGKTMLAKAVASESEATFFNV 300

Query: 494  SAASLTSKWVGEGEKLVRTLFMVAKSRQPSVIFMDEIDSVMSTRQANENEASRRLKSEFL 553
            SA+SLTSKWVGE EKLVRTLF+VA SRQPSVIFMDEIDS+MSTR ANEN+ASRRLKSEFL
Sbjct: 301  SASSLTSKWVGEAEKLVRTLFLVAISRQPSVIFMDEIDSIMSTRLANENDASRRLKSEFL 360

Query: 554  VQFDGVTSNSTDLVIVIGATNKPQELDDAVLRRLVKRIYIPLPDDNVRRLLLKHILKGQS 613
            +QFDGVTSN  DLVIVIGATNKPQELDDA     VKR+YIPLPD   RRLLL+H LKGQ+
Sbjct: 361  IQFDGVTSNPNDLVIVIGATNKPQELDDA-----VKRVYIPLPDLTARRLLLRHKLKGQA 420

Query: 614  FSLPSREVERLVRETEGYSGSDLQALCEEAAMMPIRELGGNILTVKANQIRPLRYEDFKE 673
            FSLPS ++ERL  ETEGYSGSDLQALCEEAAMMPIRELG NILT+KANQ+RPLRYEDF++
Sbjct: 421  FSLPSGDLERLAGETEGYSGSDLQALCEEAAMMPIRELGENILTIKANQVRPLRYEDFQK 480

Query: 674  AMKVIRPSLNKSREFDAEVSLKPLSNFNIQFDSSMEVDINIFTVFSFVLCTVFLFFLSFL 733
            AM VIRPSL+KS                                                
Sbjct: 481  AMTVIRPSLSKS------------------------------------------------ 540

Query: 734  ILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLYDHLTEVAKRHRTFRLLAPAYSEIY 793
                                                                        
Sbjct: 541  ------------------------------------------------------------ 600

Query: 794  TTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGIFAVDGDKWKQQRKLASYEFSTRIL 853
                                                      KW++  +      +T   
Sbjct: 601  ------------------------------------------KWEELERQPYLYAATGTK 660

Query: 854  RDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLMRCALDSIFKVGFGVDLNCLEESSK 913
             D                                                      E S+
Sbjct: 661  ND-----------------------------------------------------AEKSQ 720

Query: 914  EGSDFMKAFDDSSAQIFWRYIDPFWKLKRLLNIGSEASFRNNIKTIDAFVHQLIRDKRKL 973
            EG                         +++ ++ S A                   KRK 
Sbjct: 721  EGLP-----------------------RKIYSLSSGA-------------------KRK- 780

Query: 974  LQQPNHNDKEDILWRFLMESEKDPTRMNDQYLRDIVLNFMLAGKDSSGGTLSWFFYMLCK 1033
                                                                        
Sbjct: 781  ------------------------------------------------------------ 824

Query: 1034 NPLIQEKVAEEVRQIVAFEGEEVDINLFIQNLTDSALDKMHYLHAALTETLRLYPAVPLD 1093
                                        IQN                             
Sbjct: 841  ----------------------------IQNF---------------------------- 824

Query: 1094 GRTAEIDDILPDGYKLRKGDGVYYMAYSMGRMSSLWGEDAEDFKPERWLEMELFNPNHLS 1153
                                                                     H+ 
Sbjct: 901  ---------------------------------------------------------HIV 824

Query: 1154 NSSLSYFLLSEFYGGEPMKSQAKDEDETPKRQWSLQDFDVGKPLGKGKFGRVYLAREVKS 1213
              SLS             KS+  +  E+PKR+WSL+DF++GKPLGKGKFGRVY+ARE KS
Sbjct: 961  VVSLSL-----------EKSEEMERQESPKREWSLRDFEIGKPLGKGKFGRVYVAREAKS 824

Query: 1214 KYIVALKVIFKEQMEKYRIHHQLRREMEIQTSLRHPNILRLYGWFHDAERIFLILEYAHR 1273
            KYIVALKVIFKEQ+EKY+I HQLRREMEIQTSLRHPNILRLYGWFHD ERIFLILEYAH 
Sbjct: 1021 KYIVALKVIFKEQIEKYKIQHQLRREMEIQTSLRHPNILRLYGWFHDDERIFLILEYAHG 824

Query: 1274 GELYRELRKSGHLSEKQAATYILSLTQALAYCHEKHVIHRDIKPENLLLDHEGRLKIADF 1333
            GELY  LRK+ +LSEKQAATYILSLTQALAYCHEK+VIHRDIKPENLLLDHEGRLKIADF
Sbjct: 1081 GELYGLLRKTNYLSEKQAATYILSLTQALAYCHEKNVIHRDIKPENLLLDHEGRLKIADF 824

Query: 1334 GWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYGVPPFEAESQSD 1393
            GWSVQSRSKR TMCGTLDYLAPEMVEN+ HDYAVDNWTLGILCYEFLYG+PPFEAESQ+D
Sbjct: 1141 GWSVQSRSKRQTMCGTLDYLAPEMVENRPHDYAVDNWTLGILCYEFLYGIPPFEAESQTD 824

Query: 1394 TFKRIMKVDLNFPLTRHVSPEAKDLIGRLLVKDSSRRLSLQKIVEHPWIIRNADPSGIC 1437
            TFKRI+KVDLNFP     S EAK LI RLLVKDSS+RLSLQ+I+EHPWI++NADPSGIC
Sbjct: 1201 TFKRIIKVDLNFPSEPQTSAEAKHLITRLLVKDSSKRLSLQRIMEHPWIVKNADPSGIC 824

BLAST of Sgr015962 vs. NCBI nr
Match: PQM41947.1 (hypothetical protein Pyn_04712 [Prunus yedoensis var. nudiflora])

HSP 1 Score: 961.8 bits (2485), Expect = 7.4e-276
Identity = 593/1248 (47.52%), Postives = 667/1248 (53.45%), Query Frame = 0

Query: 194  MDFVRGVVDYLGSIFSETSSIHES-PHNPSGVGASTMEGV--NGVPVLNERYASKLKGYF 253
            M F++G++D  GS+FS  SS +ES P++ S   +STMEG+   G  V NER A KL+GYF
Sbjct: 1    MSFLKGIIDSFGSVFSAASSSYESHPNSDSPPNSSTMEGIAGPGASVSNERVAYKLRGYF 60

Query: 254  DLAKEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASSTAVPSFISSSEQEKVKSHRQKI 313
            DLAK+EIAKAVRAEEWG++DDAI HY NAQR+L EA+ST VPS+IS SE+EKVKS+RQKI
Sbjct: 61   DLAKDEIAKAVRAEEWGLVDDAIAHYNNAQRVLVEATSTPVPSYISLSEREKVKSYRQKI 120

Query: 314  SKWQSQVSERLQALNMRAGVTSTNKSSLNHVQRAGIASTMSNTKKAVLRSSSHSVASNPI 373
            SKWQ +VSERLQAL+ RAG TS +KS+L+H Q A +  T SN++K VL  S     + P 
Sbjct: 121  SKWQGEVSERLQALSRRAGGTSVSKSTLDHAQTAVVRPTTSNSRKHVLPKSPRPTTNRPE 180

Query: 374  TRS--QPPNVGTSKSMQEVPNGYDAKLVEMINTAIVDHSPSVKWDDIAGLQKAKQALLEM 433
            TR+  Q  N+ +SK +QE   GYDAKLVEMIN+AIVD SPSVKW+D+AGL+K K+ L+EM
Sbjct: 181  TRNQIQTNNIVSSKPVQETGGGYDAKLVEMINSAIVDRSPSVKWEDVAGLEKVKKTLMEM 240

Query: 434  VILPTKRRDLFTGLRKPARGLLLFGPPGNGKTMLAKAVASESEATFFNVSAASLTSKWVG 493
            VILPTKRRDLFTGLR+PARGLLLFGPPGNGKTMLAKAVASESEATFFNVSA+SLTSKWVG
Sbjct: 241  VILPTKRRDLFTGLRRPARGLLLFGPPGNGKTMLAKAVASESEATFFNVSASSLTSKWVG 300

Query: 494  EGEKLVRTLFMVAKSRQPSVIFMDEIDSVMSTRQANENEASRRLKSEFLVQFDGVTSNST 553
            E EKLVRTLFMVA SRQPSVIFMDEIDS+MSTR ANE++ASRRLKSEFL+QFDGVTSN  
Sbjct: 301  EAEKLVRTLFMVAISRQPSVIFMDEIDSIMSTRLANEHDASRRLKSEFLMQFDGVTSNPN 360

Query: 554  DLVIVIGATNKPQELDDAVLRRLVKRIYIPLPDDNVRRLLLKHILKGQSFSLPSREVERL 613
            DLVIVIGATNKPQELDDAVLRRLVKR+YIPLPD   RRLLL+H LKGQ+FSLPS ++ERL
Sbjct: 361  DLVIVIGATNKPQELDDAVLRRLVKRVYIPLPDLTARRLLLRHKLKGQAFSLPSGDLERL 420

Query: 614  VRETEGYSGSDLQALCEEAAMMPIRELGGNILTVKANQIRPLRYEDFKEAMKVIRPSLNK 673
             RETEGYSGSDLQALCEEAAMMPIRELG NILT+KANQ+RPLRYEDF++AM         
Sbjct: 421  ARETEGYSGSDLQALCEEAAMMPIRELGENILTIKANQVRPLRYEDFQKAM--------- 480

Query: 674  SREFDAEVSLKPLSNFNIQFDSSMEVDINIFTVFSFVLCTVFLFFLSFLILLLLRTLAGK 733
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 734  SITSSEYTPVYGTVYGQAFYFNNLYDHLTEVAKRHRTFRLLAPAYSEIYTTDPRNIEHML 793
                                                                        
Sbjct: 541  ------------------------------------------------------------ 600

Query: 794  KTKFDKYSKGSKDQEIVGDLFGEGIFAVDGDKWKQQRKLASYEFSTRILRDFSCSVFRRS 853
             TK                            KWK                          
Sbjct: 601  -TK----------------------------KWK-------------------------- 660

Query: 854  AAKLDGVVSEFSSMGRVFDIQDLLMRCALDSIFKVGFGVDLNCLEESSKEGSDFMKAFDD 913
                                                                        
Sbjct: 661  ------------------------------------------------------------ 720

Query: 914  SSAQIFWRYIDPFWKLKRLLNIGSEASFRNNIKTIDAFVHQLIRDKRKLLQQPNHNDKED 973
                                                                        
Sbjct: 721  ------------------------------------------------------------ 754

Query: 974  ILWRFLMESEKDPTRMNDQYLRDIVLNFMLAGKDSSGGTLSWFFYMLCKNPLIQEKVAEE 1033
                                                                        
Sbjct: 781  ------------------------------------------------------------ 754

Query: 1034 VRQIVAFEGEEVDINLFIQNLTDSALDKMHYLHAALTETLRLYPAVPLDGRTAEIDDILP 1093
                                                                        
Sbjct: 841  ------------------------------------------------------------ 754

Query: 1094 DGYKLRKGDGVYYMAYSMGRMSSLWGEDAEDFKPERWLEMELFNPNHLSNSSLSYFLLSE 1153
                                                                        
Sbjct: 901  ------------------------------------------------------------ 754

Query: 1154 FYGGEPMKSQAKDEDETPKRQWSLQDFDVGKPLGKGKFGRVYLAREVKSKYIVALKVIFK 1213
                 P K Q     E+PKR+WSL+DF++GKPLGKGKFGRVY+ARE KSKYIVALKVIFK
Sbjct: 961  -----PRKGQ-----ESPKREWSLRDFEIGKPLGKGKFGRVYVAREAKSKYIVALKVIFK 754

Query: 1214 EQMEKYRIHHQLRREMEIQTSLRHPNILRLYGWFHDAERIFLILEYAHRGELYRELRKSG 1273
            EQ+EKY+I HQLRREMEIQTSLRHPNILRLYGWFHD ERIFLILEYAH GELY  LRK+ 
Sbjct: 1021 EQIEKYKIQHQLRREMEIQTSLRHPNILRLYGWFHDDERIFLILEYAHGGELYGLLRKTN 754

Query: 1274 HLSEKQAATYILSLTQALAYCHEKHVIHRDIKPENLLLDHEGRLKIADFGWSVQSRSKRH 1333
            +LSEKQAATYILSLTQALAYCHEK+VIHRDIKPENLLLDHEGRLKIADFGWSVQSRSKR 
Sbjct: 1081 YLSEKQAATYILSLTQALAYCHEKNVIHRDIKPENLLLDHEGRLKIADFGWSVQSRSKRQ 754

Query: 1334 TMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYGVPPFEAESQSDTFKRIMKVDLN 1393
            TMCGTLDYLAPEMVEN+ HDYAVDNWTLGILCYEFLYGVPPFEAESQ+DTF+RI+KVDL+
Sbjct: 1141 TMCGTLDYLAPEMVENRPHDYAVDNWTLGILCYEFLYGVPPFEAESQTDTFRRIIKVDLS 754

Query: 1394 FPLTRHVSPEAKDLIGRLLVKDSSRRLSLQKIVEHPWIIRNADPSGIC 1437
            FP     S EAK LI RLLVKDSS+RLSLQ+I+EHPWI++NADPSGIC
Sbjct: 1201 FPSEPQASAEAKHLITRLLVKDSSKRLSLQRIMEHPWIVKNADPSGIC 754

BLAST of Sgr015962 vs. NCBI nr
Match: OMP00538.1 (hypothetical protein COLO4_12580 [Corchorus olitorius])

HSP 1 Score: 893.6 bits (2308), Expect = 2.5e-255
Identity = 560/1206 (46.43%), Postives = 614/1206 (50.91%), Query Frame = 0

Query: 229  MEGVNGVPVLNERYASKLKGYFDLAKEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASS 288
            M+GV G    NER A KL+GYFDLAKEEI KAVRAEEWG+I+DA++HY+NA+RIL EASS
Sbjct: 1    MDGVAG----NERVAYKLRGYFDLAKEEIDKAVRAEEWGLIEDALVHYRNAERILVEASS 60

Query: 289  TAVPSFISSSEQEKVKSHRQKISKWQSQVSERLQALNMRAGVTSTNKSSLNHVQRAGIAS 348
            T VP +ISSSEQEKVKS+RQKISKWQ QVSERLQ L  RAG  ST            I+ 
Sbjct: 61   TPVPLYISSSEQEKVKSYRQKISKWQGQVSERLQVLGRRAGGPST------------ISP 120

Query: 349  TMSNTKKAVLRSSSHSVASNPITRSQPPNVGTSKSMQEVPNGYDAKLVEMINTAIVDHSP 408
              SN ++ VL+ S      N + R+Q   VGT K  QE  NGYD+K++EMINTAIVD SP
Sbjct: 121  RTSNPRRDVLQKSPR----NQVVRNQGDRVGTPKPAQESANGYDSKMIEMINTAIVDRSP 180

Query: 409  SVKWDDIAGLQKAKQALLEMVILPTKRRDLFTGLRKPARGLLLFGPPGNGKTMLAKAVAS 468
            SVKW+D+AGL KAKQAL+EMVILPT+RRDLFTGLR+PARGLLLFGPPGNGKTMLAKAVAS
Sbjct: 181  SVKWEDVAGLDKAKQALMEMVILPTRRRDLFTGLRRPARGLLLFGPPGNGKTMLAKAVAS 240

Query: 469  ESEATFFNVSAASLTSKWVGEGEKLVRTLFMVAKSRQPSVIFMDEIDSVMSTRQANENEA 528
            ES+ATFFNVSA+SLTSKWVGEGEKLVRTLFMVA S+QPSVIF+DEIDSV+STR  NE++A
Sbjct: 241  ESQATFFNVSASSLTSKWVGEGEKLVRTLFMVAISKQPSVIFIDEIDSVLSTRLENEHDA 300

Query: 529  SRRLKSEFLVQFDGVTSNSTDLVIVIGATNKPQELDDAVLRRLVKRIYIPLPDDNVRRLL 588
            SRRLKSEFL+QFDGVTSN  DLVIVIGATNKPQELDDAVLRRLVKRIY+PLPD+NVRRLL
Sbjct: 301  SRRLKSEFLIQFDGVTSNPNDLVIVIGATNKPQELDDAVLRRLVKRIYVPLPDENVRRLL 360

Query: 589  LKHILKGQSFSLPSREVERLVRETEGYSGSDLQALCEEAAMMPIRELGGNILTVKANQIR 648
            L++ LKGQ+FSLP R++ERLVRETEGYSGSDLQALCEEAAMMPIRELG NILTVKANQ  
Sbjct: 361  LQNKLKGQAFSLPGRDLERLVRETEGYSGSDLQALCEEAAMMPIRELGSNILTVKANQ-- 420

Query: 649  PLRYEDFKEAMKVIRPSLNKSREFDAEVSLKPLSNFNIQFDSSMEVDINIFTVFSFVLCT 708
                                                                        
Sbjct: 421  ------------------------------------------------------------ 480

Query: 709  VFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLYDHLTEVAKRHRTFRL 768
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 769  LAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGIFAVDGDKWKQQRKLA 828
                                                                        
Sbjct: 541  ------------------------------------------------------------ 600

Query: 829  SYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLMRCALDSIFKVGFGVD 888
                                                                        
Sbjct: 601  ------------------------------------------------------------ 660

Query: 889  LNCLEESSKEGSDFMKAFDDSSAQIFWRYIDPFWKLKRLLNIGSEASFRNNIKTIDAFVH 948
                                                                        
Sbjct: 661  ------------------------------------------------------------ 673

Query: 949  QLIRDKRKLLQQPNHNDKEDILWRFLMESEKDPTRMNDQYLRDIVLNFMLAGKDSSGGTL 1008
                                                                        
Sbjct: 721  ------------------------------------------------------------ 673

Query: 1009 SWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEVDINLFIQNLTDSALDKMHYLHAALTETL 1068
                                                                        
Sbjct: 781  ------------------------------------------------------------ 673

Query: 1069 RLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYMAYSMGRMSSLWGEDAEDFKPERWLEM 1128
                                                                        
Sbjct: 841  ------------------------------------------------------------ 673

Query: 1129 ELFNPNHLSNSSLSYFLLSEFYGGEPMKSQAKDEDETPKRQWSLQDFDVGKPLGKGKFGR 1188
                                           K+E+   KR WS++DFD+GKPLGKGKFGR
Sbjct: 901  -------------------------------KEEETNVKRDWSIKDFDIGKPLGKGKFGR 673

Query: 1189 VYLAREVKSKYIVALKVIFKEQMEKYRIHHQLRREMEIQTSLRHPNILRLYGWFHDAERI 1248
            VYLAREVKSKYIVALKVIFKEQ+EKY+IHHQLRREMEIQTSLRHPNILRLYGWFHD ERI
Sbjct: 961  VYLAREVKSKYIVALKVIFKEQIEKYKIHHQLRREMEIQTSLRHPNILRLYGWFHDNERI 673

Query: 1249 FLILEYAHRGELYRELRKSGHLSEKQAATYILSLTQALAYCHEKHVIHRDIKPENLLLDH 1308
            FLILEYAH GELY+ELRK GHLSEKQAATYI SLT ALAYCHEK+VIHRDIKPENLLLDH
Sbjct: 1021 FLILEYAHGGELYKELRKKGHLSEKQAATYIASLTTALAYCHEKNVIHRDIKPENLLLDH 673

Query: 1309 EGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYGVP 1368
            EGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYG P
Sbjct: 1081 EGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYGSP 673

Query: 1369 PFEAESQSDTFKRIMKVDLNFPLTRHVSPEAKDLIGRLLVKDSSRRLSLQKIVEHPWIIR 1428
            PFEAESQ DTF+RIM VDL+FP T HVS EAK+LI RLLVKDS +RLSLQKI+EHPWII+
Sbjct: 1141 PFEAESQRDTFRRIMNVDLSFPPTPHVSMEAKNLISRLLVKDSYKRLSLQKIMEHPWIIK 673

Query: 1429 NADPSG 1435
            NADP G
Sbjct: 1201 NADPLG 673

BLAST of Sgr015962 vs. NCBI nr
Match: OMO78635.1 (hypothetical protein CCACVL1_14257 [Corchorus capsularis])

HSP 1 Score: 828.9 bits (2140), Expect = 7.5e-236
Identity = 536/1213 (44.19%), Postives = 590/1213 (48.64%), Query Frame = 0

Query: 229  MEGVNGVPVLNERYASKLKGYFDLAKEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASS 288
            M+GV G    NER A KL+GYFDLAK EI KAVRAEEWG+IDDA++HY+NA+RIL EA+S
Sbjct: 1    MDGVAG----NERVAYKLRGYFDLAKGEIDKAVRAEEWGLIDDALVHYRNAERILVEANS 60

Query: 289  TAVPSFISSSEQEKVKSHRQKISKWQSQVSERLQALNMRAGVTSTNKSSLNHVQRAGIAS 348
            T VP +ISSSEQEKVKS+RQKISKWQ QVSERLQ L  RAG  ST+K++L H Q A ++ 
Sbjct: 61   TPVPLYISSSEQEKVKSYRQKISKWQGQVSERLQVLGRRAGGPSTSKNTLTHAQTAAVSP 120

Query: 349  TMSNTKKAVL-RSSSHSVASNPITR------SQPPNVGTSKSMQEVPNGYDAKLVEMINT 408
              SN ++ VL +S  + V  N   R      +Q   VGT K+ QE  NGYD+K++EMINT
Sbjct: 121  RTSNPRRDVLQKSPRNQVVRNQADRVGTLKPAQADRVGTPKAAQESANGYDSKMIEMINT 180

Query: 409  AIVDHSPSVKWDDIAGLQKAKQALLEMVILPTKRRDLFTGLRKPARGLLLFGPPGNGKTM 468
            AIVD SPSVKW+D+AGL KAKQAL+EMVILPT+RRDLFTGLR+PARGLLLFGPPGNGKTM
Sbjct: 181  AIVDRSPSVKWEDVAGLDKAKQALMEMVILPTRRRDLFTGLRRPARGLLLFGPPGNGKTM 240

Query: 469  LAKAVASESEATFFNVSAASLTSKWVGEGEKLVRTLFMVAKSRQPSVIFMDEIDSVMSTR 528
            LAKAVASES+ATFFNVSA+SLTSKWVGEGEKLVRTLFMVA S+QPSVIF+DEIDSV+STR
Sbjct: 241  LAKAVASESQATFFNVSASSLTSKWVGEGEKLVRTLFMVAISKQPSVIFIDEIDSVLSTR 300

Query: 529  QANENEASRRLKSEFLVQFDGVTSNSTDLVIVIGATNKPQELDDAVLRRLVKRIYIPLPD 588
              NE++ASRRLKSEFL+QFDGVTSN  DLVIVIGATNKPQELDDAVLRRLVKRIY+PLPD
Sbjct: 301  LENEHDASRRLKSEFLIQFDGVTSNPNDLVIVIGATNKPQELDDAVLRRLVKRIYVPLPD 360

Query: 589  DNVRRLLLKHILKGQSFSLPSREVERLVRETEGYSGSDLQALCEEAAMMPIRELGGNILT 648
            +NVRRLLL++ LKGQ+FSLP R++ERLVRETEGYSGSDLQALCEEAAMMPIRELG NILT
Sbjct: 361  ENVRRLLLQNKLKGQAFSLPGRDLERLVRETEGYSGSDLQALCEEAAMMPIRELGSNILT 420

Query: 649  VKANQIRPLRYEDFKEAMKVIRPSLNKSREFDAEVSLKPLSNFNIQFDSSMEVDINIFTV 708
            VKANQ                                                       
Sbjct: 421  VKANQ------------------------------------------------------- 480

Query: 709  FSFVLCTVFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLYDHLTEVAK 768
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 769  RHRTFRLLAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGIFAVDGDKW 828
                                                                        
Sbjct: 541  ------------------------------------------------------------ 600

Query: 829  KQQRKLASYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLMRCALDSIF 888
                                                                        
Sbjct: 601  ------------------------------------------------------------ 659

Query: 889  KVGFGVDLNCLEESSKEGSDFMKAFDDSSAQIFWRYIDPFWKLKRLLNIGSEASFRNNIK 948
                                                                        
Sbjct: 661  ------------------------------------------------------------ 659

Query: 949  TIDAFVHQLIRDKRKLLQQPNHNDKEDILWRFLMESEKDPTRMNDQYLRDIVLNFMLAGK 1008
                                                                        
Sbjct: 721  ------------------------------------------------------------ 659

Query: 1009 DSSGGTLSWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEVDINLFIQNLTDSALDKMHYLH 1068
                                                                        
Sbjct: 781  ------------------------------------------------------------ 659

Query: 1069 AALTETLRLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYMAYSMGRMSSLWGEDAEDFK 1128
                                                                        
Sbjct: 841  ------------------------------------------------------------ 659

Query: 1129 PERWLEMELFNPNHLSNSSLSYFLLSEFYGGEPMKSQAKDEDETPKRQWSLQDFDVGKPL 1188
                                                                        
Sbjct: 901  ------------------------------------------------------------ 659

Query: 1189 GKGKFGRVYLAREVKSKYIVALKVIFKEQMEKYRIHHQLRREMEIQTSLRHPNILRLYGW 1248
                           SKYIVALKVIFKEQ+EKY+IHHQLRREMEIQTSLRHPNILRLYGW
Sbjct: 961  ---------------SKYIVALKVIFKEQIEKYKIHHQLRREMEIQTSLRHPNILRLYGW 659

Query: 1249 FHDAERIFLILEYAHRGELYRELRKSGHLSEKQAATYILSLTQALAYCHEKHVIHRDIKP 1308
            FHD ERIFLILEYAH GELY+ELRK GHLSEKQAATYI SLT ALAYCHEK+VIHRDIKP
Sbjct: 1021 FHDNERIFLILEYAHGGELYKELRKKGHLSEKQAATYIASLTTALAYCHEKNVIHRDIKP 659

Query: 1309 ENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCY 1368
            ENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCY
Sbjct: 1081 ENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCY 659

Query: 1369 EFLYGVPPFEAESQSDTFKRIMKVDLNFPLTRHVSPEAKDLIGRLLVKDSSRRLSLQKIV 1428
            EFLYG PPFEAESQ DTF+RIM VDL+FP T HVS EAK+LI RLLVKDS +RLSLQKI+
Sbjct: 1141 EFLYGSPPFEAESQRDTFRRIMNVDLSFPPTPHVSMEAKNLISRLLVKDSYKRLSLQKIM 659

Query: 1429 EHPWIIRNADPSG 1435
            EHPWII+NADP G
Sbjct: 1201 EHPWIIKNADPLG 659

BLAST of Sgr015962 vs. ExPASy Swiss-Prot
Match: O64629 (Serine/threonine-protein kinase Aurora-3 OS=Arabidopsis thaliana OX=3702 GN=AUR3 PE=2 SV=1)

HSP 1 Score: 476.5 bits (1225), Expect = 1.2e-132
Identity = 227/283 (80.21%), Postives = 255/283 (90.11%), Query Frame = 0

Query: 1156 KSQAKDEDETPKRQWSLQDFDVGKPLGKGKFGRVYLAREVKSKYIVALKVIFKEQMEKYR 1215
            KS   D   T K QWSL DF++G+PLGKGKFGRVYLARE KSKYIVALKVIFKEQ+EKY+
Sbjct: 4    KSTESDAGNTEK-QWSLADFEIGRPLGKGKFGRVYLAREAKSKYIVALKVIFKEQIEKYK 63

Query: 1216 IHHQLRREMEIQTSLRHPNILRLYGWFHDAERIFLILEYAHRGELYRELRKSGHLSEKQA 1275
            IHHQLRREMEIQTSLRHPNILRL+GWFHD ERIFLILEYAH GELY  L+++GHL+E+QA
Sbjct: 64   IHHQLRREMEIQTSLRHPNILRLFGWFHDNERIFLILEYAHGGELYGVLKQNGHLTEQQA 123

Query: 1276 ATYILSLTQALAYCHEKHVIHRDIKPENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLD 1335
            ATYI SL+QALAYCH K VIHRDIKPENLLLDHEGRLKIADFGWSVQS +KR TMCGTLD
Sbjct: 124  ATYIASLSQALAYCHGKCVIHRDIKPENLLLDHEGRLKIADFGWSVQSSNKRKTMCGTLD 183

Query: 1336 YLAPEMVENKAHDYAVDNWTLGILCYEFLYGVPPFEAESQSDTFKRIMKVDLNFPLTRHV 1395
            YLAPEMVEN+ HDYAVDNWTLGILCYEFLYG PPFEAESQ DTFKRI+K+DL+FPLT +V
Sbjct: 184  YLAPEMVENRDHDYAVDNWTLGILCYEFLYGNPPFEAESQKDTFKRILKIDLSFPLTPNV 243

Query: 1396 SPEAKDLIGRLLVKDSSRRLSLQKIVEHPWIIRNADPSGICNS 1439
            S EAK+LI +LLVKD S+RLS++KI++HPWI++NADP G+C S
Sbjct: 244  SEEAKNLISQLLVKDPSKRLSIEKIMQHPWIVKNADPKGVCAS 285

BLAST of Sgr015962 vs. ExPASy Swiss-Prot
Match: Q50EK3 (Cytochrome P450 704C1 OS=Pinus taeda OX=3352 GN=CYP704C1 PE=2 SV=1)

HSP 1 Score: 402.5 bits (1033), Expect = 2.3e-110
Identity = 206/451 (45.68%), Postives = 308/451 (68.29%), Query Frame = 0

Query: 694  VDINIFTVFSFVLCTVFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLY 753
            +D+NI T+  FV  +      S  I   LR    K +    Y PV GT+   A  F  L+
Sbjct: 1    MDVNILTM--FVTVSALALACSLWIASYLRNWRKKGV----YPPVVGTMLNHAINFERLH 60

Query: 754  DHLTEVAKRHRTFRLLAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGI 813
            D+ T+ A+R++TFR++ P  S ++TTDP N+EH+LKT F  Y KG+ + +I+ DL G+GI
Sbjct: 61   DYHTDQAQRYKTFRVVYPTCSYVFTTDPVNVEHILKTNFANYDKGTFNYDIMKDLLGDGI 120

Query: 814  FAVDGDKWKQQRKLASYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLM 873
            F VDGDKW+QQRKLAS EF++++L+DFS  VF  +AAKL  ++++ + +    ++QDL M
Sbjct: 121  FNVDGDKWRQQRKLASSEFASKVLKDFSSGVFCNNAAKLANILAQAAKLNLSVEMQDLFM 180

Query: 874  RCALDSIFKVGFGVDLNCLEESSKEG---SDFMKAFDDSSAQIFWRY-IDPFWKLKRLLN 933
            R +LDSI KV FG+D+N L  S  E    + F KAFD ++A +F R+ +  FWK++R  N
Sbjct: 181  RSSLDSICKVVFGIDINSLSSSKAESGPEASFAKAFDVANAMVFHRHMVGSFWKVQRFFN 240

Query: 934  IGSEASFRNNIKTIDAFVHQLIRDKR-KLLQQPNHNDKEDILWRFLMESEKDPT-RMNDQ 993
            +GSEA  R+NIK +D F++++I  +R ++      N + DIL R+++ S+K+   +++D+
Sbjct: 241  VGSEAILRDNIKMVDDFLYKVIHFRRQEMFSAEKENVRPDILSRYIIISDKETDGKVSDK 300

Query: 994  YLRDIVLNFMLAGKDSSGGTLSWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEV-----DI 1053
            YLRD++LNFM+A +D++   LSWF YMLCK+  +QEK+ EE+    +   ++      DI
Sbjct: 301  YLRDVILNFMVAARDTTAIALSWFIYMLCKHQHVQEKLLEEIISSTSVHEDQYSTECNDI 360

Query: 1054 NLFIQNLTDSALDKMHYLHAALTETLRLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYM 1113
              F Q+LTD AL KMHYLHA+L+ETLRLYPA+P+DG+    +D LPDG+K++KGD V ++
Sbjct: 361  ASFAQSLTDEALGKMHYLHASLSETLRLYPALPVDGKYVVNEDTLPDGFKVKKGDSVNFL 420

Query: 1114 AYSMGRMSSLWGEDAEDFKPERWLEMELFNP 1134
             Y+MGRMS LWG+DA++FKPERW++  +F+P
Sbjct: 421  PYAMGRMSYLWGDDAKEFKPERWIQDGIFHP 445

BLAST of Sgr015962 vs. ExPASy Swiss-Prot
Match: Q9M077 (Serine/threonine-protein kinase Aurora-1 OS=Arabidopsis thaliana OX=3702 GN=AUR1 PE=1 SV=1)

HSP 1 Score: 389.4 bits (999), Expect = 2.0e-106
Identity = 177/269 (65.80%), Postives = 228/269 (84.76%), Query Frame = 0

Query: 1167 KRQWSLQDFDVGKPLGKGKFGRVYLAREVKSKYIVALKVIFKEQMEKYRIHHQLRREMEI 1226
            +++W+L DFD+GKPLG+GKFG VYLARE +S ++VALKV+FK Q+++ ++ HQLRRE+EI
Sbjct: 23   QKRWTLSDFDIGKPLGRGKFGHVYLAREKRSNHVVALKVLFKSQLQQSQVEHQLRREVEI 82

Query: 1227 QTSLRHPNILRLYGWFHDAERIFLILEYAHRGELYRELRKSGHLSEKQAATYILSLTQAL 1286
            Q+ LRHPNILRLYG+F+D +R++LILEYA RGELY++L+K  + SE++AATY+ SL +AL
Sbjct: 83   QSHLRHPNILRLYGYFYDQKRVYLILEYAARGELYKDLQKCKYFSERRAATYVASLARAL 142

Query: 1287 AYCHEKHVIHRDIKPENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKA 1346
             YCH KHVIHRDIKPENLL+  +G LKIADFGWSV + ++R TMCGTLDYL PEMVE+  
Sbjct: 143  IYCHGKHVIHRDIKPENLLIGAQGELKIADFGWSVHTFNRRRTMCGTLDYLPPEMVESVE 202

Query: 1347 HDYAVDNWTLGILCYEFLYGVPPFEAESQSDTFKRIMKVDLNFPLTRHVSPEAKDLIGRL 1406
            HD +VD W+LGILCYEFLYGVPPFEA   SDT++RI++VDL FP    +S  AKDLI ++
Sbjct: 203  HDASVDIWSLGILCYEFLYGVPPFEAMEHSDTYRRIVQVDLKFPPKPIISASAKDLISQM 262

Query: 1407 LVKDSSRRLSLQKIVEHPWIIRNADPSGI 1436
            LVK+SS+RL L K++EHPWI++NADPSGI
Sbjct: 263  LVKESSQRLPLHKLLEHPWIVQNADPSGI 291

BLAST of Sgr015962 vs. ExPASy Swiss-Prot
Match: Q683C9 (Serine/threonine-protein kinase Aurora-2 OS=Arabidopsis thaliana OX=3702 GN=AUR2 PE=1 SV=2)

HSP 1 Score: 389.0 bits (998), Expect = 2.6e-106
Identity = 178/272 (65.44%), Postives = 229/272 (84.19%), Query Frame = 0

Query: 1164 ETPKRQWSLQDFDVGKPLGKGKFGRVYLAREVKSKYIVALKVIFKEQMEKYRIHHQLRRE 1223
            E  +++W+  DFD+GKPLG+GKFG VYLARE +S +IVALKV+FK Q+++ ++ HQLRRE
Sbjct: 8    EAAQKRWTTSDFDIGKPLGRGKFGHVYLAREKRSDHIVALKVLFKAQLQQSQVEHQLRRE 67

Query: 1224 MEIQTSLRHPNILRLYGWFHDAERIFLILEYAHRGELYRELRKSGHLSEKQAATYILSLT 1283
            +EIQ+ LRHPNILRLYG+F+D +R++LILEYA RGELY+EL+K  + SE++AATY+ SL 
Sbjct: 68   VEIQSHLRHPNILRLYGYFYDQKRVYLILEYAVRGELYKELQKCKYFSERRAATYVASLA 127

Query: 1284 QALAYCHEKHVIHRDIKPENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVE 1343
            +AL YCH KHVIHRDIKPENLL+  +G LKIADFGWSV + ++R TMCGTLDYL PEMVE
Sbjct: 128  RALIYCHGKHVIHRDIKPENLLIGAQGELKIADFGWSVHTFNRRRTMCGTLDYLPPEMVE 187

Query: 1344 NKAHDYAVDNWTLGILCYEFLYGVPPFEAESQSDTFKRIMKVDLNFPLTRHVSPEAKDLI 1403
            +  HD +VD W+LGILCYEFLYGVPPFEA   S+T+KRI++VDL FP    VS  AKDLI
Sbjct: 188  SVEHDASVDIWSLGILCYEFLYGVPPFEAREHSETYKRIVQVDLKFPPKPIVSSSAKDLI 247

Query: 1404 GRLLVKDSSRRLSLQKIVEHPWIIRNADPSGI 1436
             ++LVK+S++RL+L K++EHPWI++NADPSG+
Sbjct: 248  SQMLVKESTQRLALHKLLEHPWIVQNADPSGL 279

BLAST of Sgr015962 vs. ExPASy Swiss-Prot
Match: P97477 (Aurora kinase A OS=Mus musculus OX=10090 GN=Aurka PE=1 SV=1)

HSP 1 Score: 357.1 bits (915), Expect = 1.1e-96
Identity = 174/294 (59.18%), Postives = 223/294 (75.85%), Query Frame = 0

Query: 1151 GGEPMKSQA--KDEDETPKRQWSLQDFDVGKPLGKGKFGRVYLAREVKSKYIVALKVIFK 1210
            G +  K QA  +  ++T KRQW+L+DFD+G+PLGKGKFG VYLARE +SK+I+ALKV+FK
Sbjct: 98   GNDSEKEQASLQKTEDTKKRQWTLEDFDIGRPLGKGKFGNVYLARERQSKFILALKVLFK 157

Query: 1211 EQMEKYRIHHQLRREMEIQTSLRHPNILRLYGWFHDAERIFLILEYAHRGELYRELRKSG 1270
             Q+EK  + HQLRRE+EIQ+ LRHPNILRLYG+FHDA R++LILEYA  G +YREL+K  
Sbjct: 158  TQLEKANVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLS 217

Query: 1271 HLSEKQAATYILSLTQALAYCHEKHVIHRDIKPENLLLDHEGRLKIADFGWSVQS-RSKR 1330
               E++ ATYI  L  AL+YCH K VIHRDIKPENLLL   G LKIADFGWSV +  S+R
Sbjct: 218  KFDEQRTATYITELANALSYCHSKRVIHRDIKPENLLLGSNGELKIADFGWSVHAPSSRR 277

Query: 1331 HTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYGVPPFEAESQSDTFKRIMKVDL 1390
             TMCGTLDYL PEM+E + HD  VD W+LG+LCYEFL G+PPFEA +  +T++RI +V+ 
Sbjct: 278  TTMCGTLDYLPPEMIEGRMHDEKVDLWSLGVLCYEFLVGMPPFEAHTYQETYRRISRVEF 337

Query: 1391 NFPLTRHVSPEAKDLIGRLLVKDSSRRLSLQKIVEHPWIIRNAD--PSGICNSE 1440
             FP    V+  A+DLI RLL  ++S+RL+L +++EHPWI  N+   P+G  + E
Sbjct: 338  TFP--DFVTEGARDLISRLLKHNASQRLTLAEVLEHPWIKANSSKPPTGHTSKE 389

BLAST of Sgr015962 vs. ExPASy TrEMBL
Match: A0A5A7U692 (Cytochrome P450 704C1-like isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold434G004140 PE=3 SV=1)

HSP 1 Score: 1243.0 bits (3215), Expect = 0.0e+00
Identity = 620/801 (77.40%), Postives = 679/801 (84.77%), Query Frame = 0

Query: 692  MEVDINIFTVFSFVLCTVFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNN 751
            MEV+ NI T F+ VLC   LFFLSF I LLL+TLAGKSIT+S+Y+PVYGT+YGQAFY NN
Sbjct: 1    MEVNFNIITFFTVVLC---LFFLSFFI-LLLKTLAGKSITNSDYSPVYGTIYGQAFYLNN 60

Query: 752  LYDHLTEVAKRHRTFRLLAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGE 811
            LYDHLT VAKRHRTFRLL  +YSEIYT DPRN+EH+LKTKF+ Y KGSKDQE+ GDLFGE
Sbjct: 61   LYDHLTAVAKRHRTFRLLGESYSEIYTVDPRNVEHILKTKFENYRKGSKDQEVCGDLFGE 120

Query: 812  GIFAVDGDKWKQQRKLASYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDL 871
            GIFAVDG+KWK+QRKLASYEFST+ILRDFSCSVFRR+A KL G+VSEFS+M RVFD+QDL
Sbjct: 121  GIFAVDGEKWKEQRKLASYEFSTKILRDFSCSVFRRNAEKLVGIVSEFSTMARVFDVQDL 180

Query: 872  LMRCALDSIFKVGFGVDLNCLEESSKEGS--DFMKAFDDSSAQIFWRYIDPFWKLKRLLN 931
            LMRC+LDSIFKVGFGVDLNC+EE SK      FM+AFDD+SAQ+FWRYIDPFWKLKR LN
Sbjct: 181  LMRCSLDSIFKVGFGVDLNCMEEPSKAAGRRGFMEAFDDASAQVFWRYIDPFWKLKRFLN 240

Query: 932  IGSEASFRNNIKTIDAFVHQLIRDKRKLLQQPNHN-DKEDILWRFLMESEKDPTRMNDQY 991
            IGSEASFRNN+K IDAFVHQLI  +RKLL QPN   DKEDIL RFLMESEKDPTRMNDQY
Sbjct: 241  IGSEASFRNNLKIIDAFVHQLISARRKLLHQPNLKIDKEDILSRFLMESEKDPTRMNDQY 300

Query: 992  LRDIVLNFMLAGKDSSGGTLSWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEVDINLFIQN 1051
            LRDIVLNFMLAG+D+S GTLSWFFYMLCKNPLIQEKVAEEV QIV  +GEE DINLF+QN
Sbjct: 301  LRDIVLNFMLAGRDTSAGTLSWFFYMLCKNPLIQEKVAEEVSQIVGVQGEETDINLFVQN 360

Query: 1052 LTDSALDKMHYLHAALTETLRLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYMAYSMGR 1111
            LTDSALDKMHYLHAALTETLRLYPAVP+DGRTAE DDILPDGYKLRKGDGVYY+AYSMGR
Sbjct: 361  LTDSALDKMHYLHAALTETLRLYPAVPIDGRTAETDDILPDGYKLRKGDGVYYLAYSMGR 420

Query: 1112 MSSLWGEDAEDFKPERWLEMELFNPNH--------------------------LSNSSLS 1171
            M  LWGEDAEDFKPERWLE   F P                            +S + L 
Sbjct: 421  MPCLWGEDAEDFKPERWLENGTFRPESPFKFISFHAGPRMCLGKDFAYRQMKIVSAALLQ 480

Query: 1172 YFLLS-------------------------EFYGGEPMKSQAKDEDETPKRQWSLQDFDV 1231
            +F                            EF GG+PMK  AK+ +E+PKRQWSL+DFDV
Sbjct: 481  FFRFKLADPTRNVTYRIMLTLHIDGDFSQPEFDGGDPMKYLAKNNNESPKRQWSLKDFDV 540

Query: 1232 GKPLGKGKFGRVYLAREVKSKYIVALKVIFKEQMEKYRIHHQLRREMEIQTSLRHPNILR 1291
            GKPLGKGKFGRVYLAREV+SKYIVALKVIF+EQM+KY IH QL REMEIQTSLRHPNILR
Sbjct: 541  GKPLGKGKFGRVYLAREVRSKYIVALKVIFREQMKKYGIHRQLMREMEIQTSLRHPNILR 600

Query: 1292 LYGWFHDAERIFLILEYAHRGELYRELRKSGHLSEKQAATYILSLTQALAYCHEKHVIHR 1351
            LYGWFHDAERIF+ILEYAHRGELYRELRK+GHLSEKQAATY+LSL QALAYCHEK VIHR
Sbjct: 601  LYGWFHDAERIFMILEYAHRGELYRELRKNGHLSEKQAATYMLSLAQALAYCHEKDVIHR 660

Query: 1352 DIKPENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLG 1411
            DIKPENLLLDHEGRLKI DFGW+VQSRSKR+TMCGTLDYLAPEMVENK HD+A+DNWT+G
Sbjct: 661  DIKPENLLLDHEGRLKIGDFGWAVQSRSKRYTMCGTLDYLAPEMVENKGHDFAIDNWTMG 720

Query: 1412 ILCYEFLYGVPPFEAESQSDTFKRIMKVDLNFPLTRHVSPEAKDLIGRLLVKDSSRRLSL 1439
            ILCYEFLYGVPPFEAESQ+DTFKRI KV+LNFP T H+S EAKDLIGRLLVKD+S+RLSL
Sbjct: 721  ILCYEFLYGVPPFEAESQNDTFKRIRKVELNFPSTPHISTEAKDLIGRLLVKDASKRLSL 780

BLAST of Sgr015962 vs. ExPASy TrEMBL
Match: A0A6J5U5J8 (Serine/threonine-protein kinase ULK3 OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS15581 PE=4 SV=1)

HSP 1 Score: 983.0 bits (2540), Expect = 1.5e-282
Identity = 608/1259 (48.29%), Postives = 694/1259 (55.12%), Query Frame = 0

Query: 194  MDFVRGVVDYLGSIFSETSSIHES-PHNPSGVGASTMEGV--NGVPVLNERYASKLKGYF 253
            M F++G++D  GS+FS  SS +ES P++ S   +S MEG+   G  V NER A KL+GYF
Sbjct: 1    MSFLKGIIDSFGSVFSAASSSYESHPNSDSPPNSSIMEGIAGPGASVSNERVAYKLRGYF 60

Query: 254  DLAKEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASSTAVPSFISSSEQEKVKSHRQKI 313
            DLAK+EIAKAVRAEEWG++DDAI HY NAQR+L EA+ST VPS+IS SE+EKVKS+RQKI
Sbjct: 61   DLAKDEIAKAVRAEEWGLVDDAIAHYNNAQRVLVEATSTPVPSYISPSEREKVKSYRQKI 120

Query: 314  SKWQSQVSERLQALNMRAGVTSTNKSSLNHVQRAGIASTMSNTKKAVLRSSSHSVASNPI 373
            SKWQ +VSERLQAL+ RAG TS +KS+L H Q A +  T SN +K VL  S     + P 
Sbjct: 121  SKWQGEVSERLQALSRRAGGTSVSKSTLAHAQTAVVKPTTSNARKHVLPKSPRPTTNRPE 180

Query: 374  TRS--QPPNVGTSKSMQEVPNGYDAKLVEMINTAIVDHSPSVKWDDI-----------AG 433
            TR+  Q  N+ +SK +QE   GYDAKLVEMIN+AIVD SPSV+W+D+           AG
Sbjct: 181  TRNQIQTNNIVSSKPVQETGGGYDAKLVEMINSAIVDRSPSVQWEDVDYQVTHFSFSAAG 240

Query: 434  LQKAKQALLEMVILPTKRRDLFTGLRKPARGLLLFGPPGNGKTMLAKAVASESEATFFNV 493
            L+K K+ L+EMVILPTKRRDLFTGLR+PARGLLLFGPPGNGKTMLAKAVASESEATFFNV
Sbjct: 241  LEKVKKTLMEMVILPTKRRDLFTGLRRPARGLLLFGPPGNGKTMLAKAVASESEATFFNV 300

Query: 494  SAASLTSKWVGEGEKLVRTLFMVAKSRQPSVIFMDEIDSVMSTRQANENEASRRLKSEFL 553
            SA+SLTSKWVGE EKLVRTLF+VA SRQPSVIFMDEIDS+MSTR ANEN+ASRRLKSEFL
Sbjct: 301  SASSLTSKWVGEAEKLVRTLFLVAISRQPSVIFMDEIDSIMSTRLANENDASRRLKSEFL 360

Query: 554  VQFDGVTSNSTDLVIVIGATNKPQELDDAVLRRLVKRIYIPLPDDNVRRLLLKHILKGQS 613
            +QFDGVTSN  DLVIVIGATNKPQELDDA     VKR+YIPLPD   RRLLL+H LKGQ+
Sbjct: 361  IQFDGVTSNPNDLVIVIGATNKPQELDDA-----VKRVYIPLPDLTARRLLLRHKLKGQA 420

Query: 614  FSLPSREVERLVRETEGYSGSDLQALCEEAAMMPIRELGGNILTVKANQIRPLRYEDFKE 673
            FSLPS ++ERL  ETEGYSGSDLQALCEEAAMMPIRELG NILT+KANQ+RPLRYEDF++
Sbjct: 421  FSLPSGDLERLAGETEGYSGSDLQALCEEAAMMPIRELGENILTIKANQVRPLRYEDFQK 480

Query: 674  AMKVIRPSLNKSREFDAEVSLKPLSNFNIQFDSSMEVDINIFTVFSFVLCTVFLFFLSFL 733
            AM VIRPSL+KS                                                
Sbjct: 481  AMTVIRPSLSKS------------------------------------------------ 540

Query: 734  ILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLYDHLTEVAKRHRTFRLLAPAYSEIY 793
                                                                        
Sbjct: 541  ------------------------------------------------------------ 600

Query: 794  TTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGIFAVDGDKWKQQRKLASYEFSTRIL 853
                                                      KW++  +      +T   
Sbjct: 601  ------------------------------------------KWEELERQPYLYAATGTK 660

Query: 854  RDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLMRCALDSIFKVGFGVDLNCLEESSK 913
             D                                                      E S+
Sbjct: 661  ND-----------------------------------------------------AEKSQ 720

Query: 914  EGSDFMKAFDDSSAQIFWRYIDPFWKLKRLLNIGSEASFRNNIKTIDAFVHQLIRDKRKL 973
            EG                         +++ ++ S A                   KRK 
Sbjct: 721  EGLP-----------------------RKIYSLSSGA-------------------KRK- 780

Query: 974  LQQPNHNDKEDILWRFLMESEKDPTRMNDQYLRDIVLNFMLAGKDSSGGTLSWFFYMLCK 1033
                                                                        
Sbjct: 781  ------------------------------------------------------------ 824

Query: 1034 NPLIQEKVAEEVRQIVAFEGEEVDINLFIQNLTDSALDKMHYLHAALTETLRLYPAVPLD 1093
                                        IQN                             
Sbjct: 841  ----------------------------IQNF---------------------------- 824

Query: 1094 GRTAEIDDILPDGYKLRKGDGVYYMAYSMGRMSSLWGEDAEDFKPERWLEMELFNPNHLS 1153
                                                                     H+ 
Sbjct: 901  ---------------------------------------------------------HIV 824

Query: 1154 NSSLSYFLLSEFYGGEPMKSQAKDEDETPKRQWSLQDFDVGKPLGKGKFGRVYLAREVKS 1213
              SLS             KS+  +  E+PKR+WSL+DF++GKPLGKGKFGRVY+ARE KS
Sbjct: 961  VVSLSL-----------EKSEEMERQESPKREWSLRDFEIGKPLGKGKFGRVYVAREAKS 824

Query: 1214 KYIVALKVIFKEQMEKYRIHHQLRREMEIQTSLRHPNILRLYGWFHDAERIFLILEYAHR 1273
            KYIVALKVIFKEQ+EKY+I HQLRREMEIQTSLRHPNILRLYGWFHD ERIFLILEYAH 
Sbjct: 1021 KYIVALKVIFKEQIEKYKIQHQLRREMEIQTSLRHPNILRLYGWFHDDERIFLILEYAHG 824

Query: 1274 GELYRELRKSGHLSEKQAATYILSLTQALAYCHEKHVIHRDIKPENLLLDHEGRLKIADF 1333
            GELY  LRK+ +LSEKQAATYILSLTQALAYCHEK+VIHRDIKPENLLLDHEGRLKIADF
Sbjct: 1081 GELYGLLRKTNYLSEKQAATYILSLTQALAYCHEKNVIHRDIKPENLLLDHEGRLKIADF 824

Query: 1334 GWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYGVPPFEAESQSD 1393
            GWSVQSRSKR TMCGTLDYLAPEMVEN+ HDYAVDNWTLGILCYEFLYG+PPFEAESQ+D
Sbjct: 1141 GWSVQSRSKRQTMCGTLDYLAPEMVENRPHDYAVDNWTLGILCYEFLYGIPPFEAESQTD 824

Query: 1394 TFKRIMKVDLNFPLTRHVSPEAKDLIGRLLVKDSSRRLSLQKIVEHPWIIRNADPSGIC 1437
            TFKRI+KVDLNFP     S EAK LI RLLVKDSS+RLSLQ+I+EHPWI++NADPSGIC
Sbjct: 1201 TFKRIIKVDLNFPSEPQTSAEAKHLITRLLVKDSSKRLSLQRIMEHPWIVKNADPSGIC 824

BLAST of Sgr015962 vs. ExPASy TrEMBL
Match: A0A314UWL0 (Serine/threonine-protein kinase ULK3 OS=Prunus yedoensis var. nudiflora OX=2094558 GN=Pyn_04712 PE=4 SV=1)

HSP 1 Score: 961.8 bits (2485), Expect = 3.6e-276
Identity = 593/1248 (47.52%), Postives = 667/1248 (53.45%), Query Frame = 0

Query: 194  MDFVRGVVDYLGSIFSETSSIHES-PHNPSGVGASTMEGV--NGVPVLNERYASKLKGYF 253
            M F++G++D  GS+FS  SS +ES P++ S   +STMEG+   G  V NER A KL+GYF
Sbjct: 1    MSFLKGIIDSFGSVFSAASSSYESHPNSDSPPNSSTMEGIAGPGASVSNERVAYKLRGYF 60

Query: 254  DLAKEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASSTAVPSFISSSEQEKVKSHRQKI 313
            DLAK+EIAKAVRAEEWG++DDAI HY NAQR+L EA+ST VPS+IS SE+EKVKS+RQKI
Sbjct: 61   DLAKDEIAKAVRAEEWGLVDDAIAHYNNAQRVLVEATSTPVPSYISLSEREKVKSYRQKI 120

Query: 314  SKWQSQVSERLQALNMRAGVTSTNKSSLNHVQRAGIASTMSNTKKAVLRSSSHSVASNPI 373
            SKWQ +VSERLQAL+ RAG TS +KS+L+H Q A +  T SN++K VL  S     + P 
Sbjct: 121  SKWQGEVSERLQALSRRAGGTSVSKSTLDHAQTAVVRPTTSNSRKHVLPKSPRPTTNRPE 180

Query: 374  TRS--QPPNVGTSKSMQEVPNGYDAKLVEMINTAIVDHSPSVKWDDIAGLQKAKQALLEM 433
            TR+  Q  N+ +SK +QE   GYDAKLVEMIN+AIVD SPSVKW+D+AGL+K K+ L+EM
Sbjct: 181  TRNQIQTNNIVSSKPVQETGGGYDAKLVEMINSAIVDRSPSVKWEDVAGLEKVKKTLMEM 240

Query: 434  VILPTKRRDLFTGLRKPARGLLLFGPPGNGKTMLAKAVASESEATFFNVSAASLTSKWVG 493
            VILPTKRRDLFTGLR+PARGLLLFGPPGNGKTMLAKAVASESEATFFNVSA+SLTSKWVG
Sbjct: 241  VILPTKRRDLFTGLRRPARGLLLFGPPGNGKTMLAKAVASESEATFFNVSASSLTSKWVG 300

Query: 494  EGEKLVRTLFMVAKSRQPSVIFMDEIDSVMSTRQANENEASRRLKSEFLVQFDGVTSNST 553
            E EKLVRTLFMVA SRQPSVIFMDEIDS+MSTR ANE++ASRRLKSEFL+QFDGVTSN  
Sbjct: 301  EAEKLVRTLFMVAISRQPSVIFMDEIDSIMSTRLANEHDASRRLKSEFLMQFDGVTSNPN 360

Query: 554  DLVIVIGATNKPQELDDAVLRRLVKRIYIPLPDDNVRRLLLKHILKGQSFSLPSREVERL 613
            DLVIVIGATNKPQELDDAVLRRLVKR+YIPLPD   RRLLL+H LKGQ+FSLPS ++ERL
Sbjct: 361  DLVIVIGATNKPQELDDAVLRRLVKRVYIPLPDLTARRLLLRHKLKGQAFSLPSGDLERL 420

Query: 614  VRETEGYSGSDLQALCEEAAMMPIRELGGNILTVKANQIRPLRYEDFKEAMKVIRPSLNK 673
             RETEGYSGSDLQALCEEAAMMPIRELG NILT+KANQ+RPLRYEDF++AM         
Sbjct: 421  ARETEGYSGSDLQALCEEAAMMPIRELGENILTIKANQVRPLRYEDFQKAM--------- 480

Query: 674  SREFDAEVSLKPLSNFNIQFDSSMEVDINIFTVFSFVLCTVFLFFLSFLILLLLRTLAGK 733
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 734  SITSSEYTPVYGTVYGQAFYFNNLYDHLTEVAKRHRTFRLLAPAYSEIYTTDPRNIEHML 793
                                                                        
Sbjct: 541  ------------------------------------------------------------ 600

Query: 794  KTKFDKYSKGSKDQEIVGDLFGEGIFAVDGDKWKQQRKLASYEFSTRILRDFSCSVFRRS 853
             TK                            KWK                          
Sbjct: 601  -TK----------------------------KWK-------------------------- 660

Query: 854  AAKLDGVVSEFSSMGRVFDIQDLLMRCALDSIFKVGFGVDLNCLEESSKEGSDFMKAFDD 913
                                                                        
Sbjct: 661  ------------------------------------------------------------ 720

Query: 914  SSAQIFWRYIDPFWKLKRLLNIGSEASFRNNIKTIDAFVHQLIRDKRKLLQQPNHNDKED 973
                                                                        
Sbjct: 721  ------------------------------------------------------------ 754

Query: 974  ILWRFLMESEKDPTRMNDQYLRDIVLNFMLAGKDSSGGTLSWFFYMLCKNPLIQEKVAEE 1033
                                                                        
Sbjct: 781  ------------------------------------------------------------ 754

Query: 1034 VRQIVAFEGEEVDINLFIQNLTDSALDKMHYLHAALTETLRLYPAVPLDGRTAEIDDILP 1093
                                                                        
Sbjct: 841  ------------------------------------------------------------ 754

Query: 1094 DGYKLRKGDGVYYMAYSMGRMSSLWGEDAEDFKPERWLEMELFNPNHLSNSSLSYFLLSE 1153
                                                                        
Sbjct: 901  ------------------------------------------------------------ 754

Query: 1154 FYGGEPMKSQAKDEDETPKRQWSLQDFDVGKPLGKGKFGRVYLAREVKSKYIVALKVIFK 1213
                 P K Q     E+PKR+WSL+DF++GKPLGKGKFGRVY+ARE KSKYIVALKVIFK
Sbjct: 961  -----PRKGQ-----ESPKREWSLRDFEIGKPLGKGKFGRVYVAREAKSKYIVALKVIFK 754

Query: 1214 EQMEKYRIHHQLRREMEIQTSLRHPNILRLYGWFHDAERIFLILEYAHRGELYRELRKSG 1273
            EQ+EKY+I HQLRREMEIQTSLRHPNILRLYGWFHD ERIFLILEYAH GELY  LRK+ 
Sbjct: 1021 EQIEKYKIQHQLRREMEIQTSLRHPNILRLYGWFHDDERIFLILEYAHGGELYGLLRKTN 754

Query: 1274 HLSEKQAATYILSLTQALAYCHEKHVIHRDIKPENLLLDHEGRLKIADFGWSVQSRSKRH 1333
            +LSEKQAATYILSLTQALAYCHEK+VIHRDIKPENLLLDHEGRLKIADFGWSVQSRSKR 
Sbjct: 1081 YLSEKQAATYILSLTQALAYCHEKNVIHRDIKPENLLLDHEGRLKIADFGWSVQSRSKRQ 754

Query: 1334 TMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYGVPPFEAESQSDTFKRIMKVDLN 1393
            TMCGTLDYLAPEMVEN+ HDYAVDNWTLGILCYEFLYGVPPFEAESQ+DTF+RI+KVDL+
Sbjct: 1141 TMCGTLDYLAPEMVENRPHDYAVDNWTLGILCYEFLYGVPPFEAESQTDTFRRIIKVDLS 754

Query: 1394 FPLTRHVSPEAKDLIGRLLVKDSSRRLSLQKIVEHPWIIRNADPSGIC 1437
            FP     S EAK LI RLLVKDSS+RLSLQ+I+EHPWI++NADPSGIC
Sbjct: 1201 FPSEPQASAEAKHLITRLLVKDSSKRLSLQRIMEHPWIVKNADPSGIC 754

BLAST of Sgr015962 vs. ExPASy TrEMBL
Match: A0A1R3K0M8 (Serine/threonine-protein kinase ULK3 OS=Corchorus olitorius OX=93759 GN=COLO4_12580 PE=3 SV=1)

HSP 1 Score: 893.6 bits (2308), Expect = 1.2e-255
Identity = 560/1206 (46.43%), Postives = 614/1206 (50.91%), Query Frame = 0

Query: 229  MEGVNGVPVLNERYASKLKGYFDLAKEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASS 288
            M+GV G    NER A KL+GYFDLAKEEI KAVRAEEWG+I+DA++HY+NA+RIL EASS
Sbjct: 1    MDGVAG----NERVAYKLRGYFDLAKEEIDKAVRAEEWGLIEDALVHYRNAERILVEASS 60

Query: 289  TAVPSFISSSEQEKVKSHRQKISKWQSQVSERLQALNMRAGVTSTNKSSLNHVQRAGIAS 348
            T VP +ISSSEQEKVKS+RQKISKWQ QVSERLQ L  RAG  ST            I+ 
Sbjct: 61   TPVPLYISSSEQEKVKSYRQKISKWQGQVSERLQVLGRRAGGPST------------ISP 120

Query: 349  TMSNTKKAVLRSSSHSVASNPITRSQPPNVGTSKSMQEVPNGYDAKLVEMINTAIVDHSP 408
              SN ++ VL+ S      N + R+Q   VGT K  QE  NGYD+K++EMINTAIVD SP
Sbjct: 121  RTSNPRRDVLQKSPR----NQVVRNQGDRVGTPKPAQESANGYDSKMIEMINTAIVDRSP 180

Query: 409  SVKWDDIAGLQKAKQALLEMVILPTKRRDLFTGLRKPARGLLLFGPPGNGKTMLAKAVAS 468
            SVKW+D+AGL KAKQAL+EMVILPT+RRDLFTGLR+PARGLLLFGPPGNGKTMLAKAVAS
Sbjct: 181  SVKWEDVAGLDKAKQALMEMVILPTRRRDLFTGLRRPARGLLLFGPPGNGKTMLAKAVAS 240

Query: 469  ESEATFFNVSAASLTSKWVGEGEKLVRTLFMVAKSRQPSVIFMDEIDSVMSTRQANENEA 528
            ES+ATFFNVSA+SLTSKWVGEGEKLVRTLFMVA S+QPSVIF+DEIDSV+STR  NE++A
Sbjct: 241  ESQATFFNVSASSLTSKWVGEGEKLVRTLFMVAISKQPSVIFIDEIDSVLSTRLENEHDA 300

Query: 529  SRRLKSEFLVQFDGVTSNSTDLVIVIGATNKPQELDDAVLRRLVKRIYIPLPDDNVRRLL 588
            SRRLKSEFL+QFDGVTSN  DLVIVIGATNKPQELDDAVLRRLVKRIY+PLPD+NVRRLL
Sbjct: 301  SRRLKSEFLIQFDGVTSNPNDLVIVIGATNKPQELDDAVLRRLVKRIYVPLPDENVRRLL 360

Query: 589  LKHILKGQSFSLPSREVERLVRETEGYSGSDLQALCEEAAMMPIRELGGNILTVKANQIR 648
            L++ LKGQ+FSLP R++ERLVRETEGYSGSDLQALCEEAAMMPIRELG NILTVKANQ  
Sbjct: 361  LQNKLKGQAFSLPGRDLERLVRETEGYSGSDLQALCEEAAMMPIRELGSNILTVKANQ-- 420

Query: 649  PLRYEDFKEAMKVIRPSLNKSREFDAEVSLKPLSNFNIQFDSSMEVDINIFTVFSFVLCT 708
                                                                        
Sbjct: 421  ------------------------------------------------------------ 480

Query: 709  VFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLYDHLTEVAKRHRTFRL 768
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 769  LAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGIFAVDGDKWKQQRKLA 828
                                                                        
Sbjct: 541  ------------------------------------------------------------ 600

Query: 829  SYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLMRCALDSIFKVGFGVD 888
                                                                        
Sbjct: 601  ------------------------------------------------------------ 660

Query: 889  LNCLEESSKEGSDFMKAFDDSSAQIFWRYIDPFWKLKRLLNIGSEASFRNNIKTIDAFVH 948
                                                                        
Sbjct: 661  ------------------------------------------------------------ 673

Query: 949  QLIRDKRKLLQQPNHNDKEDILWRFLMESEKDPTRMNDQYLRDIVLNFMLAGKDSSGGTL 1008
                                                                        
Sbjct: 721  ------------------------------------------------------------ 673

Query: 1009 SWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEVDINLFIQNLTDSALDKMHYLHAALTETL 1068
                                                                        
Sbjct: 781  ------------------------------------------------------------ 673

Query: 1069 RLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYMAYSMGRMSSLWGEDAEDFKPERWLEM 1128
                                                                        
Sbjct: 841  ------------------------------------------------------------ 673

Query: 1129 ELFNPNHLSNSSLSYFLLSEFYGGEPMKSQAKDEDETPKRQWSLQDFDVGKPLGKGKFGR 1188
                                           K+E+   KR WS++DFD+GKPLGKGKFGR
Sbjct: 901  -------------------------------KEEETNVKRDWSIKDFDIGKPLGKGKFGR 673

Query: 1189 VYLAREVKSKYIVALKVIFKEQMEKYRIHHQLRREMEIQTSLRHPNILRLYGWFHDAERI 1248
            VYLAREVKSKYIVALKVIFKEQ+EKY+IHHQLRREMEIQTSLRHPNILRLYGWFHD ERI
Sbjct: 961  VYLAREVKSKYIVALKVIFKEQIEKYKIHHQLRREMEIQTSLRHPNILRLYGWFHDNERI 673

Query: 1249 FLILEYAHRGELYRELRKSGHLSEKQAATYILSLTQALAYCHEKHVIHRDIKPENLLLDH 1308
            FLILEYAH GELY+ELRK GHLSEKQAATYI SLT ALAYCHEK+VIHRDIKPENLLLDH
Sbjct: 1021 FLILEYAHGGELYKELRKKGHLSEKQAATYIASLTTALAYCHEKNVIHRDIKPENLLLDH 673

Query: 1309 EGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYGVP 1368
            EGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYG P
Sbjct: 1081 EGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCYEFLYGSP 673

Query: 1369 PFEAESQSDTFKRIMKVDLNFPLTRHVSPEAKDLIGRLLVKDSSRRLSLQKIVEHPWIIR 1428
            PFEAESQ DTF+RIM VDL+FP T HVS EAK+LI RLLVKDS +RLSLQKI+EHPWII+
Sbjct: 1141 PFEAESQRDTFRRIMNVDLSFPPTPHVSMEAKNLISRLLVKDSYKRLSLQKIMEHPWIIK 673

Query: 1429 NADPSG 1435
            NADP G
Sbjct: 1201 NADPLG 673

BLAST of Sgr015962 vs. ExPASy TrEMBL
Match: A0A1R3I7R1 (Serine/threonine-protein kinase ULK3 OS=Corchorus capsularis OX=210143 GN=CCACVL1_14257 PE=3 SV=1)

HSP 1 Score: 828.9 bits (2140), Expect = 3.6e-236
Identity = 536/1213 (44.19%), Postives = 590/1213 (48.64%), Query Frame = 0

Query: 229  MEGVNGVPVLNERYASKLKGYFDLAKEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASS 288
            M+GV G    NER A KL+GYFDLAK EI KAVRAEEWG+IDDA++HY+NA+RIL EA+S
Sbjct: 1    MDGVAG----NERVAYKLRGYFDLAKGEIDKAVRAEEWGLIDDALVHYRNAERILVEANS 60

Query: 289  TAVPSFISSSEQEKVKSHRQKISKWQSQVSERLQALNMRAGVTSTNKSSLNHVQRAGIAS 348
            T VP +ISSSEQEKVKS+RQKISKWQ QVSERLQ L  RAG  ST+K++L H Q A ++ 
Sbjct: 61   TPVPLYISSSEQEKVKSYRQKISKWQGQVSERLQVLGRRAGGPSTSKNTLTHAQTAAVSP 120

Query: 349  TMSNTKKAVL-RSSSHSVASNPITR------SQPPNVGTSKSMQEVPNGYDAKLVEMINT 408
              SN ++ VL +S  + V  N   R      +Q   VGT K+ QE  NGYD+K++EMINT
Sbjct: 121  RTSNPRRDVLQKSPRNQVVRNQADRVGTLKPAQADRVGTPKAAQESANGYDSKMIEMINT 180

Query: 409  AIVDHSPSVKWDDIAGLQKAKQALLEMVILPTKRRDLFTGLRKPARGLLLFGPPGNGKTM 468
            AIVD SPSVKW+D+AGL KAKQAL+EMVILPT+RRDLFTGLR+PARGLLLFGPPGNGKTM
Sbjct: 181  AIVDRSPSVKWEDVAGLDKAKQALMEMVILPTRRRDLFTGLRRPARGLLLFGPPGNGKTM 240

Query: 469  LAKAVASESEATFFNVSAASLTSKWVGEGEKLVRTLFMVAKSRQPSVIFMDEIDSVMSTR 528
            LAKAVASES+ATFFNVSA+SLTSKWVGEGEKLVRTLFMVA S+QPSVIF+DEIDSV+STR
Sbjct: 241  LAKAVASESQATFFNVSASSLTSKWVGEGEKLVRTLFMVAISKQPSVIFIDEIDSVLSTR 300

Query: 529  QANENEASRRLKSEFLVQFDGVTSNSTDLVIVIGATNKPQELDDAVLRRLVKRIYIPLPD 588
              NE++ASRRLKSEFL+QFDGVTSN  DLVIVIGATNKPQELDDAVLRRLVKRIY+PLPD
Sbjct: 301  LENEHDASRRLKSEFLIQFDGVTSNPNDLVIVIGATNKPQELDDAVLRRLVKRIYVPLPD 360

Query: 589  DNVRRLLLKHILKGQSFSLPSREVERLVRETEGYSGSDLQALCEEAAMMPIRELGGNILT 648
            +NVRRLLL++ LKGQ+FSLP R++ERLVRETEGYSGSDLQALCEEAAMMPIRELG NILT
Sbjct: 361  ENVRRLLLQNKLKGQAFSLPGRDLERLVRETEGYSGSDLQALCEEAAMMPIRELGSNILT 420

Query: 649  VKANQIRPLRYEDFKEAMKVIRPSLNKSREFDAEVSLKPLSNFNIQFDSSMEVDINIFTV 708
            VKANQ                                                       
Sbjct: 421  VKANQ------------------------------------------------------- 480

Query: 709  FSFVLCTVFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLYDHLTEVAK 768
                                                                        
Sbjct: 481  ------------------------------------------------------------ 540

Query: 769  RHRTFRLLAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGIFAVDGDKW 828
                                                                        
Sbjct: 541  ------------------------------------------------------------ 600

Query: 829  KQQRKLASYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLMRCALDSIF 888
                                                                        
Sbjct: 601  ------------------------------------------------------------ 659

Query: 889  KVGFGVDLNCLEESSKEGSDFMKAFDDSSAQIFWRYIDPFWKLKRLLNIGSEASFRNNIK 948
                                                                        
Sbjct: 661  ------------------------------------------------------------ 659

Query: 949  TIDAFVHQLIRDKRKLLQQPNHNDKEDILWRFLMESEKDPTRMNDQYLRDIVLNFMLAGK 1008
                                                                        
Sbjct: 721  ------------------------------------------------------------ 659

Query: 1009 DSSGGTLSWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEVDINLFIQNLTDSALDKMHYLH 1068
                                                                        
Sbjct: 781  ------------------------------------------------------------ 659

Query: 1069 AALTETLRLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYMAYSMGRMSSLWGEDAEDFK 1128
                                                                        
Sbjct: 841  ------------------------------------------------------------ 659

Query: 1129 PERWLEMELFNPNHLSNSSLSYFLLSEFYGGEPMKSQAKDEDETPKRQWSLQDFDVGKPL 1188
                                                                        
Sbjct: 901  ------------------------------------------------------------ 659

Query: 1189 GKGKFGRVYLAREVKSKYIVALKVIFKEQMEKYRIHHQLRREMEIQTSLRHPNILRLYGW 1248
                           SKYIVALKVIFKEQ+EKY+IHHQLRREMEIQTSLRHPNILRLYGW
Sbjct: 961  ---------------SKYIVALKVIFKEQIEKYKIHHQLRREMEIQTSLRHPNILRLYGW 659

Query: 1249 FHDAERIFLILEYAHRGELYRELRKSGHLSEKQAATYILSLTQALAYCHEKHVIHRDIKP 1308
            FHD ERIFLILEYAH GELY+ELRK GHLSEKQAATYI SLT ALAYCHEK+VIHRDIKP
Sbjct: 1021 FHDNERIFLILEYAHGGELYKELRKKGHLSEKQAATYIASLTTALAYCHEKNVIHRDIKP 659

Query: 1309 ENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCY 1368
            ENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCY
Sbjct: 1081 ENLLLDHEGRLKIADFGWSVQSRSKRHTMCGTLDYLAPEMVENKAHDYAVDNWTLGILCY 659

Query: 1369 EFLYGVPPFEAESQSDTFKRIMKVDLNFPLTRHVSPEAKDLIGRLLVKDSSRRLSLQKIV 1428
            EFLYG PPFEAESQ DTF+RIM VDL+FP T HVS EAK+LI RLLVKDS +RLSLQKI+
Sbjct: 1141 EFLYGSPPFEAESQRDTFRRIMNVDLSFPPTPHVSMEAKNLISRLLVKDSYKRLSLQKIM 659

Query: 1429 EHPWIIRNADPSG 1435
            EHPWII+NADP G
Sbjct: 1201 EHPWIIKNADPLG 659

BLAST of Sgr015962 vs. TAIR 10
Match: AT2G45500.1 (AAA-type ATPase family protein )

HSP 1 Score: 614.4 bits (1583), Expect = 2.7e-175
Identity = 330/499 (66.13%), Postives = 395/499 (79.16%), Query Frame = 0

Query: 194 MDFVRGVVDYLGSIFSETSSIHESPHNPSGVGASTMEGVNGVPVLNERYASKLKGYFDLA 253
           M F+RG++D   SI +E S    S  + S   + +M G++GVPV NER A KLKGYFDLA
Sbjct: 1   MSFLRGIIDSFSSILNEESKKDPSV-SSSSTSSESMNGIDGVPVTNERIAYKLKGYFDLA 60

Query: 254 KEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASSTAVPSFISSSEQEKVKSHRQKISKW 313
           KEEIAK VRAEEWG+ DDA+LHY+NAQRI+ EA+ST  PS+ISSSE+EKV+S+R+KIS W
Sbjct: 61  KEEIAKGVRAEEWGLHDDALLHYRNAQRIMNEATSTPSPSYISSSEKEKVRSYREKISNW 120

Query: 314 QSQVSERLQALNMRAGV-TSTNKSSLNHVQRAGIASTMSNTKKAVLRSSSHSVASNPITR 373
           Q+QVSERLQAL  R GV  S NK ++ +   A ++ST S  +K + + +  +       R
Sbjct: 121 QNQVSERLQALGKRTGVGMSENKRTVAYPSSASVSSTASRYRKTLSQKTPVARGGVATPR 180

Query: 374 SQPPNVGTSKSMQEVPNGYDAKLVEMINTAIVDHSPSVKWDDIAGLQKAKQALLEMVILP 433
           +      + K ++E  N YD KLVEMINT IVD SPSVKWDD+AGL  AKQALLEMVILP
Sbjct: 181 NPKDAAASPKPVKESGNVYDDKLVEMINTTIVDRSPSVKWDDVAGLNGAKQALLEMVILP 240

Query: 434 TKRRDLFTGLRKPARGLLLFGPPGNGKTMLAKAVASESEATFFNVSAASLTSKWVGEGEK 493
            KRRDLFTGLR+PARGLLLFGPPGNGKTMLAKAVASES+ATFFNVSA+SLTSKWVGE EK
Sbjct: 241 AKRRDLFTGLRRPARGLLLFGPPGNGKTMLAKAVASESQATFFNVSASSLTSKWVGEAEK 300

Query: 494 LVRTLFMVAKSRQPSVIFMDEIDSVMSTRQANENEASRRLKSEFLVQFDGVTSNSTDLVI 553
           LV+TLF VA SRQPSVIFMDEIDS+MSTR  +ENEASRRLKSEFL+QFDGVTSN  DLVI
Sbjct: 301 LVKTLFQVAISRQPSVIFMDEIDSIMSTRSTSENEASRRLKSEFLIQFDGVTSNPDDLVI 360

Query: 554 VIGATNKPQELDDAVLRRLVKRIYIPLPDDNVRRLLLKHILKGQSFSLPSREVERLVRET 613
           +IGATNKPQELDDAVLRRLVKRIY+PLPD NVR+LL K  LK Q  SL   +++++V+ET
Sbjct: 361 IIGATNKPQELDDAVLRRLVKRIYVPLPDSNVRKLLFKTKLKCQPHSLSDGDIDKIVKET 420

Query: 614 EGYSGSDLQALCEEAAMMPIRELGGNILTVKANQIRPLRYEDFKEAMKVIRPSLNKSREF 673
           EGYSGSDLQALCEEAAMMPIRELG NILT++AN++R LRY+DF+++M VIRPSL+KS+  
Sbjct: 421 EGYSGSDLQALCEEAAMMPIRELGANILTIQANKVRSLRYDDFRKSMAVIRPSLSKSK-- 480

Query: 674 DAEVSLKPLSNFNIQFDSS 692
                 + L  +N +F S+
Sbjct: 481 -----WEELERWNSEFGSN 491

BLAST of Sgr015962 vs. TAIR 10
Match: AT2G45500.2 (AAA-type ATPase family protein )

HSP 1 Score: 607.8 bits (1566), Expect = 2.5e-173
Identity = 327/498 (65.66%), Postives = 393/498 (78.92%), Query Frame = 0

Query: 194 MDFVRGVVDYLGSIFSETSSIHESPHNPSGVGASTMEGVNGVPVLNERYASKLKGYFDLA 253
           M F+RG++D   SI +E S    S  + S   + +M G++GVPV NER A KLKGYFDLA
Sbjct: 1   MSFLRGIIDSFSSILNEESKKDPSV-SSSSTSSESMNGIDGVPVTNERIAYKLKGYFDLA 60

Query: 254 KEEIAKAVRAEEWGIIDDAILHYQNAQRILAEASSTAVPSFISSSEQEKVKSHRQKISKW 313
           KEEIAK VRAEEWG+ DDA+LHY+NAQRI+ EA+ST  PS+ISSSE+EKV+S+R+KIS W
Sbjct: 61  KEEIAKGVRAEEWGLHDDALLHYRNAQRIMNEATSTPSPSYISSSEKEKVRSYREKISNW 120

Query: 314 QSQVSERLQALNMRAGVTSTNKSSLNHVQRAGIASTMSNTKKAVLRSSSHSVASNPITRS 373
           Q+QVSERLQAL +     S NK ++ +   A ++ST S  +K + + +  +       R+
Sbjct: 121 QNQVSERLQALGVG---MSENKRTVAYPSSASVSSTASRYRKTLSQKTPVARGGVATPRN 180

Query: 374 QPPNVGTSKSMQEVPNGYDAKLVEMINTAIVDHSPSVKWDDIAGLQKAKQALLEMVILPT 433
                 + K ++E  N YD KLVEMINT IVD SPSVKWDD+AGL  AKQALLEMVILP 
Sbjct: 181 PKDAAASPKPVKESGNVYDDKLVEMINTTIVDRSPSVKWDDVAGLNGAKQALLEMVILPA 240

Query: 434 KRRDLFTGLRKPARGLLLFGPPGNGKTMLAKAVASESEATFFNVSAASLTSKWVGEGEKL 493
           KRRDLFTGLR+PARGLLLFGPPGNGKTMLAKAVASES+ATFFNVSA+SLTSKWVGE EKL
Sbjct: 241 KRRDLFTGLRRPARGLLLFGPPGNGKTMLAKAVASESQATFFNVSASSLTSKWVGEAEKL 300

Query: 494 VRTLFMVAKSRQPSVIFMDEIDSVMSTRQANENEASRRLKSEFLVQFDGVTSNSTDLVIV 553
           V+TLF VA SRQPSVIFMDEIDS+MSTR  +ENEASRRLKSEFL+QFDGVTSN  DLVI+
Sbjct: 301 VKTLFQVAISRQPSVIFMDEIDSIMSTRSTSENEASRRLKSEFLIQFDGVTSNPDDLVII 360

Query: 554 IGATNKPQELDDAVLRRLVKRIYIPLPDDNVRRLLLKHILKGQSFSLPSREVERLVRETE 613
           IGATNKPQELDDAVLRRLVKRIY+PLPD NVR+LL K  LK Q  SL   +++++V+ETE
Sbjct: 361 IGATNKPQELDDAVLRRLVKRIYVPLPDSNVRKLLFKTKLKCQPHSLSDGDIDKIVKETE 420

Query: 614 GYSGSDLQALCEEAAMMPIRELGGNILTVKANQIRPLRYEDFKEAMKVIRPSLNKSREFD 673
           GYSGSDLQALCEEAAMMPIRELG NILT++AN++R LRY+DF+++M VIRPSL+KS+   
Sbjct: 421 GYSGSDLQALCEEAAMMPIRELGANILTIQANKVRSLRYDDFRKSMAVIRPSLSKSK--- 480

Query: 674 AEVSLKPLSNFNIQFDSS 692
                + L  +N +F S+
Sbjct: 481 ----WEELERWNSEFGSN 487

BLAST of Sgr015962 vs. TAIR 10
Match: AT2G45510.1 (cytochrome P450, family 704, subfamily A, polypeptide 2 )

HSP 1 Score: 531.6 bits (1368), Expect = 2.3e-150
Identity = 252/439 (57.40%), Postives = 328/439 (74.72%), Query Frame = 0

Query: 696  INIFTVFSFVLCTVFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLYDH 755
            + I T  +  + T     L F I L++R   GKS     Y PV+ TV+   F+ + LYD+
Sbjct: 1    MEILTSIAITVATTIFIVLCFTIYLMIRIFTGKSRNDKRYAPVHATVFDLLFHSDELYDY 60

Query: 756  LTEVAKRHRTFRLLAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGIFA 815
             TE+A+   T+R L+P  SEI T DPRN+EH+LKT+FD YSKG   +E + DL G GIFA
Sbjct: 61   ETEIAREKPTYRFLSPGQSEILTADPRNVEHILKTRFDNYSKGHSSRENMADLLGHGIFA 120

Query: 816  VDGDKWKQQRKLASYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLMRC 875
            VDG+KW+QQRKL+S+EFSTR+LRDFSCSVFRR+A+KL G VSEF+  G+ FD QDLLMRC
Sbjct: 121  VDGEKWRQQRKLSSFEFSTRVLRDFSCSVFRRNASKLVGFVSEFALSGKAFDAQDLLMRC 180

Query: 876  ALDSIFKVGFGVDLNCLEESSKEGSDFMKAFDDSSAQIFWRYIDPFWKLKRLLNIGSEAS 935
             LDSIFKVGFGV+L CL+  SKEG +FM+AFD+ +     R+IDP WKLK   NIGS++ 
Sbjct: 181  TLDSIFKVGFGVELKCLDGFSKEGQEFMEAFDEGNVATSSRFIDPLWKLKWFFNIGSQSK 240

Query: 936  FRNNIKTIDAFVHQLIRDKRK-LLQQPNHNDKEDILWRFLMESEKDPTRMNDQYLRDIVL 995
             + +I TID FV+ LI  KRK L ++ N   +EDIL RFL+ESEKDP  MND+YLRDI+L
Sbjct: 241  LKKSIATIDKFVYSLITTKRKELAKEQNTVVREDILSRFLVESEKDPENMNDKYLRDIIL 300

Query: 996  NFMLAGKDSSGGTLSWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEVDINLFIQNLTDSAL 1055
            NFM+AGKD++   LSWF YMLCKNPL+QEK+ +E+R +     +  D+N F++++ + AL
Sbjct: 301  NFMIAGKDTTAALLSWFLYMLCKNPLVQEKIVQEIRDVTFSHEKTTDVNGFVESINEEAL 360

Query: 1056 DKMHYLHAALTETLRLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYMAYSMGRMSSLWG 1115
            D+MHYLHAAL+ETLRLYP VP+D R AE DD+LPDG+++ KGD +YY+AY+MGRM+ +WG
Sbjct: 361  DEMHYLHAALSETLRLYPPVPVDMRCAENDDVLPDGHRVSKGDNIYYIAYAMGRMTYIWG 420

Query: 1116 EDAEDFKPERWLEMELFNP 1134
            +DAE+FKPERWL+  LF P
Sbjct: 421  QDAEEFKPERWLKDGLFQP 439

BLAST of Sgr015962 vs. TAIR 10
Match: AT2G44890.1 (cytochrome P450, family 704, subfamily A, polypeptide 1 )

HSP 1 Score: 505.8 bits (1301), Expect = 1.4e-142
Identity = 244/451 (54.10%), Postives = 325/451 (72.06%), Query Frame = 0

Query: 703  SFVLCTVFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLYDHLTEVAKR 762
            + ++ T     LSF + L +R   GKS     YTPV+ T++   F+ + LYD+ TE+A+ 
Sbjct: 2    AIIVVTTIFILLSFALYLTIRIFTGKSRNDKRYTPVHATIFDLFFHSHKLYDYETEIART 61

Query: 763  HRTFRLLAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGIFAVDGDKWK 822
              TFR L+P  SEI+T DPRN+EH+LKT+F  YSKG      + DL G GIFAVDG+KWK
Sbjct: 62   KPTFRFLSPGQSEIFTADPRNVEHILKTRFHNYSKGPVGTVNLADLLGHGIFAVDGEKWK 121

Query: 823  QQRKLASYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLMRCALDSIFK 882
            QQRKL S+EFSTR+LR+FS SVFR SA+KL G ++EF+  G+ FD QD+LM+C LDSIFK
Sbjct: 122  QQRKLVSFEFSTRVLRNFSYSVFRTSASKLVGFIAEFALSGKSFDFQDMLMKCTLDSIFK 181

Query: 883  VGFGVDLNCLEESSKEGSDFMKAFDDSSAQIFWRYIDPFWKLKRLLNIGSEASFRNNIKT 942
            VGFGV+L CL+  SKEG +FMKAFD+ +     R  DPFWKLK  LNIGSE+  + +I  
Sbjct: 182  VGFGVELGCLDGFSKEGEEFMKAFDEGNGATSSRVTDPFWKLKCFLNIGSESRLKKSIAI 241

Query: 943  IDAFVHQLIRDKRK-LLQQPNHNDKEDILWRFLMESEKDPTRMNDQYLRDIVLNFMLAGK 1002
            ID FV+ LI  KRK L ++ N + +EDIL +FL+ESEKDP  MND+YLRDI+LN M+AGK
Sbjct: 242  IDKFVYSLITTKRKELSKEQNTSVREDILSKFLLESEKDPENMNDKYLRDIILNVMVAGK 301

Query: 1003 DSSGGTLSWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEVDINLFIQNLTDSALDKMHYLH 1062
            D++  +LSWF YMLCKNPL+QEK+ +E+R + +   +  D+N FI+++T+ AL +M YLH
Sbjct: 302  DTTAASLSWFLYMLCKNPLVQEKIVQEIRDVTSSHEKTTDVNGFIESVTEEALAQMQYLH 361

Query: 1063 AALTETLRLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYMAYSMGRMSSLWGEDAEDFK 1122
            AAL+ET+RLYP VP   R AE DD+LPDG+++ KGD +YY++Y+MGRM+ +WG+DAE+FK
Sbjct: 362  AALSETMRLYPPVPEHMRCAENDDVLPDGHRVSKGDNIYYISYAMGRMTYIWGQDAEEFK 421

Query: 1123 PERWLEMELFNPNHLSNSSLSYFLLSEFYGG 1153
            PERWL+  +F P        S F    F+ G
Sbjct: 422  PERWLKDGVFQPE-------SQFKFISFHAG 445

BLAST of Sgr015962 vs. TAIR 10
Match: AT2G44890.2 (cytochrome P450, family 704, subfamily A, polypeptide 1 )

HSP 1 Score: 498.4 bits (1282), Expect = 2.2e-140
Identity = 238/432 (55.09%), Postives = 316/432 (73.15%), Query Frame = 0

Query: 696  INIFTVFSFVLCTVFLFFLSFLILLLLRTLAGKSITSSEYTPVYGTVYGQAFYFNNLYDH 755
            + I T  + ++ T     LSF + L +R   GKS     YTPV+ T++   F+ + LYD+
Sbjct: 1    MEILTSMAIIVVTTIFILLSFALYLTIRIFTGKSRNDKRYTPVHATIFDLFFHSHKLYDY 60

Query: 756  LTEVAKRHRTFRLLAPAYSEIYTTDPRNIEHMLKTKFDKYSKGSKDQEIVGDLFGEGIFA 815
             TE+A+   TFR L+P  SEI+T DPRN+EH+LKT+F  YSKG      + DL G GIFA
Sbjct: 61   ETEIARTKPTFRFLSPGQSEIFTADPRNVEHILKTRFHNYSKGPVGTVNLADLLGHGIFA 120

Query: 816  VDGDKWKQQRKLASYEFSTRILRDFSCSVFRRSAAKLDGVVSEFSSMGRVFDIQDLLMRC 875
            VDG+KWKQQRKL S+EFSTR+LR+FS SVFR SA+KL G ++EF+  G+ FD QD+LM+C
Sbjct: 121  VDGEKWKQQRKLVSFEFSTRVLRNFSYSVFRTSASKLVGFIAEFALSGKSFDFQDMLMKC 180

Query: 876  ALDSIFKVGFGVDLNCLEESSKEGSDFMKAFDDSSAQIFWRYIDPFWKLKRLLNIGSEAS 935
             LDSIFKVGFGV+L CL+  SKEG +FMKAFD+ +     R  DPFWKLK  LNIGSE+ 
Sbjct: 181  TLDSIFKVGFGVELGCLDGFSKEGEEFMKAFDEGNGATSSRVTDPFWKLKCFLNIGSESR 240

Query: 936  FRNNIKTIDAFVHQLIRDKRKLLQQPNHNDKEDILWRFLMESEKDPTRMNDQYLRDIVLN 995
             + +I  ID FV+ LI  KRK L +  +    DIL +FL+ESEKDP  MND+YLRDI+LN
Sbjct: 241  LKKSIAIIDKFVYSLITTKRKELSKEQNT---DILSKFLLESEKDPENMNDKYLRDIILN 300

Query: 996  FMLAGKDSSGGTLSWFFYMLCKNPLIQEKVAEEVRQIVAFEGEEVDINLFIQNLTDSALD 1055
             M+AGKD++  +LSWF YMLCKNPL+QEK+ +E+R + +   +  D+N FI+++T+ AL 
Sbjct: 301  VMVAGKDTTAASLSWFLYMLCKNPLVQEKIVQEIRDVTSSHEKTTDVNGFIESVTEEALA 360

Query: 1056 KMHYLHAALTETLRLYPAVPLDGRTAEIDDILPDGYKLRKGDGVYYMAYSMGRMSSLWGE 1115
            +M YLHAAL+ET+RLYP VP   R AE DD+LPDG+++ KGD +YY++Y+MGRM+ +WG+
Sbjct: 361  QMQYLHAALSETMRLYPPVPEHMRCAENDDVLPDGHRVSKGDNIYYISYAMGRMTYIWGQ 420

Query: 1116 DAEDFKPERWLE 1128
            DAE+FKPERWL+
Sbjct: 421  DAEEFKPERWLK 429

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0049019.10.0e+0077.40cytochrome P450 704C1-like isoform X1 [Cucumis melo var. makuwa] >TYK17545.1 cyt... [more]
CAB4269788.13.1e-28248.29unnamed protein product [Prunus armeniaca][more]
PQM41947.17.4e-27647.52hypothetical protein Pyn_04712 [Prunus yedoensis var. nudiflora][more]
OMP00538.12.5e-25546.43hypothetical protein COLO4_12580 [Corchorus olitorius][more]
OMO78635.17.5e-23644.19hypothetical protein CCACVL1_14257 [Corchorus capsularis][more]
Match NameE-valueIdentityDescription
O646291.2e-13280.21Serine/threonine-protein kinase Aurora-3 OS=Arabidopsis thaliana OX=3702 GN=AUR3... [more]
Q50EK32.3e-11045.68Cytochrome P450 704C1 OS=Pinus taeda OX=3352 GN=CYP704C1 PE=2 SV=1[more]
Q9M0772.0e-10665.80Serine/threonine-protein kinase Aurora-1 OS=Arabidopsis thaliana OX=3702 GN=AUR1... [more]
Q683C92.6e-10665.44Serine/threonine-protein kinase Aurora-2 OS=Arabidopsis thaliana OX=3702 GN=AUR2... [more]
P974771.1e-9659.18Aurora kinase A OS=Mus musculus OX=10090 GN=Aurka PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7U6920.0e+0077.40Cytochrome P450 704C1-like isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
A0A6J5U5J81.5e-28248.29Serine/threonine-protein kinase ULK3 OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCU... [more]
A0A314UWL03.6e-27647.52Serine/threonine-protein kinase ULK3 OS=Prunus yedoensis var. nudiflora OX=20945... [more]
A0A1R3K0M81.2e-25546.43Serine/threonine-protein kinase ULK3 OS=Corchorus olitorius OX=93759 GN=COLO4_12... [more]
A0A1R3I7R13.6e-23644.19Serine/threonine-protein kinase ULK3 OS=Corchorus capsularis OX=210143 GN=CCACVL... [more]
Match NameE-valueIdentityDescription
AT2G45500.12.7e-17566.13AAA-type ATPase family protein [more]
AT2G45500.22.5e-17365.66AAA-type ATPase family protein [more]
AT2G45510.12.3e-15057.40cytochrome P450, family 704, subfamily A, polypeptide 2 [more]
AT2G44890.11.4e-14254.10cytochrome P450, family 704, subfamily A, polypeptide 1 [more]
AT2G44890.22.2e-14055.09cytochrome P450, family 704, subfamily A, polypeptide 1 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002401Cytochrome P450, E-class, group IPRINTSPR00463EP450Icoord: 1008..1034
score: 30.49
coord: 1061..1079
score: 29.95
coord: 988..1005
score: 23.42
IPR000719Protein kinase domainSMARTSM00220serkin_6coord: 1175..1426
e-value: 4.0E-97
score: 338.6
IPR000719Protein kinase domainPFAMPF00069Pkinasecoord: 1176..1426
e-value: 3.9E-74
score: 249.4
IPR000719Protein kinase domainPROSITEPS50011PROTEIN_KINASE_DOMcoord: 1175..1426
score: 51.071102
IPR007330MIT domainSMARTSM00745smartcoord: 246..323
e-value: 4.0E-15
score: 66.3
IPR003593AAA+ ATPase domainSMARTSM00382AAA_5coord: 445..581
e-value: 2.9E-15
score: 66.7
NoneNo IPR availableGENE3D1.10.8.60coord: 580..662
e-value: 9.8E-108
score: 361.6
NoneNo IPR availableGENE3D1.10.510.10Transferase(Phosphotransferase) domain 1coord: 1259..1455
e-value: 5.9E-57
score: 194.8
NoneNo IPR availableGENE3D1.20.58.80coord: 239..328
e-value: 2.8E-6
score: 29.0
NoneNo IPR availableGENE3D3.30.200.20Phosphorylase Kinase; domain 1coord: 1163..1258
e-value: 3.6E-33
score: 115.4
NoneNo IPR availablePIRSRPIRSR037014-1PIRSR037014-1coord: 1175..1427
e-value: 2.8E-48
score: 162.7
NoneNo IPR availablePIRSRPIRSR037568-2PIRSR037568-2coord: 1171..1424
e-value: 1.2E-38
score: 130.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 360..385
NoneNo IPR availablePANTHERPTHR24296CYTOCHROME P450coord: 701..1134
NoneNo IPR availablePANTHERPTHR24296:SF163CYTOCHROME P450, FAMILY 704, SUBFAMILY A, POLYPEPTIDE 1coord: 701..1134
NoneNo IPR availableCDDcd00009AAAcoord: 416..579
e-value: 9.78117E-25
score: 99.9131
NoneNo IPR availableCDDcd14007STKc_Auroracoord: 1174..1426
e-value: 1.24033E-156
score: 475.041
NoneNo IPR availableCDDcd02679MIT_spastincoord: 246..324
e-value: 2.86671E-26
score: 101.585
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 408..669
e-value: 9.8E-108
score: 361.6
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 408..670
IPR036396Cytochrome P450 superfamilyGENE3D1.10.630.10Cytochrome P450coord: 734..1145
e-value: 1.1E-85
score: 289.8
IPR036396Cytochrome P450 superfamilySUPERFAMILY48264Cytochrome P450coord: 736..1131
IPR041569AAA ATPase, AAA+ lid domainPFAMPF17862AAA_lid_3coord: 605..638
e-value: 2.4E-10
score: 40.1
IPR003959ATPase, AAA-type, corePFAMPF00004AAAcoord: 449..579
e-value: 2.0E-38
score: 131.8
IPR001128Cytochrome P450PFAMPF00067p450coord: 747..1130
e-value: 5.4E-55
score: 187.0
IPR008271Serine/threonine-protein kinase, active sitePROSITEPS00108PROTEIN_KINASE_STcoord: 1294..1306
IPR003960ATPase, AAA-type, conserved sitePROSITEPS00674AAAcoord: 551..570
IPR017441Protein kinase, ATP binding sitePROSITEPS00107PROTEIN_KINASE_ATPcoord: 1181..1208
IPR011009Protein kinase-like domain superfamilySUPERFAMILY56112Protein kinase-like (PK-like)coord: 1167..1432

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr015962.1Sgr015962.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006468 protein phosphorylation
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0015630 microtubule cytoskeleton
molecular_function GO:0005524 ATP binding
molecular_function GO:0016887 ATP hydrolysis activity
molecular_function GO:0020037 heme binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0004497 monooxygenase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
molecular_function GO:0004672 protein kinase activity