Sgr022791 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr022791
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionAdenosine kinase
Locationtig00000589: 1801831 .. 1824541 (+)
RNA-Seq ExpressionSgr022791
SyntenySgr022791
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGCAACCCTCTTCTTGACATCTCGGCCGTCGTCGACGATGATTTTTTGCAGAGGTATTCCCTCATTGTTCCTCGGTTACTTTATCCAGCTTTGCTGTGATTTCCTATAAAACCCAGATCTCTATTGTACTTTTTTTTTTTTTTTTTTTTTATGTATCGTGTGCGCCTAGCTTTCCTGTTATAGGCTATCTCTATCGCCAATGTTAATTTCGTTGTTGCTTTTCGTTCATTTCGTAGCTTGTTGGTTGTAGTGTTCAAGATATTAATGCGTGTGAGGAGAGGATTTGTTAATTGTAACTTTGATCCGAAATAGCAACACCGTGTAATGTTTATCAGGGTGTTGGGATTTCACTTTTTATTGTATTGAATAACTCATTGAATAACTCATTTCGTGCAGATACGACATCAAGCCGAACAATGCTATTCTCGCCGAGGAGAAACATCTTCCAATGTAGGGGATTGGTTTTCAATTTTCTGACGGACATCTTCCCCCGCTTTGTTTTATTTTTATATTTAACTGGCTGGTGTCGTGAACTTTATTCTTTTAAGATAACGTGTTCTCCTTGTTTTAACTTTTAAGAGTGGATGCCATTAGGTCTTGAGAGAATCATTTATAATATCCGACAATGTCTTATGAGTTTGTTTGAAATCAAGTTTCTCATTACAATTTAGCTGATATGACATCAATGAATAAGGGAAATAGGAAGATGAGTGCTGTCAGAAGTTTGATGTTAAAAGATACTAGATCAACCAATATTGATATTACTTGCTGTTGTGTCTTGATTGAGGATAATGTTTATTCTAACTTCTACTTGAGGTTATTTACTTAATGGAGTTTTTCAATATGAATCTTATGTCAATGAATCGAACTACAGGACTAGTTTCAAAAACGACCAGAGAACAGCAAGAAGCTAATAATAGCTTGGTTAGGATGTGATGTGTGAGCTTAAACTCAATTTTGGCAATAAAAGTATACTGATGTCGTCAAAATTTACTAAGCCTCAGGCATACTGAACTTTAGGGGAAAAATAGAAAAAAAAAAAAGAAGAACCAGTTTGTCTGGCTTGGATTGTGTAAGCATCTGATACATTAGGCGTTACAATTAGATTGTTGTAATTTATTATTTCATTTTCATTTTTGGGAATGCTTTTGGAATGTTGTGGACATGCTCTGACCCATTAACAGGGACAAGTAAAAGGGAAAGATAAACAAATGATAGTGGTGAAAATTTAGAGTTGTTGGAGGAAGAGTTCAAACAAACATAACAAATGAAATCAAATAGTTACAACACAATGAAATGAAAACCTTTTTTTTTATGATAAAGTAAGGGTCATAATTTAGTTTGTAATAATGATTGTCTTTTTTCTTCTTCTTCTACTTCATTTTTCCCCCTTATTTTTTTCTAATGTTGGATTAATATTTATGAACTTTTATTATACACTGTTTTTTTTTTTAATTCCAATTTCCCAGAAAAATTTCCCATTGAAGAAAATTTTATATTCAGTAGCTTGAGAGATCATCTATGACAGCTTTATGATGGTGATGGAAATTAGGAAGTGGTGGTCTTACTTCAAGCCACAGTAGAATTGTAATACTTATTCAATTGTAGTATTGATCCCTTCGTGCATAATTCTTTTTGTAGGTATGAAGAATTGGCAAACAATCCTAATGTGGAGTACATTGCTGGAGGTGACGTATGGATGATTGAATATATTTGTGTTATTATGGTTTTGGCACATTTGTGAATTTCATGTACTTTTATTAAGTGCAGGTGCTACACAAAACTCAATTAAAGTGGCTCAGGTATACTAATTTTGTTCCCTGGTGAATATACACATATCCCTCAAGTTTTATTTTCCATTGTGGGATGAACCTTTTCTTGTCTTACCAGTGGATGCTTCAATATCCTGGTGCAACAAGCTATATGGGTTGCATCGGAAAGGACAAGTTCGGGGAGGAGATGAAGAAAAACTCAAAAAGTGCTGGAGTTAATGTAAGTTCCATATTCTATCACTATTTTCCCATTTTAGTGATAATTTTATGTTTCAAATTCCCATTATTAAAAACTTTTTTTGCTCGTTTGGCCATTGGTATTTGAGAAATTAGGTTCAATATTATGAGGTTGAATCTACACCAACAGGAACCTGTGCTGTTTGTGTCGTGGGTGGTGAAAGGTTAGTTAAAATCTTGCTTGCTTTGATGTTTTCTCTTCCATTAAGTCAAACATATAAAGTTTGCTTCCTTCATAGTGAAAGGTTATTTCTCAAATTGTTACTTGATTTAACCGGTAACCAGGCGTATCATATTTCAGTTAGATGTACTAACAAGAGTCTGTTTTTATGACATACAGGTCACTTGTTGCCAACTTGTCAGCTGCAAATTGCTACAAGTCAGATCATTTGAAAAGACCTGAGAATTGGGCATTAGGTAAACACAGAACTCATATTTATGGCTTTAGAGGGCTTTTTGGAATTTATTCTTAAAAATCTTTTTTTGTTTTTTAGAACAAAAATTTAGTGAAAACATATTTGGTTAACTGTGATTTAGAAGTTCCCTTAAAAAAACTGTTCTTTAAAAGCTCTTTTCCCTTGTTTCTCCTTAATTTGTTCTCATGGTGGCCTATTTTTCAGTACAATTTGTTCTTCCCTTGAAAATCTTGGACTAGTGAATAGTTTTTGTACAAGAACCTTGTACATTTTCAATCCTCTTAATCATAGTTCTTTTAAAAACTGTTCTGCTGAACAAGTACCAAACAACCCCATGATTTTTTTTGAAATATTTTAGGGACCTGTAGATAGGATAGACGTGCATTCTTACGAGCTAACTTGTTCATAATTTGCCGTGAGGACAGTTGAAAGGCCAAGTATTTCTATATTGCCGGGTTTTTTCTCACTGTATCACCAGACTCCGTACTGCTTGTAGCTGAACATGCAGCTGCAAACAAAAAGGTACACCATGTGACATAATCAATATAATTATTGTCTATCAAGGCTTCTGAACAAGTAACTAATATTGCCAATCATTGCAGTATTTCTCGATGAACCTGTCTGCTCCATTTATCTGTGAGTTCTTCAAGATGCACTGGAGAAAGTTTTGCCGTAAGTATTAATTATGTTAATTGGTAAATGCAAGGGACTTTGACATGTGGACTCAGGACATATCTTCTTTTTTGTGTTGCAGGTATATGGACTTCATTTTTGGTAATGAAACTGAAGCAAGGACGTTCTCAAAAGTTCAGGGCTGGGAGGTAATGGTTTGCGTGGTCTCCCATAAAGACTTGGATAGTGGCTTTCATTTTATGTATAAATAGATTGGTGAACTTTTTATGACCAGACTGAGAATGTTGAGGAGATAGCTCTAAAGATTGCTCAGTGGCCTAAAGCATCAGGAACACACAAGAGGATTGCTGTTATTACTCAAGGTCCAGATCCAGTTATAGTTGCTGAGGATGGAAACGTGAAAAAGTTCCCAGTCATTTTGTTGCCAAAGGAGAAACTTGTTGACACAAATGGAGCAGGTATGTTCTGTGCCGCCCTTGAAATTTAATGGAGCTTAGTGAACATGAACCATGCTCAAGCACATCCAAGTTTGACTTTGATTACATTTTATCTATTGAGATCTCTCTCTCTGTGATACCCAAAAAATTCACTTTTATGATAATTTTAGTTACATGAATGAACGCCAGAAACATGTGCATATACATTCGTCGGTTTTAAGATCATATTTTCATCTCGTTTTTTTTTTCTTTTTTTAGGAGATGCATTCGTCGGAGGATTTCTCTCTCAGTTGGTTCAAGATAAACCCATTGAAGACTGCGTAAGAGCTGGCTGTTATGCATCAAATGTTATAATCCAAAGGTCTGGCTGCACGTTCCCTGAGAAGCCTGATTTTAGTTGAGCATTTTCGTCTTGCGAGACGTTTTAGCTTCCTCTCCTAGTAGTACTGAATCTCGAGCCTATTACCAATTGTTTTATCTTTTAATTCTTGTCGCTTCCGCTTCTTCCACCACTGCTCGTGTGGGTGATGGCTATTCTGTTCATCCCAGTGACGTCCACTATATCCCTGTTATATTCAGTTACTTTAAGAAATGAGATTTTGATTATAGAGTCCCTTAACTTTGAAGCATCTTGTACTAAAATGTTAAATGATTCACAATTACTATATGGAGAGGTACTGTCTTTGGGACCACTGTCCTGCAGGCACACCATAAGTTTGGCCTTTAAGAGCAATTTTGTTAATCTTTTCTGAATTTTCTTGCATTTCCCTTCCATGTGATGGCATTACAAAACAAACAATGAGGCAGCAAACTGAGGTCTGCAAAAGCTACAGCAGCTTTTGGAGCCATGGTTGCCTCTGAGACCCACAATTCACATGCTTTTGCTCTGGTCTTTATAATGATTGATGTGATGTTCTTGTCATTCCGACAGTTTTGGGAATTTTACATTTTCTTTATCTTATCTAAAGACATCTCGTTTGAATGATGTCTCTTCTTTTTTCTTGGAACTGAAAAAATAATGCAGCTCTTGTCGAGTTCTGTTTGGCAAACCTCGAATTGAGCTCCCATTATTAATGGGTTTCGATGTTCTTTGCATTACTTTTTTCAACGATTGATTTGGAATCATGTTAAATGATGAAGTCTTTAGAACTTTGTAATGGTTTACGTGGTGCCTTTTTAGGAACTCCAAATTTATTGTAGTATGAATGATAGTTATCTGCAAAAAAAAAAAAAAAAAAAAAAAGCCCACACTTAATTATATATTTTTGTTTCACCCTTCTAAATGAAGGATATTAGAGAAATAATCCATTGTTTTTAATAATAACGTACAATTAAATCCTCTATTTGGCAAACGTAACTGTTCATCAATTGAGGCAATGCAACAATTCCAAATCTACTACATTTTGTCAATGTTATGACTTGTTTGATTTATATCGTGCCTAATTCTAATAAAAATTGGATATATATATATGTAATAAATATTTTTTAAAATGCCATTTGATGATGTTAGTGGTTTTTTAATGTTAGGGGTGTAAACATGTTGGTTTGATTTGATTCGATTTTTTACTAAAACCGATATTGAATTGATCTATTGATATAGAAATTGTTGTTGTTTTTTTTAAATCAACTGAGATTTATAAAATCAACCAAAATTTAAATAAGCTAATTTTTTTGAGTTCAATAATAAGCTAATTGAAACAAATTAAATCTATTTAAACCAAAAGGTTTAAATTTTCAAATTGATTGAACTAATTTGATCGATTCATCATTTATCTACTCTTAATCTTTTGATACCATCAAATATTGGAAATTTTTATTGTTGATAATCTGTTTTATATTATTAAAATACAGCTGTTTTGAGTAAAGATGCAAGGATAAAATTTTTATATGCACTTTCAAATAATTTTCCTTCCCAAAGTTTTGTCATCATTTCCTCTAAACTTTGAAAATATCAAATTCTGTATTTTATCACTTTTTTTAAAAAAAGTATGCATGTATTTATTTATTTATTTTTAACAGTGATATGAGGTATTTTATTACATAACTTCAACTTTTCAGTAATGCCATGTTAGCTTTGGTTACTATCTAATATTGTGGCCTATAGTGAGATATTTGTAAAAGTATATCAAATAACAAACAAGATTTAAAAATTAAAATTGACAAAAATGTGAAAAACTCTAATTAATTTTAATTTCATCAAATTAAACTTTAATTTTAAATGAACAGTGAAATTGATACATTTTTGGGAATTAAATAAAACATGGACTAAAATGTAGACAATAAAATATCTGAAATCCAACCATATCATAAAAGCCTAACTTCACCTAGGAACAAGATCATTAAAAAAATAAAAACAAAAAACTCAGAGCTAGAAATTAAAATCTCGGAAACGAAAAAATAGTTTTATCCAAACCAGTTAAATCTTATCAGAGATTTTTAAAAAAGGAAACTCAATTAGTGATGGGTATTGATCTCAAGAAACGTGTCCGTGCCTCATACGTGTCGGATATTCCAACACTAGATAGACACATGTCGAACACTTATTAGCACAATAAACATGTTTAGTTATTAGTTGTACAAATTGAATATAAGTCAAAGACTTGTTAAGCACATATCTAACACTTAATAAGCATAATAAATAGACACAAATAGTAATATAAGACAAAAATAATAAATTTTAAGATGAAATACATCAAACTCATTTTTTCAAACATATAAATGCTTGCTTATTGCCAACTGCCAACTGAACATAATTCAACTAGTTAAGAAATTAGCGACTTCTCATAAAAGTTAGAGGTTCAAATCTTTACCTCTCGTTGAATTTTTTTAAAAATGCTTATTGAAAACGATATATATTTTTAAAAATATATTCAATAAACGTATCATTGTCCTGTCCATATCCTGAATTTTTGAAAAATGATGTGTCATTGTGTCCATGTTGTATCGAGTCTGTATTCGTGCTTCTTAGGTGTTAGATTTTATTTTTATCGCTTATTCAAAAGTTCAAAGGTTTTTTTTTTTTTTTTTTGGTGAAACTATAATTTAAAAACGAAATGTAACTCACTCAAATAGGTTGAGAGGTTTGAGGAAAAAATATATATATATATATATATATAATTTTAGTTAAATTATAAATTTAGTCACTATGATTTGACAATAGTTATAATTTGATTTCTGTAGTTTTAAAAATTTCAATTTAGTCCCTTTATTTTGGCTTAATTTTAATTTGGTTCTTATAATTTGGACTTCGTTTCAATTTGATCTTTATAGTTTTAAAAGTTTTAATTTGATCATTATATTTTAGTCAAAATTTACAAAGGGTCAATGTCGTCATCATATTGGCACATTTTTGTTACTTGACAATAAGTTGATATTTTATATTTACTAATTATTTTAATTGACTGTATTAAAGTTTATGACTGAGTAAAAAATATAAGGCAATAAATAGGTTTTTTAGGTGTGTGAAACGCAGTTTACGAATGACTTGTGAGATTTTATTAAACTATAAAAACTAAATTAAAACTAAGTTCAAATTGTATGACTAAATTAAACATTTTAAAATTATATGGACCAATTAAAGCTTTTTACAGATTATAAGTAACAAATTTGTAATTTAATCTATCATTTAACGTTTGCAAATAGACAAAAAAAAAAAAAAAAAAAAAGCACGCTCTGGTAACGTCTCTGTTCGCTGCCACGACCTTTCTGAGGTCGTCTATCCCTATTTGACAATTGAGAATCACGAGCACTGTCTTACTCTCTGCTCCATTGCAACTCTCTCTCGCCCGCCACTAATCCGCCGTACTCAGATCATCGGACGGCATATTGCACGAATCCGGCGCCGGAAACTCTCCCTCTCACGGCGGTGGACCAGAAGGAATTGCAGGCCACAACCCGCCATGAGCGACGATCATCCTAGGCTCGATAACCTCCGCAGTACGGCCCAGCTTCTCAGAGAAGCTACGGCGTCCTTTACCTCCAACCTCTTCACCTTCCTCTTCCTTTCCCTCCTTATCCTCTCCTTTCGCGTCGTCGTCGAGAATGGCACCCAATACGTCACCTCCTTCATTGATCGCGACCCTTCCCTTAAGGCTCTTCTTTCTCGCCTTGACATTGCAGGGGAGCAACGCCTCCTAAGATCTTCCGAGGACCCGTCTATGCCTGCTGCCGTTGCCCGTCACCACCGTCGCCGACCTTTCCTACATCTTACCCGCGTCGGAACCCTAGATGATGATATCTTTTCCGGAGATGGGGATGACGAACGCGGCCTCTTTGGGTCTAATCGGAATCACCCACCTAATGCCAGCTTCCTGATTTTCACCCAATTCGGTTCGATCTCAGGGTTTTCGGATCTGGTCGTCGATAATGGAATTAGGGTTTCGGAAGTTGTTCGGCCGGGAGTGGGATTCAAGGCCCGGAGTTCATCTTTCTCTAATCATAAAGACAGCGCTGATGATCAGGAAGAGAAGGATAGAAGACCTGGAGGGGAAAATGTGCACCAAGATATGGATAGGCTTGTAGATTTGCAATTTTTCGTCAAAGGACTGGAACTTGGTCGTCGGGACGCTTCCGCCTTGTTCTTCTTTGTGAGTTTTCTGTCTGCAGCCTATATCTGGGTGATGCTTGGCTTTCTTGTTACATATTCATGGGCTTCGGGCATTGTGTTCATCGCAGTTCTTAATGATCTAATAGGGAGGTTTGGTTCATTTGTTGGTATGGTATGGGATGGATCAAGATTGGGGTTCAAGAGGCTGTCTGGGTTTATCTTGATGAGATGGGCTGTGAGAGATGCGCTAACTCAGCTCCTTGGGTTGTGGTATTTTGGTGAAATTGAAGATCAGTATTCATTTTTCAAGCTATTCGTGAGGTTGAAATTGATGCCATTTTCCATAATGTCTCCTTGGATTCGTGGTTACGAAAAAGAGATCTCTGGGTTTCTGTTTGCCTGGTTTTTGCTAGATACTCTTGTTGCATTCATCTTTGCTGTGGATGCTTGGGTTGTCATTGTGGATGCAAGGCGGACAGGCAGAGAGATCTTGAAGGAAGGCTGTTATTTGATATTGACAATGTTAAACCAGGCTATTCAAATCAAGTGTTTGGAAGCCATCTGTTGTGGCTCTTTCATGAGATGGGCTCTTGCTCGAGTTTGTGGAAAACACGTCGCCATGTTTTTTCAGTCCGTGGGAGAGGTCTATTTTATGGTGGTTTGGCTTATCTTCTACTTTGCTGCTAGATGTAGAGATGCCAAAGTGCAGGGACTGAGGTTTGGTCGTAGAGAGCTGGAAGGTTTGATCGAGGGGGTTAGATAATGGTGGATTGGCGGCTTTATAGGGGCAAATTGCTTAACGCAAAGGAAAGTTACCGGTAATTTTTCCATTGCTCTGTGCATGCCTATCTTGCATTTCCAGATATCCAGCCGTACTTGTTTCTGTAGTGTCACGAGCAATGTTCTGTATGTAACTTCTTTCTAGGCTGCATGCTACTTGTGTATATTTTACATCAGAATACTTGACATGAATCAAATCCATCCCAGTGGGATGCCTTGCTGACATATCTGTTTTCTGGTGATAAAAGCAAATTTTAGGACTGTCATAAGAATCTGCTCTGTAGATAGAACTATAGGTCTGTTTTAGTATTTTCATTTCATTCAGTATGCTAGATTGAACTATATAGGCTTTAATATGGGAAAGCATAGAAGATTTTCTACTCGTCTGCTGGCATTCTTAGTGTTGATTACTTAAATCTATCTCGTAAATATTATAACTGGTCATGATAATTGTCAAAGAAAGTTGTGCAAATGTTGTAAACACCCAATTATTGGACTATCATTATATCCTTTGGGATCATTGAAGACTGCTGGACGAGGAAATCACCAAGTCAAGTTGGTCTTTTTCTTTATGGATCTGTAACTTCGGGAATCAAAGATCAAGGTAATCATCTAAGTAAGGATTTTATGTGCTCTTGCTGTTAAAGTAAAAACTAGATACCGGGGTTTTGTCACAGCCAAATTAGTTTGGGAGGATGGGAAGTGTGGTCTTTTTCCTTTTTCCTCCATTGTACAATCTAAATCATCTGAATTGAAATTTTAACACCTTATAGTTATGACGCACAGAGGTTTCTTCATCCAACTTTTGGCTAGTCAACCTGAGTCTGAAAGTAGAGAAATTAACCATAGTCTGCGGTCTGCTATTATTATAGATTTACCTTTCCCCAGCTCCTCACAGTACGTCAATGTAATAAATGCATGTTACTTCTGCACAAGACAAAATTTAGCACATCCAGATTAAAGAATAGCTCAATAATTTTCGGCGGGGGTTGGGGGTGGGTAAGTGTGTGGGTTCAGATTGATTTTGTTCCAAAACCGATGTCATAACAACATGTCAATTTAAGCGGGAAGAAAAGAACTTTTTTTTTTAACCAAACCATACACTAACCAAACAGATTGTAACCAACAAAATCTATCCAAATTGTATTGGTGGACTACCTTCCAGTTTTATAATTTAGCAAGTTTAAACCAACAGAACTGACTCAATTTGGTTGGTTTTTTCGATTTCGACCTGCACTCCTAATTGCTAGTAGATTTTTCTTTGTTCTTTTGTTTTTCTTTTGGAGCAGAGGACAATAGAGATAGAGAGATTCTGAATTGAATCATGGATAGAAATCTTAAATTTATTCAAGTTTACCCATTTGGGTTTTCAAGTGGAAGGTTTGAATTTGTATGAATTCTCGCCATCTGAGCCAAATGTGGGGAGTAGAAACATTGGTGAGGGGCACAAGAAATGCTCGGATTTGCTCGACCAACCATTTAGATTTTCAGTACCTTTCTGATGGGTTCTGACGTATGACAAAAGGGTTCACATTTCATCGTACATGAACTTGGGAAAGGAAAAATTCAGATATTTTTTTACAACTTTTAAAATCAACATGTACGTTGTGTGGCTTGATTTGCTTGGCATCTTTCAAGGTCAACCATTTTTTAATATGCCATTTTTCTTTTATCACTTGGGATTATTGTGGACGTTATTATCAAAGTGATTGAGAACTGTTTGTACAAAACAAACATATCTCGAGACCTGTTCTTCAAAATTTTCCTACCATATTCCATTTCAACTTTCTTCTTTACTGTTCAGACAACTTTTAGAGGTTGTTTGTCATGGACAACCAAGGGATGAATATCCAAATTATGTGTTCACTTTGTAGTACTTAAAGATATTAAATCTAGTTGATTTTTTATACTCTCTTGAAATTTTGAGATTATTATAATTTTTTTTTAATCTCATTTAACAATATTAAGATCTACTTATTGAATTTTTACATTTCACTTGAATGGGGCACCAATCGAGATCGGCCATTCAAACTTGTGGTGCAGTATAAGTACAATGAGCAAACTAAGCTTGGTATGACACGCGCTCCTCCTTGGCGAGGTAGGATGAGCACACCAAGCTTAGTAGGGCTCGTGGGACCTCTTCTCTTGCAAGGTAGGATGAGCATAGCATGGCCTACAAGGTAGGATGGGCATACGATAGCCTACAAGGTAGGTAAGAGAGTTCAATCCAAGTGTGGTGCTATCCTTGTGTCCAAGTGAGGAGCTTTCAAACTAGATAGGATGACACCGATGGACCTAATAGGACATGCACTCTTCTCTCGGAAGGCAGGATGAGCACATTAAGCCTAGTAGTAAAGTATGTAAAAGAACTCGCCCCAAGTTCGGTGCCAACTTCATGCCTAGGTGAAGAGTTGTGTAGTCTCCACCACCGACATAGTAGCACGACTCAGTGTTCCAACAAATGTGCCTTTAATTGAGGAGAAAGGATATCCTTCGAAAACCCCTCTCCCTTGATGAATCAAAGGATAACATATTTTTGCTAGGAACTTAATATCTATTACTAACACCTACTTTATAGCGGTGAAATAATCTTCATTCCCTTTTATAAATACGAGGTGCGTGATTACTATAGAGGTATGCGATATTTTTCATCGTTCAATCTAAATTAAGCATTGAAAGGTGATTAATAAACACCACATAAGTACTCAGTTCCTTTTCCATACAGTCATTTTCTAATTTCTTTTTTTTTATCCTTTATCATTTCTTTTTATGATGGTTGGACCTTCTCAACTAAAATTTTTCAGCATCTAATTTAATCACTTAGGATTGGCAATGACGTTTTTATCAAAGAGATGGAAAAATATTGGTACAAAGGTAAACATATCCAAGTTCCGTTTTTCCAAATTCTCTACCATATTCTCATTTCTAACTTTCTTCTTTACTGTTCAAAGAATAAAGTTGTAACTTAACGTAGAAGAGATCAAAGAGGTTTCTTCATGGTTAATCCAGGCGTATCTTCCTTTTCATATCCTCTAGAATTTTGAGATTACGATAATATTTTCTTCTTTTTTTTTTCTTTCTTTTTCATATCAACTTACCATTTAATAATATCATTTACAGCCTCTTCTCAACTCTCAAGTTCCCTTCACTCCCATTTTCACGTTGAATAGTTCCATGTGCTTTAATAAATGACATTAAAAATTAATCAAATTGACTGAAATATTAAAAAAATGAGTTGGATTACGAGAAGTATTGTTTAATTTGATATAAACAGAAACATGGTTTTTTTTTTTAATTAAAAAACATGGGCATTATTGAAATTTGATAATTCTATTAAATTTTCATGGATGAATACCTCAATCCAAACATAATAATCTTTCGTTTTCGAACATTTTTTTCAAGATATCTATTGGGAATTCCTCTTAAACAAATATCATAATCTTACATTCTTAGGTATTAAATACTCAACCGTCCTAAACTACATGGCATTGGAGAATACTGTAAAGAAATAGTGCCAAAGAAGGTCTATACTAATTTTCACAAGTTATTCGTTCTGATAAAAACGACTAGGGAAAAAATGCAAACATTTGAGGTTGTGAAAAAATATATATATATATATATATATATATTTTTAAAAAAGACTAGATCCCCATTTGAAAAATAGCTTTTTTTATTTAAAAACTCCGATTGACAATCGATCATGAAAGACAAAATCAAATTTTATTTTAAAATACTCATATACATTAGTAAAAATAGATATTCAAGCGACACTGATACCTACTGTTTGTTCACACTATGAATTTTCTTTTATAAATAATTTTTTAATGCTATCGAACCTCACGTCATTCTTAAATAAAACTAATGTGGCTAACAATAAATAAGTAGTATCCATATTACATGTTCCTTCCCTCACCCATATAAAGTTGCATCTGCCCTAGCTTCATTATCAATCACAATTTCAGGGATAATTGCATTTGCAAACCTTTAACAATGTAAATTATTCAAGAAGTGTACCTTATAAGACCATCTCGTGGTGGATAGTACGAGGTTACCGCTGCCCTGCAAGGTCATGAAAAATTACTTTGTAATCCATTCGAGTAACCTGCAAGTAGCTTAATTAACATCATGATTTGAGAATTTGAAATAAAAATGTGGGTAAACTAATCAAATTAACAACTTTTAAGATAGGGTCAATCACATGAAACCAGTAACTTAAATACAAGGAGCGACTTGTAAACTAGACTATAATGCCTAAGCATTAAGCACCATTCCCTTGGACAATGACCACTGCAAGCTAAGATTAATACCAAACCAGGTACCATTTTTCATCTTTATACCAAACAAAACTTGTTATCCAAGAAGAACATCTTTGAGTTCTGACATAATGAAGACAAGCAGGGACCGTTTCCCCTATTTCCTTCTGTGTGCTGGTGCTGGTAAGACAGTCAATGATAGTTTCCGCATTTAATCACCAGTAGCCAACAGCCCTCGAATCTTGACATCATAGTATACTTCTTCGACTCTTCGCAGGTCATACTTCATGCCTATTGCAATGACATGATTCATATGAGAAGATGAAATGCAGAAGGAAAAAAATATATAGAAGTAAAGACTGTACCATCGAATTTCTTGCGGAGAAAATCATTGCGAAGATTAAGCATACGGAAGGCTGCATGAAGATCTGTAAAAAACTTGAGCACCTTTCTTGGACAATCATAGTCTCCAACTGTCACTTGGTTGACAACATATCGCGGCTACAAATACAGTTATCACCAAATTTTATTACAAGTTATAAATGCGAGACCTTATGCATGATATATACCAACTCAAATGAATATCTCCTTTATCCAACTACTATGTAAACTTTTTGTAAGCCTGATATATTCATTTTTCTCCAATTTACAGAATCAATAACAATTCTACAAATTCTCACCAATTCATTGGACATGAAACATATTCCTGCAAAATTAATAAAAATATCCAAATTAGCAACTCGGCATAGTCATCAAATTGTTGAAGAATGCGAGGACAGACAGCAGACTGGCTCAGAAGTACAGAGCACAGTTGAATGGAAAGCGGACACAGAAAACTCTAACATGAACTACCAATTCCATCATCAAACCAACATGATTCTGTGTATGGTGCTTTTTATTTTTATCAATCACATCACAGACTCTCCTATGGCCATATCTCCCAATTATTGGAACTTGGATTCATCTCTGATAAGTAACCATGCGATTCATTAAGAATAAAAAAGTTATTGTTGGAACTACGTGAACATACTATTAAGATACTAATTATTTTATCTGTATGTACACCATTTTTGCATTCTCTTGAAACTAGGTTAACGTACCATCACCAAGAAGGAAAGATTTTTTTTTTTAAAAAAGGGTGATGCACTTCTACCAAGGAAAACAAAGAAATGAAAGTTTCTTTCTTCCAAGAGAGAGCATGGGTTACAAAACTTCTAAACAGAAGGAACCAATATTTCAAATCACAAGGCATGCTCCATGGCGAATCTCTCCTTGCTTGCCTAGAGGCCTACGTATAAAAAAAACATTGCCGGACTGAAGTGGTGTGCAAGATGTAAAATAAAAAATTATGATTACATATCGAAAATAGAAAATAGGAATACATTATAACATAAAAGAAATATAAATTAGACAAATAAGTTTGTATGTGATAAAAATACTTCAATGCCACGAATTTTCAAGAGATAATGAATCTTAAAGGATTTTAGTTAAAAGTCCTAAAAATGCAGAAGGTCCCGTTTCTTCATGGAGGTGCAAAAGCGATGAAAAGGCAACCAAAAAGCAACCAAAAGCGAACTGATAGACAACTAAGGTGACCCACTCAAGTGAACGTGCTAGCCAGAGACAACCATCAGGCGCACTACTTCACCCTTGCTAAAGTGATCGACTTGCCTTGAAAACACTGAAAGCAATATAATCTCAATGTCTCATTGAACCTTGAAGTAAGAAATATATGAAACTGTTGAAAACTCACCAATAAGATAGTCTTCAACATCCAAACTGAAATCAGATTCATTTACTGCAAATAGACCAAAAGAATATAATCATGAGTTAATATCCAAGTTCTTCATTTCAGAGTATATGTAAGATAGGCATAGCATAATGAGATCATGAACAATTCATTAACAGCCGAAAGTGAAATTTATAAGATTAAAAAGCATGAAGCATATACATTTGGCAATCAGTTCCCTCTGTTCCATAGTGGCGGAAACATTGATATAGAGAACAATGATTTGGAAAAAGGATGAATTAGAATGTTATTTAGAAGTAAATATTGAGGTAAGATATATAGTAAAGAAAAAAAATACGGTTAAATTACAATATTGAGGTTAAATATGCAAAACAATATTAATATATGCATGACATGACACAGTAATAACGCCAACAAAGAAAAAGTAAAGTAAAGGTGCTCATCTTCTTTAGCAAGTTGCTTTCAGTTTCCAGATAGGCTTGGGAATCTGGGCAAAGGCACTATTTTTGTTTTGGGTGAGGAAAAGATCTCTAATTTTTTATGTTTCGGTGAGGACAAAATTCTAAAATGTTCCCAACGAGGATAGACATTAATTAAATCAATCCTCAAATTTCTTATACAATATTGATTAAAGATTTTCTAGATTTCTTTTAGATCAGAAAAAACTAAAACTAAAAGGCGGTGCTATGAAGGACTACTGTAGTAATCTGGGTATGGAGCACATAAAAGATATTCCATCTTTTTTGCAACTAATCCAGTCAACCTCCAGAAGACCTGTATCTAAATTCTAATCATTTTCACAACTGACTACAATCCTGAACTGTGTGTTGTTTAAGAGGATGTCGGATACCACCAAAAAATTATGAGGACAACACAAAATTGATGGTTACTTCAAATTGTTTTAGTTATATAGGCAGTCTTGTCTGAGAGAAAATTACCAGCAAACCCTCCATATGACTGGTTATACTGCACAATCTAGTTGAAGGATTGAGCATGATGTAGGAGAGCTGGTGACTGTACAATCTAGTGGAAGAACGAGTAAATTCATCTATTCAGTTTCTGTTTTATTGGTTCTACAGGGTTTGTATATCAATTCAATCAGGTCATCGAATAGAATATGATCATGATGTTAGATAGTTCAAACATGACACAACTGCCGAAGAGAGACACAGCACAGCATCCTATAAAGTTCACTAAATTCCAACACTTACAGTTAAAGTCAAATGTGAACAGAAAAAATACAATGACAAAGAACATACACCCAAGTTTTTCCTCAGCTTCGGTGTGCAAAAGAAGTTCTCCTGTTTCTAGCCAATGAATAAAAGCAAGCAGTGAAACAGCTGTCTGCGTCTCACTCCTCCAGTCTCCATGATACCTAAATGAAAATTCAACGAACGCACTAGCAATTTGCCGAAAACAAGATCCACAGAAAGACGAATTGTGAAAGGATAAACCAAGAAAAATGTCAACCATGATTAGTTGTCTAACCTATAGTAAAGACCAGGGCTTTCATGAAGAATTTCAGCAAGTCGCTTGTAGAGCGACTTTAATAAACGAACCTCAGCTTTTGGCTTTTCAAGAACCTCTACACCAGAGAAATTAGGGAAACTAGAAAAGTCAGATTTTGACAACCTTAATATGGTGATCCGCAACTAACTACATTTAGTCTCATGAGTCAATAAGTTGAACATAGGGAAGCAGAACGAGAGAGAGTCCTCGTTTCCCACAAAACCCAGAACCGCACAAACACAATCCTCGAAACCAATGAGCATTATATAGACCATTACAGTATCATGCAATGAAAATTCATACCAGGAGTCGGACGAGACTGGTGAACCAAAAGTAAGCTGGCCTGCATAAGCCTCGTGGAGGACTCGATCTCCATTGCCACACTTCGAATGCGCTCCCGTAAGCTTCCAGACTCCTCAAGCTGAACTCTGAAATCCTCGAATTGCTTCTCCACCGAAGAAGACGAGGCAGGTGCCTCGGCTTCACTGCCTGCCATGGAAGAGGACGAACAGAAAGAAGAAGCAGGGGCCGATGGATACGTTTCACGTCTGGTCTGCAATGGAAGGCGAGAAACAGTAATGGATGGAAGGGAATGTAAGCAGAGGATTAGGGGAAAAGATTTAGGGTTTGAAGCAGGGTTTAAGGAGTGAGAGAAAATGCAGCAGGCATTTCGAAGCGCTGAGTTCATCGTTTGGCGGTGGTCACAACTCACAGGGTATTTATTGGATGTTACTATTATGTAATGTAAAAGCTTATCAATTGGTCCGATATCTGTGTCTCTTTTATGAACTAAAGGATATTAAAAAATTATATATATTCAAATTAAATCGAACGTAATTCAACTAATTAAATATTAAAATTGATTTTTATTTTAGATTTTAAGCAAACATTAAAGATGGATTAATTTTTTAATAGAGTCTCTAGCGTACGAATGACCTTAAAATGATTGTTACCTTTATGGTGTGTTAGATTAATTTACCGTAGAATAATCTTAATTTGTGAAACCAACACATTCTTCATAATTGATTCTTATGGTTAGAAATGTCCCTACCACTTGTTATTTTTATTATTTTTTTAATAACTAACTTTAAATGCTACATAAAAAAGTATGGTTTCGATCGAACTTTAAAGTTAATACACATACGTACTCCCATATTTTGTTCACATATGAGAAGTCGTTTGGTATGCAAACACAATAACTTATTTTAACAAACTTTTTTTTAGGATATTTTAACAATCTTATATTCAAACTTTCTATTTTATTGTAAATTATATTACCATCCCTTTGATTATTGATGAGACATTTAAATTGTCCATTTCAAAGAGTCATTTTCTTCCCAATAATTTTATATCCCTAATTTTATATTATCAAAGTTTTTATTCTGATGTCATCGTCCATTTAATCGTGACATTATCTCATTCTCAATGCAATACAAAAAAATATTTGATGTTAAATTATAAATTTAGTGAATATAGTTTTACCATTTTACTTATTTGGTTTATAAACTTTAAATTTTTTTATCTTGCTTCAATGTTAAAAATTATACTTCTACTCAATTCATATCATTACCAACATCAACGAAATATTAACATGACATATTTTAATTATTTACCTCATCACGTTAAATTAGTCAAATATTATTTAATGGCATCACACATTAATACTTCATTAATAATATTAACAATAAGGGTTAAATTTAAAATCTACGAACGTAATTAAAATTTTTTAAAATTTAAAATCTAAAATAGACACGGTTAAAGTTATAATTTAGTCAATTTTATTTTTAGATATATATTTTTTAAAGTACAATAATTGTGAATTATTTTAAAAATTGTACTACATCATCGTCTACTTTTTTTTGCTTGGGCATTTCTGATGAACTGTGACGTCAGTGGGGACCTTTCCGCCTCAGATTGAATTTGAAGCAAAAGCCATAAATTTTTTATTTTTCTTTTTCCTAATCAATATTCAATCAGAAGTCGTAAAAGTTCGGCCACACAACGATCGCAGCCGCTCGATTTTCCACCCTTCATCCGAAATCCAATCTCGCCCGTTCCCCTCGGCCTACGCTCTTTGACCATCGCCGATCTCCACCGTCAAATCCAACCTCTCAATTAACAGCCATGGTTTCCTGATTAAGACGTGGTTTCTTAAACTATAATAAATCGTACACTTAGGAATTGCAGACCGTTCAAGAAGTTTGTGCGTAGTTATTTCCGTCTCCAGAGCTTTCAGCATGTTACTGGGCAGCCAGCCATAGCCATTACTTTTGAAACACTCGCTGGCGTTTAAACCGCGTGGGAAGGGAACTGTGTTATATAAATGGCATGAATTGCGACTATTTCTTTTTTTCTTTTTTTGCCCATTTTCGTTTTCAATGCGTTCATCGTCATCGGATGGTGTTCATTAACTTTCTGTAAATGCACACTGACCTGTTAACCTCTTTTTCTCTGTACCCACCTTGTTCGTTCGAGCTAAACGGAGCTTCGTTCGTATTTTTGATCGTATTGGCTGTGAATTGAAGGTAATTTGCTGCTGAAAATTAAACGAATATACTTGGGTTATGTCAAAATCGTTTTGTTCATTGAATTACGAGATAGTATTGCTATATATCTATTCTCAAGTATTCGTTTCTTGTACATAAGTTGAAATTCTGTTCAGAGTTCTTGAGTTGGGTGTATTAGTAAAGCCCAAATGTAGAGTCGTTTGTTTTGATTTGTTAACGGCACTCAATATGGATGACTTGTTTGTTTGTATCTAGATGGTTCCGTCTGGTTATGAATCGGATGTCATTCGGTGGGGTCTTCGTCTTTTTGACGGTGATTCAGTTTTTAATTCTGGGTATTACGGTGAGATGACAGCTGTAGATGACCATTACCCTGGGAATTATTACAGAGATCATTACAATTTAGAATCCACTTGTGTGGAGAATGATGAGATTATTGCACGCACTCTTCAAGAAAACTTGTCGCAGTTATCAATTACAGAGTCCTCGGGATGCGCTCTTGAAAGAGAGGAGCAATCGCAAGCATCCACATACACAACTGATTGGCACAATCCATTTCCAAGTAATAACAGTTCAGGTATTCAATATGCCTTCTATCCTAGTATTCATCTCAGACTCGTGTAGCTAATTTTTGTTATCTTGATGACTTTTAACTTCTTACTGCAGAGAGTATTTCTGTTGAGGAAGATGCTGAAACTCTGGATCCTTCTAGTTCATGTTCAAGTCCTGGGGATGACGACTTCTCCTATTCCCATGCAGTTGATGGTGAAGAATTATGGAGATTCAATCAAATGATTCCTGTTCCTGTAGGTTTAAATATATTTAGTATCACCTCTCTGCAACGTACCCATTTCTTGCATAATCATATTAAATTATTTGGTGAAATTTCCTCTATAATCATTCATGGTTTTCCCCCCTACCTGTTCTGTACTTGCAGAGTCTTGGAGATGGCTGTTTTTTCCTACTAGTTGTCATGGCTATTGACTTGTGTAAAATCTCCTTATGTAATGTGTTATAGTCATACTGATGACTGGCAACTAATTATTTGTTCTAAAAATGTTTACTAGTACATCCCTTGTGCCTTGCTTTTACTTCCCTGTATGACATTTACATTCTATCTTGAATCCTCAACTCATAGAAACTCAAAACTTTTGAAGAGGTTTGTGTTTGGTGGTGAATATAGAAACTAAACTAACTAGGACACTTCACTTAATCTTGTTGCTTTAAAACCATCCAAGGCATAGGGTCATTCTTTGGATCATTTATTTTCTTACTTATCTATGCAGCTCAAGTGCTCTTCATATAAATTCTACTTACAATGAAAGGAACTTTTCTTTGTAATTTCTACTGCTTGTTGCTGTAGTTCTTTGTGAATTATAAGAAGTGATTTGCATTTCTTTAATTGGCATTTTTGCTGAGAGTTCAATTTCAGACTAACCCTTTCGGTGAATTGAGCAGCATGTTCCTAGAATTAACAGAGAAATTCCTTCAGTTGATGAAGCAGCTTCTGATCATGAAAGGCTTCTGAACAGGTATCTTCTAAAAATTTTGTACTTTGAAAAAGTTGTTACAGTACAATAAGAGGAATCTACGGGGTCATAGCCTTCTTGATATAGTTACATTCTTTCATTACTGTATTTCTGCATATCACATTTAGATATAGCTTTCCTCATGTCACAATATATCTGGCATTGCGATTAGAAAAATGTTGGTATGATATCATTGTGCTGGATATACAAGTTCCTGTACAATACACTGTTAAATAGATTGATTTTAGCTATTAAAACTGATGAGGAAATGTTTTCTCAATTCAGATTGCAAGTGTATGACTTTGTTGAACGCAAGGTTCAAGGTGATGGCAATTGTCAGGTTCGTTGTTCCCAGCTCTCTCTCTCTCTCTCTCTTGTTCAATTGAATTCTGATTCATATTAAATGGTGGGTAATGGGCGCTGTCATATCAATCCTTTGGATTTCTGTATTTTTTGCCAGTTTCGTGCATTATCTGATCAATTATATGGAACACCCGACAATCACGAATTGGTGAGACAAAAAGTTGTAAATCAGGTTTGTCATCTTGGAAAAACACAGTCCTTATACGAATTAGTTTAGTTATCTTGCATTTGACCACAGCTCTAGATCTTTGATTTGTTTGTTTTCTTGCCCAAAGCTTATGTCACATCCAGAGATTTATGAGGGATATGTTCCAATGGCATATGATGATTACTTGGAGAAGATGTCCAGGTTCTAATACAGCTTTATCTTTCATTTCTTTTCCTCTTTTTTTTTTTTTTTTTTTTTTTATCTTACTCTCCTTCCAAACACCATCTTAATCTCTAGTTCTATGAGGCAGGAACGGTGAATGGGGTGATCATGTCACATTACAGGCAGCCGTGGATTCGGTATAATCTACATAAAATGCATTGCCTTGATTTTCTTTCACAACGAAGCTTTTAGCATTATGTTATGATATTATCTATTCTATATAAATTCCTTTGGTTATGTTCGTGTGGGGATTGATGGTGACGATAACTTATCGTCCTTCATCGAATGAAATCAATCCCACCTCTCTTCTTTCTGTTGCTCCAACACAAAAATTTGAAATACTAAGCTTAATCAATCAATGAATTTACATGTGAACTTCATCGTTGTCTTATCAACATTATCATCATTCATCATCATTATCTTCTGATGCAGTATGGTGTGAGGATATTTGTTATAACTTCTTTCAAGGATACCTGCTGCATAGAGATTCTTCCAAATTCTCAGAAGACAAAAGGAGGTAAACTTATACCTATTTGCAAGTAATGACTGGTGGCATTTTTGCCTACTCCAATAGCTATTTGTTTCCTGATTAAGGTTGCATGTTTTATCATTAGCTTCATTTTTAACAATTCCCGTCTCTTTTGATTTTTTTTCTTCAGTCATTGTAATGTTACATTTATGCTGCAATTAAAATTCTCTCATTGTTTACCTCTTCGATGGTTCGCTTGCTTATTGTGAGCCACAGTACATTACCTCGTTCCTCGGTTTTGTTTGCCGTTTATTTGCATTGAATACTTGCCTTGGATCCTCTTAGTAGTTTTTTTTTCAATGATTTTATGGTCTTTGACTATACTTATACCGTGTCTTTGCAGTAATTTTCTTGAGCTTTGGGCAGAGGTGCATTACAACTCGATTCATCCTCAGGGAGGTAAATGGTGTTCTCTATTCTGTATTGTGTTTGTCGTTGCTTGAAATAGGGGTCTGATGAATTGCTAAGCCTAGAAAAAAGGAGGACAAGGAGGAAAAGAAAGGAGATTCATTTGCTTTAGCCAAACTAAGTTGGATGTAACACCCTGACCCTTCTCGGTTACTCTTGATTTGGAGATGCCTAATTTGTTTTTGAATGAAAATGAAAGAAAGAAAGAGACATTCACTTAGAAGTTCCTTTTATTTGTTTTTCTAACATGCCAACAAGGGACATTCACTTGGCTCAATTGACTTTGGATCCTTTCAAAATATTTTGAGAACGTTTGATTTAGCAAGCCTATATATGATATCCATGATATTAGGCCCTACCCACTACTCCACAGGGGAACTTTACCCCGAACATCTCAAATCGTTATTACTAGGGGGTGTGCGAGGCTAACAATAAAAAATCTATGTTTATTAACTCGAAATAGGAAAAGAAAATCATGAAAATTTTCAAGTCCTCAGATTCATAATTTAATTGTATGTTTGAGCTTTAACGTTGGTCCAAATTGATCTCCATGATTAAAATTTATGCTGTTGTTTTGATATACATTCAGGTATGCCATCAACTGGAGATTCCCCACCCAGTGAGTTGAGAAAGAAGAAAAGGTGGTGGAAATTTGGACAGAAGCATTGA

mRNA sequence

ATGGGCAACCCTCTTCTTGACATCTCGGCCGTCGTCGACGATGATTTTTTGCAGAGATACGACATCAAGCCGAACAATGCTATTCTCGCCGAGGAGAAACATCTTCCAATGTATGAAGAATTGGCAAACAATCCTAATGTGGAGTACATTGCTGGAGGTGCTACACAAAACTCAATTAAAGTGGCTCAGTGGATGCTTCAATATCCTGGTGCAACAAGCTATATGGGTTGCATCGGAAAGGACAAGTTCGGGGAGGAGATGAAGAAAAACTCAAAAAGTGCTGGAGTTAATGTTCAATATTATGAGGTTGAATCTACACCAACAGGAACCTGTGCTGTTTGTGTCGTGGGTGGTGAAAGGTCACTTGTTGCCAACTTGTCAGCTGCAAATTGCTACAAGTCAGATCATTTGAAAAGACCTGAGAATTGGGCATTAGACTCCGTACTGCTTGTAGCTGAACATGCAGCTGCAAACAAAAAGTATTTCTCGATGAACCTGTCTGCTCCATTTATCTGTGAGTTCTTCAAGATGCACTGGAGAAAGTTTTGCCGTAAGTATATGGACTTCATTTTTGGTAATGAAACTGAAGCAAGGACGTTCTCAAAAGTTCAGGGCTGGGAGACTGAGAATGTTGAGGAGATAGCTCTAAAGATTGCTCAGTGGCCTAAAGCATCAGGAACACACAAGAGGATTGCTGTTATTACTCAAGGTCCAGATCCAGTTATAGTTGCTGAGGATGGAAACGTGAAAAAGTTCCCAGTCATTTTGTTGCCAAAGGAGAAACTTGTTGACACAAATGGAGCAGGAGATGCATTCGTCGGAGGATTTCTCTCTCAGTTGGTTCAAGATAAACCCATTGAAGACTGCGTAAGAGCTGGCTACGTTTTAGCTTCCTCTCCTAGTAGTACTGAATCTCGAGCCTATTACCAATTGTTTTATCTTTTAATTCTTGTCGCTTCCGCTTCTTCCACCACTGCTCGTGTGGGTGATGGCTATTCTGTTCATCCCAGTGACGTCCACTATATCCCTGTCGTCTATCCCTATTTGACAATTGAGAATCACGAGCACTGTCTTACTCTCTGCTCCATTGCAACTCTCTCTCGCCCGCCACTAATCCGCCGTACTCAGATCATCGGACGGCATATTGCACGAATCCGGCGCCGGAAACTCTCCCTCTCACGGCGGTGGACCAGAAGGAATTGCAGGCCACAACCCGCCATGAGCGACGATCATCCTAGGCTCGATAACCTCCGCAGTACGGCCCAGCTTCTCAGAGAAGCTACGGCGTCCTTTACCTCCAACCTCTTCACCTTCCTCTTCCTTTCCCTCCTTATCCTCTCCTTTCGCGTCGTCGTCGAGAATGGCACCCAATACGTCACCTCCTTCATTGATCGCGACCCTTCCCTTAAGGCTCTTCTTTCTCGCCTTGACATTGCAGGGGAGCAACGCCTCCTAAGATCTTCCGAGGACCCGTCTATGCCTGCTGCCGTTGCCCGTCACCACCGTCGCCGACCTTTCCTACATCTTACCCGCGTCGGAACCCTAGATGATGATATCTTTTCCGGAGATGGGGATGACGAACGCGGCCTCTTTGGGTCTAATCGGAATCACCCACCTAATGCCAGCTTCCTGATTTTCACCCAATTCGGTTCGATCTCAGGGTTTTCGGATCTGGTCGTCGATAATGGAATTAGGGTTTCGGAAGTTGTTCGGCCGGGAGTGGGATTCAAGGCCCGGAGTTCATCTTTCTCTAATCATAAAGACAGCGCTGATGATCAGGAAGAGAAGGATAGAAGACCTGGAGGGGAAAATGTGCACCAAGATATGGATAGGCTTGTAGATTTGCAATTTTTCGTCAAAGGACTGGAACTTGGTCGTCGGGACGCTTCCGCCTTGTTCTTCTTTGTGAGTTTTCTGTCTGCAGCCTATATCTGGGTGATGCTTGGCTTTCTTGTTACATATTCATGGGCTTCGGGCATTGTGTTCATCGCAGTTCTTAATGATCTAATAGGGAGGTTTGGTTCATTTGTTGGTATGGTATGGGATGGATCAAGATTGGGGTTCAAGAGGCTGTCTGGGTTTATCTTGATGAGATGGGCTGTGAGAGATGCGCTAACTCAGCTCCTTGGGTTGTGGTATTTTGGTGAAATTGAAGATCAGTATTCATTTTTCAAGCTATTCGTGAGGTTGAAATTGATGCCATTTTCCATAATGTCTCCTTGGATTCGTGGTTACGAAAAAGAGATCTCTGGGTTTCTGTTTGCCTGGTTTTTGCTAGATACTCTTGTTGCATTCATCTTTGCTGTGGATGCTTGGGTTGTCATTGTGGATGCAAGGCGGACAGGCAGAGAGATCTTGAAGGAAGGCTGTTATTTGATATTGACAATGTTAAACCAGGCTATTCAAATCAAGTGTTTGGAAGCCATCTGTTGTGGCTCTTTCATGAGATGGGCTCTTGCTCGAGTTTGTGGAAAACACGTCGCCATGTTTTTTCAGTCCGTGGGAGAGGTCTATTTTATGGTGGTTTGGCTTATCTTCTACTTTGCTGCTAGATGTAGAGATGCCAAAGTGCAGGGACTGAGGTTTGGTCGTAGAGAGCTGGAAGATCGGCCATTCAAACTTGTGGTGCAGTATAAGTACAATGAGCAAACTAAGCTTGGTATGACACGCGCTCCTCCTTGGCGAGAAGACGAGGCAGGTGCCTCGGCTTCACTGCCTGCCATGGAAGAGGACGAACAGAAAGAAGAAGCAGGGGCCGATGGATACGTTTCACGTCTGGTCTGCAATGGAAGGCGAGAAACAATGGTTCCGTCTGGTTATGAATCGGATGTCATTCGGTGGGGTCTTCGTCTTTTTGACGGTGATTCAGTTTTTAATTCTGGGTATTACGGTGAGATGACAGCTGTAGATGACCATTACCCTGGGAATTATTACAGAGATCATTACAATTTAGAATCCACTTGTGTGGAGAATGATGAGATTATTGCACGCACTCTTCAAGAAAACTTGTCGCAGTTATCAATTACAGAGTCCTCGGGATGCGCTCTTGAAAGAGAGGAGCAATCGCAAGCATCCACATACACAACTGATTGGCACAATCCATTTCCAAGTAATAACAGTTCAGAGAGTATTTCTGTTGAGGAAGATGCTGAAACTCTGGATCCTTCTAGTTCATGTTCAAGTCCTGGGGATGACGACTTCTCCTATTCCCATGCAGTTGATGGTGAAGAATTATGGAGATTCAATCAAATGATTCCTGTTCCTCATGTTCCTAGAATTAACAGAGAAATTCCTTCAGTTGATGAAGCAGCTTCTGATCATGAAAGGCTTCTGAACAGATTGCAAGTGTATGACTTTGTTGAACGCAAGGTTCAAGGTGATGGCAATTGTCAGTTTCGTGCATTATCTGATCAATTATATGGAACACCCGACAATCACGAATTGGTGAGACAAAAAGTTGTAAATCAGCTTATGTCACATCCAGAGATTTATGAGGGATATGTTCCAATGGCATATGATGATTACTTGGAGAAGATGTCCAGGAACGGTGAATGGGGTGATCATGTCACATTACAGGCAGCCGTGGATTCGTATGGTGTGAGGATATTTGTTATAACTTCTTTCAAGGATACCTGCTGCATAGAGATTCTTCCAAATTCTCAGAAGACAAAAGGAGGTATGCCATCAACTGGAGATTCCCCACCCAGTGAGTTGAGAAAGAAGAAAAGGTGGTGGAAATTTGGACAGAAGCATTGA

Coding sequence (CDS)

ATGGGCAACCCTCTTCTTGACATCTCGGCCGTCGTCGACGATGATTTTTTGCAGAGATACGACATCAAGCCGAACAATGCTATTCTCGCCGAGGAGAAACATCTTCCAATGTATGAAGAATTGGCAAACAATCCTAATGTGGAGTACATTGCTGGAGGTGCTACACAAAACTCAATTAAAGTGGCTCAGTGGATGCTTCAATATCCTGGTGCAACAAGCTATATGGGTTGCATCGGAAAGGACAAGTTCGGGGAGGAGATGAAGAAAAACTCAAAAAGTGCTGGAGTTAATGTTCAATATTATGAGGTTGAATCTACACCAACAGGAACCTGTGCTGTTTGTGTCGTGGGTGGTGAAAGGTCACTTGTTGCCAACTTGTCAGCTGCAAATTGCTACAAGTCAGATCATTTGAAAAGACCTGAGAATTGGGCATTAGACTCCGTACTGCTTGTAGCTGAACATGCAGCTGCAAACAAAAAGTATTTCTCGATGAACCTGTCTGCTCCATTTATCTGTGAGTTCTTCAAGATGCACTGGAGAAAGTTTTGCCGTAAGTATATGGACTTCATTTTTGGTAATGAAACTGAAGCAAGGACGTTCTCAAAAGTTCAGGGCTGGGAGACTGAGAATGTTGAGGAGATAGCTCTAAAGATTGCTCAGTGGCCTAAAGCATCAGGAACACACAAGAGGATTGCTGTTATTACTCAAGGTCCAGATCCAGTTATAGTTGCTGAGGATGGAAACGTGAAAAAGTTCCCAGTCATTTTGTTGCCAAAGGAGAAACTTGTTGACACAAATGGAGCAGGAGATGCATTCGTCGGAGGATTTCTCTCTCAGTTGGTTCAAGATAAACCCATTGAAGACTGCGTAAGAGCTGGCTACGTTTTAGCTTCCTCTCCTAGTAGTACTGAATCTCGAGCCTATTACCAATTGTTTTATCTTTTAATTCTTGTCGCTTCCGCTTCTTCCACCACTGCTCGTGTGGGTGATGGCTATTCTGTTCATCCCAGTGACGTCCACTATATCCCTGTCGTCTATCCCTATTTGACAATTGAGAATCACGAGCACTGTCTTACTCTCTGCTCCATTGCAACTCTCTCTCGCCCGCCACTAATCCGCCGTACTCAGATCATCGGACGGCATATTGCACGAATCCGGCGCCGGAAACTCTCCCTCTCACGGCGGTGGACCAGAAGGAATTGCAGGCCACAACCCGCCATGAGCGACGATCATCCTAGGCTCGATAACCTCCGCAGTACGGCCCAGCTTCTCAGAGAAGCTACGGCGTCCTTTACCTCCAACCTCTTCACCTTCCTCTTCCTTTCCCTCCTTATCCTCTCCTTTCGCGTCGTCGTCGAGAATGGCACCCAATACGTCACCTCCTTCATTGATCGCGACCCTTCCCTTAAGGCTCTTCTTTCTCGCCTTGACATTGCAGGGGAGCAACGCCTCCTAAGATCTTCCGAGGACCCGTCTATGCCTGCTGCCGTTGCCCGTCACCACCGTCGCCGACCTTTCCTACATCTTACCCGCGTCGGAACCCTAGATGATGATATCTTTTCCGGAGATGGGGATGACGAACGCGGCCTCTTTGGGTCTAATCGGAATCACCCACCTAATGCCAGCTTCCTGATTTTCACCCAATTCGGTTCGATCTCAGGGTTTTCGGATCTGGTCGTCGATAATGGAATTAGGGTTTCGGAAGTTGTTCGGCCGGGAGTGGGATTCAAGGCCCGGAGTTCATCTTTCTCTAATCATAAAGACAGCGCTGATGATCAGGAAGAGAAGGATAGAAGACCTGGAGGGGAAAATGTGCACCAAGATATGGATAGGCTTGTAGATTTGCAATTTTTCGTCAAAGGACTGGAACTTGGTCGTCGGGACGCTTCCGCCTTGTTCTTCTTTGTGAGTTTTCTGTCTGCAGCCTATATCTGGGTGATGCTTGGCTTTCTTGTTACATATTCATGGGCTTCGGGCATTGTGTTCATCGCAGTTCTTAATGATCTAATAGGGAGGTTTGGTTCATTTGTTGGTATGGTATGGGATGGATCAAGATTGGGGTTCAAGAGGCTGTCTGGGTTTATCTTGATGAGATGGGCTGTGAGAGATGCGCTAACTCAGCTCCTTGGGTTGTGGTATTTTGGTGAAATTGAAGATCAGTATTCATTTTTCAAGCTATTCGTGAGGTTGAAATTGATGCCATTTTCCATAATGTCTCCTTGGATTCGTGGTTACGAAAAAGAGATCTCTGGGTTTCTGTTTGCCTGGTTTTTGCTAGATACTCTTGTTGCATTCATCTTTGCTGTGGATGCTTGGGTTGTCATTGTGGATGCAAGGCGGACAGGCAGAGAGATCTTGAAGGAAGGCTGTTATTTGATATTGACAATGTTAAACCAGGCTATTCAAATCAAGTGTTTGGAAGCCATCTGTTGTGGCTCTTTCATGAGATGGGCTCTTGCTCGAGTTTGTGGAAAACACGTCGCCATGTTTTTTCAGTCCGTGGGAGAGGTCTATTTTATGGTGGTTTGGCTTATCTTCTACTTTGCTGCTAGATGTAGAGATGCCAAAGTGCAGGGACTGAGGTTTGGTCGTAGAGAGCTGGAAGATCGGCCATTCAAACTTGTGGTGCAGTATAAGTACAATGAGCAAACTAAGCTTGGTATGACACGCGCTCCTCCTTGGCGAGAAGACGAGGCAGGTGCCTCGGCTTCACTGCCTGCCATGGAAGAGGACGAACAGAAAGAAGAAGCAGGGGCCGATGGATACGTTTCACGTCTGGTCTGCAATGGAAGGCGAGAAACAATGGTTCCGTCTGGTTATGAATCGGATGTCATTCGGTGGGGTCTTCGTCTTTTTGACGGTGATTCAGTTTTTAATTCTGGGTATTACGGTGAGATGACAGCTGTAGATGACCATTACCCTGGGAATTATTACAGAGATCATTACAATTTAGAATCCACTTGTGTGGAGAATGATGAGATTATTGCACGCACTCTTCAAGAAAACTTGTCGCAGTTATCAATTACAGAGTCCTCGGGATGCGCTCTTGAAAGAGAGGAGCAATCGCAAGCATCCACATACACAACTGATTGGCACAATCCATTTCCAAGTAATAACAGTTCAGAGAGTATTTCTGTTGAGGAAGATGCTGAAACTCTGGATCCTTCTAGTTCATGTTCAAGTCCTGGGGATGACGACTTCTCCTATTCCCATGCAGTTGATGGTGAAGAATTATGGAGATTCAATCAAATGATTCCTGTTCCTCATGTTCCTAGAATTAACAGAGAAATTCCTTCAGTTGATGAAGCAGCTTCTGATCATGAAAGGCTTCTGAACAGATTGCAAGTGTATGACTTTGTTGAACGCAAGGTTCAAGGTGATGGCAATTGTCAGTTTCGTGCATTATCTGATCAATTATATGGAACACCCGACAATCACGAATTGGTGAGACAAAAAGTTGTAAATCAGCTTATGTCACATCCAGAGATTTATGAGGGATATGTTCCAATGGCATATGATGATTACTTGGAGAAGATGTCCAGGAACGGTGAATGGGGTGATCATGTCACATTACAGGCAGCCGTGGATTCGTATGGTGTGAGGATATTTGTTATAACTTCTTTCAAGGATACCTGCTGCATAGAGATTCTTCCAAATTCTCAGAAGACAAAAGGAGGTATGCCATCAACTGGAGATTCCCCACCCAGTGAGTTGAGAAAGAAGAAAAGGTGGTGGAAATTTGGACAGAAGCATTGA

Protein sequence

MGNPLLDISAVVDDDFLQRYDIKPNNAILAEEKHLPMYEELANNPNVEYIAGGATQNSIKVAQWMLQYPGATSYMGCIGKDKFGEEMKKNSKSAGVNVQYYEVESTPTGTCAVCVVGGERSLVANLSAANCYKSDHLKRPENWALDSVLLVAEHAAANKKYFSMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEARTFSKVQGWETENVEEIALKIAQWPKASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPKEKLVDTNGAGDAFVGGFLSQLVQDKPIEDCVRAGYVLASSPSSTESRAYYQLFYLLILVASASSTTARVGDGYSVHPSDVHYIPVVYPYLTIENHEHCLTLCSIATLSRPPLIRRTQIIGRHIARIRRRKLSLSRRWTRRNCRPQPAMSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRDPSLKALLSRLDIAGEQRLLRSSEDPSMPAAVARHHRRRPFLHLTRVGTLDDDIFSGDGDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKARSSSFSNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSFLSAAYIWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMRWAVRDALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLVAFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVCGKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELEDRPFKLVVQYKYNEQTKLGMTRAPPWREDEAGASASLPAMEEDEQKEEAGADGYVSRLVCNGRRETMVPSGYESDVIRWGLRLFDGDSVFNSGYYGEMTAVDDHYPGNYYRDHYNLESTCVENDEIIARTLQENLSQLSITESSGCALEREEQSQASTYTTDWHNPFPSNNSSESISVEEDAETLDPSSSCSSPGDDDFSYSHAVDGEELWRFNQMIPVPHVPRINREIPSVDEAASDHERLLNRLQVYDFVERKVQGDGNCQFRALSDQLYGTPDNHELVRQKVVNQLMSHPEIYEGYVPMAYDDYLEKMSRNGEWGDHVTLQAAVDSYGVRIFVITSFKDTCCIEILPNSQKTKGGMPSTGDSPPSELRKKKRWWKFGQKH
Homology
BLAST of Sgr022791 vs. NCBI nr
Match: KAA8518739.1 (hypothetical protein F0562_016487 [Nyssa sinensis])

HSP 1 Score: 1026.9 bits (2654), Expect = 1.4e-295
Identity = 555/890 (62.36%), Postives = 637/890 (71.57%), Query Frame = 0

Query: 1   MGNPLLDISAVVDDDFLQRYDIKPNNAILAEEKHLPMYEELANNPNVEYIAGGATQNSIK 60
           MGNPLLDISAVVD +FL +YDIK NNAILAE+KH+ MY+E+A+  +VEYIAGGATQNSI+
Sbjct: 10  MGNPLLDISAVVDQEFLDKYDIKLNNAILAEDKHVGMYDEMASKYSVEYIAGGATQNSIR 69

Query: 61  VAQWMLQYPGATSYMGCIGKDKFGEEMKKNSKSAGVNVQYYEVESTPTGTCAVCVVGGER 120
           VAQWMLQ P ATSYMGCIGKDKFGEEMKKNSK AGVNV YYE ESTPTGTCAVCVVGGER
Sbjct: 70  VAQWMLQIPAATSYMGCIGKDKFGEEMKKNSKLAGVNVHYYEDESTPTGTCAVCVVGGER 129

Query: 121 SLVANLSAANCYKSDHLKRPENWAL------------------DSVLLVAEHAAANKKYF 180
           SLVANLSAANCYKS+HLKRPENWAL                  DS+ LVAEHAAAN K F
Sbjct: 130 SLVANLSAANCYKSEHLKRPENWALVEKAKYFYIAGFFLTVSPDSIQLVAEHAAANNKVF 189

Query: 181 SMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEARTFSKVQGWETENVEEIALKIAQWP 240
            MNLSAPFICEFFK    K    YMDF+FGNETEARTFSKV GW TENVEEIALKI+Q P
Sbjct: 190 MMNLSAPFICEFFKDAQDKIL-PYMDFVFGNETEARTFSKVHGWGTENVEEIALKISQCP 249

Query: 241 KASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ 300
           KASGTHKRI VITQG DPV+VAEDG VK FPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ
Sbjct: 250 KASGTHKRITVITQGADPVVVAEDGKVKLFPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ 309

Query: 301 DKPIEDCVRAG-YVLASSPSSTESRAYYQLFYLLILVASASSTTARVGDGYSVHPSDVHY 360
           +KPI DC+RAG Y    +P +  S        L IL   ++  T +     S + S+   
Sbjct: 310 EKPIVDCIRAGCYAANEAPLARTS--------LSILSPHSADQTFK-----SPNQSNRAL 369

Query: 361 IPVVYPYLTIENHEHCLTLCSIATLSRPPLIRRTQIIGRHIARIRRRKLSLSRRWTRRNC 420
            P       + NH H                                             
Sbjct: 370 KP------AMNNHHH--------------------------------------------- 429

Query: 421 RPQPAMSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTS 480
            P P+     P   +LR+T QL+++ T+ F+S+LFTFLFL+ LIL+FR  VENGT YVT+
Sbjct: 430 -PPPSPPSPPPPFLDLRTTTQLIKQTTSVFSSHLFTFLFLAFLILTFRTNVENGTHYVTA 489

Query: 481 FIDRDPSLKALLSRLDIAGEQRLLRSSEDPSMPAAVARHHR---RRPFLHLTRVGTLDDD 540
           FIDRDPSLKALLSRLDI+G   L  SS DP     +  HHR   RRPFLHLTRVGTLDDD
Sbjct: 490 FIDRDPSLKALLSRLDISGND-LHSSSSDP-----LVNHHRRRHRRPFLHLTRVGTLDDD 549

Query: 541 IFSGDGDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKA 600
            FSGD D++R LFG+N    PN SF+I   F S  GFS+ V DNGI+ SE+VRPG  FKA
Sbjct: 550 FFSGDEDNDRSLFGANPKTKPNGSFVILNNFDSKLGFSNFVTDNGIKFSEIVRPGFSFKA 609

Query: 601 RSSSFSNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSF 660
              S  + +  ++D E        +    +  ++VD QF +KGL+LGRRDASALFF V  
Sbjct: 610 PDGSLRSIEGESNDDEGNKMNGSVKEEKDESKKVVDFQFLIKGLQLGRRDASALFFLVCL 669

Query: 661 LSAAYIWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMR 720
           LSAAY +V+LGFLVTYSW  GIVF+AV+ DL+GR+ SF G VWDGSR+G +RLSGFILMR
Sbjct: 670 LSAAYCYVILGFLVTYSWVLGIVFVAVVYDLLGRYRSFTGTVWDGSRMGLQRLSGFILMR 729

Query: 721 WAVRDALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFL 780
           WAVRDALTQ+LGLW+F EIEDQYSFFK+FVRLKLMPFSI  PWI+G+E+E+ GFLFAWF 
Sbjct: 730 WAVRDALTQVLGLWFFSEIEDQYSFFKIFVRLKLMPFSITFPWIKGFERELWGFLFAWFF 789

Query: 781 LDTLVAFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWA 840
           LDT V FIF+VD+WV IVD+RR+GREI+KEGCYL+ TMLNQAI IKCLE++ CGSF RW 
Sbjct: 790 LDTFVGFIFSVDSWVAIVDSRRSGREIVKEGCYLLSTMLNQAINIKCLESMLCGSFTRWI 827

Query: 841 LARVCGKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
           L+ + GK  A  FQSV EVYFMV WLIFYFA R +D+   G  FGRRELE
Sbjct: 850 LSGIFGKFFASAFQSVMEVYFMVAWLIFYFAVRSKDSTSLGRTFGRRELE 827

BLAST of Sgr022791 vs. NCBI nr
Match: XP_011651956.1 (uncharacterized protein LOC101218916 [Cucumis sativus] >KGN58965.1 hypothetical protein Csa_000801 [Cucumis sativus])

HSP 1 Score: 856.3 bits (2211), Expect = 3.3e-244
Identity = 438/466 (93.99%), Postives = 451/466 (96.78%), Query Frame = 0

Query: 406 AMSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDR 465
           AM+D+HPRLDNLRST+QLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDR
Sbjct: 4   AMTDNHPRLDNLRSTSQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDR 63

Query: 466 DPSLKALLSRLDIAGEQRLLRSSEDPSMPAAVA---RHHRRRPFLHLTRVGTLDDDIFSG 525
           DPSLKALLSRLDIAGEQRLLR+SED S+ A+VA   R  RRRPFLHLTRVGTLDDDIFSG
Sbjct: 64  DPSLKALLSRLDIAGEQRLLRTSEDSSLSASVARRQRRQRRRPFLHLTRVGTLDDDIFSG 123

Query: 526 DGDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKARSSS 585
           DGDDERGLFG+NRNHPPNASF+ FTQF SISGFSDLVVD+GIRVSEVVRPGVGFKARSSS
Sbjct: 124 DGDDERGLFGTNRNHPPNASFVFFTQFSSISGFSDLVVDDGIRVSEVVRPGVGFKARSSS 183

Query: 586 FSNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSFLSAA 645
           FSN K+SADDQEEKDRR GGENVHQDMDRLVDLQFFVKGLELGRRDA+ALFFFVSFLSAA
Sbjct: 184 FSNDKESADDQEEKDRRLGGENVHQDMDRLVDLQFFVKGLELGRRDAAALFFFVSFLSAA 243

Query: 646 YIWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMRWAVR 705
           YIWVMLGFLVTYSWASGIVFIAVLNDL  RFGSFVGMVWDGSRLGFKRLSGFILMRWAVR
Sbjct: 244 YIWVMLGFLVTYSWASGIVFIAVLNDLTERFGSFVGMVWDGSRLGFKRLSGFILMRWAVR 303

Query: 706 DALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTL 765
           DALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTL
Sbjct: 304 DALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTL 363

Query: 766 VAFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARV 825
           VAFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARV
Sbjct: 364 VAFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARV 423

Query: 826 CGKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
           CGK+VAMFFQSVGEVYFMVVWL FYFAA+CRDAKVQG RFGRRELE
Sbjct: 424 CGKNVAMFFQSVGEVYFMVVWLTFYFAAKCRDAKVQGQRFGRRELE 469

BLAST of Sgr022791 vs. NCBI nr
Match: XP_038903973.1 (uncharacterized protein LOC120090409 [Benincasa hispida])

HSP 1 Score: 854.7 bits (2207), Expect = 9.7e-244
Identity = 439/465 (94.41%), Postives = 446/465 (95.91%), Query Frame = 0

Query: 407 MSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 466
           M D+HPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD
Sbjct: 1   MPDNHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 60

Query: 467 PSLKALLSRLDIAGEQRLLRSSEDPSMPAAVA---RHHRRRPFLHLTRVGTLDDDIFSGD 526
           PSLKALLSRLDIAGEQRLLR+SED SM  AVA   R  RRRPFLHLTRVGTLDDDIFSGD
Sbjct: 61  PSLKALLSRLDIAGEQRLLRTSEDSSMSGAVARRQRRQRRRPFLHLTRVGTLDDDIFSGD 120

Query: 527 GDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKARSSSF 586
           GDDERGLFG+NRNHPPNASF+  TQFGSISGFSDLVVD+GIRVSEVVR GVGFKARSSSF
Sbjct: 121 GDDERGLFGTNRNHPPNASFVFLTQFGSISGFSDLVVDDGIRVSEVVRSGVGFKARSSSF 180

Query: 587 SNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSFLSAAY 646
           SN K+S DDQEEKDRR GGENVHQDMDRLVDLQFFVKGLELGRRDA+ALFFFVSFLSAAY
Sbjct: 181 SNDKESTDDQEEKDRRLGGENVHQDMDRLVDLQFFVKGLELGRRDAAALFFFVSFLSAAY 240

Query: 647 IWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 706
           IWVMLGFLVTYSWASGIVFIAVLNDL  RFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD
Sbjct: 241 IWVMLGFLVTYSWASGIVFIAVLNDLTERFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 300

Query: 707 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 766
           ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV
Sbjct: 301 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 360

Query: 767 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC 826
           AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC
Sbjct: 361 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC 420

Query: 827 GKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
           GKHVAMFFQSVGEVYFMVVWL FYFAARCRDAKVQG RFGRRELE
Sbjct: 421 GKHVAMFFQSVGEVYFMVVWLTFYFAARCRDAKVQGQRFGRRELE 465

BLAST of Sgr022791 vs. NCBI nr
Match: KAA0044005.1 (uncharacterized protein E6C27_scaffold236G003510 [Cucumis melo var. makuwa] >TYK25135.1 uncharacterized protein E5676_scaffold352G002600 [Cucumis melo var. makuwa])

HSP 1 Score: 850.1 bits (2195), Expect = 2.4e-242
Identity = 435/465 (93.55%), Postives = 449/465 (96.56%), Query Frame = 0

Query: 407 MSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 466
           M+D+HPRLDNLRST+QLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD
Sbjct: 1   MTDNHPRLDNLRSTSQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 60

Query: 467 PSLKALLSRLDIAGEQRLLRSSEDPSMPAAVA---RHHRRRPFLHLTRVGTLDDDIFSGD 526
           PSLKALLSRLDIAGEQRLLR+SED S+ A+VA   R  RRRPFLHLTRVGTLDDDIFSGD
Sbjct: 61  PSLKALLSRLDIAGEQRLLRTSEDSSLSASVARRQRRQRRRPFLHLTRVGTLDDDIFSGD 120

Query: 527 GDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKARSSSF 586
           GDDERGLFG+NRNHPPNASF+ FTQF SISGFSDLVVD+GIRVSEVVR GVGFKARSSSF
Sbjct: 121 GDDERGLFGTNRNHPPNASFVFFTQFSSISGFSDLVVDDGIRVSEVVRSGVGFKARSSSF 180

Query: 587 SNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSFLSAAY 646
           SN K+SADDQEEKDRR GGENVHQDMDRLVDLQFFVKGLELGRRDA+ALFFFVSFLSAAY
Sbjct: 181 SNDKESADDQEEKDRRIGGENVHQDMDRLVDLQFFVKGLELGRRDAAALFFFVSFLSAAY 240

Query: 647 IWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 706
           IWVMLGFLVTYSWASGIVFIAVLNDL  RFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD
Sbjct: 241 IWVMLGFLVTYSWASGIVFIAVLNDLTERFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 300

Query: 707 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 766
           ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV
Sbjct: 301 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 360

Query: 767 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC 826
           AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSF+RWALARVC
Sbjct: 361 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFLRWALARVC 420

Query: 827 GKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
           GK+VAMFFQSVGEVYFMVVWL FYFAA+CRDAKVQG RFGRRELE
Sbjct: 421 GKNVAMFFQSVGEVYFMVVWLTFYFAAKCRDAKVQGQRFGRRELE 465

BLAST of Sgr022791 vs. NCBI nr
Match: XP_023527234.1 (uncharacterized protein LOC111790531 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 845.5 bits (2183), Expect = 5.9e-241
Identity = 434/465 (93.33%), Postives = 444/465 (95.48%), Query Frame = 0

Query: 407 MSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 466
           M+D+HPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD
Sbjct: 1   MTDNHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 60

Query: 467 PSLKALLSRLDIAGEQRLLRSSEDPSMPAAVA---RHHRRRPFLHLTRVGTLDDDIFSGD 526
           PSLKALLSRLDIAGEQRLLRS+ED SM  AVA   R  RRRPFLHLTRVGTLDDDIFSGD
Sbjct: 61  PSLKALLSRLDIAGEQRLLRSAEDSSMSGAVARRQRRQRRRPFLHLTRVGTLDDDIFSGD 120

Query: 527 GDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKARSSSF 586
            DDERGLFG+NRNHPPNASF+ FTQFGSISGFS+LVVD+GIRVSEVVRPGVGFKARSS F
Sbjct: 121 VDDERGLFGTNRNHPPNASFVFFTQFGSISGFSNLVVDDGIRVSEVVRPGVGFKARSSPF 180

Query: 587 SNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSFLSAAY 646
           SN  +S DDQEE DRR G ENVHQDM+RLVDLQFFVKGLELGRRDA+ALFFFVSFLSAAY
Sbjct: 181 SNDNESGDDQEENDRRLGDENVHQDMERLVDLQFFVKGLELGRRDAAALFFFVSFLSAAY 240

Query: 647 IWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 706
           IWVMLGFLVTYSWASGIVFIAVLNDL  RFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD
Sbjct: 241 IWVMLGFLVTYSWASGIVFIAVLNDLTERFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 300

Query: 707 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 766
           ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV
Sbjct: 301 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 360

Query: 767 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC 826
           AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC
Sbjct: 361 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC 420

Query: 827 GKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
           GKHVAMFFQSVGEVYFMVVWL FYFAARCRDAKVQG RFGRRELE
Sbjct: 421 GKHVAMFFQSVGEVYFMVVWLTFYFAARCRDAKVQGQRFGRRELE 465

BLAST of Sgr022791 vs. ExPASy Swiss-Prot
Match: Q9LZG0 (Adenosine kinase 2 OS=Arabidopsis thaliana OX=3702 GN=ADK2 PE=1 SV=1)

HSP 1 Score: 464.5 bits (1194), Expect = 3.7e-129
Identity = 230/311 (73.95%), Postives = 262/311 (84.24%), Query Frame = 0

Query: 1   MGNPLLDISAVVDDDFLQRYDIKPNNAILAEEKHLPMYEELANNPNVEYIAGGATQNSIK 60
           MGNPLLDISAVVDD+FL +YDIK NNAILAE+KHLPMY+E+++  NVEYIAGGATQNSIK
Sbjct: 14  MGNPLLDISAVVDDEFLTKYDIKLNNAILAEDKHLPMYDEMSSKFNVEYIAGGATQNSIK 73

Query: 61  VAQWMLQYPGATSYMGCIGKDKFGEEMKKNSKSAGVNVQYYEVESTPTGTCAVCVVGGER 120
           VAQWMLQ PGATSYMG IGKDK+GE MKK++ +AGVNV YYE ES PTGTC VCVVGGER
Sbjct: 74  VAQWMLQIPGATSYMGSIGKDKYGEAMKKDATAAGVNVHYYEDESAPTGTCGVCVVGGER 133

Query: 121 SLVANLSAANCYKSDHLKRPENWAL------------------DSVLLVAEHAAANKKYF 180
           SL+ANLSAANCYK DHLK+PENWAL                  +S+ LV+EHAAAN K F
Sbjct: 134 SLIANLSAANCYKVDHLKKPENWALVEKAKFYYIAGFFLTVSPESIQLVSEHAAANNKVF 193

Query: 181 SMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEARTFSKVQGWETENVEEIALKIAQWP 240
           +MNLSAPFICEFFK    KF   YMDF+FGNETEARTFS+V GWETE+VE+IA+KI+Q P
Sbjct: 194 TMNLSAPFICEFFKDVQEKFL-PYMDFVFGNETEARTFSRVHGWETEDVEQIAIKISQLP 253

Query: 241 KASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ 294
           KA+GT+KR  VITQG DPV+VAEDG VKK+PVI LPKEKLVDTNGAGDAFVGGF+SQLV+
Sbjct: 254 KATGTYKRTTVITQGADPVVVAEDGKVKKYPVIPLPKEKLVDTNGAGDAFVGGFMSQLVK 313

BLAST of Sgr022791 vs. ExPASy Swiss-Prot
Match: Q9SF85 (Adenosine kinase 1 OS=Arabidopsis thaliana OX=3702 GN=ADK1 PE=1 SV=1)

HSP 1 Score: 454.5 bits (1168), Expect = 3.8e-126
Identity = 226/311 (72.67%), Postives = 256/311 (82.32%), Query Frame = 0

Query: 1   MGNPLLDISAVVDDDFLQRYDIKPNNAILAEEKHLPMYEELANNPNVEYIAGGATQNSIK 60
           MGNPLLD+SAVVD  FL +YDIK NNAILAE+KHLPMY+E++   NVEYIAGGATQNSIK
Sbjct: 13  MGNPLLDVSAVVDQQFLDKYDIKLNNAILAEDKHLPMYDEMSQKFNVEYIAGGATQNSIK 72

Query: 61  VAQWMLQYPGATSYMGCIGKDKFGEEMKKNSKSAGVNVQYYEVESTPTGTCAVCVVGGER 120
           VAQWMLQ PGATSYMG IGKDK+GE MKK++ +AGV V YYE E+TPTGTC VCV+GGER
Sbjct: 73  VAQWMLQVPGATSYMGSIGKDKYGEAMKKDATAAGVYVHYYEDEATPTGTCGVCVLGGER 132

Query: 121 SLVANLSAANCYKSDHLKRPENWAL------------------DSVLLVAEHAAANKKYF 180
           SL+ANLSAANCYK +HLK+PENWAL                  +S+ LV EHAAAN K F
Sbjct: 133 SLIANLSAANCYKVEHLKKPENWALVEKAKFYYIAGFFLTVSPESIQLVREHAAANNKVF 192

Query: 181 SMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEARTFSKVQGWETENVEEIALKIAQWP 240
           +MNLSAPFICEFFK    K C  YMD+IFGNETEARTFS+V GWET++VE+IA+K++Q P
Sbjct: 193 TMNLSAPFICEFFKDVQEK-CLPYMDYIFGNETEARTFSRVHGWETDDVEQIAIKMSQLP 252

Query: 241 KASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ 294
           KASGT+KR  VITQG DPV+VAEDG VKK+PVI LPKEKLVDTNGAGDAFVGGFLSQLV 
Sbjct: 253 KASGTYKRTTVITQGADPVVVAEDGKVKKYPVIPLPKEKLVDTNGAGDAFVGGFLSQLVH 312

BLAST of Sgr022791 vs. ExPASy Swiss-Prot
Match: O49923 (Adenosine kinase OS=Physcomitrium patens OX=3218 GN=ADK PE=2 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 6.8e-107
Identity = 198/316 (62.66%), Postives = 236/316 (74.68%), Query Frame = 0

Query: 1   MGNPLLDISAVVDDDFLQRYDIKPNNAILAEEKHLPMYEELANNPNVEYIAGGATQNSIK 60
           MGNPLLDIS VVDD FL++Y +  NNAILAE+KHLPMY+ELA NP+VEYIAGGATQN+I+
Sbjct: 10  MGNPLLDISCVVDDAFLEKYGLTLNNAILAEDKHLPMYKELAANPDVEYIAGGATQNTIR 69

Query: 61  VAQWMLQYPGATSYMGCIGKDKFGEEMKKNSKSAGVNVQYYEVESTPTGTCAVCVVGGER 120
           +AQWML    ATSY GC+GKD++G+ M K +   GVN++Y   E  PTGTC V VV GER
Sbjct: 70  IAQWMLGESNATSYFGCVGKDEYGDRMFKLASEGGVNIRYDVDEDLPTGTCGVLVVKGER 129

Query: 121 SLVANLSAANCYKSDHLKRPENWAL------------------DSVLLVAEHAAANKKYF 180
           SLVANLSAAN YK DHLK+PENWA                   +S++ VA+HAA   KY+
Sbjct: 130 SLVANLSAANKYKIDHLKKPENWAFVEKAKYIYSAGFFLTVSPESMMTVAKHAAETGKYY 189

Query: 181 SMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEARTFSKVQGWETENVEEIALKIAQWP 240
            +NL+APFIC+FFK    +    Y+DFIFGNE+EAR F++VQGWETE+ + IA+K+A  P
Sbjct: 190 MINLAAPFICQFFKDPLMELF-PYVDFIFGNESEARAFAQVQGWETEDTKVIAVKLAALP 249

Query: 241 KASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ 299
           KA GTHKR+AVITQG DP IVAEDG V +FPV  +PKEKLVDTN AGD+FVGGFLSQLV 
Sbjct: 250 KAGGTHKRVAVITQGTDPTIVAEDGKVTEFPVTPIPKEKLVDTNAAGDSFVGGFLSQLVL 309

BLAST of Sgr022791 vs. ExPASy Swiss-Prot
Match: P55264 (Adenosine kinase OS=Mus musculus OX=10090 GN=Adk PE=1 SV=2)

HSP 1 Score: 329.7 bits (844), Expect = 1.4e-88
Identity = 168/318 (52.83%), Postives = 217/318 (68.24%), Query Frame = 0

Query: 1   MGNPLLDISAVVDDDFLQRYDIKPNNAILAEEKHLPMYEELANNPNVEYIAGGATQNSIK 60
           MGNPLLDISAVVD DFL +Y +KPN+ ILAE+KH  +++EL     VEY AGG+TQNS+K
Sbjct: 28  MGNPLLDISAVVDKDFLDKYSLKPNDQILAEDKHKELFDELVKKFKVEYHAGGSTQNSMK 87

Query: 61  VAQWMLQYP-GATSYMGCIGKDKFGEEMKKNSKSAGVNVQYYEVESTPTGTCAVCVVGGE 120
           VAQW++Q P  A ++ GCIG DKFGE +K+ +  A V+  YYE    PTGTCA C+ GG 
Sbjct: 88  VAQWLIQEPHKAATFFGCIGIDKFGEILKRKAADAHVDAHYYEQNEQPTGTCAACITGGN 147

Query: 121 RSLVANLSAANCYKSD-HLKRPENWAL------------------DSVLLVAEHAAANKK 180
           RSLVANL+AANCYK + HL    NW L                  +SVL VA +AA N +
Sbjct: 148 RSLVANLAAANCYKKEKHLDLERNWVLVEKARVYYIAGFFLTVSPESVLKVARYAAENNR 207

Query: 181 YFSMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEARTFSKVQGWETENVEEIALKIAQ 240
            F++NLSAPFI +FFK         Y+D +FGNETEA TF++ QG+ET++++EIA K   
Sbjct: 208 VFTLNLSAPFISQFFKEALMD-VMPYVDILFGNETEAATFAREQGFETKDIKEIAKKAQA 267

Query: 241 WPKASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPKEKLVDTNGAGDAFVGGFLSQL 299
            PK +   +R  + TQG D  IVA + +V  FPV+   +E+++DTNGAGDAFVGGFLSQL
Sbjct: 268 LPKVNSKRQRTVIFTQGRDDTIVAAENDVTAFPVLDQNQEEIIDTNGAGDAFVGGFLSQL 327

BLAST of Sgr022791 vs. ExPASy Swiss-Prot
Match: P55263 (Adenosine kinase OS=Homo sapiens OX=9606 GN=ADK PE=1 SV=2)

HSP 1 Score: 327.8 bits (839), Expect = 5.4e-88
Identity = 166/318 (52.20%), Postives = 217/318 (68.24%), Query Frame = 0

Query: 1   MGNPLLDISAVVDDDFLQRYDIKPNNAILAEEKHLPMYEELANNPNVEYIAGGATQNSIK 60
           MGNPLLDISAVVD DFL +Y +KPN+ ILAE+KH  +++EL     VEY AGG+TQNSIK
Sbjct: 29  MGNPLLDISAVVDKDFLDKYSLKPNDQILAEDKHKELFDELVKKFKVEYHAGGSTQNSIK 88

Query: 61  VAQWMLQYP-GATSYMGCIGKDKFGEEMKKNSKSAGVNVQYYEVESTPTGTCAVCVVGGE 120
           VAQWM+Q P  A ++ GCIG DKFGE +K+ +  A V+  YYE    PTGTCA C+ G  
Sbjct: 89  VAQWMIQQPHKAATFFGCIGIDKFGEILKRKAAEAHVDAHYYEQNEQPTGTCAACITGDN 148

Query: 121 RSLVANLSAANCYKSD-HLKRPENWAL------------------DSVLLVAEHAAANKK 180
           RSL+ANL+AANCYK + HL   +NW L                  +SVL VA HA+ N +
Sbjct: 149 RSLIANLAAANCYKKEKHLDLEKNWMLVEKARVCYIAGFFLTVSPESVLKVAHHASENNR 208

Query: 181 YFSMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEARTFSKVQGWETENVEEIALKIAQ 240
            F++NLSAPFI +F+K    K    Y+D +FGNETEA TF++ QG+ET++++EIA K   
Sbjct: 209 IFTLNLSAPFISQFYKESLMK-VMPYVDILFGNETEAATFAREQGFETKDIKEIAKKTQA 268

Query: 241 WPKASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPKEKLVDTNGAGDAFVGGFLSQL 299
            PK +   +RI + TQG D  I+A +  V  F V+   +++++DTNGAGDAFVGGFLSQL
Sbjct: 269 LPKMNSKRQRIVIFTQGRDDTIMATESEVTAFAVLDQDQKEIIDTNGAGDAFVGGFLSQL 328

BLAST of Sgr022791 vs. ExPASy TrEMBL
Match: A0A5J4ZL70 (Adenosine kinase OS=Nyssa sinensis OX=561372 GN=F0562_016487 PE=3 SV=1)

HSP 1 Score: 1026.9 bits (2654), Expect = 6.9e-296
Identity = 555/890 (62.36%), Postives = 637/890 (71.57%), Query Frame = 0

Query: 1   MGNPLLDISAVVDDDFLQRYDIKPNNAILAEEKHLPMYEELANNPNVEYIAGGATQNSIK 60
           MGNPLLDISAVVD +FL +YDIK NNAILAE+KH+ MY+E+A+  +VEYIAGGATQNSI+
Sbjct: 10  MGNPLLDISAVVDQEFLDKYDIKLNNAILAEDKHVGMYDEMASKYSVEYIAGGATQNSIR 69

Query: 61  VAQWMLQYPGATSYMGCIGKDKFGEEMKKNSKSAGVNVQYYEVESTPTGTCAVCVVGGER 120
           VAQWMLQ P ATSYMGCIGKDKFGEEMKKNSK AGVNV YYE ESTPTGTCAVCVVGGER
Sbjct: 70  VAQWMLQIPAATSYMGCIGKDKFGEEMKKNSKLAGVNVHYYEDESTPTGTCAVCVVGGER 129

Query: 121 SLVANLSAANCYKSDHLKRPENWAL------------------DSVLLVAEHAAANKKYF 180
           SLVANLSAANCYKS+HLKRPENWAL                  DS+ LVAEHAAAN K F
Sbjct: 130 SLVANLSAANCYKSEHLKRPENWALVEKAKYFYIAGFFLTVSPDSIQLVAEHAAANNKVF 189

Query: 181 SMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEARTFSKVQGWETENVEEIALKIAQWP 240
            MNLSAPFICEFFK    K    YMDF+FGNETEARTFSKV GW TENVEEIALKI+Q P
Sbjct: 190 MMNLSAPFICEFFKDAQDKIL-PYMDFVFGNETEARTFSKVHGWGTENVEEIALKISQCP 249

Query: 241 KASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ 300
           KASGTHKRI VITQG DPV+VAEDG VK FPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ
Sbjct: 250 KASGTHKRITVITQGADPVVVAEDGKVKLFPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ 309

Query: 301 DKPIEDCVRAG-YVLASSPSSTESRAYYQLFYLLILVASASSTTARVGDGYSVHPSDVHY 360
           +KPI DC+RAG Y    +P +  S        L IL   ++  T +     S + S+   
Sbjct: 310 EKPIVDCIRAGCYAANEAPLARTS--------LSILSPHSADQTFK-----SPNQSNRAL 369

Query: 361 IPVVYPYLTIENHEHCLTLCSIATLSRPPLIRRTQIIGRHIARIRRRKLSLSRRWTRRNC 420
            P       + NH H                                             
Sbjct: 370 KP------AMNNHHH--------------------------------------------- 429

Query: 421 RPQPAMSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTS 480
            P P+     P   +LR+T QL+++ T+ F+S+LFTFLFL+ LIL+FR  VENGT YVT+
Sbjct: 430 -PPPSPPSPPPPFLDLRTTTQLIKQTTSVFSSHLFTFLFLAFLILTFRTNVENGTHYVTA 489

Query: 481 FIDRDPSLKALLSRLDIAGEQRLLRSSEDPSMPAAVARHHR---RRPFLHLTRVGTLDDD 540
           FIDRDPSLKALLSRLDI+G   L  SS DP     +  HHR   RRPFLHLTRVGTLDDD
Sbjct: 490 FIDRDPSLKALLSRLDISGND-LHSSSSDP-----LVNHHRRRHRRPFLHLTRVGTLDDD 549

Query: 541 IFSGDGDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKA 600
            FSGD D++R LFG+N    PN SF+I   F S  GFS+ V DNGI+ SE+VRPG  FKA
Sbjct: 550 FFSGDEDNDRSLFGANPKTKPNGSFVILNNFDSKLGFSNFVTDNGIKFSEIVRPGFSFKA 609

Query: 601 RSSSFSNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSF 660
              S  + +  ++D E        +    +  ++VD QF +KGL+LGRRDASALFF V  
Sbjct: 610 PDGSLRSIEGESNDDEGNKMNGSVKEEKDESKKVVDFQFLIKGLQLGRRDASALFFLVCL 669

Query: 661 LSAAYIWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMR 720
           LSAAY +V+LGFLVTYSW  GIVF+AV+ DL+GR+ SF G VWDGSR+G +RLSGFILMR
Sbjct: 670 LSAAYCYVILGFLVTYSWVLGIVFVAVVYDLLGRYRSFTGTVWDGSRMGLQRLSGFILMR 729

Query: 721 WAVRDALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFL 780
           WAVRDALTQ+LGLW+F EIEDQYSFFK+FVRLKLMPFSI  PWI+G+E+E+ GFLFAWF 
Sbjct: 730 WAVRDALTQVLGLWFFSEIEDQYSFFKIFVRLKLMPFSITFPWIKGFERELWGFLFAWFF 789

Query: 781 LDTLVAFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWA 840
           LDT V FIF+VD+WV IVD+RR+GREI+KEGCYL+ TMLNQAI IKCLE++ CGSF RW 
Sbjct: 790 LDTFVGFIFSVDSWVAIVDSRRSGREIVKEGCYLLSTMLNQAINIKCLESMLCGSFTRWI 827

Query: 841 LARVCGKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
           L+ + GK  A  FQSV EVYFMV WLIFYFA R +D+   G  FGRRELE
Sbjct: 850 LSGIFGKFFASAFQSVMEVYFMVAWLIFYFAVRSKDSTSLGRTFGRRELE 827

BLAST of Sgr022791 vs. ExPASy TrEMBL
Match: A0A0A0LAG1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G740090 PE=4 SV=1)

HSP 1 Score: 856.3 bits (2211), Expect = 1.6e-244
Identity = 438/466 (93.99%), Postives = 451/466 (96.78%), Query Frame = 0

Query: 406 AMSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDR 465
           AM+D+HPRLDNLRST+QLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDR
Sbjct: 4   AMTDNHPRLDNLRSTSQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDR 63

Query: 466 DPSLKALLSRLDIAGEQRLLRSSEDPSMPAAVA---RHHRRRPFLHLTRVGTLDDDIFSG 525
           DPSLKALLSRLDIAGEQRLLR+SED S+ A+VA   R  RRRPFLHLTRVGTLDDDIFSG
Sbjct: 64  DPSLKALLSRLDIAGEQRLLRTSEDSSLSASVARRQRRQRRRPFLHLTRVGTLDDDIFSG 123

Query: 526 DGDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKARSSS 585
           DGDDERGLFG+NRNHPPNASF+ FTQF SISGFSDLVVD+GIRVSEVVRPGVGFKARSSS
Sbjct: 124 DGDDERGLFGTNRNHPPNASFVFFTQFSSISGFSDLVVDDGIRVSEVVRPGVGFKARSSS 183

Query: 586 FSNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSFLSAA 645
           FSN K+SADDQEEKDRR GGENVHQDMDRLVDLQFFVKGLELGRRDA+ALFFFVSFLSAA
Sbjct: 184 FSNDKESADDQEEKDRRLGGENVHQDMDRLVDLQFFVKGLELGRRDAAALFFFVSFLSAA 243

Query: 646 YIWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMRWAVR 705
           YIWVMLGFLVTYSWASGIVFIAVLNDL  RFGSFVGMVWDGSRLGFKRLSGFILMRWAVR
Sbjct: 244 YIWVMLGFLVTYSWASGIVFIAVLNDLTERFGSFVGMVWDGSRLGFKRLSGFILMRWAVR 303

Query: 706 DALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTL 765
           DALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTL
Sbjct: 304 DALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTL 363

Query: 766 VAFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARV 825
           VAFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARV
Sbjct: 364 VAFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARV 423

Query: 826 CGKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
           CGK+VAMFFQSVGEVYFMVVWL FYFAA+CRDAKVQG RFGRRELE
Sbjct: 424 CGKNVAMFFQSVGEVYFMVVWLTFYFAAKCRDAKVQGQRFGRRELE 469

BLAST of Sgr022791 vs. ExPASy TrEMBL
Match: A0A5D3DNE7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G002600 PE=4 SV=1)

HSP 1 Score: 850.1 bits (2195), Expect = 1.2e-242
Identity = 435/465 (93.55%), Postives = 449/465 (96.56%), Query Frame = 0

Query: 407 MSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 466
           M+D+HPRLDNLRST+QLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD
Sbjct: 1   MTDNHPRLDNLRSTSQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 60

Query: 467 PSLKALLSRLDIAGEQRLLRSSEDPSMPAAVA---RHHRRRPFLHLTRVGTLDDDIFSGD 526
           PSLKALLSRLDIAGEQRLLR+SED S+ A+VA   R  RRRPFLHLTRVGTLDDDIFSGD
Sbjct: 61  PSLKALLSRLDIAGEQRLLRTSEDSSLSASVARRQRRQRRRPFLHLTRVGTLDDDIFSGD 120

Query: 527 GDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKARSSSF 586
           GDDERGLFG+NRNHPPNASF+ FTQF SISGFSDLVVD+GIRVSEVVR GVGFKARSSSF
Sbjct: 121 GDDERGLFGTNRNHPPNASFVFFTQFSSISGFSDLVVDDGIRVSEVVRSGVGFKARSSSF 180

Query: 587 SNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSFLSAAY 646
           SN K+SADDQEEKDRR GGENVHQDMDRLVDLQFFVKGLELGRRDA+ALFFFVSFLSAAY
Sbjct: 181 SNDKESADDQEEKDRRIGGENVHQDMDRLVDLQFFVKGLELGRRDAAALFFFVSFLSAAY 240

Query: 647 IWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 706
           IWVMLGFLVTYSWASGIVFIAVLNDL  RFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD
Sbjct: 241 IWVMLGFLVTYSWASGIVFIAVLNDLTERFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 300

Query: 707 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 766
           ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV
Sbjct: 301 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 360

Query: 767 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC 826
           AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSF+RWALARVC
Sbjct: 361 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFLRWALARVC 420

Query: 827 GKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
           GK+VAMFFQSVGEVYFMVVWL FYFAA+CRDAKVQG RFGRRELE
Sbjct: 421 GKNVAMFFQSVGEVYFMVVWLTFYFAAKCRDAKVQGQRFGRRELE 465

BLAST of Sgr022791 vs. ExPASy TrEMBL
Match: A0A6J1F2Z3 (uncharacterized protein LOC111441707 OS=Cucurbita moschata OX=3662 GN=LOC111441707 PE=4 SV=1)

HSP 1 Score: 845.5 bits (2183), Expect = 2.8e-241
Identity = 433/465 (93.12%), Postives = 445/465 (95.70%), Query Frame = 0

Query: 407 MSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 466
           M+D+HPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD
Sbjct: 1   MTDNHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 60

Query: 467 PSLKALLSRLDIAGEQRLLRSSEDPSMPAAVA---RHHRRRPFLHLTRVGTLDDDIFSGD 526
           PSLKALLSRLDIAGEQRLLRS+ED SM +AVA   R  RRRPFLHLTRVGTLDDDIFSGD
Sbjct: 61  PSLKALLSRLDIAGEQRLLRSAEDSSMSSAVARRQRRQRRRPFLHLTRVGTLDDDIFSGD 120

Query: 527 GDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKARSSSF 586
            DDERGLFG+NRNHPPNASF+ FTQFGSISGFS+LVVD+GIRVSEVVRPGVGFKARSS F
Sbjct: 121 VDDERGLFGTNRNHPPNASFVFFTQFGSISGFSNLVVDDGIRVSEVVRPGVGFKARSSPF 180

Query: 587 SNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSFLSAAY 646
           SN  +S DDQEE DRR G ENVHQDM+RLVDLQFFVKGLELGRRDA+ALFFFVSFLSAAY
Sbjct: 181 SNDNESGDDQEENDRRLGDENVHQDMERLVDLQFFVKGLELGRRDAAALFFFVSFLSAAY 240

Query: 647 IWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 706
           IWVMLGFLVTYSWASG+VFIAVLNDL  RFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD
Sbjct: 241 IWVMLGFLVTYSWASGVVFIAVLNDLTERFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 300

Query: 707 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 766
           ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV
Sbjct: 301 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 360

Query: 767 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC 826
           AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC
Sbjct: 361 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC 420

Query: 827 GKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
           GKHVAMFFQSVGEVYFMVVWL FYFAARCRDAKVQG RFGRRELE
Sbjct: 421 GKHVAMFFQSVGEVYFMVVWLTFYFAARCRDAKVQGQRFGRRELE 465

BLAST of Sgr022791 vs. ExPASy TrEMBL
Match: A0A6J1J177 (uncharacterized protein LOC111482506 OS=Cucurbita maxima OX=3661 GN=LOC111482506 PE=4 SV=1)

HSP 1 Score: 844.3 bits (2180), Expect = 6.3e-241
Identity = 433/465 (93.12%), Postives = 444/465 (95.48%), Query Frame = 0

Query: 407 MSDDHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 466
           M+D+HPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD
Sbjct: 1   MTDNHPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRD 60

Query: 467 PSLKALLSRLDIAGEQRLLRSSEDPSMPAAVA---RHHRRRPFLHLTRVGTLDDDIFSGD 526
           PSLKALLSRLDIAGEQRLLRS+ED SM +AVA   R  RRRPFLHLTRVGTLDDDIFSGD
Sbjct: 61  PSLKALLSRLDIAGEQRLLRSAEDSSMSSAVARRQRRQRRRPFLHLTRVGTLDDDIFSGD 120

Query: 527 GDDERGLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKARSSSF 586
            DDERGLFG+NRNHPPNASF+ FTQFGSISGFS+LVVD+GIRVSEVVRPGVGFKARSS F
Sbjct: 121 VDDERGLFGTNRNHPPNASFVFFTQFGSISGFSNLVVDDGIRVSEVVRPGVGFKARSSPF 180

Query: 587 SNHKDSADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSFLSAAY 646
           SN  +S DDQEE DRR G ENVHQDM+RLVDLQFFVKGLELGRRDA+ALFFFVSFLSAAY
Sbjct: 181 SNDNESVDDQEENDRRLGDENVHQDMERLVDLQFFVKGLELGRRDAAALFFFVSFLSAAY 240

Query: 647 IWVMLGFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 706
           IWVMLGFLVTYSWASGIVFIAVLNDL  RFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD
Sbjct: 241 IWVMLGFLVTYSWASGIVFIAVLNDLTERFGSFVGMVWDGSRLGFKRLSGFILMRWAVRD 300

Query: 707 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 766
           ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV
Sbjct: 301 ALTQLLGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLV 360

Query: 767 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC 826
           AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC
Sbjct: 361 AFIFAVDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVC 420

Query: 827 GKHVAMFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
           GKHVAMFFQSVGEVYFMVVWL FYFAARCRDAKVQG RFGRRE E
Sbjct: 421 GKHVAMFFQSVGEVYFMVVWLTFYFAARCRDAKVQGQRFGRREFE 465

BLAST of Sgr022791 vs. TAIR 10
Match: AT5G03300.1 (adenosine kinase 2 )

HSP 1 Score: 464.5 bits (1194), Expect = 2.6e-130
Identity = 230/311 (73.95%), Postives = 262/311 (84.24%), Query Frame = 0

Query: 1   MGNPLLDISAVVDDDFLQRYDIKPNNAILAEEKHLPMYEELANNPNVEYIAGGATQNSIK 60
           MGNPLLDISAVVDD+FL +YDIK NNAILAE+KHLPMY+E+++  NVEYIAGGATQNSIK
Sbjct: 14  MGNPLLDISAVVDDEFLTKYDIKLNNAILAEDKHLPMYDEMSSKFNVEYIAGGATQNSIK 73

Query: 61  VAQWMLQYPGATSYMGCIGKDKFGEEMKKNSKSAGVNVQYYEVESTPTGTCAVCVVGGER 120
           VAQWMLQ PGATSYMG IGKDK+GE MKK++ +AGVNV YYE ES PTGTC VCVVGGER
Sbjct: 74  VAQWMLQIPGATSYMGSIGKDKYGEAMKKDATAAGVNVHYYEDESAPTGTCGVCVVGGER 133

Query: 121 SLVANLSAANCYKSDHLKRPENWAL------------------DSVLLVAEHAAANKKYF 180
           SL+ANLSAANCYK DHLK+PENWAL                  +S+ LV+EHAAAN K F
Sbjct: 134 SLIANLSAANCYKVDHLKKPENWALVEKAKFYYIAGFFLTVSPESIQLVSEHAAANNKVF 193

Query: 181 SMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEARTFSKVQGWETENVEEIALKIAQWP 240
           +MNLSAPFICEFFK    KF   YMDF+FGNETEARTFS+V GWETE+VE+IA+KI+Q P
Sbjct: 194 TMNLSAPFICEFFKDVQEKFL-PYMDFVFGNETEARTFSRVHGWETEDVEQIAIKISQLP 253

Query: 241 KASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ 294
           KA+GT+KR  VITQG DPV+VAEDG VKK+PVI LPKEKLVDTNGAGDAFVGGF+SQLV+
Sbjct: 254 KATGTYKRTTVITQGADPVVVAEDGKVKKYPVIPLPKEKLVDTNGAGDAFVGGFMSQLVK 313

BLAST of Sgr022791 vs. TAIR 10
Match: AT3G09820.1 (adenosine kinase 1 )

HSP 1 Score: 454.5 bits (1168), Expect = 2.7e-127
Identity = 226/311 (72.67%), Postives = 256/311 (82.32%), Query Frame = 0

Query: 1   MGNPLLDISAVVDDDFLQRYDIKPNNAILAEEKHLPMYEELANNPNVEYIAGGATQNSIK 60
           MGNPLLD+SAVVD  FL +YDIK NNAILAE+KHLPMY+E++   NVEYIAGGATQNSIK
Sbjct: 13  MGNPLLDVSAVVDQQFLDKYDIKLNNAILAEDKHLPMYDEMSQKFNVEYIAGGATQNSIK 72

Query: 61  VAQWMLQYPGATSYMGCIGKDKFGEEMKKNSKSAGVNVQYYEVESTPTGTCAVCVVGGER 120
           VAQWMLQ PGATSYMG IGKDK+GE MKK++ +AGV V YYE E+TPTGTC VCV+GGER
Sbjct: 73  VAQWMLQVPGATSYMGSIGKDKYGEAMKKDATAAGVYVHYYEDEATPTGTCGVCVLGGER 132

Query: 121 SLVANLSAANCYKSDHLKRPENWAL------------------DSVLLVAEHAAANKKYF 180
           SL+ANLSAANCYK +HLK+PENWAL                  +S+ LV EHAAAN K F
Sbjct: 133 SLIANLSAANCYKVEHLKKPENWALVEKAKFYYIAGFFLTVSPESIQLVREHAAANNKVF 192

Query: 181 SMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEARTFSKVQGWETENVEEIALKIAQWP 240
           +MNLSAPFICEFFK    K C  YMD+IFGNETEARTFS+V GWET++VE+IA+K++Q P
Sbjct: 193 TMNLSAPFICEFFKDVQEK-CLPYMDYIFGNETEARTFSRVHGWETDDVEQIAIKMSQLP 252

Query: 241 KASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPKEKLVDTNGAGDAFVGGFLSQLVQ 294
           KASGT+KR  VITQG DPV+VAEDG VKK+PVI LPKEKLVDTNGAGDAFVGGFLSQLV 
Sbjct: 253 KASGTYKRTTVITQGADPVVVAEDGKVKKYPVIPLPKEKLVDTNGAGDAFVGGFLSQLVH 312

BLAST of Sgr022791 vs. TAIR 10
Match: AT2G37035.1 (unknown protein; Has 26 Blast hits to 26 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 441.0 bits (1133), Expect = 3.1e-123
Identity = 241/460 (52.39%), Postives = 302/460 (65.65%), Query Frame = 0

Query: 411 HPRLDNLRSTAQLLREATASFTSNLFTFLFLSLLILSFRVVVENGTQYVTSFIDRDPSLK 470
           H RL  LR+T QLLR+ T+SF+S+  TF+FL+ L+ SF  +V++ +  +TSF+D DPSL+
Sbjct: 12  HARLSRLRTTTQLLRQTTSSFSSHPLTFIFLTFLLFSFHSLVDHCSLLLTSFVDSDPSLR 71

Query: 471 ALLSRLDIAGEQRLLRSSEDPSMPAAVARHHRRRPFLHLTRVGTLDDDIFSGDGDD--ER 530
           +LLSRL +         S  P+       HHRR PFL LTR+GTLDDD FS D  D   R
Sbjct: 72  SLLSRLPLNSR------SHTPTR----FNHHRRAPFLQLTRLGTLDDDFFSTDEHDPHRR 131

Query: 531 GLFGSNRNHPPNASFLIFTQFGSISGFSDLVVDNGIRVSEVVRPGVGFKARSSSFSNHKD 590
            L GS+   P NA+ +  + F SISGFS  ++DNG+ + +++R GV  +          +
Sbjct: 132 SLQGSSFRSPINATTVFLSGFESISGFSRPIIDNGLLLPQIIRSGVVLRQLEKEDHGGDE 191

Query: 591 SADDQEEKDRRPGGENVHQDMDRLVDLQFFVKGLELGRRDASALFFFVSFLSAAYIWVML 650
              + +E +     E   +D +  VDL+   KGLELGR DA+ALFF VSFLSAAY WV+L
Sbjct: 192 DETELDESELDRESEKKDKDFESFVDLKMIFKGLELGRSDAAALFFLVSFLSAAYGWVIL 251

Query: 651 GFLVTYSWASGIVFIAVLNDLIGRFGSFVGMVWDGSRLGFKRLSGFILMRWAVRDALTQL 710
           GF   YS    I+F+ V+NDL+GRF SF+G+VW GSRLGFKR++GF+LMRWAVRDALTQL
Sbjct: 252 GFTTVYSLVLAIMFVTVINDLLGRFPSFLGVVWRGSRLGFKRVTGFVLMRWAVRDALTQL 311

Query: 711 LGLWYFGEIEDQYSFFKLFVRLKLMPFSIMSPWIRGYEKEISGFLFAWFLLDTLVAFIFA 770
           LGLWYFGE+EDQYSFF+LFVRLKLMPF++M PWIRG+EKEISGFLFAWFLLDTLV  I A
Sbjct: 312 LGLWYFGEVEDQYSFFRLFVRLKLMPFTVMPPWIRGFEKEISGFLFAWFLLDTLVGLILA 371

Query: 771 VDAWVVIVDARRTGREILKEGCYLILTMLNQAIQIKCLEAICCGSFMRWALARVCGKHVA 830
           VDA+V IVD+RR GREI+KE                                   GK  A
Sbjct: 372 VDAFVAIVDSRRRGREIVKE-----------------------------------GKSFA 426

Query: 831 MFFQSVGEVYFMVVWLIFYFAARCRDAKVQGLRFGRRELE 869
              QS  EVYFM  WL+FY AA+C+DA   G RFGRRE+E
Sbjct: 432 SVIQSALEVYFMAAWLVFYLAAKCKDAHADGRRFGRREME 426

BLAST of Sgr022791 vs. TAIR 10
Match: AT3G09820.2 (adenosine kinase 1 )

HSP 1 Score: 392.9 bits (1008), Expect = 9.7e-109
Identity = 196/274 (71.53%), Postives = 223/274 (81.39%), Query Frame = 0

Query: 38  YEELANNPNVEYIAGGATQNSIKVAQWMLQYPGATSYMGCIGKDKFGEEMKKNSKSAGVN 97
           Y+E++   NVEYIAGGATQNSIKVAQWMLQ PGATSYMG IGKDK+GE MKK++ +AGV 
Sbjct: 8   YDEMSQKFNVEYIAGGATQNSIKVAQWMLQVPGATSYMGSIGKDKYGEAMKKDATAAGVY 67

Query: 98  VQYYEVESTPTGTCAVCVVGGERSLVANLSAANCYKSDHLKRPENWAL------------ 157
           V YYE E+TPTGTC VCV+GGERSL+ANLSAANCYK +HLK+PENWAL            
Sbjct: 68  VHYYEDEATPTGTCGVCVLGGERSLIANLSAANCYKVEHLKKPENWALVEKAKFYYIAGF 127

Query: 158 ------DSVLLVAEHAAANKKYFSMNLSAPFICEFFKMHWRKFCRKYMDFIFGNETEART 217
                 +S+ LV EHAAAN K F+MNLSAPFICEFFK    K C  YMD+IFGNETEART
Sbjct: 128 FLTVSPESIQLVREHAAANNKVFTMNLSAPFICEFFKDVQEK-CLPYMDYIFGNETEART 187

Query: 218 FSKVQGWETENVEEIALKIAQWPKASGTHKRIAVITQGPDPVIVAEDGNVKKFPVILLPK 277
           FS+V GWET++VE+IA+K++Q PKASGT+KR  VITQG DPV+VAEDG VKK+PVI LPK
Sbjct: 188 FSRVHGWETDDVEQIAIKMSQLPKASGTYKRTTVITQGADPVVVAEDGKVKKYPVIPLPK 247

Query: 278 EKLVDTNGAGDAFVGGFLSQLVQDKPIEDCVRAG 294
           EKLVDTNGAGDAFVGGFLSQLV  K IE+CVRAG
Sbjct: 248 EKLVDTNGAGDAFVGGFLSQLVHGKGIEECVRAG 280

BLAST of Sgr022791 vs. TAIR 10
Match: AT5G03330.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 275.4 bits (703), Expect = 2.3e-73
Identity = 163/359 (45.40%), Postives = 220/359 (61.28%), Query Frame = 0

Query: 935  MVPSGYESDVIRWGLRLFDGDSVFNSGYYG-EMTAVDD----------HYPGNYYRDHYN 994
            MV     + ++ W    F G   +    YG EM   DD          H  G YYR++ +
Sbjct: 1    MVSHEENTSIVEW----FLGPHPYTYPPYGIEMIHEDDEVAVAHHHHHHQSGEYYREYED 60

Query: 995  LESTCVENDEIIARTLQENLSQLSITESSGCALE-REEQSQASTYTTD---------WHN 1054
              S+ V+NDEIIARTLQ++  QL I ES+  + + +++Q Q   YT +         W++
Sbjct: 61   HRSSDVDNDEIIARTLQDDFLQLEIAESNDYSHQNQQQQHQQEGYTNNYSNNNNGYAWND 120

Query: 1055 PFPS-NNSSESISVEEDAETLDPSS----SCSSPGD-DDFSYSHA-----VDGEELWRFN 1114
              P+ + SSE I  + D +     S    SCSSP D D++ YS        DGE   R N
Sbjct: 121  QSPAVDYSSEWIGNDNDQDGRSDDSVNVFSCSSPSDTDEYVYSWESDQCDADGEFGRRLN 180

Query: 1115 QMIPVPHVPRINREIPSVDEAASDHERLLNRLQVYDFVERKVQGDGNCQFRALSDQLYGT 1174
            QM+P+P++P+IN EIP  +EA SDHERL NRL+++DF E KV GDGNCQFRAL+DQLY T
Sbjct: 181  QMVPIPYIPKINGEIPPEEEAVSDHERLRNRLEMFDFTEVKVPGDGNCQFRALADQLYKT 240

Query: 1175 PDNHELVRQKVVNQLMSHPEIYEGYVPMAYDDYLEKMSRNGEWGDHVTLQAAVDSYGVRI 1234
             D H+ VR+++V QL S P+ Y+GYVPM + DYL KMSR+GEWGDHVTLQAA D+Y V+I
Sbjct: 241  ADRHKHVRRQIVKQLKSRPDSYQGYVPMDFSDYLRKMSRSGEWGDHVTLQAAADAYRVKI 300

Query: 1235 FVITSFKDTCCIEILPNSQKTKG-------------GMPSTGDSPPSELRKKKRWWKFG 1249
             V+TSFKDTC IEILP SQ++KG              +    D+  +EL++K++WW+FG
Sbjct: 301  VVLTSFKDTCYIEILPTSQESKGVIFLSFWAEVHYNAIYLNRDTSETELQRKRKWWRFG 355

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA8518739.11.4e-29562.36hypothetical protein F0562_016487 [Nyssa sinensis][more]
XP_011651956.13.3e-24493.99uncharacterized protein LOC101218916 [Cucumis sativus] >KGN58965.1 hypothetical ... [more]
XP_038903973.19.7e-24494.41uncharacterized protein LOC120090409 [Benincasa hispida][more]
KAA0044005.12.4e-24293.55uncharacterized protein E6C27_scaffold236G003510 [Cucumis melo var. makuwa] >TYK... [more]
XP_023527234.15.9e-24193.33uncharacterized protein LOC111790531 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q9LZG03.7e-12973.95Adenosine kinase 2 OS=Arabidopsis thaliana OX=3702 GN=ADK2 PE=1 SV=1[more]
Q9SF853.8e-12672.67Adenosine kinase 1 OS=Arabidopsis thaliana OX=3702 GN=ADK1 PE=1 SV=1[more]
O499236.8e-10762.66Adenosine kinase OS=Physcomitrium patens OX=3218 GN=ADK PE=2 SV=1[more]
P552641.4e-8852.83Adenosine kinase OS=Mus musculus OX=10090 GN=Adk PE=1 SV=2[more]
P552635.4e-8852.20Adenosine kinase OS=Homo sapiens OX=9606 GN=ADK PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A5J4ZL706.9e-29662.36Adenosine kinase OS=Nyssa sinensis OX=561372 GN=F0562_016487 PE=3 SV=1[more]
A0A0A0LAG11.6e-24493.99Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G740090 PE=4 SV=1[more]
A0A5D3DNE71.2e-24293.55Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1F2Z32.8e-24193.12uncharacterized protein LOC111441707 OS=Cucurbita moschata OX=3662 GN=LOC1114417... [more]
A0A6J1J1776.3e-24193.12uncharacterized protein LOC111482506 OS=Cucurbita maxima OX=3661 GN=LOC111482506... [more]
Match NameE-valueIdentityDescription
AT5G03300.12.6e-13073.95adenosine kinase 2 [more]
AT3G09820.12.7e-12772.67adenosine kinase 1 [more]
AT2G37035.13.1e-12352.39unknown protein; Has 26 Blast hits to 26 proteins in 11 species: Archae - 0; Bac... [more]
AT3G09820.29.7e-10971.53adenosine kinase 1 [more]
AT5G03330.12.3e-7345.40Cysteine proteinases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001805Adenosine kinasePRINTSPR00989ADENOKINASEcoord: 74..88
score: 59.17
coord: 57..66
score: 76.25
coord: 105..120
score: 54.69
IPR003323OTU domainPFAMPF02338OTUcoord: 1125..1211
e-value: 3.1E-9
score: 37.4
IPR003323OTU domainPROSITEPS50802OTUcoord: 1119..1251
score: 12.147105
IPR029056Ribokinase-likeGENE3D3.40.1190.20coord: 1..298
e-value: 8.1E-104
score: 348.9
IPR029056Ribokinase-likeSUPERFAMILY53613Ribokinase-likecoord: 1..298
NoneNo IPR availableGENE3D3.90.70.80coord: 1100..1231
e-value: 5.6E-33
score: 115.8
NoneNo IPR availableGENE3D3.30.1110.10coord: 6..125
e-value: 8.1E-104
score: 348.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1222..1251
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 584..605
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1014..1065
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1014..1067
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 580..605
NoneNo IPR availablePANTHERPTHR36353TRANSMEMBRANE PROTEINcoord: 407..869
NoneNo IPR availableCDDcd01168adenosine_kinasecoord: 1..298
e-value: 9.99301E-103
score: 326.108
IPR011611Carbohydrate kinase PfkBPFAMPF00294PfkBcoord: 20..298
e-value: 3.5E-56
score: 190.7
IPR002173Carbohydrate/purine kinase, PfkB, conserved sitePROSITEPS00584PFKB_KINASES_2coord: 264..277
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 1106..1218

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr022791.1Sgr022791.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006166 purine ribonucleoside salvage
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004001 adenosine kinase activity
molecular_function GO:0016773 phosphotransferase activity, alcohol group as acceptor