Sgr021201 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr021201
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionTic22-like family protein
Locationtig00153648: 640351 .. 663765 (-)
RNA-Seq ExpressionSgr021201
SyntenySgr021201
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCTGGACCAAGAGTTGGCTTTTACTTCCATTTACTTCAAGTTTGGCAATCTTAGCTACTCAATCTCGTTCGTTTGATGCTTTTTCCACCCACATACTAGTATGGGATTTTTTTCTCCTTGAAGAACTTGTGGGTGAGGGGAAGATTCTTAAGCCTCTCAAAGCAACCTTCCAACTGAATGATGGAGTCCTAGTGAAGTGAGAACGAAAAAATTTGTCTCTTTTATGTTTAGAAATCCTAGTGCTACATCTCCTAAAATTTTCAACTTCTATGGAGAAAGACCTTTTCCCTTGTAACTGAATGTCGGCTATAGTCCCTCCCAGTCGATGTACGTTAGCGTAAGGGGTGAGAGTTGAAAAAAGGGTTGGAAGGAGAATCAGGGGGTGATCAAAGGAGATAGGAGAGGCCATGAAAGGGGAAGGAAGGCAAGGAGAGGAAGGGGAGGGAAGGGACTGGGAGAGCATTTCAAAAGATTTTATAATATTAAAGATACTAACAACTCACGGCTAGTTTGTGGCCATCCTTAGGATGAAAAGCAAAACTTTAATTGAATAATGCTATTAAGGCCTTATCATGGAAATGTTGGATGGAAAAACTAAGAGGGCATTTAGTAATATATAAGGATCTAAATGGTCTTTTGGATTCTTTAAGATTTTTAACCTCTTTGTGGTGTACTCTCTACAGTGATTTTTGTAGCTACACCTTTTTTCTTGTTCTTTTTGGGTTAATTGGAGCTCCTTCTGTAAGACTTGCTGTGCTATTAGGTTGACAACATGTCTTTCCTTTTTTGTATTCATTTTTTACTTATATAATTGGTATTTTCTCTTGAAGAAACTTTATTATCCTCAATTTGGGGTGGCATGTTGGTTTAATCTTCCGTTTGAGTGAAGCTGGTTGATTTTACGTTACGCACATTCTATACCAATTCAATAATTGTTATGACTGTGACAGTATTTGCATTATCAGTTAGGCTGATGAAATTTTGAGATATAAAGTGTTGTTTGGACTTTTTAGTTTCGCTGAAATTATTTTTAGTTATTGAAACTGAAAGAAAGAAGCATTTGCAGTTTTGCTATTAACTACTACCGTTGTTCTGAGTGAAGCATCCTTCCGAATGCCATCTCTTATTTGTTGATTCCATTCTTTTAGAACCCCTGAGTGGCTTAAGATGATGTTCTCAACCATTACTAAGAGTGAAAGAAATGGCCCCATATTCCGCTTCTTTACTGATTTAGGTGATGCAGGTATCAATTCATTAAATTTTTATGTTTATGCTACTCTTTATCCTCATTTCTTTTGTAGTATTTATGTGGTTTATGTACTTCTAGACTTTCAAATGCATTTGGAACAAACTACATTTAAGAAACGAAATTTTCAAACAAGCATGACCTTGTTTCTAATGTTAGCAATTTATCTTGACATTTATAAAAAACATTTCAAATCATTAAAATTTTACCCGCTATTCTGTTTTTTTAAGGCTGTGTTACCGGAATGACAAACCTGTTTATATTGGAGCATCCCTCGCAAATTTTATCATTGTTTTCTATCTATTAGTGTCTTAGTATTGATTTATCAATCTGACTTCAGCCACTATGCATTACATTCTGACTCAAAATATTTGCTTCTTAGTTACATACGTTAAGAGACTGAATATTCCAAGTGCTGTGGTGGGTGCATGTCGTCTTGATTTAGCGTATGAGCATTTCAAGGTACTAAATTCATATTGGTGGACCAATCTTTATTTATTGGTTGCATTGTCAACTTTGGAATTTTAGATGTAATAAGGTTCCTTTCCTTTTTTATCCCCTTCTCATCTTTAGATAGTCTATAGCTATTGCATACTCTTTTTAGTGATATAGGTAGCTGTGATTTGATTTGTTTGTGTTCTATGAGCTATCTTCATATAATCACATTGTGAGTAAATTAATAGTATGTACAGGTTCCCTTGATTTCTCATATGAGCTTCTGACATTATTGCTTCATTTGGTTGGAGAGTTGAGTAGTTCAATCTGCATTTGCTGTATTTTAAATGAAAGCTTCACATATCATCCTCCTTTATTGTATTTCATTTACTTGTATATTTTTTTTGGATGAAAAACTATGCTTTCATTACTTGTATACTTTTTGGAAAATATATATTATATTTTATTAGTTTTTAAATAGTTTTATTGTAGTGCATAAGTACATTTCATTCATCAGTGAAAGTGTTTCTAAAAATGAGCTTTTGAATAGGGTTGTTGTCAGCTTATATGGCAAGGAGGAAGTTGGATGGTTCTCCAAGCAAGTGAAGGGTGGTTGGGCAAGTATTAAAACCATTGTTTAAAAAGTTTTCCTATGTGAGATTTGGTAATGGATCATTTTGGAAAGATAGGTGGGTTGTCTCTATCCCGCTTTTGATTGCTTTTTTCAGAATCTTTGCTATCACATAGGAAAAAGAATATCTTGGTACATGAGGCTTGTGATGTGAGGACCAAAAGTTGTGGAATGGAAGGTAACAAGGAGTAATTTGAAGGAATAGGAGATGGAGGAATTTGGGTGCCTGTTGGCAGTACTAAATGGAATTTTTCCGAGAAGATAGAAGAAGTTGGTCTCTACAATTGTTTGGACGTTGTTCAATCAAGTCTCTCTTTGTGAATTACTAGATAAACTGAAATGAGTAGATCTTCGGTCTTCATTTGAGTAATGGCCAATTTACCTAATGTAATATGAGGGAGGAGGTGATGCAGTCCTTGATTGATAGCTTCTTAGTAAGTAATAACCAGAGACTGGATGGATAAATTCAAAGACTACTAGTTTTGAAGAAGATGAACTATCTCAGATCACACACCATTATTGTTGGAAGCCAATGGTATTGCATGCTCCAACCCCTTCTGGATTCAAAATGTTTGGCTTGGAGATTTATACTTACAGAAGCCAGCGAGTAAGGCTTGGTAGGAGGATGGGAGGAAAGGGTGGCCAGGGTATAAATTTATGAAAAAGCTGAAAACTTGAAGGAGAAAATCAAATTTTGAAACAAGCAGAACTATGTTAACCTAATGAAAGGGGAGCTTAGGAAAATATAAATTTCTTAGATCTGAAGGAAGAAATGGAGGCCTTATCAGTACAGGAGTAAGAACTAAGGAAAAAGTGGAAAAAGGAATCAAATGAGTTATGGTGTAAAGTCATGACGAGAATCTATATCAAAGAGGAGTTATGTTGGTTCACAAAGAAGAAGATAAGAGGAAATATTTGTAGCCCATGGATTAGTATTGCTAAATAGAAATTGTTTATTGAAGAAATTATATAAACTTAAGGTGAGGTCTAACTTAAGTATACAACTCTGGAGGGATATATAACTAGAGGATCAACAATTGATGATAAAATTTTCAAATTTGTTTGCTGTCACCTAGGGAAGGAAATTAAAGTAGGTGTGGGAGGACGAAACAAGATCATGAAATATAATGGCAAGTTGGAATTTAATATATGTGGAAGCCCATGAATATGGGGAGATGTTGGTTTTACTTAATGGAGTGGTGGTTAGAAGGGTAGAGGAAAGGATAATTTGGAAATTTAAGCCATAGGACATAGTTCAAAAACTATCCAATTTGGCTGATTCATATTCATCTCAGCATGTGACGAAACGAGAACAAGATACAGTAGGTAAAAAATTCAAAAAAAGAGTGACTTGATCTGGCTTTGACTATATCAACCCGATTTGGACTATATTAGTCATGATTCAGGATTTATCTGCCATGACTCAGGATCACTAGCATTGTATCATACTTTTCAATGAGATTATTGATAATTAATAATAACTACAACATGTTGCTATCTCAAATGAGATACTATGGAAATTTTTTTTTGGTGTTCTTATTCAAACCGTTTGTCATATGTAAACTTAATAACTTATGTAGTGCTTCTTTTTTTCGTAGTTAGTATCTTTATAATAAGTTACAACTTTTCACTAATTTTATGATAGAATCTTAATCTACATATATATTCTTTTTAATACTTTGTTTTGGCCAAGTTCCAATGCTCTTTTTGGTGTTCAATCTCTAATTCTTTTTGTAACTTTACCTCCTTTGTAGTTAACACAATTTGGGGTGGTTTTTTTTGTTAACTCCCATGGAGGTGGATAGTTAGGTTGTATATTGTTACAACTTTTCATCAAATCAATGAGAAGTCAATTTCTGTTTCAAAAAAATAAAAAATAATAAAAAAAACAATTCAGTGAGCAAAAATTATGTTTGGCATTTTGGAAAGGCAGGGGTCCCAAGAAGGATAAATTTTTTCAATGAATTATTTATCATGAAGATTTGAATACAGCTAAGAAAGTTCAAAAGAGATGCCCCAACTTTAGTCTTCAACTGAACTTGTGTTTTTAATGTCACAAAGCTGAAGAAGCCGTTTGAAATTTGTTTTTTCACTGCTGGTACAGAGAAAATGTTGGGAATTTTTGTTGGAACAATTTCATATGTCATGGGCTATGAGACAAACAGCAAAGGAGAATCTTATTCAATTCTCGTGTGATGGAGATGAAGGAAGATTTTGTGGGTAAATGTAGTAAAGGCATTCTTATTGAATATTTGGATGGAATGGAATTATAGAACATTTAATGACAAGTAAAAAGAGTGCGAGGAATTGGCTGATTTGGCCAAGATTCGTATTTCTTCTGGGTGTTCTTTGTCTAACTCTTTCTGTAATTATTCTACTTTTGAAAAAAAAAAAACATTCTCTCTGACAAAAAGATGCCATCAGTTAATTTTGACATTTTGTTCATGAGATTATCCATAAGTTAAGATATTGTAGTATATTATACAAACCTGTTTGGTTTTTGAACTGATACATGGTTGTAATGTAATTTCCAAAATCTGCTGAAGATCTGCTGCTAAATTTTTAGAGATGGTCTTCCATGATCTAACTTCTAATGTTACTTGAACCATGCAGGAAAAACCTCATTTATTTCAGTTCATTCCGAATGAGAAGCAAGTATGATTTTTCTTATCTGAAGCAAGAAAACTTTCATTAATATTGAAAATATTACAAGTATGCTACTGAATACCACCACCTAAACATCATGAAAATTACTCGTGCCCTCAGCCCTTGAATGCTTCTCTAATGTAGGTTAAAGCAGCAAACAAGCTTCTTAAAGGGCTACCGCAAAATGGTGGAAGCAAAAGGATTGATGGTGTTCCTGTGTTCAGTGCCCAAAACTTGGATATTGCAATAGCAACTACTGATGGGATTAAGTGGTATGTCATTTTCTCATGTTAGTTATGTAGTTTTTGCTATGGCGAAAGTTACTAAATTGTTGTAAAGGCTGGTCCTTGGCATACCAGATCATAATGTGCTTGGAAGCACAGACACTTCGTTATAGGCAAAGACTCTGTGTCAGACACGGATTTGTCTGGACACGCCCCAGACACGTGTCTGACATACCAAATCAGTGTCCAATTTATTTATTTGTTTTTATTTTCGGACACGCTTGCAACACGGTTGGGACACTAGACACGGGAGGGACACAACTGAAAAAAATGATGGAGAAAAACGCCCAATTGAAAGTCCTAGAAGCCCAACTAAAAGCCCTAAAAATCTGAATAAACCTTTCAAATTAAATTTAAAAGAAAGAAATAAATAAAAGAAACCCTAATTTCTTCAACCCACATTGCTGCGATAAGAGAGAAAGCCCCCACACCCCACGACGTTGCTTCGATTTCGCTCAGCCCCCGCGTTGCCACCGGCCACCATCGAAGTCACTGTCTTTCGCCTTTCAAGTTGCCGTCGTCGCTGCTGTCTGCCGTCCAAATCCACCTCCACGTTGCTGCCTTGCCCTAACCTTACGGCAAGGGTAGAATTTTTTATTTTTATTTTTATTTTTTTCAATTCATAAAATACATAATTTATTTGACTTCTAGATTTCAACTCTTCTCATTTAAATATATATTTTCAGCCCAAGAACATTGAACTAAATACTAAACTATTGACCATTACTTATTGATTTGACATTTTATGTTATCTAGTGAAATATATTTTGTCGTTTATGTTAAACTTACATGTTTTTGTCGATTATGAAGTATTAAATATATTCATATTGTAGAGACTAAAGGTTTGAGAGAATCCAATGATTCTATTCCATTCACTAAGAGAAATATATAGACAAGTATACAAGGTTAACCTAAAGTAAAGAATGTAAAAGGACGATAAAGGACCAATGACTAAGACTGAGATAATTACAATAAGTACAAAATATAATAATATAAACACTATAACACTCCCCCTCAAGTTGGAGCATATATGTCAATCATGCCCAGCTTGTTAGAGAGATAATCTATACGTGCTTCATTTAATGCTTTTGTGAAGATATCTCCTAATTGCTCTTCAGTCTTCACATACCCTGTGGACACCAAACCTTGCTGTATTTTCTCACGTACAAAATGACAATCAACCTCAATTGTGTTTGGTTCGTTCATGAAATATTGGATTAAATGCAATATGGAGAGTTGCTTGATTATCACACCAGAGTTTGGTTGGAGTTGTGATATCAAATCCCAATTCAGTCAGAAGTTGATATATTCACACTAATTCACACAGATTTTCCTATCGCTCTATAATCTGATTCAGCACTTGGACGAGACACCACATTTTGTTTCTTATTCTTCCAGGAAACTAGATTACGTCCGACAAATACACAATATCCTGAAGTTGATCTTCTGTCTTCCTTAGATTCTGCCCAATCAACATCTGAGAAACATTCAATATTAGTGTGATCATGATCTTCATATAATAAACCACGCCCAGGAGCAACTTTCAAATAACATAGAATATGTTCTAATGCGGCCAAATGATCAACAGTAGGAGAAGACGTATACTGACTCACAATACTCACTGCATAAGCTATGTCTGGCCTAGTCACTATAAGATAATTTAGCTTTCCTACTAACCTCCTATACCTTTTAGGATCTTCTAGCAATTCTACCTCTTTTGTGAGCTGTAAATTAGGCATCATCGGGGTATTACGTGGCTTAGCACCTAACTTCCCTGTCTCGATTAACAAATCAAGTGCATATTTTCCCTATGATAAAAGAATTCCCTTCTTACTTCGTATTAACTCAATTCCCAAGAAGTATTTCAACATTTCCAAATCTTTTGTATGGAATTGACTATGAAGAAAAGTCTTAAGAGATAGAATACCTGATGTATCATTACTAGTAATGAAAATTTCATCAACATACACAACTAGTAAGATGACACCAGTCTCAGATCGTTTATAAAAGACAGAATGATCTGACTTACTTTTCTTCATTCCAACGTTTTCAATCACCTAACTAAATCTTTCAAACCATGCTCGTGGGCTTTGCTTTAAACCATACAAGGGTTTACGAAGACGACATACCTTTCCATTCTCCCCCAGAGCAACAAAACCTGGTGGTTGCTCCATATACCCTCCTTCTAGAAGATTACCATGTAGAAAGACATTTTTAATATCAAGCTGATGCAAGGGCCAATGATAGATTTATGCCAACGAAATGAATAACCTGACAGAAATCAATTTAGCAATAGGAGAAAAAGTATCAGAATAGTCAACTCTATAAGTCTGCGCGTAGCCTTTAGCAACAAGACGAGCTTTCAATCGCGCGACAGATCCATTAGGATTAACTTTAATGGAAAACACCCATTTACAACCGATAGGCTTCTTTCCTGCAGGAAGAAAAACTAAATCCCAAGTACCATTGTCATCTAAGGCAGTCATCTCCTCTAATATTGCAGCACGCCAACCAGAATGAGACAAAGCTTCATGAACAATGTTAGGAACAGATGAAGACTCAAGAAATGCAATGAACGATCAAGTAGGAGATGACAAATGGTTATATGAAACAAAAGAGGAAACAGGATAAGTACACTGACGTTTACCTTTGCATAGAGCAATAGGAAGATCATTGCTCGTTCCTAGATCTAATGACGAAGAAGCCTCTGGTATACGACGTGACCGAAGGAGGTTGCCGTCGAGAATAAACCTGAGTAATAGGTGGACGAGGAGGATCATGTCCAGAGGGAGATGTATTGCTAGTGAGCACTTCCTCAGAAGAAGAGATAATTGAATAGACAAGAAAATCATTGTCTTCCTCTGGACGCTCCACCTGATTGTTGCTTGAAGAAGATGAAAAGAAAGAAGCATTCTCAAAGAACGTAACGTAAGGAGAGACGAGATATCTATTGAGATCAGGACAATAACACTGATACCTTTTTTGAACACGAGAATAACCAAGAAAAATGCATTTCAAGGACTTGGGGTCCAATTTTGTGAGTTGGGGGCGAACATCCCGAACGAAACAAGTACAACCAAATATTTAGGTTTGATAGAAAATAAAGGTTGTGTAGGACACAAAGTATGATAAGGTATCTCATCTTTAAGAACTGATGAAGGCATGCGATTAATTAAGAAACAAGTTGTCGAAACAGCATCAGCCCAAAAATATTTTGGAACATGCATTTGAAACATTAAGGCCCTTGCTGTTTCGTGGAGATGTCTATTTTTTCGTTCTGCAACTCCATTTTGAGATGGAGTATCAACACACAAGGATTGATGAAGAATACCATGTTCACCTAAGTAAGAGCCAAGAACATTAGAGAAATTTTTTTTAACATTATCGCTCTGTGAAACTTTAAGAGACCCATCAAATTGAGTTTGAATTTCAGCGTAAAAGTTACGAAAATGAGAAAGCAACTCAGAACGATTTTTCATTAAATATAGCCAAGTTACACGAGAAAAATCATCGACAAAAGTAACAAAATACCGAAACCCACTTTTGGACTCAACAGGACATGGACCCCAAACATCAGAATGAACTAACTCAAAAGGAGCACAAATTCGTTTATTGACTCTAGGATACGAACTTAGACGATGAAATTTAGAAAATTGACATGACTCACAATCTAAAGAAGACAAATTATGAAACTGGGGACGAAGACTCTTCAACACTGAGATTGATGGATGACCCAAACGACAATGTTCTTCCAAAGGAGATGACACTCTAGAACATGGGATGACTGCAGGTATTTGTATATCAAACGTGTAGAGACCTCCGGATTCACGCCCTTTACCAATAGTCCTCTTCGTCATAAGATCCTGAAATAAGCAATAACCAGAGAATAATAAGACACAACAATTAAGATCACGAGTAAGTTACTAACAGAGATCAAATTAAAAGAGAACTGTGGCAAATTTAAAACAGAGGTCAATGAAAGAGAGTTGGTAATACAAACTGTGCCCGATTCTAGAACAGAAGAGGTGGTTCCATTCGCTATAGTAACATCAGGCAAAGAAGTAGAGGGTGAAAGGGTAGAAAATAAACTAGGATTACCTATCATATGACCTGTAGCACCAGAGTCAATGACCCATTTGGATGAGAATGAAAGAAGGCATTTATTTGTGTTATCTGATCCAGCGATGGCTGTAATTGGATTAGAGGAAGATGTCGTCAATGACTCTTGATACTGTTGAAACTTAGCAAACTCTTATGCAGAGATCGTAATTAGCTTTTCGGGATCATCAGAAGTAGACGCAACATGCGCAGACTGAGTTCTTTGACCTTTATTCAACAACTTCCTACAATCACGCTTTGAATGGCTTGACTTATGACAATAATGACACCCAACATCACCTAAGTTCAATTTCTGAGTATTAGAATTCTCTCTTGGGTTATTTGAACCCACTCCTCTGTTACCTCGATACTCATTCGTTCGTCCAACCAAAGCACTGTTGGACTAGAATGATGTGACTGTTTGTGACTTCTCAGTACGAAGTATTCTAGTATATGCCTCTTCCAATGATTAACAAACTCACAATGATTAACTAGGCCAACAATCTCACTCTCAATCGAATTCTTGATCTGTAGAATCATCCTAGAATCATTCCGCAGCCAAGCTCTTCTGGTGCCGTCAGTCGGTGGATTTTCAGTCATGTGATCCTTCATATCAATGCTTCGTACAAAATGACGAACGTTTGTTCTCCATGCATAGTACTTGGATCCATTCAACTTATGTTCTGTGATTTTTGACATCATTGGAATCATCTCGGACGTCACTACTGATTTCTTCTCAGCCATAACCTCGTAAACAAGCACAAAACTTCAAAACTGAACCAAAATTGATCAGAATCACAAAACTCTAATAGACCCGAAATGTTTTTCCTTCAAACAGCCGCAATTTGACCAAGGTATGACTAAAAGACCAACACCATTAGAAAGAGGATGGAGCGGTGAACAAGAGCTCGTTGATTGGAGGCGATTCCGGCGGCGCACAAGCCCACACGTGCCGGAGCGTGAGATGGGGTGTCGCGGAGGTTGGCGGCGCGTGAGACTCACGTGCAAGTCGGATTTTGACCAAATTTTAGGGTTTTGCAGATCGGCTTGTGGGCTGAGCGATGTTGAAGACAGACGACGGCGACGACGGCGGTGCGGTGGTGGCGATGGCGATGACGTGGCAGCTGCGGTAGATTCACAATACCAGGTGAACAACGAGCACTTTCTAAGTGTTCCAAACTTCAAACCCTAAAGGTTTGAAAAATCACAAAACCCTTAGGTTAACAAAATCCTAATTGCCTTAGGAAAAGACCTAGCTCTGATACCATGTAGAGACTAAAGGTTTGAGAGAATCCAATAATTCTATTCCATTCACTAAGAGATATATATAGACAAGTATATAAGGTTAACCTAGAGTAAAGAATGTAAAAGGACGATAAAGGACAAATGACTAAGACTGAGATAAGTACAATAAATACAAAATATAATAATATAAATACTATAACACATATGTACACATATATTCTAAAAAAAAAAAAAAAGAACGTATCCCTGATGTGTTCGTATTCTACTTTTTTAAAAATTGACGTATCACTGTGTCGTGTCGCGTCTGTGTCCATATCTGTGCTTCTTACATAATGTGTCTTCAAGACTCAAGGAAGCATTCTTTCTTTGCTTATGACTATAAGAATGGTGCTATTGTATCTTACCTTAAAAAGTAAAATTTAGTATTTCCTAATTCCCACAGGGAGCATGTTGGTAAGACAACGTTAAATTTCAAGAGATGGGTTGAAGAAGCTAGTTTAGTTAGAAGCTAGAAAGCACAGACACTTCTTATAGGCAGAGAATATGTGTCGGACACGTGTCGGACACGGACAAGCCCGGGCACGCCCCGGACACGTGTCCGACACACCAAATAAGTGTCCGAATTTTTTTTTTTTTTTCAGACACGGGAGGGACACTGCTGGGATACGGCTGTAAAAAAAAAAAAAGAAAAACAGTGGCCAACTAAAAGCCCATGAAATTCAATTAAAAGCCCATTTTAGAAGCCCAACAGAAAGCTTAAAACAACCTAAAAAATCTGACATATTAAATTAAAAAATAATAATAAAACAAACCTAATTAATATTTAAAAGGAGACCCACGACTTCTTCTTCAAGAAAAGAAGAAAATCCATTCTGCGAAGAGAGAAGTCCTCCCTCCCCCAAACTGCGACTCTGTCTCTCAGCCCCCCACGTCGCGCCGTCGGCCTCGGCCTTTCAGATCGCCGCTAGCCACCACCTTTCAGGTCGCCGCTGTCTGCCTCCAAGTTGCCGCCGTTTGTCGCGTCTGCCCCCCAAATCTGCTTCCTTGCTGCCTTGCCACCCGAACCTTCGGCAATGTTAGTTTTTTATCTTTTTGTTTTTCTTTTCATTTCACAAAAAACAAAACTTAACTTCTCCTTTTAGATTTCAACTCTTCTCATTTAGATGTATATTTTCAGCTCAAAAACATTGAACTACATACTAAACTTTTGAGTTTTGACTATTACTTATTAGGATGACATCTTCTTCCTCTAGTCATTGTCAATCAACTCCTTCCATATCTTCAACTTCTAGTAATGTTGAGGATGAATCAAAACCACTTTGGCAATATGTGACTAAGCTTCAAAAATTGAGTGAAGGAGGTGGAAATTTTTTATGGCAATGTAATTTTTGTCATGCCGTAAAGAAGAGTTCGTATACAAGAGTTAGAGCTCACTTGTTGAAGATAAGCGGCCAAGGAACTGGAGTTTGTCCAAAAGTCACACCTAAAGATATTGTCAATATGCAAAAGTTGGAAGATGAGGCGAAATATCGGAGTGAAAGGAAAGCTCTTAAAAATGTTCCTCTACCACGTTCAATCATATCTATTGGTGGTGTTAGTGTGAGTAATTCTCCTATCACAAGCAGTTTTGAGCATAAGAAAAGGAAGGGTAGTTCAAATGTAATTGAAAGATCATTTAACAAGGCATCAAGGGACCAATTGCATGCCCTTATTGCTCGAATGTTTTATTCTGCTGGCCTACCATTTCATTTAGCGAGGAACCCACATTTCATAGGTGCTTTTACTTATAAATAATATGTTATCGGGATATATTCCTCCTGGATATAATTTGTTGAGGACAAGTTTGCTTCAAAGAGAAAAAGCGAATATAGAGAGATTATTGACGGCGGTTAAATCCATATGGAGTCAGAAAGGTGTGAGTATTGTGAGCGATGGATGGAGTGACTCACAAAGGGGACCTTTGATTAACTTTATGGCTGTTACAGATGGTATACCAATGTTTCTAAAAGCAGTGGATTGTTCTGGCGAGATCAAAGATAAGTATTTTATTGCAAATTTGATGGAAGAAGTGATTAACGAGGTCGGTCATCAAAATGTGATTCAAGTGATAACAGATAATGCTCCTAAATGTAAAGGTGCAAGCCAAATTATTGGAGCACAATTTTCGACTATTGTATGGACACCGTGTGTAGTTCATACCCTCAATCTTGTCTTGAAGAATATATGTGCTGCCAAGAATGTTGAAAATAATCAACTTGTATATGAGGAATGCAGTTGGATCTCTGACATTGCTGGCGATGTCATGGTAGTGAAAAATTTCATCATGAATCATTCTATGAGGCTCGCTATGTTTAATGAGTTTGTATCTCTAAAGTTGCTGTCTATGGCTGAAACATGTATTGCATCAGTTATCATTATGCTTAGGAGGTTTAAGCTTATTAAAGGTGGATTACAAGCTATGGTGATTAGTGATCAATGGGCAAATTATAGAGAAGATGACGTGGGAAGAGCGAGACATGTGAAGGAGTTCGTGCTTGATGACATATGGTGGGACAAAATTGAGTATATTATTTCCTTCACTGGACCTATATATGACATGATCCGAGCTTGTGATACGGACAAACCTTGTCTTCATTTGGTGTATGATATGTGGGACACTATGATTGAAAAGGTGAAGACAACAATATATAGACATGGAGGATTGCCACCAAATGAATCCTCCTCTTTTTATAATGTGGTGCATAATATTTTTATTGATCGTTGGAACAAAAATAATACTCCACTTCATTGTCTAGCGCATTCCTTGAATCCAAGGTACTTTTTAGTTACTATTCTTTTGTCTATATTGTAATTGTATTATAATTTATAAGAGTTTAATATTTTTTAAACTTATATTAGGCAGTGTTTTTTAAAGCTCGAGGCGCACTAAAGCGCAATAGCCATCTGGGGCTTAAGCGCGAGGCGCAAAAAAAAGCGCGAGCTTTTTTTTGTGAGGCGCACTATATGTAAAAAATACATAAATATATATGTACATGTACATATTCAAAATAAAAAACATCAATGAAAAGAAATGAGATATAATAAAGAAAAAGATAATAAAATATAACATATAAGAGACAAACAAGGTTCAAAGATTTTAAGTCTTTCAACACCAAGAGTCAATGCTCTATAAAAACATAAAAACAATCCTAACATAGTATGATAGGTATCACAAATAAAAGTCTAAAGATCAAGCTCATCATCACTAAATAGGTCTTCATTTTCATTCTCTCCATCATTAGACTTATAGCTATCTGCATCTTCCTCTTCGGTCTCATCTCTATCGTCATCCAAATCTATTGGTGTGGTCTGTGTGGATGAACATGAAGCTCTAGTCCTGGTTCTTGAAGTACTTGCTCTAGAATGATAGGTGGGTTCTTCTGCTCCAGCAGCTCTCGAGACATCGCCCCATGTTAAAGAATCATCACCGAATACAAGATCGTCATCCTCTTCAGAATCATCCATTTTTCCAGTCAACCACTCATTACTATTGTCGATGTCCTTTAAAGAGATGAGATCAATGATGTCTCGAATATTATATCGACGTTTCAATGCTCTATTATATTTTATGAAAACCAAATCATTCAGACGACTTTGAGCAAGCCTATTCCTTTTTTTGCTATGAAGCTGCAAGTATAAAAGACAAAAGTTCTATAATCATTTAAACCAAGAAGTTAAAACTTAATTCTCAAGTTATATGGGGAGAACTAACCTGTTCAAAGACACTCCAATTACGTTCACAACCAGAGGCACTACAAGTTAGACCTAAAATTCTCACGGCAAATATTCGCAAGTTTGGAGTTGAAGTTCCAAAATCATTCCACCATTCCACTATTGAACTAAACTATGTTAATAACTTTCAATCAAACAAAATAATTTACTTAATATTATCAAACACTCAAAGTTAAACGTTACCTGGAGATCTTTTATTTCTTTGCCTAATAGCCATGGTTTGTCCAAATAGTCCTTCGGCCTTCTTGTACTTGGTCAGTTCTTCTAATATCTTGTCTTGTATCTCTGGAGAAGAGACCATTTTCGTTATACATGAGTACAACCCATTAACAACTTCATCATCACCTTCGATACTAGGATTTGAATAAAAGAATTCTGGGTTTAAATAATATCCCGCTGCATGCAAGGGACGGTGCAATTGAAGCTCCCATCTTCGATCAATGATTTGAAAAATATCCTTGTATTTTTCTTCCTTTTCATTAAAGGATTTAGCTATGGCCTCCTTAGCCCTATCCATGGCCTCATAAATATATCCCATAGGGGGCTTCCTCTCGCCATCCACCAACCTAAGAACTCGCACTAGAGGGCCTGAAACTTTAAGAGCAAACACAACTGTATTCCAAAAAGTAGTCATCAATATAGTTTGAACAATTCGCTTGCCTTGTTGCTCCTTGCTCCATTTGCTATCCTTCCATTCATCTGAAGTAAACATCTTCCTCAAGTTGTTTTTTTGACGATGTATACTTGATAATGTGATGCAAGCTGTAGCAAATCGAGTCTTAGCTGGCCTAACTAACTCCTTTTGATTAGTAAACCGTCTCATCATATTTACCAATCCAGGACGAACATAAATGAAATTACTGATCTCTATGCCCCTTTTCAATGTGTTGTGAATGTTTGGGATCTTGAATATGTCCTCCAACATCAAGTCTAAGCAATGAGCGGCACATGGTGACCAAACTAAATGTGGTCGTTTTGCTTCTAACAATCTCCCTAGTAACAAAGTAAAATATGAACATATTAGGCAACTTGAAATTTGTAAGACTAATTTGAAGCCTTAAACATATATTACTTTAAGAGAAAGCTAGGAATTTGCATTCTTACCTGCCATTACATTTGCTGAGGCACTATCAGTAACAACTTGTACAACATTGGCTTCTCCAATGCGCTCTACAAAATCGTCAAGCAATTCAAACATTTTTTTTCCACTCTTCACATAAGATGAAGCATCAATGGACTCAATAAACATTGTGCCTTTTGGACTATTAACTAAGAAGTTAATTAATGTCCTATCTCTTCTATCCGTCCATCCATCGGCCACGATAGTGCATCCAACCTTGGCCCACTCTTCCTTATGACTCTTCAACAACTCATGTGTTGTTGCTAACTCTTTTTTCAACAATGGCACTCTCAGCTCATGATAAGATGGTGGCTTCAATCCAGGGCCAAATTGTCATATTGCTTCAATCATAGGACCAAAGCTTTCATAAGTGCAAGCGTTTAGAGGTATTCCAGTATCATAAAACCATCGAGCAATTCTTTGAACTGTGCGGTCCCTCATTTCCTTTTTGTAGGCTTCATTCAATGTTGTTTGCTTTCCTTTCTCATTCTTTCGATTTTGAATAACTGTTTCCGGATTAGGAGTAAAAAATGCATCCATTGGACCCTTCTGTCTTGGCTTCTTAAAAGACGACTTAATTGATTCTAAACTTGCCCCTTGAGATGACGATGTTCTCTTACTCGAATTATTTACCACCAAATCGTCCTCATCCTCATCCTCAATACCAAAATCTTCATCGTCAATATCAGGTATAAGATTTCTTTGTTCTTTGATCTCCTTTTTCTTGGACATATATTCTTTAATTTCTTCCTTCACATGGTTCGGGCATTTCTTGCACGCAGTTGTATTTCTATAACCACCAACAAGGTGTTGCTTCACTCTATAAACACCTCCTTTTGTTAATTTTGAACAAAAACTACAAACAAATGTAGTTATATCTTGTGGATTCTCCAAACGAGCATACTTCCATGCCGGATCTTTTCTCGGGCCGTCGTTAGACATCTATGTAATGTAATCAATACCAACCTAATTAATTAAATCAAATTGACCAATTGTTAGACTTAGCAAAATTCAAAGAGTTAGTGAAAAAGTGAGATAGAGACAGCAAGCGAGAATGGAGAGAGAGCTTACCCGAAGCTTCTGTCGAAAGAGGAGAAAGGAAGAAAATCGTAGAGAGAAGAGAAGCTCAAATATTCTGCCGTGAAGAGGATAAAGGTAGAGAAAAATCGGGAGAGAGGAGGGAGGAAGAGAAAAAGGAGGGAGGAAGGAAGAGAAAAATCGAGAGAGAGAGGAGGGAGGAAGAAAAAATCGAGAGAGAGAGGAGGGAGAGGGAGGAAGAAAAAATCGAGAGAGAGGAGGGAGAGGGGAGAGGGAGGAAGAAAAAATCGAGAGAGGAGGGAGAGGGAGGAAGAAAAAATCGAGAGAGAGAAGAGGGCTGGAGAAATCGGGAGAAAGGGGAGACTTCAATGAAGAAATCGGGGGAGACTGGGAGAGACAGGACCAGAAGAAATAAAACCTGTTTAAATTAAAAAAAAAAACCGATAGCCCTAAACGCAAGGATTTCTGCGTTTAGGGCATAATTTTTTTTTTTTATTATTAAAGAAGCGTCGAGGCGCCGCAATTCAGCGCCTCGACGCTTCGCGGGAAGGCGCGCGCTTAACGCGCGCCTTCCTATCGGCGCCTCGCTTATCCAAAGCGAGGCGCAATTCTTGCGCTTAGCGCCTAGGCGCGCCTCAGGGCGCTTTTCAAAACCCTGATATTAGGTACTATAGTGAAGAATGGCTTAAAGAAGACCCTAATCGAGTGCCTCCGCATAAAGTTTTGGAAGTAACTCGAGAGAGAATGAAGTGTTTTAAGAGATACTTTAGCACCAATGAGGAGCGTGCAAAAGTGAAGATTGAATTTGCTAACTTTTCAACAAATGCTGGAGATTTTGCTGATTATGAATCCATAATTGATAGGCATCAGTTAGATCCTAAAAGTTGGTGGGCCACTCATGGTGTTTATGCACCAACTCTTCAATCAATTGCTTTTAAACTATTAGTGCAGCCTTCCTCCTCTTCATGTTGTGAGAGAAATTGGAGCACATACTCATTTGTTAATTCGATTAGACGAAATAAAATGACGCCACAACGTGCAGAAGATTTGGTATTTATCCACAGTAATCTTCGTCTTTTGTCAAGAAGAACTCCAGAATATTTAAAAGGAGAGACTAAGTCATGGGATATTGCTGGAGATTCTTTTGATTCTTTTGAAGACGTTGGTATGCTTGAAGTAGCTAATTTATCATTGGATAAACCAGATTTGGAGGCTGTATGATGATGGTGGTGAGCTTGATCGAAGTGATGGAGATGAAGATGTTATGAAAATTTAAGATGAGAGCTTTAGTTTTTTTGTATGAAAAACTATTTTCTATTTTCTTGTTGTTCTACTCTACTTTGTGTTATATACTTTGTTGGATCTTTGTTTTGTTTTGTACTATTGTCAATTGATTTGACATTTTATATTATCTAGTGAAATACATCGTCGTTTATGTTAAATTTACATGTGTTTGTCTATTATGAAGTATTAAATATATTCATATATACACATATATTTAAAAAAAAAAAAGAAAAAATTACATATCCCCGACGTGTCTGTGTCCTACTTTTTTAGAAATTGACGTATCGCTGTGTCGTGTCGTGTCGTATCCGTGTCCGTGTCCCGTATCCGTATCCGTGTCCGTGTCCCGTATCCGTATCCGTGTAGTTAGAAGACAATATGGAGAAGCGTGCAGGAGAAAATGCTCTTAAAAGGAAGAACACTTTCACGGCAAGATAATTTGCATCTTCAAAAAAATATATCAATCTTATTTACAATTTTGAAAAAAGCACAGATATTTGTCATCTTATTAGTTTATAACCAACAAAACCCCCTTATGTTAGGAACAACAATAGCTCTTTCCTTTCCCATTTTTTTAGCGCCTTTCTTTTTTTTTTTTTAATTTGCCAATTAGGTTCATTTAGTAGGACGATTCATTGGTTGAGATAATTAGAATTCGGAATTGGAAGCTTAAGAGGATTTGTTTGCATCGTCTTGGCCTTTCTTTGATCTTTTTTCTTTTCTTTCTCCATATGTTCAATAAAGTTCTGATATTTTTTTTCCTTATGGAAGTTTTTCTCTGCATATTTTATCTTCAGTATTTTCATTTTTTTATTCAGGTACACCCCTTATTTTTTTGATAAAAATATGCTCGATAATATTCTTGAAGAATCTGTTGATCAGCATTTTCATGCTTTAATCCAAACTCGGCGTTTGCAGCGCCGTAGGGAGATAGTTGATGATAATGCAGCAGCAGAAGTTCTTGAAGAGATTGGCGATAGTTTGTTGGAGCCTCCAGAGGTATGATATTCAATAGTCTGCTAAAATTTCTTGAACAGTTTATAGTAAGGAGGTACTGGGTATTTTATTTATATTTCAATGCCCAGCATTTCTACGGCCTCTCTCATTTCTTGAGTTCAAACTGGGACTAGCTCCTTACACAATGTTTGGTAGTTAGGGAACTCATGGAAATTGGATATCAACATCCTATATACGTAGCCTTAAAGTATTGAATGGTCTTCGTTTAGGCAGTGGTTGGGTTGGGTCATCTTTGGCTTAAGTATTTCCGTCTTCCCCTTCTAATAGAAAATAATGCATACGGTTAGTTCCAAGTATCATAGCTTTCAGTAAACATATATCCTATTTTTCAAGACGAGGGAGGCTGACTTTGATGTATAAGCTACTCTTACTAGTACTCCATTTTATTACCTACCAGAAAATTTTACTCCCTTTTATTTCCTCCTAGAGAATTTTTACGTGCAATTGCACGTGTGATGTATTTTAACGTGTTTTTAAATTCTAAAACTTCATACGGTAAAAATTTTATATTTTGTTTTAAAAGTTATTTTTGAAATCTTATCCAAATTGAAAAACAAAACATAGGTTCTTTAAACTAACTTTTTCGTTTTTGAAATTTGATTAAAATATCTAAAATATTTTTAACTTAACGTGATTCATTTTAATTCTAGGACATAAATTTTATTTTAATTTTAACAAAAGTAATGGAATAAATAAAATATAAACTATCATTATTAGGAGCATGTTTGGAATTTTAAAATTATTTAAAGGACAAATATGTCCAAAAATGATTCTTAACTATCCCACCTTTTAATAGTAAAGATTTATGTTTAAGATGCTCTCTAAACTAGCCAAGGCAGTTGAGAGAATTTCAGAAATTTTGGGGGAAGGCACGCACGTTGAAGGAGGAAATATTCTGATTTAGTGGGATCGGGTTTCTTTACCTAAGGAGTTTGGTAGTCTAGGTGTTCGCAAGAACAACGCTACCTTGTATTGTAGAGATATCCTAAGGAGTTTCTGAGGTCATTGAAATCAAACACGATAAGGTTTCCCTGATTCTGTATAGCCGATGTAAAATTGTTTTGGCCTAGTGTATTCCCTTAAAAGTAGTTGACGCAGCTGGATTAAGTTTAGGATGATCTTTGAGGAACTTAGCTATGATCAATTTCCTGGACTGTACAAGCTTGCTGATAGCCACCATCAGCAAGGATCAAGGAGGCTTTGAATAATGTGGAGCTTAAGTGGGATCATAGCTTCAGTAGGAATTTGGGATGAAGAGGGCTATTGAATGGTAAGACTTTTGAATCTATATTCCCACTCAAAAGGAGGATTCTGGAACTTTGAAAGCTCAGGGTCATATTCTGCAAGTTCATGTTCATTCACCTGGTAGACAACACAGGCAATTTTTGCACATTTAATGGGTCTTGAAGTGTAAGGCTCTTAAAAAAAGTAAGGCCTTTTTTCTTTCCTCTTTTAAATTTGAACTTTTAATTAAGGCCCTGGGAGTTGGTTTTAAAAGCTGTTTTTTTTTTTTTTTTTCAAAATTAATGTTTTTCTTTTCAATTAAAATATGTTGAATAATTCTAAACTTAGTTTTTCTTATTTTATTCCAACATATTTTATTTTTTCTTTGTTTGAAACAAAAATGAACTTTTCATTGAGATAGTGAGAAGTTGCACAAAATTTAATGAAACTAAAAACTTGCAAGACAAGTCACTCTACCCAAACAAAACCCTTAGAAAAAACTTTCCCCACTAAAATTTATTACAGAACAAAGTAACCAACAAACTCTATTTTATGAGACCCACAAAGAACAATTGGCTTTTGCACAATCTACCTCCTTCCCCCCATTCTATAGTAACCCTGTTAAACAATCTATTATTCTTCTCCTTTCAAAGATTCCATTTAAAATATTTTACAGAGTTGTTCTACGACTTCCTTGCTCTCCCCTTGAGATAGGTACCACATAACCAAATGCTTTACATTTTCCCTCGTTGAATGATGGGTGACCCGCCATCAGTAAGTAGCTCCACAATTTTCTGCTCAAAAAACATGAAAAAAAACAAATGGGACATGTCTTCACTATCTGAATAACACAACATACACTGAGAAGGATTTAAACTCGTATAAATAATTTTCTTTCCAAACTCCACTGTAAAGGCTCTCTACACATGCTGGCCGTATAAACACTTCTACCTTTTCTGGACAAGTTCCTTTCCGTATCTTAGAGCAGAGACTTATTTCCAATGCCCTTTTATCATCCATCAACTTACAAACAGGAGATTTAACCGAAAAGCAACCACTCGATTTCAGCAGTCATACTCTTTTTTCTATTTCTTCCTTTAGCTCATGGAACTTAATTGACTCTGGCAGTAGCTTCACCTTCATCTCCTCTTTTTGCTTCAGATTTCTTCTAGTTATGATACTCCTAAACATATCTATTAGAACAGCTGAAGTTATTCGCAAAAATAGCTTTGATGCATAGTATCATCATATTTTAGAAAGAATCTCATCAATTTTTAAAAATAACTCTAAAAAGGACTTCTAAAAAATTGTATGCCAAAAAAAAAAATCCAATCAGTCCTAAATTTCTTCATTTGACTCTTTCTTATGGGAAAAATACCCTGGACAGGACCCCAAATTCTATCTCCCACTCGTTGTTTATAACAACGATAATATGAACCAAGGACACCTGTTTCTTGCTTGCAAGTTTTCCAGGAAAGTGTAGGAGACTAATTTCATCTACTCAAACACTTATTTGAATCAAGATTCCTACTTCAAATCCCTCGGGGGACTATTTGGGGTGTGGGTTTGGGTTTTGGTAGGTTTGGTTGGTCTCAAAGTAAAATCCTCGACAAGCTCACGACATGTATTCTCACTGCTCAGATAACTTATTGGTGTTGGACCACATCTTTTGGATCAGGCATGTCAATCCAAAAAGGCAAGTAACGGAATCATCCAGTGATGGCTTGATGTCTTTGCTGAATATCATCTTTTCGTGATAGTAAATGTAAATAGTTGAAATCAGTTATGACGGATAGTTGTAGGTTGGCAGCAAGTGCGCAGGAGTGGAGGGTTCAAAATAGGAGAGGAGGGGAAAGGAATCAGAGTAGGCAGACTGAGGCTTCTTTTTTCCATTTTTAGAGAGAGAATTTTTCATTCTCCCAACTAAAAAACAAACAATACAAAAAACACCTAAATCCTCAACCGAAAGACCATCAAGGATTAATACAACCCATATCACCCTAACGCTTACAATAAATACCAACTAAAAAACCTAAGAAAAAAGCCCAAGGAACTTAGAAAAGCAAAAACTAAGCCACCAAATGGATAATTTTTAACAGAACTTTTCAGCAAAAATTGAGCTCCACTCCTCATATAGAGATCTTCAACCAGAGTTTGATGTGAGAACAGCAACTTCAATAGCACCAAAGAGAACCATCATAAATAACCCAAATAAGCCCATCACAACAACAAGACCAAAGCTTCACCCCTCCTACTTGCTGCTTAAAAACTCTAACCCTTTTGTTTGTTCGTTTGTTTTTTCTGGTAAGAAACTACCGTTTCATTATCGATGAACTTTTATTTGTTACCATTTCTTTACTGTATGAGATCGAAACCAAACTCAAAAGGTAAACCTCTAATTCTTCATGCTTTCTCCTTTTTTTTTTTTTCCTTCTTTTTCCCTCTCTCTCCTCCTTGACTGCACATGCTGATATCACAATGTTGAGATAAAGCATATCAATCGATGGCCACATGCAGTAGCAAATATTGGTTTATGGCCACATGGAGCAATGAATATTGGTTGACAGCCTTTGTTTCTAATTTGTGAAGCATGTTTAAAATTTATGTAATCATTTATATGTTCATAATACTTTCATTTTAAATTAAGTTAACTAGTATTACGAAGATGGTGTAATATACTCATTTGTATTACAATTTATTTCGATGAATATATACATATACACATGTACGTACTTTTTCATCCATATTCCTAAACACTCCTTGACTGCACAAATTGATAGAGTTCTAGCTTTGCGTTTACTACACACGATTGTGAACACTATCCTACAAACACATGATGCCAAATTTTAACTTAAATCTTAAATTTAGTAGTAATTCGAATATGCTGTAATATACTCATTTATATTACAATTTATCTTGATGATGCCAACTTTGGCTGCAATCTTAAATAAGGAAAGGTGGAATGCTAGGATGCTAGAAAGAACTGGCTGTTCCAAGGTTAATCTGCTTTTTCTTTTTCTTTTTCTTTTTGAAGTTAATGGAGAGGCTTTTTGTAACTCGTTTGGTTCCTTAGCAGGATAATGTTTTATTCTCCCACTTTTATACTCTTTCTGGGGTTCTGAGAAACATTATCATAGGATTGACTATTGCGCACTTCACCTACATATATTGTCTCACATATTCTTCCCTGGTAGGTTCAAGAAGTGGTGGATGAGATGGGCAATCCTGGAATACCCTTGAGTGTCATCTTTAAGGTTACTGAAATGCAGCTTCTTTATGCTGTTGATAAAGTAATTCTTGGAAATAGGTGGTTGCGTAAAGCTGTTGGTATTCAGCCAAAATTCCTTACATGGTCGACTCATTTGAGAGAAG

mRNA sequence

ATGACCTGGACCAAGAGTTGGCTTTTACTTCCATTTACTTCAAGTTTGGCAATCTTAGCTACTCAATCTCGTTCGTTTGATGCTTTTTCCACCCACATACTAGTATGGGATTTTTTTCTCCTTGAAGAACTTGTGGGTGAGGGGAAGATTCTTAAGCCTCTCAAAGCAACCTTCCAACTGAATGATGGAGTCCTAGTGAAAACCCCTGAGTGGCTTAAGATGATGTTCTCAACCATTACTAAGAGTGAAAGAAATGGCCCCATATTCCGCTTCTTTACTGATTTAGGTGATGCAGTTACATACGTTAAGAGACTGAATATTCCAAGTGCTGTGGTGGGTGCATGTCGTCTTGATTTAGCGTATGAGCATTTCAAGGAAAAACCTCATTTATTTCAGTTCATTCCGAATGAGAAGCAAGTTAAAGCAGCAAACAAGCTTCTTAAAGGGCTACCGCAAAATGGTGGAAGCAAAAGGATTGATGGTGTTCCTGTGTTCAGTGCCCAAAACTTGGATATTGCAATAGCAACTACTGATGGGATTAAGTGGTACACCCCTTATTTTTTTGATAAAAATATGCTCGATAATATTCTTGAAGAATCTGTTGATCAGCATTTTCATGCTTTAATCCAAACTCGGCGTTTGCAGCGCCGTAGGGAGATAGTTGATGATAATGCAGCAGCAGAAGTTCTTGAAGAGATTGGCGATAGTTTGTTGGAGCCTCCAGAGGTTCAAGAAGTGGTGGATGAGATGGGCAATCCTGGAATACCCTTGAGTGTCATCTTTAAGGTTACTGAAATGCAGCTTCTTTATGCTGTTGATAAAGTAATTCTTGGAAATAGGTGGTTGCGTAAAGCTGTTGGTATTCAGCCAAAATTCCTTACATGGTCGACTCATTTGAGAGAAG

Coding sequence (CDS)

ATGACCTGGACCAAGAGTTGGCTTTTACTTCCATTTACTTCAAGTTTGGCAATCTTAGCTACTCAATCTCGTTCGTTTGATGCTTTTTCCACCCACATACTAGTATGGGATTTTTTTCTCCTTGAAGAACTTGTGGGTGAGGGGAAGATTCTTAAGCCTCTCAAAGCAACCTTCCAACTGAATGATGGAGTCCTAGTGAAAACCCCTGAGTGGCTTAAGATGATGTTCTCAACCATTACTAAGAGTGAAAGAAATGGCCCCATATTCCGCTTCTTTACTGATTTAGGTGATGCAGTTACATACGTTAAGAGACTGAATATTCCAAGTGCTGTGGTGGGTGCATGTCGTCTTGATTTAGCGTATGAGCATTTCAAGGAAAAACCTCATTTATTTCAGTTCATTCCGAATGAGAAGCAAGTTAAAGCAGCAAACAAGCTTCTTAAAGGGCTACCGCAAAATGGTGGAAGCAAAAGGATTGATGGTGTTCCTGTGTTCAGTGCCCAAAACTTGGATATTGCAATAGCAACTACTGATGGGATTAAGTGGTACACCCCTTATTTTTTTGATAAAAATATGCTCGATAATATTCTTGAAGAATCTGTTGATCAGCATTTTCATGCTTTAATCCAAACTCGGCGTTTGCAGCGCCGTAGGGAGATAGTTGATGATAATGCAGCAGCAGAAGTTCTTGAAGAGATTGGCGATAGTTTGTTGGAGCCTCCAGAGGTTCAAGAAGTGGTGGATGAGATGGGCAATCCTGGAATACCCTTGAGTGTCATCTTTAAGGTTACTGAAATGCAGCTTCTTTATGCTGTTGATAAAGTAATTCTTGGAAATAGGTGGTTGCGTAAAGCTGTTGGTATTCAGCCAAAATTCCTTACATGGTCGACTCATTTGAGAGAAG

Protein sequence

MTWTKSWLLLPFTSSLAILATQSRSFDAFSTHILVWDFFLLEELVGEGKILKPLKATFQLNDGVLVKTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKEKPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPYFFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEVVDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKFLTWSTHLREX
Homology
BLAST of Sgr021201 vs. NCBI nr
Match: XP_022157864.1 (uncharacterized protein LOC111024477 [Momordica charantia])

HSP 1 Score: 445.3 bits (1144), Expect = 4.3e-121
Identity = 219/226 (96.90%), Postives = 222/226 (98.23%), Query Frame = 0

Query: 67  KTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 126
           +TPEWLK MFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE
Sbjct: 151 RTPEWLKKMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 210

Query: 127 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 186
           KPHLFQFIP+EKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY
Sbjct: 211 KPHLFQFIPSEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 270

Query: 187 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 246
           FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV
Sbjct: 271 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 330

Query: 247 VDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKF 293
           +DEMGNPGIPLSVI KV EMQLLY VDKVILGNRWLRKAVGIQPKF
Sbjct: 331 MDEMGNPGIPLSVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKF 376

BLAST of Sgr021201 vs. NCBI nr
Match: XP_038874546.1 (uncharacterized protein LOC120067161 isoform X2 [Benincasa hispida])

HSP 1 Score: 444.9 bits (1143), Expect = 5.6e-121
Identity = 218/226 (96.46%), Postives = 222/226 (98.23%), Query Frame = 0

Query: 67  KTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 126
           +TPEWLK MFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE
Sbjct: 151 RTPEWLKKMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 210

Query: 127 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 186
           KPHLFQFIPNEKQVKAANKLLKGLPQNGGSK+IDGVPVFSAQNLDIAIATTDGIKWYTPY
Sbjct: 211 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKKIDGVPVFSAQNLDIAIATTDGIKWYTPY 270

Query: 187 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 246
           FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEE+GDSLLEPPEVQEV
Sbjct: 271 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEMGDSLLEPPEVQEV 330

Query: 247 VDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKF 293
           +DEMGNPGIPLSVI KV EMQLLY VDKVILGNRWLRKAVGIQPKF
Sbjct: 331 MDEMGNPGIPLSVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKF 376

BLAST of Sgr021201 vs. NCBI nr
Match: XP_038874545.1 (uncharacterized protein LOC120067161 isoform X1 [Benincasa hispida])

HSP 1 Score: 438.3 bits (1126), Expect = 5.2e-119
Identity = 218/232 (93.97%), Postives = 222/232 (95.69%), Query Frame = 0

Query: 67  KTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 126
           +TPEWLK MFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE
Sbjct: 151 RTPEWLKKMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 210

Query: 127 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKW---- 186
           KPHLFQFIPNEKQVKAANKLLKGLPQNGGSK+IDGVPVFSAQNLDIAIATTDGIKW    
Sbjct: 211 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKKIDGVPVFSAQNLDIAIATTDGIKWCVIF 270

Query: 187 --YTPYFFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEP 246
             YTPYFFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEE+GDSLLEP
Sbjct: 271 LGYTPYFFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEMGDSLLEP 330

Query: 247 PEVQEVVDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKF 293
           PEVQEV+DEMGNPGIPLSVI KV EMQLLY VDKVILGNRWLRKAVGIQPKF
Sbjct: 331 PEVQEVMDEMGNPGIPLSVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKF 382

BLAST of Sgr021201 vs. NCBI nr
Match: XP_004145902.1 (uncharacterized protein LOC101215938 [Cucumis sativus] >KGN49918.1 hypothetical protein Csa_000071 [Cucumis sativus])

HSP 1 Score: 434.5 bits (1116), Expect = 7.5e-118
Identity = 213/226 (94.25%), Postives = 219/226 (96.90%), Query Frame = 0

Query: 67  KTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 126
           +T EWLK MFSTITKS+RN PIFRFFTDLGDAVTYVKRLNIPSAVVG CRLDLAYEHFKE
Sbjct: 151 RTEEWLKKMFSTITKSKRNAPIFRFFTDLGDAVTYVKRLNIPSAVVGVCRLDLAYEHFKE 210

Query: 127 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 186
           KPHLFQFIPNEKQVKAANKLLKGLPQNGGSK+IDGVPVFSAQNLDIAIATT+GIKWYTPY
Sbjct: 211 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKKIDGVPVFSAQNLDIAIATTNGIKWYTPY 270

Query: 187 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 246
           FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEE+GDSLLEPPEVQEV
Sbjct: 271 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEMGDSLLEPPEVQEV 330

Query: 247 VDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKF 293
           +DEMGNPGIPLSVI KV EMQLLY VDKVILGNRWLRKAVGIQPKF
Sbjct: 331 MDEMGNPGIPLSVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKF 376

BLAST of Sgr021201 vs. NCBI nr
Match: XP_022996000.1 (uncharacterized protein LOC111491342 [Cucurbita maxima])

HSP 1 Score: 431.4 bits (1108), Expect = 6.4e-117
Identity = 209/226 (92.48%), Postives = 219/226 (96.90%), Query Frame = 0

Query: 67  KTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 126
           +TPEWL+ +FSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE
Sbjct: 151 RTPEWLRKLFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 210

Query: 127 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 186
           KPHLFQFIPNEKQVKAANKLLKGLP+NGGSK+IDGVPVFSAQNLDIAIATTDG++WYTPY
Sbjct: 211 KPHLFQFIPNEKQVKAANKLLKGLPENGGSKKIDGVPVFSAQNLDIAIATTDGVQWYTPY 270

Query: 187 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 246
           FFDKNMLDNILEESVDQHFHALIQTRRLQRRREI DDNAAAEV EE+GDSLLEPPEVQEV
Sbjct: 271 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREITDDNAAAEVFEEMGDSLLEPPEVQEV 330

Query: 247 VDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKF 293
           +DEMGN GIPL+VI KV EMQLLY VDKVILGNRWLRKAVGIQPKF
Sbjct: 331 MDEMGNLGIPLNVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKF 376

BLAST of Sgr021201 vs. ExPASy TrEMBL
Match: A0A6J1DXS3 (uncharacterized protein LOC111024477 OS=Momordica charantia OX=3673 GN=LOC111024477 PE=4 SV=1)

HSP 1 Score: 445.3 bits (1144), Expect = 2.1e-121
Identity = 219/226 (96.90%), Postives = 222/226 (98.23%), Query Frame = 0

Query: 67  KTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 126
           +TPEWLK MFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE
Sbjct: 151 RTPEWLKKMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 210

Query: 127 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 186
           KPHLFQFIP+EKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY
Sbjct: 211 KPHLFQFIPSEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 270

Query: 187 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 246
           FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV
Sbjct: 271 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 330

Query: 247 VDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKF 293
           +DEMGNPGIPLSVI KV EMQLLY VDKVILGNRWLRKAVGIQPKF
Sbjct: 331 MDEMGNPGIPLSVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKF 376

BLAST of Sgr021201 vs. ExPASy TrEMBL
Match: A0A0A0KK08 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G140440 PE=4 SV=1)

HSP 1 Score: 434.5 bits (1116), Expect = 3.6e-118
Identity = 213/226 (94.25%), Postives = 219/226 (96.90%), Query Frame = 0

Query: 67  KTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 126
           +T EWLK MFSTITKS+RN PIFRFFTDLGDAVTYVKRLNIPSAVVG CRLDLAYEHFKE
Sbjct: 151 RTEEWLKKMFSTITKSKRNAPIFRFFTDLGDAVTYVKRLNIPSAVVGVCRLDLAYEHFKE 210

Query: 127 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 186
           KPHLFQFIPNEKQVKAANKLLKGLPQNGGSK+IDGVPVFSAQNLDIAIATT+GIKWYTPY
Sbjct: 211 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKKIDGVPVFSAQNLDIAIATTNGIKWYTPY 270

Query: 187 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 246
           FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEE+GDSLLEPPEVQEV
Sbjct: 271 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEMGDSLLEPPEVQEV 330

Query: 247 VDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKF 293
           +DEMGNPGIPLSVI KV EMQLLY VDKVILGNRWLRKAVGIQPKF
Sbjct: 331 MDEMGNPGIPLSVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKF 376

BLAST of Sgr021201 vs. ExPASy TrEMBL
Match: A0A6J1K5H1 (uncharacterized protein LOC111491342 OS=Cucurbita maxima OX=3661 GN=LOC111491342 PE=4 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 3.1e-117
Identity = 209/226 (92.48%), Postives = 219/226 (96.90%), Query Frame = 0

Query: 67  KTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 126
           +TPEWL+ +FSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE
Sbjct: 151 RTPEWLRKLFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 210

Query: 127 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 186
           KPHLFQFIPNEKQVKAANKLLKGLP+NGGSK+IDGVPVFSAQNLDIAIATTDG++WYTPY
Sbjct: 211 KPHLFQFIPNEKQVKAANKLLKGLPENGGSKKIDGVPVFSAQNLDIAIATTDGVQWYTPY 270

Query: 187 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 246
           FFDKNMLDNILEESVDQHFHALIQTRRLQRRREI DDNAAAEV EE+GDSLLEPPEVQEV
Sbjct: 271 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREITDDNAAAEVFEEMGDSLLEPPEVQEV 330

Query: 247 VDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKF 293
           +DEMGN GIPL+VI KV EMQLLY VDKVILGNRWLRKAVGIQPKF
Sbjct: 331 MDEMGNLGIPLNVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKF 376

BLAST of Sgr021201 vs. ExPASy TrEMBL
Match: A0A6J1H266 (uncharacterized protein LOC111459695 OS=Cucurbita moschata OX=3662 GN=LOC111459695 PE=4 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 3.1e-117
Identity = 209/226 (92.48%), Postives = 219/226 (96.90%), Query Frame = 0

Query: 67  KTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 126
           +TPEWL+ +FSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE
Sbjct: 151 RTPEWLRKLFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 210

Query: 127 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 186
           KPHLFQFIPNEKQVKAANKLLKGLP+NGGSK+IDGVPVFSAQNLDIAIATTDG++WYTPY
Sbjct: 211 KPHLFQFIPNEKQVKAANKLLKGLPENGGSKKIDGVPVFSAQNLDIAIATTDGVQWYTPY 270

Query: 187 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 246
           FFDKNMLDNILEESVDQHFHALIQTRRLQRRREI DDNAAAEV EE+GDSLLEPPEVQEV
Sbjct: 271 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREITDDNAAAEVFEEMGDSLLEPPEVQEV 330

Query: 247 VDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKF 293
           +DEMGN GIPL+VI KV EMQLLY VDKVILGNRWLRKAVGIQPKF
Sbjct: 331 MDEMGNLGIPLNVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKF 376

BLAST of Sgr021201 vs. ExPASy TrEMBL
Match: E5GBD3 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 429.5 bits (1103), Expect = 1.2e-116
Identity = 213/234 (91.03%), Postives = 221/234 (94.44%), Query Frame = 0

Query: 59  QLNDGVLVKTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLD 118
           QL  G +V T EWLK MFSTITKS+RNGPIFRFFTDLGDAVTYVKRLNIPSAVVG CRLD
Sbjct: 51  QLPFGEMV-TQEWLKKMFSTITKSKRNGPIFRFFTDLGDAVTYVKRLNIPSAVVGVCRLD 110

Query: 119 LAYEHFKEKPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTD 178
           LAYEHFKEKP LFQFIPNEKQV+AANKLLKGLPQNGGSK+IDGVPVFSAQNLDIAIATTD
Sbjct: 111 LAYEHFKEKPDLFQFIPNEKQVQAANKLLKGLPQNGGSKKIDGVPVFSAQNLDIAIATTD 170

Query: 179 GIKWYTPYFFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLL 238
           G+KWYTPYFFDKNMLD +LEESVDQHFH LIQTRRLQRRREIVDDNAAAEVLEE+GDSLL
Sbjct: 171 GVKWYTPYFFDKNMLDKVLEESVDQHFHGLIQTRRLQRRREIVDDNAAAEVLEEMGDSLL 230

Query: 239 EPPEVQEVVDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPKF 293
           EPPEVQEV+DEMGNPGIPLSVI KV EMQLLY VDKVILGNRWLRKAVGIQPKF
Sbjct: 231 EPPEVQEVMDEMGNPGIPLSVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKF 283

BLAST of Sgr021201 vs. TAIR 10
Match: AT5G62650.1 (Tic22-like family protein )

HSP 1 Score: 383.3 bits (983), Expect = 1.9e-106
Identity = 176/225 (78.22%), Postives = 206/225 (91.56%), Query Frame = 0

Query: 67  KTPEWLKMMFSTITKSERNGPIFRFFTDLGDAVTYVKRLNIPSAVVGACRLDLAYEHFKE 126
           +TPEWLK MFSTITKSERNGP+FRFF DLGDAV+YVK+LNIPS VVGACRLDLAYEHFKE
Sbjct: 122 RTPEWLKKMFSTITKSERNGPVFRFFMDLGDAVSYVKKLNIPSGVVGACRLDLAYEHFKE 181

Query: 127 KPHLFQFIPNEKQVKAANKLLKGLPQNGGSKRIDGVPVFSAQNLDIAIATTDGIKWYTPY 186
           KPHLFQF+PNE+QVKAANKLLK +PQNG +++++GVPVF AQNLDIA+AT DGIKWYTPY
Sbjct: 182 KPHLFQFVPNERQVKAANKLLKSMPQNGKTQKVEGVPVFGAQNLDIAVATADGIKWYTPY 241

Query: 187 FFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEIGDSLLEPPEVQEV 246
           FFDK +LDNILEESVDQHFH LIQTR +QRRR++VDD+ A+EV+EE+GDS+LEPPEVQE 
Sbjct: 242 FFDKAVLDNILEESVDQHFHTLIQTRHVQRRRDVVDDSLASEVMEEMGDSMLEPPEVQEA 301

Query: 247 VDEMGNPGIPLSVIFKVTEMQLLYAVDKVILGNRWLRKAVGIQPK 292
           ++E+G  GIPLSV+ K  E+QLLYAVD+V+LG+RW RKA GIQPK
Sbjct: 302 MEEIGTSGIPLSVVAKAAEIQLLYAVDRVLLGSRWFRKATGIQPK 346

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022157864.14.3e-12196.90uncharacterized protein LOC111024477 [Momordica charantia][more]
XP_038874546.15.6e-12196.46uncharacterized protein LOC120067161 isoform X2 [Benincasa hispida][more]
XP_038874545.15.2e-11993.97uncharacterized protein LOC120067161 isoform X1 [Benincasa hispida][more]
XP_004145902.17.5e-11894.25uncharacterized protein LOC101215938 [Cucumis sativus] >KGN49918.1 hypothetical ... [more]
XP_022996000.16.4e-11792.48uncharacterized protein LOC111491342 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DXS32.1e-12196.90uncharacterized protein LOC111024477 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A0A0KK083.6e-11894.25Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G140440 PE=4 SV=1[more]
A0A6J1K5H13.1e-11792.48uncharacterized protein LOC111491342 OS=Cucurbita maxima OX=3661 GN=LOC111491342... [more]
A0A6J1H2663.1e-11792.48uncharacterized protein LOC111459695 OS=Cucurbita moschata OX=3662 GN=LOC1114596... [more]
E5GBD31.2e-11691.03Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G62650.11.9e-10678.22Tic22-like family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007378Tic22-likePFAMPF04278Tic22coord: 124..200
e-value: 1.6E-7
score: 30.9
NoneNo IPR availablePANTHERPTHR35138OS01G0225300 PROTEINcoord: 66..292

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr021201.1Sgr021201.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015031 protein transport
cellular_component GO:0009507 chloroplast