Sgr015599 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr015599
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionTPR_REGION domain-containing protein
Locationtig00004836: 170125 .. 189508 (-)
RNA-Seq ExpressionSgr015599
SyntenySgr015599
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATTCTCAGAGACCGATCGCAGAGGATGGGATATTCATACTGAAGAAGAAGATCGAAAGGAGGAAACTTTGGTGGGTTTTTTTTTCTTTTTTTCCAAATTTTCCTTAATTAATTTGCTTTTTGTTCATAGTTGTTTTTTGTTGTTGTAAATAGTGAAGGTCTTTTGCCCAAGCTTTTTTTGTTTTTTGTTTTTGTTTTTGTTTTTTGGCCCTTGTTTTTGGGTCTAATGTTGGGCAATCGATCGGTTTTGTTCAGGTGTAAGTATGTTGTTTGGTTGATGGGAAAGTTTAAAAAGAAAAAAAAAAGGTTTCTTTTTGGGATTTTCTCAGTGATGCTATGGTTTTTATTTGATAGTAGAGAGGAAGTTGGAATTGAATTTTCCTTTCCCGAAAAATGTTGCAGAGGAAGAGATGTGAAAGTCTCTTTCATAATTAATATATTTCTTTCAACCCTTTGAATTTGACTGAGAATCATAATGAAAATTGAAGGTTGTGTTTTTTAACCTTGACTTGAACTTGCTTTTGTTTTCATTTTTCCATTTAATTTTTGGAAAAAAAAATCACTTTCTATCCTATTTAATGGAAGGAATTAAAATTTATCTTTAAAATTTTGGTTTTATTAAATTGAATTTTGAATTTAAATAAGTGGTAAAATTGATTCTCCCTATGAGTTTTTTTTTCCTTTTTTTGAAGATTTGTCCTTGAATTTGATATTTTTTTATTTGTGTTCAAAGTTGAGTTCAAATGTTTGTTTTCCAAGAACTGGTTTTGTGGTTAGGTTAAGTTATTTTTGTGATTGTGTTCGAGCATTTTTGTGGTTAGATTTTTTGTATTTTATTAGTTGAATTTTGTGTGATTTACGAGTTGATTCATGGACAATTTTCTAATTTAATTTTAAAAATCATCAAATTTTTAATTTAATTTTTAAAATATATTAAGAGAAATTACACTGACTTAATCAGTAGCTATGCCTATGCTTTAGGTCACAATAGGTTAAAATTGATTTGCTAATGAGTTTGCGAGGTTTATATTTTGAATTGACATTTATTTTGTCCGAAAATATAAAAATTACAAAATTAATTATTTAAGTCATTTAACACGTCACATATAAGTATTTTGTCAAAAAATCATTTAACAAGTTGTTACTAGGAAAGAAAAAAAAAAGACTTCACATGCATAGCGTGACACATATCAAATAATAGACCAAATTAAATAAAGAAATAATAGTAACAAGAAAATTTTAAATCATGTGTTAATTGCAATAATGAAAATTGAAAAGACATGTAAAAGCAAAACCTGCTAATTAATAAAATCAAAATTCAATCCGAACATAACTTAGTAGATAAAGTATTATTACCATCTTAAAAATCGATGATTCTATCTCTCATTCCGCAAGTGTTGAACTAAAAAAATGAAAATTAAGTATTTAAATTAATATTTAATGGAAAAAAATAGTTGGAGAATTCGACTCTTGACCTCGAGAGATTGAATAAAAGATAAGAAGACATTTTTTTCTTTAGGGAAAGTTGCATTTGTTAATTAATTAATTTTTTTTTATAAAAGGAATGCAAATGGGACATGAAAGCGATTATGATATTATTTAAAGTGTTTAAAATTTCCACTTTATTCTTACAATTAATGCTAGCTGGACCAAGGGTTAATTATAATAATTTTTTTCTTTCATCTCCCTAGCTTACACGGTTGAGATTTTGAGAATATGGGTGGCATGTAGTTGGTATGTTGTTGTTGTTTTTTTTTTATCAAGATATTTGGTATTTCATAATTAAAAAAAAAAAAGTTCTCTCTCTAAAATGAGAACGTGAGTTCATAGGTGGTTGGCATTTGAAAATTTTGAATTTATAAGATATTGGCATTTGAAAAACTTGCGCATATTTATTTTGTGGGATCCACCGTTATTTGAAAATTATTTTCAAAAACCTTTTCATGTGTCTAATTTTACCATTTAATATATCCAGTTTTATTTCTCTTGGTTTTAAATAAAATTGTTGTCAAATAAAAATTTAAAATTTAAAAACATATGTCAATATACATTATTATTTCATGTAAAAATTTGTAGTGCGTATGTGTTTTTTTTCCTTTTTTTAGTTCAACAAATATTGGTGGGGATCAAATTATTATTTTTGGGATGGTAATAGATATCTTATCCACTGAGTTATGTTTAAATTGTGTATTTATGTGTACATACAATTATAAATAAAATAAAGTATATGTGACCCACCAATTATATTTGGCAAATTGATATCATTTGGACGGTTCACATGTGTTTTGCAAAAGAATGATTAATGGGTTGCCATTTACAATTTGTATGTTTTAGCTAAAAAGCAACCCAAAAAGACCAATAGCTTAAATTAAAGTGGAGATATAAATAATAATAATAATATTATTAATTAATTAATTAATTAAAGTGGACAAAAGAGAGAGAAAAAAAAAAGATAAACCCATCAATTAAAAAGGTGTCAACAATTCAATCATACAATGTTTCCCTTTCGCATCATATTATCCTTTAATTATGCCCTCTGATCTCTTCATTTATTCAAATTAAATTTAGTCAAAATATTTTAAAAATTGTATTTTAAAACGATTAAAATTATTTATTTTTTAAATTTAAATTTAATTTTAAAATTGAAAGCAAAAGGCAAACAGGTCTGGTACGCTCCTTGAACTGTAATGGCCGATGGAATTTGTCACAAGAATCAAAGGAACTTTAAATAAATGAAAAATAATTATTTATTTACCATAAAAAATTGTCAATTCAATATAGTTCATAATAAAATGGATGTTCGATCTTCTAGCTAAAACTGGTGAACTTAAAAAAAATGTCAATTAAAATTCTAAAATTTTAGTGCAATTATATATATATATTCCTCAAGATCAGCGACGGAATATTTGATTAATCAGATTTTTGAGATTGTTAATTAAACTGTGCGGATTACCCAAACGTGCCAGCAAGAGCAACTCATTAAAATAAGAGAACAACAATCACTGCTTTTTGTTTATTTATTTATTTGTTTGGTAATTAATTATCATTTCTTGTTTTTTTAATTAAAAAAAATTACACTATAAATATTTATTTTAAAATAAAATATTTAAAATTATTATGTTATATTTACTTATTGTAGTATTTGTTATCAATTTCATAAGTAAAAAGTGATAAAACAATGCTTTTATTTATCATTTGAAAACTTGGAAACAAGAAACTCTATTTTGTTGTTTCAAGATTTACATACAATTTTAAGAATATTTGTTAAAAATAAGAAACACAAAATAGAAATAATTACCAAACACATATCATTTTGAAAAATAAGGGCGAAGCCAGAAATTTTCATACGGAGGGACAAGTTTTTACATAATAACTTTTGGTATATATAATATTTCAATTATGAAAATTTAAGAAAAGATTAGGGAGAATTGTTAAAATATTAACTTTTTAATACAATTTTTTTAATCAAACACAAACTGGGGGGGGGGGGGGGGGGCGCCATTGCTCTCCCCATCCTTCTTCTCTACCTCTGCTAAAAATTTTATAACAAAAAATAGGAATTGAGAATCATAAAAACATTACCCCGAAAAGGCCTATAACTGACATAGTAGAAAAATATTACGCCTTTTGTTTTATTTATCATAGCATGCATTATTTTAATTATAAAAATAATTAAAAAGGTCAAATTCAATCATAGAAATATCACGTCCCAAATTGTTAAAAAAAAAAACGTATAACCTATTTTATAATAATATTAATAAAAAATACAAAAAGTGCGGTTATTTATAAAAATTCATTTTTTATTATGGTATTTAATTAAATTACTTGGGTAAAAAAAGAAAAAGGAAACTGCAGAGCAGCTGAGCCCCGTTCTAGGAATAAAAGGAAGCTTTTCATCTTTCTCCAGTCTCCTTTCTATTCCCCTTCTTTATTTCTTCCACTTGCCAGAGAAGAAAGTTGCAGAAGAAGAAGAAGAAGCAGCCTGGATAAAAGGGGGAGCTTTCGATTCCCGGACGGAGGTCGGAGAGTGATAGACAACGACATATAATTTCAAGGGTCTGTCTTGTTTTCCTTGGAACAAACTTGAATTTTCTCTCTCTTTCTCTGTCTTTCCTTTAGATTTCCTTTCACTAATCTTTTTCTTTATTGGGGGGTTTTCAAGTGCATTATCTTTATTACCATGACTCTTCTGATTGTCAGGAACTACTCTTTGATCTTCTTTCCTGGAATCGGGTTTTCCTTGGATTTTGTTTTCCCCTGCAAATCCTTATCAGTTTCGTAGCTCAGGCTCCTGCGTCGTGGGTCTTCTGCGTGTTTTGCATTAATGGGGTAAGTTTTCTAATTATCTTTATTGATGATATTGTTCTGGCGATTGCTTCCTGTTCTAGCAGTATTCTTGTTGTGCAAATTGGGGAGTCTTCTGCCCGATTCATTTGTTCAATGGTTTAAATTCTTCGATATATTGAGAGTAATCGAAATGAAATTCAGCATTTGAGTAGTTTGCTTTTGGCTGCAAACAAGATCGGAAGAGAAGGGGAAAATGCTATGGTTTAGTTTAGCCTCTTTTATTAAGATTCTAATAGTTTAGCTCTATTAAAGGATTCTTAAGGACTTGTCCAAGCTGATTTCCTGGTGCTGGTGTGTTCTCCATTGTTCAATTAATTTAAAAGGATAAGGCATCTTACTGTTAGCTACTTTATGCTTAAAGTTAACTTAAAAAAGAGTTCTCCATTGTAATTGGCTATGCTTGATGTTTGTTGACTATATAGAAAAGCTAACAACTTTAAGCTTTTAAGTCTAGTATTGATTTAACGTGTCATCTATCTCCAGTTCAAACTTCTACAATGTTATTCCTTCCCACCTTTCATATTAATAGTTCATATGTTAGGCTTTGTATTAATATCTTAGCCCACTTGTGAGGCAGAGCATTGAGATTATATTGAAAATGTTAAATTTACCTTAACCTATAGATTTAATTTACTTTTGGGTTTAGTTGTGATGTTACACTATCATTATTATGACTATTGCTATGTGAGGCTGGTGCAAACTAATGGGCTTTTGATTGATTTTGTCTCAACTGTGCAAATATCAACAGTGATGGAATGATGGCATATGATGCTAAATGCTTTGTCATCTTAAAAACTTAAAGTTCTCGATCATTCAGATATTCTATGATTCGTGTTTTATTTTCTGAAAGTCCTGTCAAACTATTTGTAGTCATCTTCTCCTCTCCCTAGCTTACATGGTTGGAACTTGGAAGGGATTGGCTGGTTAGATCAGATGTATATCTGGAGTGCACCCACCCATCTAGACAAGATTCTAAAAGAATATTGTTCAGCACAATTTCAAGATTTCTATGTATGTATATATTTTTAGTCCAATTATGAGCACAATCTTTACAAGATTTCTACAACAGAAAACCAAGCAAGGCCATTAGTTTTCTGCCACCTAGCCATGGAGAAAGATTAAGAACAGCAATATAATGCAATTGTATTGCAAAATGAGGGAGGATTCTGAATGAATAAAGGAAGTAAGGTGAAATTCAGTCTGCTGTAGAAAGTAACCACACCTAATTTATCCTAACGCATCACCTACTGCATGGTAGGATACTTTTCTATAATTAGTTTTGAGGAGATGCCAAACATCATTGACCCCATGTTACCCACTTGTTAACTGCCCTTTAAATCTGGTTCAATGTGGATGGACGTTAAGCTTAAACCTTCCGTCTTATGGATGCATCTGTCATCAATAATTATTTGTTTATTCGTTGATCATTTCAATCTGTTGATATTCAAGTAGCTTGTATTGCGTGCTGATCATTTCAATCTGTTGATACTCAAGTAGCTTGTATTGCGTGCAGTGTATACAATAGCTTTAAAATTCAATATAAATCATCAGGTTTCCCATTAACCATATCATGATTTTTAAATCCTGGTTTATGAGACATAACTACAACTATGCTATCATTTATAACAAAAAAATACAATATCTAGTCAAATCTGGTTCGGTTTTTCCAGTCCCTTCCCAAATATTTCCTTTTAAATTCATGTCTGTCTATTAGTATTATTTTTTCGTATTTGAAAAGGTTTGATTCTTGGAAGACTTGTTTGTGTGATGAACTAAGGAATTGATTCTACCGTAAATGGTTCTCAATTGTACACAATTTACATGATGGATATGAGTAGTGTCTTCGAGTTTCCTGGCATTGATTTGATGCAATGAGGTACTTGAGGCGTTTTGTGAATATTTGAGCCGGTATTTATCAGTTATATGATCACGTTTGGTTTGTAACAGAACAGTGGTTGTCTTGGATGCTACACGAAGCCGAAACTAAGAACTAAGCTGAATGAACCATCAAAAGGTCTACCAATTCAATGTCATGGACTAAAGAAACCCAGCATATCAGAGGATTTCTGGACTACTAGTACATTTGATGTGGACAACAGTGCAGGTCAATCACAAGGAAGTATGTCATCAATGAGTACAATCAACCAGATGCACGACCACCACGGTAGCTCAGGCAATGTACACAACCCTTCAGAGTTTATAAATCACGGTGATTTTCTCCCCACTCTGAACTTTCAGTTACTCAATTGGCCATGTGCAAGATAGTTAAGGAGAAGAAATTATAAGCAAAACTAACTGCACTTTGTGACTTGGACTGTAAATGGATGATTTAACGGATTAAGTTTTCTGTGTGCATTTCGCCAATTTGTTCATATCCTATACCTTTTTACGACCTGGTAGTCTCTTTAAATTTTTCTATGCATTATCCCATTGTTGCTTTGAAGAATGACATTGGCATAAGTAACTATATTTGGATCTTGGTTGTAGGCCTTCTTCTGTGGAATCAAACTAGGCAACGCTGGTCGGGGAATAAACAGTCTCAGAACCGAGCACCGCAGTTTCAAGAACCCAAGTTAGAGTAAGCTGCTTTCTAATCTTTCAGATCAATGAAATCGAATATCAGTACCTCATAGTGATAATCCTACTCGGGTTTCCTCTTGCAGTTGGAATGCAACCTATGAAAGTTTATTAGGGAGCAACAAGCCATTCCGTCAACCTATTTCACTAGGCGTAAGTTCTCGACTCATCTCAACTCTTTCATTTTCTGGAAATCGTACTTGAATCTAGTCATAGATTGATGTTCATTCCAACGTGCAGGAAATGGTAGATTTTCTTGTGGATGTATGGGAACAAGAAGGGTTGTATGATTGATTATTTTAAAGATTGCAAGGCTTTATTTGGAAGAAGTTGAAACGTAGCCGCAGTGGCTTTTTCTCTGTCGAGGGCTTCCTTTGAACAATGGAAATCTGCCTTGTATTTGCTGAAACGCATTCAAATGAACATTTTCCCCGCTGCCTATTTTTGGTGCTTGGTTTGTTTTTTTTTGTTGACGTCATCCTCAATGCCTTGTTTGGTGTTGCTGTAGAGTTTTATATCTTAAACTTGTTGTAGAAATATCAGCTATGATGTGATGGAAAATACTACTTTTGCAAATCGACTTTTTTTTTTCCCCTCCAAGTGACAACAAACATATCTAAAGCATAAAGTTTCACGAGAACAAAACACAGATAAACGAGTTATTCTTTACGACTATGAATCTAAGATTTTGTTTTTTGTTGTATGATTTGAATCGAATAACTAATTGGGGTGATTGCACGCATGCACATTGACGCAGTTAGAGATGACATGTTGAACTTGATCGGTAGTTATATATAAGAATGGATAAGTGTTAGATACTCTGTGTGAAATTGGTCAAATATCCTAATATCTAAAGTTAAGTGGATTAAATAGAAAAAAAAAGGTTGCGAATTTGAGATGATATTTATCACATTGTACTAATTGGTGGGGGGTTAAATTTTGAAAATTTGATGAAGTTAAAAAGGCAATTTGAGAAGTGATAGCCACAGCCACAATCTTTCCCAAATAAAGAAATCTACGAATGATTGTGACTTGTCGTTGCCAGTAGAAATGATTTGGGCCCACATCCACCACACAGGTTTTGACACGTGTATCTTGATTGGATTTCTTGCTTATGATTAGTACAGTAAAAAATATTTCGGACCCCAACAAAACGAGTGCAGCCAAGGTCTTTTCCATCATCATTTCACGGATTACAATGATCAAAATCCTAGCCGTCCAATTAGATGACCAAACTTTGCTAATCTGAAATCATCACCGTTAATTCCTATCCATTGTGGAAAAACATATTAACTTCTGTTTTATTTCATCTCATTTTAATACTAAACTTTTATATTTAGTTTGTTTCAACCATTACATCTTTAATGGTTAAAATTTGCATAGAAAATTTATTTTGGCTATTTGCCATATAATTTTCATTGCATTTTTTTTAATTTGATATCAGGGTAAATATGACTTGAGATGTTTGTTTAATCCACACAGACCCAAACCAATTCTCGTACAAAATTTTAAAATAAAAGTTAAATGAAAAAATCCTCAAAGAGGCACATGTACAAAATATTATTTTTAATTTACTTTGTCACTTTTGTTGTTGTTGTGCTTTGGCGACAAATAAGAGATTTTATCGTTGTTGACATTTAGCAATGAATAAGTAATTTCGTCCTTATTAACCTTTAGTGACAGAATTAACTATTTTGTCACTATTAACCTTGTCTTTGTTAACCTATAGCCACATAATAAGTAATTTTTTCACTGTTAGCTCAAGTAAATGAAATAAGCAATTTTATTGTGTTGAACTTTAGTGACGGAATAAGTATTTCGTCACTTGCATGTAGCAACGATTTGTTATTTTGTCTCCTTTGATTCTTAGTGTCACACTTTCAACATATTGTGGCAACCATTTTTTAGGTCGCATGAACTTTTTAGCAACATTTTTTTTCCTTTTATGAAAACCTTTTTTCTTGTAATAAAAGTAATAATATTTCTGTAAAAGTAAAGAGAAAAGCATAAAGCCCTTGTATATTGATCTTTCTCTTACTTGCTTGTACTGTGTTTTGTTTGGTATGGATGTATAAAGCTGATCTATTTAGCCAATCTCTCTCTCTCTCTCGAGATCATGAAGTAATAAAGCATGCAGCAGTTCCAATAACAAAGAGAAACCCACCATTTTGCATCTCTATTCTATTCTATTCCCACAGAAGCTCATGCATGCTAAACAGTGAACAAGGTTAAACACATTTTGAGAATTGAAAAGGAAAGAAAGAGAGTGATTGGGCAAAAATCTTCAAAGATTTAGGTTCTTTGGAAATATGAAGAGAGATGATGCTGTTTCCAAGGGGAGGAGTGAAGTTCCACTGAAACTGCTCTCATCCTTCTTTCACTGAAAAAGGTTGTTTAACCCTGCCACACAACAATTTATTACCCTCTCAATCCTCTCACCTACTTTTTCACTTCCTAATATCTTCTAATTAGATCTCATATTAATGGATGAAATATGTACTATGAGATCGAATATTAGCTTATCGGGGATAATTCAAATCATAACTTCATTTTTTTGAACAGGTATTAAAAAAAAAAGGAAAAAGATTGAGAAGCCCATTTAACTTGTGAAGTTGGGTTTATGAAAAGGGATAGGAGAAAGCAGATTTTCCGAAAAGGAAAAGGTGAAAATGGTGAAGAGGAGCAGAAATGGAACTCATTTTGGCACTGTTATGTAGCAGTGTAGCACTCACTAGCTGGGGTGGTTCAGTGCTGTATGGGAGAAGCTTAGCAACTGACAGATATCTACTTGAACTGCTCTTATGTTTTCGGTACCCCATTTCTCTCTTCCTTTTCCCTTTTTCACCACCTCTTCATATCGCTTTATTTCACTTTTACATATATGTAACTCGTGGATGGGATTAAAAAAAATTGCATACCAAAGGCTTGACGTTATAATAGGTGATGAGTGGGCATTCAATATGAAATATGTGTGTAAGTTGAATGGTCGACTATATATATTAGCTTGAATGTATTAGAGTTTGAATTGTATAAAAAATAGGAAGATATATGTGGGGTTTTGGGTTAGGAGAAAATGGGTAGGTTTTGGCTTTAAGGGGATGATTTACTTTCGTCTCAAGCTGCTGCTCGATTTTCTCCACCTCTTTTTTTGTATAAAATCCGGACATCAAACCTAGCAAAATTGAACATCTTTCTCTTTCTATTTCTTCATGAAAACTAACTTCCTAAGCCCAGTCTTTTTTTTTCAATCACAAATCACATTCCAATTCCTCTTTCTAATTTCAAAAATTCGAACCCCCAAAACCCAACCCTTTTCTTAGCAACCCATTTCCCTATACTGAAAAGCCCATTCATCCCTCTATTTTTTGTCCAATTAAGACCCTAATATAGTTAATCAATGTAAACTTGACAAATATTTATACAGAACATCGAACTCGTCACCTTCCTTACTACATTCAAAGAAAATGTGTCACAAAAAAACGTTAACACATGATAATGACACATGCATTTTGGAAGATTTAATTAACTATATGAATTAAATAAATATTTAAAAAATATAGATATCCAAATTGAAACAATACAGCTATGAAGACTAAAATAATAATTTGAAGGAAGAAAATGGTATTTCTTAAAAATCAACCCATACAATTTTCCATGATCACAAGAGACAGTAGTCAGATTACAAAACCCATGTTAAACATCTATCATTTTGTAATGGGTATTTTCCCTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGGAGAGTCCCACACCACAGCAGTCCCAGCTCGAGAAACTAGGTATTGTCACCTTATAATGGACAAGTCATGCAGCCCAAAAGAAAAAAAAAATCAAATGGTCACTTCAGTCATTCACTAACAACTTAAAGTCAGAGTACCACTTTCCCAAAACCAACGCAACTAAATCAATCCCGCTTTTTCCTTCCTCAGACACCTTCTCTTTCATTTCTCCCTCTCCCACTTCCAACACCACAAACACAGAAATCTTATTTATGTCTCTCTCACAGGAAACAGCTTTCAACCTTGTCGAAAAACACAAGCACTACCCTCCACTCCCACCATTAATATCATTCAACCCCTCCCTCTCTCTCTCTCTCTCTCTTCTCCAACACTCAGATCTTAACCCTTCTCTCTCTCGACTGGATTGACTGTGAAACTTACACATCACATGAAGCTTCAAAGTCCCACCACCAATCACTGATGGCTACGTCTACATCTCATCCACGTTCCCTCAAACTTGCTATCACTGTAGTAGCTTTCATCTTGATCTATCTCACATTTCTTCCTCCTAAATCAGGTAATAAGAGAATCAAACGACACCCACTTGGCTTGTTCTTGTGTTTTCTCATTTGATAACTCTGTACTTTCTTTTTCGGGTTCAGTTGGGTCGCTTTTGTCCACGAGCGGCGGAGGAGGGAAAGAATCGAAGTTTTCTGAGAAGGGTGAAAGTGGGAGTTATGTGGGAGAGAAGAAAATGGTGCTGGGTTCGAGGCCGCCGGGGTGCGAGAACAAGTGCATGAGTGCCGGCCGTGCATTGCCGCGCTGGTGGTGCCGGTGCACCGGATGAAGGGTTTCGAAGCAGCATTGTCTCTTCTCTCGCGAGGAAGACGATAGCTATTATCTGCTGTCGTGGAAATGCAGATGTGGGAACAAGATCTATCAACCCTAGCAACATCATGTGTTTCATGTCGTGGAAAGTAACCAAAAATGTATTAATTAAAATATATGTATATGTATTTTTCATTTTGATCTCTCTTCTCTCTCTCTCTGTTTGATGGAATATATCTAAGTATTGAGAATGGAAGGTAAAGTTCCTAATTCTAGTAATAAACAAATGTAATGTAGAATGAATTCTACAACCATTTTGAGTTTATGAATGATGTTAAATGGCCTACATTTTCAATTCTTTTTAAATAAATAAATTAGCAAGGTTGTGTCATTTGGGTGAAACAACTGCATGAACTAATTGCGTGTCGGAGGTAGTAACGATAAAGTGTGCGATCAAAAGTCACCATGTTTGACATTAATGTGGTGTTTTTGTTAGACATTCTTAGAAATTAATAAGTTAATTTGACTCTTGAGTTTGATAGAATTTTGATAGCGTAGAATCATTTTTTTTTTCCATTATCATATCTGTAATATATATAAAAGAGAGAAAATGTGCTGAAATTATCGTCAATTGTCGGCTATTGATAGCAATATGTATTGATTGCGACTTCCCTTGGGCGGTCATCAGTAACTTGTTAGGAAGTCGTAGATGAGTTGGTGGGCTTGTAAAGGCTACTTGGAGGCACATTAATCACAATCTAATATCAAGTCTGACTGCTTCCACGAGACTAATTTAGTGACTTTAGAAAAAAATATTTTTTAAATAAAATTATTTTTTTTATAAGTACTTTTAAAAATTATTAATTTAAGTGTGTTTAGTATAGTTTTTTAAGCGATTTTAGATGACTATTTTTTTTTTAAATCTACAGTGTTTGGTAATACATAAAAAATATTTTTAAAAATCAGATTGATTTAAAATCTATTTTTGAATAAATACTTGAGAGGTGATTAATCAAAAAATAACTTTTTAAATAGTTGACCAAAAATCATTTTTAAGAAAAAAAAAATATTTGTAACTAGTAGAATCACTTTTTAATGTAGTGTAACCAAACAAATTTGTTATTAAAAGATTAAAAAAAATGTAATCAAATGCATAATTAAAAATATTTTTATTGAAAAATATTTATAAAAAAAAGTACTTAATGCTAAACTCACCCTACCCTAAATGATATATGATGTGATATTTTTGACCTCATATACTCTATTGGGTATGTTTCGAGTATTGCCAGCTAGGCTAAGCAGCTTCTTCTCTCACCCAAATGGTTTCGTCTCAAGTTTAAGATATTTTAGCATATCTCATTTTTATTTGTCAATTATTTAAAAAAAAAATTCTTAAAACTAGTCATTTCTTCAAGTTACCATTTTTTAGATATTTTTAATTTAGTATTTTTATATATTAAAACTGCGATATTTTGATCAATTTGGGCCGAGTTGACCCATATAGGCCACAAAGATGACTCACTGGATTTGGGCCGGCCCATATGGCTGAAGTCATCATGTATCTAAAGGAACAATGGAGAAATGGAATCTTCGTCGATTTCTAACTCCTCCGGATGGCACGCTCTCTCTCTCTACTGGAAGCTCGCTCTACCATTCTACGCACTCCGTTACCGCCTCTTCAATCGCCGTTCTCTCTGCTCAAGCTTCCATGCTCCGCCGGAAATCCGCTTCTCATCGCATCATCGCCGCCGTTCATCTTTCTTCATCTCCTCCAATCCATATGCTAAATCCGGCGTTTGCAGCGCAGCAGTACGGCCACCACCGGACTCCGATCCGCCTCCTGAGAAGGATCCGATTCGTCTAAAAGGTTCATATTGTTTTCTGATGTTTTCTGTGCTTAATTCGTTCAATGGATTGAATTTCTGTTGATAAGGCCTCTAGGATAAGTTTTAAGTTTAACGCGTAGAGCTTGGATACGTGTTTGCTTCTTTCTATTTCGTTTTTGCACATAAAACCGAAATGAAATTATTAGCTCCTGAATTTAGCTCAGATTGTGCTAGCGGTTCTTATCACAGCTACGATTAATAACGATTTTATTTATCCATTCATGTGTTTTCTTCTGAATTATCCGAGTATTCGTGTTTAGCTGCAAGTAGAAAGCCTTTTTCCCCTTCCATGACAGTTATGCCAGTTCGTAGGATGCAGCTTCCGAATGTAAGATGAGGGAAGCGTGTCTTCTTTTGTAGCCCCTATCATATAGCTGCTGGAGATTTTGGTCTTTAATGGTGTTACGTCACAATTACACCCAAAAGTTTAAGTTGATAGGGCATGATAACTTTAATTTTTATCAATTCATTTCTTTACCACCCCCTTCCACGTATTGGCCAAATATTAATTTTGGTTCACGGTGCCCAACATGTGGTCTATTAACATGAAATGGGATGGAAATGACGTTATTGATTTGAGAGTTATCTGGGTTTCCTACCTTAAATTGGTGTACTCTTTCCAAGCTTTTTTATTATTATTATTATTTTTTTCTTGTAACTATAGCTCCTCCTTGATTATGGACGATTGGAGTGCTTTTTAGCAGTTTTCTTTGTTTCTGGGGGATTACTTCTAAGCCTATCTTTGTCTTTTCTACTTATCTCTTATTGATGAATATGGTTTCTTATTTAATAAAAAAGTAACGGAGAGACAGAGAGAGTGAAGGAAGAGCCTTTCATAAGTTGTTATATTTTTTAGAGAGAATCATTTGAATATGTTGATCAGAGTAGATAAAGGGATTATTAGACTGAAACCGTTGGTAATACTCAAGAATCCTACCAAGTTACTATATATTTTTAATTTTTGAACTTAATAATATTTTTCTTACTTGATAATTCATGTTGTAACTTGTTGCATGTAATTTGAATTGCTTTGGCACCATTGTAAGATTCTCTAATGTGAATGAACTATGTTCCCTTACTTCTCTTAATCTCATTATTTGAGCAGGTTTTCCCGCAGTTTTCTCAAAATTTCAGGACAGAGTACAGATTTTCTTTGCTGTGTTATTCTGGATGTCCCTCTTCTTCTGGACTTCTGCATTGGATGGAAAAAATAGACCTAATAAGGGCTCTCGATTTAGACGATAGTTTTGTAAACAAGCATCCTTTTGTAAAGATTTGTAGGTTAGTAGTGTTAGACTACTTGATTATATTTAAGATATATTGAAGCCTTTGATAAGGTAGAGAAAACTTGCCCCATTAGATGATGATACCAAAGATGGAAGGCTATATACCCTGTATATTGTGTAGCTGCAAAGGATACTGAGTGGTTAAGAAACAAATGAACGTAAACCGTCATGGAAAGTCGTATTCTGTTTTATATTGTTATCAAGTTTCATGAATTCCCTGAATCCCTCCCTCCTGATCGATCCATCTTGAATTCGAAAACTGGGGGATTAGGAAAGTATCAGAGTTGAAATCGTTGAAGATATTTTGTTTGAACTTGTGTTGCCAGAACCATTTAGCTTTACCCTCTCTATGACTAATCTTCTTCACTGTTATGTTACAATTTCTTATAGATTTGTCTTGTAGATTTTGGTGCCTTGTCTTTCCTTGATTTGAATTTGCATGTTGACTAAGAATGCCTGAGATGGATATTTGCTCGCTTCCATGGGCTAAAATGAGGCAGAGAAGCCTCCTATTGAGCCTAAGGATGCTCCTGTCCACGAGATTTGGAGTTAAGATATTTGGAACCACTGTGTTGCTTTCAACGTCAGAATCATGGGAGGTGTGATTGGTTTGTTCCTGCAATGATTAGAGGTGACCGCCACAATAAGGTTTGACCCTAGAAGTGACTGCGACAATATAGTTGATATCAGAAGAATGTGGTTGACATTAGAAGTGACTGCCACAATGTGATTGACACGTTGCTAATGGTTGGATGTATGACTAAAAAATATGTTTTTGTCATTCTCTCACTTATACGTTTTCAATGGTTGGTTGACACGTTGCTAATGGTTGGATTTTAAAAGTAATAGTCATATTGTTTTGTGCTAGTTTTGTGCGAAGCTGAAAGTGAGAGTTAGTTGGGGGAGGGGGAGGGGTGTGAAGATTTTTTTAACAGTTTTAATTTTTGATTTGTTTTTGACTTGCCACTAATCAAAAGGGAGGAAATATGAAATTTATAAACGGAAAGCACAAAAAGCAATGTAAAATTACTCAGAAATATTAGCTTTTAAACATGAATTAATGAAGTGTCATCATGTGAATTATGTAATCTATATTTAATTTGTGAATTTATGATTTGTAAGGATTAACATGTTATTAGTATAAAATCTGAGTTTTGTAAGTGGATTTATGTGGTTAAAGAAATTAATAAAATTGTAATATGAGATAAGGAAATTTACGTATGAACTATAGTATTACCATACAAGTCTATTTTGTGACTCCTGTTAAGTGCTTGTAAATAAATTCAATCTGTAATTTATCTCCAAATATAAGAATGTATTTCACATTGAGTTCACAATAATACTTGTAATATTATTATTATTATTAAAACTACAGTCGGAAAATGAGGCTAGAAGTATTCCTCGTAGGCTTAACTCTTTGACCAAAATGTATAACAATTTTTTTTAATATCAGAATTGACATATAAAATAAGAATTTGAATATGCGATCAATAAGAAATGTTACTAATGTTTTGATCAGTTAAATTATGTTAGAGTCAATAACTAATTATTCCATTCTAATGTTATATTGCTTTAATTTTTTTTTTTGGATAAGAATGTTATATCTCAATCACATTCACACAATTATTTAATTTAATTGTTCGACGTTACTTTAAATACTAGTAATTTAATTAAATATTAATTGAAATAATTTTGTGGGAAAATTTTCTGACAATAAGAAGGTTAAAATTGAGAAGCTTATCTCTATTGTTCTTAGGTTCGAATATCTACTCTAAAAAAAAAAATAATCCCAACCTTCGTATCTGTAACATAAAATAAAAAAAAAAAAAAAAAACTTCCGTAGTAGTGTAATATTTAAAAAAAAACGACAACTTTGTATTTCCGCTACTCCCAAAATTTATCTAGAGCCGTCCACGTGTACAACGAAACTGGAATGACGGGTCCCATTTCTGTGTCTTGTGTCGACGTGTCGTACTGTTGGAGGTGAAAGCTTGGTGTCGTCATACGTTGGCCCTGTGGTGGGACGACAAGAAACCAAAATATCTATTTCACTGCCTCTGCTTCTTTTTCGTCTTCTTCCTTTGATGCATATATCTACGATACCAAATGTTTCTAATAATGAATGTGTCCACTCCGCATTAAAATTGATCAACTTTCTTACCTTCAGTCAGATTTTTTCTTTAAAAAAAATTTTAAATAATAATTTTGATTCTTAAAATCTTTATTATTCTATATGTACCACCACATTCATTGTATGGCTAATAGAAGGGATTTTATTCTTAAAAAGAGTAATTAATTAAAATTTACATTAATTTGCCAAACTTATATATTCCCATGTTAAAAAACCCCAAAACAACAACACCTTATTACAATATTTGGATGATGTCAATAGAATTTTTCTTATAGTAAATTTCTCTTATGACTTCAATATGTTTTAGATGACTCGATAATGAAATGCATATGTAGTGACAAAACTCGCTGAAGACAATTATCATGTCTACCGTAAAATGCTGAGAGGATTGCTACTACATGGATACAAAAAGATGAAGAGTTAGATAGTATTCGACTAAAGATTTAATTCATCAAATGCTTGAGTTGGCTTATAAGGAGGAACTGATCTCTTCGACTCTAAAAGCTTCTATTCGTGAGGGTTTGTCTTTTGTCCTTTACTTTTGGAGTTCTTTCGTAAAAGAAAAAAAAAAAAAAAAACTCTCTTTGGAGTCCCTTGTTTAGATTTCATTGACACGTTTGCATAGAAGCCTCAAAAATGATTTCACAGTCATGTTATATGTAATTTTAAGTTGGGATTAAATATATATATATATAATTCAAATATAGAAAATAATTTTTTTAACAAAAAAATAGTATTTGGAGGAGAAAACAAAAATTGATATCAAGCGAGCATCCCTTCCATCGCTTCTATTAAAATCCACTTGCCACTTCGTTTCGAAGAAGGGAGGCGTACCTATAGCCGGCTGTCTCCGCCATTTTTGGTCCCATGAACTTGCATCAATAATTTTAGTCCTTCTCAATTCACACATTCAATTCTCACCCAACTCTCTGATCTGATCGCTCCCTTCTCCTTCAATCCCATCTTCTTTCAACTTCCTGGTCTCTCAGCGCATAAGCATGCTGTTGAGAAGTTCTTCAACCCCAATTCTCAATTCATGGCTTTGCCAAACCAAAGCCTCGCCATCAGAATCGGACCAAATTCACCACCTTCAAAGGACTAAATCCCTCTCGCTGACCACTTCTTTTCATCCTCCATCACCTCCTTCGGATGACTCGGCAAAGAGAGTAACCCAAAACTTGCTAGAATCAGATACGGCAGATCCCAGAAGGAAGAATCGGGTACCGAAGAGTTGTAAAATTCGAACGAAGGTGAAGTCCAAGGAGAATGGAGTAGCAGTGCGAGATCAAGAGCTCAGACCCACTTCAGATTCGTCTTCGTCTTCCATTCATGGACTATTTTCAGGCTCTGGGTTGGGTGCGAGAGTGCCAAATGATGATGAGGCGCATGATGTGAGGCGGCATGAATGTGTACTGCAGACGCTGGTGGTCGGCGGTGGAATGGGAAGCGACGGTGGCCGGGTGTGTGGCGCCGGAGGTGGCGGTAGAGGTTCCGATGGTGGAGGCGGAGGGGATAATGATAGGTCGGGGTTTTCTGAGAATAACAATCACCATGGAAGCAATAGCACCGATGCTTATTACCAGAAGATGATTGAAGCGAACCCTGAAAATGCCCTTCTGCTTGGAATTATGCCAAGTTCCTGATAGAGGTACTATAAATTATCTTAAAACTCCTTGGCAAATACTTCAATGGCTTCCCTTCTGATCATGCTTTGCCATTAGGTCATGGAGATTTTGCCAAAGCAGAAGAGTTTTGTGGAAGAGCAATTCTGGCCGACCCAAATGATGCAAATGTTCTATCACTTTATGCTGATATAATATGGCATACACAAAGGATGCTCAACGAGCCGAGTCCTATTTCGATCAAGCTGTTAAAAGTTCCCCAGATGATTGGTAAGAATATGAATATATATTAGTTAACTTCTCAGCTCACTAAAAAACAGATTGAAATTTCATTTTGAGGGGCTGCTTCTCCTTTCTATGGTTTGATTGCTAGAAAGAGATTGTATTGACTGGTGATTCCTACGTTTACTTTTGATTCTTTTTGATAATGGGTCAAGTTGATTTCGCCATGAACACCATAACACGAACAAATTCTGAGTTCAGTATCAATATCAATTCAATTCCTAAACATTCATTGGTTGTGAAGTGAATATTCCAATTTGATTGTGATTTGTGTCTTTTCTTCAAGAGTTAAGAAAGTTATCTCTGTTAGTATGTGGATCATGGATCATATGGTATAGTTTATCATCAATTTCAGTCAAATACATTTCATTATTGGAATCTTTCAGTGAGCCTACAAGTTCTTGATCGAGTCAAGTGTTGAAATGGAGAGAATATTTCATAGGTCTTTGCTACTCTGATTGGGATATTGTTTTAAAGTTGATTTCATTAGTGCAAAATCTTCTTTGTCTACAGCTATCTCCTAGCCTCATATGCACGATTTCTCTGGGATACCGATGTCGATGAGGAAGACGATAAGGCAGACCAGTATGAAACAGAGGAAAGCCGCCCATCTCCGCCCGGTTTCTCACATGGAGTTCCCCACCACTCTCCTCTGGCTGCAGCTTCCTAA

mRNA sequence

ATGGAATTCTCAGAGACCGATCGCAGAGGATGGGATATTCATACTGAAGAAGAAGATCGAAAGGAGGAAACTTTGGAACTACTCTTTGATCTTCTTTCCTGGAATCGGGTTTTCCTTGGATTTTGTTTTCCCCTGCAAATCCTTATCAGTTTCGTAGCTCAGGCTCCTGCGTCGTGGGTCTTCTGCGTGTTTTGCATTAATGGGAACAGTGGTTGTCTTGGATGCTACACGAAGCCGAAACTAAGAACTAAGCTGAATGAACCATCAAAAGGTCTACCAATTCAATGTCATGGACTAAAGAAACCCAGCATATCAGAGGATTTCTGGACTACTAGTACATTTGATGTGGACAACAGTGCAGGTCAATCACAAGGAAGTATGTCATCAATGAGTACAATCAACCAGATGCACGACCACCACGGTAGCTCAGGCAATGTACACAACCCTTCAGAGTTTATAAATCACGGCCTTCTTCTGTGGAATCAAACTAGGCAACGCTGGTCGGGGAATAAACAGTCTCAGAACCGAGCACCGCAGTTTCAAGAACCCAAGTTAGATTGGAATGCAACCTATGAAAGTTTATTAGGGAGCAACAAGCCATTCCGTCAACCTATTTCACTAGGCATCTTAACCCTTCTCTCTCTCGACTGGATTGACTGTGAAACTTACACATCACATGAAGCTTCAAAGTCCCACCACCAATCACTGATGGCTACGTCTACATCTCATCCACGTTCCCTCAAACTTGCTATCACTGTAGTAGCTTTCATCTTGATCTATCTCACATTTCTTCCTCCTAAATCAGTTGGGTCGCTTTTGTCCACGAGCGGCGGAGGAGGGAAAGAATCGAAGTTTTCTGAGAAGGGTGAAAGTGGGAGTTATGTGGGAGAGAAGAAAATGGTGCTGGGTTCGAGGCCGCCGGGGTGCGAGAACAAGTGCATGAGTGCCGGCCGTGCATTGCCGCGCTGGAACAATGGAGAAATGGAATCTTCGTCGATTTCTAACTCCTCCGGATGGCACGCTCTCTCTCTCTACTGGAAGCTCGCTCTACCATTCTACGCACTCCGTTACCGCCTCTTCAATCGCCGTTCTCTCTGCTCAAGCTTCCATGCTCCGCCGGAAATCCGCTTCTCATCGCATCATCGCCGCCGTTCATCTTTCTTCATCTCCTCCAATCCATATGCTAAATCCGGCGTTTGCAGCGCAGCAGTACGGCCACCACCGGACTCCGATCCGCCTCCTGAGAAGGATCCGATTCGTCTAAAAGCTGCAAGTAGAAAGCCTTTTTCCCCTTCCATGACAGTTATGCCAGTTCGTAGGATGCAGCTTCCGAATAATCATGGGAGGTGTGATTGGTTTGTTCCTGCAATGATTAGAGGTGACCGCCACAATAAGCGCATAAGCATGCTGTTGAGAAGTTCTTCAACCCCAATTCTCAATTCATGGCTTTGCCAAACCAAAGCCTCGCCATCAGAATCGGACCAAATTCACCACCTTCAAAGGACTAAATCCCTCTCGCTGACCACTTCTTTTCATCCTCCATCACCTCCTTCGGATGACTCGGCAAAGAGAGTAACCCAAAACTTGCTAGAATCAGATACGGCAGATCCCAGAAGGAAGAATCGGGTACCGAAGAGTTGTAAAATTCGAACGAAGGTGAAGTCCAAGGAGAATGGAGTAGCAGTGCGAGATCAAGAGCTCAGACCCACTTCAGATTCGTCTTCGTCTTCCATTCATGGACTATTTTCAGGCTCTGGGTTGGGTGCGAGAGTGCCAAATGATGATGAGGCGCATGATGTGAGGCGGCATGAATGTGTACTGCAGACGCTGGTGGTCGGCGGTGGAATGGGAAGCGACGGTGGCCGGGTGTGTGGCGCCGGAGGTGGCGGTAGAGGTTCCGATGGTGGAGGCGGAGGGGATAATGATAGGTCGGGGTTTTCTGAGAATAACAATCACCATGGAAGCAATAGCACCGATGCTTATTACCAGAAGATGATTGAAGCGAACCCTGAAAATGCCCTTCTGCTTGGAATTATGCCAAGTCATGGAGATTTTGCCAAAGCAGAAGAGTTTTGTGGAAGAGCAATTCTGGCCGACCCAAATGATGCAAATGTTCTATCACTTTATGCTGATATAATATGGCATACACAAAGGATGCTCAACGAGCCGAGTCCTATTTCGATCAAGCTGTTAAAAGTTCCCCAGATGATTGTTGATTTCGCCATGAACACCATAACACGAACAAATTCTGAGTTCAGTATCAATATCAATTCAATTCCTAAACATTCATTGGTTGTGAACTATCTCCTAGCCTCATATGCACGATTTCTCTGGGATACCGATGTCGATGAGGAAGACGATAAGGCAGACCAGTATGAAACAGAGGAAAGCCGCCCATCTCCGCCCGGTTTCTCACATGGAGTTCCCCACCACTCTCCTCTGGCTGCAGCTTCCTAA

Coding sequence (CDS)

ATGGAATTCTCAGAGACCGATCGCAGAGGATGGGATATTCATACTGAAGAAGAAGATCGAAAGGAGGAAACTTTGGAACTACTCTTTGATCTTCTTTCCTGGAATCGGGTTTTCCTTGGATTTTGTTTTCCCCTGCAAATCCTTATCAGTTTCGTAGCTCAGGCTCCTGCGTCGTGGGTCTTCTGCGTGTTTTGCATTAATGGGAACAGTGGTTGTCTTGGATGCTACACGAAGCCGAAACTAAGAACTAAGCTGAATGAACCATCAAAAGGTCTACCAATTCAATGTCATGGACTAAAGAAACCCAGCATATCAGAGGATTTCTGGACTACTAGTACATTTGATGTGGACAACAGTGCAGGTCAATCACAAGGAAGTATGTCATCAATGAGTACAATCAACCAGATGCACGACCACCACGGTAGCTCAGGCAATGTACACAACCCTTCAGAGTTTATAAATCACGGCCTTCTTCTGTGGAATCAAACTAGGCAACGCTGGTCGGGGAATAAACAGTCTCAGAACCGAGCACCGCAGTTTCAAGAACCCAAGTTAGATTGGAATGCAACCTATGAAAGTTTATTAGGGAGCAACAAGCCATTCCGTCAACCTATTTCACTAGGCATCTTAACCCTTCTCTCTCTCGACTGGATTGACTGTGAAACTTACACATCACATGAAGCTTCAAAGTCCCACCACCAATCACTGATGGCTACGTCTACATCTCATCCACGTTCCCTCAAACTTGCTATCACTGTAGTAGCTTTCATCTTGATCTATCTCACATTTCTTCCTCCTAAATCAGTTGGGTCGCTTTTGTCCACGAGCGGCGGAGGAGGGAAAGAATCGAAGTTTTCTGAGAAGGGTGAAAGTGGGAGTTATGTGGGAGAGAAGAAAATGGTGCTGGGTTCGAGGCCGCCGGGGTGCGAGAACAAGTGCATGAGTGCCGGCCGTGCATTGCCGCGCTGGAACAATGGAGAAATGGAATCTTCGTCGATTTCTAACTCCTCCGGATGGCACGCTCTCTCTCTCTACTGGAAGCTCGCTCTACCATTCTACGCACTCCGTTACCGCCTCTTCAATCGCCGTTCTCTCTGCTCAAGCTTCCATGCTCCGCCGGAAATCCGCTTCTCATCGCATCATCGCCGCCGTTCATCTTTCTTCATCTCCTCCAATCCATATGCTAAATCCGGCGTTTGCAGCGCAGCAGTACGGCCACCACCGGACTCCGATCCGCCTCCTGAGAAGGATCCGATTCGTCTAAAAGCTGCAAGTAGAAAGCCTTTTTCCCCTTCCATGACAGTTATGCCAGTTCGTAGGATGCAGCTTCCGAATAATCATGGGAGGTGTGATTGGTTTGTTCCTGCAATGATTAGAGGTGACCGCCACAATAAGCGCATAAGCATGCTGTTGAGAAGTTCTTCAACCCCAATTCTCAATTCATGGCTTTGCCAAACCAAAGCCTCGCCATCAGAATCGGACCAAATTCACCACCTTCAAAGGACTAAATCCCTCTCGCTGACCACTTCTTTTCATCCTCCATCACCTCCTTCGGATGACTCGGCAAAGAGAGTAACCCAAAACTTGCTAGAATCAGATACGGCAGATCCCAGAAGGAAGAATCGGGTACCGAAGAGTTGTAAAATTCGAACGAAGGTGAAGTCCAAGGAGAATGGAGTAGCAGTGCGAGATCAAGAGCTCAGACCCACTTCAGATTCGTCTTCGTCTTCCATTCATGGACTATTTTCAGGCTCTGGGTTGGGTGCGAGAGTGCCAAATGATGATGAGGCGCATGATGTGAGGCGGCATGAATGTGTACTGCAGACGCTGGTGGTCGGCGGTGGAATGGGAAGCGACGGTGGCCGGGTGTGTGGCGCCGGAGGTGGCGGTAGAGGTTCCGATGGTGGAGGCGGAGGGGATAATGATAGGTCGGGGTTTTCTGAGAATAACAATCACCATGGAAGCAATAGCACCGATGCTTATTACCAGAAGATGATTGAAGCGAACCCTGAAAATGCCCTTCTGCTTGGAATTATGCCAAGTCATGGAGATTTTGCCAAAGCAGAAGAGTTTTGTGGAAGAGCAATTCTGGCCGACCCAAATGATGCAAATGTTCTATCACTTTATGCTGATATAATATGGCATACACAAAGGATGCTCAACGAGCCGAGTCCTATTTCGATCAAGCTGTTAAAAGTTCCCCAGATGATTGTTGATTTCGCCATGAACACCATAACACGAACAAATTCTGAGTTCAGTATCAATATCAATTCAATTCCTAAACATTCATTGGTTGTGAACTATCTCCTAGCCTCATATGCACGATTTCTCTGGGATACCGATGTCGATGAGGAAGACGATAAGGCAGACCAGTATGAAACAGAGGAAAGCCGCCCATCTCCGCCCGGTTTCTCACATGGAGTTCCCCACCACTCTCCTCTGGCTGCAGCTTCCTAA

Protein sequence

MEFSETDRRGWDIHTEEEDRKEETLELLFDLLSWNRVFLGFCFPLQILISFVAQAPASWVFCVFCINGNSGCLGCYTKPKLRTKLNEPSKGLPIQCHGLKKPSISEDFWTTSTFDVDNSAGQSQGSMSSMSTINQMHDHHGSSGNVHNPSEFINHGLLLWNQTRQRWSGNKQSQNRAPQFQEPKLDWNATYESLLGSNKPFRQPISLGILTLLSLDWIDCETYTSHEASKSHHQSLMATSTSHPRSLKLAITVVAFILIYLTFLPPKSVGSLLSTSGGGGKESKFSEKGESGSYVGEKKMVLGSRPPGCENKCMSAGRALPRWNNGEMESSSISNSSGWHALSLYWKLALPFYALRYRLFNRRSLCSSFHAPPEIRFSSHHRRRSSFFISSNPYAKSGVCSAAVRPPPDSDPPPEKDPIRLKAASRKPFSPSMTVMPVRRMQLPNNHGRCDWFVPAMIRGDRHNKRISMLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFHPPSPPSDDSAKRVTQNLLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGLGARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFSENNNHHGSNSTDAYYQKMIEANPENALLLGIMPSHGDFAKAEEFCGRAILADPNDANVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKHSLVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS
Homology
BLAST of Sgr015599 vs. NCBI nr
Match: XP_022143958.1 (uncharacterized protein LOC111013740 [Momordica charantia])

HSP 1 Score: 427.6 bits (1098), Expect = 2.5e-115
Identity = 248/355 (69.86%), Postives = 261/355 (73.52%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFHPPSPPSDDSAKRVTQN 528
           MLLR+SSTPILNSWL QTK+ PSESDQ H LQR KSLSL  SFHPP PPSDDSAK+ TQN
Sbjct: 1   MLLRTSSTPILNSWLHQTKSPPSESDQCHQLQRAKSLSLAASFHPPPPPSDDSAKKSTQN 60

Query: 529 LLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGLG 588
           LL+SD ADPRRKNRVPK      KVKSKENG AVRD+EL P SDSSSSSIH LFS SGLG
Sbjct: 61  LLQSDAADPRRKNRVPK------KVKSKENGAAVRDRELSPASDSSSSSIHRLFSSSGLG 120

Query: 589 ARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFSE 648
             VPN  EA D RR ECVLQT+VVGGGMGSDGGRVCG GGGGRGSDGGGGGDNDRSGF  
Sbjct: 121 VNVPN-YEARDERRDECVLQTMVVGGGMGSDGGRVCGGGGGGRGSDGGGGGDNDRSGF-- 180

Query: 649 NNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILADPNDA 708
            NNHHGS STDAYYQ+MIEANP NALLLG     +   HGDFAKAEEFCGRAILADPNDA
Sbjct: 181 -NNHHGSKSTDAYYQRMIEANPNNALLLGNYAKFLKEVHGDFAKAEEFCGRAILADPNDA 240

Query: 709 NVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKHS 768
           NVLSLYAD+IW TQ+                            R  + F   I + P   
Sbjct: 241 NVLSLYADLIWCTQK-------------------------DAQRAETYFDQAIKTSPDDC 300

Query: 769 LVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS 819
               YLLASYARFLWDTDVDEEDDKA     EES  SPPGF+HG PHHSPLAAAS
Sbjct: 301 ----YLLASYARFLWDTDVDEEDDKA-----EESCSSPPGFAHGTPHHSPLAAAS 311

BLAST of Sgr015599 vs. NCBI nr
Match: KAA0058754.1 (aspartate, glycine, lysine and serine-rich protein-like [Cucumis melo var. makuwa] >TYK10548.1 aspartate, glycine, lysine and serine-rich protein-like [Cucumis melo var. makuwa])

HSP 1 Score: 423.3 bits (1087), Expect = 4.7e-114
Identity = 240/356 (67.42%), Postives = 268/356 (75.28%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFHPPSPP-SDDSAKRVTQ 528
           MLLR+SSTPILNSWL Q+K+SPSESDQIHHLQRTKS+SLT+SFH P P  S++S  RVTQ
Sbjct: 1   MLLRTSSTPILNSWLHQSKSSPSESDQIHHLQRTKSISLTSSFHLPPPSFSNESPNRVTQ 60

Query: 529 NLLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGL 588
           NLLESD+ DPR+K  +PKS ++++KVKSKENGV+VRDQ L+PTSDSSSSSIHG+F  SGL
Sbjct: 61  NLLESDSRDPRKKIPIPKSSEVQSKVKSKENGVSVRDQHLKPTSDSSSSSIHGVFLNSGL 120

Query: 589 GARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFS 648
           G + PN DE  D +R  C+LQTLVVGGGMG+DGGRVC  GG GRGSDGGGGGDN RSGF 
Sbjct: 121 GLKFPN-DEVCDEKRDGCILQTLVVGGGMGNDGGRVC--GGSGRGSDGGGGGDNGRSGF- 180

Query: 649 ENNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILADPND 708
             NNHHGSNSTDAYYQKMIEANP NALLLG     +   HGDF+KAEEFCGRAILADPND
Sbjct: 181 --NNHHGSNSTDAYYQKMIEANPNNALLLGNYAKFLKEVHGDFSKAEEFCGRAILADPND 240

Query: 709 ANVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKH 768
           A+VLSLYAD+IWHTQR                            R  + F   + S P  
Sbjct: 241 ASVLSLYADLIWHTQR-------------------------DAQRAETYFDQAVKSAPDD 300

Query: 769 SLVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS 819
                YLLASYARFLWDT+VDEEDD  DQYETEES  S PGFSHG PHHSPLAA S
Sbjct: 301 C----YLLASYARFLWDTEVDEEDDTEDQYETEESHRSHPGFSHGAPHHSPLAATS 321

BLAST of Sgr015599 vs. NCBI nr
Match: KAG7014137.1 (hypothetical protein SDJN02_24310 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 422.5 bits (1085), Expect = 8.0e-114
Identity = 241/356 (67.70%), Postives = 265/356 (74.44%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFHPPSPP-SDDSAKRVTQ 528
           MLLRSSSTPI NSWL QTK+SPSESDQ+H LQRTKSL    SFHPP  P   +SA RVTQ
Sbjct: 1   MLLRSSSTPIFNSWLHQTKSSPSESDQVHQLQRTKSL----SFHPPPVPLLKESANRVTQ 60

Query: 529 NLLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGL 588
           NLL+SD+ADPR+K  VP+SCK R+KVKS+ENGV VRDQEL+P  DSSSSSIHG+FS SGL
Sbjct: 61  NLLDSDSADPRKKIPVPRSCKARSKVKSRENGVPVRDQELKPALDSSSSSIHGVFSNSGL 120

Query: 589 GARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFS 648
           G + PN DE  D +R  CVLQTLVVGGGMGSDGGRVC   GGGRGSDGGGGGDN RSGF 
Sbjct: 121 GLKSPN-DEVRDEKRDACVLQTLVVGGGMGSDGGRVC---GGGRGSDGGGGGDNGRSGF- 180

Query: 649 ENNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILADPND 708
             NNHHGSNSTDAYYQKMIEANP NALLLG     +   HGDF+KAEEFCGRAILADPND
Sbjct: 181 --NNHHGSNSTDAYYQKMIEANPNNALLLGNYAKFLKEVHGDFSKAEEFCGRAILADPND 240

Query: 709 ANVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKH 768
           AN+LSLYAD+IWHTQ+                            R  S F   + S P  
Sbjct: 241 ANILSLYADLIWHTQK-------------------------DAQRAESYFDQAVKSSPDD 300

Query: 769 SLVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS 819
                YLLASYA+FLW+ DVDE+D+ ADQYETEES PSPPGF+HG PHHSPLAAAS
Sbjct: 301 C----YLLASYAQFLWNADVDEDDNPADQYETEESCPSPPGFAHGAPHHSPLAAAS 316

BLAST of Sgr015599 vs. NCBI nr
Match: XP_008461118.1 (PREDICTED: uncharacterized protein LOC103499799 [Cucumis melo])

HSP 1 Score: 421.4 bits (1082), Expect = 1.8e-113
Identity = 239/356 (67.13%), Postives = 267/356 (75.00%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFHPPSPP-SDDSAKRVTQ 528
           MLLR+SSTPILNSWL Q+K+SPSESDQIHHLQRTKS+SLT+SFH P P  S++S  RVTQ
Sbjct: 1   MLLRTSSTPILNSWLHQSKSSPSESDQIHHLQRTKSISLTSSFHLPPPSFSNESPNRVTQ 60

Query: 529 NLLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGL 588
           NLLESD+ DPR+K  +PKS ++++KVK KENGV+VRDQ L+PTSDSSSSSIHG+F  SGL
Sbjct: 61  NLLESDSRDPRKKIPIPKSSEVQSKVKPKENGVSVRDQHLKPTSDSSSSSIHGVFLNSGL 120

Query: 589 GARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFS 648
           G + PN DE  D +R  C+LQTLVVGGGMG+DGGRVC  GG GRGSDGGGGGDN RSGF 
Sbjct: 121 GLKFPN-DEVCDEKRDGCILQTLVVGGGMGNDGGRVC--GGSGRGSDGGGGGDNGRSGF- 180

Query: 649 ENNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILADPND 708
             NNHHGSNSTDAYYQKMIEANP NALLLG     +   HGDF+KAEEFCGRAILADPND
Sbjct: 181 --NNHHGSNSTDAYYQKMIEANPNNALLLGNYAKFLKEVHGDFSKAEEFCGRAILADPND 240

Query: 709 ANVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKH 768
           A+VLSLYAD+IWHTQR                            R  + F   + S P  
Sbjct: 241 ASVLSLYADLIWHTQR-------------------------DAQRAETYFDQAVKSAPDD 300

Query: 769 SLVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS 819
                YLLASYARFLWDT+VDEEDD  DQYETEES  S PGFSHG PHHSPLAA S
Sbjct: 301 C----YLLASYARFLWDTEVDEEDDTEDQYETEESHRSHPGFSHGAPHHSPLAATS 321

BLAST of Sgr015599 vs. NCBI nr
Match: XP_038898606.1 (uncharacterized protein LOC120086170 [Benincasa hispida] >XP_038898607.1 uncharacterized protein LOC120086170 [Benincasa hispida])

HSP 1 Score: 421.0 bits (1081), Expect = 2.3e-113
Identity = 239/356 (67.13%), Postives = 267/356 (75.00%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFHPPSP-PSDDSAKRVTQ 528
           MLLR+SSTPILNSWL Q+K+SPSESDQIH LQR KS+SL +SFHPP P  S++S KRV Q
Sbjct: 1   MLLRTSSTPILNSWLHQSKSSPSESDQIHQLQRAKSISLISSFHPPPPSSSNESPKRVIQ 60

Query: 529 NLLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGL 588
           +LLESD+ADPRRK  +PK CK+R+KVKS+ENGV+VRDQ+L+PTSDSSSSSIHG+F  SGL
Sbjct: 61  SLLESDSADPRRKIPLPKCCKVRSKVKSRENGVSVRDQDLKPTSDSSSSSIHGVFFNSGL 120

Query: 589 GARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFS 648
           G + PN D+  D +R  CVLQTLVVGGGMG+DGGRVC   GGGRGSDGGGGGDN RSGF 
Sbjct: 121 GLKFPN-DQVCDEKRDACVLQTLVVGGGMGNDGGRVC---GGGRGSDGGGGGDNGRSGF- 180

Query: 649 ENNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILADPND 708
             NNHHGSNSTDAYYQKMIEANP NALLLG     +   HGDF+KAEEFCGRAILADPND
Sbjct: 181 --NNHHGSNSTDAYYQKMIEANPNNALLLGNYAKFLKEVHGDFSKAEEFCGRAILADPND 240

Query: 709 ANVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKH 768
           A+VLSLYAD+IW TQR                            R  + F   + S P  
Sbjct: 241 ASVLSLYADLIWRTQR-------------------------DAQRAEAYFDQAVKSSPDD 300

Query: 769 SLVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS 819
                YLLASYARFLWDTDVDEEDD  DQYETEES  S  G+SHG PHHSPLAAAS
Sbjct: 301 C----YLLASYARFLWDTDVDEEDDTVDQYETEESHMSQSGYSHGAPHHSPLAAAS 320

BLAST of Sgr015599 vs. ExPASy TrEMBL
Match: A0A6J1CS01 (uncharacterized protein LOC111013740 OS=Momordica charantia OX=3673 GN=LOC111013740 PE=4 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 1.2e-115
Identity = 248/355 (69.86%), Postives = 261/355 (73.52%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFHPPSPPSDDSAKRVTQN 528
           MLLR+SSTPILNSWL QTK+ PSESDQ H LQR KSLSL  SFHPP PPSDDSAK+ TQN
Sbjct: 1   MLLRTSSTPILNSWLHQTKSPPSESDQCHQLQRAKSLSLAASFHPPPPPSDDSAKKSTQN 60

Query: 529 LLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGLG 588
           LL+SD ADPRRKNRVPK      KVKSKENG AVRD+EL P SDSSSSSIH LFS SGLG
Sbjct: 61  LLQSDAADPRRKNRVPK------KVKSKENGAAVRDRELSPASDSSSSSIHRLFSSSGLG 120

Query: 589 ARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFSE 648
             VPN  EA D RR ECVLQT+VVGGGMGSDGGRVCG GGGGRGSDGGGGGDNDRSGF  
Sbjct: 121 VNVPN-YEARDERRDECVLQTMVVGGGMGSDGGRVCGGGGGGRGSDGGGGGDNDRSGF-- 180

Query: 649 NNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILADPNDA 708
            NNHHGS STDAYYQ+MIEANP NALLLG     +   HGDFAKAEEFCGRAILADPNDA
Sbjct: 181 -NNHHGSKSTDAYYQRMIEANPNNALLLGNYAKFLKEVHGDFAKAEEFCGRAILADPNDA 240

Query: 709 NVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKHS 768
           NVLSLYAD+IW TQ+                            R  + F   I + P   
Sbjct: 241 NVLSLYADLIWCTQK-------------------------DAQRAETYFDQAIKTSPDDC 300

Query: 769 LVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS 819
               YLLASYARFLWDTDVDEEDDKA     EES  SPPGF+HG PHHSPLAAAS
Sbjct: 301 ----YLLASYARFLWDTDVDEEDDKA-----EESCSSPPGFAHGTPHHSPLAAAS 311

BLAST of Sgr015599 vs. ExPASy TrEMBL
Match: A0A5A7UU96 (Aspartate, glycine, lysine and serine-rich protein-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold459G002010 PE=4 SV=1)

HSP 1 Score: 423.3 bits (1087), Expect = 2.3e-114
Identity = 240/356 (67.42%), Postives = 268/356 (75.28%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFHPPSPP-SDDSAKRVTQ 528
           MLLR+SSTPILNSWL Q+K+SPSESDQIHHLQRTKS+SLT+SFH P P  S++S  RVTQ
Sbjct: 1   MLLRTSSTPILNSWLHQSKSSPSESDQIHHLQRTKSISLTSSFHLPPPSFSNESPNRVTQ 60

Query: 529 NLLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGL 588
           NLLESD+ DPR+K  +PKS ++++KVKSKENGV+VRDQ L+PTSDSSSSSIHG+F  SGL
Sbjct: 61  NLLESDSRDPRKKIPIPKSSEVQSKVKSKENGVSVRDQHLKPTSDSSSSSIHGVFLNSGL 120

Query: 589 GARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFS 648
           G + PN DE  D +R  C+LQTLVVGGGMG+DGGRVC  GG GRGSDGGGGGDN RSGF 
Sbjct: 121 GLKFPN-DEVCDEKRDGCILQTLVVGGGMGNDGGRVC--GGSGRGSDGGGGGDNGRSGF- 180

Query: 649 ENNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILADPND 708
             NNHHGSNSTDAYYQKMIEANP NALLLG     +   HGDF+KAEEFCGRAILADPND
Sbjct: 181 --NNHHGSNSTDAYYQKMIEANPNNALLLGNYAKFLKEVHGDFSKAEEFCGRAILADPND 240

Query: 709 ANVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKH 768
           A+VLSLYAD+IWHTQR                            R  + F   + S P  
Sbjct: 241 ASVLSLYADLIWHTQR-------------------------DAQRAETYFDQAVKSAPDD 300

Query: 769 SLVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS 819
                YLLASYARFLWDT+VDEEDD  DQYETEES  S PGFSHG PHHSPLAA S
Sbjct: 301 C----YLLASYARFLWDTEVDEEDDTEDQYETEESHRSHPGFSHGAPHHSPLAATS 321

BLAST of Sgr015599 vs. ExPASy TrEMBL
Match: A0A1S3CF71 (uncharacterized protein LOC103499799 OS=Cucumis melo OX=3656 GN=LOC103499799 PE=4 SV=1)

HSP 1 Score: 421.4 bits (1082), Expect = 8.7e-114
Identity = 239/356 (67.13%), Postives = 267/356 (75.00%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFHPPSPP-SDDSAKRVTQ 528
           MLLR+SSTPILNSWL Q+K+SPSESDQIHHLQRTKS+SLT+SFH P P  S++S  RVTQ
Sbjct: 1   MLLRTSSTPILNSWLHQSKSSPSESDQIHHLQRTKSISLTSSFHLPPPSFSNESPNRVTQ 60

Query: 529 NLLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGL 588
           NLLESD+ DPR+K  +PKS ++++KVK KENGV+VRDQ L+PTSDSSSSSIHG+F  SGL
Sbjct: 61  NLLESDSRDPRKKIPIPKSSEVQSKVKPKENGVSVRDQHLKPTSDSSSSSIHGVFLNSGL 120

Query: 589 GARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFS 648
           G + PN DE  D +R  C+LQTLVVGGGMG+DGGRVC  GG GRGSDGGGGGDN RSGF 
Sbjct: 121 GLKFPN-DEVCDEKRDGCILQTLVVGGGMGNDGGRVC--GGSGRGSDGGGGGDNGRSGF- 180

Query: 649 ENNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILADPND 708
             NNHHGSNSTDAYYQKMIEANP NALLLG     +   HGDF+KAEEFCGRAILADPND
Sbjct: 181 --NNHHGSNSTDAYYQKMIEANPNNALLLGNYAKFLKEVHGDFSKAEEFCGRAILADPND 240

Query: 709 ANVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKH 768
           A+VLSLYAD+IWHTQR                            R  + F   + S P  
Sbjct: 241 ASVLSLYADLIWHTQR-------------------------DAQRAETYFDQAVKSAPDD 300

Query: 769 SLVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS 819
                YLLASYARFLWDT+VDEEDD  DQYETEES  S PGFSHG PHHSPLAA S
Sbjct: 301 C----YLLASYARFLWDTEVDEEDDTEDQYETEESHRSHPGFSHGAPHHSPLAATS 321

BLAST of Sgr015599 vs. ExPASy TrEMBL
Match: A0A0A0K6Z4 (TPR_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G432550 PE=4 SV=1)

HSP 1 Score: 410.6 bits (1054), Expect = 1.5e-110
Identity = 236/356 (66.29%), Postives = 262/356 (73.60%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFH-PPSPPSDDSAKRVTQ 528
           MLLR++STPILNSWL Q K+SPSES+QIHHLQRTKS+SL +SFH PP   S +S+ RVTQ
Sbjct: 1   MLLRTNSTPILNSWLHQFKSSPSESNQIHHLQRTKSISLISSFHLPPPSVSTESSNRVTQ 60

Query: 529 NLLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGL 588
           NLLESD+ DPR+K  + KS K+  KVKS+ENGV+VRDQ L+PTSDSSSSSIHG+F  SGL
Sbjct: 61  NLLESDSTDPRKKIPITKSSKV--KVKSRENGVSVRDQHLKPTSDSSSSSIHGVFLNSGL 120

Query: 589 GARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFS 648
           G + PN DE  D +R  C+LQTLVVGGGMG+DGGRVC  GG GRGSDGGGGGDN RSGF 
Sbjct: 121 GLKFPN-DEVCDEKRDACILQTLVVGGGMGNDGGRVC--GGSGRGSDGGGGGDNGRSGF- 180

Query: 649 ENNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILADPND 708
             NNHHGSNSTDAYYQKMIEANP NALLLG     +   HGDF+KAEEFCGRAILADPND
Sbjct: 181 --NNHHGSNSTDAYYQKMIEANPNNALLLGNYAKFLKEVHGDFSKAEEFCGRAILADPND 240

Query: 709 ANVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKH 768
           A+VLSLYAD+IWHTQR                            R  + F   + S P  
Sbjct: 241 ASVLSLYADLIWHTQR-------------------------DARRAETYFDQAVKSAPDD 300

Query: 769 SLVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS 819
                YLLASYARFLWDTDVD EDD  DQYETEES P  PGFSHG PHHSPLAA S
Sbjct: 301 C----YLLASYARFLWDTDVDNEDDTEDQYETEESHPLHPGFSHGAPHHSPLAATS 319

BLAST of Sgr015599 vs. ExPASy TrEMBL
Match: A0A6J1JVS8 (uncharacterized protein LOC111487984 OS=Cucurbita maxima OX=3661 GN=LOC111487984 PE=4 SV=1)

HSP 1 Score: 399.4 bits (1025), Expect = 3.5e-107
Identity = 234/356 (65.73%), Postives = 257/356 (72.19%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWLCQTKASPSESDQIHHLQRTKSLSLTTSFH-PPSPPSDDSAKRVTQ 528
           MLLRSSSTPI NSWL QTK+SPSESDQIH LQRTKSL    SFH PP+P   +SA RVTQ
Sbjct: 1   MLLRSSSTPIFNSWLHQTKSSPSESDQIHQLQRTKSL----SFHPPPAPLLKESANRVTQ 60

Query: 529 NLLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGSGL 588
           NLL+SD+ADPR+K  VP+SCK R+KVKS+ENGV           DSSSSSIHG+FS SGL
Sbjct: 61  NLLDSDSADPRKKIPVPRSCKARSKVKSRENGV----------PDSSSSSIHGVFSNSGL 120

Query: 589 GARVPNDDEAHDVRRHECVLQTLVVGGGMGSDGGRVCGAGGGGRGSDGGGGGDNDRSGFS 648
           G + PN DE  D +R  CVLQTLVVGGGMGSDGGRVC   GGGRGSDGGGGGDN RSGF 
Sbjct: 121 GLKSPN-DEVRDEKRDACVLQTLVVGGGMGSDGGRVC---GGGRGSDGGGGGDNGRSGF- 180

Query: 649 ENNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILADPND 708
             NNHHGSNSTDAYYQKMIEANP NALLLG     +   HGDF+KAEEFCGRAILADPND
Sbjct: 181 --NNHHGSNSTDAYYQKMIEANPNNALLLGNYAKFLKEVHGDFSKAEEFCGRAILADPND 240

Query: 709 ANVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININSIPKH 768
           AN+LSLYAD+IWHTQ+                            R  + F   + S P  
Sbjct: 241 ANILSLYADLIWHTQK-------------------------DAQRAETYFDQAVKSSPDD 300

Query: 769 SLVVNYLLASYARFLWDTDVDEEDDKADQYETEESRPSPPGFSHGVPHHSPLAAAS 819
                YLLASYA+FLW+ DVDE+D+ ADQYETEES PSPPGF HG PHHSPLAAAS
Sbjct: 301 C----YLLASYAQFLWNADVDEDDNTADQYETEESCPSPPGFGHGAPHHSPLAAAS 306

BLAST of Sgr015599 vs. TAIR 10
Match: AT5G25360.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 153.3 bits (386), Expect = 8.5e-37
Identity = 82/150 (54.67%), Postives = 104/150 (69.33%), Query Frame = 0

Query: 58  SWVFCVFCINGNSGCLGCYTKPKLRTKLNEPSKGLPIQCHGLKKPSISEDFWTTSTFDVD 117
           SW++ +F   G  GC GC  KP L   ++EPSKGL IQ   +KKPS+SEDFW+TST ++D
Sbjct: 9   SWIYQLFGCMG--GCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMD 68

Query: 118 NSAGQSQGSMSSMSTINQMHDHHGSSGNVHNPSEFINHGLLLWNQTRQRWSGNKQSQNRA 177
           NS  QSQ SMSS+S  N    +  +S +  NP+EF+NHGL LWNQTRQ+W  N  SQ +A
Sbjct: 69  NSTLQSQRSMSSISFTN----NTSTSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKKA 128

Query: 178 PQFQEPKLDWNATYESLLGSNKPFRQPISL 208
            + +EP + WNATYESLLG NK F +PI L
Sbjct: 129 -KVREPTISWNATYESLLGMNKRFSRPIPL 151

BLAST of Sgr015599 vs. TAIR 10
Match: AT5G25360.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32342.1). )

HSP 1 Score: 153.3 bits (386), Expect = 8.5e-37
Identity = 82/150 (54.67%), Postives = 104/150 (69.33%), Query Frame = 0

Query: 58  SWVFCVFCINGNSGCLGCYTKPKLRTKLNEPSKGLPIQCHGLKKPSISEDFWTTSTFDVD 117
           SW++ +F   G  GC GC  KP L   ++EPSKGL IQ   +KKPS+SEDFW+TST ++D
Sbjct: 9   SWIYQLFGCMG--GCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMD 68

Query: 118 NSAGQSQGSMSSMSTINQMHDHHGSSGNVHNPSEFINHGLLLWNQTRQRWSGNKQSQNRA 177
           NS  QSQ SMSS+S  N    +  +S +  NP+EF+NHGL LWNQTRQ+W  N  SQ +A
Sbjct: 69  NSTLQSQRSMSSISFTN----NTSTSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKKA 128

Query: 178 PQFQEPKLDWNATYESLLGSNKPFRQPISL 208
            + +EP + WNATYESLLG NK F +PI L
Sbjct: 129 -KVREPTISWNATYESLLGMNKRFSRPIPL 151

BLAST of Sgr015599 vs. TAIR 10
Match: AT3G15770.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25360.2); Has 143 Blast hits to 143 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 125.9 bits (315), Expect = 1.4e-28
Identity = 68/145 (46.90%), Postives = 95/145 (65.52%), Query Frame = 0

Query: 70  SGCLGCYTKPKLRTKLNEPSKG-----LPIQCHGLKKPSI--SEDFWTTSTFDVDNSAGQ 129
           S CL C+ K K +T ++ P  G     +      L+KPS+  SEDFWT +T D++++A  
Sbjct: 3   SSCLACFDKSKAKTSVDVPLNGTKDVLVEEDWSELRKPSVVASEDFWTNTTLDMESNA-- 62

Query: 130 SQGSMSSMSTINQMHDHHGSSGNVHNPSEFINHGLLLWNQTRQRWSGNKQSQNRAPQFQE 189
             GS+SS+ST N   D  G   + + P+EF+NHGL+LWNQTRQ+W G+K+S++R    +E
Sbjct: 63  -HGSVSSISTTNLTIDSQGCGSSSNEPAEFVNHGLVLWNQTRQQWVGDKRSESRKSVGRE 122

Query: 190 PKLDWNATYESLLGSNKPFRQPISL 208
           P L+ N TYESLLGSNK F +PI L
Sbjct: 123 PILNENVTYESLLGSNKRFPRPIPL 144

BLAST of Sgr015599 vs. TAIR 10
Match: AT1G80130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 124.8 bits (312), Expect = 3.2e-28
Identity = 118/364 (32.42%), Postives = 165/364 (45.33%), Query Frame = 0

Query: 469 MLLRSSSTPILNSWL---CQTKASPSESDQIHHLQRTKSLSLTTSFHPPSPPSDDSAKRV 528
           MLLRS+S PILNSWL   C  ++SP    Q+   +R+ SLSL +S          + +++
Sbjct: 1   MLLRSTSAPILNSWLPQHCSRESSPEPESQL--WRRSTSLSLFSS----KSIDGHTGEQL 60

Query: 529 TQNLLESDTADPRRKNRVPKSCKIRTKVKSKENGVAVRDQELRPTSDSSSSSIHGLFSGS 588
            Q L ++      +      S K  T  + + + +       +    SS   +  LFS S
Sbjct: 61  HQALSDNKEIIILKSKSNEHSYKTPTSSRQRRSSLDETRYTKKTLDRSSPFLVERLFSSS 120

Query: 589 GLGARVPNDDEAHDVRRHECVLQTLVV--GGGMGSDGGRVCGAGGGGRGSDGGGGGDNDR 648
           G G +  ++D           L+TLV   GGGMG  GG +C  GGG     GG G D  R
Sbjct: 121 GQGDKASSNDR----------LETLVSGGGGGMGGSGGNICNGGGG----VGGSGVDGGR 180

Query: 649 SGFSENNNHHGSNSTDAYYQKMIEANPENALLLG-----IMPSHGDFAKAEEFCGRAILA 708
           S           ++TD YY++MI++NP N+LL G     +    GD  KAEE+C RAIL 
Sbjct: 181 S----------EDATDTYYREMIDSNPGNSLLTGNYAKFLKEVKGDMKKAEEYCERAILG 240

Query: 709 DPNDANVLSLYADIIWHTQRMLNEPSPISIKLLKVPQMIVDFAMNTITRTNSEFSININS 768
           + ND NVLSLYAD+I H  +                            R +S +   +  
Sbjct: 241 NTNDGNVLSLYADLILHNHQ-------------------------DRQRAHSYYKQAVKM 300

Query: 769 IPKHSLVVNYLLASYARFLWDTDVDEEDDKADQYE----TEESRPSPPGFSHGVPHHSPL 819
            P+      Y+ ASYARFLWD D DEED+   + E     E     P       P H+ +
Sbjct: 301 SPEDC----YVQASYARFLWDVDEDEEDEALGEEEENLSDETGHVPPTTMFRDFPQHTSI 305

BLAST of Sgr015599 vs. TAIR 10
Match: AT3G15770.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G15350.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 124.4 bits (311), Expect = 4.2e-28
Identity = 67/143 (46.85%), Postives = 94/143 (65.73%), Query Frame = 0

Query: 72  CLGCYTKPKLRTKLNEPSKG-----LPIQCHGLKKPSI--SEDFWTTSTFDVDNSAGQSQ 131
           CL C+ K K +T ++ P  G     +      L+KPS+  SEDFWT +T D++++A    
Sbjct: 4   CLACFDKSKAKTSVDVPLNGTKDVLVEEDWSELRKPSVVASEDFWTNTTLDMESNA---H 63

Query: 132 GSMSSMSTINQMHDHHGSSGNVHNPSEFINHGLLLWNQTRQRWSGNKQSQNRAPQFQEPK 191
           GS+SS+ST N   D  G   + + P+EF+NHGL+LWNQTRQ+W G+K+S++R    +EP 
Sbjct: 64  GSVSSISTTNLTIDSQGCGSSSNEPAEFVNHGLVLWNQTRQQWVGDKRSESRKSVGREPI 123

Query: 192 LDWNATYESLLGSNKPFRQPISL 208
           L+ N TYESLLGSNK F +PI L
Sbjct: 124 LNENVTYESLLGSNKRFPRPIPL 143

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022143958.12.5e-11569.86uncharacterized protein LOC111013740 [Momordica charantia][more]
KAA0058754.14.7e-11467.42aspartate, glycine, lysine and serine-rich protein-like [Cucumis melo var. makuw... [more]
KAG7014137.18.0e-11467.70hypothetical protein SDJN02_24310 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_008461118.11.8e-11367.13PREDICTED: uncharacterized protein LOC103499799 [Cucumis melo][more]
XP_038898606.12.3e-11367.13uncharacterized protein LOC120086170 [Benincasa hispida] >XP_038898607.1 unchara... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CS011.2e-11569.86uncharacterized protein LOC111013740 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A5A7UU962.3e-11467.42Aspartate, glycine, lysine and serine-rich protein-like OS=Cucumis melo var. mak... [more]
A0A1S3CF718.7e-11467.13uncharacterized protein LOC103499799 OS=Cucumis melo OX=3656 GN=LOC103499799 PE=... [more]
A0A0A0K6Z41.5e-11066.29TPR_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G432550 ... [more]
A0A6J1JVS83.5e-10765.73uncharacterized protein LOC111487984 OS=Cucurbita maxima OX=3661 GN=LOC111487984... [more]
Match NameE-valueIdentityDescription
AT5G25360.18.5e-3754.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G25360.28.5e-3754.67unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G15770.11.4e-2846.90unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G80130.13.2e-2832.42Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G15770.24.2e-2846.85unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025124Domain of unknown function DUF4050PFAMPF13259DUF4050coord: 112..183
e-value: 6.3E-10
score: 39.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 567..582
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 785..818
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 624..657
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 493..594
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 399..425
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 643..657
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 500..532
NoneNo IPR availablePANTHERPTHR26312TETRATRICOPEPTIDE REPEAT PROTEIN 5coord: 469..718
coord: 742..812
NoneNo IPR availablePANTHERPTHR26312:SF179REPEAT-LIKE SUPERFAMILY PROTEIN, PUTATIVE-RELATEDcoord: 469..718
coord: 742..812

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr015599.1Sgr015599.1mRNA