Sgr014827 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr014827
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPLATZ transcription factor family protein
Locationtig00001291: 464409 .. 481190 (-)
RNA-Seq ExpressionSgr014827
SyntenySgr014827
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCGATCTGGTTCAAGAATCAGAGGACCCCAAAAAGCCAGAAGAGCTCTACGATCCCTTCGATTCCCGCTTCTCCATGCTCTGCCTCAACGGCTCCATTCCCTCGCCCACCGTCCACGGCGACGGGCATGCCCACCACCCTCGACCCTGTGCTTCATGCGGCTGCCGCGAACCTCGCTCCGCCGCCGCAACCCCTTCCATTCCCATGAAACGACGCTCCCCGACGCCCTCCTACGCCCTTCAAGACCCCACCGACTCCTCACACCCAGAGCCCAAGCCCAAGCGGCTCTTTCTTGATCAAGAAGACAGTCTCCCCGGCTTCTCCAAGATTGATCTTCCTATTCCTTTTGCCGCCGCCCACAGCCAGGGCCGGCCCCCTTCGGTCTCTCCTCTCCGCCGCTCTGTTTCCGATCCTACCGAAGCCCGTAATGGAGCCACTTTCACTCCCTCGTTTGATTATCTACTTGATTCCTCTCCTCCGGTGAATACCGATCAATCACCCGACAATGTGAAGGCGAGTGTACCTCCGTCGTCGGCCAAACGGTTGTGTGCGAACTCGGCGCTGCCGCCGCTGCCTCTCCGTCGTACGGTTTCAGACCCAAATCCCTCCCCTGACAAGATTTCCCGATCACCAAATAAAGTAGGGAGAGTCAGTGATTTGATTAAGGAAGACAGTCCCGATTCAAAGGTTTGTTATTTGTATTCTCTAAACCATCGAAAACAGTATATTCTTTGGGCTCGTTATTGACTGATTATGGGGGGATCTTGGGGACTTTTTCAGAGGCTGAGAAGGATCAAGGATCGTTTGAAGGAGATGAACCAATGGTGGAACGAAGTCATGAGCGAACAAGAACATGAAGAAGAGGACGAAGACGCCAAAAAGGTATGATTTTTATAATAAAAAAAACAATGTTCATCTTCATCTTCATCTTCTTGGTTTGCTGAATTTGGGATTTTGATATGATAATGGTGCAGAATAATGTGTGTTGTAAGGAAGAAGAAGATGAAGAAACTGTGGGAGTGGAGAGGGTGGGAGATTCATTGGAGCTTCATCTGAAGTGCCCCTGTGGGAAAGGCTTTGAGATTCTTCTCTGTGGAAGAAGCTGTTTCTACAAGCTGCTGTAGAATTGAGTATTTTCAACTTTTAACATTTTTTTTTAGTACATATTTTGAACTTTTAACTGATACCCAATTCAATTGATCACTTTGCTCTTTCATTTCAAAAATTGAGTTTGGGTTTTGAAAAGCTTCACACAAGACAACAGTTCAAGAAGTCAATGGGTTAAATTTCTAGTGTATTAGCTTTTGTTTTGAAGTTTTGCTAAAACATTATTAATATTTCGTGACTTTTTTGTTTTTTAAAATTCTAACAACCTCGTGCTAGGAAAAAAAAATCAATTTTTACTTGTGTAAGATGAGTCAACTTTTACTTTAAAGTTTCAATTTCATCAAATTGAGCCTTAAACTTAAACTTTTTATCATATGTTTGACTTTTACATAAAGAGTTGTTAAAAAAAACATTTTTTCGTGTATTCAATGAATAATACTAATACTAAATTGATAGAGCTTTAGAGAAAAACAAATACTAGAGTTAGATTTCTGTTAAAATTTGTGAGTTTTGTTCAATGGTTGATTGTTGTACATAGATTTGAACCAGACTTAATTTTGTGCATTTATTAGTTAGAGTTCAATTCTCGTTAAAATTAATAAATTTTGAGAGATTTTTATGTTACTTTTTAGTGACATTAGTAAGTAAGAACATAATAAAAGTTGAACAGAAAGTAAAAATTACAACGCTTATATTTAGATTTAAGGTTGAATCGGATGCAATTGATAGTTAATGATTTAAAAGCTAAATATAGAGGGACATAAAGAAATATGTAAAAGAACTTTTTGTGGGCGAGGATGCATGCAATTTGGTTTGTGAAAATAAGGGACAAACAATAATTTAAATATGTCAATAAATATGCAGAAGGTACAACAGCTCCCATTTTAGCATTACGCTTTTCTTACAATATATTTTATAATCATCCTCTTTCCCCAGTCTTTAATATATATTTTCTTCTAAATACTTCAGGGAAAGAAGGATAAATGATGATATTTATCTGTTTTTTTCTTTATGATGGTAAATAGTATATTATATAGCAGTGCAGTGGTGAGAAGTTCATTTTCAAGGGGCATATCCACCGTGGCCACCGCCCTCTCCGCTACCACCACCTCCTCCATATCCACTGCCTCCTTCGTGACCACCGTATGCACCTCCTCCACCATAGCCCGATCCTCCTCCTCCTCCGCCTCCGCCTCCGCTTCCTTCACCATTGCTTCCTCCTCCTGCACCATGTCCTCCACCTGCTCCGCTTCCATAACCAGCACCATGCTCTCCTCCTGGACCGTATCCTGCTCCTCCACCACCACCATTTCCACCGCCTCCACCACCACCATAGCCGCCTCCATGTTCTCCACCATACCCAGAACCACCCCCTTCACCTCCTCCAGTGCCATACCCGGCACGACCAGGACCATAACCAGCTCCACCTCCCCCTCCTCCTCCATTTCCTCCACCACCTCCATCATATCCTGCACCTCCTGGACCATAGCCAGCTCCACCGCCATTTCCACCACCTCCACCATATCCGGCACCTCCATGACCATAACCAGCTCCACCACCACTTCCTCCACCTCCCCCGTATCCACCACCTCCAGGACCATATCCACCACCACCTGGACCATAGCCCGCTCCACCACCACTTCCTCCTCCTCCTCCATATCCACCACCGCCTCCAGGACCATAACCACCGCCACCTGGACCATAGCTAGCTCCACCACCACTTCCTCCACCTCCTCCATATCCACCACCTCCTCCAGGACCATATCCACCACCACCTGGACCATAGCCCGCTCCACCACCACTTCCTCCACCTCCTCCATATCCACCACCACCTCCAGGACCATAACCTGCTCTTCCACCATTTCCACCACCTCCTCCATATCCAGCTCCTCCACCACCTCCGTATCCGCCACCACCAGCACCATAAGCAGAGCCACCGCCACTTCCGCCGCCTCCTCCACCACCATGGCCACCAGCAAGACCATAACCTTCACCACCTCCACTTCCACCTCCACCTCCACTTCCATATCCTCCAACACCGTATTCTCCTATGGAACCATAGCTGCTACCACCCCCACCTCCATAGCCTCCACCACCACCATGACCACCAAGAGGATCATAGCCACCACCACTACCACCGCCTGCACCATGGCCTACACCATAAGCAAATGCAGGAACATTAACAGGTTCTCCTACACCATAAGTCAAGAGAGCTCTGGCTGCAGAAGAAACCCCAATACCTAATAAGAGAAAGAACACAAGAGAAGCAAGTTTGTGGGAAGCCATTTGTGGATGCTTGAAGTGAAACGGAAGGGAGTGGAGAGGAGCATATATAAGGAGAGGAAGGGGGGCCAAAAGCGCAATTTCTGTGCAGAGAGTGGTTGAGGCACATGCAAAAGAGAGAAAGTGGGTGGCCCAAAATCTAAAGATTGCACACCTTCTCCATAGCCTTATTCACTCCATATTACAAAATCAGTGTGTTTAAGCTGTCCTCCACTCTCTTCCCTCTTCACCCCCATCTAAACTTTCAATGTCATATCCATCTTCCAGGCCACCTCCTTGCAATGGTAACTCAAATGTAGTTTTCAACATCAATTTTCTCTTCCTTCTAAAACACATTCCTCTAAATCCACCTTTTGGTAACTCAAGTGTACAAAATTGCTTCATAACGCTCATGCTCGAGATGATCGGTTAGTGTATGTTTTTGTAACCATTATAAATTTCTTTCTTAATGGTTTAAATGTGCCTTTATAACATTAGTATGTTTATTTTGAAAAACTTATAATCCATTATCTTTGAGTTTTATGCATGAATATTGATAAATTATTTAATAATATGTTTCCTTTAATGCCATAATTTGAATGTTGAAAACCTTAGTTGATTTGCTCCATGATTTAGTTGCGATGAACTGTAATGTATTATGGTGTGAATGGGTGGGATGTGTGAGCATATATTGTTCCTTGGTTAGTGGATCTTCTTTTCTTTCATCCAATATTATATATTTTTATTTCCTCTTCTCACATCATACTGACGTCTACTTCCAATATTCTACACTACATATAATTTCGGTAATTTTGTTTTTATTTTTACTTCTTTTGTTTTTCGTTCAAGTAGGAGATCGAACCATCGACCTTTGAGATGATAATAGTTACCTTTATCTACTGAACTATGCTCGGATTGACAGTTTTTATTTTTACTTGACAGTAGGAATGAGAAATCTAATTTTGATGAGTTTTACTTGGATTTATTATGTTTATTTAAATTTACTTTTGATATTTTAAAATTGCTTTTAGAATTGTTTGATGGTTGTAAAGATTAACACTAATATGTTTCTGAAATTTTTACGATTTTTTTAATGATGAACACCGGCCCTCCAATTGTTCTGAAGCTTAATCATTCGTGAAAATGGATTAAAAAGTGAAAAGGTTTGTGTTTTTGAAAACAACTTTGGAACAATTTCTTAAATTTTTTATGTTGGAGAAAAAAGGACGAATGCCGTATAGCTTATGTGAGGATCTACCATGTAATGAAATATTTATTTTTAAATCCTATTTGGTCATTGTATTTTCCATTTTTGTCTATTTTAGTTCCTATATCAAAACATTCATTTTTATTCATGTAGTTTCAAAACATCAATTTTAGTTCTTACACTGTCAACTTTTGTCTGTTTTAGTTTATGTACTTTCAAATAGTTTATTTTGATTTTAAAAGGTGATCATTTTGATATTTTTTTTCATAATCCTAACACAATTTTTATACACAACGAAAGCCTTTTAATACAAAATTTGGCCAATATTTCAAGAACTTTATTAAATATAGTGTTGTGCTAAAACTTGTGATAAAAAAATGAGATAAGAAAAACATAAGGAGCAGAACGGCCATTTAAGGATCACTCTATAGTCCATTTTTCCATGAAGTAAAATTTAAAATAGTTTTTTTTTCCAATATACCTTTATATACAGCTTACATTGCAAAGGAAGAAAAAACCCTACATTTGTAATGTTTATGATTTTTACTTTCCTAGATTTGACATCAGAACAAATTTAACCCTGAATATAAATATATTAGACAAAATTTCTCTAACAATACAAAACTAAATAATCACTGAAGAGTCACTAAATTTAAAATGTGATATAATACTTGGTGTGTGAGAGAGAGTGTATAATTGAAAATATTAATTAAAAAAAAAAAAGAATTAACAGTGTTAAATTGAAATATTAATAATTGGAAATAAAATATTTTTCCAAGTATACAGTAGAGCCATAGACTACTCGTGGATATCTGGGTCCGGCCCAATAGAGACCTAAGCGGACGTGGCTGGCGAATCCTCAATTACCAAACTACCCTCAAACTTCATCGAATTTTCATATCCAACCCCGATGCCCCCAAACACCTGCGATTAATCTTTTCTTACAGAGAACATGAATTGCCGAGGTTCATGGAAGACTCTTCGATCTTTGATCCCTTTTCACATACTCTACCGCCTCGCGTTCCGTATACTCCTCTAATCCCTCTCTGCCTTTGCCATTCCACAGCCCTTCTCCGATGCTCTTTCCGGCATAATAGCACTCGGCTTCGATCGCCACCGCGCCGTCCGCCCGGGCATCGATGGCTACGACTTGAACCGGCAGTCGGTCGTAGTGGCCGACAGTAGCGCCTTCCTACTCGCAGTCATCGATTCGATCAAGTCCACGATCTGAGACGGAGTAGAGCTCGCCGCGGATCTGCCGGCCAAATCCGGGGAGATTGAGGAGATAAGGAATGCCATAAGGCCCCAACACGAGCGGGAAGGAGAATCGCGTCGAGTAGTAACCCACATATACGGCGTCGTTAGGGTCCTGCATTAGGCTGTGATTTGCAAACCCTCTTTTCAGTGTGCCGTAGCCCGTAGGTGAAGATCCGATGTCGTTTCGTCTCCTCCATGGCAAATGCCTCTAAATAATCCAAGGTTCGTATCCTGATCGTTTTAAAAGTTTTGAGTACTATTTTAGTTTCCGTCTACTTTTCGTTTTTATCCATTTTAATCTATTTATTTTTAAAACATCTATTTTAATCTATTTATTTTTAAATTATTTTGATCTAGTAAAATCTTAATATCAATATTATACACTGTTAAATGAAAACTTTTAAATATCAACATATAATAATATTTTAAAAAATTTATTGAGTGTAGTATTGTTTTAAAATTGTGATTAAAAAAAATTGAGATAAAAAAAAAACAATATAGCTACAGTAGTCACTTTTTCAAAAATTATATAGATCAAAATAGATGTTTTAAAATAATATAACCCAAATAATTATTTGAAAGTAAAGGAATAAAAATAGACGATTTGTAAGTATAAAAATCAAAATAAATCAAAGCTAAAAATACGTGCGTAAAACTATAGGATTTAAACATAATCTTATTTGTTCTAACCATTTGGGCCGAAACTGGCTCGGTTCGATTCTCTTTCAGCCCATATTTTTGGAGCCCAAAATTGTTCTCGGCCCATGAACTGGATCGAAGAGTAATTGCATGGACCTCCATCCAGGCCCACTTTATATGTTGCACGTCATCGACATTTGAGTGTCAAAAGCTTTTGAATAATGCATGGAATTTTAAGCAAATAAAGACGACACTTTATTACAATTTGTCGTCATTGTTGTAGTTGTTAAGGTATTTTAATTAAGAAATGACACAAAATTAAATATGTGGTATTATTATAATTAGGCGATGATGAAATGGAATATTAGAAGTTGTCCTCCAATAAAAAGAAAGGCGATTGTATGATTGCCACAAAAGACATCATGCACTCTTTCTCTTTGTAATAATGATGAAGTACCAATAATTATAAGATTAAAATTAATTTGTATGTAGAATTTGCTTTTTCTTTATAGTCATTATCAAAGCATACACATTTTTTAAACTTCAAAAAGTTGGAAGCAAAAAAAAAAAAAGAAAAAGAAACAAGGACCAAAATATATTGCCATGCTGTTTCTATAGCTTTAACATTTGGCTTCCTTTATTTTATGCCTTTTATGTCTAGCATTCATGAGTATTTGAGTTTGTTTATTCTATCTAATTATGACTCATGCAAATATATCAAATTTCTATTGGTTTTGTATCTATTTAAGTATGGTTGTATTTGTTTTTAGATAAAAGCAATTGGGGGAGGGAGATTTGAACGCAAAATGAGTTCTTTAAACAATTAGTCTTTAGCGGATGAACTATTCCTATAAGATGAGTATTGTCTTAAAGGCAAAGCATAATCAATAACGACACTCAATCACGACAATTTTGATTAGTATAACAAGCATAAAGTTTTGATTTTTATGAAACAGAAGTTTTAAAGTCCAATAATTATAACTAAGAGATGGAATGTAGCTATCCTTATCCTTTGGGAATAAATAAGTACAACTAATTAGACATATCAAAAAAAAAAAAAAGAAAATTAAAAGACCAAAACAACATTATTAAGAATTGGATTGGCCAATGATAATGATATGGCTTTATGTGATTTTTAGTTGTCCAAACAAGGGAAGAAAAAAATCTCTCTCAATCAACTACCATCCACCATAGAAATCTAAGTCCTTCATTAAGAAAAAAAAAAAAGAGAGAAAAGAAAAAAAATAGGCCAATTAAAATGAGAAAGCAGCATATACTTTATTATTTTTGATAATGAAAGTGATTATAGAAAATGGGACAAAGAAAATTGGGTCAAATTGTTTTGTCGCTGGTGGCCACAAAGTCTTGCCACTTACCTTGGCAATTTCTCTGGCCCTCCTCTCTCCAATTCCTTCTTCTTCTTCTTTTATTATTATTATTATTATTATTATTTGTTAAATTATAAATTTGATCCATTAATTTTAAAGATTGTGTCTAATAAGTCTGTAAACTTTCAATTTTAGGTCTTGTAGGTCCCTAAATTTTAAAAAGTATAATAAATTTCTCAACTTTCAATTTTGTGTCTGATAGATCTCTGACATATTAGAAATTTTTTTAAAATTTACAAATCTATTAGACACAAAATTAAAAGGTTAAAGATCTATTAGACACAGCATTAAAAGTTTAAGAATCTATTAGATACTTTTTAACTTTTAGAGACTTAATAGACATAACTCTGAATGTTTAAGGACTAAACTCGTAATTTAACATTTTTTTTTTAATTAACTAATAGCTCTATAAATTGACACATAATTTAATGTAAAAATTCGAAATAATTGCCATACACTTCAATTGTATATTATTCATGTGGGCATGATTATGATTATAAATTCATAGTACCATCTTTATATCTCCTTTCCTTTTTTGTTATAAATATGCATGTCATGTGAATGATGACGAAGAAAATACTAAGAATAAAATACGCAACAATATATTCCCATTCTATAGTCAGCTGCCTATTATTGAGTATCTAATTCTACTTTTTCAGCATTGTATATGCACCACCTATTTATTGAAAGCATTATTCCTTTATAAAAATAAATTTTCAATTTTTTATTATTTTTTTAAAATGCTTTTTTTGAGTTTTGAGCTTGATTTTGGAAAGTAGTTTTTCAAAAGCTTCATTAGGAATGTTTCTAATAAATGCATAAAAATGTTCATGAATTTTTTTGAGATAAAAATCCTTATGTTCTATAAATAATACTCATTTTAATTATAATAATTAAGAGAATGATCATTATTAATCATTGCATTCTATTAATATTAGAAAAGTAAGTAATTTTTCTAAGAAGCGTTTTTATTATAAACACACAAATAGTGAAATTGATAATAATAATAAAGTTCTTGATTCTTGAACTTGCAGAGTAATGTCTAATAAGTATCAATACTTAAAAAAATGTCTAATAAGTATTTAAACTTTTAAAGTTGTATCTAATAAGTTTTTGTATATTATAAAATTTTTAATAGACTCATGAACTTTTAATTTCATGTATAGTAGGTCTCTATCATTAATTTCGACAATTTGACAGCACATTGCAGGTTGATTGAATTATTTAACGATTATGTGTCGGATTAATTTCGAGCAAGTTTAGACATTTAATAACGTAAGTATTGAACTGAAAAAGTTAATGACAATGATCAATTAAATATAAAATTAAAAGTTTAGAAATTTATTAAAAAAATTTTAAGGTACACTAATCTATTAGATATAATTTTGAAAATTTCTTAAAACCTTCTAAAGTACAAGAGTTTATTAAATACAATCCTGAAAGTTTATATACCGCACTAATAATTAAACGCACCTAATAATAGTAATAATGGGATATAATGATTAATTTTAGGAGATTTCACGTCAGGCAAACCTCATCTCCTCGAGAATCGAGAGTGAAAAAAGGGACGTGGGAAGGGGTTATGTACAGGCGATGCCACTTGGACCTTTGGGCCGGCAGATCGATCGGTTGTGGGACCCAGAGGGCTACGGGAAGGCCCACAGATGGGGCCCAAAGAACTTAAAAGAAGGCCCACACGTGTGAAACACGTACGACAAACGCATTAGTCATGCCTGGTGTGCGCGTGGGTTGCTTGACCGCGTGTTTTAAAAAAACCTCACCGTTGACTCCCATGTAACTAACAAGCGCCCAAAACGCACCGTCACATTCGAATTCCCGCCATTATTTTGCTTCTATTTTCTATTTTCCTTTTTTAATTTCTTAGAAATTATTATAAATTCTTAAAATAAAATTCCTTTTCATGACCAGGTCAAAGTCCTCTCTCTTTAAAAATGTATTTTTTAAAACTGGAAGGCATCGACTCTTCTACTACAATCGGCTGATCAAACAGCAACATTTTTTTTTTCTGATTAAATCATATTTTTTCTTTTTATTATGGATCAATGTTAGGAAATTTTTGCAATTAAATCTCAATAAAAAGTTCATATTTTATTCTTATTAACTATGTAAGTAATTCATTTATAACATACTTATTGGCACCTTGATTAAAGTTTAAACGTATATATGCTTCAAGTGGTTAAATGATTTTTTAAAAAATAATTTTTATAATTTAACACAGTTGTGACAATATACCCTTTTTTTATTTCTGAAACAATGACAATATACATTGTCTGATTAATTTAGCGACTGTCTTTTAAAAAATAATAATAATAATAATGGCATTGTTAGTGTATAACTGGCTTGTTTAAAATAAAGACAAACTTTAATTATAGAGACAAAAAGGATATGTTTTAGACTTGCTGTTCATAAATAAGATGCCTAAATTTGTGTGAAAGCTACAAATTAAGAAAACGTGTTTAAAAATATTAGTCATTTTAGGAAAATTTGCTGTTGGACATAGAAATGTTGCAATTATTATATTTTATCCACTTTTGGAGTAAATATTAATTAAATGTTTTTAAGTATTTTTAGAATTGTCATTTGCTATGTAATCAAATACGATAAACAAAATTTTGAAAAGAATTTTTATTTATTAAAAATATTTTTCACTATTGCAAACCTTATATTAAATTCATAGTCTAAATTTTAATAGAGGCAACGAATAAATGTCCCATAATAAATAGTACTCTTTTAAGACACGGATTAATTTGTTATCAACAAGATAATTATTGTTTTTTTAATGAATGATTCTTTTTTTTCTCTTCTTTTTACTTTTCTTTTCTTTTGTGGTTGTGGTATGATTGTTTTACCAAGAATGTTGCCATAATTTAGATTCATATTATTTCTTAATCCACTTTGTCACTCCTTTCAAATCCTGCCTCTTTTGGTTAAAAAAGCAAAAAAAAACTTTTAGAGTGCAACAAATGTGAATTAGATTTTGGTAAAATTAAATTGATAGACAAAATTAAGCGAGATAGAATTGATCACATACTTAATTAATCTACCTTACATAAATATGAAATTGATATATAAAACAACTTATTAACTAGTAAAACAAGATCAGTACAAGAATAAATTATAAGACTTCAATATATTTTTATCGACCAAGCATGCCAAATAAATATAGATATTAGGATACTATATTTACATTAAAAGAAAAGATGTGACTATTCTTTTTTATTTTTCATTTCAAGATGATGTGGTGCCTAAAAACAACCTCACATTACAAAATTTAAACCATTAGAACTGCAATGTCATTAATTAGAGAATAAAGTTAATTATATATGTGAAAGGCAAAAAACTCTGTTTGATAACCAATATCACCTTATCAGATTTTTTTTTTTTCTAAAAATAGGGCAGTGGAAGTTAATAACTAATGTATTAATTATGAATTATTATCTTAATTTAATGGATTAGGTTTCATGTCTAAATCACCTCAATCCCAATGGCCAAGTCATTACTATTTCATGTGAAAAAATCGTCATCAAATTGATTGGTGGTATAAAGTTGGAAATGGTATTTGAAATTGGAGTTTTGTTATATCAAAAGATTCCATTATTGTATAAAATTCAAACAAAGAAAGGGAATCGCCAATTTTTTTAATTAATTAAAGTGTTGTACTAATTCAATGTGACAGCATTAATTATCTGATTAATAATTTTATTAATTGTGTAATTATTCAGTAGTTAATGGCATGGATCCTGTGATTTCATTTTCTTAATTCATTCTGCAGCCCTTCGTTTAAATATATGTTGGTTATGTTCTATATATATTTTTGGCTAAAACTTCAATTGGTCAACTCCGAGATTTAATTATTTAATATTTTTAGTTTTGTGTCCTTTCAATAGCAACGTCTGATTATTATCAAATCAAAATAATAAGTATGATTAATCTTTCATTAAAAATATTGGGCCTCTAGTTTATAGCCAAATATACATTTTTTTTGGGCACTATTTAAATTTTCACTCTTAAACTTAAAGGAGTGATCGAGTCAAATTTATACATCAAAATTTCAATTTCATCAAATTACTTAATTCAAACCTCAAGTTTGATAAGTGATGAAATATTTAACGAAGAGTTAAGTATCAAGGTTAGATTTCATTAGTATGATCCCAAGAGTCATATGTCAATGAAGTAACTTAACATATCTATAGTTGTTTTAAAATTATCGTAGTAATTTCTATATTGAGTTGTCAGTCAATGATTGTTATACTATAAAAATGTTGAATCGAATAATGAAGAGAATACTATGTCATGAATTTAAAAGGTGGATCAACTTTGTTTTTAAAAAAATACATCTGAACAGCCTCTAAATGAACAAATAATGGAGCATAGTGGCCTAGAATTTGGTGTGTGTTGAGTAGGGTGATGATGGGGAGTAAATTGAATTACTTAAACAAATGGTCCAAAATTGAACTCAAGCTATGCGATGCCCACGTACCACTTTTAGGACATTTTGGAAATATTTTGAGGACCAAACTTTCAATAAGTTGACTCCTAACATGTGTCTGTCATCCTTGAATATGCATGCCCTTGCCCAATTGTTTCATCTGCCAATTCAAATTTGTTCTTTCTCTATCTTCAATCTATAGAGAAAGACTTGTATGCATAAATTTGAAGATGAATAGGAACTCTATTATCTTCTTTTTACTACAACTAGAAAAAAAAAATCTGAAAGCAGCCTCGAGACGAAACGACAAACGAAGGAAAATCGATCTTCTCATGAATCTCTTGAAGTGTGATGTACAATCTTCAATCTGAACCACGTGCATTGTTGATGATTGAGCTAACTTTGCAAGGAGATTCACCAGTTGGGTTCCTTCTCTTTTGTTAAAGTTGGAAAAAAAATACAACATTTATTAAATATTTTTGCTTTTAATTATGCAATTATAAACTCTAGCTTGGATCATCTTCTTAATTTTTATCGTGTAGAATAAAAACCTGTTGGAACTGTATCATCAATTTGTTGATTCTCATTCCATTCTAAAGTTTCACCATTTTATATTAAAAAAATGCACATATGATTTTTTAAATTATTGTTACCTAATCCCTAATGCTTGTGATTGCACAATACCCAATCTCATGTCTTCAAAAGAGAGATCAACCACATATATGTTGTCCAAAGCAAATCCCCATTATTTAGGGAGATAATGTAATTATAAAATTCTTTCTATAAATTATTATTTTTAGGATATAGATTATAAATCGAGTTTTTGTCAATTCAATAAGATTAATACTAAACTTAGTTCAATAAGGTAGTATTGTACTTTTTTCTTTTGAGAATATTATATCATAAATGTAGAGGTGAGGATTAAAACCTATAACCTATTAAAAGAAGCAATGATGTCTTAACTACTTAGTTATGTTTATGTTGGCGTAGTATTGTAATTTTCAATTGTATCTAGTATAAATTTCATATGAATAAATGAGGACTAACAAATAATTTTAGAATTATGACAATGTTAATTAAGTGAATATTGATGTAATTCACAATCAAAGGACAGAACTCATGAAGATTTGGTATAAGCATGTCTCACAACACCCTAACTAGGCTTAGGGATCTACAACCTTCTCCACATATCAAATTGCATTAGGTTGCCCAATTCTTCATGTTTTATAAATTAGTTAATGAGGATTGATGGTTTTAAAAAGTAAAAGATAATGAAATTTTTGGTTGGCTATTTGACCAAAGAGATTTAATATTTATTTCTTTCATATGGAAGAATAAATACAAATATATACTTCCAAAATATGGCTTCCGGAAATAGGGTCATAATATTGAATAAGACCGGAACTTTGAATAAATAAAATATGGCGGAAAATAATATCTTATATTTCTCCCCCCTTCCACACCCAACCAAAATAAATAAATAAATAAAATAAAATTAAAGGGATTGTTTGGTAATCATAATTTTTTATAGTCTTTTTTTTAAAAAATTTTTTAGAAAAACAAGTTTGGTTACAAATAATTTCTTTGTCTTAATAATAAATTTATTAAAAGTATTTTAGAAATCTAACCAAATTTCAAAAACTAAAAAAATTAATGTTTTTGTTTTTTACAGTATTCTTAAAAGATGAAGACATAAACATGTTTTTATTAACTTTTTTTTTTTTTTTTGAAAAAAGACATTATAAACTCAATCTAAAAAAAACACTTATTTTACATATTTGGTTCATCGAAGTATATTAACGTAAGAACACGCAAAAAAAAAGGGAAAGAAACTAATGGTTTTTGCTAAAAGCATTCTATTAGAGCTAAGTTTAGTAAATAAAATAAAAATATTTAATCACGAAAACAACTAAAATCTTGCAGTTACTATAATATACATTAAAAATATTAAATATGAAAATTGAAGATGGAAAAAATATTGTTGACTTCATCTTCACTCCATAATATTATATTTTAAAAGAAAAAAGGAAGGATGGATTTAATTACATCATAATACATTTATTGTGGAGTTTAAGGTATGGAATAAAAAATAAGAAAGAGGAAGATGAATAATGAAGAAAAGATGTAAAAAAATTGATTGGAAAATAAAGGGAAGAGCAATTGATGCATAGGACAAGACAAATGGACAAACAAGTCCTAAATATAGTGTGTCCATCCAAATGTTGGGTTGTTCCAAAACCTTTTTTCTGATTCCCTTAAGTCGAGACGAGAGAATGAGAATTTTTGGCCGGGAAAGTCAAGGAATCCGATTGCCCCTTTTTCAATTTCCGGGAACTCTTCCGGCCAACCAAACACACAAATTCAAACATAAATTAATCGATGCGAAATAAATTAATCAGTGGTTGAAATTTTGTGTTAATTCTACAGAAATAATGAAGGAAGATGGGATTAAAATAATTGGAATAGAAAAATATGCGATAAGACCGAAGCAACGAAAATGAAAGCAAATTTATAAAGGAAAGAAGGAGAAGAGTTCCTTGCTCCCAACGCATCTTTTCCTTTTGCTTCGGCTTCGGACCTTTTTGCTTCGGCTTAGACTTTGAGCAGATAAGAACGACTTCCCCCCCCTCTCTCTCTCTCTCTCTCTCTAAACACTCCTTTCCATTCCACCTACAAGAACCCGTTTCCTTCCCCCTCGCCGCCCCCATCTTTTCTTCTTCACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTACAGACTCGTACAATCGTTTTCCCTCTTCTTCTTGCTGCTGTTGGTTCTGTTTCTACACATTTTGAGAAGCAAAAAGACTGTTCTTCTTCGTCATTCAAGCTGGCTATCGAAGACCAGGAAGCTACTGTGCGAGAAATCAAGCCCAAGAGTAGGAGAATCATGGTATGATTTCTCTGTTTTTCTTTTGGGCTTTTTTTTTTTTTTATACTTACGGGCATTCTGTTTCTGTTGTTTTTCCCATCTGGGTCTGTGTTTTAATGGTGGATTCTTGTGGGGTTGTTGAGCAGGGTGCTGGAGGTCCTGATGATGAAGACAACAGGTGGCCGCCATGGCTGAAACCTCTGCTTAGAGAAAGCTTCTTCGTTCAATGCAAGTTCCATGCCGATTCCCACAAGAGCGAATGCAATATGTATTGTTTGGATTGCATGAATGGTGCTCTCTGTTCTCTTTGCCTTGCTTATCACAAGGACCATCGTGCTATTCAGGTTTGTTGTTCCCATAATTTCTCTGTGTTTTTTTTACCTTGAGGGTTTCATTCATTCAATCTTTGGGGGTTCCCCTTTTCTCCATCACACTTTTGGTCAATTTTCCTTTCCGGGAAAAAACTCGTGGAATTTCCCCTGGAAAATCGTATTTACATTGTTTCTGAAGAGAATCTATCAAGGGAAGGTCGTTCATCCTTCAAAAAGTTTCGTTCTCTTCTTACTGTTTTTCAAACTCCGATTCAAATCCAGCCATAAATTCGCCGGAGTTTCCCAATCATCTAGTTTCAAAAAGTTGTTTAGAAGCATTATCACTCTCAGCAAATGTACTTTTTCTGACCGAATTAGCACAAACCCTTTATTCACCAATCGAGATTACTAACGGGTTTGCTCTGTTTTTTCGTTTTGGTTCTGTTTGGCGCAGATTAGAAGGTCGTCGTACCACGACGTTATCAGGGTATCGGAGATTCAGAAAGTTTTGGACATCACCGGCGTTCAAACCTACATTATCAACAGCGCCAGAGTCGTCTTCTTGAACGAACGGCCGCAGCCGAGGCCGGGGAAAGGCGTCACCAACACCTGCGAAGTCTGCGAGCGGAGCCTCCTCGATTCATTCCGGTTTTGCTCTCTCGGTTGCAAGGTACTATTATTAATTCATCAATGTACACTTTCTCGTTTCCCACTTTTTCTGTTTGGTTGCCGAGAAAATACACATAATCAAACGAAACACGAAATCCTCTTTGATACCGAAAAAAGGGTTAAAAATATCCATCTCGGAGTTAATGCCATTAATATGCTTGGTCGGAATGGGTTTGACTTTTAGTCCCCGATGCTTGAGGCGGTCGATAAAGACCGGCCGAACGGAAGCAGACTTCCGGTCAACTTCCTGAAAAATTAGGGTTTATTTGAGCAGCTTCAGATTTCGTTAATTTCCTCAGCATTATGAGATTGGTAATCGGAAAATTGATTTGGATTTTGGAATCTATTGTTTTCAGATCGTTGGAACTTCGAAGAATTTTGAGAAGAAGCGGCGGTCGCCGAGGTCGGACTCGGAGGATTCGTACAGCAGCAGCAGCCAAGGGCAATCGATGAATAAAAACAAGAACAAAGTGGTGCAGAGCTTCACGCCGTCAACCCCACCCCCAACTTTGGTTAATTACAGAACGGCCAAACGAAGAAAGGGAATTCCACACAGAGCCCCAATGGGAGGACTCATCATCGAATATTGA

mRNA sequence

ATGAGCGATCTGGTTCAAGAATCAGAGGACCCCAAAAAGCCAGAAGAGCTCTACGATCCCTTCGATTCCCGCTTCTCCATGCTCTGCCTCAACGGCTCCATTCCCTCGCCCACCGTCCACGGCGACGGGCATGCCCACCACCCTCGACCCTGTGCTTCATGCGGCTGCCGCGAACCTCGCTCCGCCGCCGCAACCCCTTCCATTCCCATGAAACGACGCTCCCCGACGCCCTCCTACGCCCTTCAAGACCCCACCGACTCCTCACACCCAGAGCCCAAGCCCAAGCGGCTCTTTCTTGATCAAGAAGACAGTCTCCCCGGCTTCTCCAAGATTGATCTTCCTATTCCTTTTGCCGCCGCCCACAGCCAGGGCCGGCCCCCTTCGGTCTCTCCTCTCCGCCGCTCTGTTTCCGATCCTACCGAAGCCCGTAATGGAGCCACTTTCACTCCCTCGTTTGATTATCTACTTGATTCCTCTCCTCCGGTGAATACCGATCAATCACCCGACAATGTGAAGGCGAGTGTACCTCCGTCGTCGGCCAAACGGTTGTGTGCGAACTCGGCGCTGCCGCCGCTGCCTCTCCGTCGTACGGTTTCAGACCCAAATCCCTCCCCTGACAAGATTTCCCGATCACCAAATAAAGTAGGGAGAGTCAGTGATTTGATTAAGGAAGACAGTCCCGATTCAAAGAGGCTGAGAAGGATCAAGGATCGTTTGAAGGAGATGAACCAATGGTGGAACGAAGTCATGAGCGAACAAGAACATGAAGAAGAGGACGAAGACGCCAAAAAGAATAATGTGTGTTGTAAGGAAGAAGAAGATGAAGAAACTGTGGGAGTGGAGAGGGTGGGAGATTCATTGGAGCTTCATCTGAAGTGCCCCTGTGGGAAAGGCTTTGAGATTCTTCTCTGTGGAAGAAGCTGTTTCTACAAGCTGCTTATATTATATAGCAGTGCAGTGGTGAGAAGTTCATTTTCAAGGGGCATATCCACCGTGGCCACCGCCCTCTCCGCTACCACCACCTCCTCCATATCCACTGCCTCCTTCGTGACCACCGTATGCACCTCCTCCACCATAGCCCGATCCTCCTCCTCCTCCGCCTCCGCCTCCGCTTCCTTCACCATTGCTTCCTCCTCCTGCACCATGTCCTCCACCTGCTCCGCTTCCATAACCAGCACCATGCTCTCCTCCTGGACCGTATCCTGCTCCTCCACCACCACCATTTCCACCGCCTCCACCACCACCATAGCCGCCTCCATGTTCTCCACCATACCCAGAACCACCCCCTTCACCTCCTCCAGTGCCATACCCGGCACGACCAGGACCATAACCAGCTCCACCTCCCCCTCCTCCTCCATTTCCTCCACCACCTCCATCATATCCTGCACCTCCTGGACCATAGCCAGCTCCACCGCCATTTCCACCACCTCCACCATATCCGGCACCTCCATGACCATAACCAGCTCCACCACCACTTCCTCCACCTCCCCCGTATCCACCACCTCCAGGACCATATCCACCACCACCTGGACCATAGCCCGCTCCACCACCACTTCCTCCTCCTCCTCCATATCCACCACCGCCTCCAGGACCATAACCACCGCCACCTGGACCATAGCTAGCTCCACCACCACTTCCTCCACCTCCTCCATATCCACCACCTCCTCCAGGACCATATCCACCACCACCTGGACCATAGCCCGCTCCACCACCACTTCCTCCACCTCCTCCATATCCACCACCACCTCCAGGACCATAACCTGCTCTTCCACCATTTCCACCACCTCCTCCATATCCAGCTCCTCCACCACCTCCGTATCCGCCACCACCAGCACCATAAGCAGAGCCACCGCCACTTCCGCCGCCTCCTCCACCACCATGGCCACCAGCAAGACCATAACCTTCACCACCTCCACTTCCACCTCCACCTCCACTTCCATATCCTCCAACACCGTATTCTCCTATGGAACCATAGCTGCTACCACCCCCACCTCCATAGCCTCCACCACCACCATGACCACCAAGAGGATCATAGCCACCACCACTACCACCGCCTGCACCATGGCCTACACCATAAGCAAATGCAGGAACATTAACAGTGTGTTTAAGCTGTCCTCCACTCTCTTCCCTCTTCACCCCCATCTAAACTTTCAATGTCATATCCATCTTCCAGGCCACCTCCTTGCAATGCCCTTCTCCGATGCTCTTTCCGGCATAATAGCACTCGGCTTCGATCGCCACCGCGCCGTCCGCCCGGGCATCGATGGCTACGACTTGAACCGGCAGTCGGTCGTAGTGGCCGACAACGGAGTAGAGCTCGCCGCGGATCTGCCGGCCAAATCCGGGGAGATTGAGGAGATAAGGAATGCCATAAGGCCCCAACACGAGCGGGAAGGAGAATCGCGTCGAGTAGTAACCCACATATACGGCGTCGTTAGGGAGATTTCACGTCAGGCAAACCTCATCTCCTCGAGAATCGAGAGTGAAAAAAGGGACGTGGGAAGGGGTTATGTACAGGCGATGCCACTTGGACCTTTGGGCCGGCAGATCGATCGGTTGTGGGACCCAGAGGGCTACGGGAAGGCCCACAGATGGGGCCCAAAGAACTTAAAAGAAGGCCCACACACTCGTACAATCGTTTTCCCTCTTCTTCTTGCTGCTGTTGGTTCTGTTTCTACACATTTTGAGAAGCAAAAAGACTGTTCTTCTTCGTCATTCAAGCTGGCTATCGAAGACCAGGAAGCTACTGTGCGAGAAATCAAGCCCAAGAGTAGGAGAATCATGGGTGCTGGAGGTCCTGATGATGAAGACAACAGGTGGCCGCCATGGCTGAAACCTCTGCTTAGAGAAAGCTTCTTCGTTCAATGCAAGTTCCATGCCGATTCCCACAAGAGCGAATGCAATATGTATTGTTTGGATTGCATGAATGGTGCTCTCTGTTCTCTTTGCCTTGCTTATCACAAGGACCATCGTGCTATTCAGATTAGAAGGTCGTCGTACCACGACGTTATCAGGGTATCGGAGATTCAGAAAGTTTTGGACATCACCGGCGTTCAAACCTACATTATCAACAGCGCCAGAGTCGTCTTCTTGAACGAACGGCCGCAGCCGAGGCCGGGGAAAGGCGTCACCAACACCTGCGAAGTCTGCGAGCGGAGCCTCCTCGATTCATTCCGGTTTTGCTCTCTCGGTTGCAAGATCGTTGGAACTTCGAAGAATTTTGAGAAGAAGCGGCGGTCGCCGAGGTCGGACTCGGAGGATTCGTACAGCAGCAGCAGCCAAGGGCAATCGATGAATAAAAACAAGAACAAAGTGGTGCAGAGCTTCACGCCGTCAACCCCACCCCCAACTTTGGTTAATTACAGAACGGCCAAACGAAGAAAGGGAATTCCACACAGAGCCCCAATGGGAGGACTCATCATCGAATATTGA

Coding sequence (CDS)

ATGAGCGATCTGGTTCAAGAATCAGAGGACCCCAAAAAGCCAGAAGAGCTCTACGATCCCTTCGATTCCCGCTTCTCCATGCTCTGCCTCAACGGCTCCATTCCCTCGCCCACCGTCCACGGCGACGGGCATGCCCACCACCCTCGACCCTGTGCTTCATGCGGCTGCCGCGAACCTCGCTCCGCCGCCGCAACCCCTTCCATTCCCATGAAACGACGCTCCCCGACGCCCTCCTACGCCCTTCAAGACCCCACCGACTCCTCACACCCAGAGCCCAAGCCCAAGCGGCTCTTTCTTGATCAAGAAGACAGTCTCCCCGGCTTCTCCAAGATTGATCTTCCTATTCCTTTTGCCGCCGCCCACAGCCAGGGCCGGCCCCCTTCGGTCTCTCCTCTCCGCCGCTCTGTTTCCGATCCTACCGAAGCCCGTAATGGAGCCACTTTCACTCCCTCGTTTGATTATCTACTTGATTCCTCTCCTCCGGTGAATACCGATCAATCACCCGACAATGTGAAGGCGAGTGTACCTCCGTCGTCGGCCAAACGGTTGTGTGCGAACTCGGCGCTGCCGCCGCTGCCTCTCCGTCGTACGGTTTCAGACCCAAATCCCTCCCCTGACAAGATTTCCCGATCACCAAATAAAGTAGGGAGAGTCAGTGATTTGATTAAGGAAGACAGTCCCGATTCAAAGAGGCTGAGAAGGATCAAGGATCGTTTGAAGGAGATGAACCAATGGTGGAACGAAGTCATGAGCGAACAAGAACATGAAGAAGAGGACGAAGACGCCAAAAAGAATAATGTGTGTTGTAAGGAAGAAGAAGATGAAGAAACTGTGGGAGTGGAGAGGGTGGGAGATTCATTGGAGCTTCATCTGAAGTGCCCCTGTGGGAAAGGCTTTGAGATTCTTCTCTGTGGAAGAAGCTGTTTCTACAAGCTGCTTATATTATATAGCAGTGCAGTGGTGAGAAGTTCATTTTCAAGGGGCATATCCACCGTGGCCACCGCCCTCTCCGCTACCACCACCTCCTCCATATCCACTGCCTCCTTCGTGACCACCGTATGCACCTCCTCCACCATAGCCCGATCCTCCTCCTCCTCCGCCTCCGCCTCCGCTTCCTTCACCATTGCTTCCTCCTCCTGCACCATGTCCTCCACCTGCTCCGCTTCCATAACCAGCACCATGCTCTCCTCCTGGACCGTATCCTGCTCCTCCACCACCACCATTTCCACCGCCTCCACCACCACCATAGCCGCCTCCATGTTCTCCACCATACCCAGAACCACCCCCTTCACCTCCTCCAGTGCCATACCCGGCACGACCAGGACCATAACCAGCTCCACCTCCCCCTCCTCCTCCATTTCCTCCACCACCTCCATCATATCCTGCACCTCCTGGACCATAGCCAGCTCCACCGCCATTTCCACCACCTCCACCATATCCGGCACCTCCATGACCATAACCAGCTCCACCACCACTTCCTCCACCTCCCCCGTATCCACCACCTCCAGGACCATATCCACCACCACCTGGACCATAGCCCGCTCCACCACCACTTCCTCCTCCTCCTCCATATCCACCACCGCCTCCAGGACCATAACCACCGCCACCTGGACCATAGCTAGCTCCACCACCACTTCCTCCACCTCCTCCATATCCACCACCTCCTCCAGGACCATATCCACCACCACCTGGACCATAGCCCGCTCCACCACCACTTCCTCCACCTCCTCCATATCCACCACCACCTCCAGGACCATAACCTGCTCTTCCACCATTTCCACCACCTCCTCCATATCCAGCTCCTCCACCACCTCCGTATCCGCCACCACCAGCACCATAAGCAGAGCCACCGCCACTTCCGCCGCCTCCTCCACCACCATGGCCACCAGCAAGACCATAACCTTCACCACCTCCACTTCCACCTCCACCTCCACTTCCATATCCTCCAACACCGTATTCTCCTATGGAACCATAGCTGCTACCACCCCCACCTCCATAGCCTCCACCACCACCATGACCACCAAGAGGATCATAGCCACCACCACTACCACCGCCTGCACCATGGCCTACACCATAAGCAAATGCAGGAACATTAACAGTGTGTTTAAGCTGTCCTCCACTCTCTTCCCTCTTCACCCCCATCTAAACTTTCAATGTCATATCCATCTTCCAGGCCACCTCCTTGCAATGCCCTTCTCCGATGCTCTTTCCGGCATAATAGCACTCGGCTTCGATCGCCACCGCGCCGTCCGCCCGGGCATCGATGGCTACGACTTGAACCGGCAGTCGGTCGTAGTGGCCGACAACGGAGTAGAGCTCGCCGCGGATCTGCCGGCCAAATCCGGGGAGATTGAGGAGATAAGGAATGCCATAAGGCCCCAACACGAGCGGGAAGGAGAATCGCGTCGAGTAGTAACCCACATATACGGCGTCGTTAGGGAGATTTCACGTCAGGCAAACCTCATCTCCTCGAGAATCGAGAGTGAAAAAAGGGACGTGGGAAGGGGTTATGTACAGGCGATGCCACTTGGACCTTTGGGCCGGCAGATCGATCGGTTGTGGGACCCAGAGGGCTACGGGAAGGCCCACAGATGGGGCCCAAAGAACTTAAAAGAAGGCCCACACACTCGTACAATCGTTTTCCCTCTTCTTCTTGCTGCTGTTGGTTCTGTTTCTACACATTTTGAGAAGCAAAAAGACTGTTCTTCTTCGTCATTCAAGCTGGCTATCGAAGACCAGGAAGCTACTGTGCGAGAAATCAAGCCCAAGAGTAGGAGAATCATGGGTGCTGGAGGTCCTGATGATGAAGACAACAGGTGGCCGCCATGGCTGAAACCTCTGCTTAGAGAAAGCTTCTTCGTTCAATGCAAGTTCCATGCCGATTCCCACAAGAGCGAATGCAATATGTATTGTTTGGATTGCATGAATGGTGCTCTCTGTTCTCTTTGCCTTGCTTATCACAAGGACCATCGTGCTATTCAGATTAGAAGGTCGTCGTACCACGACGTTATCAGGGTATCGGAGATTCAGAAAGTTTTGGACATCACCGGCGTTCAAACCTACATTATCAACAGCGCCAGAGTCGTCTTCTTGAACGAACGGCCGCAGCCGAGGCCGGGGAAAGGCGTCACCAACACCTGCGAAGTCTGCGAGCGGAGCCTCCTCGATTCATTCCGGTTTTGCTCTCTCGGTTGCAAGATCGTTGGAACTTCGAAGAATTTTGAGAAGAAGCGGCGGTCGCCGAGGTCGGACTCGGAGGATTCGTACAGCAGCAGCAGCCAAGGGCAATCGATGAATAAAAACAAGAACAAAGTGGTGCAGAGCTTCACGCCGTCAACCCCACCCCCAACTTTGGTTAATTACAGAACGGCCAAACGAAGAAAGGGAATTCCACACAGAGCCCCAATGGGAGGACTCATCATCGAATATTGA

Protein sequence

MSDLVQESEDPKKPEELYDPFDSRFSMLCLNGSIPSPTVHGDGHAHHPRPCASCGCREPRSAAATPSIPMKRRSPTPSYALQDPTDSSHPEPKPKRLFLDQEDSLPGFSKIDLPIPFAAAHSQGRPPSVSPLRRSVSDPTEARNGATFTPSFDYLLDSSPPVNTDQSPDNVKASVPPSSAKRLCANSALPPLPLRRTVSDPNPSPDKISRSPNKVGRVSDLIKEDSPDSKRLRRIKDRLKEMNQWWNEVMSEQEHEEEDEDAKKNNVCCKEEEDEETVGVERVGDSLELHLKCPCGKGFEILLCGRSCFYKLLILYSSAVVRSSFSRGISTVATALSATTTSSISTASFVTTVCTSSTIARSSSSSASASASFTIASSSCTMSSTCSASITSTMLSSWTVSCSSTTTISTASTTTIAASMFSTIPRTTPFTSSSAIPGTTRTITSSTSPSSSISSTTSIISCTSWTIASSTAISTTSTISGTSMTITSSTTTSSTSPVSTTSRTISTTTWTIARSTTTSSSSSISTTASRTITTATWTIASSTTTSSTSSISTTSSRTISTTTWTIARSTTTSSTSSISTTTSRTITCSSTISTTSSISSSSTTSVSATTSTISRATATSAASSTTMATSKTITFTTSTSTSTSTSISSNTVFSYGTIAATTPTSIASTTTMTTKRIIATTTTTACTMAYTISKCRNINSVFKLSSTLFPLHPHLNFQCHIHLPGHLLAMPFSDALSGIIALGFDRHRAVRPGIDGYDLNRQSVVVADNGVELAADLPAKSGEIEEIRNAIRPQHEREGESRRVVTHIYGVVREISRQANLISSRIESEKRDVGRGYVQAMPLGPLGRQIDRLWDPEGYGKAHRWGPKNLKEGPHTRTIVFPLLLAAVGSVSTHFEKQKDCSSSSFKLAIEDQEATVREIKPKSRRIMGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADSHKSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTCEVCERSLLDSFRFCSLGCKIVGTSKNFEKKRRSPRSDSEDSYSSSSQGQSMNKNKNKVVQSFTPSTPPPTLVNYRTAKRRKGIPHRAPMGGLIIEY
Homology
BLAST of Sgr014827 vs. NCBI nr
Match: XP_022159845.1 (uncharacterized protein LOC111026143, partial [Momordica charantia])

HSP 1 Score: 469.9 bits (1208), Expect = 6.1e-128
Identity = 240/268 (89.55%), Postives = 246/268 (91.79%), Query Frame = 0

Query: 882  PLLLAAVGSVSTHFE----KQKDCSSSSFKLAIEDQEATVREIKPKSRRIMGAGGPDDED 941
            PL L+   S+ THF       KDCS+ SFKLAIEDQEATVREIKPKSRRIM   GPD+ED
Sbjct: 7    PLSLSLSISLYTHFNIHSPSSKDCSTLSFKLAIEDQEATVREIKPKSRRIM---GPDEED 66

Query: 942  NRWPPWLKPLLRESFFVQCKFHADSHKSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRR 1001
            +RWPPWLKPLLRESFFVQCKFHADSHKSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRR
Sbjct: 67   HRWPPWLKPLLRESFFVQCKFHADSHKSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRR 126

Query: 1002 SSYHDVIRVSEIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTCEVCERSLLDS 1061
            SSYHDVIRVSEIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTCEVCERSLLDS
Sbjct: 127  SSYHDVIRVSEIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTCEVCERSLLDS 186

Query: 1062 FRFCSLGCKIVGTSKNFEKKRRSPRSDSED-SYSSSSQGQSMNK-NKNKVVQSFTPSTPP 1121
            FRFCSLGCKIVGTSKNF KKRR   SDSED SYSSSSQG+SMNK NKNKVVQSFTPSTPP
Sbjct: 187  FRFCSLGCKIVGTSKNFVKKRRMTGSDSEDSSYSSSSQGKSMNKNNKNKVVQSFTPSTPP 246

Query: 1122 PTLVNYRTAKRRKGIPHRAPMGGLIIEY 1144
            PTLVNYRTAKRRKGIPHRAPMGGLIIEY
Sbjct: 247  PTLVNYRTAKRRKGIPHRAPMGGLIIEY 271

BLAST of Sgr014827 vs. NCBI nr
Match: XP_014517396.1 (uncharacterized protein LOC106774881 [Vigna radiata var. radiata])

HSP 1 Score: 432.6 bits (1111), Expect = 1.1e-116
Identity = 210/242 (86.78%), Postives = 225/242 (92.98%), Query Frame = 0

Query: 904  SSFKLAIEDQEATVREIKPKSRRIMGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADSH 963
            ++FKLA ++Q  TVREIKPK+RRIMG+GGPDDEDNRWPPWLKPLL+ESFFVQCK HADSH
Sbjct: 24   TTFKLAFQNQATTVREIKPKNRRIMGSGGPDDEDNRWPPWLKPLLKESFFVQCKLHADSH 83

Query: 964  KSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYII 1023
            KSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRV+EIQKVLDITGVQTYII
Sbjct: 84   KSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVNEIQKVLDITGVQTYII 143

Query: 1024 NSARVVFLNERPQPRPGKGVTNTCEVCERSLLDSFRFCSLGCKIVGTSKNFEKKRRSP-- 1083
            NSARVVFLNERPQPRPGKGVTNTCEVCERSLLDS+RFCSLGCKIVGTS+NFEKK++S   
Sbjct: 144  NSARVVFLNERPQPRPGKGVTNTCEVCERSLLDSYRFCSLGCKIVGTSRNFEKKKKSATM 203

Query: 1084 RSDSEDSYSSSSQGQSMNKNKNKVVQSFTPSTPPPTLVNYRTAKRRKGIPHRAPMGGLII 1143
             SDSEDSYSS+S   S  KN +  VQSFTPSTPPPT VNYRTAKRRKGIPHRAPMGGL+I
Sbjct: 204  ASDSEDSYSSNS---SHGKNNSNKVQSFTPSTPPPTSVNYRTAKRRKGIPHRAPMGGLVI 262

BLAST of Sgr014827 vs. NCBI nr
Match: XP_007156423.1 (hypothetical protein PHAVU_003G2848001g, partial [Phaseolus vulgaris] >ESW28417.1 hypothetical protein PHAVU_003G2848001g, partial [Phaseolus vulgaris])

HSP 1 Score: 431.0 bits (1107), Expect = 3.2e-116
Identity = 210/243 (86.42%), Postives = 224/243 (92.18%), Query Frame = 0

Query: 903  SSSFKLAIEDQEATVREIKPKSRRIMGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADS 962
            S+SFKLA ++Q  TVREIKPK+RRIMG+GGPDD DNRWPPWLKPLL+ESFFVQCK HADS
Sbjct: 20   STSFKLAFQNQTITVREIKPKNRRIMGSGGPDDADNRWPPWLKPLLKESFFVQCKLHADS 79

Query: 963  HKSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYI 1022
            HKSECNMYCLDCMNGALCSLCL YHKDHRAIQIRRSSYHDVIRV+EIQKVLDITGVQTYI
Sbjct: 80   HKSECNMYCLDCMNGALCSLCLGYHKDHRAIQIRRSSYHDVIRVNEIQKVLDITGVQTYI 139

Query: 1023 INSARVVFLNERPQPRPGKGVTNTCEVCERSLLDSFRFCSLGCKIVGTSKNFEKKRRSP- 1082
            INSARVVFLNERPQPRPGKGVTNTCEVCERSLLDS+RFCSLGCKIVGTS+NFEKK++S  
Sbjct: 140  INSARVVFLNERPQPRPGKGVTNTCEVCERSLLDSYRFCSLGCKIVGTSRNFEKKKKSTT 199

Query: 1083 -RSDSEDSYSSSSQGQSMNKNKNKVVQSFTPSTPPPTLVNYRTAKRRKGIPHRAPMGGLI 1142
              SDSEDSYSS+S   S  KN +  VQSFTPSTPPPT VNYRTAKRRKGIPHRAPMGGL+
Sbjct: 200  MASDSEDSYSSNS---SHGKNNSNKVQSFTPSTPPPTSVNYRTAKRRKGIPHRAPMGGLV 259

Query: 1143 IEY 1144
            IEY
Sbjct: 260  IEY 259

BLAST of Sgr014827 vs. NCBI nr
Match: RDY00613.1 (hypothetical protein CR513_16178, partial [Mucuna pruriens])

HSP 1 Score: 429.9 bits (1104), Expect = 7.0e-116
Identity = 210/240 (87.50%), Postives = 222/240 (92.50%), Query Frame = 0

Query: 906  FKLAIEDQEATVREIKPKSRRIMGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADSHKS 965
            FKLA E+Q+ TVREIKPK+RRIMGAGGP DEDNRWPPWLKPLL+ESFFVQCK HADSHKS
Sbjct: 83   FKLAFENQDITVREIKPKNRRIMGAGGPGDEDNRWPPWLKPLLKESFFVQCKLHADSHKS 142

Query: 966  ECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINS 1025
            ECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRV+EIQKVLDITGVQTYIINS
Sbjct: 143  ECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVNEIQKVLDITGVQTYIINS 202

Query: 1026 ARVVFLNERPQPRPGKGVTNTCEVCERSLLDSFRFCSLGCKIVGTSKNFEKKRRSP--RS 1085
            ARVVFLNERPQPRPGKGVTNTCEVCERSLLDS+RFCSLGCKIVGTS+NF+KK++S    S
Sbjct: 203  ARVVFLNERPQPRPGKGVTNTCEVCERSLLDSYRFCSLGCKIVGTSRNFQKKKKSATMAS 262

Query: 1086 DSEDSYSSSSQGQSMNKNKNKVVQSFTPSTPPPTLVNYRTAKRRKGIPHRAPMGGLIIEY 1144
            DSEDS SS+S   S  KN N  VQSFTPSTPPPT VNYRTAKRRKGIPHRAPMGGL+IEY
Sbjct: 263  DSEDSCSSNS---SHGKNNNNKVQSFTPSTPPPTSVNYRTAKRRKGIPHRAPMGGLVIEY 319

BLAST of Sgr014827 vs. NCBI nr
Match: TKY58532.1 (hypothetical protein E2542_SST15595 [Spatholobus suberectus])

HSP 1 Score: 429.1 bits (1102), Expect = 1.2e-115
Identity = 215/278 (77.34%), Postives = 233/278 (83.81%), Query Frame = 0

Query: 875  HTRTIVFPLL-------LAAVGSVSTHFEKQKDCSSSSFKLAIEDQEATVREIKPKSRRI 934
            HT   +F LL       L    S S    +      ++FKLA E+Q+ TVREIKPK+RRI
Sbjct: 174  HTYLFLFSLLFFLQNNILTPTSSHSPQHTQHNLSPITAFKLAFENQDTTVREIKPKNRRI 233

Query: 935  MGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADSHKSECNMYCLDCMNGALCSLCLAYH 994
            MGAGGPDDEDNRWPPWLKPLL+ESFFVQC  HADSHKSECNMYCLDCMNGALCSLCLAYH
Sbjct: 234  MGAGGPDDEDNRWPPWLKPLLKESFFVQCNVHADSHKSECNMYCLDCMNGALCSLCLAYH 293

Query: 995  KDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTC 1054
            KDHRAIQIRRSSYHDVIRV+EIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTC
Sbjct: 294  KDHRAIQIRRSSYHDVIRVNEIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTC 353

Query: 1055 EVCERSLLDSFRFCSLGCKIVGTSKNFEKKRRSP--RSDSEDSYSSSSQGQSMNKNKNKV 1114
            EVCERSLLDS+RFCSLGCKIVGTS+NF KK++S    SDSEDSYSS+S     N NK   
Sbjct: 354  EVCERSLLDSYRFCSLGCKIVGTSRNFLKKKKSAGMASDSEDSYSSNSSHGKKNSNK--- 413

Query: 1115 VQSFTPSTPPPTLVNYRTAKRRKGIPHRAPMGGLIIEY 1144
            VQSFTP+TPPPT VNY+TAKRRKGIPHRAPMGGL+IE+
Sbjct: 414  VQSFTPTTPPPTSVNYKTAKRRKGIPHRAPMGGLVIEF 448

BLAST of Sgr014827 vs. ExPASy TrEMBL
Match: A0A6J1DZX6 (uncharacterized protein LOC111026143 OS=Momordica charantia OX=3673 GN=LOC111026143 PE=4 SV=1)

HSP 1 Score: 469.9 bits (1208), Expect = 3.0e-128
Identity = 240/268 (89.55%), Postives = 246/268 (91.79%), Query Frame = 0

Query: 882  PLLLAAVGSVSTHFE----KQKDCSSSSFKLAIEDQEATVREIKPKSRRIMGAGGPDDED 941
            PL L+   S+ THF       KDCS+ SFKLAIEDQEATVREIKPKSRRIM   GPD+ED
Sbjct: 7    PLSLSLSISLYTHFNIHSPSSKDCSTLSFKLAIEDQEATVREIKPKSRRIM---GPDEED 66

Query: 942  NRWPPWLKPLLRESFFVQCKFHADSHKSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRR 1001
            +RWPPWLKPLLRESFFVQCKFHADSHKSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRR
Sbjct: 67   HRWPPWLKPLLRESFFVQCKFHADSHKSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRR 126

Query: 1002 SSYHDVIRVSEIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTCEVCERSLLDS 1061
            SSYHDVIRVSEIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTCEVCERSLLDS
Sbjct: 127  SSYHDVIRVSEIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTCEVCERSLLDS 186

Query: 1062 FRFCSLGCKIVGTSKNFEKKRRSPRSDSED-SYSSSSQGQSMNK-NKNKVVQSFTPSTPP 1121
            FRFCSLGCKIVGTSKNF KKRR   SDSED SYSSSSQG+SMNK NKNKVVQSFTPSTPP
Sbjct: 187  FRFCSLGCKIVGTSKNFVKKRRMTGSDSEDSSYSSSSQGKSMNKNNKNKVVQSFTPSTPP 246

Query: 1122 PTLVNYRTAKRRKGIPHRAPMGGLIIEY 1144
            PTLVNYRTAKRRKGIPHRAPMGGLIIEY
Sbjct: 247  PTLVNYRTAKRRKGIPHRAPMGGLIIEY 271

BLAST of Sgr014827 vs. ExPASy TrEMBL
Match: A0A1S3VGB4 (uncharacterized protein LOC106774881 OS=Vigna radiata var. radiata OX=3916 GN=LOC106774881 PE=4 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 5.2e-117
Identity = 210/242 (86.78%), Postives = 225/242 (92.98%), Query Frame = 0

Query: 904  SSFKLAIEDQEATVREIKPKSRRIMGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADSH 963
            ++FKLA ++Q  TVREIKPK+RRIMG+GGPDDEDNRWPPWLKPLL+ESFFVQCK HADSH
Sbjct: 24   TTFKLAFQNQATTVREIKPKNRRIMGSGGPDDEDNRWPPWLKPLLKESFFVQCKLHADSH 83

Query: 964  KSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYII 1023
            KSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRV+EIQKVLDITGVQTYII
Sbjct: 84   KSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVNEIQKVLDITGVQTYII 143

Query: 1024 NSARVVFLNERPQPRPGKGVTNTCEVCERSLLDSFRFCSLGCKIVGTSKNFEKKRRSP-- 1083
            NSARVVFLNERPQPRPGKGVTNTCEVCERSLLDS+RFCSLGCKIVGTS+NFEKK++S   
Sbjct: 144  NSARVVFLNERPQPRPGKGVTNTCEVCERSLLDSYRFCSLGCKIVGTSRNFEKKKKSATM 203

Query: 1084 RSDSEDSYSSSSQGQSMNKNKNKVVQSFTPSTPPPTLVNYRTAKRRKGIPHRAPMGGLII 1143
             SDSEDSYSS+S   S  KN +  VQSFTPSTPPPT VNYRTAKRRKGIPHRAPMGGL+I
Sbjct: 204  ASDSEDSYSSNS---SHGKNNSNKVQSFTPSTPPPTSVNYRTAKRRKGIPHRAPMGGLVI 262

BLAST of Sgr014827 vs. ExPASy TrEMBL
Match: V7CE01 (Uncharacterized protein (Fragment) OS=Phaseolus vulgaris OX=3885 GN=PHAVU_003G2848001g PE=4 SV=1)

HSP 1 Score: 431.0 bits (1107), Expect = 1.5e-116
Identity = 210/243 (86.42%), Postives = 224/243 (92.18%), Query Frame = 0

Query: 903  SSSFKLAIEDQEATVREIKPKSRRIMGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADS 962
            S+SFKLA ++Q  TVREIKPK+RRIMG+GGPDD DNRWPPWLKPLL+ESFFVQCK HADS
Sbjct: 20   STSFKLAFQNQTITVREIKPKNRRIMGSGGPDDADNRWPPWLKPLLKESFFVQCKLHADS 79

Query: 963  HKSECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYI 1022
            HKSECNMYCLDCMNGALCSLCL YHKDHRAIQIRRSSYHDVIRV+EIQKVLDITGVQTYI
Sbjct: 80   HKSECNMYCLDCMNGALCSLCLGYHKDHRAIQIRRSSYHDVIRVNEIQKVLDITGVQTYI 139

Query: 1023 INSARVVFLNERPQPRPGKGVTNTCEVCERSLLDSFRFCSLGCKIVGTSKNFEKKRRSP- 1082
            INSARVVFLNERPQPRPGKGVTNTCEVCERSLLDS+RFCSLGCKIVGTS+NFEKK++S  
Sbjct: 140  INSARVVFLNERPQPRPGKGVTNTCEVCERSLLDSYRFCSLGCKIVGTSRNFEKKKKSTT 199

Query: 1083 -RSDSEDSYSSSSQGQSMNKNKNKVVQSFTPSTPPPTLVNYRTAKRRKGIPHRAPMGGLI 1142
              SDSEDSYSS+S   S  KN +  VQSFTPSTPPPT VNYRTAKRRKGIPHRAPMGGL+
Sbjct: 200  MASDSEDSYSSNS---SHGKNNSNKVQSFTPSTPPPTSVNYRTAKRRKGIPHRAPMGGLV 259

Query: 1143 IEY 1144
            IEY
Sbjct: 260  IEY 259

BLAST of Sgr014827 vs. ExPASy TrEMBL
Match: A0A371HCT3 (Uncharacterized protein (Fragment) OS=Mucuna pruriens OX=157652 GN=CR513_16178 PE=4 SV=1)

HSP 1 Score: 429.9 bits (1104), Expect = 3.4e-116
Identity = 210/240 (87.50%), Postives = 222/240 (92.50%), Query Frame = 0

Query: 906  FKLAIEDQEATVREIKPKSRRIMGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADSHKS 965
            FKLA E+Q+ TVREIKPK+RRIMGAGGP DEDNRWPPWLKPLL+ESFFVQCK HADSHKS
Sbjct: 83   FKLAFENQDITVREIKPKNRRIMGAGGPGDEDNRWPPWLKPLLKESFFVQCKLHADSHKS 142

Query: 966  ECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINS 1025
            ECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRV+EIQKVLDITGVQTYIINS
Sbjct: 143  ECNMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVNEIQKVLDITGVQTYIINS 202

Query: 1026 ARVVFLNERPQPRPGKGVTNTCEVCERSLLDSFRFCSLGCKIVGTSKNFEKKRRSP--RS 1085
            ARVVFLNERPQPRPGKGVTNTCEVCERSLLDS+RFCSLGCKIVGTS+NF+KK++S    S
Sbjct: 203  ARVVFLNERPQPRPGKGVTNTCEVCERSLLDSYRFCSLGCKIVGTSRNFQKKKKSATMAS 262

Query: 1086 DSEDSYSSSSQGQSMNKNKNKVVQSFTPSTPPPTLVNYRTAKRRKGIPHRAPMGGLIIEY 1144
            DSEDS SS+S   S  KN N  VQSFTPSTPPPT VNYRTAKRRKGIPHRAPMGGL+IEY
Sbjct: 263  DSEDSCSSNS---SHGKNNNNKVQSFTPSTPPPTSVNYRTAKRRKGIPHRAPMGGLVIEY 319

BLAST of Sgr014827 vs. ExPASy TrEMBL
Match: A0A151THE9 (Uncharacterized protein OS=Cajanus cajan OX=3821 GN=KK1_012709 PE=4 SV=1)

HSP 1 Score: 426.0 bits (1094), Expect = 4.9e-115
Identity = 207/239 (86.61%), Postives = 221/239 (92.47%), Query Frame = 0

Query: 908  LAIEDQEATVREIKPKSRRIMGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADSHKSEC 967
            +AIE+Q+ TVREIKPK+RRIMGAGGPDDEDNRWPPWLKPLL+ESFFVQCK HADSHKSEC
Sbjct: 1    MAIENQDTTVREIKPKNRRIMGAGGPDDEDNRWPPWLKPLLKESFFVQCKLHADSHKSEC 60

Query: 968  NMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINSAR 1027
            NMYCLDCMNG LCSLCL +H+DHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINSAR
Sbjct: 61   NMYCLDCMNGPLCSLCLVHHRDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINSAR 120

Query: 1028 VVFLNERPQPRPGKGVTNTCEVCERSLLDSFRFCSLGCKIVGTSKNFEKKRRSP---RSD 1087
            VVFLNERPQPRPGKGVTNTCEVCERSLLDS+RFCSLGCKIVGTSKNF+KK++ P    SD
Sbjct: 121  VVFLNERPQPRPGKGVTNTCEVCERSLLDSYRFCSLGCKIVGTSKNFQKKKKHPAAMSSD 180

Query: 1088 SEDSYSSSSQGQSMNKNKNKVVQSFTPSTPPPTLVNYRTAKRRKGIPHRAPMGGLIIEY 1144
            SEDSYSS+S   +  + KN  VQSFTPSTPPPT VNYRTAKRRKGIPHRAPMGGLIIEY
Sbjct: 181  SEDSYSSNS---AHGRQKNFKVQSFTPSTPPPTSVNYRTAKRRKGIPHRAPMGGLIIEY 236

BLAST of Sgr014827 vs. TAIR 10
Match: AT4G17900.1 (PLATZ transcription factor family protein )

HSP 1 Score: 353.6 bits (906), Expect = 6.0e-97
Identity = 172/232 (74.14%), Postives = 197/232 (84.91%), Query Frame = 0

Query: 908  LAIEDQEATVREIKPKSRRIMGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADSHKSEC 967
            +AIEDQE T+REIKPK+RRIMGAGGP++E+NRWPPWLKPLL+E FFV CKFH DSHKSEC
Sbjct: 1    MAIEDQENTIREIKPKNRRIMGAGGPEEEENRWPPWLKPLLKEQFFVHCKFHGDSHKSEC 60

Query: 968  NMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINSAR 1027
            NMYCLDC NG LCSLCLA+HKDHR IQIRRSSYHDVIRV+EIQK LDI G+QTY+INSA+
Sbjct: 61   NMYCLDCTNGPLCSLCLAHHKDHRTIQIRRSSYHDVIRVNEIQKYLDIGGIQTYVINSAK 120

Query: 1028 VVFLNERPQPRPGKGVTNTCEVCERSLL-DSFRFCSLGCKIVGTSKNFEKKRRSPRSDSE 1087
            VVFLNERPQPRPGKGVTNTC+VC RSL+ DSFRFCSLGCKI GTS+ FEK R +   ++E
Sbjct: 121  VVFLNERPQPRPGKGVTNTCKVCYRSLVDDSFRFCSLGCKIAGTSRGFEKGRENLLMETE 180

Query: 1088 DSYSSSSQGQSMNKNKNKVVQSFTPSTPP-PTLVNYRTAKRRKGIPHRAPMG 1138
            DS SS + G+++       +QSF+PSTPP  T  N R  KRRKGIPHR+PMG
Sbjct: 181  DSSSSIAIGKNITN-----LQSFSPSTPPLTTSSNCRIVKRRKGIPHRSPMG 227

BLAST of Sgr014827 vs. TAIR 10
Match: AT1G32700.1 (PLATZ transcription factor family protein )

HSP 1 Score: 326.6 bits (836), Expect = 7.8e-89
Identity = 162/213 (76.06%), Postives = 180/213 (84.51%), Query Frame = 0

Query: 932  GPDDEDNR-WPPWLKPLLRESFFVQCKFHADSHKSECNMYCLDCMNGALCSLCLAYHKDH 991
            G ++E N+ +P WLKPLLRE FFVQCK HADSHKSECNMYCLDC NG LCSLCL++HKDH
Sbjct: 2    GAEEETNKTYPHWLKPLLREKFFVQCKLHADSHKSECNMYCLDCTNGPLCSLCLSFHKDH 61

Query: 992  RAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINSARVVFLNERPQPRPGKGVTNTCEVC 1051
             AIQIRRSSYHDVIRVSEIQK LDITGVQTY+INSA+VVFLNERPQPRPGKGV NTCEVC
Sbjct: 62   HAIQIRRSSYHDVIRVSEIQKFLDITGVQTYVINSAKVVFLNERPQPRPGKGVINTCEVC 121

Query: 1052 ERSLLDSFRFCSLGCKIVGTSKNFEKKRRSPRSDSEDSYSSSSQGQSMNKNKNKVVQSFT 1111
             RSL+DSFRFCSLGCKI G SK   K+  +  SDS+DSYSS+S G+ + KN + +  SFT
Sbjct: 122  YRSLVDSFRFCSLGCKISGISKKKRKEWTNNLSDSDDSYSSTSIGR-LKKNDDIMNNSFT 181

Query: 1112 PSTPPPTLVNYRTAKRRKGIPHRAPMGGLIIEY 1144
            PSTPP + VN R AKRRKGIPHRAP GGLIIEY
Sbjct: 182  PSTPPLSAVNRRIAKRRKGIPHRAPFGGLIIEY 213

BLAST of Sgr014827 vs. TAIR 10
Match: AT1G32700.2 (PLATZ transcription factor family protein )

HSP 1 Score: 265.4 bits (677), Expect = 2.1e-70
Identity = 134/175 (76.57%), Postives = 148/175 (84.57%), Query Frame = 0

Query: 969  MYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINSARV 1028
            MYCLDC NG LCSLCL++HKDH AIQIRRSSYHDVIRVSEIQK LDITGVQTY+INSA+V
Sbjct: 1    MYCLDCTNGPLCSLCLSFHKDHHAIQIRRSSYHDVIRVSEIQKFLDITGVQTYVINSAKV 60

Query: 1029 VFLNERPQPRPGKGVTNTCEVCERSLLDSFRFCSLGCKIVGTSKNFEKKRRSPRSDSEDS 1088
            VFLNERPQPRPGKGV NTCEVC RSL+DSFRFCSLGCKI G SK   K+  +  SDS+DS
Sbjct: 61   VFLNERPQPRPGKGVINTCEVCYRSLVDSFRFCSLGCKISGISKKKRKEWTNNLSDSDDS 120

Query: 1089 YSSSSQGQSMNKNKNKVVQSFTPSTPPPTLVNYRTAKRRKGIPHRAPMGGLIIEY 1144
            YSS+S G+ + KN + +  SFTPSTPP + VN R AKRRKGIPHRAP GGLIIEY
Sbjct: 121  YSSTSIGR-LKKNDDIMNNSFTPSTPPLSAVNRRIAKRRKGIPHRAPFGGLIIEY 174

BLAST of Sgr014827 vs. TAIR 10
Match: AT5G46710.1 (PLATZ transcription factor family protein )

HSP 1 Score: 246.1 bits (627), Expect = 1.3e-64
Identity = 127/233 (54.51%), Postives = 163/233 (69.96%), Query Frame = 0

Query: 908  LAIEDQEATVREIKPKSRRIMGAGGPDDEDNRWPPWLKPLLRESFFVQCKFHADSHKSEC 967
            +AIED E   REIKPK+RR M      + +N+WP WLKPLL + FF QCKFH    ++EC
Sbjct: 1    MAIEDYENPNREIKPKNRRFM------EGENQWPIWLKPLLNQHFFAQCKFHGHLPRTEC 60

Query: 968  NMYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINSAR 1027
             MYCLDC N + CSLCL+ H++HR IQIR SSYH+V +V EIQK LDI+ +QTY+INS++
Sbjct: 61   KMYCLDCTNDSFCSLCLSEHENHRTIQIRISSYHNVTKVDEIQKYLDISSIQTYVINSSK 120

Query: 1028 VVFLNERPQPRPGKGVTNTCEVCERSLLDS-FRFCSLGCKIVGTSKNFEKKRRSPRSDSE 1087
            V+FLNERPQ +PGKG TN C VC R L ++ FRFCS+GCK+ GTS  F+K+ +   +DS+
Sbjct: 121  VLFLNERPQSKPGKGFTNACMVCYRGLAENCFRFCSIGCKVAGTSGVFQKRVKHTTNDSD 180

Query: 1088 DSYSSS--SQGQSMNKNKNKVVQSFTPSTP--PPTLVNYRTAKRRKGIPHRAP 1136
            +S +SS      S  +N N  +QS +P TP  PP  +  R    RKGIPHRAP
Sbjct: 181  NSNNSSGVENNSSGAENGNSNLQSLSPPTPQFPPRSLRKRL---RKGIPHRAP 224

BLAST of Sgr014827 vs. TAIR 10
Match: AT4G17900.2 (PLATZ transcription factor family protein )

HSP 1 Score: 237.3 bits (604), Expect = 6.2e-62
Identity = 122/171 (71.35%), Postives = 140/171 (81.87%), Query Frame = 0

Query: 969  MYCLDCMNGALCSLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKVLDITGVQTYIINSARV 1028
            MYCLDC NG LCSLCLA+HKDHR IQIRRSSYHDVIRV+EIQK LDI G+QTY+INSA+V
Sbjct: 1    MYCLDCTNGPLCSLCLAHHKDHRTIQIRRSSYHDVIRVNEIQKYLDIGGIQTYVINSAKV 60

Query: 1029 VFLNERPQPRPGKGVTNTCEVCERSLL-DSFRFCSLGCKIVGTSKNFEKKRRSPRSDSED 1088
            VFLNERPQPRPGKGVTNTC+VC RSL+ DSFRFCSLGCKI GTS+ FEK R +   ++ED
Sbjct: 61   VFLNERPQPRPGKGVTNTCKVCYRSLVDDSFRFCSLGCKIAGTSRGFEKGRENLLMETED 120

Query: 1089 SYSSSSQGQSMNKNKNKVVQSFTPSTPP-PTLVNYRTAKRRKGIPHRAPMG 1138
            S SS + G+++       +QSF+PSTPP  T  N R  KRRKGIPHR+PMG
Sbjct: 121  SSSSIAIGKNITN-----LQSFSPSTPPLTTSSNCRIVKRRKGIPHRSPMG 166

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022159845.16.1e-12889.55uncharacterized protein LOC111026143, partial [Momordica charantia][more]
XP_014517396.11.1e-11686.78uncharacterized protein LOC106774881 [Vigna radiata var. radiata][more]
XP_007156423.13.2e-11686.42hypothetical protein PHAVU_003G2848001g, partial [Phaseolus vulgaris] >ESW28417.... [more]
RDY00613.17.0e-11687.50hypothetical protein CR513_16178, partial [Mucuna pruriens][more]
TKY58532.11.2e-11577.34hypothetical protein E2542_SST15595 [Spatholobus suberectus][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DZX63.0e-12889.55uncharacterized protein LOC111026143 OS=Momordica charantia OX=3673 GN=LOC111026... [more]
A0A1S3VGB45.2e-11786.78uncharacterized protein LOC106774881 OS=Vigna radiata var. radiata OX=3916 GN=LO... [more]
V7CE011.5e-11686.42Uncharacterized protein (Fragment) OS=Phaseolus vulgaris OX=3885 GN=PHAVU_003G28... [more]
A0A371HCT33.4e-11687.50Uncharacterized protein (Fragment) OS=Mucuna pruriens OX=157652 GN=CR513_16178 P... [more]
A0A151THE94.9e-11586.61Uncharacterized protein OS=Cajanus cajan OX=3821 GN=KK1_012709 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G17900.16.0e-9774.14PLATZ transcription factor family protein [more]
AT1G32700.17.8e-8976.06PLATZ transcription factor family protein [more]
AT1G32700.22.1e-7076.57PLATZ transcription factor family protein [more]
AT5G46710.11.3e-6454.51PLATZ transcription factor family protein [more]
AT4G17900.26.2e-6271.35PLATZ transcription factor family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006734PLATZ transcription factorPFAMPF04640PLATZcoord: 996..1067
e-value: 4.9E-29
score: 100.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 128..174
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 432..456
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 482..501
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1086..1119
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1074..1130
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 33..229
NoneNo IPR availablePANTHERPTHR31065:SF57PLATZ TRANSCRIPTION FACTOR FAMILY PROTEINcoord: 916..1138
NoneNo IPR availablePANTHERPTHR31065PLATZ TRANSCRIPTION FACTOR FAMILY PROTEINcoord: 916..1138

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr014827.1Sgr014827.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane