Sgr026053 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr026053
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPeroxidase
Locationtig00153031: 1288285 .. 1307499 (-)
RNA-Seq ExpressionSgr026053
SyntenySgr026053
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCTTCACATGGGAAGCGCAGTCCCATACACCAACTTTACGTTTGTGCTATTCGATTCCTATACCAATCCTTCTCTTCAATGTCAGAATCTCAAGGTTCATCTCAATCTCTCGCAGTCCGTGGTTTGCGTGACTTGGTTCGAAGACCTCAAAGTGTCGATTCGAGTTCCTGTTCCTTCGGTTTTGGTTGACGCGGAGTCGCCCTTGAGTTTTAGAGCTTTTGAAGATCATATTGAGGTCAAGCTCGTCTTGCTTCTTCCGGTCGATCACCCAATTATTCTCAACTTCGACAATGTGCTGAATTTCTCCGAAGAGCGAGGAAATGACTACTCTAAGGCGTCGAAGCCACTTTTGATGGACTCTGGTGCGCTAAAATTTCAATTTTTTGTTTTATCTCTATTGATAGTCGCTGCATTTTGTAAAAATGACTTCGGGGAAAGGTGAAACCCACGGTGGCGTAGCTACTTAGATATTAAAATCCTATGAATTTCTTGTAAATATTGTTGGTAAATGGTTGTCAAAATAGTTGCGAACTAAGCCTAGCTGCCTCCTCTAATAGGTTATAAGTTGGAATCACCGCACTGAATTTTTTTTATAAAAAAGTGGTTGTCAAAATAGTTGCTGCATTTTCCCTTGTGCTTCCTAAGAAATTTTGTTGAATTATAAGGTGAAAGCTATAACAGATATGCATTACCATTTGCAAAAGGTGCTAAATTTGAAACTTAAAGGTTTTTGCGCTGTCTATCTCTGATATGTATGATCAATATCAAACTTTACGGAGAGTTAATTGCAATGCTAAAGAGCCAGAAAGCTTGGACTTTTATTATGTTCAATCTTTTAATTTCTCATGTCTGGTGGCCATTGCTCTACCATCTACTATTTTTTCATTAAGTTGTATGTTTTTCCCTTCTGACATGCTTGCGTTTCTTAGACAAAACAGTTTATCACGCAGTGGTGGCATCCACTTTTATTGCAGAAATTGTTCGTTCAGGCTGAGTAAATCTCCGCTCAGGTAGAAATTGTTCAATCACACTATTGACAGGATTTATGATTTCCTAGAGCAGTTGATTTGCATTCCTCCAGTCCATAAATTTTGGTTTTTGACTTTTCACAATTTATATGTATGTGTGTAGAGATTTTGTTGAAATGCCATCAGTCAACTGGCGAGAGGTGGCTGATAACTGGTTTGGGTCTTGCTGCTGCTCCTTTGGGGGGATAAGCGAGAAGCTGGTAACTAGGTATACAAATTCCTATAGATGTGCAAAGGGTGTCTGCCTACTCACTCTAACAACTATTACTCTTTCCAAGGATGATCTTATTGGACATATGTTCCCAGACTATGATGGGACCCGAGAATTCAAGGATGAATCAGATTTTACTGATGGAAATTGGTTAAGTGAAGCTAAGCAGGAATCACAATGTAATCATACATCTACGAAGGAGGTAAAATCTAAGAAGTTTAATGATGAAAACCTTGCTGCAAACATGGAGGGTGATGCTGCTGGGAAAGAAAGTGATGAAGTTAATTCACCTCATATGACTCCGATTCCTGAGTGTTGTCATCATGGAGAAAGTAATGTTTTAAATAATCTTGATAGAGACTGCATGCATCACACATGTGGCACATATAAGTTAGACCCAAAGCCTATTAATACTATAGATCTTTCGGACAATCAGAGATCCTTTCTTAATGGTTTCCTTGGAAATATCTTTATGGCTAGACTGTCAAATCTTTCAGCAGATTTTGAGTGGGTTGAGTTTTTTTGCCCCCAGTGCTCAACTCTGATTGGGGCTTACCCTTGCAGTAATGGCTGCGGACCTACAGATGGTGGAGTTCGACTCTTTAAATGTTATGTCTCAGCATGTTCATCAGTTGAATCTAGAAATTTTTTGAGGTTAGCATATTATTATGTTATGCCTTTCTTTTAGTCGTCTCTGGAAATCATGTCTGCATAATGTAATAAGTGACTTTGAGATGTTTCTTATTTCATTGAGTGCTGAAGATTAGCTGACTTATGCTCACAGGGAGTACACCTTGGAAAGAATGTTTGCTAATCAGCTACTGGAAAGTGCAAATGAAGAATCATCATTTCGTACTGTGGTTAAGGAGCTGAAAACCAAGTCTTCCATGCTGCACATTGTTCTCATTAATTCAAATTCTTGGTCTTGTAGTGGTTATTGTTTGGGCATGAAGGATACTGCAGAATCAGTTCCAAAGATGGAGTTAAATCCTGTCATCAAGGTGCTATTTTCCGATTGCAACAAAAGTGCAGAATCTCATTTGAGGTTGTTTAGAAATTATTCTCTCATCTTTCCTTCTTGTGTGATGTCAAGTATCCTTATAGAGACCTATTTACACCGGTAGATCATAGGTGTTCTCATTTTCTGCAATTTCCCTGTTTCTAGTACCCCCTTTCCTTTCTTATATATCAGTACATGGTTTTGTGATTTTTACCTATTTGTAGTTGCTGCATTCTATTCCCAATACGTACTTCTCTGTTTTTATTATATAAATTGATCATGATTGAATTTGAAATGCAGGAAACTTGAAGAGTTGGTGACAAAAGATATAGCAGATGAAGTTTTTATGTTAGCCCATCAAATAGAGGAATTAGTTGAAATCCTAGTTTCAAGAATGATACCCTTCCATCTTCTTGTTCTTCCCTTGATGGTTTATCTTTGGCATCTATCCTGAGGTGACATTTATCTTTCTCATCTTCTTTGGATTCCCACTCAAATCATGGATTAGTCATGATCTTCTAAGATATGAACTTGATAAAGCCGAGTGGCAGCTTTGAAGAACTCATCATTGTAAATCGTGTTTCTGATATTTTAAGGAGCCGCCAACCAAATATGTAAGGCCTTTAGATCTGTTTATCAGCAAGCCAGACTTTGTATCGATGGGAACCTTATTAATGGTGCTGGCTTATCAGATCTTGGATGTGTTTCAATCACGTGGAGGATGTTACTTTATATTTGCAGTTAAACTACATGTTTCGGGTATATTTTATTTTGTGTTTATTCGGTTCTCTGCTTGGTTAGGGCATTAGTCGTCAAGGTGCTTGACTGTAAGATACTTGATATTTTATGCCAGCATTTTAGTCGGATTTGAGTGAGCTTGTTACAAAAGCTTCCCCACAAGCCATATCTACTAGGGGATATCAGCTAATTATGCCGGAACATTTAAACCTTTATTTTAAGGTTGATGTTGGCCTTTAGGCACCTGTAGAGAATGTAGAGTTTTAATGACGTCTCTTTGCTTAGTCTCTGTACCAGGTACGGTGTTAATTTTCAAGCTTTTTAGGGGATGTTTTAGTCTCTGTTTACCAAAACACTTCCCATTCAAAATCTCATTGTGTAGGACAGGAGTGATTTAAAATTGCTTTTGATTAAAGTCAAAATCACAAAATGCTTTGTGAGAAAGTACTTGTAGAGTGTTTTTGGTCTAAGTGCTTGTTAGGAAAGCACTTAAAAGTACTTTTTAAGAGCGTAGAATCAAATATAAAAATAATTTGTCATTGAGTATGTTTACTACAGTACTTGTAATCAAATTACTTAGGCTAAAATCACATCTTTTAAAAGCATTTTCGGGCACCATTAGTTCTTTCTTAGATTTCTCTTTTTCCTTCATGGATTTGACATCTTTTTATTCCAAGGAGTTTGCTGATCATGGGCTGCCAGCAAGTTTCACAAGGATTACTTTTATTATTACCTTTCTTAGCTTTTCAAACTCAGTTGAAGATGCCATGTTCCTTAGTTTTGTAATATGAAGAATTAGAGCTTTGAATGGTGCAGGTGAGCAGTTGCCCATGTGAAAGAATTAGAGCTTTTAATCAAAGATTAGAGTAAAAAAAAAAAAAAAGGGGTTGTCGTCCTTCTCCTAATTTTGTAAATAGCCTTCTGGCAAGTTAACCAATTAAATAAATATCTAAATTATGACGTTACGCTGCGTTTTTCTTTTCTTTTATTATTCATATATTTTTTTCTCATATCTTTCTACGACTAACAAAATATATTTCTACAGCTCAAAATCAAATTTGGTTTCTTCATTTTTACATTTTTTTCTCATCAAGCCAAAAGATAGACATATCTTATGATTATTATGACTTTTCACAGTGTCAAAAAAACTATTATTGGTAACGAAAAATTTTGTCAATGAGAAAAAAAAAAATTTGGTAACAAAAAGTCGCTCTCTCTCTATTCTTGCTTCTTGTTTTTTTTTTTTTTTTTTTCTGCTCAAGATAAATTTTTGTTAACAATATATATATTCTTTTCTTGGTGACAAGCAAAAATTTCTTGTTACATAATAATTTTGAGATATGCGATAATCTTGTGAGAAAATAGAAAATAATTTTAGGTACCATAAGGAACATGATAGAGAAAAATAACTGAAAAAGGATCCATATTTAAAATCTTAGAAGTGGGATAATTTATGGCCTGTTTGGTAATCATTTTGTTTTAGTTTTTTGTTTTTGAAAATTAAACTTATAAACACTATTTCTACCTATTGTTTTGTTATCTACTTTTTATAAATATTTTCAAAATATAAGCCAAATTTTAAAAACTGAAAAAAAAAGTTATTTTCAAAAATTTGTTTTTGTTTTTAAAATTTGGCTAAGAATTCAAATGTGAGTTTTTTTTTTAATTTTTAGAATATCACAAATTAAGGGTAGGGGATTCGAACGTACGCCTTCTAAGAAAAAATAGAGATGCCTTAACGTGTTTTTAAGAACGGTGAAAAATACATCAAAGAAATTGTAGGAAACATGCATAATTTTTAAAAACAGAAATTAAAAATGAAATTGTTATCAAAAGGACCTTAGTATTTTACTAACTGGTGAATTGTTAGAAGGATTGCAATTGAAAAAGAGTAATATAATTATAAGATTTTTCTTAGAAAAAGGGTTTTTTTTTAATTATAAAAACTCTAAATTTAATAGAATGAGGTTGAATGTTTCGAAAAAGTTAAAATATTTAACTAAATTAAGTTTACCACTTGATGAAGACGAACAAGGGATATTCTTATTATTGGCGTTTATGTAAAAAAAACTAAATTAATGTGGAATTTTATGTTTTTCTAATACAATATATTTAAATTAATAAAATAGTCACATGAAGACCTACTTCTTCCCCGATGAAGTTCCGAGTTTTATCGAAATTTAAAATGTATAATTTTTTGTCTTATTTTCAGTAAGATTCTACATTTTTTGTTAATTGAAATTTATTACCAGGCAAATGCCCGCGTTGCTGGTATCATGGCCTGACATTTCCCATAATCGCAGGTTTTCGCGTATTCATCCTGTATTTATCGAAATCCAGTCCTTTGATTCAGTGCTGGGCAACAAAACAGAGCTATTCACTTGTAGGTTTCAGGTAATTTCGTTTCGTTTTTCTCGATCATCATTTTCGGAAGTAGTTTGGAGGCTCGAATTCAGCTGGACTCGAACATAATTAAACTTTCCATTTTCTCGTGTTGGAACAGAAACTGATCCGAATTCACAAATGTAGTGTTGAATATTAGTTTTTATTAATGTTGTTACGATTTGTTCCTTGAATTGGATATCGTTTTCAATTGGTTCTACTTTCCTTTTCAAATTTTGCACATATGTTTTTTTTGTGTGAGGATATTTTAAACACGCAAGCGTTATTCGATGCTATAATCTCCCTGCCGCGTTGTCTTTCTCTCTGAATTAAAGGTTACACTGACTTAACAAATGCTTTAACTCGAGCTTCTTGTTTTATGGCAGCAATAGGCATAGTAACTGGTATTGGATATTAACCACTATTCATGTTATTTACCTTGATTTGTTTATTTACTGGTCATGAATCCATGATGTCTTAGCTGTTAAGTTTACAAATCTTTAATTTGCAGGACGTTCAAAACGTTGAAAGCAAGACAGAAGTATGAGTGCAAGATATCATATTGGTGTTTGTTCTTTACATTTACTACTCTTCACTGCATTAATTAGTAAGCTTGCTTCATTCAGCTTATTTTGTCCCTTTTCCAAATGTTGTATTTATTTTGTTCTCTTAAAGGATTTTTGTTTGTTTCAGTTGTACTTTTACAGGCTGTTGAAATGAACGCAATTGCAGTGCCAAGTTCCAGTTGCTATGTTTTTGACAATTCTAGTCACATTGTTGATTTTGTATCCTACATTGTATGGTCTGTCTTACAACTTACAAGGAACTGGATTTGGTAACATGTATTACCAAATGCACAGTTAAGTGGGAGATGGGAAAACAATTTGCATGCATAAAACAATCAAGATTATGTTGATCTTTCTGTAACTTTCCTGATTGTGATGATTCTTGACTTTATCGCGAATGGATCCTTTTAGCGGTTTGTGATTAGCAATGGATTCATTTTAGGAAACCTTTTCCATAAAGTTTCTTGTGTCTTTAAGTTGAATTTTCTTGATATAGTGGTTAATGTTGGTGGACCGGTCAATCTGTTAGCATCTTCATCTGTCATATTGAACTTTCTGTTTTTGTTTGCTAATCTGAACCGAAATTACTTCCCCTTCACATTTTATTTGCTTGTGCCTCAATTATGCTTCTGGTGTTATGATGTTTTTAGCTTTACTTAACTCGACAATTAGAGTAGCTGGATTGGACAGCCATTTGAATATGATGGGAAAGTGAGATTCTCTTTTAACTTAAATAATGCAACTTGAGACATTTTATGATATTTTAAGAATCCCGCCTATGAATACATGAATTCAGGCAGTATTGAGTCAAACATGTCTTGATATAGGGTTCTGACTTGGTGGTTCGATTTTGCAAAGATGTGGAAAGTAGATCGCAAACGGTAATACTGCTTTGCTTGATTACTTTACCATTACCTTTGTTTACTTTGAAGAAATGTTTTGTCAGAGCAATATTAGCGCCTTATTTGATGAAGAAAGATTTATATATGCATGTCATTTTTTATTCTAGATCTACCTACTGTTGGAGTTCCATCTCTAATGGGGTAAGACTTTGGTCATAGAGTGTTGGAATTCTTGAAAATGAATATATGTTATGGAACTTTTTCTTCAGCCAAAGGAAGTTGCTCTCCACTTCATTTTGGGTTAACCATTCTCAAGGAACTGTCAAAATTAAATGAGAATTGCATTGATTTCATATCATTCTTTTATACTCAGCAATAGTTAAGGAGTTTTGAAATACTTTGTGCCCTTTATTACTTAGTTGCAAGTTCTCAATGAAGTTAGCTCTGAAAGGGCTAATATAGTTTTATTTTGGCAAGTAACAAAAAGGTCATGAGTTCGAATCCCTTTTTTCATATCTTCTTATAAATAATGTTTTGAAGGGACAATTTTGGCCTGTAAAAAAATAATTCTTGATATGTGAGTTATGTGTGCGTGACTGTGGTGAATCAAGGTCAAAAGGCTATTTCATTCTCAGTGTCACTATTTTATACATCTTGAACATGTTCAAGCTATACATGTTTGGTCTCTGGAAAATTTATATTTTCATTATTGTTCTCTTCTTTGTTGTCAACGGATTGAGCAACCTCATTCACATTTAGTGCATTGTTCCGGATTAACCAGGGATATGTAGATTTTGGTCGATTTGACAAATTCAACTACTTTGTCACAGGTTCAGGACACATCAACTTTGTTCAAGTAAGCTTTGTTTTTAATGTAAATAGATGTTCATCTTTAGTTTTTTCTCTCATGGATGAGAACGTTAGTTATTCCATTATGTAAAAGGAAAAAAAAAAATCAACATTCTTTTCTCTTGCTATAACTTGTTGGATGTTATGGCCCTGGACATTGTAATCATTCCAATTTCCATCTGTTTAGGGTTATTACAATGGCGACCTGACTTCTTGTGAGCAGAGTTATGACAAATTGGGAAGGACTGCTCAGGTTGGTTTTTATATTGAAACAATCTCTCAATCCTTAAAACTTATGTATTTGAATCTATGGCTTTCTCATGGACCATTGTATTCTAATTCAGGTAAATGTCATATGTGGAGGTTGTTTAAATGGACAATGTAAAGGTTAGTCAGCTCTTCATGCCCAAGTCTTCTTCGTATATATGAACTCCTGAAATTTCTGGAAATTATGTTAATGCAATTTTGGCTGCCTGGAAGATGTGTTTTTTGGGATCCATGATAGCTTGAAAGGAATTGTATTTGGAACAATTAACTAAAAAGCAATAGAATCACACACGTGCTATACCAACTAAAAGATTTTTGAGATCGAATTTAGGCTAAAAGGCTCTGGCTCAAACACTGTCATAGGCTCATAGCCACCCTATGGAAGTTATGGCATGAAAGAAATAGAAAAACTTTAACGGAGATGATAAAAGATTTTTGGGTGTGTAAGTAATCTATTTCTTTAATTGTTTTGCAGTTTCCTTTTGGTATTCTACAGATAAGGCATTTTCTGATTACTATGTCTTCTTTGTAATCCTTCTAGCTTAAGGGGAAGGATCCCTTGGTCTTCCCTAGTTCTTTTTTGTAGTCTTTTCTTTTTGTTTTTAAAAATTTACTCCTTATCAAACAAAACACTTTGCATCTTCACAACTAAATTACACTTGCTACACTAACTCGGTGACAACAAATAAGAATTTTTTCTACCTTTAGTTTTGTTGCAAGAATTTAAGCCACCATGGAGAAGAGTTCAAACAAAAACTTCAAGTTTCCTAGGATATTTTTTTCTTCCTTTGGCTCTTGCTCAAGAGTGGGGGCATAGCCACCTCCAAAGTTGTAAAATCGTGAAAAATGATTCCTAAATTTTCCAACTTACCAGTTCTAGAATCCTGAGGACCCAAAGCCTGTTAATCACTTTGTTTGTTCATGAGAGTCGACCATCGTTCCAACCAACCTTCCTCAAAATTTCTCAATATTAGGTCCACCCACCATCTCTCACTCTAGAAATCCTTGAACATATATTATTGCTTCCAGCTAGACTGTAGCTGTACAAACTGGGGGAAGCAAGGTAAAGTAAAATGAAAGGCCCTCCAGCCATTGCCAAAACCTTATGGCTGAAGCTTAATCATCACCAACAACGAACCCTCTGTCTCTCATATTTCCCCAACTTTTTTTAGTAGCAATTTTTTATATGCTTTTTCAATATTTATTATTCGCACTGTCTAATTCTTAGATCTTTTAGTTCTTTGAGCAGTGCTCCTATCTGCCTTTTCTGAGAATTGTATGCACATGTTTGATCTTGGCGCACTTGGAATATGTTTCAACTAATGACTATATATATTCATTCATCTTATTATAGGTGGTCTGGGATGCATCTGCAATATCACTTATGAGTCCAGTTGCAGGTCTTGCACCGAATACCTTTATTTCTTTAAAGAGTTAATTTTCAGTTTGCTGAATCTATACATGTAAGCAATCCTTGAATGATGATTTTAAAGTTTTCTTGATGAGTAGATTAAAGACAAGGGCTTATTTAATTGAAATTCATGGAACTCTATGCAGAACATGAGTAGATCAATGGAAACTTTTTTCTTCCTTTGAAGCATGGGTGCCAAATCTTCTTGAAAATCATGGATGGTTGTTTCTTTCAAGTGCTTATTTGTTGTTGGCTGGGCTAAATATTTCCTTTTACAAAACGAAGAGAAAGTACAGAACATGAAAGTAAAGAATCTGTCATCTGCAGTTCTTAAAATTTGAGTCTAGTTTATGGCAGTGGTAATGGTAAACGCTCACTATTGCTTGGTTTGCCTGTGCTAACAATTGGCCTCATTGCTACTATCGTATTAAGTGGAAATGTAAAGGATCAATGGCAATGGCTTGAGGTAGAAATCTCAGCTCACTGTTACAGGAATTGGAGAACCTGTGTCTCCCAAGAACTATCCAAATTTTTACCCATAACCCGTCTCTTTCCTGCCTTCTCCTTCTTTAATACTCGCCCTAGTTTCTAACAAATGTCACTCCTATCTAACTCTTTGATTGGGCCCACTTTAAACCAAAAATTCATGTAATATCATTATGCAGCAAAATACGAAACTGCACTCACCCCTTTAAACATGCATTTTGTCCTGTACTGCAGTCATACCACCTCCCCTGCGCTCCCTTCCTTCTTTGGATGCAAGAAACATGGAACGGGGGCTTATCAGCCTGATAAAGAAATTTCTAGATAATGTTATGTTACGAGTTAGTTTTGTAGTGTAAACCACATCTAAATCAGACATACAACATCTCTATCCTGCATAATCCATCTTGTTGCAGTGAGTGAAGCATGGAAGCCAGTCTTGTTTCATTAATATTTGTAATAACTAGAATGATTGGAGATTTTTTTTTTGGAAGGAAAACAAAGCAAAGTGTCAATATTGACAAGTGCATTCCAAAATTTAGAATGCTGATTAGGGACGCTTCATAATGCTCTGGCATCTGGGACATAATTCTTGAAAGGGTTTCGTTCACTTGGCGCTGAGAAAATAACTCACTTATAATTATTATCTGCCTCAGAGTTATTGTCGATCTTGCCATCCCTTGTGAGATACAAGGCCCACGTGTTTTCAAAGGATTTACTGTTGGTTTTCACCCTCGATCCTGGGAAATTGTAAGAAATTTATGTTAACTCAAGTTGGTATAGGAAAAAAAAAAATCTAAGATTGAATTATTTACGAAGATATACCTTATATAATACCACAGGTTTACAATGGCTTGACTCAATTAGGCTATGAGAAGCCACACCGTGCATTCAGGTAAAGGAACACGGTCTTATACCCCTCCATTATATTCTGTCTCTTTGATAATGATTGAAGATCTATTCATCTTAGTTATTAAATCTCACATTTACATCTTCAGCTTCAGCACAGAGCAGACTCGCGTGGTTCTTTATATGACTGCAATTGCGTCACTTTCCTCTTTGGTACAGAGACCAATCATTCAGGTATTGAATTCCCAGTCAATTGAATAAAGAATTAAAAAATTGATATTCTTTGTAGCTGATGCACTATGATCAAAACATTTGCAGGTTTCTCCAGAAAATGGACTAGAGGTGAAAGTATCAGGCTCAGGGGCAACTGGGAGCTACCCTACAACTCTGTCACCCTCAATGTTGACGATTGACTGGAGATGTATGGGTTTTTTTTTACAATGGCATTCTCAATAACTTATCAATATTTACAATCTTCAAATTTTCATAAACCTGTAATTATCTCAGGTGATATTGCCAGGAACATTCTATATGAAGTTAATGTCACGGTCCCTGTGGCTGATTACGAACCAATTAGTTTTTTCCTTACCAAAATTTGTGGTGAGAATCTGTCTCTCTCACGAAAGATATCAACCATGCAATTTTTCTTAAGTTTAAGTTGTAGTAAGAACTGAATTAAGTGTAAACCCAGGAAATCCAGCGTTGACTATCAATGATTTTTTCCCCCTCCAGGCGTGAAAGTTAAATGTAAAACTAGTTTTGTGTTTCATATTTTGCAGACAATAGGCAGGACCTAGAAGGAGATTCTATGAAAGGATGGGCCACATTTGGAATATTGTCTTGCATGTATGTCAAGTTTTTCTTGTTTGTCTTCTGCATATACACAAGTATTTATCATCATCTTTCTGATAAGTATCTAGACTTCAGTTATGAAAATAGTTGGGGTGCTGTAGAATGGATATCTCTCTGCAAAGTAACGGGGTCTCCCTCAGCATGCTTTGGGTGGCTTAGAAAATATTACAATGATCTAGATGATCGTGATTTGTGCTGGCTTGATTGAAGTAGCCTCATTCAGCTGCAGTATCTTGCTCTATTTGATCAACTAGAAAATAAGCTTGAATTAACTTTTCAACAGCATGTTTAACATAATTTTTACGGTTCTGATGCCTGAAAAAATTGGCATGAACAGATTCATAGTCGTAACATCACTACTTTGTTGTGGAGGATTTGTTTACAAGGCCCAAGTGCAAGGCCAGGTAAGCTATTTTTCTATCCCATTGCAATCAGATATTTGCTATGAAATGTAATGTTATTGTTCGATATTTATGTTGAGTCAATTGCATGTAGCGTGGAATTGATGCATTGCCGGGCATGACACTAGTATCCGCTTGCTTGGAAACTGTAAGTTTACAGCCCCTTTGATGTAATATTAAATTACCATATTTCTTGTTCTTTTCAATGCACCATTATTTCAAGAAACCAAAAAGTTTATGGGAGTAAAAGGTTTCCTTTGGAGCATCTTGTTCTGTTCGTTGATTATTCTTATTCATTATTTATACGCTTCCTTGGAAACTGAAAGTTACAGCCCCCTCTTCGATGTAATGTTAAGTTACCATATTTCTTGGCCTTGTCAATGCAACCATTATTTCAAGAAACCAAGAAGTTTCCTTTGGAGCATCTTGTTCTGTTCGTTGGTTATTCTTATTCATTATTCTTGGCATTAGGTTATGCATTTATGTCATCTCATCTTTGGAGTTTTTTCCCAGTATTAAAATATAAGTCCTTGGAGTCTCTTTTCATCTCATTCTTCAAGTTGGTTGATCTAGCATAATTCACATTACCACTGATAAGTTTCGGAGTGAAACTGGCTTATACTTACAGTGTCTTTTCAAGGCTTGTATTTTCTCGTCAGGCGTAGCAACACCTTGTTATGTTCTATTTGACATGTTTGAATTTGTATCCATGAAATTAGATAAGTGGAGGAGGACAAGGCTACTACCCGAGAGCGGAAGGCATCAACAATGCGTTCGTCAGTGAAGCCTCCTGGGAACAACGCCCATCATCTTCTTCTTCTTCTCGACGGACATGGACACCATCTGAGAAAAATTATGGTTCAATATGAGAAAAATTAGAGGGTTCATTTAACGAATCCGGTTTGTATGACACTGGGTCAAGGAGAATATAGGCAAAGCTTTGGGTTTTGAAGTTATCCACGTATTGGCATTTTCATAAATTGAAGAGTTCATCATGAATATGAGGGAGTAATGTTCTTTTTTTTTTTTTTTTTTTTGTGTGGTAGTGTTAACAAAAAGTTGAAAATGTCTGTAAGTATTAGAAATGTTGACATTTATCAATAAAAGTACTAATAATGATGAGATAAAATTTTGAAAAATTGTAATGGGTGTCAAAATTAATTTTTATATTTGCAAATATGTGATAGAGAAAAATAATTTGCAAATATGATAATTTTTTTAAACGTGTCTATTTAAACAATATTACGTAATTCTTTTTTTACTATTGTATATCACTGATATATCCTTTTTATAGTATATCACTGATATATCGCTTTAATAGTATATCAATGAGTTATTCTTCTTTATTATTGATACATGGTTATCAGTGATTGACCTTTATTTATTATTGATACATGACTATCGACAATTATACTTTTTCTTATTGATATATGATTATTAACTTACAATTGAATTTTGTTTTTCAAATTTTATTTTAAATTTTGGCCAATCAAACACACATTTCACAAGTTATATCAGCAAAGGACAATGTAAAGAAAAATGTGCTTCTTTCATAATGTGTATATTAGCAAGGAATTCGAAGATGGATATATTAGCAAGGTAGTGTAAAAGAAAATCAAAAGAAATAGAATGTGAAAGAAACAATAGAGTTATGAAAGGAGAAATTGAAAGAAGAGGAAGAAATTAGGGTAAACGACAAGAGACAAAACATAAGAAAACTCAAGAAATTAGGATAAAAATCAAGAGAGAAATAACAAAAAAATTAAGGTAAACGTGTTTAAGGAGAAGGGAAACGTTTCGTGCTATTGAAATTACTGATATATCATTTATGTGGTAAAATATAACACTGATATGTCTATATATAGTATATCATTAATATACCATTTGAATCATAAATAAAGTATATCACTGATATACTTAAGGTTATTTTAGACTTTTTACATATTCTACTTAGGCTAGACTAACTTTCTTGTTATATTTGCAATTTTTATAATGTGTGTGCTAGATTTGTTATTTATTTAGTTGTTTTTGCCAGATCTGCAAGTACCCCTAAAATTTTATTTTATTTTATCTACTTAGTTTAGGGAGTTAAAACGGTTCAATTTAAACTTTTAATTTAACCAAATCAAATGTTAAATATCTATTTAAAATAAATAAAAATTATCAACATGAGTATAGTTCAGTGATTAAAATATTTTTACTTCTTCTAAGGGTATTGTAGGTTTGAATCTCCATCTTTGTAGTTATGATGTAATATTCTATAAAAAAAAAAATAGAAGTGGTGCAATTTTCATGAGCTAAACTAAGTTCTTAAAATGTTGTTTTTATTTTTCTATGATATAATAAATAAATAAGTTCTTGAAGAACTATATTTGTATCGTTTCATTGACATATATTCCAAAAACGACTAGCAATATGAACCAAATCTATAGATTAAACCTGAGTGAAAATTATCCATCATCTTAACTAAATTCAAATAAGTTATTTAGAATCGCTCGTTAACATCTAACAAACATATCAATCTTCGTTCAAAGTTGATAACGAATGCTTGTTTGTTTGAACAGAACTATTTAATCAAGTTGAAACATTTGAAAATATGATTTGTGCTCTTGATTAAAAATGGAAGAAAGGGCCAAAAAAATTAATTGACATGATGTTTCACTTGAGTCTTAGGGGTCCCTTGGAAATGAAAAATTTATTTGGTCAAAAGAAGTGGTTGTCCTAACTGCTACCCCACAACTTTTCTTTTTTTGTTTTTTTGTTTTGTTTAGTTTTGTTTTTTTTTTTGTCTTCTTATCTTATATTATCTGGTTGGTTATGTTGACCTAATTTTCTACTTCCATTCATTAAGCATAATGGATATAAATTATCAAAGTTTTGGGAGATGACATGATTAAAGAGTACAAAACAAGGACTTTTGCTTTCTTTAGTGAAGATTGCCTTCTCCATAAACTTACCTCAACAATGACCAAAATATCAAATTAAATTTATAAAATGAAGAGTGTAGATAAAACCTAGATGGGTACCAGTTGTCATTTAGGGAATTTGAATGAATCTTGCCCATCTCCCTTTTGACATTTGAAACATGTATACTTTTAGTTTCTAGAAACTAAGAAGGAAGCAAAGAGACGACTGAAAGTCCCCCCGGAAAGGTCCCCCACATATTAATTATCAGATGAAGAAAATGCAAAAGAGAAACAAAAGGGTCAAAAGTACAACAATCTTTAACCTCACCTGTTCTGTTGACAATCTCACCCAAAAATTATTTTACTCAAAAGCTTGCCAATATTTTGATGGAGCCTGCTAAGTTTGACAAGTAAACTTAATGAACATATTCATTGTTCTGAGGTATGCAATGATTTATATGCCATTCAGTTTGTCAAAACATGCACATGCAAAGTAATTTCTTCTCCCCTTAACCAATACTTTAACATGCTGGTAATGCACAAGACTAATATTACAGCACAGAAAAAAAGCTACCAACTCTTACCTGCATTGCAAGTTAGCTACCAAATCCATTTCTTTCCGTCCCCTTTTTTCAACAGACACGACCATTCCACAAAGTCTCCACTATAAATAGCCACTACCCATTTCAACTCCTTTCACAGAAACATTTCAAGTAAGGGTTTTGGCCATATTATAAGATTTCAATATGGCTCAGACTTTGAGCTTTGTCTTTGTTCTTTCCCTTCTTGCCTTTGCTCCACTTTGTATCTCTAGCAGCAGTGGTGGCTATGGCTATCTATACCCGCAGTATTACGATCATTCATGCCCCAGAGCTAAAGACATAGTGAAGTCCATTCTAGCAAAGGCTTTTGCAAGAGAAGCTCGTATTGCTGCCTCTATCCTTAGACTTCACTTCCATGACTGTTTTGTTCAGGTCGTTTAAACAATCTTCTTTGTCGCCCTTTTGGCTTCGTTTTCTTTCAATCTTGCATGGGTTTTTGTGGGTATAAAGTAACTTAGAAAGTAATATGATGATTGTTTTCAGGGGTGTGATGCATCTTTACTATTGGATAGCAGTGGGAGCATCAATAGCGAAAAGAATTCGAACCCAAACAGGAACTCGGCTAGAGGGTTTGAGGTGATTGATGAGATGAAATCTGCACTGGAGAAAGAGTGCCCACAAACTGTTTCTTGTGCTGATATCTTAACTTTGGCTGCAAGAGACTCCACTGTCATTGTAAGTGCTGAGTAGCCAAATCCATTCTTCTCAATCTATCTGTTGTATTGCATTTTCTTATGGAAATATATATTGGAATTGGTAGACGGGCGGACCCTATTGGGAGGTTCCATTGGGGAGGAGGGACTCGAAAACTGCGAGTCTGAGTGGCTCCAACAACAACATTCCAGCTCCAAACAACACATTTCAAACCATTCTCTCCAGATTCCAAAACCAAGGGCTTGACATAGTTGATCTTGTTGCCTTATCTGGTAAATTTTTTGGGGCTGTTTGTTGTAGGCCCAGTTGGTCTAACGTTTTAGTAGCGTTTTCTGTTTGTTTTCTAAAACTATTTTTGAAAATAGAAATAAACCCATTTAGTTTAAATATCATTTTGTCTATATAACCTTTCAAAATAGTACTATTTATTTTAGTCTCGTTTTGACTTTCTATTATTTATCATAAATACCATTTACTCAATAATTATTTTAACTAATCATATAACTTTAATATATATCTACCTAAACATATGCTAATACATGTTCTCGATTGAAATTATGTTATCTATTTGTGAAAAGTAAAATAATAATAATGATCATAATGATTATTTTAATAAAGTTTGATAATAAAATAGACATCTAAACATTCAAAAATCAAAATAAAATAATTAAAAAATTCAGAAATCAAAATAAAATAATGTTAAAGCTAATATTTTTTGGGTTTTAAACTTTTAAGCCCATATTCAGATGTGATCATCCTGTTTTGCATGCAGGAGGGCACACCATCGGAAATTCCAGGTGCACCAGCTTCAGGCAGAGGCTGTACAACCAGAACGGCAACGGACAACCCGACAAAACCCTTCCGGCGTCGTTAGCGGCCGAGCTCCGGACCCGCTGCCCAAGATCCGGCGGCGACAATAACCTCTTTTTCCTGGACTTCTTCACCCCGACCAAGTTCGACAACAGCTACTTCAAGAACATAGTTGCTTACAAAGGCCTGCTCAACTCCGACCAAGTTCTCCTCACCAGCAACGACGCGTCGGCTGCTTTGGTAAAGAAATACGCAGAAGACAACGAGCTGTTCTTCGAGCAATTCGCCAAATCTATGATCAAGATGGGCAATATCTCGCCATTGACGGGTCAAGTGGAGAGATCAGAAAAACTTGCAGGAAGATCAACAATTAATCAGTTGCGTTTCGTATTTTTCCAGAAAGAAATAAATATGAATGCACAGACGACTGAGGGAGAGTGCAGATTTTCTGGGTTTTTATTTCAATTGTTTTTCAAGAATTCGTGTCTGATCTGATCTGATCTGGGTTTTGTTTTCTATTCTAAGATATTTTAATTTGAAGCAGTGTATGTGTCGGCTGTCCATTGGTCCCTATGTATTTGCTCAATTTTATAGTTCTTGTCTTGTTCACTATAATGCTAAACAAATATGTTGAATTTACTTCAAATCTGATCATTTTCTCGTCAAATAAGAGACTTCTCAAATTTTCATAAGTTTAATTTAATTTTGGTAAATTATAAGTTTAGTCCTATACTAATAATGTGTGCATTTGATACTTAAATTTTAAAATTTGTCAGTTACATATATGCCATTAAGTTAAATTTTGTGATGAAAATTTCAACATTTCAATACCATGTAGATGTCAATTGAAATGTTGGTAACACGAGAATATTGAAAAATAACGACTTAATTAAAATATTGAAAATACAAAAATTTAAATGACAATGAATATTGATATATTTTTTTAAAACTACAAATCAAACAAAAGTAGAAATTGAAACCTACAAGGTTTAAATGAGTTGTAAATGTCTTGACTCTTGACCAATTGAATATTAATATTAAATAATAAATAATTAAAATAAATGAGTAATTTGATATTAATTAAAGGATGATAATTGATAAATATAAAAATTCTTTAGATAAATTAAAACTTAGCCCCTGAACTTTTTGTGTCTAATAGATTCTAATTTATTAGAAATTTTTAAAATTCACCGATCTATTAGATATAAAATTAAATTTTGTGTGTAATATATCCCCAGCCTTTCAATTTTATTATAACATAAAATTGAAAGTTCATGAACCTATTAAACACAAAATTAAAAGTTTAAGTATCTATTAAAAACATTTTAAAGTTTAAAAATTACTAAATATATCTTTAAAAGTTTAAGATCAAATTGTAATTTAACTAAAATATTCATTTTAACGAGATAATGTAAAGATGCGTACCGAAAATTACCCCCCTTTTTAGGGGAAAAAAAACGAAACATCTAATTTACTATTGCGGAATGAGATGGACCGTTCTTCACAGGGACGGCCCAATATTTGTGGATCTACCAGATTCCAAAGCCCAACTAAAAAGCCCACCCAGCGAATTGACTGGCTCCTTCATTCTCATCGGTAGGGTTTATGCCTCCTTCCTGCGATTCCCGGCATCTGAACGACGGGCAACAGTCTAAATTCCAGCTCCAAAGTCATTAACCATGAATGCCCAGAGCTCATCGCCTCACTGTTCCTGCTGACTGAACATTCACGATTTATCACACACTAAAAAGATGAGACATTTGCGAATTCTGTCCCCTCATTTCTCGAGGTTCGCAGAACCTTCGCGCAAAACCCTTGCTGTGATTCTCCGTCTACTCATAGTTTAAATCTTCTAGGATTGGAGAGCTTGGCATCTTCTTCATTCCTGCACCATCGGACTTACACCTCTGCTTCACCGGAAGCAAGAGCGGCCCCTTCTGAGAGAGTTTCGGCCATTGTGGATGAGATCTCTGGTCTCACGCTACTCGAGGTCGCGGACCTCACGGAGGTTCTGCGCGAAAAATTGGATGTAAAGGATATGCCGGTAATGACGGTCATGATGCCTGGGATGGGGTTCGGCGGCTTGCAAGGCGCCGGCAAGGGTGGTCCTGGGGCAAAGGGCGGTGAGGAGAAAAAGGCAGAGAAGACGGCGTTTGATGTGAAGCTTGAAGCTTTTGATGCTGCGTCGAAGATCAAGGTCATTAAGGAAGTGCGGACGTTTACGAATTTGGGGTTGAAGGAGGCTAAGGATTTGGTGGAGAAGGCGCCAACGCTTCTGAAGAAGGGAGTGACAAAAGAGGAAGCTGATACCATTATTGCCAAGATGAAGGAGGTTGGCGCCAAAGTTTCAATGGAGTGA

mRNA sequence

ATGGCGCTTCACATGGGAAGCGCAGTCCCATACACCAACTTTACGTTTGTGCTATTCGATTCCTATACCAATCCTTCTCTTCAATGTCAGAATCTCAAGGTTCATCTCAATCTCTCGCAGTCCGTGGTTTGCGTGACTTGGTTCGAAGACCTCAAAGTGTCGATTCGAGTTCCTGTTCCTTCGGTTTTGGTTGACGCGGAGTCGCCCTTGAGTTTTAGAGCTTTTGAAGATCATATTGAGGTCAAGCTCGTCTTGCTTCTTCCGGTCGATCACCCAATTATTCTCAACTTCGACAATGTGCTGAATTTCTCCGAAGAGCGAGGAAATGACTACTCTAAGGCGTCGAAGCCACTTTTGATGGACTCTGATTTTGTTGAAATGCCATCAGTCAACTGGCGAGAGGTGGCTGATAACTGGTTTGGGTCTTGCTGCTGCTCCTTTGGGGGGATAAGCGAGAAGCTGGTAACTAGGTATACAAATTCCTATAGATGTGCAAAGGGTGTCTGCCTACTCACTCTAACAACTATTACTCTTTCCAAGGATGATCTTATTGGACATATGTTCCCAGACTATGATGGGACCCGAGAATTCAAGGATGAATCAGATTTTACTGATGGAAATTGGTTAAGTGAAGCTAAGCAGGAATCACAATGTAATCATACATCTACGAAGGAGGTAAAATCTAAGAAGTTTAATGATGAAAACCTTGCTGCAAACATGGAGGGTGATGCTGCTGGGAAAGAAAGTGATGAAGTTAATTCACCTCATATGACTCCGATTCCTGAGTGTTGTCATCATGGAGAAAGTAATGTTTTAAATAATCTTGATAGAGACTGCATGCATCACACATGTGGCACATATAAGTTAGACCCAAAGCCTATTAATACTATAGATCTTTCGGACAATCAGAGATCCTTTCTTAATGGTTTCCTTGGAAATATCTTTATGGCTAGACTGTCAAATCTTTCAGCAGATTTTGAGTGGGTTGAGTTTTTTTGCCCCCAGTGCTCAACTCTGATTGGGGCTTACCCTTGCAGTAATGGCTGCGGACCTACAGATGGTGGAGTTCGACTCTTTAAATGTTATGTCTCAGCATGTTCATCAGTTGAATCTAGAAATTTTTTGAGGGAGTACACCTTGGAAAGAATGTTTGCTAATCAGCTACTGGAAAGTGCAAATGAAGAATCATCATTTCGTACTGTGGTTAAGGAGCTGAAAACCAAGTCTTCCATGCTGCACATTGTTCTCATTAATTCAAATTCTTGGTCTTGTAGTGGTTATTGTTTGGGCATGAAGGATACTGCAGAATCAGTTCCAAAGATGGAGTTAAATCCTGTCATCAAGGTGCTATTTTCCGATTGCAACAAAAGTGCAGAATCTCATTTGAGGCAAATGCCCGCGTTGCTGGTATCATGGCCTGACATTTCCCATAATCGCAGGTTTTCGCGTATTCATCCTGTATTTATCGAAATCCAGTCCTTTGATTCAGTGCTGGGCAACAAAACAGAGCTATTCACTTGTAGGTTTCAGGACGTTCAAAACGTTGAAAGCAAGACAGAAGCTGTTGAAATGAACGCAATTGCAGTGCCAAGTTCCAGTTGCTATGTTTTTGACAATTCTAGTCACATTGTTGATTTTGTATCCTACATTGGATATGTAGATTTTGGTCGATTTGACAAATTCAACTACTTTGTCACAGGTTCAGGACACATCAACTTTGTTCAAGGTTATTACAATGGCGACCTGACTTCTTGTGAGCAGAGTTATGACAAATTGGGAAGGACTGCTCAGGTAAATGTCATATGTGGAGGTTGTTTAAATGGACAATGTAAAGGTGGTCTGGGATGCATCTGCAATATCACTTATGAGTCCAGTTGCAGAGTTATTGTCGATCTTGCCATCCCTTGTGAGATACAAGGCCCACGTGTTTTCAAAGGATTTACTGTTGGTTTTCACCCTCGATCCTGGGAAATTGTTTACAATGGCTTGACTCAATTAGGCTATGAGAAGCCACACCGTGCATTCAGCTTCAGCACAGAGCAGACTCGCGTGGTTCTTTATATGACTGCAATTGCGTCACTTTCCTCTTTGGTACAGAGACCAATCATTCAGGTTTCTCCAGAAAATGGACTAGAGGTGAAAGTATCAGGCTCAGGGGCAACTGGGAGCTACCCTACAACTCTGTCACCCTCAATGTTGACGATTGACTGGAGATGTGATATTGCCAGGAACATTCTATATGAAGTTAATGTCACGGTCCCTGTGGCTGATTACGAACCAATTAGTTTTTTCCTTACCAAAATTTGTGACAATAGGCAGGACCTAGAAGGAGATTCTATGAAAGGATGGGCCACATTTGGAATATTGTCTTGCATATTCATAGTCGTAACATCACTACTTTGTTGTGGAGGATTTGTTTACAAGGCCCAAGTGCAAGGCCAGCGTGGAATTGATGCATTGCCGGGCATGACACTAGTATCCGCTTGCTTGGAAACTATAAGTGGAGGAGGACAAGGCTACTACCCGAGAGCGGAAGGCATCAACAATGCGTTCGTCAGTGAAGCCTCCTGGGAACAACGCCCATCATCTTCTTCTTCTTCTCGACGGACATGGACACCATCTGAGAAAAATTATGATTTCAATATGGCTCAGACTTTGAGCTTTGTCTTTGTTCTTTCCCTTCTTGCCTTTGCTCCACTTTGTATCTCTAGCAGCAGTGGTGGCTATGGCTATCTATACCCGCAGTATTACGATCATTCATGCCCCAGAGCTAAAGACATAGTGAAGTCCATTCTAGCAAAGGCTTTTGCAAGAGAAGCTCGTATTGCTGCCTCTATCCTTAGACTTCACTTCCATGACTGTTTTGTTCAGGGGTGTGATGCATCTTTACTATTGGATAGCAGTGGGAGCATCAATAGCGAAAAGAATTCGAACCCAAACAGGAACTCGGCTAGAGGGTTTGAGGTGATTGATGAGATGAAATCTGCACTGGAGAAAGAGTGCCCACAAACTGTTTCTTGTGCTGATATCTTAACTTTGGCTGCAAGAGACTCCACTGTCATTACGGGCGGACCCTATTGGGAGGTTCCATTGGGGAGGAGGGACTCGAAAACTGCGAGTCTGAGTGGCTCCAACAACAACATTCCAGCTCCAAACAACACATTTCAAACCATTCTCTCCAGATTCCAAAACCAAGGGCTTGACATAGTTGATCTTGTTGCCTTATCTGGAGGGCACACCATCGGAAATTCCAGGTGCACCAGCTTCAGGCAGAGGCTGTACAACCAGAACGGCAACGGACAACCCGACAAAACCCTTCCGGCGTCGTTAGCGGCCGAGCTCCGGACCCGCTGCCCAAGATCCGGCGGCGACAATAACCTCTTTTTCCTGGACTTCTTCACCCCGACCAAGTTCGACAACAGCTACTTCAAGAACATAGTTGCTTACAAAGGCCTGCTCAACTCCGACCAAGTTCTCCTCACCAGCAACGACGCGTCGGCTGCTTTGGTAAAGAAATACGCAGAAGACAACGAGCTGTTCTTCGAGCAATTCGCCAAATCTATGATCAAGATGGGCAATATCTCGCCATTGACGGGTCAAGTGGAGAGATCAGAAAAACTTGCAGGAAGATCAACAATTAATCAGTTGCGTTTCGTATTTTTCCAGAAAGAAATAAATATGAATGCACAGACGACTGAGGGAGAGTGCAGATTTTCTGGGGACGGCCCAATATTTGTGGATCTACCAGATTCCAAAGCCCAACTAAAAAGCCCACCCAGCGAATTGACTGGCTCCTTCATTCTCATCGATGAGACATTTGCGAATTCTGTCCCCTCATTTCTCGAGGTTCGCAGAACCTTCGCGCAAAACCCTTGCTGTGATTCTCCGTCTACTCATAGTTTAAATCTTCTAGGATTGGAGAGCTTGGCATCTTCTTCATTCCTGCACCATCGGACTTACACCTCTGCTTCACCGGAAGCAAGAGCGGCCCCTTCTGAGAGAGTTTCGGCCATTGTGGATGAGATCTCTGGTCTCACGCTACTCGAGGTCGCGGACCTCACGGAGGTTCTGCGCGAAAAATTGGATGTAAAGGATATGCCGGTAATGACGGTCATGATGCCTGGGATGGGGTTCGGCGGCTTGCAAGGCGCCGGCAAGGGTGGTCCTGGGGCAAAGGGCGGTGAGGAGAAAAAGGCAGAGAAGACGGCGTTTGATGTGAAGCTTGAAGCTTTTGATGCTGCGTCGAAGATCAAGGTCATTAAGGAAGTGCGGACGTTTACGAATTTGGGGTTGAAGGAGGCTAAGGATTTGGTGGAGAAGGCGCCAACGCTTCTGAAGAAGGGAGTGACAAAAGAGGAAGCTGATACCATTATTGCCAAGATGAAGGAGGTTGGCGCCAAAGTTTCAATGGAGTGA

Coding sequence (CDS)

ATGGCGCTTCACATGGGAAGCGCAGTCCCATACACCAACTTTACGTTTGTGCTATTCGATTCCTATACCAATCCTTCTCTTCAATGTCAGAATCTCAAGGTTCATCTCAATCTCTCGCAGTCCGTGGTTTGCGTGACTTGGTTCGAAGACCTCAAAGTGTCGATTCGAGTTCCTGTTCCTTCGGTTTTGGTTGACGCGGAGTCGCCCTTGAGTTTTAGAGCTTTTGAAGATCATATTGAGGTCAAGCTCGTCTTGCTTCTTCCGGTCGATCACCCAATTATTCTCAACTTCGACAATGTGCTGAATTTCTCCGAAGAGCGAGGAAATGACTACTCTAAGGCGTCGAAGCCACTTTTGATGGACTCTGATTTTGTTGAAATGCCATCAGTCAACTGGCGAGAGGTGGCTGATAACTGGTTTGGGTCTTGCTGCTGCTCCTTTGGGGGGATAAGCGAGAAGCTGGTAACTAGGTATACAAATTCCTATAGATGTGCAAAGGGTGTCTGCCTACTCACTCTAACAACTATTACTCTTTCCAAGGATGATCTTATTGGACATATGTTCCCAGACTATGATGGGACCCGAGAATTCAAGGATGAATCAGATTTTACTGATGGAAATTGGTTAAGTGAAGCTAAGCAGGAATCACAATGTAATCATACATCTACGAAGGAGGTAAAATCTAAGAAGTTTAATGATGAAAACCTTGCTGCAAACATGGAGGGTGATGCTGCTGGGAAAGAAAGTGATGAAGTTAATTCACCTCATATGACTCCGATTCCTGAGTGTTGTCATCATGGAGAAAGTAATGTTTTAAATAATCTTGATAGAGACTGCATGCATCACACATGTGGCACATATAAGTTAGACCCAAAGCCTATTAATACTATAGATCTTTCGGACAATCAGAGATCCTTTCTTAATGGTTTCCTTGGAAATATCTTTATGGCTAGACTGTCAAATCTTTCAGCAGATTTTGAGTGGGTTGAGTTTTTTTGCCCCCAGTGCTCAACTCTGATTGGGGCTTACCCTTGCAGTAATGGCTGCGGACCTACAGATGGTGGAGTTCGACTCTTTAAATGTTATGTCTCAGCATGTTCATCAGTTGAATCTAGAAATTTTTTGAGGGAGTACACCTTGGAAAGAATGTTTGCTAATCAGCTACTGGAAAGTGCAAATGAAGAATCATCATTTCGTACTGTGGTTAAGGAGCTGAAAACCAAGTCTTCCATGCTGCACATTGTTCTCATTAATTCAAATTCTTGGTCTTGTAGTGGTTATTGTTTGGGCATGAAGGATACTGCAGAATCAGTTCCAAAGATGGAGTTAAATCCTGTCATCAAGGTGCTATTTTCCGATTGCAACAAAAGTGCAGAATCTCATTTGAGGCAAATGCCCGCGTTGCTGGTATCATGGCCTGACATTTCCCATAATCGCAGGTTTTCGCGTATTCATCCTGTATTTATCGAAATCCAGTCCTTTGATTCAGTGCTGGGCAACAAAACAGAGCTATTCACTTGTAGGTTTCAGGACGTTCAAAACGTTGAAAGCAAGACAGAAGCTGTTGAAATGAACGCAATTGCAGTGCCAAGTTCCAGTTGCTATGTTTTTGACAATTCTAGTCACATTGTTGATTTTGTATCCTACATTGGATATGTAGATTTTGGTCGATTTGACAAATTCAACTACTTTGTCACAGGTTCAGGACACATCAACTTTGTTCAAGGTTATTACAATGGCGACCTGACTTCTTGTGAGCAGAGTTATGACAAATTGGGAAGGACTGCTCAGGTAAATGTCATATGTGGAGGTTGTTTAAATGGACAATGTAAAGGTGGTCTGGGATGCATCTGCAATATCACTTATGAGTCCAGTTGCAGAGTTATTGTCGATCTTGCCATCCCTTGTGAGATACAAGGCCCACGTGTTTTCAAAGGATTTACTGTTGGTTTTCACCCTCGATCCTGGGAAATTGTTTACAATGGCTTGACTCAATTAGGCTATGAGAAGCCACACCGTGCATTCAGCTTCAGCACAGAGCAGACTCGCGTGGTTCTTTATATGACTGCAATTGCGTCACTTTCCTCTTTGGTACAGAGACCAATCATTCAGGTTTCTCCAGAAAATGGACTAGAGGTGAAAGTATCAGGCTCAGGGGCAACTGGGAGCTACCCTACAACTCTGTCACCCTCAATGTTGACGATTGACTGGAGATGTGATATTGCCAGGAACATTCTATATGAAGTTAATGTCACGGTCCCTGTGGCTGATTACGAACCAATTAGTTTTTTCCTTACCAAAATTTGTGACAATAGGCAGGACCTAGAAGGAGATTCTATGAAAGGATGGGCCACATTTGGAATATTGTCTTGCATATTCATAGTCGTAACATCACTACTTTGTTGTGGAGGATTTGTTTACAAGGCCCAAGTGCAAGGCCAGCGTGGAATTGATGCATTGCCGGGCATGACACTAGTATCCGCTTGCTTGGAAACTATAAGTGGAGGAGGACAAGGCTACTACCCGAGAGCGGAAGGCATCAACAATGCGTTCGTCAGTGAAGCCTCCTGGGAACAACGCCCATCATCTTCTTCTTCTTCTCGACGGACATGGACACCATCTGAGAAAAATTATGATTTCAATATGGCTCAGACTTTGAGCTTTGTCTTTGTTCTTTCCCTTCTTGCCTTTGCTCCACTTTGTATCTCTAGCAGCAGTGGTGGCTATGGCTATCTATACCCGCAGTATTACGATCATTCATGCCCCAGAGCTAAAGACATAGTGAAGTCCATTCTAGCAAAGGCTTTTGCAAGAGAAGCTCGTATTGCTGCCTCTATCCTTAGACTTCACTTCCATGACTGTTTTGTTCAGGGGTGTGATGCATCTTTACTATTGGATAGCAGTGGGAGCATCAATAGCGAAAAGAATTCGAACCCAAACAGGAACTCGGCTAGAGGGTTTGAGGTGATTGATGAGATGAAATCTGCACTGGAGAAAGAGTGCCCACAAACTGTTTCTTGTGCTGATATCTTAACTTTGGCTGCAAGAGACTCCACTGTCATTACGGGCGGACCCTATTGGGAGGTTCCATTGGGGAGGAGGGACTCGAAAACTGCGAGTCTGAGTGGCTCCAACAACAACATTCCAGCTCCAAACAACACATTTCAAACCATTCTCTCCAGATTCCAAAACCAAGGGCTTGACATAGTTGATCTTGTTGCCTTATCTGGAGGGCACACCATCGGAAATTCCAGGTGCACCAGCTTCAGGCAGAGGCTGTACAACCAGAACGGCAACGGACAACCCGACAAAACCCTTCCGGCGTCGTTAGCGGCCGAGCTCCGGACCCGCTGCCCAAGATCCGGCGGCGACAATAACCTCTTTTTCCTGGACTTCTTCACCCCGACCAAGTTCGACAACAGCTACTTCAAGAACATAGTTGCTTACAAAGGCCTGCTCAACTCCGACCAAGTTCTCCTCACCAGCAACGACGCGTCGGCTGCTTTGGTAAAGAAATACGCAGAAGACAACGAGCTGTTCTTCGAGCAATTCGCCAAATCTATGATCAAGATGGGCAATATCTCGCCATTGACGGGTCAAGTGGAGAGATCAGAAAAACTTGCAGGAAGATCAACAATTAATCAGTTGCGTTTCGTATTTTTCCAGAAAGAAATAAATATGAATGCACAGACGACTGAGGGAGAGTGCAGATTTTCTGGGGACGGCCCAATATTTGTGGATCTACCAGATTCCAAAGCCCAACTAAAAAGCCCACCCAGCGAATTGACTGGCTCCTTCATTCTCATCGATGAGACATTTGCGAATTCTGTCCCCTCATTTCTCGAGGTTCGCAGAACCTTCGCGCAAAACCCTTGCTGTGATTCTCCGTCTACTCATAGTTTAAATCTTCTAGGATTGGAGAGCTTGGCATCTTCTTCATTCCTGCACCATCGGACTTACACCTCTGCTTCACCGGAAGCAAGAGCGGCCCCTTCTGAGAGAGTTTCGGCCATTGTGGATGAGATCTCTGGTCTCACGCTACTCGAGGTCGCGGACCTCACGGAGGTTCTGCGCGAAAAATTGGATGTAAAGGATATGCCGGTAATGACGGTCATGATGCCTGGGATGGGGTTCGGCGGCTTGCAAGGCGCCGGCAAGGGTGGTCCTGGGGCAAAGGGCGGTGAGGAGAAAAAGGCAGAGAAGACGGCGTTTGATGTGAAGCTTGAAGCTTTTGATGCTGCGTCGAAGATCAAGGTCATTAAGGAAGTGCGGACGTTTACGAATTTGGGGTTGAAGGAGGCTAAGGATTTGGTGGAGAAGGCGCCAACGCTTCTGAAGAAGGGAGTGACAAAAGAGGAAGCTGATACCATTATTGCCAAGATGAAGGAGGTTGGCGCCAAAGTTTCAATGGAGTGA

Protein sequence

MALHMGSAVPYTNFTFVLFDSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDAESPLSFRAFEDHIEVKLVLLLPVDHPIILNFDNVLNFSEERGNDYSKASKPLLMDSDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGVCLLTLTTITLSKDDLIGHMFPDYDGTREFKDESDFTDGNWLSEAKQESQCNHTSTKEVKSKKFNDENLAANMEGDAAGKESDEVNSPHMTPIPECCHHGESNVLNNLDRDCMHHTCGTYKLDPKPINTIDLSDNQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLFKCYVSACSSVESRNFLREYTLERMFANQLLESANEESSFRTVVKELKTKSSMLHIVLINSNSWSCSGYCLGMKDTAESVPKMELNPVIKVLFSDCNKSAESHLRQMPALLVSWPDISHNRRFSRIHPVFIEIQSFDSVLGNKTELFTCRFQDVQNVESKTEAVEMNAIAVPSSSCYVFDNSSHIVDFVSYIGYVDFGRFDKFNYFVTGSGHINFVQGYYNGDLTSCEQSYDKLGRTAQVNVICGGCLNGQCKGGLGCICNITYESSCRVIVDLAIPCEIQGPRVFKGFTVGFHPRSWEIVYNGLTQLGYEKPHRAFSFSTEQTRVVLYMTAIASLSSLVQRPIIQVSPENGLEVKVSGSGATGSYPTTLSPSMLTIDWRCDIARNILYEVNVTVPVADYEPISFFLTKICDNRQDLEGDSMKGWATFGILSCIFIVVTSLLCCGGFVYKAQVQGQRGIDALPGMTLVSACLETISGGGQGYYPRAEGINNAFVSEASWEQRPSSSSSSRRTWTPSEKNYDFNMAQTLSFVFVLSLLAFAPLCISSSSGGYGYLYPQYYDHSCPRAKDIVKSILAKAFAREARIAASILRLHFHDCFVQGCDASLLLDSSGSINSEKNSNPNRNSARGFEVIDEMKSALEKECPQTVSCADILTLAARDSTVITGGPYWEVPLGRRDSKTASLSGSNNNIPAPNNTFQTILSRFQNQGLDIVDLVALSGGHTIGNSRCTSFRQRLYNQNGNGQPDKTLPASLAAELRTRCPRSGGDNNLFFLDFFTPTKFDNSYFKNIVAYKGLLNSDQVLLTSNDASAALVKKYAEDNELFFEQFAKSMIKMGNISPLTGQVERSEKLAGRSTINQLRFVFFQKEINMNAQTTEGECRFSGDGPIFVDLPDSKAQLKSPPSELTGSFILIDETFANSVPSFLEVRRTFAQNPCCDSPSTHSLNLLGLESLASSSFLHHRTYTSASPEARAAPSERVSAIVDEISGLTLLEVADLTEVLREKLDVKDMPVMTVMMPGMGFGGLQGAGKGGPGAKGGEEKKAEKTAFDVKLEAFDAASKIKVIKEVRTFTNLGLKEAKDLVEKAPTLLKKGVTKEEADTIIAKMKEVGAKVSME
Homology
BLAST of Sgr026053 vs. NCBI nr
Match: QCD81783.1 (Ubiquitin-conjugating enzyme E2-binding protein [Vigna unguiculata])

HSP 1 Score: 826.2 bits (2133), Expect = 4.3e-235
Identity = 449/936 (47.97%), Postives = 595/936 (63.57%), Query Frame = 0

Query: 7   SAVPYTNFTFVLFDSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDA 66
           S VP         D   NPSL C +L ++L+ S S + +T      +S+RVP+P+VL+DA
Sbjct: 18  SHVPTLRLMVFPNDKTLNPSLHCHDLAINLHSSHSFLTLT---TSSLSLRVPLPAVLLDA 77

Query: 67  ESPLSFRAFEDHIEVKLVLLLPVDHPIILNF--------DNVLNFSEERG-------NDY 126
           +SP++FR   DHIEVKL+LLLPVDHPI+ +         D +++ S+ +        + Y
Sbjct: 78  DSPVTFRPLSDHIEVKLLLLLPVDHPILSSLHPSPTPLPDPLVSESDVKKLSSAGEVDFY 137

Query: 127 SKASKPLLMD---SDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 186
            +     L +    +FVEMPSVNWREVADNWFG+CCCSFGGISEK+V RY +SY C  GV
Sbjct: 138 CRTCTFKLTEIPLRNFVEMPSVNWREVADNWFGACCCSFGGISEKMVMRYVSSYTCMPGV 197

Query: 187 CLLTLTTITLSKDDLIGHMFPD---------------YDG------TREFKDE-----SD 246
           CLLT  ++T+ KDDL+ + FP+                DG      + E  DE     SD
Sbjct: 198 CLLTSASVTICKDDLVEYNFPEGCAKQECTSVAENPRDDGIVKLLRSCELNDERTSTCSD 257

Query: 247 FTDGNWLSEAKQESQC------NHTSTKEVKSKKFNDENLAANMEGDAA----------G 306
               +  S+ ++ S C      N       + ++  DE L+  +  + A           
Sbjct: 258 DERTSTCSDDERTSTCSDDGGVNLAFDSNYRFERSEDEKLSMKLRSEVAKSKPDCGHFSD 317

Query: 307 KESDEVNSPHMTPIPECCHHGESNVLNNLDRDCMHHTCGTYKLDPKPINTIDLSDNQRSF 366
              D   +  +T IP CC H ++N+    D D  HH+CGT   +  P  T+++  NQ++ 
Sbjct: 318 SHPDSNGTKDVTEIPSCCAHMKNNL---GDEDSEHHSCGTAGREGMPTETLEILGNQKTL 377

Query: 367 LNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLFKCYVSAC 426
           LNGFL +IFMARLSNL+ D +W EF CPQC+++IGAYPC  G  P DGGVRLFKCY+S C
Sbjct: 378 LNGFLEDIFMARLSNLTKDIDWREFTCPQCASIIGAYPCCEGHTPVDGGVRLFKCYISTC 437

Query: 427 SSV-ESRNFLREYTLERMFANQLLESANEESSFRTVVKELKTKSSMLHIVLINSNSWSCS 486
             V  S +   +YTL +MFAN+L+E AN+ES FR V+++L TK+ +L I+L+N ++WSCS
Sbjct: 438 LPVGGSEDMFSKYTLGKMFANRLMECANDESLFRFVIRDLTTKAPVLQIILLNPDTWSCS 497

Query: 487 GYCLGMKDTAESVPKMELNPVIKVLFSDCNKSAESHLRQMPALL---VSWPDISHNRRFS 546
           G C G +D  ESV K++L P+IKVL+SD + + ES  R   +L    +++P I    +  
Sbjct: 498 GNCSGTED-KESVHKLKLQPIIKVLYSDFHNATESQSRCRFSLTPPHIAFPKIPIKNQSK 557

Query: 547 RIHPVFIEIQSFDSVLGNKTELFTCRFQDVQNVESKTEAVEMNAIAVPSSSCYVFDNSSH 606
             + +   ++ F    G+  E FT   Q   ++  +                +  D    
Sbjct: 558 --NALLDGVRRFMPRPGHAAEFFTVLMQQGSDLIVR----------------FCKD---- 617

Query: 607 IVDFVSYIGYVDFGRFDKFNYFVTGSGHINFVQGYYNGDLTSCEQSYDKLGRTAQVNVIC 666
            V+  S  GYV FGRFDKFN FV GSG  +F+Q YYNGDL  CEQSYDK+GRTAQVN++C
Sbjct: 618 -VESRSQTGYVGFGRFDKFNNFVAGSGQHDFMQEYYNGDLMGCEQSYDKMGRTAQVNMVC 677

Query: 667 GGCLNGQCKGGLGCICNITYESSCRVIVDLAIPCEIQGPRVFKGFTVGFHPRSWEIVYNG 726
           G C NG CKG  GCICN+T+ES+CRV+VDLAIPC+  GP VF+GFTVGFHPRSWE+VYNG
Sbjct: 678 GSCSNGLCKGRPGCICNVTHESNCRVLVDLAIPCDKPGPHVFQGFTVGFHPRSWELVYNG 737

Query: 727 LTQLGYEKPHRAFSFSTEQTRVVLYMTAIASLSSLVQRPIIQVSPENGLEVKVSGSGATG 786
           LTQ+G+E+PH  FSF T QT+VVL+MTA+ASLSSLVQ+P ++V P+ GLEV++SGS   G
Sbjct: 738 LTQIGFEEPHHDFSFHTGQTQVVLFMTAVASLSSLVQKPSLKVHPDKGLEVRLSGSAIKG 797

Query: 787 SYPTTLSPSMLTIDWRCDIARNILYEVNVTVPVADYEPISFFLTKICDNRQDLEGDSMKG 846
             PTTLSPSML +DWRC++AR+  YEVN+T+PV  YEPI F LTK CD  QD  G   +G
Sbjct: 798 MPPTTLSPSMLIVDWRCEVARDTPYEVNITIPVQGYEPIQFVLTKQCDYTQDPGGGRTRG 857

Query: 847 WATFGILSCIFIVVTSLLCCGGFVYKAQVQGQRGIDALPGMTLVSACLETISGGGQGYYP 879
           WA FG+LSCIF V ++L CCGGFVYK +V+ QRGIDALPGMT++SACLET+SG G G Y 
Sbjct: 858 WAIFGVLSCIFFVSSTLFCCGGFVYKTKVERQRGIDALPGMTILSACLETVSGVGHG-YS 917

BLAST of Sgr026053 vs. NCBI nr
Match: XP_022133273.1 (uncharacterized protein LOC111005900 [Momordica charantia])

HSP 1 Score: 809.7 bits (2090), Expect = 4.2e-230
Identity = 395/477 (82.81%), Postives = 424/477 (88.89%), Query Frame = 0

Query: 17  VLFDSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDAESPLSFRAFE 76
           +LFDS+TNPSLQCQNLKVHLNL QSVVC TW +DL+VSIRVP+P VLVD+ESPLSFRAFE
Sbjct: 30  LLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFE 89

Query: 77  DHIEVKLVLLLPVDHPIILNFDNVLNFSEERGNDYSKASKPLLMDSD------------- 136
           DHIEVKL LLLPVDHPI+LNFDNVLN SEERGN YSKASKPLLMDSD             
Sbjct: 90  DHIEVKLFLLLPVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMDSDQNSLSRTGGVHFY 149

Query: 137 ---------------FVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 196
                          FVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV
Sbjct: 150 CRNCSFRLSESPLRNFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 209

Query: 197 CLLTLTTITLSKDDLIGHMFPDYDGTREFKDESDFTDGNWLSEAKQESQCNHTSTKEVKS 256
           CLLTLTTITLSKDD+IGH+FPDYDGTR+FKDESDF DGNWL+EAKQE QCN TS K+VK 
Sbjct: 210 CLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKP 269

Query: 257 KKFNDENLAANMEGDAAGKESDEVNSPHMTPIPECCHHGESNVLNNLDRDCMHHTCGTYK 316
           K+ ND+ LAANMEGDA  KE +EV+SP+MTPIP+CCHHGESNVLN+LDRDCMHHTC TYK
Sbjct: 270 KQSNDKTLAANMEGDATEKEREEVDSPNMTPIPDCCHHGESNVLNHLDRDCMHHTCSTYK 329

Query: 317 LDPKPINTIDLSDNQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNG 376
           LDPKPINTIDLSD+QRSFLNGFLGNIFMARLSNLSADFEWVEFFCP+CSTLIGAYPCSN 
Sbjct: 330 LDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNR 389

Query: 377 CGPTDGGVRLFKCYVSACSSVESRNFLREYTLERMFANQLLESANEESSFRTVVKELKTK 436
           CGPTDGGVRLFKCYVS CSSVES N LREYTLERMFANQLLESAN+ESSFRTVVKELKTK
Sbjct: 390 CGPTDGGVRLFKCYVSTCSSVESGNLLREYTLERMFANQLLESANDESSFRTVVKELKTK 449

Query: 437 SSMLHIVLINSNSWSCSGYCLGMKDTAESVPKMELNPVIKVLFSDCNKSAESHLRQM 466
           S MLHIVLINS SWSCSGYCLGM+DTAESV K++L+PVIKVLFSDC+KSAESHLR++
Sbjct: 450 SPMLHIVLINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKL 506

BLAST of Sgr026053 vs. NCBI nr
Match: KAG6603940.1 (hypothetical protein SDJN03_04549, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 800.4 bits (2066), Expect = 2.5e-227
Identity = 406/553 (73.42%), Postives = 443/553 (80.11%), Query Frame = 0

Query: 17  VLFDSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDAESPLSFRAFE 76
           +LFDSYTNPSLQCQNLKVHLNL QSVVCV W +DL++SIRVP+P VLVDAESPLSFRAFE
Sbjct: 30  LLFDSYTNPSLQCQNLKVHLNLQQSVVCVAWLQDLEMSIRVPMPPVLVDAESPLSFRAFE 89

Query: 77  DHIEVKLVLLLPVDHPIILNFDNVLNFSEERGNDYSKASKPLLMDSD------------- 136
           DHIEVKLVLLLPVDHPIILNF+NVL+FSE+RG+  SKA KPL MD D             
Sbjct: 90  DHIEVKLVLLLPVDHPIILNFNNVLDFSEKRGHSNSKALKPLSMDYDQSSLSRSGGVHFY 149

Query: 137 ---------------FVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 196
                          FVEMPSVNWREVADNWFG+CCCSFGGISEKLVTRYTNSYRCAKGV
Sbjct: 150 CRNCYFRLSESPLRNFVEMPSVNWREVADNWFGTCCCSFGGISEKLVTRYTNSYRCAKGV 209

Query: 197 CLLTLTTITLSKDDLIGHMFPDYDGTREFKDESDFTDGNWLSEAKQESQCNHTSTKEVKS 256
           CLLTLTTITL KDDLIGH FPDYDGTRE KDESDFTDGNW +EAKQESQCNHTST EVKS
Sbjct: 210 CLLTLTTITLYKDDLIGHAFPDYDGTRELKDESDFTDGNWSTEAKQESQCNHTSTGEVKS 269

Query: 257 KKFNDENLAANMEGDAAGKESDEVNSPHMTPIPECCHHGESNVLNNLDRDCMHHTCGTYK 316
           K+FN +NL A  EG+AA K SDEV+SP +T IP+   HGESNVL++LDRDCMHHTCGTY+
Sbjct: 270 KQFNYKNLVAKTEGNAAVKGSDEVDSPLVTSIPDLHQHGESNVLHDLDRDCMHHTCGTYE 329

Query: 317 LDPKPINTIDLSDNQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNG 376
           LDPKPINT+D+SD+Q SFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYPC NG
Sbjct: 330 LDPKPINTVDVSDDQISFLNGFLGNIFMARLSNLSADFEWAEFFCPQCSTLIGAYPCRNG 389

Query: 377 CGPTDGGVRLFKCYVSACSSVESRNFLREYTLERMFANQLLESANEESSFRTVVKELKTK 436
           CGPTDGGVRLFKCYVS C S E  N  REYTLE+MFA+QLLESANEESSFRTVVKELKTK
Sbjct: 390 CGPTDGGVRLFKCYVSTCLSTEPENLFREYTLEKMFASQLLESANEESSFRTVVKELKTK 449

Query: 437 SSMLHIVLINSNSWSCSGYCLGMKDTAESVPKMELNPVIKVLFSDCNKSAESHLR----- 496
           S+MLHIVLINSNSWSCSGYCLGM+DTAE VPK++LNP+IKVLFSDCNKSAESHLR     
Sbjct: 450 STMLHIVLINSNSWSCSGYCLGMEDTAEVVPKVDLNPIIKVLFSDCNKSAESHLRKLEEW 509

Query: 497 -----------------QMPALLVSWPD-------------ISHNRRFSRIHPVFIEIQS 507
                            ++  +LVS  D             ++   RFSRIH V ++ QS
Sbjct: 510 VTKDIAEEVFMLAHQIEELNEILVSRNDTLPSSCSSLDGLTLTSILRFSRIHSVVLKFQS 569

BLAST of Sgr026053 vs. NCBI nr
Match: XP_023543348.1 (uncharacterized protein LOC111803252 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 788.1 bits (2034), Expect = 1.3e-223
Identity = 385/477 (80.71%), Postives = 418/477 (87.63%), Query Frame = 0

Query: 17  VLFDSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDAESPLSFRAFE 76
           +LFDSYTNPSLQCQNLKVHLNL QSVVCV W +DL++SIRVP+P VLVDAESPLSFRAFE
Sbjct: 30  LLFDSYTNPSLQCQNLKVHLNLQQSVVCVAWLQDLEMSIRVPMPPVLVDAESPLSFRAFE 89

Query: 77  DHIEVKLVLLLPVDHPIILNFDNVLNFSEERGNDYSKASKPLLMDSD------------- 136
           DHIEVKLVLLLPVDHPIILNFDNVL+FSE RG+  SKA KPL MD D             
Sbjct: 90  DHIEVKLVLLLPVDHPIILNFDNVLDFSETRGHSNSKALKPLSMDYDQSSLSRSGGVHFY 149

Query: 137 ---------------FVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 196
                          FVEMPSVNWREVADNWFG+CCCSFGGISEKLVTRYTNSYRCAKGV
Sbjct: 150 CRNCSFRLSESPLRNFVEMPSVNWREVADNWFGTCCCSFGGISEKLVTRYTNSYRCAKGV 209

Query: 197 CLLTLTTITLSKDDLIGHMFPDYDGTREFKDESDFTDGNWLSEAKQESQCNHTSTKEVKS 256
           CLLTLTTITLSKDDLIGH+FPDYDGTRE KDESDFTDGNWL+EAKQESQCNHTST+EVKS
Sbjct: 210 CLLTLTTITLSKDDLIGHVFPDYDGTRELKDESDFTDGNWLTEAKQESQCNHTSTEEVKS 269

Query: 257 KKFNDENLAANMEGDAAGKESDEVNSPHMTPIPECCHHGESNVLNNLDRDCMHHTCGTYK 316
           K+FN +NL A  EG+AA K SDEV+SP +T IP+   HGESNVL++LDRDCMHHTCGTY+
Sbjct: 270 KQFNYKNLVAKTEGNAAVKGSDEVDSPLVTSIPDLHQHGESNVLHDLDRDCMHHTCGTYE 329

Query: 317 LDPKPINTIDLSDNQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNG 376
           LDPKPINT+D+SD+QRSFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYPC NG
Sbjct: 330 LDPKPINTVDVSDDQRSFLNGFLGNIFMARLSNLSADFEWAEFFCPQCSTLIGAYPCRNG 389

Query: 377 CGPTDGGVRLFKCYVSACSSVESRNFLREYTLERMFANQLLESANEESSFRTVVKELKTK 436
           CGPTDGGVRLFKCYVS C S ES N  R+YTLE+MFA+QLLESANEESSFRTVVKELKTK
Sbjct: 390 CGPTDGGVRLFKCYVSTCLSTESENLFRDYTLEKMFASQLLESANEESSFRTVVKELKTK 449

Query: 437 SSMLHIVLINSNSWSCSGYCLGMKDTAESVPKMELNPVIKVLFSDCNKSAESHLRQM 466
           S+MLHIVLINSNSWSCSGYCLGM+DTAE VPK++LNP+IKVLFSDCNKSAESHLR++
Sbjct: 450 SAMLHIVLINSNSWSCSGYCLGMEDTAEVVPKVDLNPIIKVLFSDCNKSAESHLRKL 506

BLAST of Sgr026053 vs. NCBI nr
Match: XP_038883816.1 (uncharacterized protein LOC120074678 isoform X1 [Benincasa hispida])

HSP 1 Score: 779.6 bits (2012), Expect = 4.6e-221
Identity = 379/477 (79.45%), Postives = 413/477 (86.58%), Query Frame = 0

Query: 17  VLFDSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDAESPLSFRAFE 76
           +LFDSYTNPSLQCQNLKVHLNL QSVVCV W +DL +SIRVP+P VLVDAESPLSFRAFE
Sbjct: 30  LLFDSYTNPSLQCQNLKVHLNLQQSVVCVAWLQDLDMSIRVPMPPVLVDAESPLSFRAFE 89

Query: 77  DHIEVKLVLLLPVDHPIILNFDNVLNFSEERGNDYSKASKPLLMD--------------- 136
           DHIEVKLVLLLPVDHPIILNFDNVL+F +ERGN +SKA+KPL MD               
Sbjct: 90  DHIEVKLVLLLPVDHPIILNFDNVLDFPQERGNSHSKATKPLSMDFDQISLSRSGGVHFY 149

Query: 137 -------------SDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 196
                         DFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV
Sbjct: 150 CRNCSFRLSKAPLRDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 209

Query: 197 CLLTLTTITLSKDDLIGHMFPDYDGTREFKDESDFTDGNWLSEAKQESQCNHTSTKEVKS 256
           CLLTLTTITLSKDDL GH+FPDYDGTREFKDESD TDGN L+EAKQES CNHTS ++VKS
Sbjct: 210 CLLTLTTITLSKDDLNGHVFPDYDGTREFKDESDLTDGNCLTEAKQESPCNHTSAEKVKS 269

Query: 257 KKFNDENLAANMEGDAAGKESDEVNSPHMTPIPECCHHGESNVLNNLDRDCMHHTCGTYK 316
           K+FN +N  A+MEG+AA K ++EV+SP +TP P+CCHH ES+VL++LDRDCMHHTCGTY 
Sbjct: 270 KQFNYKNFVADMEGNAAEKGNEEVDSPILTPFPDCCHHEESSVLHHLDRDCMHHTCGTYN 329

Query: 317 LDPKPINTIDLSDNQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNG 376
           LDPKPIN++D+SD+QRSFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYPC  G
Sbjct: 330 LDPKPINSVDISDDQRSFLNGFLGNIFMARLSNLSADFEWAEFFCPQCSTLIGAYPCKKG 389

Query: 377 CGPTDGGVRLFKCYVSACSSVESRNFLREYTLERMFANQLLESANEESSFRTVVKELKTK 436
           CGPTD GVRLFKCYVS C S ES N LREYTLERMFANQLLESANEESSFRTVVKELKTK
Sbjct: 390 CGPTDCGVRLFKCYVSTCLSAESGNLLREYTLERMFANQLLESANEESSFRTVVKELKTK 449

Query: 437 SSMLHIVLINSNSWSCSGYCLGMKDTAESVPKMELNPVIKVLFSDCNKSAESHLRQM 466
             MLHIVLINSNSWSCSGYCLGM+D AE VPK++LNP+IKVLFSDCNKSAESHLR++
Sbjct: 450 FPMLHIVLINSNSWSCSGYCLGMEDNAEFVPKVDLNPIIKVLFSDCNKSAESHLRKL 506

BLAST of Sgr026053 vs. ExPASy Swiss-Prot
Match: Q9FJZ9 (Peroxidase 72 OS=Arabidopsis thaliana OX=3702 GN=PER72 PE=1 SV=1)

HSP 1 Score: 469.2 bits (1206), Expect = 1.8e-130
Identity = 230/320 (71.88%), Postives = 274/320 (85.62%), Query Frame = 0

Query: 882  MAQTLS-FVFVLSLLAFAPLCISSSS-GGYGYLYPQYYDHSCPRAKDIVKSILAKAFARE 941
            MA++L+  +  LSL+AF+P C+ S + G  GYL+PQ+YD SCP+A++IV+SI+AKAF  +
Sbjct: 1    MAKSLNILIAALSLIAFSPFCLCSKAYGSGGYLFPQFYDQSCPKAQEIVQSIVAKAFEHD 60

Query: 942  ARIAASILRLHFHDCFVQGCDASLLLDSSGSINSEKNSNPNRNSARGFEVIDEMKSALEK 1001
             R+ AS+LRLHFHDCFV+GCDAS+LLDSSG+I SEK SNPNRNSARGFE+I+E+K ALE+
Sbjct: 61   PRMPASLLRLHFHDCFVKGCDASILLDSSGTIISEKRSNPNRNSARGFELIEEIKHALEQ 120

Query: 1002 ECPQTVSCADILTLAARDSTVITGGPYWEVPLGRRDSKTASLSGSNNNIPAPNNTFQTIL 1061
            ECP+TVSCADIL LAARDSTVITGGP WEVPLGRRD++ ASLSGSNN+IPAPNNTFQTIL
Sbjct: 121  ECPETVSCADILALAARDSTVITGGPSWEVPLGRRDARGASLSGSNNDIPAPNNTFQTIL 180

Query: 1062 SRFQNQGLDIVDLVALSGGHTIGNSRCTSFRQRLYNQNGNGQPDKTLPASLAAELRTRCP 1121
            ++F+ QGLD+VDLV+LSG HTIGNSRCTSFRQRLYNQ+GNG+PD TL    A  LR RCP
Sbjct: 181  TKFKRQGLDLVDLVSLSGSHTIGNSRCTSFRQRLYNQSGNGKPDMTLSQYYATLLRQRCP 240

Query: 1122 RSGGDNNLFFLDFFTPTKFDNSYFKNIVAYKGLLNSDQVLLTSNDASAALVKKYAEDNEL 1181
            RSGGD  LFFLDF TP KFDN YFKN++ YKGLL+SD++L T N  S  LV+ YAE+ E 
Sbjct: 241  RSGGDQTLFFLDFATPFKFDNHYFKNLIMYKGLLSSDEILFTKNKQSKELVELYAENQEA 300

Query: 1182 FFEQFAKSMIKMGNISPLTG 1200
            FFEQFAKSM+KMGNISPLTG
Sbjct: 301  FFEQFAKSMVKMGNISPLTG 320

BLAST of Sgr026053 vs. ExPASy Swiss-Prot
Match: O23237 (Peroxidase 49 OS=Arabidopsis thaliana OX=3702 GN=PER49 PE=2 SV=2)

HSP 1 Score: 456.1 bits (1172), Expect = 1.5e-126
Identity = 226/318 (71.07%), Postives = 261/318 (82.08%), Query Frame = 0

Query: 882  MAQTLSFVFVLSLLAFAPLCISSSSGGYGYLYPQYYDHSCPRAKDIVKSILAKAFAREAR 941
            MA+  SF+ +LSL+ F PLC+   S G G L+P YY HSCP+  +IV+S++AKA ARE R
Sbjct: 1    MARLTSFLLLLSLICFVPLCLCDKSYG-GKLFPGYYAHSCPQVNEIVRSVVAKAVARETR 60

Query: 942  IAASILRLHFHDCFVQGCDASLLLDSSGSINSEKNSNPNRNSARGFEVIDEMKSALEKEC 1001
            +AAS+LRLHFHDCFVQGCD SLLLDSSG + +EKNSNPN  SARGF+V+D++K+ LEK+C
Sbjct: 61   MAASLLRLHFHDCFVQGCDGSLLLDSSGRVATEKNSNPNSKSARGFDVVDQIKAELEKQC 120

Query: 1002 PQTVSCADILTLAARDSTVITGGPYWEVPLGRRDSKTASLSGSNNNIPAPNNTFQTILSR 1061
            P TVSCAD+LTLAARDS+V+TGGP W VPLGRRDS++ASLS SNNNIPAPNNTFQTILS+
Sbjct: 121  PGTVSCADVLTLAARDSSVLTGGPSWVVPLGRRDSRSASLSQSNNNIPAPNNTFQTILSK 180

Query: 1062 FQNQGLDIVDLVALSGGHTIGNSRCTSFRQRLYNQNGNGQPDKTLPASLAAELRTRCPRS 1121
            F  QGLDI DLVALSG HTIG SRCTSFRQRLYNQ+GNG PD TL  S AA LR RCP+S
Sbjct: 181  FNRQGLDITDLVALSGSHTIGFSRCTSFRQRLYNQSGNGSPDMTLEQSFAANLRQRCPKS 240

Query: 1122 GGDNNLFFLDFFTPTKFDNSYFKNIVAYKGLLNSDQVLLTSNDASAALVKKYAEDNELFF 1181
            GGD  L  LD  +   FDNSYFKN++  KGLLNSDQVL +SN+ S  LVKKYAED   FF
Sbjct: 241  GGDQILSVLDIISAASFDNSYFKNLIENKGLLNSDQVLFSSNEKSRELVKKYAEDQGEFF 300

Query: 1182 EQFAKSMIKMGNISPLTG 1200
            EQFA+SMIKMGNISPLTG
Sbjct: 301  EQFAESMIKMGNISPLTG 317

BLAST of Sgr026053 vs. ExPASy Swiss-Prot
Match: Q9SI16 (Peroxidase 15 OS=Arabidopsis thaliana OX=3702 GN=PER15 PE=2 SV=1)

HSP 1 Score: 448.0 bits (1151), Expect = 4.2e-124
Identity = 226/323 (69.97%), Postives = 263/323 (81.42%), Query Frame = 0

Query: 882  MAQTLSFVFVLSLLAFAPLCI-----SSSSGGYGYLYPQYYDHSCPRAKDIVKSILAKAF 941
            MA+  SF+ +L L+    LCI     S+  G  G L+P +Y  SCPRA++IV+S++AKA 
Sbjct: 1    MARIGSFLIILYLIYALTLCICDDDESNYGGDKGNLFPGFYRSSCPRAEEIVRSVVAKAV 60

Query: 942  AREARIAASILRLHFHDCFVQGCDASLLLDSSGSINSEKNSNPNRNSARGFEVIDEMKSA 1001
            ARE R+AAS++RLHFHDCFVQGCD SLLLD+SGSI +EKNSNPN  SARGFEV+DE+K+A
Sbjct: 61   ARETRMAASLMRLHFHDCFVQGCDGSLLLDTSGSIVTEKNSNPNSRSARGFEVVDEIKAA 120

Query: 1002 LEKECPQTVSCADILTLAARDSTVITGGPYWEVPLGRRDSKTASLSGSNNNIPAPNNTFQ 1061
            LE ECP TVSCAD LTLAARDS+V+TGGP W VPLGRRDS +ASLSGSNNNIPAPNNTF 
Sbjct: 121  LENECPNTVSCADALTLAARDSSVLTGGPSWMVPLGRRDSTSASLSGSNNNIPAPNNTFN 180

Query: 1062 TILSRFQNQGLDIVDLVALSGGHTIGNSRCTSFRQRLYNQNGNGQPDKTLPASLAAELRT 1121
            TI++RF NQGLD+ D+VALSG HTIG SRCTSFRQRLYNQ+GNG PD+TL  S AA LR 
Sbjct: 181  TIVTRFNNQGLDLTDVVALSGSHTIGFSRCTSFRQRLYNQSGNGSPDRTLEQSYAANLRQ 240

Query: 1122 RCPRSGGDNNLFFLDFFTPTKFDNSYFKNIVAYKGLLNSDQVLLTSNDASAALVKKYAED 1181
            RCPRSGGD NL  LD  +  +FDNSYFKN++   GLLNSD+VL +SN+ S  LVKKYAED
Sbjct: 241  RCPRSGGDQNLSELDINSAGRFDNSYFKNLIENMGLLNSDEVLFSSNEQSRELVKKYAED 300

Query: 1182 NELFFEQFAKSMIKMGNISPLTG 1200
             E FFEQFA+SMIKMGNISPLTG
Sbjct: 301  QEEFFEQFAESMIKMGNISPLTG 323

BLAST of Sgr026053 vs. ExPASy Swiss-Prot
Match: Q9SI17 (Peroxidase 14 OS=Arabidopsis thaliana OX=3702 GN=PER14 PE=3 SV=1)

HSP 1 Score: 422.9 bits (1086), Expect = 1.4e-116
Identity = 214/322 (66.46%), Postives = 253/322 (78.57%), Query Frame = 0

Query: 882  MAQTLSFVFVLSLLAFAPLCISSSSGGYG----YLYPQYYDHSCPRAKDIVKSILAKAFA 941
            MA+  SF+ +LSL     LCI  ++  +G     L+P +Y  SCPRA++IV+S++AKAF 
Sbjct: 1    MARIGSFLILLSLTYALTLCICDNASNFGGNKRNLFPDFYRSSCPRAEEIVRSVVAKAFE 60

Query: 942  REARIAASILRLHFHDCFVQGCDASLLLDSSGSINSEKNSNPNRNSARGFEVIDEMKSAL 1001
            RE R+AAS++RLHFHDCFVQGCD SLLLD+SGSI +EKNSNPN  SARGFEV+DE+K+AL
Sbjct: 61   RETRMAASLMRLHFHDCFVQGCDGSLLLDTSGSIVTEKNSNPNSRSARGFEVVDEIKAAL 120

Query: 1002 EKECPQTVSCADILTLAARDSTVITGGPYWEVPLGRRDSKTASLSGSNNNIPAPNNTFQT 1061
            E ECP TVSCAD LTLAARDS+V+TGGP W VPLGRRDS TAS +  N ++P P+N F T
Sbjct: 121  ENECPNTVSCADALTLAARDSSVLTGGPSWTVPLGRRDSATASRAKPNKDLPEPDNLFDT 180

Query: 1062 ILSRFQNQGLDIVDLVALSGGHTIGNSRCTSFRQRLYNQNGNGQPDKTLPASLAAELRTR 1121
            I  RF N+GL++ DLVALSG HTIG SRCTSFRQRLYNQ+G+G PD TL  S AA LR R
Sbjct: 181  IFLRFSNEGLNLTDLVALSGSHTIGFSRCTSFRQRLYNQSGSGSPDTTLEKSYAAILRQR 240

Query: 1122 CPRSGGDNNLFFLDFFTPTKFDNSYFKNIVAYKGLLNSDQVLLTSNDASAALVKKYAEDN 1181
            CPRSGGD NL  LD  +  +FDNSYFKN++   GLLNSDQVL +SN+ S  LVKKYAED 
Sbjct: 241  CPRSGGDQNLSELDINSAGRFDNSYFKNLIENMGLLNSDQVLFSSNEQSRELVKKYAEDQ 300

Query: 1182 ELFFEQFAKSMIKMGNISPLTG 1200
            E FFEQFA+SMIKMG ISPLTG
Sbjct: 301  EEFFEQFAESMIKMGKISPLTG 322

BLAST of Sgr026053 vs. ExPASy Swiss-Prot
Match: Q96512 (Peroxidase 9 OS=Arabidopsis thaliana OX=3702 GN=PER9 PE=1 SV=1)

HSP 1 Score: 396.7 bits (1018), Expect = 1.1e-108
Identity = 195/300 (65.00%), Postives = 241/300 (80.33%), Query Frame = 0

Query: 903  SSSSGG--YGYLYPQYYDHSCPRAKDIVKSILAKAFAREARIAASILRLHFHDCFVQGCD 962
            +S  GG  Y  LYPQ+Y  SCP+A +IV ++L KA A+E R+AAS+LRLHFHDCFVQGCD
Sbjct: 34   NSPIGGSFYSNLYPQFYQFSCPQADEIVMTVLEKAIAKEPRMAASLLRLHFHDCFVQGCD 93

Query: 963  ASLLLDSSGSINSEKNSNPNRNSARGFEVIDEMKSALEKECPQTVSCADILTLAARDSTV 1022
            AS+LLD S +I SEKN+ PN+NS RGF+VIDE+K+ LE+ CPQTVSCADIL LAAR ST+
Sbjct: 94   ASILLDDSATIRSEKNAGPNKNSVRGFQVIDEIKAKLEQACPQTVSCADILALAARGSTI 153

Query: 1023 ITGGPYWEVPLGRRDSKTASLSGSNNNIPAPNNTFQTILSRFQNQGLDIVDLVALSGGHT 1082
            ++GGP WE+PLGRRDS+TASL+G+N NIPAPN+T Q +L+ FQ +GL+  DLV+LSGGHT
Sbjct: 154  LSGGPSWELPLGRRDSRTASLNGANTNIPAPNSTIQNLLTMFQRKGLNEEDLVSLSGGHT 213

Query: 1083 IGNSRCTSFRQRLYNQNGNGQPDKTLPASLAAELRTRCPRSGGDNNLFFLDFFTPTKFDN 1142
            IG +RCT+F+QRLYNQNGN QPD+TL  S    LR+ CP +GGDNN+  LD  +P +FDN
Sbjct: 214  IGVARCTTFKQRLYNQNGNNQPDETLERSYYYGLRSICPPTGGDNNISPLDLASPARFDN 273

Query: 1143 SYFKNIVAYKGLLNSDQVLLTSN-DASAALVKKYAEDNELFFEQFAKSMIKMGNISPLTG 1200
            +YFK ++  KGLL SD+VLLT N   + ALVK YAED  LFF+QFAKSM+ MGNI PLTG
Sbjct: 274  TYFKLLLWGKGLLTSDEVLLTGNVGKTGALVKAYAEDERLFFQQFAKSMVNMGNIQPLTG 333

BLAST of Sgr026053 vs. ExPASy TrEMBL
Match: A0A4D6KYD9 (Ubiquitin-conjugating enzyme E2-binding protein OS=Vigna unguiculata OX=3917 GN=DEO72_LG2g2113 PE=4 SV=1)

HSP 1 Score: 826.2 bits (2133), Expect = 2.1e-235
Identity = 449/936 (47.97%), Postives = 595/936 (63.57%), Query Frame = 0

Query: 7   SAVPYTNFTFVLFDSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDA 66
           S VP         D   NPSL C +L ++L+ S S + +T      +S+RVP+P+VL+DA
Sbjct: 18  SHVPTLRLMVFPNDKTLNPSLHCHDLAINLHSSHSFLTLT---TSSLSLRVPLPAVLLDA 77

Query: 67  ESPLSFRAFEDHIEVKLVLLLPVDHPIILNF--------DNVLNFSEERG-------NDY 126
           +SP++FR   DHIEVKL+LLLPVDHPI+ +         D +++ S+ +        + Y
Sbjct: 78  DSPVTFRPLSDHIEVKLLLLLPVDHPILSSLHPSPTPLPDPLVSESDVKKLSSAGEVDFY 137

Query: 127 SKASKPLLMD---SDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 186
            +     L +    +FVEMPSVNWREVADNWFG+CCCSFGGISEK+V RY +SY C  GV
Sbjct: 138 CRTCTFKLTEIPLRNFVEMPSVNWREVADNWFGACCCSFGGISEKMVMRYVSSYTCMPGV 197

Query: 187 CLLTLTTITLSKDDLIGHMFPD---------------YDG------TREFKDE-----SD 246
           CLLT  ++T+ KDDL+ + FP+                DG      + E  DE     SD
Sbjct: 198 CLLTSASVTICKDDLVEYNFPEGCAKQECTSVAENPRDDGIVKLLRSCELNDERTSTCSD 257

Query: 247 FTDGNWLSEAKQESQC------NHTSTKEVKSKKFNDENLAANMEGDAA----------G 306
               +  S+ ++ S C      N       + ++  DE L+  +  + A           
Sbjct: 258 DERTSTCSDDERTSTCSDDGGVNLAFDSNYRFERSEDEKLSMKLRSEVAKSKPDCGHFSD 317

Query: 307 KESDEVNSPHMTPIPECCHHGESNVLNNLDRDCMHHTCGTYKLDPKPINTIDLSDNQRSF 366
              D   +  +T IP CC H ++N+    D D  HH+CGT   +  P  T+++  NQ++ 
Sbjct: 318 SHPDSNGTKDVTEIPSCCAHMKNNL---GDEDSEHHSCGTAGREGMPTETLEILGNQKTL 377

Query: 367 LNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLFKCYVSAC 426
           LNGFL +IFMARLSNL+ D +W EF CPQC+++IGAYPC  G  P DGGVRLFKCY+S C
Sbjct: 378 LNGFLEDIFMARLSNLTKDIDWREFTCPQCASIIGAYPCCEGHTPVDGGVRLFKCYISTC 437

Query: 427 SSV-ESRNFLREYTLERMFANQLLESANEESSFRTVVKELKTKSSMLHIVLINSNSWSCS 486
             V  S +   +YTL +MFAN+L+E AN+ES FR V+++L TK+ +L I+L+N ++WSCS
Sbjct: 438 LPVGGSEDMFSKYTLGKMFANRLMECANDESLFRFVIRDLTTKAPVLQIILLNPDTWSCS 497

Query: 487 GYCLGMKDTAESVPKMELNPVIKVLFSDCNKSAESHLRQMPALL---VSWPDISHNRRFS 546
           G C G +D  ESV K++L P+IKVL+SD + + ES  R   +L    +++P I    +  
Sbjct: 498 GNCSGTED-KESVHKLKLQPIIKVLYSDFHNATESQSRCRFSLTPPHIAFPKIPIKNQSK 557

Query: 547 RIHPVFIEIQSFDSVLGNKTELFTCRFQDVQNVESKTEAVEMNAIAVPSSSCYVFDNSSH 606
             + +   ++ F    G+  E FT   Q   ++  +                +  D    
Sbjct: 558 --NALLDGVRRFMPRPGHAAEFFTVLMQQGSDLIVR----------------FCKD---- 617

Query: 607 IVDFVSYIGYVDFGRFDKFNYFVTGSGHINFVQGYYNGDLTSCEQSYDKLGRTAQVNVIC 666
            V+  S  GYV FGRFDKFN FV GSG  +F+Q YYNGDL  CEQSYDK+GRTAQVN++C
Sbjct: 618 -VESRSQTGYVGFGRFDKFNNFVAGSGQHDFMQEYYNGDLMGCEQSYDKMGRTAQVNMVC 677

Query: 667 GGCLNGQCKGGLGCICNITYESSCRVIVDLAIPCEIQGPRVFKGFTVGFHPRSWEIVYNG 726
           G C NG CKG  GCICN+T+ES+CRV+VDLAIPC+  GP VF+GFTVGFHPRSWE+VYNG
Sbjct: 678 GSCSNGLCKGRPGCICNVTHESNCRVLVDLAIPCDKPGPHVFQGFTVGFHPRSWELVYNG 737

Query: 727 LTQLGYEKPHRAFSFSTEQTRVVLYMTAIASLSSLVQRPIIQVSPENGLEVKVSGSGATG 786
           LTQ+G+E+PH  FSF T QT+VVL+MTA+ASLSSLVQ+P ++V P+ GLEV++SGS   G
Sbjct: 738 LTQIGFEEPHHDFSFHTGQTQVVLFMTAVASLSSLVQKPSLKVHPDKGLEVRLSGSAIKG 797

Query: 787 SYPTTLSPSMLTIDWRCDIARNILYEVNVTVPVADYEPISFFLTKICDNRQDLEGDSMKG 846
             PTTLSPSML +DWRC++AR+  YEVN+T+PV  YEPI F LTK CD  QD  G   +G
Sbjct: 798 MPPTTLSPSMLIVDWRCEVARDTPYEVNITIPVQGYEPIQFVLTKQCDYTQDPGGGRTRG 857

Query: 847 WATFGILSCIFIVVTSLLCCGGFVYKAQVQGQRGIDALPGMTLVSACLETISGGGQGYYP 879
           WA FG+LSCIF V ++L CCGGFVYK +V+ QRGIDALPGMT++SACLET+SG G G Y 
Sbjct: 858 WAIFGVLSCIFFVSSTLFCCGGFVYKTKVERQRGIDALPGMTILSACLETVSGVGHG-YS 917

BLAST of Sgr026053 vs. ExPASy TrEMBL
Match: A0A6J1BYQ5 (uncharacterized protein LOC111005900 OS=Momordica charantia OX=3673 GN=LOC111005900 PE=4 SV=1)

HSP 1 Score: 809.7 bits (2090), Expect = 2.0e-230
Identity = 395/477 (82.81%), Postives = 424/477 (88.89%), Query Frame = 0

Query: 17  VLFDSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDAESPLSFRAFE 76
           +LFDS+TNPSLQCQNLKVHLNL QSVVC TW +DL+VSIRVP+P VLVD+ESPLSFRAFE
Sbjct: 30  LLFDSHTNPSLQCQNLKVHLNLPQSVVCATWLQDLEVSIRVPIPPVLVDSESPLSFRAFE 89

Query: 77  DHIEVKLVLLLPVDHPIILNFDNVLNFSEERGNDYSKASKPLLMDSD------------- 136
           DHIEVKL LLLPVDHPI+LNFDNVLN SEERGN YSKASKPLLMDSD             
Sbjct: 90  DHIEVKLFLLLPVDHPIVLNFDNVLNSSEERGNKYSKASKPLLMDSDQNSLSRTGGVHFY 149

Query: 137 ---------------FVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 196
                          FVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV
Sbjct: 150 CRNCSFRLSESPLRNFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 209

Query: 197 CLLTLTTITLSKDDLIGHMFPDYDGTREFKDESDFTDGNWLSEAKQESQCNHTSTKEVKS 256
           CLLTLTTITLSKDD+IGH+FPDYDGTR+FKDESDF DGNWL+EAKQE QCN TS K+VK 
Sbjct: 210 CLLTLTTITLSKDDIIGHVFPDYDGTRQFKDESDFADGNWLTEAKQELQCNLTSMKKVKP 269

Query: 257 KKFNDENLAANMEGDAAGKESDEVNSPHMTPIPECCHHGESNVLNNLDRDCMHHTCGTYK 316
           K+ ND+ LAANMEGDA  KE +EV+SP+MTPIP+CCHHGESNVLN+LDRDCMHHTC TYK
Sbjct: 270 KQSNDKTLAANMEGDATEKEREEVDSPNMTPIPDCCHHGESNVLNHLDRDCMHHTCSTYK 329

Query: 317 LDPKPINTIDLSDNQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNG 376
           LDPKPINTIDLSD+QRSFLNGFLGNIFMARLSNLSADFEWVEFFCP+CSTLIGAYPCSN 
Sbjct: 330 LDPKPINTIDLSDDQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPKCSTLIGAYPCSNR 389

Query: 377 CGPTDGGVRLFKCYVSACSSVESRNFLREYTLERMFANQLLESANEESSFRTVVKELKTK 436
           CGPTDGGVRLFKCYVS CSSVES N LREYTLERMFANQLLESAN+ESSFRTVVKELKTK
Sbjct: 390 CGPTDGGVRLFKCYVSTCSSVESGNLLREYTLERMFANQLLESANDESSFRTVVKELKTK 449

Query: 437 SSMLHIVLINSNSWSCSGYCLGMKDTAESVPKMELNPVIKVLFSDCNKSAESHLRQM 466
           S MLHIVLINS SWSCSGYCLGM+DTAESV K++L+PVIKVLFSDC+KSAESHLR++
Sbjct: 450 SPMLHIVLINSYSWSCSGYCLGMEDTAESVSKIDLSPVIKVLFSDCSKSAESHLRKL 506

BLAST of Sgr026053 vs. ExPASy TrEMBL
Match: A0A6J1IL55 (uncharacterized protein LOC111478431 OS=Cucurbita maxima OX=3661 GN=LOC111478431 PE=4 SV=1)

HSP 1 Score: 778.5 bits (2009), Expect = 5.0e-221
Identity = 379/477 (79.45%), Postives = 417/477 (87.42%), Query Frame = 0

Query: 17  VLFDSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDAESPLSFRAFE 76
           +LFDSYTNPSLQCQNLKVHLNL QSVVCV W +D+++SIRVP+P VLVDAESPLSFRAFE
Sbjct: 30  LLFDSYTNPSLQCQNLKVHLNLQQSVVCVAWLQDVEMSIRVPMPPVLVDAESPLSFRAFE 89

Query: 77  DHIEVKLVLLLPVDHPIILNFDNVLNFSEERGNDYSKASKPLLMDSD------------- 136
           +HIEVKLVLLLPVDHPIILNFDNVL+FSE+RG++ SKA KPL MD D             
Sbjct: 90  NHIEVKLVLLLPVDHPIILNFDNVLDFSEKRGHNNSKALKPLSMDYDQSSLSRSGGVHFY 149

Query: 137 ---------------FVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 196
                          FVEMPSVNWREVADNWFG+CCCSFGG+SEKLVTRYTNSYRCAKGV
Sbjct: 150 CRNCSFRLSESPLRNFVEMPSVNWREVADNWFGTCCCSFGGVSEKLVTRYTNSYRCAKGV 209

Query: 197 CLLTLTTITLSKDDLIGHMFPDYDGTREFKDESDFTDGNWLSEAKQESQCNHTSTKEVKS 256
           CLLTLTTITLSKDDLIGH FPDYDGTRE K+ESDFTDGNWL+EAKQESQCNHTST EVKS
Sbjct: 210 CLLTLTTITLSKDDLIGHAFPDYDGTRELKEESDFTDGNWLTEAKQESQCNHTSTGEVKS 269

Query: 257 KKFNDENLAANMEGDAAGKESDEVNSPHMTPIPECCHHGESNVLNNLDRDCMHHTCGTYK 316
           K+FN +NL A  EG+A+ K SDEV+SP +T IP+   HGESNVL++LDRDCMHHTCGTY+
Sbjct: 270 KQFNYKNLVAKTEGNASVKGSDEVDSPLVTSIPDLHQHGESNVLHDLDRDCMHHTCGTYE 329

Query: 317 LDPKPINTIDLSDNQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNG 376
           LDPKP+NT+D+SD+Q SFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYPC NG
Sbjct: 330 LDPKPLNTVDVSDDQISFLNGFLGNIFMARLSNLSADFEWAEFFCPQCSTLIGAYPCRNG 389

Query: 377 CGPTDGGVRLFKCYVSACSSVESRNFLREYTLERMFANQLLESANEESSFRTVVKELKTK 436
           CGPTDGGVRLFKCYVS C S ES N  REYTLE+MFA+QLLESANEESSFRTVVKELKTK
Sbjct: 390 CGPTDGGVRLFKCYVSTCLSTESENLFREYTLEKMFASQLLESANEESSFRTVVKELKTK 449

Query: 437 SSMLHIVLINSNSWSCSGYCLGMKDTAESVPKMELNPVIKVLFSDCNKSAESHLRQM 466
           S+MLHIVLINSNSWSCSGYCLGM+DTAE VPK++LNP+IKVLFSDCNKSAESHLR++
Sbjct: 450 STMLHIVLINSNSWSCSGYCLGMEDTAEVVPKVDLNPIIKVLFSDCNKSAESHLRKL 506

BLAST of Sgr026053 vs. ExPASy TrEMBL
Match: A0A151SJN5 (Uncharacterized protein OS=Cajanus cajan OX=3821 GN=KK1_001194 PE=4 SV=1)

HSP 1 Score: 777.7 bits (2007), Expect = 8.6e-221
Identity = 426/913 (46.66%), Postives = 552/913 (60.46%), Query Frame = 0

Query: 20  DSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDAESPLSFRAFEDHI 79
           D   NPSLQC +L V+L+ S + + +T      +S+RVP+P VL+D +SPL+FR   DHI
Sbjct: 30  DKTLNPSLQCHDLTVNLHSSHAFLTLT---ASSLSLRVPLPVVLLDGDSPLTFRPLSDHI 89

Query: 80  EVKLVLLLPVDHPIILNFDNVLNFSEE---RGNDYSKASKPLLMD--------------- 139
           EVKL+LLLPVDHPI+   D   +   +     +D  K S    +D               
Sbjct: 90  EVKLLLLLPVDHPILSTLDPSPSPPTDPLVSQSDVEKLSSAGEVDFYCRTCAFKLTKVPL 149

Query: 140 SDFVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGVCLLTLTTITLSKD 199
             FVEMPSVNWREVADNWFG+CCCSFGGISEK+V RY +SY CA G+CLL+ T++TL KD
Sbjct: 150 RTFVEMPSVNWREVADNWFGACCCSFGGISEKMVMRYVSSYMCAPGMCLLSSTSVTLCKD 209

Query: 200 DLIGHMFPDYDGTREF----------KDESDFTDG------NWLSEAKQESQCNHTSTKE 259
           DL+   F +  G RE+            E+   DG      N     +++  C+  S   
Sbjct: 210 DLVECDFLEGCGEREWSCVTENPGDDSGENPRDDGVGKGMRNCELNGERDLTCSDDSGVT 269

Query: 260 VKSKKFNDENLAANMEGDAAGKES-------------------DEVNSPHMTPIPECCHH 319
           +    F++ +  A+ E D A   S                   D  ++  +     CC H
Sbjct: 270 L---AFDESSRFAHPENDKASVNSRCEFVKNEPDHSDFSDSRPDSDDAKDVNQASSCCAH 329

Query: 320 GESNVLNNLDRDCMHHTCGTYKLDPKPINTIDLSDNQRSFLNGFLGNIFMARLSNLSADF 379
             SN L N DR+  HH CGT   +     T+++  NQ+SFLNGFL +IFMARLSNLS D 
Sbjct: 330 MTSN-LGNGDRE--HHLCGTDGKEGMSTETVEILGNQKSFLNGFLEDIFMARLSNLSKDI 389

Query: 380 EWVEFFCPQCSTLIGAYPCSNGCGPTDGGVRLFKCYVSACSSV-ESRNFLREYTLERMFA 439
           +W EF CPQC+  +GAYPC +G    DGG+RLFKCY+S    V  S +   +YT+ +MFA
Sbjct: 390 DWREFRCPQCTCFLGAYPCFDGHTLVDGGIRLFKCYISTSVPVGGSGDMFSKYTMGKMFA 449

Query: 440 NQLLESANEESSFRTVVKELKTKSSMLHIVLINSNSWSCSGYCLGMKDTAESVPKMELNP 499
           NQL+E AN+ESSFR V+++L+T+S +L I+L+N ++WSCSG C   +D  + V K++L P
Sbjct: 450 NQLMECANDESSFRFVIRDLRTQSPVLQIILLNPDTWSCSGNCSSAED-KDPVTKLKLQP 509

Query: 500 VIKVLFSDCNKSAESHLRQMPALLVSWPDISHNRRFSRIHPVFIEIQSFDSVLGNKTELF 559
           +IK                                                         
Sbjct: 510 IIK--------------------------------------------------------- 569

Query: 560 TCRFQDVQNVESKTEAVEMNAIAVPSSSCYVFDNSSHIVDFVSYIGYVDFGRFDKFNYFV 619
                                                        GYVDFGRFDKFNYFV
Sbjct: 570 ---------------------------------------------GYVDFGRFDKFNYFV 629

Query: 620 TGSGHINFVQGYYNGDLTSCEQSYDKLGRTAQVNVICGGCLNGQCKGGLGCICNITYESS 679
           +G+G  +F+Q +YNGDL SCEQSYDK+GRTAQVN+ICG CLNGQCKG  GCICN+T+ES+
Sbjct: 630 SGTGQSDFIQEFYNGDLMSCEQSYDKMGRTAQVNIICGSCLNGQCKGLPGCICNVTFESN 689

Query: 680 CRVIVDLAIPCEIQGPRVFKGFTVGFHPRSWEIVYNGLTQLGYEKPHRAFSFSTEQTRVV 739
           CRV+++LAI C+  GP+VF+GFTVGFHPRSWE+VYNG+TQ+G+EKP+  FSF T QT+VV
Sbjct: 690 CRVLIELAISCDKPGPQVFQGFTVGFHPRSWELVYNGMTQIGFEKPYNEFSFPTGQTQVV 749

Query: 740 LYMTAIASLSSLVQRPIIQVSPENGLEVKVSGSGATGSYPTTLSPSMLTIDWRCDIARNI 799
           L+MTA+ASLSSLVQ P ++V P+NGLEVK+SGS A G  PTTLSPSML +DWRC++AR+ 
Sbjct: 750 LFMTAVASLSSLVQNPSLKVLPDNGLEVKLSGSAAKGKPPTTLSPSMLVVDWRCEVARDT 809

Query: 800 LYEVNVTVPVADYEPISFFLTKICDNRQDLEGDSMKGWATFGILSCIFIVVTSLLCCGGF 859
            YEVN+T+PV  YEPI F LTK CD +Q + G   +GWA FG+LSCIF V ++L CCGGF
Sbjct: 810 PYEVNITIPVEGYEPIHFVLTKSCDYKQAIGGGRTRGWAIFGVLSCIFFVSSTLFCCGGF 826

Query: 860 VYKAQVQGQRGIDALPGMTLVSACLETISGGGQGYYPRAEGINNAFVSEASWEQRPSSSS 879
           +YK +V+ QRGIDALPGMT++SACLET+SG GQG Y R E IN+A  SE SWE+ P  S 
Sbjct: 870 IYKMKVERQRGIDALPGMTILSACLETVSGAGQG-YSRPEDINSAVASETSWERPPGPSQ 826

BLAST of Sgr026053 vs. ExPASy TrEMBL
Match: A0A6J1GEG6 (uncharacterized protein LOC111453202 OS=Cucurbita moschata OX=3662 GN=LOC111453202 PE=4 SV=1)

HSP 1 Score: 776.5 bits (2004), Expect = 1.9e-220
Identity = 381/477 (79.87%), Postives = 413/477 (86.58%), Query Frame = 0

Query: 17  VLFDSYTNPSLQCQNLKVHLNLSQSVVCVTWFEDLKVSIRVPVPSVLVDAESPLSFRAFE 76
           +LFDSYTNPSLQCQNLKVHLNL QSVVCV W +DL++SIRVP+P VLVDAESPLSFRAFE
Sbjct: 30  LLFDSYTNPSLQCQNLKVHLNLQQSVVCVAWLQDLEMSIRVPMPPVLVDAESPLSFRAFE 89

Query: 77  DHIEVKLVLLLPVDHPIILNFDNVLNFSEERGNDYSKASKPLLMDSD------------- 136
           DH+EVKLVLLLPVDHPIILNFDNVL+FSE+RG+  SKA KPL MD D             
Sbjct: 90  DHMEVKLVLLLPVDHPIILNFDNVLDFSEKRGHRNSKALKPLSMDYDQSSLSRSGGVHFY 149

Query: 137 ---------------FVEMPSVNWREVADNWFGSCCCSFGGISEKLVTRYTNSYRCAKGV 196
                          FVEMPSVNWREVADNWFG+CCCSFGGISEKLVTRYTNSYRCAKGV
Sbjct: 150 CRNCSFRLSESPLRNFVEMPSVNWREVADNWFGTCCCSFGGISEKLVTRYTNSYRCAKGV 209

Query: 197 CLLTLTTITLSKDDLIGHMFPDYDGTREFKDESDFTDGNWLSEAKQESQCNHTSTKEVKS 256
           CLLTLTTITL KDDLIGH FPDYDGTRE KDESDFTDGNW +EAKQESQCNHTST EVKS
Sbjct: 210 CLLTLTTITLYKDDLIGHAFPDYDGTRELKDESDFTDGNWSTEAKQESQCNHTSTGEVKS 269

Query: 257 KKFNDENLAANMEGDAAGKESDEVNSPHMTPIPECCHHGESNVLNNLDRDCMHHTCGTYK 316
           K+FN +NL A  EG+AA K SDEV+SP +T IP+   HGESNVL++LDRDCMHHTCGTY+
Sbjct: 270 KQFNYKNLVAKTEGNAAVKGSDEVDSPLVTSIPDLHQHGESNVLHDLDRDCMHHTCGTYE 329

Query: 317 LDPKPINTIDLSDNQRSFLNGFLGNIFMARLSNLSADFEWVEFFCPQCSTLIGAYPCSNG 376
           LDPKPINT+D+SD+Q SFLNGFLGNIFMARLSNLSADFEW EFFCPQCSTLIGAYPC NG
Sbjct: 330 LDPKPINTVDVSDDQISFLNGFLGNIFMARLSNLSADFEWAEFFCPQCSTLIGAYPCRNG 389

Query: 377 CGPTDGGVRLFKCYVSACSSVESRNFLREYTLERMFANQLLESANEESSFRTVVKELKTK 436
           CGPTDGGVRLFKCYVS C S E  N  REYTLE+MFA+QLLESANEESSFRTVVKELKTK
Sbjct: 390 CGPTDGGVRLFKCYVSTCLSTEPENLFREYTLEKMFASQLLESANEESSFRTVVKELKTK 449

Query: 437 SSMLHIVLINSNSWSCSGYCLGMKDTAESVPKMELNPVIKVLFSDCNKSAESHLRQM 466
           S+MLHIVLINSNSWSCSGYCLGM+DTAE VPK++LNP+IKVLFSDCNKSAESHLR++
Sbjct: 450 STMLHIVLINSNSWSCSGYCLGMEDTAEVVPKVDLNPIIKVLFSDCNKSAESHLRKL 506

BLAST of Sgr026053 vs. TAIR 10
Match: AT5G66390.1 (Peroxidase superfamily protein )

HSP 1 Score: 469.2 bits (1206), Expect = 1.3e-131
Identity = 230/320 (71.88%), Postives = 274/320 (85.62%), Query Frame = 0

Query: 882  MAQTLS-FVFVLSLLAFAPLCISSSS-GGYGYLYPQYYDHSCPRAKDIVKSILAKAFARE 941
            MA++L+  +  LSL+AF+P C+ S + G  GYL+PQ+YD SCP+A++IV+SI+AKAF  +
Sbjct: 1    MAKSLNILIAALSLIAFSPFCLCSKAYGSGGYLFPQFYDQSCPKAQEIVQSIVAKAFEHD 60

Query: 942  ARIAASILRLHFHDCFVQGCDASLLLDSSGSINSEKNSNPNRNSARGFEVIDEMKSALEK 1001
             R+ AS+LRLHFHDCFV+GCDAS+LLDSSG+I SEK SNPNRNSARGFE+I+E+K ALE+
Sbjct: 61   PRMPASLLRLHFHDCFVKGCDASILLDSSGTIISEKRSNPNRNSARGFELIEEIKHALEQ 120

Query: 1002 ECPQTVSCADILTLAARDSTVITGGPYWEVPLGRRDSKTASLSGSNNNIPAPNNTFQTIL 1061
            ECP+TVSCADIL LAARDSTVITGGP WEVPLGRRD++ ASLSGSNN+IPAPNNTFQTIL
Sbjct: 121  ECPETVSCADILALAARDSTVITGGPSWEVPLGRRDARGASLSGSNNDIPAPNNTFQTIL 180

Query: 1062 SRFQNQGLDIVDLVALSGGHTIGNSRCTSFRQRLYNQNGNGQPDKTLPASLAAELRTRCP 1121
            ++F+ QGLD+VDLV+LSG HTIGNSRCTSFRQRLYNQ+GNG+PD TL    A  LR RCP
Sbjct: 181  TKFKRQGLDLVDLVSLSGSHTIGNSRCTSFRQRLYNQSGNGKPDMTLSQYYATLLRQRCP 240

Query: 1122 RSGGDNNLFFLDFFTPTKFDNSYFKNIVAYKGLLNSDQVLLTSNDASAALVKKYAEDNEL 1181
            RSGGD  LFFLDF TP KFDN YFKN++ YKGLL+SD++L T N  S  LV+ YAE+ E 
Sbjct: 241  RSGGDQTLFFLDFATPFKFDNHYFKNLIMYKGLLSSDEILFTKNKQSKELVELYAENQEA 300

Query: 1182 FFEQFAKSMIKMGNISPLTG 1200
            FFEQFAKSM+KMGNISPLTG
Sbjct: 301  FFEQFAKSMVKMGNISPLTG 320

BLAST of Sgr026053 vs. TAIR 10
Match: AT4G36430.1 (Peroxidase superfamily protein )

HSP 1 Score: 456.1 bits (1172), Expect = 1.1e-127
Identity = 226/318 (71.07%), Postives = 261/318 (82.08%), Query Frame = 0

Query: 882  MAQTLSFVFVLSLLAFAPLCISSSSGGYGYLYPQYYDHSCPRAKDIVKSILAKAFAREAR 941
            MA+  SF+ +LSL+ F PLC+   S G G L+P YY HSCP+  +IV+S++AKA ARE R
Sbjct: 1    MARLTSFLLLLSLICFVPLCLCDKSYG-GKLFPGYYAHSCPQVNEIVRSVVAKAVARETR 60

Query: 942  IAASILRLHFHDCFVQGCDASLLLDSSGSINSEKNSNPNRNSARGFEVIDEMKSALEKEC 1001
            +AAS+LRLHFHDCFVQGCD SLLLDSSG + +EKNSNPN  SARGF+V+D++K+ LEK+C
Sbjct: 61   MAASLLRLHFHDCFVQGCDGSLLLDSSGRVATEKNSNPNSKSARGFDVVDQIKAELEKQC 120

Query: 1002 PQTVSCADILTLAARDSTVITGGPYWEVPLGRRDSKTASLSGSNNNIPAPNNTFQTILSR 1061
            P TVSCAD+LTLAARDS+V+TGGP W VPLGRRDS++ASLS SNNNIPAPNNTFQTILS+
Sbjct: 121  PGTVSCADVLTLAARDSSVLTGGPSWVVPLGRRDSRSASLSQSNNNIPAPNNTFQTILSK 180

Query: 1062 FQNQGLDIVDLVALSGGHTIGNSRCTSFRQRLYNQNGNGQPDKTLPASLAAELRTRCPRS 1121
            F  QGLDI DLVALSG HTIG SRCTSFRQRLYNQ+GNG PD TL  S AA LR RCP+S
Sbjct: 181  FNRQGLDITDLVALSGSHTIGFSRCTSFRQRLYNQSGNGSPDMTLEQSFAANLRQRCPKS 240

Query: 1122 GGDNNLFFLDFFTPTKFDNSYFKNIVAYKGLLNSDQVLLTSNDASAALVKKYAEDNELFF 1181
            GGD  L  LD  +   FDNSYFKN++  KGLLNSDQVL +SN+ S  LVKKYAED   FF
Sbjct: 241  GGDQILSVLDIISAASFDNSYFKNLIENKGLLNSDQVLFSSNEKSRELVKKYAEDQGEFF 300

Query: 1182 EQFAKSMIKMGNISPLTG 1200
            EQFA+SMIKMGNISPLTG
Sbjct: 301  EQFAESMIKMGNISPLTG 317

BLAST of Sgr026053 vs. TAIR 10
Match: AT4G36440.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 41 Blast hits to 41 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 451.4 bits (1160), Expect = 2.7e-126
Identity = 217/380 (57.11%), Postives = 272/380 (71.58%), Query Frame = 0

Query: 526 AIAVPSSSCYVFDNSSHIVDFVSYI---------------------------GYVDFGRF 585
           ++ VP S+CY  DNSS +VDF S+I                           GYVDFGRF
Sbjct: 25  SVPVPDSNCYALDNSSRLVDFSSWIGHPFEYDGKEFDLVVRFCKDVETRGQAGYVDFGRF 84

Query: 586 DKFNYFVTGSGHINFVQGYYNGDLTSCEQSYDKLGRTAQVNVICGGCLNGQCKGGLGCIC 645
           D  +YFV+ S + +FV            QSYDKLGRTAQVN+ICG C +G+CKGGLGCIC
Sbjct: 85  DPLSYFVSSSENFDFV------------QSYDKLGRTAQVNIICGNCSDGRCKGGLGCIC 144

Query: 646 NITYESSCRVIVDLAIPCEIQGPRVFKGFTVGFHPRSWEIVYNGLTQLGYEKPHRAFSFS 705
           ++T +S+CRV VDLAIPCE  GPRVFKGFTVG HPRSWEI+YNG+TQ G++KP R FSF 
Sbjct: 145 SVTQDSTCRVTVDLAIPCEKPGPRVFKGFTVGLHPRSWEIIYNGMTQFGFDKPRREFSFK 204

Query: 706 TEQTRVVLYMTAIASLSSLVQRPIIQVSPENGLEVKVSGSGATGSYPTTLSPSMLTIDWR 765
           TEQT + LYMTAIASLS+LV +PII+VSPENGL+VK++GS  TG++PTTLSPS L +DW 
Sbjct: 205 TEQTHLTLYMTAIASLSTLVGKPIIKVSPENGLDVKIAGSSLTGNHPTTLSPSTLVLDWN 264

Query: 766 CDIARNILYEVNVTVPVADYEPISFFLTKICDNRQDLEGDSMKGWATFGILSCIFIVVTS 825
           C+ +R   YEVNVT+PV  Y+P+ FFLTK+C+  Q  EG S KGWA FG+ SC+F+V ++
Sbjct: 265 CEKSRRTPYEVNVTIPVDGYDPVQFFLTKLCEYNQGNEGGSAKGWAIFGVFSCVFLVASA 324

Query: 826 LLCCGGFVYKAQVQGQRGIDALPGMTLVSACLETISGGGQGYYPRAEGINNAFVSEASWE 879
           L CCGGF+YK +V+  RG DALPGM+L+S  LET+SG GQ  Y R E INNAF +E SW+
Sbjct: 325 LFCCGGFIYKTRVERVRGTDALPGMSLLSGLLETVSGSGQS-YSRTEDINNAFANEVSWD 384

BLAST of Sgr026053 vs. TAIR 10
Match: AT2G18150.1 (Peroxidase superfamily protein )

HSP 1 Score: 448.0 bits (1151), Expect = 3.0e-125
Identity = 226/323 (69.97%), Postives = 263/323 (81.42%), Query Frame = 0

Query: 882  MAQTLSFVFVLSLLAFAPLCI-----SSSSGGYGYLYPQYYDHSCPRAKDIVKSILAKAF 941
            MA+  SF+ +L L+    LCI     S+  G  G L+P +Y  SCPRA++IV+S++AKA 
Sbjct: 1    MARIGSFLIILYLIYALTLCICDDDESNYGGDKGNLFPGFYRSSCPRAEEIVRSVVAKAV 60

Query: 942  AREARIAASILRLHFHDCFVQGCDASLLLDSSGSINSEKNSNPNRNSARGFEVIDEMKSA 1001
            ARE R+AAS++RLHFHDCFVQGCD SLLLD+SGSI +EKNSNPN  SARGFEV+DE+K+A
Sbjct: 61   ARETRMAASLMRLHFHDCFVQGCDGSLLLDTSGSIVTEKNSNPNSRSARGFEVVDEIKAA 120

Query: 1002 LEKECPQTVSCADILTLAARDSTVITGGPYWEVPLGRRDSKTASLSGSNNNIPAPNNTFQ 1061
            LE ECP TVSCAD LTLAARDS+V+TGGP W VPLGRRDS +ASLSGSNNNIPAPNNTF 
Sbjct: 121  LENECPNTVSCADALTLAARDSSVLTGGPSWMVPLGRRDSTSASLSGSNNNIPAPNNTFN 180

Query: 1062 TILSRFQNQGLDIVDLVALSGGHTIGNSRCTSFRQRLYNQNGNGQPDKTLPASLAAELRT 1121
            TI++RF NQGLD+ D+VALSG HTIG SRCTSFRQRLYNQ+GNG PD+TL  S AA LR 
Sbjct: 181  TIVTRFNNQGLDLTDVVALSGSHTIGFSRCTSFRQRLYNQSGNGSPDRTLEQSYAANLRQ 240

Query: 1122 RCPRSGGDNNLFFLDFFTPTKFDNSYFKNIVAYKGLLNSDQVLLTSNDASAALVKKYAED 1181
            RCPRSGGD NL  LD  +  +FDNSYFKN++   GLLNSD+VL +SN+ S  LVKKYAED
Sbjct: 241  RCPRSGGDQNLSELDINSAGRFDNSYFKNLIENMGLLNSDEVLFSSNEQSRELVKKYAED 300

Query: 1182 NELFFEQFAKSMIKMGNISPLTG 1200
             E FFEQFA+SMIKMGNISPLTG
Sbjct: 301  QEEFFEQFAESMIKMGNISPLTG 323

BLAST of Sgr026053 vs. TAIR 10
Match: AT2G18140.1 (Peroxidase superfamily protein )

HSP 1 Score: 422.9 bits (1086), Expect = 1.0e-117
Identity = 214/322 (66.46%), Postives = 253/322 (78.57%), Query Frame = 0

Query: 882  MAQTLSFVFVLSLLAFAPLCISSSSGGYG----YLYPQYYDHSCPRAKDIVKSILAKAFA 941
            MA+  SF+ +LSL     LCI  ++  +G     L+P +Y  SCPRA++IV+S++AKAF 
Sbjct: 1    MARIGSFLILLSLTYALTLCICDNASNFGGNKRNLFPDFYRSSCPRAEEIVRSVVAKAFE 60

Query: 942  REARIAASILRLHFHDCFVQGCDASLLLDSSGSINSEKNSNPNRNSARGFEVIDEMKSAL 1001
            RE R+AAS++RLHFHDCFVQGCD SLLLD+SGSI +EKNSNPN  SARGFEV+DE+K+AL
Sbjct: 61   RETRMAASLMRLHFHDCFVQGCDGSLLLDTSGSIVTEKNSNPNSRSARGFEVVDEIKAAL 120

Query: 1002 EKECPQTVSCADILTLAARDSTVITGGPYWEVPLGRRDSKTASLSGSNNNIPAPNNTFQT 1061
            E ECP TVSCAD LTLAARDS+V+TGGP W VPLGRRDS TAS +  N ++P P+N F T
Sbjct: 121  ENECPNTVSCADALTLAARDSSVLTGGPSWTVPLGRRDSATASRAKPNKDLPEPDNLFDT 180

Query: 1062 ILSRFQNQGLDIVDLVALSGGHTIGNSRCTSFRQRLYNQNGNGQPDKTLPASLAAELRTR 1121
            I  RF N+GL++ DLVALSG HTIG SRCTSFRQRLYNQ+G+G PD TL  S AA LR R
Sbjct: 181  IFLRFSNEGLNLTDLVALSGSHTIGFSRCTSFRQRLYNQSGSGSPDTTLEKSYAAILRQR 240

Query: 1122 CPRSGGDNNLFFLDFFTPTKFDNSYFKNIVAYKGLLNSDQVLLTSNDASAALVKKYAEDN 1181
            CPRSGGD NL  LD  +  +FDNSYFKN++   GLLNSDQVL +SN+ S  LVKKYAED 
Sbjct: 241  CPRSGGDQNLSELDINSAGRFDNSYFKNLIENMGLLNSDQVLFSSNEQSRELVKKYAEDQ 300

Query: 1182 ELFFEQFAKSMIKMGNISPLTG 1200
            E FFEQFA+SMIKMG ISPLTG
Sbjct: 301  EEFFEQFAESMIKMGKISPLTG 322

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
QCD81783.14.3e-23547.97Ubiquitin-conjugating enzyme E2-binding protein [Vigna unguiculata][more]
XP_022133273.14.2e-23082.81uncharacterized protein LOC111005900 [Momordica charantia][more]
KAG6603940.12.5e-22773.42hypothetical protein SDJN03_04549, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023543348.11.3e-22380.71uncharacterized protein LOC111803252 [Cucurbita pepo subsp. pepo][more]
XP_038883816.14.6e-22179.45uncharacterized protein LOC120074678 isoform X1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Q9FJZ91.8e-13071.88Peroxidase 72 OS=Arabidopsis thaliana OX=3702 GN=PER72 PE=1 SV=1[more]
O232371.5e-12671.07Peroxidase 49 OS=Arabidopsis thaliana OX=3702 GN=PER49 PE=2 SV=2[more]
Q9SI164.2e-12469.97Peroxidase 15 OS=Arabidopsis thaliana OX=3702 GN=PER15 PE=2 SV=1[more]
Q9SI171.4e-11666.46Peroxidase 14 OS=Arabidopsis thaliana OX=3702 GN=PER14 PE=3 SV=1[more]
Q965121.1e-10865.00Peroxidase 9 OS=Arabidopsis thaliana OX=3702 GN=PER9 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A4D6KYD92.1e-23547.97Ubiquitin-conjugating enzyme E2-binding protein OS=Vigna unguiculata OX=3917 GN=... [more]
A0A6J1BYQ52.0e-23082.81uncharacterized protein LOC111005900 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
A0A6J1IL555.0e-22179.45uncharacterized protein LOC111478431 OS=Cucurbita maxima OX=3661 GN=LOC111478431... [more]
A0A151SJN58.6e-22146.66Uncharacterized protein OS=Cajanus cajan OX=3821 GN=KK1_001194 PE=4 SV=1[more]
A0A6J1GEG61.9e-22079.87uncharacterized protein LOC111453202 OS=Cucurbita moschata OX=3662 GN=LOC1114532... [more]
Match NameE-valueIdentityDescription
AT5G66390.11.3e-13171.88Peroxidase superfamily protein [more]
AT4G36430.11.1e-12771.07Peroxidase superfamily protein [more]
AT4G36440.12.7e-12657.11unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G18150.13.0e-12569.97Peroxidase superfamily protein [more]
AT2G18140.11.0e-11766.46Peroxidase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 375..395
NoneNo IPR availableGENE3D1.10.420.10Peroxidase, domain 2coord: 1049..1192
e-value: 4.3E-119
score: 399.2
NoneNo IPR availableGENE3D1.10.520.10coord: 916..1200
e-value: 4.3E-119
score: 399.2
NoneNo IPR availablePIRSRPIRSR600823-3PIRSR600823-3coord: 884..1200
e-value: 2.2E-129
score: 428.9
NoneNo IPR availablePIRSRPIRSR600823-2PIRSR600823-2coord: 884..1200
e-value: 2.2E-129
score: 428.9
NoneNo IPR availablePANTHERPTHR31388PEROXIDASE 72-RELATEDcoord: 886..1200
NoneNo IPR availablePANTHERPTHR31388:SF188PEROXIDASEcoord: 886..1200
IPR002016Haem peroxidasePRINTSPR00458PEROXIDASEcoord: 1071..1086
score: 55.87
coord: 1005..1022
score: 45.83
coord: 1023..1035
score: 57.12
coord: 943..957
score: 54.15
coord: 1131..1146
score: 45.01
IPR002016Haem peroxidasePFAMPF00141peroxidasecoord: 928..1176
e-value: 3.5E-75
score: 252.6
IPR002016Haem peroxidasePROSITEPS50873PEROXIDASE_4coord: 911..1217
score: 77.278992
IPR000823Plant peroxidasePRINTSPR00461PLPEROXIDASEcoord: 1186..1199
score: 65.06
coord: 1145..1162
score: 47.17
coord: 1023..1038
score: 67.22
coord: 1129..1144
score: 47.86
coord: 985..998
score: 56.44
coord: 921..940
score: 29.12
coord: 945..965
score: 78.38
coord: 1070..1082
score: 74.89
coord: 1004..1014
score: 80.12
IPR000206Ribosomal protein L7/L12TIGRFAMTIGR00855TIGR00855coord: 1336..1470
e-value: 9.6E-25
score: 85.4
IPR000206Ribosomal protein L7/L12HAMAPMF_00368Ribosomal_L7_L12coord: 1332..1471
score: 18.616758
IPR000206Ribosomal protein L7/L12CDDcd00387Ribosomal_L7_L12coord: 1333..1470
e-value: 3.5304E-52
score: 177.348
IPR019193Ubiquitin-conjugating enzyme E2-binding proteinPFAMPF09814HECT_2coord: 125..460
e-value: 6.4E-14
score: 51.6
IPR036235Ribosomal protein L7/L12, oligomerisation domain superfamilyGENE3D1.20.5.710Single helix bincoord: 1334..1400
e-value: 4.7E-7
score: 31.7
IPR036235Ribosomal protein L7/L12, oligomerisation domain superfamilySUPERFAMILY48300Ribosomal protein L7/12, oligomerisation (N-terminal) domaincoord: 1335..1406
IPR014719Ribosomal protein L7/L12, C-terminal/adaptor protein ClpS-likeGENE3D3.30.1390.10coord: 1401..1470
e-value: 4.2E-23
score: 83.2
IPR014719Ribosomal protein L7/L12, C-terminal/adaptor protein ClpS-likeSUPERFAMILY54736ClpS-likecoord: 1400..1470
IPR013823Ribosomal protein L7/L12, C-terminalPFAMPF00542Ribosomal_L12coord: 1404..1470
e-value: 4.2E-23
score: 81.5
IPR008932Ribosomal protein L7/L12, oligomerisationPFAMPF16320Ribosomal_L12_Ncoord: 1336..1376
e-value: 9.8E-10
score: 38.0
IPR019794Peroxidase, active sitePROSITEPS00436PEROXIDASE_2coord: 943..954
IPR019793Peroxidases heam-ligand binding sitePROSITEPS00435PEROXIDASE_1coord: 1071..1081
IPR033905Secretory peroxidaseCDDcd00693secretory_peroxidasecoord: 912..1199
e-value: 1.45186E-163
score: 492.029
IPR010255Haem peroxidase superfamilySUPERFAMILY48113Heme-dependent peroxidasescoord: 912..1201

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr026053.1Sgr026053.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042744 hydrogen peroxide catabolic process
biological_process GO:0043161 proteasome-mediated ubiquitin-dependent protein catabolic process
biological_process GO:0051865 protein autoubiquitination
biological_process GO:0006513 protein monoubiquitination
biological_process GO:0000209 protein polyubiquitination
biological_process GO:0006979 response to oxidative stress
biological_process GO:0006412 translation
cellular_component GO:0000151 ubiquitin ligase complex
cellular_component GO:0005840 ribosome
cellular_component GO:0005634 nucleus
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005829 cytosol
molecular_function GO:0030332 cyclin binding
molecular_function GO:0020037 heme binding
molecular_function GO:0004601 peroxidase activity
molecular_function GO:0003735 structural constituent of ribosome
molecular_function GO:0031624 ubiquitin conjugating enzyme binding
molecular_function GO:0061630 ubiquitin protein ligase activity