Sgr012060 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr012060
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationtig00153204: 181908 .. 195089 (+)
RNA-Seq ExpressionSgr012060
SyntenySgr012060
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGGTAACCACTCTGAATTTTGTGTTAAATTTACGGGAAAAAATTATTCGGCATGGAAATTTCAATTCTGTTTATATGTTACTGGAAAAGAGTTATGAGGACATATTGATGGTACTACTCCAGCACCAACAGATGCTACTCAGTTGGCTCAATGGAAGATCAAAGATGCTAGGGTCATGTCTTGGATTACTGGGTCATGTGATCCTCAAATTGTTCTTAATTTACGTTCCTATAGCATCGCTCAAACCATGTGGAACTATTTGAAAAAGATTTATGCTCAAACAAATTCAGCCAGGAGATTTCAATTGGAGTGTGAAATTTCAAATTATACATAGGGGAGTCTTTCTATTCAGGATTACTATTCTGGTTTTCAAAATTTATGGGCTGAATTTTCTGATATAGTGTGTGCTGCAGTATCTAAAGAATCTCTTACTGATGTTTTGGCTATTCATGAGATTAGCAAGCGTGATCAGTTCTTGATGAAGCTACGATCAGATTTTGAAAATGCTCGTTCTAATTTGATGAATCGTCATCCTTCTCCTACCTTGGATGTATGTTTTAGTGAATTGCTTCGTGAGGAACAACACCTTCTTACACAAACCACTCTTGAACAGGAGAAAATGACTACAACACAAATGGCATTTTTGGCTCACGGGAAAAGCAAGGGTAAAGATATGAGTAAGGTTCAGTGCTTTAGTTGCAAAAATTATGGGCATATTGCAGCAAATTGTACCCAGAAATTTTGCAATTATTGCAAGAAGCATGGGCACATCATCAAAGATTGTTTCATTCGCCCTCAAAATCGTCAAAACACTGCTTTTCAAGCAACAGCTAGCACTTCTCCTACAGGTCAGTCTCACATTGGGCCCCCAACTATGTCAGCTGTTAATGAAAATGCTCAATCCACTGTTCTGACTCCTGAAATGGTGCAACAAATGATTGTTTCAGCCTTCTCAGCTCTGAAACTCCACGGTAATGGTAATTCTTTATCTAAGTCTTGGCTTGTTGATTCTGCTGCATCAAATCATATGACTAGTTCTGCCAATTTGTTACAAAATGTCCGACCCTATCATGGTTTGGAAAATATTCAAGTTGCTAATGGAAATCAATTACTGGTGTTGGTGATATTACTTCAGTTTTCAACAATGCTTTTCTCTCACCTGGACTTTCTGATAATCTTCTTTCTGTTGGCCAATTAGTAGACAATAACTGTAATGTCAATTTTTCTCGTAATGGTTGTTGTGTGCAGGATCAGGACTCGGGGACGGTGATCGCGAAGGGGCCTAAAGTCGGACGTTTATTTCCTTTGCATATTTCCATTCCTAGTAATGTGTCTCTAGCATGTTCTGTGGTTATCAATCAAAATGAGTTGTGGCATAAACGTTTGGGACACCCCAATTCTGCTATATTGTCTTACTTATTGACCTCTGGTTTATTAGGCAAAAATAATAAATTTTCAGGCCTGTCTTTTGATTGTTCAACTTGTAAATTGGGCAAAAGTAAAATTCTTCCTTTTCCCCTTGCTGGTAGTCGTGCAAATAAATGCTTTGATATTATTCATAGTGATGTATGGGGGATTACACCTATTATTTCTCATGCACACTATAAATATTTTGTGACATTTATTGATGACTATAGTCACTTTACTTAGATATATTTTCTTCGTTCCAAATCTGATGTTTTTTCGGTATTCAAGACTTTTGTTGCATATATTGAAACTCAATTTTCCACTGTCATCAAAGTTCTTCGATCTGATTCTGGTGGAGAATACATGTCTCATGCACTTCATGATTTTTTAAATGACAAAGGTATTCTCTCACAGCGCTCTTGTCCACACACCCCCCAGCAAAATGGTGTGGCTGAGAGAAAGAGCCGTCATCTCTTAGAAGTTTCCCGTACCTTGCTACTTGAATCTTCCGTTCCACCTCAGTTTTGGGTAGAAGCTTTGTCTACAGCAGTTTATCTTATTAATAGATTGCCTTCTCAAACTCTCAACCTTGAGTCTCCTTACTTTCGTCTTTATCAACAGCATCCTCTGTATGGGCATTTGCATACATTTGGTTGTGTTTGTTTCCTTCACTTGCCTCCTCTTGAACGTCATAAACTTTCTACTCAGTCTGTTAAATGTACATTTATGGGATATAGTCCAAATCATAAAGGTTTGTTTGTTATGATTCTTCTTCTCAAAAGCTTCGTGTCTCTCGCAATGTTGTTTTCTTTGAAAATCAGTTTTTCTTTCCTACTTGTGATCAGTCATTCTCCGATATTGCTATTCTTCCTAGCTTTGATGAAACGTCTTCTTCTCCTGAACGATTCAAGCCTGGATATGTGTATGAACAACGACATTCACCACCACCCCTTCCGACTCCAGATCCGTCACCTGATCCTGCTCCGACTCTCTTGAGACGGTCCACTAGAGTCTCCCGTCCTCCTAATTGGTATGGCTCCTATCATACATCCTTTAGTGCTGCTTTATCCTCTTTTTCAGTTCCATCTTCTTACTCACAGGCAGTTAAGCATGAGAGTTGGCAACAAGCAATGTAAGAAGAACTTCGAGCTCTTCAGGATAATCATACATGGGATATCGTTCCATGTCCACCCAAAATTAAACCTATTGGGTGTAAATGGGTTTATTCAGTGAAACTTCACTCTGATGGGACTCTGGAACACTATAAGGCAAGGTTGGTTGCTCTTGGTAATAGACAAGAATATGGGGTGGACTATGAGGAGACATATGCTCCTGTGGCTAAAATGACCACTGTGCGAACAATTCTTGCCATTGCTGCTTCACAGGGTTGGCCACTTAAACAAATGGATGTCAAAAACGCTTTTCTCCATGGTGATCTCAAAGAAGATATCTACATGACACCTCCTCCTGGCTTATTTTCCTCTTCTACTTTGGAAGTTTGTAAGTTGAATCGTTCACTATATGGCTTAAAGCAAGCTCCCCGTGCATGGTTTGAGAAGTTCCGATCTACCTTGTTACAATTTCATTTTGTGCAAAGTCAATATGACTCTTCTCTTTTTCTTCATAAAACTTCTACTGGTACTGTGCTTCTTTTGGTCTATGTTGATGACATAGTTATAACTGGACAAATTCTGTATTGATTACTCAGCTTCAGCAACATCTTCGAGAGTCTTTTCATATGAAGGATCTTGGTCATCTGACATATTTCTTGGGATTAGAAGTTCGATCTAACTCCTCTGGCATATTTCTTAATCAGCATAAATACACTCAAGATTTAATTGCATTGGCTGGTCTTCAGGATTCTTCTTTGGTTGATACTCCTCTCGAAGTAAATGTCAAGTATCACTCCGATGAGGGAGCACTCCTTTCTGATCCATCTTTGTATCGTCAATTAGTGGGTAGCTTAAACTACCTAACTATTACAAGACCTGACATTTCCTTTGCTGTTCAGCAAGTTAGTCAGTTTATGCACTCACCTCGCCATCTTCACTTGGCTGCAGTTCGTCGTATTATAAAATATCTTCGGGGTACTCCATGTCGTGGTTTATTTTTTTCCTCTGCGTCTTCTCTGCGCCTTAGTGCTTTTAGCGATGCTGATTGGGCTGGTTGCCCAAATACTCGTCGCTCTGTTACAGGTTGGTGTATGTTTCTTGGGAATTCTTTGATATCGTGGAAGAGTAAGAAACAAGATCGGGTCTCTAAATCCTCTACTGAATCTCAATATCGTGCTATGTCTGCTGCCTGCTCTGAGATCACTTGGCTTCGAGGATTGTTGACTGAACTGGGATTTCCTCAAACAAATCCTACCCCTTTACATGCTGACAACACCAGTGCTATTCAAATTGCCACTAATCCCGTTTACCATGAACGCACTAAGCATATTGAAGTTGATTGTCATTATATTCGAGACGCTGTCAATAATCGAGTTATTTCTCTTCCACATGTTTCCACAGCTTTGCAAATTGCCGATGTATTTACCAAGTCTCTTACCCGACAGCGCCATCAGTTTCTTATTGGCAAATTGATGCTTCTTGATCCACCAGCATCAATTTGAGGGGGGATGTTGCCATATATAGCAAACATTTATTTTGTACAGCCTATTTATTTTGTATTCATATTCTCTTTCCATAAGTAGATTGTATAGCTGTAGGTAGAAATACATTAGCCTCAAATCTCTGTAAATCAGCCTTTAATTACTGTATATTCAGCTATATATTCAAGGAAATCAGAATGAACAGGCATTGGGGCCATTCAATCAAAATTCTCTCACAACAAAGGTTATCAGTGTTGTTACCAATTTCGATGAGTAGATGCGAGACCAAAACAAGAGAAGTAATACAAGATAATATGGACTAGAATTATCTTCTACAGAATGATGAACTGCAAATTGAGAGGTGCGAGAAGAATCGACTGTAACGTCTCCTATCGCCATGAAAGAAGAGTGGAGAAAACGAGAAGAAGAAGAGAAAATCCAAAGGACCAAGACAAAGACTTTTGATACCATATTAAGCTAGAGATGTGGAGAAAATTCTATTCATTTCTCAACAATATTACAATCCACGAATATAAGTTAGTTTACTGCCACGTGGCTAACTTACAAGATTAAAACAAAAACTAATTGTTTAACTAATAGAAATAACAAGAATAACTGAAGGTAAACTCTAATAGAAGCAACTTAACATGAAGAAGCTATTTTTTAGGGTGGCCCTAATACTCTAGGCCAATAACTTTTTCTGAATTTAATGAGTAGTTTTAATTTTTTAAAAAAAATTTATAGATAGATCAAAATCACTTTTCCGACCACGTTATATATCAGTTCAAATTGACCAGCTTTCAATCGATTTGATCTATTTTTTAAATTTGAATGTTCACCCCATTATTTACGGGGATATCATCGATATCAATATTTGATATTTTTGGCCTAAAACTTTAACCTTGATATTTTGTCAACGTGGACATTTAAAAACAAGACCCTCAAAGTCAAATAGCCCACAGGACATTTAAAAAAAAAAAAAAAAAAACATGTGGCTTTACACGCTTTGTTTATGCAACTGATATTTTGATCTTGAAGGCAGTTGATTCCGCTGACTGAACCGAGCCCCCGAAACCCATAAGACCCTTCTTCTCTTCTCTCTCAAATTGAGTTCGTCTTCTGAATACTTCTTTGGTTTCGTTTGCTCTCTCTCTCTCTCTCTACAATGGCTAGCAGTTGGAGAAGATCCTTCGGGAATGTCAGGTCCTTCATCGGCAATTCAATGGGCGGTCTCAGAGGCGGCAGCAATCTCGCTTCCTGGGTTGTCGCCGGAACCCTTGCCTACTTCCTCTGGGTCAAGCCCTCCCAAGACCTCAAACGCGAGCAGCAGGTTCGTCTCTCCCGGAAATTATTTCACTCAACTTTCGAAACGTTTCTGCTCCGTGCTGATTTCTGATCTTCTTCCCCTTCTTCTCTCCTGGATAGTTGTTTGCCATGTCCTTTGTAATGCTTTATTTATTCTCAGGAAAGGGCTGCTCTTGCTGCTTCGGATCCTCATCGGTATATTGAGAAAAGGAAACCCATTCCTGATCCCCAGGTAGTTCAATTTCTACTCTCTGACTTTTACTTCATGTGTGGCTGAATAACTTGCAATGATTGTGTTCTTTTGATGAGATGTTCAGGTTTCGTATATCATACGTTTCTTAAACAATCGAATTACTTAAGTTCGATCTCGTTTTCTTGCTTCCCCCAGGAAACTGGTTTACTATACGGAAACAAGAATACACCTCGAAAACAGGAGGAATAATCTGTAGAGAGTGAAAACTTTGTAAGTTTTGTTTATTTTGTTTGGTAAATTGACCATGAGAAGGTTCTCTACGTGATGTATTTCTAACGTTCTTGGAACTATATTTTATGTCATGTTTATTTAGGCTAAATATATCATTTCATCCTATAAGTTTACACTTACACCCATTTTTCAGTTTTGTTTTCAATTTTATTCTCAAGGTACATGAAAATTTTCAATTAAGTTTCTACTCAATAACATCATCGAAATTGAAAATTGAATAAAGGAAGAGTGATGTAATATTTACAAACAAAGTTTTCTTTGAGCTGACCTCATCCTGCTAGATTAAAAAAATCCTACTGGTCCTATCCAATTAAAAAATCCAACCCTTTCGGGTAGAGTGAGATGGATATTTAATATTCATAGGTTATAGAAAATAAATGCCATGTTATCATTTCTTTGTTCAATTTGGAAATTGTATTGAGTAGAGATTTGATTCAGAATTTTTTATATTTTGAAGGTGAAATTGAACTTTTAAATTTAAGGACAAAATTGAAAATGGTATATACTTTAGAGGCCAAATTATATATTTAGCCCATTTTTATGATACGATAAAAAATTTCATGAAAAGAATGAAAAATATGAAATTTGATAGGGCACTTCAAGGAGAGGCAGCTGGTCAATAAGATCCCAAAGCTCCAAAACATGCCTATCTTTCCCTGAAAAGCTTTTATGTTCCTTTCAGACCAAAGTTTTTATGAAATAGCTATAATAGTTAAAAAAAAAAAGAGGTTGCCCTGCATATTAAAGCATTTGCATTTAGAAATCCTATAGAAAACTTGGACACTGAACTAGTTAGAAAGTCTCTCTCCACATATTGATGGAGGAAGAATGTCACAAAAATAAATGGTAAGTTTTGATAGACCCAAGTAACCAAGGTGTATAGGCGTAGGCGTGTGAAGAATGGGGGGAACGCGTAAGAAGGGGGGCGGTATATAATATCCTAGTGGGGCCCAGAGTATATTGTTAGTTGAGGGAAAATATATAGTGAGTTAGTTGTGGAGGGGGGATTATTTTGATAGTTTTCTTATGCATTCTCTATTGAGAAGAGGGAGAGGCGGGGAGCTCTCAAATCTCCCGGCTTGTAATACTGTTGATAAATATATTGATAGGTTGAGGTCTATCAAGTTTCCTCACGACTTATTGCATTTGTTCCACCTACCAGGAAGAGTCTAAACAAACTCCTCCACTTTCTTCAAATAATTGGCCTTCCATATCTCGTTGCTCAATAGCAATCTCTGTTGCTGCTAAGTTTTGGAAGAGTGAGACTTGCTAATAAAGCGATGGTGAATTTTAAAGCTTTCACATCCTTTACCCATGAAAAGTATCCACCCAGCCTCTCTCACGCATTTAAATTTTTCTTGGAATTTAGAATCCATCTACCAGGAATCTGGCTCCAACAATCATGTATGGACTTGTTTTTATTTATTCACAAGAGCTTTTAGCCTTGTAAATGTGCAGCAAAAATGGGGTGTTGCAATCCAACTTTTCTAGAATCTGATAATCGATCTATCACTCACTAGCAAAGTTAACCTAACAAAAATTAGACTGTCATCTGGCAATAAAATGGAATATGCTCCTTTAGCTACTATAATAAAGCAGGAAGTGAACCAACTAAACCATCGTATGCCATTCTTGCTAATAATGTTCTTGAACAATAGAGCTTCTTTTCCACTTGTTCCACAGTCACTTGGCAACAAAGCGTTATTTCTCTGCCTTATAATGCCAGGCTCTCCTACTTCAAATGAAGGGAAATAAGCTTCCAGTTCATTTGATGGCTACCTCTGTTTATATTTGCATCTCTTATAATCTCTTCAATCCCTTTGGCCATATTGTAAGGAATTTTAAATAGTAAATGGAATAAGAGGAAATGTGAAGTTGTCTATTAGCACATTTCAGTTTTGTATTTTATTGCTGCTTCTGTGACCTACTTACAGGGGCGATGTGAAGTAGTCCATTAAAAAGTGTGTTAGCCAGTAGACTGTTGGGCCGTTAGTTACTTAGCAAGTTCTTGTTTTTAGTTAGCTAACTAATCCGATATGCATTGGAGCCTTTATAAGGTTTGGGATGTAATATATCATCTTTGGAATGAAGAAAATATTAGTTGCATTGGGATATTTGACAATACAGCGTGGGAAATGGTAGCAGGGTGAGTCTTTCACCTCCTGAAATAGAAATTTTGTTTTTCCACTTTTGCACTTTATTCACAACTTTTTCAACCATGTGTACTCAAAACTCCATTGCACATAGCAATGCCTATTAAGGAATGCCGAGATATTTGATGGACCACTGGCCAACACTTTGGGAGAACCAGTCAACTATCTCTTTCACTTCCTCTTGGGAACAGTTGATGGCGACAACCACTTATGTAGTAAGATTCACCTTTGACTCATAGGCTAATTAAAAGAATTTCACACCTTAGACAACATTTTGCAAAGAGCTTCCTCTCATGTCTTAGGCCCAAGTTCGTGCGTTCTTTTCACGAGTGATTCTGAGGACCTAAATCATTTGTTGTGGTTATGTCCTTTTACCACTAGATTATGGAGGCATATGTGGCAATATTTCAATTTAGCCCATGTGTCGAGTATCCAAGAGATGCTGGAGGGGGTTATTCAAGACCCCTTTTCAAGGAAGGGGTCATGTTCTTTGGCAGGCTGATTCCTTTGTAGTTTTATGATGGTTTATTTGGTTGGAAAGGAATATCAGAGTGTTTTTGTGGAAAGATAGTTCGCGGGAACATGTTTGGAATTTGGTGCATTTTCACACTTCTCTTTGGGCTCTTAATTTAAAATTATTTTTTTATATCTTTTTTTTTAACCAAAAGTTGGGGATCTAACGGTAAGTTTGTTCTCCCCATTTTGGTCAGGTTGTTTTTGCAAGCCTTTTCTTGTTTTTTATTCTTTCATCTTCTTAATAAAAGTTGGTTTTTCAATTAAAAGAGTGTATCTGTTTTGATTGAGAAGTTTTCCTTGAGACAAAAATGTATCTGTTCGGTCATAGGCATCAATTGTGAGTCCTCTAAGATTAGTAGATGGGCATCCTTAGCAGGTTGTGATGTGGGTACTTTTCCTACCTCTTGGTCTTCCTCTTGGTCATAACCTCAAGTGGGCTTTGTTTAGGAGAACTGACTTGGAAAAGGTCCAAGAGCGTCTTTCGTCTTGGAAAATGGCATTCTTCTCTAAGGGGGTAGGTTGACTCTTATTCAATCTGTTTTAAGTGGAATCTCGACTTACTTTCTGTGTTTGTTTAGAATTCTTGTAGTTGTCAGTAAGAGCTTGGAGAAGATTTTGAGAGACTTTCTATGGGAGGGTGTAGAGGAGGATGGAGGTCCCCATTTGATTAGTTGGGAGGTGGTGGCGAGACCAGTTGACTTTGGAAGGCTGGGCATAGGTAATTTGAGACTTTGTAATGAGGCCATGCGGTCTAAATGGTTGTGGCATTTACCATTGGAGTCGAACACTCTGTGACATAAGGTCATAGTCAACATGTACGGTCCTCACCCTTTGGAGTGGGCCTCACCTATGGGGGTCTAAAGGCATATTCAGGAACTCCTAGAAAGCTATCTCTTTGGGTGTTCCCTTGTTTTCTAACTTTGTGAAATGCATTGTTGGGGATGGCTCAAACACTTATTTCTAGAAGGATCGGTGGTTGGGTGATAGTCTTCTTAGGATCTTGTTCCCTCATTTATACCACCTTTCTTCCTTGAGGACACATTTATACCACCTTTCTTCCTTGAGGTTATGTCCTGTGACATCAATCATCTCATACTCAGGCAATTCTTATTCTTTTAATTTTAGTTTTGCCGCTCTCTTTTGGATAAGGAGGCGTTGGTTGTATCTGCACTTTTTTCTCTTTTGGGTGATTTCCATCCTCACTCGTAGTAGAGATATTTGTTTTTGGTCTCCCAACCCCTCTAGAGGGTTCTCTTGCCATTCTTCCTTCTCCTTTGGGCAATAGCATACATCTAGAGACCCTTTATGTTCCTCACTGTGGAATCCCAAGAAAGATAAGTTTTTTGCTTGGCAGGACTTATATGGGAGAGTCGACACTATGGTCTGTATTCAAAGGCACTCTCCCTTCTTGTTGGGGCCGCGATGGTACATTCTTTGTAGGGAGGCGGCTGAGGATTTGGGTCATATCCTTTGGAGGTGCAAGTTTTCTCTCGCGATTTAGAATTCTTTCTTAGAGATTTTTGGGGTGTGCTTGGCTTGTAATAGCGACGGTTGTCGTATGATGGAGGAAGTTCTATTCCATGCACCCTTTCATGATAAGGGGCGTTTTCTCTAGCATGCTAGGTTTTTTGTTGTTTTATGGTGTATTTGGCTTCAGAGGAATGAGAGAATTTTTAGAGGGATTGAGAGGTCTTGGGAAGAGATGTGGGCTCTTGCCAAGTTTAATGTCTCTCCTTAGGCGGCGTATGTTTCTAAGGACTTTAGTAGTCATCCTCTCGGTCTTATTATTTTGGATTAGAGTTTTAGCTTGTTAGGTGGTTTTGGCTCCCTACTATGGGCTGTGGTTTTTGTAAGCCCCTTTTGTATTCTTTCATTTTTCTCAATGAAAGCTCAGTTTTTGTATTAAAAAAAAAGTCTATATTTCTAGACAAAAAGTTTTTAAGGCAATCTGCCACTGAAACGAAAAAGGTTTCCTTGTCTAAGACCTCTAGAAGTTGGAGTTTTCCCTTAACTTTCATTCAAAATAATCAGAAATTTGTGGAAGAGGTTACTTAGAAAGAAAACACCTTTCTCTATTCAAATTCCTTTTTTCTTTTTTGTGTGTATTTGACCATAATAACATTAAGGATGGATGGACCAAGCGACCATTTCATAGGCTTTCTCCAAGTTGATCTTGAAAATATTACACTTTTTCTTTGTCCTTTTCCATTCATTTGTTACCTTATTGTTATTAGTAGCCAAGTTTGGGTCAACAATTCGTTTGTTATCTACAAAAGCTTCATAGAAAACAGAGATAGTGAAGGGCAACATTACCTTTGTTTTGTTAGCCAAGACCTAGCATGATTTTATAAGAAATGGAAGAAAACTGATGAGAAAAGAATTTAAAAAAATTATAAGGAAAAGTATAAAGGAAAAAATTATTGGAAAAATGATATGGAATAAAACGAATATAAATTCAACCGTCACTAGGTTTGCTCAAATTAGTCGTTATTTGTGTCAGTGTTTTCAAGATGAGCCTAGGTGATTGACTAAGACAATGGGCGAGGCAAGGAGGTATTTCATGAACTAACTAGACTTATTGAAGTCAATATTCACTACAACAAATTTAAGATTTAATAGCTTTAAACTATAGCATTATAAAAAAATGCTATAGAAAATTTCAAAATTTTGAGGTATGCGCCTACGTTCCCGTCTACGTAAGTAGCGCACGCTATTGAAAAGGAAAATTAGGGAGGTCACTTTATTTTCTCGGTTTGCCTTTCATTTCTCTTCCATACCCTATCATACAGTTTCTCGAAAAAATCAACAAATCGCCTCAGCCTCTTCAATCTTTCTCTGGAAAAACATCTCTCAGGCATTCCTCCAAGCTTCCATGATTTCTTGTCCCTCAAATCTTCCACACCTTCATCGATTCAGTGTTGGTTGATTGTTTTGAAGGAGATGTATTTTGACTGTTATCCAAAAGTATCGATTTGCATTTTTTTTTGTTTTCATTTCTTGCTTGCTCAAGATTGTGTATGTTTCTTTATCTTTGTTCTTTTTTGATTTTCAACTCTCGCAAGCTGTGTAGTCTGATTAGTTTGTGTTAGATTTTCTTTAAATTGATGATTCAATGCCACTGTAATATATGTGTCTACTGTCTATGTATATTTTTGCTAGATTTTGTCTTATTTTGTTGCAATAACATTCTTTATTTTTAAGTATGAACATATTATTTGTAAGTGCGCAAGGACTATTCTTCTCTATAATCTATTTGTGCTTGATTTTCTTTAACTTTAGGCTGAAATGTTGTTGAAACGTATGTATAATTTTGTATGCATATATTTTTGCTAAATATTGTCTTATTTTTCTAGATATTCTCTTGTCGAAATGTATGTTTATATGTAGGCTATTGTGTTCGTTGTCCCTCTTTCTCTTCTTTTATTTCATTTCAGTTCTTCTCCCTTTCTTCCTCTCTTTTCTTCTCCTCTAGGTACCGTATCTATATATTTTTTCTCTCTCCTTGTTTTCTTCATCCTTTCCATTGCTACTACTTTCTTTTATTTCTTTTTCTCTTTTTATTTGTCTCTCTTGGTTAAAGTCTGATACTCTTTGTACTATTATTTAGTTAGAGACGCTAATTGTCACTGGAGTGGGTTTTGCTAGCATGACAACGGTCAAGAAACCTCAAAAAAGCATCTTTTATTTACCACACAATTAATCCCGTCTTTGTTAAGAATTGATAATCTTAACTACTCCGTTTGAAAATCATGAAGTAGACAACAAAAACAAACCAAATTTCTTAGTTTTGTTTTGCTTAAACTATGCACTCACTTGTCAAATGGGAGGGTTATTTAAGACATGTAAATCTTGAACACATCCAAACATTCCTTAAACTTATCTTACTGTTTTTTTTCTTTTCTGTCTGAAAATGGTGTTTTTTTACCTTTAGTGATTGTTCCAACTGTTATGGGAACTTGGAAAGACAAGACTAAGCATTTTTAAATATTATATTTAATGAGATTGTTAAACAATATCCTTGGACTTGTATCTTTTTCTTTCATATCAGAATGATTGAAGTGATTAAATAAGGAACCGTGATTAAAGAAAAAAAACGAATATTTTTTATTGAAAAAAGGAGAAAGGTTCTTCCATAGAATGACATAATCTACTTTCTTTTCTCTTCCTTTTTCTCCTTCACCAACTTACACTATTTAAAAAGAGCTTAGAGCGGCCTCAAACCTTCATTCTTAAATATAAAAACCTCTAAAAACTTTCTTATCTTCTGCATTGTCCTTCCCCTTCATACTTAACTAGATTTTTTATTCATTTTTTTGTTCCAATGGCTACCTCCAAAGCCTCGGATAAAGCCTACCCAGTTTATGAAACTGCTCAATCTCAAATGGGTTTTGCTCTAATTCAAGGAAATTCACCTCTAATATCTAAACCTGGGGCAAATATGCTCTTGAAATTAGAGATCCAACAACCAAAGAGAGGAAATGGCTTGGCACTTTTGACACTGCCCATGTAACAGCTTTAGCTTATGACAGAGCTACCCTGTCAATGAAGGGCACCCTAGCAAGAACCAACTTCATTTACTCTGACAGCTCAACTTTCCACTCTCTTCTCACTGCTCTTGATGTCCAAGCTTTGCTTCCTTCTGATTCTCCTCATTCCAAGCAACACTCCCCATTGGCAACCAAAACACCCCATTTTCTCAAGTCAGCCTTCCACTGCTGA

mRNA sequence

ATGGCTGGACATATTGATGGTACTACTCCAGCACCAACAGATGCTACTCAGTTGGCTCAATGGAAGATCAAAGATGCTAGGGGGAGTCTTTCTATTCAGGATTACTATTCTGGTTTTCAAAATTTATGGGCTGAATTTTCTGATATAGTGTGTGCTGCAGTATCTAAAGAATCTCTTACTGATGTTTTGGCTATTCATGAGATTAGCAAGCGTGATCAGTTCTTGATGAAGCTACGATCAGATTTTGAAAATGCTCGTTCTAATTTGATGAATCGTCATCCTTCTCCTACCTTGGATGTATGTTTTAGTGAATTGCTTCGTGAGGAACAACACCTTCTTACACAAACCACTCTTGAACAGGAGAAAATGACTACAACACAAATGGCATTTTTGGCTCACGGGAAAAGCAAGGGTAAAGATATGAGTAAGGTTCAGTGCTTTAGTTGCAAAAATTATGGGCATATTGCAGCAAATTGTACCCAGAAATTTTGCAATTATTGCAAGAAGCATGGGCACATCATCAAAGATTGTTTCATTCGCCCTCAAAATCGTCAAAACACTGCTTTTCAAGCAACAGCTAGCACTTCTCCTACAGGTCAGTCTCACATTGGGCCCCCAACTATGTCAGCTGTTAATGAAAATGCTCAATCCACTGTTCTGACTCCTGAAATGGTGCAACAAATGATTGTTTCAGCCTTCTCAGCTCTGAAACTCCACGGTAATGGTAATTCTTTATCTAAGTCTTGGCTTGTTGATTCTGCTGCATCAAATCATATGACTAGTTCTGCCAATTTGTTACAAAATGTCCGACCCTATCATGGTTTGGAAAATATTCAAGTTGCTAATGGAAATCAATTACTGGTGTTGGATCAGGACTCGGGGACGGTGATCGCGAAGGGGCCTAAAGTCGGACGTTTATTTCCTTTGCATATTTCCATTCCTAGTAATGTGTCTCTAGCATGTTCTGTGGTTATCAATCAAAATGAGTTGTGGCATAAACGTTTGGGACACCCCAATTCTGCTATATTGTCTTACTTATTGACCTCTGGTTTATTAGGCAAAAATAATAAATTTTCAGGCCTGTCTTTTGATTGTTCAACTTGTAAATTGGGCAAAAGTAAAATTCTTCCTTTTCCCCTTGCTGGTAGTCGTGCAAATAAATGCTTTGATATTATTCATAGTGATTCATTCTCCGATATTGCTATTCTTCCTAGCTTTGATGAAACGTCTTCTTCTCCTGAACGATTCAAGCCTGGATATGTGTATGAACAACGACATTCACCACCACCCCTTCCGACTCCAGATCCGTCACCTGATCCTGCTCCGACTCTCTTGAGACGGTCCACTAGAGTCTCCCGTCCTCCTAATTGGTATGGCTCCTATCATACATCCTTTAGTGCTGCTTTATCCTCTTTTTCAGTTCCATCTTCTTACTCACAGGCAGATTCTTCTTTGGTTGATACTCCTCTCGAAGTAAATGTCAAGTATCACTCCGATGAGGGAGCACTCCTTTCTGATCCATCTTTGTATCGTCAATTAGTGGGTAGCTTAAACTACCTAACTATTACAAGACCTGACATTTCCTTTGCTGTTCAGCAAGTTAGTCAGTTTATGCACTCACCTCGCCATCTTCACTTGGCTGCAGTTCGTCGTATTATAAAATATCTTCGGGCAGTTGGAGAAGATCCTTCGGGAATGTCAGGTCCTTCATCGGCAATTCAATGGGCGGTCTCAGAGGCGGCAGCAATCTCGCTTCCTGGGTTGTCGCCGGAACCCTTGCCTACTTCCTCTGGGTCAAGCCCTCCCAAGACCTCAAACGCGAGCAGCAGGTTCGTCTCTCCCGGAAATTATTTCACTCAACTTTCGAAACGTTTCTGCTCCGTGCTGATTTCTGATCTTCTTCCCCTTCTTCTCTCCTGGATAGAAAGGGCTGCTCTTGCTGCTTCGGATCCTCATCGGTATATTGAGAAAAGGAAACCCATTCCTGATCCCCAGGCTCTCCTACTTCAAATGAAGGGAAATAAGCTTCCAGTTCATTTGATGGCTACCTCTGTTTATATTTGCATCTCTTATAATCTCTTCAATCCCTTTGGCCATATTGTTTGGGATGTAATATATCATCTTTGGAATGAAGAAAATATTAGTTGCATTGGGATATTTGACAATACAGCGTGGGAAATGGTAGCAGGAGATCCAACAACCAAAGAGAGGAAATGGCTTGGCACTTTTGACACTGCCCATGTAACAGCTTTAGCTTATGACAGAGCTACCCTGTCAATGAAGGGCACCCTAGCAAGAACCAACTTCATTTACTCTGACAGCTCAACTTTCCACTCTCTTCTCACTGCTCTTGATGTCCAAGCTTTGCTTCCTTCTGATTCTCCTCATTCCAAGCAACACTCCCCATTGGCAACCAAAACACCCCATTTTCTCAAGTCAGCCTTCCACTGCTGA

Coding sequence (CDS)

ATGGCTGGACATATTGATGGTACTACTCCAGCACCAACAGATGCTACTCAGTTGGCTCAATGGAAGATCAAAGATGCTAGGGGGAGTCTTTCTATTCAGGATTACTATTCTGGTTTTCAAAATTTATGGGCTGAATTTTCTGATATAGTGTGTGCTGCAGTATCTAAAGAATCTCTTACTGATGTTTTGGCTATTCATGAGATTAGCAAGCGTGATCAGTTCTTGATGAAGCTACGATCAGATTTTGAAAATGCTCGTTCTAATTTGATGAATCGTCATCCTTCTCCTACCTTGGATGTATGTTTTAGTGAATTGCTTCGTGAGGAACAACACCTTCTTACACAAACCACTCTTGAACAGGAGAAAATGACTACAACACAAATGGCATTTTTGGCTCACGGGAAAAGCAAGGGTAAAGATATGAGTAAGGTTCAGTGCTTTAGTTGCAAAAATTATGGGCATATTGCAGCAAATTGTACCCAGAAATTTTGCAATTATTGCAAGAAGCATGGGCACATCATCAAAGATTGTTTCATTCGCCCTCAAAATCGTCAAAACACTGCTTTTCAAGCAACAGCTAGCACTTCTCCTACAGGTCAGTCTCACATTGGGCCCCCAACTATGTCAGCTGTTAATGAAAATGCTCAATCCACTGTTCTGACTCCTGAAATGGTGCAACAAATGATTGTTTCAGCCTTCTCAGCTCTGAAACTCCACGGTAATGGTAATTCTTTATCTAAGTCTTGGCTTGTTGATTCTGCTGCATCAAATCATATGACTAGTTCTGCCAATTTGTTACAAAATGTCCGACCCTATCATGGTTTGGAAAATATTCAAGTTGCTAATGGAAATCAATTACTGGTGTTGGATCAGGACTCGGGGACGGTGATCGCGAAGGGGCCTAAAGTCGGACGTTTATTTCCTTTGCATATTTCCATTCCTAGTAATGTGTCTCTAGCATGTTCTGTGGTTATCAATCAAAATGAGTTGTGGCATAAACGTTTGGGACACCCCAATTCTGCTATATTGTCTTACTTATTGACCTCTGGTTTATTAGGCAAAAATAATAAATTTTCAGGCCTGTCTTTTGATTGTTCAACTTGTAAATTGGGCAAAAGTAAAATTCTTCCTTTTCCCCTTGCTGGTAGTCGTGCAAATAAATGCTTTGATATTATTCATAGTGATTCATTCTCCGATATTGCTATTCTTCCTAGCTTTGATGAAACGTCTTCTTCTCCTGAACGATTCAAGCCTGGATATGTGTATGAACAACGACATTCACCACCACCCCTTCCGACTCCAGATCCGTCACCTGATCCTGCTCCGACTCTCTTGAGACGGTCCACTAGAGTCTCCCGTCCTCCTAATTGGTATGGCTCCTATCATACATCCTTTAGTGCTGCTTTATCCTCTTTTTCAGTTCCATCTTCTTACTCACAGGCAGATTCTTCTTTGGTTGATACTCCTCTCGAAGTAAATGTCAAGTATCACTCCGATGAGGGAGCACTCCTTTCTGATCCATCTTTGTATCGTCAATTAGTGGGTAGCTTAAACTACCTAACTATTACAAGACCTGACATTTCCTTTGCTGTTCAGCAAGTTAGTCAGTTTATGCACTCACCTCGCCATCTTCACTTGGCTGCAGTTCGTCGTATTATAAAATATCTTCGGGCAGTTGGAGAAGATCCTTCGGGAATGTCAGGTCCTTCATCGGCAATTCAATGGGCGGTCTCAGAGGCGGCAGCAATCTCGCTTCCTGGGTTGTCGCCGGAACCCTTGCCTACTTCCTCTGGGTCAAGCCCTCCCAAGACCTCAAACGCGAGCAGCAGGTTCGTCTCTCCCGGAAATTATTTCACTCAACTTTCGAAACGTTTCTGCTCCGTGCTGATTTCTGATCTTCTTCCCCTTCTTCTCTCCTGGATAGAAAGGGCTGCTCTTGCTGCTTCGGATCCTCATCGGTATATTGAGAAAAGGAAACCCATTCCTGATCCCCAGGCTCTCCTACTTCAAATGAAGGGAAATAAGCTTCCAGTTCATTTGATGGCTACCTCTGTTTATATTTGCATCTCTTATAATCTCTTCAATCCCTTTGGCCATATTGTTTGGGATGTAATATATCATCTTTGGAATGAAGAAAATATTAGTTGCATTGGGATATTTGACAATACAGCGTGGGAAATGGTAGCAGGAGATCCAACAACCAAAGAGAGGAAATGGCTTGGCACTTTTGACACTGCCCATGTAACAGCTTTAGCTTATGACAGAGCTACCCTGTCAATGAAGGGCACCCTAGCAAGAACCAACTTCATTTACTCTGACAGCTCAACTTTCCACTCTCTTCTCACTGCTCTTGATGTCCAAGCTTTGCTTCCTTCTGATTCTCCTCATTCCAAGCAACACTCCCCATTGGCAACCAAAACACCCCATTTTCTCAAGTCAGCCTTCCACTGCTGA

Protein sequence

MAGHIDGTTPAPTDATQLAQWKIKDARGSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQMAFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKSWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKFSGLSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFSDIAILPSFDETSSSPERFKPGYVYEQRHSPPPLPTPDPSPDPAPTLLRRSTRVSRPPNWYGSYHTSFSAALSSFSVPSSYSQADSSLVDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAAVRRIIKYLRAVGEDPSGMSGPSSAIQWAVSEAAAISLPGLSPEPLPTSSGSSPPKTSNASSRFVSPGNYFTQLSKRFCSVLISDLLPLLLSWIERAALAASDPHRYIEKRKPIPDPQALLLQMKGNKLPVHLMATSVYICISYNLFNPFGHIVWDVIYHLWNEENISCIGIFDNTAWEMVAGDPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSDSSTFHSLLTALDVQALLPSDSPHSKQHSPLATKTPHFLKSAFHC
Homology
BLAST of Sgr012060 vs. NCBI nr
Match: KAA8529702.1 (hypothetical protein F0562_034198 [Nyssa sinensis])

HSP 1 Score: 438.7 bits (1127), Expect = 1.1e-118
Identity = 250/495 (50.51%), Postives = 299/495 (60.40%), Query Frame = 0

Query: 3   GHIDGTTPAPTDATQLAQWKIKDAR----------------------------------- 62
           GH+DG+ PAPTD  +L QWK+KDAR                                   
Sbjct: 7   GHVDGSDPAPTDPMKLVQWKVKDARVMTWILGSVDPLLILNLKPHKTAKSMWEYLKKVYH 66

Query: 63  -------------------GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEI 122
                              G+LS+Q+Y+ GFQNLWAEFSDIV A VS ESL+ V A+HE 
Sbjct: 67  QDHSARRFQLETDLAAYSQGTLSVQEYFCGFQNLWAEFSDIVYANVSAESLSAVQAVHEA 126

Query: 123 SKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQM 182
           SKRDQFLMKLR +FE+ RSNLM+R PSP+LDVCF  LLREEQ LLTQ++L QE +    +
Sbjct: 127 SKRDQFLMKLRPEFESIRSNLMHRDPSPSLDVCFGALLREEQRLLTQSSLPQENV----V 186

Query: 183 AFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTA 242
           A+ A GK +G+DM  VQC+SCK+YGHIA +C +KFCNYCK+ GHIIK+C  RPQ R   A
Sbjct: 187 AYAAQGKGRGRDMRTVQCYSCKDYGHIAVHCAKKFCNYCKQKGHIIKECPTRPQIRPVNA 246

Query: 243 FQATASTSPTGQSHIGPPTMSAVNENAQST-VLTPEMVQQMIVSAFSALKLHGNGNSLSK 302
           + ATA T  T         +++V+ +  +T  LT EMVQQMIVSAFSAL L G G   S+
Sbjct: 247 YHATA-TGHTSDGVTSTQNLASVSPSTAATPALTLEMVQQMIVSAFSALGLQGKGTIPSQ 306

Query: 303 SWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL--------------------- 362
            WLVDSAASNHMTSS  +L NVR Y G  NIQVAN + L                     
Sbjct: 307 PWLVDSAASNHMTSSPTILSNVRKYTGSSNIQVANDHLLPITGVGDIAPSLTNIFVSPGL 366

Query: 363 ------------------------LVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLACS 396
                                    V D  SG  IAKGPKVGRLFPL+ SIPS +SLAC+
Sbjct: 367 STSLISVGQLVDDNYNVQFSRDGCHVQDPVSGRTIAKGPKVGRLFPLYFSIPSIISLACT 426

BLAST of Sgr012060 vs. NCBI nr
Match: XP_021654098.1 (uncharacterized protein LOC110645300 [Hevea brasiliensis])

HSP 1 Score: 428.3 bits (1100), Expect = 1.5e-115
Identity = 289/705 (40.99%), Postives = 359/705 (50.92%), Query Frame = 0

Query: 3   GHIDGTTPAPTDATQLAQWKIKDARGSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDV 62
           GH DG+ PAPTD+ +L QW +KD RG+LSIQ+Y+SGFQNLWAEF+D+V A V  ESL+ +
Sbjct: 36  GH-DGSDPAPTDSKELLQWNVKDTRGNLSIQEYFSGFQNLWAEFTDLVYAKVPAESLSVI 95

Query: 63  LAIHEISKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEK 122
            AIHE SKRDQFLMKLRSDFE  RSNLM+R PSP+LDVCF ELLREEQ  LT++T +QE 
Sbjct: 96  QAIHEQSKRDQFLMKLRSDFETIRSNLMSRDPSPSLDVCFGELLREEQRPLTKSTFKQE- 155

Query: 123 MTTTQMAFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQ 182
             T  +AF+A GK KG+DM+ + C+SCK YGHIAANC +KF NY K+ GHIIK+C  RPQ
Sbjct: 156 -NTVMVAFVAKGKGKGRDMNNILCYSCKEYGHIAANCGKKFYNYYKQLGHIIKECPTRPQ 215

Query: 183 NRQNTAFQATASTSPTGQSHIGPPTMSAV-NENAQSTVLTPEMVQQMIVSAFSALKLHGN 242
           NR+  A  A  ++S    +H   P +S   +  A+  VLTPEMVQQMIVSAFSAL L  N
Sbjct: 216 NRRANASSAAMNSS----NHFAAPAISTTPSTAAEPVVLTPEMVQQMIVSAFSALGLQVN 275

Query: 243 GNSLSKSWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGP 302
             + S+ WLVDSAASNHMT+S+++L+NV  YHG   IQ+AN + + +      T   K  
Sbjct: 276 DIASSQFWLVDSAASNHMTNSSSMLKNVHKYHGSIEIQIANESNIPITKVGDLTPSFKNI 335

Query: 303 KVG-RLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKFSG 362
            +  +L    IS+   V   C V  +            N  ++   ++  ++ K  K S 
Sbjct: 336 FISPKLSTKLISVDQLVDNNCDVHFSH-----------NGYLVQDQVSGTVIAKGPKPS- 395

Query: 363 LSFDCSTCKLGKSKILPFPLAGSRANKCFDIIHSDSFSDIAILPSFDETSSSPERFKPGY 422
                                                S I ILP+F++  S P  FKPG+
Sbjct: 396 -------------------------------------SVITILPTFEDLPSLPNWFKPGF 455

Query: 423 VYEQRH--------SPPPLPTPDPSPD------PAPTLLRRSTRVSRPPNWYGSYHTSFS 482
           VYE+R          PP  PT +P+ +      P   +LRRSTRVSR PNWYG     FS
Sbjct: 456 VYERRQQTLLFLETDPPLAPTSEPTFEISYELAPPEPILRRSTRVSRAPNWYG-----FS 515

Query: 483 AALSSFSVPSSYSQA--------------------------------------------- 542
             LS  SV S YSQA                                             
Sbjct: 516 TTLSDISVASCYSQASKHECWQKAIQGELQLRSNGTLDRYKARLVALGNKQEYGVEYEET 575

Query: 543 ------------------------------------------------------------ 557
                                                                       
Sbjct: 576 FAPVAKMTTVRTVIAIATSQVWPLNQMDIKNAFLYGDLKEDIYMVPPPEDIVITKTDSSL 635

BLAST of Sgr012060 vs. NCBI nr
Match: TXG67369.1 (hypothetical protein EZV62_008644 [Acer yangbiense])

HSP 1 Score: 424.5 bits (1090), Expect = 2.1e-114
Identity = 252/508 (49.61%), Postives = 293/508 (57.68%), Query Frame = 0

Query: 3   GHIDGTTPAPTDATQLAQWKIKDAR----------------------------------- 62
           GHIDG+ PAPT+  +LA WK+KDAR                                   
Sbjct: 36  GHIDGSDPAPTEPKELANWKVKDARVMSWILGSVDPLIVLNLRPYKTAKTMWEYLLKVYH 95

Query: 63  -------------------GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEI 122
                              G+LSIQDY+S FQNLW EFSD+V A V   SL+ V A+HE 
Sbjct: 96  QDNTACRFQLEYEIANYTQGNLSIQDYFSSFQNLWGEFSDMVYAKVPAASLSAVQAVHEQ 155

Query: 123 SKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQM 182
           SKRDQFLMKLR +FE  RSNLMNR PSP+LDVCF ELLREEQ LLTQ   +Q+      +
Sbjct: 156 SKRDQFLMKLRPEFEITRSNLMNRDPSPSLDVCFGELLREEQRLLTQAMFQQDS-NPNPI 215

Query: 183 AFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTA 242
           A+ A+GK KG+DM KVQCFSCK YGHIAANC +K CNYCKK GH IK+C  +PQNRQ TA
Sbjct: 216 AYAAYGKGKGRDMRKVQCFSCKEYGHIAANCAKKSCNYCKKQGHFIKECPTQPQNRQATA 275

Query: 243 FQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKS 302
           +QA  +TS        P   S  +     + LTPEMVQQMI+SAFSAL L GN  +LSKS
Sbjct: 276 YQAAVNTSSV------PKMPSTSSSTDGLSALTPEMVQQMIMSAFSALGLQGNDTTLSKS 335

Query: 303 WLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGPKVGRLFP 362
           WL+DSAASNHMT S++ L N                     DQ SG ++AKGPKVGRLFP
Sbjct: 336 WLIDSAASNHMTRSSDTLCN---------------------DQVSGKILAKGPKVGRLFP 395

Query: 363 LHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLG-KNNKFSGLSFDCST 422
           LH SIPS +SLAC  V +QNE+WHKRLGHPNS +LS++L SGLLG K   +  LSFDC  
Sbjct: 396 LHFSIPSCLSLACMTVNSQNEVWHKRLGHPNSVVLSHMLNSGLLGNKEQVYKNLSFDCFV 455

Query: 423 CKLGKSKILPFPLAGSRANKCFDIIHSDSFSDIAILPSFDETSSSPERFKPGYVYEQRHS 456
           CKLGKSK L FP  GSRA                                          
Sbjct: 456 CKLGKSKTLSFPPHGSRAANF--------------------------------------- 470

BLAST of Sgr012060 vs. NCBI nr
Match: KAG6501099.1 (hypothetical protein ZIOFF_040967 [Zingiber officinale])

HSP 1 Score: 409.1 bits (1050), Expect = 9.2e-110
Identity = 233/505 (46.14%), Postives = 295/505 (58.42%), Query Frame = 0

Query: 3   GHIDGTTPAPTDATQLAQWKIKDAR----------------------------------- 62
           GHIDG+  AP +A  L QW+ KDAR                                   
Sbjct: 102 GHIDGSLMAPENAKDLGQWETKDARIISWLLGSIEAHMVNNLRPFNTTKEMWDYLKRIYH 161

Query: 63  -------------------GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEI 122
                              G LSI+ YYSGF NLW E+S+I+ + V KE+L  + AIHE+
Sbjct: 162 QDNTAKRFQLELEIGNLSQGDLSIEQYYSGFLNLWGEYSNIIYSKVPKEALASIQAIHEV 221

Query: 123 SKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTL--EQEKMTTT 182
           SKRDQFLMKLRSDF+ AR+ L+NR+P P+LD+C  ELLREEQ L TQ  L    EK T  
Sbjct: 222 SKRDQFLMKLRSDFDVARAGLLNRNPVPSLDICLGELLREEQRLATQAVLGASLEKSTVI 281

Query: 183 QMAFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQN 242
            +A+ A G+++GKD  ++QC+SCK +GHIA NC++KFCNYCK+HGHIIK+C  RP+NR+ 
Sbjct: 282 NVAYAAQGRNRGKD--QLQCYSCKEFGHIARNCSKKFCNYCKQHGHIIKECPTRPENRRT 341

Query: 243 TAFQATASTSPTGQSHIGP-PTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSL 302
            AFQAT        + IGP  T++  N+    +VLTPEMVQQMI++AFS L L G G ++
Sbjct: 342 QAFQATI----PNLNVIGPTSTVTGTNQ----SVLTPEMVQQMILTAFSTLTLQGQGMNI 401

Query: 303 SKSWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL------------------- 362
           S SW+VDS ASNHMT S + L NVR Y+G +NIQ+ANG+ L                   
Sbjct: 402 SSSWIVDSRASNHMTGSPDQLHNVRQYNGSQNIQIANGSNLPITAIGDIGSSFSHVFISP 461

Query: 363 --------------------------LVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLA 404
                                     +V DQ SG VIAKGPKVGRLFPL  S+P N+S +
Sbjct: 462 GLSTNLISVGQMVDNHCDVHFSRDGCIVQDQVSGQVIAKGPKVGRLFPLQFSVPRNLSFS 521

BLAST of Sgr012060 vs. NCBI nr
Match: KAG6536639.1 (hypothetical protein ZIOFF_001697 [Zingiber officinale])

HSP 1 Score: 406.4 bits (1043), Expect = 5.9e-109
Identity = 230/477 (48.22%), Postives = 287/477 (60.17%), Query Frame = 0

Query: 3   GHIDGTTPAPTDATQLAQWKIKDAR----------------------------------- 62
           GHIDG+  AP +A  L QW+ KDAR                                   
Sbjct: 42  GHIDGSLMAPENAKDLGQWETKDARIISWLLGSIEAHMVNNLHNTARRFQLELEIGNLSQ 101

Query: 63  GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARS 122
           G LSI+ YYSGF NLW E+S+I+ + V KE+L  + AIHE+SK DQFLMKLRSDF+ AR+
Sbjct: 102 GDLSIEQYYSGFLNLWGEYSNIIYSKVPKEALASIQAIHEVSKHDQFLMKLRSDFDVARA 161

Query: 123 NLMNRHPSPTLDVCFSELLREEQHLLTQTTL--EQEKMTTTQMAFLAHGKSKGKDMSKVQ 182
            L+NR+  P+LD+C  ELLREEQ L TQ  L    EK T   +A+ A G+++GKD  ++Q
Sbjct: 162 GLLNRNLVPSLDICLGELLREEQRLATQAVLGASLEKSTVINVAYAAQGRNRGKD--QLQ 221

Query: 183 CFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQATASTSPTGQSHIGP 242
           C+SCK +GHIA NC++KFCNYCK+HGHIIK+C  RP+NR+  AFQAT        + IGP
Sbjct: 222 CYSCKEFGHIAHNCSKKFCNYCKQHGHIIKECPTRPENRRTQAFQATI----PNLNVIGP 281

Query: 243 PTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKSWLVDSAASNHMTSSANL 302
              S V    QS VLTPEMVQQMI++AFS L L G G ++S SW+VDS ASNH+T S +L
Sbjct: 282 --TSTVTGTHQS-VLTPEMVQQMILTAFSTLTLQGQGMNISSSWIVDSGASNHITGSPDL 341

Query: 303 LQNVRPYHGLENIQVANGNQL--------------------------------------- 362
           L NVR Y+G +NIQ+AN + L                                       
Sbjct: 342 LHNVRQYNGSQNIQIANASNLPITAIGDIGSSFSHVFISPGLSANLISVGQMVDNHCDVH 401

Query: 363 ------LVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLACSVVINQNELWHKRLGHPNS 396
                 +V DQ SG VIAKGPKVGRLFPL  S+P N+S +  V  N+ ++WHKRLGHPN+
Sbjct: 402 FSRDGCIVQDQVSGQVIAKGPKVGRLFPLQFSVPRNLSFSSIVTANKADIWHKRLGHPNN 461

BLAST of Sgr012060 vs. ExPASy Swiss-Prot
Match: Q6J9Q2 (Ethylene-responsive transcription factor ERF086 OS=Arabidopsis thaliana OX=3702 GN=ERF086 PE=1 SV=2)

HSP 1 Score: 85.5 bits (210), Expect = 3.0e-15
Identity = 44/81 (54.32%), Postives = 58/81 (71.60%), Query Frame = 0

Query: 731 DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSDSSTFHSLLTALDVQALL 790
           DPTTKER WLGTFDTAH  ALAYDRA LSM+GT ARTNF+Y+ +   H++LT  ++ +L+
Sbjct: 72  DPTTKERHWLGTFDTAHEAALAYDRAALSMRGTQARTNFVYTPTDV-HTILTNPNLHSLI 131

Query: 791 PSDSPHSKQHSPLATKTPHFL 812
              SP++   S L   +P F+
Sbjct: 132 V--SPYNNNQSFLPNSSPQFV 149

BLAST of Sgr012060 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 77.4 bits (189), Expect = 8.3e-13
Identity = 37/71 (52.11%), Postives = 49/71 (69.01%), Query Frame = 0

Query: 486  VDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLH 545
            V TP+  + K     G  L+DP+ YR +VGSL YL  TRPDIS+AV ++SQFMH P   H
Sbjct: 1221 VTTPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLAFTRPDISYAVNRLSQFMHMPTEEH 1280

Query: 546  LAAVRRIIKYL 557
            L A++RI++YL
Sbjct: 1281 LQALKRILRYL 1291

BLAST of Sgr012060 vs. ExPASy Swiss-Prot
Match: Q9M644 (Ethylene-responsive transcription factor LEP OS=Arabidopsis thaliana OX=3702 GN=LEP PE=2 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 5.4e-12
Identity = 41/70 (58.57%), Postives = 48/70 (68.57%), Query Frame = 0

Query: 731 DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSD---SSTFHSLLTALDVQ 790
           DPTTKER WLGTFDTA   ALAYDRA  SM+GT ARTNF+YSD   SS+  S+++  D  
Sbjct: 37  DPTTKERHWLGTFDTAEEAALAYDRAARSMRGTRARTNFVYSDMPPSSSVTSIVSPDDPP 96

Query: 791 ALLPSDSPHS 798
              P  +P S
Sbjct: 97  PPPPPPAPPS 106

BLAST of Sgr012060 vs. ExPASy Swiss-Prot
Match: Q8H3Q1 (Ethylene-responsive transcription factor FZP OS=Oryza sativa subsp. japonica OX=39947 GN=FZP PE=1 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 3.5e-11
Identity = 33/45 (73.33%), Postives = 37/45 (82.22%), Query Frame = 0

Query: 731 DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSDSS 776
           DPTTKER WLGTFDTA   ALAYDRA LSMKG  ARTNF+Y+ ++
Sbjct: 75  DPTTKERHWLGTFDTAQEAALAYDRAALSMKGAQARTNFVYTHAA 119

BLAST of Sgr012060 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 71.6 bits (174), Expect = 4.5e-11
Identity = 33/71 (46.48%), Postives = 47/71 (66.20%), Query Frame = 0

Query: 486  VDTPLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLH 545
            V TP+  + K     G  L DP+ YR +VGSL YL  TRPD+S+AV ++SQ+MH P   H
Sbjct: 1204 VATPMATSPKLTLHSGTKLPDPTEYRGIVGSLQYLAFTRPDLSYAVNRLSQYMHMPTDDH 1263

Query: 546  LAAVRRIIKYL 557
              A++R+++YL
Sbjct: 1264 WNALKRVLRYL 1274

BLAST of Sgr012060 vs. ExPASy TrEMBL
Match: A0A5J5AIJ4 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_034198 PE=4 SV=1)

HSP 1 Score: 438.7 bits (1127), Expect = 5.2e-119
Identity = 250/495 (50.51%), Postives = 299/495 (60.40%), Query Frame = 0

Query: 3   GHIDGTTPAPTDATQLAQWKIKDAR----------------------------------- 62
           GH+DG+ PAPTD  +L QWK+KDAR                                   
Sbjct: 7   GHVDGSDPAPTDPMKLVQWKVKDARVMTWILGSVDPLLILNLKPHKTAKSMWEYLKKVYH 66

Query: 63  -------------------GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEI 122
                              G+LS+Q+Y+ GFQNLWAEFSDIV A VS ESL+ V A+HE 
Sbjct: 67  QDHSARRFQLETDLAAYSQGTLSVQEYFCGFQNLWAEFSDIVYANVSAESLSAVQAVHEA 126

Query: 123 SKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQM 182
           SKRDQFLMKLR +FE+ RSNLM+R PSP+LDVCF  LLREEQ LLTQ++L QE +    +
Sbjct: 127 SKRDQFLMKLRPEFESIRSNLMHRDPSPSLDVCFGALLREEQRLLTQSSLPQENV----V 186

Query: 183 AFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTA 242
           A+ A GK +G+DM  VQC+SCK+YGHIA +C +KFCNYCK+ GHIIK+C  RPQ R   A
Sbjct: 187 AYAAQGKGRGRDMRTVQCYSCKDYGHIAVHCAKKFCNYCKQKGHIIKECPTRPQIRPVNA 246

Query: 243 FQATASTSPTGQSHIGPPTMSAVNENAQST-VLTPEMVQQMIVSAFSALKLHGNGNSLSK 302
           + ATA T  T         +++V+ +  +T  LT EMVQQMIVSAFSAL L G G   S+
Sbjct: 247 YHATA-TGHTSDGVTSTQNLASVSPSTAATPALTLEMVQQMIVSAFSALGLQGKGTIPSQ 306

Query: 303 SWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL--------------------- 362
            WLVDSAASNHMTSS  +L NVR Y G  NIQVAN + L                     
Sbjct: 307 PWLVDSAASNHMTSSPTILSNVRKYTGSSNIQVANDHLLPITGVGDIAPSLTNIFVSPGL 366

Query: 363 ------------------------LVLDQDSGTVIAKGPKVGRLFPLHISIPSNVSLACS 396
                                    V D  SG  IAKGPKVGRLFPL+ SIPS +SLAC+
Sbjct: 367 STSLISVGQLVDDNYNVQFSRDGCHVQDPVSGRTIAKGPKVGRLFPLYFSIPSIISLACT 426

BLAST of Sgr012060 vs. ExPASy TrEMBL
Match: A0A5J5AIJ4 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_034198 PE=4 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 9.5e-04
Identity = 30/61 (49.18%), Postives = 38/61 (62.30%), Query Frame = 0

Query: 400 IAILPSFDETSSSPERFKPGYVYEQRHSPPPLP----TPDPSPDPAPTLLRRSTRVSRPP 457
           +++LP FD+    PERFK G+VYE+R    PLP     PDP PDP     RRS+R S PP
Sbjct: 640 VSVLPRFDDLICPPERFKLGFVYERRQPTLPLPKSDLPPDPDPDPVLHPPRRSSRASHPP 699


HSP 2 Score: 424.5 bits (1090), Expect = 1.0e-114
Identity = 252/508 (49.61%), Postives = 293/508 (57.68%), Query Frame = 0

Query: 3   GHIDGTTPAPTDATQLAQWKIKDAR----------------------------------- 62
           GHIDG+ PAPT+  +LA WK+KDAR                                   
Sbjct: 36  GHIDGSDPAPTEPKELANWKVKDARVMSWILGSVDPLIVLNLRPYKTAKTMWEYLLKVYH 95

Query: 63  -------------------GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEI 122
                              G+LSIQDY+S FQNLW EFSD+V A V   SL+ V A+HE 
Sbjct: 96  QDNTACRFQLEYEIANYTQGNLSIQDYFSSFQNLWGEFSDMVYAKVPAASLSAVQAVHEQ 155

Query: 123 SKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQM 182
           SKRDQFLMKLR +FE  RSNLMNR PSP+LDVCF ELLREEQ LLTQ   +Q+      +
Sbjct: 156 SKRDQFLMKLRPEFEITRSNLMNRDPSPSLDVCFGELLREEQRLLTQAMFQQDS-NPNPI 215

Query: 183 AFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTA 242
           A+ A+GK KG+DM KVQCFSCK YGHIAANC +K CNYCKK GH IK+C  +PQNRQ TA
Sbjct: 216 AYAAYGKGKGRDMRKVQCFSCKEYGHIAANCAKKSCNYCKKQGHFIKECPTQPQNRQATA 275

Query: 243 FQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKS 302
           +QA  +TS        P   S  +     + LTPEMVQQMI+SAFSAL L GN  +LSKS
Sbjct: 276 YQAAVNTSSV------PKMPSTSSSTDGLSALTPEMVQQMIMSAFSALGLQGNDTTLSKS 335

Query: 303 WLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLVLDQDSGTVIAKGPKVGRLFP 362
           WL+DSAASNHMT S++ L N                     DQ SG ++AKGPKVGRLFP
Sbjct: 336 WLIDSAASNHMTRSSDTLCN---------------------DQVSGKILAKGPKVGRLFP 395

Query: 363 LHISIPSNVSLACSVVINQNELWHKRLGHPNSAILSYLLTSGLLG-KNNKFSGLSFDCST 422
           LH SIPS +SLAC  V +QNE+WHKRLGHPNS +LS++L SGLLG K   +  LSFDC  
Sbjct: 396 LHFSIPSCLSLACMTVNSQNEVWHKRLGHPNSVVLSHMLNSGLLGNKEQVYKNLSFDCFV 455

Query: 423 CKLGKSKILPFPLAGSRANKCFDIIHSDSFSDIAILPSFDETSSSPERFKPGYVYEQRHS 456
           CKLGKSK L FP  GSRA                                          
Sbjct: 456 CKLGKSKTLSFPPHGSRAANF--------------------------------------- 470

BLAST of Sgr012060 vs. ExPASy TrEMBL
Match: A0A2N9GB15 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS24542 PE=3 SV=1)

HSP 1 Score: 396.0 bits (1016), Expect = 3.9e-106
Identity = 329/1015 (32.41%), Postives = 401/1015 (39.51%), Query Frame = 0

Query: 3    GHIDGTTPAPTDATQLAQWKIKD-----------------------------------AR 62
            GHIDG++   +     A W  KD                                    +
Sbjct: 307  GHIDGSSKGGSSEADKAAWAAKDNQIMSAKAMWDYLKQVYHQDNNARRFHLELAIANYTQ 366

Query: 63   GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEISKRDQFLMKLRSDFENARS 122
            G LS+QDYYSGF  LW ++SD+V A VS E L  V  +H  S+RDQFLMKLR +FE+ R+
Sbjct: 367  GDLSVQDYYSGFLTLWNDYSDLVTAKVSAEGLASVQHVHRTSQRDQFLMKLRPEFESIRA 426

Query: 123  NLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKM--TTTQMAFLAHGKSKGKDMSKVQ 182
            +L+NR P PTL+ CF ELLREEQ L TQ  +EQ ++   T  +A+ AHGK KG+DMS  Q
Sbjct: 427  SLVNRDPVPTLEACFGELLREEQRLNTQNLMEQSRIASNTVSVAYAAHGKGKGRDMSTTQ 486

Query: 183  CFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTAFQA--TASTSPTGQSHI 242
            C+SCK YGHIA NC QKFCNYCK+ GHIIK+C IRP +R   A+ A  T  + P   +  
Sbjct: 487  CYSCKKYGHIAPNCPQKFCNYCKQPGHIIKECTIRP-SRSTKAYHAVVTDDSQPAANA-- 546

Query: 243  GPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKSWLVDSAASNHMTSSA 302
                 SAV     +  LT EMVQ+MIVSAFSAL   G G+  S SW++DS ASNHMT+S 
Sbjct: 547  ---VSSAVVLQPPAPSLTREMVQEMIVSAFSALGFQGTGS--SPSWILDSGASNHMTNSL 606

Query: 303  NLLQNVRPYHGLENIQVANGNQLLVLDQD----------------SGTVIAKGPKVGRLF 362
            + L NVR Y G  +IQ AN +   ++DQD                SG VIAKGPK GRLF
Sbjct: 607  HGLSNVREYCGSSHIQTANVSVGQLVDQDYGVNFTHDGCVVQDQMSGQVIAKGPKHGRLF 666

Query: 363  PLHISIPSNVSLACSVVINQN----ELWHKRLGHPNSAILSYLLTSGLLGKNNKFSGLSF 422
             L I  P N+    S++ N++    E+WHKRLGHPNS ILSYLL SGLL     FS   F
Sbjct: 667  SLQIPAPRNLPPFLSLLCNKSRVSPEVWHKRLGHPNSRILSYLLKSGLLNNKEHFSSAVF 726

Query: 423  -DCSTCKLGKSKILPFPLAGSRANKCFDIIHSD--------------------------- 482
             DC+TCKLGKSKILPFP  GSRA   F+IIHSD                           
Sbjct: 727  SDCATCKLGKSKILPFPSEGSRATHSFEIIHSDVWGISPTISHAQYKYFVTFIDDYSKYT 786

Query: 483  ------------------------------------------------------------ 542
                                                                        
Sbjct: 787  WVYFLRHKSEVFPMFKLFLALVQTQFSATVKILRSDSGGEYMSHEFQSFLHSKGIISQRS 846

Query: 543  ------------------------------------------------------------ 571
                                                                        
Sbjct: 847  CPYTPQQNGVAERKNRHLLDVVRTLLIESSVPPKFWVEALTTATFLINRLPSQALNLESP 906

BLAST of Sgr012060 vs. ExPASy TrEMBL
Match: A0A5C7IEK2 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_008121 PE=4 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 3.4e-102
Identity = 245/615 (39.84%), Postives = 319/615 (51.87%), Query Frame = 0

Query: 3   GHIDGTTPAPTDATQLAQWKIKDAR----------------------------------- 62
           GH+DG++ APTD  +L+ W+ KDA+                                   
Sbjct: 36  GHVDGSSTAPTDPKELSSWEGKDAKIASWLLSSVEPHMVNNLRGFTTVKQMWDYLRRIYY 95

Query: 63  -------------------GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEI 122
                              G+LSI+ +YSGF NLW++++ +V + V KE+L  + A+H  
Sbjct: 96  QDNSARKFQLELDIGNYRQGNLSIEQFYSGFLNLWSDYTRLVHSTVPKEALAALQAVHSE 155

Query: 123 SKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQM 182
           S+RDQFLMKLR +FE+AR+ L+NR P P+LDVC  ELLREEQ L +Q  + Q+   T  +
Sbjct: 156 SQRDQFLMKLRPEFESARAGLINRTPVPSLDVCLGELLREEQRLASQLGIAQDAGGTEMV 215

Query: 183 AFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTA 242
                   KG+  S  QC+SCK  GHIA +C +KFCNYCKK GHIIKDC +R QNR   A
Sbjct: 216 NMAYAAYDKGRYKSPPQCYSCKEVGHIAKHCRKKFCNYCKKEGHIIKDCRVRLQNRSAPA 275

Query: 243 FQATASTSPTGQSHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSKS 302
           F     +S    S   P T+   + N     +TPE VQQMIVSA  AL L G    LS  
Sbjct: 276 FHTAVQSSFVPASFAQPTTVPGSSSN-----ITPEQVQQMIVSALFALGLQGKQYLLSSP 335

Query: 303 WLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQL---------------------- 362
           WL+DSAASNHMT S+  LQ+VR Y G ++IQ+A+GN L                      
Sbjct: 336 WLIDSAASNHMTGSSTALQDVRKYDGEQHIQIADGNTLPITAVGNLGSSFTNVFVSPALS 395

Query: 363 -----------------------LVLDQDSGTVIAKGPKVGRLFPLH-ISIPSNVSLACS 422
                                   V DQ SG  IAKGPKVGRLFPL   SIP ++S+  S
Sbjct: 396 ANLISVGQLVEENFSLHFDRSGCRVQDQASGLEIAKGPKVGRLFPLQSFSIPCSISVGYS 455

Query: 423 VVINQNELWHKRLGHPNSAILSYLLTSGLLGKNNKFSGLSFDCSTCKLGKSKIL------ 482
            + N +  WHK+LGHPNS IL++L+  G L   N FS LSFDC+ CKL +   L      
Sbjct: 456 AIANNSHFWHKKLGHPNSVILTHLMKHGHLSNTNAFSSLSFDCAPCKLVQCAFLGYSNSH 515

Query: 483 -PFPLAGSRANKC-------------FDIIHSDSFSDIAILPSFDETSSSPERFKPGYVY 484
             F    + ANK              F    +++ S   +LP FD+ SS+P RF+PG VY
Sbjct: 516 KGFVCYDADANKIRISRNVIFFENQYFFPSRTNTVSSSVLLPPFDDVSSTPTRFRPGIVY 575

BLAST of Sgr012060 vs. ExPASy TrEMBL
Match: A5B7U3 (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_031906 PE=4 SV=1)

HSP 1 Score: 381.7 bits (979), Expect = 7.6e-102
Identity = 211/483 (43.69%), Postives = 278/483 (57.56%), Query Frame = 0

Query: 3   GHIDGTTPAPTDATQLAQWKIKDAR----------------------------------- 62
           GHIDGT+    D   L  W+ KDAR                                   
Sbjct: 10  GHIDGTSSTLKDTKGLGLWEAKDARIVSWLLGSIKPHMVNNLRSFSTAKEMWEYLRRIYN 69

Query: 63  -------------------GSLSIQDYYSGFQNLWAEFSDIVCAAVSKESLTDVLAIHEI 122
                              G+LSI+ YYSGF NLW E+S I+ A V KE L+ +  ++E 
Sbjct: 70  QDNNACCFQLELEIAYFTQGNLSIEQYYSGFLNLWGEYSGIIYAKVPKEVLSALQTVYEE 129

Query: 123 SKRDQFLMKLRSDFENARSNLMNRHPSPTLDVCFSELLREEQHLLTQTTLEQEKMTTTQM 182
           S+ DQFLMKLR+++E  ++ L+ R+P PTLD+C  ELLREEQ L TQ  + QE++ +  +
Sbjct: 130 SRCDQFLMKLRAEYETTQAGLLKRNPVPTLDICLGELLREEQRLATQAEMGQERLHSEIV 189

Query: 183 AFLAHGKSKGKDMSKVQCFSCKNYGHIAANCTQKFCNYCKKHGHIIKDCFIRPQNRQNTA 242
                 + +G++  ++QC+SCK +GHIA +CT+ +CNYC+K GHIIK+C IRPQNRQ  A
Sbjct: 190 NVAYATQGRGREKGQIQCYSCKEFGHIATSCTKPYCNYCRKRGHIIKECPIRPQNRQAQA 249

Query: 243 FQATASTSPTGQ-SHIGPPTMSAVNENAQSTVLTPEMVQQMIVSAFSALKLHGNGNSLSK 302
           FQA    +P  Q S I  PT++ V  +    VLTPEMVQQM +SAFS   L GNG ++S 
Sbjct: 250 FQAAVQATPAAQXSPIVGPTVTPV--STSQAVLTPEMVQQMXISAFSTFGLQGNGKTVSS 309

Query: 303 SWLVDSAASNHMTSSANLLQNVRPYHGLENIQVANGNQLLV------------------- 362
            W VDS ASNHMT  +  L NV+ Y+G + IQ+ NG+ L +                   
Sbjct: 310 PWFVDSGASNHMTGQSESLLNVQSYNGPQYIQIGNGSHLPINAVGDIGPSFQNVFVSPGL 369

Query: 363 ----------LDQDSGTVIAK----GPKVGRLFPLHISIPSNVSLACSVVINQNELWHKR 396
                     +D +     +     GPKVG+LFPL  SIPS +SLACS V NQ+E+WHKR
Sbjct: 370 SANLISVGQLVDNNCNVSFSHGGCLGPKVGQLFPLQFSIPSALSLACSTVSNQSEVWHKR 429

BLAST of Sgr012060 vs. TAIR 10
Match: AT5G18560.1 (Integrase-type DNA-binding superfamily protein )

HSP 1 Score: 85.5 bits (210), Expect = 2.2e-16
Identity = 44/81 (54.32%), Postives = 58/81 (71.60%), Query Frame = 0

Query: 731 DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSDSSTFHSLLTALDVQALL 790
           DPTTKER WLGTFDTAH  ALAYDRA LSM+GT ARTNF+Y+ +   H++LT  ++ +L+
Sbjct: 72  DPTTKERHWLGTFDTAHEAALAYDRAALSMRGTQARTNFVYTPTDV-HTILTNPNLHSLI 131

Query: 791 PSDSPHSKQHSPLATKTPHFL 812
              SP++   S L   +P F+
Sbjct: 132 V--SPYNNNQSFLPNSSPQFV 149

BLAST of Sgr012060 vs. TAIR 10
Match: AT5G13910.1 (Integrase-type DNA-binding superfamily protein )

HSP 1 Score: 74.7 bits (182), Expect = 3.8e-13
Identity = 41/70 (58.57%), Postives = 48/70 (68.57%), Query Frame = 0

Query: 731 DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIYSD---SSTFHSLLTALDVQ 790
           DPTTKER WLGTFDTA   ALAYDRA  SM+GT ARTNF+YSD   SS+  S+++  D  
Sbjct: 37  DPTTKERHWLGTFDTAEEAALAYDRAARSMRGTRARTNFVYSDMPPSSSVTSIVSPDDPP 96

Query: 791 ALLPSDSPHS 798
              P  +P S
Sbjct: 97  PPPPPPAPPS 106

BLAST of Sgr012060 vs. TAIR 10
Match: AT1G28160.1 (Integrase-type DNA-binding superfamily protein )

HSP 1 Score: 69.3 bits (168), Expect = 1.6e-11
Identity = 36/65 (55.38%), Postives = 42/65 (64.62%), Query Frame = 0

Query: 712 ENISCIGIFDNTAWEMVAGD---PTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTN 771
           E I  +G+     W   A +   PTTKER WLGTFDTA   ALAYDRA  S++G  ARTN
Sbjct: 35  EEIKYVGV-RRRPWGRYAAEIRNPTTKERYWLGTFDTAEEAALAYDRAARSIRGLTARTN 94

Query: 772 FIYSD 774
           F+YSD
Sbjct: 95  FVYSD 98

BLAST of Sgr012060 vs. TAIR 10
Match: AT1G24590.1 (DORNROSCHEN-like )

HSP 1 Score: 64.3 bits (155), Expect = 5.2e-10
Identity = 36/78 (46.15%), Postives = 47/78 (60.26%), Query Frame = 0

Query: 731 DPTTKERKWLGTFDTAHVTALAYDRATLSMKGTLARTNFIY---SDSSTFHSLLTALDVQ 790
           DP +KER+WLGTFDTA   A AYD A  +M+G  ARTNF+Y   S  S  H + ++  + 
Sbjct: 75  DPLSKERRWLGTFDTAEEAACAYDCAARAMRGLKARTNFVYPMPSLDSYHHRIFSSPPMN 134

Query: 791 ALLPSDSPHSKQHSPLAT 806
             L  D  +S+  SPL T
Sbjct: 135 MFLLRDVLNSQSLSPLTT 152

BLAST of Sgr012060 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 62.8 bits (151), Expect = 1.5e-09
Identity = 30/69 (43.48%), Postives = 44/69 (63.77%), Query Frame = 0

Query: 489 PLEVNVKYHSDEGALLSDPSLYRQLVGSLNYLTITRPDISFAVQQVSQFMHSPRHLHLAA 548
           P++ +V + +  G    D   YR+L+G L YL ITR DISFAV ++SQF  +PR  H  A
Sbjct: 357 PMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQA 416

Query: 549 VRRIIKYLR 558
           V +I+ Y++
Sbjct: 417 VMKILHYIK 425

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA8529702.11.1e-11850.51hypothetical protein F0562_034198 [Nyssa sinensis][more]
XP_021654098.11.5e-11540.99uncharacterized protein LOC110645300 [Hevea brasiliensis][more]
TXG67369.12.1e-11449.61hypothetical protein EZV62_008644 [Acer yangbiense][more]
KAG6501099.19.2e-11046.14hypothetical protein ZIOFF_040967 [Zingiber officinale][more]
KAG6536639.15.9e-10948.22hypothetical protein ZIOFF_001697 [Zingiber officinale][more]
Match NameE-valueIdentityDescription
Q6J9Q23.0e-1554.32Ethylene-responsive transcription factor ERF086 OS=Arabidopsis thaliana OX=3702 ... [more]
Q94HW28.3e-1352.11Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9M6445.4e-1258.57Ethylene-responsive transcription factor LEP OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8H3Q13.5e-1173.33Ethylene-responsive transcription factor FZP OS=Oryza sativa subsp. japonica OX=... [more]
Q9ZT944.5e-1146.48Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A5J5AIJ45.2e-11950.51Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_034198 PE=4 SV=1[more]
A0A5J5AIJ49.5e-0449.18Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_034198 PE=4 SV=1[more]
A0A2N9GB153.9e-10632.41Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS24542 PE=3 SV=1[more]
A0A5C7IEK23.4e-10239.84Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_008121 PE=4 SV=1[more]
A5B7U37.6e-10243.69Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_031906 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G18560.12.2e-1654.32Integrase-type DNA-binding superfamily protein [more]
AT5G13910.13.8e-1358.57Integrase-type DNA-binding superfamily protein [more]
AT1G28160.11.6e-1155.38Integrase-type DNA-binding superfamily protein [more]
AT1G24590.15.2e-1046.15DORNROSCHEN-like [more]
AT4G23160.11.5e-0943.48cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 145..161
e-value: 0.0076
score: 24.3
coord: 163..179
e-value: 0.67
score: 12.3
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 145..161
e-value: 8.9E-4
score: 19.2
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 146..161
score: 8.729682
IPR001471AP2/ERF domainSMARTSM00380rav1_2coord: 722..776
e-value: 3.8E-7
score: 39.8
IPR001471AP2/ERF domainPROSITEPS51032AP2_ERFcoord: 700..770
score: 10.59952
IPR001471AP2/ERF domainCDDcd00018AP2coord: 725..769
e-value: 3.1004E-14
score: 65.7559
IPR036955AP2/ERF domain superfamilyGENE3D3.30.730.10AP2/ERF domaincoord: 720..771
e-value: 2.1E-10
score: 42.6
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 305..373
e-value: 6.4E-13
score: 48.4
NoneNo IPR availableGENE3D4.10.60.10coord: 138..195
e-value: 2.5E-9
score: 38.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 425..441
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 420..447
NoneNo IPR availablePANTHERPTHR31677:SF49ETHYLENE-RESPONSIVE TRANSCRIPTION FACTOR ERF086coord: 731..803
NoneNo IPR availablePANTHERPTHR31677AP2 DOMAIN CLASS TRANSCRIPTION FACTORcoord: 731..803
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 137..180
IPR016177DNA-binding domain superfamilySUPERFAMILY54171DNA-binding domaincoord: 725..771

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr012060.1Sgr012060.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding