Sgr023804 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr023804
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptioneukaryotic translation initiation factor 4B3-like
Locationtig00000892: 6840384 .. 6851308 (-)
RNA-Seq ExpressionSgr023804
SyntenySgr023804
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCAACGGTGTCACCGTGGGGCAAGCCCGGAGCTTGGGCTCTGGATGCGGAAGAGCACGAAGCGGAGCTCCTCAAGGACCAGCAAGATCAGGCTCGCCAGCAATCGGAGCCCAGTGCTGAGTTTCCGTCCCTTGCTGCCGCGGCCGCCACCAAGCCGAAGAAGAAGAAAGGGCAGCCCATCCCCCTTGGGGAGTTTCAGACCTATGGAGGTTTTAAGCCGTCGTCGGCTCAATCCAGTGAGCCCAAGGGTCTTACCACCGAAGATCTCATGATGCTCCCAACCGGTCCTCGTCAGAGGACCGCCGAGGAGCTCGACCGCAACCGGCTGGGCGGAGGTTTCAAGAATTACGGTCAGAACAGTTTGTATGACCGGGGGAATCGGTATTCTAATGGTGAAGATTCATCGAATTCTCAGTGGGGTTCGTCGTCTAGGGTTAATGATGATTCAAGGAGGAGCAATGATGGATCCAATAGGGAATTCAGAAGGGAATCAGCGCCCTCTAGGGCTGATGAGATTGATGATTGGGGCGCCGGGAAGAAGCCAATGGCGGGAAATGGGTTTGAGAGGAGAGAGAGAGGAGGATTTTTCGATTCGAATTCGTCGAAAGCGGACGAATCTGATAGTTGGGTATCAAATAAGAGCTTTACACCTTCTGAGGGTCGGAGATCAGGTGGTATAGGTGGCGGTTTTGAGAGAGAGCGAAGAGGAGGATTTACCTCGAACGGTGGTGGCGCTGATTCGGATAATTGGGGCAAAAAAATTGAAGGGGGTAGAGGTGGAAGTGGTGAGAGGCCGAGGCTTAATCTGCAGCCTCGTACAATACCTTTGAACGATGGGAATCAGGAGGCTTCTGGTGCGGTAGTAAAGCCAAAGGGTTCCAATCCTTTTGGAGATGCAAGACCAAGGGAAGAGGTATTGGCTGAGAAGGGGCAGGACTGGAAGAAGATTGATGAGCACCTCGAGTCAATGAAAATTAAGGATACGGTGGAGAAGGCCGAGAGATCAAGTGGGGCTTCGTTCGAGAAAAAAGGCTTTGGAGTTCGGAGTGGTCGGTCACCAGATGATCGGATGGGGAGGAGTTGGAGGAAGCCAGAGTCTGTGGAATCTCGTCCCCAAAGGTTTGCGTTATCCTTAAAAGTTTTGCTTCTTTTGATGGCTAAATCGTCGGTAGATTTTGATTATAGTTGTCGTTACCTTGTTTTTGTTTGTGCCTTGTGGTGCTTTATACATATGCTGTTGGTTGCCCTTTGGAGAAATTGTTTTATTTGCTGCTTTCTTCTAGGATTCTTGTAGATTCAGTTCTTGATCTTAGAAGGTATGGTAGTGCAAACTAAATAAGCAACACTTGAGATGGAAATTAACTACATATGTCAGTTATGTTGGTTACCTGAATAAACCAGACAGCAAGTTCTCGTATTTTGTACTTGTTGATGCATGACCTTTTTCTCATCTCAAGTATAATTTGAAAGTGGAGATGCGTGTGACAAATCAAATTCTGTGTAAATGATAGTTTTGGTGGCTTTTTGTTATTGTTTGGATCAGAACGGAAAATAATGTTCATGTCAACTTGCTTTGCAAGGATGTTTTCTCCATTTAGATTTATCTCATCTTGAACTAGAAGGTATTGACCAAATTAGAATTCGGTTGAAACTTGAATGTTCTTTCTTCTAGCAATCAAGATGTCATAGTGGGGTAAAAGAAATTGCTTTGCATTTGCATGGTCTTTACACAGCAGGGAGTTTATTGGGGATTAAAATTTGGCAAATCTGTAGACTGCAGTGCTTTTTGTACTGTGTCCTTCTATGGAAAGATTCAGGAGGGTGAAACCTGCAAATTGTTTTTTCCTTGAACTTATTTTTTTAGCCTTTTTGGTTGATTGATTATTCTCTAGCATGTTGATTCAGTCATTTGATTCCGTGTTGCAGTGCTGAGATGGTTGATGATGGACCTGCTGAGGAAAACTGAAGACGGTATTCTATGAATTTGATACTAAAGTTGAAATAATTAAATTGCTGATGCGTAGTGGTGAAGTTTGCTGATATTCATTATTGAAAGCTCTTGAATTATTCACACTGTTGTTTTTCCGTGAGTTCTTTGTGAGTTATCCCCCCTCCCCCCCCCAAAAAAAAAAACAAAAACAGGAATGAACTATACGTAAAAATTTTGCTGCAGTGAGTGTTATTGATGTGGACCAAGTACAGGACATTGTGCAATTTTGGTAGGTTTATGGTAGCTTTGCTAATTAGGCCTTTTATGGATCCGTTTGTACCTCGTTGATGTTCTTGAGAAATACCGTTTGACTATACTGTCGTTGTTACTAATATCAGTTTTGTGTTCCGTTGAGAAATTCTAATGAGTTTGATTTTGCTTCTGACCTGAATTATATTGACTAAACAAATTATGATAATTTTTCATTGCCCGGCATTATGCTTGCCACGGTTTTTATTATAATTCTGGGGTTTTTATTATTTGCAGTTTGAATCTTGCATTATCCTCTGAAAGTTTGATTGTTTGCTGAACTCATTAAAGCATCACAAAATTTATTATTTGCTGTCTTGGATGTACAAATGAAGCTATTTTGTTATCGGACAAATAGAATATCTGCCTCTTGAATAATTAAATGGGAAGTAAAACTTTGGAGCAATGGTGAAAATGGGGTCGAATAAAAATATAGGAGAAAGTTATATATACTATAAAAATATCAATCTTCATTTTGATTGAATTTCATGAAAATTTAGCTAATTTAGAGAATTACATAGGAAGGTTCATTATACACTAACTAGAGTAGTTCAACCTTTTACTTTTGATTTATCCATTGATTGTTGTTCATCTTCAATGAATTATTTTACGTAGATGTTGATCAATAGAAAAAATTATATTTGTATATTTGAGATTTAGTGGTTGATGACTTGTTTGGACATTTTGACCGGAGGAAAATATAAATAAATGGTTTGCCTTCGTAACTCACAACTTTCTTAATTCTAATCTTCAATATCCTTCAAGTATATTGGAATGCAAACGTATGTGACGGATGGTTAGTGGCAAAATTAATTGAAAATAGAAGTTTAGAAAATTTTTAACACGTTTATAACCATTATTCAGAGCCCTTTTGCACATGGATGCTTCGTATTTTATAATGTTTTGGTGGGATGTGAATGTATTCATTCACTAAACAACAAATAGGGCGGAAAAGAAAGGGTTGCAAACAATGAAATTCAACATGAATAAAGCATATGATCTAAGCAGAATCATGGGGAATAAAGAGATTTGAAGACAAATGGATCTGTTCGGTGATGAGTAGATCAAACATGAACTAAGGCAAACTAACCTGTCGTCATCTTATTCATTTTTAATACGCACAAAATGTGTATGGAACTTTTTAAAATGGAGGAGAAACAAAAATAGGATCATAAGCAACATAAGAAGATTGCTTAACTATAAAAAATATCCTAGAGAAAACATCTTGATAGATGATCAACTTCTTAAAACTATTTTGATGACTAGCCATAACAGGAATGAAATGGTGAATTGCATAATGAACTAAAGATTCACTAACTAAAATATTTGGTCAATAGATACTTTGGTCTTTCCTCACAAACCTAGGAAAAAAAAAGAGATTTTTGAAGTGTCAGGACAAAATTTAGAAGGTATTCAAGGGCGAAGCTACATCTCAAACTATGCAATGATCTCAACTATGTGTACAAAATGGAGAAGTATCTCTTCATAAAAAGTTAGAAGGTATAGTTTTCAAGGATGACCTTCATATCTTCAACTAGATGATGTTAGCCGAACAACTAACAAGGATGGAGACTTCTAAGAACCCTCAAAGTTTAATAACATGAGGTTTCGAAGGGGGCAATACTAAAATTCTAAAGTTGCATAGAAAATCTTTTAAAATCGAAATAATTTGGAGAAGGGGAGGAAGTTACTAAAACAAGAGTTTGTTGGAGTGGGGAATGAGCTTTCACTAAATATTGGATTACATATTTGGTTTCTTAAAGATTATAATTGAAAATTAGTTCAGGCCTATTCAACTTGATTGAGAATTGTATTACTAACATTCTTAACGAGGTAGGTTTGTGAGAGTCATAAATTCTCTCCATCTCTAATATGAAGAAAAGATTCATCTGAAACATAAGTGAATTCAAATTGTGAAATTAGTTGTCAGAGTTTTAAAAAATCATAAATATGTTAGACCTACAAGAAAAAAACTTGATTAAATTCAAAACAAATTGTACTAATTTATGTCAAGGCTAGACCGCTAGATTTTGAAAAAAAAGGGTAAAAAGGAAATTTTAATGTTGGTAAAGGACACAAGTGAAATAAAATTAGGTGCTCAAATATTTTGAAAATCGCAATAAAAACAAAAAAAAAAAAGGAAGGGATTTATAAAAATTGTGCATTTATTATTGGTTGTTGAACTCTTCATCAATTGCTCACCGCCACAATCTCCTGAGATTGCAGCAACCGTTGAGCTGCCAATCCTACGGAATTATTTGCAAGCAAGGGCTCCTCCTACTTTCAAGTAAAGGACCTACTAATAGGGATCTATATTTTTATCGATATTTTCATGAATTAAAGGTTTGATATCAATATTAGAAATTAATTACTATTTTTAATACATCACAAAAATACCAATAATGTATAAAAAAAATTCAATAAAATATTACCAGTGTAAATATAAAAATTAACGATAAGATTTATAATTCATTTAAGAAGCGTAAAATACTTAATTATTAATAAATTTTTTTTATAAATTTGTTAAATTTTATGACTTTTTAAAGAATATTTGTCGATTTTGATGTTTTGTCAATCTTTCTAAAGACATCTCTAGAAAATTAAGATCTGCACATTTTTGTTACCATCGTTTTAATATTTAGGTGAGACTTGTAGTTTGTAACTTCCAACGACTATCTTTAAATGGTTTAGCCCTTGTGAGGCACGTGATATAAAGCATGGCTGGGACCCACAACGCTAATTTTTGCAACTTGTTTTTTGAAATTATCCGACTTCTTAAAGTCATATTTGAAGCAATTTTATATCTTTGGAAATAAAAAATATTAGAAACAATATATTATTATTTTTACTCGTGTTTCTTTGTTTTTTTTTTCATTTAATTATTCGTAAGTAATCATTAATTTTTTTTTAATTACAACTTGATTCAAACACATTATATTTTTAAATTAATGATTAATAAAAAATTGAAGACACAAGAACTAAAAACTAGAACAAACATTTGAAACTTTAAAATCCAAAATATAATAAATATGAAAGTTTATATTGGATTAACACTAGAGTATCCTCTTTCCTCTCTCTTTAATTAACATTTTACTTTGAGATTTTGTCTCTTTTTTTGGGTGGAGGATAGAGTTTTAATCTTTTATATTTATTCTTTACACTTTCTCTAACTTGTGAGCTTGAGAATTAACACAAGATCGAACAAATGATTATTATATAAAATTGAGAAGAAACAATAATTGGAAAGATTAGAACTAAAGACTTCTTACTCTAACATTACATTAAACTATCATTGAACTCAAATGTTTAAGGTGATAAATTAGAGTACATTTAATCTTTCATATATATTCTTATCATAAAGTGAAACACATCTTACAAGTTAAAAGTAAGAAAATTAGGAAATATTCTTAAAAAGAAAAAAAAAGAGTGAGAAAAGAAAGAAATTTTAAAAAATATTTATTTTTTTTCGGCGTCCCATTAATTTGGACAGTTGAGAGTTGAATTTCAATCTTCTTTGCCCCTCGGGTTCATTTACTTTCATCTGACTTTTTGTCAGCTTCCACCACGTTGCCGTCACAAAAAAGTGTCCAACACCATCTGCATTCTTCACCATCTCTTAAATTTACTTCCAATTCGTCATCTTCTCTCTCGCAGATCCCAATCCAACTTCCATGGCTCCTCTCTGTTTGCTGCATTTCTCTTCTTCGCCGGAGCCATCGTTGGAGCCCAAGGATACGCCGTCGTCACTGGCACTGTCTTCTGCGATCAGTGCAAAGATGGCCAAATATCCTTGTTCGACTACCCCATGAACGGTGACCTTCCTTATTTAATGCTTTGTTTGTTTATTGGAAAGGCGAAAAGAAACTGAAGCCCTAATTTTGGTGGGTTTATTTGTTGCAGGAGTTAAAGTGATGATGGCTTGCGGCGACGGCAATGGGGGAGTTACATATTCGAGGGAAGAGACTACAAATTTGTTTGGAAGCTTTACGATGAGATTCGACGGGACGCCGATCTTAGCAGCTGTTATGCGGCGGTCGGCGGGAGTGGGCAGGGAACGACGACGGACTGTGGCGGAGCCGCCGGCCCTGCGAAGAGTCTCCGGCTGATGTTCAGGATGTTTGACATGGAAATGTATGTGGTTGATTCGTTGCTTTCTCAGCCTGCTCAACCAATGCCGTTCTGTTCGGCGTCTGTTAATCCAGTCCCGGCGCCGGTTACAATAATACCTCCACCACCTCCACTTCCGCCGTTGTCTCCGCCGCCTACTTTCAGGCTGCCGCCACTTCCTCAACTGCCACCGCTTCCGCCGTTGGCTCCGTTGCCCCAGTGCCATTCTTGGAAGGTTCTGCTTGTCAACATGAGTAAGTAAGCTTTTTTTTAGTTTCCTTTGATAAGTGATAAGTGAAGCATTGCTTCTTAAATTATTTGCTAAATAACATTTTTTTTTTCATTTTCATCCCAATTTTTGGAGTTTCAATTTTAGTTTAGAATTTTTACAATCAAAGTCTAAATATATTGTTAAAAAGAGATTTTTAAAATCCTTTTATGATAACGGTTTCAATCTAAATTTTTACTTTTAAGAATTAGGCTTTTTAATTCATATTTCTTTCTTTTTCTTTTTTTACCATTTTAGAATTCTTAGTGAAAGATGGGTAAATAAATTATTAGTGAATAAGCTTAAATTTTAAAAAAGGAAAACCATTCGGAAAAAAAAAATCCTTACTAACTTGTTTCATACCACCCACCACTAAAATTTTGACAAGTGAGTATTTTTTAAAAAAATAGAAAAGTCTCTCTTCTCTTCCACCCTCCCTCCCGCCTCCTCCTCTCATACCCGTAAAAAAAAACTTAGTGGTAGGTAGCTACTGCTTGTTCTTTTCAAACGGACCTTCAATGATTATATTCACGAGGTTTTAATTTGGGTTAATTTTATTTTACAACTATCCATTCTTCTATTTTGAATTTTGAACAAAAATAGATCACATTTAAACTCAAAGACAATGGGCATTGTGGAGATTTAAACGTAAATTCAGTTTACAAAATATTTAAAACTGCCAAAAAAACATATGCCTTAAAAAATATGTAGCATAAGTCGAATATTTATTTATAATGTGTTAATTTTTACAATTTCCATTTACAGGAATTGGACAAACCCAGATTACAGATGCTACTGGAGGGCAGTGAACCCAGACACGAAGGTCGCAGTTGTTTTTGGAATAGTTGCGGCCAACCGATATGGGACCGATTTGACCCTATGGAATGGTCTGCAGGGCCGAGGGGACCCTTACAGGACCCTCTTAAGGGAAGCCATAACGGCCTTCCTTAACTCCTACAATTCCCTTCAGTTCCCCTACCCTACGATTTCTGTGCTTCAGCGAATGAATTGGGCCTTGTTGGGCTCCCCACGGGCCGTCCTCCTCACTGCCCTTCGTTCAAACGGGCCAACTCGGGTTATGGCCACGTCACCTGCAAATTTGATCCTTGCCAGTAAAATAAAATTCATATTTCATATCTACAATTTAATTACTCATAAATATTAATTATTACTACTACGTGTCTTTTTTTTTCATGGTATATTTAATTTACTCACATGTAAGATCTCTGAATGTCTTTTTTTTTTTTTAAGTTCAACAAATATGAGGTGTAAAATCAAACCTTCAACTTCATGTCAGGTAATAAATGTTTTATCTATTGAGATATGTTCAGATTGGTAGATTTCTGAATGATATTTTAGTTTGTAAAATTTGTGAAGTTTATTCAAATTTGAATAAATCTTTATAGAGCAATAGATTTGAAAGACCTATTAATATATGAAATCCTAGATGATAATAATTTTCTCTCATTGAATGAATTTGTTTTGTTGGAGCCAAAAAGTTTTTGGATTATGGCATGATCCATATTTTATGCTGAAAAATAGAGTGTATGCTACTTGAAATGTTGTTAATAAATTCATTATTATCTTGTGCTAAAATAAGTTTTTTTTGTATTTGCTTTACACTTTTACATACTAAACAAAGTTTATCTATTGGGTCCATGTCTTTCTAAATTTTGAAATTAATACATTCATACGAGATTTAATCTCATTACCTAATTTGATTTTTTTTTTTTAAGAATATTATATCACAATTGTAGGGGAGTAACAGAAGTAGGAATGCCTTAATCGAGTTACGCTCATTTTGGTCATAATTTACTTATTTGCAACTACTTTTTGTTACTAGGATTATATCAATATTTTGGTATAATTTGTATCATTCATACGTTATTATATTATAGGCTAAATATATCAAATGGTCCCTAATATTTTAGTTATTCTCAATTTGGTCCTTAATCTTTTAAAATTCTCAATCTTACCCCTAACGTATAGAAAAATTCTCAATTAGGTCATTGCATAAAGAAATTGTTAAAATTTGTTAACAAAAAATTGATATATTATCAAATTATTGTTTTATTAATATATTTGAACACATGGGCATGCCACTTGAACATGACATTTGAAAAAAAATGGATTTTGTTGATGAGCGGGTCGGATTTTGTTGATGAGCTGGGTTTCATGTATCAAGTAGCATTCGAGTGACATATTCACGTGTCCAAATATGTTAATAAAACAATATTTGATGATATATTAATTTTTTGTTAACAAATTTTAACAATTTCTTTATGAAAGGACCTAATTGAGAATTTTTCTAAACATTAGTGGTAAATTTGAGAATTTTAAAAGATTAAGGATCAAATTGATAATAACTTAAACATTAGGACTATTGATATATTTAGCCTATATTATATGTCAATGCTCAATACCTTTTGAATCAATTAGTGTAATAATCAAATAATAAGATACCTGAAAAATTAATGAAATATCCAAAAAAAGAAGCATAAGAAAAATTTAATGACCTTCGACTAATAATCAAGGCTATAATTATCACCCAATGCATTTAAAACGTTGTATTGAGATGCTTGGAACGGTTTGGATTGCCAATTGGTTGGTTATGATATAAGCTTGCCCCTTAATATTTCTATAAAATAAGAAATGAACTACAGTTAAAACCAATAATTTTTTTTTAATTCAACAATTGCAGATGAGGATCGAATCATTGAACTTTTAAATGATAATATATTTTTATTCACTGAATTATGCTCAGATTGACATCAAGAAAATTATTGTCGTTGCTATTATCATTATTATTTTATATATATATATATATATATTTTAGAATATCACATCCCAATAATGGGGTGGGAATTCAAATTAAAACTTGAGACCATTTAGATAAAGTAGAGTGTGTCTTTAACTACTTAAATATGTTCATATTGATATTTTTCTAAAATATAGAAATTAGCCCATTAGCCAAACATGATTTATTGAAATTATTTTTCTAAAAAAATAGAAACCTTTGGTAAAAACAGTTACGAAACAATCAAATTATGCTGTAAAATTTAAGAATGTTTTTGCAGAAGATTTTATTTCTTGAGGTTTATAACTCAACAAAGAAATTTCTTTGTCAATTTTTAATGAAACAAGACAACTATTCCAAAATAAATAAACATGCCCGTTCTATCATGTATATTAAATTATAAGTTAAATTGTAATTTTAGTCTATAAACTTTTAATTGTGTCAAATAGTCTCTGAATTTTAAAATATATCAAATAGGTCTCTAACTTTCAATTTTGTGTCTAATAGGTCTTGAACTTTTAATTTTTTTAAATAAATCGTAATTTATTGGACGAATTTCAAAATTAACAACCTAATATACACAAAATTTAATTTTGTAAATAGTTTTCAACTTTTAATTTGTGTTAAAAAATTTATAAATTTTAAAAAATATCTAATAAGTCAATACCCTGTTAAACATAATCCTAAAAGTGGAGGGACTAAAGTTTAGGGTTTAGGGTTTATTACGAAGTTTTGCTACCATTAGAATCTGGACAGCGAAACGAATTATAAACTTGTCTGTTCTGCACGTAAGTTTAGAAAATTATAAAAGAAGTGGATCGATCGGCACGATGTAGTTTCAATCTTTTTATTGAAACAAAAAAGAAATTCAACGCCATTGAAGCATCTTGAAATAATAAAGTCACAGTGCACTGTGTTTAATAGACAAAAAAAGTGAAGCCAGGGTTAAGCTCTCCGTGGACACAGTAATATCACGATGAAGAAACTGCCTCTACCGGCGTTTATCGTGCTGTTGATCGTCGTCAGCGGCGCATTTAGCAGAATCAAAGGACAGGATTTCACGATTGAAGAAGCAACGATCGAGAAATCCAGAGAGCTTTTGCCGATCACAGGCTCACTTCGAGAACGCTGGTCGATTTCTACTTGAACCAAATCGAAACCCTAAATCCGCTGCTTCGCAGCGTGGTGGAGGTCAATCCGGACGCGAGGGAGGAAGCGGACGAGGCCGACCGGCGGAGAGAGGCGAATGAGAACCGTTCGCCGATCGGAGACCTCGACGGCATTCCGGTGCTGCTGAAGGATACTATTGCGACCAAGGACAAGCTGAACACGACGGCCGGATCGTACGCATTACTGGGATCAGTGGTGCCTCGGGACGCCGGCGTGGTGGAGAAGCTGAGACGAGCAGGGGCGGTGATCTTGGGGAAAGCGAGCCTGACTGAGTGGTATTCGTTTCGGGCTCTGGGCCACGTGCCGAATGGTTGGTGTGCACGTGCTGGTCAAGGTGTG

mRNA sequence

ATGGCGGCAACGGTGTCACCGTGGGGCAAGCCCGGAGCTTGGGCTCTGGATGCGGAAGAGCACGAAGCGGAGCTCCTCAAGGACCAGCAAGATCAGGCTCGCCAGCAATCGGAGCCCAGTGCTGAGTTTCCGTCCCTTGCTGCCGCGGCCGCCACCAAGCCGAAGAAGAAGAAAGGGCAGCCCATCCCCCTTGGGGAGTTTCAGACCTATGGAGGTTTTAAGCCGTCGTCGGCTCAATCCAGTGAGCCCAAGGGTCTTACCACCGAAGATCTCATGATGCTCCCAACCGGTCCTCGTCAGAGGACCGCCGAGGAGCTCGACCGCAACCGGCTGGGCGGAGGTTTCAAGAATTACGGTCAGAACAGTTTGTATGACCGGGGGAATCGGTATTCTAATGGTGAAGATTCATCGAATTCTCAGTGGGGTTCGTCGTCTAGGGTTAATGATGATTCAAGGAGGAGCAATGATGGATCCAATAGGGAATTCAGAAGGGAATCAGCGCCCTCTAGGGCTGATGAGATTGATGATTGGGGCGCCGGGAAGAAGCCAATGGCGGGAAATGGGTTTGAGAGGAGAGAGAGAGGAGGATTTTTCGATTCGAATTCGTCGAAAGCGGACGAATCTGATAGTTGGGTATCAAATAAGAGCTTTACACCTTCTGAGGGTCGGAGATCAGGTGGTATAGGTGGCGGTTTTGAGAGAGAGCGAAGAGGAGGATTTACCTCGAACGGTGGTGGCGCTGATTCGGATAATTGGGGCAAAAAAATTGAAGGGGGTAGAGGTGGAAGTGGTGAGAGGCCGAGGCTTAATCTGCAGCCTCGTACAATACCTTTGAACGATGGGAATCAGGAGGCTTCTGGTGCGGTAGTAAAGCCAAAGGGTTCCAATCCTTTTGGAGATGCAAGACCAAGGGAAGAGGTATTGGCTGAGAAGGGGCAGGACTGGAAGAAGATTGATGAGCACCTCGAGTCAATGAAAATTAAGGATACGGTGGAGAAGGCCGAGAGATCAAGTGGGGCTTCGTTCGAGAAAAAAGGCTTTGGAGTTCGGAGTGGTCGGTCACCAGATGATCGGATGGGGAGGAGTTGGAGGAAGCCAGAGTCTGTGGAATCTCGTCCCCAAAGGATTCTTGTAGATTCAGTTCTTGATCTTAGAAGATCCCAATCCAACTTCCATGGCTCCTCTCTGTTTGCTGCATTTCTCTTCTTCGCCGGAGCCATCGTTGGAGCCCAAGGATACGCCGTCGTCACTGGCACTGTCTTCTGCGATCAGTGCAAAGATGGCCAAATATCCTTGTTCGACTACCCCATGAACGACTACAAATTTGTTTGGAAGCTTTACGATGAGATTCGACGGGACGCCGATCTTAGCAGCTGTTATGCGGCGGTCGGCGGGAGTGGGCAGGGAACGACGACGGACTGTGGCGGAGCCGCCGGCCCTGCGAAGAGTCTCCGGCTGATGTTCAGGATGTTTGACATGGAAATGTATGTGGTTGATTCGTTGCTTTCTCAGCCTGCTCAACCAATGCCGTTCTGTTCGGCGTCTGTTAATCCAGTCCCGGCGCCGGTTACAATAATACCTCCACCACCTCCACTTCCGCCGTTGTCTCCGCCGCCTACTTTCAGGCTGCCGCCACTTCCTCAACTGCCACCGCTTCCGCCGTTGGCTCCGTTGCCCCAGTGCCATTCTTGGAAGGTTCTGCTTGTCAACATGAGTAAGAATTGGACAAACCCAGATTACAGATGCTACTGGAGGGCAGTGAACCCAGACACGAAGGTCGCAGTTGTTTTTGGAATAGTTGCGGCCAACCGATATGGGACCGATTTGACCCTATGGAATGGTCTGCAGGGCCGAGGGGACCCTTACAGGACCCTCTTAAGGGAAGCCATAACGGCCTTCCTTAACTCCTACAATTCCCTTCAGTTCCCCTACCCTACGATTTCTGTGCTTCAGCGAATGAATTGGGCCTTGTTGGGCTCCCCACGGGCCGTCCTCCTCACTGCCCTTCGTTCAAACGGGCCAACTCGGGTTATGGCCACGTCACCTGCAAATTTGATCCTTGCCAAAATCCAGAGAGCTTTTGCCGATCACAGGCTCACTTCGAGAACGCTGGTCGATTTCTACTTGAACCAAATCGAAACCCTAAATCCGCTGCTTCGCAGCGTGGTGGAGGTCAATCCGGACGCGAGGGAGGAAGCGGACGAGGCCGACCGGCGGAGAGAGGCGAATGAGAACCGTTCGCCGATCGGAGACCTCGACGGCATTCCGGTGCTGCTGAAGGATACTATTGCGACCAAGGACAAGCTGAACACGACGGCCGGATCGTACGCATTACTGGGATCAGTGGTGCCTCGGGACGCCGGCGTGGTGGAGAAGCTGAGACGAGCAGGGGCGGTGATCTTGGGGAAAGCGAGCCTGACTGAGTGGTATTCGTTTCGGGCTCTGGGCCACGTGCCGAATGGTTGGTGTGCACGTGCTGGTCAAGGTGTG

Coding sequence (CDS)

ATGGCGGCAACGGTGTCACCGTGGGGCAAGCCCGGAGCTTGGGCTCTGGATGCGGAAGAGCACGAAGCGGAGCTCCTCAAGGACCAGCAAGATCAGGCTCGCCAGCAATCGGAGCCCAGTGCTGAGTTTCCGTCCCTTGCTGCCGCGGCCGCCACCAAGCCGAAGAAGAAGAAAGGGCAGCCCATCCCCCTTGGGGAGTTTCAGACCTATGGAGGTTTTAAGCCGTCGTCGGCTCAATCCAGTGAGCCCAAGGGTCTTACCACCGAAGATCTCATGATGCTCCCAACCGGTCCTCGTCAGAGGACCGCCGAGGAGCTCGACCGCAACCGGCTGGGCGGAGGTTTCAAGAATTACGGTCAGAACAGTTTGTATGACCGGGGGAATCGGTATTCTAATGGTGAAGATTCATCGAATTCTCAGTGGGGTTCGTCGTCTAGGGTTAATGATGATTCAAGGAGGAGCAATGATGGATCCAATAGGGAATTCAGAAGGGAATCAGCGCCCTCTAGGGCTGATGAGATTGATGATTGGGGCGCCGGGAAGAAGCCAATGGCGGGAAATGGGTTTGAGAGGAGAGAGAGAGGAGGATTTTTCGATTCGAATTCGTCGAAAGCGGACGAATCTGATAGTTGGGTATCAAATAAGAGCTTTACACCTTCTGAGGGTCGGAGATCAGGTGGTATAGGTGGCGGTTTTGAGAGAGAGCGAAGAGGAGGATTTACCTCGAACGGTGGTGGCGCTGATTCGGATAATTGGGGCAAAAAAATTGAAGGGGGTAGAGGTGGAAGTGGTGAGAGGCCGAGGCTTAATCTGCAGCCTCGTACAATACCTTTGAACGATGGGAATCAGGAGGCTTCTGGTGCGGTAGTAAAGCCAAAGGGTTCCAATCCTTTTGGAGATGCAAGACCAAGGGAAGAGGTATTGGCTGAGAAGGGGCAGGACTGGAAGAAGATTGATGAGCACCTCGAGTCAATGAAAATTAAGGATACGGTGGAGAAGGCCGAGAGATCAAGTGGGGCTTCGTTCGAGAAAAAAGGCTTTGGAGTTCGGAGTGGTCGGTCACCAGATGATCGGATGGGGAGGAGTTGGAGGAAGCCAGAGTCTGTGGAATCTCGTCCCCAAAGGATTCTTGTAGATTCAGTTCTTGATCTTAGAAGATCCCAATCCAACTTCCATGGCTCCTCTCTGTTTGCTGCATTTCTCTTCTTCGCCGGAGCCATCGTTGGAGCCCAAGGATACGCCGTCGTCACTGGCACTGTCTTCTGCGATCAGTGCAAAGATGGCCAAATATCCTTGTTCGACTACCCCATGAACGACTACAAATTTGTTTGGAAGCTTTACGATGAGATTCGACGGGACGCCGATCTTAGCAGCTGTTATGCGGCGGTCGGCGGGAGTGGGCAGGGAACGACGACGGACTGTGGCGGAGCCGCCGGCCCTGCGAAGAGTCTCCGGCTGATGTTCAGGATGTTTGACATGGAAATGTATGTGGTTGATTCGTTGCTTTCTCAGCCTGCTCAACCAATGCCGTTCTGTTCGGCGTCTGTTAATCCAGTCCCGGCGCCGGTTACAATAATACCTCCACCACCTCCACTTCCGCCGTTGTCTCCGCCGCCTACTTTCAGGCTGCCGCCACTTCCTCAACTGCCACCGCTTCCGCCGTTGGCTCCGTTGCCCCAGTGCCATTCTTGGAAGGTTCTGCTTGTCAACATGAGTAAGAATTGGACAAACCCAGATTACAGATGCTACTGGAGGGCAGTGAACCCAGACACGAAGGTCGCAGTTGTTTTTGGAATAGTTGCGGCCAACCGATATGGGACCGATTTGACCCTATGGAATGGTCTGCAGGGCCGAGGGGACCCTTACAGGACCCTCTTAAGGGAAGCCATAACGGCCTTCCTTAACTCCTACAATTCCCTTCAGTTCCCCTACCCTACGATTTCTGTGCTTCAGCGAATGAATTGGGCCTTGTTGGGCTCCCCACGGGCCGTCCTCCTCACTGCCCTTCGTTCAAACGGGCCAACTCGGGTTATGGCCACGTCACCTGCAAATTTGATCCTTGCCAAAATCCAGAGAGCTTTTGCCGATCACAGGCTCACTTCGAGAACGCTGGTCGATTTCTACTTGAACCAAATCGAAACCCTAAATCCGCTGCTTCGCAGCGTGGTGGAGGTCAATCCGGACGCGAGGGAGGAAGCGGACGAGGCCGACCGGCGGAGAGAGGCGAATGAGAACCGTTCGCCGATCGGAGACCTCGACGGCATTCCGGTGCTGCTGAAGGATACTATTGCGACCAAGGACAAGCTGAACACGACGGCCGGATCGTACGCATTACTGGGATCAGTGGTGCCTCGGGACGCCGGCGTGGTGGAGAAGCTGAGACGAGCAGGGGCGGTGATCTTGGGGAAAGCGAGCCTGACTGAGTGGTATTCGTTTCGGGCTCTGGGCCACGTGCCGAATGGTTGGTGTGCACGTGCTGGTCAAGGTGTG

Protein sequence

MAATVSPWGKPGAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSLAAAAATKPKKKKGQPIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQNSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWGAGKKPMAGNGFERRERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFERERRGGFTSNGGGADSDNWGKKIEGGRGGSGERPRLNLQPRTIPLNDGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKDTVEKAERSSGASFEKKGFGVRSGRSPDDRMGRSWRKPESVESRPQRILVDSVLDLRRSQSNFHGSSLFAAFLFFAGAIVGAQGYAVVTGTVFCDQCKDGQISLFDYPMNDYKFVWKLYDEIRRDADLSSCYAAVGGSGQGTTTDCGGAAGPAKSLRLMFRMFDMEMYVVDSLLSQPAQPMPFCSASVNPVPAPVTIIPPPPPLPPLSPPPTFRLPPLPQLPPLPPLAPLPQCHSWKVLLVNMSKNWTNPDYRCYWRAVNPDTKVAVVFGIVAANRYGTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSLQFPYPTISVLQRMNWALLGSPRAVLLTALRSNGPTRVMATSPANLILAKIQRAFADHRLTSRTLVDFYLNQIETLNPLLRSVVEVNPDAREEADEADRRREANENRSPIGDLDGIPVLLKDTIATKDKLNTTAGSYALLGSVVPRDAGVVEKLRRAGAVILGKASLTEWYSFRALGHVPNGWCARAGQGV
Homology
BLAST of Sgr023804 vs. NCBI nr
Match: KAF4382046.1 (hypothetical protein G4B88_006678 [Cannabis sativa])

HSP 1 Score: 789.6 bits (2038), Expect = 2.5e-224
Identity = 523/1050 (49.81%), Postives = 620/1050 (59.05%), Query Frame = 0

Query: 1    MAATVSPWGKPGAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSLAAAAATKPKKKKGQ 60
            MAATVSPW KPGAWALD+EE +AELLK+Q+ +A    EP  +FPSL+AAA  KPKKKKGQ
Sbjct: 1    MAATVSPWSKPGAWALDSEEQDAELLKEQEKKAAM--EPLHDFPSLSAAATAKPKKKKGQ 60

Query: 61   PIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQ 120
             + L EF TYGG KP  AQ + P GLT ED M LP GPR+RTAEE++R R   GF++   
Sbjct: 61   TLSLAEFTTYGGPKP-VAQPTAPAGLTHEDRMALPKGPRERTAEEIERAR-HSGFRS--- 120

Query: 121  NSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWGAG 180
               +DRG+R  NG+DSSNS+W SS        R+++G  ++  R+S PSRADE D+W + 
Sbjct: 121  ---FDRGDR--NGDDSSNSRWKSSD-------RNSNGFGKDAPRDSGPSRADEADNWAST 180

Query: 181  KKPM-AGNGF---ERRERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFERER 240
            KK    GN F   ERRER GFF  + SKAD+SDSWV+NKSF PSEGRR G  GGGFERER
Sbjct: 181  KKSFGGGNDFDRGERRERRGFFPDSQSKADDSDSWVTNKSFVPSEGRRFGSSGGGFERER 240

Query: 241  RGGFTSNGGGADSDNWGKK------IEGGRGG--SGERPRLNLQPRTIP-LNDGNQEAS- 300
            + GFTSNGGGAD D WGK+      + G  GG   G RPRLNLQPRT+P ++DG    S 
Sbjct: 241  KVGFTSNGGGADGDFWGKRREESTGVTGNEGGIVGGGRPRLNLQPRTLPVVSDGGSPGSV 300

Query: 301  -----GAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKDTVEKAERSSGAS 360
                   V KPKGSNPFG+ARPREEVLAEKGQDWKKIDE LES+KIK+  EK E   G+S
Sbjct: 301  TGTVAAIVTKPKGSNPFGEARPREEVLAEKGQDWKKIDEQLESLKIKEVNEKPE--GGSS 360

Query: 361  FEKKGFGVRSGRSPDDRMGRSWRKPESVE-SRPQRILVDSVLDLRRSQSNFHGSS----- 420
            F K+ FG+ +GRS DDR+ +SWRKP++ E +  QR    S   LRR++   H        
Sbjct: 361  FGKRSFGIGNGRS-DDRIEKSWRKPDTEEDTGSQRGFGLSDFVLRRARRMAHQLKIIEGI 420

Query: 421  --------------------------LFAAFL---------FFAGAIVGAQGYAVVTGTV 480
                                      LF+ +L         F   AI G QG A+VTGTV
Sbjct: 421  RQLPEKKRKKIEVIRSVDSLRLLFICLFSIYLAVTYNIELVFDNFAIDGIQGDAMVTGTV 480

Query: 481  FCDQCKDGQISLFDYPMNDYKFVWKLYDE-------------------IRRDA--DLSSC 540
            FCDQCKDGQ S FDYP++  K      D                    ++ D   DLS C
Sbjct: 481  FCDQCKDGQRSWFDYPISGVKVKVTCTDSNGQITMSREETTNWFGNYAVKFDGAPDLSGC 540

Query: 541  YAAVGGSGQGTTTDCGGAAGPAKSLRLMFRMFDMEMYVVDSLLSQPAQPMPFCSASVNPV 600
            ++ V  SGQG+ + C   AGPA+ LRLMFR FDMEMY  DSLLSQPAQPMP C  S NPV
Sbjct: 541  FSQVSDSGQGSESGCSAPAGPAQKLRLMFRFFDMEMYATDSLLSQPAQPMPNCPKSTNPV 600

Query: 601  PAPVTIIPPPP---PLPPLSPP--------------------------PTFRLPPLPQLP 660
            PAPVT   PPP   P+PP  PP                          P FRLPP+P LP
Sbjct: 601  PAPVTPAQPPPVAVPVPPAQPPPVATPAPPAQPQPVVSPVTPAIPPPSPAFRLPPMPPLP 660

Query: 661  PLPPLAPLPQCHSWKVLLVNMSKNWTNPDYRCYWRAVNPDTKVAVVFGIVAANRYGTDLT 720
            PLPP AP  +  +        SK+WT P+YRCYWR VNP+TKVAVVFG +AA+RYGTDLT
Sbjct: 661  PLPP-APFLEASACP------SKSWTMPEYRCYWRMVNPETKVAVVFGSLAASRYGTDLT 720

Query: 721  LWNGLQGRGDPYRTLLREAITAFLNSYNSLQFPYPTISVLQRMNWALLGSPRAVLLTALR 780
            +   LQGRGDPYRTLLRE +T+FLNSYN+L+FPY TI V+Q M+WAL+GS R+VLLT LR
Sbjct: 721  MMQALQGRGDPYRTLLREGVTSFLNSYNTLRFPYNTIGVVQHMDWALMGSTRSVLLTGLR 780

Query: 781  ----------------------SNGPTRVMATSPAN------------------------ 831
                                  SNG     A+S +N                        
Sbjct: 781  FMRANSGYGTDYCTAQNRSIFGSNGNQMTAASSSSNHPKAATIQDTLLPSLGEIQAKCGC 840

BLAST of Sgr023804 vs. NCBI nr
Match: KAF4356667.1 (hypothetical protein G4B88_009644 [Cannabis sativa])

HSP 1 Score: 784.6 bits (2025), Expect = 8.2e-223
Identity = 516/1032 (50.00%), Postives = 609/1032 (59.01%), Query Frame = 0

Query: 1   MAATVSPWGKPGAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSLAAAAATKPKKKKGQ 60
           MAATVSPW KPGAWALD+EE +AELLK+Q+ +A    EP  +FPSL+AAA  KPKKKKGQ
Sbjct: 1   MAATVSPWSKPGAWALDSEEQDAELLKEQEKKAAM--EPLHDFPSLSAAATAKPKKKKGQ 60

Query: 61  PIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQ 120
            + L EF TYGG KP  AQ + P GLT ED M LP GPR+RTAEE++R R   GF++   
Sbjct: 61  TLSLAEFTTYGGPKP-VAQPTAPAGLTHEDRMALPKGPRERTAEEIERAR-HSGFRS--- 120

Query: 121 NSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWGAG 180
              +DRG+R  NG+DSSNS+W SS        R+++G  ++  R+S PSRADE D+W + 
Sbjct: 121 ---FDRGDR--NGDDSSNSRWKSSD-------RNSNGFGKDAPRDSGPSRADEADNWAST 180

Query: 181 KKPM-AGNGF---ERRERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFERER 240
           KK    GN F   ERRER GFF  + SKAD+SDSWV+NKSF PSEGRR G  GGGFERER
Sbjct: 181 KKSFGGGNDFDRGERRERRGFFPDSQSKADDSDSWVTNKSFVPSEGRRFGSSGGGFERER 240

Query: 241 RGGFTSNGGGADSDNWGKK------IEGGRGG--SGERPRLNLQPRTIP-LNDGNQEAS- 300
           + GFTSNGGGAD D WGK+      + G  GG   G RPRLNLQPRT+P ++DG    S 
Sbjct: 241 KVGFTSNGGGADGDFWGKRREESSGVTGNEGGIVGGGRPRLNLQPRTLPVVSDGGSPGSV 300

Query: 301 -----GAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKDTVEKAERSSGAS 360
                  V KPKGSNPFG+ARPREEVLAEKGQDWKKIDE LES+KIK+  EK E   G+S
Sbjct: 301 TGTGAAIVTKPKGSNPFGEARPREEVLAEKGQDWKKIDEQLESLKIKEVNEKPE--GGSS 360

Query: 361 FEKKGFGVRSGRSPDDRMGRSWRKPESVESRPQRILVDSVLDLRRSQSNFHGSSLFAAFL 420
           F K+ FG+ +GRS DDR+ +SWRKP++ E                SQ  F G S F    
Sbjct: 361 FGKRSFGIGNGRS-DDRIEKSWRKPDTEED-------------TGSQRGF-GLSDF---- 420

Query: 421 FFAGAIVGAQGYAVVTGTVFCDQCKDGQISLFDYPMNDYKFVWKLYDE------------ 480
               AI G QG A+VTGTVFCDQCKDGQ S FDYP++  K      D             
Sbjct: 421 ----AIDGIQGDAMVTGTVFCDQCKDGQRSWFDYPISGVKVKVTCTDSNGQITMSREETT 480

Query: 481 -------IRRDA--DLSSCYAAVGGSGQGTTTDCGGAAGPAKSLRLMFRMFDMEMYVVDS 540
                  ++ D   DLS C++ V  SGQG+ + C   AGPA+ LRLMFR FDMEMY  DS
Sbjct: 481 NWFGNYAVKFDGAPDLSGCFSQVSDSGQGSESGCSAPAGPAQKLRLMFRFFDMEMYATDS 540

Query: 541 LLSQPAQPMPFCSASVNPVPAPVTIIPPPP---PLPPLSPP------------------- 600
           LLSQPAQPMP C  S NP+PAPVT   PPP   P+PP  PP                   
Sbjct: 541 LLSQPAQPMPNCPKSTNPIPAPVTPAQPPPVAVPVPPAQPPPVATPAPPAQPQPVVSPVT 600

Query: 601 -------PTFRLPPLPQLPPLPPLAPLPQCHSWKVLLVNMSKNWTNPDYRCYWRAVNPDT 660
                  P FRLPP+P LPPLPP AP  +  +        SK+WT P+YRCYWR VNP+T
Sbjct: 601 PAIPPPSPAFRLPPMPPLPPLPP-APFLEASACP------SKSWTMPEYRCYWRMVNPET 660

Query: 661 KVAVVFGIVAANRYGTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSLQFPYPTISVLQ 720
           KVAVVFG +AA+RYGTDLT+   LQGRGDPYRTLLRE +T+FLNSYN+L+FPY TI V+Q
Sbjct: 661 KVAVVFGSLAASRYGTDLTMMQALQGRGDPYRTLLREGVTSFLNSYNTLRFPYNTIGVVQ 720

Query: 721 RMNWALLGSPRAVLLTALRS-----------------------NGPTRVMATSPAN---- 780
            M+WAL+GS R+VLLT LR                        NG     A+S +N    
Sbjct: 721 HMDWALMGSTRSVLLTGLRFMRANSGYGTVNCSFTPCKSIFGWNGNQMTAASSSSNHPMP 780

Query: 781 ------------------------------------------------------------ 831
                                                                       
Sbjct: 781 ATIQDTLLPSPGRFKLSVDASVREQSSRIGFGAVVQNCREEVVAGCYSSIPSGLPLISLK 840

BLAST of Sgr023804 vs. NCBI nr
Match: KAA8536008.1 (hypothetical protein F0562_028486 [Nyssa sinensis])

HSP 1 Score: 751.9 bits (1940), Expect = 5.9e-213
Identity = 480/912 (52.63%), Postives = 573/912 (62.83%), Query Frame = 0

Query: 14  WALDAEEHEAELLKDQQDQA-----------RQQSEPSAEFPSLAAAAATKPKKKKGQPI 73
           WALD+EEHE ELL+ Q++ A           R++    A+FPSL+AAAATK KKKKGQ +
Sbjct: 33  WALDSEEHETELLQQQKEDADSDFNGSDTSGRREPAALADFPSLSAAAATKTKKKKGQTL 92

Query: 74  PLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQNS 133
            L EF T G  KP  AQ+S+ KGLT EDLMMLPTGPR+RTAEELDR+RLGGGF++ G   
Sbjct: 93  SLAEFTTLGTTKP--AQTSQTKGLTAEDLMMLPTGPRERTAEELDRSRLGGGFRSNG--- 152

Query: 134 LYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWGAGKK 193
            YDR NRYSN ++SSN +WG SSR +D+ RR   G  R+  RE APSRADEIDDW A KK
Sbjct: 153 -YDR-NRYSNSDESSNPRWG-SSRGSDEPRRQG-GLGRDSTRELAPSRADEIDDWAAAKK 212

Query: 194 PMAGNGFERR---ERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFE--RERR 253
              GNGFERR   ERGGFFDS  S+ADESDSWVSNKSF PSEGRR G  GG F+  RERR
Sbjct: 213 STVGNGFERRERGERGGFFDS-QSRADESDSWVSNKSFIPSEGRRFGSNGGAFDSLRERR 272

Query: 254 GGFTSNGGGADSDNWGKKIEGGR------------------------------------- 313
           GGF SNG G DSDNWG+K EGGR                                     
Sbjct: 273 GGFESNGSGTDSDNWGRKEEGGRKFGAAGGAFDSYRERRGGFESAMNGGADSDNWGKKRE 332

Query: 314 -----GGSGERPRLNLQPRTIPLNDGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDW 373
                G    RPRLNLQPRT+P+ +  Q  S A+ KPKGSNPFG+ARPREEVL EKGQDW
Sbjct: 333 EGSGAGAGTGRPRLNLQPRTVPVVNELQNGSAALAKPKGSNPFGEARPREEVLKEKGQDW 392

Query: 374 KKIDEHLESMKIKDTVEKAE-RSSGASFEKKGFGVRSGRSP----DDRMGRSWRKPESVE 433
           K+I+E LES+KIK+        + G SF K+ FG  +GR+          RSWRK +SV 
Sbjct: 393 KEIEEKLESVKIKEMGSAGTVTTDGPSFGKRSFGSGNGRASLSEGRPEAERSWRKTDSV- 452

Query: 434 SRPQRILVDSVLDLRRSQSNFHGSSL---FAAFLFFAGAIVGAQGYAVVTGTVFCDQCKD 493
                       D+  +     G+ L     AFL  A AI GAQG A+VTGTVFCDQCKD
Sbjct: 453 ------------DVGTTSGEKIGNGLVEEVEAFLLVAAAIHGAQGEAMVTGTVFCDQCKD 512

Query: 494 GQISLFDYPMNDYKFVWKLYDE-------------------IRRDA--DLSSCYAAVGGS 553
           G+ISLFDYP++  K      D                    +R D   DLS CYA V GS
Sbjct: 513 GRISLFDYPLHGIKVAMACADSNGKITMWREETTNWFGNYAMRFDGTPDLSGCYAQVSGS 572

Query: 554 GQGTTTDCGGAAGPAKSLRLMFRMFDMEMYVVDSLLSQPAQPMPFCSASVNPVPAPVTII 613
           GQG +  CG  AGPAK  RLMFR+FDME+Y VD LLS PAQPM +C     PVPAPVT  
Sbjct: 573 GQG-SMGCGAVAGPAKYPRLMFRLFDMEIYNVDPLLSGPAQPMSYCQRPATPVPAPVT-- 632

Query: 614 PPPPPLPPLSPPPTFRLPPLPQLPPLPPLAPLPQCHSWKVLLVNMSKNWTNPDYRCYWRA 673
             P   PP    P  RLPP+P LPP+PP+ P  Q  +         + W  P+Y+CYW+A
Sbjct: 633 --PVKSPPSRNRPAPRLPPMPPLPPMPPV-PFLQASACPY------QKWMMPEYKCYWKA 692

Query: 674 VNPDTKVAVVFGIVAANRYGTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSLQFPYPT 733
            +PD KVAVVFG++AA RYGTD+TLW  +QGRG+PY+TLLRE  TA LNSYNS+QFP P+
Sbjct: 693 ASPDMKVAVVFGLIAARRYGTDITLWEAMQGRGEPYKTLLREGTTALLNSYNSIQFPLPS 752

Query: 734 ISVLQRMNWALLGSPRAVLLTALRSNGPTRVMATS-----PANLILAKIQ---RAFADHR 793
                     + G      L +   +  T + A+S     P + I +       +  D+ 
Sbjct: 753 PRCCAAHELGIDGFHSPSRLHSFEGSSLTPLTASSSLSRKPQSRISSAPSPKTNSRPDNS 812

Query: 794 LTSRTLVDFYLNQIETLNPLLRSVVEVNPDAREEADEADRRREANENRSPIGDLDGIPVL 831
           LTS  +      + + LNP+LR+VVEVNPDA+++AD+ADR R  NE    +GD+ GIPVL
Sbjct: 813 LTSTCI------KPKFLNPVLRAVVEVNPDAQDQADQADRERGRNEGCGSLGDMHGIPVL 872

BLAST of Sgr023804 vs. NCBI nr
Match: XP_022146306.1 (eukaryotic translation initiation factor 4B3-like [Momordica charantia] >XP_022146307.1 eukaryotic translation initiation factor 4B3-like [Momordica charantia])

HSP 1 Score: 642.5 bits (1656), Expect = 5.0e-180
Identity = 344/398 (86.43%), Postives = 358/398 (89.95%), Query Frame = 0

Query: 1   MAATVSPWGKPGAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSLAAAAATKPKKKKGQ 60
           MAATVSPWGKPGAWALDAEEHEAELLKDQ+DQA QQSEPSAEFPSLAAAAATKPKKKKGQ
Sbjct: 1   MAATVSPWGKPGAWALDAEEHEAELLKDQKDQALQQSEPSAEFPSLAAAAATKPKKKKGQ 60

Query: 61  PIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQ 120
            IPL EFQTYGG KP S QSSEPKGLTTEDLMMLPTGPRQRTAEE+DRNRLGGGFKNYGQ
Sbjct: 61  SIPLSEFQTYGGPKP-SPQSSEPKGLTTEDLMMLPTGPRQRTAEEMDRNRLGGGFKNYGQ 120

Query: 121 NSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESA-PSRADEIDDWGA 180
           NSLYDRGNRYSNGEDSSNS+WG SSRV D+SRRSNDGSNRE RRE A PSRADEIDDWGA
Sbjct: 121 NSLYDRGNRYSNGEDSSNSRWG-SSRVFDESRRSNDGSNRELRREPAPPSRADEIDDWGA 180

Query: 181 GKKPMAGNGFERRERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFERERRGG 240
           GKKPM GNGFERRERGGFFDSNSSKAD+SDSWVSNKSF PSEGRRSGG+GGGFERERRGG
Sbjct: 181 GKKPMVGNGFERRERGGFFDSNSSKADDSDSWVSNKSFVPSEGRRSGGMGGGFERERRGG 240

Query: 241 FTS-----------------------NGGGADSDNWGKKIEGGRGGSGERPRLNLQPRTI 300
           FTS                       NGGGADSDNWGK+ EGGRGGSGERPRLNLQPRTI
Sbjct: 241 FTSNGGGADSDSWGKRSEGGRGGSGENGGGADSDNWGKRSEGGRGGSGERPRLNLQPRTI 300

Query: 301 PLNDGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKDTVEKAER 360
           PLNDGNQEASGA VKPKGSNPFG+ARPREEVLAEKGQDWKKIDE LESMK+KDTVE+AER
Sbjct: 301 PLNDGNQEASGAAVKPKGSNPFGNARPREEVLAEKGQDWKKIDEQLESMKVKDTVERAER 360

Query: 361 SSGASFEKKGFGVRSGRSPDDRMGRSWRKPESVESRPQ 375
            +GASFE+KGFGVRSGRSPDDRMGRSWRKP+SV+SRPQ
Sbjct: 361 PTGASFERKGFGVRSGRSPDDRMGRSWRKPDSVDSRPQ 396

BLAST of Sgr023804 vs. NCBI nr
Match: KAB2600730.1 (eukaryotic translation initiation factor 4B-like [Pyrus ussuriensis x Pyrus communis])

HSP 1 Score: 631.3 bits (1627), Expect = 1.2e-176
Identity = 412/781 (52.75%), Postives = 479/781 (61.33%), Query Frame = 0

Query: 1   MAATVSPWGKPGAWALDAEEHEAELLKD-QQDQARQQSEPSAEFPSLAAAAATKPKKKKG 60
           MAATVSPW KPGAWAL  EE EAEL +  +++Q R    PSA+FPSL+AAAATKPKKKK 
Sbjct: 1   MAATVSPWAKPGAWALATEEQEAELEQQAKEEQQRAVEPPSADFPSLSAAAATKPKKKK- 60

Query: 61  QPIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYG 120
           Q I L EF  +G  KP  A+   P GLT ED M+LPTGPR+RTAEELDRNRLGGGF++YG
Sbjct: 61  QTISLAEFNNFGAPKPKPAE--HPVGLTHEDRMLLPTGPRERTAEELDRNRLGGGFRSYG 120

Query: 121 QNSLYDRGN-RYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWG 180
                DRGN RYSNG++SS S+WGS        +R   G  RE  R+  PSRADE+DDWG
Sbjct: 121 G----DRGNSRYSNGDESSESKWGS-------GQRKEGGFGRESNRDE-PSRADEVDDWG 180

Query: 181 AGKKPMAGNGFERRERGG----FFDSNSSKADESDSWVSNKSFTPSEGRRSGG------- 240
           A KK M GNGFERRERGG    FF  + SKADES+SWVSNKS   SEGRR GG       
Sbjct: 181 AKKKSMPGNGFERRERGGPGGSFFGGSQSKADESNSWVSNKSSMQSEGRRFGGGGSGFDR 240

Query: 241 -------------------------IGGGFERERRGGFTSNGGGADSDNWGKKIEGGRGG 300
                                     G GF+RER+ GF SNGGGADSD WGKK E   GG
Sbjct: 241 DRKVGFPSNGGADSDNWGRKKEETNGGSGFDRERKVGFVSNGGGADSDVWGKKREESNGG 300

Query: 301 SGE---RPRLNLQPRTIPLNDGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKID 360
             E   RPRLNLQPR++P+++     S    KPKGSNPFG ARPREEVLAEKG+DWK+ID
Sbjct: 301 LSESTARPRLNLQPRSLPVSNETSPGSTTPPKPKGSNPFGAARPREEVLAEKGKDWKEID 360

Query: 361 EHLESMKIKDTVEKAERSSGASFEKKGFGVRSGRSPDDRMGRSWRKPESVESRPQRILVD 420
           E LES KIK+  E A   S      + FG+ +GR+  DR  R+WRKP+S +SRPQ     
Sbjct: 361 EQLESAKIKEVKEIANSESFG----RSFGMGNGRA-GDRTERAWRKPDSADSRPQSADKS 420

Query: 421 SVLDLRRSQSNFH--------------GSSLF------AAFLFFAG-------------- 480
                     N H               SSL         + +F G              
Sbjct: 421 ETRSNSEEPQNEHVEPYLLQAWIGWLCKSSLSLSWSHECVWCYFYGQRIPPNQQRLEIHI 480

Query: 481 ------------AIVGAQGYAVVTGTVFCDQCKDGQISLFDYPMNDYKFVWKLYDE---- 540
                       AI  AQ   +V+GTVFCDQCKDG+ SLFDYP+   K      D     
Sbjct: 481 IHLAANPPMSNNAIDAAQANTMVSGTVFCDQCKDGERSLFDYPIYGVKVQVACSDSNGQI 540

Query: 541 -IRRD----------------ADLSSCYAAVGGSGQGTTTDCGGAAGPAKSLRLMFRMFD 600
            + R+                 DLS CYA +  S   T + C  +AGPA+SLRL+FRMFD
Sbjct: 541 TMSREETTNWFGNYAIGFDGTPDLSGCYAQISSS---TGSGCVVSAGPAQSLRLVFRMFD 600

Query: 601 MEMYVVDSLLSQPAQPMPFCSASVNPVPAPVTIIPP-PPPLPPLSPPPTFRLPPLPQLPP 660
           + MYVVDSLL+QPA+PM FC  S NPVPAPVT + P P P+ P SPPP FRLPP+P+LPP
Sbjct: 601 VGMYVVDSLLTQPAEPMSFCPKSSNPVPAPVTPVNPLPKPVTPASPPP-FRLPPMPRLPP 660

Query: 661 LPPLAPLPQCHSWKVLLVNM--SKNWTNPDYRCYWRAVNPDTKVAVVFGIVAANRYGTDL 671
           LPPL  LP       L       + WT P+++CYWRAVNP+TKVAV+FG VAA RYGTDL
Sbjct: 661 LPPLPSLPPMPPMPFLEATACPHQQWTLPEHKCYWRAVNPNTKVAVIFGPVAAGRYGTDL 720

BLAST of Sgr023804 vs. ExPASy Swiss-Prot
Match: Q9SZP8 (Eukaryotic translation initiation factor 4B3 OS=Arabidopsis thaliana OX=3702 GN=EIF4B3 PE=1 SV=1)

HSP 1 Score: 315.5 bits (807), Expect = 1.8e-84
Identity = 216/396 (54.55%), Postives = 259/396 (65.40%), Query Frame = 0

Query: 2   AATVSPWGKPGAWALDAEEHEAELLKDQQDQARQQS--EPSAEFPSLAAAAATKPKKKKG 61
           AA  S W KPGAWAL+AEEHEAE LK Q     Q+S  E S++FPSLAAAA TK KKKKG
Sbjct: 3   AAVSSVWAKPGAWALEAEEHEAE-LKQQPSPTNQKSSAEDSSDFPSLAAAATTKTKKKKG 62

Query: 62  QPIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYG 121
           Q I L EF TYG  K   A  +E   LT  +L+ LPTGPR+R+AEELDR++LGGGF++YG
Sbjct: 63  QTISLAEFATYGTAKAKPAPQTE--RLTQAELVALPTGPRERSAEELDRSKLGGGFRSYG 122

Query: 122 QNSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSN--REFRRESAPSRADEIDDW 181
                  G RY  G++SS+S+WG SSRV++D  R   G N  RE  R+S PSRADE D+W
Sbjct: 123 -------GGRY--GDESSSSRWG-SSRVSEDGERRGGGFNRDREPSRDSGPSRADEDDNW 182

Query: 182 GAGKKPMAGNGFERRER---GGFFDSNS-SKADESDSWVSNKSFTPSEGRR--SGGIGGG 241
            A KKP++GNGFERRER   GGFF+S S SKADE DSWVS K   PSE RR  S   GGG
Sbjct: 183 AAAKKPISGNGFERRERGSGGGFFESQSQSKADEVDSWVSTK---PSEPRRFVSSNGGGG 242

Query: 242 FERERRGGFTS----------NGGGADSDNWGKKIE--GGRGGS-----GERPRLNLQPR 301
              E+RG F S           GGG++SD WG++ E  G   GS     G RPRL LQPR
Sbjct: 243 DRFEKRGSFESLSRNRDSQYGGGGGSESDTWGRRREESGAANGSPPPSGGSRPRLVLQPR 302

Query: 302 TIPLN-----DGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKD 361
           T+P+               V KPKG+NPFG+ARPREEVLAEKGQDWK+IDE LE+ K+KD
Sbjct: 303 TLPVAVVEVVKPESPVLVIVEKPKGANPFGNARPREEVLAEKGQDWKEIDEKLEAEKLKD 362

Query: 362 TVEKAERSSGASFEKKGFGVRSGRSPDDRMGRSWRK 366
                E+ +  S  K GFG+ +GR  ++R+ RSWRK
Sbjct: 363 IAAAMEKPNEKSTGKMGFGLGNGRKDEERIERSWRK 382

BLAST of Sgr023804 vs. ExPASy Swiss-Prot
Match: A0A1P8B760 (Probable amidase At4g34880 OS=Arabidopsis thaliana OX=3702 GN=At4g34880 PE=2 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 1.1e-36
Identity = 89/166 (53.61%), Postives = 119/166 (71.69%), Query Frame = 0

Query: 665 LLTALRSNGPTRVMAT-SPANLILAKIQRAFADHRLTSRTLVDFYLNQIETLNPLLRSVV 724
           L+ ++ S    R+ +T S     +  I+ AF + RLTS+ LV+ YL  I  LNP+L +V+
Sbjct: 22  LIMSVGSASQIRLSSTFSIQEATIEDIRVAFNEKRLTSKQLVELYLEAISKLNPILHAVI 81

Query: 725 EVNPDAREEADEADRRREANENRSPIGDLDGIPVLLKDTIATKDKLNTTAGSYALLGSVV 784
           E NPDA  +A+ ADR R+  +N + +  L G+PVLLKD+I+TKDKLNTTAGS+ALLGSVV
Sbjct: 82  ETNPDALIQAEIADRERDL-KNTTKLPILHGVPVLLKDSISTKDKLNTTAGSFALLGSVV 141

Query: 785 PRDAGVVEKLRRAGAVILGKASLTEWYSFRALGHVPNGWCARAGQG 830
            RDAGVV++LR +GAVILGKASL+EW  FR+   +P+GW AR  QG
Sbjct: 142 ARDAGVVKRLRESGAVILGKASLSEWAHFRSFS-IPDGWSARGLQG 185

BLAST of Sgr023804 vs. ExPASy Swiss-Prot
Match: Q9AUJ7 (Eukaryotic translation initiation factor 4B1 OS=Triticum aestivum OX=4565 GN=EIF4B PE=1 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 5.6e-33
Identity = 135/369 (36.59%), Postives = 178/369 (48.24%), Query Frame = 0

Query: 7   PWGKPGAWALDAEEHEAELLKDQQDQARQQSEP------SAEFPSL---AAAAATKPKKK 66
           PWG  GAWALDAE  + E    +   A    +P      +A FPSL     A   K KKK
Sbjct: 4   PWGGVGAWALDAEREDEE---REHAAAFPAPDPPAAAGGAASFPSLKEAVVAGGGKQKKK 63

Query: 67  KGQPIPLGEFQTYGGF-KPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFK 126
           KG  + L EF TYG    P     +EPKGLT +++MMLPTGPR+R+ +ELDR+R   GF+
Sbjct: 64  KGTTLSLSEFTTYGAAGAPRRVAPAEPKGLTPQEMMMLPTGPRERSEDELDRSR---GFR 123

Query: 127 NYGQNSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESAPSRADEIDD 186
           +YG       G+R   G               DD RRS+  S+ +      PSRADE D+
Sbjct: 124 SYG-------GDREPRGGGF------------DDDRRSSRDSDLDM-----PSRADESDN 183

Query: 187 WGAGK--KPMAGNGFERRERGGFFDSNSSKADESDSWVSNKSFTPSE----------GRR 246
           WG  K   P   +   R    G   S   ++D+ D+W  +K   PS              
Sbjct: 184 WGKNKSFSPAPTDSGRRDRLSG--PSPLGRSDDIDNWSRDKKPLPSRYPSLGTGGGFRES 243

Query: 247 SGG-----IGGGFERERRGGFTSNGGGADSDNWGK-KIEGGRGGSGERPRLNLQPRTIPL 306
           SGG      GGGF     GGF  + G +DSD W +  +      +G+RPRLNL P   P 
Sbjct: 244 SGGGFRESSGGGFRESSGGGFRDSPGPSDSDRWVRGAVPAPMTNNGDRPRLNLNP---PK 303

Query: 307 NDGN-QEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKDTVEKAERS 347
            D +      A V     +PFG A+PREEVLAEKG DW+K++  +E    + T   + R 
Sbjct: 304 RDPSATPVPAAEVARSRPSPFGAAKPREEVLAEKGLDWRKMEGEIEKKTSRPTSSHSSRP 337

BLAST of Sgr023804 vs. ExPASy Swiss-Prot
Match: Q9SAD7 (Eukaryotic translation initiation factor 4B2 OS=Arabidopsis thaliana OX=3702 GN=EIF4B2 PE=1 SV=1)

HSP 1 Score: 133.3 bits (334), Expect = 1.3e-29
Identity = 130/370 (35.14%), Postives = 165/370 (44.59%), Query Frame = 0

Query: 7   PWGKPGAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSLAAAAATKPKKKKGQPIPLGE 66
           PWG  GAWA +AE  + E    Q  +A   +  S  FPSL  AA  K  KKK + + L E
Sbjct: 4   PWGGIGAWADEAERADEE----QAAEATATAADSQSFPSLKEAATAKSSKKK-KKMTLSE 63

Query: 67  FQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQNSLYDR 126
           F       PSSA      GLT E ++ LPTGPRQR+ +E+   RLGGGF +YG       
Sbjct: 64  FTKGAYTAPSSA------GLTREQMLQLPTGPRQRSEDEMQPGRLGGGFSSYGGGRSSGP 123

Query: 127 GNRYSNGEDSSNSQWG-------SSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWGA 186
             R S   + S+  WG       S    +DD R    GSN         SRADE DDWG 
Sbjct: 124 PGRMSRDREDSDGSWGGGGGGRRSYGGFDDDQR----GSNSRVSDFPQVSRADEDDDWGK 183

Query: 187 GKKPM---------------------AGNGFERRERGGFFDSNS------SKADESDSWV 246
           GKK +                      G G      GG   + S      SKADE D+W 
Sbjct: 184 GKKSLPSFDQGRQGSRYGGGGGSFGGGGGGGAGSYGGGGAGAGSGGGGGFSKADEVDNWA 243

Query: 247 SNKSFTPSEGRRSGGIGGGFERERRGGFTSNGGGADSDNWGKKI-EGGRGGSGERPRLNL 306
           + K+       +S   G GF             G + D W + +   G G   ER RL  
Sbjct: 244 AGKA-------KSSTFGSGFRE----------SGPEPDRWARGVLPSGGGVQEERRRLVF 303

Query: 307 QPRTIPLNDGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKDTV 342
           +PR     D     +   VK    +PFG ARPRE+VLAEKG DWKK+D  +E+ K + + 
Sbjct: 304 EPRKA---DTEVSETPTAVKTSKPSPFGAARPREQVLAEKGLDWKKLDSDIEAKKGQTSR 338

BLAST of Sgr023804 vs. ExPASy Swiss-Prot
Match: Q9LIN5 (Eukaryotic translation initiation factor 4B1 OS=Arabidopsis thaliana OX=3702 GN=EIF4B1 PE=1 SV=1)

HSP 1 Score: 130.2 bits (326), Expect = 1.1e-28
Identity = 141/435 (32.41%), Postives = 190/435 (43.68%), Query Frame = 0

Query: 12  GAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSL---AAAAATKPKKKKGQPIPLGEFQ 71
           GAWA +AE  + E    Q  +A   +  +  FPSL   AAA AT  K +K + + L EF 
Sbjct: 11  GAWADEAERADEE----QAAEATAATADTQSFPSLREAAAATATSGKSRKMKKMSLSEFT 70

Query: 72  TYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQNSLYDRGN 131
           T G +     ++S   GLT ++++ LPTGPRQR+ EE+   RLGGGF +YG  S    G 
Sbjct: 71  T-GAYTAPGGRNS--VGLTQQEILQLPTGPRQRSEEEMQPGRLGGGFSSYGGRS----GG 130

Query: 132 RYSNGEDSSNSQW-----GSSSRVN----DDSRRSNDGSNREFRRESAPSRADEIDDWGA 191
           R     D S+  W     G   R      DD RR N     +F +   PSRADE+DDWG 
Sbjct: 131 RIGRDRDDSDGSWSGGGGGGGRRPYGGGFDDDRRGNQSRVSDFPQ---PSRADEVDDWGK 190

Query: 192 GKKPMAGNGFERRERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFERERRGG 251
            KKP+    F++  +G +                       +G   GG G GF     GG
Sbjct: 191 EKKPLP--SFDQGRQGRY---------------------SGDGGGFGGGGSGF-----GG 250

Query: 252 FTSNGGGA-----DSDNWGK------------KIEGGRGGSGERPRLNLQPRTIPLNDGN 311
               GGG      D DNWG                 G  G  ER RL L+PR +    G 
Sbjct: 251 GGGGGGGGLSRADDVDNWGAGKRQAPVRSSTFGSSFGDSGQEERRRLVLEPRKV--ESGG 310

Query: 312 QEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKD------TVEKAER 371
            E    V K    NPFG ARPRE+VLAEKG DWKKID  +E+ K         +   +  
Sbjct: 311 SETPPVVEKTSKPNPFGAARPREDVLAEKGLDWKKIDSEIEAKKGSSQTSRPTSAHSSRP 370

Query: 372 SSGASFEKKGFGVRSGRSPD----------------DRMGRSWRK---------PESVES 387
           SS  S   +  G+ +   P                 +  G+ WRK          +  E+
Sbjct: 371 SSAQSNRSESSGLNNVVKPRPKVNPFGDAKPREVLLEEQGKDWRKMDLELEHRRVDRPET 401

BLAST of Sgr023804 vs. ExPASy TrEMBL
Match: A0A7J6GGG4 (Amidase domain-containing protein OS=Cannabis sativa OX=3483 GN=G4B88_006678 PE=4 SV=1)

HSP 1 Score: 789.6 bits (2038), Expect = 1.2e-224
Identity = 523/1050 (49.81%), Postives = 620/1050 (59.05%), Query Frame = 0

Query: 1    MAATVSPWGKPGAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSLAAAAATKPKKKKGQ 60
            MAATVSPW KPGAWALD+EE +AELLK+Q+ +A    EP  +FPSL+AAA  KPKKKKGQ
Sbjct: 1    MAATVSPWSKPGAWALDSEEQDAELLKEQEKKAAM--EPLHDFPSLSAAATAKPKKKKGQ 60

Query: 61   PIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQ 120
             + L EF TYGG KP  AQ + P GLT ED M LP GPR+RTAEE++R R   GF++   
Sbjct: 61   TLSLAEFTTYGGPKP-VAQPTAPAGLTHEDRMALPKGPRERTAEEIERAR-HSGFRS--- 120

Query: 121  NSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWGAG 180
               +DRG+R  NG+DSSNS+W SS        R+++G  ++  R+S PSRADE D+W + 
Sbjct: 121  ---FDRGDR--NGDDSSNSRWKSSD-------RNSNGFGKDAPRDSGPSRADEADNWAST 180

Query: 181  KKPM-AGNGF---ERRERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFERER 240
            KK    GN F   ERRER GFF  + SKAD+SDSWV+NKSF PSEGRR G  GGGFERER
Sbjct: 181  KKSFGGGNDFDRGERRERRGFFPDSQSKADDSDSWVTNKSFVPSEGRRFGSSGGGFERER 240

Query: 241  RGGFTSNGGGADSDNWGKK------IEGGRGG--SGERPRLNLQPRTIP-LNDGNQEAS- 300
            + GFTSNGGGAD D WGK+      + G  GG   G RPRLNLQPRT+P ++DG    S 
Sbjct: 241  KVGFTSNGGGADGDFWGKRREESTGVTGNEGGIVGGGRPRLNLQPRTLPVVSDGGSPGSV 300

Query: 301  -----GAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKDTVEKAERSSGAS 360
                   V KPKGSNPFG+ARPREEVLAEKGQDWKKIDE LES+KIK+  EK E   G+S
Sbjct: 301  TGTVAAIVTKPKGSNPFGEARPREEVLAEKGQDWKKIDEQLESLKIKEVNEKPE--GGSS 360

Query: 361  FEKKGFGVRSGRSPDDRMGRSWRKPESVE-SRPQRILVDSVLDLRRSQSNFHGSS----- 420
            F K+ FG+ +GRS DDR+ +SWRKP++ E +  QR    S   LRR++   H        
Sbjct: 361  FGKRSFGIGNGRS-DDRIEKSWRKPDTEEDTGSQRGFGLSDFVLRRARRMAHQLKIIEGI 420

Query: 421  --------------------------LFAAFL---------FFAGAIVGAQGYAVVTGTV 480
                                      LF+ +L         F   AI G QG A+VTGTV
Sbjct: 421  RQLPEKKRKKIEVIRSVDSLRLLFICLFSIYLAVTYNIELVFDNFAIDGIQGDAMVTGTV 480

Query: 481  FCDQCKDGQISLFDYPMNDYKFVWKLYDE-------------------IRRDA--DLSSC 540
            FCDQCKDGQ S FDYP++  K      D                    ++ D   DLS C
Sbjct: 481  FCDQCKDGQRSWFDYPISGVKVKVTCTDSNGQITMSREETTNWFGNYAVKFDGAPDLSGC 540

Query: 541  YAAVGGSGQGTTTDCGGAAGPAKSLRLMFRMFDMEMYVVDSLLSQPAQPMPFCSASVNPV 600
            ++ V  SGQG+ + C   AGPA+ LRLMFR FDMEMY  DSLLSQPAQPMP C  S NPV
Sbjct: 541  FSQVSDSGQGSESGCSAPAGPAQKLRLMFRFFDMEMYATDSLLSQPAQPMPNCPKSTNPV 600

Query: 601  PAPVTIIPPPP---PLPPLSPP--------------------------PTFRLPPLPQLP 660
            PAPVT   PPP   P+PP  PP                          P FRLPP+P LP
Sbjct: 601  PAPVTPAQPPPVAVPVPPAQPPPVATPAPPAQPQPVVSPVTPAIPPPSPAFRLPPMPPLP 660

Query: 661  PLPPLAPLPQCHSWKVLLVNMSKNWTNPDYRCYWRAVNPDTKVAVVFGIVAANRYGTDLT 720
            PLPP AP  +  +        SK+WT P+YRCYWR VNP+TKVAVVFG +AA+RYGTDLT
Sbjct: 661  PLPP-APFLEASACP------SKSWTMPEYRCYWRMVNPETKVAVVFGSLAASRYGTDLT 720

Query: 721  LWNGLQGRGDPYRTLLREAITAFLNSYNSLQFPYPTISVLQRMNWALLGSPRAVLLTALR 780
            +   LQGRGDPYRTLLRE +T+FLNSYN+L+FPY TI V+Q M+WAL+GS R+VLLT LR
Sbjct: 721  MMQALQGRGDPYRTLLREGVTSFLNSYNTLRFPYNTIGVVQHMDWALMGSTRSVLLTGLR 780

Query: 781  ----------------------SNGPTRVMATSPAN------------------------ 831
                                  SNG     A+S +N                        
Sbjct: 781  FMRANSGYGTDYCTAQNRSIFGSNGNQMTAASSSSNHPKAATIQDTLLPSLGEIQAKCGC 840

BLAST of Sgr023804 vs. ExPASy TrEMBL
Match: A0A7J6EDX3 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_009644 PE=4 SV=1)

HSP 1 Score: 784.6 bits (2025), Expect = 3.9e-223
Identity = 516/1032 (50.00%), Postives = 609/1032 (59.01%), Query Frame = 0

Query: 1   MAATVSPWGKPGAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSLAAAAATKPKKKKGQ 60
           MAATVSPW KPGAWALD+EE +AELLK+Q+ +A    EP  +FPSL+AAA  KPKKKKGQ
Sbjct: 1   MAATVSPWSKPGAWALDSEEQDAELLKEQEKKAAM--EPLHDFPSLSAAATAKPKKKKGQ 60

Query: 61  PIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQ 120
            + L EF TYGG KP  AQ + P GLT ED M LP GPR+RTAEE++R R   GF++   
Sbjct: 61  TLSLAEFTTYGGPKP-VAQPTAPAGLTHEDRMALPKGPRERTAEEIERAR-HSGFRS--- 120

Query: 121 NSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWGAG 180
              +DRG+R  NG+DSSNS+W SS        R+++G  ++  R+S PSRADE D+W + 
Sbjct: 121 ---FDRGDR--NGDDSSNSRWKSSD-------RNSNGFGKDAPRDSGPSRADEADNWAST 180

Query: 181 KKPM-AGNGF---ERRERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFERER 240
           KK    GN F   ERRER GFF  + SKAD+SDSWV+NKSF PSEGRR G  GGGFERER
Sbjct: 181 KKSFGGGNDFDRGERRERRGFFPDSQSKADDSDSWVTNKSFVPSEGRRFGSSGGGFERER 240

Query: 241 RGGFTSNGGGADSDNWGKK------IEGGRGG--SGERPRLNLQPRTIP-LNDGNQEAS- 300
           + GFTSNGGGAD D WGK+      + G  GG   G RPRLNLQPRT+P ++DG    S 
Sbjct: 241 KVGFTSNGGGADGDFWGKRREESSGVTGNEGGIVGGGRPRLNLQPRTLPVVSDGGSPGSV 300

Query: 301 -----GAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKDTVEKAERSSGAS 360
                  V KPKGSNPFG+ARPREEVLAEKGQDWKKIDE LES+KIK+  EK E   G+S
Sbjct: 301 TGTGAAIVTKPKGSNPFGEARPREEVLAEKGQDWKKIDEQLESLKIKEVNEKPE--GGSS 360

Query: 361 FEKKGFGVRSGRSPDDRMGRSWRKPESVESRPQRILVDSVLDLRRSQSNFHGSSLFAAFL 420
           F K+ FG+ +GRS DDR+ +SWRKP++ E                SQ  F G S F    
Sbjct: 361 FGKRSFGIGNGRS-DDRIEKSWRKPDTEED-------------TGSQRGF-GLSDF---- 420

Query: 421 FFAGAIVGAQGYAVVTGTVFCDQCKDGQISLFDYPMNDYKFVWKLYDE------------ 480
               AI G QG A+VTGTVFCDQCKDGQ S FDYP++  K      D             
Sbjct: 421 ----AIDGIQGDAMVTGTVFCDQCKDGQRSWFDYPISGVKVKVTCTDSNGQITMSREETT 480

Query: 481 -------IRRDA--DLSSCYAAVGGSGQGTTTDCGGAAGPAKSLRLMFRMFDMEMYVVDS 540
                  ++ D   DLS C++ V  SGQG+ + C   AGPA+ LRLMFR FDMEMY  DS
Sbjct: 481 NWFGNYAVKFDGAPDLSGCFSQVSDSGQGSESGCSAPAGPAQKLRLMFRFFDMEMYATDS 540

Query: 541 LLSQPAQPMPFCSASVNPVPAPVTIIPPPP---PLPPLSPP------------------- 600
           LLSQPAQPMP C  S NP+PAPVT   PPP   P+PP  PP                   
Sbjct: 541 LLSQPAQPMPNCPKSTNPIPAPVTPAQPPPVAVPVPPAQPPPVATPAPPAQPQPVVSPVT 600

Query: 601 -------PTFRLPPLPQLPPLPPLAPLPQCHSWKVLLVNMSKNWTNPDYRCYWRAVNPDT 660
                  P FRLPP+P LPPLPP AP  +  +        SK+WT P+YRCYWR VNP+T
Sbjct: 601 PAIPPPSPAFRLPPMPPLPPLPP-APFLEASACP------SKSWTMPEYRCYWRMVNPET 660

Query: 661 KVAVVFGIVAANRYGTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSLQFPYPTISVLQ 720
           KVAVVFG +AA+RYGTDLT+   LQGRGDPYRTLLRE +T+FLNSYN+L+FPY TI V+Q
Sbjct: 661 KVAVVFGSLAASRYGTDLTMMQALQGRGDPYRTLLREGVTSFLNSYNTLRFPYNTIGVVQ 720

Query: 721 RMNWALLGSPRAVLLTALRS-----------------------NGPTRVMATSPAN---- 780
            M+WAL+GS R+VLLT LR                        NG     A+S +N    
Sbjct: 721 HMDWALMGSTRSVLLTGLRFMRANSGYGTVNCSFTPCKSIFGWNGNQMTAASSSSNHPMP 780

Query: 781 ------------------------------------------------------------ 831
                                                                       
Sbjct: 781 ATIQDTLLPSPGRFKLSVDASVREQSSRIGFGAVVQNCREEVVAGCYSSIPSGLPLISLK 840

BLAST of Sgr023804 vs. ExPASy TrEMBL
Match: A0A5J5B0E3 (Amidase domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_028486 PE=4 SV=1)

HSP 1 Score: 751.9 bits (1940), Expect = 2.8e-213
Identity = 480/912 (52.63%), Postives = 573/912 (62.83%), Query Frame = 0

Query: 14  WALDAEEHEAELLKDQQDQA-----------RQQSEPSAEFPSLAAAAATKPKKKKGQPI 73
           WALD+EEHE ELL+ Q++ A           R++    A+FPSL+AAAATK KKKKGQ +
Sbjct: 33  WALDSEEHETELLQQQKEDADSDFNGSDTSGRREPAALADFPSLSAAAATKTKKKKGQTL 92

Query: 74  PLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQNS 133
            L EF T G  KP  AQ+S+ KGLT EDLMMLPTGPR+RTAEELDR+RLGGGF++ G   
Sbjct: 93  SLAEFTTLGTTKP--AQTSQTKGLTAEDLMMLPTGPRERTAEELDRSRLGGGFRSNG--- 152

Query: 134 LYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWGAGKK 193
            YDR NRYSN ++SSN +WG SSR +D+ RR   G  R+  RE APSRADEIDDW A KK
Sbjct: 153 -YDR-NRYSNSDESSNPRWG-SSRGSDEPRRQG-GLGRDSTRELAPSRADEIDDWAAAKK 212

Query: 194 PMAGNGFERR---ERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFE--RERR 253
              GNGFERR   ERGGFFDS  S+ADESDSWVSNKSF PSEGRR G  GG F+  RERR
Sbjct: 213 STVGNGFERRERGERGGFFDS-QSRADESDSWVSNKSFIPSEGRRFGSNGGAFDSLRERR 272

Query: 254 GGFTSNGGGADSDNWGKKIEGGR------------------------------------- 313
           GGF SNG G DSDNWG+K EGGR                                     
Sbjct: 273 GGFESNGSGTDSDNWGRKEEGGRKFGAAGGAFDSYRERRGGFESAMNGGADSDNWGKKRE 332

Query: 314 -----GGSGERPRLNLQPRTIPLNDGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDW 373
                G    RPRLNLQPRT+P+ +  Q  S A+ KPKGSNPFG+ARPREEVL EKGQDW
Sbjct: 333 EGSGAGAGTGRPRLNLQPRTVPVVNELQNGSAALAKPKGSNPFGEARPREEVLKEKGQDW 392

Query: 374 KKIDEHLESMKIKDTVEKAE-RSSGASFEKKGFGVRSGRSP----DDRMGRSWRKPESVE 433
           K+I+E LES+KIK+        + G SF K+ FG  +GR+          RSWRK +SV 
Sbjct: 393 KEIEEKLESVKIKEMGSAGTVTTDGPSFGKRSFGSGNGRASLSEGRPEAERSWRKTDSV- 452

Query: 434 SRPQRILVDSVLDLRRSQSNFHGSSL---FAAFLFFAGAIVGAQGYAVVTGTVFCDQCKD 493
                       D+  +     G+ L     AFL  A AI GAQG A+VTGTVFCDQCKD
Sbjct: 453 ------------DVGTTSGEKIGNGLVEEVEAFLLVAAAIHGAQGEAMVTGTVFCDQCKD 512

Query: 494 GQISLFDYPMNDYKFVWKLYDE-------------------IRRDA--DLSSCYAAVGGS 553
           G+ISLFDYP++  K      D                    +R D   DLS CYA V GS
Sbjct: 513 GRISLFDYPLHGIKVAMACADSNGKITMWREETTNWFGNYAMRFDGTPDLSGCYAQVSGS 572

Query: 554 GQGTTTDCGGAAGPAKSLRLMFRMFDMEMYVVDSLLSQPAQPMPFCSASVNPVPAPVTII 613
           GQG +  CG  AGPAK  RLMFR+FDME+Y VD LLS PAQPM +C     PVPAPVT  
Sbjct: 573 GQG-SMGCGAVAGPAKYPRLMFRLFDMEIYNVDPLLSGPAQPMSYCQRPATPVPAPVT-- 632

Query: 614 PPPPPLPPLSPPPTFRLPPLPQLPPLPPLAPLPQCHSWKVLLVNMSKNWTNPDYRCYWRA 673
             P   PP    P  RLPP+P LPP+PP+ P  Q  +         + W  P+Y+CYW+A
Sbjct: 633 --PVKSPPSRNRPAPRLPPMPPLPPMPPV-PFLQASACPY------QKWMMPEYKCYWKA 692

Query: 674 VNPDTKVAVVFGIVAANRYGTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSLQFPYPT 733
            +PD KVAVVFG++AA RYGTD+TLW  +QGRG+PY+TLLRE  TA LNSYNS+QFP P+
Sbjct: 693 ASPDMKVAVVFGLIAARRYGTDITLWEAMQGRGEPYKTLLREGTTALLNSYNSIQFPLPS 752

Query: 734 ISVLQRMNWALLGSPRAVLLTALRSNGPTRVMATS-----PANLILAKIQ---RAFADHR 793
                     + G      L +   +  T + A+S     P + I +       +  D+ 
Sbjct: 753 PRCCAAHELGIDGFHSPSRLHSFEGSSLTPLTASSSLSRKPQSRISSAPSPKTNSRPDNS 812

Query: 794 LTSRTLVDFYLNQIETLNPLLRSVVEVNPDAREEADEADRRREANENRSPIGDLDGIPVL 831
           LTS  +      + + LNP+LR+VVEVNPDA+++AD+ADR R  NE    +GD+ GIPVL
Sbjct: 813 LTSTCI------KPKFLNPVLRAVVEVNPDAQDQADQADRERGRNEGCGSLGDMHGIPVL 872

BLAST of Sgr023804 vs. ExPASy TrEMBL
Match: A0A6J1CZ88 (eukaryotic translation initiation factor 4B3-like OS=Momordica charantia OX=3673 GN=LOC111015543 PE=4 SV=1)

HSP 1 Score: 642.5 bits (1656), Expect = 2.4e-180
Identity = 344/398 (86.43%), Postives = 358/398 (89.95%), Query Frame = 0

Query: 1   MAATVSPWGKPGAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSLAAAAATKPKKKKGQ 60
           MAATVSPWGKPGAWALDAEEHEAELLKDQ+DQA QQSEPSAEFPSLAAAAATKPKKKKGQ
Sbjct: 1   MAATVSPWGKPGAWALDAEEHEAELLKDQKDQALQQSEPSAEFPSLAAAAATKPKKKKGQ 60

Query: 61  PIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQ 120
            IPL EFQTYGG KP S QSSEPKGLTTEDLMMLPTGPRQRTAEE+DRNRLGGGFKNYGQ
Sbjct: 61  SIPLSEFQTYGGPKP-SPQSSEPKGLTTEDLMMLPTGPRQRTAEEMDRNRLGGGFKNYGQ 120

Query: 121 NSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESA-PSRADEIDDWGA 180
           NSLYDRGNRYSNGEDSSNS+WG SSRV D+SRRSNDGSNRE RRE A PSRADEIDDWGA
Sbjct: 121 NSLYDRGNRYSNGEDSSNSRWG-SSRVFDESRRSNDGSNRELRREPAPPSRADEIDDWGA 180

Query: 181 GKKPMAGNGFERRERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFERERRGG 240
           GKKPM GNGFERRERGGFFDSNSSKAD+SDSWVSNKSF PSEGRRSGG+GGGFERERRGG
Sbjct: 181 GKKPMVGNGFERRERGGFFDSNSSKADDSDSWVSNKSFVPSEGRRSGGMGGGFERERRGG 240

Query: 241 FTS-----------------------NGGGADSDNWGKKIEGGRGGSGERPRLNLQPRTI 300
           FTS                       NGGGADSDNWGK+ EGGRGGSGERPRLNLQPRTI
Sbjct: 241 FTSNGGGADSDSWGKRSEGGRGGSGENGGGADSDNWGKRSEGGRGGSGERPRLNLQPRTI 300

Query: 301 PLNDGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKDTVEKAER 360
           PLNDGNQEASGA VKPKGSNPFG+ARPREEVLAEKGQDWKKIDE LESMK+KDTVE+AER
Sbjct: 301 PLNDGNQEASGAAVKPKGSNPFGNARPREEVLAEKGQDWKKIDEQLESMKVKDTVERAER 360

Query: 361 SSGASFEKKGFGVRSGRSPDDRMGRSWRKPESVESRPQ 375
            +GASFE+KGFGVRSGRSPDDRMGRSWRKP+SV+SRPQ
Sbjct: 361 PTGASFERKGFGVRSGRSPDDRMGRSWRKPDSVDSRPQ 396

BLAST of Sgr023804 vs. ExPASy TrEMBL
Match: A0A5N5FHL1 (Eukaryotic translation initiation factor 4B-like OS=Pyrus ussuriensis x Pyrus communis OX=2448454 GN=D8674_038461 PE=4 SV=1)

HSP 1 Score: 631.3 bits (1627), Expect = 5.6e-177
Identity = 412/781 (52.75%), Postives = 479/781 (61.33%), Query Frame = 0

Query: 1   MAATVSPWGKPGAWALDAEEHEAELLKD-QQDQARQQSEPSAEFPSLAAAAATKPKKKKG 60
           MAATVSPW KPGAWAL  EE EAEL +  +++Q R    PSA+FPSL+AAAATKPKKKK 
Sbjct: 1   MAATVSPWAKPGAWALATEEQEAELEQQAKEEQQRAVEPPSADFPSLSAAAATKPKKKK- 60

Query: 61  QPIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYG 120
           Q I L EF  +G  KP  A+   P GLT ED M+LPTGPR+RTAEELDRNRLGGGF++YG
Sbjct: 61  QTISLAEFNNFGAPKPKPAE--HPVGLTHEDRMLLPTGPRERTAEELDRNRLGGGFRSYG 120

Query: 121 QNSLYDRGN-RYSNGEDSSNSQWGSSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWG 180
                DRGN RYSNG++SS S+WGS        +R   G  RE  R+  PSRADE+DDWG
Sbjct: 121 G----DRGNSRYSNGDESSESKWGS-------GQRKEGGFGRESNRDE-PSRADEVDDWG 180

Query: 181 AGKKPMAGNGFERRERGG----FFDSNSSKADESDSWVSNKSFTPSEGRRSGG------- 240
           A KK M GNGFERRERGG    FF  + SKADES+SWVSNKS   SEGRR GG       
Sbjct: 181 AKKKSMPGNGFERRERGGPGGSFFGGSQSKADESNSWVSNKSSMQSEGRRFGGGGSGFDR 240

Query: 241 -------------------------IGGGFERERRGGFTSNGGGADSDNWGKKIEGGRGG 300
                                     G GF+RER+ GF SNGGGADSD WGKK E   GG
Sbjct: 241 DRKVGFPSNGGADSDNWGRKKEETNGGSGFDRERKVGFVSNGGGADSDVWGKKREESNGG 300

Query: 301 SGE---RPRLNLQPRTIPLNDGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKID 360
             E   RPRLNLQPR++P+++     S    KPKGSNPFG ARPREEVLAEKG+DWK+ID
Sbjct: 301 LSESTARPRLNLQPRSLPVSNETSPGSTTPPKPKGSNPFGAARPREEVLAEKGKDWKEID 360

Query: 361 EHLESMKIKDTVEKAERSSGASFEKKGFGVRSGRSPDDRMGRSWRKPESVESRPQRILVD 420
           E LES KIK+  E A   S      + FG+ +GR+  DR  R+WRKP+S +SRPQ     
Sbjct: 361 EQLESAKIKEVKEIANSESFG----RSFGMGNGRA-GDRTERAWRKPDSADSRPQSADKS 420

Query: 421 SVLDLRRSQSNFH--------------GSSLF------AAFLFFAG-------------- 480
                     N H               SSL         + +F G              
Sbjct: 421 ETRSNSEEPQNEHVEPYLLQAWIGWLCKSSLSLSWSHECVWCYFYGQRIPPNQQRLEIHI 480

Query: 481 ------------AIVGAQGYAVVTGTVFCDQCKDGQISLFDYPMNDYKFVWKLYDE---- 540
                       AI  AQ   +V+GTVFCDQCKDG+ SLFDYP+   K      D     
Sbjct: 481 IHLAANPPMSNNAIDAAQANTMVSGTVFCDQCKDGERSLFDYPIYGVKVQVACSDSNGQI 540

Query: 541 -IRRD----------------ADLSSCYAAVGGSGQGTTTDCGGAAGPAKSLRLMFRMFD 600
            + R+                 DLS CYA +  S   T + C  +AGPA+SLRL+FRMFD
Sbjct: 541 TMSREETTNWFGNYAIGFDGTPDLSGCYAQISSS---TGSGCVVSAGPAQSLRLVFRMFD 600

Query: 601 MEMYVVDSLLSQPAQPMPFCSASVNPVPAPVTIIPP-PPPLPPLSPPPTFRLPPLPQLPP 660
           + MYVVDSLL+QPA+PM FC  S NPVPAPVT + P P P+ P SPPP FRLPP+P+LPP
Sbjct: 601 VGMYVVDSLLTQPAEPMSFCPKSSNPVPAPVTPVNPLPKPVTPASPPP-FRLPPMPRLPP 660

Query: 661 LPPLAPLPQCHSWKVLLVNM--SKNWTNPDYRCYWRAVNPDTKVAVVFGIVAANRYGTDL 671
           LPPL  LP       L       + WT P+++CYWRAVNP+TKVAV+FG VAA RYGTDL
Sbjct: 661 LPPLPSLPPMPPMPFLEATACPHQQWTLPEHKCYWRAVNPNTKVAVIFGPVAAGRYGTDL 720

BLAST of Sgr023804 vs. TAIR 10
Match: AT4G38710.2 (glycine-rich protein )

HSP 1 Score: 317.4 bits (812), Expect = 3.4e-86
Identity = 218/402 (54.23%), Postives = 262/402 (65.17%), Query Frame = 0

Query: 2   AATVSPWGKPGAWALDAEEHEAELLKDQQDQARQQS--EPSAEFPSLAAAAATKPKKKKG 61
           AA  S W KPGAWAL+AEEHEAE LK Q     Q+S  E S++FPSLAAAA TK KKKKG
Sbjct: 3   AAVSSVWAKPGAWALEAEEHEAE-LKQQPSPTNQKSSAEDSSDFPSLAAAATTKTKKKKG 62

Query: 62  QPIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYG 121
           Q I L EF TYG  K   A  +E   LT  +L+ LPTGPR+R+AEELDR++LGGGF++YG
Sbjct: 63  QTISLAEFATYGTAKAKPAPQTE--RLTQAELVALPTGPRERSAEELDRSKLGGGFRSYG 122

Query: 122 QNSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSN--REFRRESAPSRADEIDDW 181
                  G RY  G++SS+S+WG SSRV++D  R   G N  RE  R+S PSRADE D+W
Sbjct: 123 -------GGRY--GDESSSSRWG-SSRVSEDGERRGGGFNRDREPSRDSGPSRADEDDNW 182

Query: 182 GAGKKPMAGNGFERRER---GGFFDSNS-SKADESDSWVSNKSFTPSEGRR--SGGIGGG 241
            A KKP++GNGFERRER   GGFF+S S SKADE DSWVS K   PSE RR  S   GGG
Sbjct: 183 AAAKKPISGNGFERRERGSGGGFFESQSQSKADEVDSWVSTK---PSEPRRFVSSNGGGG 242

Query: 242 FERERRGGFTS----------NGGGADSDNWGKKIE--GGRGGS-----GERPRLNLQPR 301
              E+RG F S           GGG++SD WG++ E  G   GS     G RPRL LQPR
Sbjct: 243 DRFEKRGSFESLSRNRDSQYGGGGGSESDTWGRRREESGAANGSPPPSGGSRPRLVLQPR 302

Query: 302 TIPLN-----DGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKD 361
           T+P+               V KPKG+NPFG+ARPREEVLAEKGQDWK+IDE LE+ K+KD
Sbjct: 303 TLPVAVVEVVKPESPVLVIVEKPKGANPFGNARPREEVLAEKGQDWKEIDEKLEAEKLKD 362

Query: 362 TVEKAERSSGASFEKKGFGVRSGRSPDDRMGRSWRKPESVES 372
                E+ +  S  K GFG+ +GR  ++R+ RSWRK  S+ S
Sbjct: 363 IAAAMEKPNEKSTGKMGFGLGNGRKDEERIERSWRKSFSLHS 388

BLAST of Sgr023804 vs. TAIR 10
Match: AT4G38710.1 (glycine-rich protein )

HSP 1 Score: 315.5 bits (807), Expect = 1.3e-85
Identity = 216/396 (54.55%), Postives = 259/396 (65.40%), Query Frame = 0

Query: 2   AATVSPWGKPGAWALDAEEHEAELLKDQQDQARQQS--EPSAEFPSLAAAAATKPKKKKG 61
           AA  S W KPGAWAL+AEEHEAE LK Q     Q+S  E S++FPSLAAAA TK KKKKG
Sbjct: 3   AAVSSVWAKPGAWALEAEEHEAE-LKQQPSPTNQKSSAEDSSDFPSLAAAATTKTKKKKG 62

Query: 62  QPIPLGEFQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYG 121
           Q I L EF TYG  K   A  +E   LT  +L+ LPTGPR+R+AEELDR++LGGGF++YG
Sbjct: 63  QTISLAEFATYGTAKAKPAPQTE--RLTQAELVALPTGPRERSAEELDRSKLGGGFRSYG 122

Query: 122 QNSLYDRGNRYSNGEDSSNSQWGSSSRVNDDSRRSNDGSN--REFRRESAPSRADEIDDW 181
                  G RY  G++SS+S+WG SSRV++D  R   G N  RE  R+S PSRADE D+W
Sbjct: 123 -------GGRY--GDESSSSRWG-SSRVSEDGERRGGGFNRDREPSRDSGPSRADEDDNW 182

Query: 182 GAGKKPMAGNGFERRER---GGFFDSNS-SKADESDSWVSNKSFTPSEGRR--SGGIGGG 241
            A KKP++GNGFERRER   GGFF+S S SKADE DSWVS K   PSE RR  S   GGG
Sbjct: 183 AAAKKPISGNGFERRERGSGGGFFESQSQSKADEVDSWVSTK---PSEPRRFVSSNGGGG 242

Query: 242 FERERRGGFTS----------NGGGADSDNWGKKIE--GGRGGS-----GERPRLNLQPR 301
              E+RG F S           GGG++SD WG++ E  G   GS     G RPRL LQPR
Sbjct: 243 DRFEKRGSFESLSRNRDSQYGGGGGSESDTWGRRREESGAANGSPPPSGGSRPRLVLQPR 302

Query: 302 TIPLN-----DGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKD 361
           T+P+               V KPKG+NPFG+ARPREEVLAEKGQDWK+IDE LE+ K+KD
Sbjct: 303 TLPVAVVEVVKPESPVLVIVEKPKGANPFGNARPREEVLAEKGQDWKEIDEKLEAEKLKD 362

Query: 362 TVEKAERSSGASFEKKGFGVRSGRSPDDRMGRSWRK 366
                E+ +  S  K GFG+ +GR  ++R+ RSWRK
Sbjct: 363 IAAAMEKPNEKSTGKMGFGLGNGRKDEERIERSWRK 382

BLAST of Sgr023804 vs. TAIR 10
Match: AT2G16630.1 (Pollen Ole e 1 allergen and extensin family protein )

HSP 1 Score: 221.1 bits (562), Expect = 3.4e-57
Identity = 148/351 (42.17%), Postives = 188/351 (53.56%), Query Frame = 0

Query: 390 NFHGSSLFAAFLFFAGAIVGAQGYAVVTGTVFCDQCKDGQISLFDYPMNDYKFVWKLYDE 449
           N +   +F AF     A  G  GYA VTG+VFCDQCKDG+ SLFD+P++  K      DE
Sbjct: 4   NHNNLLIFVAFFVLCLATNGVTGYATVTGSVFCDQCKDGERSLFDFPVSGIKISVTCADE 63

Query: 450 -------------------IRRDA--DLSSCYAAVGGSG-QGTTTDCGGAAGPAKSLRLM 509
                              +R D   DLS+CYA V  +G Q   + C  A+GPA+ L+LM
Sbjct: 64  NGQVYMSREETTNWLGGYVMRFDGTPDLSNCYAQVSDNGVQQDPSSCSIASGPAQKLKLM 123

Query: 510 FRMFDMEMYVVDSLLSQPAQPMPFC----SASVNPVP----------------------- 569
           F  F +E +  D+LL+QP QP  FC    +A V P P                       
Sbjct: 124 FSFFGIETFAADALLAQPVQPSSFCPKPPTAPVMPPPQVPVMPPPQVPVKPHPKVPVISP 183

Query: 570 -APVTIIPP-------------PPP------LPPLSPPPTFRLPPLPQLPPLPPLAPLPQ 629
             P T+ PP             PPP      LPP++ PP F+LPPLPQ+PP+P + P   
Sbjct: 184 DPPATLPPPKVPVISPDPPTTLPPPLVPVINLPPVTSPPQFKLPPLPQIPPMPFVEPSAC 243

Query: 630 CHSWKVLLVNMSKNWTNPDYRCYWRAVNPDTKVAVVFGIVAANRYGTDLTLWNGLQGRGD 671
            H          + W  P+YRCYWRA+ PDTKVAV FG+VA   YGTD+T+   L GRG+
Sbjct: 244 SH----------QLWMKPEYRCYWRAIGPDTKVAVAFGLVAGRIYGTDMTVREALDGRGE 303

BLAST of Sgr023804 vs. TAIR 10
Match: AT1G13020.1 (eukaryotic initiation factor 4B2 )

HSP 1 Score: 133.3 bits (334), Expect = 9.2e-31
Identity = 130/370 (35.14%), Postives = 165/370 (44.59%), Query Frame = 0

Query: 7   PWGKPGAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSLAAAAATKPKKKKGQPIPLGE 66
           PWG  GAWA +AE  + E    Q  +A   +  S  FPSL  AA  K  KKK + + L E
Sbjct: 4   PWGGIGAWADEAERADEE----QAAEATATAADSQSFPSLKEAATAKSSKKK-KKMTLSE 63

Query: 67  FQTYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQNSLYDR 126
           F       PSSA      GLT E ++ LPTGPRQR+ +E+   RLGGGF +YG       
Sbjct: 64  FTKGAYTAPSSA------GLTREQMLQLPTGPRQRSEDEMQPGRLGGGFSSYGGGRSSGP 123

Query: 127 GNRYSNGEDSSNSQWG-------SSSRVNDDSRRSNDGSNREFRRESAPSRADEIDDWGA 186
             R S   + S+  WG       S    +DD R    GSN         SRADE DDWG 
Sbjct: 124 PGRMSRDREDSDGSWGGGGGGRRSYGGFDDDQR----GSNSRVSDFPQVSRADEDDDWGK 183

Query: 187 GKKPM---------------------AGNGFERRERGGFFDSNS------SKADESDSWV 246
           GKK +                      G G      GG   + S      SKADE D+W 
Sbjct: 184 GKKSLPSFDQGRQGSRYGGGGGSFGGGGGGGAGSYGGGGAGAGSGGGGGFSKADEVDNWA 243

Query: 247 SNKSFTPSEGRRSGGIGGGFERERRGGFTSNGGGADSDNWGKKI-EGGRGGSGERPRLNL 306
           + K+       +S   G GF             G + D W + +   G G   ER RL  
Sbjct: 244 AGKA-------KSSTFGSGFRE----------SGPEPDRWARGVLPSGGGVQEERRRLVF 303

Query: 307 QPRTIPLNDGNQEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKDTV 342
           +PR     D     +   VK    +PFG ARPRE+VLAEKG DWKK+D  +E+ K + + 
Sbjct: 304 EPRKA---DTEVSETPTAVKTSKPSPFGAARPREQVLAEKGLDWKKLDSDIEAKKGQTSR 338

BLAST of Sgr023804 vs. TAIR 10
Match: AT3G26400.1 (eukaryotic translation initiation factor 4B1 )

HSP 1 Score: 130.2 bits (326), Expect = 7.8e-30
Identity = 141/435 (32.41%), Postives = 190/435 (43.68%), Query Frame = 0

Query: 12  GAWALDAEEHEAELLKDQQDQARQQSEPSAEFPSL---AAAAATKPKKKKGQPIPLGEFQ 71
           GAWA +AE  + E    Q  +A   +  +  FPSL   AAA AT  K +K + + L EF 
Sbjct: 11  GAWADEAERADEE----QAAEATAATADTQSFPSLREAAAATATSGKSRKMKKMSLSEFT 70

Query: 72  TYGGFKPSSAQSSEPKGLTTEDLMMLPTGPRQRTAEELDRNRLGGGFKNYGQNSLYDRGN 131
           T G +     ++S   GLT ++++ LPTGPRQR+ EE+   RLGGGF +YG  S    G 
Sbjct: 71  T-GAYTAPGGRNS--VGLTQQEILQLPTGPRQRSEEEMQPGRLGGGFSSYGGRS----GG 130

Query: 132 RYSNGEDSSNSQW-----GSSSRVN----DDSRRSNDGSNREFRRESAPSRADEIDDWGA 191
           R     D S+  W     G   R      DD RR N     +F +   PSRADE+DDWG 
Sbjct: 131 RIGRDRDDSDGSWSGGGGGGGRRPYGGGFDDDRRGNQSRVSDFPQ---PSRADEVDDWGK 190

Query: 192 GKKPMAGNGFERRERGGFFDSNSSKADESDSWVSNKSFTPSEGRRSGGIGGGFERERRGG 251
            KKP+    F++  +G +                       +G   GG G GF     GG
Sbjct: 191 EKKPLP--SFDQGRQGRY---------------------SGDGGGFGGGGSGF-----GG 250

Query: 252 FTSNGGGA-----DSDNWGK------------KIEGGRGGSGERPRLNLQPRTIPLNDGN 311
               GGG      D DNWG                 G  G  ER RL L+PR +    G 
Sbjct: 251 GGGGGGGGLSRADDVDNWGAGKRQAPVRSSTFGSSFGDSGQEERRRLVLEPRKV--ESGG 310

Query: 312 QEASGAVVKPKGSNPFGDARPREEVLAEKGQDWKKIDEHLESMKIKD------TVEKAER 371
            E    V K    NPFG ARPRE+VLAEKG DWKKID  +E+ K         +   +  
Sbjct: 311 SETPPVVEKTSKPNPFGAARPREDVLAEKGLDWKKIDSEIEAKKGSSQTSRPTSAHSSRP 370

Query: 372 SSGASFEKKGFGVRSGRSPD----------------DRMGRSWRK---------PESVES 387
           SS  S   +  G+ +   P                 +  G+ WRK          +  E+
Sbjct: 371 SSAQSNRSESSGLNNVVKPRPKVNPFGDAKPREVLLEEQGKDWRKMDLELEHRRVDRPET 401

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAF4382046.12.5e-22449.81hypothetical protein G4B88_006678 [Cannabis sativa][more]
KAF4356667.18.2e-22350.00hypothetical protein G4B88_009644 [Cannabis sativa][more]
KAA8536008.15.9e-21352.63hypothetical protein F0562_028486 [Nyssa sinensis][more]
XP_022146306.15.0e-18086.43eukaryotic translation initiation factor 4B3-like [Momordica charantia] >XP_0221... [more]
KAB2600730.11.2e-17652.75eukaryotic translation initiation factor 4B-like [Pyrus ussuriensis x Pyrus comm... [more]
Match NameE-valueIdentityDescription
Q9SZP81.8e-8454.55Eukaryotic translation initiation factor 4B3 OS=Arabidopsis thaliana OX=3702 GN=... [more]
A0A1P8B7601.1e-3653.61Probable amidase At4g34880 OS=Arabidopsis thaliana OX=3702 GN=At4g34880 PE=2 SV=... [more]
Q9AUJ75.6e-3336.59Eukaryotic translation initiation factor 4B1 OS=Triticum aestivum OX=4565 GN=EIF... [more]
Q9SAD71.3e-2935.14Eukaryotic translation initiation factor 4B2 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q9LIN51.1e-2832.41Eukaryotic translation initiation factor 4B1 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Match NameE-valueIdentityDescription
A0A7J6GGG41.2e-22449.81Amidase domain-containing protein OS=Cannabis sativa OX=3483 GN=G4B88_006678 PE=... [more]
A0A7J6EDX33.9e-22350.00Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_009644 PE=4 SV=1[more]
A0A5J5B0E32.8e-21352.63Amidase domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_028486 PE... [more]
A0A6J1CZ882.4e-18086.43eukaryotic translation initiation factor 4B3-like OS=Momordica charantia OX=3673... [more]
A0A5N5FHL15.6e-17752.75Eukaryotic translation initiation factor 4B-like OS=Pyrus ussuriensis x Pyrus co... [more]
Match NameE-valueIdentityDescription
AT4G38710.23.4e-8654.23glycine-rich protein [more]
AT4G38710.11.3e-8554.55glycine-rich protein [more]
AT2G16630.13.4e-5742.17Pollen Ole e 1 allergen and extensin family protein [more]
AT1G13020.19.2e-3135.14eukaryotic initiation factor 4B2 [more]
AT3G26400.17.8e-3032.41eukaryotic translation initiation factor 4B1 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 16..36
NoneNo IPR availableCOILSCoilCoilcoord: 726..746
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 149..179
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 729..747
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 201..221
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 22..301
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 349..368
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 186..200
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 337..368
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 72..86
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 121..148
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 729..748
NoneNo IPR availablePANTHERPTHR32091:SF17EUKARYOTIC TRANSLATION INITIATION FACTOR 4B3coord: 1..371
IPR036928Amidase signature (AS) superfamilyGENE3D3.90.1300.10Amidase signature (AS) domaincoord: 668..830
e-value: 4.3E-42
score: 146.7
IPR036928Amidase signature (AS) superfamilySUPERFAMILY75304Amidase signature (AS) enzymescoord: 688..813
IPR010433Plant specific eukaryotic initiation factor 4BPFAMPF06273eIF-4Bcoord: 7..341
e-value: 5.5E-59
score: 200.6
IPR010433Plant specific eukaryotic initiation factor 4BPANTHERPTHR32091EUKARYOTIC TRANSLATION INITIATION FACTOR 4Bcoord: 1..371
IPR023631Amidase signature domainPFAMPF01425Amidasecoord: 704..811
e-value: 1.1E-22
score: 80.7

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr023804.1Sgr023804.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006413 translational initiation
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003729 mRNA binding
molecular_function GO:0003743 translation initiation factor activity