Sgr012579 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr012579
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionS4 RNA-binding domain-containing protein
Locationtig00153447: 33456 .. 44687 (+)
RNA-Seq ExpressionSgr012579
SyntenySgr012579
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGCTGCGAGCATTGGAGCTCTGTGGAGTCTAAGAAGGGCAGCTCAAGCGTCGAGTTTCCGCACTCCCCCCGCCGTTAACCTCAATAAGCTCTCCTTCCATGAGGCCCGCTTTCCCTCTTCTCTGTATTCTCCTCTGGGGTCCTCAGGTTTTGATTCGCTTTTCCTCCATGGTTCGTTCTTGAACAATGCAAAACCCTGTGTGAATGTAATTGAAAAATATGAATTATTGTTGTTTAGTAGGCGGATTTGTTCTTGAAGTGCGCTGTGATGGCTAAGATGAAAATAATTGAACAATTAGTGCAGGAGATAATAAGCTTTGTTTGTATTGGGAATCAATTGGGAGTAATCTTCCATTTATAAACTTATCTATAATTATTTAAATTCCATTTTAAATGGATGATAGAAAGGATGGTAGCTAATTGAGGATATAAGCCCGACGAGTATTCATTGAGAGAGCATGATGGATGGATTCCAAAATGTTCCTTTAGCACAATGTCCTCTACTGCATGAAGTGCTTCTGCAGCGTTCTGCTTGTGTAGTTCTACTTTTCCTTTGGGGATCTCATTCAATTATGCAAGAGCACTTGAGCTTAATATTTAGCTTGCAAGTACCCAAAGGGATGCCGAGCCTCCGGGATAAAGATTTAATGACTTAAATCCTTGTTGTCACTCTGAGCTGGTCTTGTAGTAGGTGAAGTCTTCTCAATTAGTGGAGATGGACCATGAGTAGAATGCAAGGATTGGGTGGGCCCCGATACCTTGAAAAAAAAAAAAAAAAAAAAATTTACATATATATATATATATATATAGATAGATAGATATATGTTGAGCTTGCAGGTAAAACACAACAAATTATGAGTAACATAACCACGGTTGGGCAAAAAGTATTTCCTATAACTATGCACCCACTAGCCTGTCTCCTACACCTATCAAATACTTGAATTATTTCATTTTGAAAATGCAGCAAGAATGCTAGAAATATGTCACTTGCATTTATAACTCTAAAAATGATACGCATTATACATGGGATAAAGGAAGACTTAGTTTTGCTTCTTCTTGACACAATCCTGAATGGTTACCCAAGTATTGTTACTTAAAATGACTCAATATTTGCATGTTAATAATCCACCCAATGCCATTTCAGGAATATGTCAATTAGTGCAAGCTGTGAAGGGAGACATTGATGTTTTACTCAACGGAGTAGGAGACAAAGGTGTTATTGTAGAAGTGAAACATATTCTTGTGATGGTATTGAATCCTTAAAACCTTGTACTAACATGTTAGTAAAAGAGCCTTAAGAATCTTGTTACACTCTATATTTGCTTATTCCTCATCCATGCTGTGAAGACTGAACAAGCATGGATCTACAAATTGCTAAAGAATTAAACGGAAAAAGCAGTCTTTTCCCCCTTCTAATTTGACGAAATGTCGATAGAAAATAGCAATCCAATAACCCTTTTGTCTCTTGGTTGTGAGGGATGATTATGGTCTTCACTCTTTTGGTTACGTTTAGGCCAAACGTTCATTATCAAGACGAGAAGTTCTTCATACGAACTTTCTCACCCCACCTGTAGTGAAAGAGTCAATGCTAGCTATACAAAAACTAGCTGACGTGAAAGCAATAGCTCAGGGAGGATACCCGCAGGTGAAATATAAAAGTTTTTCTTTGTTTCTTCAGTTGGGGCTTGAGTATATGAATTTAAGATCTATTGCTGATACCACACTTCTTTCAAGTTGACGAAATGTTTAGGCAGAGCGTTGCCGGATTTCTGTTGGACATGCAGATGAACTAACAAGCGATCCAGATGTAATTTCGGCATTGAGGTATCATTCTTTAATCTTCCACATTAGTCCATCTTATTTGAGCTCTGGTAACATCCATTGTTCTTTCGTTCTTTCTTCTTTTTATTTTATTTTTTATTTTTTAATTTTCCCTTCCCAGTATCGCAGGAAATTTTACGTTTCACCCTTGCTCACATGGGGACTTCCTTGGAGCAATTCTTGGTACAGGCATTGCGAGGGAAAAGCTTGGTGATATTATACTCCAGGTATCATTGTATTCAGTATTTACTTTGTTTACATTTGGAAGTATAATTACAACTGTTTTGTTCTTCAAATAATTGTTTGAATTTGTGACTCGTCAAGTATTATAAATGCATTTCACTTTGGCTATCGCTGCATTGGTCTTATACTGAGAAAGTATAATTCTCTTTAACAAATCCCAAAAACTGACGGTAAAAATTGTGTTTTGGGCTATCATGCTCTTGCCAACTGACTGTCTCCAAAAGTTTGATCAGAAATGCTACTTTTTAAAATTTATTTTTCAATCCTTCCATGTGGAACAAAACTTTATGAAAAAGATTCAATTTAGATAAACGCTAGAACGTCTAATTCATGCCTTGTCTATATCCAGAAGTAGCTTTGTGTAAATTTTTTAGGTTTTTGATCAAAATCTGTGACAAGGCTATCATGGGCTGGGTTTCTTTTATTCTCTTATTGTTACCTTATTTTCAGGAGAAAAGGGAGCTCAAGTAGTCATTGATCAGAACTTGTCGACTTCCTTGTATTGTCACTGCGCAAGGTAGTGTCATTGTCTCGGGTGATTGCTTCTAACACGATGATATGGAAATTAACACAAGAATGAAAATATGTTTTACCTCTGAGTGCATTTCTTGTGTGTATTAAACATTTAAAGACATATTTTGGTGAAGCCTACGTGGCCTAATAGCTATTCAAACTATTTGTATATTTGCTTCATTGATCTCTTTATTTGTTACGGCATTCTTAATTTCAGGTTGGTAATGTCACAGTTTCTTGTACGAGGATTCCATTGACAGCTCTTGATTACGAACCACCGAAGTATGAAGTCATATTGGTAATATTTCATATTGGTTTCCATGTTATTTTAATGTAGTATTTTTATCTCCTTTGCAGGACTAAGACATTTAAAACCATTGAGGCATCTCTTAGGGTGGATGCTCTAGCGAGTGCTGGATTCAAATTTCACGGTCTAAACTAGTGGATTTAATTAGGTACAAGGCTTCTACCAACCAATTGATCTTTCACATTGTGATTATTCTAGTATGGAATGAGTCGATCATTTACGCATTGCGATTATTTCTAGTATGAAATGAGTCAATCATCTACTTTTGTGATTATTTCTAGTATGAAATGAGTCAATCATCTACTTTTGTATCTTACTGAACAAAAATGGTGATATTGTTCGTTCATAGATAAAGAATTTTGATTTTTGGGAAAGTATCATTATTATCCAATCAGAAGGGTTTCAATTTTTTTCAAAGTAACGATTTGAAGAACTATTTTTAAGTTCCATTCCGGATAAGTGAAAAAATAATTTATAACAAAGTTTATGTGTTGTTGATTCCTACGTGCAGTGCTGTTTCCCCTCTATTACTTATTTTTACTAGTGGAGCAAAGTTGACTTAATCCATAAGTTACACCTTTACTCAACACTAAAATTGACAATTACAGCATAAGAGGTTACATTAATCGGGATACACAACAAACTTTTATGATTTGCTTTTATCCTTTGTTGATTTACTGTTTGACCTTAATTAATCTGAAATTTTAGGCATCCTTTTGAAGTATTTGTAGGTAGTATGCCATCATGTGGCAGTGAATGATAGAGAACTTTCTCAAAGGTTTTGGGGAATAGGGTGTGACAATGTACTGTCTTCTATTTCTTTTGTGAAAGTACATGAACCAGAAATTTTTATATGAGCAGCAACGGCGATGTTCGTGTCAATTGGTCGCCTGTTACAAAAAATGGAACCACACTGAAGACTGGTGATATTGTTTCTGTTAGTGGGAAAGGCAGACTAAAGGTGAGCTTTTCAATTCAGTAGAAACCAAAACTAGTTGGACTGAGTTCCTATATTTTCTAGCTTATCATTCCCGACTTTTGGTTATTTTCCCGATATTCAGATTGGAGAAATAAATTCTACGAAAAAGGGAAAATTTTCTGTTGAGCTTATCAGGTACGTGTAAGCTCGGCGTTAATGGATCGTTTGGAGTTGACCTGAGTAGATATCAGTAGTAGAGGCTCCCTCAGGGGTGAACATTATGATTGCTTCCTTAGGAAGAAGGAAAGCTAGTCATTGTTCTATATGACATGAATCTAATTGTACATGTTAAAATCTGAACATCAATCCATCAATTCTTAAAATTTTCATTTGGAGAATCTGTCTCGTAACATTTCTATGCCAGGAAATATTGGTTTTTTTTTTTTTAAATTTTTGGGATATGTAATTTTCTTTCCTGGAATGAATTTAAAAGCAAACTAATGTTGTCAAAGGAGGGAATGTAAATACAAAGCTATAGCTTCACATTAACTTTTTGTAAATTCAATGATATTTGACTTATTTCTATTTCATAATATGTAGTTATCAATGTGTACTACTAGCCTGTTGTTTTAGTTGGTGTACAAGTGTCAATTTGTTATGTGATTAGCTGTTAGTTGACTGATCAAGTATGTAAGTCAGTTTTTTGTTCAATAAGATTGACTCCATTTTTCCCATATTTCACGTGCTCACAGTTAACATGATATGAATGCAGGAGCAGGAGGTCAGTATTTAATATGAAATGCTCACTGTTAATACAAGTGTCAATTTGTTATGTGGTTAGCAGTTAGTTGACTAATCGAGTATTTAAGTCAATGTGTTCTTCAAGATTGATTCTATTTTTTCCATATTTCACATGCTCATTGTTAACATAGTAAGAGATCTTAGGTTTAAACTCCTGCAGTATTGTTTTTTTCCCAAATAATATTAAATTATTAATATCATTTATTGGTTCTTGTGCTAATTTTCACCCATAAGTGAGAGATTGTTAAGAATATATATATAAAAAATCAAATATATCATAACTTAATAGATTAAGCTTTTGAATTTATTGGTGATCTAACCGTAGACAAAGAATTCTTACAAATTGAAGGAAAAACTAAGGAGCAAAACTGAACTTTGATGCTAAATGGACATAGAATATATGTGATTTGCTCTACTTACAATACTTCAAAAGTGAAAAACTGGCATTTTGAACTGATTATATTACATAGGCTTGACCAGGACAAGATTAAGTACAGCTATATGACACAAACAAAAAAACTATTTCTTCAGTTTAAACAACAATGACAGAGAGAGGGAGAAAACAGGGGAAACTTACTGAATGGAACAGTCTCTGGGTTAAGTTGTCAAACTATGATGTTATTGATAATTTTCATGTCCTTTGAATGTATTACAAAAAGTAGTTTCTCATTTCAGATTTCTGAGAAGGAATTGATCGTCTGGAACCCATGATTATCAGGCAGTGGACCATTTCCTGGTCGGATTGAGATTGCCACCTCCAACCCTGAAACATCGATTTACATTCGAAAAAGAATTATGTAATTAGTAAATTTGCATTATTTCGGTCCAGTCAAATTTCCCATTACTAAAAATTTATGCAAAGTTGACGTCAAAGACCAACTTAACGATTTTTTGGAACTTTTTAAATAGAAGTGTCATTAGGGATCGAGATGAAATCTAACCTTAACGTCAGTGTATCAAGAAGAAAATTTTCACAAGAACTTTTCAGGCAATGCAATTCCTATGGTGCCATGAATCTACTTTTCCCCATAAACTAACAAAAAACAATAGCTAAAAGTAGAAACTCATTACCTGCAGCTTTTGCTGCTACAGCTTCCTGATAGACATCTGTGATAAACAATATTTCTGATGGCTTATCAACTCCAACAGACTCTGAAATCTCAACATAACTGCTTGTTTCTCTTTTGTTCCTGGGATACAAGAGCAGTTCCACCATATGAATATGTGACATTGAGAATGATTAAGGCATGGAAAAGAAACAGCCATAGATTACCCCACAGCAGTGTCAAAGAATCCTGATAAATTTTTTCTTAATCACCATAATTAGTCTTTCCAAATATAAGCCTTTGTGCCAATCTACTACCGCTTGAATATATGTAGACCTATTAATGCCAGAAACGATAAAAGTTATTAATGGTATAATTAAGCATCGGAAGAAAATTTGTTCAAATATCATCAAGCTTCTAATAAAAAGGGAAATCAAGCACAGAACATCTATTATCTAATGTCAGTTTCAGCTAATGACTACTTGAGTTTATCTTCCTAAATAGATTCCATTGTGAAAAAAAACTGGGAATACTTCTGCAAAATTTTTGCTTCTAAGCGACTTAATTAATCTTTCATTACAAGAGGGGCGTTAAAGTAGCAATGCAATTTTGTAGGACTCAATGAAAGGACATCAATTCAAATCCTAGTTTTTCGAATGGAGAGATCAAGAAAGATGATTTCGATTCCATTGGCAATTAGGATTTGGAATGAATATCACACATTGTTCCGTGGCTTTTTGCCACCTAGGGTTGTAGTATATCCGAGGGTTGGATTGTTCACCACTAACATAGTACGTGAGTTGGGTTCAGAATGTCATGAGACAGTTCCATCCATAACTCCATGCCGGTGTGGACATTTGGGCATTAGAGTATTGAGAGGACCTTTCTCCGGTTAGTCTCAAATAGCAACTTGTGAGTGTATATGTGATGTTTGCTTTTTTTTTTTTTTTTATCCTTCTTCTCATCACGTGACTGGGATTTAATGAAAATCAAACGTGGGGGAGGGGGGGATGACCTTAATGCATAAAGCATTCCACCTCTCAAGAGCCTCTGGCACATCATCAAAAACTACTCCCTCCAATTCATTGCTTAAAAATCCAGTTCTCCATATGTGACCCTGCATAGTTAGAAGTTAGCTCTCTCAAAAGATATGCTAAATGCATAGCAGTAAGATGAATTCAGAAAATCATTTTACTTGTATACCTGCAACTGTTTCAAGGCAGTTATCTTGCGGTCAGCTTTTATCATTGCTTCAACATTAGCAACCAATGCTGCAATGACCTCCCCCTTCCCAGCATTATCAGGAGGAATAGGCACAGCTCCAGGAACACCATTTTCCAAGTCATCCTCAACCTGAACACCTTTTTATTGATTATTACAAAGAAGGCCCTTATGCCAAAAAGTAAATGTATTTTCTGTATGACCAAATTGGTTTGACTATGAGATGCAAATCAATGTAGGATGCTCGCCAACAATGATGTAGTAATTTATTGGCAAATTTAGGGTATGACTGCTCTCTTTTCTGTTTAAGAAAGTGATGGGGTTGATTGTAAAGGCTGAGGTTAATGCGTACTAGAACAACATAAGTTGTTATTAAAAATAACCACACATGAAAACAAGCCAAGGAAAAACAATTTGTAATAACTTTTAAATACAGTTATATAAACATATTTTGCTAAATCTTTATACTGAAAAACAAAACAGTTTTGGAAAAACAACTCCCAAATAGCCTCTCAACTACCAAATTGAATTTGCTGCTAGGAGGAGCTTTTTGTATTTCACAGATTAACTCCTAGACTCGCAAGCTATTTTCAGTTCAACTAAGACCAATCTTCTAGGGTGGTCCTAGTGGTAAGACCTCGAGATGTTTAGACATATAGTCTAAAGTCCTGAGTTGAAGCTCCCAGGGAGAAAACTTAATACTTTCAATTCTAGTTGTCTCTTGGGGTAGGCCTTTGGGCTAAGTATCTCAGAGATTGATTTCTCAAGGATCTAAAACCTCAGCCCTTTTTGGGGCCATGTGGGGGTGAGCCATGAACATAATACAAGGAGTGGGATCACTTGATACCTCAATTATAAAGTAAAAGTCTAACTAATACCAACAATCCCAATTAAATATTATGAATTGAACACAAACATTCTATAGATAATAACAAAATCAAAATACTGTTCAACCAAAATATCTTACTTGTGAACGTAACAACTTAATGTCATCTTGAGTTTCTGCAGTGTCGTATGTTAAAGTCAAATGCTTTCCAACATTATCACGAGCATATGGAAAAAGTACATCAGCAACAAAAGATATTGGAGTAGTAGTTCCTTCGATATCAAGAACTATGCAACGCTGCATCCAAAAATATAAATGAAAAATGAACTGCAATCTAAAAGACGACCGATTGAACTCAAACCATAAAGAAAAACCTCCTAACCGAGATACAAAAACGGCATACCTCAAAGGAAAAAGAAAAGAATGAAAAATTAAAGGCTGGAATCTTAAGCTTTCCAATTAAGTAGAATGCAATTTAACTCACTGGCAATGGTTCGTTTTTATTATCTAAACCTTGTGCCCCGGCCTTCACAGAAATGTTGATGCCAGCGTTATCTCCTGAAATTCCTTTGATGCTCCAAATCGGACCCTGAGTTGGAGAAGACTAATCAATGCCTAGTTGACGAAGCTTAATAGCAGCATCAATGAGATAGTGATAACACTCAGCCTGTAAGATTAGGTTATTGATTAAACTGAAACCAAATGAAAGATCAAGACAGACCCAGAATCAATTTCTTGAAATTGTATAAAATAAGAGGTTTCATCATCTTCAAAGGAACTTCTTATCAAAAGTATCTTATCAAATAAAAGGGGCTGATAAGAAAAAGTACATATAGTGCCGAAGCAATCTAGAGATTTAAGGCACAAAATCACAAGGCTAACCTTCCCTTTGCTAAAAATAATGACATAACTATAGTCTATAGTTATTATTGTTCTGTTTAATAAGAAACACTGTATTGAAATCATAAAATGTACAACAGAAGGAAGAAGGATTAGATTTCCACCCCAAGCTAAGAGCCACATAGACAAGAAAACGCTCTCCAGATACATGAAATCAAGAAAAAGTTATAAAATTCTTTGGTAACATGCAGCCAATCTGAAATATTAATGCTCATCTAATTCCAGCTTTTTAGGTTATGTCTACATAGCTGTAGTATGCAAGAACTATTTCCATTCAATCCTTCTCAGCTAGTTGATATTTAGTAATATAATATAGGCCACTTAATCCAACAAGAGAAGTACAAAAATTGATTTGTTTTTGAATGTTTAGCTGAAAAGTCTTCTACTCGGCAGAAGTTACACAGCATTGTTCAGGATTTCATGCTTAACATGAACGAAGGAGCAGATAGGACTTGTCAGCCTCATCAAAGGGCCCAAGGCCCAAAGGCCGGCCCAAGCCCAGCCTGGGCCCGATCCAAGCCCGATCCAAAGTCAAAGTTTGACCGGCCCGGCCCATCCATCAATTGGATTGGATATGGATCACCCTTTTGGAGGTTGGGCCGGCCCACGGGCCAGCCCGTTTGGGCCATGTTCGAGCCATATTTAGATCGGCCCGTCGGGCTGGCCCAATGGACTTCTAATTGAGCCATATATATATATATATATATATATAATAGTTTATATATTATATTTTATATTTTAAAAACTAAATTTTTTAATTTTTTTTAAAAAATATGTTTTTTTACTTTACATATTATAATTTAATAAATAGAAGTATAAATAATTAATAAAAAGTTTATTTAAAAATAAAAGAACAAATAAAAATTCATAATATTGATAATTTTATCAAGAAAGAGTATTTAAAATTATCAAAAAAATTCAAAATAAAATTAAATAGAAAATATCAATAATATAAATTTTAATAAAATTTAATTCAAAATAATATTAAAATATTTTTTAAAAAATTAATTAAAAAAGTAATTATCTATTGGACCAATCCAATGAGCCGGCCCAAGCTCGATCCAAACTTTGATCCATTGAACTAAGCCCATGGGCTTGAGAATAGAGGTTGGGCCGGCCCGACCCAAGCCCAAATAGTGTATTAAGCCCGCTGGGCTTGGGCCTGGGCCTTTACTTCTTTGGGCTGGGCCGCCCAGCCCGGCCCAATGATGAGGCCAAGATAGGAGTCTAACATTTACTTCTGATTAAAATGTAGAATTTTTCAAATTATAGATGCCACTAGATGAAACCTTACCCGGAATTCATTCGGATGATGGTTCATAATTGTTACAAGACAGGATGATTCCATACCATGACTATGGATAACTGCTCCAGCATTGTGCATCTCATATGCCTGGAAAACAGGAAACAAATTTTGGTTGTTATGGATGGATTTGGACCAAACAAGAACGAGAAAGGCTGGTCCATCATGCAGAGATCACTAAAGTAAATGTCTAGATTCTTTTGCTAATATTACCACTAGTATTTGAAACTAGGTCATTCTCCATTCATTGCAAGTCGGTTATTCATCAGCATGGGTTGAGCTTCCAATGTTTTTGAAAGAACTCATTCAAATGAGTATTTTACGGAAGAAGTACTATTAACGGTACCTTCATAAAAAGAGGTCCACAGTCAGAACACTTTGGAGGCTTGTGTGGATATGGCTTTGGAGGTGGTGACGATACGATAGACCCACCATGAGATAACACATGCGTATCCTCCAGTACCATTCTCTCCTTCTGAACACCTAATTGCGGCAAAGAATATTAATCTAAATCCATGAATTCCCAGCAATCGACTTTTGAGAAGAAGATAAGAAAATAGAGAAACAAAGGAACAAGATCTCTATACTCTTATGCAAGGAGTTGAAATACACAATGAAAAATACACACTCTTTAATCCAGTTCTTGATGGCTTGATTAACAAACGAATACCCAGAAGCCAGGAAATAGAGAAGACTTGGGATAAGTTCGATATTCTTTTTGGAAGAGTGAAAAATGCTTTTATTTTGAAAAGTATTTTAGATAATAAAAAGACTTTAAGTTTTTTTAAGGTATTTAATTACAAAGTCAAAAGAGTGCTAATTAGAAATTAGTGTTTTTCTCGAGATTGTTTGTAGAAAAACACTGCTCTTAAAAAGTTCTTCTTTGGGAAATAAAGACAACTGAGAGCAGAGTTGACGAAGAAAGATTTACCTTAATAACAAAAATTCAGCACCAAAAGCCAGGAAAATTGGGGGAATCTTGAAAATCACTTTCCCACGAGACAAACAAAGTCAATTAGCTCGATTTGATGCAATGCAAGTAATAAAATGCCCAAAAATTGTACAACACTTGATCTTAGACAAAAGGCCGAGAAAGGTATAAATGCCCAAAATTGTAAGAAATTAAACAAGAAACAAAAAAATTGAGAGAGAGAGAGATGCCCACCAGAAGGTGACATAACGATTAGCTGTTGTCTCTTGGGAATGGAGTCGTGGTGAACCTTGATGGTAATGCTACCGCCGGTGCCGGAGACCCATCGGTGGCCGTAGAATTGGCGGCGGAGCTCCGACAACAGAGACCTGGTGTCGTTCGCGGCCTTGCTTTCTAA

mRNA sequence

ATGGCCGCTGCGAGCATTGGAGCTCTGTGGAGTCTAAGAAGGGCAGCTCAAGCGTCGAGTTTCCGCACTCCCCCCGCCGTTAACCTCAATAAGCTCTCCTTCCATGAGGCCCGCTTTCCCTCTTCTCTGTATTCTCCTCTGGGGTCCTCAGGTTTTGATTCGCTTTTCCTCCATGGAATATGTCAATTAGTGCAAGCTGTGAAGGGAGACATTGATGTTTTACTCAACGGAGTAGGAGACAAAGGTGTTATTGTAGAAGTGAAACATATTCTTGTGATGGCCAAACGTTCATTATCAAGACGAGAAGTTCTTCATACGAACTTTCTCACCCCACCTGTAGTGAAAGAGTCAATGCTAGCTATACAAAAACTAGCTGACGTGAAAGCAATAGCTCAGGGAGGATACCCGCAGGCAGAGCGTTGCCGGATTTCTGTTGGACATGCAGATGAACTAACAAGCGATCCAGATGTAATTTCGGCATTGAGTATCGCAGGAAATTTTACGTTTCACCCTTGCTCACATGGGGACTTCCTTGGAGCAATTCTTGGTACAGGCATTGCGAGGGAAAAGCTTGGTGATATTATACTCCAGGTTGGTAATGTCACAGTTTCTTGTACGAGGATTCCATTGACAGCTCTTGATTACGAACCACCGAAGTATGAAGTCATATTGGTAGTATGCCATCATGTGGCAGTGAATGATAGAGAACTTTCTCAAAGGTTTTGGGGAATAGGCAACGGCGATGTTCGTGTCAATTGGTCGCCTGTTACAAAAAATGGAACCACACTGAAGACTGGTGATATTGTTTCTGTTAGTGGGAAAGGCAGACTAAAGCTGTTGTCTCTTGGGAATGGAGTCGTGGTGAACCTTGATGGTAATGCTACCGCCGGTGCCGGAGACCCATCGGTGGCCGTAGAATTGGCGGCGGAGCTCCGACAACAGAGACCTGGTGTCGTTCGCGGCCTTGCTTTCTAA

Coding sequence (CDS)

ATGGCCGCTGCGAGCATTGGAGCTCTGTGGAGTCTAAGAAGGGCAGCTCAAGCGTCGAGTTTCCGCACTCCCCCCGCCGTTAACCTCAATAAGCTCTCCTTCCATGAGGCCCGCTTTCCCTCTTCTCTGTATTCTCCTCTGGGGTCCTCAGGTTTTGATTCGCTTTTCCTCCATGGAATATGTCAATTAGTGCAAGCTGTGAAGGGAGACATTGATGTTTTACTCAACGGAGTAGGAGACAAAGGTGTTATTGTAGAAGTGAAACATATTCTTGTGATGGCCAAACGTTCATTATCAAGACGAGAAGTTCTTCATACGAACTTTCTCACCCCACCTGTAGTGAAAGAGTCAATGCTAGCTATACAAAAACTAGCTGACGTGAAAGCAATAGCTCAGGGAGGATACCCGCAGGCAGAGCGTTGCCGGATTTCTGTTGGACATGCAGATGAACTAACAAGCGATCCAGATGTAATTTCGGCATTGAGTATCGCAGGAAATTTTACGTTTCACCCTTGCTCACATGGGGACTTCCTTGGAGCAATTCTTGGTACAGGCATTGCGAGGGAAAAGCTTGGTGATATTATACTCCAGGTTGGTAATGTCACAGTTTCTTGTACGAGGATTCCATTGACAGCTCTTGATTACGAACCACCGAAGTATGAAGTCATATTGGTAGTATGCCATCATGTGGCAGTGAATGATAGAGAACTTTCTCAAAGGTTTTGGGGAATAGGCAACGGCGATGTTCGTGTCAATTGGTCGCCTGTTACAAAAAATGGAACCACACTGAAGACTGGTGATATTGTTTCTGTTAGTGGGAAAGGCAGACTAAAGCTGTTGTCTCTTGGGAATGGAGTCGTGGTGAACCTTGATGGTAATGCTACCGCCGGTGCCGGAGACCCATCGGTGGCCGTAGAATTGGCGGCGGAGCTCCGACAACAGAGACCTGGTGTCGTTCGCGGCCTTGCTTTCTAA

Protein sequence

MAAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGICQLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAIQKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQVGNVTVSCTRIPLTALDYEPPKYEVILVVCHHVAVNDRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVSVSGKGRLKLLSLGNGVVVNLDGNATAGAGDPSVAVELAAELRQQRPGVVRGLAF
Homology
BLAST of Sgr012579 vs. NCBI nr
Match: XP_038899605.1 (putative RNA-binding protein YlmH [Benincasa hispida])

HSP 1 Score: 413.3 bits (1061), Expect = 1.9e-111
Identity = 226/309 (73.14%), Postives = 243/309 (78.64%), Query Frame = 0

Query: 1   MAAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGI 60
           MAA SIGA WSLRRAAQ+SSFR+P AVNLNKLSFHEARFPSS  SPLGSS        GI
Sbjct: 1   MAATSIGARWSLRRAAQSSSFRSPLAVNLNKLSFHEARFPSSPSSPLGSS--------GI 60

Query: 61  CQLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA 120
           CQLVQAVKGDIDVLLNGVGDKGVIV+VKHILVMAKRSLSRR VLHTNFLTPPVVKES+LA
Sbjct: 61  CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRRVVLHTNFLTPPVVKESILA 120

Query: 121 IQKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGA 180
           +QKLADVKAIAQGGYP+AERCRISVGHADEL SDPD++SALSI GNF FHPCSHGDFLGA
Sbjct: 121 LQKLADVKAIAQGGYPEAERCRISVGHADELLSDPDIVSALSITGNFMFHPCSHGDFLGA 180

Query: 181 ILGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEP 240
           ILGTGIAREKLGDIILQ                       VGNVTVSCTRIPLTALDYEP
Sbjct: 181 ILGTGIAREKLGDIILQEEKGAQVVVVPELVDFLASSLRKVGNVTVSCTRIPLTALDYEP 240

Query: 241 PKYEVILVVCHHVAVN-------DRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVS 280
           PK +    +   + V+           S+    I +GDVRVNW+P+TKNGT LKTGDIVS
Sbjct: 241 PKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTPITKNGTILKTGDIVS 300

BLAST of Sgr012579 vs. NCBI nr
Match: XP_022142745.1 (uncharacterized protein LOC111012786 isoform X2 [Momordica charantia])

HSP 1 Score: 409.1 bits (1050), Expect = 3.6e-110
Identity = 227/308 (73.70%), Postives = 241/308 (78.25%), Query Frame = 0

Query: 1   MAAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGI 60
           MAA SIGALWSLRRAAQASSFR P A+N   LSFH+ARFP S  SP GSS        GI
Sbjct: 1   MAATSIGALWSLRRAAQASSFRIPLALN---LSFHQARFPPSPSSPPGSS--------GI 60

Query: 61  CQLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA 120
           CQLVQAVKGDIDVLLNGVGDKGVIV+VKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA
Sbjct: 61  CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA 120

Query: 121 IQKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGA 180
           IQKLADVKAIAQGGYP+AERCRISVGHADELTSDPD+ISALSI GNFTFHPCSHGDFLG+
Sbjct: 121 IQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIISALSITGNFTFHPCSHGDFLGS 180

Query: 181 ILGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEP 240
           ILGTGIAREKLGDIILQ                       VGNVTVSCTRIPLTALDYEP
Sbjct: 181 ILGTGIAREKLGDIILQGENGAQVVLVPELVDFLISSLRKVGNVTVSCTRIPLTALDYEP 240

Query: 241 PK------YEVILVVCHHVAVNDRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVSV 280
           PK       E  L V    +   +    +   + +GDVRVNW+PVTKNGTTLKTGD+VSV
Sbjct: 241 PKTKTFNTIEASLRVDAIASAGFKISRSKLVDLISGDVRVNWTPVTKNGTTLKTGDVVSV 297

BLAST of Sgr012579 vs. NCBI nr
Match: XP_022142743.1 (uncharacterized protein LOC111012786 isoform X1 [Momordica charantia])

HSP 1 Score: 408.7 bits (1049), Expect = 4.7e-110
Identity = 227/309 (73.46%), Postives = 242/309 (78.32%), Query Frame = 0

Query: 1   MAAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGI 60
           MAA SIGALWSLRRAAQASSFR P A+N   LSFH+ARFP S  SP GSS        GI
Sbjct: 1   MAATSIGALWSLRRAAQASSFRIPLALN---LSFHQARFPPSPSSPPGSS--------GI 60

Query: 61  CQLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA 120
           CQLVQAVKGDIDVLLNGVGDKGVIV+VKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA
Sbjct: 61  CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA 120

Query: 121 IQKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGA 180
           IQKLADVKAIAQGGYP+AERCRISVGHADELTSDPD+ISALSI GNFTFHPCSHGDFLG+
Sbjct: 121 IQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIISALSITGNFTFHPCSHGDFLGS 180

Query: 181 ILGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEP 240
           ILGTGIAREKLGDIILQ                       VGNVTVSCTRIPLTALDYEP
Sbjct: 181 ILGTGIAREKLGDIILQGENGAQVVLVPELVDFLISSLRKVGNVTVSCTRIPLTALDYEP 240

Query: 241 PKYEVILVVCHHVAVN-------DRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVS 280
           PK +    +   + V+           S+    I +GDVRVNW+PVTKNGTTLKTGD+VS
Sbjct: 241 PKTKTFNTIEASLRVDAIASAGFKISRSKLVDLISSGDVRVNWTPVTKNGTTLKTGDVVS 298

BLAST of Sgr012579 vs. NCBI nr
Match: KAG6600381.1 (hypothetical protein SDJN03_05614, partial [Cucurbita argyrosperma subsp. sororia] >KAG7031043.1 ylmH [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 407.5 bits (1046), Expect = 1.1e-109
Identity = 223/308 (72.40%), Postives = 240/308 (77.92%), Query Frame = 0

Query: 2   AAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGIC 61
           A ASIGALWSLRRAAQ+SSFRTP A+NL K+SFHEARFPSS  SPLG+S        G+C
Sbjct: 4   ATASIGALWSLRRAAQSSSFRTPLAINLGKVSFHEARFPSSPSSPLGAS--------GMC 63

Query: 62  QLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI 121
           QLVQAVKGDIDVLLNGVGDKGVIV+VKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI
Sbjct: 64  QLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI 123

Query: 122 QKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGAI 181
           QKLADVKAIAQGGYP+AERCRISVGHADELTSDPDV+SALSI GNF FH C+HGDFLGAI
Sbjct: 124 QKLADVKAIAQGGYPEAERCRISVGHADELTSDPDVVSALSITGNFVFHTCTHGDFLGAI 183

Query: 182 LGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEPP 241
           LGTGIAREKLGDIILQ                       VGNVTVSCTRIPLT LDYEPP
Sbjct: 184 LGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTDLDYEPP 243

Query: 242 KYEVILVVCHHVAVN-------DRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVSV 280
           K +    +   + V+           S+    I  GDVRVNW+ +TKNGT LKTGDIVSV
Sbjct: 244 KTKTFKTIEASLRVDALASAGFKISRSKLVDFISGGDVRVNWTTITKNGTILKTGDIVSV 303

BLAST of Sgr012579 vs. NCBI nr
Match: XP_022981895.1 (uncharacterized protein LOC111480897 isoform X1 [Cucurbita maxima])

HSP 1 Score: 404.4 bits (1038), Expect = 9.0e-109
Identity = 222/308 (72.08%), Postives = 241/308 (78.25%), Query Frame = 0

Query: 2   AAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGIC 61
           A ASIGALWSLRRAAQ+SS RTP AVNL+K+SFHEARFP+S  SPLGSS        G+C
Sbjct: 4   ATASIGALWSLRRAAQSSSLRTPLAVNLSKVSFHEARFPTSPSSPLGSS--------GMC 63

Query: 62  QLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI 121
           QLVQAVKGDIDVLLNGVGDKGVIV+VKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI
Sbjct: 64  QLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI 123

Query: 122 QKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGAI 181
           +KLADVKAIAQGGYP+AERCRISVGHADELTSDPDV+SALSI GNF FH C+HGDFLGAI
Sbjct: 124 KKLADVKAIAQGGYPEAERCRISVGHADELTSDPDVVSALSITGNFMFHTCTHGDFLGAI 183

Query: 182 LGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEPP 241
           LGTGIAREKLGDIILQ                       VGNVTVSCTRIPLT LDYEPP
Sbjct: 184 LGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLHKVGNVTVSCTRIPLTDLDYEPP 243

Query: 242 KYEVILVVCHHVAVN-------DRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVSV 280
           K +    +   + V+           S+    I +GDVRVNW+ +TKNGT LKTGDIVSV
Sbjct: 244 KTKTFKTIEASLRVDALASAGFKISRSKLVDFISSGDVRVNWTTITKNGTILKTGDIVSV 303

BLAST of Sgr012579 vs. ExPASy Swiss-Prot
Match: P71020 (Putative RNA-binding protein YlmH OS=Bacillus subtilis (strain 168) OX=224308 GN=ylmH PE=4 SV=1)

HSP 1 Score: 63.9 bits (154), Expect = 3.8e-09
Identity = 57/212 (26.89%), Postives = 91/212 (42.92%), Query Frame = 0

Query: 106 TNFLTPPVVKESML--AIQKLADVKAIAQGGYPQAERCRISVGHADELTSDPD--VISAL 165
           T+FL P   +E ++  A+   ADV     GGY +AER R ++   + +T +     + A 
Sbjct: 35  TDFLDP---REQVILSAVTGQADVGLAFSGGYDRAERKR-AILFPEYITPEESDFELQAF 94

Query: 166 SIAGNFTFHPCSHGDFLGAILGTGIAREKLGDIIL----------------------QVG 225
           ++     F    H   LGA++G G+ R+K GDI+                       Q G
Sbjct: 95  NVRYADKFVSVDHRSLLGALMGIGLKRQKFGDIVFSETAVQLIVSADTADFVAAQLTQAG 154

Query: 226 NVTVSCTRIPLTALDYEPPKYEV---------ILVVCHHVAVNDRELSQRFWGIGNGDVR 283
              VS  +I L+ L+      E+         +  VC  ++   R+ SQ    + NG V+
Sbjct: 155 KAAVSLEKIDLSDLNIPAVDVEIRDDTVSSLRLDAVCASMSRQSRQKSQTL--VKNGLVK 214

BLAST of Sgr012579 vs. ExPASy TrEMBL
Match: A0A6J1CLT4 (uncharacterized protein LOC111012786 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111012786 PE=4 SV=1)

HSP 1 Score: 409.1 bits (1050), Expect = 1.8e-110
Identity = 227/308 (73.70%), Postives = 241/308 (78.25%), Query Frame = 0

Query: 1   MAAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGI 60
           MAA SIGALWSLRRAAQASSFR P A+N   LSFH+ARFP S  SP GSS        GI
Sbjct: 1   MAATSIGALWSLRRAAQASSFRIPLALN---LSFHQARFPPSPSSPPGSS--------GI 60

Query: 61  CQLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA 120
           CQLVQAVKGDIDVLLNGVGDKGVIV+VKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA
Sbjct: 61  CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA 120

Query: 121 IQKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGA 180
           IQKLADVKAIAQGGYP+AERCRISVGHADELTSDPD+ISALSI GNFTFHPCSHGDFLG+
Sbjct: 121 IQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIISALSITGNFTFHPCSHGDFLGS 180

Query: 181 ILGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEP 240
           ILGTGIAREKLGDIILQ                       VGNVTVSCTRIPLTALDYEP
Sbjct: 181 ILGTGIAREKLGDIILQGENGAQVVLVPELVDFLISSLRKVGNVTVSCTRIPLTALDYEP 240

Query: 241 PK------YEVILVVCHHVAVNDRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVSV 280
           PK       E  L V    +   +    +   + +GDVRVNW+PVTKNGTTLKTGD+VSV
Sbjct: 241 PKTKTFNTIEASLRVDAIASAGFKISRSKLVDLISGDVRVNWTPVTKNGTTLKTGDVVSV 297

BLAST of Sgr012579 vs. ExPASy TrEMBL
Match: A0A6J1CP18 (uncharacterized protein LOC111012786 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111012786 PE=4 SV=1)

HSP 1 Score: 408.7 bits (1049), Expect = 2.3e-110
Identity = 227/309 (73.46%), Postives = 242/309 (78.32%), Query Frame = 0

Query: 1   MAAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGI 60
           MAA SIGALWSLRRAAQASSFR P A+N   LSFH+ARFP S  SP GSS        GI
Sbjct: 1   MAATSIGALWSLRRAAQASSFRIPLALN---LSFHQARFPPSPSSPPGSS--------GI 60

Query: 61  CQLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA 120
           CQLVQAVKGDIDVLLNGVGDKGVIV+VKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA
Sbjct: 61  CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA 120

Query: 121 IQKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGA 180
           IQKLADVKAIAQGGYP+AERCRISVGHADELTSDPD+ISALSI GNFTFHPCSHGDFLG+
Sbjct: 121 IQKLADVKAIAQGGYPEAERCRISVGHADELTSDPDIISALSITGNFTFHPCSHGDFLGS 180

Query: 181 ILGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEP 240
           ILGTGIAREKLGDIILQ                       VGNVTVSCTRIPLTALDYEP
Sbjct: 181 ILGTGIAREKLGDIILQGENGAQVVLVPELVDFLISSLRKVGNVTVSCTRIPLTALDYEP 240

Query: 241 PKYEVILVVCHHVAVN-------DRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVS 280
           PK +    +   + V+           S+    I +GDVRVNW+PVTKNGTTLKTGD+VS
Sbjct: 241 PKTKTFNTIEASLRVDAIASAGFKISRSKLVDLISSGDVRVNWTPVTKNGTTLKTGDVVS 298

BLAST of Sgr012579 vs. ExPASy TrEMBL
Match: A0A6J1J103 (uncharacterized protein LOC111480897 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111480897 PE=4 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 4.3e-109
Identity = 222/308 (72.08%), Postives = 241/308 (78.25%), Query Frame = 0

Query: 2   AAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGIC 61
           A ASIGALWSLRRAAQ+SS RTP AVNL+K+SFHEARFP+S  SPLGSS        G+C
Sbjct: 4   ATASIGALWSLRRAAQSSSLRTPLAVNLSKVSFHEARFPTSPSSPLGSS--------GMC 63

Query: 62  QLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI 121
           QLVQAVKGDIDVLLNGVGDKGVIV+VKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI
Sbjct: 64  QLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI 123

Query: 122 QKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGAI 181
           +KLADVKAIAQGGYP+AERCRISVGHADELTSDPDV+SALSI GNF FH C+HGDFLGAI
Sbjct: 124 KKLADVKAIAQGGYPEAERCRISVGHADELTSDPDVVSALSITGNFMFHTCTHGDFLGAI 183

Query: 182 LGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEPP 241
           LGTGIAREKLGDIILQ                       VGNVTVSCTRIPLT LDYEPP
Sbjct: 184 LGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLHKVGNVTVSCTRIPLTDLDYEPP 243

Query: 242 KYEVILVVCHHVAVN-------DRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVSV 280
           K +    +   + V+           S+    I +GDVRVNW+ +TKNGT LKTGDIVSV
Sbjct: 244 KTKTFKTIEASLRVDALASAGFKISRSKLVDFISSGDVRVNWTTITKNGTILKTGDIVSV 303

BLAST of Sgr012579 vs. ExPASy TrEMBL
Match: A0A6J1FU11 (uncharacterized protein LOC111446880 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111446880 PE=4 SV=1)

HSP 1 Score: 403.3 bits (1035), Expect = 9.7e-109
Identity = 222/308 (72.08%), Postives = 239/308 (77.60%), Query Frame = 0

Query: 2   AAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGIC 61
           A ASIGALWSLRRAA++SSFRTP AVNL K+SFHEARFPSS  SPLG+S        G+C
Sbjct: 4   ATASIGALWSLRRAARSSSFRTPLAVNLGKVSFHEARFPSSPSSPLGAS--------GMC 63

Query: 62  QLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI 121
           QLVQAVKGDIDVLLNGVGDKGVIV+VKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI
Sbjct: 64  QLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI 123

Query: 122 QKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGAI 181
           QKLADVKAIAQGGYP+AERCRISVGHADELTSDPDV+SALSI GNF F  C+HGDFLGAI
Sbjct: 124 QKLADVKAIAQGGYPEAERCRISVGHADELTSDPDVVSALSITGNFVFQTCTHGDFLGAI 183

Query: 182 LGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEPP 241
           LGTGIAREKLGDIILQ                       VGNVTVSCTRIPLT LDYEPP
Sbjct: 184 LGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTDLDYEPP 243

Query: 242 KYEVILVVCHHVAVN-------DRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVSV 280
           K +    +   + V+           S+    I  GDVRVNW+ +TKNGT LKTGDIVSV
Sbjct: 244 KTKTFKTIEASLRVDALASAGFKISRSKLVDFISGGDVRVNWTTITKNGTILKTGDIVSV 303

BLAST of Sgr012579 vs. ExPASy TrEMBL
Match: A0A6J1FSF6 (uncharacterized protein LOC111446880 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111446880 PE=4 SV=1)

HSP 1 Score: 402.1 bits (1032), Expect = 2.2e-108
Identity = 223/310 (71.94%), Postives = 239/310 (77.10%), Query Frame = 0

Query: 2   AAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGIC 61
           A ASIGALWSLRRAA++SSFRTP AVNL K+SFHEARFPSS  SPLG+S        G+C
Sbjct: 4   ATASIGALWSLRRAARSSSFRTPLAVNLGKVSFHEARFPSSPSSPLGAS--------GMC 63

Query: 62  QLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI 121
           QLVQAVKGDIDVLLNGVGDKGVIV+VKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI
Sbjct: 64  QLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLAI 123

Query: 122 QKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGAI 181
           QKLADVKAIAQGGYP+AERCRISVGHADELTSDPDV+SALSI GNF F  C+HGDFLGAI
Sbjct: 124 QKLADVKAIAQGGYPEAERCRISVGHADELTSDPDVVSALSITGNFVFQTCTHGDFLGAI 183

Query: 182 LGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEPP 241
           LGTGIAREKLGDIILQ                       VGNVTVSCTRIPLT LDYEPP
Sbjct: 184 LGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTDLDYEPP 243

Query: 242 KYEVILVVCHHVAVN---------DRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIV 280
           K +    +   + V+          R     F   G GDVRVNW+ +TKNGT LKTGDIV
Sbjct: 244 KTKTFKTIEASLRVDALASAGFKISRSKLVDFISSG-GDVRVNWTTITKNGTILKTGDIV 303

BLAST of Sgr012579 vs. TAIR 10
Match: AT1G53120.1 (RNA-binding S4 domain-containing protein )

HSP 1 Score: 269.2 bits (687), Expect = 4.2e-72
Identity = 153/309 (49.51%), Postives = 192/309 (62.14%), Query Frame = 0

Query: 1   MAAASIGALWSLRRAAQASSFRTPPAVNLNKLSFHEARFPSSLYSPLGSSGFDSLFLHGI 60
           MA  S+   W + R A  S   +       K        P+S   PL  S          
Sbjct: 1   MAVTSLAPPWVILRLAFRSVAASSCLHTNQKTLITNLSIPTSF--PLRQSALRR------ 60

Query: 61  CQLVQAVKGDIDVLLNGVGDKGVIVEVKHILVMAKRSLSRREVLHTNFLTPPVVKESMLA 120
           C   +A+KGD+D LL GVGD+ V  EVK IL MA+R+ S+REVLHT+FLTPP+VKES+  
Sbjct: 61  CYSAEAIKGDVDFLLKGVGDQAVAKEVKQILEMARRASSKREVLHTDFLTPPIVKESVSL 120

Query: 121 IQKLADVKAIAQGGYPQAERCRISVGHADELTSDPDVISALSIAGNFTFHPCSHGDFLGA 180
           ++K ADVK +AQGGYP+AERCRIS+GH D LTSDPD+++ALSI GNF F PCSHGDFLGA
Sbjct: 121 LEKFADVKIVAQGGYPEAERCRISIGHPDVLTSDPDIVAALSITGNFGFQPCSHGDFLGA 180

Query: 181 ILGTGIAREKLGDIILQ-----------------------VGNVTVSCTRIPLTALDYEP 240
           ILGTGI+REKLGDI++Q                       VGNV V+C++IPL AL+YEP
Sbjct: 181 ILGTGISREKLGDILIQEEKGAQVLIVPELVDFVVTALDKVGNVGVTCSKIPLLALEYEP 240

Query: 241 PKYEVILVVCHHVAVN-------DRELSQRFWGIGNGDVRVNWSPVTKNGTTLKTGDIVS 280
           P+      V   + ++           S+    I + DVRVNW+ VTKNGT +KTGD+VS
Sbjct: 241 PRTNSFKTVEASLRIDAVASAGFKISRSKLVDLISSKDVRVNWATVTKNGTIVKTGDVVS 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899605.11.9e-11173.14putative RNA-binding protein YlmH [Benincasa hispida][more]
XP_022142745.13.6e-11073.70uncharacterized protein LOC111012786 isoform X2 [Momordica charantia][more]
XP_022142743.14.7e-11073.46uncharacterized protein LOC111012786 isoform X1 [Momordica charantia][more]
KAG6600381.11.1e-10972.40hypothetical protein SDJN03_05614, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022981895.19.0e-10972.08uncharacterized protein LOC111480897 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
P710203.8e-0926.89Putative RNA-binding protein YlmH OS=Bacillus subtilis (strain 168) OX=224308 GN... [more]
Match NameE-valueIdentityDescription
A0A6J1CLT41.8e-11073.70uncharacterized protein LOC111012786 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1CP182.3e-11073.46uncharacterized protein LOC111012786 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1J1034.3e-10972.08uncharacterized protein LOC111480897 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FU119.7e-10972.08uncharacterized protein LOC111446880 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1FSF62.2e-10871.94uncharacterized protein LOC111446880 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT1G53120.14.2e-7249.51RNA-binding S4 domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 154..217
e-value: 7.8E-12
score: 46.9
IPR040591YlmH, putative RNA-binding domainPFAMPF17774YlmH_RBDcoord: 159..203
e-value: 3.9E-6
score: 27.1
NoneNo IPR availablePANTHERPTHR32219:SF3RNA-BINDING PROTEIN YLMH-RELATEDcoord: 197..279
coord: 59..197
NoneNo IPR availablePANTHERPTHR32219RNA-BINDING PROTEIN YLMH-RELATEDcoord: 197..279
coord: 59..197
NoneNo IPR availablePROSITEPS50889S4coord: 244..284
score: 10.635284
NoneNo IPR availableSUPERFAMILY55174Alpha-L RNA-binding motifcoord: 134..282
IPR002942RNA-binding S4 domainCDDcd00165S4coord: 244..288
e-value: 2.40926E-5
score: 39.9254

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr012579.1Sgr012579.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003723 RNA binding