Sgr015100 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr015100
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionRING-H2 finger protein ATL16-like
Locationtig00002854: 740919 .. 752214 (+)
RNA-Seq ExpressionSgr015100
SyntenySgr015100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGGTTGCCGAGAAAATCATCGGGCAGTGAGAGTGAGGTTTCTAGGGAGGTGTGAATCGAGAAAAGATCGCTCCAAGAGAAGGACGATTCCGGTGAACCGAACGAAGATTCCGTCGAGAAGTCTGAATCGGAGAATGAAAAGTAGGAAGACTCCATTTCAGGCGCTTCTGAAATCCGAGAGGAAATGGAATTTTTCCGGTGGTTTTGGGTTTTCTGGGAGTTGTTTGTTTCGTAAGAAAGTCTGTTTTCTCTCTGTTGCTCTGCTTAAAGAATTTGAATTTCTAAACCCTGTATATAACAGCGGAAAAAAGAAGAATTTCGGTCCACAAAATAATTAAATTAAATAATCTGGGCGGGCTTATCAATATTTTAAAATTGGGGGATTAAAAACAAATAACTTTTAGAATATGATAAACATTAAAGCAAAAAAGAAAAAAAGAAAAAAAGAAAAAAAATAGCAATAGCAAGGGAACTGCGTCACGTGCGTCAAAACAGGCCACAGGCCCGGCGGCGCCCTGGGACCCATTTCGAATTTTTTTAAAAAAAAATTAAGTATCCGAATCATCAATTTAGAGGAATGATTTGTGAATTAGCAAGCTACGTGGCAGAACACGTCATCAAGATATTTTGAAGAATTCACTGCAATTACGTTATTGACTTACAGCTAGGCATGGCACGAGTGCATTAATTTAGTGGTTATAGATCGAAAAAAATTAAAATAATTAATCAAATAATTGGTCTTTTTTAATGTCAAGTTGGTTTTAAAAGAGGGAATCTAATCAACAACCTATATTAAATGTTATAATCTTAATTTAATTAGAGAGAATAAATATTTGTTATTAATTGTTTCAATTAAGATTTAATCATTAATAAATTTTTACGTAATTATATAAAAGCACATAAAGAATTCTCGACTTTGTTGATTTTATAACCCCCAAAAAAAGTGTTCAACTTTTTAAAATGAATCATGAGATCAACTAATACCCATTAAATGATTATGTTTAGACCCCATTTAATGGTTAATATATTTATATATTTTTTGTTTTTCGTTTATCATTTATAATATTTTATTTCTTGATTTATTCTTGAGAAGGACGTTTTTACAAAAGTTTGTATAATCTTTATGCATTTTTATATAAAAATTTCATAAAATTCAAAAAAAATGAAGTAGAAAAACTCACATTATATAGGCTGTTCTCCCTCATTCATGATCAAAATAGTACTTCAAAAAAGTTATGCGTGGCGGTCTCATCTAGATATATATTCACCACTTGATAGCCATCCATTCTCAAAATATATCTTTGTTAAATGCAATTGAATTGAGTATTTTCAGTCTCACAATCAAATCGACCTAATTAGCTTGGACAAAGTGATTATGTTAAGCCTAAAACATATTATTCTATTTGCAAATTTCTTCTTGAGATAAAGATCCTATTTTCATGAGCCCTAATCCTTCTCAAATTCTTCCTTGAGGGTCTTATTCTTATCGTTACTCTGTTATCAAATAACTCTTTTATTTGATAAGAAACTCTATATTATCGAGAATTTTTAGTAATTCACCTCTATGCTCTCATTCAATGCAAGTATTTACATTAAATTTTGTCGTTTTTCACTCAATTTTTTTTTTTACCATCATCATGTGATTTGATTGCTAAAGTTACTTCTATAATCATTAAAAAAAAATTAATATTAGTCGAGCCCTCCGAACAATCTTTATTCTAGATGATTAAAAAATAGATCCTTTGCTTTTGAACTGCATAGAATTACTATATTGTTAGATAAAACTCACTTTATGACTTCATTTTTGTTTATAATTTAGTCTTTGCTTTTTTTGGTTAAGAGAGAATTGTGTTATTGCATCAATAATGTGAATATCATTTATATAATTAAACATTGTTTGACTAAATTTGCTTGAATTTTAGGGAGATGATGGCCGAAAGATATAAGAACAAAATTCTTACTTACTTTCTGTAGACAGAAGAAAACTTCCCATTTAAGGAGCTGATGAAAATTATTTCATGTTTATATCATAAGGCCTCTCAAGTGATTTTTATTAACTTCAATTCAATTTCACATATCATTACAATATGTAATCAATTAATTAATTGATTAAAAATAATTAGGAGATGCTACTCATACAATATTTTACTAGTCATATGTTTCTTTTTCTAAACTAACTTTTCATATGCAATCAGCAACTTATTTTAAGTATAAGATTTAAATTATATGTGATATTTTTAAATTAAAATGCACAAGCTATACATGAAATCACACAAGAAAAAAAAAGAATCCACCTTTCCGAACCTTTTGTAAACTAAAAAAATATATAAAATAATAAATTATTTGCACAAAATTATTTTTAGAAAAAAAGCATATTAGGGAGCCAATTTTAACCGAAGAGGTTGGCTTTTGTAACAATTTATTAATCTTTAAGGACTATATAAAGAGAAGCTCTAAAACCTACATCTCAAACTATACATTATGATCTTTTACAAATTTTAAACTTATCCAGCTAAATGCAAGCTACACAAATTCATATTTCTCTAAACGTCATATTTTTCTGCACGCGATTCAAACACATCTACTACACGGGTTAGACTTCGACATCTCATGCTTTTTAAATTCGTGAACAAAATGACTTCTAAATACATGCCAAGAAAATAGGAATAGGAAAAATAATAAAATGAAAAGTTGGGAACCAGGATAGAAACACACAAGTACAGAGAAAGAAAAGGAAGTAGAGATATTGAATGGGTTAATAATTTTAAGAGGAAACAGTGACTGTTGAGGAATTAAAAAGAAGGGAAAGAGAAGAAATCGTCTGTTGTGATATAAACAAATCAAAGTAGAAATATTGGAAAAGTCAGAATTTTTTTTTTTTTCTCTAGGAGACGTACTTGAAAGAATTCAATATTTGACAAATTTTTGTTTTTGTTTTTGTTTTTTTTTTTCTATTAGCCGTCAACCCATTTAGGACATTGAAGACATTTCAGACAGCTTATAATGAAGAGGGTTTGCATCTGGTCTGGTCTCTGTCTATTTCCTTTGCTCACTCCCAATCTACCCCTTTCATTAAGTTTTTTTGTATATTAAATCCCACTTGCATGTCCTTTCTTTATCCTTTTCACACACAACACAAACACATAAGGGTACTTTTTATATTTTACCTCACATATTTTAGAATTAAACTTTATTCAATAATAATTATTAAAATGATTCGCTAGGTAAATGAATCACGACTCTTAAAATTCTACACGTAAGATGATAATATTAAAAAAGATAAAAAAAAAAAAAATCATTACAATTGATTGTTTTCACCGAGAATGAACTATATCTTGATTATCTTATATTCATTGTGAAAGGGGTTGGAGGTTGATCTATAAGTGCAAGGAAAAAAATTACCTAACAATAGTATTGGTCAATATTCAATATCGACAAGTTTGAACCGTTGATTTCTTTTTTGATAAAAAAAAATAGTTTGATATACCTTTTTTGCTCTAATTATTGGGACAATCAAGGAGAATCCAAACCCATTTCTACTGTGTGGCAATACTTTTTGTAGATATGAATTGGTCTAAAAATTTATTAATTTTAGAAATGGAAAAAGCAAAGAGAGAAAATTGAATTTAGGGGTTGTTGAAGGTATTACAGCACTTGAAACAAATTGCAAGTAATGGGTCCAAAGAAATCTCCTTAATGTGATTAATCATTACGAATCCCCCAACCACTTGATTTTATTTTAGCTACCCCTAACTCCTTTGTTGCTTTGAATCTTTTCGCATAAATTATTGCAAGTGCCAACCTCTCTCCCCCTTCACAAATCTTTGGCTCACTAAAAAAGAAAAAAGTTAAATTACAAATTTAATTCTTTAGTTATGTTTATTTAGTCTCTAACTTTGAATAGTGGTTACACGTAATAACTAACACCATCGACTATATTTATATCGAACAAAAGTTAGTTGATTTGACATGACATAACATCAAATAGAAGCCAAGAAATAGATAGTTATAGACAAAGTAGAGTGACACGGTCTACATTGGCTAAATTCATATTGAACAAAGGTTAGTTGATTTGATATGACATAACATTAAATGGAAGTCAAGAAGTAGATAGTTATAGACAAAGTAGAGTGACACAGTCTACATCGACTAAATTCATATCGGACAAAGGTTAATTGATTTGACATGAGATAATATTAAATGGAAGCCAAGAAGTAGATAGTTATAGACAAAGTAGAGTGACACGGTCTATAAAAGAAATTTGATATCATACACTATTTCGAAAAGAGAGGATATTACACATGCTACTTTATTTTGTCTAATTGTCTTTTTTTTTTTTGTCATCTCTACATACAATGTCGTGGGCTTCGGATAGTTTGGGTTAGATATGTACTATTTATGAGTAGTTTATTAGATGGTGTCGTCATTTAGATTTAGATGGGCGAGTAGGGTTTTTGAGTGGCAAGCCACCCAACCAAACCAAGCAACTTGCTATAGAGACTACTGATGTAATCACCTCATTCCCATTACATGGGACCACGAATAAAAATGAACCATTTTGGGGCATGTTCATGAACCCAACTTCCACCCCCACAACCTTTTCTTTTTTTCTTTTTTTCTCTTTTGGCATCTTCATCACTGGTCTCTTTATTTTTATTTTTTTAAAAAAAACCCTAATTTTATATACTATATATATGCACATTGATTTTGTTGACCATCTTGCTTCCAAATTTGGTTTCTTTGAGTTTTATAATGTTTAATTTTTGGTGAGTCCTATCCATATAGGTATATAAAGCTAGCAAGTAGCTAACTAAGATAAAAATAATCATCTCCTTCCTTTTTATTTTTTTTCCCCCCAATTATTATAACAATTTTTTTTTGTTTAAGTAGCCCTCGTGTTAAAAATCAGTGGAGATGACTTCTTAAGTTCATTGCATCTTTACATGGTTCCATGCATTCATTTTGAAACTCTTGGCTGCGAAATTTTTGTCATTTTATCATCCCTCTTTATTAATTATATGCTTTCTTCTAGATGAAAAGGCTGAGATGTGAGCCTTCAGGTTTTTTGAACCCAATTTTTGTCATTGTCTTTTATATATTTTTTCTCAGGCATTTTTTTCGAGGGACTTAAATAATAATGTTCTTCAACATGTAGCTGAAGTTTAGTTTCCTTTTGGGTCTTGGTTGGTCCTTCTAGAAGGATAGCTGGTTTTAGTATATTAGATAAAAAGGGTCCTTTTTTATTTTAAAAGTATTATGATTTAATTTATATGTATTTTAGAATTTGAGTTGGAAGTTATTGAGGAAGTAGGGTCAGATTTTGGTGTATATTGAACTTCAATTCAAACATTAAAAACCCTTTTTCTTAAGATATTAGAAAAACCCATTTACATTTATAAGGGGAAATGAAATAATAATCTTACTACATTCTCTCTCTTTCTCTCTCTCTTTCTCTTTCTGAAATCATCATTATCCCTTTGAGAAACTTCCAATAAAATTCTTTGATTTGAAAGTGATGGGTTCTGAATACATTCAAGCAATGTTTGAAGAAACCTTCCAAAGACCTGATTTTATATGGAAACATATAAAAATGGAATATAATCAGGTTTTAGAACTCTCTTTAAACCCATATTTTTCACAAGCTATTGACTTTTGCTCAACAAAGAGACCTTAATTCTTTGCTTTTTTTAAGTTGTGATGGTCAGTTTCCTTCTTTGTTAGGCAAGGGGGAACTTGGCTGAATGCATTGCATGAAACAAAATACCAAAGTGATTGAGAGAGAGAGAGAATTGTATTATATGTAACAAAAAGAGGGGGAAATATAGAAGTTCAAAAGTTGAGACTGTGATGAGGCATTGCATTGATCTGATGGACATGGGCAGCCAAAAAGGACTTAGTAGTTCTAGTTAATAAGTATGACAATTCAAGGAACTACCATTAAAAATATCCCATAGCCAAAAGCTTGGCCAAACACAAATGAAGAGGTTTTCAATATTGGCAACCTTTTCAGTGAAAGTCAACAAGAATGAAGCAATTATCTTCCCAACCAAAAAACAAACAAACAAACAACAGAACCAAGGTTTTACATGTGATTTGTAGACTTGCAGTTTCAATGGAAAACTCTGTTTCTGATGAATTTGGACTGCAACAAAACCCAAGAACAGAAAAGGGAAAAAAAGGAAGTTGGTGCTTGGGTTTGATCTAAGCATTTTGAGTGTAATAAAGCTTGAACTGAACTTCCAAACACAGAGTCAAGTAAGATTTGAGATTATTAAAACACCATTCTTAAGCTTCCTTAGATTCACACCCAATTATTGGCTTTAACTTTTTCACTATCTATTCATTGCATTGTTCCTTTAAAATGAGCAGTATTTCTGATAAATGGAAAACAGAGTATGCTTCTAACCGTCACACTTCATAAACTCAACTGATTTTTCTTCTTTTTCCTTTTATTTGTTTGTATAAATTTGTCATGCGTATCAAAGTAAGGTTTTTGTCCAAATAACTTTGTTTTGCAATTTAAGGGCCCGGCATTGTTAAGTTTGAGGGTGTAAGTGAAGGGTTTAGAAAACTAGAAAACCTAAATTTAGAAAACATGTGTTTGGAGTACAGATTTCAAAAGCTTTAGTAAAGAAATATTGTGTGAGAAAGTGGGGTCTAAGGACTAGGTTTGGAGTACTCTTGAAACCTATGAATTTGTCAAACAAATTGTAAGCTTGACAAACCCAAAAGGCAAATATGGATTCAGCATCTTAAGCCAAATCTACCCAAATTCAACCCTCACCTCAAACACTCATAAATTTAAGAGAAAGAAATATGTCTTTTATGGTCTAATTAGCTATAAGTTAATTTTAACCATCAAATTTCAATTCCATCAATTTACAATAGCAGCTAATTACTTAAGAGTTCTTCTTTTCACTCATTTCTTATATGTAAATTTTGTATATCAAATATAGGGCCAGTTATAGTTTCAACTTTTAGATTCCATTCAAAAAATGTTGGTTTGAAACTTTAGCTACTTTTATTTTTTAAACATGTGAGCAAAAAGGAACTAGTGATTTGAATGGGTACTTGATTCAAATTTAGGGTTCATATTCTATTATTAAAATGATAAAGAAAATACCTAGAAGTGTAATAATATCAATATTCTATCTTTTTAATATAAAATAATAATAATGATATCATATTAAATGATGATGATGATGAATTTAAGTGTTAAATTATATAAAAGTTGATATTGTCTAAAAAAAATCCTCCAAATTTTAGAATATTATATGTATTATATATTAGAGATTAGAGATTACACGTGCAATATATTTTCTTTGTTTTTCTGTTCTAAGTTGTAACCTGAAGGGTTTGGTCAGAGAGCAATATAAAGCTTGACCAAAGGAAGACGCCATAGAAGAAGAAACACGTGTCATTTGCTAAGATTTTACAAGTTTAATAATATTTGTCTCATTTGGTCTGCAGAAGATATGTTTGAAATTACAGCCTCGACTTTATCAGTAGTACTGCTTTGTCATTTTTCTCAGTGGCCAAATGGTTAGGCAGATAAATATTTCTACTGGATCCTTTTGGGCTAATGGAAAGATTTTCAAGGCCCAAGTCCAAAGGATGGGCCTTGAGTGGGCTTTGGATAGGAGAAACGACACTTCAAACCTTCAAATGATGGGCCTTGATTTTCTTTTAAAAAACAAGATTTTTTAGGGATTAAATCCAATGAATAATAATATTCAAAGGGCAAAATAGAAAATAGCTACTTTGTGGTTAAGAGATGATATGAAATTGAACTTAATTTAATTGAAAAATACTTAATTTATAGCTTTTTCAAGATATTTTAATTTTTAGATGTCCCTCAATTTTTTAAGTTATGTTTATTTAGTCCCCATATTTTAAATTTGCTTTAATAAGTTTATAAACTTTCAAAGTTATATCTATCTAATCCTTCTAATTTAAAGTTTCTAATAAATTGTTCAACTTATAAATTTATGTTTGATAGGTTTCTATTGTTAGGTTCGTAGTTTAATATTTATGTGGCATGTTGACTGTATTATTTAAATATTATTTGACAGGTTAATTTTAAATAGATTTAGACAATTTATATCATAAATTTTAAACTTAGATATCTATTAAACATAAATTAAAAATTTAAACTTATTATAATTTTTTTAAAGTATAAGAATAAAATACAACACCGAAAATTTAGAGAATTGGTGACATTTATTGAATTGATAAAAAAAAAAGGTAACATTAAAAAAAAAAAAGGGGGGGGGGTGACAGAAGCGCACGAGCATGATAACTGGCAAAAGAAAAGAAAAAAAAAAAAAAAGTGTTCGCGATTCGGTTACCTCTTATCTTCAACACAGACAGGCACTTCAATTTTCACAGGAGCTAAAATGGCTGCTAATTAAGCTTAACGAACCGGCCCATTTGCAGGCGGAGCCACCTCGACCACCGTTCTTCAGCTCAATTTACTTTCTTTCGTTCATCCTTCCTTTTCTGAACTGCCCCCAATGAGATCAGCCGCATTTCTTAGAACCCATTGCTCGTACTTCTTGCTTCCCGTTCATGGCTTCAGCTCTTCGACGGCGAAGAACTCGATTCTTTTTGTCGCACCCTGTAAATTTAAACCCACTTTCTTGAATTTGCAAGCTAAACCGGCCCGGTTGGTTGTGCTTTGTTATCGGGATTCCGAGAAATCTTCTCGGGATGAAGAATCCGGGGGTCGTGACTCGGAAGCTATGGGGGTCGAAGATTCGAATGGGTCGTTCATGGAGGAGAATATAGAGCAAAACCAATGGAGCGTCGGAGTGGGTAGCCCTAGTGGTGGATTTCCGCCTCTGGCCAAATTGAGCTTCAGCGACAAGGCTTTTCTTCTCTTGACATTCATCGCCTTAACGGTAAATTTAACTATAAGCTTGAATTTGAAAATTAGATCTTTGGCAATTTTTGTAACGTAAAACTCTCTGTATTCGTTGTGTGTTGGACGAATTTCTCTGTATTTTCGTAGTTTGTTGACTATGCCCTGATAAATGCGGGTTATATAACCCATAATATCATTGAGATTTGAGAATTATTGGGAGACATGCGGCCTTCTTTGTTTTGTTTTGTTTTGTTTTCTCTTCTTTTCCTTCCTTTTTCCTAGTTGATTATTAGGAGTAGTTCTTAACAAATATAATAATTCTTGGTCTAACCGGTCTTTAATAGTTTCTCCTAACAACGTTGGACTATTATTAATGTTAACTAACAGATCGATTTCTGATTATAGAGATGTTATATTTTTTTAAAAAAATATATAATACAAACATATATTAAATAAACAATATTACAAACCAATGAAAGCATGTTAGCTTAGCTGATCATTTCTAGCTGGGTCTTCCCCTTGCAAATTACATATCGTTTGAGATGGAGAGGACAATCCTTAGGTGCCATGGTTGGATTTCCATTGAAGCTCAAGCAGTTCCATGTGGTTTTAGGTTCCATAGATGGATATGTCGCAGATATTAGTAATTATCTGATTGATACCCTCTATATTATCTTTTATTGCAAAATCTAGAGTTGAGCTCTGGATTGAAAAATTACTTGAGGATATTTGTTGCCATAACTCCTTATAACAGAGTTAATTATTTGATTCTGTTTTTTCTGTTATGTTGGATGTTTCTGAATTGTGCCCTTGGTAGGAGGAGCTGACCAATGCTTAATACTGGGAACATGTCATGAATACCGGCTTTCATGTATTTTTAACTAGTTATTTGGTCATTGTGCATATGTATAAACATTGATTAGTTCTAATTGGTGCAGACTTCTGTGGCATTTACAAGCCTTGTCATTGCGGCTGTGCCGACGCTTTATGTAAGTTCATACTATCTTGAGCTGAAATTAGCCTTCAGGCACATTTTTTCCCTTTTTGCTTTTATCCATCTATTTTTCTCTTTTTACAAATCCATGTATTGAGCAAGAAAGTCCATAGTTTAACAAGACAATTTGGGCATAACATACTACTCCACAATGTTATACAGTATGAAGCTGAGACTTAATTTTCTATAAATTTAATGTAGGCAATGCGTAGAGCTGCCCTTTCACTTTCAAAGCTAGCTGACACAGCTCGTGAAGAACTTCCTGGTACAATGGCTGCCATTAGACTTTCAGGCATGGAGATCAGTGATCTTACTCTTGAACTAAGCGATCTAAGGTGAGAATTAGATTCTCACTAGGGCCATCTCTTCCTAATGATGGTTTGATGGTTAGAAATATATTTCACATTTGTTTGAGATTGGCATTGTCTTCTGTGACAAGCTGGATTTCTTGCAATTCTTTAGGATATTTGGTTCTAGAGAAATAGGAGGATTTTTCAGGGTATTGAGAATACTTGGGAGTAAGTTTGGGATGTAGCTCACTTTAATGCTTCTCTTATCATCCATGTTCTGTAATTACCCATTGTTTTTTTATCATAATGGTTAGAGTCATATATCTTAATTTCCCTTTTGTGGTGGTTCCATTGCTTGGTTTTGACTTGTGTTCTTGTATGACCCTCTTTTGATTGTACTCTCATTCTTCTTAATAAAAGCATTTTTTAAGCAGTTCAAGAGATTTATTTCTTGAGAACTAATCTGGAAACCAAGTTGTAATGTTAAAATCTTGTTCAGGTGGTTCGTGCTCTTTACCTTTTAAATATGATCATAGCTGTTGTCACTAAAAAGAATATTCTATATGGAGGCCACTGCAGACAAAAATGTGAATACAGATCAAATATGAATAAGGGTTTTATTGATTGAAGCACTTATTTACTCGTACTTATATGGCTTTGAATCTTATTTATGTGTGTTTCTTGTATCATTGTCTCCCCTTTTCAGCAAGCACAATCGTTCCAACTTTTCTGAGTAATCCGTCTCGATTCTGTTTTGAACGCAGCCAGGAGATAGCTGATGGGGTAAACAAATCTGCTCAGGCTGTCCAAGCAGCAGAAGCTGGAATTCGACAGATCGGTGCACTTGCCCACCAACAAACTATGTGTATGTTTGACTACAAGCAATCTAAGATAACTTATTGTAAACGTTCTTATGACTAATTACCAGAGCTTAACAGCTTATATTATGTTGTCTCCATGATACAGCGATGATTGAAGAGAGAGCAAGCCTGCCCATTATTTCTTTGCAGCCTGTTGTTGCTGGAGCTGCAAAGAAGACTTCTCGCGCGGTTAGCAAAGCCACACGAACCATCATGAAAATGATCTCACGAGGAGAAAGTACAGAGAATGAGGATGACAATAGTATAGATAGACTGGAAGTTTAA

mRNA sequence

ATGTTTGGTTGCCGAGAAAATCATCGGGCAGTGAGAGTGAGGTTTCTAGGGAGGTGTGAATCGAGAAAAGATCGCTCCAAGAGAAGGACGATTCCGGTGAACCGAACGAAGATTCCGTCGAGAAGTCTGAATCGGAGAATGAAAAGTAGGAAGACTCCATTTCAGGCGCTTCTGAAATCCGAGAGGAAATGGAATTTTTCCGGTGGTTTTGGGTTTTCTGGGAGTTGTTTGTTTCGCGGAGCCACCTCGACCACCGTTCTTCAGCTCAATTTACTTTCTTTCGTTCATCCTTCCTTTTCTGAACTGCCCCCAATGAGATCAGCCGCATTTCTTAGAACCCATTGCTCGTACTTCTTGCTTCCCGTTCATGGCTTCAGCTCTTCGACGGCGAAGAACTCGATTCTTTTTGTCGCACCCTGTAAATTTAAACCCACTTTCTTGAATTTGCAAGCTAAACCGGCCCGGTTGGTTGTGCTTTGTTATCGGGATTCCGAGAAATCTTCTCGGGATGAAGAATCCGGGGGTCGTGACTCGGAAGCTATGGGGGTCGAAGATTCGAATGGGTCGTTCATGGAGGAGAATATAGAGCAAAACCAATGGAGCGTCGGAGTGGGTAGCCCTAGTGGTGGATTTCCGCCTCTGGCCAAATTGAGCTTCAGCGACAAGGCTTTTCTTCTCTTGACATTCATCGCCTTAACGACTTCTGTGGCATTTACAAGCCTTGTCATTGCGGCTGTGCCGACGCTTTATGCAATGCGTAGAGCTGCCCTTTCACTTTCAAAGCTAGCTGACACAGCTCGTGAAGAACTTCCTGGTACAATGGCTGCCATTAGACTTTCAGGCATGGAGATCAGTGATCTTACTCTTGAACTAAGCGATCTAAGCCAGGAGATAGCTGATGGGGTAAACAAATCTGCTCAGGCTGTCCAAGCAGCAGAAGCTGGAATTCGACAGATCGGTGCACTTGCCCACCAACAAACTATGTCGATGATTGAAGAGAGAGCAAGCCTGCCCATTATTTCTTTGCAGCCTGTTGTTGCTGGAGCTGCAAAGAAGACTTCTCGCGCGGTTAGCAAAGCCACACGAACCATCATGAAAATGATCTCACGAGGAGAAAGTACAGAGAATGAGGATGACAATAGTATAGATAGACTGGAAGTTTAA

Coding sequence (CDS)

ATGTTTGGTTGCCGAGAAAATCATCGGGCAGTGAGAGTGAGGTTTCTAGGGAGGTGTGAATCGAGAAAAGATCGCTCCAAGAGAAGGACGATTCCGGTGAACCGAACGAAGATTCCGTCGAGAAGTCTGAATCGGAGAATGAAAAGTAGGAAGACTCCATTTCAGGCGCTTCTGAAATCCGAGAGGAAATGGAATTTTTCCGGTGGTTTTGGGTTTTCTGGGAGTTGTTTGTTTCGCGGAGCCACCTCGACCACCGTTCTTCAGCTCAATTTACTTTCTTTCGTTCATCCTTCCTTTTCTGAACTGCCCCCAATGAGATCAGCCGCATTTCTTAGAACCCATTGCTCGTACTTCTTGCTTCCCGTTCATGGCTTCAGCTCTTCGACGGCGAAGAACTCGATTCTTTTTGTCGCACCCTGTAAATTTAAACCCACTTTCTTGAATTTGCAAGCTAAACCGGCCCGGTTGGTTGTGCTTTGTTATCGGGATTCCGAGAAATCTTCTCGGGATGAAGAATCCGGGGGTCGTGACTCGGAAGCTATGGGGGTCGAAGATTCGAATGGGTCGTTCATGGAGGAGAATATAGAGCAAAACCAATGGAGCGTCGGAGTGGGTAGCCCTAGTGGTGGATTTCCGCCTCTGGCCAAATTGAGCTTCAGCGACAAGGCTTTTCTTCTCTTGACATTCATCGCCTTAACGACTTCTGTGGCATTTACAAGCCTTGTCATTGCGGCTGTGCCGACGCTTTATGCAATGCGTAGAGCTGCCCTTTCACTTTCAAAGCTAGCTGACACAGCTCGTGAAGAACTTCCTGGTACAATGGCTGCCATTAGACTTTCAGGCATGGAGATCAGTGATCTTACTCTTGAACTAAGCGATCTAAGCCAGGAGATAGCTGATGGGGTAAACAAATCTGCTCAGGCTGTCCAAGCAGCAGAAGCTGGAATTCGACAGATCGGTGCACTTGCCCACCAACAAACTATGTCGATGATTGAAGAGAGAGCAAGCCTGCCCATTATTTCTTTGCAGCCTGTTGTTGCTGGAGCTGCAAAGAAGACTTCTCGCGCGGTTAGCAAAGCCACACGAACCATCATGAAAATGATCTCACGAGGAGAAAGTACAGAGAATGAGGATGACAATAGTATAGATAGACTGGAAGTTTAA

Protein sequence

MFGCRENHRAVRVRFLGRCESRKDRSKRRTIPVNRTKIPSRSLNRRMKSRKTPFQALLKSERKWNFSGGFGFSGSCLFRGATSTTVLQLNLLSFVHPSFSELPPMRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDSEKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAFLLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEISDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQPVVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV
Homology
BLAST of Sgr015100 vs. NCBI nr
Match: XP_038889010.1 (uncharacterized protein LOC120078775 [Benincasa hispida])

HSP 1 Score: 414.5 bits (1064), Expect = 1.0e-111
Identity = 233/283 (82.33%), Postives = 251/283 (88.69%), Query Frame = 0

Query: 105 MRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDS 164
           MRSA  L  H  +F LPVHGF+SST+KN ILFV+ CKFKPTF NL+AKP RLVV CYRDS
Sbjct: 1   MRSATLL--HSPFFFLPVHGFTSSTSKNPILFVSHCKFKPTFFNLRAKPDRLVVFCYRDS 60

Query: 165 EKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAF 224
           EKS RDE+S       +GVEDSNG+ MEEN+E+NQW+V VGSP  GF  L +L+ SDKAF
Sbjct: 61  EKSVRDEQS-------VGVEDSNGALMEENVERNQWNVEVGSPGVGFRSLPRLNLSDKAF 120

Query: 225 LLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEI 284
           LLLTFIALTTSVAFTSLVIAAVPTL AMRRAA+SLSKLADTAREELPGTMAAIRLSGMEI
Sbjct: 121 LLLTFIALTTSVAFTSLVIAAVPTLNAMRRAAISLSKLADTAREELPGTMAAIRLSGMEI 180

Query: 285 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 344
           SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMI+ERASLPIISLQP
Sbjct: 181 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIQERASLPIISLQP 240

Query: 345 VVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV 388
           VVAGAAKKTSRAV KATRTIMKMIS GE+ EN+DDNS+DRLEV
Sbjct: 241 VVAGAAKKTSRAVGKATRTIMKMISGGENMENDDDNSLDRLEV 274

BLAST of Sgr015100 vs. NCBI nr
Match: XP_023533033.1 (uncharacterized protein LOC111795040 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 413.3 bits (1061), Expect = 2.3e-111
Identity = 230/283 (81.27%), Postives = 247/283 (87.28%), Query Frame = 0

Query: 105 MRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDS 164
           MRSA FL+THC Y  LP+HGFS S  KNS + V PCKF+P   NL+AKPAR +VLCYRDS
Sbjct: 1   MRSATFLKTHCPYVSLPLHGFSFSATKNSFVSVTPCKFEPFLFNLRAKPARFLVLCYRDS 60

Query: 165 EKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAF 224
           EKS+ +++S       +GVEDSNG+   EN EQNQWSV VGSPS GF PLAKLS +DKAF
Sbjct: 61  EKSALEQQS-------LGVEDSNGALTVENTEQNQWSVEVGSPSFGFWPLAKLSLNDKAF 120

Query: 225 LLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEI 284
           LL TFIALTTSVAFTSLVIAAVPT  AMRRAA+SLSKLADTAREELPGTMAAIRLSGMEI
Sbjct: 121 LLFTFIALTTSVAFTSLVIAAVPTFNAMRRAAISLSKLADTAREELPGTMAAIRLSGMEI 180

Query: 285 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 344
           SDLTLELSDLS+EIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP
Sbjct: 181 SDLTLELSDLSREIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 240

Query: 345 VVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV 388
           VV GAAKKTSRAV KATRTIMKMIS GESTEN+DDNS+DRLEV
Sbjct: 241 VVVGAAKKTSRAVGKATRTIMKMISGGESTENDDDNSLDRLEV 276

BLAST of Sgr015100 vs. NCBI nr
Match: KAG6606380.1 (hypothetical protein SDJN03_03697, partial [Cucurbita argyrosperma subsp. sororia] >KAG7036320.1 hypothetical protein SDJN02_03123 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 411.4 bits (1056), Expect = 8.7e-111
Identity = 231/283 (81.63%), Postives = 247/283 (87.28%), Query Frame = 0

Query: 105 MRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDS 164
           MRSA FL+THC    LP+HGFSSS  KNSI+ V P KF+P   NL+AKPAR +VLCYRDS
Sbjct: 1   MRSATFLKTHCPCVSLPLHGFSSSPTKNSIVSVTPSKFEPFLFNLRAKPARFLVLCYRDS 60

Query: 165 EKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAF 224
           EKS+ +++S       +GVEDSNG+   EN EQNQWSV VGSPS GF PLAKLS SDKAF
Sbjct: 61  EKSALEQQS-------VGVEDSNGALTVENTEQNQWSVEVGSPSFGFWPLAKLSLSDKAF 120

Query: 225 LLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEI 284
           LL TF+ALTTSVAFTSLVIAAVPT  AMRRAA+SLSKLADTAREELPGTMAAIRLSGMEI
Sbjct: 121 LLFTFVALTTSVAFTSLVIAAVPTFNAMRRAAISLSKLADTAREELPGTMAAIRLSGMEI 180

Query: 285 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 344
           SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP
Sbjct: 181 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 240

Query: 345 VVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV 388
           VV GAAKKTSRAV KATRTIMKMIS GESTEN+DDNS+DRLEV
Sbjct: 241 VVVGAAKKTSRAVGKATRTIMKMISGGESTENDDDNSLDRLEV 276

BLAST of Sgr015100 vs. NCBI nr
Match: XP_022931045.1 (uncharacterized protein LOC111437355 [Cucurbita moschata])

HSP 1 Score: 410.6 bits (1054), Expect = 1.5e-110
Identity = 230/283 (81.27%), Postives = 247/283 (87.28%), Query Frame = 0

Query: 105 MRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDS 164
           MRSA FL+THC    LP+HGFSSS  KNSI+ V PCKF+P   NL+AKPAR +VLCYRDS
Sbjct: 1   MRSATFLKTHCPCVSLPLHGFSSSPTKNSIVSVTPCKFEPFLFNLRAKPARFLVLCYRDS 60

Query: 165 EKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAF 224
           EKS+ +++S       +GVEDS+G+   EN EQNQWSV +GSPS GF PLAKLS SDKAF
Sbjct: 61  EKSALEQQS-------VGVEDSSGALTVENTEQNQWSVELGSPSFGFWPLAKLSLSDKAF 120

Query: 225 LLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEI 284
           LL TFIAL TSVAFTSLVIAAVPT  AMRRAA+SLSKLADTAREELPGTMAAIRLSGMEI
Sbjct: 121 LLFTFIALATSVAFTSLVIAAVPTFNAMRRAAISLSKLADTAREELPGTMAAIRLSGMEI 180

Query: 285 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 344
           SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP
Sbjct: 181 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 240

Query: 345 VVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV 388
           VV GAAKKTSRAV KATRTIMKMIS GESTEN+DDNS+DRLEV
Sbjct: 241 VVVGAAKKTSRAVGKATRTIMKMISGGESTENDDDNSLDRLEV 276

BLAST of Sgr015100 vs. NCBI nr
Match: XP_022996244.1 (uncharacterized protein LOC111491526 [Cucurbita maxima])

HSP 1 Score: 406.8 bits (1044), Expect = 2.2e-109
Identity = 228/283 (80.57%), Postives = 244/283 (86.22%), Query Frame = 0

Query: 105 MRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDS 164
           MRSA  L+THC Y  LP+HGFS S  KNS + V PCKF+P   NL+AKPAR +VLCYRDS
Sbjct: 1   MRSATSLKTHCPYVSLPLHGFSFSATKNSSVSVTPCKFEPFLFNLRAKPARFLVLCYRDS 60

Query: 165 EKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAF 224
           EKS+ +++S       +GVEDSNG+   EN EQNQWSV VGSPS GF PLAK S SDKAF
Sbjct: 61  EKSALEQQS-------VGVEDSNGALTVENTEQNQWSVEVGSPSFGFWPLAKFSLSDKAF 120

Query: 225 LLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEI 284
           LL TFIALT SVAFTSLVIAAVPT  AMRRAA+SLSKLADTAREELPGTMAAIRLSGMEI
Sbjct: 121 LLFTFIALTISVAFTSLVIAAVPTFNAMRRAAISLSKLADTAREELPGTMAAIRLSGMEI 180

Query: 285 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 344
           SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMI+ERASLPIISLQP
Sbjct: 181 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIQERASLPIISLQP 240

Query: 345 VVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV 388
           VV GAAKKTSRAV KATRTIMKMIS GESTEN+DDNS+DRLEV
Sbjct: 241 VVVGAAKKTSRAVGKATRTIMKMISGGESTENDDDNSLDRLEV 276

BLAST of Sgr015100 vs. ExPASy TrEMBL
Match: A0A6J1EYE8 (uncharacterized protein LOC111437355 OS=Cucurbita moschata OX=3662 GN=LOC111437355 PE=4 SV=1)

HSP 1 Score: 410.6 bits (1054), Expect = 7.2e-111
Identity = 230/283 (81.27%), Postives = 247/283 (87.28%), Query Frame = 0

Query: 105 MRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDS 164
           MRSA FL+THC    LP+HGFSSS  KNSI+ V PCKF+P   NL+AKPAR +VLCYRDS
Sbjct: 1   MRSATFLKTHCPCVSLPLHGFSSSPTKNSIVSVTPCKFEPFLFNLRAKPARFLVLCYRDS 60

Query: 165 EKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAF 224
           EKS+ +++S       +GVEDS+G+   EN EQNQWSV +GSPS GF PLAKLS SDKAF
Sbjct: 61  EKSALEQQS-------VGVEDSSGALTVENTEQNQWSVELGSPSFGFWPLAKLSLSDKAF 120

Query: 225 LLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEI 284
           LL TFIAL TSVAFTSLVIAAVPT  AMRRAA+SLSKLADTAREELPGTMAAIRLSGMEI
Sbjct: 121 LLFTFIALATSVAFTSLVIAAVPTFNAMRRAAISLSKLADTAREELPGTMAAIRLSGMEI 180

Query: 285 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 344
           SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP
Sbjct: 181 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 240

Query: 345 VVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV 388
           VV GAAKKTSRAV KATRTIMKMIS GESTEN+DDNS+DRLEV
Sbjct: 241 VVVGAAKKTSRAVGKATRTIMKMISGGESTENDDDNSLDRLEV 276

BLAST of Sgr015100 vs. ExPASy TrEMBL
Match: A0A6J1K868 (uncharacterized protein LOC111491526 OS=Cucurbita maxima OX=3661 GN=LOC111491526 PE=4 SV=1)

HSP 1 Score: 406.8 bits (1044), Expect = 1.0e-109
Identity = 228/283 (80.57%), Postives = 244/283 (86.22%), Query Frame = 0

Query: 105 MRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDS 164
           MRSA  L+THC Y  LP+HGFS S  KNS + V PCKF+P   NL+AKPAR +VLCYRDS
Sbjct: 1   MRSATSLKTHCPYVSLPLHGFSFSATKNSSVSVTPCKFEPFLFNLRAKPARFLVLCYRDS 60

Query: 165 EKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAF 224
           EKS+ +++S       +GVEDSNG+   EN EQNQWSV VGSPS GF PLAK S SDKAF
Sbjct: 61  EKSALEQQS-------VGVEDSNGALTVENTEQNQWSVEVGSPSFGFWPLAKFSLSDKAF 120

Query: 225 LLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEI 284
           LL TFIALT SVAFTSLVIAAVPT  AMRRAA+SLSKLADTAREELPGTMAAIRLSGMEI
Sbjct: 121 LLFTFIALTISVAFTSLVIAAVPTFNAMRRAAISLSKLADTAREELPGTMAAIRLSGMEI 180

Query: 285 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 344
           SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMI+ERASLPIISLQP
Sbjct: 181 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIQERASLPIISLQP 240

Query: 345 VVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV 388
           VV GAAKKTSRAV KATRTIMKMIS GESTEN+DDNS+DRLEV
Sbjct: 241 VVVGAAKKTSRAVGKATRTIMKMISGGESTENDDDNSLDRLEV 276

BLAST of Sgr015100 vs. ExPASy TrEMBL
Match: A0A5D3DAC3 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold237G001000 PE=4 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 5.2e-109
Identity = 228/283 (80.57%), Postives = 249/283 (87.99%), Query Frame = 0

Query: 105 MRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDS 164
           MR++ FL  H +YFL PV GF+ ST+K SILFVAPCKFKP F NL+AKP RLVV CY DS
Sbjct: 1   MRTSTFL--HSAYFLFPVRGFTCSTSKKSILFVAPCKFKPAFFNLRAKPDRLVVFCYGDS 60

Query: 165 EKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAF 224
           E+S RDE+S       +GVEDSN + +EEN+E+N+W+V +G+PS GF  L KLS SDKAF
Sbjct: 61  ERSVRDEQS-------IGVEDSNVTLVEENVERNRWNVELGTPSVGFQLLPKLSLSDKAF 120

Query: 225 LLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEI 284
           LLLTFIALTTSVAFTSLVIAAVPTL AMRRAA+SLSKLADTAREELPGTMAAIRLSGMEI
Sbjct: 121 LLLTFIALTTSVAFTSLVIAAVPTLNAMRRAAISLSKLADTAREELPGTMAAIRLSGMEI 180

Query: 285 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 344
           SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMI+ERASLPIISLQP
Sbjct: 181 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIQERASLPIISLQP 240

Query: 345 VVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV 388
           VVAGAAKKTS AV KATRTIMKMIS GES EN+DDNS+DRLEV
Sbjct: 241 VVAGAAKKTSHAVGKATRTIMKMISGGESVENDDDNSLDRLEV 274

BLAST of Sgr015100 vs. ExPASy TrEMBL
Match: A0A1S3BHG6 (uncharacterized protein LOC103489884 OS=Cucumis melo OX=3656 GN=LOC103489884 PE=4 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 5.2e-109
Identity = 228/283 (80.57%), Postives = 249/283 (87.99%), Query Frame = 0

Query: 105 MRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDS 164
           MR++ FL  H +YFL PV GF+ ST+K SILFVAPCKFKP F NL+AKP RLVV CY DS
Sbjct: 1   MRTSTFL--HSAYFLFPVRGFTCSTSKKSILFVAPCKFKPAFFNLRAKPDRLVVFCYGDS 60

Query: 165 EKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAF 224
           E+S RDE+S       +GVEDSN + +EEN+E+N+W+V +G+PS GF  L KLS SDKAF
Sbjct: 61  ERSVRDEQS-------IGVEDSNVTLVEENVERNRWNVELGTPSVGFQLLPKLSLSDKAF 120

Query: 225 LLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEI 284
           LLLTFIALTTSVAFTSLVIAAVPTL AMRRAA+SLSKLADTAREELPGTMAAIRLSGMEI
Sbjct: 121 LLLTFIALTTSVAFTSLVIAAVPTLNAMRRAAISLSKLADTAREELPGTMAAIRLSGMEI 180

Query: 285 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 344
           SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMI+ERASLPIISLQP
Sbjct: 181 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIQERASLPIISLQP 240

Query: 345 VVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV 388
           VVAGAAKKTS AV KATRTIMKMIS GES EN+DDNS+DRLEV
Sbjct: 241 VVAGAAKKTSHAVGKATRTIMKMISGGESVENDDDNSLDRLEV 274

BLAST of Sgr015100 vs. ExPASy TrEMBL
Match: A0A6J1HHL4 (uncharacterized protein LOC111464126 OS=Cucurbita moschata OX=3662 GN=LOC111464126 PE=4 SV=1)

HSP 1 Score: 387.9 bits (995), Expect = 5.0e-104
Identity = 225/283 (79.51%), Postives = 243/283 (85.87%), Query Frame = 0

Query: 105 MRSAAFLRTHCSYFLLPVHGFSSSTAKNSILFVAPCKFKPTFLNLQAKPARLVVLCYRDS 164
           MRSA FL   C YFLLPVHGFSSST+KNSI      KFKPTF N +AKPARLVV C RDS
Sbjct: 1   MRSATFL--CCPYFLLPVHGFSSSTSKNSI------KFKPTFFNSRAKPARLVVFCCRDS 60

Query: 165 EKSSRDEESGGRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAF 224
           EKS R+E+S       +G++DSN + ME+NIE+ QW+  VG PS GF PLAKLS SDKA 
Sbjct: 61  EKSVRNEQS-------VGIDDSNRALMEKNIERMQWNEEVGCPSVGFQPLAKLSLSDKAS 120

Query: 225 LLLTFIALTTSVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEI 284
           LLLTFIALT SVAFTSLVIAAVPTL AMRRAA+SLSKLADTAREELPGTMAAIRLSGMEI
Sbjct: 121 LLLTFIALTASVAFTSLVIAAVPTLNAMRRAAISLSKLADTAREELPGTMAAIRLSGMEI 180

Query: 285 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQP 344
           SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQI A+AHQ+TMSMI+ERA LP+ISLQP
Sbjct: 181 SDLTLELSDLSQEIADGVNKSAQAVQAAEAGIRQISAVAHQKTMSMIQERADLPVISLQP 240

Query: 345 VVAGAAKKTSRAVSKATRTIMKMISRGESTENEDDNSIDRLEV 388
           VVAGAAKKTSRAV KATRTIMKMIS GES EN+DDNS+DRLEV
Sbjct: 241 VVAGAAKKTSRAVGKATRTIMKMISGGESMENDDDNSLDRLEV 268

BLAST of Sgr015100 vs. TAIR 10
Match: AT1G08530.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G09995.2); Has 140 Blast hits to 140 proteins in 53 species: Archae - 0; Bacteria - 63; Metazoa - 0; Fungi - 0; Plants - 76; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 217.6 bits (553), Expect = 1.7e-56
Identity = 131/200 (65.50%), Postives = 153/200 (76.50%), Query Frame = 0

Query: 185 DSNGSFMEENI------EQNQWSVGVGSPSG--GFPPLAKLSFSDKAFLLLTFIALTTSV 244
           +SNG     +I        N   + VGSP       PLAKLS SD+AFLLL FI  TTSV
Sbjct: 58  NSNGGMSRASISVFGGTSLNNLKMQVGSPISLHSINPLAKLSLSDQAFLLLAFIVCTTSV 117

Query: 245 AFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEISDLTLELSDLSQ 304
           AFTSLVI A+PTL AM RAA S +KLADTAR+ELP T+AA+RLSGMEISDLTLELSDLSQ
Sbjct: 118 AFTSLVITAIPTLVAMGRAATSFAKLADTARKELPSTLAAVRLSGMEISDLTLELSDLSQ 177

Query: 305 EIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQPVVAGAAKKTSRA 364
           +I DG+NKSA+AVQAAEAGI+QIG LA QQT+SMIEERA+LP ISLQPVVAGAA+KTS A
Sbjct: 178 DITDGINKSAKAVQAAEAGIKQIGTLAQQQTLSMIEERANLPEISLQPVVAGAAEKTSHA 237

Query: 365 VSKATRTIMKMISRGESTEN 377
           +  AT+ +M +I+ G   E+
Sbjct: 238 IGSATKRLMNIITGGNKDED 257

BLAST of Sgr015100 vs. TAIR 10
Match: AT5G09995.3 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G08530.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 100.9 bits (250), Expect = 2.4e-21
Identity = 77/195 (39.49%), Postives = 110/195 (56.41%), Query Frame = 0

Query: 175 GRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAFLLLTFIALTT 234
           G DS A     +  S  +E    +Q +  VG P      L++ +F+ K F+LL  +A  T
Sbjct: 50  GSDSIASSTPSALYSNPQEPSISSQLTSSVGQPP---LQLSQWTFTQKHFVLLNVVACVT 109

Query: 235 SVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEISDLTLELSDL 294
           +++ + L  AA+PTL A ++AA SL KL D  REELP TMAA+RLSGMEISDLT+ELSDL
Sbjct: 110 AISASWLFFAAIPTLLAFKKAAESLEKLLDVTREELPDTMAAVRLSGMEISDLTMELSDL 169

Query: 295 SQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQPVVAGAAKKTS 354
            Q I  GV  S +A++ AE  +R+   L +    SM E          +P++A  A+   
Sbjct: 170 GQGITQGVKSSTRAIRVAEDRLRR---LTNMNPASMQEVMRQTKTDETEPMLAKQARSFR 229

Query: 355 RAVSKATRTIMKMIS 370
             V K  R++ ++ S
Sbjct: 230 EGVVKG-RSLWQLFS 237

BLAST of Sgr015100 vs. TAIR 10
Match: AT5G09995.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G08530.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 100.5 bits (249), Expect = 3.1e-21
Identity = 66/145 (45.52%), Postives = 91/145 (62.76%), Query Frame = 0

Query: 175 GRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAFLLLTFIALTT 234
           G DS A     +  S  +E    +Q +  VG P      L++ +F+ K F+LL  +A  T
Sbjct: 50  GSDSIASSTPSALYSNPQEPSISSQLTSSVGQPP---LQLSQWTFTQKHFVLLNVVACVT 109

Query: 235 SVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEISDLTLELSDL 294
           +++ + L  AA+PTL A ++AA SL KL D  REELP TMAA+RLSGMEISDLT+ELSDL
Sbjct: 110 AISASWLFFAAIPTLLAFKKAAESLEKLLDVTREELPDTMAAVRLSGMEISDLTMELSDL 169

Query: 295 SQEIADGVNKSAQAVQAAEAGIRQI 320
            Q I  GV  S +A++ AE  +R++
Sbjct: 170 GQGITQGVKSSTRAIRVAEDRLRRL 191

BLAST of Sgr015100 vs. TAIR 10
Match: AT5G09995.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G08530.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 99.4 bits (246), Expect = 6.9e-21
Identity = 76/195 (38.97%), Postives = 110/195 (56.41%), Query Frame = 0

Query: 175 GRDSEAMGVEDSNGSFMEENIEQNQWSVGVGSPSGGFPPLAKLSFSDKAFLLLTFIALTT 234
           G DS A     +  S  +E    +Q +  VG P      L++ +F+ K F+LL  +A  T
Sbjct: 50  GSDSIASSTPSALYSNPQEPSISSQLTSSVGQPP---LQLSQWTFTQKHFVLLNVVACVT 109

Query: 235 SVAFTSLVIAAVPTLYAMRRAALSLSKLADTAREELPGTMAAIRLSGMEISDLTLELSDL 294
           +++ + L  AA+PTL A ++AA SL KL D  REELP TMAA+RLSGMEISDLT+ELSDL
Sbjct: 110 AISASWLFFAAIPTLLAFKKAAESLEKLLDVTREELPDTMAAVRLSGMEISDLTMELSDL 169

Query: 295 SQEIADGVNKSAQAVQAAEAGIRQIGALAHQQTMSMIEERASLPIISLQPVVAGAAKKTS 354
            Q I  GV  S +A++ AE  +R++  +      SM E          +P++A  A+   
Sbjct: 170 GQGITQGVKSSTRAIRVAEDRLRRLTNM--NPVASMQEVMRQTKTDETEPMLAKQARSFR 229

Query: 355 RAVSKATRTIMKMIS 370
             V K  R++ ++ S
Sbjct: 230 EGVVKG-RSLWQLFS 238

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038889010.11.0e-11182.33uncharacterized protein LOC120078775 [Benincasa hispida][more]
XP_023533033.12.3e-11181.27uncharacterized protein LOC111795040 [Cucurbita pepo subsp. pepo][more]
KAG6606380.18.7e-11181.63hypothetical protein SDJN03_03697, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022931045.11.5e-11081.27uncharacterized protein LOC111437355 [Cucurbita moschata][more]
XP_022996244.12.2e-10980.57uncharacterized protein LOC111491526 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1EYE87.2e-11181.27uncharacterized protein LOC111437355 OS=Cucurbita moschata OX=3662 GN=LOC1114373... [more]
A0A6J1K8681.0e-10980.57uncharacterized protein LOC111491526 OS=Cucurbita maxima OX=3661 GN=LOC111491526... [more]
A0A5D3DAC35.2e-10980.57Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BHG65.2e-10980.57uncharacterized protein LOC103489884 OS=Cucumis melo OX=3656 GN=LOC103489884 PE=... [more]
A0A6J1HHL45.0e-10479.51uncharacterized protein LOC111464126 OS=Cucurbita moschata OX=3662 GN=LOC1114641... [more]
Match NameE-valueIdentityDescription
AT1G08530.11.7e-5665.50unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G09995.32.4e-2139.49unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G09995.13.1e-2145.52unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G09995.26.9e-2138.97unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33825:SF14CHITINASE-LIKE PROTEINcoord: 136..387
NoneNo IPR availablePANTHERPTHR33825CHITINASE-LIKE PROTEINcoord: 136..387

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr015100.1Sgr015100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016567 protein ubiquitination
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016740 transferase activity