Sgr024810 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr024810
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionBHLH domain-containing protein
Locationtig00002486: 3027017 .. 3029500 (+)
RNA-Seq ExpressionSgr024810
SyntenySgr024810
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACGGCTGCACCGACTCTGTGATTCCACTTTTGCCGCAGATAATGCCAGTGCATGATTCTGAAGCAGTCGACGAGAAGGCTTCGTTCTCAAGAAAGCGTCGCAGAGCCCTGGAAGCCAACGGAGGTATACAGAAGGGGAGAGAGAAGAGGAAGGAGATGAGCGAGAGTTTCGATGTTCTTCAATCTCTCGTCCCCAATATCTCTCCCAAGGTTAATCGATTTGATTCTCGTTGTCGTATATACTTAGACGTTGTCAAGATTTTGCTTCTTCTACATTTCGACTTCGCTATTTTAATCCAAGTTTTTGTGCTGAAATTATCAGGCTACGAGGGAGAATATTGTTTCCGAGACGATCCAGTTCATCGAGTTTCTGCAGAAGCAGTTGATGAGGCTGGAGATGAAGAAGAAACCATCGGAATCGGTGACAATGCTTCCCAGTACGAACTCGGATTCATCAGGCGGCGTCATCGTCTCGGTCTCCGGCAACATTGTGTTGTTTGGGATTCTTGCTTCTGTTCGACGAGGTATGGTGACACAGATTTTAATGGTGTTTGAAAGACACCGGGCTGAAGTTCTAGCAGCAAATGTTGCAGTCGGCCATGGCAAATTAACTTTAACAGTCACAGCTTCTGTACACGGTTACACAGAAAATACCGTAGAGCAGATTAAAAACGATATCCTCAGCTTAAAGAAATTATAAATTCCATTTTTCACGAGATTCTTCCCTCCTCATTTACTACGAGGGATTTAATATTTTTATAGAACTAGACAGTGAAAGAATATAATATAGTTGTCATATATTTTGGAGTTTACTTCAAAATCATGTTATTGAATATTGATTAAGTTCCAAGTTGGGTTCATCTTTTGATTCTACATAATATTATTCTTATTTGGCAGTAAAGATCTCTCTCTCTACACTGATAATTTTGGACGAAACTGTTCTGTTATTGAGATGAAGTTGTATTTCACCTTTTAGTTGTAGTTGTTATCATTCCTGTTTTCCGAGAATCTCTCCCCCTCTCTCTACACTGATTATATATATATATATGTTGTAATATATATTTTATTATATGTATGCCTGCATATATATATTTATTTGTGTGTATATATATACATATGTATATTAGATGTATATACATGAATATATATACTAAACATACATTATATATAATGGTGTATACATATGCTCATAGATATATACTTTCTTACAAGTTTGTTGCCCATCTAACATTGAGAGCCGACCGACCTGAACAAGTTAAAGCTATAGTAATGGTTCGAATCCTCACTCTTTATTTGAATTATAATATTTAAAAAAAATATTGTGTATACTAATGAAGTCTACTTTAATCGTGAAATGTCTTCTCATTTGGGTAAAGATTATCTATTTAGAAGTGAAATATTTTTAACAAAGAGGTTGATGGTACTTTTGCTTGGACGAAAACCAAATTTATGAATTTGGTCATAAATTGTTGTAAAAGATTCCACAGACCCGCATGAGGAAACTCTTATCTGATGGTATGGTTAGGATTCAGCACAGTCCACTGGAGCAAAAGAAATAACTTGAGCTTCAAGGTCATACCCAACATAGTAGTTTTGCAGATGAAAGTTCCCCAATACAGAAATTGGAGATCCAGAACGCAGAAGGGCAAGGCAGATAACTCCATCATCCTCTATCTTCACGAAGGTACTTTCTACTTTAAGAACTAAATCTGCACCATCAAAATGAACTGTAACATCTGGAAATGACTCCAAATCATCTGCATTTGCTGCAAAGCACAATTCAAATCTGTTTCTAGGATCATCTTTTCTCTTTGGTAAATCTGGTAGTGTAAGGAATTTATCTAGCAAACTGTCGAATGCATCTGTTTCAAGACTTGAGTATGTTGTCCCTGAATCTACGATCCATCCATCTCCGACATCGTATACATCAAAAACTCCATTTAAGTAGAGCTCATCGTCACCGACACTAATTCCGAGAACCTTGACATAATAAGCATCTAAATTGGGATATAGCAGAGGAGTTTGACCCCCAGAAGTTACAGGTAATGATCCAAAATACATTTTACTTGCTGATCCCAAATTGAAAGGAACCAAGCAGTAGGAGAACTTTTTGATACCCAGTTGAGAAATTAATGACAGGGGTGTTTGGTTCAAGCCCACACTGCCCATATAACTCTGCATACCTCCTGTTAAAGGAGCATCTGAACAGCCAAAGTTCAAATAGCCAACATCCACAAGTTTCCCATTTGAGGTATCAAAACTAAAACTATCAGATGAAAGAATCCCACTTGTTGCAGAATTGTCTTCATATTCTAATCGGTATTTGCACCATTTGTCAGAAGAATTGCAGGTTTGGAAGCCAGTCAAGGAATTGCAAAAGTTAGAGCCACATGGCTCCAACTCATAGGTGAAGGATTTGGAGGAGTGGAACTTGGTGTTGGTGGGGCCTTTTTCTGGCTCACATTGATTACTGCAGTTTGA

mRNA sequence

ATGGACGGCTGCACCGACTCTGTGATTCCACTTTTGCCGCAGATAATGCCAGTGCATGATTCTGAAGCAGTCGACGAGAAGGCTTCGTTCTCAAGAAAGCGTCGCAGAGCCCTGGAAGCCAACGGAGGTATACAGAAGGGGAGAGAGAAGAGGAAGGAGATGAGCGAGAGTTTCGATGTTCTTCAATCTCTCGTCCCCAATATCTCTCCCAAGGTTAATCGATTTGATTCTCGTTGTCGTATATACTTAGACGCTACGAGGGAGAATATTGTTTCCGAGACGATCCAGTTCATCGAGTTTCTGCAGAAGCAGTTGATGAGGCTGGAGATGAAGAAGAAACCATCGGAATCGGTGACAATGCTTCCCAGTACGAACTCGGATTCATCAGGCGGCGTCATCGTCTCGGTCTCCGGCAACATTGTGTTGTTTGGGATTCTTGCTTCTGTTCGACGAGGTATGGTGACACAGATTTTAATGGTGTTTGAAAGACACCGGGCTGAAGTTCTAGCAGCAAATGTTGCAGTCGGCCATGGCAAATTAACTTTAACAGTCACAGCTTCTGTACACGGATTCAGCACAGTCCACTGGAGCAAAAGAAATAACTTGAGCTTCAAGGTCATACCCAACATAGTAGTTTTGCAGATGAAAGTTCCCCAATACAGAAATTGGAGATCCAGAACGCAGAAGGGCAAGGCAGATAACTCCATCATCCTCTATCTTCACGAAGGGGTGTTTGGTTCAAGCCCACACTGCCCATATAACTCTGCATACCTCCTGTTAAAGGAGCATCTGAACAGCCAAAGTTCAAATAGCCAACATCCACAAGTTTCCCATTTGAGAAGAATTGCAGGTTTGGAAGCCAGTCAAGGAATTGCAAAAGTTAGAGCCACATGGCTCCAACTCATAGGTGAAGGATTTGGAGGAGTGGAACTTGGTGTTGGTGGGGCCTTTTTCTGGCTCACATTGATTACTGCAGTTTGA

Coding sequence (CDS)

ATGGACGGCTGCACCGACTCTGTGATTCCACTTTTGCCGCAGATAATGCCAGTGCATGATTCTGAAGCAGTCGACGAGAAGGCTTCGTTCTCAAGAAAGCGTCGCAGAGCCCTGGAAGCCAACGGAGGTATACAGAAGGGGAGAGAGAAGAGGAAGGAGATGAGCGAGAGTTTCGATGTTCTTCAATCTCTCGTCCCCAATATCTCTCCCAAGGTTAATCGATTTGATTCTCGTTGTCGTATATACTTAGACGCTACGAGGGAGAATATTGTTTCCGAGACGATCCAGTTCATCGAGTTTCTGCAGAAGCAGTTGATGAGGCTGGAGATGAAGAAGAAACCATCGGAATCGGTGACAATGCTTCCCAGTACGAACTCGGATTCATCAGGCGGCGTCATCGTCTCGGTCTCCGGCAACATTGTGTTGTTTGGGATTCTTGCTTCTGTTCGACGAGGTATGGTGACACAGATTTTAATGGTGTTTGAAAGACACCGGGCTGAAGTTCTAGCAGCAAATGTTGCAGTCGGCCATGGCAAATTAACTTTAACAGTCACAGCTTCTGTACACGGATTCAGCACAGTCCACTGGAGCAAAAGAAATAACTTGAGCTTCAAGGTCATACCCAACATAGTAGTTTTGCAGATGAAAGTTCCCCAATACAGAAATTGGAGATCCAGAACGCAGAAGGGCAAGGCAGATAACTCCATCATCCTCTATCTTCACGAAGGGGTGTTTGGTTCAAGCCCACACTGCCCATATAACTCTGCATACCTCCTGTTAAAGGAGCATCTGAACAGCCAAAGTTCAAATAGCCAACATCCACAAGTTTCCCATTTGAGAAGAATTGCAGGTTTGGAAGCCAGTCAAGGAATTGCAAAAGTTAGAGCCACATGGCTCCAACTCATAGGTGAAGGATTTGGAGGAGTGGAACTTGGTGTTGGTGGGGCCTTTTTCTGGCTCACATTGATTACTGCAGTTTGA

Protein sequence

MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQKGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKKPSESVTMLPSTNSDSSGGVIVSVSGNIVLFGILASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGFSTVHWSKRNNLSFKVIPNIVVLQMKVPQYRNWRSRTQKGKADNSIILYLHEGVFGSSPHCPYNSAYLLLKEHLNSQSSNSQHPQVSHLRRIAGLEASQGIAKVRATWLQLIGEGFGGVELGVGGAFFWLTLITAV
Homology
BLAST of Sgr024810 vs. NCBI nr
Match: XP_038885840.1 (transcription factor bHLH95-like [Benincasa hispida])

HSP 1 Score: 268.9 bits (686), Expect = 5.9e-68
Identity = 159/213 (74.65%), Postives = 174/213 (81.69%), Query Frame = 0

Query: 1   MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFD 60
           M+ CTDSV+PL P I+PVH SEA ++KAS SRKR RALEANGG+Q K REKRKEMSESFD
Sbjct: 1   MEACTDSVVPLFPLILPVHQSEATEKKASASRKRIRALEANGGVQKKEREKRKEMSESFD 60

Query: 61  VLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKKPSESVT 120
           VLQSLVPN+SPK             ATRE IVSETIQFIEFLQKQLMRLEM+KK SESVT
Sbjct: 61  VLQSLVPNLSPK-------------ATRETIVSETIQFIEFLQKQLMRLEMEKKSSESVT 120

Query: 121 MLPSTNSDSS----GGVIVSVSGNIVLFG-ILASVRRGMVTQILMVFERHRAEVLAANVA 180
           MLPSTNSDSS    GGVIVSVSGNIVLFG I+ASV+RGMVTQILMVFERH+AEVLAANVA
Sbjct: 121 MLPSTNSDSSGGGGGGVIVSVSGNIVLFGIIIASVQRGMVTQILMVFERHQAEVLAANVA 180

Query: 181 VGHGKLTLTVTASVHGF--STVHWSKRNNLSFK 206
           V HG LTLTVTASVHG+  +T+   K + LS K
Sbjct: 181 VSHGNLTLTVTASVHGYVENTIELIKNDILSLK 200

BLAST of Sgr024810 vs. NCBI nr
Match: XP_022955986.1 (uncharacterized protein LOC111457820 [Cucurbita moschata] >XP_022956015.1 uncharacterized protein LOC111457838 isoform X1 [Cucurbita moschata])

HSP 1 Score: 267.3 bits (682), Expect = 1.7e-67
Identity = 157/210 (74.76%), Postives = 171/210 (81.43%), Query Frame = 0

Query: 1   MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFD 60
           M+ CTDSV+ LLP I+PVH SEA + KAS SRKRRRALEANGG+Q KGREKRKEMSESFD
Sbjct: 1   MEACTDSVVSLLPHILPVHQSEAAENKASTSRKRRRALEANGGVQRKGREKRKEMSESFD 60

Query: 61  VLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKKPSESVT 120
           VLQSLVPN+SPK             ATRE IVSETIQFIE LQKQLMRLEM+KKP ESVT
Sbjct: 61  VLQSLVPNLSPK-------------ATRETIVSETIQFIEDLQKQLMRLEMEKKPLESVT 120

Query: 121 MLPSTNSDS-SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGH 180
           MLPSTNSDS  GGVIVSVS NIVLFGI+ ASVRRGMVTQILM FERH+AEVLAANVAV H
Sbjct: 121 MLPSTNSDSPGGGVIVSVSSNIVLFGIIFASVRRGMVTQILMAFERHQAEVLAANVAVSH 180

Query: 181 GKLTLTVTASVHGF--STVHWSKRNNLSFK 206
           G LTLTVTASVHG+  +T+   + + LS K
Sbjct: 181 GNLTLTVTASVHGYVENTIEQIRNDILSLK 197

BLAST of Sgr024810 vs. NCBI nr
Match: KAG6582238.1 (Transcription factor basic helix-loop-helix 95, partial [Cucurbita argyrosperma subsp. sororia] >KAG7018637.1 Transcription factor bHLH95 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 267.3 bits (682), Expect = 1.7e-67
Identity = 157/210 (74.76%), Postives = 171/210 (81.43%), Query Frame = 0

Query: 1   MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFD 60
           M+ CTDSV+ LLP I+PVH SEA + KAS SRKRRRALEANGG+Q KGREKRKEMSESFD
Sbjct: 1   MEACTDSVVTLLPHILPVHQSEAAENKASTSRKRRRALEANGGVQRKGREKRKEMSESFD 60

Query: 61  VLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKKPSESVT 120
           VLQSLVPN+SPK             ATRE IVSETIQFIE LQKQLMRLEM+KKP ESVT
Sbjct: 61  VLQSLVPNLSPK-------------ATRETIVSETIQFIEDLQKQLMRLEMEKKPLESVT 120

Query: 121 MLPSTNSDS-SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGH 180
           MLPSTNSDS  GGVIVSVS NIVLFGI+ ASVRRGMVTQILM FERH+AEVLAANVAV H
Sbjct: 121 MLPSTNSDSPGGGVIVSVSSNIVLFGIIFASVRRGMVTQILMAFERHQAEVLAANVAVSH 180

Query: 181 GKLTLTVTASVHGF--STVHWSKRNNLSFK 206
           G LTLTVTASVHG+  +T+   + + LS K
Sbjct: 181 GNLTLTVTASVHGYVENTIEQIRNDILSLK 197

BLAST of Sgr024810 vs. NCBI nr
Match: XP_023528352.1 (uncharacterized protein LOC111791298 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 263.5 bits (672), Expect = 2.5e-66
Identity = 155/213 (72.77%), Postives = 170/213 (79.81%), Query Frame = 0

Query: 1   MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFD 60
           M+ CTDSV+ LLP I+PVH SEA + KAS SRKRRRALEANGG+Q KGREKRKEMSESFD
Sbjct: 1   MEACTDSVVSLLPHILPVHQSEAAENKASTSRKRRRALEANGGVQRKGREKRKEMSESFD 60

Query: 61  VLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKKPSESVT 120
           VLQSLVPN+SPK             ATRE IVSETIQFIE LQKQL RLEM+KKP ESVT
Sbjct: 61  VLQSLVPNLSPK-------------ATRETIVSETIQFIEDLQKQLTRLEMEKKPLESVT 120

Query: 121 MLPSTNSDS----SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVA 180
           MLPSTNSDS     GGVIVSVS NIVLFGI+ ASVRRGMVTQILM FERH+AEVLAANVA
Sbjct: 121 MLPSTNSDSPGGGDGGVIVSVSSNIVLFGIIFASVRRGMVTQILMAFERHQAEVLAANVA 180

Query: 181 VGHGKLTLTVTASVHGF--STVHWSKRNNLSFK 206
           V HG LTLTVTAS+HG+  +T+   + + LS K
Sbjct: 181 VSHGNLTLTVTASIHGYVENTIEQIRNDILSLK 200

BLAST of Sgr024810 vs. NCBI nr
Match: XP_022979931.1 (uncharacterized protein LOC111479473 [Cucurbita maxima])

HSP 1 Score: 258.5 bits (659), Expect = 8.0e-65
Identity = 155/215 (72.09%), Postives = 169/215 (78.60%), Query Frame = 0

Query: 1   MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFD 60
           M+ CTDSV+ LLP I+PVH SEA + KAS SRKR RALEANGG Q KGREKRKEMSESFD
Sbjct: 1   MEACTDSVVSLLPHILPVHQSEAAENKASTSRKRWRALEANGGEQRKGREKRKEMSESFD 60

Query: 61  VLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKKPSESVT 120
           VLQSLVPN+SPK             ATRE IVSETIQFIE LQKQLMRLEM+KKP ESVT
Sbjct: 61  VLQSLVPNLSPK-------------ATRETIVSETIQFIEDLQKQLMRLEMEKKPLESVT 120

Query: 121 MLPSTNSDS------SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAAN 180
           MLPSTNSDS       GGVIVSVS NIVLFGI+ ASVRRGMVT+ILM FERH+AEVLAAN
Sbjct: 121 MLPSTNSDSPGGDGGDGGVIVSVSSNIVLFGIIFASVRRGMVTRILMAFERHQAEVLAAN 180

Query: 181 VAVGHGKLTLTVTASVHGF--STVHWSKRNNLSFK 206
           VAV HG LTLTVTASVHG+  +T+   + + LS K
Sbjct: 181 VAVSHGNLTLTVTASVHGYVENTIEQIRNDILSLK 202

BLAST of Sgr024810 vs. ExPASy TrEMBL
Match: A0A6J1GWN9 (uncharacterized protein LOC111457820 OS=Cucurbita moschata OX=3662 GN=LOC111457838 PE=4 SV=1)

HSP 1 Score: 267.3 bits (682), Expect = 8.3e-68
Identity = 157/210 (74.76%), Postives = 171/210 (81.43%), Query Frame = 0

Query: 1   MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFD 60
           M+ CTDSV+ LLP I+PVH SEA + KAS SRKRRRALEANGG+Q KGREKRKEMSESFD
Sbjct: 1   MEACTDSVVSLLPHILPVHQSEAAENKASTSRKRRRALEANGGVQRKGREKRKEMSESFD 60

Query: 61  VLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKKPSESVT 120
           VLQSLVPN+SPK             ATRE IVSETIQFIE LQKQLMRLEM+KKP ESVT
Sbjct: 61  VLQSLVPNLSPK-------------ATRETIVSETIQFIEDLQKQLMRLEMEKKPLESVT 120

Query: 121 MLPSTNSDS-SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGH 180
           MLPSTNSDS  GGVIVSVS NIVLFGI+ ASVRRGMVTQILM FERH+AEVLAANVAV H
Sbjct: 121 MLPSTNSDSPGGGVIVSVSSNIVLFGIIFASVRRGMVTQILMAFERHQAEVLAANVAVSH 180

Query: 181 GKLTLTVTASVHGF--STVHWSKRNNLSFK 206
           G LTLTVTASVHG+  +T+   + + LS K
Sbjct: 181 GNLTLTVTASVHGYVENTIEQIRNDILSLK 197

BLAST of Sgr024810 vs. ExPASy TrEMBL
Match: A0A6J1IS55 (uncharacterized protein LOC111479473 OS=Cucurbita maxima OX=3661 GN=LOC111479473 PE=4 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 3.9e-65
Identity = 155/215 (72.09%), Postives = 169/215 (78.60%), Query Frame = 0

Query: 1   MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFD 60
           M+ CTDSV+ LLP I+PVH SEA + KAS SRKR RALEANGG Q KGREKRKEMSESFD
Sbjct: 1   MEACTDSVVSLLPHILPVHQSEAAENKASTSRKRWRALEANGGEQRKGREKRKEMSESFD 60

Query: 61  VLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKKPSESVT 120
           VLQSLVPN+SPK             ATRE IVSETIQFIE LQKQLMRLEM+KKP ESVT
Sbjct: 61  VLQSLVPNLSPK-------------ATRETIVSETIQFIEDLQKQLMRLEMEKKPLESVT 120

Query: 121 MLPSTNSDS------SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAAN 180
           MLPSTNSDS       GGVIVSVS NIVLFGI+ ASVRRGMVT+ILM FERH+AEVLAAN
Sbjct: 121 MLPSTNSDSPGGDGGDGGVIVSVSSNIVLFGIIFASVRRGMVTRILMAFERHQAEVLAAN 180

Query: 181 VAVGHGKLTLTVTASVHGF--STVHWSKRNNLSFK 206
           VAV HG LTLTVTASVHG+  +T+   + + LS K
Sbjct: 181 VAVSHGNLTLTVTASVHGYVENTIEQIRNDILSLK 202

BLAST of Sgr024810 vs. ExPASy TrEMBL
Match: A0A6J1CA71 (uncharacterized protein LOC111009370 OS=Momordica charantia OX=3673 GN=LOC111009370 PE=4 SV=1)

HSP 1 Score: 250.0 bits (637), Expect = 1.4e-62
Identity = 151/199 (75.88%), Postives = 166/199 (83.42%), Query Frame = 0

Query: 1   MDGCTDSVIPLLPQIMPVHDSEAVD-EKASFSRKRRRA-LEANGGIQKGREKRKEMSESF 60
           M+ CTDSVIPLL QI+PV +SEA D  KAS SRKRRRA LEA GG+QKGR KRKEM++SF
Sbjct: 1   MEACTDSVIPLLTQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSF 60

Query: 61  DVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKK-PSES 120
           DVLQSLVPN+SPK             ATRENIVSETIQFI+FL+KQLMRLEMKKK PSES
Sbjct: 61  DVLQSLVPNLSPK-------------ATRENIVSETIQFIDFLEKQLMRLEMKKKLPSES 120

Query: 121 V--TML-PSTNSDSS--GGVIVSVSGNIVLFGILASVRRGMVTQILMVFERHRAEVLAAN 180
           V  TM+ PSTNSDSS  GGVIVS SGNIVLFGILASVRRGMVTQILM FER++AEVLAAN
Sbjct: 121 VIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAAN 180

Query: 181 VAVGHGKLTLTVTASVHGF 192
           VAV HG L+LT+TASVHG+
Sbjct: 181 VAVSHGNLSLTITASVHGY 186

BLAST of Sgr024810 vs. ExPASy TrEMBL
Match: A0A5A7U767 (Transcription factor bHLH95-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold209G00480 PE=4 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 2.0e-61
Identity = 143/213 (67.14%), Postives = 169/213 (79.34%), Query Frame = 0

Query: 1   MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFD 60
           M+ C+DSV+PL P I+P+H  EA +++AS SRKR RALEANGG+Q K +EKRKEMSESFD
Sbjct: 1   MEACSDSVVPLFPLILPIHHFEATEKEASASRKRCRALEANGGVQKKEKEKRKEMSESFD 60

Query: 61  VLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKKPSESVT 120
           VL+SLVPN+SPK             ATRE IVS  IQFIEFLQKQLMRLEM+KK SESVT
Sbjct: 61  VLRSLVPNLSPK-------------ATRETIVSGAIQFIEFLQKQLMRLEMEKKSSESVT 120

Query: 121 MLPSTNSDSSG----GVIVSVSGNIVLFG-ILASVRRGMVTQILMVFERHRAEVLAANVA 180
           +LP++NSDSSG    GVIVS+SGNIVLFG I+ASV+RGMVTQIL+VFERH+ EVLAANV 
Sbjct: 121 LLPNSNSDSSGGNGDGVIVSISGNIVLFGVIIASVQRGMVTQILLVFERHKTEVLAANVV 180

Query: 181 VGHGKLTLTVTASVHGF--STVHWSKRNNLSFK 206
           V HG LTLTVTASVHG+  +T+   + + LS K
Sbjct: 181 VSHGNLTLTVTASVHGYVENTIEQIRNDILSLK 200

BLAST of Sgr024810 vs. ExPASy TrEMBL
Match: A0A1S4DSM0 (transcription factor bHLH95-like OS=Cucumis melo OX=3656 GN=LOC103483801 PE=4 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 2.0e-61
Identity = 143/213 (67.14%), Postives = 169/213 (79.34%), Query Frame = 0

Query: 1   MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFD 60
           M+ C+DSV+PL P I+P+H  EA +++AS SRKR RALEANGG+Q K +EKRKEMSESFD
Sbjct: 1   MEACSDSVVPLFPLILPIHHFEATEKEASASRKRCRALEANGGVQKKEKEKRKEMSESFD 60

Query: 61  VLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEMKKKPSESVT 120
           VL+SLVPN+SPK             ATRE IVS  IQFIEFLQKQLMRLEM+KK SESVT
Sbjct: 61  VLRSLVPNLSPK-------------ATRETIVSGAIQFIEFLQKQLMRLEMEKKSSESVT 120

Query: 121 MLPSTNSDSSG----GVIVSVSGNIVLFG-ILASVRRGMVTQILMVFERHRAEVLAANVA 180
           +LP++NSDSSG    GVIVS+SGNIVLFG I+ASV+RGMVTQIL+VFERH+ EVLAANV 
Sbjct: 121 LLPNSNSDSSGGNGDGVIVSISGNIVLFGVIIASVQRGMVTQILLVFERHKTEVLAANVV 180

Query: 181 VGHGKLTLTVTASVHGF--STVHWSKRNNLSFK 206
           V HG LTLTVTASVHG+  +T+   + + LS K
Sbjct: 181 VSHGNLTLTVTASVHGYVENTIEQIRNDILSLK 200

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038885840.15.9e-6874.65transcription factor bHLH95-like [Benincasa hispida][more]
XP_022955986.11.7e-6774.76uncharacterized protein LOC111457820 [Cucurbita moschata] >XP_022956015.1 unchar... [more]
KAG6582238.11.7e-6774.76Transcription factor basic helix-loop-helix 95, partial [Cucurbita argyrosperma ... [more]
XP_023528352.12.5e-6672.77uncharacterized protein LOC111791298 [Cucurbita pepo subsp. pepo][more]
XP_022979931.18.0e-6572.09uncharacterized protein LOC111479473 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1GWN98.3e-6874.76uncharacterized protein LOC111457820 OS=Cucurbita moschata OX=3662 GN=LOC1114578... [more]
A0A6J1IS553.9e-6572.09uncharacterized protein LOC111479473 OS=Cucurbita maxima OX=3661 GN=LOC111479473... [more]
A0A6J1CA711.4e-6275.88uncharacterized protein LOC111009370 OS=Momordica charantia OX=3673 GN=LOC111009... [more]
A0A5A7U7672.0e-6167.14Transcription factor bHLH95-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... [more]
A0A1S4DSM02.0e-6167.14transcription factor bHLH95-like OS=Cucumis melo OX=3656 GN=LOC103483801 PE=4 SV... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036638Helix-loop-helix DNA-binding domain superfamilyGENE3D4.10.280.10coord: 31..125
e-value: 1.2E-5
score: 27.2
IPR036638Helix-loop-helix DNA-binding domain superfamilySUPERFAMILY47459HLH, helix-loop-helix DNA-binding domaincoord: 48..113
NoneNo IPR availablePANTHERPTHR31945TRANSCRIPTION FACTOR SCREAM2-RELATEDcoord: 6..190
NoneNo IPR availablePANTHERPTHR31945:SF53HELIX LOOP HELIX DNA-BINDING DOMAIN PROTEINcoord: 6..190
NoneNo IPR availableCDDcd11393bHLH_AtbHLH_likecoord: 48..109
e-value: 1.90441E-7
score: 45.25

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr024810.1Sgr024810.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0043565 sequence-specific DNA binding