Sgr023301 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr023301
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionCoiled-coil domain-containing protein 96-like
Locationtig00000892: 2008675 .. 2009769 (+)
RNA-Seq ExpressionSgr023301
SyntenySgr023301
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCTGCGAAGAAGAAGGAGACGAAGCGGCTGATCGGAACCTGTATCCTCGTAAAACTCTGACCATGGATTCAAATCCGCCCCGTAACCGTTCCAACCAGGTAGCCTATACTCGCCTCGGTTATTATATTCCGCGTGAATGTTCTAAAGCTCAAATGTGCCTTCAAAAGATTCTGCATATAATTTCGTCCGTTCCTCGCCAAGAATCATGCGCGGATTCCCAAATCAGAAAATTAGAATTCGATGGAAGCGTAAGTGGTGGTGGAGGGTTTTGTACCGATTTCAATCTTAGATTCGGCGCGTCCATGGCGGAAGAAGACCACGAATCGGTGGCTTTTGTGGAGAATGGAGGAGGCGGCGGAAATGGTTCGGTAGGCGTCGTGTCTGTTGGTGGTTCGGCGACTGAGAAATGCTCGGAAGGCGACGGTGACGACGACGGCGAGAACGCGAAATGCTCTGAATCTCCTGAAACTAAAACTAACAAATATGAAGTCGAAGAACACATGGAGAAAATTCCAGAACATTCGAAGATCGCCAAATTTCATATTCAAAGAGGTGAAGCTGAAAGCAGCAGATTTAATGAAAACGAAAGCGAAAAGACAGTCGAAGAAGAAGAAGGTAAAAAGGAAAAACTCACGGGGAGTTCCGTTGGTGGAAGCGACGACGGCAAAGTGGAAACGGCACCGTATCAAGCGGTGGAATGCAAGAGCGGTGGCGATTGTCTAGGTTTGCTCATCGAAGCGGCGAGACTGATACTGGGAGACATCGGTGAGAACGAGTTTGAGACAGAGTCAACTCCCGAGGACGGCGAGTCGAACAGCGAGTTAGATGCCAAAGAACCGAGTCAACTCGAGAAGGTAATTTCAGAGTCGCATTCAAGCGGGTCGAAGAGGGAGCTCGAAGGAAGTTGGATGGTGATGAATTTAGTTAGAGATATCGACGACAGGTCGCCATTAGTAAGGTCAAAGCGAGGAAGAAGCCAGGTCTTACCGTGTCGTTACAAAGACTCTGTTCTTGAGCCATGGCGATCTCAGCCATTGCCGAGCAAGGTCAAGGTCTCGAGAAGACAACGGCGATCGAGGTACCCCACTTAG

mRNA sequence

ATGCTCTGCGAAGAAGAAGGAGACGAAGCGGCTGATCGGAACCTGTATCCTCGTAAAACTCTGACCATGGATTCAAATCCGCCCCGTAACCGTTCCAACCAGGTAGCCTATACTCGCCTCGGTTATTATATTCCGCGTGAATGTTCTAAAGCTCAAATGTGCCTTCAAAAGATTCTGCATATAATTTCGTCCGTTCCTCGCCAAGAATCATGCGCGGATTCCCAAATCAGAAAATTAGAATTCGATGGAAGCGTAAGTGGTGGTGGAGGGTTTTGTACCGATTTCAATCTTAGATTCGGCGCGTCCATGGCGGAAGAAGACCACGAATCGGTGGCTTTTGTGGAGAATGGAGGAGGCGGCGGAAATGGTTCGGTAGGCGTCGTGTCTGTTGGTGGTTCGGCGACTGAGAAATGCTCGGAAGGCGACGGTGACGACGACGGCGAGAACGCGAAATGCTCTGAATCTCCTGAAACTAAAACTAACAAATATGAAGTCGAAGAACACATGGAGAAAATTCCAGAACATTCGAAGATCGCCAAATTTCATATTCAAAGAGGTGAAGCTGAAAGCAGCAGATTTAATGAAAACGAAAGCGAAAAGACAGTCGAAGAAGAAGAAGGTAAAAAGGAAAAACTCACGGGGAGTTCCGTTGGTGGAAGCGACGACGGCAAAGTGGAAACGGCACCGTATCAAGCGGTGGAATGCAAGAGCGGTGGCGATTGTCTAGGTTTGCTCATCGAAGCGGCGAGACTGATACTGGGAGACATCGGTGAGAACGAGTTTGAGACAGAGTCAACTCCCGAGGACGGCGAGTCGAACAGCGAGTTAGATGCCAAAGAACCGAGTCAACTCGAGAAGGTAATTTCAGAGTCGCATTCAAGCGGGTCGAAGAGGGAGCTCGAAGGAAGTTGGATGGTGATGAATTTAGTTAGAGATATCGACGACAGGTCGCCATTAGTAAGGTCAAAGCGAGGAAGAAGCCAGGTCTTACCGTGTCGTTACAAAGACTCTGTTCTTGAGCCATGGCGATCTCAGCCATTGCCGAGCAAGGTCAAGGTCTCGAGAAGACAACGGCGATCGAGGTACCCCACTTAG

Coding sequence (CDS)

ATGCTCTGCGAAGAAGAAGGAGACGAAGCGGCTGATCGGAACCTGTATCCTCGTAAAACTCTGACCATGGATTCAAATCCGCCCCGTAACCGTTCCAACCAGGTAGCCTATACTCGCCTCGGTTATTATATTCCGCGTGAATGTTCTAAAGCTCAAATGTGCCTTCAAAAGATTCTGCATATAATTTCGTCCGTTCCTCGCCAAGAATCATGCGCGGATTCCCAAATCAGAAAATTAGAATTCGATGGAAGCGTAAGTGGTGGTGGAGGGTTTTGTACCGATTTCAATCTTAGATTCGGCGCGTCCATGGCGGAAGAAGACCACGAATCGGTGGCTTTTGTGGAGAATGGAGGAGGCGGCGGAAATGGTTCGGTAGGCGTCGTGTCTGTTGGTGGTTCGGCGACTGAGAAATGCTCGGAAGGCGACGGTGACGACGACGGCGAGAACGCGAAATGCTCTGAATCTCCTGAAACTAAAACTAACAAATATGAAGTCGAAGAACACATGGAGAAAATTCCAGAACATTCGAAGATCGCCAAATTTCATATTCAAAGAGGTGAAGCTGAAAGCAGCAGATTTAATGAAAACGAAAGCGAAAAGACAGTCGAAGAAGAAGAAGGTAAAAAGGAAAAACTCACGGGGAGTTCCGTTGGTGGAAGCGACGACGGCAAAGTGGAAACGGCACCGTATCAAGCGGTGGAATGCAAGAGCGGTGGCGATTGTCTAGGTTTGCTCATCGAAGCGGCGAGACTGATACTGGGAGACATCGGTGAGAACGAGTTTGAGACAGAGTCAACTCCCGAGGACGGCGAGTCGAACAGCGAGTTAGATGCCAAAGAACCGAGTCAACTCGAGAAGGTAATTTCAGAGTCGCATTCAAGCGGGTCGAAGAGGGAGCTCGAAGGAAGTTGGATGGTGATGAATTTAGTTAGAGATATCGACGACAGGTCGCCATTAGTAAGGTCAAAGCGAGGAAGAAGCCAGGTCTTACCGTGTCGTTACAAAGACTCTGTTCTTGAGCCATGGCGATCTCAGCCATTGCCGAGCAAGGTCAAGGTCTCGAGAAGACAACGGCGATCGAGGTACCCCACTTAG

Protein sequence

MLCEEEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISSVPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVENGGGGGNGSVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHIQRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRSRYPT
Homology
BLAST of Sgr023301 vs. NCBI nr
Match: XP_022140313.1 (uncharacterized protein LOC111011014 isoform X1 [Momordica charantia])

HSP 1 Score: 526.6 bits (1355), Expect = 1.8e-145
Identity = 295/364 (81.04%), Postives = 312/364 (85.71%), Query Frame = 0

Query: 5   EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
           EEG+EAADRNLYP +T +MDSN    RSNQV +TRLGY IPRECSKAQMCLQKILHIISS
Sbjct: 4   EEGEEAADRNLYPCETPSMDSNLLCKRSNQVGHTRLGYCIPRECSKAQMCLQKILHIISS 63

Query: 65  VPRQESCADSQIRKLEFDGSVSG-GGGFCTDFNLRFGASMAEEDHESVAFVENGGGGGNG 124
           VPRQESCADSQI KLEFDG+VSG  GGFC DFN R GAS  EED ESVA VEN GGGGNG
Sbjct: 64  VPRQESCADSQIGKLEFDGNVSGDAGGFCPDFNPRVGASAEEEDQESVALVENEGGGGNG 123

Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESP--ETKTNKYEVEEHMEKIPEHSKIAKF 184
            V VVSVGGS TEKC +G    DGENAK SESP   T+T+KYEVEEHMEK  EHS IA+F
Sbjct: 124 LVDVVSVGGSVTEKCLDG----DGENAKFSESPATATETDKYEVEEHMEKTLEHSNIAEF 183

Query: 185 HIQRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDC 244
           HIQRG+    RF+ENESEKT  EEEGKKEKLTGSS GGSDDG VETAP++  + KSGGDC
Sbjct: 184 HIQRGDDRIGRFDENESEKTA-EEEGKKEKLTGSSDGGSDDGIVETAPFRGAKXKSGGDC 243

Query: 245 LGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESH-SSGSKREL 304
           LGLLIEAARLILGD GENEFETEST E  ESNSELDAKEPSQLE+VISESH SSGSKR+L
Sbjct: 244 LGLLIEAARLILGDFGENEFETESTHEXHESNSELDAKEPSQLEQVISESHSSSGSKRKL 303

Query: 305 EGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 364
           EG+WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQ LPSKVKVSRRQRRS
Sbjct: 304 EGNWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQSLPSKVKVSRRQRRS 362

BLAST of Sgr023301 vs. NCBI nr
Match: XP_022140314.1 (uncharacterized protein LOC111011014 isoform X2 [Momordica charantia])

HSP 1 Score: 522.7 bits (1345), Expect = 2.5e-144
Identity = 293/361 (81.16%), Postives = 310/361 (85.87%), Query Frame = 0

Query: 5   EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
           EEG+EAADRNLYP +T +MDSN    RSNQV +TRLGY IPRECSKAQMCLQKILHIISS
Sbjct: 4   EEGEEAADRNLYPCETPSMDSNLLCKRSNQVGHTRLGYCIPRECSKAQMCLQKILHIISS 63

Query: 65  VPRQESCADSQIRKLEFDGSVSG-GGGFCTDFNLRFGASMAEEDHESVAFVENGGGGGNG 124
           VPRQESCADSQI KLEFDG+VSG  GGFC DFN R GAS  EED ESVA VEN GGGGNG
Sbjct: 64  VPRQESCADSQIGKLEFDGNVSGDAGGFCPDFNPRVGASAEEEDQESVALVENEGGGGNG 123

Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESP--ETKTNKYEVEEHMEKIPEHSKIAKF 184
            V VVSVGGS TEKC +G    DGENAK SESP   T+T+KYEVEEHMEK  EHS IA+F
Sbjct: 124 LVDVVSVGGSVTEKCLDG----DGENAKFSESPATATETDKYEVEEHMEKTLEHSNIAEF 183

Query: 185 HIQRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDC 244
           HIQRG+    RF+ENESEKT  EEEGKKEKLTGSS GGSDDG VETAP++  + KSGGDC
Sbjct: 184 HIQRGDDRIGRFDENESEKTA-EEEGKKEKLTGSSDGGSDDGIVETAPFRGAKXKSGGDC 243

Query: 245 LGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESH-SSGSKREL 304
           LGLLIEAARLILGD GENEFETEST E  ESNSELDAKEPSQLE+VISESH SSGSKR+L
Sbjct: 244 LGLLIEAARLILGDFGENEFETESTHEXHESNSELDAKEPSQLEQVISESHSSSGSKRKL 303

Query: 305 EGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 362
           EG+WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQ LPSKVKVSRRQRRS
Sbjct: 304 EGNWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQSLPSKVKVSRRQRRS 359

BLAST of Sgr023301 vs. NCBI nr
Match: XP_022985998.1 (uncharacterized protein LOC111483864 isoform X2 [Cucurbita maxima])

HSP 1 Score: 517.3 bits (1331), Expect = 1.1e-142
Identity = 281/361 (77.84%), Postives = 308/361 (85.32%), Query Frame = 0

Query: 5   EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
           EEGDEAADR LYP KTL M+S+P  NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4   EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63

Query: 65  VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
           VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA   EED ESVAFVEN  GGGG+G
Sbjct: 64  VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123

Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
           SV  V V GSATEKCS G    DGENAKCSESP  +T KYEVEEH  +IPE S+IA+FH 
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183

Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
           QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV  SDD K ET P++  +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243

Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
           LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS  KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303

Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRSRYP 364
           WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRSRY 
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRSRYR 360

BLAST of Sgr023301 vs. NCBI nr
Match: XP_022985999.1 (uncharacterized protein LOC111483864 isoform X3 [Cucurbita maxima])

HSP 1 Score: 513.5 bits (1321), Expect = 1.5e-141
Identity = 279/358 (77.93%), Postives = 306/358 (85.47%), Query Frame = 0

Query: 5   EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
           EEGDEAADR LYP KTL M+S+P  NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4   EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63

Query: 65  VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
           VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA   EED ESVAFVEN  GGGG+G
Sbjct: 64  VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123

Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
           SV  V V GSATEKCS G    DGENAKCSESP  +T KYEVEEH  +IPE S+IA+FH 
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183

Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
           QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV  SDD K ET P++  +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243

Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
           LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS  KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303

Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRSR 362
           WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRSR
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRSR 357

BLAST of Sgr023301 vs. NCBI nr
Match: XP_022985997.1 (uncharacterized protein LOC111483864 isoform X1 [Cucurbita maxima])

HSP 1 Score: 512.3 bits (1318), Expect = 3.4e-141
Identity = 278/357 (77.87%), Postives = 305/357 (85.43%), Query Frame = 0

Query: 5   EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
           EEGDEAADR LYP KTL M+S+P  NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4   EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63

Query: 65  VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
           VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA   EED ESVAFVEN  GGGG+G
Sbjct: 64  VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123

Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
           SV  V V GSATEKCS G    DGENAKCSESP  +T KYEVEEH  +IPE S+IA+FH 
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183

Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
           QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV  SDD K ET P++  +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243

Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
           LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS  KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303

Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 361
           WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRS
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRS 356

BLAST of Sgr023301 vs. ExPASy TrEMBL
Match: A0A6J1CHQ9 (uncharacterized protein LOC111011014 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111011014 PE=4 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 8.5e-146
Identity = 295/364 (81.04%), Postives = 312/364 (85.71%), Query Frame = 0

Query: 5   EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
           EEG+EAADRNLYP +T +MDSN    RSNQV +TRLGY IPRECSKAQMCLQKILHIISS
Sbjct: 4   EEGEEAADRNLYPCETPSMDSNLLCKRSNQVGHTRLGYCIPRECSKAQMCLQKILHIISS 63

Query: 65  VPRQESCADSQIRKLEFDGSVSG-GGGFCTDFNLRFGASMAEEDHESVAFVENGGGGGNG 124
           VPRQESCADSQI KLEFDG+VSG  GGFC DFN R GAS  EED ESVA VEN GGGGNG
Sbjct: 64  VPRQESCADSQIGKLEFDGNVSGDAGGFCPDFNPRVGASAEEEDQESVALVENEGGGGNG 123

Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESP--ETKTNKYEVEEHMEKIPEHSKIAKF 184
            V VVSVGGS TEKC +G    DGENAK SESP   T+T+KYEVEEHMEK  EHS IA+F
Sbjct: 124 LVDVVSVGGSVTEKCLDG----DGENAKFSESPATATETDKYEVEEHMEKTLEHSNIAEF 183

Query: 185 HIQRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDC 244
           HIQRG+    RF+ENESEKT  EEEGKKEKLTGSS GGSDDG VETAP++  + KSGGDC
Sbjct: 184 HIQRGDDRIGRFDENESEKTA-EEEGKKEKLTGSSDGGSDDGIVETAPFRGAKXKSGGDC 243

Query: 245 LGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESH-SSGSKREL 304
           LGLLIEAARLILGD GENEFETEST E  ESNSELDAKEPSQLE+VISESH SSGSKR+L
Sbjct: 244 LGLLIEAARLILGDFGENEFETESTHEXHESNSELDAKEPSQLEQVISESHSSSGSKRKL 303

Query: 305 EGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 364
           EG+WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQ LPSKVKVSRRQRRS
Sbjct: 304 EGNWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQSLPSKVKVSRRQRRS 362

BLAST of Sgr023301 vs. ExPASy TrEMBL
Match: A0A6J1CFC4 (uncharacterized protein LOC111011014 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111011014 PE=4 SV=1)

HSP 1 Score: 522.7 bits (1345), Expect = 1.2e-144
Identity = 293/361 (81.16%), Postives = 310/361 (85.87%), Query Frame = 0

Query: 5   EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
           EEG+EAADRNLYP +T +MDSN    RSNQV +TRLGY IPRECSKAQMCLQKILHIISS
Sbjct: 4   EEGEEAADRNLYPCETPSMDSNLLCKRSNQVGHTRLGYCIPRECSKAQMCLQKILHIISS 63

Query: 65  VPRQESCADSQIRKLEFDGSVSG-GGGFCTDFNLRFGASMAEEDHESVAFVENGGGGGNG 124
           VPRQESCADSQI KLEFDG+VSG  GGFC DFN R GAS  EED ESVA VEN GGGGNG
Sbjct: 64  VPRQESCADSQIGKLEFDGNVSGDAGGFCPDFNPRVGASAEEEDQESVALVENEGGGGNG 123

Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESP--ETKTNKYEVEEHMEKIPEHSKIAKF 184
            V VVSVGGS TEKC +G    DGENAK SESP   T+T+KYEVEEHMEK  EHS IA+F
Sbjct: 124 LVDVVSVGGSVTEKCLDG----DGENAKFSESPATATETDKYEVEEHMEKTLEHSNIAEF 183

Query: 185 HIQRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDC 244
           HIQRG+    RF+ENESEKT  EEEGKKEKLTGSS GGSDDG VETAP++  + KSGGDC
Sbjct: 184 HIQRGDDRIGRFDENESEKTA-EEEGKKEKLTGSSDGGSDDGIVETAPFRGAKXKSGGDC 243

Query: 245 LGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESH-SSGSKREL 304
           LGLLIEAARLILGD GENEFETEST E  ESNSELDAKEPSQLE+VISESH SSGSKR+L
Sbjct: 244 LGLLIEAARLILGDFGENEFETESTHEXHESNSELDAKEPSQLEQVISESHSSSGSKRKL 303

Query: 305 EGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 362
           EG+WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQ LPSKVKVSRRQRRS
Sbjct: 304 EGNWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQSLPSKVKVSRRQRRS 359

BLAST of Sgr023301 vs. ExPASy TrEMBL
Match: A0A6J1JF75 (uncharacterized protein LOC111483864 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111483864 PE=4 SV=1)

HSP 1 Score: 517.3 bits (1331), Expect = 5.2e-143
Identity = 281/361 (77.84%), Postives = 308/361 (85.32%), Query Frame = 0

Query: 5   EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
           EEGDEAADR LYP KTL M+S+P  NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4   EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63

Query: 65  VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
           VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA   EED ESVAFVEN  GGGG+G
Sbjct: 64  VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123

Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
           SV  V V GSATEKCS G    DGENAKCSESP  +T KYEVEEH  +IPE S+IA+FH 
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183

Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
           QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV  SDD K ET P++  +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243

Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
           LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS  KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303

Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRSRYP 364
           WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRSRY 
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRSRYR 360

BLAST of Sgr023301 vs. ExPASy TrEMBL
Match: A0A6J1JEV5 (uncharacterized protein LOC111483864 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111483864 PE=4 SV=1)

HSP 1 Score: 513.5 bits (1321), Expect = 7.4e-142
Identity = 279/358 (77.93%), Postives = 306/358 (85.47%), Query Frame = 0

Query: 5   EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
           EEGDEAADR LYP KTL M+S+P  NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4   EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63

Query: 65  VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
           VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA   EED ESVAFVEN  GGGG+G
Sbjct: 64  VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123

Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
           SV  V V GSATEKCS G    DGENAKCSESP  +T KYEVEEH  +IPE S+IA+FH 
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183

Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
           QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV  SDD K ET P++  +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243

Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
           LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS  KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303

Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRSR 362
           WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRSR
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRSR 357

BLAST of Sgr023301 vs. ExPASy TrEMBL
Match: A0A6J1J6D7 (uncharacterized protein LOC111483864 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483864 PE=4 SV=1)

HSP 1 Score: 512.3 bits (1318), Expect = 1.7e-141
Identity = 278/357 (77.87%), Postives = 305/357 (85.43%), Query Frame = 0

Query: 5   EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
           EEGDEAADR LYP KTL M+S+P  NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4   EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63

Query: 65  VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
           VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA   EED ESVAFVEN  GGGG+G
Sbjct: 64  VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123

Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
           SV  V V GSATEKCS G    DGENAKCSESP  +T KYEVEEH  +IPE S+IA+FH 
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183

Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
           QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV  SDD K ET P++  +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243

Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
           LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS  KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303

Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 361
           WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRS
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRS 356

BLAST of Sgr023301 vs. TAIR 10
Match: AT4G27910.1 (SET domain protein 16 )

HSP 1 Score: 42.7 bits (99), Expect = 7.2e-04
Identity = 36/103 (34.95%), Postives = 50/103 (48.54%), Query Frame = 0

Query: 242 LGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHS-SGSKREL 301
           L LL E A  I+   G N F      E           +P ++E+ +S+  S SG+ R+ 
Sbjct: 41  LNLLGEIAAGIVPGNGRNGFSASWCTE---------VTKPVEVEESLSKRRSDSGTVRDS 100

Query: 302 EGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWR 344
             + +          R PLVR+ RGR QVLP R+ DSVL+ WR
Sbjct: 101 PPAEV---------SRPPLVRTSRGRIQVLPSRFNDSVLDNWR 125

BLAST of Sgr023301 vs. TAIR 10
Match: AT5G53430.1 (SET domain group 29 )

HSP 1 Score: 42.4 bits (98), Expect = 9.4e-04
Identity = 18/28 (64.29%), Postives = 22/28 (78.57%), Query Frame = 0

Query: 316 RSPLVRSKRGRSQVLPCRYKDSVLEPWR 344
           R PLV++ RGR QVLP R+ DSV+E WR
Sbjct: 95  RPPLVKTSRGRVQVLPSRFNDSVIENWR 122

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022140313.11.8e-14581.04uncharacterized protein LOC111011014 isoform X1 [Momordica charantia][more]
XP_022140314.12.5e-14481.16uncharacterized protein LOC111011014 isoform X2 [Momordica charantia][more]
XP_022985998.11.1e-14277.84uncharacterized protein LOC111483864 isoform X2 [Cucurbita maxima][more]
XP_022985999.11.5e-14177.93uncharacterized protein LOC111483864 isoform X3 [Cucurbita maxima][more]
XP_022985997.13.4e-14177.87uncharacterized protein LOC111483864 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CHQ98.5e-14681.04uncharacterized protein LOC111011014 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1CFC41.2e-14481.16uncharacterized protein LOC111011014 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1JF755.2e-14377.84uncharacterized protein LOC111483864 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1JEV57.4e-14277.93uncharacterized protein LOC111483864 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1J6D71.7e-14177.87uncharacterized protein LOC111483864 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT4G27910.17.2e-0434.95SET domain protein 16 [more]
AT5G53430.19.4e-0464.29SET domain group 29 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 272..297
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 259..301
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 138..162
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 345..364
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..225
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 133..162
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..212

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr023301.1Sgr023301.1mRNA