Sgr029749 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029749
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPHD finger-like protein
Locationtig00153449: 2441047 .. 2441946 (+)
RNA-Seq ExpressionSgr029749
SyntenySgr029749
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGCTAACATCTGTGATGTCAATCACCTTGATGCCGATGCCCTTCTACCTCCAAGAAAGCGTCTTCTTGCTGGCTTGAGAAAGCAAGGTGCCGATGGTGATGGTACTTTTAATCTGCCACCAGTTGCCTCCTCCTCTTGTTCTCCTCCTCCCTCTCCTTCCTATGCCTCCACTTCTATTGAATTCAATATACGGCTTAACAATCTGTTGAGTGCTCATTCCAATACTAACCTATCACCTGAGGAGATAGTGGAGGCCTCAAGATCAGCTGCAGCTGCAGCTGTGAAGGCTGCAGAGGCCGCCAGGGCAGCAGCTGAAGAGAAGGCTGGGATTGCAGCAAAGGCTGTTGCAGCTGCAAAGAGTGCCATGGACTTGGTTGCCTCGATTTCTGAAGAAGCAGCCTGCAAAGAAATAAACCTGAGAAAGAATAAGCTGAAGAAACATGTCCCAGTTCAGCTTCTGTACACAAAATATCAACCTGTTGAGAATTGCAAGACAGATGAAGAGTTAGCCCGCAAATTGCATAGGGCAATAAATAGCTCCCCAAGAATCTTGAAGAATTCATCTAGTTCTGATGTTAGAGGCCAAAAACATAAGCAGTTTAAAAGCTCACCTGGTTCTGAGAAAACTAGGGTTTCCAATTGTGGCATCTCACAGGAGTTGGACCCCACTCCAACATGCAATGGGCATGCTAAAATTAGCGAGGCTGGCTCTGAATGCAGCTTTCAGGAAGTATACAAGCTCAAAGCTGATGAGAAGACTAGCAAATATGAGAAGAACAATCAATCTCGGACAGATAATGGAGAAGCAGAGACAAGTCAGAAGGAGAAATTGTGCGACGATATTAGTGTCACTGTTAAAAAGAGGGGAAGAGTAAAGCTTAAAAACTGCCCTTGA

mRNA sequence

ATGGAGGCTAACATCTGTGATGTCAATCACCTTGATGCCGATGCCCTTCTACCTCCAAGAAAGCGTCTTCTTGCTGGCTTGAGAAAGCAAGGTGCCGATGGTGATGGTACTTTTAATCTGCCACCAGTTGCCTCCTCCTCTTGTTCTCCTCCTCCCTCTCCTTCCTATGCCTCCACTTCTATTGAATTCAATATACGGCTTAACAATCTGTTGAGTGCTCATTCCAATACTAACCTATCACCTGAGGAGATAGTGGAGGCCTCAAGATCAGCTGCAGCTGCAGCTGTGAAGGCTGCAGAGGCCGCCAGGGCAGCAGCTGAAGAGAAGGCTGGGATTGCAGCAAAGGCTGTTGCAGCTGCAAAGAGTGCCATGGACTTGGTTGCCTCGATTTCTGAAGAAGCAGCCTGCAAAGAAATAAACCTGAGAAAGAATAAGCTGAAGAAACATGTCCCAGTTCAGCTTCTGTACACAAAATATCAACCTGTTGAGAATTGCAAGACAGATGAAGAGTTAGCCCGCAAATTGCATAGGGCAATAAATAGCTCCCCAAGAATCTTGAAGAATTCATCTAGTTCTGATGTTAGAGGCCAAAAACATAAGCAGTTTAAAAGCTCACCTGGTTCTGAGAAAACTAGGGTTTCCAATTGTGGCATCTCACAGGAGTTGGACCCCACTCCAACATGCAATGGGCATGCTAAAATTAGCGAGGCTGGCTCTGAATGCAGCTTTCAGGAAGTATACAAGCTCAAAGCTGATGAGAAGACTAGCAAATATGAGAAGAACAATCAATCTCGGACAGATAATGGAGAAGCAGAGACAAGTCAGAAGGAGAAATTGTGCGACGATATTAGTGTCACTGTTAAAAAGAGGGGAAGAGTAAAGCTTAAAAACTGCCCTTGA

Coding sequence (CDS)

ATGGAGGCTAACATCTGTGATGTCAATCACCTTGATGCCGATGCCCTTCTACCTCCAAGAAAGCGTCTTCTTGCTGGCTTGAGAAAGCAAGGTGCCGATGGTGATGGTACTTTTAATCTGCCACCAGTTGCCTCCTCCTCTTGTTCTCCTCCTCCCTCTCCTTCCTATGCCTCCACTTCTATTGAATTCAATATACGGCTTAACAATCTGTTGAGTGCTCATTCCAATACTAACCTATCACCTGAGGAGATAGTGGAGGCCTCAAGATCAGCTGCAGCTGCAGCTGTGAAGGCTGCAGAGGCCGCCAGGGCAGCAGCTGAAGAGAAGGCTGGGATTGCAGCAAAGGCTGTTGCAGCTGCAAAGAGTGCCATGGACTTGGTTGCCTCGATTTCTGAAGAAGCAGCCTGCAAAGAAATAAACCTGAGAAAGAATAAGCTGAAGAAACATGTCCCAGTTCAGCTTCTGTACACAAAATATCAACCTGTTGAGAATTGCAAGACAGATGAAGAGTTAGCCCGCAAATTGCATAGGGCAATAAATAGCTCCCCAAGAATCTTGAAGAATTCATCTAGTTCTGATGTTAGAGGCCAAAAACATAAGCAGTTTAAAAGCTCACCTGGTTCTGAGAAAACTAGGGTTTCCAATTGTGGCATCTCACAGGAGTTGGACCCCACTCCAACATGCAATGGGCATGCTAAAATTAGCGAGGCTGGCTCTGAATGCAGCTTTCAGGAAGTATACAAGCTCAAAGCTGATGAGAAGACTAGCAAATATGAGAAGAACAATCAATCTCGGACAGATAATGGAGAAGCAGAGACAAGTCAGAAGGAGAAATTGTGCGACGATATTAGTGTCACTGTTAAAAAGAGGGGAAGAGTAAAGCTTAAAAACTGCCCTTGA

Protein sequence

MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTSIEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAAKSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAINSSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSECSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKEKLCDDISVTVKKRGRVKLKNCP
Homology
BLAST of Sgr029749 vs. NCBI nr
Match: XP_022158290.1 (uncharacterized protein LOC111024810 [Momordica charantia])

HSP 1 Score: 514.6 bits (1324), Expect = 5.7e-142
Identity = 277/299 (92.64%), Postives = 286/299 (95.65%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           MEANICDVNHLDAD LLPPRKRLLAGLRKQGADGDGTFNLPPVAS SCSPPPSPSYA+TS
Sbjct: 1   MEANICDVNHLDADVLLPPRKRLLAGLRKQGADGDGTFNLPPVASPSCSPPPSPSYATTS 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           IEFNIRL+NLLSAHSNTNLSPEEIVEASRSAAAAAVKAA++ARAAAEEKA IAAKAVAAA
Sbjct: 61  IEFNIRLSNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAQSARAAAEEKAAIAAKAVAAA 120

Query: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAIN 180
           KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQP+ENCKTDEELARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPIENCKTDEELARKLHRAIN 180

Query: 181 SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSE 240
           SSPRI KNSSS DVRGQKHK+ KSSPGSEKTRVSNC ISQELDPTP+CNGHAKISEA SE
Sbjct: 181 SSPRITKNSSSPDVRGQKHKKLKSSPGSEKTRVSNCDISQELDPTPSCNGHAKISEASSE 240

Query: 241 CSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKEKLCDDISVTVKKRGRVKLKNCP 300
           CSF+EVYKLKADEKTSKYEKN QSR DNGEAETS+KEK CDDISVTVKKRGRVKLK  P
Sbjct: 241 CSFREVYKLKADEKTSKYEKNYQSRMDNGEAETSRKEKTCDDISVTVKKRGRVKLKKLP 299

BLAST of Sgr029749 vs. NCBI nr
Match: XP_023540700.1 (uncharacterized protein LOC111800985 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 473.4 bits (1217), Expect = 1.4e-129
Identity = 258/297 (86.87%), Postives = 273/297 (91.92%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           MEANICDVNHLD+D LLPPRKRLLAGLRK+GADGDGTFN+PPVAS+SCSPPPSPSY  TS
Sbjct: 1   MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSPPPSPSYGFTS 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAAR AAEEKA IAAKAVAAA
Sbjct: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAVAAA 120

Query: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAIN 180
           KSAMDLVASISEEAA KEI  RKNKLKKHVPVQ LYTKYQP+EN +TDEE+ARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHRAIN 180

Query: 181 SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSE 240
           SSPRILKNSS SD RG KHK+ K+SPGSEK  VSNCGISQEL+PTPTCNGHAK +EA SE
Sbjct: 181 SSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKTNEADSE 240

Query: 241 CSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKE-KLCDDISVTVKKRGRVKLK 297
           CSFQEVYKLK+DEKT KYEKNNQS+ D GEAETSQKE K CDDI+VT KK+GRVKLK
Sbjct: 241 CSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRVKLK 297

BLAST of Sgr029749 vs. NCBI nr
Match: XP_022947836.1 (uncharacterized protein LOC111451593 [Cucurbita moschata])

HSP 1 Score: 473.4 bits (1217), Expect = 1.4e-129
Identity = 258/297 (86.87%), Postives = 273/297 (91.92%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           MEANICDVNHLD+D LLPPRKRLLAGLRK+GADGDGTFN+PPVAS+SCSPPPSPSY  TS
Sbjct: 1   MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSPPPSPSYGFTS 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAAR AAEEKA IAAKAVAAA
Sbjct: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAVAAA 120

Query: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAIN 180
           KSAMDLVASISEEAA KEI  RKNKLKKHVPVQ LYTKYQP+EN +TDEE+ARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHRAIN 180

Query: 181 SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSE 240
           SSPRILKNSS SD RG KHK+ K+SPGSEK  VSNCGISQEL+PTPTCNGHAK +EA SE
Sbjct: 181 SSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKTNEADSE 240

Query: 241 CSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKE-KLCDDISVTVKKRGRVKLK 297
           CSFQEVYKLK+DEKT KYEKNNQS+ D GEAETSQKE K CDDI+VT KK+GRVKLK
Sbjct: 241 CSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRVKLK 297

BLAST of Sgr029749 vs. NCBI nr
Match: XP_011650244.1 (uncharacterized protein LOC101214022 [Cucumis sativus] >KGN55595.1 hypothetical protein Csa_009725 [Cucumis sativus])

HSP 1 Score: 473.0 bits (1216), Expect = 1.9e-129
Identity = 259/299 (86.62%), Postives = 272/299 (90.97%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           ME NICDVNHL++D LLPPRKRLLAGLRKQG DGDGTFNLPPVASSSCSPPPSPSY  TS
Sbjct: 1   MEGNICDVNHLNSDVLLPPRKRLLAGLRKQGGDGDGTFNLPPVASSSCSPPPSPSYGFTS 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           IEFNIRLN+LLSAHSN+NLSPEEIV+ASRSAAAAAVKAAEAARAAAEEKA IAA+AV  A
Sbjct: 61  IEFNIRLNSLLSAHSNSNLSPEEIVQASRSAAAAAVKAAEAARAAAEEKAAIAARAVTVA 120

Query: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAIN 180
           KSAMDLVASISEEAA KEINLRKNKLKKHVPVQLLYTKYQP+EN KTDEELARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAAYKEINLRKNKLKKHVPVQLLYTKYQPLENTKTDEELARKLHRAIN 180

Query: 181 SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSE 240
           SSPRILKNSS SDVR  KHK+ KSS  SEK RVSNCGISQ+LDPT TCNGHAK +E  SE
Sbjct: 181 SSPRILKNSSGSDVRSHKHKKLKSSTSSEKIRVSNCGISQDLDPTTTCNGHAKPNEVDSE 240

Query: 241 CSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKEKLCDDISVTVKKRGRVKLKNCP 300
           CSFQEVYKLK DEKTSKYEK+N S TDNGE ETSQKEK+CDDISVT+KKRGRVKLK  P
Sbjct: 241 CSFQEVYKLKPDEKTSKYEKSNPSLTDNGE-ETSQKEKMCDDISVTIKKRGRVKLKKLP 298

BLAST of Sgr029749 vs. NCBI nr
Match: KAG6596283.1 (hypothetical protein SDJN03_09463, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 472.2 bits (1214), Expect = 3.2e-129
Identity = 258/297 (86.87%), Postives = 273/297 (91.92%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           MEANICDVNHLD+D LLPPRKRLLAGLRK+GADGDGTFN+PPVAS+SCSPPPSPSY  TS
Sbjct: 71  MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSPPPSPSYGFTS 130

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAAR AAEEKA IAAKAVAAA
Sbjct: 131 IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAVAAA 190

Query: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAIN 180
           KSAMDLVASISEEAA KEI  RKNKLKKHVPVQ LYTKYQP+EN +TDEE+ARKLH+AIN
Sbjct: 191 KSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHQAIN 250

Query: 181 SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSE 240
           SSPRILKNSS SD RG KHK+ K+SPGSEK  VSNCGISQEL+PTPTCNGHAK +EA SE
Sbjct: 251 SSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKTNEAVSE 310

Query: 241 CSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKE-KLCDDISVTVKKRGRVKLK 297
           CSFQEVYKLK+DEKT KYEKNNQS+ D GEAETSQKE K CDDISVT KK+GRVKLK
Sbjct: 311 CSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDISVTGKKKGRVKLK 367

BLAST of Sgr029749 vs. ExPASy TrEMBL
Match: A0A6J1DVF5 (uncharacterized protein LOC111024810 OS=Momordica charantia OX=3673 GN=LOC111024810 PE=4 SV=1)

HSP 1 Score: 514.6 bits (1324), Expect = 2.7e-142
Identity = 277/299 (92.64%), Postives = 286/299 (95.65%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           MEANICDVNHLDAD LLPPRKRLLAGLRKQGADGDGTFNLPPVAS SCSPPPSPSYA+TS
Sbjct: 1   MEANICDVNHLDADVLLPPRKRLLAGLRKQGADGDGTFNLPPVASPSCSPPPSPSYATTS 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           IEFNIRL+NLLSAHSNTNLSPEEIVEASRSAAAAAVKAA++ARAAAEEKA IAAKAVAAA
Sbjct: 61  IEFNIRLSNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAQSARAAAEEKAAIAAKAVAAA 120

Query: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAIN 180
           KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQP+ENCKTDEELARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPIENCKTDEELARKLHRAIN 180

Query: 181 SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSE 240
           SSPRI KNSSS DVRGQKHK+ KSSPGSEKTRVSNC ISQELDPTP+CNGHAKISEA SE
Sbjct: 181 SSPRITKNSSSPDVRGQKHKKLKSSPGSEKTRVSNCDISQELDPTPSCNGHAKISEASSE 240

Query: 241 CSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKEKLCDDISVTVKKRGRVKLKNCP 300
           CSF+EVYKLKADEKTSKYEKN QSR DNGEAETS+KEK CDDISVTVKKRGRVKLK  P
Sbjct: 241 CSFREVYKLKADEKTSKYEKNYQSRMDNGEAETSRKEKTCDDISVTVKKRGRVKLKKLP 299

BLAST of Sgr029749 vs. ExPASy TrEMBL
Match: A0A6J1G7K8 (uncharacterized protein LOC111451593 OS=Cucurbita moschata OX=3662 GN=LOC111451593 PE=4 SV=1)

HSP 1 Score: 473.4 bits (1217), Expect = 7.0e-130
Identity = 258/297 (86.87%), Postives = 273/297 (91.92%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           MEANICDVNHLD+D LLPPRKRLLAGLRK+GADGDGTFN+PPVAS+SCSPPPSPSY  TS
Sbjct: 1   MEANICDVNHLDSDVLLPPRKRLLAGLRKKGADGDGTFNVPPVASTSCSPPPSPSYGFTS 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAAR AAEEKA IAAKAVAAA
Sbjct: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARTAAEEKAAIAAKAVAAA 120

Query: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAIN 180
           KSAMDLVASISEEAA KEI  RKNKLKKHVPVQ LYTKYQP+EN +TDEE+ARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAAYKEIKQRKNKLKKHVPVQFLYTKYQPLENTRTDEEMARKLHRAIN 180

Query: 181 SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSE 240
           SSPRILKNSS SD RG KHK+ K+SPGSEK  VSNCGISQEL+PTPTCNGHAK +EA SE
Sbjct: 181 SSPRILKNSSGSDARGHKHKKLKTSPGSEKIMVSNCGISQELEPTPTCNGHAKTNEADSE 240

Query: 241 CSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKE-KLCDDISVTVKKRGRVKLK 297
           CSFQEVYKLK+DEKT KYEKNNQS+ D GEAETSQKE K CDDI+VT KK+GRVKLK
Sbjct: 241 CSFQEVYKLKSDEKTIKYEKNNQSQMDTGEAETSQKEKKTCDDINVTGKKKGRVKLK 297

BLAST of Sgr029749 vs. ExPASy TrEMBL
Match: A0A0A0L0X5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G000190 PE=4 SV=1)

HSP 1 Score: 473.0 bits (1216), Expect = 9.2e-130
Identity = 259/299 (86.62%), Postives = 272/299 (90.97%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           ME NICDVNHL++D LLPPRKRLLAGLRKQG DGDGTFNLPPVASSSCSPPPSPSY  TS
Sbjct: 1   MEGNICDVNHLNSDVLLPPRKRLLAGLRKQGGDGDGTFNLPPVASSSCSPPPSPSYGFTS 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           IEFNIRLN+LLSAHSN+NLSPEEIV+ASRSAAAAAVKAAEAARAAAEEKA IAA+AV  A
Sbjct: 61  IEFNIRLNSLLSAHSNSNLSPEEIVQASRSAAAAAVKAAEAARAAAEEKAAIAARAVTVA 120

Query: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAIN 180
           KSAMDLVASISEEAA KEINLRKNKLKKHVPVQLLYTKYQP+EN KTDEELARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAAYKEINLRKNKLKKHVPVQLLYTKYQPLENTKTDEELARKLHRAIN 180

Query: 181 SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSE 240
           SSPRILKNSS SDVR  KHK+ KSS  SEK RVSNCGISQ+LDPT TCNGHAK +E  SE
Sbjct: 181 SSPRILKNSSGSDVRSHKHKKLKSSTSSEKIRVSNCGISQDLDPTTTCNGHAKPNEVDSE 240

Query: 241 CSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKEKLCDDISVTVKKRGRVKLKNCP 300
           CSFQEVYKLK DEKTSKYEK+N S TDNGE ETSQKEK+CDDISVT+KKRGRVKLK  P
Sbjct: 241 CSFQEVYKLKPDEKTSKYEKSNPSLTDNGE-ETSQKEKMCDDISVTIKKRGRVKLKKLP 298

BLAST of Sgr029749 vs. ExPASy TrEMBL
Match: A0A5D3CHI9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G00150 PE=4 SV=1)

HSP 1 Score: 469.9 bits (1208), Expect = 7.8e-129
Identity = 258/299 (86.29%), Postives = 270/299 (90.30%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           ME NICDVNHL++D LLPPRKRLLAGLRKQG DGDGTFNLPPVASS+CSPPPSPSY  TS
Sbjct: 1   MEGNICDVNHLNSDVLLPPRKRLLAGLRKQGGDGDGTFNLPPVASSTCSPPPSPSYGFTS 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           IEFNIRLN+LLSAHSN+NLSPEEIVEASRSAAAAAVKAAEAARAAAEEKA IAA+AV  A
Sbjct: 61  IEFNIRLNSLLSAHSNSNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAAIAARAVTVA 120

Query: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAIN 180
           KSAMDLVASISEEAA KEINLRKNKLKKHVPVQ LYTKYQP+EN KTDEELARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAAYKEINLRKNKLKKHVPVQFLYTKYQPLENTKTDEELARKLHRAIN 180

Query: 181 SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSE 240
           SSPRILKNSS SDVR  KHK+ KSS  SEK RVSNCGISQ+LDP  TCNGHAK +EA SE
Sbjct: 181 SSPRILKNSSGSDVRSHKHKKLKSSTSSEKIRVSNCGISQDLDPATTCNGHAKSNEADSE 240

Query: 241 CSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKEKLCDDISVTVKKRGRVKLKNCP 300
           CSFQEVYK K DEKTSKYEKNNQS TD+GE ETSQKEK CDDISVT+KKRGRVKLK  P
Sbjct: 241 CSFQEVYKPKPDEKTSKYEKNNQSLTDDGE-ETSQKEKTCDDISVTIKKRGRVKLKKLP 298

BLAST of Sgr029749 vs. ExPASy TrEMBL
Match: A0A1S3BJP5 (uncharacterized protein LOC103490642 OS=Cucumis melo OX=3656 GN=LOC103490642 PE=4 SV=1)

HSP 1 Score: 469.9 bits (1208), Expect = 7.8e-129
Identity = 258/299 (86.29%), Postives = 270/299 (90.30%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           ME NICDVNHL++D LLPPRKRLLAGLRKQG DGDGTFNLPPVASS+CSPPPSPSY  TS
Sbjct: 1   MEGNICDVNHLNSDVLLPPRKRLLAGLRKQGGDGDGTFNLPPVASSTCSPPPSPSYGFTS 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           IEFNIRLN+LLSAHSN+NLSPEEIVEASRSAAAAAVKAAEAARAAAEEKA IAA+AV  A
Sbjct: 61  IEFNIRLNSLLSAHSNSNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAAIAARAVTVA 120

Query: 121 KSAMDLVASISEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRAIN 180
           KSAMDLVASISEEAA KEINLRKNKLKKHVPVQ LYTKYQP+EN KTDEELARKLHRAIN
Sbjct: 121 KSAMDLVASISEEAAYKEINLRKNKLKKHVPVQFLYTKYQPLENTKTDEELARKLHRAIN 180

Query: 181 SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKISEAGSE 240
           SSPRILKNSS SDVR  KHK+ KSS  SEK RVSNCGISQ+LDP  TCNGHAK +EA SE
Sbjct: 181 SSPRILKNSSGSDVRSHKHKKLKSSTSSEKIRVSNCGISQDLDPATTCNGHAKSNEADSE 240

Query: 241 CSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETSQKEKLCDDISVTVKKRGRVKLKNCP 300
           CSFQEVYK K DEKTSKYEKNNQS TD+GE ETSQKEK CDDISVT+KKRGRVKLK  P
Sbjct: 241 CSFQEVYKPKPDEKTSKYEKNNQSLTDDGE-ETSQKEKTCDDISVTIKKRGRVKLKKLP 298

BLAST of Sgr029749 vs. TAIR 10
Match: AT4G35510.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G17540.3); Has 182 Blast hits to 179 proteins in 73 species: Archae - 0; Bacteria - 87; Metazoa - 17; Fungi - 9; Plants - 50; Viruses - 0; Other Eukaryotes - 19 (source: NCBI BLink). )

HSP 1 Score: 157.9 bits (398), Expect = 1.3e-38
Identity = 134/305 (43.93%), Postives = 181/305 (59.34%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADG-DGTFNLPPVASSSCSPPPSPSYAST 60
           ME N CD+N LD+D+ LPPRKRLLAG +K  ++G +G+      +SSS S     S AST
Sbjct: 1   METNPCDMNQLDSDSHLPPRKRLLAGFKKFNSNGINGSSPSDFASSSSTSNSNGSSSAST 60

Query: 61  SIEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAA 120
           +++    L NLLS+  N + SPEE+VEA+RSAAA AVKAA+AARA A EKA I+AKA+AA
Sbjct: 61  NVQ--THLGNLLSSPFNNDQSPEELVEATRSAAALAVKAAKAARAIANEKALISAKAIAA 120

Query: 121 AKSAMDLVASISEEAA--CKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHR 180
           AK A++LV S  +EA   CKE + RKNK KKHVPV+LLY+K Q  +    +++LAR+LHR
Sbjct: 121 AKRALELVDSFPKEAMADCKERSPRKNKQKKHVPVELLYSKGQLRDE---EDDLARRLHR 180

Query: 181 AIN--SSPRILKNSSSSDVRGQKHKQFKSSPGSEKTRVSNCGISQELDPTPTCNGHAKIS 240
           AI+  S PR+L+ S  +  R +K K+ KS                         G + I 
Sbjct: 181 AIDNTSYPRVLRTSEENGQRYKKQKKNKS---------------------VVEGGSSSII 240

Query: 241 EAGSECSFQEVYKLKADEKTSKYEKNNQSRTDNGEAETS-QKEKLCDDISVTVKKRGRVK 300
             GS      V      +  S YE    +R++  EA++    EK  ++ +  VK+RGRVK
Sbjct: 241 VTGSMKDIAGVV-----DSDSSYEGLEIARSNRDEADSMLMMEKSGEESNSLVKRRGRVK 274

BLAST of Sgr029749 vs. TAIR 10
Match: AT2G17540.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G35510.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 84.3 bits (207), Expect = 1.8e-16
Identity = 79/207 (38.16%), Postives = 110/207 (53.14%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           M  N C    L++D+LLPPRKRLLAG + Q +        PP ASSS S   S   ++++
Sbjct: 1   MATNAC----LESDSLLPPRKRLLAGFKNQNSRISNESPSPPFASSSSSTTTSNGSSASA 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           +     L++LL+  +    SPEE+ +AS++ AA AVK A+AARA A EKA IA+KAVAAA
Sbjct: 61  VVHTHHLDHLLNDQTR---SPEELAQASKATAALAVKVAKAARATANEKAIIASKAVAAA 120

Query: 121 KSAMDLVASI--SEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRA 180
           K+A++L AS   +E  +CKE                                   +L RA
Sbjct: 121 KNALELFASFPAAETVSCKE----------------------------------PRLIRA 164

Query: 181 INSSPRILKNSSSSDVRGQKHKQFKSS 206
           IN+SPR+L + S    R +K K   S+
Sbjct: 181 INNSPRVLTDCSGH--RNKKQKTLTST 164

BLAST of Sgr029749 vs. TAIR 10
Match: AT2G17540.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G35510.1); Has 39 Blast hits to 39 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 37; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 84.3 bits (207), Expect = 1.8e-16
Identity = 79/207 (38.16%), Postives = 110/207 (53.14%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           M  N C    L++D+LLPPRKRLLAG + Q +        PP ASSS S   S   ++++
Sbjct: 1   MATNAC----LESDSLLPPRKRLLAGFKNQNSRISNESPSPPFASSSSSTTTSNGSSASA 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           +     L++LL+  +    SPEE+ +AS++ AA AVK A+AARA A EKA IA+KAVAAA
Sbjct: 61  VVHTHHLDHLLNDQTR---SPEELAQASKATAALAVKVAKAARATANEKAIIASKAVAAA 120

Query: 121 KSAMDLVASI--SEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRA 180
           K+A++L AS   +E  +CKE                                   +L RA
Sbjct: 121 KNALELFASFPAAETVSCKE----------------------------------PRLIRA 164

Query: 181 INSSPRILKNSSSSDVRGQKHKQFKSS 206
           IN+SPR+L + S    R +K K   S+
Sbjct: 181 INNSPRVLTDCSGH--RNKKQKTLTST 164

BLAST of Sgr029749 vs. TAIR 10
Match: AT2G17540.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G35510.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 84.3 bits (207), Expect = 1.8e-16
Identity = 79/207 (38.16%), Postives = 110/207 (53.14%), Query Frame = 0

Query: 1   MEANICDVNHLDADALLPPRKRLLAGLRKQGADGDGTFNLPPVASSSCSPPPSPSYASTS 60
           M  N C    L++D+LLPPRKRLLAG + Q +        PP ASSS S   S   ++++
Sbjct: 1   MATNAC----LESDSLLPPRKRLLAGFKNQNSRISNESPSPPFASSSSSTTTSNGSSASA 60

Query: 61  IEFNIRLNNLLSAHSNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAAKAVAAA 120
           +     L++LL+  +    SPEE+ +AS++ AA AVK A+AARA A EKA IA+KAVAAA
Sbjct: 61  VVHTHHLDHLLNDQTR---SPEELAQASKATAALAVKVAKAARATANEKAIIASKAVAAA 120

Query: 121 KSAMDLVASI--SEEAACKEINLRKNKLKKHVPVQLLYTKYQPVENCKTDEELARKLHRA 180
           K+A++L AS   +E  +CKE                                   +L RA
Sbjct: 121 KNALELFASFPAAETVSCKE----------------------------------PRLIRA 164

Query: 181 INSSPRILKNSSSSDVRGQKHKQFKSS 206
           IN+SPR+L + S    R +K K   S+
Sbjct: 181 INNSPRVLTDCSGH--RNKKQKTLTST 164

BLAST of Sgr029749 vs. TAIR 10
Match: AT5G66000.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G17540.3); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 70.1 bits (170), Expect = 3.4e-12
Identity = 63/141 (44.68%), Postives = 90/141 (63.83%), Query Frame = 0

Query: 56  YASTSIEFNIRLNNLLSAH-SNTNLSPEEIVEASRSAAAAAVKAAEAARAAAEEKAGIAA 115
           + ST++++++ +  LL++H SN +L+P+EI +ASR  AAAA  AA+AARA A+EKA  AA
Sbjct: 18  HPSTNVDYHLSI--LLASHLSNPDLTPQEIADASRCTAAAAAIAAKAARATADEKAAAAA 77

Query: 116 KAVAAAKSAMDLVASI-SEEAACKEINLRKNK--LKKHVPVQLLYTKYQPVENCKTDEEL 175
           KAVAAAK+A+DL+AS    +   ++  L K+K   KKHV   LL++K         D+ L
Sbjct: 78  KAVAAAKTALDLIASFPPNQGLVQDACLHKDKKMKKKHVAADLLFSK---------DDAL 137

Query: 176 ARKLHRAINSSPRILKNSSSS 193
             KL      S  I+ NSSSS
Sbjct: 138 PSKLQLGGVVSQGIVSNSSSS 147

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022158290.15.7e-14292.64uncharacterized protein LOC111024810 [Momordica charantia][more]
XP_023540700.11.4e-12986.87uncharacterized protein LOC111800985 [Cucurbita pepo subsp. pepo][more]
XP_022947836.11.4e-12986.87uncharacterized protein LOC111451593 [Cucurbita moschata][more]
XP_011650244.11.9e-12986.62uncharacterized protein LOC101214022 [Cucumis sativus] >KGN55595.1 hypothetical ... [more]
KAG6596283.13.2e-12986.87hypothetical protein SDJN03_09463, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DVF52.7e-14292.64uncharacterized protein LOC111024810 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6J1G7K87.0e-13086.87uncharacterized protein LOC111451593 OS=Cucurbita moschata OX=3662 GN=LOC1114515... [more]
A0A0A0L0X59.2e-13086.62Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G000190 PE=4 SV=1[more]
A0A5D3CHI97.8e-12986.29Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BJP57.8e-12986.29uncharacterized protein LOC103490642 OS=Cucumis melo OX=3656 GN=LOC103490642 PE=... [more]
Match NameE-valueIdentityDescription
AT4G35510.11.3e-3843.93unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G17540.21.8e-1638.16unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G17540.11.8e-1638.16unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G17540.31.8e-1638.16unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G66000.13.4e-1244.68unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 89..109
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 182..214
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 252..277
NoneNo IPR availablePANTHERPTHR35477:SF1OS06G0728500 PROTEINcoord: 1..297
NoneNo IPR availablePANTHERPTHR35477OS06G0728500 PROTEINcoord: 1..297

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029749.1Sgr029749.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane