Sgr026798 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr026798
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00153047: 994539 .. 997376 (+)
RNA-Seq ExpressionSgr026798
SyntenySgr026798
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATTAGGCTGTCGAAAATACATATTCTTTTCTTCACTCGCCGCTTGAGGCTACCAATAGACACAACAAAGGAGGCGGCTCTGTTGCTTCGCTTCGCTATTTCTTCTCCTTTTTCAAACCCACTCTCCAAAATCGTCGAATTCCTTCGACTTTCCCCTCATCTTATTCCTCTGCATAAGAATTTTCATCATGGGTATTCATAAATTGTGCTTCAAATTTAATGGGCTCGCATTGGAGTTCATCAAAATGGATTGGTAGAGACGGCACTGGACTTGGCTGCTTAATTTTTTCTCTTTCTCGGACATGTCTGTCAAATTTGCTTCTTTTTCTTGCTCGGTTCCTTCTCTATCTTCACCCTCTGATTGCCGTAGAAACTTACAGGTGCTTTTTGGTCCTATTCGCTTACTCATATGTTAATGCTTTCTCTGTAAATTTTAATGCTTTCTCTGTAAATTGATGTTCTATAAATTGATGTTTCAACATTGTTCCTGTGGGTTAGAACCTTCCTTTCTTTTTATTTTCTTATTTTTCTTTCTCATATGGGCGTGGAGTGAAAGCCCTGGTAAATTTAATAACGTAGTAAGCAAACACAAGCTAGGTTTAGGATTGTGCGCAGGCGCGCTCTGGTGATACCTTTATCAAGAATCAATAATGTATTGAGTGTCTATAATGATTGGCCATACTGATATGATATCTTTATTGAGAGCCTTACTTGCCATCTCTAACTTTTTACTTTGACTTCTACTGACTAGTGAGGGAATTCTAGTATGAAAATCTTATGATTTTTAAATTATTGCTGTGACTTTATGCTATATGAGCTAAAATGTTCAATATTTTCTTAAAGATATCATTTGATTTAATGGGCTCTGTGCATTTGTTTTGATTTCCAGCTGAGTTTTCCACAAGGATTATATGGTAGCCAGCAACATAATTTCCAGAGGCTAACATATGCTAAAGGTAGGTTTTTCTTATACTTTCTTTTTTTCATAATTTCATGAAATAACAGGCGATTTAAATCCTTAGGTTGTTTCTTTATATACAAAATGGCTATAAAATGGGCCATCTGGTTTCTATTTTCTAGACTTCTCATAGTCAGCATGTCTATTATTATTTTTCTGTTCACACTGTGGGTTACCTGGTGAGATGCCTATTGTGACTTATTTCCCAAAATGTCTATTTTGATCAAATCCAATGCTTTTGATGTTTTTCTACTACGAAATGATATGTTTCTTGGTTCTCTTTTGGAATTGTGGATAGTTGAATCATTCAAGTTAATGCATGGATTGCTGAGGTCTTGGAGAAGATTCCCAAAGAGGCAAAATGTTCTCTCTGCTATTTCGGAGGACCAGTCTCAATGCAGTGAGTCAAATGATCCAGAGCAAGTAAATCAGCATCTCACTGATGAAGATATCTCTCTTGAAAGCAATTCATCTTTATATTATGAGGGCACTGGTGGTAAACCGGGTTTTATCTCATTTTACAACCATTCTTATAAAGAAGGAAATCAGATTCCTTTATCTAGCGCACAAAGCAATCAATACAATTTCTTATGGTTTGTTGGTCCAGCGCTCCTCGTAGCCTCTTTCATTTTCCCCTCTCTTTATCTGCGAAAAATACTTTCAAATATTTTTGAGGACTCTCTGTTAACAGGTCAGTACCCTTTCTATAGTCTCTTAACTATGTTAGTATATTGATATATATTTAGTAAAATTTCCTGTGTGTGAACGTGCCCCTTATCAGCAAGGTCAAAAAAGGCCAAAGGAGACATTTAAGATATACCATCTATTTATGGCTTTTTATTGAGTATGCGTATAAAGGCAATTAAGTATAGTGTTTTAAAAATCATTCTTTAAGAATATTCAAGGCAAAGGAGATTATTGTAGTCACCACTCTTAGATCTTAAAGGAGTTCGGTGATTTTTTAAAAATTTTCATGCTCTGTTAAATATGAAATTTATTTGCTCATGCTAAATATCTACATGACGAAACCTTGATTCTGTTGAAATTCTTACTTGGCGTTTGAAATCCTCTTTTGTAAACTACAAACTATCTTCTGATTGCAGACTTCCTTATATTGTTCTTCACGGAAGCACTGTTCTATTGTGGTGTTGCGGTGTTTCTTTTTTTAATAGACCGTTTGAGGAGGCTTACTGAAACTGCTAATGTGACAAACAAGTACGAAATTCTGTCTAATCAATTGGGACAGAGGATCTCTTCTGTAGCTGCATTAGTACTCAGTTTAATAATTCCCATGGTCACTATGGGATTAGTATGGCCATGGACTGGCCCTGCGGCATCTGCAACTCTTGCACCTTACCTTGTTGGCATTGTTGTCCAATTTGCATTTGAACAGTATGCGAGGCGCAAGAAATCACCTTCATGGCCTGTTATTCCAATTGTCTTTCAAGTAAGGACACTCGTATCTCATCATTCTCTCTCCACTTCTTTCTGTTGCCTTGCCCGTCTATGCTAGATTTTTTTTCATCAGTTGTATACGAAAATTCGTATATTAGTAAGACATTCCTGGTTATCATTTCTCTCTATGGATAATTTTTATGGCCTTGTATTATATATTTTTCAATATTTTGGCCTTTGAATTGAAGAGATCTTGTTAAACAGGTTTATAGATTGCATCAACTTAATCGAGCAGCTCAACTGGTGACAGCACTTTCATTTACCATAAAAGGAGCTGAGATGACGCCACACAACTTGGCAATAAACAGCTCTCTGGGTACACTGTTGAATGTTCTTCAATGTCTTGGGATAATCTGCATTTGGTCTCTCTCGAGCTTTCTTATGAGGTTTTTCCCGTCAAATGCTACAACGCTGCAGTAA

mRNA sequence

ATGGAAATTAGGCTGTCGAAAATACATATTCTTTTCTTCACTCGCCGCTTGAGGCTACCAATAGACACAACAAAGGAGGCGGCTCTGTTGCTTCGCTTCGCTATTTCTTCTCCTTTTTCAAACCCACTCTCCAAAATCGTCGAATTCCTTCGACTTTCCCCTCATCTTATTCCTCTGCATAAGAATTTTCATCATGGACGGCACTGGACTTGGCTGCTTAATTTTTTCTCTTTCTCGGACATGTCTGTCAAATTTGCTTCTTTTTCTTGCTCGGTTCCTTCTCTATCTTCACCCTCTGATTGCCGTAGAAACTTACAGCTGAGTTTTCCACAAGGATTATATGGTAGCCAGCAACATAATTTCCAGAGGCTAACATATGCTAAAGTTGAATCATTCAAGTTAATGCATGGATTGCTGAGGTCTTGGAGAAGATTCCCAAAGAGGCAAAATGTTCTCTCTGCTATTTCGGAGGACCAGTCTCAATGCAGTGAGTCAAATGATCCAGAGCAAGTAAATCAGCATCTCACTGATGAAGATATCTCTCTTGAAAGCAATTCATCTTTATATTATGAGGGCACTGGTGGTAAACCGGGTTTTATCTCATTTTACAACCATTCTTATAAAGAAGGAAATCAGATTCCTTTATCTAGCGCACAAAGCAATCAATACAATTTCTTATGGTTTGTTGGTCCAGCGCTCCTCGTAGCCTCTTTCATTTTCCCCTCTCTTTATCTGCGAAAAATACTTTCAAATATTTTTGAGGACTCTCTGTTAACAGACTTCCTTATATTGTTCTTCACGGAAGCACTGTTCTATTGTGGTGTTGCGGTGTTTCTTTTTTTAATAGACCGTTTGAGGAGGCTTACTGAAACTGCTAATGTGACAAACAAGTACGAAATTCTGTCTAATCAATTGGGACAGAGGATCTCTTCTGTAGCTGCATTAGTACTCAGTTTAATAATTCCCATGGTCACTATGGGATTAGTATGGCCATGGACTGGCCCTGCGGCATCTGCAACTCTTGCACCTTACCTTGTTGGCATTGTTGTCCAATTTGCATTTGAACAGTATGCGAGGCGCAAGAAATCACCTTCATGGCCTGTTATTCCAATTGTCTTTCAAGTTTATAGATTGCATCAACTTAATCGAGCAGCTCAACTGGTGACAGCACTTTCATTTACCATAAAAGGAGCTGAGATGACGCCACACAACTTGGCAATAAACAGCTCTCTGGGTACACTGTTGAATGTTCTTCAATGTCTTGGGATAATCTGCATTTGGTCTCTCTCGAGCTTTCTTATGAGGTTTTTCCCGTCAAATGCTACAACGCTGCAGTAA

Coding sequence (CDS)

ATGGAAATTAGGCTGTCGAAAATACATATTCTTTTCTTCACTCGCCGCTTGAGGCTACCAATAGACACAACAAAGGAGGCGGCTCTGTTGCTTCGCTTCGCTATTTCTTCTCCTTTTTCAAACCCACTCTCCAAAATCGTCGAATTCCTTCGACTTTCCCCTCATCTTATTCCTCTGCATAAGAATTTTCATCATGGACGGCACTGGACTTGGCTGCTTAATTTTTTCTCTTTCTCGGACATGTCTGTCAAATTTGCTTCTTTTTCTTGCTCGGTTCCTTCTCTATCTTCACCCTCTGATTGCCGTAGAAACTTACAGCTGAGTTTTCCACAAGGATTATATGGTAGCCAGCAACATAATTTCCAGAGGCTAACATATGCTAAAGTTGAATCATTCAAGTTAATGCATGGATTGCTGAGGTCTTGGAGAAGATTCCCAAAGAGGCAAAATGTTCTCTCTGCTATTTCGGAGGACCAGTCTCAATGCAGTGAGTCAAATGATCCAGAGCAAGTAAATCAGCATCTCACTGATGAAGATATCTCTCTTGAAAGCAATTCATCTTTATATTATGAGGGCACTGGTGGTAAACCGGGTTTTATCTCATTTTACAACCATTCTTATAAAGAAGGAAATCAGATTCCTTTATCTAGCGCACAAAGCAATCAATACAATTTCTTATGGTTTGTTGGTCCAGCGCTCCTCGTAGCCTCTTTCATTTTCCCCTCTCTTTATCTGCGAAAAATACTTTCAAATATTTTTGAGGACTCTCTGTTAACAGACTTCCTTATATTGTTCTTCACGGAAGCACTGTTCTATTGTGGTGTTGCGGTGTTTCTTTTTTTAATAGACCGTTTGAGGAGGCTTACTGAAACTGCTAATGTGACAAACAAGTACGAAATTCTGTCTAATCAATTGGGACAGAGGATCTCTTCTGTAGCTGCATTAGTACTCAGTTTAATAATTCCCATGGTCACTATGGGATTAGTATGGCCATGGACTGGCCCTGCGGCATCTGCAACTCTTGCACCTTACCTTGTTGGCATTGTTGTCCAATTTGCATTTGAACAGTATGCGAGGCGCAAGAAATCACCTTCATGGCCTGTTATTCCAATTGTCTTTCAAGTTTATAGATTGCATCAACTTAATCGAGCAGCTCAACTGGTGACAGCACTTTCATTTACCATAAAAGGAGCTGAGATGACGCCACACAACTTGGCAATAAACAGCTCTCTGGGTACACTGTTGAATGTTCTTCAATGTCTTGGGATAATCTGCATTTGGTCTCTCTCGAGCTTTCTTATGAGGTTTTTCCCGTCAAATGCTACAACGCTGCAGTAA

Protein sequence

MEIRLSKIHILFFTRRLRLPIDTTKEAALLLRFAISSPFSNPLSKIVEFLRLSPHLIPLHKNFHHGRHWTWLLNFFSFSDMSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLYGSQQHNFQRLTYAKVESFKLMHGLLRSWRRFPKRQNVLSAISEDQSQCSESNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPGFISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLLTDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLSLIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRLHQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFPSNATTLQ
Homology
BLAST of Sgr026798 vs. NCBI nr
Match: XP_022135241.1 (uncharacterized protein LOC111007252 [Momordica charantia])

HSP 1 Score: 625.5 bits (1612), Expect = 3.4e-175
Identity = 326/366 (89.07%), Postives = 339/366 (92.62%), Query Frame = 0

Query: 81  MSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLYGSQQHNFQRLTYAKVESFKLMHGLLR 140
           MSVKFASFSCS+PSL SPSD RR+LQLS  QGLYGSQQHNFQRLT+AKVES KL+HG L+
Sbjct: 1   MSVKFASFSCSLPSLPSPSDTRRSLQLSLSQGLYGSQQHNFQRLTFAKVESIKLVHGSLK 60

Query: 141 SWRRFPKRQNVLSAISEDQSQ-CSESNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPGF 200
           SWR FPKRQNVLSAISEDQS  CSE  DPEQVNQH TDEDISLE+NS LYYEGTGGKPGF
Sbjct: 61  SWRGFPKRQNVLSAISEDQSTLCSELVDPEQVNQHPTDEDISLENNSFLYYEGTGGKPGF 120

Query: 201 ISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLLT 260
           ISFYNHSYKEGN++PLSS Q N+YNFLWFVGPA+LVASFIFPSLYLRKILSNIFEDSLLT
Sbjct: 121 ISFYNHSYKEGNRVPLSSTQRNEYNFLWFVGPAVLVASFIFPSLYLRKILSNIFEDSLLT 180

Query: 261 DFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLSL 320
           DFLILFFTEALFYCGVAVFLFLID  RR TE   V N YE LSNQ GQRISSVA LVLSL
Sbjct: 181 DFLILFFTEALFYCGVAVFLFLIDCSRRPTEPDTVKNYYETLSNQFGQRISSVATLVLSL 240

Query: 321 IIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRLH 380
           IIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIP+VFQVYRLH
Sbjct: 241 IIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPVVFQVYRLH 300

Query: 381 QLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFPS 440
           QLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQ LGIICIWSLSSFLMR+FPS
Sbjct: 301 QLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQGLGIICIWSLSSFLMRYFPS 360

Query: 441 NATTLQ 446
           NATT+Q
Sbjct: 361 NATTMQ 366

BLAST of Sgr026798 vs. NCBI nr
Match: XP_008464550.1 (PREDICTED: uncharacterized protein LOC103502400 [Cucumis melo])

HSP 1 Score: 597.8 bits (1540), Expect = 7.6e-167
Identity = 314/367 (85.56%), Postives = 335/367 (91.28%), Query Frame = 0

Query: 81  MSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLY--GSQQHNFQRLTYAKVESFKLMHGL 140
           MSVKFAS SCSVP+LSSPSD RR+ QLSF +GLY  GSQQ +FQRLT+AKVES KLMHGL
Sbjct: 1   MSVKFASISCSVPTLSSPSDARRSFQLSFSRGLYGIGSQQRSFQRLTFAKVESIKLMHGL 60

Query: 141 LRSWRRFPKRQNVLSAISEDQSQCSESNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPG 200
           LRSWRRFP+R+NVLSAISE+QSQCSE  + E+V +H TDEDISLE+NS L+YEGTGGKPG
Sbjct: 61  LRSWRRFPRRKNVLSAISENQSQCSEQVELERVTEHPTDEDISLENNSFLHYEGTGGKPG 120

Query: 201 FISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLL 260
           FISFYN+S KEGN+IPLSS QSNQY FLWFVGPA+LVASFIFPSLYLRK+LSNIFEDSLL
Sbjct: 121 FISFYNNS-KEGNRIPLSSVQSNQYKFLWFVGPAVLVASFIFPSLYLRKLLSNIFEDSLL 180

Query: 261 TDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLS 320
           TDFLILFFTEALFYCGVAVFLFLID  RR  E   + N Y+ LSNQ GQRISSVA L LS
Sbjct: 181 TDFLILFFTEALFYCGVAVFLFLIDCSRRTAEPDTLKNSYQTLSNQFGQRISSVATLALS 240

Query: 321 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRL 380
           LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKS SWPVIPIVFQVYRL
Sbjct: 241 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSSSWPVIPIVFQVYRL 300

Query: 381 HQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 440
           HQLNRAAQLVTALSFTIKGAEMTP+NLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP
Sbjct: 301 HQLNRAAQLVTALSFTIKGAEMTPNNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 360

Query: 441 SNATTLQ 446
           SNA T+Q
Sbjct: 361 SNAATVQ 366

BLAST of Sgr026798 vs. NCBI nr
Match: KAA0057843.1 (uncharacterized protein E6C27_scaffold274G001300 [Cucumis melo var. makuwa] >TYJ98527.1 uncharacterized protein E5676_scaffold350G001320 [Cucumis melo var. makuwa])

HSP 1 Score: 595.1 bits (1533), Expect = 4.9e-166
Identity = 313/365 (85.75%), Postives = 333/365 (91.23%), Query Frame = 0

Query: 81  MSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLY--GSQQHNFQRLTYAKVESFKLMHGL 140
           MSVKFAS SCSVP+LSSPSD RR+ QLSF +GLY  GSQQ +FQRLT+AKVES KLMHGL
Sbjct: 1   MSVKFASISCSVPTLSSPSDARRSFQLSFSRGLYGIGSQQRSFQRLTFAKVESIKLMHGL 60

Query: 141 LRSWRRFPKRQNVLSAISEDQSQCSESNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPG 200
           LRSWRRFP+R+NVLSAISE+QSQCSE  + E+V +H TDEDISLE+NS L+YEGTGGKPG
Sbjct: 61  LRSWRRFPRRKNVLSAISENQSQCSEQVELERVTEHPTDEDISLENNSFLHYEGTGGKPG 120

Query: 201 FISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLL 260
           FISFYN+S KEGN+IPLSS QSNQY FLWFVGPA+LVASFIFPSLYLRK+LSNIFEDSLL
Sbjct: 121 FISFYNNS-KEGNRIPLSSVQSNQYKFLWFVGPAVLVASFIFPSLYLRKLLSNIFEDSLL 180

Query: 261 TDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLS 320
           TDFLILFFTEALFYCGVAVFLFLID  RR  E   + N Y+ LSNQ GQRISSVA L LS
Sbjct: 181 TDFLILFFTEALFYCGVAVFLFLIDCSRRTAEPDTLKNSYQTLSNQFGQRISSVATLALS 240

Query: 321 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRL 380
           LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKS SWPVIPIVFQVYRL
Sbjct: 241 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSSSWPVIPIVFQVYRL 300

Query: 381 HQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 440
           HQLNRAAQLVTALSFTIKGAEMTP+NLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP
Sbjct: 301 HQLNRAAQLVTALSFTIKGAEMTPNNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 360

Query: 441 SNATT 444
           SNA T
Sbjct: 361 SNAAT 364

BLAST of Sgr026798 vs. NCBI nr
Match: XP_038879012.1 (uncharacterized protein LOC120071064 isoform X1 [Benincasa hispida])

HSP 1 Score: 587.4 bits (1513), Expect = 1.0e-163
Identity = 312/366 (85.25%), Postives = 330/366 (90.16%), Query Frame = 0

Query: 81  MSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLY--GSQQHNFQRLTYAKVESFKLMHGL 140
           MSVK AS SCSVPSLSS SD RR+ QLSF +GLY   SQQ +FQRLT+AK ESFKLM+GL
Sbjct: 1   MSVKSASLSCSVPSLSSTSDRRRSFQLSFSRGLYDIDSQQRSFQRLTFAKAESFKLMYGL 60

Query: 141 LRSWRRFPKRQNVLSAISEDQSQCSESNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPG 200
           LRSWRRFP RQNVLSAISEDQS+C E ++ EQVNQH T+EDISLE++S L YEGTGGKPG
Sbjct: 61  LRSWRRFP-RQNVLSAISEDQSECREQSELEQVNQHPTNEDISLENDSFLLYEGTGGKPG 120

Query: 201 FISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLL 260
           FISFYNH YKEGN+IPLSS QSNQY FLWFVGPA+LVASFIFPSLYLRK+LSNIFEDSLL
Sbjct: 121 FISFYNH-YKEGNRIPLSSVQSNQYKFLWFVGPAVLVASFIFPSLYLRKLLSNIFEDSLL 180

Query: 261 TDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLS 320
           TDFLILFFTE LFYCGVAVFLFLIDR RR TE   +   Y+ LSNQ GQRISSVA L LS
Sbjct: 181 TDFLILFFTEVLFYCGVAVFLFLIDRSRRPTEPDTMKTSYQTLSNQFGQRISSVATLALS 240

Query: 321 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRL 380
           LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRL
Sbjct: 241 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRL 300

Query: 381 HQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 440
           HQLNRAAQLVTALSFTIKGAEMTP+NLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP
Sbjct: 301 HQLNRAAQLVTALSFTIKGAEMTPNNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 360

Query: 441 SNATTL 445
           SN  T+
Sbjct: 361 SNVATV 364

BLAST of Sgr026798 vs. NCBI nr
Match: XP_011656390.2 (uncharacterized protein LOC105435726 [Cucumis sativus] >KGN63569.2 hypothetical protein Csa_013797 [Cucumis sativus])

HSP 1 Score: 587.4 bits (1513), Expect = 1.0e-163
Identity = 310/367 (84.47%), Postives = 332/367 (90.46%), Query Frame = 0

Query: 81  MSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLY--GSQQHNFQRLTYAKVESFKLMHGL 140
           MSVKFAS SCSVP+LSSPSD RR+ QLSF +GLY  GSQQ +FQ LT+A+VES KLM+GL
Sbjct: 1   MSVKFASISCSVPTLSSPSDGRRSFQLSFSRGLYGIGSQQRSFQNLTFAEVESIKLMYGL 60

Query: 141 LRSWRRFPKRQNVLSAISEDQSQCSESNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPG 200
           LRSW RFP+R+NVLSAISE+QSQ  E  + E+VN+H TDEDISLE+NS L+YEGTGGKPG
Sbjct: 61  LRSWSRFPRRKNVLSAISENQSQWCEQVELERVNEHPTDEDISLENNSFLHYEGTGGKPG 120

Query: 201 FISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLL 260
           FISFYNHS KEG +IPLSS QSNQY FLWFVGPA+LVASFIFPSLYLRK+LSNIFEDSLL
Sbjct: 121 FISFYNHS-KEGKRIPLSSVQSNQYKFLWFVGPAVLVASFIFPSLYLRKLLSNIFEDSLL 180

Query: 261 TDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLS 320
           TDFLILFFTEALFYCGVAVFLFLIDR RR  E   + N Y+ LSNQ GQRISSVA L LS
Sbjct: 181 TDFLILFFTEALFYCGVAVFLFLIDRSRRTAEPDTLKNSYQTLSNQFGQRISSVATLALS 240

Query: 321 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRL 380
           LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKS SWPVIPIVFQVYRL
Sbjct: 241 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSCSWPVIPIVFQVYRL 300

Query: 381 HQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 440
           HQLNRAAQLVTALSFTIKGAEMTP+NLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP
Sbjct: 301 HQLNRAAQLVTALSFTIKGAEMTPNNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 360

Query: 441 SNATTLQ 446
           SNA T+Q
Sbjct: 361 SNAATVQ 366

BLAST of Sgr026798 vs. ExPASy TrEMBL
Match: A0A6J1C043 (uncharacterized protein LOC111007252 OS=Momordica charantia OX=3673 GN=LOC111007252 PE=4 SV=1)

HSP 1 Score: 625.5 bits (1612), Expect = 1.6e-175
Identity = 326/366 (89.07%), Postives = 339/366 (92.62%), Query Frame = 0

Query: 81  MSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLYGSQQHNFQRLTYAKVESFKLMHGLLR 140
           MSVKFASFSCS+PSL SPSD RR+LQLS  QGLYGSQQHNFQRLT+AKVES KL+HG L+
Sbjct: 1   MSVKFASFSCSLPSLPSPSDTRRSLQLSLSQGLYGSQQHNFQRLTFAKVESIKLVHGSLK 60

Query: 141 SWRRFPKRQNVLSAISEDQSQ-CSESNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPGF 200
           SWR FPKRQNVLSAISEDQS  CSE  DPEQVNQH TDEDISLE+NS LYYEGTGGKPGF
Sbjct: 61  SWRGFPKRQNVLSAISEDQSTLCSELVDPEQVNQHPTDEDISLENNSFLYYEGTGGKPGF 120

Query: 201 ISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLLT 260
           ISFYNHSYKEGN++PLSS Q N+YNFLWFVGPA+LVASFIFPSLYLRKILSNIFEDSLLT
Sbjct: 121 ISFYNHSYKEGNRVPLSSTQRNEYNFLWFVGPAVLVASFIFPSLYLRKILSNIFEDSLLT 180

Query: 261 DFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLSL 320
           DFLILFFTEALFYCGVAVFLFLID  RR TE   V N YE LSNQ GQRISSVA LVLSL
Sbjct: 181 DFLILFFTEALFYCGVAVFLFLIDCSRRPTEPDTVKNYYETLSNQFGQRISSVATLVLSL 240

Query: 321 IIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRLH 380
           IIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIP+VFQVYRLH
Sbjct: 241 IIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPVVFQVYRLH 300

Query: 381 QLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFPS 440
           QLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQ LGIICIWSLSSFLMR+FPS
Sbjct: 301 QLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQGLGIICIWSLSSFLMRYFPS 360

Query: 441 NATTLQ 446
           NATT+Q
Sbjct: 361 NATTMQ 366

BLAST of Sgr026798 vs. ExPASy TrEMBL
Match: A0A1S3CLW8 (uncharacterized protein LOC103502400 OS=Cucumis melo OX=3656 GN=LOC103502400 PE=4 SV=1)

HSP 1 Score: 597.8 bits (1540), Expect = 3.7e-167
Identity = 314/367 (85.56%), Postives = 335/367 (91.28%), Query Frame = 0

Query: 81  MSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLY--GSQQHNFQRLTYAKVESFKLMHGL 140
           MSVKFAS SCSVP+LSSPSD RR+ QLSF +GLY  GSQQ +FQRLT+AKVES KLMHGL
Sbjct: 1   MSVKFASISCSVPTLSSPSDARRSFQLSFSRGLYGIGSQQRSFQRLTFAKVESIKLMHGL 60

Query: 141 LRSWRRFPKRQNVLSAISEDQSQCSESNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPG 200
           LRSWRRFP+R+NVLSAISE+QSQCSE  + E+V +H TDEDISLE+NS L+YEGTGGKPG
Sbjct: 61  LRSWRRFPRRKNVLSAISENQSQCSEQVELERVTEHPTDEDISLENNSFLHYEGTGGKPG 120

Query: 201 FISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLL 260
           FISFYN+S KEGN+IPLSS QSNQY FLWFVGPA+LVASFIFPSLYLRK+LSNIFEDSLL
Sbjct: 121 FISFYNNS-KEGNRIPLSSVQSNQYKFLWFVGPAVLVASFIFPSLYLRKLLSNIFEDSLL 180

Query: 261 TDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLS 320
           TDFLILFFTEALFYCGVAVFLFLID  RR  E   + N Y+ LSNQ GQRISSVA L LS
Sbjct: 181 TDFLILFFTEALFYCGVAVFLFLIDCSRRTAEPDTLKNSYQTLSNQFGQRISSVATLALS 240

Query: 321 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRL 380
           LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKS SWPVIPIVFQVYRL
Sbjct: 241 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSSSWPVIPIVFQVYRL 300

Query: 381 HQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 440
           HQLNRAAQLVTALSFTIKGAEMTP+NLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP
Sbjct: 301 HQLNRAAQLVTALSFTIKGAEMTPNNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 360

Query: 441 SNATTLQ 446
           SNA T+Q
Sbjct: 361 SNAATVQ 366

BLAST of Sgr026798 vs. ExPASy TrEMBL
Match: A0A5D3BHC7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold350G001320 PE=4 SV=1)

HSP 1 Score: 595.1 bits (1533), Expect = 2.4e-166
Identity = 313/365 (85.75%), Postives = 333/365 (91.23%), Query Frame = 0

Query: 81  MSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLY--GSQQHNFQRLTYAKVESFKLMHGL 140
           MSVKFAS SCSVP+LSSPSD RR+ QLSF +GLY  GSQQ +FQRLT+AKVES KLMHGL
Sbjct: 1   MSVKFASISCSVPTLSSPSDARRSFQLSFSRGLYGIGSQQRSFQRLTFAKVESIKLMHGL 60

Query: 141 LRSWRRFPKRQNVLSAISEDQSQCSESNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPG 200
           LRSWRRFP+R+NVLSAISE+QSQCSE  + E+V +H TDEDISLE+NS L+YEGTGGKPG
Sbjct: 61  LRSWRRFPRRKNVLSAISENQSQCSEQVELERVTEHPTDEDISLENNSFLHYEGTGGKPG 120

Query: 201 FISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLL 260
           FISFYN+S KEGN+IPLSS QSNQY FLWFVGPA+LVASFIFPSLYLRK+LSNIFEDSLL
Sbjct: 121 FISFYNNS-KEGNRIPLSSVQSNQYKFLWFVGPAVLVASFIFPSLYLRKLLSNIFEDSLL 180

Query: 261 TDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLS 320
           TDFLILFFTEALFYCGVAVFLFLID  RR  E   + N Y+ LSNQ GQRISSVA L LS
Sbjct: 181 TDFLILFFTEALFYCGVAVFLFLIDCSRRTAEPDTLKNSYQTLSNQFGQRISSVATLALS 240

Query: 321 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRL 380
           LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKS SWPVIPIVFQVYRL
Sbjct: 241 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSSSWPVIPIVFQVYRL 300

Query: 381 HQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 440
           HQLNRAAQLVTALSFTIKGAEMTP+NLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP
Sbjct: 301 HQLNRAAQLVTALSFTIKGAEMTPNNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 360

Query: 441 SNATT 444
           SNA T
Sbjct: 361 SNAAT 364

BLAST of Sgr026798 vs. ExPASy TrEMBL
Match: A0A0A0LS19 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G004340 PE=4 SV=1)

HSP 1 Score: 581.3 bits (1497), Expect = 3.6e-162
Identity = 308/367 (83.92%), Postives = 330/367 (89.92%), Query Frame = 0

Query: 81  MSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLY--GSQQHNFQRLTYAKVESFKLMHGL 140
           MSVKFAS SCSVP+LSSPSD RR+ QLSF +GLY  GSQQ +FQ LT+A+VES KLM+GL
Sbjct: 1   MSVKFASISCSVPTLSSPSDGRRSFQLSFSRGLYGIGSQQRSFQNLTFAEVESIKLMYGL 60

Query: 141 LRSWRRFPKRQNVLSAISEDQSQCSESNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPG 200
           LRSW RFP+R+NVLSAISE+QSQ  E  + E+VN+H TDEDISLE+NS L+YEGTGGKPG
Sbjct: 61  LRSWSRFPRRKNVLSAISENQSQWCEQVELERVNEHPTDEDISLENNSFLHYEGTGGKPG 120

Query: 201 FISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLL 260
           FISFYNHS KEG +IPLSS QSNQY FLWFVGPA+LVASFIFPSLYLRK+LSNIFEDSLL
Sbjct: 121 FISFYNHS-KEGKRIPLSSVQSNQYKFLWFVGPAVLVASFIFPSLYLRKLLSNIFEDSLL 180

Query: 261 TDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLS 320
           TDFLILFFTEALFYCGVAVFLFLIDR RR  E   + N Y+ LSNQ GQRISSVA L LS
Sbjct: 181 TDFLILFFTEALFYCGVAVFLFLIDRSRRTAEPDTLKNSYQTLSNQFGQRISSVATLALS 240

Query: 321 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRL 380
           LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYAR KKS SWPVI IVFQVYRL
Sbjct: 241 LIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARCKKSCSWPVIQIVFQVYRL 300

Query: 381 HQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 440
           HQLNRAAQLVTALSFTIKGAEMTP+NLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP
Sbjct: 301 HQLNRAAQLVTALSFTIKGAEMTPNNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFFP 360

Query: 441 SNATTLQ 446
           SNA T+Q
Sbjct: 361 SNAATVQ 366

BLAST of Sgr026798 vs. ExPASy TrEMBL
Match: A0A6J1E2A7 (uncharacterized protein LOC111430132 OS=Cucurbita moschata OX=3662 GN=LOC111430132 PE=4 SV=1)

HSP 1 Score: 570.5 bits (1469), Expect = 6.3e-159
Identity = 310/368 (84.24%), Postives = 327/368 (88.86%), Query Frame = 0

Query: 81  MSVKFASFSCSVPSLSSPSDCRRNLQLSFPQGLY--GSQQHNFQRLTYAKVESFKLMHGL 140
           MSVKFAS SCSVPS SSPS  RR+ QL F Q LY  GSQQ +FQRLT+AK ESFKLM+GL
Sbjct: 1   MSVKFASLSCSVPSPSSPSGARRSSQLGFSQALYGIGSQQRSFQRLTFAKAESFKLMYGL 60

Query: 141 LRSWRRFPKRQNVLSAISEDQSQCSESNDPEQVNQHLTDEDISLESNSSLYY-EGTGGKP 200
           L   RRFP+RQN+LSAISEDQSQCSE  + EQ++QH TDEDISLE+NS L+  E TGGKP
Sbjct: 61  L---RRFPRRQNILSAISEDQSQCSEQGELEQIDQHPTDEDISLENNSFLHSDECTGGKP 120

Query: 201 GFISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSL 260
           GFISFYNHS KEG Q  LSS QSNQ+ FLWFVGPA+LVASFIFPSLYLRK+LSNIFEDSL
Sbjct: 121 GFISFYNHS-KEGYQTRLSSVQSNQHKFLWFVGPAVLVASFIFPSLYLRKLLSNIFEDSL 180

Query: 261 LTDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVL 320
           LTDFLILFFTEALFYCGVAVFLFLIDR RR TE   V N Y+ LSNQ GQRISSVAAL L
Sbjct: 181 LTDFLILFFTEALFYCGVAVFLFLIDRSRRPTEPDTVRNSYQTLSNQFGQRISSVAALAL 240

Query: 321 SLIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYR 380
           SLIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRK+SPSWPVIPIVFQVYR
Sbjct: 241 SLIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKESPSWPVIPIVFQVYR 300

Query: 381 LHQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLMRFF 440
           LHQLNRAAQLVTALSFTIKGAEMTP+NLAINSSLGTLLNVLQ LGIICIWSLSSFLMRFF
Sbjct: 301 LHQLNRAAQLVTALSFTIKGAEMTPNNLAINSSLGTLLNVLQFLGIICIWSLSSFLMRFF 360

Query: 441 PSNATTLQ 446
           PSNA TLQ
Sbjct: 361 PSNAATLQ 364

BLAST of Sgr026798 vs. TAIR 10
Match: AT5G63040.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48460.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 336.3 bits (861), Expect = 3.8e-92
Identity = 203/361 (56.23%), Postives = 248/361 (68.70%), Query Frame = 0

Query: 81  MSVKFASFSCS--VPSLSSPSDCRRNLQLSFPQGLYGSQQHNFQRLTYAKVESFKLMHGL 140
           M+ K  S S S  VPS+ SP   R + Q   P+ +  +  H +  L + K +    +H  
Sbjct: 1   MAAKLVSSSASSLVPSIFSPFQPRSSFQFRSPR-VQPAHLHCYHSL-HLKTDKIGSLHIS 60

Query: 141 LRSWRRFP---KRQNVLSAISEDQSQCSE--SNDPEQVNQHLTDEDISLESNSSLYYEGT 200
            R  +      KR  +  A SE++   +E  +NDP+ V+  +  E    + +S++ Y   
Sbjct: 61  SRGQKPSEVSRKRTYLPLATSEEKFHYTEDTTNDPDTVSPQIGTEATPRDDDSTIQYNRN 120

Query: 201 GGKPGFISFYNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIF 260
            GKPGFISFYN   K  + I     QS     LW +GPA+LV+SFI P +YLR+I+S +F
Sbjct: 121 DGKPGFISFYNPRNKTEDIIIPPETQSPWGRLLWLIGPAVLVSSFILPPVYLRRIVSAVF 180

Query: 261 EDSLLTDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVA 320
           EDSLLTDFLILFFTEALFYCGVA FL +IDR R+    +    +  I  +QLGQRISSVA
Sbjct: 181 EDSLLTDFLILFFTEALFYCGVAAFLLIIDRSRK---GSGKVPQNRINPSQLGQRISSVA 240

Query: 321 ALVLSLIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVF 380
            LVLSL+IPMVTMG VWPWTGPAASATLAPYLVGIVVQFAFEQYAR + SPS P+IPI+F
Sbjct: 241 TLVLSLMIPMVTMGFVWPWTGPAASATLAPYLVGIVVQFAFEQYARYRNSPSSPIIPIIF 300

Query: 381 QVYRLHQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFL 435
           QVYRLHQLNRAAQLVTALSFT+KGAE T +NLAI  SLGTLLNV+Q LG+I IWS+SSFL
Sbjct: 301 QVYRLHQLNRAAQLVTALSFTVKGAEATVNNLAIKKSLGTLLNVIQVLGVISIWSISSFL 356

BLAST of Sgr026798 vs. TAIR 10
Match: AT5G63040.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 330.5 bits (846), Expect = 2.1e-90
Identity = 185/290 (63.79%), Postives = 219/290 (75.52%), Query Frame = 0

Query: 147 KRQNVLSAISEDQSQCSE--SNDPEQVNQHLTDEDISLESNSSLYYEGTGGKPGFISFYN 206
           KR  +  A SE++   +E  +NDP+ V+  +  E    + +S++ Y    GKPGFISFYN
Sbjct: 70  KRTYLPLATSEEKFHYTEDTTNDPDTVSPQIGTEATPRDDDSTIQYNRNDGKPGFISFYN 129

Query: 207 HSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNIFEDSLLTDFLIL 266
              K  + I     QS     LW +GPA+LV+SFI P +YLR+I+S +FEDSLLTDFLIL
Sbjct: 130 PRNKTEDIIIPPETQSPWGRLLWLIGPAVLVSSFILPPVYLRRIVSAVFEDSLLTDFLIL 189

Query: 267 FFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSVAALVLSLIIPMV 326
           FFTEALFYCGVA FL +IDR R+    +    +  I  +QLGQRISSVA LVLSL+IPMV
Sbjct: 190 FFTEALFYCGVAAFLLIIDRSRK---GSGKVPQNRINPSQLGQRISSVATLVLSLMIPMV 249

Query: 327 TMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRLHQLNRA 386
           TMG VWPWTGPAASATLAPYLVGIVVQFAFEQYAR + SPS P+IPI+FQVYRLHQLNRA
Sbjct: 250 TMGFVWPWTGPAASATLAPYLVGIVVQFAFEQYARYRNSPSSPIIPIIFQVYRLHQLNRA 309

Query: 387 AQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSFLM 435
           AQLVTALSFT+KGAE T +NLAI  SLGTLLNV+Q LG+I IWS+SSFLM
Sbjct: 310 AQLVTALSFTVKGAEATVNNLAIKKSLGTLLNVIQVLGVISIWSISSFLM 356

BLAST of Sgr026798 vs. TAIR 10
Match: AT1G48460.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast envelope; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G63040.1); Has 60 Blast hits to 60 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 60; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 143.3 bits (360), Expect = 4.8e-34
Identity = 83/247 (33.60%), Postives = 137/247 (55.47%), Query Frame = 0

Query: 195 GKPGFISF--YNHSYKEGNQIPLSSAQSNQYNFLWFVGPALLVASFIFPSLYLRKILSNI 254
           GK G +SF    H   E +++  +  Q  + +FLW + P +L++S I P  +L  I+   
Sbjct: 88  GKSGSVSFNGLTHQLVEESKLVSAPFQEEKGSFLWVLAPVVLISSLILPQFFLSGIIEAT 147

Query: 255 FEDSLLTDFLILFFTEALFYCGVAVFLFLIDRLRRLTETANVTNKYEILSNQLGQRISSV 314
           F++  + + +  F  E +FY G+A+FL + DR++R     + + ++ +++   G   S+ 
Sbjct: 148 FKNDTVAEIVTSFCFETVFYAGLAIFLSVTDRVQRPYLDFS-SKRWGLITGLRGYLTSAF 207

Query: 315 AALVLSLIIPMVTMGLVWPWTGPAASATLAPYLVGIVVQFAFEQYARRKKSPSWPVIPIV 374
             + L +++P+  + + WP  G  A   + P+LVG  VQ  FE    R+ S  WP++PIV
Sbjct: 208 LTMGLKVVVPVFAVYMTWPALGIDALIAVLPFLVGCAVQRVFEARLERRGSSCWPIVPIV 267

Query: 375 FQVYRLHQLNRAAQLVTALSFTIKGAEMTPHNLAINSSLGTLLNVLQCLGIICIWSLSSF 434
           F+VYRL+Q+ RAA  V  L F +K A  T        +L  L+  LQ L ++C+WS  +F
Sbjct: 268 FEVYRLYQVTRAATFVQRLMFMMKDAATTAEITERGVALVGLVVTLQFLAVMCLWSFITF 327

Query: 435 LMRFFPS 440
           LMR FPS
Sbjct: 328 LMRLFPS 333

BLAST of Sgr026798 vs. TAIR 10
Match: AT3G60590.3 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48460.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 56.2 bits (134), Expect = 7.7e-08
Identity = 49/171 (28.65%), Postives = 82/171 (47.95%), Query Frame = 0

Query: 226 LWFVGPALLVASFIFPSLYLRKILSNIFEDSLLTDFLILFFTEALFYCGVAVFLFLIDRL 285
           +W +GP++L+ S + P+L+L   LS++F  S +   L L   + +F  G  +FL + D  
Sbjct: 186 IWLLGPSVLLTSGMAPTLWLP--LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSC 245

Query: 286 RRLTETANVTNKYEILSNQLGQRISSVAALVLSLIIPMVTM-----GL---VWPWTGPAA 345
            R  + +   N     S +     S    L++  ++PM+ +     GL   + P     +
Sbjct: 246 ARPKDPSQSCNSKPPFSYKFWNMFS----LIIGFLVPMLLLFGSQSGLLASLQPQIPFLS 305

Query: 346 SAT-LAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRLHQLNRAAQL 388
           SA  L PY + + VQ   E      +SP W V P+V++ YR+ QL R   L
Sbjct: 306 SAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTL 350

BLAST of Sgr026798 vs. TAIR 10
Match: AT3G60590.1 (unknown protein; LOCATED IN: chloroplast, chloroplast inner membrane, chloroplast envelope; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G48460.1); Has 81 Blast hits to 81 proteins in 19 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 56.2 bits (134), Expect = 7.7e-08
Identity = 49/171 (28.65%), Postives = 82/171 (47.95%), Query Frame = 0

Query: 226 LWFVGPALLVASFIFPSLYLRKILSNIFEDSLLTDFLILFFTEALFYCGVAVFLFLIDRL 285
           +W +GP++L+ S + P+L+L   LS++F  S +   L L   + +F  G  +FL + D  
Sbjct: 18  IWLLGPSVLLTSGMAPTLWLP--LSSVFLGSNVVSLLSLIGLDCIFNLGATLFLLMADSC 77

Query: 286 RRLTETANVTNKYEILSNQLGQRISSVAALVLSLIIPMVTM-----GL---VWPWTGPAA 345
            R  + +   N     S +     S    L++  ++PM+ +     GL   + P     +
Sbjct: 78  ARPKDPSQSCNSKPPFSYKFWNMFS----LIIGFLVPMLLLFGSQSGLLASLQPQIPFLS 137

Query: 346 SAT-LAPYLVGIVVQFAFEQYARRKKSPSWPVIPIVFQVYRLHQLNRAAQL 388
           SA  L PY + + VQ   E      +SP W V P+V++ YR+ QL R   L
Sbjct: 138 SAVILFPYFILLAVQTLTEILTWHWQSPVWLVTPVVYEAYRILQLMRGLTL 182

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022135241.13.4e-17589.07uncharacterized protein LOC111007252 [Momordica charantia][more]
XP_008464550.17.6e-16785.56PREDICTED: uncharacterized protein LOC103502400 [Cucumis melo][more]
KAA0057843.14.9e-16685.75uncharacterized protein E6C27_scaffold274G001300 [Cucumis melo var. makuwa] >TYJ... [more]
XP_038879012.11.0e-16385.25uncharacterized protein LOC120071064 isoform X1 [Benincasa hispida][more]
XP_011656390.21.0e-16384.47uncharacterized protein LOC105435726 [Cucumis sativus] >KGN63569.2 hypothetical ... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1C0431.6e-17589.07uncharacterized protein LOC111007252 OS=Momordica charantia OX=3673 GN=LOC111007... [more]
A0A1S3CLW83.7e-16785.56uncharacterized protein LOC103502400 OS=Cucumis melo OX=3656 GN=LOC103502400 PE=... [more]
A0A5D3BHC72.4e-16685.75Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0LS193.6e-16283.92Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G004340 PE=4 SV=1[more]
A0A6J1E2A76.3e-15984.24uncharacterized protein LOC111430132 OS=Cucurbita moschata OX=3662 GN=LOC1114301... [more]
Match NameE-valueIdentityDescription
AT5G63040.13.8e-9256.23unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G63040.22.1e-9063.79unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G48460.14.8e-3433.60unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G60590.37.7e-0828.65unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G60590.17.7e-0828.65unknown protein; LOCATED IN: chloroplast, chloroplast inner membrane, chloroplas... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33918:SF2OS01G0704200 PROTEINcoord: 87..443
NoneNo IPR availablePANTHERPTHR33918OS01G0704200 PROTEINcoord: 87..443

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr026798.1Sgr026798.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane