Sgr028225 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr028225
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Locationtig00153056: 4807181 .. 4812564 (+)
RNA-Seq ExpressionSgr028225
SyntenySgr028225
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GCGAGTAATCAGGGGAAATGATGAGGAACAATTTTTTTTTTTTTTTTTTTTTTTATATACAGCGCGATTTTTCCATTTTCCTCGTCTCCTCGTTTTCGCTTCTTATTCTACTCCCAGTAAGGGGAGACGGCACACGCAACGCAACCATGATTCTTTTTTTTCTTTTTTTTTTTCTTTTTTGCTTTTTTTAATGTTTTAAAGTTGATGGATTTGATTAATGATAAAAGTTGATGGATTATTTTTTTTTTTTTTAAGAAGTAGTATGTTGGAATACGATCTAATTAAATTTTGTTATGATAATTGTATTGCTTTTAATTAGAGAAATTTGTTATTATTTCTTATTTGATAATTGATAGTCTACTAATCAAAATAAATTAAAATTTTGAGCTATATTTAAAGACAAATAAATGTTTTTAGATTTACATGAAATTTTAAGAGTGTTTTTCAAATATCAAAATAAATAAATCTAAACAATCTTCTTTTTTTTAATAGAATTGTGAAAATAAATTTATTATTAAATGTGGCCTTCCACTAACTTTGGCATGGCACCTATGGTTGAACCAAAAATGACATTATTTTGCTTGACCATCCATGATCGCTAATGTAATACTTTGCTCAAGCAACAAAATAGGCACATGCATGACTCTACTCGATCAATGGACGAGCCTAACTATATTAATAGAATATCGCGTTAGCACACCACTAAAATATATTGCAATACATGTGTTATTAGTATATACATAAACTAGATCGACACATTTTTTACAATGGTCAACGTGTTCCATATTTCTCGTTTAAATGGTTAACGACACTCTTAATATGATAGTATTGTCCAAGCCTAATGACCAACCTATTACCAAGGTAATTTTTGTAAAAATAGCATATAATCATCCAATGTTCACGATACTTCCCATTGTAATAGCAACTTCCTTAACATTGTCAATATCATTTTCTGCCACGATGGCTAAAGTAATGTAACTCACTTACACTCGTACAACCAGCTTGTTATTTCTTTATCAAGCTTAATTTAGTTCATTGATTCAACTGTTAGTCTCTTTAGCATGTATGAATCATACACTCTCAAACATACTTTATGTGTTGCAACTGCAAGTAATTTGATATTGATAGAGGCAACCCTTAGACTGTAATAGTGATTTTACTCATTAAAATTACTTTTAAAATATTTTCATAATGTTCAACTGCTTGATTAAAACTATTTTTAAAAGGTTTTAAAACTCCCAACATTTAGTTAGTATTTTTACAAAATATATTTAACTTTTAAATATTATTCCCAATAGTTATTCCAACACACTTTTATTAACTTCAATTTAGTCATTTAAATTCAATTTGTTGGGTCCTGACTGCATACAGGCCCAACTCAGTTTTAGGCCCATTAGTTAATAATAATAATAAAAGGAAAAAGCAAAATTATATGAAGCTTGCGGTGCAAGCTTGTTTGATATACAGGGGTGGCTACTAGTTTGTAAGATAGAGAGGTGAGGTGAGTAAGCGGTCGAGGAGAAGAAGCAGGGTGAAGAGAGAGAGCTCAAGTTCTTGCAACCGAGAAGAAGAAGGAAAAGCAAATTGAAGGTTTCTTTCAGTTTGTTGCCAAATCTATTTCTCCTTCGGTTGGTCTTCTTGAGTCGTCTTCTCCGGTGAGATCTCTCCTGCTATTAAGTTCCATCATTTGTAGATGAAGGGTAGCTTTATGTACCGATTGTTTGCCAGGATTTAGCGTGATGTACTTGTTGATAGTTGCTGTATCCTCTAGTTATCGCTAGGGAAGAACTTGTTGTAACTCATTTTCGTTATAGTGGAAGTTTTTCGGAGCTGTCTCGTGGTTTTTCCCTTACACTGAGGGATTTTCCACATAAATCTTGGTGTCACTATTTTTCTTGTTGTGTGCATGTGATCTATTATTCGCATTAATCTCATCGACTATACACATAAAATCCCAACACAATTAGACTCCACATTATTAGTTATCATAAAAAAAAATCTATATCGATATCAATTCACATCTTATTTAATCGTCTTGTAAATAAAGCCTTAATTTTAACAAGATAGAGTTAAAAACACTTCACTTTAAGGGAAGTGAACAAGTGGCTCATTGCCTTCCTACTTCGAACTGCAATATTGTGCGGATTGAAGACCCAGAGATTGCCATAGATGAAGTCATTGCGAACATGTCGAAGTTGAGGTAGAGTTTATTGGAGTGTTGTTCAAATTGTTTTGTTGTGTCTTGATTATTTGTTGTTGTGTCTCAAAAAATAAAATAAATTTATTTATTATTATTTTTTAAGAAAATTAAGACAAAGTTATTGTAGTTAGATAATTGCAAAAGACATTATTTATCATAATTAGAATTATTAATTTGTCCGGCTAACATTTTTTTCATAATACGACATTATGAATGCGGGTGAAATTCAAACTTTTGACATAAAAAAAATGTAATTAATGTGTTGACCCATTTAACTACACTTGTATTTGTGAGGTTAACTAATAAATAATATGTTTGGAGTGTTTTTAAAATGGTTAAAATTATTTGCTCAATAAAATAAATATATGTGACTATAATTTATAAATAAAATTTTTAAAAATTAAGAAAATACCTAAATAAGTTTGGGATGATTTTGAGAGTATTTTTCAAACAACATTTTTTTATAAGTTTTGATTGAAAAATAATAATTAATTGCTTTAACTAAAGGCACTTTAAGTGTCTGTATAATTTCCCAAATTTGCTTTTGACTACGTAGTTGAATGTTATAAATTTTTAAAGTACGTTTAATTAGTTAAAAACATTGTTTATTTTTTAAAAGAAGTAATATTAAACGCATTTTCCTAATTGATTTTCAAAAGTCCTATTTTTTAGAATGATTTTCAAAAAGTTAATTGATTATATTTTATCAAGTACTTCTCAAAAAACAATATACTAGAGAATATAAATTTCTCTTATAACACACTTTTTTTAGTTAGAACATCTTGCAATTTTCAAAACATCACATAATCCTGTTATGACTTTTATATATATACACATCTAACATCTACATACATTATCTCTTAATTGTCTCCCTAAACATACATTTATTTCATTAATCTCTCCATACACGCTATTCATTTTTTTTATTCTTTTGATTTTTTTAGCAACGTGTTTATATTAATAATTCAAGAAACATTTATAATTATATATAAAAAAACACAGTGAAGTATTAGAATCAAACACACAAAAAAAAAAGCATTTTTTATTATATATATATATATGTGTGTATGTATGTATGTATGTGTATGTATGTGTTAGATCAATACAAAGAGAATGATGATGGGACAGAGATTTGAATGATTATCGACAAGATACATTATCTTCAACTTCAAAATGACAAAGTTGGTGATTAATAAAGGAATGGTGCATTATAAGACTCCATGACCATATTTGTGTTTATCCCTCAAATTTTGGCTATGTTTATGAGTGTTTTTGACATGATTAAAAATATTTTTGCAGTGTCAAAAATACTTCCGCTCATAATATCATATTTAGGGTAGCACAAAATGTTTTTTTTAGAAAAAATCAAAGTACTTAAAGTGTTTTGCAAAAAAAACTTGTAGAATGTTTTTATCTCAAGTGTTTGATCGGAAAGCACTTCAAAAGCACTTTTAAGAGTATAGAACCAAATATCAAGGTGATTTATTTAAGCATTTAGTACTAAAGTGATTAAGTCAAAACTATTTTTGCTAAAATCCCATATTTTAAAAGCACTCCTAAACACACACTAATTATGTGTTATCTCTACAAGCACTTTAAAGTGATTTTAGAGCTTTCAAAATAATTTTTAACTAAATAATCGAACATCATAAAACCTTCAAAAGTAATTCGTGAAAACCTTCAAAAGCAATTCTCACTATTTTAAAAGTCATTACAAACTCAGCATAAGAATTACAATTAAAACATACAAGAAATTAAGAGTTCTGTAAAAAAGACAAAACAGGTGTATGGATTATTCAAGATATAAATATTGAGTATACTCTATAAATAAAGTGTCTATGTTTATTTAAATAGAGATTGGATTAATCCATGGCTATAATTACTCATTTCTTACTTAGTCATTTATTCCAGTGATCCAAAATTCATCTGTTTTATGGCGCTTCATTATTAGTTTAAAAACTTTCATCGGTAAAACCAGACGAAGCGCGAGCTAATCGTCGCCGGCCCTCAGCGCTCGAAAACTCAAACGCCACCCACTCATCTCCTCCATGAAACAACTGCAAACTGTTTCTTAGATTCTCTTTTTTCTTCTCGGATTCTCATTCGCCGTTTCTCAGATTTTCTTTCCCCTGTTTTCTAGCCTCCCGAACGCCTGCTAAATCCTCGCAACCTGCACAAAAACAGCCTCCAAATTCGAGTTTTATGTCGAATGGACTCCATTGCAACCAAAGAGAGGAAGAAAAGTAAATTTTTTCCGTGTTTCCGAGCGCCGGCCACCAGCAGCCATTTCAGGACGGAACGACGTAAGGATGCTCCAGACGAGCCGGTTTTTCCGTTCATGGCGGTGGGAGAGAGGGACGGTGTGATGTTCCACACTGTGCAACCGTTGGCTTCGCCGTCGGATGGAGACGATGAAGATCCCGGTCTCCGGAAAAAGAAAGGCGGCGGTGGTGCTTTATCGCGGGCAATTAAGGCCGTCTTATTCGGAACGTCATTGGTTTGTTCGCCGATCCAAAAAACCTTCTGAAATGAAACGCATCAATTTCATTCTGTGTTTTTTTTTTCCATGATTGCTTTCTTATAATCTTGGATCAAGTTTTCTAACGAAAATTCCGGATGTTCCGCGAGTGTTGCAGGGGAAGAAGATCAGAAACAGGAAAGCGAAACAAAAGCAAAATTCACAAATCGGAATTCGAAAAAGGAGAATCAGAGGCATCAAGCTATGTCCTCAATCAGCAACAGAAGAATTGCTTCAGATCTCTACCACAACTCTTCATTCTCTTCTTCTTCGCTCACTTCTGTGCCATTTTCATCCTCCTCGATCTGCAGTTCGTCCTGTTCCTCCTCAGACATCAACGACAGATCATTTCGATTTAATCCAACAGCCTCGAATCGATTGTTCAGACAGATAAATCTCAGAAAAATCTGCAGCGGTTGGTTTCTGCTACTGGTATGTCTTCTGAGCTTGATTTCATGGGGAAAAGTCGGTGCTATTGTCTGCACTTCTGCTTGGATCCTCTGTTTGCCTCGCCGGAGAATCGGATTCAAGTCGCCGGATGACAAGGCCAGTGAGGCGGCGGCGGCGGCGATTGATTCCAGTGAATACAAGAAGAGAATCGTAATCGAAGGGCTGCTGGCGAGGGACCGTTCAGCTGCTCAAAATTCAAGCTTACGCATTGATTGA

mRNA sequence

GCGAGTAATCAGGGGAAATGATGAGGAACAATTTTTTTTTTTTTTTTTTTTTTTATATACAGCGCGATTTTTCCATTTTCCTCGTCTCCTCGTTTTCGCTTCTTATTCTACTCCCAACCCAGAGATTGCCATAGATGAAGTCATTGCGAACATGTCGAAGTTGAGCCTCCAAATTCGAGTTTTATGTCGAATGGACTCCATTGCAACCAAAGAGAGGAAGAAAAGTAAATTTTTTCCGTGTTTCCGAGCGCCGGCCACCAGCAGCCATTTCAGGACGGAACGACGTAAGGATGCTCCAGACGAGCCGGTTTTTCCGTTCATGGCGGTGGGAGAGAGGGACGGTGTGATGTTCCACACTGTGCAACCGTTGGCTTCGCCGTCGGATGGAGACGATGAAGATCCCGGTCTCCGGAAAAAGAAAGGCGGCGGTGGTGCTTTATCGCGGGCAATTAAGGCCGTCTTATTCGGAACGTCATTGTTTTCTAACGAAAATTCCGGATGTTCCGCGAGTGTTGCAGGGGAAGAAGATCAGAAACAGGAAAGCGAAACAAAAGCAAAATTCACAAATCGGAATTCGAAAAAGGAGAATCAGAGGCATCAAGCTATGTCCTCAATCAGCAACAGAAGAATTGCTTCAGATCTCTACCACAACTCTTCATTCTCTTCTTCTTCGCTCACTTCTGTGCCATTTTCATCCTCCTCGATCTGCAGTTCGTCCTGTTCCTCCTCAGACATCAACGACAGATCATTTCGATTTAATCCAACAGCCTCGAATCGATTGTTCAGACAGATAAATCTCAGAAAAATCTGCAGCGGTTGGTTTCTGCTACTGGTATGTCTTCTGAGCTTGATTTCATGGGGAAAAGTCGGTGCTATTGTCTGCACTTCTGCTTGGATCCTCTGTTTGCCTCGCCGGAGAATCGGATTCAAGTCGCCGGATGACAAGGCCAGTGAGGCGGCGGCGGCGGCGATTGATTCCAGTGAATACAAGAAGAGAATCGTAATCGAAGGGCTGCTGGCGAGGGACCGTTCAGCTGCTCAAAATTCAAGCTTACGCATTGATTGA

Coding sequence (CDS)

GCGAGTAATCAGGGGAAATGATGAGGAACAATTTTTTTTTTTTTTTTTTTTTTTATATACAGCGCGATTTTTCCATTTTCCTCGTCTCCTCGTTTTCGCTTCTTATTCTACTCCCAACCCAGAGATTGCCATAGATGAAGTCATTGCGAACATGTCGAAGTTGAGCCTCCAAATTCGAGTTTTATGTCGAATGGACTCCATTGCAACCAAAGAGAGGAAGAAAAGTAAATTTTTTCCGTGTTTCCGAGCGCCGGCCACCAGCAGCCATTTCAGGACGGAACGACGTAAGGATGCTCCAGACGAGCCGGTTTTTCCGTTCATGGCGGTGGGAGAGAGGGACGGTGTGATGTTCCACACTGTGCAACCGTTGGCTTCGCCGTCGGATGGAGACGATGAAGATCCCGGTCTCCGGAAAAAGAAAGGCGGCGGTGGTGCTTTATCGCGGGCAATTAAGGCCGTCTTATTCGGAACGTCATTGTTTTCTAACGAAAATTCCGGATGTTCCGCGAGTGTTGCAGGGGAAGAAGATCAGAAACAGGAAAGCGAAACAAAAGCAAAATTCACAAATCGGAATTCGAAAAAGGAGAATCAGAGGCATCAAGCTATGTCCTCAATCAGCAACAGAAGAATTGCTTCAGATCTCTACCACAACTCTTCATTCTCTTCTTCTTCGCTCACTTCTGTGCCATTTTCATCCTCCTCGATCTGCAGTTCGTCCTGTTCCTCCTCAGACATCAACGACAGATCATTTCGATTTAATCCAACAGCCTCGAATCGATTGTTCAGACAGATAAATCTCAGAAAAATCTGCAGCGGTTGGTTTCTGCTACTGGTATGTCTTCTGAGCTTGATTTCATGGGGAAAAGTCGGTGCTATTGTCTGCACTTCTGCTTGGATCCTCTGTTTGCCTCGCCGGAGAATCGGATTCAAGTCGCCGGATGACAAGGCCAGTGAGGCGGCGGCGGCGGCGATTGATTCCAGTGAATACAAGAAGAGAATCGTAATCGAAGGGCTGCTGGCGAGGGACCGTTCAGCTGCTCAAAATTCAAGCTTACGCATTGATTGA

Protein sequence

RVIRGNDEEQFFFFFFFLYTARFFHFPRLLVFASYSTPNPEIAIDEVIANMSKLSLQIRVLCRMDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESETKAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSSDINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPRRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID
Homology
BLAST of Sgr028225 vs. NCBI nr
Match: XP_022139696.1 (uncharacterized protein LOC111010543 [Momordica charantia])

HSP 1 Score: 317.8 bits (813), Expect = 1.2e-82
Identity = 192/293 (65.53%), Postives = 217/293 (74.06%), Query Frame = 0

Query: 64  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPL 123
           MDS A K RKK+K FPCFR+PA+  + RTER KD PDE VFP MAV E DG MFH+V+ +
Sbjct: 1   MDSDAAKSRKKTKLFPCFRSPASDCYVRTERCKDVPDEKVFPLMAVEEMDGTMFHSVRSV 60

Query: 124 ASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESET 183
           AS  D DDED G RK+KG GGALSRAIKAVLFGT+L                  K+  + 
Sbjct: 61  ASSKDADDEDSGCRKRKGSGGALSRAIKAVLFGTAL-----------------AKKMRKK 120

Query: 184 KAKFTNRNSKKENQ-RHQAMSSISNR-RIASDLYHNSSFSSSSLTSVPFSSSSICSSSCS 243
           KAK   +NSKKENQ RH ++SSIS+R RIASD YHN S + SS TS+PFSSSS CSSS S
Sbjct: 121 KAK-QKQNSKKENQIRHHSVSSISDRSRIASDPYHNYS-NFSSRTSMPFSSSSFCSSSPS 180

Query: 244 SSDINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILC 303
           SSDI++RSF   PTA  RLF QINLR+ICSGW + LVC+LSLI WGK+ AIVCTS WILC
Sbjct: 181 SSDISERSFAIYPTAPKRLFSQINLRRICSGWLMFLVCILSLILWGKICAIVCTSVWILC 240

Query: 304 LPRRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID 355
              RR GFKSP+ KAS   AAAIDS E+ KRIVIEGLLARDRSAAQNSSLRID
Sbjct: 241 FSSRRFGFKSPEIKAS---AAAIDSGEHNKRIVIEGLLARDRSAAQNSSLRID 271

BLAST of Sgr028225 vs. NCBI nr
Match: XP_038900051.1 (uncharacterized protein LOC120087211 [Benincasa hispida])

HSP 1 Score: 261.5 bits (667), Expect = 1.0e-65
Identity = 179/296 (60.47%), Postives = 201/296 (67.91%), Query Frame = 0

Query: 64  MDSI-ATKERKKSKFFPCFRAPATSSHFRTER-RKDAPDEPVFPFMAVGERDGVMFHTVQ 123
           MDSI A K +KK+K FPCFRA A+     T R ++DA DE +FPF+ V   DGV      
Sbjct: 1   MDSITAIKSKKKNKLFPCFRAAASGGPVGTARCKEDASDEHIFPFITVD--DGV------ 60

Query: 124 PLASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQES 183
                 DG D D G RKKK GGGALSRA+KAVLFGTSL                  K+  
Sbjct: 61  ---RSFDGGDGDSGHRKKKNGGGALSRAVKAVLFGTSL-----------------AKKIR 120

Query: 184 ETKAKFTNRNSKKENQRHQAMSSISNR--RIASDL-YHNSSFSSSSLTSVPFSSSSICSS 243
           + KAK   +NSK ENQRHQA  SISN   RIASDL YHNSS + SS TS PFSSSS CSS
Sbjct: 121 KRKAK-EKQNSKTENQRHQAPFSISNNRSRIASDLNYHNSS-TCSSRTSAPFSSSSFCSS 180

Query: 244 SCSSSDINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAW 303
           S SSS+++D SFRF PTASNRLFRQIN  KI SGWFLLLVCLLSL+ WGK+GAI+CTS W
Sbjct: 181 SPSSSEMSDISFRFYPTASNRLFRQINFGKIFSGWFLLLVCLLSLVLWGKIGAIICTSVW 240

Query: 304 ILCLPRRRIGFKSPDDKASEAAAAAIDSSE-YKKRIVIEGLLARDRSAAQNSSLRI 354
           ILCL RRRIG KS DDK S   A A+ S E YK+R+V+EG L RD S AQNS LRI
Sbjct: 241 ILCLYRRRIGLKSSDDKVS---AGAMSSGEYYKRRVVMEGFLKRDHSGAQNSILRI 263

BLAST of Sgr028225 vs. NCBI nr
Match: XP_022981581.1 (uncharacterized protein LOC111480657 isoform X1 [Cucurbita maxima])

HSP 1 Score: 258.1 bits (658), Expect = 1.1e-64
Identity = 166/293 (56.66%), Postives = 200/293 (68.26%), Query Frame = 0

Query: 64  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPL 123
           MDS A  + KK KFFPCFR+ A+SS  RT R  DA DE VFPFMAV ER+G+M H VQP 
Sbjct: 1   MDSFAATKSKK-KFFPCFRSTASSSPVRTVRGMDAADEQVFPFMAVEERNGMMLHNVQPF 60

Query: 124 ASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESET 183
               DG  +  G +KK GGGGALSRA+KAVLFGTSL              ++ +K++ + 
Sbjct: 61  ----DGCGDVSGRQKKGGGGGALSRALKAVLFGTSL-------------AKKIRKRKRKQ 120

Query: 184 KAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSS 243
           K      NS +ENQR Q +SSIS+R  +SD    +S + SS  S PFSS+S CSSS +SS
Sbjct: 121 K-----ENSNEENQRRQLLSSISSR--SSDPNFRNSSTCSSRISAPFSSTSFCSSSPTSS 180

Query: 244 DINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP 303
           +IN+ SFRF+PTASNRLFRQINLR     WF+LLVCLLSL+ W K+GA VCTS WILC  
Sbjct: 181 EINEISFRFHPTASNRLFRQINLRNTSRCWFVLLVCLLSLVLWEKIGATVCTSIWILCFH 240

Query: 304 --RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID 355
             RR IGF+SPDDKAS   AAA+ S EY+KR ++EG L RDRSA +NS   ID
Sbjct: 241 FYRREIGFRSPDDKAS---AAAMSSDEYRKRTILEGFLNRDRSAVRNSITHID 265

BLAST of Sgr028225 vs. NCBI nr
Match: XP_023525890.1 (uncharacterized protein LOC111789371 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 250.4 bits (638), Expect = 2.4e-62
Identity = 165/293 (56.31%), Postives = 196/293 (66.89%), Query Frame = 0

Query: 64  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPL 123
           MDS A  + KK KFFPCFR+ A+S   RT R  DA DE VFPFMAV ER+G+M H VQP 
Sbjct: 1   MDSFAANKSKK-KFFPCFRSTASSGPVRTVRGVDAADEQVFPFMAVEERNGMMLHNVQPF 60

Query: 124 ASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESET 183
               DG     G +KK GGGGALSRA+KAVLFGTSL              ++ +K++ + 
Sbjct: 61  ----DGCGNVSGRQKKGGGGGALSRALKAVLFGTSL-------------AKKIRKRKRKQ 120

Query: 184 KAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSS 243
           K      NS +ENQR Q +SSIS+R  +SD    +  + SS  S PFSS+S CSSS +S 
Sbjct: 121 K-----ENSNEENQRRQLLSSISSR--SSDPNFRNYSTCSSRISAPFSSASFCSSSPTSP 180

Query: 244 DINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP 303
           +IN+ SFRF+PTASNRLFRQINLRK    WF+LLV LLSLI W K+GA VCTS WILC  
Sbjct: 181 EINEISFRFHPTASNRLFRQINLRKTSRCWFVLLVGLLSLILWEKIGATVCTSIWILCFH 240

Query: 304 --RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID 355
             RR IGF+SPDDKAS   AAA+ S EYKKR ++EG L RDRSA +NS   ID
Sbjct: 241 FYRREIGFRSPDDKAS---AAAMSSDEYKKRTILEGFLNRDRSAVRNSITHID 265

BLAST of Sgr028225 vs. NCBI nr
Match: XP_022941268.1 (uncharacterized protein LOC111446618 isoform X1 [Cucurbita moschata])

HSP 1 Score: 250.4 bits (638), Expect = 2.4e-62
Identity = 164/293 (55.97%), Postives = 196/293 (66.89%), Query Frame = 0

Query: 64  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPL 123
           MDS A  + KK KFFPCFR+ A+S   RT R  +A DE VFPFMAV ER+G+M H VQP 
Sbjct: 1   MDSFAATKSKK-KFFPCFRSTASSGPVRTVRGMNAADEQVFPFMAVEERNGMMLHNVQPF 60

Query: 124 ASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESET 183
               DG     G +KK GGGGALSRA+KAVLFGTSL         A    ++ +KQ+   
Sbjct: 61  ----DGCGNVSGRQKKGGGGGALSRALKAVLFGTSL---------AKKIRKKKRKQK--- 120

Query: 184 KAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSS 243
                  NS +ENQR + +SSIS+R  +SD    +S + SS TS PFSS+S CSSS +S 
Sbjct: 121 ------ENSNEENQRRRLLSSISSR--SSDPNFRNSSTCSSRTSAPFSSASFCSSSPTSP 180

Query: 244 DINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP 303
           +I + SFRF+PTASNRLFRQINLRK    WF+LLV LLSL+ W K+GA VCTS WILC  
Sbjct: 181 EIKEISFRFHPTASNRLFRQINLRKTSRCWFVLLVGLLSLVLWEKIGATVCTSIWILCFH 240

Query: 304 --RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID 355
             RR IGF+SPDDKAS   AAA+ S EYKKR ++EG L RDRSA +NS   ID
Sbjct: 241 FYRREIGFRSPDDKAS---AAAMSSDEYKKRTILEGFLNRDRSAVRNSITHID 265

BLAST of Sgr028225 vs. ExPASy TrEMBL
Match: A0A6J1CDH5 (uncharacterized protein LOC111010543 OS=Momordica charantia OX=3673 GN=LOC111010543 PE=4 SV=1)

HSP 1 Score: 317.8 bits (813), Expect = 5.8e-83
Identity = 192/293 (65.53%), Postives = 217/293 (74.06%), Query Frame = 0

Query: 64  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPL 123
           MDS A K RKK+K FPCFR+PA+  + RTER KD PDE VFP MAV E DG MFH+V+ +
Sbjct: 1   MDSDAAKSRKKTKLFPCFRSPASDCYVRTERCKDVPDEKVFPLMAVEEMDGTMFHSVRSV 60

Query: 124 ASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESET 183
           AS  D DDED G RK+KG GGALSRAIKAVLFGT+L                  K+  + 
Sbjct: 61  ASSKDADDEDSGCRKRKGSGGALSRAIKAVLFGTAL-----------------AKKMRKK 120

Query: 184 KAKFTNRNSKKENQ-RHQAMSSISNR-RIASDLYHNSSFSSSSLTSVPFSSSSICSSSCS 243
           KAK   +NSKKENQ RH ++SSIS+R RIASD YHN S + SS TS+PFSSSS CSSS S
Sbjct: 121 KAK-QKQNSKKENQIRHHSVSSISDRSRIASDPYHNYS-NFSSRTSMPFSSSSFCSSSPS 180

Query: 244 SSDINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILC 303
           SSDI++RSF   PTA  RLF QINLR+ICSGW + LVC+LSLI WGK+ AIVCTS WILC
Sbjct: 181 SSDISERSFAIYPTAPKRLFSQINLRRICSGWLMFLVCILSLILWGKICAIVCTSVWILC 240

Query: 304 LPRRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID 355
              RR GFKSP+ KAS   AAAIDS E+ KRIVIEGLLARDRSAAQNSSLRID
Sbjct: 241 FSSRRFGFKSPEIKAS---AAAIDSGEHNKRIVIEGLLARDRSAAQNSSLRID 271

BLAST of Sgr028225 vs. ExPASy TrEMBL
Match: A0A6J1IWY5 (uncharacterized protein LOC111480657 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111480657 PE=4 SV=1)

HSP 1 Score: 258.1 bits (658), Expect = 5.5e-65
Identity = 166/293 (56.66%), Postives = 200/293 (68.26%), Query Frame = 0

Query: 64  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPL 123
           MDS A  + KK KFFPCFR+ A+SS  RT R  DA DE VFPFMAV ER+G+M H VQP 
Sbjct: 1   MDSFAATKSKK-KFFPCFRSTASSSPVRTVRGMDAADEQVFPFMAVEERNGMMLHNVQPF 60

Query: 124 ASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESET 183
               DG  +  G +KK GGGGALSRA+KAVLFGTSL              ++ +K++ + 
Sbjct: 61  ----DGCGDVSGRQKKGGGGGALSRALKAVLFGTSL-------------AKKIRKRKRKQ 120

Query: 184 KAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSS 243
           K      NS +ENQR Q +SSIS+R  +SD    +S + SS  S PFSS+S CSSS +SS
Sbjct: 121 K-----ENSNEENQRRQLLSSISSR--SSDPNFRNSSTCSSRISAPFSSTSFCSSSPTSS 180

Query: 244 DINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP 303
           +IN+ SFRF+PTASNRLFRQINLR     WF+LLVCLLSL+ W K+GA VCTS WILC  
Sbjct: 181 EINEISFRFHPTASNRLFRQINLRNTSRCWFVLLVCLLSLVLWEKIGATVCTSIWILCFH 240

Query: 304 --RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID 355
             RR IGF+SPDDKAS   AAA+ S EY+KR ++EG L RDRSA +NS   ID
Sbjct: 241 FYRREIGFRSPDDKAS---AAAMSSDEYRKRTILEGFLNRDRSAVRNSITHID 265

BLAST of Sgr028225 vs. ExPASy TrEMBL
Match: A0A6J1FRN0 (uncharacterized protein LOC111446618 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111446618 PE=4 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 1.1e-62
Identity = 164/293 (55.97%), Postives = 196/293 (66.89%), Query Frame = 0

Query: 64  MDSIATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPL 123
           MDS A  + KK KFFPCFR+ A+S   RT R  +A DE VFPFMAV ER+G+M H VQP 
Sbjct: 1   MDSFAATKSKK-KFFPCFRSTASSGPVRTVRGMNAADEQVFPFMAVEERNGMMLHNVQPF 60

Query: 124 ASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESET 183
               DG     G +KK GGGGALSRA+KAVLFGTSL         A    ++ +KQ+   
Sbjct: 61  ----DGCGNVSGRQKKGGGGGALSRALKAVLFGTSL---------AKKIRKKKRKQK--- 120

Query: 184 KAKFTNRNSKKENQRHQAMSSISNRRIASDLYHNSSFSSSSLTSVPFSSSSICSSSCSSS 243
                  NS +ENQR + +SSIS+R  +SD    +S + SS TS PFSS+S CSSS +S 
Sbjct: 121 ------ENSNEENQRRRLLSSISSR--SSDPNFRNSSTCSSRTSAPFSSASFCSSSPTSP 180

Query: 244 DINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLP 303
           +I + SFRF+PTASNRLFRQINLRK    WF+LLV LLSL+ W K+GA VCTS WILC  
Sbjct: 181 EIKEISFRFHPTASNRLFRQINLRKTSRCWFVLLVGLLSLVLWEKIGATVCTSIWILCFH 240

Query: 304 --RRRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID 355
             RR IGF+SPDDKAS   AAA+ S EYKKR ++EG L RDRSA +NS   ID
Sbjct: 241 FYRREIGFRSPDDKAS---AAAMSSDEYKKRTILEGFLNRDRSAVRNSITHID 265

BLAST of Sgr028225 vs. ExPASy TrEMBL
Match: A0A5A7SZR0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold110G001060 PE=4 SV=1)

HSP 1 Score: 228.0 bits (580), Expect = 6.1e-56
Identity = 164/298 (55.03%), Postives = 192/298 (64.43%), Query Frame = 0

Query: 64  MDSIAT-KERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQP 123
           MDSIAT K +KK+K FPCFRA A+ S     R KD   E VFPF+ V E        V+P
Sbjct: 1   MDSIATPKSKKKNKLFPCFRAAASGSGHVKVRSKD-DSEDVFPFITVDE-------NVRP 60

Query: 124 LASPSDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESE 183
           L     G D D G RKKKG  GALSRA KAVLFGTSL                  K+  +
Sbjct: 61  LC----GCDGDSGHRKKKGSAGALSRAFKAVLFGTSL-----------------AKKIRK 120

Query: 184 TKAKFTNRNSKKENQRHQAMSSISNRR-IASD---LYHNSSFSSSSLTSVPFSSSSICSS 243
            KAK    +  + NQ HQA+SSI NR   ASD   LYHNSS + SS TS PFSSSS CSS
Sbjct: 121 RKAKEKENSKNEINQMHQALSSIGNRSGTASDNLNLYHNSS-TRSSRTSAPFSSSSFCSS 180

Query: 244 SCSSSDINDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAW 303
           S +SS++++ SFRF P  SNRL RQINLRKI SGWF+LLVCLL+LI WGK+GAI+CTS W
Sbjct: 181 SPASSEMSEISFRFYPNGSNRLLRQINLRKILSGWFVLLVCLLNLILWGKLGAIMCTSVW 240

Query: 304 ILCLPRRRIGFKSPDDKASEAAAAAIDSSE-YKKRIVIEGLLARDR-SAAQNSSLRID 355
           ILCL RRR+G K       + +A A+ S E YK+RI +EG L R+R S+AQNS LRID
Sbjct: 241 ILCLYRRRMGLK-------KGSAVAMSSGEYYKRRIGMEGFLKRERSSSAQNSILRID 261

BLAST of Sgr028225 vs. ExPASy TrEMBL
Match: A0A6J1F8B9 (uncharacterized protein LOC111443063 OS=Cucurbita moschata OX=3662 GN=LOC111443063 PE=4 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 3.2e-49
Identity = 148/290 (51.03%), Postives = 178/290 (61.38%), Query Frame = 0

Query: 67  IATKERKKSKFFPCFRAPATSSHFRTERRKDAPDEPVFPFMAVGERDGVMFHTVQPLASP 126
           + TK +K++K FPCFRA A+ S         AP+E VFPFM V  RD V+         P
Sbjct: 5   VVTKSKKRNKLFPCFRAAASGSPV----GHGAPEEQVFPFMTV--RDNVL---------P 64

Query: 127 SDGDDEDPGLRKKKGGGGALSRAIKAVLFGTSLFSNENSGCSASVAGEEDQKQESETKAK 186
            D  DED    KKKGG GA SRAI+AV+FGTSL                  K+ ++ KAK
Sbjct: 65  VDRGDEDSSRWKKKGGRGAWSRAIRAVIFGTSL-----------------AKKIAKRKAK 124

Query: 187 FTNRNSKKENQRHQAMSSISNR-RIASDL-YHNSSFSSSSLTSVPFSSSSICSSSCSSSD 246
             +  + KE+QRH A S  S+R R  SDL Y N S  SS     PFSS S  SSS SS++
Sbjct: 125 --HYQNSKESQRHLAPSWFSSRSRSGSDLNYRNYSTRSSE----PFSSPSFYSSSPSSTE 184

Query: 247 INDRSFRFNPTASNRLFRQINLRKICSGWFLLLVCLLSLISWGKVGAIVCTSAWILCLPR 306
            +D SFR  PTASNRL+ QIN RKI SGWF+LLVCLLSL+ WGK GAI+CTS W+LCL R
Sbjct: 185 KSDSSFRLYPTASNRLYTQINFRKIFSGWFVLLVCLLSLVLWGKTGAIICTSVWLLCLYR 244

Query: 307 RRIGFKSPDDKASEAAAAAIDSSEYKKRIVIEGLLARDRSAAQNSSLRID 355
            R  F+SPDDKAS     A+ S EY    ++E  L RDR AA+NS+LRID
Sbjct: 245 WRFRFRSPDDKAS---TVAMSSGEYNDIEIMEEFLKRDRLAARNSTLRID 253

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022139696.11.2e-8265.53uncharacterized protein LOC111010543 [Momordica charantia][more]
XP_038900051.11.0e-6560.47uncharacterized protein LOC120087211 [Benincasa hispida][more]
XP_022981581.11.1e-6456.66uncharacterized protein LOC111480657 isoform X1 [Cucurbita maxima][more]
XP_023525890.12.4e-6256.31uncharacterized protein LOC111789371 [Cucurbita pepo subsp. pepo][more]
XP_022941268.12.4e-6255.97uncharacterized protein LOC111446618 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CDH55.8e-8365.53uncharacterized protein LOC111010543 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
A0A6J1IWY55.5e-6556.66uncharacterized protein LOC111480657 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FRN01.1e-6255.97uncharacterized protein LOC111446618 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A5A7SZR06.1e-5655.03Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1F8B93.2e-4951.03uncharacterized protein LOC111443063 OS=Cucurbita moschata OX=3662 GN=LOC1114430... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 121..143
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 166..204
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 175..197
NoneNo IPR availablePANTHERPTHR34379:SF6PROTEIN, PUTATIVE-RELATEDcoord: 64..344
IPR040411Uncharacterized protein At5g23160-likePANTHERPTHR34379OS07G0553800 PROTEINcoord: 64..344

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr028225.1Sgr028225.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane