Sgr025797 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr025797
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionLate embryogenesis abundant protein
Locationtig00152936: 3167955 .. 3168602 (+)
RNA-Seq ExpressionSgr025797
SyntenySgr025797
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGAAAAGGAGCAAGCGCGACCACTTGCCCCAGCCGCCCACCATCTAAGCAGCAACAACGAGGAGACATCATTACACCTAAAGAGAATTCGACGAAGAAGATTCATAAAATGTTGTGGTTCCATCGTTGCCCTTCTTGTAGTACAAGCTGTGACAGTCATTATCTTGATGTTCACTGTGTTTCAAGTCAAAGATCCGATAATCAAGATGAACGGAATTTCAATCACCAACGTCGAGCTGATCAATGGCATCATCCCGAAGCCAGGAACCAATGTGTCGCTGACAGCGGACGTGTCTGTGAAAAACCCTAACATGGCGTCGTTCAAGTATAGTAACACGACGACAACTCTATATATTAAGGAAACTATGATAGGGGAGGCCAGAGGGCCGTCAGGGCAGGCCAAGGCACATCGGACATCGCGGATGAACATCACCATCGACATCATTGCCGACCAACTCTTGTCGAACCTCAACGTCAGCTCCGGGAAGCTGAGCTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGCTGCTGCATATTATAAGGAGACATATTGTCGTCAAAATGAACTGTACGTTGATTATCAATATCATGAACAGAACGATTGAGGATCAGAAATGCAAGAGGAAGGTGAAGCTGTAG

mRNA sequence

ATGGCTGAAAAGGAGCAAGCGCGACCACTTGCCCCAGCCGCCCACCATCTAAGCAGCAACAACGAGGAGACATCATTACACCTAAAGAGAATTCGACGAAGAAGATTCATAAAATGTTGTGGTTCCATCGTTGCCCTTCTTGTAGTACAAGCTGTGACAGTCATTATCTTGATGTTCACTGTGTTTCAAGTCAAAGATCCGATAATCAAGATGAACGGAATTTCAATCACCAACGTCGAGCTGATCAATGGCATCATCCCGAAGCCAGGAACCAATGTGTCGCTGACAGCGGACGTGTCTGTGAAAAACCCTAACATGGCGTCGTTCAAGTATAGTAACACGACGACAACTCTATATATTAAGGAAACTATGATAGGGGAGGCCAGAGGGCCGTCAGGGCAGGCCAAGGCACATCGGACATCGCGGATGAACATCACCATCGACATCATTGCCGACCAACTCTTGTCGAACCTCAACGTCAGCTCCGGGAAGCTGAGCTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGCTGCTGCATATTATAAGGAGACATATTGTCGTCAAAATGAACTGTACGTTGATTATCAATATCATGAACAGAACGATTGAGGATCAGAAATGCAAGAGGAAGGTGAAGCTGTAG

Coding sequence (CDS)

ATGGCTGAAAAGGAGCAAGCGCGACCACTTGCCCCAGCCGCCCACCATCTAAGCAGCAACAACGAGGAGACATCATTACACCTAAAGAGAATTCGACGAAGAAGATTCATAAAATGTTGTGGTTCCATCGTTGCCCTTCTTGTAGTACAAGCTGTGACAGTCATTATCTTGATGTTCACTGTGTTTCAAGTCAAAGATCCGATAATCAAGATGAACGGAATTTCAATCACCAACGTCGAGCTGATCAATGGCATCATCCCGAAGCCAGGAACCAATGTGTCGCTGACAGCGGACGTGTCTGTGAAAAACCCTAACATGGCGTCGTTCAAGTATAGTAACACGACGACAACTCTATATATTAAGGAAACTATGATAGGGGAGGCCAGAGGGCCGTCAGGGCAGGCCAAGGCACATCGGACATCGCGGATGAACATCACCATCGACATCATTGCCGACCAACTCTTGTCGAACCTCAACGTCAGCTCCGGGAAGCTGAGCTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGCTGCTGCATATTATAAGGAGACATATTGTCGTCAAAATGAACTGTACGTTGATTATCAATATCATGAACAGAACGATTGAGGATCAGAAATGCAAGAGGAAGGTGAAGCTGTAG

Protein sequence

MAEKEQARPLAPAAHHLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQAVTVIILMFTVFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTLYIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLNVSSGKLSLRSFSRIPGRVKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL
Homology
BLAST of Sgr025797 vs. NCBI nr
Match: XP_038875202.1 (uncharacterized protein LOC120067718 [Benincasa hispida])

HSP 1 Score: 305.8 bits (782), Expect = 2.9e-79
Identity = 159/219 (72.60%), Postives = 193/219 (88.13%), Query Frame = 0

Query: 1   MAEKEQARPLAPAAHH-LSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQAV-TVIILM 60
           M +K+QA+PLAPA HH  SS+N ET+LHLKRI+RRRFIKCCG IV  L++  +  +IILM
Sbjct: 45  MVDKDQAQPLAPATHHRSSSDNGETNLHLKRIQRRRFIKCCGFIVVFLIIPTIMIIIILM 104

Query: 61  FTVFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+FQ+KDP+I+MN +SIT +ELING IPKPG+N+SLTADVSVKNPNMASFKYSNTTTTL
Sbjct: 105 FTLFQIKDPVIRMNRVSITKLELINGAIPKPGSNMSLTADVSVKNPNMASFKYSNTTTTL 164

Query: 121 YIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNL--NVSSGKLSLRSFSRIPGR 180
           +I ET+IGEARGP G+AKA RT RMN+TIDI+AD++LSNL  +VS GK+ LRSFSRIPGR
Sbjct: 165 FINETVIGEARGPPGKAKARRTVRMNVTIDIVADRVLSNLDDDVSLGKVRLRSFSRIPGR 224

Query: 181 VKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           VKLLH+I R++VVKMNCT +INI NR+IEDQ+CKRKVK+
Sbjct: 225 VKLLHLIGRNVVVKMNCTFLINIFNRSIEDQECKRKVKM 263

BLAST of Sgr025797 vs. NCBI nr
Match: XP_011656360.1 (uncharacterized protein LOC105435724 [Cucumis sativus])

HSP 1 Score: 296.2 bits (757), Expect = 2.3e-76
Identity = 156/219 (71.23%), Postives = 191/219 (87.21%), Query Frame = 0

Query: 1   MAEKEQARPLAPAA-HHLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQA-VTVIILM 60
           M +K+QA+PL PA  + LSS+N ET LHLKRI+R+RFIKCC  IVALL++   V +IILM
Sbjct: 1   MVDKDQAQPLTPATLNRLSSDNGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60

Query: 61  FTVFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+FQ+KDPII+MN +SIT +ELIN +IPKPG+NVSLTADVSVKNPNMASFKYSNTTTTL
Sbjct: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120

Query: 121 YIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLN--VSSGKLSLRSFSRIPGR 180
           +I ET+IGE RGPSG+AKA +T RMN+TIDI+AD++LSNLN  VS GK+ LRSFSRIPG+
Sbjct: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK 180

Query: 181 VKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           VKLLH I R++VVKMNCT +INI +++IEDQKCKRK+K+
Sbjct: 181 VKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 219

BLAST of Sgr025797 vs. NCBI nr
Match: XP_008458164.1 (PREDICTED: uncharacterized protein LOC103497685 [Cucumis melo])

HSP 1 Score: 292.7 bits (748), Expect = 2.5e-75
Identity = 154/219 (70.32%), Postives = 188/219 (85.84%), Query Frame = 0

Query: 1   MAEKEQARPLAPAA-HHLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQA-VTVIILM 60
           M  K+QA+PL PA    LSS+N ET LHLKRI+R+RFIKCC  I ALL++   V +IILM
Sbjct: 1   MVGKDQAQPLTPATLDRLSSDNGETELHLKRIQRKRFIKCCSFIAALLIIPTIVIIIILM 60

Query: 61  FTVFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+FQ+KDPII+MN +SIT +ELIN +IPKPG+NVSLTADVSVKNPNMASFKYSNTTTTL
Sbjct: 61  FTLFQIKDPIIRMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120

Query: 121 YIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLN--VSSGKLSLRSFSRIPGR 180
           +I ET+IGE RGP G+AKA +T RMN+TIDI+AD++LSNLN  VS GK+ LRSFSRIPG+
Sbjct: 121 FINETVIGEVRGPPGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK 180

Query: 181 VKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           VKLLH+I R++VVKMNCT +INI +++IEDQKCKRK+K+
Sbjct: 181 VKLLHLIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 219

BLAST of Sgr025797 vs. NCBI nr
Match: KAG7013763.1 (Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 290.4 bits (742), Expect = 1.3e-74
Identity = 159/219 (72.60%), Postives = 188/219 (85.84%), Query Frame = 0

Query: 1   MAEKEQARPLAPAAH-HLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQAVTVI-ILM 60
           MA+K+QARPLAP  H   SS++ +  LHLKRI+RRRFIK    I+ LL++ +V VI ILM
Sbjct: 1   MADKDQARPLAPTTHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILM 60

Query: 61  FTVFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+FQVKDPII+MN ISIT +ELING+IPKPG+NVSLTADVSVKNPN+ASFKYSNTTTTL
Sbjct: 61  FTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTL 120

Query: 121 YIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLN--VSSGKLSLRSFSRIPGR 180
           YI ET+IGEARGP GQAKA RT +MN+TI+I+ D+LL NLN  +SSGKL LRSFSR+PGR
Sbjct: 121 YINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGR 180

Query: 181 VKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           VKLLHI+RR+IVVKMNCT  INI N++IEDQ CKRKVK+
Sbjct: 181 VKLLHILRRNIVVKMNCTSTINIFNKSIEDQNCKRKVKI 219

BLAST of Sgr025797 vs. NCBI nr
Match: KAG6575200.1 (Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 289.7 bits (740), Expect = 2.1e-74
Identity = 159/219 (72.60%), Postives = 188/219 (85.84%), Query Frame = 0

Query: 1   MAEKEQARPLAPAAH-HLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQAVTVI-ILM 60
           MA+K+QARPLAP  H   SS++ +  LHLKRI+RRRFIK    I+ LL++ +V VI ILM
Sbjct: 1   MADKDQARPLAPTIHCRPSSDDYQEQLHLKRIQRRRFIKLFCFIIGLLIILSVIVILILM 60

Query: 61  FTVFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+FQVKDPII+MN ISIT +ELING+IPKPG+NVSLTADVSVKNPN+ASFKYSNTTTTL
Sbjct: 61  FTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTL 120

Query: 121 YIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLN--VSSGKLSLRSFSRIPGR 180
           YI ET+IGEARGP GQAKA RT +MN+TI+I+ D+LL NLN  +SSGKL LRSFSR+PGR
Sbjct: 121 YINETVIGEARGPPGQAKARRTVQMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGR 180

Query: 181 VKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           VKLLHI+RR+IVVKMNCT  INI N++IEDQ CKRKVK+
Sbjct: 181 VKLLHILRRNIVVKMNCTSTINIFNKSIEDQDCKRKVKI 219

BLAST of Sgr025797 vs. ExPASy TrEMBL
Match: A0A0A0KD33 (LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G006820 PE=4 SV=1)

HSP 1 Score: 296.2 bits (757), Expect = 1.1e-76
Identity = 156/219 (71.23%), Postives = 191/219 (87.21%), Query Frame = 0

Query: 1   MAEKEQARPLAPAA-HHLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQA-VTVIILM 60
           M +K+QA+PL PA  + LSS+N ET LHLKRI+R+RFIKCC  IVALL++   V +IILM
Sbjct: 1   MVDKDQAQPLTPATLNRLSSDNGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60

Query: 61  FTVFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+FQ+KDPII+MN +SIT +ELIN +IPKPG+NVSLTADVSVKNPNMASFKYSNTTTTL
Sbjct: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120

Query: 121 YIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLN--VSSGKLSLRSFSRIPGR 180
           +I ET+IGE RGPSG+AKA +T RMN+TIDI+AD++LSNLN  VS GK+ LRSFSRIPG+
Sbjct: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK 180

Query: 181 VKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           VKLLH I R++VVKMNCT +INI +++IEDQKCKRK+K+
Sbjct: 181 VKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 219

BLAST of Sgr025797 vs. ExPASy TrEMBL
Match: A0A1S3C8G8 (uncharacterized protein LOC103497685 OS=Cucumis melo OX=3656 GN=LOC103497685 PE=4 SV=1)

HSP 1 Score: 292.7 bits (748), Expect = 1.2e-75
Identity = 154/219 (70.32%), Postives = 188/219 (85.84%), Query Frame = 0

Query: 1   MAEKEQARPLAPAA-HHLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQA-VTVIILM 60
           M  K+QA+PL PA    LSS+N ET LHLKRI+R+RFIKCC  I ALL++   V +IILM
Sbjct: 1   MVGKDQAQPLTPATLDRLSSDNGETELHLKRIQRKRFIKCCSFIAALLIIPTIVIIIILM 60

Query: 61  FTVFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+FQ+KDPII+MN +SIT +ELIN +IPKPG+NVSLTADVSVKNPNMASFKYSNTTTTL
Sbjct: 61  FTLFQIKDPIIRMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120

Query: 121 YIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLN--VSSGKLSLRSFSRIPGR 180
           +I ET+IGE RGP G+AKA +T RMN+TIDI+AD++LSNLN  VS GK+ LRSFSRIPG+
Sbjct: 121 FINETVIGEVRGPPGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK 180

Query: 181 VKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           VKLLH+I R++VVKMNCT +INI +++IEDQKCKRK+K+
Sbjct: 181 VKLLHLIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 219

BLAST of Sgr025797 vs. ExPASy TrEMBL
Match: A0A6J1H4K3 (uncharacterized protein LOC111460339 OS=Cucurbita moschata OX=3662 GN=LOC111460339 PE=4 SV=1)

HSP 1 Score: 287.3 bits (734), Expect = 5.1e-74
Identity = 158/219 (72.15%), Postives = 188/219 (85.84%), Query Frame = 0

Query: 1   MAEKEQARPLAPAAH-HLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQAVTVI-ILM 60
           MA+K+QARPLAPA     SS++ +  LHLKRI+RRRFIK    I+ LL++ +V VI IL+
Sbjct: 1   MADKDQARPLAPATDCRPSSDDYQEKLHLKRIQRRRFIKLFCFIIGLLIILSVGVILILI 60

Query: 61  FTVFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+FQVKDPII+MN ISIT +ELING+IPKPG+NVSLTADVSVKNPN+ASFKYSNTTTTL
Sbjct: 61  FTLFQVKDPIIQMNNISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTL 120

Query: 121 YIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLN--VSSGKLSLRSFSRIPGR 180
           YI ET+IGEARGP GQAKA RT RMN+TI+I+ D+LL NLN  +SSGKL LRSFSR+PGR
Sbjct: 121 YINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNSDMSSGKLRLRSFSRVPGR 180

Query: 181 VKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           VK+LHI+RR+IVVKMNCT  INI N++IEDQ CKRKVK+
Sbjct: 181 VKVLHILRRNIVVKMNCTSTINIFNKSIEDQDCKRKVKI 219

BLAST of Sgr025797 vs. ExPASy TrEMBL
Match: A0A6J1L0R6 (uncharacterized protein LOC111499318 OS=Cucurbita maxima OX=3661 GN=LOC111499318 PE=4 SV=1)

HSP 1 Score: 282.7 bits (722), Expect = 1.3e-72
Identity = 159/219 (72.60%), Postives = 186/219 (84.93%), Query Frame = 0

Query: 1   MAEKEQARPLAPAAH-HLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQAVTVI-ILM 60
           MA+K+QARPLA A     SS++ +  LHLK+I+R RFIK    I+ LLV+ +V VI ILM
Sbjct: 1   MADKDQARPLALATDCRPSSDDYQEKLHLKKIQRIRFIKFFCFIICLLVILSVVVILILM 60

Query: 61  FTVFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+FQVKDPII+MN ISIT +ELING+IPKPG+NVSLTADVSVKNPN+ASFKYSNTTTTL
Sbjct: 61  FTLFQVKDPIIQMNKISITKLELINGVIPKPGSNVSLTADVSVKNPNVASFKYSNTTTTL 120

Query: 121 YIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLN--VSSGKLSLRSFSRIPGR 180
           YI ET+IGEARGP GQAKA RT RMN+TI+I+ D+LL NLN  +SSGKL LRSFSR+PGR
Sbjct: 121 YINETVIGEARGPPGQAKARRTVRMNLTINIVVDRLLLNLNNDMSSGKLRLRSFSRVPGR 180

Query: 181 VKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           VKLLHIIRR+IVVKMNCT  INI N++IEDQ CKRKVK+
Sbjct: 181 VKLLHIIRRNIVVKMNCTSTINIFNKSIEDQDCKRKVKI 219

BLAST of Sgr025797 vs. ExPASy TrEMBL
Match: A0A2P5A832 (Immunoglobulin-like fold containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_359160 PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 2.8e-72
Identity = 145/219 (66.21%), Postives = 183/219 (83.56%), Query Frame = 0

Query: 1   MAEKEQARPLAPAAHHLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQAVTVIILMFT 60
           MAEKEQARPLAPAA   SS++++ +  LK+IRRR+FIKCCG I AL+++QAV +IIL+FT
Sbjct: 1   MAEKEQARPLAPAADRPSSDDDDITAQLKKIRRRKFIKCCGCITALMLIQAVVIIILIFT 60

Query: 61  VFQVKDPIIKMNGISITNVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTLYI 120
           VF+VKDP+IKMN I++T +EL N   PKPGTN+SLTADVSVKNPN+ASFKY NTTTTLY 
Sbjct: 61  VFRVKDPVIKMNKITVTQLELANNTTPKPGTNMSLTADVSVKNPNVASFKYKNTTTTLYY 120

Query: 121 KETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLN----VSSGKLSLRSFSRIPGR 180
              ++GEARGP GQAK  RT RMNIT+DII D+L+S+ N    V SG L++ S+SRIPGR
Sbjct: 121 HGMVVGEARGPPGQAKPKRTMRMNITVDIITDRLMSSPNLVADVGSGLLTMSSYSRIPGR 180

Query: 181 VKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           VK+L+II+RH+VVKMNCT+ +NI ++TI++QKCKRKV L
Sbjct: 181 VKMLNIIKRHVVVKMNCTMKVNISSQTIQEQKCKRKVNL 219

BLAST of Sgr025797 vs. TAIR 10
Match: AT2G46150.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 188.0 bits (476), Expect = 8.1e-48
Identity = 101/223 (45.29%), Postives = 154/223 (69.06%), Query Frame = 0

Query: 1   MAEKEQARPLAPAAHHLSSNNEETSLHLKRIRR-RRFIKCCGSIVALLVVQAVTVIILMF 60
           MA+ E  RPLAPA   +   ++E++ ++K   R R  IKC   + A  ++    V+ L+F
Sbjct: 1   MADSEHVRPLAPAT--ILPVSDESASNIKNTHRSRNRIKCSICVTATSLILTTIVLTLVF 60

Query: 61  TVFQVKDPIIKMNGISITNVELINGI--IPKPGTNVSLTADVSVKNPNMASFKYSNTTTT 120
           TVF+VKDPIIKMNG+ +  ++ + G   +   GTN+S+  DVSVKNPN ASFKYSNTTT 
Sbjct: 61  TVFRVKDPIIKMNGVMVNGLDSVTGTNQVQLLGTNISMIVDVSVKNPNTASFKYSNTTTD 120

Query: 121 LYIKETMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNLNVS-----SGKLSLRSFSR 180
           +Y K T++GEA G  G+A+ HRTSRMN+T+DI+ D++LS+  +      SG +++ S++R
Sbjct: 121 IYYKGTLVGEAHGLPGKARPHRTSRMNVTVDIMLDRILSDPGLGREISRSGLVNVWSYTR 180

Query: 181 IPGRVKLLHIIRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
           + G+VK++ I+++H+ VKMNCT+ +NI  + I+D  CK+K+ L
Sbjct: 181 VGGKVKIMGIVKKHVTVKMNCTMAVNITGQAIQDVDCKKKIDL 221

BLAST of Sgr025797 vs. TAIR 10
Match: AT3G54200.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 101.7 bits (252), Expect = 7.7e-22
Identity = 65/213 (30.52%), Postives = 120/213 (56.34%), Query Frame = 0

Query: 9   PLAPAAHHLSSNNEETSLHLKRIRRRRFIKCCGSIVALLVVQ-AVTVIILMFTVFQVKDP 68
           P  P A  + + +  T    K++RR+R  K C     LL++  A+ ++IL FT+F+ K P
Sbjct: 24  PPKPNASSMETQSANTGT-AKKLRRKRNCKICICFTILLILLIAIVIVILAFTLFKPKRP 83

Query: 69  IIKMNGISITNVEL-INGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTTTLYIKETMIG 128
              ++ +++  ++  +N ++ K   N++L  D+S+KNPN   F Y +++  L  +  +IG
Sbjct: 84  TTTIDSVTVDRLQASVNPLLLKVLLNLTLNVDLSLKNPNRIGFSYDSSSALLNYRGQVIG 143

Query: 129 EARGPSGQAKAHRTSRMNITIDIIADQLLSNL----NVSSGKLSLRSFSRIPGRVKLLHI 188
           EA  P+ +  A +T  +NIT+ ++AD+LLS      +V +G + L +F ++ G+V +L I
Sbjct: 144 EAPLPANRIAARKTVPLNITLTLMADRLLSETQLLSDVMAGVIPLNTFVKVTGKVTVLKI 203

Query: 189 IRRHIVVKMNCTLIINIMNRTIEDQKCKRKVKL 216
            +  +    +C L I++ +R +  Q CK   KL
Sbjct: 204 FKIKVQSSSSCDLSISVSDRNVTSQHCKYSTKL 235

BLAST of Sgr025797 vs. TAIR 10
Match: AT4G23610.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 80.1 bits (196), Expect = 2.4e-15
Identity = 63/211 (29.86%), Postives = 113/211 (53.55%), Query Frame = 0

Query: 4   KEQARPLAPAAHHLSSN--NEETSLHLKRIR----RRRFIKCCGSIVALLVVQAVTVIIL 63
           ++QA+PLAP      S+  +EE   H  R +    + + I CCG I +L ++ AVT I+L
Sbjct: 10  EDQAKPLAPLFLTTRSDQPDEEDQYHHDRTKYVHSQTKLILCCGFIASLTMLIAVTFIVL 69

Query: 64  MFTVFQVKDPIIKMNGISIT-NVELINGIIPKPGTNVSLTADVSVKNPNMASFKYSNTTT 123
             TVF +  P + ++ IS     + +NG +     N +++ ++S+ NPN A F   N   
Sbjct: 70  SLTVFHLHSPNLTVDSISFNQRFDFVNGKV-NTNQNTTVSVEISLHNPNPALFIVKNVNV 129

Query: 124 TLYIKE-TMIGEARGPSGQAKAHRTSRMNITIDIIADQLLSNL-----NVSSGKLSLRSF 183
           + Y  E  ++GE+   S    A RT +MN+T +I+  +LL++L     +++   + L+S 
Sbjct: 130 SFYHGELVVVGESIRRSETIPAKRTVKMNLTAEIVKTKLLASLPGLMEDLNGRGVDLKSS 189

Query: 184 SRIPGRVKLLHIIRRHIVVKMNCTLIINIMN 202
             + GRVK + I R+ + ++ +C + +   N
Sbjct: 190 VEVRGRVKKMKIFRKTVHLQTDCFMKMTTNN 219

BLAST of Sgr025797 vs. TAIR 10
Match: AT3G05975.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 68.9 bits (167), Expect = 5.5e-12
Identity = 49/188 (26.06%), Postives = 96/188 (51.06%), Query Frame = 0

Query: 33  RRRFIKCCGSIVALLVVQAVTVIILMFTVFQVKDPIIKMNGISITNVELINGIIPKPGTN 92
           +RR       I+ +L V  +T +IL   VF+ K PI++    ++  +     +  +   N
Sbjct: 3   KRRICCIVSGIIFVLFVIFMTALILA-QVFKPKHPILQTVSSTVDGISTNISLPYEVQLN 62

Query: 93  VSLTADVSVKNPNMASFKYSNTTTTLYIKETMIGEARGPSGQAKAHRTSRMNITIDIIAD 152
            +LT ++ +KNPN+A F+Y      +Y ++T++G    PS    A  +  +   + +  D
Sbjct: 63  FTLTLEMLLKNPNVADFEYKTVENLVYYRDTLVGNLTLPSSTLPAKGSVLLPCPLFLQLD 122

Query: 153 QLLSNL-----NVSSGKLSLRSFSRIPGRVKLLHIIRRHIVVKMNCTLIINIMNRTIEDQ 212
           + ++NL     +V  GK+ + + +++PG++ LL I +  +    +C L++   +  +EDQ
Sbjct: 123 KFVANLGDIVQDVLHGKIVMETRAKMPGKITLLGIFKIPLDSISHCNLVLGFPSMVVEDQ 182

Query: 213 KCKRKVKL 216
            C  K KL
Sbjct: 183 VCDLKTKL 189

BLAST of Sgr025797 vs. TAIR 10
Match: AT1G64450.1 (Glycine-rich protein family )

HSP 1 Score: 45.4 bits (106), Expect = 6.5e-05
Identity = 31/121 (25.62%), Postives = 57/121 (47.11%), Query Frame = 0

Query: 29  KRIRRRRFIKCCGSIVALLVVQAVTVIILMFTVFQVKDPIIKMNGISITNVELINGIIPK 88
           +R   R  +  C      L++  V ++++ FTVF+ KDP I +N + + +  + N     
Sbjct: 8   RRSSGRTNLASCAVATVFLLILLVVLLVVYFTVFKPKDPKISVNAVQLPSFAVSNNT--- 67

Query: 89  PGTNVSLTADVSVKNPNMASFKYSNTTTTLYIKETMIGEARGPSGQAKAHRTSRMNITID 148
              N S +  V+V+NPN A F + +++  L      +G    P+G+  + R   M  T  
Sbjct: 68  --ANFSFSQYVAVRNPNRAVFSHYDSSIQLLYSGNQVGFMFIPAGKIDSGRIQYMAATFT 123

Query: 149 I 150
           +
Sbjct: 128 V 123

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038875202.12.9e-7972.60uncharacterized protein LOC120067718 [Benincasa hispida][more]
XP_011656360.12.3e-7671.23uncharacterized protein LOC105435724 [Cucumis sativus][more]
XP_008458164.12.5e-7570.32PREDICTED: uncharacterized protein LOC103497685 [Cucumis melo][more]
KAG7013763.11.3e-7472.60Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. argy... [more]
KAG6575200.12.1e-7472.60Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. soro... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KD331.1e-7671.23LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G006820 PE=4 ... [more]
A0A1S3C8G81.2e-7570.32uncharacterized protein LOC103497685 OS=Cucumis melo OX=3656 GN=LOC103497685 PE=... [more]
A0A6J1H4K35.1e-7472.15uncharacterized protein LOC111460339 OS=Cucurbita moschata OX=3662 GN=LOC1114603... [more]
A0A6J1L0R61.3e-7272.60uncharacterized protein LOC111499318 OS=Cucurbita maxima OX=3661 GN=LOC111499318... [more]
A0A2P5A8322.8e-7266.21Immunoglobulin-like fold containing protein OS=Parasponia andersonii OX=3476 GN=... [more]
Match NameE-valueIdentityDescription
AT2G46150.18.1e-4845.29Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT3G54200.17.7e-2230.52Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT4G23610.12.4e-1529.86Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT3G05975.15.5e-1226.06Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT1G64450.16.5e-0525.62Glycine-rich protein family [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 99..193
e-value: 1.6E-9
score: 38.3
NoneNo IPR availableGENE3D2.60.40.1820coord: 48..184
e-value: 2.1E-9
score: 39.4
NoneNo IPR availablePANTHERPTHR31852:SF212LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 17..214
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 17..214
NoneNo IPR availableSUPERFAMILY117070LEA14-likecoord: 60..157

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr025797.1Sgr025797.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane