Sgr028804 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr028804
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionglycosyltransferase family 64 protein C4
Locationtig00153206: 2228640 .. 2231126 (-)
RNA-Seq ExpressionSgr028804
SyntenySgr028804
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGCGGCTCCGGCAATTCGCCGTCACAACCGTCGGATCCATCAAGATCAAGCTTCTCCTCTGTTGCTGCATCGGATTTGCCGTCGTCCTCTTCGCAGGCCGCGGCTCGGATCTCATGGGATGGACCGGCGACCGCGCTACGGTGGACCGATACTCTACGTCGCGGTTCGTACTGTGATCTTTCTCCCGAACCATTAACAATTTTCTTTTAACTCATGAATCGCTCGCCTATACATTATATGATCAAGATTTCTTGATCTCCATTCTGATTTTCAACAAACTTTGAATAATGTAGGAATCGTTATGCAATCGTAATCAACACATGGAAAAGATACGATCTCCTGAAGAGGTCCATTTCTCACTACACGACGTGTTCGGGAGTTGAGTCGATACATATCGTATGGAGCGAACCCGATCCTCCGACAGAATCTCTGGTAACTTTCCTGCAACAGACGGCGGAGGCGAATTCCGGACATGGGCGAGAAGCGGAGTTGAGATTTGAGATCAACGAAGAAGATAGCTTGAACAATCGGTTTAAAGAAATAAAGGGTTTGAAGACGGAGGCCATTTTTTCAGTGGACGATGATGTTATATTTCCTTGTTCTTCGGTGGAGTTTGCTTTCAGCGTTTGGCAGAGCGCGCCTGAGACGATGGTTGGGTTTGTGCCCCGCATGCATTGGCTTGATCGCTCGGTATGTTACTTTCATTTTGTGTGCATGTTCATGATATCTTTAGCTGCAATTTTTTTTTTTTTTTTTTTATACTTTCAAAAGTTCTTTTTAGTTTCCATAACACTTCTTCAATTTTCTTACACCGTTAGTTTATGAAAATTCGATACTCAGACCAGATATATGTTGTCAAAACTCGCGACTTTGTATAGTCATGAAAATCTATTTTGTTTATTATTGTTTAATTTTCTTAAACATTCTTAAAAATTAAAATCTTACACTTTATGTACATGTGACTTGAACAAACAAGTTTGCATAGTGAAAATATACTTCCGACGTTGTTAAATTTTTCTAGACATTCATAAAAATTTTAGGCTATTCTCCGTTCTAAAAACCCAACATTCTGAAGAGGTATATTTTTTTTACTTAAACAGACATGTCTATCTATTTGAAATCTATTTTGGTCTCTACTGTTAAAGCTTTCATGAACTATTCATAAAACTCTTAGGTTATTCTGCTCTCGAAACCCAATACTTGATTAGATACATTTGACCTAAACACACAAGCGTATGTAGTGAAAGTCTAATTTGGTCGTGATTGTTAAATTCTTCTTACCCTTCATAAAAATCTTAGACTTAATCCTTGACCAGAAACATTTTGACCTAAATATACAAAACCCTTAGACGATTTCTTTCTGAAAACCCAATATACTCAGATCAAATACGTGCGACTTAAACCTACAAATCAACATAGTGTCATGTAAGATTTTTGTTAAATCTTAAATGTAATGGAGAAATTTAATGAAGTATGAACAAAAGTTTTAAGTATGAAAATAGGGTTTAAAGTTGAAATGTTTGGTTAATGTTGTTGAGTAGTAATGCAGAAGGGAGGCATGGGGGGATACAGATATGGGGGTTGGTGGTCAGTGTGGTGGACTGGTACATACAGTATGGTGCTTTCAAAGGCTGCATTTTTTCACTCAAAGTATTTGGGCATTTACACTAACCACATGCCTGCTTCCATCAGAGATTATGTTACCAAAAACAGGTTTCTTTTCTTTTCTTTTCCTTTTTTCCTTTCCTTTCTGAACCTTTCTTCTGTCAAATTGTCATCTTGAGATTGAATTTTGCAGGAACTGTGAAGACATTGCAATGGCCTTTCTTGTTGCTAATGCAAGTGGTACTCCCCCTATATGGGTCAAAGGTTTGAAATGATCAAATTCTAATTTTAATCAATTTTATTTTAAGGTGTAAATGAGTTGACTATTTTAGATTTTTGGTCAAAACTGACGTCAAACATATTATGTTGTTTTACAAAAATCGAAATTCAATATTTTCTAACCTTGAAGTGAGTAATTTGGTTAAAAAAACCAACCAACCAAAATCAAACATCATAATATTTTTCGATGTTGTAATTATATAAAATAAAACATTCAACACTACATAACTAAATCGAACCGAACTAGTTTTGATTTATCTTTGATCACTTATATTGTCAACCTCAACCTCAAGAAATGCAGTGAGAATTCTCATCTCATGATCTAAATATGTTTATCAGCCAAAGATGTGAATTTCCCTTGTCGATTCAAAACTATATAAGCAGTAGCTCCCTTATCATTGAAGCTCATTGATGTGTATTCTTTTTTGGTAATTGTAGGGAAGATATTTGAAATAGGCTCAAGTGGAATCAGTAGCATAGGAGGTCACAGTGAAAGAAGAAGCCAATGCGTCAATAGGTTTGTTGCAGAGTATGGTGGAATGCCTTTGCTATCTTCAACCGTGAAGGCTGTTGATAGTCGTAACATTTGGTTTTGGTAA

mRNA sequence

ATGCAGCGGCTCCGGCAATTCGCCGTCACAACCGTCGGATCCATCAAGATCAAGCTTCTCCTCTGTTGCTGCATCGGATTTGCCGTCGTCCTCTTCGCAGGCCGCGGCTCGGATCTCATGGGATGGACCGGCGACCGCGCTACGGTGGACCGATACTCTACGTCGCGGAATCGTTATGCAATCGTAATCAACACATGGAAAAGATACGATCTCCTGAAGAGGTCCATTTCTCACTACACGACGTGTTCGGGAGTTGAGTCGATACATATCGTATGGAGCGAACCCGATCCTCCGACAGAATCTCTGGTAACTTTCCTGCAACAGACGGCGGAGGCGAATTCCGGACATGGGCGAGAAGCGGAGTTGAGATTTGAGATCAACGAAGAAGATAGCTTGAACAATCGGTTTAAAGAAATAAAGGGTTTGAAGACGGAGGCCATTTTTTCAGTGGACGATGATGTTATATTTCCTTGTTCTTCGGTGGAGTTTGCTTTCAGCGTTTGGCAGAGCGCGCCTGAGACGATGGTTGGGTTTGTGCCCCGCATGCATTGGCTTGATCGCTCGAAGGGAGGCATGGGGGGATACAGATATGGGGGTTGGTGGTCAGTGTGGTGGACTGGTACATACAGTATGGTGCTTTCAAAGGCTGCATTTTTTCACTCAAAGTATTTGGGCATTTACACTAACCACATGCCTGCTTCCATCAGAGATTATGTTACCAAAAACAGGAACTGTGAAGACATTGCAATGGCCTTTCTTGTTGCTAATGCAAGTGGTACTCCCCCTATATGGGTCAAAGGGAAGATATTTGAAATAGGCTCAAGTGGAATCAGTAGCATAGGAGGTCACAGTGAAAGAAGAAGCCAATGCGTCAATAGGTTTGTTGCAGAGTATGGTGGAATGCCTTTGCTATCTTCAACCGTGAAGGCTGTTGATAGTCGTAACATTTGGTTTTGGTAA

Coding sequence (CDS)

ATGCAGCGGCTCCGGCAATTCGCCGTCACAACCGTCGGATCCATCAAGATCAAGCTTCTCCTCTGTTGCTGCATCGGATTTGCCGTCGTCCTCTTCGCAGGCCGCGGCTCGGATCTCATGGGATGGACCGGCGACCGCGCTACGGTGGACCGATACTCTACGTCGCGGAATCGTTATGCAATCGTAATCAACACATGGAAAAGATACGATCTCCTGAAGAGGTCCATTTCTCACTACACGACGTGTTCGGGAGTTGAGTCGATACATATCGTATGGAGCGAACCCGATCCTCCGACAGAATCTCTGGTAACTTTCCTGCAACAGACGGCGGAGGCGAATTCCGGACATGGGCGAGAAGCGGAGTTGAGATTTGAGATCAACGAAGAAGATAGCTTGAACAATCGGTTTAAAGAAATAAAGGGTTTGAAGACGGAGGCCATTTTTTCAGTGGACGATGATGTTATATTTCCTTGTTCTTCGGTGGAGTTTGCTTTCAGCGTTTGGCAGAGCGCGCCTGAGACGATGGTTGGGTTTGTGCCCCGCATGCATTGGCTTGATCGCTCGAAGGGAGGCATGGGGGGATACAGATATGGGGGTTGGTGGTCAGTGTGGTGGACTGGTACATACAGTATGGTGCTTTCAAAGGCTGCATTTTTTCACTCAAAGTATTTGGGCATTTACACTAACCACATGCCTGCTTCCATCAGAGATTATGTTACCAAAAACAGGAACTGTGAAGACATTGCAATGGCCTTTCTTGTTGCTAATGCAAGTGGTACTCCCCCTATATGGGTCAAAGGGAAGATATTTGAAATAGGCTCAAGTGGAATCAGTAGCATAGGAGGTCACAGTGAAAGAAGAAGCCAATGCGTCAATAGGTTTGTTGCAGAGTATGGTGGAATGCCTTTGCTATCTTCAACCGTGAAGGCTGTTGATAGTCGTAACATTTGGTTTTGGTAA

Protein sequence

MQRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTGDRATVDRYSTSRNRYAIVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFVPRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYVTKNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYGGMPLLSSTVKAVDSRNIWFW
Homology
BLAST of Sgr028804 vs. NCBI nr
Match: XP_038893187.1 (glycosyltransferase family 64 protein C4 [Benincasa hispida])

HSP 1 Score: 534.6 bits (1376), Expect = 5.6e-148
Identity = 261/321 (81.31%), Postives = 284/321 (88.47%), Query Frame = 0

Query: 1   MQRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTG-DRATVDRYSTSRNRY 60
           +QRL Q AVT    IKIKLLLCCCIG AV+LF  R SDLMGWT  D A+  RYST R RY
Sbjct: 12  IQRLWQVAVT----IKIKLLLCCCIGLAVILFTARASDLMGWTSDDGASALRYSTPRKRY 71

Query: 61  AIVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGRE 120
           AI++NTWKRYDLLK+SISHYTTC GVESIHIVWSEP PP  SLV+FLQ TA+ANS  GRE
Sbjct: 72  AILMNTWKRYDLLKKSISHYTTCLGVESIHIVWSEPAPPPVSLVSFLQGTAKANSRDGRE 131

Query: 121 AELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFV 180
           AELRFE+NEEDSLNNRFKEIKGLKTEAIFSVDDDVIF CS++EFAFSVWQSAP+TMVGFV
Sbjct: 132 AELRFEMNEEDSLNNRFKEIKGLKTEAIFSVDDDVIFGCSTLEFAFSVWQSAPQTMVGFV 191

Query: 181 PRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYV 240
           PRMHW+DRSKG MG +RYGGWWSVWWTGTYSMVLSKAAFFHSKYL IYTNHMP+SIRDY+
Sbjct: 192 PRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKYLSIYTNHMPSSIRDYI 251

Query: 241 TKNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYG 300
           TKNRNCEDIAM+FLVAN SG PP+WV+GKI+EIGSSGISS+GGHSERRSQC+N FV EYG
Sbjct: 252 TKNRNCEDIAMSFLVANVSGAPPVWVQGKIYEIGSSGISSLGGHSERRSQCLNIFVEEYG 311

Query: 301 G-MPLLSSTVKAVDSRNIWFW 320
           G MPLL ST+KAVDSR  W W
Sbjct: 312 GIMPLLPSTLKAVDSRQFWSW 328

BLAST of Sgr028804 vs. NCBI nr
Match: XP_022156919.1 (glycosyltransferase family 64 protein C4 [Momordica charantia])

HSP 1 Score: 526.2 bits (1354), Expect = 2.0e-145
Identity = 259/322 (80.43%), Postives = 286/322 (88.82%), Query Frame = 0

Query: 2   QRLRQFAVTTVGSIKIK-LLLCCCIGFAVVLFAGRGSDLMGWTGDRATVDRYSTSRNRYA 61
           QRLRQ+  T +GS+KIK +LL CCIGFAVV+FA R  +L+ WT  R  V+ +S  R RYA
Sbjct: 12  QRLRQW--TALGSMKIKIILLFCCIGFAVVVFAARAPNLVVWTAGRLMVEPFSLPRKRYA 71

Query: 62  IVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANS-GHGRE 121
           I+INTWKRYDLLK+SI HY+TC GVESIHIVWSEP+PP+ESLVTFL+Q A+ NS  HGRE
Sbjct: 72  ILINTWKRYDLLKQSIFHYSTCLGVESIHIVWSEPEPPSESLVTFLRQRAKENSRRHGRE 131

Query: 122 A--ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVG 181
              ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCS+VEFAFSVWQSAPETMVG
Sbjct: 132 LEDELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSTVEFAFSVWQSAPETMVG 191

Query: 182 FVPRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRD 241
           FVPRMHW D S+G MG +RYGGWWSVWW+GTYSMVLSKAAFFHSKYL IY+N+MPASIRD
Sbjct: 192 FVPRMHWPDHSEGEMGRFRYGGWWSVWWSGTYSMVLSKAAFFHSKYLAIYSNNMPASIRD 251

Query: 242 YVTKNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAE 301
           YV+KNRNCEDIAM+FLVAN SGTPPIWVKGKIFEIGSSGISS+GGH+ERRS+CVNRFV E
Sbjct: 252 YVSKNRNCEDIAMSFLVANVSGTPPIWVKGKIFEIGSSGISSLGGHNERRSECVNRFVEE 311

Query: 302 YGGMPLLSSTVKAVDSRNIWFW 320
           YGGMPLL STVKAVDSR  WFW
Sbjct: 312 YGGMPLLPSTVKAVDSRYTWFW 331

BLAST of Sgr028804 vs. NCBI nr
Match: XP_030490104.1 (glycosyltransferase family 64 protein C4 [Cannabis sativa] >KAF4386121.1 hypothetical protein F8388_016373 [Cannabis sativa] >KAF4395708.1 hypothetical protein G4B88_013482 [Cannabis sativa])

HSP 1 Score: 514.2 bits (1323), Expect = 7.9e-142
Identity = 234/319 (73.35%), Postives = 279/319 (87.46%), Query Frame = 0

Query: 1   MQRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTGDRATVDRYSTSRNRYA 60
           +QR RQ A++TVGS+KIKLLLCCCIGF+++  A R    +GWTG   ++   S SR  Y+
Sbjct: 11  VQRFRQVAISTVGSLKIKLLLCCCIGFSLIAVASRAPAFLGWTGSSISMPPISDSRKGYS 70

Query: 61  IVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGREA 120
           IV+NTWKRYDLLK+SISHYTTCSG++SIHIVWSEPDPP++SL  FL    E+N+  GR  
Sbjct: 71  IVMNTWKRYDLLKQSISHYTTCSGLDSIHIVWSEPDPPSDSLKKFLNHIVESNAKDGRLV 130

Query: 121 ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFVP 180
           EL+F+IN+EDSLNNRFKEI  LKT+AIFS+DDDVIFPCSSVEFAFSVWQSAP+TMVG+VP
Sbjct: 131 ELKFDINKEDSLNNRFKEITDLKTDAIFSIDDDVIFPCSSVEFAFSVWQSAPDTMVGYVP 190

Query: 181 RMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYVT 240
           R+HW+D +K  +G Y YGGWWSVWWTG+YSMVLSKAAFFH KYL +YTN MPASIR+Y+T
Sbjct: 191 RIHWVDSTKDNLGSYIYGGWWSVWWTGSYSMVLSKAAFFHKKYLSLYTNEMPASIREYIT 250

Query: 241 KNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYGG 300
           KNRNCEDIAM+FLVANA+G PPIWVKGKIFEIGS+GISS+GGHSE+R++CVNRFVAE+GG
Sbjct: 251 KNRNCEDIAMSFLVANATGAPPIWVKGKIFEIGSTGISSLGGHSEKRTECVNRFVAEFGG 310

Query: 301 MPLLSSTVKAVDSRNIWFW 320
           MPL+S++VKAVDSR IWFW
Sbjct: 311 MPLVSTSVKAVDSRKIWFW 329

BLAST of Sgr028804 vs. NCBI nr
Match: KAG7031573.1 (Glycosyltransferase family 64 protein C4 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 511.5 bits (1316), Expect = 5.1e-141
Identity = 248/321 (77.26%), Postives = 280/321 (87.23%), Query Frame = 0

Query: 1   MQRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTGDR-ATVDRYSTSRNRY 60
           +QRLRQ AVT    IKIK LLCCCI  A VLFA R S+LMGWT D  AT  RYST R RY
Sbjct: 12  IQRLRQIAVT----IKIKHLLCCCIVLAFVLFATRASNLMGWTSDEDATALRYSTPRKRY 71

Query: 61  AIVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGRE 120
           AI++NTWKR+DLLK+SISHYTTC GVESIHIVWSEPDPP +SLV FLQ+T + NS + RE
Sbjct: 72  AILMNTWKRHDLLKQSISHYTTCMGVESIHIVWSEPDPPPDSLVAFLQRTVKENSRNDRE 131

Query: 121 AELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFV 180
            ELRFE+N+EDSLNNRFKEIK L T+AIFSVDDDVIF CS++EFAF+VWQSA ETMVGFV
Sbjct: 132 IELRFEMNKEDSLNNRFKEIKNLNTDAIFSVDDDVIFACSTLEFAFTVWQSASETMVGFV 191

Query: 181 PRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYV 240
           PRMHW+D+++  MG Y YGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYT+HMPASIR+YV
Sbjct: 192 PRMHWIDQAQEEMGKYIYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTHHMPASIRNYV 251

Query: 241 TKNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYG 300
           TKNRNCEDIAM+F+VAN SG PP+WVKG+I+EIGS GISS+GGHSERRS+CVN FV EYG
Sbjct: 252 TKNRNCEDIAMSFVVANVSGAPPVWVKGRIYEIGSGGISSLGGHSERRSKCVNTFVEEYG 311

Query: 301 G-MPLLSSTVKAVDSRNIWFW 320
           G MPL+ STVKAVDSR++WFW
Sbjct: 312 GMMPLVPSTVKAVDSRHLWFW 328

BLAST of Sgr028804 vs. NCBI nr
Match: PON97367.1 (Exostosin, C-terminal [Trema orientale])

HSP 1 Score: 511.1 bits (1315), Expect = 6.7e-141
Identity = 233/319 (73.04%), Postives = 275/319 (86.21%), Query Frame = 0

Query: 1   MQRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTGDRATVDRYSTSRNRYA 60
           +QR RQ A++TVGS+KIKL+LCCCIG  ++  A R    +GWTG   ++   S SR  YA
Sbjct: 11  VQRFRQVAISTVGSLKIKLVLCCCIGLTLIAVASRAPGFLGWTGPSVSMPPISDSRKGYA 70

Query: 61  IVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGREA 120
           IV+NTWKRYDLLK+SISHYTTCSG++SIHIVWSEPDPP++S+   L    E+N+  GR+ 
Sbjct: 71  IVMNTWKRYDLLKQSISHYTTCSGLDSIHIVWSEPDPPSDSVKKVLNHILESNARDGRQV 130

Query: 121 ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFVP 180
           EL+F+IN EDSLNNRFKEI  LKT+AIFS+DDDVIFPCSSVEFAF+VWQSAP+TMVGFVP
Sbjct: 131 ELKFDINTEDSLNNRFKEINDLKTDAIFSIDDDVIFPCSSVEFAFNVWQSAPDTMVGFVP 190

Query: 181 RMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYVT 240
           R+HW+D +K  MG Y YGGWWSVWWTG+YSMVLSKAAFFH KYL +YTN MPASIR+Y+T
Sbjct: 191 RIHWVDSTKNNMGRYIYGGWWSVWWTGSYSMVLSKAAFFHKKYLSLYTNEMPASIREYIT 250

Query: 241 KNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYGG 300
           KNRNCEDIAM+FLVANA+G PPIWVKGKIFEIGS+GISS+GGHSERR++CVNRFV EYGG
Sbjct: 251 KNRNCEDIAMSFLVANATGAPPIWVKGKIFEIGSTGISSLGGHSERRTECVNRFVTEYGG 310

Query: 301 MPLLSSTVKAVDSRNIWFW 320
           MPL+S++VKAVDSR IWFW
Sbjct: 311 MPLVSTSVKAVDSRKIWFW 329

BLAST of Sgr028804 vs. ExPASy Swiss-Prot
Match: Q9LY62 (Glycosylinositol phosphorylceramide mannosyl transferase 1 OS=Arabidopsis thaliana OX=3702 GN=GMT1 PE=1 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 5.2e-120
Identity = 202/319 (63.32%), Postives = 251/319 (78.68%), Query Frame = 0

Query: 2   QRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTGDR-ATVDRYSTSRNRYA 61
           Q+LR+F    V +   K LL CCI F +V    R S    W     A  DR S SR  Y 
Sbjct: 22  QKLRKF----VTARSTKFLLFCCIAFVLVTIVCRSS--RPWVNSSIAVADRISGSRKGYT 81

Query: 62  IVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGREA 121
           +++NTWKRYDLLK+S+SHY +CS ++SIHIVWSEP+PP+ESL  +L    +  +  G E 
Sbjct: 82  LLMNTWKRYDLLKKSVSHYASCSRLDSIHIVWSEPNPPSESLKEYLHNVLKKKTRDGHEV 141

Query: 122 ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFVP 181
           ELRF+IN+EDSLNNRFKEIK LKT+A+FS+DDD+IFPC +V+FAF+VW+SAP+TMVGFVP
Sbjct: 142 ELRFDINKEDSLNNRFKEIKDLKTDAVFSIDDDIIFPCHTVDFAFNVWESAPDTMVGFVP 201

Query: 182 RMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYVT 241
           R+HW ++S      Y Y GWWSVWW+GTYSMVLSKAAFFH KYL +YTN MPASIR++ T
Sbjct: 202 RVHWPEKSNDKANYYTYSGWWSVWWSGTYSMVLSKAAFFHKKYLSLYTNSMPASIREFTT 261

Query: 242 KNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYGG 301
           KNRNCEDIAM+FL+ANA+  P IWVKGKI+EIGS+GISSIGGH+E+R+ CVNRFVAE+G 
Sbjct: 262 KNRNCEDIAMSFLIANATNAPAIWVKGKIYEIGSTGISSIGGHTEKRTHCVNRFVAEFGK 321

Query: 302 MPLLSSTVKAVDSRNIWFW 320
           MPL+ +++KAVDSRN+WFW
Sbjct: 322 MPLVYTSMKAVDSRNLWFW 334

BLAST of Sgr028804 vs. ExPASy Swiss-Prot
Match: P70428 (Exostosin-2 OS=Mus musculus OX=10090 GN=Ext2 PE=1 SV=2)

HSP 1 Score: 129.4 bits (324), Expect = 7.2e-29
Identity = 83/253 (32.81%), Postives = 132/253 (52.17%), Query Frame = 0

Query: 59  YAIVINTWKRYDLLKRSISHYTTCSGVESIHIVWS--EPDPPTESLVTFLQQTAEANSGH 118
           +  ++ T+ R + L R I+  +    +  + +VW+    +PP ESL   ++         
Sbjct: 456 FTAIVLTYDRVESLFRVITEVSKVPSLSKLLVVWNNQNKNPPEESLWPKIR--------- 515

Query: 119 GREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCS-SVEFAFSVWQSAPETM 178
                L+     E+ L+NRF     ++TEA+ ++DDD+I   S  ++F + VW+  P+ +
Sbjct: 516 ---VPLKVVRTAENKLSNRFFPYDEIETEAVLAIDDDIIMLTSDELQFGYEVWREFPDRL 575

Query: 179 VGFVPRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASI 238
           VG+  R+H  D     M  ++Y       WT   SMVL+ AAF+H  +  +YT  MP  I
Sbjct: 576 VGYPGRLHLWDHE---MNKWKY----ESEWTNEVSMVLTGAAFYHKYFNYLYTYKMPGDI 635

Query: 239 RDYVTKNRNCEDIAMAFLVANASGTPPIWV----KGKIFEIGS-SGISSIGGHSERRSQC 298
           +++V  + NCEDIAM FLVAN +G   I V    K K  E  +  G+S    H   RS+C
Sbjct: 636 KNWVDTHMNCEDIAMNFLVANVTGKAVIKVTPRKKFKCPECTAIDGLSLDQTHMVERSEC 689

Query: 299 VNRFVAEYGGMPL 304
           +N+F + +G MPL
Sbjct: 696 INKFASVFGTMPL 689

BLAST of Sgr028804 vs. ExPASy Swiss-Prot
Match: O77783 (Exostosin-2 OS=Bos taurus OX=9913 GN=EXT2 PE=1 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 1.6e-28
Identity = 82/253 (32.41%), Postives = 132/253 (52.17%), Query Frame = 0

Query: 59  YAIVINTWKRYDLLKRSISHYTTCSGVESIHIVWS--EPDPPTESLVTFLQQTAEANSGH 118
           +  ++ T+ R + L R I+  +    +  + +VW+    +PP +SL   ++         
Sbjct: 456 FTAIVLTYDRVESLFRVITEVSKVPSLSKLLVVWNNQNKNPPEDSLWPKIR--------- 515

Query: 119 GREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCS-SVEFAFSVWQSAPETM 178
                L+     E+ L+NRF     ++TEA+ ++DDD+I   S  ++F + VW+  P+ +
Sbjct: 516 ---VPLKVVRTAENKLSNRFFPYDEIETEAVLAIDDDIIMLTSDELQFGYEVWREFPDRL 575

Query: 179 VGFVPRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASI 238
           VG+  R+H  D     M  ++Y       WT   SMVL+ AAF+H  +  +YT  MP  I
Sbjct: 576 VGYPGRLHLWDHE---MNKWKY----ESEWTNEVSMVLTGAAFYHKYFNYLYTYKMPGDI 635

Query: 239 RDYVTKNRNCEDIAMAFLVANASGTPPIWV----KGKIFEIGS-SGISSIGGHSERRSQC 298
           +++V  + NCEDIAM FLVAN +G   I V    K K  E  +  G+S    H   RS+C
Sbjct: 636 KNWVDAHMNCEDIAMNFLVANVTGKAVIKVTPRKKFKCPECTAIDGLSLDQTHMVERSEC 689

Query: 299 VNRFVAEYGGMPL 304
           +N+F + +G MPL
Sbjct: 696 INKFASVFGTMPL 689

BLAST of Sgr028804 vs. ExPASy Swiss-Prot
Match: Q93063 (Exostosin-2 OS=Homo sapiens OX=9606 GN=EXT2 PE=1 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 1.6e-28
Identity = 82/253 (32.41%), Postives = 132/253 (52.17%), Query Frame = 0

Query: 59  YAIVINTWKRYDLLKRSISHYTTCSGVESIHIVWS--EPDPPTESLVTFLQQTAEANSGH 118
           +  ++ T+ R + L R I+  +    +  + +VW+    +PP +SL   ++         
Sbjct: 456 FTAIVLTYDRVESLFRVITEVSKVPSLSKLLVVWNNQNKNPPEDSLWPKIR--------- 515

Query: 119 GREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCS-SVEFAFSVWQSAPETM 178
                L+     E+ L+NRF     ++TEA+ ++DDD+I   S  ++F + VW+  P+ +
Sbjct: 516 ---VPLKVVRTAENKLSNRFFPYDEIETEAVLAIDDDIIMLTSDELQFGYEVWREFPDRL 575

Query: 179 VGFVPRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASI 238
           VG+  R+H  D     M  ++Y       WT   SMVL+ AAF+H  +  +YT  MP  I
Sbjct: 576 VGYPGRLHLWDHE---MNKWKY----ESEWTNEVSMVLTGAAFYHKYFNYLYTYKMPGDI 635

Query: 239 RDYVTKNRNCEDIAMAFLVANASGTPPIWV----KGKIFEIGS-SGISSIGGHSERRSQC 298
           +++V  + NCEDIAM FLVAN +G   I V    K K  E  +  G+S    H   RS+C
Sbjct: 636 KNWVDAHMNCEDIAMNFLVANVTGKAVIKVTPRKKFKCPECTAIDGLSLDQTHMVERSEC 689

Query: 299 VNRFVAEYGGMPL 304
           +N+F + +G MPL
Sbjct: 696 INKFASVFGTMPL 689

BLAST of Sgr028804 vs. ExPASy Swiss-Prot
Match: Q9WVL6 (Exostosin-like 3 OS=Mus musculus OX=10090 GN=Extl3 PE=1 SV=2)

HSP 1 Score: 127.1 bits (318), Expect = 3.6e-28
Identity = 88/265 (33.21%), Postives = 134/265 (50.57%), Query Frame = 0

Query: 53  STSRNRYAIVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEA 112
           +  R ++ +V+ T++R ++L  S+        +  + +VW+ P  P+E L+         
Sbjct: 656 NVQREQFTVVMLTYEREEVLMNSLERLNGLPYLNKVVVVWNSPKLPSEDLLW-------P 715

Query: 113 NSGHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAP 172
           + G      +     E++SLNNRF     ++TEAI S+DDD       + F F VW+ A 
Sbjct: 716 DIG----VPIMVVRTEKNSLNNRFLPWNEIETEAILSIDDDAHLRHDEIMFGFRVWREAR 775

Query: 173 ETMVGFVPRMHWLDRSKGGMGGYRYGGW-WSVWWTGTYSMVLSKAAFFHSKYLGIYTNHM 232
           + +VGF  R H  D          +  W ++  ++   SMVL+ AAFFH  Y  +Y+  M
Sbjct: 776 DRIVGFPGRYHAWD--------IPHQSWLYNSNYSCELSMVLTGAAFFHKYYAYLYSYVM 835

Query: 233 PASIRDYVTKNRNCEDIAMAFLVANASGTPPIWVKGK-IFEIGS--SGISSIGGHSERRS 292
           P +IRD V +  NCEDIAM FLV++ +  PPI V  +  F        +S    H   R 
Sbjct: 836 PQAIRDMVDEYINCEDIAMNFLVSHITRKPPIKVTSRWTFRCPGCPQALSHDDSHFHERH 895

Query: 293 QCVNRFVAEYGGMPLLSSTVKAVDS 314
           +C+N FV  YG MPLL +  + VDS
Sbjct: 896 KCINFFVKVYGYMPLLYTQFR-VDS 900

BLAST of Sgr028804 vs. ExPASy TrEMBL
Match: A0A6J1DV09 (glycosyltransferase family 64 protein C4 OS=Momordica charantia OX=3673 GN=LOC111023748 PE=3 SV=1)

HSP 1 Score: 526.2 bits (1354), Expect = 9.7e-146
Identity = 259/322 (80.43%), Postives = 286/322 (88.82%), Query Frame = 0

Query: 2   QRLRQFAVTTVGSIKIK-LLLCCCIGFAVVLFAGRGSDLMGWTGDRATVDRYSTSRNRYA 61
           QRLRQ+  T +GS+KIK +LL CCIGFAVV+FA R  +L+ WT  R  V+ +S  R RYA
Sbjct: 12  QRLRQW--TALGSMKIKIILLFCCIGFAVVVFAARAPNLVVWTAGRLMVEPFSLPRKRYA 71

Query: 62  IVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANS-GHGRE 121
           I+INTWKRYDLLK+SI HY+TC GVESIHIVWSEP+PP+ESLVTFL+Q A+ NS  HGRE
Sbjct: 72  ILINTWKRYDLLKQSIFHYSTCLGVESIHIVWSEPEPPSESLVTFLRQRAKENSRRHGRE 131

Query: 122 A--ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVG 181
              ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCS+VEFAFSVWQSAPETMVG
Sbjct: 132 LEDELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSTVEFAFSVWQSAPETMVG 191

Query: 182 FVPRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRD 241
           FVPRMHW D S+G MG +RYGGWWSVWW+GTYSMVLSKAAFFHSKYL IY+N+MPASIRD
Sbjct: 192 FVPRMHWPDHSEGEMGRFRYGGWWSVWWSGTYSMVLSKAAFFHSKYLAIYSNNMPASIRD 251

Query: 242 YVTKNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAE 301
           YV+KNRNCEDIAM+FLVAN SGTPPIWVKGKIFEIGSSGISS+GGH+ERRS+CVNRFV E
Sbjct: 252 YVSKNRNCEDIAMSFLVANVSGTPPIWVKGKIFEIGSSGISSLGGHNERRSECVNRFVEE 311

Query: 302 YGGMPLLSSTVKAVDSRNIWFW 320
           YGGMPLL STVKAVDSR  WFW
Sbjct: 312 YGGMPLLPSTVKAVDSRYTWFW 331

BLAST of Sgr028804 vs. ExPASy TrEMBL
Match: A0A7J6GTL7 (Glyco_transf_64 domain-containing protein OS=Cannabis sativa OX=3483 GN=F8388_016373 PE=3 SV=1)

HSP 1 Score: 514.2 bits (1323), Expect = 3.8e-142
Identity = 234/319 (73.35%), Postives = 279/319 (87.46%), Query Frame = 0

Query: 1   MQRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTGDRATVDRYSTSRNRYA 60
           +QR RQ A++TVGS+KIKLLLCCCIGF+++  A R    +GWTG   ++   S SR  Y+
Sbjct: 11  VQRFRQVAISTVGSLKIKLLLCCCIGFSLIAVASRAPAFLGWTGSSISMPPISDSRKGYS 70

Query: 61  IVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGREA 120
           IV+NTWKRYDLLK+SISHYTTCSG++SIHIVWSEPDPP++SL  FL    E+N+  GR  
Sbjct: 71  IVMNTWKRYDLLKQSISHYTTCSGLDSIHIVWSEPDPPSDSLKKFLNHIVESNAKDGRLV 130

Query: 121 ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFVP 180
           EL+F+IN+EDSLNNRFKEI  LKT+AIFS+DDDVIFPCSSVEFAFSVWQSAP+TMVG+VP
Sbjct: 131 ELKFDINKEDSLNNRFKEITDLKTDAIFSIDDDVIFPCSSVEFAFSVWQSAPDTMVGYVP 190

Query: 181 RMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYVT 240
           R+HW+D +K  +G Y YGGWWSVWWTG+YSMVLSKAAFFH KYL +YTN MPASIR+Y+T
Sbjct: 191 RIHWVDSTKDNLGSYIYGGWWSVWWTGSYSMVLSKAAFFHKKYLSLYTNEMPASIREYIT 250

Query: 241 KNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYGG 300
           KNRNCEDIAM+FLVANA+G PPIWVKGKIFEIGS+GISS+GGHSE+R++CVNRFVAE+GG
Sbjct: 251 KNRNCEDIAMSFLVANATGAPPIWVKGKIFEIGSTGISSLGGHSEKRTECVNRFVAEFGG 310

Query: 301 MPLLSSTVKAVDSRNIWFW 320
           MPL+S++VKAVDSR IWFW
Sbjct: 311 MPLVSTSVKAVDSRKIWFW 329

BLAST of Sgr028804 vs. ExPASy TrEMBL
Match: A0A803P0W9 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 514.2 bits (1323), Expect = 3.8e-142
Identity = 234/319 (73.35%), Postives = 279/319 (87.46%), Query Frame = 0

Query: 1   MQRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTGDRATVDRYSTSRNRYA 60
           +QR RQ A++TVGS+KIKLLLCCCIGF+++  A R    +GWTG   ++   S SR  Y+
Sbjct: 11  VQRFRQVAISTVGSLKIKLLLCCCIGFSLIAVASRAPAFLGWTGSSISMPPISDSRKGYS 70

Query: 61  IVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGREA 120
           IV+NTWKRYDLLK+SISHYTTCSG++SIHIVWSEPDPP++SL  FL    E+N+  GR  
Sbjct: 71  IVMNTWKRYDLLKQSISHYTTCSGLDSIHIVWSEPDPPSDSLKKFLNHIVESNAKDGRLV 130

Query: 121 ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFVP 180
           EL+F+IN+EDSLNNRFKEI  LKT+AIFS+DDDVIFPCSSVEFAFSVWQSAP+TMVG+VP
Sbjct: 131 ELKFDINKEDSLNNRFKEITDLKTDAIFSIDDDVIFPCSSVEFAFSVWQSAPDTMVGYVP 190

Query: 181 RMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYVT 240
           R+HW+D +K  +G Y YGGWWSVWWTG+YSMVLSKAAFFH KYL +YTN MPASIR+Y+T
Sbjct: 191 RIHWVDSTKDNLGSYIYGGWWSVWWTGSYSMVLSKAAFFHKKYLSLYTNEMPASIREYIT 250

Query: 241 KNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYGG 300
           KNRNCEDIAM+FLVANA+G PPIWVKGKIFEIGS+GISS+GGHSE+R++CVNRFVAE+GG
Sbjct: 251 KNRNCEDIAMSFLVANATGAPPIWVKGKIFEIGSTGISSLGGHSEKRTECVNRFVAEFGG 310

Query: 301 MPLLSSTVKAVDSRNIWFW 320
           MPL+S++VKAVDSR IWFW
Sbjct: 311 MPLVSTSVKAVDSRKIWFW 329

BLAST of Sgr028804 vs. ExPASy TrEMBL
Match: A0A2P5FHX5 (Exostosin, C-terminal OS=Trema orientale OX=63057 GN=TorRG33x02_068340 PE=3 SV=1)

HSP 1 Score: 511.1 bits (1315), Expect = 3.2e-141
Identity = 233/319 (73.04%), Postives = 275/319 (86.21%), Query Frame = 0

Query: 1   MQRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTGDRATVDRYSTSRNRYA 60
           +QR RQ A++TVGS+KIKL+LCCCIG  ++  A R    +GWTG   ++   S SR  YA
Sbjct: 11  VQRFRQVAISTVGSLKIKLVLCCCIGLTLIAVASRAPGFLGWTGPSVSMPPISDSRKGYA 70

Query: 61  IVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGREA 120
           IV+NTWKRYDLLK+SISHYTTCSG++SIHIVWSEPDPP++S+   L    E+N+  GR+ 
Sbjct: 71  IVMNTWKRYDLLKQSISHYTTCSGLDSIHIVWSEPDPPSDSVKKVLNHILESNARDGRQV 130

Query: 121 ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFVP 180
           EL+F+IN EDSLNNRFKEI  LKT+AIFS+DDDVIFPCSSVEFAF+VWQSAP+TMVGFVP
Sbjct: 131 ELKFDINTEDSLNNRFKEINDLKTDAIFSIDDDVIFPCSSVEFAFNVWQSAPDTMVGFVP 190

Query: 181 RMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYVT 240
           R+HW+D +K  MG Y YGGWWSVWWTG+YSMVLSKAAFFH KYL +YTN MPASIR+Y+T
Sbjct: 191 RIHWVDSTKNNMGRYIYGGWWSVWWTGSYSMVLSKAAFFHKKYLSLYTNEMPASIREYIT 250

Query: 241 KNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYGG 300
           KNRNCEDIAM+FLVANA+G PPIWVKGKIFEIGS+GISS+GGHSERR++CVNRFV EYGG
Sbjct: 251 KNRNCEDIAMSFLVANATGAPPIWVKGKIFEIGSTGISSLGGHSERRTECVNRFVTEYGG 310

Query: 301 MPLLSSTVKAVDSRNIWFW 320
           MPL+S++VKAVDSR IWFW
Sbjct: 311 MPLVSTSVKAVDSRKIWFW 329

BLAST of Sgr028804 vs. ExPASy TrEMBL
Match: A0A6J1JG04 (glycosyltransferase family 64 protein C4 OS=Cucurbita maxima OX=3661 GN=LOC111484781 PE=3 SV=1)

HSP 1 Score: 508.4 bits (1308), Expect = 2.1e-140
Identity = 246/321 (76.64%), Postives = 278/321 (86.60%), Query Frame = 0

Query: 1   MQRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTG-DRATVDRYSTSRNRY 60
           +QRLRQ AVT    IKIK LLCCCI  A VLFA R S+LMGWT  D AT  RYST R RY
Sbjct: 12  IQRLRQIAVT----IKIKHLLCCCIVLAFVLFATRASNLMGWTSDDGATALRYSTPRKRY 71

Query: 61  AIVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGRE 120
           AI++NTWKR+DLLK+S+SHYTTC GVESIHIVWSEPDPP +SLV FLQ+T + NS + RE
Sbjct: 72  AILMNTWKRHDLLKQSVSHYTTCMGVESIHIVWSEPDPPPDSLVAFLQRTVKENSRNDRE 131

Query: 121 AELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFV 180
            +LRFE+NEEDSLNNRFKEIK L T+AIFSVDDDVIF CS++EFAF+VWQSA ETMVGFV
Sbjct: 132 IKLRFEMNEEDSLNNRFKEIKDLNTDAIFSVDDDVIFACSTLEFAFTVWQSASETMVGFV 191

Query: 181 PRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYV 240
           PRMHW+D++K  MG Y YGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYT+HMPASIR+YV
Sbjct: 192 PRMHWIDQAKEEMGRYIYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTHHMPASIRNYV 251

Query: 241 TKNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYG 300
           TKNRNCEDIAM+F+VAN SG PP+WVKG+I+EIGS GISS+GGHSERR +CVN FV EYG
Sbjct: 252 TKNRNCEDIAMSFVVANVSGAPPVWVKGRIYEIGSGGISSLGGHSERRGKCVNTFVEEYG 311

Query: 301 G-MPLLSSTVKAVDSRNIWFW 320
           G MPL+ STVKAVD R++WFW
Sbjct: 312 GMMPLVPSTVKAVDGRHLWFW 328

BLAST of Sgr028804 vs. TAIR 10
Match: AT3G55830.1 (Nucleotide-diphospho-sugar transferases superfamily protein )

HSP 1 Score: 432.2 bits (1110), Expect = 3.7e-121
Identity = 202/319 (63.32%), Postives = 251/319 (78.68%), Query Frame = 0

Query: 2   QRLRQFAVTTVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTGDR-ATVDRYSTSRNRYA 61
           Q+LR+F    V +   K LL CCI F +V    R S    W     A  DR S SR  Y 
Sbjct: 22  QKLRKF----VTARSTKFLLFCCIAFVLVTIVCRSS--RPWVNSSIAVADRISGSRKGYT 81

Query: 62  IVINTWKRYDLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGREA 121
           +++NTWKRYDLLK+S+SHY +CS ++SIHIVWSEP+PP+ESL  +L    +  +  G E 
Sbjct: 82  LLMNTWKRYDLLKKSVSHYASCSRLDSIHIVWSEPNPPSESLKEYLHNVLKKKTRDGHEV 141

Query: 122 ELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFVP 181
           ELRF+IN+EDSLNNRFKEIK LKT+A+FS+DDD+IFPC +V+FAF+VW+SAP+TMVGFVP
Sbjct: 142 ELRFDINKEDSLNNRFKEIKDLKTDAVFSIDDDIIFPCHTVDFAFNVWESAPDTMVGFVP 201

Query: 182 RMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYTNHMPASIRDYVT 241
           R+HW ++S      Y Y GWWSVWW+GTYSMVLSKAAFFH KYL +YTN MPASIR++ T
Sbjct: 202 RVHWPEKSNDKANYYTYSGWWSVWWSGTYSMVLSKAAFFHKKYLSLYTNSMPASIREFTT 261

Query: 242 KNRNCEDIAMAFLVANASGTPPIWVKGKIFEIGSSGISSIGGHSERRSQCVNRFVAEYGG 301
           KNRNCEDIAM+FL+ANA+  P IWVKGKI+EIGS+GISSIGGH+E+R+ CVNRFVAE+G 
Sbjct: 262 KNRNCEDIAMSFLIANATNAPAIWVKGKIYEIGSTGISSIGGHTEKRTHCVNRFVAEFGK 321

Query: 302 MPLLSSTVKAVDSRNIWFW 320
           MPL+ +++KAVDSRN+WFW
Sbjct: 322 MPLVYTSMKAVDSRNLWFW 334

BLAST of Sgr028804 vs. TAIR 10
Match: AT5G04500.1 (glycosyltransferase family protein 47 )

HSP 1 Score: 108.6 bits (270), Expect = 9.3e-24
Identity = 87/304 (28.62%), Postives = 142/304 (46.71%), Query Frame = 0

Query: 11  TVGSIKIKLLLCCCIGFAVVLFAGRGSDLMGWTGDRATVDRYSTSRNRYAIVINTWKRYD 70
           T+G I I  LL  C+G   + + G G+           V+ Y    +     + T   YD
Sbjct: 480 TLGVIVILGLLLTCVGVRYI-YGGSGA-----------VEPYPFKGHLSQFTLAT-MTYD 539

Query: 71  L----LKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANSGHGREAELRFEI 130
                LK  +  Y+ C  V+ I ++W++  PP           +E +S       +R  +
Sbjct: 540 ARLWNLKMYVKRYSRCPSVKEIVVIWNKGPPP---------DLSELDSA----VPVRIRV 599

Query: 131 NEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPETMVGFVPRMHWLD 190
            +++SLNNRF+    +KT A+  +DDD++ PC  +E  F VW+  PE +VGF PR  ++D
Sbjct: 600 QKQNSLNNRFEIDPLIKTRAVLELDDDIMMPCDDIEKGFRVWREHPERLVGFYPR--FVD 659

Query: 191 RSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKY-LGIYTNHMPASIRDYVTKNRNC 250
           ++        Y           Y+M+L+ AAF   ++   +Y +      R +V +  NC
Sbjct: 660 QT------MTYSAEKFARSHKGYNMILTGAAFMDVRFAFDMYQSDKAKLGRVFVDEQFNC 719

Query: 251 EDIAMAFLVANASGTPPI--WVKGKIFEIGSSGISSI------GGHSERRSQCVNRFVAE 302
           EDI + FL ANASG+     +V+  +  I +S  S +        H  +RS+C+ RF   
Sbjct: 720 EDILLNFLYANASGSGKAVEYVRPSLVTIDTSKFSGVAISGNTNQHYRKRSKCLRRFSDL 749

BLAST of Sgr028804 vs. TAIR 10
Match: AT1G80290.1 (Nucleotide-diphospho-sugar transferases superfamily protein )

HSP 1 Score: 100.9 bits (250), Expect = 1.9e-21
Identity = 83/278 (29.86%), Postives = 122/278 (43.88%), Query Frame = 0

Query: 57  NRYAIVINTWKRY--DLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANS 116
           ++  ++IN +  Y   LL+  ++ Y++ S V SI ++W  P  P + L    Q   + + 
Sbjct: 46  DQITVLINGYSEYRIPLLQTIVASYSSSSIVSSILVLWGNPSTPDQLLDQLYQNLTQYSP 105

Query: 117 GHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPET 176
           G    A +        SLN RF     + T A+   DDDV     S+EFAFSVW+S P+ 
Sbjct: 106 G---SASISLIQQSSSSLNARFLPRSSVDTRAVLICDDDVEIDQRSLEFAFSVWKSNPDR 165

Query: 177 MVGFVPRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYT---NHM 236
           +VG   R H  D         +   W        YS+VL+K       YL  Y+      
Sbjct: 166 LVGTFVRSHGFD--------LQGKEWIYTVHPDKYSIVLTKFMMMKQDYLFEYSCKGGVE 225

Query: 237 PASIRDYVTKNRNCEDIAMAFLVANASGTPPI---------W-------VKGKIFEIGSS 296
              +R  V + RNCEDI M F+ A+     PI         W       V+ ++ ++G S
Sbjct: 226 MEEMRMIVDQMRNCEDILMNFVAADRLRAGPIMVGAERVRDWGDARNEEVEERVRDVGLS 285

Query: 297 GISSIGGHSERRSQCVNRFVAEYGGMPLLSSTVKAVDS 314
             S    H +RR  C+  F    G MPL+ S  K V+S
Sbjct: 286 --SRRVEHRKRRGNCIREFHRVMGKMPLMYSYGKVVNS 310

BLAST of Sgr028804 vs. TAIR 10
Match: AT1G80290.2 (Nucleotide-diphospho-sugar transferases superfamily protein )

HSP 1 Score: 100.9 bits (250), Expect = 1.9e-21
Identity = 83/278 (29.86%), Postives = 122/278 (43.88%), Query Frame = 0

Query: 57  NRYAIVINTWKRY--DLLKRSISHYTTCSGVESIHIVWSEPDPPTESLVTFLQQTAEANS 116
           ++  ++IN +  Y   LL+  ++ Y++ S V SI ++W  P  P + L    Q   + + 
Sbjct: 54  DQITVLINGYSEYRIPLLQTIVASYSSSSIVSSILVLWGNPSTPDQLLDQLYQNLTQYSP 113

Query: 117 GHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFPCSSVEFAFSVWQSAPET 176
           G    A +        SLN RF     + T A+   DDDV     S+EFAFSVW+S P+ 
Sbjct: 114 G---SASISLIQQSSSSLNARFLPRSSVDTRAVLICDDDVEIDQRSLEFAFSVWKSNPDR 173

Query: 177 MVGFVPRMHWLDRSKGGMGGYRYGGWWSVWWTGTYSMVLSKAAFFHSKYLGIYT---NHM 236
           +VG   R H  D         +   W        YS+VL+K       YL  Y+      
Sbjct: 174 LVGTFVRSHGFD--------LQGKEWIYTVHPDKYSIVLTKFMMMKQDYLFEYSCKGGVE 233

Query: 237 PASIRDYVTKNRNCEDIAMAFLVANASGTPPI---------W-------VKGKIFEIGSS 296
              +R  V + RNCEDI M F+ A+     PI         W       V+ ++ ++G S
Sbjct: 234 MEEMRMIVDQMRNCEDILMNFVAADRLRAGPIMVGAERVRDWGDARNEEVEERVRDVGLS 293

Query: 297 GISSIGGHSERRSQCVNRFVAEYGGMPLLSSTVKAVDS 314
             S    H +RR  C+  F    G MPL+ S  K V+S
Sbjct: 294 --SRRVEHRKRRGNCIREFHRVMGKMPLMYSYGKVVNS 318

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893187.15.6e-14881.31glycosyltransferase family 64 protein C4 [Benincasa hispida][more]
XP_022156919.12.0e-14580.43glycosyltransferase family 64 protein C4 [Momordica charantia][more]
XP_030490104.17.9e-14273.35glycosyltransferase family 64 protein C4 [Cannabis sativa] >KAF4386121.1 hypothe... [more]
KAG7031573.15.1e-14177.26Glycosyltransferase family 64 protein C4 [Cucurbita argyrosperma subsp. argyrosp... [more]
PON97367.16.7e-14173.04Exostosin, C-terminal [Trema orientale][more]
Match NameE-valueIdentityDescription
Q9LY625.2e-12063.32Glycosylinositol phosphorylceramide mannosyl transferase 1 OS=Arabidopsis thalia... [more]
P704287.2e-2932.81Exostosin-2 OS=Mus musculus OX=10090 GN=Ext2 PE=1 SV=2[more]
O777831.6e-2832.41Exostosin-2 OS=Bos taurus OX=9913 GN=EXT2 PE=1 SV=1[more]
Q930631.6e-2832.41Exostosin-2 OS=Homo sapiens OX=9606 GN=EXT2 PE=1 SV=1[more]
Q9WVL63.6e-2833.21Exostosin-like 3 OS=Mus musculus OX=10090 GN=Extl3 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1DV099.7e-14680.43glycosyltransferase family 64 protein C4 OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A7J6GTL73.8e-14273.35Glyco_transf_64 domain-containing protein OS=Cannabis sativa OX=3483 GN=F8388_01... [more]
A0A803P0W93.8e-14273.35Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A2P5FHX53.2e-14173.04Exostosin, C-terminal OS=Trema orientale OX=63057 GN=TorRG33x02_068340 PE=3 SV=1[more]
A0A6J1JG042.1e-14076.64glycosyltransferase family 64 protein C4 OS=Cucurbita maxima OX=3661 GN=LOC11148... [more]
Match NameE-valueIdentityDescription
AT3G55830.13.7e-12163.32Nucleotide-diphospho-sugar transferases superfamily protein [more]
AT5G04500.19.3e-2428.62glycosyltransferase family protein 47 [more]
AT1G80290.11.9e-2129.86Nucleotide-diphospho-sugar transferases superfamily protein [more]
AT1G80290.21.9e-2129.86Nucleotide-diphospho-sugar transferases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR029044Nucleotide-diphospho-sugar transferasesGENE3D3.90.550.10Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain Acoord: 55..315
e-value: 7.0E-87
score: 292.7
IPR029044Nucleotide-diphospho-sugar transferasesSUPERFAMILY53448Nucleotide-diphospho-sugar transferasescoord: 57..309
IPR015338Glycosyl transferase 64 domainPFAMPF09258Glyco_transf_64coord: 59..310
e-value: 2.2E-77
score: 259.7
IPR004263Exostosin-likePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 2..316
NoneNo IPR availablePANTHERPTHR11062:SF286GLYCOSYLTRANSFERASE FAMILY 64 PROTEIN C4coord: 2..316

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr028804.1Sgr028804.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 glycosyltransferase activity