HG10004259 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004259
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGlycosyltransferase family 92 protein
LocationChr08: 15319076 .. 15320794 (-)
RNA-Seq ExpressionHG10004259
SyntenyHG10004259
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGGCGGAAGCCGCGCTCCACCGGCCTCCTCTTCTCCCTCGCACTCTTTCTCCTCTTCGCCTTTCAATTGTTCCGTAAAGCTTTCTTCTACGTCGCCGATTTCCCTTTTCCTTCTTCAGCTAATCTTCTCCCTTCTCGCTCTGCTAATTCCGTAGCTCATTACGCTCTCTATGAAACTAATTTCAAATTCCCACACCAACTCACTCACCGGACTCGCCACGTCTCTTCGATTCTCGATCCTATTCCGACGGTTTCGCTTCTTCTACCCGATTGGGAGGTTCTTCTAATCTCTTCAATCGACACGCCGCTCTCTTCGCCAGATTCTCTCCGGGATTTTCTCTGTTTGTTTCAGAACAATGCTACTTCATCGGCTAATTTCTCGGGGATTTTGGATTCCACTGGTCGGCTTACGTTTAAGTGCCTTATGCCGGATTCCGTCCGTCGTCTCCGCCCGTTTTTTCAACCGGCGCTTACCAAGTCGCCGGAAAAGGAGTTTTCGTCTTCTTCTTTTTCTTCTTCTTCTTCTCTAGCGCCGGAATTGATGAGGTGGACGTTTTTCGCTTATGAAGCTTTCGAAACAGAGGACGACATCCTTCTGTTCGTCAAAGGCGTGAACTATCGTCAAGGGAATAACCGGGCACCGACTGATTTGAACTGCGTGTTTGGCGATGGTGATGACGCCGTTAGAACTGCCGTTACCAGCTCTGCGCAGGAGGTTTTTCGATGCGGTCATCCGAATTTAACGACAAGAGAACATCACGACAAAATCAAAATCACTCTCGAAATTCTCGACGCGAAAGGGAAAAGCGTCCTCGTGCCGTCCGTCGCTTATTACTCGCCGCGCCGCGGCGGTGGTGGCTTAGTGGAGGCTCAATCGATGATTTGTGCTTGTACGATGGTGTATAACGTCGGAAAGTTCTTGAAGGAGTGGGTGATGTACTACTCGAGAATCGGCGTCGAGAAGTTCATTCTTTACGACAACGGCAGCGATGACGAAATTTCAGCCGTCGTGAAGGAATTGAAACTAGAAGGATACAATATCGAGATCGTGTTCTGGATTTGGCCTAAAACGCAGGAGGCTGGATTTTCTCACAGCGTTGAATATTCGAAGAAATCGTGCAAATGGATGATGTTCTTCGACGTCGATGAATTCATCTTCTCTCCATTGTGGTTGAATTCATTGAAACCGTCGAAGAACATGGTGAAATCTCTGCTTCCTGCGGAGAATAACGAAATCGGAATGATCACGGTCATGTGTAACGATTACGGGCCGTCGGATCGTATCTCGCATCCGGCGGAGGGAGTAACACAGGGCTATAATTGCAGGAGAAAATTAGAGGAGAGACATAAATCGATTGTGTTATTGGAGGCTGTGGATCCATCGCTGTTGAATGTGATTCACCATTTTAAATTGAGGAAGGAATTTCGGATGAGGCAGATGACGCCGGAAGAGGCGGTGGTGAATCATTACAAGTACCAGGCGTGGCCGGAGTTTCGGACGAAGTTCCGGCGGCGCGTGTCGGCGTACGTGGTGGATTGGAAGGATTCAGCGAATCCAACATCGAAGGACAGGGCGCCGGGGCTGGGAAACGCCGCCGTGGAGCCGCCGGAGTGGCCGAGGAAGTTCTGTGAGGTTAGAGATGATCGGTTGAGATTGTTGACGCAGAGATGGTTCGGGTTTCAGACGGCGGACAGGTATAGAATGGCGTGGCAGTGA

mRNA sequence

ATGCGGCGGAAGCCGCGCTCCACCGGCCTCCTCTTCTCCCTCGCACTCTTTCTCCTCTTCGCCTTTCAATTGTTCCGTAAAGCTTTCTTCTACGTCGCCGATTTCCCTTTTCCTTCTTCAGCTAATCTTCTCCCTTCTCGCTCTGCTAATTCCGTAGCTCATTACGCTCTCTATGAAACTAATTTCAAATTCCCACACCAACTCACTCACCGGACTCGCCACGTCTCTTCGATTCTCGATCCTATTCCGACGGTTTCGCTTCTTCTACCCGATTGGGAGGTTCTTCTAATCTCTTCAATCGACACGCCGCTCTCTTCGCCAGATTCTCTCCGGGATTTTCTCTGTTTGTTTCAGAACAATGCTACTTCATCGGCTAATTTCTCGGGGATTTTGGATTCCACTGGTCGGCTTACGTTTAAGTGCCTTATGCCGGATTCCGTCCGTCGTCTCCGCCCGTTTTTTCAACCGGCGCTTACCAAGTCGCCGGAAAAGGAGTTTTCGTCTTCTTCTTTTTCTTCTTCTTCTTCTCTAGCGCCGGAATTGATGAGGTGGACGTTTTTCGCTTATGAAGCTTTCGAAACAGAGGACGACATCCTTCTGTTCGTCAAAGGCGTGAACTATCGTCAAGGGAATAACCGGGCACCGACTGATTTGAACTGCGTGTTTGGCGATGGTGATGACGCCGTTAGAACTGCCGTTACCAGCTCTGCGCAGGAGGTTTTTCGATGCGGTCATCCGAATTTAACGACAAGAGAACATCACGACAAAATCAAAATCACTCTCGAAATTCTCGACGCGAAAGGGAAAAGCGTCCTCGTGCCGTCCGTCGCTTATTACTCGCCGCGCCGCGGCGGTGGTGGCTTAGTGGAGGCTCAATCGATGATTTGTGCTTGTACGATGGTGTATAACGTCGGAAAGTTCTTGAAGGAGTGGGTGATGTACTACTCGAGAATCGGCGTCGAGAAGTTCATTCTTTACGACAACGGCAGCGATGACGAAATTTCAGCCGTCGTGAAGGAATTGAAACTAGAAGGATACAATATCGAGATCGTGTTCTGGATTTGGCCTAAAACGCAGGAGGCTGGATTTTCTCACAGCGTTGAATATTCGAAGAAATCGTGCAAATGGATGATGTTCTTCGACGTCGATGAATTCATCTTCTCTCCATTGTGGTTGAATTCATTGAAACCGTCGAAGAACATGGTGAAATCTCTGCTTCCTGCGGAGAATAACGAAATCGGAATGATCACGGTCATGTGTAACGATTACGGGCCGTCGGATCGTATCTCGCATCCGGCGGAGGGAGTAACACAGGGCTATAATTGCAGGAGAAAATTAGAGGAGAGACATAAATCGATTGTGTTATTGGAGGCTGTGGATCCATCGCTGTTGAATGTGATTCACCATTTTAAATTGAGGAAGGAATTTCGGATGAGGCAGATGACGCCGGAAGAGGCGGTGGTGAATCATTACAAGTACCAGGCGTGGCCGGAGTTTCGGACGAAGTTCCGGCGGCGCGTGTCGGCGTACGTGGTGGATTGGAAGGATTCAGCGAATCCAACATCGAAGGACAGGGCGCCGGGGCTGGGAAACGCCGCCGTGGAGCCGCCGGAGTGGCCGAGGAAGTTCTGTGAGGTTAGAGATGATCGGTTGAGATTGTTGACGCAGAGATGGTTCGGGTTTCAGACGGCGGACAGGTATAGAATGGCGTGGCAGTGA

Coding sequence (CDS)

ATGCGGCGGAAGCCGCGCTCCACCGGCCTCCTCTTCTCCCTCGCACTCTTTCTCCTCTTCGCCTTTCAATTGTTCCGTAAAGCTTTCTTCTACGTCGCCGATTTCCCTTTTCCTTCTTCAGCTAATCTTCTCCCTTCTCGCTCTGCTAATTCCGTAGCTCATTACGCTCTCTATGAAACTAATTTCAAATTCCCACACCAACTCACTCACCGGACTCGCCACGTCTCTTCGATTCTCGATCCTATTCCGACGGTTTCGCTTCTTCTACCCGATTGGGAGGTTCTTCTAATCTCTTCAATCGACACGCCGCTCTCTTCGCCAGATTCTCTCCGGGATTTTCTCTGTTTGTTTCAGAACAATGCTACTTCATCGGCTAATTTCTCGGGGATTTTGGATTCCACTGGTCGGCTTACGTTTAAGTGCCTTATGCCGGATTCCGTCCGTCGTCTCCGCCCGTTTTTTCAACCGGCGCTTACCAAGTCGCCGGAAAAGGAGTTTTCGTCTTCTTCTTTTTCTTCTTCTTCTTCTCTAGCGCCGGAATTGATGAGGTGGACGTTTTTCGCTTATGAAGCTTTCGAAACAGAGGACGACATCCTTCTGTTCGTCAAAGGCGTGAACTATCGTCAAGGGAATAACCGGGCACCGACTGATTTGAACTGCGTGTTTGGCGATGGTGATGACGCCGTTAGAACTGCCGTTACCAGCTCTGCGCAGGAGGTTTTTCGATGCGGTCATCCGAATTTAACGACAAGAGAACATCACGACAAAATCAAAATCACTCTCGAAATTCTCGACGCGAAAGGGAAAAGCGTCCTCGTGCCGTCCGTCGCTTATTACTCGCCGCGCCGCGGCGGTGGTGGCTTAGTGGAGGCTCAATCGATGATTTGTGCTTGTACGATGGTGTATAACGTCGGAAAGTTCTTGAAGGAGTGGGTGATGTACTACTCGAGAATCGGCGTCGAGAAGTTCATTCTTTACGACAACGGCAGCGATGACGAAATTTCAGCCGTCGTGAAGGAATTGAAACTAGAAGGATACAATATCGAGATCGTGTTCTGGATTTGGCCTAAAACGCAGGAGGCTGGATTTTCTCACAGCGTTGAATATTCGAAGAAATCGTGCAAATGGATGATGTTCTTCGACGTCGATGAATTCATCTTCTCTCCATTGTGGTTGAATTCATTGAAACCGTCGAAGAACATGGTGAAATCTCTGCTTCCTGCGGAGAATAACGAAATCGGAATGATCACGGTCATGTGTAACGATTACGGGCCGTCGGATCGTATCTCGCATCCGGCGGAGGGAGTAACACAGGGCTATAATTGCAGGAGAAAATTAGAGGAGAGACATAAATCGATTGTGTTATTGGAGGCTGTGGATCCATCGCTGTTGAATGTGATTCACCATTTTAAATTGAGGAAGGAATTTCGGATGAGGCAGATGACGCCGGAAGAGGCGGTGGTGAATCATTACAAGTACCAGGCGTGGCCGGAGTTTCGGACGAAGTTCCGGCGGCGCGTGTCGGCGTACGTGGTGGATTGGAAGGATTCAGCGAATCCAACATCGAAGGACAGGGCGCCGGGGCTGGGAAACGCCGCCGTGGAGCCGCCGGAGTGGCCGAGGAAGTTCTGTGAGGTTAGAGATGATCGGTTGAGATTGTTGACGCAGAGATGGTTCGGGTTTCAGACGGCGGACAGGTATAGAATGGCGTGGCAGTGA

Protein sequence

MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPFPSSANLLPSRSANSVAHYALYETNFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDFLCLFQNNATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEFSSSSFSSSSSLAPELMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGDDAVRTAVTSSAQEVFRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRGGGGLVEAQSMICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENNEIGMITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFKLRKEFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGNAAVEPPEWPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ
Homology
BLAST of HG10004259 vs. NCBI nr
Match: XP_038884062.1 (glycosyltransferase family 92 protein At1g27200 [Benincasa hispida])

HSP 1 Score: 1043.1 bits (2696), Expect = 8.8e-301
Identity = 514/574 (89.55%), Postives = 534/574 (93.03%), Query Frame = 0

Query: 1   MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPFPSSANLLPSRSANSVAHYALYET 60
           MRRKPRSTGLLFS+A+FLLFAFQ  RKAFF  AD PFPSS NLLPSRSANS AHYAL+ET
Sbjct: 1   MRRKPRSTGLLFSVAVFLLFAFQFSRKAFFNGADLPFPSSYNLLPSRSANSEAHYALHET 60

Query: 61  NFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDFLCLFQNN 120
           N  FPHQLTHRTRHVSSILDP+PTVSLLLPDWEVLLISSIDTPL+SPDSLRDFLCLFQNN
Sbjct: 61  NIDFPHQLTHRTRHVSSILDPVPTVSLLLPDWEVLLISSIDTPLASPDSLRDFLCLFQNN 120

Query: 121 ATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEFSSSSFSSSSSLAPE 180
           ATSSANFSG+LD TGRLTF+C MP+SVRRLRPFFQP L KSP+KEFSS    SSSSLAPE
Sbjct: 121 ATSSANFSGVLDFTGRLTFRCFMPESVRRLRPFFQPLLIKSPDKEFSS---YSSSSLAPE 180

Query: 181 LMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGDDAVRTAVTSSAQEV 240
           LMRWTFFAYEAFETEDD++LFVKGVN+RQGNNR PTDLNCVFGDGD+AVRTAVTSS QEV
Sbjct: 181 LMRWTFFAYEAFETEDDVVLFVKGVNHRQGNNRPPTDLNCVFGDGDNAVRTAVTSSVQEV 240

Query: 241 FRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRR--GGGGLVEAQSMICAC 300
           FRC HPNLTTRE HDK KITLEILD KGKSV+VPSVAYYSPRR  GGGG + AQSMICAC
Sbjct: 241 FRCRHPNLTTREDHDKFKITLEILD-KGKSVIVPSVAYYSPRRSDGGGGSLAAQSMICAC 300

Query: 301 TMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFWIWPKT 360
           TMVYNVGKFLKEWVMYYSRIGV+KFILYDNGSDDEISAVVKELK EGYNIEIVFWIWPKT
Sbjct: 301 TMVYNVGKFLKEWVMYYSRIGVDKFILYDNGSDDEISAVVKELKQEGYNIEIVFWIWPKT 360

Query: 361 QEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENNEIGMITV 420
           QEAGFSHSVEYSKKSCKWMMF D+DEF+FSP WLNSLKPSKNMVKSLLPAENN IGMITV
Sbjct: 361 QEAGFSHSVEYSKKSCKWMMFVDIDEFVFSPSWLNSLKPSKNMVKSLLPAENNGIGMITV 420

Query: 421 MCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFKLRKEFRM 480
           MCNDYGPSDRISHP EGVTQGYNCRRKLEERHKSIVLLEAVD SLLNVIHHFKL+ EFR 
Sbjct: 421 MCNDYGPSDRISHPTEGVTQGYNCRRKLEERHKSIVLLEAVDRSLLNVIHHFKLKTEFRS 480

Query: 481 RQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGNAAVEPPE 540
           RQM PEEAVVNHYKYQAWPEFR KFRRRVSAYVVDWKDSANPTSKDRAPGLGN AVEPP+
Sbjct: 481 RQMRPEEAVVNHYKYQAWPEFRMKFRRRVSAYVVDWKDSANPTSKDRAPGLGNTAVEPPD 540

Query: 541 WPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           WPRKFCEVRDDRLRLLTQRWFG QTAD YRMAWQ
Sbjct: 541 WPRKFCEVRDDRLRLLTQRWFGSQTADGYRMAWQ 570

BLAST of HG10004259 vs. NCBI nr
Match: XP_004144291.1 (glycosyltransferase family 92 protein At1g27200 [Cucumis sativus] >KGN47528.1 hypothetical protein Csa_018985 [Cucumis sativus])

HSP 1 Score: 1013.1 bits (2618), Expect = 9.7e-292
Identity = 499/577 (86.48%), Postives = 530/577 (91.85%), Query Frame = 0

Query: 1   MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPF-PSSANLLPSRSANSVAHYALYE 60
           MRRKPR TGL+FS+A FLLFAFQ  RK FF+ AD     SS+NLLPSRSANSV HYAL+E
Sbjct: 1   MRRKPRFTGLIFSVAFFLLFAFQFSRKVFFFGADLNISSSSSNLLPSRSANSVVHYALHE 60

Query: 61  TNFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDFLCLFQN 120
           TN  FP QLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDS RDFLCLFQN
Sbjct: 61  TNLDFPQQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSFRDFLCLFQN 120

Query: 121 NATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEFSSSSFSSSSSLAP 180
           NATSSANFSG+LD TGR+TFKCLMP+SVRRLRPFFQP LTKSP+KEFSSS   SSSS AP
Sbjct: 121 NATSSANFSGVLDFTGRVTFKCLMPESVRRLRPFFQPLLTKSPDKEFSSS--LSSSSPAP 180

Query: 181 ELMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGDDAVRTAVTSSAQE 240
           ELMRWTFFAYEAFETE+D++LFVKGVN RQG+NR PTDLNCVFGDGDDA+RTAVTSS QE
Sbjct: 181 ELMRWTFFAYEAFETEEDVVLFVKGVNNRQGSNRQPTDLNCVFGDGDDAIRTAVTSSVQE 240

Query: 241 VFRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRG--GGGLV--EAQSMI 300
           VFRC HPNLTT E HDK KITLEILDA+GK++LVPSVAYYSPRR   GGGLV  EAQSMI
Sbjct: 241 VFRCRHPNLTTSEDHDKFKITLEILDARGKNILVPSVAYYSPRRSGDGGGLVETEAQSMI 300

Query: 301 CACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFWIW 360
           CACTMVYNVGKFL+EWVMYYSRIGVEKFILYDNGS+DEISAV+KELK EGYNIEIVFWIW
Sbjct: 301 CACTMVYNVGKFLREWVMYYSRIGVEKFILYDNGSEDEISAVLKELKQEGYNIEIVFWIW 360

Query: 361 PKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENNEIGM 420
           PKTQEAGFSHSVEYSKKSCKWMMF D+DEF+FSP WLNSLKPSKNM+ SLLP +N+ IGM
Sbjct: 361 PKTQEAGFSHSVEYSKKSCKWMMFVDIDEFVFSPSWLNSLKPSKNMLNSLLPTKNSGIGM 420

Query: 421 ITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFKLRKE 480
           +TVMCNDYGPSDRISHPAEGVTQGYNCRRK+EERHKSIVLLEAVD SLLNVIHHFKLRKE
Sbjct: 421 VTVMCNDYGPSDRISHPAEGVTQGYNCRRKVEERHKSIVLLEAVDRSLLNVIHHFKLRKE 480

Query: 481 FRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGNAAVE 540
           F+ RQM  EEAVVNHYKYQAWPEFR KFRRRVSAYVVDWK+SANPTSKDRAPGLGN AVE
Sbjct: 481 FQSRQMRVEEAVVNHYKYQAWPEFRMKFRRRVSAYVVDWKNSANPTSKDRAPGLGNTAVE 540

Query: 541 PPEWPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           PPEWPRKFCEVRDDRLRLLTQRWFG++TAD YRMAWQ
Sbjct: 541 PPEWPRKFCEVRDDRLRLLTQRWFGYETADGYRMAWQ 575

BLAST of HG10004259 vs. NCBI nr
Match: XP_008464892.1 (PREDICTED: glycosyltransferase family 92 protein At1g27200 [Cucumis melo])

HSP 1 Score: 1008.1 bits (2605), Expect = 3.1e-290
Identity = 501/587 (85.35%), Postives = 528/587 (89.95%), Query Frame = 0

Query: 1   MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPF-------PSSANLLPSRSANSVA 60
           MRRKPR TGLLFS+A F LF FQL RKAFF+ +D  F        SS+NLLPSRSANSV 
Sbjct: 1   MRRKPRFTGLLFSVAFFFLFVFQLSRKAFFFGSDLNFSSSSSSSSSSSNLLPSRSANSVV 60

Query: 61  HYALYETNFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDF 120
           HYAL+ETN  FP QLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDF
Sbjct: 61  HYALHETNPDFPQQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDF 120

Query: 121 LCLFQNNATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEFSSSSFSS 180
           LCLFQNNATSSANFSG+LD TGRLTFKCLMP+SVRRLRPF QP LTKSP+KEFSSS   S
Sbjct: 121 LCLFQNNATSSANFSGVLDFTGRLTFKCLMPESVRRLRPFLQPLLTKSPDKEFSSS--LS 180

Query: 181 SSSLAPELMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGDDAVRTAV 240
           SSS APELMRWTFFAYEAFETEDD++LFVKGVN RQG+NR PTDLNCVFGDGD AVRTAV
Sbjct: 181 SSSPAPELMRWTFFAYEAFETEDDVVLFVKGVNNRQGSNRQPTDLNCVFGDGDGAVRTAV 240

Query: 241 TSSAQEVFRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRG------GGG 300
           TSS QEVFRC HPNLTTR+ HDK K+TLEILDA+GK++LVPSVAYYSPRR       GGG
Sbjct: 241 TSSVQEVFRCRHPNLTTRDDHDKFKVTLEILDARGKNILVPSVAYYSPRRSGSGSGDGGG 300

Query: 301 LV--EAQSMICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEG 360
           LV  EAQSMICACTMVYNVGKFL+EWVMYYSRIGVEKFILYDNGSDDEISAV K LK EG
Sbjct: 301 LVKTEAQSMICACTMVYNVGKFLREWVMYYSRIGVEKFILYDNGSDDEISAVAKGLKQEG 360

Query: 361 YNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSL 420
           YNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMF D+DEF+FSP WLNSLKPSKNM+KSL
Sbjct: 361 YNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFVDIDEFVFSPSWLNSLKPSKNMLKSL 420

Query: 421 LPAENNEIGMITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLN 480
           LP +N+ IGMITVMCNDYGPSDRISHP EGVTQGYNCRRK+EERHKSIVLL+AVD SLLN
Sbjct: 421 LPTQNSGIGMITVMCNDYGPSDRISHPVEGVTQGYNCRRKIEERHKSIVLLDAVDRSLLN 480

Query: 481 VIHHFKLRKEFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDR 540
           VIHHFKLR EF+ +QM  EEAVVNHYKYQAWPEFR KFRRRVSAYVVDWKDSANPTSKDR
Sbjct: 481 VIHHFKLRNEFQSKQMRLEEAVVNHYKYQAWPEFRMKFRRRVSAYVVDWKDSANPTSKDR 540

Query: 541 APGLGNAAVEPPEWPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           APGLGN AVEPPEWPRKFCEVRDDRLRLLTQRWFGF+TAD YRMAWQ
Sbjct: 541 APGLGNRAVEPPEWPRKFCEVRDDRLRLLTQRWFGFETADGYRMAWQ 585

BLAST of HG10004259 vs. NCBI nr
Match: KAA0038445.1 (glycosyltransferase family 92 protein [Cucumis melo var. makuwa])

HSP 1 Score: 1005.4 bits (2598), Expect = 2.0e-289
Identity = 501/594 (84.34%), Postives = 528/594 (88.89%), Query Frame = 0

Query: 1   MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPFPSSA--------------NLLPS 60
           MRRKPR TGLLFS+A F LF FQL RKAFF+ +D  F SS+              NLLPS
Sbjct: 1   MRRKPRFTGLLFSVAFFFLFVFQLSRKAFFFGSDLNFSSSSSSSSSSSSSSSSSFNLLPS 60

Query: 61  RSANSVAHYALYETNFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSS 120
           RSANSV HYAL+ETN  FP QLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSS
Sbjct: 61  RSANSVVHYALHETNPDFPQQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSS 120

Query: 121 PDSLRDFLCLFQNNATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEF 180
           PDSLRDFLCLFQNNATSSANFSG+LD TGRLTFKCLMP+SVRRLRPF QP LTKSP+KEF
Sbjct: 121 PDSLRDFLCLFQNNATSSANFSGVLDFTGRLTFKCLMPESVRRLRPFLQPLLTKSPDKEF 180

Query: 181 SSSSFSSSSSLAPELMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGD 240
           SSS   SSSS APELMRWTFFAYEAFETEDD++LFVKGVN RQG+NR PTDLNCVFGDGD
Sbjct: 181 SSS--LSSSSPAPELMRWTFFAYEAFETEDDVVLFVKGVNNRQGSNRQPTDLNCVFGDGD 240

Query: 241 DAVRTAVTSSAQEVFRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRG-- 300
            AVRTAVTSS QEVFRC HPNLTTR+ HDK K+TLEILDA+GK++LVPSVAYYSPRR   
Sbjct: 241 GAVRTAVTSSVQEVFRCRHPNLTTRDDHDKFKVTLEILDARGKNILVPSVAYYSPRRSGS 300

Query: 301 ----GGGLV--EAQSMICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVV 360
               GGGLV  EAQSMICACTMVYNVGKFL+EWVMYYSRIGVEKFILYDNGSDDEISAV 
Sbjct: 301 GSGDGGGLVKTEAQSMICACTMVYNVGKFLREWVMYYSRIGVEKFILYDNGSDDEISAVA 360

Query: 361 KELKLEGYNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPS 420
           K LK EGYNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMF D+DEF+FSP WLNSLKPS
Sbjct: 361 KGLKQEGYNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFVDIDEFVFSPSWLNSLKPS 420

Query: 421 KNMVKSLLPAENNEIGMITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEA 480
           KNM+KSLLP +N+ IGMITVMCNDYGPSDRISHP EGVTQGYNCRRK+EERHKSIVLL+A
Sbjct: 421 KNMLKSLLPTQNSGIGMITVMCNDYGPSDRISHPVEGVTQGYNCRRKIEERHKSIVLLDA 480

Query: 481 VDPSLLNVIHHFKLRKEFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSA 540
           VD SLLNVIHHFKLR EF+ +QM  EEAVVNHYKYQAWPEFR KFRRRVSAYVVDWKDSA
Sbjct: 481 VDRSLLNVIHHFKLRNEFQSKQMRLEEAVVNHYKYQAWPEFRMKFRRRVSAYVVDWKDSA 540

Query: 541 NPTSKDRAPGLGNAAVEPPEWPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           NPTSKDRAPGLGN AVEPPEWPRKFCEVRDDRLRLLTQRWFGF+TAD YRMAWQ
Sbjct: 541 NPTSKDRAPGLGNRAVEPPEWPRKFCEVRDDRLRLLTQRWFGFETADGYRMAWQ 592

BLAST of HG10004259 vs. NCBI nr
Match: XP_022999662.1 (glycosyltransferase family 92 protein At1g27200-like [Cucurbita maxima])

HSP 1 Score: 926.4 bits (2393), Expect = 1.2e-265
Identity = 457/572 (79.90%), Postives = 496/572 (86.71%), Query Frame = 0

Query: 1   MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPFPSSANLLPSRSANSVAHYALYET 60
           MRRKP   GLL S A+FL+F+FQ+ RKA F   D P  SS  LLPSRS NSVAHYA++E 
Sbjct: 1   MRRKPCFAGLLLSCAVFLIFSFQISRKAIFSGGDLPSLSSDKLLPSRSKNSVAHYAIHEA 60

Query: 61  NFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDFLCLFQNN 120
           N      L HR R +SS+L  IPT+SLLLPDWE+LLISSI TPLSSPDSLRDFLCLF NN
Sbjct: 61  N------LAHRNRQLSSVLGSIPTLSLLLPDWEILLISSIHTPLSSPDSLRDFLCLFHNN 120

Query: 121 ATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEFSSSSFSSSSSLAPE 180
           ATS ANFSG+LD TGR TFKC MP SVRRLRPFFQP LTKSP+ E S    SSSSS A E
Sbjct: 121 ATSPANFSGVLDFTGRATFKCRMPPSVRRLRPFFQPLLTKSPKNELS----SSSSSPAME 180

Query: 181 LMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGDDAVRTAVTSSAQEV 240
           LMRWTF AYEA ETEDD++LFVKGVN+R+G NR P+DL CVFGDG DA+RTAVTSS QEV
Sbjct: 181 LMRWTFLAYEALETEDDVVLFVKGVNHRRGINRPPSDLKCVFGDGYDAIRTAVTSSEQEV 240

Query: 241 FRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRGGGGLVEAQSMICACTM 300
           FRC HPNLTTR+ ++K+KITLEI DAKGKS+LVPSVAYYSPR GG   VEA+SMICACTM
Sbjct: 241 FRCRHPNLTTRDDYEKMKITLEIFDAKGKSILVPSVAYYSPRYGGDS-VEAKSMICACTM 300

Query: 301 VYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFWIWPKTQE 360
           VYNVGKFLKEWV+YYSRIGVEKFILYDNGSDDEIS + KELKLEGY IEIVFWIWPKTQE
Sbjct: 301 VYNVGKFLKEWVIYYSRIGVEKFILYDNGSDDEISEIAKELKLEGYIIEIVFWIWPKTQE 360

Query: 361 AGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENNEIGMITVMC 420
           AGFSHS EYSKKSCKWMM  D+DEF+FSP WLNSL+PSKNM+KSL+PAE N IGMIT+MC
Sbjct: 361 AGFSHSAEYSKKSCKWMMIVDIDEFVFSPSWLNSLEPSKNMLKSLIPAEKNGIGMITIMC 420

Query: 421 NDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFKLRKEFRMRQ 480
           NDYGPSDRISHPAEGVTQGYNCR K EERHKSIVLLEAVDPSLLNVIHHF+LRKEFR R+
Sbjct: 421 NDYGPSDRISHPAEGVTQGYNCRIKAEERHKSIVLLEAVDPSLLNVIHHFRLRKEFRWRK 480

Query: 481 MTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGNAAVEPPEWP 540
           M   EAVVNHYKYQAWPEF+ KFRRRVS YVVDWKDSANPTSKDRAPGLGN+AVEPP+WP
Sbjct: 481 MKSSEAVVNHYKYQAWPEFQMKFRRRVSTYVVDWKDSANPTSKDRAPGLGNSAVEPPDWP 540

Query: 541 RKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           RKFCEVRDDRLRLLT+RWFGFQTA+ YRMAWQ
Sbjct: 541 RKFCEVRDDRLRLLTRRWFGFQTAEGYRMAWQ 561

BLAST of HG10004259 vs. ExPASy Swiss-Prot
Match: Q94K98 (Glycosyltransferase family 92 protein At1g27200 OS=Arabidopsis thaliana OX=3702 GN=At1g27200 PE=2 SV=2)

HSP 1 Score: 309.7 bits (792), Expect = 6.9e-83
Identity = 165/401 (41.15%), Postives = 229/401 (57.11%), Query Frame = 0

Query: 184 WTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVF--GDGDDAVRTAVTSSAQEVF 243
           W    YEA    D +++FVKG+  R      P+   C F   + ++   T   ++AQEV 
Sbjct: 176 WEKVGYEAVIDGDTVVVFVKGLTRRPHKESDPSYYKCQFEIENSEEKEVTQAIAAAQEVV 235

Query: 244 RCGHPNLTTREHHDKIKITLEILDAKGKSV-LVPSVAYYSPRRGGGGLVEAQSM------ 303
           RCG P           ++++  +D +G++   +PSVA    R  G   +E +        
Sbjct: 236 RCGLPESLKLNPEMMFRVSVIHIDPRGRTTPALPSVA----RIYGSDSIEKKEKKSGVKH 295

Query: 304 -ICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFW 363
            +C CTM++N   FL+EW+MY+S +GVE++ +YDN SDD I   ++ L  E YN+    W
Sbjct: 296 ELCVCTMLWNQAPFLREWIMYHSWLGVERWFIYDNNSDDGIQEEIELLSSENYNVSRHVW 355

Query: 364 IWPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLP--AENN 423
            W KTQEAGFSH    +K+ C W+ FFDVDEF + P   +   PSKN +KSL+      +
Sbjct: 356 PWIKTQEAGFSHCAVRAKEECNWVGFFDVDEFYYFPTHRSQGLPSKNALKSLVSNYTSWD 415

Query: 424 EIGMITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFK 483
            +G I   C+ YGPS   S P++GVT GY CR+   ERHKSI+  E +  SLLN +HHF+
Sbjct: 416 LVGEIRTDCHSYGPSGLTSVPSQGVTVGYTCRQANPERHKSIIRPELLTSSLLNEVHHFQ 475

Query: 484 LRKEFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGN 543
           L++      +    AVVNHYKYQ W  F+ KF RRV+ YVVDW+++ N  SKDRAPGLG 
Sbjct: 476 LKEGVGHMSLVESVAVVNHYKYQVWDTFKAKFYRRVATYVVDWQENQNQGSKDRAPGLGT 535

Query: 544 AAVEPPEWPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
            A+EPP+W R+FCEV D  L+ L    F  Q      + WQ
Sbjct: 536 EAIEPPDWKRRFCEVWDTGLKDLVMSNFADQVTG--YLPWQ 570

BLAST of HG10004259 vs. ExPASy Swiss-Prot
Match: B9S2H4 (Glycosyltransferase family 92 protein RCOM_0530710 OS=Ricinus communis OX=3988 GN=RCOM_0699480 PE=3 SV=1)

HSP 1 Score: 299.3 bits (765), Expect = 9.4e-80
Identity = 158/386 (40.93%), Postives = 226/386 (58.55%), Query Frame = 0

Query: 181 LMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFG----DGDDAV--RTAVT 240
           ++ W    YEA    + + +FVKG+N R       +   C FG    D D+ +   T   
Sbjct: 172 VVSWDRVVYEAMLDWNTVAVFVKGLNLRPHKESDSSKFRCHFGLSKFDKDEGIVFTTEAI 231

Query: 241 SSAQEVFRCGHPNLTTREHHDK---IKITLEILDA--KGKSVLVPSVA-YYSPRRGGGGL 300
           ++AQEV RC  P  + R +  K   I++T+  ++A   G    +PSVA  Y  +      
Sbjct: 232 TAAQEVIRCLLPR-SIRNNPVKAQGIRVTVSRINAGEDGVDAPLPSVAKVYGAKSYEKRS 291

Query: 301 VEAQSMICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNI 360
              +  +CACTM++N   FL EW+ Y++ +GV+++ +YDN SDD I  VV EL L+ YN+
Sbjct: 292 NRGKYELCACTMLWNQASFLHEWITYHAWLGVQRWFIYDNNSDDGIQEVVDELNLQNYNV 351

Query: 361 EIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLP- 420
               W W K QEAGFSH    ++  CKW+ FFDVDEF + P         +N +++L+  
Sbjct: 352 TRHSWPWIKAQEAGFSHCALRARSECKWLGFFDVDEFFYLPRHRGQDMLGENSLRTLVAN 411

Query: 421 -AENNEIGMITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNV 480
            ++++    I  +C+ +GPS   S P++GVT GY CR +  ERHKSIV  E +D +LLNV
Sbjct: 412 YSDSSTYAEIRTICHSFGPSGLTSAPSQGVTVGYTCRLQAPERHKSIVRPELLDTTLLNV 471

Query: 481 IHHFKLRKEFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRA 540
           +HHFKL++ +R   +    AVVNHYKYQ W  F+ KF RRVS YV +W++  N  SKDRA
Sbjct: 472 VHHFKLKEGYRYLNVPESTAVVNHYKYQVWDTFKAKFFRRVSTYVANWQEDQNQGSKDRA 531

Query: 541 PGLGNAAVEPPEWPRKFCEVRDDRLR 553
           PGLG  A+EPP+W  +FCEV D  L+
Sbjct: 532 PGLGTVAIEPPDWRLRFCEVWDTGLK 556

BLAST of HG10004259 vs. ExPASy Swiss-Prot
Match: B9SLR1 (Glycosyltransferase family 92 protein RCOM_0530710 OS=Ricinus communis OX=3988 GN=RCOM_0530710 PE=3 SV=1)

HSP 1 Score: 295.4 bits (755), Expect = 1.4e-78
Identity = 164/381 (43.04%), Postives = 219/381 (57.48%), Query Frame = 0

Query: 183 RWTFFAYEA-FETEDDILLFVKGVNYRQGNNRAPTDLNCVFG----DGDDAVRTAVTSSA 242
           RW    YEA  + ++  ++FVKG N R       +   CV+G         +R+ V S A
Sbjct: 161 RWDSLVYEAMIDRDNTTVVFVKGFNLRADRIYNASKFECVYGWDFRKTKFVLRSNVISIA 220

Query: 243 QEVFRCGHPNLTTREHHDKIKITLEI-LDAKGK----SVLVPSVAYYS-PRRGGGGLVEA 302
           QE+ RC  P L+   +  K+   +++ +  KGK    S+  P V   + P  G  G  E 
Sbjct: 221 QEIVRCQTP-LSILNNQLKVNNAIKVSIRLKGKGTLHSIARPGVQLLTDPEPGLRG--EK 280

Query: 303 QSMICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIV 362
              +C CTM+ N G+FLKEWVMY+S+IGVE++ +YDN S+D+I +V++ L    +NI   
Sbjct: 281 PHEMCICTMLRNQGRFLKEWVMYHSQIGVERWFIYDNNSEDDIDSVIESLIDAKFNISRH 340

Query: 363 FWIWPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENN 422
            W W K QEAGF+H    ++  C+W+ F DVDEF   P  LN     KN   S      N
Sbjct: 341 VWPWVKAQEAGFAHCALRARGLCEWVGFIDVDEFFHLPTGLNLQDAVKNQSNS-----GN 400

Query: 423 EIGMITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFK 482
            +  + V C+ +GPS     PA+GVT GY CR  L ERHKSIV  EA++ +L+NV+HHF 
Sbjct: 401 NVAELRVSCHSFGPSGLKHVPAQGVTVGYTCRMMLPERHKSIVKPEALNSTLINVVHHFH 460

Query: 483 LRKEFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGN 542
           LR  FR         V+NHYKYQ W  F+ KF RRV+ YVVDW++  N  SKDRAPGLG 
Sbjct: 461 LRDGFRYVNADKGILVINHYKYQVWEVFKEKFYRRVATYVVDWQNEQNVGSKDRAPGLGT 520

Query: 543 AAVEPPEWPRKFCEVRDDRLR 553
            AVEPP+W  +FCEV D  LR
Sbjct: 521 RAVEPPDWSSRFCEVSDTGLR 533

BLAST of HG10004259 vs. ExPASy Swiss-Prot
Match: Q6YRM6 (Glycosyltransferase family 92 protein Os08g0121900 OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0121900 PE=2 SV=1)

HSP 1 Score: 287.3 bits (734), Expect = 3.7e-76
Identity = 160/417 (38.37%), Postives = 236/417 (56.59%), Query Frame = 0

Query: 162 PEKEFSSSSFSSSSSLAPELMRWTFFAYEAF--ETEDDILLFVKGVNYRQGNNRAPTDLN 221
           P +   S S + S  +AP  ++W    Y A     ++  ++F KG+N R G    P+   
Sbjct: 167 PSRVAVSLSLAQSVPVAP--LQWDRLVYTALIDSKDNSTVVFAKGMNLRPGRLGVPSRYE 226

Query: 222 CVFGDGDD----AVRTAVTSSAQEVFRCGHP-------NLTT---REHHDKIKITLEILD 281
           CVFG         V + V S+AQE+FRC  P        +TT      ++  K  L  + 
Sbjct: 227 CVFGRDFSKPKLVVTSPVVSAAQEIFRCVTPVRIRRYLRMTTGGKNSVNNDDKPMLVSIR 286

Query: 282 AKGK-SVLVPSVAYYS--PRRGGGGLVEAQSMICACTMVYNVGKFLKEWVMYYSRIGVEK 341
            KG+ S  +PS+A     PR       +A SM C CTM+ N  +FL+EW++Y+SRIGV++
Sbjct: 287 TKGRGSSTLPSIAQPEPLPRYNKHWRRKAHSM-CVCTMLRNQARFLREWIIYHSRIGVQR 346

Query: 342 FILYDNGSDDEISAVVKELKLEGYNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFFDV 401
           + +YDN SDD I  V+  +    YN+    W W K+QEAGF+H    +++SC+W+ F D+
Sbjct: 347 WFIYDNNSDDGIEEVLNTMDSSRYNVTRYLWPWMKSQEAGFAHCALRARESCEWVGFIDI 406

Query: 402 DEFIFSPLWLNSLKPSKNMVKSLLPAENNEIGMITVMCNDYGPSDRISHPAEGVTQGYNC 461
           DEF+  P      +  ++++++   +    IG +   C+ +GPS R   P +GVT GY C
Sbjct: 407 DEFLHFP----GNQTLQDVLRNY--SVKPRIGELRTACHSFGPSGRTKIPKKGVTTGYTC 466

Query: 462 RRKLEERHKSIVLLEAVDPSLLNVIHHFKLRKEFRMRQMTPEEAVVNHYKYQAWPEFRTK 521
           R    ERHKSIV  +A++PSL+NV+HHF L++  +   +     ++NHYKYQ W  F+ K
Sbjct: 467 RLAAPERHKSIVRPDALNPSLINVVHHFHLKEGMKYVNIGQGMMLINHYKYQVWEVFKDK 526

Query: 522 FRRRVSAYVVDWKDSANPTSKDRAPGLGNAAVEPPEWPRKFCEVRDDRLRLLTQRWF 560
           F  RV+ YV DW+D  N  S+DRAPGLG   VEP +WPR+FCEV D+ L+   Q+ F
Sbjct: 527 FSGRVATYVADWQDEENVGSRDRAPGLGTKPVEPEDWPRRFCEVYDNGLKDFVQKVF 574

BLAST of HG10004259 vs. ExPASy TrEMBL
Match: A0A0A0KDF9 (Glycosyltransferase family 92 protein OS=Cucumis sativus OX=3659 GN=Csa_6G355960 PE=3 SV=1)

HSP 1 Score: 1013.1 bits (2618), Expect = 4.7e-292
Identity = 499/577 (86.48%), Postives = 530/577 (91.85%), Query Frame = 0

Query: 1   MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPF-PSSANLLPSRSANSVAHYALYE 60
           MRRKPR TGL+FS+A FLLFAFQ  RK FF+ AD     SS+NLLPSRSANSV HYAL+E
Sbjct: 1   MRRKPRFTGLIFSVAFFLLFAFQFSRKVFFFGADLNISSSSSNLLPSRSANSVVHYALHE 60

Query: 61  TNFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDFLCLFQN 120
           TN  FP QLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDS RDFLCLFQN
Sbjct: 61  TNLDFPQQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSFRDFLCLFQN 120

Query: 121 NATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEFSSSSFSSSSSLAP 180
           NATSSANFSG+LD TGR+TFKCLMP+SVRRLRPFFQP LTKSP+KEFSSS   SSSS AP
Sbjct: 121 NATSSANFSGVLDFTGRVTFKCLMPESVRRLRPFFQPLLTKSPDKEFSSS--LSSSSPAP 180

Query: 181 ELMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGDDAVRTAVTSSAQE 240
           ELMRWTFFAYEAFETE+D++LFVKGVN RQG+NR PTDLNCVFGDGDDA+RTAVTSS QE
Sbjct: 181 ELMRWTFFAYEAFETEEDVVLFVKGVNNRQGSNRQPTDLNCVFGDGDDAIRTAVTSSVQE 240

Query: 241 VFRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRG--GGGLV--EAQSMI 300
           VFRC HPNLTT E HDK KITLEILDA+GK++LVPSVAYYSPRR   GGGLV  EAQSMI
Sbjct: 241 VFRCRHPNLTTSEDHDKFKITLEILDARGKNILVPSVAYYSPRRSGDGGGLVETEAQSMI 300

Query: 301 CACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFWIW 360
           CACTMVYNVGKFL+EWVMYYSRIGVEKFILYDNGS+DEISAV+KELK EGYNIEIVFWIW
Sbjct: 301 CACTMVYNVGKFLREWVMYYSRIGVEKFILYDNGSEDEISAVLKELKQEGYNIEIVFWIW 360

Query: 361 PKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENNEIGM 420
           PKTQEAGFSHSVEYSKKSCKWMMF D+DEF+FSP WLNSLKPSKNM+ SLLP +N+ IGM
Sbjct: 361 PKTQEAGFSHSVEYSKKSCKWMMFVDIDEFVFSPSWLNSLKPSKNMLNSLLPTKNSGIGM 420

Query: 421 ITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFKLRKE 480
           +TVMCNDYGPSDRISHPAEGVTQGYNCRRK+EERHKSIVLLEAVD SLLNVIHHFKLRKE
Sbjct: 421 VTVMCNDYGPSDRISHPAEGVTQGYNCRRKVEERHKSIVLLEAVDRSLLNVIHHFKLRKE 480

Query: 481 FRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGNAAVE 540
           F+ RQM  EEAVVNHYKYQAWPEFR KFRRRVSAYVVDWK+SANPTSKDRAPGLGN AVE
Sbjct: 481 FQSRQMRVEEAVVNHYKYQAWPEFRMKFRRRVSAYVVDWKNSANPTSKDRAPGLGNTAVE 540

Query: 541 PPEWPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           PPEWPRKFCEVRDDRLRLLTQRWFG++TAD YRMAWQ
Sbjct: 541 PPEWPRKFCEVRDDRLRLLTQRWFGYETADGYRMAWQ 575

BLAST of HG10004259 vs. ExPASy TrEMBL
Match: A0A1S3CMN3 (Glycosyltransferase family 92 protein OS=Cucumis melo OX=3656 GN=LOC103502651 PE=3 SV=1)

HSP 1 Score: 1008.1 bits (2605), Expect = 1.5e-290
Identity = 501/587 (85.35%), Postives = 528/587 (89.95%), Query Frame = 0

Query: 1   MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPF-------PSSANLLPSRSANSVA 60
           MRRKPR TGLLFS+A F LF FQL RKAFF+ +D  F        SS+NLLPSRSANSV 
Sbjct: 1   MRRKPRFTGLLFSVAFFFLFVFQLSRKAFFFGSDLNFSSSSSSSSSSSNLLPSRSANSVV 60

Query: 61  HYALYETNFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDF 120
           HYAL+ETN  FP QLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDF
Sbjct: 61  HYALHETNPDFPQQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDF 120

Query: 121 LCLFQNNATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEFSSSSFSS 180
           LCLFQNNATSSANFSG+LD TGRLTFKCLMP+SVRRLRPF QP LTKSP+KEFSSS   S
Sbjct: 121 LCLFQNNATSSANFSGVLDFTGRLTFKCLMPESVRRLRPFLQPLLTKSPDKEFSSS--LS 180

Query: 181 SSSLAPELMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGDDAVRTAV 240
           SSS APELMRWTFFAYEAFETEDD++LFVKGVN RQG+NR PTDLNCVFGDGD AVRTAV
Sbjct: 181 SSSPAPELMRWTFFAYEAFETEDDVVLFVKGVNNRQGSNRQPTDLNCVFGDGDGAVRTAV 240

Query: 241 TSSAQEVFRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRG------GGG 300
           TSS QEVFRC HPNLTTR+ HDK K+TLEILDA+GK++LVPSVAYYSPRR       GGG
Sbjct: 241 TSSVQEVFRCRHPNLTTRDDHDKFKVTLEILDARGKNILVPSVAYYSPRRSGSGSGDGGG 300

Query: 301 LV--EAQSMICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEG 360
           LV  EAQSMICACTMVYNVGKFL+EWVMYYSRIGVEKFILYDNGSDDEISAV K LK EG
Sbjct: 301 LVKTEAQSMICACTMVYNVGKFLREWVMYYSRIGVEKFILYDNGSDDEISAVAKGLKQEG 360

Query: 361 YNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSL 420
           YNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMF D+DEF+FSP WLNSLKPSKNM+KSL
Sbjct: 361 YNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFVDIDEFVFSPSWLNSLKPSKNMLKSL 420

Query: 421 LPAENNEIGMITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLN 480
           LP +N+ IGMITVMCNDYGPSDRISHP EGVTQGYNCRRK+EERHKSIVLL+AVD SLLN
Sbjct: 421 LPTQNSGIGMITVMCNDYGPSDRISHPVEGVTQGYNCRRKIEERHKSIVLLDAVDRSLLN 480

Query: 481 VIHHFKLRKEFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDR 540
           VIHHFKLR EF+ +QM  EEAVVNHYKYQAWPEFR KFRRRVSAYVVDWKDSANPTSKDR
Sbjct: 481 VIHHFKLRNEFQSKQMRLEEAVVNHYKYQAWPEFRMKFRRRVSAYVVDWKDSANPTSKDR 540

Query: 541 APGLGNAAVEPPEWPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           APGLGN AVEPPEWPRKFCEVRDDRLRLLTQRWFGF+TAD YRMAWQ
Sbjct: 541 APGLGNRAVEPPEWPRKFCEVRDDRLRLLTQRWFGFETADGYRMAWQ 585

BLAST of HG10004259 vs. ExPASy TrEMBL
Match: A0A5A7TAP9 (Glycosyltransferase family 92 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold119G00110 PE=3 SV=1)

HSP 1 Score: 1005.4 bits (2598), Expect = 9.8e-290
Identity = 501/594 (84.34%), Postives = 528/594 (88.89%), Query Frame = 0

Query: 1   MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPFPSSA--------------NLLPS 60
           MRRKPR TGLLFS+A F LF FQL RKAFF+ +D  F SS+              NLLPS
Sbjct: 1   MRRKPRFTGLLFSVAFFFLFVFQLSRKAFFFGSDLNFSSSSSSSSSSSSSSSSSFNLLPS 60

Query: 61  RSANSVAHYALYETNFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSS 120
           RSANSV HYAL+ETN  FP QLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSS
Sbjct: 61  RSANSVVHYALHETNPDFPQQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSS 120

Query: 121 PDSLRDFLCLFQNNATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEF 180
           PDSLRDFLCLFQNNATSSANFSG+LD TGRLTFKCLMP+SVRRLRPF QP LTKSP+KEF
Sbjct: 121 PDSLRDFLCLFQNNATSSANFSGVLDFTGRLTFKCLMPESVRRLRPFLQPLLTKSPDKEF 180

Query: 181 SSSSFSSSSSLAPELMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGD 240
           SSS   SSSS APELMRWTFFAYEAFETEDD++LFVKGVN RQG+NR PTDLNCVFGDGD
Sbjct: 181 SSS--LSSSSPAPELMRWTFFAYEAFETEDDVVLFVKGVNNRQGSNRQPTDLNCVFGDGD 240

Query: 241 DAVRTAVTSSAQEVFRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRG-- 300
            AVRTAVTSS QEVFRC HPNLTTR+ HDK K+TLEILDA+GK++LVPSVAYYSPRR   
Sbjct: 241 GAVRTAVTSSVQEVFRCRHPNLTTRDDHDKFKVTLEILDARGKNILVPSVAYYSPRRSGS 300

Query: 301 ----GGGLV--EAQSMICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVV 360
               GGGLV  EAQSMICACTMVYNVGKFL+EWVMYYSRIGVEKFILYDNGSDDEISAV 
Sbjct: 301 GSGDGGGLVKTEAQSMICACTMVYNVGKFLREWVMYYSRIGVEKFILYDNGSDDEISAVA 360

Query: 361 KELKLEGYNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPS 420
           K LK EGYNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMF D+DEF+FSP WLNSLKPS
Sbjct: 361 KGLKQEGYNIEIVFWIWPKTQEAGFSHSVEYSKKSCKWMMFVDIDEFVFSPSWLNSLKPS 420

Query: 421 KNMVKSLLPAENNEIGMITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEA 480
           KNM+KSLLP +N+ IGMITVMCNDYGPSDRISHP EGVTQGYNCRRK+EERHKSIVLL+A
Sbjct: 421 KNMLKSLLPTQNSGIGMITVMCNDYGPSDRISHPVEGVTQGYNCRRKIEERHKSIVLLDA 480

Query: 481 VDPSLLNVIHHFKLRKEFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSA 540
           VD SLLNVIHHFKLR EF+ +QM  EEAVVNHYKYQAWPEFR KFRRRVSAYVVDWKDSA
Sbjct: 481 VDRSLLNVIHHFKLRNEFQSKQMRLEEAVVNHYKYQAWPEFRMKFRRRVSAYVVDWKDSA 540

Query: 541 NPTSKDRAPGLGNAAVEPPEWPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           NPTSKDRAPGLGN AVEPPEWPRKFCEVRDDRLRLLTQRWFGF+TAD YRMAWQ
Sbjct: 541 NPTSKDRAPGLGNRAVEPPEWPRKFCEVRDDRLRLLTQRWFGFETADGYRMAWQ 592

BLAST of HG10004259 vs. ExPASy TrEMBL
Match: A0A6J1KBF7 (Glycosyltransferase family 92 protein OS=Cucurbita maxima OX=3661 GN=LOC111493951 PE=3 SV=1)

HSP 1 Score: 926.4 bits (2393), Expect = 5.8e-266
Identity = 457/572 (79.90%), Postives = 496/572 (86.71%), Query Frame = 0

Query: 1   MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPFPSSANLLPSRSANSVAHYALYET 60
           MRRKP   GLL S A+FL+F+FQ+ RKA F   D P  SS  LLPSRS NSVAHYA++E 
Sbjct: 1   MRRKPCFAGLLLSCAVFLIFSFQISRKAIFSGGDLPSLSSDKLLPSRSKNSVAHYAIHEA 60

Query: 61  NFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDFLCLFQNN 120
           N      L HR R +SS+L  IPT+SLLLPDWE+LLISSI TPLSSPDSLRDFLCLF NN
Sbjct: 61  N------LAHRNRQLSSVLGSIPTLSLLLPDWEILLISSIHTPLSSPDSLRDFLCLFHNN 120

Query: 121 ATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEFSSSSFSSSSSLAPE 180
           ATS ANFSG+LD TGR TFKC MP SVRRLRPFFQP LTKSP+ E S    SSSSS A E
Sbjct: 121 ATSPANFSGVLDFTGRATFKCRMPPSVRRLRPFFQPLLTKSPKNELS----SSSSSPAME 180

Query: 181 LMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGDDAVRTAVTSSAQEV 240
           LMRWTF AYEA ETEDD++LFVKGVN+R+G NR P+DL CVFGDG DA+RTAVTSS QEV
Sbjct: 181 LMRWTFLAYEALETEDDVVLFVKGVNHRRGINRPPSDLKCVFGDGYDAIRTAVTSSEQEV 240

Query: 241 FRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRGGGGLVEAQSMICACTM 300
           FRC HPNLTTR+ ++K+KITLEI DAKGKS+LVPSVAYYSPR GG   VEA+SMICACTM
Sbjct: 241 FRCRHPNLTTRDDYEKMKITLEIFDAKGKSILVPSVAYYSPRYGGDS-VEAKSMICACTM 300

Query: 301 VYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFWIWPKTQE 360
           VYNVGKFLKEWV+YYSRIGVEKFILYDNGSDDEIS + KELKLEGY IEIVFWIWPKTQE
Sbjct: 301 VYNVGKFLKEWVIYYSRIGVEKFILYDNGSDDEISEIAKELKLEGYIIEIVFWIWPKTQE 360

Query: 361 AGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENNEIGMITVMC 420
           AGFSHS EYSKKSCKWMM  D+DEF+FSP WLNSL+PSKNM+KSL+PAE N IGMIT+MC
Sbjct: 361 AGFSHSAEYSKKSCKWMMIVDIDEFVFSPSWLNSLEPSKNMLKSLIPAEKNGIGMITIMC 420

Query: 421 NDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFKLRKEFRMRQ 480
           NDYGPSDRISHPAEGVTQGYNCR K EERHKSIVLLEAVDPSLLNVIHHF+LRKEFR R+
Sbjct: 421 NDYGPSDRISHPAEGVTQGYNCRIKAEERHKSIVLLEAVDPSLLNVIHHFRLRKEFRWRK 480

Query: 481 MTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGNAAVEPPEWP 540
           M   EAVVNHYKYQAWPEF+ KFRRRVS YVVDWKDSANPTSKDRAPGLGN+AVEPP+WP
Sbjct: 481 MKSSEAVVNHYKYQAWPEFQMKFRRRVSTYVVDWKDSANPTSKDRAPGLGNSAVEPPDWP 540

Query: 541 RKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           RKFCEVRDDRLRLLT+RWFGFQTA+ YRMAWQ
Sbjct: 541 RKFCEVRDDRLRLLTRRWFGFQTAEGYRMAWQ 561

BLAST of HG10004259 vs. ExPASy TrEMBL
Match: A0A6J1G319 (Glycosyltransferase family 92 protein OS=Cucurbita moschata OX=3662 GN=LOC111450330 PE=3 SV=1)

HSP 1 Score: 921.0 bits (2379), Expect = 2.4e-264
Identity = 454/572 (79.37%), Postives = 493/572 (86.19%), Query Frame = 0

Query: 1   MRRKPRSTGLLFSLALFLLFAFQLFRKAFFYVADFPFPSSANLLPSRSANSVAHYALYET 60
           MRRKP   GLL S A+F +F+FQ+ RKAFF   D P  SS  LLPSRS NSVAHYA++E 
Sbjct: 1   MRRKPCFAGLLLSCAVFFIFSFQISRKAFFSGGDLPSLSSDKLLPSRSTNSVAHYAIHEA 60

Query: 61  NFKFPHQLTHRTRHVSSILDPIPTVSLLLPDWEVLLISSIDTPLSSPDSLRDFLCLFQNN 120
           N      L HR R +SS+L  IPT+S+LLPDWE+LLISSI TPLSSPDSLRDFLCLF NN
Sbjct: 61  N------LAHRNRQLSSVLRSIPTLSILLPDWEILLISSIHTPLSSPDSLRDFLCLFHNN 120

Query: 121 ATSSANFSGILDSTGRLTFKCLMPDSVRRLRPFFQPALTKSPEKEFSSSSFSSSSSLAPE 180
           ATS ANFSGILD TGR  FKC MP SVRRLRPFFQP LTKSP+ E S    SSSSS A E
Sbjct: 121 ATSPANFSGILDFTGRAKFKCRMPPSVRRLRPFFQPLLTKSPKNELS----SSSSSPAME 180

Query: 181 LMRWTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVFGDGDDAVRTAVTSSAQEV 240
           LMRWTF AYE+ ETEDD++LFVKGVN+R+G NR P+DL CVFGDGDDA+RTAVTSS QEV
Sbjct: 181 LMRWTFLAYESLETEDDVVLFVKGVNHRRGINRPPSDLKCVFGDGDDAIRTAVTSSEQEV 240

Query: 241 FRCGHPNLTTREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRGGGGLVEAQSMICACTM 300
           FRC HPNLTTR+ ++K+KITLEI D+KGKS+LVPSVAYYSPR GG  L EA+SMICACTM
Sbjct: 241 FRCCHPNLTTRDDYNKMKITLEIFDSKGKSILVPSVAYYSPRYGGASL-EAKSMICACTM 300

Query: 301 VYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFWIWPKTQE 360
           VYNVGKFLKEWV+YYS IGVEKFILYDNGSDDEIS +VKELKLEGY IEIVFWIWPKTQE
Sbjct: 301 VYNVGKFLKEWVIYYSSIGVEKFILYDNGSDDEISEIVKELKLEGYIIEIVFWIWPKTQE 360

Query: 361 AGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENNEIGMITVMC 420
           AGFSHS EYSKKSCKWMM  D+DEF+FSP WLNSL+PSKNM+KSL+P ENN IGMIT+MC
Sbjct: 361 AGFSHSAEYSKKSCKWMMIVDIDEFVFSPSWLNSLEPSKNMLKSLIPPENNGIGMITIMC 420

Query: 421 NDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFKLRKEFRMRQ 480
           NDYGPSDRISHPAEGVTQGYNCR K EERHKSIVLLEAVDPSLLNVIHHF+LRKEFR R+
Sbjct: 421 NDYGPSDRISHPAEGVTQGYNCRIKAEERHKSIVLLEAVDPSLLNVIHHFRLRKEFRWRK 480

Query: 481 MTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGNAAVEPPEWP 540
           M   EAVVNHYKYQAWPEFR KFRRRVS YVVDWKD ANPTSKDRAPGLGN AVEPP+W 
Sbjct: 481 MKSSEAVVNHYKYQAWPEFRMKFRRRVSTYVVDWKDPANPTSKDRAPGLGNTAVEPPDWA 540

Query: 541 RKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           RKFCEVRDDRLRLLT+RWFGFQTA+ YRMAWQ
Sbjct: 541 RKFCEVRDDRLRLLTRRWFGFQTAEGYRMAWQ 561

BLAST of HG10004259 vs. TAIR 10
Match: AT4G37420.1 (Domain of unknown function (DUF23) )

HSP 1 Score: 491.1 bits (1263), Expect = 1.2e-138
Identity = 259/561 (46.17%), Postives = 349/561 (62.21%), Query Frame = 0

Query: 17  FLLFAFQLFRKAFFYVADFPFPSSANLLPSRSANSVAHYALYETNFKFPHQLTHRTRHVS 76
           F+L +  LF     Y +      +A    +R A S     +  T     HQL++ +R + 
Sbjct: 46  FILVSLSLFGFISLYYSPNTIYRAAFFATTRPAKSRFVSYVINTQDSNHHQLSNGSRRIR 105

Query: 77  SILDPIPTVSLLLPDWEVLLISSIDTPLSSPD-SLRDFLCLFQNNATSSANFSGILDSTG 136
           +        ++L P WE+L+I S +     P     +++C + N   S+A F+ IL  + 
Sbjct: 106 A-------EAVLWPGWEILVIVSPEEKAKPPPFPGENYICFYPNGEKSTARFAAILPFSN 165

Query: 137 RLTFKCLMPDSVRRLRPFFQPALTKSPEKEFSSSSFSSSSSLAPELMRWTFFAYEAFETE 196
           R +F+C +P   R   P   P L  S  K F      S  +  P+L  W F  +EA  TE
Sbjct: 166 RASFRCSLPGIYRHHHPIPTPILASS--KRFQ----LSPETRWPDLPLWNFVVFEAISTE 225

Query: 197 DDILLFVKGVNYRQGNNRAPTDLNCVFG-DGDDAVRTAVTSSAQEVFRCGHPNLTTREHH 256
            D++L VKG N   G+N+ P    CVFG + D A+RTAVTSS QEVFRC  PN+T     
Sbjct: 226 TDVVLLVKGPNRGLGSNKPPESFRCVFGEESDTAIRTAVTSSVQEVFRCSLPNITI---D 285

Query: 257 DKIKITLE-ILDAKGKSVLVPSVAYYSPRRGGGGLVE--AQSMICACTMVYNVGKFLKEW 316
             +KI LE +   K ++  VPSVAYYSP+R    LVE   +S++CA TMVYNV K+L+EW
Sbjct: 286 TPVKIYLEAVATGKEETKTVPSVAYYSPKR---TLVEPREKSLLCATTMVYNVAKYLREW 345

Query: 317 VMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFWIWPKTQEAGFSHSVEYSK 376
           VMY++ IG+++FI+YDNGSDDE++ VVK L  E Y++  V WIWPKTQEAGFSH+  Y  
Sbjct: 346 VMYHAAIGIQRFIIYDNGSDDELNDVVKGLNSEKYDVIKVLWIWPKTQEAGFSHAAVYGN 405

Query: 377 KSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENNEIGMITVMCNDYGPSDRISH 436
            +C WMM+ DVDEF+FSP W    +PS  M++SLLP++ + IG ++   +++GPS++  H
Sbjct: 406 DTCTWMMYLDVDEFLFSPAWDKQSQPSDQMIRSLLPSDQSMIGQVSFKSHEFGPSNQTKH 465

Query: 437 PAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFKLRKEFRMRQMTPEEAVVNHY 496
           P  GVTQGY CRR+ ++RHKSIV L AV+ SL   IHHF L++E+  R    EE VVNHY
Sbjct: 466 PRGGVTQGYTCRREEDQRHKSIVRLSAVEHSLYTAIHHFGLKREYEWRVADTEEGVVNHY 525

Query: 497 KYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGNAAVEPPEWPRKFCEVRDDRL 556
           KYQAW EF+ KF+RRVSAYVVDW   +NP S+DR PGLG   VEP  W  KFCEV D RL
Sbjct: 526 KYQAWQEFKAKFKRRVSAYVVDWTRVSNPKSRDRTPGLGFRPVEPEGWAHKFCEVEDLRL 585

Query: 557 RLLTQRWFGFQTADRYRMAWQ 573
           ++LT++WFG+   + YRMAWQ
Sbjct: 586 KILTRKWFGYPVKNGYRMAWQ 587

BLAST of HG10004259 vs. TAIR 10
Match: AT1G27200.1 (Domain of unknown function (DUF23) )

HSP 1 Score: 309.7 bits (792), Expect = 4.9e-84
Identity = 165/401 (41.15%), Postives = 229/401 (57.11%), Query Frame = 0

Query: 184 WTFFAYEAFETEDDILLFVKGVNYRQGNNRAPTDLNCVF--GDGDDAVRTAVTSSAQEVF 243
           W    YEA    D +++FVKG+  R      P+   C F   + ++   T   ++AQEV 
Sbjct: 176 WEKVGYEAVIDGDTVVVFVKGLTRRPHKESDPSYYKCQFEIENSEEKEVTQAIAAAQEVV 235

Query: 244 RCGHPNLTTREHHDKIKITLEILDAKGKSV-LVPSVAYYSPRRGGGGLVEAQSM------ 303
           RCG P           ++++  +D +G++   +PSVA    R  G   +E +        
Sbjct: 236 RCGLPESLKLNPEMMFRVSVIHIDPRGRTTPALPSVA----RIYGSDSIEKKEKKSGVKH 295

Query: 304 -ICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFW 363
            +C CTM++N   FL+EW+MY+S +GVE++ +YDN SDD I   ++ L  E YN+    W
Sbjct: 296 ELCVCTMLWNQAPFLREWIMYHSWLGVERWFIYDNNSDDGIQEEIELLSSENYNVSRHVW 355

Query: 364 IWPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLP--AENN 423
            W KTQEAGFSH    +K+ C W+ FFDVDEF + P   +   PSKN +KSL+      +
Sbjct: 356 PWIKTQEAGFSHCAVRAKEECNWVGFFDVDEFYYFPTHRSQGLPSKNALKSLVSNYTSWD 415

Query: 424 EIGMITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFK 483
            +G I   C+ YGPS   S P++GVT GY CR+   ERHKSI+  E +  SLLN +HHF+
Sbjct: 416 LVGEIRTDCHSYGPSGLTSVPSQGVTVGYTCRQANPERHKSIIRPELLTSSLLNEVHHFQ 475

Query: 484 LRKEFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGN 543
           L++      +    AVVNHYKYQ W  F+ KF RRV+ YVVDW+++ N  SKDRAPGLG 
Sbjct: 476 LKEGVGHMSLVESVAVVNHYKYQVWDTFKAKFYRRVATYVVDWQENQNQGSKDRAPGLGT 535

Query: 544 AAVEPPEWPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
            A+EPP+W R+FCEV D  L+ L    F  Q      + WQ
Sbjct: 536 EAIEPPDWKRRFCEVWDTGLKDLVMSNFADQVTG--YLPWQ 570

BLAST of HG10004259 vs. TAIR 10
Match: AT5G40720.1 (Domain of unknown function (DUF23) )

HSP 1 Score: 294.7 bits (753), Expect = 1.6e-79
Identity = 162/402 (40.30%), Postives = 226/402 (56.22%), Query Frame = 0

Query: 183 RWTFFAYEA-FETEDDILLFVKGVNYRQGNNRAPTDLNCVFG----DGDDAVRTAVTSSA 242
           RW +  Y+A  + ++  ++FVKG+N R G     +   CV+G         +R    S+A
Sbjct: 161 RWDWLVYDAVIDDDNSTVVFVKGLNLRPGKVADASRYECVYGWDFTKPKLLLRAQAISAA 220

Query: 243 QEVFRCGHPNLTT-----REHHDKIKITLEILDAKGKSVLVPSVAYYSPRRGGGGLVEAQ 302
           QE+ RC  P LT      R     +K+++ I   KG  +L PSVA +  +R G   V   
Sbjct: 221 QEIVRCKTP-LTVLDGPRRAQSQPVKVSVRI---KGSGML-PSVA-HPIKRPGRIKVSKT 280

Query: 303 SMICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVF 362
              C CTM  N    L+EWVMY++ IGV+++ +YDN SDD+I + +K L+  GYNI   F
Sbjct: 281 FETCVCTMTRNAANVLREWVMYHAGIGVQRWFIYDNNSDDDIVSEIKNLENRGYNISRHF 340

Query: 363 WIWPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENNE 422
           W W KTQEAGF++    +K  C W+ F DVDEF + P         +N   +  P+ + E
Sbjct: 341 WPWIKTQEAGFANCAIRAKSDCDWVAFIDVDEFFYIPSGQTLTNVIRN--HTTTPSSSGE 400

Query: 423 IGMITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFKL 482
           IG I   C+ +GPS     P  GVT  Y CR  L ERHKSI+  E+++ +L+NV+HHF L
Sbjct: 401 IGEIRTPCHSFGPSGLRDPPRSGVTAAYTCRMALPERHKSIIRPESLNATLINVVHHFHL 460

Query: 483 RKEFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGNA 542
           ++EF    +     V+NHYKYQ W  F+ KF+RRV+ YV DW++  N  SKDRAPGLG  
Sbjct: 461 KEEFAFVDVDKSTMVINHYKYQVWDIFKEKFKRRVATYVADWQNEENVGSKDRAPGLGTR 520

Query: 543 AVEPPEWPRKFCEVRDDRLRLLTQRWFGFQTADR--YRMAWQ 573
            VEP +W  +FCEV D  LR     W   + +DR   R+ W+
Sbjct: 521 PVEPTDWAERFCEVSDIGLR----DWVLEKFSDRKTQRLVWE 550

BLAST of HG10004259 vs. TAIR 10
Match: AT3G27330.1 (zinc finger (C3HC4-type RING finger) family protein )

HSP 1 Score: 289.3 bits (739), Expect = 6.9e-78
Identity = 156/398 (39.20%), Postives = 223/398 (56.03%), Query Frame = 0

Query: 183 RWTFFAYEA-FETEDDILLFVKGVNYRQGNNRAPTDLNCVFG----DGDDAVRTAVTSSA 242
           R+ +  Y+A  + ++  ++FVKG+N R G     +   CV+G      +  +R+ V ++A
Sbjct: 165 RYDWLVYDAVIDYDNSTVVFVKGLNLRPGRVADVSRYECVYGWDFAKHNRLIRSDVITAA 224

Query: 243 QEVFRCGHPNLT---TREHHDKIKITLEILDAKGKSVLVPSVAYYSPRRGGGGLVEAQSM 302
           QE+ RC  P       +     +K+++ I   KG + ++PS+A   P R      +    
Sbjct: 225 QEIVRCRTPLAVLDGPKAARGPVKVSVRI---KGGTGMLPSIA--QPVRIINPPRKKPFQ 284

Query: 303 ICACTMVYNVGKFLKEWVMYYSRIGVEKFILYDNGSDDEISAVVKELKLEGYNIEIVFWI 362
           +C CTM  N    L+EWVMY++ IGV+++ +YDN SDD+I A ++ L+  GYNI   FW 
Sbjct: 285 MCVCTMTRNAAAVLREWVMYHAGIGVQRWFIYDNNSDDDIIAEIENLERRGYNISRHFWP 344

Query: 363 WPKTQEAGFSHSVEYSKKSCKWMMFFDVDEFIFSPLWLNSLKPSKNMVKSLLPAENNEIG 422
           W KTQEAGFS+    +K  C W+ F DVDEF + P         +N   +      + IG
Sbjct: 345 WIKTQEAGFSNCAIRAKSDCDWIAFIDVDEFFYIPSGETLTSVIRNYTTT------DSIG 404

Query: 423 MITVMCNDYGPSDRISHPAEGVTQGYNCRRKLEERHKSIVLLEAVDPSLLNVIHHFKLRK 482
            I   C+ +GPS   S P  GVT GY CR  L ERHKSI+  EA++ +L+NV+HHF LR 
Sbjct: 405 EIRTPCHSFGPSGLRSRPRSGVTSGYTCRVVLPERHKSIIRPEAMNATLINVVHHFHLRD 464

Query: 483 EFRMRQMTPEEAVVNHYKYQAWPEFRTKFRRRVSAYVVDWKDSANPTSKDRAPGLGNAAV 542
            F    M  +  V+NHYKYQ W  F+ KF RRV+ YV DW++  N  S+DRAPGLG   V
Sbjct: 465 GFTFADMDKDIMVINHYKYQVWEVFKEKFYRRVATYVADWQNEENVGSRDRAPGLGTRPV 524

Query: 543 EPPEWPRKFCEVRDDRLRLLTQRWFGFQTADRYRMAWQ 573
           EP +W  +FCEV D  LR   Q +  F+     R+ W+
Sbjct: 525 EPSDWAERFCEVNDTGLR--DQVFEKFKDKKTQRLVWE 549

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884062.18.8e-30189.55glycosyltransferase family 92 protein At1g27200 [Benincasa hispida][more]
XP_004144291.19.7e-29286.48glycosyltransferase family 92 protein At1g27200 [Cucumis sativus] >KGN47528.1 hy... [more]
XP_008464892.13.1e-29085.35PREDICTED: glycosyltransferase family 92 protein At1g27200 [Cucumis melo][more]
KAA0038445.12.0e-28984.34glycosyltransferase family 92 protein [Cucumis melo var. makuwa][more]
XP_022999662.11.2e-26579.90glycosyltransferase family 92 protein At1g27200-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q94K986.9e-8341.15Glycosyltransferase family 92 protein At1g27200 OS=Arabidopsis thaliana OX=3702 ... [more]
B9S2H49.4e-8040.93Glycosyltransferase family 92 protein RCOM_0530710 OS=Ricinus communis OX=3988 G... [more]
B9SLR11.4e-7843.04Glycosyltransferase family 92 protein RCOM_0530710 OS=Ricinus communis OX=3988 G... [more]
Q6YRM63.7e-7638.37Glycosyltransferase family 92 protein Os08g0121900 OS=Oryza sativa subsp. japoni... [more]
Match NameE-valueIdentityDescription
A0A0A0KDF94.7e-29286.48Glycosyltransferase family 92 protein OS=Cucumis sativus OX=3659 GN=Csa_6G355960... [more]
A0A1S3CMN31.5e-29085.35Glycosyltransferase family 92 protein OS=Cucumis melo OX=3656 GN=LOC103502651 PE... [more]
A0A5A7TAP99.8e-29084.34Glycosyltransferase family 92 protein OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
A0A6J1KBF75.8e-26679.90Glycosyltransferase family 92 protein OS=Cucurbita maxima OX=3661 GN=LOC11149395... [more]
A0A6J1G3192.4e-26479.37Glycosyltransferase family 92 protein OS=Cucurbita moschata OX=3662 GN=LOC111450... [more]
Match NameE-valueIdentityDescription
AT4G37420.11.2e-13846.17Domain of unknown function (DUF23) [more]
AT1G27200.14.9e-8441.15Domain of unknown function (DUF23) [more]
AT5G40720.11.6e-7940.30Domain of unknown function (DUF23) [more]
AT3G27330.16.9e-7839.20zinc finger (C3HC4-type RING finger) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008166Glycosyltransferase family 92PFAMPF01697Glyco_transf_92coord: 294..511
e-value: 1.6E-33
score: 116.5
NoneNo IPR availablePANTHERPTHR21461:SF60GLYCOSYLTRANSFERASE FAMILY 92 PROTEINcoord: 71..572
NoneNo IPR availablePANTHERPTHR21461UNCHARACTERIZEDcoord: 71..572
NoneNo IPR availableCDDcd00761Glyco_tranf_GTA_typecoord: 301..421
e-value: 0.00587983
score: 35.945
IPR029044Nucleotide-diphospho-sugar transferasesSUPERFAMILY53448Nucleotide-diphospho-sugar transferasescoord: 293..397

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004259.1HG10004259.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane
molecular_function GO:0016757 glycosyltransferase activity