HG10021007 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021007
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGlycosyltransferase
LocationChr05: 4437288 .. 4439830 (+)
RNA-Seq ExpressionHG10021007
SyntenyHG10021007
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGTACCTGATCATCATCTAGTTTTCATCTGTAATCCGGCAATCGGAAATTTAGTTCCGGCAGTCGAATTCGCCGTCCGATTAGTCAATCACGACTCTCGTTTCTTCGCTACATTTCTTGCCATCGACATCCCTGGAAGACCCCTCGTCAATGCCTATACCCAATCACGTTTCTCACTTTCCCCTTCTCCAAATCTCCAATTCATTCATCTCCCATCTCTCCAACCCCCATCTCCCAATCTCTACCATTCCCACATCGCTTATTTGTCCTTAATTTTTGAATCTCATAAGCCCAATGTCAAGCAAGCAATCTCCGACCTCCAAAAACTCCACAATAATTCTGGCCGTATCGTTGGGATTTTTGTCGATATGTTCACTACTGCTTTCATCGATGTTGCTAATGACCTCCAAATTCCTTCCTACCTTTTCTTTGCTTCTCCAGCCACTTTCCTTAGCCTCATGATTCATCTTTCTAAAACGGATCATGACCGATTTAACGCCTTGATCCGTGACTCGGATGCTGAGTTCGTTTTACCGAGTTATGTTCACTCGTTGACTGTCAATTTGTTGCCGCCTACTCTTTTGACGACGGAGGACGGTCTGTTTTGGTACGCCCATCACGGGCAACGGTATGGGGAAACGAAAGGTGTTGTTATAAACACGTTTGCAGAGCTTGAGCCTCACGCGCTGAGTTCGTTGGATGAAATTCCGCCGGTTTATGCTATTGGGCCTGTGGTTGATTTGGACGGCCCGGCCCAGTGGCAGCCTTGTCAGGGGACCCACCAGAGCGTGGTGAAGTGGCTGGATGGTCAGGCTGAGGGGTCGGTTGTTTTGTTGAGCTTTGGGAGTATGGGGAGTCTTGATAAAGGTCAAGTGAGAGAAATTGCGTTTGGGCTGGAGAGGGCGGGGTTCCGGTTCGTGTGGGCCGTACGGCAGCCTCCTAAGACCCATTTAGAACATCCCGACGACTACGACGATCTAAACGACGTGTTGCCGGAAGGGTTTCTAGCTCGTACGGCAGGGCGGGGATTGGTCTGTGGGTGGGTCCCGCAGGTGTGTATATTTCTTAAGAGATACTTGAATTGCATTCATCTTAGTTCCACCACCACACACAAAAATAATAATAATAATAAAAAAATATATATATATTTTTAAAAAATATAAGAAGAATTTATCATAAGAAAAATTATGATTAGACAATTTTGTGAAATATATAAATATAGGTATCGTCAAGATGCATCCAAATATTTTTCTTTTTTTGAGTTCCACTCAGTCATTTCATCTCCACTCCTTTCACCATTTTTTTGGTTTAATTTGTGAGGATAAAAGAGCAAATACTCAGTGGATCATTCTTTGTCACGTGATGAACTCTTTTTTTTTAAAAAAAAATAATAGTTTCGAGTTTTTTTTTTTTTTTTACTTTGTGACTAGAGCATGCAATAAAGAGGATTGGAGTCTGGATTAGGTTCTTGAAAAAAAAATATTTTATTTTAGGGGTCTTTTCACAAATAGAAAAAAAACAAAACTAATTACACATATAGAAAAAAAAGTAGAAAATATAAAAGCATGGGGTGTACTTTTTTCTATATGTGTAATTAGTTTTGTTTTTTTTCTATACACAATAATTTCTCTTTTTTATTTTATTTAATTTAATTTTTTAAATATATATTTTAAAATATATATTATATTTAAATGATTATTTTTTTATATGAAATTTATGTTTAATGAGAATTTTTTTAAGTACAACAATTTGGAGTTGGGTATTCAAATCTCCAATCTCACATCTCCAATCTCACAGTCAAGACAGAATATTTTATGTTTATTGAACTATGTTAGATATGTGTGTATATATATATTTCATAAAAAAAATGAGGTGAGGAAGATGGCAAATGTATTGTTCATTCTCGTCCTTGTTTTACTAACAGAAAAAACTCGTCCCAAATACATCTTTCCTGTTATTTGGAAAGCAAAAAGAATACATTTCTAAAAATTATAATATTAGAACAATATGAAATTTAATTCAGTGTTAATCATGTATGAAATAAACATTTAAAAGGAATGTAAAAACTTTATGAAATAAAGGTAGTATATCACTAACTGGGTATGTATAGTCGCAATATAAGTTTTTTCTTTGAGAATTGATTTATGCTCACATTGACTTGCAGGTGACTATTTTGAGTCATCATGCAATTGGAGGGTTCGTGTCGCATTGTGGATGGAACTCGATTTTAGAGAGTTTATGGTTTGGTGTGCCGATAGCAACATGGCCAGTGTATGCAGAGCAACAAATGAATGCCTTTGAAATGGTGAAGGAATTGGAATTGGCGGTGGAGTTGCGGCTCGATTACAGGGAAGGAAGCAAACTGGTGACGGCAAAGGAGCTTGAGACGGCACTGAGGCGCTTGATGGACGACGGAGATGAGATCAAATCGAGAGTGAAGCAAATGGGAGAGAAGTGCAGAGCAGTTCTCGTGGAAAATGGATCCTCGTATGGGGCACTTAATTCTCTAATTGAGAAATTAACGACTCAAATTTTGTAA

mRNA sequence

ATGGCCGTACCTGATCATCATCTAGTTTTCATCTGTAATCCGGCAATCGGAAATTTAGTTCCGGCAGTCGAATTCGCCGTCCGATTAGTCAATCACGACTCTCGTTTCTTCGCTACATTTCTTGCCATCGACATCCCTGGAAGACCCCTCGTCAATGCCTATACCCAATCACGTTTCTCACTTTCCCCTTCTCCAAATCTCCAATTCATTCATCTCCCATCTCTCCAACCCCCATCTCCCAATCTCTACCATTCCCACATCGCTTATTTGTCCTTAATTTTTGAATCTCATAAGCCCAATGTCAAGCAAGCAATCTCCGACCTCCAAAAACTCCACAATAATTCTGGCCGTATCGTTGGGATTTTTGTCGATATGTTCACTACTGCTTTCATCGATGTTGCTAATGACCTCCAAATTCCTTCCTACCTTTTCTTTGCTTCTCCAGCCACTTTCCTTAGCCTCATGATTCATCTTTCTAAAACGGATCATGACCGATTTAACGCCTTGATCCGTGACTCGGATGCTGAGTTCGTTTTACCGAGTTATGTTCACTCGTTGACTGTCAATTTGTTGCCGCCTACTCTTTTGACGACGGAGGACGGTCTGTTTTGGTACGCCCATCACGGGCAACGGTATGGGGAAACGAAAGGTGTTGTTATAAACACGTTTGCAGAGCTTGAGCCTCACGCGCTGAGTTCGTTGGATGAAATTCCGCCGGTTTATGCTATTGGGCCTGTGGTTGATTTGGACGGCCCGGCCCAGTGGCAGCCTTGTCAGGGGACCCACCAGAGCGTGGTGAAGTGGCTGGATGGTCAGGCTGAGGGGTCGGTTGTTTTGTTGAGCTTTGGGAGTATGGGGAGTCTTGATAAAGGTCAAGTGAGAGAAATTGCGTTTGGGCTGGAGAGGGCGGGGTTCCGGTTCGTGTGGGCCGTACGGCAGCCTCCTAAGACCCATTTAGAACATCCCGACGACTACGACGATCTAAACGACGTGTTGCCGGAAGGGTTTCTAGCTCGTACGGCAGGGCGGGGATTGGTCTGTGGGTGGGTCCCGCAGGTGACTATTTTGAGTCATCATGCAATTGGAGGGTTCGTGTCGCATTGTGGATGGAACTCGATTTTAGAGAGTTTATGGTTTGGTGTGCCGATAGCAACATGGCCAGTGTATGCAGAGCAACAAATGAATGCCTTTGAAATGGTGAAGGAATTGGAATTGGCGGTGGAGTTGCGGCTCGATTACAGGGAAGGAAGCAAACTGGTGACGGCAAAGGAGCTTGAGACGGCACTGAGGCGCTTGATGGACGACGGAGATGAGATCAAATCGAGAGTGAAGCAAATGGGAGAGAAGTGCAGAGCAGTTCTCGTGGAAAATGGATCCTCGTATGGGGCACTTAATTCTCTAATTGAGAAATTAACGACTCAAATTTTGTAA

Coding sequence (CDS)

ATGGCCGTACCTGATCATCATCTAGTTTTCATCTGTAATCCGGCAATCGGAAATTTAGTTCCGGCAGTCGAATTCGCCGTCCGATTAGTCAATCACGACTCTCGTTTCTTCGCTACATTTCTTGCCATCGACATCCCTGGAAGACCCCTCGTCAATGCCTATACCCAATCACGTTTCTCACTTTCCCCTTCTCCAAATCTCCAATTCATTCATCTCCCATCTCTCCAACCCCCATCTCCCAATCTCTACCATTCCCACATCGCTTATTTGTCCTTAATTTTTGAATCTCATAAGCCCAATGTCAAGCAAGCAATCTCCGACCTCCAAAAACTCCACAATAATTCTGGCCGTATCGTTGGGATTTTTGTCGATATGTTCACTACTGCTTTCATCGATGTTGCTAATGACCTCCAAATTCCTTCCTACCTTTTCTTTGCTTCTCCAGCCACTTTCCTTAGCCTCATGATTCATCTTTCTAAAACGGATCATGACCGATTTAACGCCTTGATCCGTGACTCGGATGCTGAGTTCGTTTTACCGAGTTATGTTCACTCGTTGACTGTCAATTTGTTGCCGCCTACTCTTTTGACGACGGAGGACGGTCTGTTTTGGTACGCCCATCACGGGCAACGGTATGGGGAAACGAAAGGTGTTGTTATAAACACGTTTGCAGAGCTTGAGCCTCACGCGCTGAGTTCGTTGGATGAAATTCCGCCGGTTTATGCTATTGGGCCTGTGGTTGATTTGGACGGCCCGGCCCAGTGGCAGCCTTGTCAGGGGACCCACCAGAGCGTGGTGAAGTGGCTGGATGGTCAGGCTGAGGGGTCGGTTGTTTTGTTGAGCTTTGGGAGTATGGGGAGTCTTGATAAAGGTCAAGTGAGAGAAATTGCGTTTGGGCTGGAGAGGGCGGGGTTCCGGTTCGTGTGGGCCGTACGGCAGCCTCCTAAGACCCATTTAGAACATCCCGACGACTACGACGATCTAAACGACGTGTTGCCGGAAGGGTTTCTAGCTCGTACGGCAGGGCGGGGATTGGTCTGTGGGTGGGTCCCGCAGGTGACTATTTTGAGTCATCATGCAATTGGAGGGTTCGTGTCGCATTGTGGATGGAACTCGATTTTAGAGAGTTTATGGTTTGGTGTGCCGATAGCAACATGGCCAGTGTATGCAGAGCAACAAATGAATGCCTTTGAAATGGTGAAGGAATTGGAATTGGCGGTGGAGTTGCGGCTCGATTACAGGGAAGGAAGCAAACTGGTGACGGCAAAGGAGCTTGAGACGGCACTGAGGCGCTTGATGGACGACGGAGATGAGATCAAATCGAGAGTGAAGCAAATGGGAGAGAAGTGCAGAGCAGTTCTCGTGGAAAATGGATCCTCGTATGGGGCACTTAATTCTCTAATTGAGAAATTAACGACTCAAATTTTGTAA

Protein sequence

MAVPDHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFSLSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRIVGIFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLPSYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSLDEIPPVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAFGLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSHHAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSKLVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL
Homology
BLAST of HG10021007 vs. NCBI nr
Match: XP_038876972.1 (UDP-glycosyltransferase 43-like [Benincasa hispida])

HSP 1 Score: 861.3 bits (2224), Expect = 3.9e-246
Identity = 428/480 (89.17%), Postives = 446/480 (92.92%), Query Frame = 0

Query: 1   MAVPD----HHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQ 60
           MAVPD    HHLVFIC PAIGNLVPAVEFAVRLVNHDSRFF TFLAIDIPG PLVNAYTQ
Sbjct: 1   MAVPDDDDHHHLVFICTPAIGNLVPAVEFAVRLVNHDSRFFVTFLAIDIPGTPLVNAYTQ 60

Query: 61  SRFSLSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSG 120
           SRFSLS S ++QFIHLP L+PPSPNLY+S+I YLSL+FESHKPNVK +IS LQKLHN+S 
Sbjct: 61  SRFSLSLSQDIQFIHLPPLEPPSPNLYNSYIGYLSLLFESHKPNVKNSISHLQKLHNSS- 120

Query: 121 RIVGIFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAE 180
           RIVGIFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSK DHDRFNALI DSDAE
Sbjct: 121 RIVGIFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKMDHDRFNALILDSDAE 180

Query: 181 FVLPSYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSLDE 240
           FVLPSYVHSLTVNLLPPT L+TEDGLFWYAHHG+RYGETKGVVINTFAELEPHAL SLDE
Sbjct: 181 FVLPSYVHSLTVNLLPPT-LSTEDGLFWYAHHGRRYGETKGVVINTFAELEPHALRSLDE 240

Query: 241 IPPVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREI 300
           +PPVYAIGPVVDL GPAQWQP  GTHQSVVKWLDGQ EGSVVLLSFGSMGSLDK QVREI
Sbjct: 241 VPPVYAIGPVVDLGGPAQWQPSHGTHQSVVKWLDGQPEGSVVLLSFGSMGSLDKDQVREI 300

Query: 301 AFGLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTIL 360
           AFGLER GFRFVWAVRQPPKT LEHPDDY DLNDVLPEGFL RTAGRGLVCGWVPQVTIL
Sbjct: 301 AFGLERVGFRFVWAVRQPPKTKLEHPDDYSDLNDVLPEGFLTRTAGRGLVCGWVPQVTIL 360

Query: 361 SHHAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREG 420
           SH AIGGFVSHCGWNSILESLW GVPIATWPVYAEQQMNAFEMVKELELAVELRLDYR+G
Sbjct: 361 SHGAIGGFVSHCGWNSILESLWLGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYRKG 420

Query: 421 SKLVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL 477
           SKLVTA+ELETALRRLMD  ++++SRVKQMGEKCR VLVENGSSY ALNSLI+KLT QIL
Sbjct: 421 SKLVTAEELETALRRLMDGREDVRSRVKQMGEKCRTVLVENGSSYRALNSLIDKLTAQIL 478

BLAST of HG10021007 vs. NCBI nr
Match: KAA0055369.1 (anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo var. makuwa])

HSP 1 Score: 826.6 bits (2134), Expect = 1.1e-235
Identity = 401/478 (83.89%), Postives = 433/478 (90.59%), Query Frame = 0

Query: 1   MAVPDHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFS 60
           M +P HHLVFIC PAIGNLVPAVEFA RL+NHDSRFF TFL+IDIPG  LV AYTQSR S
Sbjct: 1   MTIPHHHLVFICTPAIGNLVPAVEFATRLINHDSRFFVTFLSIDIPGTSLVTAYTQSRSS 60

Query: 61  LSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRIVG 120
           LSPSPNLQFIHLPSLQPPSPNLYHS++AYLSLIF SHKPNVK AISDLQK  +NS RIVG
Sbjct: 61  LSPSPNLQFIHLPSLQPPSPNLYHSYVAYLSLIFNSHKPNVKHAISDLQKKLHNSSRIVG 120

Query: 121 IFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLP 180
           IFVDMFTT FIDVANDLQIPSYLFFASPATFL LMIHLSKTDHDRFNALIR+S+AEFVLP
Sbjct: 121 IFVDMFTTTFIDVANDLQIPSYLFFASPATFLGLMIHLSKTDHDRFNALIRNSEAEFVLP 180

Query: 181 SYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSS--LDEIP 240
           SYV SLTV++LPPTLLTTEDGLFWY +HG+RYGETKG+VINTF ELEPHAL S  LDE+P
Sbjct: 181 SYVQSLTVSMLPPTLLTTEDGLFWYGYHGRRYGETKGIVINTFEELEPHALRSLDLDEVP 240

Query: 241 PVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAF 300
           PVYA+GPVVDL GP QWQ  +G  + VVKWLDGQ EGSVVLLSFGSMGSLD+ QVREIAF
Sbjct: 241 PVYAVGPVVDLGGPGQWQAGEGRLERVVKWLDGQEEGSVVLLSFGSMGSLDEDQVREIAF 300

Query: 301 GLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSH 360
           GLERAGFRFVW VRQPPKT +E PDDY DL+DVLPEGFL+RTAG+GLVCGW PQVTILSH
Sbjct: 301 GLERAGFRFVWVVRQPPKTTIEQPDDYSDLSDVLPEGFLSRTAGQGLVCGWAPQVTILSH 360

Query: 361 HAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSK 420
            AIGGFVSHCGWNSILESLWFGVPIATWP+YAEQQMNAFEMVKELELAVE+RLDYR+GSK
Sbjct: 361 RAIGGFVSHCGWNSILESLWFGVPIATWPLYAEQQMNAFEMVKELELAVEIRLDYRKGSK 420

Query: 421 LVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL 477
           +VT +ELE ALRRLMDD +E+KSRVK+M EKCR VLVENGS+Y ALNSLIEKLT + L
Sbjct: 421 VVTGEELERALRRLMDDNNEVKSRVKRMREKCRVVLVENGSAYNALNSLIEKLTARTL 478

BLAST of HG10021007 vs. NCBI nr
Match: XP_008440340.1 (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo] >TYJ99296.1 anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo var. makuwa])

HSP 1 Score: 824.7 bits (2129), Expect = 4.1e-235
Identity = 400/478 (83.68%), Postives = 433/478 (90.59%), Query Frame = 0

Query: 1   MAVPDHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFS 60
           M +P HHLVFIC PAIGNLVPAVEFA RL+NHDSRFF TFL+IDIPG  LV AYTQSR S
Sbjct: 1   MTIPHHHLVFICTPAIGNLVPAVEFATRLINHDSRFFVTFLSIDIPGTSLVTAYTQSRSS 60

Query: 61  LSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRIVG 120
           LSPSPNLQFIHLPSLQPPSPNLYHS++AYLSLIF SHKPNVK AISDLQK  ++S RIVG
Sbjct: 61  LSPSPNLQFIHLPSLQPPSPNLYHSYVAYLSLIFNSHKPNVKHAISDLQKKLHDSSRIVG 120

Query: 121 IFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLP 180
           IFVDMFTT FIDVANDLQIPSYLFFASPATFL LMIHLSKTDHDRFNALIR+S+AEFVLP
Sbjct: 121 IFVDMFTTTFIDVANDLQIPSYLFFASPATFLGLMIHLSKTDHDRFNALIRNSEAEFVLP 180

Query: 181 SYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSS--LDEIP 240
           SYV SLTV++LPPTLLTTEDGLFWY +HG+RYGETKG+VINTF ELEPHAL S  LDE+P
Sbjct: 181 SYVQSLTVSMLPPTLLTTEDGLFWYGYHGRRYGETKGIVINTFEELEPHALRSLDLDEVP 240

Query: 241 PVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAF 300
           PVYA+GPVVDL GP QWQ  +G  + VVKWLDGQ EGSVVLLSFGSMGSLD+ QVREIAF
Sbjct: 241 PVYAVGPVVDLGGPGQWQAGEGRLERVVKWLDGQEEGSVVLLSFGSMGSLDEDQVREIAF 300

Query: 301 GLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSH 360
           GLERAGFRFVW VRQPPKT +E PDDY DL+DVLPEGFL+RTAG+GLVCGW PQVTILSH
Sbjct: 301 GLERAGFRFVWVVRQPPKTTIEQPDDYSDLSDVLPEGFLSRTAGQGLVCGWAPQVTILSH 360

Query: 361 HAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSK 420
            AIGGFVSHCGWNSILESLWFGVPIATWP+YAEQQMNAFEMVKELELAVE+RLDYR+GSK
Sbjct: 361 RAIGGFVSHCGWNSILESLWFGVPIATWPLYAEQQMNAFEMVKELELAVEIRLDYRKGSK 420

Query: 421 LVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL 477
           +VT +ELE ALRRLMDD +E+KSRVK+M EKCR VLVENGS+Y ALNSLIEKLT + L
Sbjct: 421 VVTGEELERALRRLMDDNNEVKSRVKRMREKCRVVLVENGSAYNALNSLIEKLTARTL 478

BLAST of HG10021007 vs. NCBI nr
Match: XP_023519630.1 (UDP-glycosyltransferase 43-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 811.2 bits (2094), Expect = 4.7e-231
Identity = 396/478 (82.85%), Postives = 430/478 (89.96%), Query Frame = 0

Query: 1   MAVPDHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFS 60
           MA P HHLVFICNPAIGNLVPAVEFAVRLV+HD RF ATFLA+DIPGRPLVNAYTQSRFS
Sbjct: 1   MATPHHHLVFICNPAIGNLVPAVEFAVRLVHHDPRFLATFLALDIPGRPLVNAYTQSRFS 60

Query: 61  LSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHN--NSGRI 120
            S S N+QFI +PS QPPSP L+HSHIAYLSL FES+KP+VKQAI D   LHN  NS R+
Sbjct: 61  ASTSQNIQFIAIPSAQPPSPALFHSHIAYLSLYFESYKPHVKQAIID---LHNPQNSARV 120

Query: 121 VGIFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFV 180
           VG+FVDMFTTAFIDVANDLQIPSYLFFASPATFLSLM+ L K D DR  +LIRDSDAEFV
Sbjct: 121 VGVFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMVQLPKMDRDRVESLIRDSDAEFV 180

Query: 181 LPSYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSLDEIP 240
           LPS+ HSLT +LLP T+L T DGLFWYAHHG+RYGET GVVIN+F ELEPHALSSL EIP
Sbjct: 181 LPSFAHSLTSSLLPSTVLKTRDGLFWYAHHGRRYGETNGVVINSFTELEPHALSSLGEIP 240

Query: 241 PVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAF 300
           PVYA+GP++DL GPAQWQP + THQSV+KWLDGQ E SVVLLSFGSMGSLDK Q+REIAF
Sbjct: 241 PVYAVGPLLDLGGPAQWQPSRATHQSVLKWLDGQPERSVVLLSFGSMGSLDKAQIREIAF 300

Query: 301 GLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSH 360
           GL+RAGFRFVW VRQPPK+ LEHPDDY DL+DVLP+GF+ RTAG GLVCGWVPQV+ILSH
Sbjct: 301 GLQRAGFRFVWVVRQPPKSQLEHPDDYSDLDDVLPDGFVTRTAGLGLVCGWVPQVSILSH 360

Query: 361 HAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSK 420
            AIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSK
Sbjct: 361 GAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSK 420

Query: 421 LVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL 477
           LVTA+ELETALRRLMD GDE+++RVK+MGEKCR VL+ENGSSY ALNSLIEKLT QIL
Sbjct: 421 LVTAEELETALRRLMDGGDEVRARVKRMGEKCRTVLLENGSSYTALNSLIEKLTAQIL 475

BLAST of HG10021007 vs. NCBI nr
Match: XP_022923653.1 (UDP-glycosyltransferase 43-like [Cucurbita moschata])

HSP 1 Score: 805.4 bits (2079), Expect = 2.6e-229
Identity = 392/476 (82.35%), Postives = 427/476 (89.71%), Query Frame = 0

Query: 1   MAVPDHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFS 60
           MA P HHLVFICNPAIGNLVPAVEFAVRLV+ D RF ATFLA+DIPGRPLVNAYTQSRF+
Sbjct: 1   MAKPHHHLVFICNPAIGNLVPAVEFAVRLVHQDPRFLATFLALDIPGRPLVNAYTQSRFA 60

Query: 61  LSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRIVG 120
            SPS N+QFI +PS  PPSP L+HSHIAYLSL FES+KP+VKQAI DLQ    NS R+VG
Sbjct: 61  ASPSQNIQFIAIPSAHPPSPALFHSHIAYLSLYFESYKPHVKQAIIDLQN-PQNSARVVG 120

Query: 121 IFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLP 180
           +FVDMFTTAFIDVANDLQIPSYLFFASPATFLSLM+ L + D DR  +LIRDSDAEF LP
Sbjct: 121 VFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMVQLPEMDRDRVESLIRDSDAEFALP 180

Query: 181 SYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSLDEIPPV 240
           S+ HSLT +LLP T+L T DGLFWYAHHG+RYGET GVVIN+FAELEPHALSSL EIPPV
Sbjct: 181 SFAHSLTSSLLPSTVLKTRDGLFWYAHHGRRYGETNGVVINSFAELEPHALSSLGEIPPV 240

Query: 241 YAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAFGL 300
           YA+GP++DL GPAQWQP + THQSV+KWLDGQ E SVVLLSFGSMGSLDK Q+REIAFGL
Sbjct: 241 YAVGPLLDLGGPAQWQPSRATHQSVLKWLDGQPERSVVLLSFGSMGSLDKAQIREIAFGL 300

Query: 301 ERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSHHA 360
           ERA FRFVW VRQPPK+ LEHPDDY DL+DVLP+GF+ RTAG GLVCGWVPQV+ILSH A
Sbjct: 301 ERARFRFVWVVRQPPKSQLEHPDDYSDLDDVLPDGFVTRTAGLGLVCGWVPQVSILSHGA 360

Query: 361 IGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSKLV 420
           IGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSKLV
Sbjct: 361 IGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSKLV 420

Query: 421 TAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL 477
           TA+ELETALRRLMD GDE+++RVK+MGEKCR VL+ENGSSY ALNSLIEKLT QIL
Sbjct: 421 TAEELETALRRLMDGGDEVRARVKRMGEKCRTVLLENGSSYTALNSLIEKLTAQIL 475

BLAST of HG10021007 vs. ExPASy Swiss-Prot
Match: A0A172J2G3 (UDP-glycosyltransferase 43 OS=Pueraria montana var. lobata OX=3893 GN=UGT43 PE=1 SV=2)

HSP 1 Score: 506.9 bits (1304), Expect = 2.5e-142
Identity = 250/475 (52.63%), Postives = 332/475 (69.89%), Query Frame = 0

Query: 6   HHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFSLSPSP 65
           + +VFI  P +GNLVP VEFA  L  HD RF AT L + +P RPL+N Y Q+R   S + 
Sbjct: 4   YEVVFIAIPTLGNLVPQVEFANLLTKHDPRFSATILTVSMPQRPLMNTYVQAR--ASSAA 63

Query: 66  NLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKL----HNNSGRIVGI 125
           N++ + LP + PP+P  Y + + +LSL  ++HK +VK A+ +L K      +NS R+  I
Sbjct: 64  NIKLLQLPIVDPPAPEQYQTLVGFLSLHMQNHKHHVKHALLNLMKTTESNSSNSVRLAAI 123

Query: 126 FVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLPS 185
           FVDMF+T  IDVA +L +P YLFFASPA+ L   + L + D       + +S +EF +P 
Sbjct: 124 FVDMFSTTLIDVAAELAVPCYLFFASPASCLGFTLDLPRFD-------LAESKSEFTVPC 183

Query: 186 YVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSL---DEIP 245
           + + L  ++ P  +L  +DG FW ++H +RY ETKG+VINT  ELE HAL SL    ++ 
Sbjct: 184 FKNLLPRSVFPNLVLDAKDGTFWLSYHARRYKETKGIVINTLQELETHALQSLHNDSQLQ 243

Query: 246 PVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAF 305
            VY IGP++DL G AQW P    ++ +++WLD Q   SVVLL FGSMGSL+  QV EIA 
Sbjct: 244 RVYPIGPILDLVGSAQWDPNPAQYKRIMEWLDQQPLSSVVLLCFGSMGSLEANQVEEIAI 303

Query: 306 GLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSH 365
           GLERAG RF+WA+R+ PK  LE+P DY++  DVLP+GFL RT   GLVCGWVPQ  +L+H
Sbjct: 304 GLERAGVRFLWALRESPKAQLEYPRDYENHKDVLPDGFLERTNNIGLVCGWVPQAVVLAH 363

Query: 366 HAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSK 425
            A+GGFVSHCGWNSILESLW GVP+ATWP+Y+EQQMNAF+MV++L LAVE+ +DYR G+ 
Sbjct: 364 KAVGGFVSHCGWNSILESLWHGVPVATWPLYSEQQMNAFQMVRDLGLAVEISVDYRVGAD 423

Query: 426 LVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTT 474
           LV A+E+E  LR LM  GDEI+ +VK+M + CR  L+ENGSSY  L SLI++LT+
Sbjct: 424 LVRAEEVENGLRSLMKGGDEIRRKVKEMSDTCRGALLENGSSYSNLVSLIQELTS 469

BLAST of HG10021007 vs. ExPASy Swiss-Prot
Match: D3UAG0 (UDP-glycosyltransferase 71K1 OS=Malus domestica OX=3750 GN=UGT71K1 PE=1 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 4.0e-108
Identity = 211/477 (44.23%), Postives = 301/477 (63.10%), Query Frame = 0

Query: 8   LVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFSLSPSPNL 67
           LVFI +P  G+ +P ++F  RL++ + R   T LAI       +++YT+S    +  P +
Sbjct: 6   LVFIPSPGAGHHLPTLQFVKRLIDRNDRISITILAIQSYFPTTLSSYTKS--IAASEPRI 65

Query: 68  QFIHLPSLQP-PSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSG---RIVGIFV 127
           +FI +P  Q  P   +Y S     SL  ESH P+VK+ I++L     NS    R+  + V
Sbjct: 66  RFIDVPQPQDRPPQEMYKSRAQIFSLYIESHVPSVKKIITNLVSSSANSSDSIRVAALVV 125

Query: 128 DMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLPSYV 187
           D+F  + IDVA +L IPSYLF  S A +L+ M+HL    H++    + +SD ++ +P  V
Sbjct: 126 DLFCVSMIDVAKELNIPSYLFLTSNAGYLAFMLHLPIL-HEKNQIAVEESDPDWSIPGIV 185

Query: 188 HSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHAL---SSLDEIPPV 247
           H +   +LP  L  T+  L  Y     R+ ET+G+++NTF ELE HA+   S+ D +PPV
Sbjct: 186 HPVPPRVLPAAL--TDGRLSAYIKLASRFRETRGIIVNTFVELETHAITLFSNDDRVPPV 245

Query: 248 YAIGPVVDL-DGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAFG 307
           Y +GPV+DL DG       Q     ++KWLD Q + SVV L FGSMGS    QV+EIA G
Sbjct: 246 YPVGPVIDLDDGQEHSNLDQAQRDKIIKWLDDQPQKSVVFLCFGSMGSFGAEQVKEIAVG 305

Query: 308 LERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAG-RGLVCGWVPQVTILSH 367
           LE++G RF+W++R P    +  P D  +L +VLP+GFL RT G +GL+CGW PQV IL+H
Sbjct: 306 LEQSGQRFLWSLRMPSPKGIV-PSDCSNLEEVLPDGFLERTNGKKGLICGWAPQVEILAH 365

Query: 368 HAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGS- 427
            A GGF+SHCGWNSILESLW GVPIATWP+YAEQQ+NAF MV+EL +A+E+RLDY+ GS 
Sbjct: 366 SATGGFLSHCGWNSILESLWHGVPIATWPMYAEQQLNAFRMVRELGMALEMRLDYKAGSA 425

Query: 428 KLVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQ 475
            +V A E+E A+  +M+   E++ +V++MG+  R  + + GSS+ ++   IE +  Q
Sbjct: 426 DVVGADEIEKAVVGVMEKDSEVRKKVEEMGKMARKAVKDGGSSFASVGRFIEDVIGQ 476

BLAST of HG10021007 vs. ExPASy Swiss-Prot
Match: D3UAG2 (UDP-glycosyltransferase 71K2 OS=Pyrus communis OX=23211 GN=UGT71K2 PE=1 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 6.3e-106
Identity = 209/472 (44.28%), Postives = 297/472 (62.92%), Query Frame = 0

Query: 8   LVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFSLSPSPNL 67
           LVFI +P  G+LVP ++FA RL++ + R   T LAI       +++YT+S    +  P +
Sbjct: 6   LVFIPSPGAGHLVPTLQFAKRLIDRNDRISITILAIQSYFPTTLSSYTKS--IAASEPRI 65

Query: 68  QFIHLPSLQP-PSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSG---RIVGIFV 127
           +FI +P  Q  P   +Y S   + SL  ES  P+VK+ I++L     NS    R+  + V
Sbjct: 66  RFIDVPQPQDRPPQEMYKSPAKFFSLYIESQVPSVKKIITNLVSSSANSSDSIRVAALVV 125

Query: 128 DMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLPSYV 187
           D+F  + IDVA +L IPSYLF  S A +L+ M+HL    +++    + +SD E+ +P  V
Sbjct: 126 DLFCVSMIDVAKELNIPSYLFLTSNAGYLAFMLHLPIV-NEKNQIAVEESDPEWSIPGIV 185

Query: 188 HSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHAL---SSLDEIPPV 247
           H +   + P  L  T+     Y     R+ ET+G+++NTF ELE HA+   S+ D IPPV
Sbjct: 186 HPVPPRVFPVAL--TDGRCSAYIKLASRFRETRGIIVNTFVELETHAITLFSTDDGIPPV 245

Query: 248 YAIGPVVDL-DGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAFG 307
           Y +GPV+D+ DG A     Q     ++KWLD Q + SVV L FGSMGS    QV+EIA G
Sbjct: 246 YPVGPVIDMDDGQAHSNLDQAQRDRIIKWLDDQPQKSVVFLCFGSMGSFRAEQVKEIALG 305

Query: 308 LERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAG-RGLVCGWVPQVTILSH 367
           LE++G RF+W++R P       P D  +L +VLP+GFL RT G +GL+CGW PQV IL+H
Sbjct: 306 LEQSGQRFLWSLRMPSPIGTV-PCDCSNLEEVLPDGFLERTNGKKGLICGWAPQVEILAH 365

Query: 368 HAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGS- 427
            A GGF+SHCGWNSILESLW GVPI TWP+YAEQQ+NAF M +EL +A+E+RLDY+ GS 
Sbjct: 366 SATGGFLSHCGWNSILESLWHGVPITTWPMYAEQQLNAFRMARELGMALEMRLDYKRGSA 425

Query: 428 KLVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIE 470
            +V A E+E A+  +M+   E++ +V++MG+  R  + + GSS+ ++   IE
Sbjct: 426 DVVGADEIERAVVGVMEKDSEVRKKVEEMGKMARKAVKDGGSSFASVGRFIE 471

BLAST of HG10021007 vs. ExPASy Swiss-Prot
Match: Q66PF3 (Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa OX=3747 GN=GT3 PE=2 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 6.3e-106
Identity = 219/480 (45.62%), Postives = 308/480 (64.17%), Query Frame = 0

Query: 8   LVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPG-RPLVNAYTQS-RFSLSP-S 67
           LV I +P IG+LV  +E A  LV+ D + F T L +  P      +AY QS   S SP S
Sbjct: 7   LVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSKGTDAYVQSLADSSSPIS 66

Query: 68  PNLQFIHLP--SLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRIVGIF 127
             + FI+LP  ++     ++ +S + ++    ES +P+VK A+++L+   + + R+ G  
Sbjct: 67  QRINFINLPHTNMDHTEGSVRNSLVGFV----ESQQPHVKDAVANLR--DSKTTRLAGFV 126

Query: 128 VDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFN---ALIRDSDAEFVL 187
           VDMF T  I+VAN L +PSY+FF S A  L L+ HL +   D++N      +DSDAE ++
Sbjct: 127 VDMFCTTMINVANQLGVPSYVFFTSGAATLGLLFHLQEL-RDQYNKDCTEFKDSDAELII 186

Query: 188 PSYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSLD---E 247
           PS+ + L   +LP  +L  +D    + +  +R+ ETKG+++NTF +LE HAL +L    E
Sbjct: 187 PSFFNPLPAKVLPGRML-VKDSAEPFLNVIKRFRETKGILVNTFTDLESHALHALSSDAE 246

Query: 248 IPPVYAIGPVVDLDGPAQWQPCQGTHQ--SVVKWLDGQAEGSVVLLSFGSMGSLDKGQVR 307
           IPPVY +GP+++L+            +   ++KWLD Q   SVV L FGSMGS D+ QVR
Sbjct: 247 IPPVYPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQVR 306

Query: 308 EIAFGLERAGFRFVWAVRQ-PPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQV 367
           EIA  LE AG RF+W++R+ PP   +  P DYDD   VLPEGFL RT G G V GW PQV
Sbjct: 307 EIANALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVIGWAPQV 366

Query: 368 TILSHHAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDY 427
            +L+H ++GGFVSHCGWNS LESLW GVP+ATWP+YAEQQ+NAF+ VKELELAVE+ + Y
Sbjct: 367 AVLAHPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDMSY 426

Query: 428 REGSK-LVTAKELETALRRLMD-DGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKL 472
           R  S  LV+AKE+E  +R +M+ D  +I+ RVK+M EK +  L++ GSSY +L   I+++
Sbjct: 427 RSKSPVLVSAKEIERGIREVMELDSSDIRKRVKEMSEKGKKALMDGGSSYTSLGHFIDQI 478

BLAST of HG10021007 vs. ExPASy Swiss-Prot
Match: Q2V6K0 (UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa OX=3747 GN=GT6 PE=1 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 3.5e-104
Identity = 197/474 (41.56%), Postives = 291/474 (61.39%), Query Frame = 0

Query: 8   LVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFSLSPSPNL 67
           L+FI  P IG++V  VE A  L+  D   F T L +  P     +       ++ PS   
Sbjct: 7   LIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPFTADGSDVYIKSLAVDPSLKT 66

Query: 68  QFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRIVGIFVDMFT 127
           Q I   +L  P  +   +         +SHK +VK A++ L +  + + RI G  +DMF 
Sbjct: 67  QRIRFVNL--PQEHFQGTGATGFFTFIDSHKSHVKDAVTRLMETKSETTRIAGFVIDMFC 126

Query: 128 TAFIDVANDLQIPSYLFFASPATFLSLMIHLS--KTDHDRFNALIRDSDAEFVLPSYVHS 187
           T  ID+AN+  +PSY+F+ S A  L LM HL   + + ++     +DSDAE V+ S+V+ 
Sbjct: 127 TGMIDLANEFGLPSYVFYTSGAADLGLMFHLQALRDEENKDCTEFKDSDAELVVSSFVNP 186

Query: 188 LTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSLD---EIPPVYA 247
           L    + P+++  ++G  ++ +  +RY ETKG+++NTF ELEPHA+ SL    +I PVY 
Sbjct: 187 LPAARVLPSVVFEKEGGNFFLNFAKRYRETKGILVNTFLELEPHAIQSLSSDGKILPVYP 246

Query: 248 IGPVVDLDGPAQWQPCQGTHQ--SVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAFGL 307
           +GP++++         + + Q   +++WLD Q   SVV L FGSMG   + QV+EIA  L
Sbjct: 247 VGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGEDQVKEIAHAL 306

Query: 308 ERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSHHA 367
           E+ G RF+W++RQP K  +  P DY D   VLPEGFL RT   G V GW PQ+ IL+H A
Sbjct: 307 EQGGIRFLWSLRQPSKEKIGFPSDYTDYKAVLPEGFLDRTTDLGKVIGWAPQLAILAHPA 366

Query: 368 IGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSKLV 427
           +GGFVSHCGWNS LES+W+GVPIATWP YAEQQ+NAFE+VKEL+LAVE+ + YR+ S ++
Sbjct: 367 VGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVEIDMGYRKDSGVI 426

Query: 428 TAKE-LETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTT 474
            ++E +E  ++ +M+   E++ RVK+M +  R  L E+GSSY +L   ++++ T
Sbjct: 427 VSRENIEKGIKEVMEQESELRKRVKEMSQMSRKALEEDGSSYSSLGRFLDQIQT 478

BLAST of HG10021007 vs. ExPASy TrEMBL
Match: A0A5A7UPH3 (Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold80G002140 PE=3 SV=1)

HSP 1 Score: 826.6 bits (2134), Expect = 5.2e-236
Identity = 401/478 (83.89%), Postives = 433/478 (90.59%), Query Frame = 0

Query: 1   MAVPDHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFS 60
           M +P HHLVFIC PAIGNLVPAVEFA RL+NHDSRFF TFL+IDIPG  LV AYTQSR S
Sbjct: 1   MTIPHHHLVFICTPAIGNLVPAVEFATRLINHDSRFFVTFLSIDIPGTSLVTAYTQSRSS 60

Query: 61  LSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRIVG 120
           LSPSPNLQFIHLPSLQPPSPNLYHS++AYLSLIF SHKPNVK AISDLQK  +NS RIVG
Sbjct: 61  LSPSPNLQFIHLPSLQPPSPNLYHSYVAYLSLIFNSHKPNVKHAISDLQKKLHNSSRIVG 120

Query: 121 IFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLP 180
           IFVDMFTT FIDVANDLQIPSYLFFASPATFL LMIHLSKTDHDRFNALIR+S+AEFVLP
Sbjct: 121 IFVDMFTTTFIDVANDLQIPSYLFFASPATFLGLMIHLSKTDHDRFNALIRNSEAEFVLP 180

Query: 181 SYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSS--LDEIP 240
           SYV SLTV++LPPTLLTTEDGLFWY +HG+RYGETKG+VINTF ELEPHAL S  LDE+P
Sbjct: 181 SYVQSLTVSMLPPTLLTTEDGLFWYGYHGRRYGETKGIVINTFEELEPHALRSLDLDEVP 240

Query: 241 PVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAF 300
           PVYA+GPVVDL GP QWQ  +G  + VVKWLDGQ EGSVVLLSFGSMGSLD+ QVREIAF
Sbjct: 241 PVYAVGPVVDLGGPGQWQAGEGRLERVVKWLDGQEEGSVVLLSFGSMGSLDEDQVREIAF 300

Query: 301 GLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSH 360
           GLERAGFRFVW VRQPPKT +E PDDY DL+DVLPEGFL+RTAG+GLVCGW PQVTILSH
Sbjct: 301 GLERAGFRFVWVVRQPPKTTIEQPDDYSDLSDVLPEGFLSRTAGQGLVCGWAPQVTILSH 360

Query: 361 HAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSK 420
            AIGGFVSHCGWNSILESLWFGVPIATWP+YAEQQMNAFEMVKELELAVE+RLDYR+GSK
Sbjct: 361 RAIGGFVSHCGWNSILESLWFGVPIATWPLYAEQQMNAFEMVKELELAVEIRLDYRKGSK 420

Query: 421 LVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL 477
           +VT +ELE ALRRLMDD +E+KSRVK+M EKCR VLVENGS+Y ALNSLIEKLT + L
Sbjct: 421 VVTGEELERALRRLMDDNNEVKSRVKRMREKCRVVLVENGSAYNALNSLIEKLTARTL 478

BLAST of HG10021007 vs. ExPASy TrEMBL
Match: A0A5D3BJL3 (Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005090 PE=3 SV=1)

HSP 1 Score: 824.7 bits (2129), Expect = 2.0e-235
Identity = 400/478 (83.68%), Postives = 433/478 (90.59%), Query Frame = 0

Query: 1   MAVPDHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFS 60
           M +P HHLVFIC PAIGNLVPAVEFA RL+NHDSRFF TFL+IDIPG  LV AYTQSR S
Sbjct: 1   MTIPHHHLVFICTPAIGNLVPAVEFATRLINHDSRFFVTFLSIDIPGTSLVTAYTQSRSS 60

Query: 61  LSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRIVG 120
           LSPSPNLQFIHLPSLQPPSPNLYHS++AYLSLIF SHKPNVK AISDLQK  ++S RIVG
Sbjct: 61  LSPSPNLQFIHLPSLQPPSPNLYHSYVAYLSLIFNSHKPNVKHAISDLQKKLHDSSRIVG 120

Query: 121 IFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLP 180
           IFVDMFTT FIDVANDLQIPSYLFFASPATFL LMIHLSKTDHDRFNALIR+S+AEFVLP
Sbjct: 121 IFVDMFTTTFIDVANDLQIPSYLFFASPATFLGLMIHLSKTDHDRFNALIRNSEAEFVLP 180

Query: 181 SYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSS--LDEIP 240
           SYV SLTV++LPPTLLTTEDGLFWY +HG+RYGETKG+VINTF ELEPHAL S  LDE+P
Sbjct: 181 SYVQSLTVSMLPPTLLTTEDGLFWYGYHGRRYGETKGIVINTFEELEPHALRSLDLDEVP 240

Query: 241 PVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAF 300
           PVYA+GPVVDL GP QWQ  +G  + VVKWLDGQ EGSVVLLSFGSMGSLD+ QVREIAF
Sbjct: 241 PVYAVGPVVDLGGPGQWQAGEGRLERVVKWLDGQEEGSVVLLSFGSMGSLDEDQVREIAF 300

Query: 301 GLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSH 360
           GLERAGFRFVW VRQPPKT +E PDDY DL+DVLPEGFL+RTAG+GLVCGW PQVTILSH
Sbjct: 301 GLERAGFRFVWVVRQPPKTTIEQPDDYSDLSDVLPEGFLSRTAGQGLVCGWAPQVTILSH 360

Query: 361 HAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSK 420
            AIGGFVSHCGWNSILESLWFGVPIATWP+YAEQQMNAFEMVKELELAVE+RLDYR+GSK
Sbjct: 361 RAIGGFVSHCGWNSILESLWFGVPIATWPLYAEQQMNAFEMVKELELAVEIRLDYRKGSK 420

Query: 421 LVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL 477
           +VT +ELE ALRRLMDD +E+KSRVK+M EKCR VLVENGS+Y ALNSLIEKLT + L
Sbjct: 421 VVTGEELERALRRLMDDNNEVKSRVKRMREKCRVVLVENGSAYNALNSLIEKLTARTL 478

BLAST of HG10021007 vs. ExPASy TrEMBL
Match: A0A1S3B0G6 (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103484818 PE=3 SV=1)

HSP 1 Score: 824.7 bits (2129), Expect = 2.0e-235
Identity = 400/478 (83.68%), Postives = 433/478 (90.59%), Query Frame = 0

Query: 1   MAVPDHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFS 60
           M +P HHLVFIC PAIGNLVPAVEFA RL+NHDSRFF TFL+IDIPG  LV AYTQSR S
Sbjct: 1   MTIPHHHLVFICTPAIGNLVPAVEFATRLINHDSRFFVTFLSIDIPGTSLVTAYTQSRSS 60

Query: 61  LSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRIVG 120
           LSPSPNLQFIHLPSLQPPSPNLYHS++AYLSLIF SHKPNVK AISDLQK  ++S RIVG
Sbjct: 61  LSPSPNLQFIHLPSLQPPSPNLYHSYVAYLSLIFNSHKPNVKHAISDLQKKLHDSSRIVG 120

Query: 121 IFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLP 180
           IFVDMFTT FIDVANDLQIPSYLFFASPATFL LMIHLSKTDHDRFNALIR+S+AEFVLP
Sbjct: 121 IFVDMFTTTFIDVANDLQIPSYLFFASPATFLGLMIHLSKTDHDRFNALIRNSEAEFVLP 180

Query: 181 SYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSS--LDEIP 240
           SYV SLTV++LPPTLLTTEDGLFWY +HG+RYGETKG+VINTF ELEPHAL S  LDE+P
Sbjct: 181 SYVQSLTVSMLPPTLLTTEDGLFWYGYHGRRYGETKGIVINTFEELEPHALRSLDLDEVP 240

Query: 241 PVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAF 300
           PVYA+GPVVDL GP QWQ  +G  + VVKWLDGQ EGSVVLLSFGSMGSLD+ QVREIAF
Sbjct: 241 PVYAVGPVVDLGGPGQWQAGEGRLERVVKWLDGQEEGSVVLLSFGSMGSLDEDQVREIAF 300

Query: 301 GLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSH 360
           GLERAGFRFVW VRQPPKT +E PDDY DL+DVLPEGFL+RTAG+GLVCGW PQVTILSH
Sbjct: 301 GLERAGFRFVWVVRQPPKTTIEQPDDYSDLSDVLPEGFLSRTAGQGLVCGWAPQVTILSH 360

Query: 361 HAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSK 420
            AIGGFVSHCGWNSILESLWFGVPIATWP+YAEQQMNAFEMVKELELAVE+RLDYR+GSK
Sbjct: 361 RAIGGFVSHCGWNSILESLWFGVPIATWPLYAEQQMNAFEMVKELELAVEIRLDYRKGSK 420

Query: 421 LVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL 477
           +VT +ELE ALRRLMDD +E+KSRVK+M EKCR VLVENGS+Y ALNSLIEKLT + L
Sbjct: 421 VVTGEELERALRRLMDDNNEVKSRVKRMREKCRVVLVENGSAYNALNSLIEKLTARTL 478

BLAST of HG10021007 vs. ExPASy TrEMBL
Match: A0A6J1EA82 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111431301 PE=3 SV=1)

HSP 1 Score: 805.4 bits (2079), Expect = 1.2e-229
Identity = 392/476 (82.35%), Postives = 427/476 (89.71%), Query Frame = 0

Query: 1   MAVPDHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFS 60
           MA P HHLVFICNPAIGNLVPAVEFAVRLV+ D RF ATFLA+DIPGRPLVNAYTQSRF+
Sbjct: 1   MAKPHHHLVFICNPAIGNLVPAVEFAVRLVHQDPRFLATFLALDIPGRPLVNAYTQSRFA 60

Query: 61  LSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRIVG 120
            SPS N+QFI +PS  PPSP L+HSHIAYLSL FES+KP+VKQAI DLQ    NS R+VG
Sbjct: 61  ASPSQNIQFIAIPSAHPPSPALFHSHIAYLSLYFESYKPHVKQAIIDLQN-PQNSARVVG 120

Query: 121 IFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLP 180
           +FVDMFTTAFIDVANDLQIPSYLFFASPATFLSLM+ L + D DR  +LIRDSDAEF LP
Sbjct: 121 VFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMVQLPEMDRDRVESLIRDSDAEFALP 180

Query: 181 SYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSLDEIPPV 240
           S+ HSLT +LLP T+L T DGLFWYAHHG+RYGET GVVIN+FAELEPHALSSL EIPPV
Sbjct: 181 SFAHSLTSSLLPSTVLKTRDGLFWYAHHGRRYGETNGVVINSFAELEPHALSSLGEIPPV 240

Query: 241 YAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAFGL 300
           YA+GP++DL GPAQWQP + THQSV+KWLDGQ E SVVLLSFGSMGSLDK Q+REIAFGL
Sbjct: 241 YAVGPLLDLGGPAQWQPSRATHQSVLKWLDGQPERSVVLLSFGSMGSLDKAQIREIAFGL 300

Query: 301 ERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSHHA 360
           ERA FRFVW VRQPPK+ LEHPDDY DL+DVLP+GF+ RTAG GLVCGWVPQV+ILSH A
Sbjct: 301 ERARFRFVWVVRQPPKSQLEHPDDYSDLDDVLPDGFVTRTAGLGLVCGWVPQVSILSHGA 360

Query: 361 IGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSKLV 420
           IGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSKLV
Sbjct: 361 IGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSKLV 420

Query: 421 TAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL 477
           TA+ELETALRRLMD GDE+++RVK+MGEKCR VL+ENGSSY ALNSLIEKLT QIL
Sbjct: 421 TAEELETALRRLMDGGDEVRARVKRMGEKCRTVLLENGSSYTALNSLIEKLTAQIL 475

BLAST of HG10021007 vs. ExPASy TrEMBL
Match: A0A6J1KGB4 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111495522 PE=3 SV=1)

HSP 1 Score: 805.1 bits (2078), Expect = 1.6e-229
Identity = 393/478 (82.22%), Postives = 425/478 (88.91%), Query Frame = 0

Query: 1   MAVP--DHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSR 60
           MA P   HHLVFICNPAIGNLVPAVEFAVRLV HD RF ATFLA+DIPGRPLVNAYTQSR
Sbjct: 1   MATPHHHHHLVFICNPAIGNLVPAVEFAVRLVRHDPRFLATFLALDIPGRPLVNAYTQSR 60

Query: 61  FSLSPSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSGRI 120
           FS SPS N+QFI +PS+QPPSP L+HSHI YLSL F+S+KP+VKQAI DLQ    NSGR+
Sbjct: 61  FSASPSQNIQFIAIPSVQPPSPALFHSHIGYLSLYFDSYKPHVKQAIIDLQN-PRNSGRV 120

Query: 121 VGIFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFV 180
           VG+FVDMFTTAFIDVANDLQIPSYLFFASPATFLSLM+ L K D DR  + IRDSDAEFV
Sbjct: 121 VGVFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMVQLPKMDRDRVESSIRDSDAEFV 180

Query: 181 LPSYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSLDEIP 240
           LPS+ H LT +LLP T+L T DGL WY HHG+RYGET GVVIN+F ELEPHALSSL EIP
Sbjct: 181 LPSFAHPLTSSLLPSTVLKTRDGLLWYTHHGRRYGETNGVVINSFTELEPHALSSLGEIP 240

Query: 241 PVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAF 300
           PVYA+GP++DL GPAQWQP + THQSV+KWLDGQ E SVVLLSFGSMGSLDK QVREIAF
Sbjct: 241 PVYAVGPLLDLGGPAQWQPSRATHQSVLKWLDGQPERSVVLLSFGSMGSLDKAQVREIAF 300

Query: 301 GLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSH 360
           GLERAGFRFVW VRQPPKTHLEHPDDY DL+ VLP+GF+ RTAG GLVCGWVPQV+ILSH
Sbjct: 301 GLERAGFRFVWVVRQPPKTHLEHPDDYSDLDHVLPDGFVTRTAGLGLVCGWVPQVSILSH 360

Query: 361 HAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSK 420
            AIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGS 
Sbjct: 361 SAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREGSN 420

Query: 421 LVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKLTTQIL 477
           LVTAKELETALRRLMD GD++++RVK+MGEKCR VL+ENGSSY ALNSLIEKLT QIL
Sbjct: 421 LVTAKELETALRRLMDGGDDVRARVKRMGEKCRTVLLENGSSYTALNSLIEKLTAQIL 477

BLAST of HG10021007 vs. TAIR 10
Match: AT1G07250.1 (UDP-glucosyl transferase 71C4 )

HSP 1 Score: 354.4 bits (908), Expect = 1.5e-97
Identity = 197/485 (40.62%), Postives = 288/485 (59.38%), Query Frame = 0

Query: 3   VPDHHLVFICNPAIGNLVPAVEFAVRLVNHDSRFFA-TFLAIDIPGRPLVNAYTQSRFSL 62
           V +  L+FI  P+ G+++  +EFA RL+N D R    T L +  P  P  + + +S   +
Sbjct: 2   VKETELIFIPVPSTGHILVHIEFAKRLINLDHRIHTITILNLSSPSSPHASVFARS--LI 61

Query: 63  SPSPNLQFIHLPSLQPPSP-NLY-HSHIAYLSLIFESHKPNVKQAISDL---QKLHNNSG 122
           +  P ++   LP +Q P P +LY  +  AY+  + + + P +K A+S +   ++  ++S 
Sbjct: 62  ASQPKIRLHDLPPIQDPPPFDLYQRAPEAYIVKLIKKNTPLIKDAVSSIVASRRGGSDSV 121

Query: 123 RIVGIFVDMFTTAFI-DVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNAL-----I 182
           ++ G+ +D+F  + + DV N+L +PSY++    A +L +M ++     DR   +     +
Sbjct: 122 QVAGLVLDLFCNSLVKDVGNELNLPSYIYLTCNARYLGMMKYIP----DRHRKIASEFDL 181

Query: 183 RDSDAEFVLPSYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHA 242
              D E  +P +++++    +PP L   E     Y     R+ + KG+++N+F ELEPH 
Sbjct: 182 SSGDEELPVPGFINAIPTKFMPPGLFNKE-AYEAYVELAPRFADAKGILVNSFTELEPHP 241

Query: 243 ---LSSLDEIPPVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGS 302
               S L++ PPVY +GP++ L   A           +V WLD Q E SVV L FGS GS
Sbjct: 242 FDYFSHLEKFPPVYPVGPILSLKDRASPNEEAVDRDQIVGWLDDQPESSVVFLCFGSRGS 301

Query: 303 LDKGQVREIAFGLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVC 362
           +D+ QV+EIA  LE  G RF+W++R          D   + NDVLPEGF+ R AGRGLVC
Sbjct: 302 VDEPQVKEIARALELVGCRFLWSIR-------TSGDVETNPNDVLPEGFMGRVAGRGLVC 361

Query: 363 GWVPQVTILSHHAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAV 422
           GW PQV +L+H AIGGFVSHCGWNS LESLWFGVP+ATWP+YAEQQ+NAF +VKEL LAV
Sbjct: 362 GWAPQVEVLAHKAIGGFVSHCGWNSTLESLWFGVPVATWPMYAEQQLNAFTLVKELGLAV 421

Query: 423 ELRLDY-REGSKLVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNS 472
           +LR+DY      LVT  E+  A+R LMD GDE + +VK+M +  R  L++ GSS  A   
Sbjct: 422 DLRMDYVSSRGGLVTCDEIARAVRSLMDGGDEKRKKVKEMADAARKALMDGGSSSLATAR 472

BLAST of HG10021007 vs. TAIR 10
Match: AT2G29730.1 (UDP-glucosyl transferase 71D1 )

HSP 1 Score: 340.9 bits (873), Expect = 1.7e-93
Identity = 187/468 (39.96%), Postives = 275/468 (58.76%), Query Frame = 0

Query: 8   LVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSRFSLSPSPNL 67
           L+FI  P +G+LVP +EFA RL+  D R   T L + + G+  ++ Y +S    S  P +
Sbjct: 6   LIFIPTPTVGHLVPFLEFARRLIEQDDRIRITILLMKLQGQSHLDTYVKS--IASSQPFV 65

Query: 68  QFIHLPSL-QPPSPNLYHSHIAYLSLIFESHKPNVKQAISD-LQKLHNNSGRIVGIFVDM 127
           +FI +P L + P+     S  AY+  + E + P V+  + D L  L  +  ++ G+ VD 
Sbjct: 66  RFIDVPELEEKPTLGSTQSVEAYVYDVIERNIPLVRNIVMDILTSLALDGVKVKGLVVDF 125

Query: 128 FTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFVLPSYVHS 187
           F    IDVA D+ +P Y+F  + + FL++M +L+       +  +R+S+    +P +V+ 
Sbjct: 126 FCLPMIDVAKDISLPFYVFLTTNSGFLAMMQYLADRHSRDTSVFVRNSEEMLSIPGFVNP 185

Query: 188 LTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSL---DEIPPVYA 247
           +  N+LP  L   EDG   Y      + +  G+++N+  ++EP++++        P VYA
Sbjct: 186 VPANVLPSALF-VEDGYDAYVKLAILFTKANGILVNSSFDIEPYSVNHFLQEQNYPSVYA 245

Query: 248 IGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIAFGLER 307
           +GP+ DL      +        ++KWLD Q E SVV L FGSM  L    V+EIA GLE 
Sbjct: 246 VGPIFDLKAQPHPEQDLTRRDELMKWLDDQPEASVVFLCFGSMARLRGSLVKEIAHGLEL 305

Query: 308 AGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILSHHAIG 367
             +RF+W++R+   T            D LPEGFL R  GRG++CGW PQV IL+H A+G
Sbjct: 306 CQYRFLWSLRKEEVT-----------KDDLPEGFLDRVDGRGMICGWSPQVEILAHKAVG 365

Query: 368 GFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYR-EGSKLVT 427
           GFVSHCGWNSI+ESLWFGVPI TWP+YAEQQ+NAF MVKEL+LAVEL+LDYR    ++V 
Sbjct: 366 GFVSHCGWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELKLAVELKLDYRVHSDEIVN 425

Query: 428 AKELETALRRLMD-DGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLI 469
           A E+ETA+R +MD D + ++ RV  + +  +      GSS+ A+   I
Sbjct: 426 ANEIETAIRYVMDTDNNVVRKRVMDISQMIQRATKNGGSSFAAIEKFI 459

BLAST of HG10021007 vs. TAIR 10
Match: AT1G07260.1 (UDP-glucosyl transferase 71C3 )

HSP 1 Score: 340.1 bits (871), Expect = 2.8e-93
Identity = 193/475 (40.63%), Postives = 279/475 (58.74%), Query Frame = 0

Query: 8   LVFICNPAIGNLVPAVEFAVRLVNHDSRFFA-TFLAIDIPGRPLVNAYTQSRFSLSPSPN 67
           ++F+  P+ G+L+ ++EFA  L+  D R    T L   +P  P  + + +S   ++  P 
Sbjct: 7   IIFVTYPSPGHLLVSIEFAKSLIKRDDRIHTITILYWALPLAPQAHLFAKS--LVASQPR 66

Query: 68  LQFIHLPSLQ--PPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQKLHNNSG--RIVGIF 127
           ++ + LP +Q  PP    + +  AY+    +   P V+ A+S L      SG  R+VG+ 
Sbjct: 67  IRLLALPDVQNPPPLELFFKAPEAYILESTKKTVPLVRDALSTLVSSRKESGSVRVVGLV 126

Query: 128 VDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNAL-IRDSDAEFVLPS 187
           +D F    I+VAN+L +PSY+F    A FLS+M +L +      + L +   + E  +P 
Sbjct: 127 IDFFCVPMIEVANELNLPSYIFLTCNAGFLSMMKYLPERHRITTSELDLSSGNVEHPIPG 186

Query: 188 YVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHA---LSSLDE-I 247
           YV S+   +LPP L   E    W     +++   KG+++N+   LE +A    + LDE  
Sbjct: 187 YVCSVPTKVLPPGLFVRESYEAW-VEIAEKFPGAKGILVNSVTCLEQNAFDYFARLDENY 246

Query: 248 PPVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREIA 307
           PPVY +GPV+ L               +++WL+ Q E S+V + FGS+G + K Q+ EIA
Sbjct: 247 PPVYPVGPVLSLKDRPSPNLDASDRDRIMRWLEDQPESSIVYICFGSLGIIGKLQIEEIA 306

Query: 308 FGLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTILS 367
             LE  G RF+W++R  P    E    Y    D+LPEGFL RTA +GLVC W PQV +L+
Sbjct: 307 EALELTGHRFLWSIRTNP---TEKASPY----DLLPEGFLDRTASKGLVCDWAPQVEVLA 366

Query: 368 HHAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDYREG- 427
           H A+GGFVSHCGWNS+LESLWFGVPIATWP+YAEQQ+NAF MVKEL LAVELRLDY    
Sbjct: 367 HKALGGFVSHCGWNSVLESLWFGVPIATWPMYAEQQLNAFSMVKELGLAVELRLDYVSAY 426

Query: 428 SKLVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKL 472
            ++V A+E+  A+R LMD  D  + RVK+M E  R  L++ GSS+ A+   +++L
Sbjct: 427 GEIVKAEEIAGAIRSLMDGEDTPRKRVKEMAEAARNALMDGGSSFVAVKRFLDEL 471

BLAST of HG10021007 vs. TAIR 10
Match: AT3G21760.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 339.7 bits (870), Expect = 3.7e-93
Identity = 205/490 (41.84%), Postives = 287/490 (58.57%), Query Frame = 0

Query: 8   LVFICNPAIGNLVPAVEFAVRLVNHDSRFFATFLAIDIPGRPLVNAYTQSR-----FSLS 67
           LVFI +P  G+L P VE A   V+ D     T + I     P ++ ++ S       SLS
Sbjct: 5   LVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIII-----PQMHGFSSSNSSSYIASLS 64

Query: 68  PSPNLQFIHLPSLQPPSPNLYHSHIAYLSLIFESHKPNVKQAISDLQK--LHNNSGRIVG 127
                +  +     P  P+   +   +   I ++ KP VK  +  L      ++  R+ G
Sbjct: 65  SDSEERLSYNVLSVPDKPDSDDTKPHFFDYI-DNFKPQVKATVEKLTDPGPPDSPSRLAG 124

Query: 128 IFVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKT-DHDRFNAL-IRDSD-AEF 187
             VDMF    IDVAN+  +PSY+F+ S ATFL L +H+    D   ++   ++DSD  E 
Sbjct: 125 FVVDMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLYDVKNYDVSDLKDSDTTEL 184

Query: 188 VLPSYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHAL---SSL 247
            +P     L V   P  LLT E  L       +R+ ETKG+++NTFAELEP A+   S +
Sbjct: 185 EVPCLTRPLPVKCFPSVLLTKE-WLPVMFRQTRRFRETKGILVNTFAELEPQAMKFFSGV 244

Query: 248 DE-IPPVYAIGPVVDL--DGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKG 307
           D  +P VY +GPV++L  +GP            +++WLD Q   SVV L FGSMG   +G
Sbjct: 245 DSPLPTVYTVGPVMNLKINGP---NSSDDKQSEILRWLDEQPRKSVVFLCFGSMGGFREG 304

Query: 308 QVREIAFGLERAGFRFVWAVRQ-PPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWV 367
           Q +EIA  LER+G RFVW++R+  PK  +  P+++ +L ++LPEGFL RTA  G + GW 
Sbjct: 305 QAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKIVGWA 364

Query: 368 PQVTILSHHAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELR 427
           PQ  IL++ AIGGFVSHCGWNS LESLWFGVP+ATWP+YAEQQ+NAFEMV+EL LAVE+R
Sbjct: 365 PQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLAVEVR 424

Query: 428 LDYR-----EGSKLVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALN 476
             +R        +L+TA+E+E  +R LM+   +++SRVK+M EK    L++ GSS+ AL 
Sbjct: 425 NSFRGDFMAADDELMTAEEIERGIRCLMEQDSDVRSRVKEMSEKSHVALMDGGSSHVALL 484

BLAST of HG10021007 vs. TAIR 10
Match: AT2G29740.1 (UDP-glucosyl transferase 71C2 )

HSP 1 Score: 333.2 bits (853), Expect = 3.5e-91
Identity = 191/476 (40.13%), Postives = 278/476 (58.40%), Query Frame = 0

Query: 8   LVFICNPAIGNLVPAVEFAVRLVNHD-SRFFA-TFLAIDIPGRPLVNAYTQSRFSLSPSP 67
           L+FI  P  G+++  +E A RL++H  SR    T L   +P  P  +     +  +    
Sbjct: 9   LIFIPFPIPGHILATIELAKRLISHQPSRIHTITILHWSLPFLPQSDTIAFLKSLIETES 68

Query: 68  NLQFIHLPSLQ--PPSPNLYHSHIAYLSLIFESHKPNVKQAISDL--QKLHNNSGRIVGI 127
            ++ I LP +Q  PP      +  +Y+    +   P V+ A+S L   +  ++S  + G+
Sbjct: 69  RIRLITLPDVQNPPPMELFVKASESYILEYVKKMVPLVRNALSTLLSSRDESDSVHVAGL 128

Query: 128 FVDMFTTAFIDVANDLQIPSYLFFASPATFLSLMIHLSKTDHDRFNALIRDSDAEFV-LP 187
            +D F    IDV N+  +PSY+F    A+FL +M +L + + +    L R SD E + +P
Sbjct: 129 VLDFFCVPLIDVGNEFNLPSYIFLTCSASFLGMMKYLLERNRETKPELNRSSDEETISVP 188

Query: 188 SYVHSLTVNLLPPTLLTTEDGLFWYAHHGQRYGETKGVVINTFAELEPHALSSL----DE 247
            +V+S+ V +LPP L TTE    W     +R+ E KG+++N+F  LE +A        D 
Sbjct: 189 GFVNSVPVKVLPPGLFTTESYEAW-VEMAERFPEAKGILVNSFESLERNAFDYFDRRPDN 248

Query: 248 IPPVYAIGPVVDLDGPAQWQPCQGTHQSVVKWLDGQAEGSVVLLSFGSMGSLDKGQVREI 307
            PPVY IGP++  +        +     ++KWLD Q E SVV L FGS+ SL   Q++EI
Sbjct: 249 YPPVYPIGPILCSNDRPNLDLSE--RDRILKWLDDQPESSVVFLCFGSLKSLAASQIKEI 308

Query: 308 AFGLERAGFRFVWAVRQPPKTHLEHPDDYDDLNDVLPEGFLARTAGRGLVCGWVPQVTIL 367
           A  LE  G RF+W++R  PK       +Y   N++LP+GF+ R  G GLVCGW PQV IL
Sbjct: 309 AQALELVGIRFLWSIRTDPK-------EYASPNEILPDGFMNRVMGLGLVCGWAPQVEIL 368

Query: 368 SHHAIGGFVSHCGWNSILESLWFGVPIATWPVYAEQQMNAFEMVKELELAVELRLDY-RE 427
           +H AIGGFVSHCGWNSILESL FGVPIATWP+YAEQQ+NAF +VKEL LA+E+RLDY  E
Sbjct: 369 AHKAIGGFVSHCGWNSILESLRFGVPIATWPMYAEQQLNAFTIVKELGLALEMRLDYVSE 428

Query: 428 GSKLVTAKELETALRRLMDDGDEIKSRVKQMGEKCRAVLVENGSSYGALNSLIEKL 472
             ++V A E+  A+R LMD  D  + ++K++ E  +  +++ GSS+ A+   I+ L
Sbjct: 429 YGEIVKADEIAGAVRSLMDGEDVPRRKLKEIAEAGKEAVMDGGSSFVAVKRFIDGL 474

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876972.13.9e-24689.17UDP-glycosyltransferase 43-like [Benincasa hispida][more]
KAA0055369.11.1e-23583.89anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo var. makuwa][more]
XP_008440340.14.1e-23583.68PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo] >TYJ99296... [more]
XP_023519630.14.7e-23182.85UDP-glycosyltransferase 43-like [Cucurbita pepo subsp. pepo][more]
XP_022923653.12.6e-22982.35UDP-glycosyltransferase 43-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A172J2G32.5e-14252.63UDP-glycosyltransferase 43 OS=Pueraria montana var. lobata OX=3893 GN=UGT43 PE=1... [more]
D3UAG04.0e-10844.23UDP-glycosyltransferase 71K1 OS=Malus domestica OX=3750 GN=UGT71K1 PE=1 SV=1[more]
D3UAG26.3e-10644.28UDP-glycosyltransferase 71K2 OS=Pyrus communis OX=23211 GN=UGT71K2 PE=1 SV=1[more]
Q66PF36.3e-10645.63Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa OX... [more]
Q2V6K03.5e-10441.56UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa OX=3747 GN=... [more]
Match NameE-valueIdentityDescription
A0A5A7UPH35.2e-23683.89Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold80G0... [more]
A0A5D3BJL32.0e-23583.68Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G... [more]
A0A1S3B0G62.0e-23583.68Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103484818 PE=3 SV=1[more]
A0A6J1EA821.2e-22982.35Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111431301 PE=3 SV=1[more]
A0A6J1KGB41.6e-22982.22Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111495522 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G07250.11.5e-9740.62UDP-glucosyl transferase 71C4 [more]
AT2G29730.11.7e-9339.96UDP-glucosyl transferase 71D1 [more]
AT1G07260.12.8e-9340.63UDP-glucosyl transferase 71C3 [more]
AT3G21760.13.7e-9341.84UDP-Glycosyltransferase superfamily protein [more]
AT2G29740.13.5e-9140.13UDP-glucosyl transferase 71C2 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 422..442
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 9..464
e-value: 9.6E-136
score: 455.4
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 254..452
e-value: 9.6E-136
score: 455.4
NoneNo IPR availablePANTHERPTHR48049GLYCOSYLTRANSFERASEcoord: 6..472
NoneNo IPR availablePANTHERPTHR48049:SF5GLYCOSYLTRANSFERASEcoord: 6..472
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 5..470
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 256..435
e-value: 4.6E-23
score: 81.8
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 7..471
e-value: 1.43785E-70
score: 227.819
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 349..392

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021007.1HG10021007.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006974 cellular response to DNA damage stimulus
biological_process GO:0006259 DNA metabolic process
molecular_function GO:0008194 UDP-glycosyltransferase activity