Sgr021325 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr021325
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionN-acetyltransferase domain-containing protein
Locationtig00153654: 984928 .. 986351 (+)
RNA-Seq ExpressionSgr021325
SyntenySgr021325
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGAGCAGAGGAAGAAGAAGAAGATGAAGGAGATACTGATCAGAGAATTCGACCCCATTAACGATAGCGTCGGAGTCGAAGATGTCGAGCGACGATGCGAAGTTGGGTCCACCAAAAAGCTCTCCCTCTTCACCGATCTCCTCGGAGACCCAATCTGCAGGGTCCGCCATTCCCCTGCCTTCCTCATGCTCGTAATCTCTTCTTCCATCCCTCTCTCCTTAATTAATTTTTTCTAACCTCCAATGAATGAATAAACGAAGAACGCACTCAAAAACATTGATCAGGTCGCAGAGATTGGCGCCGAGAAGGAGATTGCGGGGATGATCAGAGGCTGCATCAAGACCGTTAGTTGTTGCTCCAAGCTCCCCAGAAATGGCGGAAACGACGGCTCTGATGTCATCAAACTCCTCCCCGTTTTCACTAAACTCGCTTACATCTTGGGCCTTCGCGTCTCCCCCGCTCATCGGTACCACTCTCAGAGCTTTTTTCTTCCATCGGATTGATTATGAATACGTTTTCCAATCTGATAATATTCTGGGTTTGTTTTTTCTGTAGGAGATTGGGCATTGGGTTGAAGCTGGTGCGGCGGATGGAGGAGTGGTTCAGAGAGAACAGCGCCGAATATTCGTACATCGCGACGGAGAGCGATAACGCCGCCTCCATTAAGCTGTTTACTGATAAATGCGGCTATTCGAAGTTCCGTACGCCGTCGATCCTCGTCAACCCGGTGTTCGCTCATCGGCTCCGCCTCTCCGACCGAGTCACCGTCCTCCGACTCGGCCTCGCCGACGCCGAAACGCTCTACCGCCGCCGCTTCGCCACGACCGAGTTCTTCCCCCGCGATATCGACGCGGTGCTCTGCAACCGGCTGAGTCTCGGCACGTTCCTGGCGGTGCCACGTGGGACGTTTTATGATAAATTGTGGGCCGGGTCGGATCGGTTCTTGTCGTACCCGCCCGAGTCGTGGGCGGTGCTGAGCGCGTGGAACTGCAAGGACGTGTTCGCACTCGAGGTGCGCGGCGCGTCGGCTACGAAGCGGGCTCTGGCGAAAATCAGCCGCGTGATCGACCGCGGCCTCCCGTGGCTGCGGCTGCCGTCGGTGCCGGAGGTGTTCGCACCGTTCGGGGTGCTGTTTTTGTACGGATTGGGAGGGGAAGGCCCACGCGCCGGGAAGTTGATGAAGGCGTTGTGTAACCACGCGCACAACCTGGCGAAGGAGCGTGGGTGTGGAGTAGTGGCGACGGAGGTTTCCAGCGACGAACCGCTTAAGTCGGGAATACCGCATTGGAAAAGACTATCTTGCCCCGAGGATTTATGGTGTATAAAGCGGCTGGGGGAAGGCTACAGTGACAGCTCCGTCGGTGACTGGACTAAATCGCCACCTGGCTTGTCCATATTCGTTGATCCCAGGGAGTTCTAA

mRNA sequence

ATGGGAGAGCAGAGGAAGAAGAAGAAGATGAAGGAGATACTGATCAGAGAATTCGACCCCATTAACGATAGCGTCGGAGTCGAAGATGTCGAGCGACGATGCGAAGTTGGGTCCACCAAAAAGCTCTCCCTCTTCACCGATCTCCTCGGAGACCCAATCTGCAGGGTCCGCCATTCCCCTGCCTTCCTCATGCTCGTCGCAGAGATTGGCGCCGAGAAGGAGATTGCGGGGATGATCAGAGGCTGCATCAAGACCGTTAGTTGTTGCTCCAAGCTCCCCAGAAATGGCGGAAACGACGGCTCTGATGTCATCAAACTCCTCCCCGTTTTCACTAAACTCGCTTACATCTTGGGCCTTCGCGTCTCCCCCGCTCATCGGAGATTGGGCATTGGGTTGAAGCTGGTGCGGCGGATGGAGGAGTGGTTCAGAGAGAACAGCGCCGAATATTCGTACATCGCGACGGAGAGCGATAACGCCGCCTCCATTAAGCTGTTTACTGATAAATGCGGCTATTCGAAGTTCCGTACGCCGTCGATCCTCGTCAACCCGGTGTTCGCTCATCGGCTCCGCCTCTCCGACCGAGTCACCGTCCTCCGACTCGGCCTCGCCGACGCCGAAACGCTCTACCGCCGCCGCTTCGCCACGACCGAGTTCTTCCCCCGCGATATCGACGCGGTGCTCTGCAACCGGCTGAGTCTCGGCACGTTCCTGGCGGTGCCACGTGGGACGTTTTATGATAAATTGTGGGCCGGGTCGGATCGGTTCTTGTCGTACCCGCCCGAGTCGTGGGCGGTGCTGAGCGCGTGGAACTGCAAGGACGTGTTCGCACTCGAGGTGCGCGGCGCGTCGGCTACGAAGCGGGCTCTGGCGAAAATCAGCCGCGTGATCGACCGCGGCCTCCCGTGGCTGCGGCTGCCGTCGGTGCCGGAGGTGTTCGCACCGTTCGGGGTGCTGTTTTTGTACGGATTGGGAGGGGAAGGCCCACGCGCCGGGAAGTTGATGAAGGCGTTGTGTAACCACGCGCACAACCTGGCGAAGGAGCGTGGGTGTGGAGTAGTGGCGACGGAGGTTTCCAGCGACGAACCGCTTAAGTCGGGAATACCGCATTGGAAAAGACTATCTTGCCCCGAGGATTTATGGTGTATAAAGCGGCTGGGGGAAGGCTACAGTGACAGCTCCGTCGGTGACTGGACTAAATCGCCACCTGGCTTGTCCATATTCGTTGATCCCAGGGAGTTCTAA

Coding sequence (CDS)

ATGGGAGAGCAGAGGAAGAAGAAGAAGATGAAGGAGATACTGATCAGAGAATTCGACCCCATTAACGATAGCGTCGGAGTCGAAGATGTCGAGCGACGATGCGAAGTTGGGTCCACCAAAAAGCTCTCCCTCTTCACCGATCTCCTCGGAGACCCAATCTGCAGGGTCCGCCATTCCCCTGCCTTCCTCATGCTCGTCGCAGAGATTGGCGCCGAGAAGGAGATTGCGGGGATGATCAGAGGCTGCATCAAGACCGTTAGTTGTTGCTCCAAGCTCCCCAGAAATGGCGGAAACGACGGCTCTGATGTCATCAAACTCCTCCCCGTTTTCACTAAACTCGCTTACATCTTGGGCCTTCGCGTCTCCCCCGCTCATCGGAGATTGGGCATTGGGTTGAAGCTGGTGCGGCGGATGGAGGAGTGGTTCAGAGAGAACAGCGCCGAATATTCGTACATCGCGACGGAGAGCGATAACGCCGCCTCCATTAAGCTGTTTACTGATAAATGCGGCTATTCGAAGTTCCGTACGCCGTCGATCCTCGTCAACCCGGTGTTCGCTCATCGGCTCCGCCTCTCCGACCGAGTCACCGTCCTCCGACTCGGCCTCGCCGACGCCGAAACGCTCTACCGCCGCCGCTTCGCCACGACCGAGTTCTTCCCCCGCGATATCGACGCGGTGCTCTGCAACCGGCTGAGTCTCGGCACGTTCCTGGCGGTGCCACGTGGGACGTTTTATGATAAATTGTGGGCCGGGTCGGATCGGTTCTTGTCGTACCCGCCCGAGTCGTGGGCGGTGCTGAGCGCGTGGAACTGCAAGGACGTGTTCGCACTCGAGGTGCGCGGCGCGTCGGCTACGAAGCGGGCTCTGGCGAAAATCAGCCGCGTGATCGACCGCGGCCTCCCGTGGCTGCGGCTGCCGTCGGTGCCGGAGGTGTTCGCACCGTTCGGGGTGCTGTTTTTGTACGGATTGGGAGGGGAAGGCCCACGCGCCGGGAAGTTGATGAAGGCGTTGTGTAACCACGCGCACAACCTGGCGAAGGAGCGTGGGTGTGGAGTAGTGGCGACGGAGGTTTCCAGCGACGAACCGCTTAAGTCGGGAATACCGCATTGGAAAAGACTATCTTGCCCCGAGGATTTATGGTGTATAAAGCGGCTGGGGGAAGGCTACAGTGACAGCTCCGTCGGTGACTGGACTAAATCGCCACCTGGCTTGTCCATATTCGTTGATCCCAGGGAGTTCTAA

Protein sequence

MGEQRKKKKMKEILIREFDPINDSVGVEDVERRCEVGSTKKLSLFTDLLGDPICRVRHSPAFLMLVAEIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLAYILGLRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFRTPSILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGTFLAVPRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRVIDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVATEVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF
Homology
BLAST of Sgr021325 vs. NCBI nr
Match: XP_023512928.1 (probable N-acetyltransferase HLS1 [Cucurbita pepo subsp. pepo] >XP_023522343.1 probable N-acetyltransferase HLS1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 700.3 bits (1806), Expect = 1.0e-197
Identity = 343/420 (81.67%), Postives = 366/420 (87.14%), Query Frame = 0

Query: 1   MGEQRKKKKMK-----EILIREFDPINDSVGVEDVERRCEVGST--KKLSLFTDLLGDPI 60
           M + +KKKK K     +ILIREFDPI D   VEDVERRCEVGST  K  SLFTDLLGDPI
Sbjct: 1   MADHKKKKKKKRKNNIQILIREFDPIKDRTAVEDVERRCEVGSTKNKNFSLFTDLLGDPI 60

Query: 61  CRVRHSPAFLMLVAEIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKL 120
           CRVRHSPAFLMLVAEI A  EI GMIRGCIKTV+CCSK  RN G + SD+ +L PV+TKL
Sbjct: 61  CRVRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNAGINASDLPQLAPVYTKL 120

Query: 121 AYILGLRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSK 180
           AYILGLRVSP HRRLGIGL LVRRMEEWFRE  AEYSYIATE DN ASIKLFT KCGYSK
Sbjct: 121 AYILGLRVSPDHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYSK 180

Query: 181 FRTPSILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSL 240
           FRTPSILVNPVFAHRLRLSD+VT+L+L L  AETLYR RFAT EFFPRDIDAVL NRLSL
Sbjct: 181 FRTPSILVNPVFAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSL 240

Query: 241 GTFLAVPRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKIS 300
           GTFLAVPRG+F D LW  SDRFLS  PESWAVLSAWNCKDV+ALEVRG S  KRA+AK+S
Sbjct: 241 GTFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLS 300

Query: 301 RVIDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVV 360
           RVIDRGLPWLRLPSVPEVF PFGVLFLYG+GGEGP AGKLMKALC+HAHNLAKERGCGVV
Sbjct: 301 RVIDRGLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVV 360

Query: 361 ATEVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           ATEVS+DEPLKS IPHWK+LSCPEDLWCIKRLGEGY+++S+GDWTKSPP  SIFVDPREF
Sbjct: 361 ATEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 420

BLAST of Sgr021325 vs. NCBI nr
Match: XP_022932903.1 (probable N-acetyltransferase HLS1 isoform X1 [Cucurbita moschata] >KAG7011045.1 putative N-acetyltransferase HLS1-like protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 698.7 bits (1802), Expect = 2.9e-197
Identity = 342/419 (81.62%), Postives = 366/419 (87.35%), Query Frame = 0

Query: 1   MGEQRKKKKMK----EILIREFDPINDSVGVEDVERRCEVGST--KKLSLFTDLLGDPIC 60
           M + +KKKK +    +ILIREFDPI D   VEDVERRCEVGST  K  SLFTDLLGDPIC
Sbjct: 1   MADHKKKKKKRKNNIQILIREFDPIKDKPSVEDVERRCEVGSTKNKNFSLFTDLLGDPIC 60

Query: 61  RVRHSPAFLMLVAEIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLA 120
           RVRHSPAFLMLVAEI A  EI GMIRGCIKTV+CCSK  RN G + SD+ +L PV+TKLA
Sbjct: 61  RVRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNAGINASDLPQLAPVYTKLA 120

Query: 121 YILGLRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKF 180
           YILGLRVSP HRRLGIGL LVRRMEEWFRE  AEYSYIATE DN ASIKLFT KCGYSKF
Sbjct: 121 YILGLRVSPDHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYSKF 180

Query: 181 RTPSILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLG 240
           RTPSILVNPVFAHRLRLSD+VT+L+L L  AETLYR RFAT EFFPRDIDAVL NRLSLG
Sbjct: 181 RTPSILVNPVFAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLG 240

Query: 241 TFLAVPRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISR 300
           TFLAVPRG+F D LW  SDRFLS  PESWAVLSAWNCKDV+ALEVRG S  KRA+AK+SR
Sbjct: 241 TFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSR 300

Query: 301 VIDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVA 360
           VIDRGLPWLRLPSVPEVF PFGVLFLYG+GGEGP AGKLMKALC+HAHNLAKERGCGVVA
Sbjct: 301 VIDRGLPWLRLPSVPEVFKPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVA 360

Query: 361 TEVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           TEVS+DEPLKS IPHWK+LSCPEDLWCIKRLGEGY+++S+GDWTKSPP  SIFVDPREF
Sbjct: 361 TEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 419

BLAST of Sgr021325 vs. NCBI nr
Match: KAG6571244.1 (putative N-acetyltransferase HLS1-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 696.0 bits (1795), Expect = 1.9e-196
Identity = 341/418 (81.58%), Postives = 365/418 (87.32%), Query Frame = 0

Query: 1   MGEQRKKKKMK---EILIREFDPINDSVGVEDVERRCEVGST--KKLSLFTDLLGDPICR 60
           M + +KKKK K   +ILIREFDPI D   VEDVERRCEVGST  K  SLFTDLLGDPICR
Sbjct: 1   MADHKKKKKRKNNIQILIREFDPIKDKPSVEDVERRCEVGSTKNKNFSLFTDLLGDPICR 60

Query: 61  VRHSPAFLMLVAEIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLAY 120
           VRHSPAFLMLVAEI A  EI GMIRGCIKTV+CCSK  RN G + SD+ ++ PV+TKLAY
Sbjct: 61  VRHSPAFLMLVAEISAHNEIVGMIRGCIKTVTCCSKSTRNAGINASDLPQIAPVYTKLAY 120

Query: 121 ILGLRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFR 180
           ILGLRVSP HRRLGIGL LVRRMEEWFRE  AEYSYIATE DN ASIKLFT KCGYSKFR
Sbjct: 121 ILGLRVSPDHRRLGIGLNLVRRMEEWFREKRAEYSYIATEIDNVASIKLFTHKCGYSKFR 180

Query: 181 TPSILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGT 240
           TPSILVNPVFAHRLRLSD+VT+L+L L  AETLYR RFAT EFFPRDIDAVL NRLSLGT
Sbjct: 181 TPSILVNPVFAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLGT 240

Query: 241 FLAVPRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRV 300
           FLAVPRG+F D LW  SDRFLS  PESWAVLSAWNCKDV+ALEVRG S  KRA+AK+SRV
Sbjct: 241 FLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSRV 300

Query: 301 IDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVAT 360
           IDR LPWLRLPSVPEVF PFGVLFLYG+GGEGP AGKLMKALC+HAHNLAKERGCGVVAT
Sbjct: 301 IDRVLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVAT 360

Query: 361 EVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           EVS+DEPLKS IPHWK+LSCPEDLWCIKRLGEGY+++S+GDWTKSPP  SIFVDPREF
Sbjct: 361 EVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 418

BLAST of Sgr021325 vs. NCBI nr
Match: XP_022986693.1 (probable N-acetyltransferase HLS1 isoform X1 [Cucurbita maxima])

HSP 1 Score: 694.5 bits (1791), Expect = 5.5e-196
Identity = 342/418 (81.82%), Postives = 361/418 (86.36%), Query Frame = 0

Query: 1   MGEQRKKKKMK---EILIREFDPINDSVGVEDVERRCEVGST--KKLSLFTDLLGDPICR 60
           M +Q+KKKK K   +ILIREFDP  D   VEDVERRCEVGST  K  SLFTDLLGDPICR
Sbjct: 1   MADQKKKKKRKNNIQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICR 60

Query: 61  VRHSPAFLMLVAEIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLAY 120
           VRHSPAFLMLVAEI A  EI GMIRGCIKTV+CCSK  RN G + SD+  L PV+TKLAY
Sbjct: 61  VRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNSGINASDLPHLAPVYTKLAY 120

Query: 121 ILGLRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFR 180
           ILGLRVSP HRRLGIGL LVRRMEEWFRE  AEYSYIATE DN ASIKLFT KCGY KFR
Sbjct: 121 ILGLRVSPDHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFR 180

Query: 181 TPSILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGT 240
           TPSILVNPVFAHRLRLSD+VT+LRL L  AETLYR RFA  EFFPRDIDAVL NRLSLGT
Sbjct: 181 TPSILVNPVFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGT 240

Query: 241 FLAVPRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRV 300
           FLAVPRG+F D LW  SDRFLS  PESWAVLSAWNCKDV+ALEVRG S  KRA+AK SRV
Sbjct: 241 FLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRV 300

Query: 301 IDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVAT 360
           IDRGLPWLRLPSVPEVF PFGVLFLYG+GGEGP AGKLMKALC+HAHNLAKERGCGVVAT
Sbjct: 301 IDRGLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVAT 360

Query: 361 EVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           EVS+DEPLKS IPHWK+LSCPEDLWCIKRLGE Y+D+S+GDWTKSPP  SIFVDPREF
Sbjct: 361 EVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 418

BLAST of Sgr021325 vs. NCBI nr
Match: XP_022932904.1 (probable N-acetyltransferase HLS1 isoform X2 [Cucurbita moschata])

HSP 1 Score: 692.6 bits (1786), Expect = 2.1e-195
Identity = 341/419 (81.38%), Postives = 365/419 (87.11%), Query Frame = 0

Query: 1   MGEQRKKKKMK----EILIREFDPINDSVGVEDVERRCEVGST--KKLSLFTDLLGDPIC 60
           M + +KKKK +    +ILIREFDPI D   VEDVERRCEVGST  K  SLFTDLLGDPIC
Sbjct: 1   MADHKKKKKKRKNNIQILIREFDPIKDKPSVEDVERRCEVGSTKNKNFSLFTDLLGDPIC 60

Query: 61  RVRHSPAFLMLVAEIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLA 120
           RVRHSPAFLMLVAEI A  EI GMIRGCIKTV+CCSK  RN G + SD+ +L PV+TKLA
Sbjct: 61  RVRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNAGINASDLPQLAPVYTKLA 120

Query: 121 YILGLRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKF 180
           YILGLRVSP H RLGIGL LVRRMEEWFRE  AEYSYIATE DN ASIKLFT KCGYSKF
Sbjct: 121 YILGLRVSPDH-RLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYSKF 180

Query: 181 RTPSILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLG 240
           RTPSILVNPVFAHRLRLSD+VT+L+L L  AETLYR RFAT EFFPRDIDAVL NRLSLG
Sbjct: 181 RTPSILVNPVFAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLG 240

Query: 241 TFLAVPRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISR 300
           TFLAVPRG+F D LW  SDRFLS  PESWAVLSAWNCKDV+ALEVRG S  KRA+AK+SR
Sbjct: 241 TFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSR 300

Query: 301 VIDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVA 360
           VIDRGLPWLRLPSVPEVF PFGVLFLYG+GGEGP AGKLMKALC+HAHNLAKERGCGVVA
Sbjct: 301 VIDRGLPWLRLPSVPEVFKPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVA 360

Query: 361 TEVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           TEVS+DEPLKS IPHWK+LSCPEDLWCIKRLGEGY+++S+GDWTKSPP  SIFVDPREF
Sbjct: 361 TEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 418

BLAST of Sgr021325 vs. ExPASy Swiss-Prot
Match: Q42381 (Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana OX=3702 GN=HLS1 PE=1 SV=1)

HSP 1 Score: 603.6 bits (1555), Expect = 1.7e-171
Identity = 287/404 (71.04%), Postives = 338/404 (83.66%), Query Frame = 0

Query: 14  LIREFDPINDSVGVEDVERRCEVGSTKKLSLFTDLLGDPICRVRHSPAFLMLVAEIGAE- 73
           ++RE+DP  D VGVEDVERRCEVG + KLSLFTDLLGDPICR+RHSP++LMLVAE+G E 
Sbjct: 3   VVREYDPTRDLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGTEK 62

Query: 74  KEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLAYILGLRVSPAHRRLGIGL 133
           KEI GMIRGCIKTV+C  KL  N  +  +DV+K  P++TKLAY+LGLRVSP HRR GIG 
Sbjct: 63  KEIVGMIRGCIKTVTCGQKLDLNHKSQ-NDVVK--PLYTKLAYVLGLRVSPFHRRQGIGF 122

Query: 134 KLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFRTPSILVNPVFAHRLRLS 193
           KLV+ MEEWFR+N AEYSYIATE+DN AS+ LFT KCGYS+FRTPSILVNPV+AHR+ +S
Sbjct: 123 KLVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRVNVS 182

Query: 194 DRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGTFLAVPRGTFYDK---LW 253
            RVTV++L   DAETLYR RF+TTEFFPRDID+VL N+LSLGTF+AVPRG+ Y      W
Sbjct: 183 RRVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSGSW 242

Query: 254 AGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRVIDRGLPWLRLPSVP 313
            GS +FL YPPESWAVLS WNCKD F LEVRGAS  +R +AK +RV+D+ LP+L+LPS+P
Sbjct: 243 PGSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPSIP 302

Query: 314 EVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVATEVSSDEPLKSGIPH 373
            VF PFG+ F+YG+GGEGPRA K++K+LC HAHNLAK  GCGVVA EV+ ++PL+ GIPH
Sbjct: 303 SVFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLAKAGGCGVVAAEVAGEDPLRRGIPH 362

Query: 374 WKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           WK LSC EDLWCIKRLG+ YSD  VGDWTKSPPG+SIFVDPREF
Sbjct: 363 WKVLSCDEDLWCIKRLGDDYSDGVVGDWTKSPPGVSIFVDPREF 403

BLAST of Sgr021325 vs. ExPASy Swiss-Prot
Match: O64815 (Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana OX=3702 GN=At2g23060 PE=2 SV=1)

HSP 1 Score: 590.9 bits (1522), Expect = 1.1e-167
Identity = 279/407 (68.55%), Postives = 332/407 (81.57%), Query Frame = 0

Query: 15  IREFDPINDSVGVEDVERRCEVGSTKKLSLFTDLLGDPICRVRHSPAFLMLVAEIG--AE 74
           +RE+DP  D   VEDVERRCEVG   KLSLFTDLLGDPICRVRHSP++LMLVAEIG   +
Sbjct: 7   VREYDPSKDLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGPKEK 66

Query: 75  KEIAGMIRGCIKTVSC---CSKLPRNGGNDGSDVIKLLPVFTKLAYILGLRVSPAHRRLG 134
           KE+ GMIRGCIKTV+C     +L        +DV+   P++TKLAYILGLRVSP HRR G
Sbjct: 67  KELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHRRQG 126

Query: 135 IGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFRTPSILVNPVFAHRL 194
           IG KLV+ ME+WF +N AEYSY ATE+DN AS+ LFT KCGY++FRTPSILVNPV+AHR+
Sbjct: 127 IGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYAHRV 186

Query: 195 RLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGTFLAVPRGTFY---D 254
            +S RVTV++L  +DAE LYR RF+TTEFFPRDID+VL N+LSLGTF+AVPRG+ Y    
Sbjct: 187 NISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGS 246

Query: 255 KLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRVIDRGLPWLRLP 314
           + W GS +FL YPP+SWAVLS WNCKD F LEVRGAS  +R ++K +R++D+ LP+L++P
Sbjct: 247 RSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFLKIP 306

Query: 315 SVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVATEVSSDEPLKSG 374
           S+P VF PFG+ F+YG+GGEGPRA K++KALC+HAHNLAKE GCGVVA EV+ +EPL+ G
Sbjct: 307 SIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEGGCGVVAAEVAGEEPLRRG 366

Query: 375 IPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           IPHWK LSC EDLWCIKRLGE YSD SVGDWTKSPPG SIFVDPREF
Sbjct: 367 IPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDSIFVDPREF 413

BLAST of Sgr021325 vs. ExPASy TrEMBL
Match: A0A6J1F329 (probable N-acetyltransferase HLS1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439465 PE=4 SV=1)

HSP 1 Score: 698.7 bits (1802), Expect = 1.4e-197
Identity = 342/419 (81.62%), Postives = 366/419 (87.35%), Query Frame = 0

Query: 1   MGEQRKKKKMK----EILIREFDPINDSVGVEDVERRCEVGST--KKLSLFTDLLGDPIC 60
           M + +KKKK +    +ILIREFDPI D   VEDVERRCEVGST  K  SLFTDLLGDPIC
Sbjct: 1   MADHKKKKKKRKNNIQILIREFDPIKDKPSVEDVERRCEVGSTKNKNFSLFTDLLGDPIC 60

Query: 61  RVRHSPAFLMLVAEIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLA 120
           RVRHSPAFLMLVAEI A  EI GMIRGCIKTV+CCSK  RN G + SD+ +L PV+TKLA
Sbjct: 61  RVRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNAGINASDLPQLAPVYTKLA 120

Query: 121 YILGLRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKF 180
           YILGLRVSP HRRLGIGL LVRRMEEWFRE  AEYSYIATE DN ASIKLFT KCGYSKF
Sbjct: 121 YILGLRVSPDHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYSKF 180

Query: 181 RTPSILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLG 240
           RTPSILVNPVFAHRLRLSD+VT+L+L L  AETLYR RFAT EFFPRDIDAVL NRLSLG
Sbjct: 181 RTPSILVNPVFAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLG 240

Query: 241 TFLAVPRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISR 300
           TFLAVPRG+F D LW  SDRFLS  PESWAVLSAWNCKDV+ALEVRG S  KRA+AK+SR
Sbjct: 241 TFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSR 300

Query: 301 VIDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVA 360
           VIDRGLPWLRLPSVPEVF PFGVLFLYG+GGEGP AGKLMKALC+HAHNLAKERGCGVVA
Sbjct: 301 VIDRGLPWLRLPSVPEVFKPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVA 360

Query: 361 TEVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           TEVS+DEPLKS IPHWK+LSCPEDLWCIKRLGEGY+++S+GDWTKSPP  SIFVDPREF
Sbjct: 361 TEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 419

BLAST of Sgr021325 vs. ExPASy TrEMBL
Match: A0A6J1JHA4 (probable N-acetyltransferase HLS1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484371 PE=4 SV=1)

HSP 1 Score: 694.5 bits (1791), Expect = 2.7e-196
Identity = 342/418 (81.82%), Postives = 361/418 (86.36%), Query Frame = 0

Query: 1   MGEQRKKKKMK---EILIREFDPINDSVGVEDVERRCEVGST--KKLSLFTDLLGDPICR 60
           M +Q+KKKK K   +ILIREFDP  D   VEDVERRCEVGST  K  SLFTDLLGDPICR
Sbjct: 1   MADQKKKKKRKNNIQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICR 60

Query: 61  VRHSPAFLMLVAEIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLAY 120
           VRHSPAFLMLVAEI A  EI GMIRGCIKTV+CCSK  RN G + SD+  L PV+TKLAY
Sbjct: 61  VRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNSGINASDLPHLAPVYTKLAY 120

Query: 121 ILGLRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFR 180
           ILGLRVSP HRRLGIGL LVRRMEEWFRE  AEYSYIATE DN ASIKLFT KCGY KFR
Sbjct: 121 ILGLRVSPDHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFR 180

Query: 181 TPSILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGT 240
           TPSILVNPVFAHRLRLSD+VT+LRL L  AETLYR RFA  EFFPRDIDAVL NRLSLGT
Sbjct: 181 TPSILVNPVFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGT 240

Query: 241 FLAVPRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRV 300
           FLAVPRG+F D LW  SDRFLS  PESWAVLSAWNCKDV+ALEVRG S  KRA+AK SRV
Sbjct: 241 FLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRV 300

Query: 301 IDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVAT 360
           IDRGLPWLRLPSVPEVF PFGVLFLYG+GGEGP AGKLMKALC+HAHNLAKERGCGVVAT
Sbjct: 301 IDRGLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVAT 360

Query: 361 EVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           EVS+DEPLKS IPHWK+LSCPEDLWCIKRLGE Y+D+S+GDWTKSPP  SIFVDPREF
Sbjct: 361 EVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 418

BLAST of Sgr021325 vs. ExPASy TrEMBL
Match: A0A6J1EYB0 (probable N-acetyltransferase HLS1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439465 PE=4 SV=1)

HSP 1 Score: 692.6 bits (1786), Expect = 1.0e-195
Identity = 341/419 (81.38%), Postives = 365/419 (87.11%), Query Frame = 0

Query: 1   MGEQRKKKKMK----EILIREFDPINDSVGVEDVERRCEVGST--KKLSLFTDLLGDPIC 60
           M + +KKKK +    +ILIREFDPI D   VEDVERRCEVGST  K  SLFTDLLGDPIC
Sbjct: 1   MADHKKKKKKRKNNIQILIREFDPIKDKPSVEDVERRCEVGSTKNKNFSLFTDLLGDPIC 60

Query: 61  RVRHSPAFLMLVAEIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLA 120
           RVRHSPAFLMLVAEI A  EI GMIRGCIKTV+CCSK  RN G + SD+ +L PV+TKLA
Sbjct: 61  RVRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNAGINASDLPQLAPVYTKLA 120

Query: 121 YILGLRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKF 180
           YILGLRVSP H RLGIGL LVRRMEEWFRE  AEYSYIATE DN ASIKLFT KCGYSKF
Sbjct: 121 YILGLRVSPDH-RLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYSKF 180

Query: 181 RTPSILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLG 240
           RTPSILVNPVFAHRLRLSD+VT+L+L L  AETLYR RFAT EFFPRDIDAVL NRLSLG
Sbjct: 181 RTPSILVNPVFAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLG 240

Query: 241 TFLAVPRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISR 300
           TFLAVPRG+F D LW  SDRFLS  PESWAVLSAWNCKDV+ALEVRG S  KRA+AK+SR
Sbjct: 241 TFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSR 300

Query: 301 VIDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVA 360
           VIDRGLPWLRLPSVPEVF PFGVLFLYG+GGEGP AGKLMKALC+HAHNLAKERGCGVVA
Sbjct: 301 VIDRGLPWLRLPSVPEVFKPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVA 360

Query: 361 TEVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           TEVS+DEPLKS IPHWK+LSCPEDLWCIKRLGEGY+++S+GDWTKSPP  SIFVDPREF
Sbjct: 361 TEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 418

BLAST of Sgr021325 vs. ExPASy TrEMBL
Match: A0A6J1JGS4 (probable N-acetyltransferase HLS1 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111484371 PE=4 SV=1)

HSP 1 Score: 688.3 bits (1775), Expect = 1.9e-194
Identity = 341/418 (81.58%), Postives = 360/418 (86.12%), Query Frame = 0

Query: 1   MGEQRKKKKMK---EILIREFDPINDSVGVEDVERRCEVGST--KKLSLFTDLLGDPICR 60
           M +Q+KKKK K   +ILIREFDP  D   VEDVERRCEVGST  K  SLFTDLLGDPICR
Sbjct: 1   MADQKKKKKRKNNIQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICR 60

Query: 61  VRHSPAFLMLVAEIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLAY 120
           VRHSPAFLMLVAEI A  EI GMIRGCIKTV+CCSK  RN G + SD+  L PV+TKLAY
Sbjct: 61  VRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNSGINASDLPHLAPVYTKLAY 120

Query: 121 ILGLRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFR 180
           ILGLRVSP H RLGIGL LVRRMEEWFRE  AEYSYIATE DN ASIKLFT KCGY KFR
Sbjct: 121 ILGLRVSPDH-RLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFR 180

Query: 181 TPSILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGT 240
           TPSILVNPVFAHRLRLSD+VT+LRL L  AETLYR RFA  EFFPRDIDAVL NRLSLGT
Sbjct: 181 TPSILVNPVFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGT 240

Query: 241 FLAVPRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRV 300
           FLAVPRG+F D LW  SDRFLS  PESWAVLSAWNCKDV+ALEVRG S  KRA+AK SRV
Sbjct: 241 FLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRV 300

Query: 301 IDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVAT 360
           IDRGLPWLRLPSVPEVF PFGVLFLYG+GGEGP AGKLMKALC+HAHNLAKERGCGVVAT
Sbjct: 301 IDRGLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVAT 360

Query: 361 EVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           EVS+DEPLKS IPHWK+LSCPEDLWCIKRLGE Y+D+S+GDWTKSPP  SIFVDPREF
Sbjct: 361 EVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 417

BLAST of Sgr021325 vs. ExPASy TrEMBL
Match: A0A803PEC7 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 650.2 bits (1676), Expect = 5.8e-183
Identity = 312/413 (75.54%), Postives = 354/413 (85.71%), Query Frame = 0

Query: 2   GEQRKKKKMKEILIREFDPINDSVGVEDVERRCEVGSTKKLSLFTDLLGDPICRVRHSPA 61
           GE+RK      +++R +DP  D +GVEDVERRCEVG + +LSLFTDLLGDPICRVR+SPA
Sbjct: 3   GEERK----VSLVVRNYDPKTDLLGVEDVERRCEVGPSGELSLFTDLLGDPICRVRNSPA 62

Query: 62  FLMLVAE--IGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLAYILGL 121
           +LMLVAE  IG EKEI GMIRGCIKTV+C  KL RNG N+     K +PV+TK+AY+LGL
Sbjct: 63  YLMLVAESIIGEEKEIVGMIRGCIKTVTCGKKLSRNGKNNNEATPKPVPVYTKVAYVLGL 122

Query: 122 RVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFRTPSI 181
           RVSP+HRR+GIGLKLVR MEEWFR+N AEYSY+AT++DN AS+ LFTDKCGYSKFRTPSI
Sbjct: 123 RVSPSHRRMGIGLKLVRDMEEWFRKNGAEYSYLATDNDNQASVNLFTDKCGYSKFRTPSI 182

Query: 182 LVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGTFLAV 241
           LVNPVFAHR+R+S RVTV++L  +DAE LYR+R+ATTEFFPRDID+VL N+L+LGTFLAV
Sbjct: 183 LVNPVFAHRVRVSSRVTVIKLPPSDAEFLYRKRYATTEFFPRDIDSVLNNKLTLGTFLAV 242

Query: 242 PRGTFYDKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRVIDRG 301
           PRGT+  + W GS+RFLS PPESWA+LS WN KDVF LEVRG S  KR LAK +RV+DR 
Sbjct: 243 PRGTYTAQTWPGSNRFLSSPPESWALLSVWNAKDVFTLEVRGVSRVKRTLAKTTRVLDRA 302

Query: 302 LPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVATEVSS 361
           LPWLRLPSVPEVF PFG+ FLYGLGGEGPRA K +KALC HAHNLAKERGCGVVATEVSS
Sbjct: 303 LPWLRLPSVPEVFKPFGLHFLYGLGGEGPRAVKFVKALCAHAHNLAKERGCGVVATEVSS 362

Query: 362 DEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPRE 413
            EPL+ GIPHWKRLSC EDLWCIKRLGE YSD SVGDWTKSPPGLSIFVDPRE
Sbjct: 363 HEPLRLGIPHWKRLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGLSIFVDPRE 411

BLAST of Sgr021325 vs. TAIR 10
Match: AT4G37580.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 603.6 bits (1555), Expect = 1.2e-172
Identity = 287/404 (71.04%), Postives = 338/404 (83.66%), Query Frame = 0

Query: 14  LIREFDPINDSVGVEDVERRCEVGSTKKLSLFTDLLGDPICRVRHSPAFLMLVAEIGAE- 73
           ++RE+DP  D VGVEDVERRCEVG + KLSLFTDLLGDPICR+RHSP++LMLVAE+G E 
Sbjct: 3   VVREYDPTRDLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGTEK 62

Query: 74  KEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLAYILGLRVSPAHRRLGIGL 133
           KEI GMIRGCIKTV+C  KL  N  +  +DV+K  P++TKLAY+LGLRVSP HRR GIG 
Sbjct: 63  KEIVGMIRGCIKTVTCGQKLDLNHKSQ-NDVVK--PLYTKLAYVLGLRVSPFHRRQGIGF 122

Query: 134 KLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFRTPSILVNPVFAHRLRLS 193
           KLV+ MEEWFR+N AEYSYIATE+DN AS+ LFT KCGYS+FRTPSILVNPV+AHR+ +S
Sbjct: 123 KLVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRVNVS 182

Query: 194 DRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGTFLAVPRGTFYDK---LW 253
            RVTV++L   DAETLYR RF+TTEFFPRDID+VL N+LSLGTF+AVPRG+ Y      W
Sbjct: 183 RRVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSGSW 242

Query: 254 AGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRVIDRGLPWLRLPSVP 313
            GS +FL YPPESWAVLS WNCKD F LEVRGAS  +R +AK +RV+D+ LP+L+LPS+P
Sbjct: 243 PGSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPSIP 302

Query: 314 EVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVATEVSSDEPLKSGIPH 373
            VF PFG+ F+YG+GGEGPRA K++K+LC HAHNLAK  GCGVVA EV+ ++PL+ GIPH
Sbjct: 303 SVFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLAKAGGCGVVAAEVAGEDPLRRGIPH 362

Query: 374 WKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           WK LSC EDLWCIKRLG+ YSD  VGDWTKSPPG+SIFVDPREF
Sbjct: 363 WKVLSCDEDLWCIKRLGDDYSDGVVGDWTKSPPGVSIFVDPREF 403

BLAST of Sgr021325 vs. TAIR 10
Match: AT2G23060.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 590.9 bits (1522), Expect = 8.0e-169
Identity = 279/407 (68.55%), Postives = 332/407 (81.57%), Query Frame = 0

Query: 15  IREFDPINDSVGVEDVERRCEVGSTKKLSLFTDLLGDPICRVRHSPAFLMLVAEIG--AE 74
           +RE+DP  D   VEDVERRCEVG   KLSLFTDLLGDPICRVRHSP++LMLVAEIG   +
Sbjct: 7   VREYDPSKDLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGPKEK 66

Query: 75  KEIAGMIRGCIKTVSC---CSKLPRNGGNDGSDVIKLLPVFTKLAYILGLRVSPAHRRLG 134
           KE+ GMIRGCIKTV+C     +L        +DV+   P++TKLAYILGLRVSP HRR G
Sbjct: 67  KELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHRRQG 126

Query: 135 IGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFRTPSILVNPVFAHRL 194
           IG KLV+ ME+WF +N AEYSY ATE+DN AS+ LFT KCGY++FRTPSILVNPV+AHR+
Sbjct: 127 IGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYAHRV 186

Query: 195 RLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGTFLAVPRGTFY---D 254
            +S RVTV++L  +DAE LYR RF+TTEFFPRDID+VL N+LSLGTF+AVPRG+ Y    
Sbjct: 187 NISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGS 246

Query: 255 KLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRVIDRGLPWLRLP 314
           + W GS +FL YPP+SWAVLS WNCKD F LEVRGAS  +R ++K +R++D+ LP+L++P
Sbjct: 247 RSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFLKIP 306

Query: 315 SVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVATEVSSDEPLKSG 374
           S+P VF PFG+ F+YG+GGEGPRA K++KALC+HAHNLAKE GCGVVA EV+ +EPL+ G
Sbjct: 307 SIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEGGCGVVAAEVAGEEPLRRG 366

Query: 375 IPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           IPHWK LSC EDLWCIKRLGE YSD SVGDWTKSPPG SIFVDPREF
Sbjct: 367 IPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDSIFVDPREF 413

BLAST of Sgr021325 vs. TAIR 10
Match: AT2G23060.2 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 511.9 bits (1317), Expect = 4.7e-145
Identity = 242/358 (67.60%), Postives = 291/358 (81.28%), Query Frame = 0

Query: 64  MLVAEIG--AEKEIAGMIRGCIKTVSC---CSKLPRNGGNDGSDVIKLLPVFTKLAYILG 123
           MLVAEIG   +KE+ GMIRGCIKTV+C     +L        +DV+   P++TKLAYILG
Sbjct: 1   MLVAEIGPKEKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILG 60

Query: 124 LRVSPAHRRLGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFRTPS 183
           LRVSP HRR GIG KLV+ ME+WF +N AEYSY ATE+DN AS+ LFT KCGY++FRTPS
Sbjct: 61  LRVSPTHRRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPS 120

Query: 184 ILVNPVFAHRLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGTFLA 243
           ILVNPV+AHR+ +S RVTV++L  +DAE LYR RF+TTEFFPRDID+VL N+LSLGTF+A
Sbjct: 121 ILVNPVYAHRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVA 180

Query: 244 VPRGTFY---DKLWAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRV 303
           VPRG+ Y    + W GS +FL YPP+SWAVLS WNCKD F LEVRGAS  +R ++K +R+
Sbjct: 181 VPRGSCYGSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRM 240

Query: 304 IDRGLPWLRLPSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVAT 363
           +D+ LP+L++PS+P VF PFG+ F+YG+GGEGPRA K++KALC+HAHNLAKE GCGVVA 
Sbjct: 241 VDKTLPFLKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEGGCGVVAA 300

Query: 364 EVSSDEPLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPREF 414
           EV+ +EPL+ GIPHWK LSC EDLWCIKRLGE YSD SVGDWTKSPPG SIFVDPREF
Sbjct: 301 EVAGEEPLRRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDSIFVDPREF 358

BLAST of Sgr021325 vs. TAIR 10
Match: AT5G67430.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 453.8 bits (1166), Expect = 1.5e-127
Identity = 235/407 (57.74%), Postives = 295/407 (72.48%), Query Frame = 0

Query: 8   KKMKEILIREFDPINDSVGVEDVERRCEVGSTKKLSLFTDLLGDPICRVRHSPAFLMLVA 67
           K    +++RE+DP  D   VE++E  CEVG     SL  DL+GDP+ R+R SP+F MLVA
Sbjct: 3   KGFNVVVVREYDPKRDLTSVEELEESCEVG-----SLLVDLMGDPLARIRQSPSFHMLVA 62

Query: 68  EIGAEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLAYILGLRVSPAHRR 127
           EIG   EI GMIRG IK V+      R   +D S  I      TKLA++ GLRVSP +RR
Sbjct: 63  EIG--NEIVGMIRGTIKMVTRGVNALRQ-ADDVSPEINT----TKLAFVSGLRVSPFYRR 122

Query: 128 LGIGLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFRTPSILVNPVFAH 187
           +GIGLKLV+R+EEWF  N A YSY+ TE+DN AS+KLFT+K GYSKFRTP+ LVNPVF H
Sbjct: 123 MGIGLKLVQRLEEWFLRNDAVYSYVQTENDNIASVKLFTEKSGYSKFRTPTFLVNPVFNH 182

Query: 188 RLRLSDRVTVLRLGLADAETLYRRRFATTEFFPRDIDAVLCNRLSLGTFLAVPRGTFYDK 247
           R+ +S RV +++L  +DAE+LYR RF+TTEFFP DI+++L N+LSLGT+LAVPRG     
Sbjct: 183 RVTVSRRVKIIKLAPSDAESLYRNRFSTTEFFPSDINSILTNKLSLGTYLAVPRG----- 242

Query: 248 LWAGSDRFLSYPPE--SWAVLSAWNCKDVFALEVRGASATKRALAKISRVIDRGLPWLRL 307
              G +   S P +  SWAV+S WN KDV+ L+V+GAS  KR LAK +RV D   P+L++
Sbjct: 243 ---GDNVSGSLPDQTGSWAVISIWNSKDVYRLQVKGASRLKRMLAKSTRVFDGAFPFLKI 302

Query: 308 PSVPEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKERGCGVVATEVSSDEPLKS 367
           PS P +F  F + F+YG+GGEGPRA ++++ALC+HAHNLA++ GC VVA EV+S EPL+ 
Sbjct: 303 PSFPNLFKSFAMHFMYGIGGEGPRAAEMVEALCSHAHNLARKSGCAVVAAEVASCEPLRV 362

Query: 368 GIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPRE 413
           GIPHWK LS PEDLWC+KRL   Y D  V DWTKSPPGLSIFVDPRE
Sbjct: 363 GIPHWKVLS-PEDLWCLKRL--RYDDDGV-DWTKSPPGLSIFVDPRE 385

BLAST of Sgr021325 vs. TAIR 10
Match: AT2G30090.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 307.4 bits (786), Expect = 1.8e-83
Identity = 174/411 (42.34%), Postives = 244/411 (59.37%), Query Frame = 0

Query: 11  KEILIREFDPINDSVGVEDVERRCEVGSTKKLSLFTDLLGDPICRVRHSPAFLMLVAEIG 70
           +E++IR +D   D + +  +E+ CE+G   +  LFTD LGDPICR+R+SP F+MLVA +G
Sbjct: 11  EEVVIRCYDDRRDRIQMGRMEKSCEIGHDHQTLLFTDTLGDPICRIRNSPFFIMLVAGVG 70

Query: 71  AEKEIAGMIRGCIKTVSCCSKLPRNGGNDGSDVIKLLPVFTKLAYILGLRVSPAHRRLGI 130
              ++ G I+G +K V    K  R G                  Y+LGLRV P++RR GI
Sbjct: 71  --NKLVGSIQGSVKPVEFHDKSVRVG------------------YVLGLRVVPSYRRRGI 130

Query: 131 GLKLVRRMEEWFRENSAEYSYIATESDNAASIKLFTDKCGYSKFRTPSILVNPVFAHR-L 190
           G  LVR++EEWF  ++A+Y+Y+ATE DN AS  LF  + GY  FR P+ILVNPV   R L
Sbjct: 131 GSILVRKLEEWFESHNADYAYMATEKDNEASHGLFIGRLGYVVFRNPAILVNPVNPGRGL 190

Query: 191 RLSDRVTVLRLGLADAETLYRRRF-ATTEFFPRDIDAVLCNRLSLGTFLAVPRGTFYDKL 250
           +L   + + +L + +AE+LYRR   ATTEFFP DI+ +L N+LS+GT++A     +Y+ +
Sbjct: 191 KLPSDIGIRKLKVKEAESLYRRNVAATTEFFPDDINKILRNKLSIGTWVA-----YYNNV 250

Query: 251 WAGSDRFLSYPPESWAVLSAWNCKDVFALEVRGASATKRALAKISRVIDRGLPWLRLPSV 310
                        SWA+LS W+   VF L +  A  +   L K+S++    L  L L  +
Sbjct: 251 ---------DNTRSWAMLSVWDSSKVFKLRIERAPLSYLLLTKVSKLFGNFLSLLGLTVL 310

Query: 311 PEVFAPFGVLFLYGLGGEGPRAGKLMKALCNHAHNLAKER---GCGVVATEV----SSDE 370
           P++F PFG  FLYG+  EGP  GKL++ALC H HN+A       C VV  EV    + D+
Sbjct: 311 PDLFTPFGFYFLYGVHSEGPHCGKLVRALCEHVHNMAALNDGCACKVVVVEVDKGSNGDD 370

Query: 371 PLKSGIPHWKRLSCPEDLWCIKRLGEGYSDSSVGDWTKSPPGLSIFVDPRE 413
            L+  IPHWK LSC +D+WCIK L    +   + + +KS    S+FVDPRE
Sbjct: 371 SLQRCIPHWKMLSCDDDMWCIKPLKCEKNKFDLSERSKSRS--SLFVDPRE 385

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023512928.11.0e-19781.67probable N-acetyltransferase HLS1 [Cucurbita pepo subsp. pepo] >XP_023522343.1 p... [more]
XP_022932903.12.9e-19781.62probable N-acetyltransferase HLS1 isoform X1 [Cucurbita moschata] >KAG7011045.1 ... [more]
KAG6571244.11.9e-19681.58putative N-acetyltransferase HLS1-like protein, partial [Cucurbita argyrosperma ... [more]
XP_022986693.15.5e-19681.82probable N-acetyltransferase HLS1 isoform X1 [Cucurbita maxima][more]
XP_022932904.12.1e-19581.38probable N-acetyltransferase HLS1 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q423811.7e-17171.04Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana OX=3702 GN=HLS1 PE=1 S... [more]
O648151.1e-16768.55Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana OX=3702 GN=At2g23... [more]
Match NameE-valueIdentityDescription
A0A6J1F3291.4e-19781.62probable N-acetyltransferase HLS1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1JHA42.7e-19681.82probable N-acetyltransferase HLS1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A6J1EYB01.0e-19581.38probable N-acetyltransferase HLS1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1JGS41.9e-19481.58probable N-acetyltransferase HLS1 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A803PEC75.8e-18375.54Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G37580.11.2e-17271.04Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G23060.18.0e-16968.55Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G23060.24.7e-14567.60Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT5G67430.11.5e-12757.74Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G30090.11.8e-8342.34Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000182GNAT domainPFAMPF00583Acetyltransf_1coord: 59..171
e-value: 4.0E-16
score: 59.3
IPR000182GNAT domainPROSITEPS51186GNATcoord: 13..192
score: 14.343189
NoneNo IPR availableGENE3D3.40.630.30coord: 11..177
e-value: 9.2E-15
score: 56.8
NoneNo IPR availablePANTHERPTHR43072N-ACETYLTRANSFERASEcoord: 12..413
NoneNo IPR availablePANTHERPTHR43072:SF42N-ACETYLTRANSFERASE HLS1-LIKE-RELATEDcoord: 12..413
NoneNo IPR availableCDDcd04301NAT_SFcoord: 103..149
e-value: 7.07112E-6
score: 41.4925
IPR016181Acyl-CoA N-acyltransferaseSUPERFAMILY55729Acyl-CoA N-acyltransferases (Nat)coord: 10..174

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr021325.1Sgr021325.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008080 N-acetyltransferase activity