HG10021803 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021803
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionN-acetyltransferase domain-containing protein
LocationChr05: 16940764 .. 16942162 (-)
RNA-Seq ExpressionHG10021803
SyntenyHG10021803
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGATGTGAAGAAGAAATTTTGATAATAAGAAGCTATGATGGGCAATCTGCAGATAGAGGTAGAGTGGAAGATCTAGAGAGAAGATGCGAGGTAGGGCCATCTGAAAGAGTTTTTCTCTTCACAGACACTATGGGTGACCCCATTTGTAGGATCAGAAACAGTCCCTTGTACAAGATGCTGGTATTTCCCTCCCACCACTCCCACCTCTTTCATTAATTCATTCATTCCTTCCTTCTCTTGTTTTTAAATATACCGTAAACCTATTGACTTAAGTTTTTCGTTACTTCAGGTGGCCGAAGTGGATAACCAGTTGGTTGGTGTGATTCAAGGCTCCATAAAGGTGGTAACGGTTCACCAGGCACCGAAAGACCGAGCCAAGGTTGGGTATGTTTTAGGCCTTCGAGTTGCACCATTGTTTCGCCGTCGAGGGATTGGTTGTAACCTTGTGCGACGGCTCGAAGAGTGGTTTGCGGTTAATGATGTAGATTATGCTTATATGGCGACGGAGAAAGACAACGAAGCATCTGTGAAGCTATTCATCAACAAGCTTGGATACACTAATTTTAGAGTTCCAGCGATTTTGGTGAACCCAGTGAAACATTACCGATCATATCACATCCCTTCCAACATCCAAATTGCTAGTCTGAAAGTGGATATCGCCGAGTTTCTCTATCGGAAATTCATGGCCTCTACGGAGTTCTTCCCCCATGACATTGATCACGTGCTCAAACACAAGCTAAGCCTCGGCACATGGGTTGCTTACTATAAAGATGACGACACCTCTTCTGCCAAATTCGAAACAAACGGTAGCAAGTCGGAAATTGTAATACCAAAGAGCTGGGCAATGCTGAGTGTATGGAACAGTGGAGAGGTGAGTGCTAAACTACAACTTTCAACACAAGAAGCTTGCTTCTTAAGACGATAAAGTTACAGCATATTTAATCATTTCATTAATTTTTTCTACTAGGTGTTCAAGCTACGACTGGGCAAGGCACCATTGTCGTGCTTCATATACACAGAGAGCTCGAAGGTGATAGACAAGATCTTCCCATGTCTGAAGTTGCCATCAATACCAGATTTCTATGAGCCATTTGGATTCTACTTTATGTACGGGGTTCATCGGGAGGGGACGGGGACAGGGAAGCTGGTGAGAGCATTGTGCCAATACGTGCACAACATGGCAGCCGCGGCGAGGGACTGTAAAGTAATAGTAACAGAGATTGGAGGAGAAGACTCACTGAGAGAAGAGATTCCACATTGGAAATTGCTGTCATGCCCCGAAGATTTGTGGTGCATAAAGGCATTGAAGAAAGAAACAAGAAATAGCCTACATGAGTTGACAAAAACCCCACCAACTACAAGACCAGCCCTTTTTGTAGACCCAAGAGAGGTATGA

mRNA sequence

ATGGGATGTGAAGAAGAAATTTTGATAATAAGAAGCTATGATGGGCAATCTGCAGATAGAGGTAGAGTGGAAGATCTAGAGAGAAGATGCGAGGTAGGGCCATCTGAAAGAGTTTTTCTCTTCACAGACACTATGGGTGACCCCATTTGTAGGATCAGAAACAGTCCCTTGTACAAGATGCTGGTGGCCGAAGTGGATAACCAGTTGGTTGGTGTGATTCAAGGCTCCATAAAGGTGGTAACGGTTCACCAGGCACCGAAAGACCGAGCCAAGGTTGGGTATGTTTTAGGCCTTCGAGTTGCACCATTGTTTCGCCGTCGAGGGATTGGTTGTAACCTTGTGCGACGGCTCGAAGAGTGGTTTGCGGTTAATGATGTAGATTATGCTTATATGGCGACGGAGAAAGACAACGAAGCATCTGTGAAGCTATTCATCAACAAGCTTGGATACACTAATTTTAGAGTTCCAGCGATTTTGGTGAACCCAGTGAAACATTACCGATCATATCACATCCCTTCCAACATCCAAATTGCTAGTCTGAAAGTGGATATCGCCGAGTTTCTCTATCGGAAATTCATGGCCTCTACGGAGTTCTTCCCCCATGACATTGATCACGTGCTCAAACACAAGCTAAGCCTCGGCACATGGGTTGCTTACTATAAAGATGACGACACCTCTTCTGCCAAATTCGAAACAAACGGTAGCAAGTCGGAAATTGTAATACCAAAGAGCTGGGCAATGCTGAGTGTATGGAACAGTGGAGAGGTGTTCAAGCTACGACTGGGCAAGGCACCATTGTCGTGCTTCATATACACAGAGAGCTCGAAGGTGATAGACAAGATCTTCCCATGTCTGAAGTTGCCATCAATACCAGATTTCTATGAGCCATTTGGATTCTACTTTATGTACGGGGTTCATCGGGAGGGGACGGGGACAGGGAAGCTGGTGAGAGCATTGTGCCAATACGTGCACAACATGGCAGCCGCGGCGAGGGACTGTAAAGTAATAGTAACAGAGATTGGAGGAGAAGACTCACTGAGAGAAGAGATTCCACATTGGAAATTGCTGTCATGCCCCGAAGATTTGTGGTGCATAAAGGCATTGAAGAAAGAAACAAGAAATAGCCTACATGAGTTGACAAAAACCCCACCAACTACAAGACCAGCCCTTTTTGTAGACCCAAGAGAGGTATGA

Coding sequence (CDS)

ATGGGATGTGAAGAAGAAATTTTGATAATAAGAAGCTATGATGGGCAATCTGCAGATAGAGGTAGAGTGGAAGATCTAGAGAGAAGATGCGAGGTAGGGCCATCTGAAAGAGTTTTTCTCTTCACAGACACTATGGGTGACCCCATTTGTAGGATCAGAAACAGTCCCTTGTACAAGATGCTGGTGGCCGAAGTGGATAACCAGTTGGTTGGTGTGATTCAAGGCTCCATAAAGGTGGTAACGGTTCACCAGGCACCGAAAGACCGAGCCAAGGTTGGGTATGTTTTAGGCCTTCGAGTTGCACCATTGTTTCGCCGTCGAGGGATTGGTTGTAACCTTGTGCGACGGCTCGAAGAGTGGTTTGCGGTTAATGATGTAGATTATGCTTATATGGCGACGGAGAAAGACAACGAAGCATCTGTGAAGCTATTCATCAACAAGCTTGGATACACTAATTTTAGAGTTCCAGCGATTTTGGTGAACCCAGTGAAACATTACCGATCATATCACATCCCTTCCAACATCCAAATTGCTAGTCTGAAAGTGGATATCGCCGAGTTTCTCTATCGGAAATTCATGGCCTCTACGGAGTTCTTCCCCCATGACATTGATCACGTGCTCAAACACAAGCTAAGCCTCGGCACATGGGTTGCTTACTATAAAGATGACGACACCTCTTCTGCCAAATTCGAAACAAACGGTAGCAAGTCGGAAATTGTAATACCAAAGAGCTGGGCAATGCTGAGTGTATGGAACAGTGGAGAGGTGTTCAAGCTACGACTGGGCAAGGCACCATTGTCGTGCTTCATATACACAGAGAGCTCGAAGGTGATAGACAAGATCTTCCCATGTCTGAAGTTGCCATCAATACCAGATTTCTATGAGCCATTTGGATTCTACTTTATGTACGGGGTTCATCGGGAGGGGACGGGGACAGGGAAGCTGGTGAGAGCATTGTGCCAATACGTGCACAACATGGCAGCCGCGGCGAGGGACTGTAAAGTAATAGTAACAGAGATTGGAGGAGAAGACTCACTGAGAGAAGAGATTCCACATTGGAAATTGCTGTCATGCCCCGAAGATTTGTGGTGCATAAAGGCATTGAAGAAAGAAACAAGAAATAGCCTACATGAGTTGACAAAAACCCCACCAACTACAAGACCAGCCCTTTTTGTAGACCCAAGAGAGGTATGA

Protein sequence

MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEWFAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASLKVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIVIPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFYFMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPEDLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV
Homology
BLAST of HG10021803 vs. NCBI nr
Match: XP_008457342.1 (PREDICTED: probable N-acetyltransferase HLS1 [Cucumis melo] >KAA0060026.1 putative N-acetyltransferase HLS1 [Cucumis melo var. makuwa] >TYJ97283.1 putative N-acetyltransferase HLS1 [Cucumis melo var. makuwa])

HSP 1 Score: 800.8 bits (2067), Expect = 5.3e-228
Identity = 385/397 (96.98%), Postives = 389/397 (97.98%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM
Sbjct: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEW 120
           LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAP FRRRGIGC+LVRRLEEW
Sbjct: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPSFRRRGIGCSLVRRLEEW 120

Query: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASL 180
           F +NDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYH+PSNIQIA L
Sbjct: 121 FVINDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHLPSNIQIARL 180

Query: 181 KVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIV 240
           KVD+AEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDD SSAKFETN SKSEI 
Sbjct: 181 KVDVAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDVSSAKFETNSSKSEIT 240

Query: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300
           IPKSWAMLSVWNSGEVFKLRLGKAPLSC IYTESSKVIDKIFPCLKLPSIPDFYEPFGFY
Sbjct: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300

Query: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360
           FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE
Sbjct: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360

Query: 361 DLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           DLWCIKALKKE RNSLHELTKTPPTTRPALFVDPREV
Sbjct: 361 DLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV 397

BLAST of HG10021803 vs. NCBI nr
Match: XP_011658711.1 (probable N-acetyltransferase HLS1 [Cucumis sativus])

HSP 1 Score: 798.1 bits (2060), Expect = 3.4e-227
Identity = 384/397 (96.73%), Postives = 388/397 (97.73%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM
Sbjct: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEW 120
           LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAP FRRRGIGC+LVRRLEEW
Sbjct: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPSFRRRGIGCSLVRRLEEW 120

Query: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASL 180
           F +NDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSY +PSNIQIA L
Sbjct: 121 FMINDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYQLPSNIQIARL 180

Query: 181 KVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIV 240
           KVD+AEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDD SS KFETNGSKSEI 
Sbjct: 181 KVDVAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDVSSTKFETNGSKSEIT 240

Query: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300
           IPKSWAMLSVWNSGEVFKLRLGKAPLSC IYTESSKVIDKIFPCLKLPSIPDFYEPFGFY
Sbjct: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300

Query: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360
           FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE
Sbjct: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360

Query: 361 DLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           DLWCIKALKKE RNSLHELTKTPPTTRPALFVDPREV
Sbjct: 361 DLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV 397

BLAST of HG10021803 vs. NCBI nr
Match: XP_038894892.1 (probable N-acetyltransferase HLS1 [Benincasa hispida])

HSP 1 Score: 797.0 bits (2057), Expect = 7.6e-227
Identity = 384/397 (96.73%), Postives = 387/397 (97.48%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM
Sbjct: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEW 120
           LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRV P FRRRGIGC+LVRRLEEW
Sbjct: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVVPSFRRRGIGCSLVRRLEEW 120

Query: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASL 180
           FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYH+PSNIQIA L
Sbjct: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHLPSNIQIARL 180

Query: 181 KVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIV 240
           KVD+AEFLYRKFMASTEFFPHDID VLKHKLSLGTWVAYYKDDDTSS KFETNGSKSEI 
Sbjct: 181 KVDVAEFLYRKFMASTEFFPHDIDQVLKHKLSLGTWVAYYKDDDTSSTKFETNGSKSEIA 240

Query: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300
           IPKSWAMLSVWNSGEVFKLRLGKAPLSC IYTESSKVIDKIFPCLKLPSIPDFYEPFGFY
Sbjct: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300

Query: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360
           FMYGVHREG GTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE
Sbjct: 301 FMYGVHREGMGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360

Query: 361 DLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           DLWCIKALKKE RNSLHELTKTPPTTRP LFVDPREV
Sbjct: 361 DLWCIKALKKEARNSLHELTKTPPTTRPGLFVDPREV 397

BLAST of HG10021803 vs. NCBI nr
Match: XP_022998707.1 (probable N-acetyltransferase HLS1 [Cucurbita maxima])

HSP 1 Score: 768.5 bits (1983), Expect = 2.9e-218
Identity = 372/399 (93.23%), Postives = 382/399 (95.74%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           M  EEEILIIRSYDGQSADR RVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM
Sbjct: 1   MRYEEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEW 120
           LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAP FRRRGIG NLVRRLEEW
Sbjct: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPSFRRRGIGYNLVRRLEEW 120

Query: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASL 180
           F  NDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR YH+PSNIQI+SL
Sbjct: 121 FVANDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYHLPSNIQISSL 180

Query: 181 KVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIV 240
           KVD+AEFLYRKFMASTEFFPHDIDHVLKHKLSLG+WVAYYKDDD+++ KFETNG KSE+V
Sbjct: 181 KVDVAEFLYRKFMASTEFFPHDIDHVLKHKLSLGSWVAYYKDDDSTTPKFETNGGKSEMV 240

Query: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300
           IPK WAMLSVWNSGEVFKLRLGKAPLSC IYTESSKVIDKIFPCLKLPSIPDFYEPFGFY
Sbjct: 241 IPKCWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300

Query: 301 FMYGVHREGTGTG--KLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSC 360
           FMYGVHREG G G  KLV+ALCQYVHNMAAAARDCKVIVTEIGGEDSLR+EIPHWKLLSC
Sbjct: 301 FMYGVHREGKGMGTRKLVKALCQYVHNMAAAARDCKVIVTEIGGEDSLRDEIPHWKLLSC 360

Query: 361 PEDLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           PEDLWCIKALKKE RNSLHELTKTPPTTRPALFVDPREV
Sbjct: 361 PEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV 399

BLAST of HG10021803 vs. NCBI nr
Match: KAG6607328.1 (putative N-acetyltransferase HLS1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 768.1 bits (1982), Expect = 3.8e-218
Identity = 372/399 (93.23%), Postives = 381/399 (95.49%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           M  EEEILIIRSYDGQSADR RVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM
Sbjct: 1   MRYEEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEW 120
           LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAP FRRRGIG NLVRRLEEW
Sbjct: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPSFRRRGIGYNLVRRLEEW 120

Query: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASL 180
           F  NDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR YH+PSNIQI+SL
Sbjct: 121 FVANDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYHLPSNIQISSL 180

Query: 181 KVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIV 240
           KVD+AEFLYRKFMASTEFFPHDIDHVLKHKLSLG+WVAYYKDDD ++ KFETNG KSE+V
Sbjct: 181 KVDVAEFLYRKFMASTEFFPHDIDHVLKHKLSLGSWVAYYKDDDATTPKFETNGGKSEMV 240

Query: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300
           IPK WAMLSVWNSGEVFKLRLGKAPLSC IYTESSKVIDKIFPCLKLPSIPDFYEPFGFY
Sbjct: 241 IPKCWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300

Query: 301 FMYGVHREGTGTG--KLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSC 360
           FMYGVHREG G G  KLV+ALCQYVHNMAAAARDCKVIVTEIGGEDSLR+EIPHWKLLSC
Sbjct: 301 FMYGVHREGKGLGTRKLVKALCQYVHNMAAAARDCKVIVTEIGGEDSLRDEIPHWKLLSC 360

Query: 361 PEDLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           PEDLWCIKALKKE RNSLHELTKTPPTTRPALFVDPREV
Sbjct: 361 PEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV 399

BLAST of HG10021803 vs. ExPASy Swiss-Prot
Match: Q42381 (Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana OX=3702 GN=HLS1 PE=1 SV=1)

HSP 1 Score: 358.2 bits (918), Expect = 1.2e-97
Identity = 194/405 (47.90%), Postives = 260/405 (64.20%), Query Frame = 0

Query: 9   IIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV--- 68
           ++R YD  + D   VED+ERRCEVGPS ++ LFTD +GDPICRIR+SP Y MLVAE+   
Sbjct: 3   VVREYD-PTRDLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGTE 62

Query: 69  DNQLVGVIQGSIKVVTV-------HQAPKD-----RAKVGYVLGLRVAPLFRRRGIGCNL 128
             ++VG+I+G IK VT        H++  D       K+ YVLGLRV+P  RR+GIG  L
Sbjct: 63  KKEIVGMIRGCIKTVTCGQKLDLNHKSQNDVVKPLYTKLAYVLGLRVSPFHRRQGIGFKL 122

Query: 129 VRRLEEWFAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPS 188
           V+ +EEWF  N  +Y+Y+ATE DN+ASV LF  K GY+ FR P+ILVNPV  +R  ++  
Sbjct: 123 VKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHR-VNVSR 182

Query: 189 NIQIASLKVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETN 248
            + +  L+   AE LYR   ++TEFFP DID VL +KLSLGT+VA  +     S      
Sbjct: 183 RVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSGSWP 242

Query: 249 GSKSEIVI-PKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPD 308
           GS   +   P+SWA+LSVWN  + F L +  A     +  ++++V+DK  P LKLPSIP 
Sbjct: 243 GSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPSIPS 302

Query: 309 FYEPFGFYFMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPH 368
            +EPFG +FMYG+  EG    K+V++LC + HN+A A   C V+  E+ GED LR  IPH
Sbjct: 303 VFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLAKAG-GCGVVAAEVAGEDPLRRGIPH 362

Query: 369 WKLLSCPEDLWCIKALKKETRNS-LHELTKTPPTTRPALFVDPRE 397
           WK+LSC EDLWCIK L  +  +  + + TK+PP    ++FVDPRE
Sbjct: 363 WKVLSCDEDLWCIKRLGDDYSDGVVGDWTKSPPGV--SIFVDPRE 402

BLAST of HG10021803 vs. ExPASy Swiss-Prot
Match: O64815 (Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana OX=3702 GN=At2g23060 PE=2 SV=1)

HSP 1 Score: 351.7 bits (901), Expect = 1.1e-95
Identity = 189/411 (45.99%), Postives = 257/411 (62.53%), Query Frame = 0

Query: 10  IRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV---- 69
           +R YD  S D   VED+ERRCEVGP+ ++ LFTD +GDPICR+R+SP Y MLVAE+    
Sbjct: 7   VREYD-PSKDLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGPKE 66

Query: 70  DNQLVGVIQGSIKVVT----------VHQAPKD--------RAKVGYVLGLRVAPLFRRR 129
             +LVG+I+G IK VT           H   ++          K+ Y+LGLRV+P  RR+
Sbjct: 67  KKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHRRQ 126

Query: 130 GIGCNLVRRLEEWFAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR 189
           GIG  LV+ +E+WF+ N  +Y+Y ATE DN ASV LF  K GY  FR P+ILVNPV  +R
Sbjct: 127 GIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYAHR 186

Query: 190 SYHIPSNIQIASLKVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSS 249
             +I   + +  L+   AE LYR   ++TEFFP DID VL +KLSLGT+VA  +     S
Sbjct: 187 -VNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGS 246

Query: 250 AKFETNGSKSEIVI-PKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLK 309
                 GS   +   P SWA+LSVWN  + F+L +  A     + +++++++DK  P LK
Sbjct: 247 GSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFLK 306

Query: 310 LPSIPDFYEPFGFYFMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSL 369
           +PSIP  + PFG +FMYG+  EG    K+V+ALC + HN+A     C V+  E+ GE+ L
Sbjct: 307 IPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEG-GCGVVAAEVAGEEPL 366

Query: 370 REEIPHWKLLSCPEDLWCIKALKKE-TRNSLHELTKTPPTTRPALFVDPRE 397
           R  IPHWK+LSC EDLWCIK L ++ +  S+ + TK+PP    ++FVDPRE
Sbjct: 367 RRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPP--GDSIFVDPRE 412

BLAST of HG10021803 vs. ExPASy TrEMBL
Match: A0A5D3BD96 (Putative N-acetyltransferase HLS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G00770 PE=4 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 2.5e-228
Identity = 385/397 (96.98%), Postives = 389/397 (97.98%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM
Sbjct: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEW 120
           LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAP FRRRGIGC+LVRRLEEW
Sbjct: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPSFRRRGIGCSLVRRLEEW 120

Query: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASL 180
           F +NDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYH+PSNIQIA L
Sbjct: 121 FVINDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHLPSNIQIARL 180

Query: 181 KVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIV 240
           KVD+AEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDD SSAKFETN SKSEI 
Sbjct: 181 KVDVAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDVSSAKFETNSSKSEIT 240

Query: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300
           IPKSWAMLSVWNSGEVFKLRLGKAPLSC IYTESSKVIDKIFPCLKLPSIPDFYEPFGFY
Sbjct: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300

Query: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360
           FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE
Sbjct: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360

Query: 361 DLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           DLWCIKALKKE RNSLHELTKTPPTTRPALFVDPREV
Sbjct: 361 DLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV 397

BLAST of HG10021803 vs. ExPASy TrEMBL
Match: A0A1S3C5Y9 (probable N-acetyltransferase HLS1 OS=Cucumis melo OX=3656 GN=LOC103497055 PE=4 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 2.5e-228
Identity = 385/397 (96.98%), Postives = 389/397 (97.98%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM
Sbjct: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEW 120
           LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAP FRRRGIGC+LVRRLEEW
Sbjct: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPSFRRRGIGCSLVRRLEEW 120

Query: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASL 180
           F +NDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYH+PSNIQIA L
Sbjct: 121 FVINDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHLPSNIQIARL 180

Query: 181 KVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIV 240
           KVD+AEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDD SSAKFETN SKSEI 
Sbjct: 181 KVDVAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDVSSAKFETNSSKSEIT 240

Query: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300
           IPKSWAMLSVWNSGEVFKLRLGKAPLSC IYTESSKVIDKIFPCLKLPSIPDFYEPFGFY
Sbjct: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300

Query: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360
           FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE
Sbjct: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360

Query: 361 DLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           DLWCIKALKKE RNSLHELTKTPPTTRPALFVDPREV
Sbjct: 361 DLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV 397

BLAST of HG10021803 vs. ExPASy TrEMBL
Match: A0A0A0M0V6 (N-acetyltransferase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G533380 PE=4 SV=1)

HSP 1 Score: 798.1 bits (2060), Expect = 1.7e-227
Identity = 384/397 (96.73%), Postives = 388/397 (97.73%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM
Sbjct: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEW 120
           LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAP FRRRGIGC+LVRRLEEW
Sbjct: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPSFRRRGIGCSLVRRLEEW 120

Query: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASL 180
           F +NDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSY +PSNIQIA L
Sbjct: 121 FMINDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYQLPSNIQIARL 180

Query: 181 KVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIV 240
           KVD+AEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDD SS KFETNGSKSEI 
Sbjct: 181 KVDVAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDVSSTKFETNGSKSEIT 240

Query: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300
           IPKSWAMLSVWNSGEVFKLRLGKAPLSC IYTESSKVIDKIFPCLKLPSIPDFYEPFGFY
Sbjct: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300

Query: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360
           FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE
Sbjct: 301 FMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSCPE 360

Query: 361 DLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           DLWCIKALKKE RNSLHELTKTPPTTRPALFVDPREV
Sbjct: 361 DLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV 397

BLAST of HG10021803 vs. ExPASy TrEMBL
Match: A0A6J1K8Q3 (probable N-acetyltransferase HLS1 OS=Cucurbita maxima OX=3661 GN=LOC111493289 PE=4 SV=1)

HSP 1 Score: 768.5 bits (1983), Expect = 1.4e-218
Identity = 372/399 (93.23%), Postives = 382/399 (95.74%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           M  EEEILIIRSYDGQSADR RVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM
Sbjct: 1   MRYEEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEW 120
           LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAP FRRRGIG NLVRRLEEW
Sbjct: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPSFRRRGIGYNLVRRLEEW 120

Query: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASL 180
           F  NDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR YH+PSNIQI+SL
Sbjct: 121 FVANDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYHLPSNIQISSL 180

Query: 181 KVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIV 240
           KVD+AEFLYRKFMASTEFFPHDIDHVLKHKLSLG+WVAYYKDDD+++ KFETNG KSE+V
Sbjct: 181 KVDVAEFLYRKFMASTEFFPHDIDHVLKHKLSLGSWVAYYKDDDSTTPKFETNGGKSEMV 240

Query: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300
           IPK WAMLSVWNSGEVFKLRLGKAPLSC IYTESSKVIDKIFPCLKLPSIPDFYEPFGFY
Sbjct: 241 IPKCWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300

Query: 301 FMYGVHREGTGTG--KLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSC 360
           FMYGVHREG G G  KLV+ALCQYVHNMAAAARDCKVIVTEIGGEDSLR+EIPHWKLLSC
Sbjct: 301 FMYGVHREGKGMGTRKLVKALCQYVHNMAAAARDCKVIVTEIGGEDSLRDEIPHWKLLSC 360

Query: 361 PEDLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           PEDLWCIKALKKE RNSLHELTKTPPTTRPALFVDPREV
Sbjct: 361 PEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV 399

BLAST of HG10021803 vs. ExPASy TrEMBL
Match: A0A6J1G9S2 (probable N-acetyltransferase HLS1 OS=Cucurbita moschata OX=3662 GN=LOC111452087 PE=4 SV=1)

HSP 1 Score: 764.2 bits (1972), Expect = 2.6e-217
Identity = 370/399 (92.73%), Postives = 380/399 (95.24%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           M  EEEILIIRSYDGQSADR RVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM
Sbjct: 1   MRYEEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEW 120
           LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAP FRRRGIG NLVRRLEEW
Sbjct: 61  LVAEVDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPSFRRRGIGYNLVRRLEEW 120

Query: 121 FAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASL 180
           F  NDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR YH+PSNIQI+SL
Sbjct: 121 FVANDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYHLPSNIQISSL 180

Query: 181 KVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIV 240
           KVD+AEFLYRKFMASTEFFPHDIDHVLKHKLSLG+WVAYYKDDD ++ KFETNG KSE+V
Sbjct: 181 KVDVAEFLYRKFMASTEFFPHDIDHVLKHKLSLGSWVAYYKDDDATTPKFETNGGKSEMV 240

Query: 241 IPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300
           IPK WAMLSVWNSGEVFKLRLGKAPLSC IYTESSKVIDKIFPCLKLPSIPDFYEPFGFY
Sbjct: 241 IPKCWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVIDKIFPCLKLPSIPDFYEPFGFY 300

Query: 301 FMYGVHREGTGTG--KLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPHWKLLSC 360
           FMYGVHREG G G  KLV+ALCQYVHNMAAAARDCKVIVTEIGGEDSL +EIPHWKLLSC
Sbjct: 301 FMYGVHREGKGMGTRKLVKALCQYVHNMAAAARDCKVIVTEIGGEDSLGDEIPHWKLLSC 360

Query: 361 PEDLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           PEDLWCIKALKKE RN+LHELTKTPPTTRPALFVDPREV
Sbjct: 361 PEDLWCIKALKKEARNTLHELTKTPPTTRPALFVDPREV 399

BLAST of HG10021803 vs. TAIR 10
Match: AT2G30090.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 406.8 bits (1044), Expect = 2.1e-113
Identity = 217/401 (54.11%), Postives = 274/401 (68.33%), Query Frame = 0

Query: 5   EEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAE 64
           +E ++IR YD +  DR ++  +E+ CE+G   +  LFTDT+GDPICRIRNSP + MLVA 
Sbjct: 10  DEEVVIRCYDDR-RDRIQMGRMEKSCEIGHDHQTLLFTDTLGDPICRIRNSPFFIMLVAG 69

Query: 65  VDNQLVGVIQGSIKVVTVHQAPKDRAKVGYVLGLRVAPLFRRRGIGCNLVRRLEEWFAVN 124
           V N+LVG IQGS+K V  H       +VGYVLGLRV P +RRRGIG  LVR+LEEWF  +
Sbjct: 70  VGNKLVGSIQGSVKPVEFHD---KSVRVGYVLGLRVVPSYRRRGIGSILVRKLEEWFESH 129

Query: 125 DVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPSNIQIASLKVDI 184
           + DYAYMATEKDNEAS  LFI +LGY  FR PAILVNPV   R   +PS+I I  LKV  
Sbjct: 130 NADYAYMATEKDNEASHGLFIGRLGYVVFRNPAILVNPVNPGRGLKLPSDIGIRKLKVKE 189

Query: 185 AEFLYRK-FMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETNGSKSEIVIPK 244
           AE LYR+   A+TEFFP DI+ +L++KLS+GTWVAYY + D +                +
Sbjct: 190 AESLYRRNVAATTEFFPDDINKILRNKLSIGTWVAYYNNVDNT----------------R 249

Query: 245 SWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPDFYEPFGFYFMY 304
           SWAMLSVW+S +VFKLR+ +APLS  + T+ SK+       L L  +PD + PFGFYF+Y
Sbjct: 250 SWAMLSVWDSSKVFKLRIERAPLSYLLLTKVSKLFGNFLSLLGLTVLPDLFTPFGFYFLY 309

Query: 305 GVHREGTGTGKLVRALCQYVHNMAAA--ARDCKVIVTEI----GGEDSLREEIPHWKLLS 364
           GVH EG   GKLVRALC++VHNMAA      CKV+V E+     G+DSL+  IPHWK+LS
Sbjct: 310 GVHSEGPHCGKLVRALCEHVHNMAALNDGCACKVVVVEVDKGSNGDDSLQRCIPHWKMLS 369

Query: 365 CPEDLWCIKALK-KETRNSLHELTKTPPTTRPALFVDPREV 398
           C +D+WCIK LK ++ +  L E +K    +R +LFVDPREV
Sbjct: 370 CDDDMWCIKPLKCEKNKFDLSERSK----SRSSLFVDPREV 386

BLAST of HG10021803 vs. TAIR 10
Match: AT4G37580.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 358.2 bits (918), Expect = 8.4e-99
Identity = 194/405 (47.90%), Postives = 260/405 (64.20%), Query Frame = 0

Query: 9   IIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV--- 68
           ++R YD  + D   VED+ERRCEVGPS ++ LFTD +GDPICRIR+SP Y MLVAE+   
Sbjct: 3   VVREYD-PTRDLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGTE 62

Query: 69  DNQLVGVIQGSIKVVTV-------HQAPKD-----RAKVGYVLGLRVAPLFRRRGIGCNL 128
             ++VG+I+G IK VT        H++  D       K+ YVLGLRV+P  RR+GIG  L
Sbjct: 63  KKEIVGMIRGCIKTVTCGQKLDLNHKSQNDVVKPLYTKLAYVLGLRVSPFHRRQGIGFKL 122

Query: 129 VRRLEEWFAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYHIPS 188
           V+ +EEWF  N  +Y+Y+ATE DN+ASV LF  K GY+ FR P+ILVNPV  +R  ++  
Sbjct: 123 VKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHR-VNVSR 182

Query: 189 NIQIASLKVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKFETN 248
            + +  L+   AE LYR   ++TEFFP DID VL +KLSLGT+VA  +     S      
Sbjct: 183 RVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSGSWP 242

Query: 249 GSKSEIVI-PKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSIPD 308
           GS   +   P+SWA+LSVWN  + F L +  A     +  ++++V+DK  P LKLPSIP 
Sbjct: 243 GSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPSIPS 302

Query: 309 FYEPFGFYFMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEIPH 368
            +EPFG +FMYG+  EG    K+V++LC + HN+A A   C V+  E+ GED LR  IPH
Sbjct: 303 VFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLAKAG-GCGVVAAEVAGEDPLRRGIPH 362

Query: 369 WKLLSCPEDLWCIKALKKETRNS-LHELTKTPPTTRPALFVDPRE 397
           WK+LSC EDLWCIK L  +  +  + + TK+PP    ++FVDPRE
Sbjct: 363 WKVLSCDEDLWCIKRLGDDYSDGVVGDWTKSPPGV--SIFVDPRE 402

BLAST of HG10021803 vs. TAIR 10
Match: AT2G23060.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 351.7 bits (901), Expect = 7.9e-97
Identity = 189/411 (45.99%), Postives = 257/411 (62.53%), Query Frame = 0

Query: 10  IRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV---- 69
           +R YD  S D   VED+ERRCEVGP+ ++ LFTD +GDPICR+R+SP Y MLVAE+    
Sbjct: 7   VREYD-PSKDLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGPKE 66

Query: 70  DNQLVGVIQGSIKVVT----------VHQAPKD--------RAKVGYVLGLRVAPLFRRR 129
             +LVG+I+G IK VT           H   ++          K+ Y+LGLRV+P  RR+
Sbjct: 67  KKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHRRQ 126

Query: 130 GIGCNLVRRLEEWFAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR 189
           GIG  LV+ +E+WF+ N  +Y+Y ATE DN ASV LF  K GY  FR P+ILVNPV  +R
Sbjct: 127 GIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYAHR 186

Query: 190 SYHIPSNIQIASLKVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSS 249
             +I   + +  L+   AE LYR   ++TEFFP DID VL +KLSLGT+VA  +     S
Sbjct: 187 -VNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGS 246

Query: 250 AKFETNGSKSEIVI-PKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLK 309
                 GS   +   P SWA+LSVWN  + F+L +  A     + +++++++DK  P LK
Sbjct: 247 GSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFLK 306

Query: 310 LPSIPDFYEPFGFYFMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSL 369
           +PSIP  + PFG +FMYG+  EG    K+V+ALC + HN+A     C V+  E+ GE+ L
Sbjct: 307 IPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEG-GCGVVAAEVAGEEPL 366

Query: 370 REEIPHWKLLSCPEDLWCIKALKKE-TRNSLHELTKTPPTTRPALFVDPRE 397
           R  IPHWK+LSC EDLWCIK L ++ +  S+ + TK+PP    ++FVDPRE
Sbjct: 367 RRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPP--GDSIFVDPRE 412

BLAST of HG10021803 vs. TAIR 10
Match: AT5G67430.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 317.4 bits (812), Expect = 1.6e-86
Identity = 175/407 (43.00%), Postives = 247/407 (60.69%), Query Frame = 0

Query: 1   MGCEEEILIIRSYDGQSADRGRVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKM 60
           MG    ++++R YD    D   VE+LE  CEVG      L  D MGDP+ RIR SP + M
Sbjct: 1   MGKGFNVVVVREYD-PKRDLTSVEELEESCEVGS-----LLVDLMGDPLARIRQSPSFHM 60

Query: 61  LVAEVDNQLVGVIQGSIKVVT-----VHQAPK-----DRAKVGYVLGLRVAPLFRRRGIG 120
           LVAE+ N++VG+I+G+IK+VT     + QA       +  K+ +V GLRV+P +RR GIG
Sbjct: 61  LVAEIGNEIVGMIRGTIKMVTRGVNALRQADDVSPEINTTKLAFVSGLRVSPFYRRMGIG 120

Query: 121 CNLVRRLEEWFAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRSYH 180
             LV+RLEEWF  ND  Y+Y+ TE DN ASVKLF  K GY+ FR P  LVNPV ++R   
Sbjct: 121 LKLVQRLEEWFLRNDAVYSYVQTENDNIASVKLFTEKSGYSKFRTPTFLVNPVFNHR-VT 180

Query: 181 IPSNIQIASLKVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWVAYYKDDDTSSAKF 240
           +   ++I  L    AE LYR   ++TEFFP DI+ +L +KLSLGT++A  +  D  S   
Sbjct: 181 VSRRVKIIKLAPSDAESLYRNRFSTTEFFPSDINSILTNKLSLGTYLAVPRGGDNVSGSL 240

Query: 241 ETNGSKSEIVIPKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSKVIDKIFPCLKLPSI 300
                        SWA++S+WNS +V++L++  A     +  +S++V D  FP LK+PS 
Sbjct: 241 PDQTG--------SWAVISIWNSKDVYRLQVKGASRLKRMLAKSTRVFDGAFPFLKIPSF 300

Query: 301 PDFYEPFGFYFMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVIVTEIGGEDSLREEI 360
           P+ ++ F  +FMYG+  EG    ++V ALC + HN+A  +  C V+  E+   + LR  I
Sbjct: 301 PNLFKSFAMHFMYGIGGEGPRAAEMVEALCSHAHNLARKS-GCAVVAAEVASCEPLRVGI 360

Query: 361 PHWKLLSCPEDLWCIKALKKETRNSLHELTKTPPTTRPALFVDPREV 398
           PHWK+LS PEDLWC+K L+ +      + TK+PP    ++FVDPRE+
Sbjct: 361 PHWKVLS-PEDLWCLKRLRYDDDGV--DWTKSPPGL--SIFVDPREI 386

BLAST of HG10021803 vs. TAIR 10
Match: AT2G23060.2 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 288.9 bits (738), Expect = 6.3e-78
Identity = 159/361 (44.04%), Postives = 219/361 (60.66%), Query Frame = 0

Query: 60  MLVAEV----DNQLVGVIQGSIKVVT----------VHQAPKD--------RAKVGYVLG 119
           MLVAE+      +LVG+I+G IK VT           H   ++          K+ Y+LG
Sbjct: 1   MLVAEIGPKEKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILG 60

Query: 120 LRVAPLFRRRGIGCNLVRRLEEWFAVNDVDYAYMATEKDNEASVKLFINKLGYTNFRVPA 179
           LRV+P  RR+GIG  LV+ +E+WF+ N  +Y+Y ATE DN ASV LF  K GY  FR P+
Sbjct: 61  LRVSPTHRRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPS 120

Query: 180 ILVNPVKHYRSYHIPSNIQIASLKVDIAEFLYRKFMASTEFFPHDIDHVLKHKLSLGTWV 239
           ILVNPV  +R  +I   + +  L+   AE LYR   ++TEFFP DID VL +KLSLGT+V
Sbjct: 121 ILVNPVYAHR-VNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFV 180

Query: 240 AYYKDDDTSSAKFETNGSKSEIVI-PKSWAMLSVWNSGEVFKLRLGKAPLSCFIYTESSK 299
           A  +     S      GS   +   P SWA+LSVWN  + F+L +  A     + +++++
Sbjct: 181 AVPRGSCYGSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATR 240

Query: 300 VIDKIFPCLKLPSIPDFYEPFGFYFMYGVHREGTGTGKLVRALCQYVHNMAAAARDCKVI 359
           ++DK  P LK+PSIP  + PFG +FMYG+  EG    K+V+ALC + HN+A     C V+
Sbjct: 241 MVDKTLPFLKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEG-GCGVV 300

Query: 360 VTEIGGEDSLREEIPHWKLLSCPEDLWCIKALKKE-TRNSLHELTKTPPTTRPALFVDPR 397
             E+ GE+ LR  IPHWK+LSC EDLWCIK L ++ +  S+ + TK+PP    ++FVDPR
Sbjct: 301 AAEVAGEEPLRRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPP--GDSIFVDPR 357

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008457342.15.3e-22896.98PREDICTED: probable N-acetyltransferase HLS1 [Cucumis melo] >KAA0060026.1 putati... [more]
XP_011658711.13.4e-22796.73probable N-acetyltransferase HLS1 [Cucumis sativus][more]
XP_038894892.17.6e-22796.73probable N-acetyltransferase HLS1 [Benincasa hispida][more]
XP_022998707.12.9e-21893.23probable N-acetyltransferase HLS1 [Cucurbita maxima][more]
KAG6607328.13.8e-21893.23putative N-acetyltransferase HLS1, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Q423811.2e-9747.90Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana OX=3702 GN=HLS1 PE=1 S... [more]
O648151.1e-9545.99Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana OX=3702 GN=At2g23... [more]
Match NameE-valueIdentityDescription
A0A5D3BD962.5e-22896.98Putative N-acetyltransferase HLS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3C5Y92.5e-22896.98probable N-acetyltransferase HLS1 OS=Cucumis melo OX=3656 GN=LOC103497055 PE=4 S... [more]
A0A0A0M0V61.7e-22796.73N-acetyltransferase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_... [more]
A0A6J1K8Q31.4e-21893.23probable N-acetyltransferase HLS1 OS=Cucurbita maxima OX=3661 GN=LOC111493289 PE... [more]
A0A6J1G9S22.6e-21792.73probable N-acetyltransferase HLS1 OS=Cucurbita moschata OX=3662 GN=LOC111452087 ... [more]
Match NameE-valueIdentityDescription
AT2G30090.12.1e-11354.11Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT4G37580.18.4e-9947.90Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G23060.17.9e-9745.99Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT5G67430.11.6e-8643.00Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G23060.26.3e-7844.04Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.630.30coord: 31..156
e-value: 8.0E-17
score: 63.6
NoneNo IPR availablePANTHERPTHR47370:SF1ACYL-COA N-ACYLTRANSFERASES (NAT) SUPERFAMILY PROTEINcoord: 6..397
NoneNo IPR availablePANTHERPTHR47370ACYL-COA N-ACYLTRANSFERASES (NAT) SUPERFAMILY PROTEINcoord: 6..397
NoneNo IPR availableCDDcd04301NAT_SFcoord: 60..130
e-value: 1.92616E-7
score: 45.7297
IPR000182GNAT domainPFAMPF00583Acetyltransf_1coord: 51..150
e-value: 1.1E-13
score: 51.5
IPR000182GNAT domainPROSITEPS51186GNATcoord: 8..178
score: 15.01201
IPR016181Acyl-CoA N-acyltransferaseSUPERFAMILY55729Acyl-CoA N-acyltransferases (Nat)coord: 19..151

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021803.1HG10021803.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008080 N-acetyltransferase activity