Cla021794 (gene) Watermelon (97103) v1

NameCla021794
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionAcetyltransferase GNAT family protein (AHRD V1 *-** Q2QNL1_ORYSJ); contains Interpro domain(s) IPR000182 GCN5-related N-acetyltransferase
LocationChr5 : 6282882 .. 6285910 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGAAAGACTCGAGAGAAGCTGTGAAATTGGGTCTAAATTAAGAGGGGCATCAATTTTCACCAACATGATGGGTGATCCTCTTTGTAGGATTAGATTCTTCCCTCTTCATATAATGTTGGTAAGTTCAAATATTCAACAAAAAAAAAAAAAAAAAAAAAAAAACTTTTTTTACAAAAAGTTTTTTAATTCTTTGTAGGATTAGATTCTTCCCTCTTCATATAATGTTGGTAAGTTCAAATATTCAACAAAAAAAAAAAAAAAAAAAAAAGACTTTTTTTAAAAAATGTTTTTAACTTTATAAATTCCATTTTCATTATGATATTCTTTTACTGTCTTTGTATCATATATATATATATAAACCATTTTTCTTTTGTTTCTCTAAACAGGTGGCTGAGCTGCCGGAAAAGGGTGAGATTGTGGGAGTGGTTAGAGGCTGTATTAAGTCTTTGGGGATTGCCGGCCCCCGTGCCGGGGCCGGAGAAGCTAATACCACGAAGATTGGCTGCATATTAGGACTTCGTGTTTCGCCCGCTCACAGGTTTTTTTTATAGTTATTTTAAATTGAATTTGAATTTCGAAGTAAATAGTATAAACTCCACGTCAATTCAACTATATTCATATAATTTAAACTTATGATAATAATCTTTTTCTAGGACACTGAATGATTTTCTTTTATTATGATTTACAATTAATTTGATTCACAAAATTCAACACACGAGGCTGCGAGCCATTCATCTTTACCATATTAAATGATTATTAACATTTAAAAACTTAGATTGCTAGATCATGATGTATTTATAGTAATTTTGGGATAATTTTTAAATATATAAAAGTGATTTCAAGGTAGCCACATTTTTTTGTAATCCCCCCACCCCCCAGTGCTCATTAGTACTTTATTTAACAGCACTCTAAAACACCATGAAATATTGTTATAATATATTAGAATTTAGTTTTTGTAAGTGTGGAGTAATAATTTGAAGGTTTGGAACTTTAGAAAGTCATAAAAAAACTCACATTTTTAACTGCTTATAGATGACCAATGTTTTTTTTTTTTTTTTTTAAAAATCTTTTTATATAAAGAAAATTTGGTTAAACTTTGAGTATTACCAATGATAAGTGTTTTAAGGAAGCATTTAACAAGAGTTTCCTATAGAAAATTTATATAACAAAGGTTTGAATGTAATTGACATGAACTTTCTCCTTTATGTAAGAAGTTTGATCTCCATCCCCACAAATATTGTATTTAAGTGTTTCAAATAAAAAAATATATTTTAAAAGGTCAACTATATATATATAAATAAAATCATTAATGGGTGAGATTTTGAACTTAAATTGTTGCATATGCAAAAGGCATGCTTAATTTTAACATTTACGAGTGAGCATGGTATTTTGCTTCTCTACTACTTCGATGGCTCATATGTCTAGGGTGCATGAAAAACTTGATATCCAACTTTGTTTTCATTGGAATTTAAATTCTAAATCTATTACCAAATACATATCTGGAAATCTGAAATCCAACTCTGTTTGTATTGAATCCCTTGTTTTCAAATTTCTATTTTTAGATTGCCTACCAAATAGACCATTAATATATATATATATATATATATATATATATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTATATATATATATATAAAGTTTAGATTTTGAAAAATTTGTGTGAAAGATCACATATTCAAACCGAGTCTCACATAATTCTATCTTTCCTTTATTTAACATTTGTTTTTTAGTTCAACAATCACAAGATAAGGGATCATATCATAGACCTTTGAAATAGTGGGATAAATATTTGAATTCCAAATCAACGCCGAACCCTATCAATTTTTTCATGGGATGGCATCAGTAAGATATTACAGTAAGGATGAGAATTCAAAGCTTCAACCTTCTGGTTGAGGATATATATACCTTAACTAATAGAGGGAACTATATTCGTGTTAACCTAACTTAACTATTGAAATGATTATAGATCTAATTTTCTAAAAGGCTTATTATATTTTATCTCTTATCATTGTTAATGTAGGAGGTTGGGAATTGGACTAAAGCTTGTACACTCAGTTGAAGAATGGGTTATAAGAAATGGAGCTCATTATGCATTTCTAGCAATAGAGAAGAAGAACAAAGCCTCAAAGAATCTCTTCACTAACAAATGCAACTATGTAAAATTCAGCTCATTGGTGATTTTCAGACAGCCATTAATTGTGTTCCCAACAAAAGACATTGTTATTTCTAAAGGAGAAATAATAAAAACAGAGAAACTCAACGTAGAGCAAGCCATTTCATTCTACACACACACTCTCACAACTAAAGGAGGAGTTTATCCAATGGATTTTGATATGATTTTGAAGGAAAAATTGAGTCTTGGCACATGGGTTTCTTATTTCAATCAAGAAGATTGGACCCATTTGATTTGTTCACAAAAAGATTCAGAGATTTACCAAAGAATGCCAAGTTCTTGGGTTGTGTTTAGCATATGGAATACCTGCAAAGCATACAAGTTTCAAATAAGGGAATCAAAACATGATCAATTATTACCTCTAAGATTCTTGAAAAGTGCAAGAAAAAAGTTAATTTCCTGCTTCAAAATGCCAAATTCTGTGTCCTTTGGGAAGTCATTTGGATTCTTCTTCCTGTATGGGATATTTGGGGAGGGGGAGAGAGTGGGAGAGCTTGTTGAGTCGATATGGATTTTTGCATCGAGATTGGCTGAAGACGAGAAGGATTGCAAGGCTATTGTTACTGAATTGTCTGTTTCTGATCCAATCATAAACCACGTCCCACAAAACAACTCGTCCATGTCTCGCATCAATGATAACTGGTACCTGAAAAGGTTGAGTGTACATAGTGATGATGAAAAGGATGAAATATTGTTGTCAAAAGATATGGAAACAGCTACAAATGTGATTGTTGACCCAAGAGACTTCTAG

mRNA sequence

ATGGTGGAAAGACTCGAGAGAAGCTGTGAAATTGGGTCTAAATTAAGAGGGGCATCAATTTTCACCAACATGATGGGTGATCCTCTTTGTAGGATTAGATTCTTCCCTCTTCATATAATGTTGGTGGCTGAGCTGCCGGAAAAGGGTGAGATTGTGGGAGTGGTTAGAGGCTGTATTAAGTCTTTGGGGATTGCCGGCCCCCGTGCCGGGGCCGGAGAAGCTAATACCACGAAGATTGGCTGCATATTAGGACTTCGTGTTTCGCCCGCTCACAGGAGGTTGGGAATTGGACTAAAGCTTGTACACTCAGTTGAAGAATGGGTTATAAGAAATGGAGCTCATTATGCATTTCTAGCAATAGAGAAGAAGAACAAAGCCTCAAAGAATCTCTTCACTAACAAATGCAACTATGTAAAATTCAGCTCATTGGTGATTTTCAGACAGCCATTAATTGTGTTCCCAACAAAAGACATTGTTATTTCTAAAGGAGAAATAATAAAAACAGAGAAACTCAACGTAGAGCAAGCCATTTCATTCTACACACACACTCTCACAACTAAAGGAGGAGTTTATCCAATGGATTTTGATATGATTTTGAAGGAAAAATTGAGTCTTGGCACATGGGTTTCTTATTTCAATCAAGAAGATTGGACCCATTTGATTTGTTCACAAAAAGATTCAGAGATTTACCAAAGAATGCCAAGTTCTTGGGTTGTGTTTAGCATATGGAATACCTGCAAAGCATACAAGTTTCAAATAAGGGAATCAAAACATGATCAATTATTACCTCTAAGATTCTTGAAAAGTGCAAGAAAAAAGTTAATTTCCTGCTTCAAAATGCCAAATTCTGTGTCCTTTGGGAAGTCATTTGGATTCTTCTTCCTGTATGGGATATTTGGGGAGGGGGAGAGAGTGGGAGAGCTTGTTGAGTCGATATGGATTTTTGCATCGAGATTGGCTGAAGACGAGAAGGATTGCAAGGCTATTGTTACTGAATTGTCTGTTTCTGATCCAATCATAAACCACGTCCCACAAAACAACTCGTCCATGTCTCGCATCAATGATAACTGGTACCTGAAAAGGTTGAGTGTACATAGTGATGATGAAAAGGATGAAATATTGTTGTCAAAAGATATGGAAACAGCTACAAATGTGATTGTTGACCCAAGAGACTTCTAG

Coding sequence (CDS)

ATGGTGGAAAGACTCGAGAGAAGCTGTGAAATTGGGTCTAAATTAAGAGGGGCATCAATTTTCACCAACATGATGGGTGATCCTCTTTGTAGGATTAGATTCTTCCCTCTTCATATAATGTTGGTGGCTGAGCTGCCGGAAAAGGGTGAGATTGTGGGAGTGGTTAGAGGCTGTATTAAGTCTTTGGGGATTGCCGGCCCCCGTGCCGGGGCCGGAGAAGCTAATACCACGAAGATTGGCTGCATATTAGGACTTCGTGTTTCGCCCGCTCACAGGAGGTTGGGAATTGGACTAAAGCTTGTACACTCAGTTGAAGAATGGGTTATAAGAAATGGAGCTCATTATGCATTTCTAGCAATAGAGAAGAAGAACAAAGCCTCAAAGAATCTCTTCACTAACAAATGCAACTATGTAAAATTCAGCTCATTGGTGATTTTCAGACAGCCATTAATTGTGTTCCCAACAAAAGACATTGTTATTTCTAAAGGAGAAATAATAAAAACAGAGAAACTCAACGTAGAGCAAGCCATTTCATTCTACACACACACTCTCACAACTAAAGGAGGAGTTTATCCAATGGATTTTGATATGATTTTGAAGGAAAAATTGAGTCTTGGCACATGGGTTTCTTATTTCAATCAAGAAGATTGGACCCATTTGATTTGTTCACAAAAAGATTCAGAGATTTACCAAAGAATGCCAAGTTCTTGGGTTGTGTTTAGCATATGGAATACCTGCAAAGCATACAAGTTTCAAATAAGGGAATCAAAACATGATCAATTATTACCTCTAAGATTCTTGAAAAGTGCAAGAAAAAAGTTAATTTCCTGCTTCAAAATGCCAAATTCTGTGTCCTTTGGGAAGTCATTTGGATTCTTCTTCCTGTATGGGATATTTGGGGAGGGGGAGAGAGTGGGAGAGCTTGTTGAGTCGATATGGATTTTTGCATCGAGATTGGCTGAAGACGAGAAGGATTGCAAGGCTATTGTTACTGAATTGTCTGTTTCTGATCCAATCATAAACCACGTCCCACAAAACAACTCGTCCATGTCTCGCATCAATGATAACTGGTACCTGAAAAGGTTGAGTGTACATAGTGATGATGAAAAGGATGAAATATTGTTGTCAAAAGATATGGAAACAGCTACAAATGTGATTGTTGACCCAAGAGACTTCTAG

Protein sequence

MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIKSLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFPTKDIVISKGEIIKTEKLNVEQAISFYTHTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVFSIWNTCKAYKFQIRESKHDQLLPLRFLKSARKKLISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMSRINDNWYLKRLSVHSDDEKDEILLSKDMETATNVIVDPRDF
BLAST of Cla021794 vs. Swiss-Prot
Match: HLS1_ARATH (Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana GN=HLS1 PE=1 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 5.0e-53
Identity = 143/402 (35.57%), Postives = 215/402 (53.48%), Query Frame = 1

Query: 2   VERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAEL-PEKGEIVGVVRGCIK 61
           VE +ER CE+G   +  S+FT+++GDP+CRIR  P ++MLVAE+  EK EIVG++RGCIK
Sbjct: 16  VEDVERRCEVGPSGK-LSLFTDLLGDPICRIRHSPSYLMLVAEMGTEKKEIVGMIRGCIK 75

Query: 62  SLGIAGPRAGAGEANT--------TKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNG 121
           ++   G +      +         TK+  +LGLRVSP HRR GIG KLV  +EEW  +NG
Sbjct: 76  TV-TCGQKLDLNHKSQNDVVKPLYTKLAYVLGLRVSPFHRRQGIGFKLVKMMEEWFRQNG 135

Query: 122 AHYAFLAIEKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFPTKDIVISKGEIIKTEKLN 181
           A Y+++A E  N+AS NLFT KC Y +F +  I   P  V+  +  V  +  +IK E ++
Sbjct: 136 AEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNP--VYAHRVNVSRRVTVIKLEPVD 195

Query: 182 VEQAISFYTHTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHLICSQKDS-EIYQ 241
            E   + Y    +T    +P D D +L  KLSLGT+V+      +     S   S +  +
Sbjct: 196 AE---TLYRIRFSTT-EFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSGSWPGSAKFLE 255

Query: 242 RMPSSWVVFSIWNTCKAYKFQIRESKHDQLLPLRFLKSARKKLISCFKMPNSVSFGKSFG 301
             P SW V S+WN   ++  ++R +   + +  +  +   K L    K+P+  S  + FG
Sbjct: 256 YPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTL-PFLKLPSIPSVFEPFG 315

Query: 302 FFFLYGIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMS 361
             F+YGI GEG R  ++V+S+   A  LA+    C  +  E++  DP+   +P +   +S
Sbjct: 316 LHFMYGIGGEGPRAVKMVKSLCAHAHNLAK-AGGCGVVAAEVAGEDPLRRGIP-HWKVLS 375

Query: 362 RINDNWYLKRLSVHSDDEKDEILLS-KDMETATNVIVDPRDF 393
              D W +KRL    DD  D ++          ++ VDPR+F
Sbjct: 376 CDEDLWCIKRL---GDDYSDGVVGDWTKSPPGVSIFVDPREF 403

BLAST of Cla021794 vs. Swiss-Prot
Match: HLS1L_ARATH (Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana GN=At2g23060 PE=2 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 1.2e-51
Identity = 140/408 (34.31%), Postives = 217/408 (53.19%), Query Frame = 1

Query: 2   VERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAEL--PEKGEIVGVVRGCI 61
           VE +ER CE+G   +  S+FT+++GDP+CR+R  P ++MLVAE+   EK E+VG++RGCI
Sbjct: 19  VEDVERRCEVGPAGK-LSLFTDLLGDPICRVRHSPSYLMLVAEIGPKEKKELVGMIRGCI 78

Query: 62  KSL--GIAGPRAGAGEANT-----------TKIGCILGLRVSPAHRRLGIGLKLVHSVEE 121
           K++  GI   R       +           TK+  ILGLRVSP HRR GIG KLV ++E+
Sbjct: 79  KTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHRRQGIGFKLVKAMED 138

Query: 122 WVIRNGAHYAFLAIEKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFPTKDIVISKGEII 181
           W  +NGA Y++ A E  N AS NLFT KC Y +F +  I   P  V+  +  +  +  +I
Sbjct: 139 WFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNP--VYAHRVNISRRVTVI 198

Query: 182 KTEKLNVEQAISFYTHTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHLICSQKD 241
           K E  + E  + +     TT+   +P D D +L  KLSLGT+V+      +     S   
Sbjct: 199 KLEPSDAE--LLYRLRFSTTE--FFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSRSWPG 258

Query: 242 S-EIYQRMPSSWVVFSIWNTCKAYKFQIRESKHDQLLPLRFLKSARKKLISCFKMPNSVS 301
           S +  +  P SW V S+WN   +++ ++R +   + +  +  +   K L    K+P+  +
Sbjct: 259 SAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTL-PFLKIPSIPA 318

Query: 302 FGKSFGFFFLYGIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQ 361
             + FG  F+YGI GEG R  ++V+++   A  LA+ E  C  +  E++  +P+   +P 
Sbjct: 319 VFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAK-EGGCGVVAAEVAGEEPLRRGIP- 378

Query: 362 NNSSMSRINDNWYLKRLSV-HSDDEKDEILLSKDMETATNVIVDPRDF 393
           +   +S   D W +KRL   +SD    +   S       ++ VDPR+F
Sbjct: 379 HWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKS---PPGDSIFVDPREF 413

BLAST of Cla021794 vs. TrEMBL
Match: A0A0A0LAQ6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G166310 PE=4 SV=1)

HSP 1 Score: 716.5 bits (1848), Expect = 1.8e-203
Identity = 360/395 (91.14%), Postives = 375/395 (94.94%), Query Frame = 1

Query: 1   MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIK 60
           MVE+LERSCEIGSK++GASIFTNMMGDPLCRI FFPLHIMLVAELPE GEIVGVVRGCIK
Sbjct: 22  MVEKLERSCEIGSKIKGASIFTNMMGDPLCRITFFPLHIMLVAELPENGEIVGVVRGCIK 81

Query: 61  SLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAI 120
           SLGIA    G GEANT KIGCILGLRVSPAHRR+GIGLKLVHSVEEW+IRNGA+YAFLAI
Sbjct: 82  SLGIARAGVGVGEANTMKIGCILGLRVSPAHRRMGIGLKLVHSVEEWIIRNGANYAFLAI 141

Query: 121 EKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFP-TKDIVISKGEIIKTEKLNVEQAISF 180
           EKKNKASKNLF  KCNYVKFSSLVIFRQPLIVFP TK+++ISKGEIIKTEKLN+EQAISF
Sbjct: 142 EKKNKASKNLFAKKCNYVKFSSLVIFRQPLIVFPTTKEVIISKGEIIKTEKLNIEQAISF 201

Query: 181 YTHTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWT-HLICSQKDS-EIYQRMPSSW 240
           YT+TLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWT HLICSQKDS +IYQRMPSSW
Sbjct: 202 YTNTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHHLICSQKDSDQIYQRMPSSW 261

Query: 241 VVFSIWNTCKAYKFQIRESKHDQLLPLRFLKSARKKLISCFKMPNSVSFGKSFGFFFLYG 300
           VVFSIWNTCKAYKFQIRESK+DQLLPLRF KSARKK ISCFKMPNSVSFGKSFGFFFLYG
Sbjct: 262 VVFSIWNTCKAYKFQIRESKNDQLLPLRFFKSARKKFISCFKMPNSVSFGKSFGFFFLYG 321

Query: 301 IFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMSRINDNW 360
           IFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVP+ N SMSR+NDN 
Sbjct: 322 IFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPR-NVSMSRVNDNL 381

Query: 361 YLKRLSVHSDDEKDEILLSKDMETATNVIVDPRDF 393
           YLKRLSVHSDDEKDE LLSKDMETA NVIVDPRDF
Sbjct: 382 YLKRLSVHSDDEKDETLLSKDMETAANVIVDPRDF 415

BLAST of Cla021794 vs. TrEMBL
Match: B9HX15_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s15480g PE=4 SV=2)

HSP 1 Score: 420.6 bits (1080), Expect = 2.1e-114
Identity = 224/396 (56.57%), Postives = 282/396 (71.21%), Query Frame = 1

Query: 1   MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIK 60
           +V +LER CEIGS  +  SIFTNMMGDPL RIRF+P+H+MLVAEL E GE+VGVV+GCIK
Sbjct: 22  VVGKLERKCEIGSN-KEVSIFTNMMGDPLSRIRFYPVHVMLVAELRENGELVGVVKGCIK 81

Query: 61  SLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAI 120
            +G    R GA   +  ++GCILGLRVSP HRR+GIGL+LV SVEEW+I NGAHY FLA 
Sbjct: 82  CVGT---RFGA---SYVRLGCILGLRVSPRHRRMGIGLELVKSVEEWLIGNGAHYTFLAT 141

Query: 121 EKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFPTKDIVISKGEIIKTEKLNVEQAISFY 180
           EK N AS NLFT+KCNY+ F+SLVIF QP  + P K +     + IK EKL  +QAI  Y
Sbjct: 142 EKNNVASTNLFTSKCNYMNFTSLVIFVQPASL-PVKGL----SQDIKIEKLQTDQAIYLY 201

Query: 181 THTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVF 240
            +   +K  +YP D D ILKEKLS+GTWVSYF +E+W  L  ++++ +I  R PSSW +F
Sbjct: 202 NNKFKSKD-IYPTDVDAILKEKLSIGTWVSYFKEEEWISLHSNERNEDIITRTPSSWAMF 261

Query: 241 SIWNTCKAYKFQIRESKHDQLLPLRF----LKSARKKLISCFKMPNSVSFGKSFGFFFLY 300
           SIWN+C+AYK  IR+S H    P +F    L  AR K+  C K P   S  K FGF FL+
Sbjct: 262 SIWNSCEAYKLHIRKSHH----PFKFFHATLSHARDKIFPCLKFPICHSLQKPFGFLFLF 321

Query: 301 GIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMSRINDN 360
           G++GEGER+ EL++SIW FASRLAE+ KDCK I++EL VSDP+I HVPQ  SSMS IND 
Sbjct: 322 GLYGEGERLQELMKSIWSFASRLAENVKDCKVIISELGVSDPLIEHVPQ-ESSMSFINDL 381

Query: 361 WYLKRLSVHSDDEKDEILLSKDMETATNVIVDPRDF 393
           WYLK+++ +  D+ +E ++    +   NV VDPRDF
Sbjct: 382 WYLKKVNDNITDDNEEPVVMG--QVTGNVFVDPRDF 397

BLAST of Cla021794 vs. TrEMBL
Match: A0A061EQU3_THECC (Acyl-CoA N-acyltransferases superfamily protein OS=Theobroma cacao GN=TCM_021361 PE=4 SV=1)

HSP 1 Score: 419.5 bits (1077), Expect = 4.6e-114
Identity = 225/398 (56.53%), Postives = 279/398 (70.10%), Query Frame = 1

Query: 1   MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIK 60
           +V +LE++C+IGS  +GASIFTNM GDPLCRI F+PLH+MLVAEL E GE+VGV+RGCIK
Sbjct: 48  VVGKLEKNCDIGSNNKGASIFTNMTGDPLCRIGFYPLHLMLVAELCENGELVGVIRGCIK 107

Query: 61  SLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAI 120
            +G    + G       K+GCILGLRVSP HRR+GIGLKLV ++EEW+I NGAHY FLA 
Sbjct: 108 HVGT---KFGGTHV---KLGCILGLRVSPRHRRMGIGLKLVRAMEEWLINNGAHYTFLAT 167

Query: 121 EKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFPTKDIVISKGEIIKTEKLNVEQAISFY 180
           EK N AS NLFT KCNY   SSLVIF QP+I F  + +     + IK EKL+ +QAIS Y
Sbjct: 168 EKNNVASTNLFTAKCNYRNLSSLVIFVQPIISFAMEGL----SQDIKVEKLSTDQAISLY 227

Query: 181 THTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVF 240
            + L  K  +Y  D D ILKEKLSLGTWVSYF Q++W  L   +KD +I    P SW +F
Sbjct: 228 DNKLRGK-DIYLTDIDAILKEKLSLGTWVSYFKQDEWIGLHSKEKDGDIISTSPRSWAMF 287

Query: 241 SIWNTCKAYKFQIRESKHDQLLPLRF----LKSARKKLISCFKMPNSVSFGKSFGFFFLY 300
           SIWN+C+ YK  I++S      PL+F    L  AR K+  C K P   S  K FGF FLY
Sbjct: 288 SIWNSCETYKIHIKKSH-----PLKFFHATLSHARDKIFPCLKTPLCDSLEKPFGFLFLY 347

Query: 301 GIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMSRINDN 360
           G+ GEGER+GEL++S W FASRLAE+ KDCK I+TEL VSDP+I HVP+  SSMSR++D 
Sbjct: 348 GLHGEGERLGELMKSAWSFASRLAENVKDCKVIITELGVSDPLIEHVPR-ESSMSRVDDL 407

Query: 361 WYLKRL--SVHSDDEKDEILLSKDMETATNVIVDPRDF 393
           WYLK++  S+H   EK+++ +   M    NV+VDPRDF
Sbjct: 408 WYLKKVNGSIH---EKNDLGM---MGELGNVVVDPRDF 422

BLAST of Cla021794 vs. TrEMBL
Match: I1MRQ6_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_17G033300 PE=4 SV=2)

HSP 1 Score: 418.3 bits (1074), Expect = 1.0e-113
Identity = 226/401 (56.36%), Postives = 279/401 (69.58%), Query Frame = 1

Query: 1   MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIK 60
           +V +LE++CEIG+K +G SIFTNMMGDPL RIRF+PLH+MLVAEL E  E+VGVVRGCIK
Sbjct: 24  VVGKLEKNCEIGTK-KGVSIFTNMMGDPLSRIRFYPLHVMLVAELLESKELVGVVRGCIK 83

Query: 61  SLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAI 120
           S+            +  KIGCILGLRVSP HRR GIGLKLV+SVEEW++RNGA YAFLA 
Sbjct: 84  SMRTPSE-------SLLKIGCILGLRVSPTHRRKGIGLKLVNSVEEWMLRNGAEYAFLAT 143

Query: 121 EKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFPTKDIVISKGEIIKTEKLNVEQAISFY 180
           EK N AS NLFTNKC YV  SSLVIF  P+I FP K I     + IK EK+N+EQAIS Y
Sbjct: 144 EKNNDASINLFTNKCKYVSLSSLVIFVHPIISFPAKHI----PKDIKIEKVNMEQAISLY 203

Query: 181 THTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQED-----WTHLICSQKDSEIYQRMPS 240
             TL  K  +YP+D D ILKEKLSLGTWVSY+  E        +++ S  +  I   + S
Sbjct: 204 RRTLRAK-ELYPLDMDSILKEKLSLGTWVSYYKDEGCRLNLQRNMVESVDEDIITNEITS 263

Query: 241 SWVVFSIWNTCKAYKFQIRESKHDQLLPLRFLKS----ARKKLISCFKMPNSVSFGKSFG 300
           SW++FSIWNTC+AY+ Q+++S+     PLRFL +    AR K+  C +M  S S    FG
Sbjct: 264 SWIIFSIWNTCEAYRLQLKKSQ-----PLRFLHTTLNHARDKIFPCLRMSVSESLCTPFG 323

Query: 301 FFFLYGIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMS 360
           F FLYG+ GEGE +GEL+ESIW F SRL E  KDC+ ++TEL   D ++NHVP   +SMS
Sbjct: 324 FLFLYGLHGEGENLGELMESIWRFTSRLGESLKDCRVVITELGFGDALVNHVPL-TASMS 383

Query: 361 RINDNWYLKRLSVHSDDEKDEILLSKDMETATNVIVDPRDF 393
            I+D WY KR+S HSD+  DE+L+ + +    NV VDPRDF
Sbjct: 384 CIDDIWYTKRISSHSDENDDELLMKRQI---GNVFVDPRDF 402

BLAST of Cla021794 vs. TrEMBL
Match: A0A0B2P6Q8_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_004323 PE=4 SV=1)

HSP 1 Score: 418.3 bits (1074), Expect = 1.0e-113
Identity = 226/401 (56.36%), Postives = 279/401 (69.58%), Query Frame = 1

Query: 1   MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIK 60
           +V +LE++CEIG+K +G SIFTNMMGDPL RIRF+PLH+MLVAEL E  E+VGVVRGCIK
Sbjct: 24  VVGKLEKNCEIGTK-KGVSIFTNMMGDPLSRIRFYPLHVMLVAELLESKELVGVVRGCIK 83

Query: 61  SLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAI 120
           S+            +  KIGCILGLRVSP HRR GIGLKLV+SVEEW++RNGA YAFLA 
Sbjct: 84  SMRTPSE-------SLLKIGCILGLRVSPTHRRKGIGLKLVNSVEEWMLRNGAEYAFLAT 143

Query: 121 EKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFPTKDIVISKGEIIKTEKLNVEQAISFY 180
           EK N AS NLFTNKC YV  SSLVIF  P+I FP K I     + IK EK+N+EQAIS Y
Sbjct: 144 EKNNDASINLFTNKCKYVSLSSLVIFVHPIISFPAKHI----PKDIKIEKVNMEQAISLY 203

Query: 181 THTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQED-----WTHLICSQKDSEIYQRMPS 240
             TL  K  +YP+D D ILKEKLSLGTWVSY+  E        +++ S  +  I   + S
Sbjct: 204 RRTLRAK-ELYPLDMDSILKEKLSLGTWVSYYKDEGCRLNLQRNMVESVDEDIITNEITS 263

Query: 241 SWVVFSIWNTCKAYKFQIRESKHDQLLPLRFLKS----ARKKLISCFKMPNSVSFGKSFG 300
           SW++FSIWNTC+AY+ Q+++S+     PLRFL +    AR K+  C +M  S S    FG
Sbjct: 264 SWIIFSIWNTCEAYRLQLKKSQ-----PLRFLHTTLNHARDKIFPCLRMSVSESLCTPFG 323

Query: 301 FFFLYGIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMS 360
           F FLYG+ GEGE +GEL+ESIW F SRL E  KDC+ ++TEL   D ++NHVP   +SMS
Sbjct: 324 FLFLYGLHGEGENLGELMESIWRFTSRLGESLKDCRVVITELGFGDALVNHVPL-TASMS 383

Query: 361 RINDNWYLKRLSVHSDDEKDEILLSKDMETATNVIVDPRDF 393
            I+D WY KR+S HSD+  DE+L+ + +    NV VDPRDF
Sbjct: 384 CIDDIWYTKRISSHSDENDDELLMKRQI---GNVFVDPRDF 402

BLAST of Cla021794 vs. NCBI nr
Match: gi|449433437|ref|XP_004134504.1| (PREDICTED: probable N-acetyltransferase HLS1-like [Cucumis sativus])

HSP 1 Score: 716.5 bits (1848), Expect = 2.6e-203
Identity = 360/395 (91.14%), Postives = 375/395 (94.94%), Query Frame = 1

Query: 1   MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIK 60
           MVE+LERSCEIGSK++GASIFTNMMGDPLCRI FFPLHIMLVAELPE GEIVGVVRGCIK
Sbjct: 22  MVEKLERSCEIGSKIKGASIFTNMMGDPLCRITFFPLHIMLVAELPENGEIVGVVRGCIK 81

Query: 61  SLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAI 120
           SLGIA    G GEANT KIGCILGLRVSPAHRR+GIGLKLVHSVEEW+IRNGA+YAFLAI
Sbjct: 82  SLGIARAGVGVGEANTMKIGCILGLRVSPAHRRMGIGLKLVHSVEEWIIRNGANYAFLAI 141

Query: 121 EKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFP-TKDIVISKGEIIKTEKLNVEQAISF 180
           EKKNKASKNLF  KCNYVKFSSLVIFRQPLIVFP TK+++ISKGEIIKTEKLN+EQAISF
Sbjct: 142 EKKNKASKNLFAKKCNYVKFSSLVIFRQPLIVFPTTKEVIISKGEIIKTEKLNIEQAISF 201

Query: 181 YTHTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWT-HLICSQKDS-EIYQRMPSSW 240
           YT+TLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWT HLICSQKDS +IYQRMPSSW
Sbjct: 202 YTNTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHHLICSQKDSDQIYQRMPSSW 261

Query: 241 VVFSIWNTCKAYKFQIRESKHDQLLPLRFLKSARKKLISCFKMPNSVSFGKSFGFFFLYG 300
           VVFSIWNTCKAYKFQIRESK+DQLLPLRF KSARKK ISCFKMPNSVSFGKSFGFFFLYG
Sbjct: 262 VVFSIWNTCKAYKFQIRESKNDQLLPLRFFKSARKKFISCFKMPNSVSFGKSFGFFFLYG 321

Query: 301 IFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMSRINDNW 360
           IFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVP+ N SMSR+NDN 
Sbjct: 322 IFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPR-NVSMSRVNDNL 381

Query: 361 YLKRLSVHSDDEKDEILLSKDMETATNVIVDPRDF 393
           YLKRLSVHSDDEKDE LLSKDMETA NVIVDPRDF
Sbjct: 382 YLKRLSVHSDDEKDETLLSKDMETAANVIVDPRDF 415

BLAST of Cla021794 vs. NCBI nr
Match: gi|659076948|ref|XP_008438951.1| (PREDICTED: LOW QUALITY PROTEIN: probable N-acetyltransferase HLS1 [Cucumis melo])

HSP 1 Score: 713.8 bits (1841), Expect = 1.7e-202
Identity = 363/396 (91.67%), Postives = 375/396 (94.70%), Query Frame = 1

Query: 1   MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIK 60
           MVE+LERSCEIGSK++GASIFTNMMGDPLCRI FFPLHIMLVAELPE GEIVGVVRGCIK
Sbjct: 22  MVEKLERSCEIGSKIKGASIFTNMMGDPLCRITFFPLHIMLVAELPENGEIVGVVRGCIK 81

Query: 61  SLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAI 120
           SLGIA    G GEANT KIGCILGLRVSPAHRR+GIGLKLVHSVEEWVIRNGA+YAFLAI
Sbjct: 82  SLGIARSGVGVGEANTMKIGCILGLRVSPAHRRMGIGLKLVHSVEEWVIRNGANYAFLAI 141

Query: 121 EKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFP-TKD-IVISKGEIIKTEKLNVEQAIS 180
           EKKNKASKNLFT KCNYVKFSSLVIFRQPLIVFP TKD  +ISKGEIIKTEKLN+EQAIS
Sbjct: 142 EKKNKASKNLFTKKCNYVKFSSLVIFRQPLIVFPTTKDHNIISKGEIIKTEKLNIEQAIS 201

Query: 181 FYTHTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWT-HLICSQKDS-EIYQRMPSS 240
           FYT+TLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWT HLICSQKDS +IYQRMPSS
Sbjct: 202 FYTNTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHHLICSQKDSDQIYQRMPSS 261

Query: 241 WVVFSIWNTCKAYKFQIRESKHDQLLPLRFLKSARKKLISCFKMPNSVSFGKSFGFFFLY 300
           WVVFSIWNTCKAYKFQIRESK DQLLPLRFLKSARKK +SCFKMPNSVSFGKSFGFFFLY
Sbjct: 262 WVVFSIWNTCKAYKFQIRESKSDQLLPLRFLKSARKKFVSCFKMPNSVSFGKSFGFFFLY 321

Query: 301 GIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMSRINDN 360
           GIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVP+ N SMSR+NDN
Sbjct: 322 GIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPR-NVSMSRVNDN 381

Query: 361 WYLKRLSVHSDDEKDEILLSKDMETATNVIVDPRDF 393
            YLKRLSVHSDDEKDE LLSKDMETA NVIVDPRDF
Sbjct: 382 LYLKRLSVHSDDEKDETLLSKDMETAANVIVDPRDF 416

BLAST of Cla021794 vs. NCBI nr
Match: gi|802704350|ref|XP_012084085.1| (PREDICTED: probable N-acetyltransferase HLS1 [Jatropha curcas])

HSP 1 Score: 421.0 bits (1081), Expect = 2.3e-114
Identity = 222/396 (56.06%), Postives = 281/396 (70.96%), Query Frame = 1

Query: 1   MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIK 60
           +V +LE++CEIGS  +  SIFTNMMGDPLCRIRF+P+H+MLVAEL E GE+VGVVRGCIK
Sbjct: 22  VVGKLEKNCEIGSN-KEVSIFTNMMGDPLCRIRFYPVHVMLVAELRENGELVGVVRGCIK 81

Query: 61  SLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAI 120
                G R GA       +GCILGLRVSP +RR+GIGLKLV SVEEW++ NGA+Y F+A 
Sbjct: 82  LC--EGTRFGA---TFVSLGCILGLRVSPKYRRMGIGLKLVKSVEEWLVGNGANYIFIAT 141

Query: 121 EKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFPTKDIVISKGEIIKTEKLNVEQAISFY 180
           EK N AS NLFT++CNY+ FSSLV+F QP      K++ +   E IK EKL + QAIS Y
Sbjct: 142 EKSNVASTNLFTSRCNYMNFSSLVVFVQPANSLTLKNLSL---EDIKIEKLQIRQAISLY 201

Query: 181 THTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVF 240
            +TL  K  +YP D D ILKE LSLGTWVSYF +E+W  L    K+ +I  + PSSW +F
Sbjct: 202 NNTLRGKD-IYPTDIDAILKENLSLGTWVSYFKEEEWIILHNDNKEEDIISKTPSSWAIF 261

Query: 241 SIWNTCKAYKFQIRESKHDQLLPLRF----LKSARKKLISCFKMPNSVSFGKSFGFFFLY 300
           SIWN+C+AYK  IR+S H    PL+F    L  AR K+  C K+P   S  K FGF FLY
Sbjct: 262 SIWNSCEAYKLHIRKSHH----PLKFFHATLSHARDKIFPCLKLPICDSLQKPFGFLFLY 321

Query: 301 GIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMSRINDN 360
           G++GEG R+ EL+ SIW F SRLAED KDCK I+TEL VSDP+I++VP+   SMS I+D 
Sbjct: 322 GLYGEGTRLQELMNSIWSFTSRLAEDVKDCKVIITELGVSDPLIDYVPR-EPSMSFIDDL 381

Query: 361 WYLKRLSVHSDDEKDEILLSKDMETATNVIVDPRDF 393
           WYLK+++ +S D  +++++ +    A +V VDPRDF
Sbjct: 382 WYLKKVNGNSGDRNEQVVMGQ----AGDVFVDPRDF 398

BLAST of Cla021794 vs. NCBI nr
Match: gi|645233456|ref|XP_008223355.1| (PREDICTED: probable N-acetyltransferase HLS1-like [Prunus mume])

HSP 1 Score: 420.6 bits (1080), Expect = 3.0e-114
Identity = 226/396 (57.07%), Postives = 284/396 (71.72%), Query Frame = 1

Query: 1   MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIK 60
           +V +LER+C++GSK RG SIFTNMMGDP CRIRF+PLH+MLVAEL E GE+VGVVRGC+K
Sbjct: 22  VVGKLERNCDLGSK-RGVSIFTNMMGDPCCRIRFYPLHVMLVAELLENGELVGVVRGCMK 81

Query: 61  SLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAI 120
            +G        G   + +IGC+LGLRVSP HRR+GIGLKL++SVEEW++R GA Y FLA 
Sbjct: 82  HVG-------TGFGASYEIGCVLGLRVSPTHRRMGIGLKLMNSVEEWLLRKGAQYTFLAT 141

Query: 121 EKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFPTKDIVISKGEIIKTEKLNVEQAISFY 180
           EK N AS NLFT KCN+V  SSLVIF QP I  P  D++  +   IK EKL+++QAI  Y
Sbjct: 142 EKSNIASTNLFTFKCNFVNLSSLVIFVQP-ICSPIDDLLPQE---IKIEKLHIDQAIFLY 201

Query: 181 THTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVF 240
            + L  K  +YP D D+ILKEKLSLGTWV YF ++ W +L   +   +I  +  SSWV+F
Sbjct: 202 KNKLRGK-DMYPTDIDVILKEKLSLGTWVCYFEEQGWINLNTEENGKDITSKTQSSWVIF 261

Query: 241 SIWNTCKAYKFQIRESKHDQLLPLR----FLKSARKKLISCFKMPNSVSFGKSFGFFFLY 300
           SIWNTC+AYK  IR+S      PLR     L  ARKK++SC K+P  VS   SFGF FLY
Sbjct: 262 SIWNTCEAYKLHIRKSH-----PLRSFHASLSHARKKILSCLKLPVRVSMQSSFGFLFLY 321

Query: 301 GIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMSRINDN 360
           GI GEGE++GEL++S+W FASRL ++ KD K I+TEL + DP+I HVP+ +S+MS IND 
Sbjct: 322 GIHGEGEKLGELMKSVWNFASRLGQNVKDSKLILTELGLCDPLIKHVPK-DSNMSCINDV 381

Query: 361 WYLKRLSVHSDDEKDEILLSKDMETATNVIVDPRDF 393
           WY+K L  H+ DEKDE+LL   +    NV VDPR+F
Sbjct: 382 WYVKSLISHA-DEKDELLLKGQL---GNVFVDPREF 394

BLAST of Cla021794 vs. NCBI nr
Match: gi|566190975|ref|XP_002314944.2| (hypothetical protein POPTR_0010s15480g [Populus trichocarpa])

HSP 1 Score: 420.6 bits (1080), Expect = 3.0e-114
Identity = 224/396 (56.57%), Postives = 282/396 (71.21%), Query Frame = 1

Query: 1   MVERLERSCEIGSKLRGASIFTNMMGDPLCRIRFFPLHIMLVAELPEKGEIVGVVRGCIK 60
           +V +LER CEIGS  +  SIFTNMMGDPL RIRF+P+H+MLVAEL E GE+VGVV+GCIK
Sbjct: 22  VVGKLERKCEIGSN-KEVSIFTNMMGDPLSRIRFYPVHVMLVAELRENGELVGVVKGCIK 81

Query: 61  SLGIAGPRAGAGEANTTKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAI 120
            +G    R GA   +  ++GCILGLRVSP HRR+GIGL+LV SVEEW+I NGAHY FLA 
Sbjct: 82  CVGT---RFGA---SYVRLGCILGLRVSPRHRRMGIGLELVKSVEEWLIGNGAHYTFLAT 141

Query: 121 EKKNKASKNLFTNKCNYVKFSSLVIFRQPLIVFPTKDIVISKGEIIKTEKLNVEQAISFY 180
           EK N AS NLFT+KCNY+ F+SLVIF QP  + P K +     + IK EKL  +QAI  Y
Sbjct: 142 EKNNVASTNLFTSKCNYMNFTSLVIFVQPASL-PVKGL----SQDIKIEKLQTDQAIYLY 201

Query: 181 THTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVF 240
            +   +K  +YP D D ILKEKLS+GTWVSYF +E+W  L  ++++ +I  R PSSW +F
Sbjct: 202 NNKFKSKD-IYPTDVDAILKEKLSIGTWVSYFKEEEWISLHSNERNEDIITRTPSSWAMF 261

Query: 241 SIWNTCKAYKFQIRESKHDQLLPLRF----LKSARKKLISCFKMPNSVSFGKSFGFFFLY 300
           SIWN+C+AYK  IR+S H    P +F    L  AR K+  C K P   S  K FGF FL+
Sbjct: 262 SIWNSCEAYKLHIRKSHH----PFKFFHATLSHARDKIFPCLKFPICHSLQKPFGFLFLF 321

Query: 301 GIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINHVPQNNSSMSRINDN 360
           G++GEGER+ EL++SIW FASRLAE+ KDCK I++EL VSDP+I HVPQ  SSMS IND 
Sbjct: 322 GLYGEGERLQELMKSIWSFASRLAENVKDCKVIISELGVSDPLIEHVPQ-ESSMSFINDL 381

Query: 361 WYLKRLSVHSDDEKDEILLSKDMETATNVIVDPRDF 393
           WYLK+++ +  D+ +E ++    +   NV VDPRDF
Sbjct: 382 WYLKKVNDNITDDNEEPVVMG--QVTGNVFVDPRDF 397

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HLS1_ARATH5.0e-5335.57Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana GN=HLS1 PE=1 SV=1[more]
HLS1L_ARATH1.2e-5134.31Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana GN=At2g23060 PE=2... [more]
Match NameE-valueIdentityDescription
A0A0A0LAQ6_CUCSA1.8e-20391.14Uncharacterized protein OS=Cucumis sativus GN=Csa_3G166310 PE=4 SV=1[more]
B9HX15_POPTR2.1e-11456.57Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s15480g PE=4 SV=2[more]
A0A061EQU3_THECC4.6e-11456.53Acyl-CoA N-acyltransferases superfamily protein OS=Theobroma cacao GN=TCM_021361... [more]
I1MRQ6_SOYBN1.0e-11356.36Uncharacterized protein OS=Glycine max GN=GLYMA_17G033300 PE=4 SV=2[more]
A0A0B2P6Q8_GLYSO1.0e-11356.36Uncharacterized protein OS=Glycine soja GN=glysoja_004323 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|449433437|ref|XP_004134504.1|2.6e-20391.14PREDICTED: probable N-acetyltransferase HLS1-like [Cucumis sativus][more]
gi|659076948|ref|XP_008438951.1|1.7e-20291.67PREDICTED: LOW QUALITY PROTEIN: probable N-acetyltransferase HLS1 [Cucumis melo][more]
gi|802704350|ref|XP_012084085.1|2.3e-11456.06PREDICTED: probable N-acetyltransferase HLS1 [Jatropha curcas][more]
gi|645233456|ref|XP_008223355.1|3.0e-11457.07PREDICTED: probable N-acetyltransferase HLS1-like [Prunus mume][more]
gi|566190975|ref|XP_002314944.2|3.0e-11456.57hypothetical protein POPTR_0010s15480g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000182GNAT_dom
IPR016181Acyl_CoA_acyltransferase
Vocabulary: Molecular Function
TermDefinition
GO:0008080N-acetyltransferase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042967 acyl-carrier-protein biosynthetic process
biological_process GO:0006475 internal protein amino acid acetylation
biological_process GO:0008150 biological_process
biological_process GO:0018002 N-terminal peptidyl-glutamic acid acetylation
biological_process GO:0017198 N-terminal peptidyl-serine acetylation
biological_process GO:0006474 N-terminal protein amino acid acetylation
cellular_component GO:0031248 protein acetyltransferase complex
cellular_component GO:0005575 cellular_component
cellular_component GO:0022626 cytosolic ribosome
cellular_component GO:0031415 NatA complex
molecular_function GO:0004596 peptide alpha-N-acetyltransferase activity
molecular_function GO:1990190 peptide-glutamate-N-acetyltransferase activity
molecular_function GO:1990189 peptide-serine-N-acetyltransferase activity
molecular_function GO:0008080 N-acetyltransferase activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0016746 transferase activity, transferring acyl groups
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla021794Cla021794.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000182GNAT domainPFAMPF00583Acetyltransf_1coord: 47..137
score: 8.5
IPR000182GNAT domainPROFILEPS51186GNATcoord: 1..202
score: 1
IPR016181Acyl-CoA N-acyltransferaseGENE3DG3DSA:3.40.630.30coord: 21..131
score: 2.9
IPR016181Acyl-CoA N-acyltransferaseunknownSSF55729Acyl-CoA N-acyltransferases (Nat)coord: 34..144
score: 2.7
NoneNo IPR availablePANTHERPTHR23091N-TERMINAL ACETYLTRANSFERASEcoord: 1..196
score: 1.5
NoneNo IPR availablePANTHERPTHR23091:SF239SUBFAMILY NOT NAMEDcoord: 1..196
score: 1.5

The following gene(s) are paralogous to this gene:

None