Lsi05G015920 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi05G015920
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionAcyl-CoA N-acyltransferases (NAT) superfamily protein
Locationchr05 : 23690802 .. 23693565 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCATTATGATATTCTTTTACTGTCTTTATATATTAACCATTTTTCTTTTGTTTCTCTAAACAGGTGGCCGAGCTGCCGGAAAAGGGAGAGATTGTGGGAGTGGTTAGAGGATGTATTAAGTCTTTGGGGATTGCCGGCCCCGGTGCCGGGGCCGGAGAAGCTAATACGATGAAGATTGGCTGCATACTGGGACTTCGTGTTTCCCCTGCACACAGGTTTTTTTTCTTAATTATTATTTTTAAGCGAATTCAACTTTTAAATGGATAGAATAAGTTTGATGTTAATTGAATTATGTTAATTTTAGTTAGGTTGATTTTGATATAAAACTTAAAATTAGTGATATTTTAAGGAGGAGAGTGAATGATTTTATTTTTTTTTATTTTTTTTATCATTATGATTTAAATATTAATTTGATTCACATGTTGTGAACCATTCATCTGTTAAATTATTACTAAAATTTTAAAACTTAAATTGTGGGATGAGTTTTTTTTAATATATAAAAGTGATTTCAAGGCAAGCACATCTTTTTGTAATTTTTTTGCATTTAAAAGAACTCTAAGGAACGTTTTAGATACGCGTAATATAATGTAATCCAAAATTCATGTTTTGATTGAGCGTTTTAAGTCTAATTCGTAATACTAAATTTATTTCGTTCTGACAGTTTCAATTTAATCCTATTTTCTATCGTTTTCATGCTTCACGATTCTATCTTCATTCTGCCTTTAATTCTACACGTATTTCAATACGCATTTGATAACTCTCCAACTCCCCAAACAATTCCTTTATCAAGAATTTTGTAATACCTCATGGGATTGTCTAACCAAACACCTTGAAATATTGTTATAATAATATTAGAATTTAGTTTTGTAAGTGTGGAGTAATGATTTGAAGGCTTGGAACTTTAGAGAAAGTCATAAACAAACTCAAATTTTTAACTGCTTATAGATGCCCAATGCTTTTTGCTCAAAATATGCATTTTTTTATTTTTATTTTTTTTAAAAAAAGACAATTTGGTTAAACTTTTAGAGTAGGTTTTACCAACAAAACTTTTTGTAAATTTTACCGTAATCAAAATTGGTTTAATTGTTGGAAATTTCTAAAAATAGACAAAAAAAAAAAAAAATTAAGTGTTTTAAGGAAGCATTTAACAAAAGCATCCTAAAGAATACTTTTATAACAAGTGTTTTAGTTAAAGTGAATTTAGCTCAGCGGTTATTAACATACACTCTTCCATCCCAACAATATTACACTAAAAATAGTTGTTTCAAATAAAAATATATTTTAAAAGGTCAACTTAAAATATATTTCTTCTTAATAAAATAATTAAATGTTAGTTGAGAATTCGAACTTAAATAGTTGCATAGGCAAATGCATGCTTAACCTTAACATTAGCGAGCGAGCATGCAATTTTGCTTCTCTACTATTTTAATAGGCTTAAGTTTTGGGAGCATGAAAAAACTTTATATCCAAAAAAAAAGATGTTTAGATTTTGAATAATTCTATCTTCACTTTATTTAATATTTGTTTTATTTCAACAATTACGAGATAGGGGATTGAATCATCGACTTTTGAAATAGTGAAATATATTTTTGAGATCCAAATCAATGTCGAACATCTCTATCAGTTTTCTCACGAGAGAGCATTGGGAACAAGGATGAAAATTTAAACCTTAAACCTTTTGGTTGAGGATATATATGCCTTAGCTAGTAGAGAGAACTGCGTTCGTGTCGGCTGAAACCTAAATATTAAAATGATTATAGATCTAATTTTCTGTAAAGCTTACAATATTTTATCTCTTTATCATTGGTGTGTAGGAGGTTGGGAATTGGACTAAAGCTTGTACACTCAGTTGAAGAATGGGTTATAAGAAATGGAGCTCATTATGCATTTCTAGCAATAGAGAAGAAGAACAAAGCCTCAAAGAATCTCTTCACTAAGAAATGCAACTATGTAAAATTCAGCTCATTGGTGATTTTCAGACAGCCACTTATTGTGTTCCCAACAAAAGACATTATTATTTCTAAAGGAGAAATAATAAAAACAGAGAAACTCAACATAGAGCAAGCAATTTCATTCTACACAAACACTCTCACAACTAAAGGAGGAGTTTATCCAATGGATTTTGATATGATTTTGAAGGAAAAACTGAGTCTTGGCACATGGGTTTCTTATTTCAATCAAGAAGATTGGACCCATTTGATTTGTTCACAAAAAGATTCAGAGATTTACCAAAGAATGCCAAGTTCTTGGGTCGTGTTTAGCATATGGAATACCTGCAAAGCATACAAGTTTCAAATAAGGGAATCAAAACATGATCAATTATTACCTCTAAAGTTCTTGAAAAGTGCAAGAAAAAAGTTCATTTCCTGCTTCAAAATGCCAAATTCTGTGTCCTTTGGGAAGTCATTTGGATTCTTCTTCCTGTATGGGATATTTGGGGAGGGGGAGAGAGTGGGAGAGCTTGTCGAGTCGATATGGATTTTCGCGTCGAGATTGGCCGAAGACGAGAAGGATTGCAAGGCTATTGTTACTGAATTGTCTGTTTCTGATCCAATCATAAACTACGTCCCACGAAACAACTCGTCCATGTCTCGCATCAATGATAACTGGTACCTGAAAAGATTGAGTGTACATAGTGATGATGAAAAGGATGAAATATTGTTGTCAAAAGATATGGAAACAGCTGCAAATGTGATTGTTGACCCAAGAGACTTCTAGATTTCAATATATATTCATACCCCCTTTCCTCCTTTTTGCACATC

mRNA sequence

TTTCATTATGATATTCTTTTACTGTCTTTATATATTAACCATTTTTCTTTTGTTTCTCTAAACAGGTGGCCGAGCTGCCGGAAAAGGGAGAGATTGTGGGAGTGGTTAGAGGATGTATTAAGTCTTTGGGGATTGCCGGCCCCGGTGCCGGGGCCGGAGAAGCTAATACGATGAAGATTGGCTGCATACTGGGACTTCGTGTTTCCCCTGCACACAGGAGGTTGGGAATTGGACTAAAGCTTGTACACTCAGTTGAAGAATGGGTTATAAGAAATGGAGCTCATTATGCATTTCTAGCAATAGAGAAGAAGAACAAAGCCTCAAAGAATCTCTTCACTAAGAAATGCAACTATGTAAAATTCAGCTCATTGGTGATTTTCAGACAGCCACTTATTGTGTTCCCAACAAAAGACATTATTATTTCTAAAGGAGAAATAATAAAAACAGAGAAACTCAACATAGAGCAAGCAATTTCATTCTACACAAACACTCTCACAACTAAAGGAGGAGTTTATCCAATGGATTTTGATATGATTTTGAAGGAAAAACTGAGTCTTGGCACATGGGTTTCTTATTTCAATCAAGAAGATTGGACCCATTTGATTTGTTCACAAAAAGATTCAGAGATTTACCAAAGAATGCCAAGTTCTTGGGTCGTGTTTAGCATATGGAATACCTGCAAAGCATACAAGTTTCAAATAAGGGAATCAAAACATGATCAATTATTACCTCTAAAGTTCTTGAAAAGTGCAAGAAAAAAGTTCATTTCCTGCTTCAAAATGCCAAATTCTGTGTCCTTTGGGAAGTCATTTGGATTCTTCTTCCTGTATGGGATATTTGGGGAGGGGGAGAGAGTGGGAGAGCTTGTCGAGTCGATATGGATTTTCGCGTCGAGATTGGCCGAAGACGAGAAGGATTGCAAGGCTATTGTTACTGAATTGTCTGTTTCTGATCCAATCATAAACTACGTCCCACGAAACAACTCGTCCATGTCTCGCATCAATGATAACTGGTACCTGAAAAGATTGAGTGTACATAGTGATGATGAAAAGGATGAAATATTGTTGTCAAAAGATATGGAAACAGCTGCAAATGTGATTGTTGACCCAAGAGACTTCTAGATTTCAATATATATTCATACCCCCTTTCCTCCTTTTTGCACATC

Coding sequence (CDS)

ATGAAGATTGGCTGCATACTGGGACTTCGTGTTTCCCCTGCACACAGGAGGTTGGGAATTGGACTAAAGCTTGTACACTCAGTTGAAGAATGGGTTATAAGAAATGGAGCTCATTATGCATTTCTAGCAATAGAGAAGAAGAACAAAGCCTCAAAGAATCTCTTCACTAAGAAATGCAACTATGTAAAATTCAGCTCATTGGTGATTTTCAGACAGCCACTTATTGTGTTCCCAACAAAAGACATTATTATTTCTAAAGGAGAAATAATAAAAACAGAGAAACTCAACATAGAGCAAGCAATTTCATTCTACACAAACACTCTCACAACTAAAGGAGGAGTTTATCCAATGGATTTTGATATGATTTTGAAGGAAAAACTGAGTCTTGGCACATGGGTTTCTTATTTCAATCAAGAAGATTGGACCCATTTGATTTGTTCACAAAAAGATTCAGAGATTTACCAAAGAATGCCAAGTTCTTGGGTCGTGTTTAGCATATGGAATACCTGCAAAGCATACAAGTTTCAAATAAGGGAATCAAAACATGATCAATTATTACCTCTAAAGTTCTTGAAAAGTGCAAGAAAAAAGTTCATTTCCTGCTTCAAAATGCCAAATTCTGTGTCCTTTGGGAAGTCATTTGGATTCTTCTTCCTGTATGGGATATTTGGGGAGGGGGAGAGAGTGGGAGAGCTTGTCGAGTCGATATGGATTTTCGCGTCGAGATTGGCCGAAGACGAGAAGGATTGCAAGGCTATTGTTACTGAATTGTCTGTTTCTGATCCAATCATAAACTACGTCCCACGAAACAACTCGTCCATGTCTCGCATCAATGATAACTGGTACCTGAAAAGATTGAGTGTACATAGTGATGATGAAAAGGATGAAATATTGTTGTCAAAAGATATGGAAACAGCTGCAAATGTGATTGTTGACCCAAGAGACTTCTAG

Protein sequence

MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCNYVKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFDMILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVFSIWNTCKAYKFQIRESKHDQLLPLKFLKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIWIFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDEILLSKDMETAANVIVDPRDF
BLAST of Lsi05G015920 vs. Swiss-Prot
Match: HLS1_ARATH (Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana GN=HLS1 PE=1 SV=1)

HSP 1 Score: 151.4 bits (381), Expect = 1.7e-35
Identity = 105/317 (33.12%), Postives = 162/317 (51.10%), Query Frame = 1

Query: 2   KIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCNY 61
           K+  +LGLRVSP HRR GIG KLV  +EEW  +NGA Y+++A E  N+AS NLFT KC Y
Sbjct: 99  KLAYVLGLRVSPFHRRQGIGFKLVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGY 158

Query: 62  VKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFDM 121
            +F +  I   P  V+  +  +  +  +IK E ++ E       +T       +P D D 
Sbjct: 159 SEFRTPSILVNP--VYAHRVNVSRRVTVIKLEPVDAETLYRIRFSTTE----FFPRDIDS 218

Query: 122 ILKEKLSLGTWVSYFNQEDWTHLICSQKDS-EIYQRMPSSWVVFSIWNTCKAYKFQIRES 181
           +L  KLSLGT+V+      +     S   S +  +  P SW V S+WN   ++  ++R +
Sbjct: 219 VLNNKLSLGTFVAVPRGSCYGSGSGSWPGSAKFLEYPPESWAVLSVWNCKDSFLLEVRGA 278

Query: 182 KHDQLLPLKFLKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIWIFA 241
              + +  K  +    K +   K+P+  S  + FG  F+YGI GEG R  ++V+S+   A
Sbjct: 279 SRLRRVVAKTTRVV-DKTLPFLKLPSIPSVFEPFGLHFMYGIGGEGPRAVKMVKSLCAHA 338

Query: 242 SRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDEILLS 301
             LA+    C  +  E++  DP+   +P +   +S   D W +KRL    DD  D ++  
Sbjct: 339 HNLAK-AGGCGVVAAEVAGEDPLRRGIP-HWKVLSCDEDLWCIKRL---GDDYSDGVVGD 398

Query: 302 -KDMETAANVIVDPRDF 317
                   ++ VDPR+F
Sbjct: 399 WTKSPPGVSIFVDPREF 403

BLAST of Lsi05G015920 vs. Swiss-Prot
Match: HLS1L_ARATH (Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana GN=At2g23060 PE=2 SV=1)

HSP 1 Score: 148.7 bits (374), Expect = 1.1e-34
Identity = 105/317 (33.12%), Postives = 166/317 (52.37%), Query Frame = 1

Query: 2   KIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCNY 61
           K+  ILGLRVSP HRR GIG KLV ++E+W  +NGA Y++ A E  N AS NLFT KC Y
Sbjct: 109 KLAYILGLRVSPTHRRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGY 168

Query: 62  VKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFDM 121
            +F +  I   P  V+  +  I  +  +IK E  + E  + +     TT+   +P D D 
Sbjct: 169 AEFRTPSILVNP--VYAHRVNISRRVTVIKLEPSDAE--LLYRLRFSTTE--FFPRDIDS 228

Query: 122 ILKEKLSLGTWVSYFNQEDWTHLICSQKDS-EIYQRMPSSWVVFSIWNTCKAYKFQIRES 181
           +L  KLSLGT+V+      +     S   S +  +  P SW V S+WN   +++ ++R +
Sbjct: 229 VLNNKLSLGTFVAVPRGSCYGSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGA 288

Query: 182 KHDQLLPLKFLKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIWIFA 241
              + +  K  +   K  +   K+P+  +  + FG  F+YGI GEG R  ++V+++   A
Sbjct: 289 SRLRRVVSKATRMVDKT-LPFLKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHA 348

Query: 242 SRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSV-HSDDEKDEILL 301
             LA+ E  C  +  E++  +P+   +P +   +S   D W +KRL   +SD    +   
Sbjct: 349 HNLAK-EGGCGVVAAEVAGEEPLRRGIP-HWKVLSCAEDLWCIKRLGEDYSDGSVGDWTK 408

Query: 302 SKDMETAANVIVDPRDF 317
           S   +   ++ VDPR+F
Sbjct: 409 SPPGD---SIFVDPREF 413

BLAST of Lsi05G015920 vs. TrEMBL
Match: A0A0A0LAQ6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G166310 PE=4 SV=1)

HSP 1 Score: 592.8 bits (1527), Expect = 2.4e-166
Identity = 300/319 (94.04%), Postives = 310/319 (97.18%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           MKIGCILGLRVSPAHRR+GIGLKLVHSVEEW+IRNGA+YAFLAIEKKNKASKNLF KKCN
Sbjct: 98  MKIGCILGLRVSPAHRRMGIGLKLVHSVEEWIIRNGANYAFLAIEKKNKASKNLFAKKCN 157

Query: 61  YVKFSSLVIFRQPLIVFPT-KDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDF 120
           YVKFSSLVIFRQPLIVFPT K++IISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDF
Sbjct: 158 YVKFSSLVIFRQPLIVFPTTKEVIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDF 217

Query: 121 DMILKEKLSLGTWVSYFNQEDWTH-LICSQKDSE-IYQRMPSSWVVFSIWNTCKAYKFQI 180
           DMILKEKLSLGTWVSYFNQEDWTH LICSQKDS+ IYQRMPSSWVVFSIWNTCKAYKFQI
Sbjct: 218 DMILKEKLSLGTWVSYFNQEDWTHHLICSQKDSDQIYQRMPSSWVVFSIWNTCKAYKFQI 277

Query: 181 RESKHDQLLPLKFLKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIW 240
           RESK+DQLLPL+F KSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIW
Sbjct: 278 RESKNDQLLPLRFFKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIW 337

Query: 241 IFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDEI 300
           IFASRLAEDEKDCKAIVTELSVSDPIIN+VPR N SMSR+NDN YLKRLSVHSDDEKDE 
Sbjct: 338 IFASRLAEDEKDCKAIVTELSVSDPIINHVPR-NVSMSRVNDNLYLKRLSVHSDDEKDET 397

Query: 301 LLSKDMETAANVIVDPRDF 317
           LLSKDMETAANVIVDPRDF
Sbjct: 398 LLSKDMETAANVIVDPRDF 415

BLAST of Lsi05G015920 vs. TrEMBL
Match: A0A061EQU3_THECC (Acyl-CoA N-acyltransferases superfamily protein OS=Theobroma cacao GN=TCM_021361 PE=4 SV=1)

HSP 1 Score: 337.8 bits (865), Expect = 1.4e-89
Identity = 182/322 (56.52%), Postives = 222/322 (68.94%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           +K+GCILGLRVSP HRR+GIGLKLV ++EEW+I NGAHY FLA EK N AS NLFT KCN
Sbjct: 118 VKLGCILGLRVSPRHRRMGIGLKLVRAMEEWLINNGAHYTFLATEKNNVASTNLFTAKCN 177

Query: 61  YVKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFD 120
           Y   SSLVIF QP+I F  + +     + IK EKL+ +QAIS Y N L  K  +Y  D D
Sbjct: 178 YRNLSSLVIFVQPIISFAMEGL----SQDIKVEKLSTDQAISLYDNKLRGK-DIYLTDID 237

Query: 121 MILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVFSIWNTCKAYKFQIRES 180
            ILKEKLSLGTWVSYF Q++W  L   +KD +I    P SW +FSIWN+C+ YK  I++S
Sbjct: 238 AILKEKLSLGTWVSYFKQDEWIGLHSKEKDGDIISTSPRSWAMFSIWNSCETYKIHIKKS 297

Query: 181 KHDQLLPLKF----LKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESI 240
                 PLKF    L  AR K   C K P   S  K FGF FLYG+ GEGER+GEL++S 
Sbjct: 298 H-----PLKFFHATLSHARDKIFPCLKTPLCDSLEKPFGFLFLYGLHGEGERLGELMKSA 357

Query: 241 WIFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRL--SVHSDDEK 300
           W FASRLAE+ KDCK I+TEL VSDP+I +VPR  SSMSR++D WYLK++  S+H   EK
Sbjct: 358 WSFASRLAENVKDCKVIITELGVSDPLIEHVPR-ESSMSRVDDLWYLKKVNGSIH---EK 417

Query: 301 DEILLSKDMETAANVIVDPRDF 317
           +++ +   M    NV+VDPRDF
Sbjct: 418 NDLGM---MGELGNVVVDPRDF 422

BLAST of Lsi05G015920 vs. TrEMBL
Match: B9HX15_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s15480g PE=4 SV=2)

HSP 1 Score: 337.8 bits (865), Expect = 1.4e-89
Identity = 177/320 (55.31%), Postives = 224/320 (70.00%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           +++GCILGLRVSP HRR+GIGL+LV SVEEW+I NGAHY FLA EK N AS NLFT KCN
Sbjct: 91  VRLGCILGLRVSPRHRRMGIGLELVKSVEEWLIGNGAHYTFLATEKNNVASTNLFTSKCN 150

Query: 61  YVKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFD 120
           Y+ F+SLVIF QP  + P K +     + IK EKL  +QAI  Y N   +K  +YP D D
Sbjct: 151 YMNFTSLVIFVQPASL-PVKGL----SQDIKIEKLQTDQAIYLYNNKFKSKD-IYPTDVD 210

Query: 121 MILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVFSIWNTCKAYKFQIRES 180
            ILKEKLS+GTWVSYF +E+W  L  ++++ +I  R PSSW +FSIWN+C+AYK  IR+S
Sbjct: 211 AILKEKLSIGTWVSYFKEEEWISLHSNERNEDIITRTPSSWAMFSIWNSCEAYKLHIRKS 270

Query: 181 KHDQLLPLKF----LKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESI 240
            H    P KF    L  AR K   C K P   S  K FGF FL+G++GEGER+ EL++SI
Sbjct: 271 HH----PFKFFHATLSHARDKIFPCLKFPICHSLQKPFGFLFLFGLYGEGERLQELMKSI 330

Query: 241 WIFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDE 300
           W FASRLAE+ KDCK I++EL VSDP+I +VP+  SSMS IND WYLK+++ +  D+ +E
Sbjct: 331 WSFASRLAENVKDCKVIISELGVSDPLIEHVPQ-ESSMSFINDLWYLKKVNDNITDDNEE 390

Query: 301 ILLSKDMETAANVIVDPRDF 317
            ++    +   NV VDPRDF
Sbjct: 391 PVVMG--QVTGNVFVDPRDF 397

BLAST of Lsi05G015920 vs. TrEMBL
Match: A0A151SSU0_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_004169 PE=4 SV=1)

HSP 1 Score: 336.7 bits (862), Expect = 3.2e-89
Identity = 176/320 (55.00%), Postives = 224/320 (70.00%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           +KIGCILGLRVSP HRR G+GLKLV SVEEW++RNGA YA LA EK N AS+NLFT KC 
Sbjct: 31  LKIGCILGLRVSPTHRRKGVGLKLVTSVEEWMLRNGAEYACLATEKNNDASRNLFTNKCK 90

Query: 61  YVKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFD 120
           YV  SSLVIF  P I FP+K I     + IK EK+N++QAIS Y  TL  K  +YP+D D
Sbjct: 91  YVSLSSLVIFLHP-ISFPSKHI----SKDIKIEKVNMDQAISLYRRTLMAK-ELYPLDMD 150

Query: 121 MILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVFSIWNTCKAYKFQIRES 180
            ILKE LSLGTWVSY+  E W +L     +  I   + SSW++FSIWNTC+AYK Q+++S
Sbjct: 151 AILKENLSLGTWVSYYKDEGWLNLQVESHEDLITNEITSSWIIFSIWNTCEAYKLQLKKS 210

Query: 181 KHDQLLPLKFLKS----ARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESI 240
           +     PL+FL +    AR K   C +M  S SF + FGF FLYG+ GEGE +GEL+ESI
Sbjct: 211 Q-----PLRFLHTTLNHARDKIFPCLRMSVSDSFCRPFGFLFLYGLHGEGENLGELMESI 270

Query: 241 WIFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDE 300
           W F SRL E  +DC+ ++TEL   DP++N+VP+  +SMS I+D W+ KRLS +SD++ DE
Sbjct: 271 WRFTSRLGESLRDCRVVITELGFGDPLVNHVPQ-TASMSCIDDIWFTKRLSSNSDEKDDE 330

Query: 301 ILLSKDMETAANVIVDPRDF 317
           +L+ + +    NV VDPRDF
Sbjct: 331 LLMKRQI---GNVFVDPRDF 335

BLAST of Lsi05G015920 vs. TrEMBL
Match: B9RZQ2_RICCO (N-acetyltransferase, putative OS=Ricinus communis GN=RCOM_1000500 PE=4 SV=1)

HSP 1 Score: 335.5 bits (859), Expect = 7.0e-89
Identity = 179/323 (55.42%), Postives = 223/323 (69.04%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           +++GCILGLRVSP HRR+GIGLKLV SVEEW++ NGAHY FLA EK N AS NLFT KCN
Sbjct: 91  VRLGCILGLRVSPKHRRMGIGLKLVKSVEEWLVGNGAHYFFLATEKNNVASTNLFTSKCN 150

Query: 61  YVKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFD 120
           Y+ F SLVIF Q   + P K +     E IK EKL I+QAIS Y N L  K  +YP D D
Sbjct: 151 YINFGSLVIFVQQASL-PVKSL----SEDIKIEKLQIDQAISLYNNKLRGKD-IYPTDID 210

Query: 121 MILKEKLSLGTWVSYFNQEDWTHLICSQK---DSEIYQRMPSSWVVFSIWNTCKAYKFQI 180
            +LKEKLSLGTWVSYF +++W  L  ++K   D +I  + PSSWV+FSIWN+C+AYK  I
Sbjct: 211 ALLKEKLSLGTWVSYFKEDEWIILHNNEKNHEDEDILSKTPSSWVIFSIWNSCEAYKLHI 270

Query: 181 RESKHDQLLPLKF----LKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELV 240
           R+S H    PLKF    L  AR K + C K+P   S  K FGF FLYG++GEG R+ EL+
Sbjct: 271 RKSHH----PLKFFHATLSHARDKILPCLKLPICDSLQKPFGFLFLYGLYGEGARLQELM 330

Query: 241 ESIWIFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDE 300
            +IWIF SR+AE+ KDCK I TEL V+DP++ YVP +  SMS I+D WYLK+++  +   
Sbjct: 331 RAIWIFTSRMAENVKDCKVITTELGVTDPLMQYVP-HEPSMSFIDDLWYLKKVNGITTGS 390

Query: 301 KDEILLSKDMETAANVIVDPRDF 317
            DE++    M  A N+ VDPRDF
Sbjct: 391 NDELMA---MGQAGNLFVDPRDF 399

BLAST of Lsi05G015920 vs. TAIR10
Match: AT4G37580.1 (AT4G37580.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 151.4 bits (381), Expect = 9.5e-37
Identity = 105/317 (33.12%), Postives = 162/317 (51.10%), Query Frame = 1

Query: 2   KIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCNY 61
           K+  +LGLRVSP HRR GIG KLV  +EEW  +NGA Y+++A E  N+AS NLFT KC Y
Sbjct: 99  KLAYVLGLRVSPFHRRQGIGFKLVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGY 158

Query: 62  VKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFDM 121
            +F +  I   P  V+  +  +  +  +IK E ++ E       +T       +P D D 
Sbjct: 159 SEFRTPSILVNP--VYAHRVNVSRRVTVIKLEPVDAETLYRIRFSTTE----FFPRDIDS 218

Query: 122 ILKEKLSLGTWVSYFNQEDWTHLICSQKDS-EIYQRMPSSWVVFSIWNTCKAYKFQIRES 181
           +L  KLSLGT+V+      +     S   S +  +  P SW V S+WN   ++  ++R +
Sbjct: 219 VLNNKLSLGTFVAVPRGSCYGSGSGSWPGSAKFLEYPPESWAVLSVWNCKDSFLLEVRGA 278

Query: 182 KHDQLLPLKFLKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIWIFA 241
              + +  K  +    K +   K+P+  S  + FG  F+YGI GEG R  ++V+S+   A
Sbjct: 279 SRLRRVVAKTTRVV-DKTLPFLKLPSIPSVFEPFGLHFMYGIGGEGPRAVKMVKSLCAHA 338

Query: 242 SRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDEILLS 301
             LA+    C  +  E++  DP+   +P +   +S   D W +KRL    DD  D ++  
Sbjct: 339 HNLAK-AGGCGVVAAEVAGEDPLRRGIP-HWKVLSCDEDLWCIKRL---GDDYSDGVVGD 398

Query: 302 -KDMETAANVIVDPRDF 317
                   ++ VDPR+F
Sbjct: 399 WTKSPPGVSIFVDPREF 403

BLAST of Lsi05G015920 vs. TAIR10
Match: AT2G23060.1 (AT2G23060.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 148.7 bits (374), Expect = 6.2e-36
Identity = 105/317 (33.12%), Postives = 166/317 (52.37%), Query Frame = 1

Query: 2   KIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCNY 61
           K+  ILGLRVSP HRR GIG KLV ++E+W  +NGA Y++ A E  N AS NLFT KC Y
Sbjct: 109 KLAYILGLRVSPTHRRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGY 168

Query: 62  VKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFDM 121
            +F +  I   P  V+  +  I  +  +IK E  + E  + +     TT+   +P D D 
Sbjct: 169 AEFRTPSILVNP--VYAHRVNISRRVTVIKLEPSDAE--LLYRLRFSTTE--FFPRDIDS 228

Query: 122 ILKEKLSLGTWVSYFNQEDWTHLICSQKDS-EIYQRMPSSWVVFSIWNTCKAYKFQIRES 181
           +L  KLSLGT+V+      +     S   S +  +  P SW V S+WN   +++ ++R +
Sbjct: 229 VLNNKLSLGTFVAVPRGSCYGSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGA 288

Query: 182 KHDQLLPLKFLKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIWIFA 241
              + +  K  +   K  +   K+P+  +  + FG  F+YGI GEG R  ++V+++   A
Sbjct: 289 SRLRRVVSKATRMVDKT-LPFLKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHA 348

Query: 242 SRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSV-HSDDEKDEILL 301
             LA+ E  C  +  E++  +P+   +P +   +S   D W +KRL   +SD    +   
Sbjct: 349 HNLAK-EGGCGVVAAEVAGEEPLRRGIP-HWKVLSCAEDLWCIKRLGEDYSDGSVGDWTK 408

Query: 302 SKDMETAANVIVDPRDF 317
           S   +   ++ VDPR+F
Sbjct: 409 SPPGD---SIFVDPREF 413

BLAST of Lsi05G015920 vs. TAIR10
Match: AT2G30090.1 (AT2G30090.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 148.3 bits (373), Expect = 8.1e-36
Identity = 102/320 (31.87%), Postives = 159/320 (49.69%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           +++G +LGLRV P++RR GIG  LV  +EEW   + A YA++A EK N+AS  LF  +  
Sbjct: 91  VRVGYVLGLRVVPSYRRRGIGSILVRKLEEWFESHNADYAYMATEKDNEASHGLFIGRLG 150

Query: 61  YVKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFD 120
           YV F +  I   P  V P + + +     I   KL +++A S Y   +      +P D +
Sbjct: 151 YVVFRNPAILVNP--VNPGRGLKLPSD--IGIRKLKVKEAESLYRRNVAATTEFFPDDIN 210

Query: 121 MILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVFSIWNTCKAYKFQIRES 180
            IL+ KLS+GTWV+Y+N  D T                 SW + S+W++ K +K +I  +
Sbjct: 211 KILRNKLSIGTWVAYYNNVDNTR----------------SWAMLSVWDSSKVFKLRIERA 270

Query: 181 KHDQLLPLKFLKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIWIFA 240
               LL  K  K     F+S   +         FGF+FLYG+  EG   G+LV ++    
Sbjct: 271 PLSYLLLTKVSK-LFGNFLSLLGLTVLPDLFTPFGFYFLYGVHSEGPHCGKLVRALCEHV 330

Query: 241 SRLA--EDEKDCKAIVTEL---SVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKD 300
             +A   D   CK +V E+   S  D  +     +   +S  +D W +K L      EK+
Sbjct: 331 HNMAALNDGCACKVVVVEVDKGSNGDDSLQRCIPHWKMLSCDDDMWCIKPLKC----EKN 385

Query: 301 EILLSKDMETAANVIVDPRD 316
           +  LS+  ++ +++ VDPR+
Sbjct: 391 KFDLSERSKSRSSLFVDPRE 385

BLAST of Lsi05G015920 vs. TAIR10
Match: AT5G67430.1 (AT5G67430.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 140.2 bits (352), Expect = 2.2e-33
Identity = 104/317 (32.81%), Postives = 155/317 (48.90%), Query Frame = 1

Query: 2   KIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCNY 61
           K+  + GLRVSP +RR+GIGLKLV  +EEW +RN A Y+++  E  N AS  LFT+K  Y
Sbjct: 95  KLAFVSGLRVSPFYRRMGIGLKLVQRLEEWFLRNDAVYSYVQTENDNIASVKLFTEKSGY 154

Query: 62  VKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFDM 121
            KF +        +V P  +  ++    +K  KL    A S Y N  +T    +P D + 
Sbjct: 155 SKFRT-----PTFLVNPVFNHRVTVSRRVKIIKLAPSDAESLYRNRFSTT-EFFPSDINS 214

Query: 122 ILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVFSIWNTCKAYKFQIRESK 181
           IL  KLSLGT+++     D          S        SW V SIWN+   Y+ Q++ + 
Sbjct: 215 ILTNKLSLGTYLAVPRGGD--------NVSGSLPDQTGSWAVISIWNSKDVYRLQVKGAS 274

Query: 182 HDQLLPLKFLKSARKKFISCF---KMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIWI 241
             +    + L  + + F   F   K+P+  +  KSF   F+YGI GEG R  E+VE++  
Sbjct: 275 RLK----RMLAKSTRVFDGAFPFLKIPSFPNLFKSFAMHFMYGIGGEGPRAAEMVEALCS 334

Query: 242 FASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDEIL 301
            A  LA  +  C  +  E++  +P+   +P  +  +    D W LKRL  + DD  D   
Sbjct: 335 HAHNLAR-KSGCAVVAAEVASCEPLRVGIP--HWKVLSPEDLWCLKRLR-YDDDGVD--- 385

Query: 302 LSKDMETAANVIVDPRD 316
                    ++ VDPR+
Sbjct: 395 -WTKSPPGLSIFVDPRE 385

BLAST of Lsi05G015920 vs. NCBI nr
Match: gi|449433437|ref|XP_004134504.1| (PREDICTED: probable N-acetyltransferase HLS1-like [Cucumis sativus])

HSP 1 Score: 592.8 bits (1527), Expect = 3.5e-166
Identity = 300/319 (94.04%), Postives = 310/319 (97.18%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           MKIGCILGLRVSPAHRR+GIGLKLVHSVEEW+IRNGA+YAFLAIEKKNKASKNLF KKCN
Sbjct: 98  MKIGCILGLRVSPAHRRMGIGLKLVHSVEEWIIRNGANYAFLAIEKKNKASKNLFAKKCN 157

Query: 61  YVKFSSLVIFRQPLIVFPT-KDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDF 120
           YVKFSSLVIFRQPLIVFPT K++IISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDF
Sbjct: 158 YVKFSSLVIFRQPLIVFPTTKEVIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDF 217

Query: 121 DMILKEKLSLGTWVSYFNQEDWTH-LICSQKDSE-IYQRMPSSWVVFSIWNTCKAYKFQI 180
           DMILKEKLSLGTWVSYFNQEDWTH LICSQKDS+ IYQRMPSSWVVFSIWNTCKAYKFQI
Sbjct: 218 DMILKEKLSLGTWVSYFNQEDWTHHLICSQKDSDQIYQRMPSSWVVFSIWNTCKAYKFQI 277

Query: 181 RESKHDQLLPLKFLKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIW 240
           RESK+DQLLPL+F KSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIW
Sbjct: 278 RESKNDQLLPLRFFKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESIW 337

Query: 241 IFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDEI 300
           IFASRLAEDEKDCKAIVTELSVSDPIIN+VPR N SMSR+NDN YLKRLSVHSDDEKDE 
Sbjct: 338 IFASRLAEDEKDCKAIVTELSVSDPIINHVPR-NVSMSRVNDNLYLKRLSVHSDDEKDET 397

Query: 301 LLSKDMETAANVIVDPRDF 317
           LLSKDMETAANVIVDPRDF
Sbjct: 398 LLSKDMETAANVIVDPRDF 415

BLAST of Lsi05G015920 vs. NCBI nr
Match: gi|659076948|ref|XP_008438951.1| (PREDICTED: LOW QUALITY PROTEIN: probable N-acetyltransferase HLS1 [Cucumis melo])

HSP 1 Score: 590.1 bits (1520), Expect = 2.3e-165
Identity = 303/320 (94.69%), Postives = 310/320 (96.88%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           MKIGCILGLRVSPAHRR+GIGLKLVHSVEEWVIRNGA+YAFLAIEKKNKASKNLFTKKCN
Sbjct: 98  MKIGCILGLRVSPAHRRMGIGLKLVHSVEEWVIRNGANYAFLAIEKKNKASKNLFTKKCN 157

Query: 61  YVKFSSLVIFRQPLIVFPT-KDI-IISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMD 120
           YVKFSSLVIFRQPLIVFPT KD  IISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMD
Sbjct: 158 YVKFSSLVIFRQPLIVFPTTKDHNIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMD 217

Query: 121 FDMILKEKLSLGTWVSYFNQEDWTH-LICSQKDSE-IYQRMPSSWVVFSIWNTCKAYKFQ 180
           FDMILKEKLSLGTWVSYFNQEDWTH LICSQKDS+ IYQRMPSSWVVFSIWNTCKAYKFQ
Sbjct: 218 FDMILKEKLSLGTWVSYFNQEDWTHHLICSQKDSDQIYQRMPSSWVVFSIWNTCKAYKFQ 277

Query: 181 IRESKHDQLLPLKFLKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESI 240
           IRESK DQLLPL+FLKSARKKF+SCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESI
Sbjct: 278 IRESKSDQLLPLRFLKSARKKFVSCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESI 337

Query: 241 WIFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDE 300
           WIFASRLAEDEKDCKAIVTELSVSDPIIN+VPR N SMSR+NDN YLKRLSVHSDDEKDE
Sbjct: 338 WIFASRLAEDEKDCKAIVTELSVSDPIINHVPR-NVSMSRVNDNLYLKRLSVHSDDEKDE 397

Query: 301 ILLSKDMETAANVIVDPRDF 317
            LLSKDMETAANVIVDPRDF
Sbjct: 398 TLLSKDMETAANVIVDPRDF 416

BLAST of Lsi05G015920 vs. NCBI nr
Match: gi|802704350|ref|XP_012084085.1| (PREDICTED: probable N-acetyltransferase HLS1 [Jatropha curcas])

HSP 1 Score: 342.4 bits (877), Expect = 8.3e-91
Identity = 179/320 (55.94%), Postives = 223/320 (69.69%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           + +GCILGLRVSP +RR+GIGLKLV SVEEW++ NGA+Y F+A EK N AS NLFT +CN
Sbjct: 92  VSLGCILGLRVSPKYRRMGIGLKLVKSVEEWLVGNGANYIFIATEKSNVASTNLFTSRCN 151

Query: 61  YVKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFD 120
           Y+ FSSLV+F QP      K++ +   E IK EKL I QAIS Y NTL  K  +YP D D
Sbjct: 152 YMNFSSLVVFVQPANSLTLKNLSL---EDIKIEKLQIRQAISLYNNTLRGKD-IYPTDID 211

Query: 121 MILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVFSIWNTCKAYKFQIRES 180
            ILKE LSLGTWVSYF +E+W  L    K+ +I  + PSSW +FSIWN+C+AYK  IR+S
Sbjct: 212 AILKENLSLGTWVSYFKEEEWIILHNDNKEEDIISKTPSSWAIFSIWNSCEAYKLHIRKS 271

Query: 181 KHDQLLPLKF----LKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESI 240
            H    PLKF    L  AR K   C K+P   S  K FGF FLYG++GEG R+ EL+ SI
Sbjct: 272 HH----PLKFFHATLSHARDKIFPCLKLPICDSLQKPFGFLFLYGLYGEGTRLQELMNSI 331

Query: 241 WIFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDE 300
           W F SRLAED KDCK I+TEL VSDP+I+YVPR   SMS I+D WYLK+++ +S D  ++
Sbjct: 332 WSFTSRLAEDVKDCKVIITELGVSDPLIDYVPR-EPSMSFIDDLWYLKKVNGNSGDRNEQ 391

Query: 301 ILLSKDMETAANVIVDPRDF 317
           +++ +    A +V VDPRDF
Sbjct: 392 VVMGQ----AGDVFVDPRDF 398

BLAST of Lsi05G015920 vs. NCBI nr
Match: gi|566190975|ref|XP_002314944.2| (hypothetical protein POPTR_0010s15480g [Populus trichocarpa])

HSP 1 Score: 337.8 bits (865), Expect = 2.0e-89
Identity = 177/320 (55.31%), Postives = 224/320 (70.00%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           +++GCILGLRVSP HRR+GIGL+LV SVEEW+I NGAHY FLA EK N AS NLFT KCN
Sbjct: 91  VRLGCILGLRVSPRHRRMGIGLELVKSVEEWLIGNGAHYTFLATEKNNVASTNLFTSKCN 150

Query: 61  YVKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFD 120
           Y+ F+SLVIF QP  + P K +     + IK EKL  +QAI  Y N   +K  +YP D D
Sbjct: 151 YMNFTSLVIFVQPASL-PVKGL----SQDIKIEKLQTDQAIYLYNNKFKSKD-IYPTDVD 210

Query: 121 MILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVFSIWNTCKAYKFQIRES 180
            ILKEKLS+GTWVSYF +E+W  L  ++++ +I  R PSSW +FSIWN+C+AYK  IR+S
Sbjct: 211 AILKEKLSIGTWVSYFKEEEWISLHSNERNEDIITRTPSSWAMFSIWNSCEAYKLHIRKS 270

Query: 181 KHDQLLPLKF----LKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESI 240
            H    P KF    L  AR K   C K P   S  K FGF FL+G++GEGER+ EL++SI
Sbjct: 271 HH----PFKFFHATLSHARDKIFPCLKFPICHSLQKPFGFLFLFGLYGEGERLQELMKSI 330

Query: 241 WIFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRLSVHSDDEKDE 300
           W FASRLAE+ KDCK I++EL VSDP+I +VP+  SSMS IND WYLK+++ +  D+ +E
Sbjct: 331 WSFASRLAENVKDCKVIISELGVSDPLIEHVPQ-ESSMSFINDLWYLKKVNDNITDDNEE 390

Query: 301 ILLSKDMETAANVIVDPRDF 317
            ++    +   NV VDPRDF
Sbjct: 391 PVVMG--QVTGNVFVDPRDF 397

BLAST of Lsi05G015920 vs. NCBI nr
Match: gi|590661866|ref|XP_007035791.1| (Acyl-CoA N-acyltransferases superfamily protein [Theobroma cacao])

HSP 1 Score: 337.8 bits (865), Expect = 2.0e-89
Identity = 182/322 (56.52%), Postives = 222/322 (68.94%), Query Frame = 1

Query: 1   MKIGCILGLRVSPAHRRLGIGLKLVHSVEEWVIRNGAHYAFLAIEKKNKASKNLFTKKCN 60
           +K+GCILGLRVSP HRR+GIGLKLV ++EEW+I NGAHY FLA EK N AS NLFT KCN
Sbjct: 118 VKLGCILGLRVSPRHRRMGIGLKLVRAMEEWLINNGAHYTFLATEKNNVASTNLFTAKCN 177

Query: 61  YVKFSSLVIFRQPLIVFPTKDIIISKGEIIKTEKLNIEQAISFYTNTLTTKGGVYPMDFD 120
           Y   SSLVIF QP+I F  + +     + IK EKL+ +QAIS Y N L  K  +Y  D D
Sbjct: 178 YRNLSSLVIFVQPIISFAMEGL----SQDIKVEKLSTDQAISLYDNKLRGK-DIYLTDID 237

Query: 121 MILKEKLSLGTWVSYFNQEDWTHLICSQKDSEIYQRMPSSWVVFSIWNTCKAYKFQIRES 180
            ILKEKLSLGTWVSYF Q++W  L   +KD +I    P SW +FSIWN+C+ YK  I++S
Sbjct: 238 AILKEKLSLGTWVSYFKQDEWIGLHSKEKDGDIISTSPRSWAMFSIWNSCETYKIHIKKS 297

Query: 181 KHDQLLPLKF----LKSARKKFISCFKMPNSVSFGKSFGFFFLYGIFGEGERVGELVESI 240
                 PLKF    L  AR K   C K P   S  K FGF FLYG+ GEGER+GEL++S 
Sbjct: 298 H-----PLKFFHATLSHARDKIFPCLKTPLCDSLEKPFGFLFLYGLHGEGERLGELMKSA 357

Query: 241 WIFASRLAEDEKDCKAIVTELSVSDPIINYVPRNNSSMSRINDNWYLKRL--SVHSDDEK 300
           W FASRLAE+ KDCK I+TEL VSDP+I +VPR  SSMSR++D WYLK++  S+H   EK
Sbjct: 358 WSFASRLAENVKDCKVIITELGVSDPLIEHVPR-ESSMSRVDDLWYLKKVNGSIH---EK 417

Query: 301 DEILLSKDMETAANVIVDPRDF 317
           +++ +   M    NV+VDPRDF
Sbjct: 418 NDLGM---MGELGNVVVDPRDF 422

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HLS1_ARATH1.7e-3533.12Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana GN=HLS1 PE=1 SV=1[more]
HLS1L_ARATH1.1e-3433.12Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana GN=At2g23060 PE=2... [more]
Match NameE-valueIdentityDescription
A0A0A0LAQ6_CUCSA2.4e-16694.04Uncharacterized protein OS=Cucumis sativus GN=Csa_3G166310 PE=4 SV=1[more]
A0A061EQU3_THECC1.4e-8956.52Acyl-CoA N-acyltransferases superfamily protein OS=Theobroma cacao GN=TCM_021361... [more]
B9HX15_POPTR1.4e-8955.31Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s15480g PE=4 SV=2[more]
A0A151SSU0_CAJCA3.2e-8955.00Uncharacterized protein OS=Cajanus cajan GN=KK1_004169 PE=4 SV=1[more]
B9RZQ2_RICCO7.0e-8955.42N-acetyltransferase, putative OS=Ricinus communis GN=RCOM_1000500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G37580.19.5e-3733.12 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
AT2G23060.16.2e-3633.12 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
AT2G30090.18.1e-3631.88 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
AT5G67430.12.2e-3332.81 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449433437|ref|XP_004134504.1|3.5e-16694.04PREDICTED: probable N-acetyltransferase HLS1-like [Cucumis sativus][more]
gi|659076948|ref|XP_008438951.1|2.3e-16594.69PREDICTED: LOW QUALITY PROTEIN: probable N-acetyltransferase HLS1 [Cucumis melo][more]
gi|802704350|ref|XP_012084085.1|8.3e-9155.94PREDICTED: probable N-acetyltransferase HLS1 [Jatropha curcas][more]
gi|566190975|ref|XP_002314944.2|2.0e-8955.31hypothetical protein POPTR_0010s15480g [Populus trichocarpa][more]
gi|590661866|ref|XP_007035791.1|2.0e-8956.52Acyl-CoA N-acyltransferases superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008080N-acetyltransferase activity
Vocabulary: INTERPRO
TermDefinition
IPR016181Acyl_CoA_acyltransferase
IPR000182GNAT_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042967 acyl-carrier-protein biosynthetic process
biological_process GO:0006474 N-terminal protein amino acid acetylation
biological_process GO:0006475 internal protein amino acid acetylation
biological_process GO:0018002 N-terminal peptidyl-glutamic acid acetylation
biological_process GO:0017198 N-terminal peptidyl-serine acetylation
biological_process GO:0008150 biological_process
cellular_component GO:0031248 protein acetyltransferase complex
cellular_component GO:0005575 cellular_component
cellular_component GO:0022626 cytosolic ribosome
cellular_component GO:0031415 NatA complex
molecular_function GO:0004596 peptide alpha-N-acetyltransferase activity
molecular_function GO:0008080 N-acetyltransferase activity
molecular_function GO:1990190 peptide-glutamate-N-acetyltransferase activity
molecular_function GO:1990189 peptide-serine-N-acetyltransferase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi05G015920.1Lsi05G015920.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000182GNAT domainPFAMPF00583Acetyltransf_1coord: 5..61
score: 2.
IPR000182GNAT domainPROFILEPS51186GNATcoord: 1..83
score: 10
IPR016181Acyl-CoA N-acyltransferaseGENE3DG3DSA:3.40.630.30coord: 2..56
score: 3.2
IPR016181Acyl-CoA N-acyltransferaseunknownSSF55729Acyl-CoA N-acyltransferases (Nat)coord: 4..69
score: 3.9
NoneNo IPR availablePANTHERPTHR23091N-TERMINAL ACETYLTRANSFERASEcoord: 2..120
score: 6.9
NoneNo IPR availablePANTHERPTHR23091:SF239SUBFAMILY NOT NAMEDcoord: 2..120
score: 6.9