Tan0020944 (gene) Snake gourd v1

Overview
NameTan0020944
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionN-acetyltransferase domain-containing protein
LocationLG05: 38852000 .. 38853851 (+)
RNA-Seq ExpressionTan0020944
SyntenyTan0020944
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTAGACCCTTGATCCTTCTTTATACTTTGTTGGAGATGTAAATGAATCTTTCTTTCTTTCTTTTTTGGTTTCTCTTTCTTCACAAGTTTTGTTGTTAAATAAAAAAGAGAAAAACGGTTGAAGGAAAAGAGAAGAAGAAGGAGAAGAAGAAAAAGGCAGGTGGAGGTGGAGGTGGGGTTAAGAGAGAGAGACAAAAAGACAGCACCAACAAGCGGAAAAGAAAGCCCAAAAATCTCAAACTCCAAGACCATCCACACAACACTTCAAACGTGAACACACACAATATTTTGTTTTTTTCTCTTTTTATATTTCTTTATTTAATCCACAAAAAACTTACAAGTTACAATTACAAAGCAACAAAAGCTAATCCAAAATCCCTCTCCTCTTTGTTTTTTTTTTTTTTTTTTTTCCATTTCTAATATGAAACTACCCTCTCTGTAGCATCCTCTGTTTCTTCAGAGCTCTCTGAAATCCGCCATGGCAGAGCAGAAGATTGAGATTATGATTAGAGAATTCGACCCTGTTAAGGATAGAGCTGTAGTTGAAGATGTCGAACGACGATGTGAAGTTGGATCAACAAAAAACAAAAAGCTGTCTCTCTTCACTGATCTCCGCGGCGACCCAATTTGCAGAGTCCGCCATTCCCCTGCCTTCCTCATGCTCGTAAAAACTCTTTCCTCTTTTTCTTCTGATCCAAAAATTCAACAAAAATACTCAAAACAAATCATCCTTCATTCAGGTCGCCGAAATTCGTGACCGGAAGGAGATTGTGGGCATGGTCAGAGGCTGCATCAAGACCGTCAGCTGCTGCTCCAAACTCCCTAGAAATGGCGGCACTCATAACGCCTCTGACCCCATCCAACTCGTCCCTGTTTACACTAAACTTGCTTACATCTTAGGCCTTCGGGTCTCTCCCGATCATCGGTAAGAAACTCAAATAATGAGAGTGTGAGACGCAGTTTGAAAGTCAACTGAATTGGTTGTTATATTGTATTTGTAGGAGGTTGGGGATTGGATTGAAGCTAGTGAGGCGAATGGAGGAGTGGTTCAAAGAGAACAAAGCGGAATATTCGTACATAGCGACAGAGAATGACAATGTAGCTTCAATAAAGCTATTCGTGGATAAATGCGAGTATTCGAAATTTCGTACGCCGTCCATTCTTGTAAATCCAGTTTTCGCCCATCGACTCCGCCTGTCGGACCGAGTCACCATCCTCCAACTCGGGTTGGCGGATGCCGAGACGCTGTATCGACGTCGATTCGCAACGGCGGAGTTTTTCCCTCGCGACATTGACGCGGTGCTGTGTAACCGACTAAGCCTGGGGACATTTCTAGCAGTGCCACGTGGGAGTAATGAGTCAGAATCATGGGCGGTTGTAAGCGCGTGGAACTGTAAGGACATGTACGCGCTGGAGGTGCGTGGCGCGTCGACGACGAAGCGAGCGGTGGCGAAAGTGAGCCGTATGATCGACAGAGGGTTGCCGTGGTTGGGGCTGCCGTCGATACCGGAGGTGTTTTCACCATTCGGAGTGTTGTTTTTATATGGAGTAGGAGGGGAAGGTCCAGAGGCAGGGAAATTGATGAAGGCGCTTTGTAAGCACGTGCATAATTTGGCGAAGGAGCGTGGTTGTGGAGTGGTGGCTACGGAGGTTTCAAAGGAGGAGCCGCTGAAATCAAGTATTCCACATTGGAAAAAATTATCTTGCCTTGAAGATTTATGGTGTATGAAGCGCCTTAGGGAAGGCTATAACGACAACTCTATAGGTGACTGGACTAAATCACCACCGGGCTTCTCCATATTTGTTGATCCTAGGGAATTCTAATGCATATCCTTCCTTCCTAGGGTTAGAT

mRNA sequence

TTTAGACCCTTGATCCTTCTTTATACTTTGTTGGAGATGTAAATGAATCTTTCTTTCTTTCTTTTTTGGTTTCTCTTTCTTCACAAGTTTTGTTGTTAAATAAAAAAGAGAAAAACGGTTGAAGGAAAAGAGAAGAAGAAGGAGAAGAAGAAAAAGGCAGGTGGAGGTGGAGGTGGGGTTAAGAGAGAGAGACAAAAAGACAGCACCAACAAGCGGAAAAGAAAGCCCAAAAATCTCAAACTCCAAGACCATCCACACAACACTTCAAACCATCCTCTGTTTCTTCAGAGCTCTCTGAAATCCGCCATGGCAGAGCAGAAGATTGAGATTATGATTAGAGAATTCGACCCTGTTAAGGATAGAGCTGTAGTTGAAGATGTCGAACGACGATGTGAAGTTGGATCAACAAAAAACAAAAAGCTGTCTCTCTTCACTGATCTCCGCGGCGACCCAATTTGCAGAGTCCGCCATTCCCCTGCCTTCCTCATGCTCGTCGCCGAAATTCGTGACCGGAAGGAGATTGTGGGCATGGTCAGAGGCTGCATCAAGACCGTCAGCTGCTGCTCCAAACTCCCTAGAAATGGCGGCACTCATAACGCCTCTGACCCCATCCAACTCGTCCCTGTTTACACTAAACTTGCTTACATCTTAGGCCTTCGGGTCTCTCCCGATCATCGGAGGTTGGGGATTGGATTGAAGCTAGTGAGGCGAATGGAGGAGTGGTTCAAAGAGAACAAAGCGGAATATTCGTACATAGCGACAGAGAATGACAATGTAGCTTCAATAAAGCTATTCGTGGATAAATGCGAGTATTCGAAATTTCGTACGCCGTCCATTCTTGTAAATCCAGTTTTCGCCCATCGACTCCGCCTGTCGGACCGAGTCACCATCCTCCAACTCGGGTTGGCGGATGCCGAGACGCTGTATCGACGTCGATTCGCAACGGCGGAGTTTTTCCCTCGCGACATTGACGCGGTGCTGTGTAACCGACTAAGCCTGGGGACATTTCTAGCAGTGCCACGTGGGAGTAATGAGTCAGAATCATGGGCGGTTGTAAGCGCGTGGAACTGTAAGGACATGTACGCGCTGGAGGTGCGTGGCGCGTCGACGACGAAGCGAGCGGTGGCGAAAGTGAGCCGTATGATCGACAGAGGGTTGCCGTGGTTGGGGCTGCCGTCGATACCGGAGGTGTTTTCACCATTCGGAGTGTTGTTTTTATATGGAGTAGGAGGGGAAGGTCCAGAGGCAGGGAAATTGATGAAGGCGCTTTGTAAGCACGTGCATAATTTGGCGAAGGAGCGTGGTTGTGGAGTGGTGGCTACGGAGGTTTCAAAGGAGGAGCCGCTGAAATCAAGTATTCCACATTGGAAAAAATTATCTTGCCTTGAAGATTTATGGTGTATGAAGCGCCTTAGGGAAGGCTATAACGACAACTCTATAGGTGACTGGACTAAATCACCACCGGGCTTCTCCATATTTGTTGATCCTAGGGAATTCTAATGCATATCCTTCCTTCCTAGGGTTAGAT

Coding sequence (CDS)

ATGGCAGAGCAGAAGATTGAGATTATGATTAGAGAATTCGACCCTGTTAAGGATAGAGCTGTAGTTGAAGATGTCGAACGACGATGTGAAGTTGGATCAACAAAAAACAAAAAGCTGTCTCTCTTCACTGATCTCCGCGGCGACCCAATTTGCAGAGTCCGCCATTCCCCTGCCTTCCTCATGCTCGTCGCCGAAATTCGTGACCGGAAGGAGATTGTGGGCATGGTCAGAGGCTGCATCAAGACCGTCAGCTGCTGCTCCAAACTCCCTAGAAATGGCGGCACTCATAACGCCTCTGACCCCATCCAACTCGTCCCTGTTTACACTAAACTTGCTTACATCTTAGGCCTTCGGGTCTCTCCCGATCATCGGAGGTTGGGGATTGGATTGAAGCTAGTGAGGCGAATGGAGGAGTGGTTCAAAGAGAACAAAGCGGAATATTCGTACATAGCGACAGAGAATGACAATGTAGCTTCAATAAAGCTATTCGTGGATAAATGCGAGTATTCGAAATTTCGTACGCCGTCCATTCTTGTAAATCCAGTTTTCGCCCATCGACTCCGCCTGTCGGACCGAGTCACCATCCTCCAACTCGGGTTGGCGGATGCCGAGACGCTGTATCGACGTCGATTCGCAACGGCGGAGTTTTTCCCTCGCGACATTGACGCGGTGCTGTGTAACCGACTAAGCCTGGGGACATTTCTAGCAGTGCCACGTGGGAGTAATGAGTCAGAATCATGGGCGGTTGTAAGCGCGTGGAACTGTAAGGACATGTACGCGCTGGAGGTGCGTGGCGCGTCGACGACGAAGCGAGCGGTGGCGAAAGTGAGCCGTATGATCGACAGAGGGTTGCCGTGGTTGGGGCTGCCGTCGATACCGGAGGTGTTTTCACCATTCGGAGTGTTGTTTTTATATGGAGTAGGAGGGGAAGGTCCAGAGGCAGGGAAATTGATGAAGGCGCTTTGTAAGCACGTGCATAATTTGGCGAAGGAGCGTGGTTGTGGAGTGGTGGCTACGGAGGTTTCAAAGGAGGAGCCGCTGAAATCAAGTATTCCACATTGGAAAAAATTATCTTGCCTTGAAGATTTATGGTGTATGAAGCGCCTTAGGGAAGGCTATAACGACAACTCTATAGGTGACTGGACTAAATCACCACCGGGCTTCTCCATATTTGTTGATCCTAGGGAATTCTAA

Protein sequence

MAEQKIEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLMLVAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPDHRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPVFAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGSNESESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLGLPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEPLKSSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF
Homology
BLAST of Tan0020944 vs. ExPASy Swiss-Prot
Match: O64815 (Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana OX=3702 GN=At2g23060 PE=2 SV=1)

HSP 1 Score: 546.2 bits (1406), Expect = 3.1e-154
Identity = 267/411 (64.96%), Postives = 317/411 (77.13%), Query Frame = 0

Query: 8   IMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLMLVAEI- 67
           + +RE+DP KD A VEDVERRCEVG     KLSLFTDL GDPICRVRHSP++LMLVAEI 
Sbjct: 5   VEVREYDPSKDLATVEDVERRCEVGPA--GKLSLFTDLLGDPICRVRHSPSYLMLVAEIG 64

Query: 68  -RDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNAS--DPIQLVPVYTKLAYILGLRVSPDH 127
            +++KE+VGM+RGCIKTV+C     R   THN S  D +   P+YTKLAYILGLRVSP H
Sbjct: 65  PKEKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTH 124

Query: 128 RRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPVF 187
           RR GIG KLV+ ME+WF +N AEYSY ATENDN AS+ LF  KC Y++FRTPSILVNPV+
Sbjct: 125 RRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVY 184

Query: 188 AHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGS-- 247
           AHR+ +S RVT+++L  +DAE LYR RF+T EFFPRDID+VL N+LSLGTF+AVPRGS  
Sbjct: 185 AHRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCY 244

Query: 248 ---------------NESESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPW 307
                             +SWAV+S WNCKD + LEVRGAS  +R V+K +RM+D+ LP+
Sbjct: 245 GSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPF 304

Query: 308 LGLPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEP 367
           L +PSIP VF PFG+ F+YG+GGEGP A K++KALC H HNLAKE GCGVVA EV+ EEP
Sbjct: 305 LKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEGGCGVVAAEVAGEEP 364

Query: 368 LKSSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           L+  IPHWK LSC EDLWC+KRL E Y+D S+GDWTKSPPG SIFVDPREF
Sbjct: 365 LRRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDSIFVDPREF 413

BLAST of Tan0020944 vs. ExPASy Swiss-Prot
Match: Q42381 (Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana OX=3702 GN=HLS1 PE=1 SV=1)

HSP 1 Score: 538.1 bits (1385), Expect = 8.3e-152
Identity = 262/407 (64.37%), Postives = 317/407 (77.89%), Query Frame = 0

Query: 9   MIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLMLVAEI-R 68
           ++RE+DP +D   VEDVERRCEVG   + KLSLFTDL GDPICR+RHSP++LMLVAE+  
Sbjct: 3   VVREYDPTRDLVGVEDVERRCEVG--PSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGT 62

Query: 69  DRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPDHRRLG 128
           ++KEIVGM+RGCIKTV+C  KL  N   H + + + + P+YTKLAY+LGLRVSP HRR G
Sbjct: 63  EKKEIVGMIRGCIKTVTCGQKLDLN---HKSQNDV-VKPLYTKLAYVLGLRVSPFHRRQG 122

Query: 129 IGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPVFAHRL 188
           IG KLV+ MEEWF++N AEYSYIATENDN AS+ LF  KC YS+FRTPSILVNPV+AHR+
Sbjct: 123 IGFKLVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRV 182

Query: 189 RLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGS------ 248
            +S RVT+++L   DAETLYR RF+T EFFPRDID+VL N+LSLGTF+AVPRGS      
Sbjct: 183 NVSRRVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGS 242

Query: 249 -----------NESESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLGLP 308
                         ESWAV+S WNCKD + LEVRGAS  +R VAK +R++D+ LP+L LP
Sbjct: 243 GSWPGSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLP 302

Query: 309 SIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEPLKSS 368
           SIP VF PFG+ F+YG+GGEGP A K++K+LC H HNLAK  GCGVVA EV+ E+PL+  
Sbjct: 303 SIPSVFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLAKAGGCGVVAAEVAGEDPLRRG 362

Query: 369 IPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           IPHWK LSC EDLWC+KRL + Y+D  +GDWTKSPPG SIFVDPREF
Sbjct: 363 IPHWKVLSCDEDLWCIKRLGDDYSDGVVGDWTKSPPGVSIFVDPREF 403

BLAST of Tan0020944 vs. NCBI nr
Match: XP_023512928.1 (probable N-acetyltransferase HLS1 [Cucurbita pepo subsp. pepo] >XP_023522343.1 probable N-acetyltransferase HLS1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 666.8 bits (1719), Expect = 1.2e-187
Identity = 330/409 (80.68%), Postives = 349/409 (85.33%), Query Frame = 0

Query: 3   EQKIEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLML 62
           +  I+I+IREFDP+KDR  VEDVERRCEVGSTKNK  SLFTDL GDPICRVRHSPAFLML
Sbjct: 13  KNNIQILIREFDPIKDRTAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLML 72

Query: 63  VAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPD 122
           VAEI    EIVGM+RGCIKTV+CCSK  RN G  NASD  QL PVYTKLAYILGLRVSPD
Sbjct: 73  VAEIAAHNEIVGMIRGCIKTVTCCSKSTRNAGI-NASDLPQLAPVYTKLAYILGLRVSPD 132

Query: 123 HRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPV 182
           HRRLGIGL LVRRMEEWF+E KAEYSYIATE DNVASIKLF  KC YSKFRTPSILVNPV
Sbjct: 133 HRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYSKFRTPSILVNPV 192

Query: 183 FAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGSN 242
           FAHRLRLSD+VTILQL L  AETLYR RFATAEFFPRDIDAVL NRLSLGTFLAVPRGS 
Sbjct: 193 FAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLGTFLAVPRGSF 252

Query: 243 ES--------------ESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLG 302
                           ESWAV+SAWNCKD+YALEVRG S  KRA+AK+SR+IDRGLPWL 
Sbjct: 253 TDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSRVIDRGLPWLR 312

Query: 303 LPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEPLK 362
           LPS+PEVF+PFGVLFLYGVGGEGP AGKLMKALC H HNLAKERGCGVVATEVS +EPLK
Sbjct: 313 LPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLK 372

Query: 363 SSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           S IPHWKKLSC EDLWC+KRL EGY +NS+GDWTKSPP FSIFVDPREF
Sbjct: 373 SDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 420

BLAST of Tan0020944 vs. NCBI nr
Match: XP_022932903.1 (probable N-acetyltransferase HLS1 isoform X1 [Cucurbita moschata] >KAG7011045.1 putative N-acetyltransferase HLS1-like protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 664.1 bits (1712), Expect = 7.7e-187
Identity = 329/409 (80.44%), Postives = 348/409 (85.09%), Query Frame = 0

Query: 3   EQKIEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLML 62
           +  I+I+IREFDP+KD+  VEDVERRCEVGSTKNK  SLFTDL GDPICRVRHSPAFLML
Sbjct: 12  KNNIQILIREFDPIKDKPSVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLML 71

Query: 63  VAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPD 122
           VAEI    EIVGM+RGCIKTV+CCSK  RN G  NASD  QL PVYTKLAYILGLRVSPD
Sbjct: 72  VAEIAAHNEIVGMIRGCIKTVTCCSKSTRNAGI-NASDLPQLAPVYTKLAYILGLRVSPD 131

Query: 123 HRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPV 182
           HRRLGIGL LVRRMEEWF+E KAEYSYIATE DNVASIKLF  KC YSKFRTPSILVNPV
Sbjct: 132 HRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYSKFRTPSILVNPV 191

Query: 183 FAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGSN 242
           FAHRLRLSD+VTILQL L  AETLYR RFATAEFFPRDIDAVL NRLSLGTFLAVPRGS 
Sbjct: 192 FAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLGTFLAVPRGSF 251

Query: 243 ES--------------ESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLG 302
                           ESWAV+SAWNCKD+YALEVRG S  KRA+AK+SR+IDRGLPWL 
Sbjct: 252 TDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSRVIDRGLPWLR 311

Query: 303 LPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEPLK 362
           LPS+PEVF PFGVLFLYGVGGEGP AGKLMKALC H HNLAKERGCGVVATEVS +EPLK
Sbjct: 312 LPSVPEVFKPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLK 371

Query: 363 SSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           S IPHWKKLSC EDLWC+KRL EGY +NS+GDWTKSPP FSIFVDPREF
Sbjct: 372 SDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 419

BLAST of Tan0020944 vs. NCBI nr
Match: KAG6571244.1 (putative N-acetyltransferase HLS1-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 659.1 bits (1699), Expect = 2.5e-185
Identity = 326/409 (79.71%), Postives = 348/409 (85.09%), Query Frame = 0

Query: 3   EQKIEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLML 62
           +  I+I+IREFDP+KD+  VEDVERRCEVGSTKNK  SLFTDL GDPICRVRHSPAFLML
Sbjct: 11  KNNIQILIREFDPIKDKPSVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLML 70

Query: 63  VAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPD 122
           VAEI    EIVGM+RGCIKTV+CCSK  RN G  NASD  Q+ PVYTKLAYILGLRVSPD
Sbjct: 71  VAEISAHNEIVGMIRGCIKTVTCCSKSTRNAGI-NASDLPQIAPVYTKLAYILGLRVSPD 130

Query: 123 HRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPV 182
           HRRLGIGL LVRRMEEWF+E +AEYSYIATE DNVASIKLF  KC YSKFRTPSILVNPV
Sbjct: 131 HRRLGIGLNLVRRMEEWFREKRAEYSYIATEIDNVASIKLFTHKCGYSKFRTPSILVNPV 190

Query: 183 FAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGSN 242
           FAHRLRLSD+VTILQL L  AETLYR RFATAEFFPRDIDAVL NRLSLGTFLAVPRGS 
Sbjct: 191 FAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLGTFLAVPRGSF 250

Query: 243 ES--------------ESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLG 302
                           ESWAV+SAWNCKD+YALEVRG S  KRA+AK+SR+IDR LPWL 
Sbjct: 251 TDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSRVIDRVLPWLR 310

Query: 303 LPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEPLK 362
           LPS+PEVF+PFGVLFLYGVGGEGP AGKLMKALC H HNLAKERGCGVVATEVS +EPLK
Sbjct: 311 LPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLK 370

Query: 363 SSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           S IPHWKKLSC EDLWC+KRL EGY +NS+GDWTKSPP FSIFVDPREF
Sbjct: 371 SDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 418

BLAST of Tan0020944 vs. NCBI nr
Match: XP_022932904.1 (probable N-acetyltransferase HLS1 isoform X2 [Cucurbita moschata])

HSP 1 Score: 657.5 bits (1695), Expect = 7.2e-185
Identity = 328/409 (80.20%), Postives = 347/409 (84.84%), Query Frame = 0

Query: 3   EQKIEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLML 62
           +  I+I+IREFDP+KD+  VEDVERRCEVGSTKNK  SLFTDL GDPICRVRHSPAFLML
Sbjct: 12  KNNIQILIREFDPIKDKPSVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLML 71

Query: 63  VAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPD 122
           VAEI    EIVGM+RGCIKTV+CCSK  RN G  NASD  QL PVYTKLAYILGLRVSPD
Sbjct: 72  VAEIAAHNEIVGMIRGCIKTVTCCSKSTRNAGI-NASDLPQLAPVYTKLAYILGLRVSPD 131

Query: 123 HRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPV 182
           H RLGIGL LVRRMEEWF+E KAEYSYIATE DNVASIKLF  KC YSKFRTPSILVNPV
Sbjct: 132 H-RLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYSKFRTPSILVNPV 191

Query: 183 FAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGSN 242
           FAHRLRLSD+VTILQL L  AETLYR RFATAEFFPRDIDAVL NRLSLGTFLAVPRGS 
Sbjct: 192 FAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLGTFLAVPRGSF 251

Query: 243 ES--------------ESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLG 302
                           ESWAV+SAWNCKD+YALEVRG S  KRA+AK+SR+IDRGLPWL 
Sbjct: 252 TDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSRVIDRGLPWLR 311

Query: 303 LPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEPLK 362
           LPS+PEVF PFGVLFLYGVGGEGP AGKLMKALC H HNLAKERGCGVVATEVS +EPLK
Sbjct: 312 LPSVPEVFKPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLK 371

Query: 363 SSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           S IPHWKKLSC EDLWC+KRL EGY +NS+GDWTKSPP FSIFVDPREF
Sbjct: 372 SDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 418

BLAST of Tan0020944 vs. NCBI nr
Match: XP_022986693.1 (probable N-acetyltransferase HLS1 isoform X1 [Cucurbita maxima])

HSP 1 Score: 655.6 bits (1690), Expect = 2.7e-184
Identity = 330/419 (78.76%), Postives = 347/419 (82.82%), Query Frame = 0

Query: 1   MAEQK--------IEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICR 60
           MA+QK        I+I+IREFDP KDR  VEDVERRCEVGSTKNK  SLFTDL GDPICR
Sbjct: 1   MADQKKKKKRKNNIQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICR 60

Query: 61  VRHSPAFLMLVAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLA 120
           VRHSPAFLMLVAEI    EIVGM+RGCIKTV+CCSK  RN G  NASD   L PVYTKLA
Sbjct: 61  VRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNSGI-NASDLPHLAPVYTKLA 120

Query: 121 YILGLRVSPDHRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKF 180
           YILGLRVSPDHRRLGIGL LVRRMEEWF+E KAEYSYIATE DNVASIKLF  KC Y KF
Sbjct: 121 YILGLRVSPDHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKF 180

Query: 181 RTPSILVNPVFAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLG 240
           RTPSILVNPVFAHRLRLSD+VTIL+L L  AETLYR RFA AEFFPRDIDAVL NRLSLG
Sbjct: 181 RTPSILVNPVFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLG 240

Query: 241 TFLAVPRGSNES--------------ESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSR 300
           TFLAVPRGS                 ESWAV+SAWNCKD+YALEVRG S  KRA+AK SR
Sbjct: 241 TFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSR 300

Query: 301 MIDRGLPWLGLPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVA 360
           +IDRGLPWL LPS+PEVF+PFGVLFLYGVGGEGP AGKLMKALC H HNLAKERGCGVVA
Sbjct: 301 VIDRGLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVA 360

Query: 361 TEVSKEEPLKSSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           TEVS +EPLKS IPHWKKLSC EDLWC+KRL E Y DNS+GDWTKSPP FSIFVDPREF
Sbjct: 361 TEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 418

BLAST of Tan0020944 vs. ExPASy TrEMBL
Match: A0A6J1F329 (probable N-acetyltransferase HLS1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439465 PE=4 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 3.7e-187
Identity = 329/409 (80.44%), Postives = 348/409 (85.09%), Query Frame = 0

Query: 3   EQKIEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLML 62
           +  I+I+IREFDP+KD+  VEDVERRCEVGSTKNK  SLFTDL GDPICRVRHSPAFLML
Sbjct: 12  KNNIQILIREFDPIKDKPSVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLML 71

Query: 63  VAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPD 122
           VAEI    EIVGM+RGCIKTV+CCSK  RN G  NASD  QL PVYTKLAYILGLRVSPD
Sbjct: 72  VAEIAAHNEIVGMIRGCIKTVTCCSKSTRNAGI-NASDLPQLAPVYTKLAYILGLRVSPD 131

Query: 123 HRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPV 182
           HRRLGIGL LVRRMEEWF+E KAEYSYIATE DNVASIKLF  KC YSKFRTPSILVNPV
Sbjct: 132 HRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYSKFRTPSILVNPV 191

Query: 183 FAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGSN 242
           FAHRLRLSD+VTILQL L  AETLYR RFATAEFFPRDIDAVL NRLSLGTFLAVPRGS 
Sbjct: 192 FAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLGTFLAVPRGSF 251

Query: 243 ES--------------ESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLG 302
                           ESWAV+SAWNCKD+YALEVRG S  KRA+AK+SR+IDRGLPWL 
Sbjct: 252 TDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSRVIDRGLPWLR 311

Query: 303 LPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEPLK 362
           LPS+PEVF PFGVLFLYGVGGEGP AGKLMKALC H HNLAKERGCGVVATEVS +EPLK
Sbjct: 312 LPSVPEVFKPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLK 371

Query: 363 SSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           S IPHWKKLSC EDLWC+KRL EGY +NS+GDWTKSPP FSIFVDPREF
Sbjct: 372 SDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 419

BLAST of Tan0020944 vs. ExPASy TrEMBL
Match: A0A6J1EYB0 (probable N-acetyltransferase HLS1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439465 PE=4 SV=1)

HSP 1 Score: 657.5 bits (1695), Expect = 3.5e-185
Identity = 328/409 (80.20%), Postives = 347/409 (84.84%), Query Frame = 0

Query: 3   EQKIEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLML 62
           +  I+I+IREFDP+KD+  VEDVERRCEVGSTKNK  SLFTDL GDPICRVRHSPAFLML
Sbjct: 12  KNNIQILIREFDPIKDKPSVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLML 71

Query: 63  VAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPD 122
           VAEI    EIVGM+RGCIKTV+CCSK  RN G  NASD  QL PVYTKLAYILGLRVSPD
Sbjct: 72  VAEIAAHNEIVGMIRGCIKTVTCCSKSTRNAGI-NASDLPQLAPVYTKLAYILGLRVSPD 131

Query: 123 HRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPV 182
           H RLGIGL LVRRMEEWF+E KAEYSYIATE DNVASIKLF  KC YSKFRTPSILVNPV
Sbjct: 132 H-RLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYSKFRTPSILVNPV 191

Query: 183 FAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGSN 242
           FAHRLRLSD+VTILQL L  AETLYR RFATAEFFPRDIDAVL NRLSLGTFLAVPRGS 
Sbjct: 192 FAHRLRLSDQVTILQLELTVAETLYRCRFATAEFFPRDIDAVLSNRLSLGTFLAVPRGSF 251

Query: 243 ES--------------ESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLG 302
                           ESWAV+SAWNCKD+YALEVRG S  KRA+AK+SR+IDRGLPWL 
Sbjct: 252 TDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKLSRVIDRGLPWLR 311

Query: 303 LPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEPLK 362
           LPS+PEVF PFGVLFLYGVGGEGP AGKLMKALC H HNLAKERGCGVVATEVS +EPLK
Sbjct: 312 LPSVPEVFKPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLK 371

Query: 363 SSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           S IPHWKKLSC EDLWC+KRL EGY +NS+GDWTKSPP FSIFVDPREF
Sbjct: 372 SDIPHWKKLSCPEDLWCIKRLGEGYTNNSLGDWTKSPPSFSIFVDPREF 418

BLAST of Tan0020944 vs. ExPASy TrEMBL
Match: A0A6J1JHA4 (probable N-acetyltransferase HLS1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484371 PE=4 SV=1)

HSP 1 Score: 655.6 bits (1690), Expect = 1.3e-184
Identity = 330/419 (78.76%), Postives = 347/419 (82.82%), Query Frame = 0

Query: 1   MAEQK--------IEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICR 60
           MA+QK        I+I+IREFDP KDR  VEDVERRCEVGSTKNK  SLFTDL GDPICR
Sbjct: 1   MADQKKKKKRKNNIQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICR 60

Query: 61  VRHSPAFLMLVAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLA 120
           VRHSPAFLMLVAEI    EIVGM+RGCIKTV+CCSK  RN G  NASD   L PVYTKLA
Sbjct: 61  VRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNSGI-NASDLPHLAPVYTKLA 120

Query: 121 YILGLRVSPDHRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKF 180
           YILGLRVSPDHRRLGIGL LVRRMEEWF+E KAEYSYIATE DNVASIKLF  KC Y KF
Sbjct: 121 YILGLRVSPDHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKF 180

Query: 181 RTPSILVNPVFAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLG 240
           RTPSILVNPVFAHRLRLSD+VTIL+L L  AETLYR RFA AEFFPRDIDAVL NRLSLG
Sbjct: 181 RTPSILVNPVFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLG 240

Query: 241 TFLAVPRGSNES--------------ESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSR 300
           TFLAVPRGS                 ESWAV+SAWNCKD+YALEVRG S  KRA+AK SR
Sbjct: 241 TFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSR 300

Query: 301 MIDRGLPWLGLPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVA 360
           +IDRGLPWL LPS+PEVF+PFGVLFLYGVGGEGP AGKLMKALC H HNLAKERGCGVVA
Sbjct: 301 VIDRGLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVA 360

Query: 361 TEVSKEEPLKSSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           TEVS +EPLKS IPHWKKLSC EDLWC+KRL E Y DNS+GDWTKSPP FSIFVDPREF
Sbjct: 361 TEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 418

BLAST of Tan0020944 vs. ExPASy TrEMBL
Match: A0A6J1JGS4 (probable N-acetyltransferase HLS1 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111484371 PE=4 SV=1)

HSP 1 Score: 649.0 bits (1673), Expect = 1.2e-182
Identity = 329/419 (78.52%), Postives = 346/419 (82.58%), Query Frame = 0

Query: 1   MAEQK--------IEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICR 60
           MA+QK        I+I+IREFDP KDR  VEDVERRCEVGSTKNK  SLFTDL GDPICR
Sbjct: 1   MADQKKKKKRKNNIQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICR 60

Query: 61  VRHSPAFLMLVAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLA 120
           VRHSPAFLMLVAEI    EIVGM+RGCIKTV+CCSK  RN G  NASD   L PVYTKLA
Sbjct: 61  VRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNSGI-NASDLPHLAPVYTKLA 120

Query: 121 YILGLRVSPDHRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKF 180
           YILGLRVSPDH RLGIGL LVRRMEEWF+E KAEYSYIATE DNVASIKLF  KC Y KF
Sbjct: 121 YILGLRVSPDH-RLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKF 180

Query: 181 RTPSILVNPVFAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLG 240
           RTPSILVNPVFAHRLRLSD+VTIL+L L  AETLYR RFA AEFFPRDIDAVL NRLSLG
Sbjct: 181 RTPSILVNPVFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLG 240

Query: 241 TFLAVPRGSNES--------------ESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSR 300
           TFLAVPRGS                 ESWAV+SAWNCKD+YALEVRG S  KRA+AK SR
Sbjct: 241 TFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSR 300

Query: 301 MIDRGLPWLGLPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVA 360
           +IDRGLPWL LPS+PEVF+PFGVLFLYGVGGEGP AGKLMKALC H HNLAKERGCGVVA
Sbjct: 301 VIDRGLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVA 360

Query: 361 TEVSKEEPLKSSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           TEVS +EPLKS IPHWKKLSC EDLWC+KRL E Y DNS+GDWTKSPP FSIFVDPREF
Sbjct: 361 TEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 417

BLAST of Tan0020944 vs. ExPASy TrEMBL
Match: A0A2I4HWA3 (probable N-acetyltransferase HLS1 OS=Juglans regia OX=51240 GN=LOC109022071 PE=4 SV=1)

HSP 1 Score: 587.0 bits (1512), Expect = 5.8e-164
Identity = 286/412 (69.42%), Postives = 334/412 (81.07%), Query Frame = 0

Query: 1   MAEQKIEIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFL 60
           M E+  EI++REFD  KD   VEDVERRCEVG   + KLSLFTDL GDPICRVR+SPAF 
Sbjct: 1   MGEENHEIVVREFDHKKDLLGVEDVERRCEVG--PSGKLSLFTDLLGDPICRVRNSPAFN 60

Query: 61  MLVAEIRDRKEIVGMVRGCIKTVSCCSKLPRNGG-THNASDPIQLVPVYTKLAYILGLRV 120
           MLVAEI ++KEIVGM+RGCIKT +C  KL RN   ++N ++ I+ VPVYTK+AYILGLRV
Sbjct: 61  MLVAEIGEQKEIVGMIRGCIKTATCGKKLSRNAKISNNNNELIKPVPVYTKVAYILGLRV 120

Query: 121 SPDHRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILV 180
           SP HRRLGIGLKLVRRMEEWF++N AEYSY+ATENDN+AS+KLF DKC YSKFRTPSILV
Sbjct: 121 SPSHRRLGIGLKLVRRMEEWFRDNGAEYSYLATENDNLASVKLFTDKCGYSKFRTPSILV 180

Query: 181 NPVFAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPR 240
           NPVFAHR+R+SD VT++QL  +DAE LYRRRF+T EFFPRDID+VL N+LSLGTFLAVPR
Sbjct: 181 NPVFAHRVRVSDLVTVIQLCPSDAEILYRRRFSTTEFFPRDIDSVLNNKLSLGTFLAVPR 240

Query: 241 G--------------SNESESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLP 300
           G              S+  +SWAV+S WNCKD++ LEVRGAS  KR +AK +R++DR  P
Sbjct: 241 GTYTTESFPGSDRFLSDPCDSWAVLSIWNCKDVFTLEVRGASLAKRTLAKTTRIVDRAFP 300

Query: 301 WLGLPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEE 360
           WL +PS+PE+F PFG+ FLYG+GGEG  A KL+KALC + HNLAK RGCGVVA EVS  E
Sbjct: 301 WLQVPSVPELFKPFGLHFLYGIGGEGTRAVKLVKALCGYAHNLAKARGCGVVAAEVSSRE 360

Query: 361 PLKSSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           PL+  IPHWKKLSC EDLWC+KRL E Y+D S+GDWTKSPPG SIFVDPREF
Sbjct: 361 PLRLGIPHWKKLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGLSIFVDPREF 410

BLAST of Tan0020944 vs. TAIR 10
Match: AT2G23060.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 546.2 bits (1406), Expect = 2.2e-155
Identity = 267/411 (64.96%), Postives = 317/411 (77.13%), Query Frame = 0

Query: 8   IMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLMLVAEI- 67
           + +RE+DP KD A VEDVERRCEVG     KLSLFTDL GDPICRVRHSP++LMLVAEI 
Sbjct: 5   VEVREYDPSKDLATVEDVERRCEVGPA--GKLSLFTDLLGDPICRVRHSPSYLMLVAEIG 64

Query: 68  -RDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNAS--DPIQLVPVYTKLAYILGLRVSPDH 127
            +++KE+VGM+RGCIKTV+C     R   THN S  D +   P+YTKLAYILGLRVSP H
Sbjct: 65  PKEKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTH 124

Query: 128 RRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPVF 187
           RR GIG KLV+ ME+WF +N AEYSY ATENDN AS+ LF  KC Y++FRTPSILVNPV+
Sbjct: 125 RRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVY 184

Query: 188 AHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGS-- 247
           AHR+ +S RVT+++L  +DAE LYR RF+T EFFPRDID+VL N+LSLGTF+AVPRGS  
Sbjct: 185 AHRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCY 244

Query: 248 ---------------NESESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPW 307
                             +SWAV+S WNCKD + LEVRGAS  +R V+K +RM+D+ LP+
Sbjct: 245 GSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPF 304

Query: 308 LGLPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEP 367
           L +PSIP VF PFG+ F+YG+GGEGP A K++KALC H HNLAKE GCGVVA EV+ EEP
Sbjct: 305 LKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEGGCGVVAAEVAGEEP 364

Query: 368 LKSSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           L+  IPHWK LSC EDLWC+KRL E Y+D S+GDWTKSPPG SIFVDPREF
Sbjct: 365 LRRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDSIFVDPREF 413

BLAST of Tan0020944 vs. TAIR 10
Match: AT4G37580.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 538.1 bits (1385), Expect = 5.9e-153
Identity = 262/407 (64.37%), Postives = 317/407 (77.89%), Query Frame = 0

Query: 9   MIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLMLVAEI-R 68
           ++RE+DP +D   VEDVERRCEVG   + KLSLFTDL GDPICR+RHSP++LMLVAE+  
Sbjct: 3   VVREYDPTRDLVGVEDVERRCEVG--PSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGT 62

Query: 69  DRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPDHRRLG 128
           ++KEIVGM+RGCIKTV+C  KL  N   H + + + + P+YTKLAY+LGLRVSP HRR G
Sbjct: 63  EKKEIVGMIRGCIKTVTCGQKLDLN---HKSQNDV-VKPLYTKLAYVLGLRVSPFHRRQG 122

Query: 129 IGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPVFAHRL 188
           IG KLV+ MEEWF++N AEYSYIATENDN AS+ LF  KC YS+FRTPSILVNPV+AHR+
Sbjct: 123 IGFKLVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRV 182

Query: 189 RLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGS------ 248
            +S RVT+++L   DAETLYR RF+T EFFPRDID+VL N+LSLGTF+AVPRGS      
Sbjct: 183 NVSRRVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGS 242

Query: 249 -----------NESESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLGLP 308
                         ESWAV+S WNCKD + LEVRGAS  +R VAK +R++D+ LP+L LP
Sbjct: 243 GSWPGSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLP 302

Query: 309 SIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEPLKSS 368
           SIP VF PFG+ F+YG+GGEGP A K++K+LC H HNLAK  GCGVVA EV+ E+PL+  
Sbjct: 303 SIPSVFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLAKAGGCGVVAAEVAGEDPLRRG 362

Query: 369 IPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           IPHWK LSC EDLWC+KRL + Y+D  +GDWTKSPPG SIFVDPREF
Sbjct: 363 IPHWKVLSCDEDLWCIKRLGDDYSDGVVGDWTKSPPGVSIFVDPREF 403

BLAST of Tan0020944 vs. TAIR 10
Match: AT2G23060.2 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 472.2 bits (1214), Expect = 4.0e-133
Identity = 229/358 (63.97%), Postives = 274/358 (76.54%), Query Frame = 0

Query: 61  MLVAEI--RDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNAS--DPIQLVPVYTKLAYILG 120
           MLVAEI  +++KE+VGM+RGCIKTV+C     R   THN S  D +   P+YTKLAYILG
Sbjct: 1   MLVAEIGPKEKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILG 60

Query: 121 LRVSPDHRRLGIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPS 180
           LRVSP HRR GIG KLV+ ME+WF +N AEYSY ATENDN AS+ LF  KC Y++FRTPS
Sbjct: 61  LRVSPTHRRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPS 120

Query: 181 ILVNPVFAHRLRLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLA 240
           ILVNPV+AHR+ +S RVT+++L  +DAE LYR RF+T EFFPRDID+VL N+LSLGTF+A
Sbjct: 121 ILVNPVYAHRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVA 180

Query: 241 VPRGS-----------------NESESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRM 300
           VPRGS                    +SWAV+S WNCKD + LEVRGAS  +R V+K +RM
Sbjct: 181 VPRGSCYGSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRM 240

Query: 301 IDRGLPWLGLPSIPEVFSPFGVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVAT 360
           +D+ LP+L +PSIP VF PFG+ F+YG+GGEGP A K++KALC H HNLAKE GCGVVA 
Sbjct: 241 VDKTLPFLKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEGGCGVVAA 300

Query: 361 EVSKEEPLKSSIPHWKKLSCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPREF 398
           EV+ EEPL+  IPHWK LSC EDLWC+KRL E Y+D S+GDWTKSPPG SIFVDPREF
Sbjct: 301 EVAGEEPLRRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDSIFVDPREF 358

BLAST of Tan0020944 vs. TAIR 10
Match: AT5G67430.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 437.2 bits (1123), Expect = 1.4e-122
Identity = 219/397 (55.16%), Postives = 285/397 (71.79%), Query Frame = 0

Query: 8   IMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLMLVAEIR 67
           +++RE+DP +D   VE++E  CEVG       SL  DL GDP+ R+R SP+F MLVAEI 
Sbjct: 8   VVVREYDPKRDLTSVEELEESCEVG-------SLLVDLMGDPLARIRQSPSFHMLVAEIG 67

Query: 68  DRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPDHRRLG 127
           +  EIVGM+RG IK V+      R       +D +      TKLA++ GLRVSP +RR+G
Sbjct: 68  N--EIVGMIRGTIKMVT------RGVNALRQADDVSPEINTTKLAFVSGLRVSPFYRRMG 127

Query: 128 IGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPVFAHRL 187
           IGLKLV+R+EEWF  N A YSY+ TENDN+AS+KLF +K  YSKFRTP+ LVNPVF HR+
Sbjct: 128 IGLKLVQRLEEWFLRNDAVYSYVQTENDNIASVKLFTEKSGYSKFRTPTFLVNPVFNHRV 187

Query: 188 RLSDRVTILQLGLADAETLYRRRFATAEFFPRDIDAVLCNRLSLGTFLAVPRGS------ 247
            +S RV I++L  +DAE+LYR RF+T EFFP DI+++L N+LSLGT+LAVPRG       
Sbjct: 188 TVSRRVKIIKLAPSDAESLYRNRFSTTEFFPSDINSILTNKLSLGTYLAVPRGGDNVSGS 247

Query: 248 --NESESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLGLPSIPEVFSPF 307
             +++ SWAV+S WN KD+Y L+V+GAS  KR +AK +R+ D   P+L +PS P +F  F
Sbjct: 248 LPDQTGSWAVISIWNSKDVYRLQVKGASRLKRMLAKSTRVFDGAFPFLKIPSFPNLFKSF 307

Query: 308 GVLFLYGVGGEGPEAGKLMKALCKHVHNLAKERGCGVVATEVSKEEPLKSSIPHWKKLSC 367
            + F+YG+GGEGP A ++++ALC H HNLA++ GC VVA EV+  EPL+  IPHWK LS 
Sbjct: 308 AMHFMYGIGGEGPRAAEMVEALCSHAHNLARKSGCAVVAAEVASCEPLRVGIPHWKVLS- 367

Query: 368 LEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPRE 397
            EDLWC+KRLR  Y+D+ + DWTKSPPG SIFVDPRE
Sbjct: 368 PEDLWCLKRLR--YDDDGV-DWTKSPPGLSIFVDPRE 385

BLAST of Tan0020944 vs. TAIR 10
Match: AT2G30090.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 310.1 bits (793), Expect = 2.6e-84
Identity = 168/399 (42.11%), Postives = 247/399 (61.90%), Query Frame = 0

Query: 7   EIMIREFDPVKDRAVVEDVERRCEVGSTKNKKLSLFTDLRGDPICRVRHSPAFLMLVAEI 66
           E++IR +D  +DR  +  +E+ CE+G   + +  LFTD  GDPICR+R+SP F+MLVA +
Sbjct: 12  EVVIRCYDDRRDRIQMGRMEKSCEIG--HDHQTLLFTDTLGDPICRIRNSPFFIMLVAGV 71

Query: 67  RDRKEIVGMVRGCIKTVSCCSKLPRNGGTHNASDPIQLVPVYTKLAYILGLRVSPDHRRL 126
            ++  +VG ++G +K                   P++      ++ Y+LGLRV P +RR 
Sbjct: 72  GNK--LVGSIQGSVK-------------------PVEFHDKSVRVGYVLGLRVVPSYRRR 131

Query: 127 GIGLKLVRRMEEWFKENKAEYSYIATENDNVASIKLFVDKCEYSKFRTPSILVNPVFAHR 186
           GIG  LVR++EEWF+ + A+Y+Y+ATE DN AS  LF+ +  Y  FR P+ILVNPV   R
Sbjct: 132 GIGSILVRKLEEWFESHNADYAYMATEKDNEASHGLFIGRLGYVVFRNPAILVNPVNPGR 191

Query: 187 -LRLSDRVTILQLGLADAETLYRRRF-ATAEFFPRDIDAVLCNRLSLGTFLAVPRGSNES 246
            L+L   + I +L + +AE+LYRR   AT EFFP DI+ +L N+LS+GT++A     + +
Sbjct: 192 GLKLPSDIGIRKLKVKEAESLYRRNVAATTEFFPDDINKILRNKLSIGTWVAYYNNVDNT 251

Query: 247 ESWAVVSAWNCKDMYALEVRGASTTKRAVAKVSRMIDRGLPWLGLPSIPEVFSPFGVLFL 306
            SWA++S W+   ++ L +  A  +   + KVS++    L  LGL  +P++F+PFG  FL
Sbjct: 252 RSWAMLSVWDSSKVFKLRIERAPLSYLLLTKVSKLFGNFLSLLGLTVLPDLFTPFGFYFL 311

Query: 307 YGVGGEGPEAGKLMKALCKHVHNLAKER---GCGVVATEVSK----EEPLKSSIPHWKKL 366
           YGV  EGP  GKL++ALC+HVHN+A       C VV  EV K    ++ L+  IPHWK L
Sbjct: 312 YGVHSEGPHCGKLVRALCEHVHNMAALNDGCACKVVVVEVDKGSNGDDSLQRCIPHWKML 371

Query: 367 SCLEDLWCMKRLREGYNDNSIGDWTKSPPGFSIFVDPRE 397
           SC +D+WC+K L+   N   + + +KS    S+FVDPRE
Sbjct: 372 SCDDDMWCIKPLKCEKNKFDLSERSKSRS--SLFVDPRE 385

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O648153.1e-15464.96Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana OX=3702 GN=At2g23... [more]
Q423818.3e-15264.37Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana OX=3702 GN=HLS1 PE=1 S... [more]
Match NameE-valueIdentityDescription
XP_023512928.11.2e-18780.68probable N-acetyltransferase HLS1 [Cucurbita pepo subsp. pepo] >XP_023522343.1 p... [more]
XP_022932903.17.7e-18780.44probable N-acetyltransferase HLS1 isoform X1 [Cucurbita moschata] >KAG7011045.1 ... [more]
KAG6571244.12.5e-18579.71putative N-acetyltransferase HLS1-like protein, partial [Cucurbita argyrosperma ... [more]
XP_022932904.17.2e-18580.20probable N-acetyltransferase HLS1 isoform X2 [Cucurbita moschata][more]
XP_022986693.12.7e-18478.76probable N-acetyltransferase HLS1 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1F3293.7e-18780.44probable N-acetyltransferase HLS1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1EYB03.5e-18580.20probable N-acetyltransferase HLS1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1JHA41.3e-18478.76probable N-acetyltransferase HLS1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A6J1JGS41.2e-18278.52probable N-acetyltransferase HLS1 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A2I4HWA35.8e-16469.42probable N-acetyltransferase HLS1 OS=Juglans regia OX=51240 GN=LOC109022071 PE=4... [more]
Match NameE-valueIdentityDescription
AT2G23060.12.2e-15564.96Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT4G37580.15.9e-15364.37Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G23060.24.0e-13363.97Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT5G67430.11.4e-12255.16Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G30090.12.6e-8442.11Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000182GNAT domainPFAMPF00583Acetyltransf_1coord: 55..164
e-value: 1.8E-15
score: 57.2
IPR000182GNAT domainPROSITEPS51186GNATcoord: 8..220
score: 13.162516
NoneNo IPR availableGENE3D3.40.630.30coord: 18..174
e-value: 7.9E-14
score: 53.8
NoneNo IPR availablePANTHERPTHR43072:SF42N-ACETYLTRANSFERASE HLS1-LIKE-RELATEDcoord: 3..397
NoneNo IPR availablePANTHERPTHR43072N-ACETYLTRANSFERASEcoord: 3..397
NoneNo IPR availableCDDcd04301NAT_SFcoord: 112..147
e-value: 1.52578E-6
score: 43.4185
IPR016181Acyl-CoA N-acyltransferaseSUPERFAMILY55729Acyl-CoA N-acyltransferases (Nat)coord: 55..163

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020944.1Tan0020944.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008080 N-acetyltransferase activity