CmaCh20G009860 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G009860
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionAcyl-CoA N-acyltransferases (NAT) superfamily protein
LocationCma_Chr20 : 5316513 .. 5318439 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGAAAGCCCCCCAAAAGCCCCAAATTGCTTTTAAATCCACCATTATAAACACCATCCACACAACACAACGGTTCATAACAAAAAAAGTTCCTTTTTTTTTTTTCTTTTCTTTTTTGATCTTAATCCAACAATCCCTCAACTCTTTGTCGTTCTCGTGTTCCCATCTCCAATAGTGAAATGAACCCACCATCTCTGTAGCATCCTCCTCTTCTGTTTCTCAAATCCGCCATGGCAGACCAGAAGAAGAAGAAGAAGAGGAAGAACAACATTCAAATTCTCATCAGAGAATTCGACCCCACTAAAGACAGACCTGCCGTTGAAGATGTTGAACGACGATGCGAAGTTGGCTCAACCAAAAACAAAAACTTCTCCCTCTTCACCGATCTCCTCGGTGACCCAATTTGCAGAGTCCGCCATTCCCCTGCCTTTCTCATGCTCGTAACACTCTTTCCCCCCCCACCCTTTTTCTTCTCCCCCTCAAATTTCATCACCAATTCTCAAAAACTTCATTCATTCATTCAGGTCGCGGAAATTGCCGCCCACAACGAGATCGTCGGCATGATCAGAGGCTGCATCAAGACTGTTACTTGCTGCTCTAAATCCACTAGAAATTCCGGAATTAACGCCTCTGATCTTCCCCACCTCGCCCCTGTTTACACCAAACTCGCTTACATCTTAGGCCTTCGCGTCTCCCCCGATCACCGGTAAGGAATTCAAACAAAACAAAAAACAGAGTATTCATACATTAAGAGAAAACAGAGTATTCGGAAAAACAGAGTATTCATAGATTAGACTAAGAATTGATACTGTATTTGTAGGAGATTGGGGATTGGATTGAATCTGGTAAGGCGAATGGAGGAATGGTTCAGAGAGAAGAAAGCGGAATATTCATACATAGCTACAGAGATTGACAACGTCGCTTCAATTAAGCTATTCACTCATAAATGCGGCTATTTCAAGTTCAGAACGCCGTCGATTCTCGTGAACCCTGTTTTCGCCCACCGGCTCCGCCTCTCCGACCAAGTCACCATCCTCCGGCTCGAGCTGACTGTCGCCGAGACACTGTATCGATGTCGGTTCGCGGCGGCGGAGTTTTTCCCGCGCGACATTGACGCGGTGCTCTCCAACCGGCTAAGCCTCGGCACATTTCTAGCAGTGCCACGTGGGAGTTTTACAGATGGGTTGTGGGTCGAGTCTGATCGGTTTTTGTCGTGTCTGCCGGAATCGTGGGCGGTTTTAAGCGCGTGGAATTGCAAGGACGTGTATGCGCTGGAGGTTCGTGGCGTGTCCGTGGTGAAGCGAGCGATGGCAAAATTCAGCCGTGTGATTGATCGAGGGCTGCCGTGGTTGCGGCTGCCATCAGTTCCGGAGGTGTTTACGCCATTTGGAGTGCTGTTTTTGTATGGGGTAGGAGGAGAAGGCCCACTCGCAGGGAAGCTGATGAAGGCGCTTTGCCACCACGCGCACAATTTGGCAAAGGAGCGTGGCTGTGGTGTGGTGGCTACGGAGGTTTCAAACGATGAGCCGCTCAAATCCGACATTCCACATTGGAAAAAACTGTCATGTCCGGAGGATTTATGGTGCATTAAGCGCCTTGGTGAATGCTACACCGACAACTCCCTCGGCGATTGGACTAAATCGCCACCCAGCTTCTCCATATTCGTTGATCCTAGGGAATTCTAATTCACCCCCAACAGCTACCGCTTTCTTCCATTCCTTGTTTTCGATAACATAATCTAATCGTCCACGAAATAAAATAAGATATCATTTTTATAACGAACAATGTTGAATGAAAAGAATATGTAATACTATTATACGATAAAATTGGGTTCATTGCACGAGACCCACAATAGTACATTCGTATAGTTATTTGAGTACTTGTTGTGCGTTGGATGACCACTTTCAATTCATGTTGTGCATTCTAC

mRNA sequence

AAAGAAAGCCCCCCAAAAGCCCCAAATTGCTTTTAAATCCACCATTATAAACACCATCCACACAACACAACGGTTCATAACAAAAAAAGTTCCTTTTTTTTTTTTCTTTTCTTTTTTGATCTTAATCCAACAATCCCTCAACTCTTTGTCGTTCTCGTGTTCCCATCTCCAATAGTGAAATGAACCCACCATCTCTGTAGCATCCTCCTCTTCTGTTTCTCAAATCCGCCATGGCAGACCAGAAGAAGAAGAAGAAGAGGAAGAACAACATTCAAATTCTCATCAGAGAATTCGACCCCACTAAAGACAGACCTGCCGTTGAAGATGTTGAACGACGATGCGAAGTTGGCTCAACCAAAAACAAAAACTTCTCCCTCTTCACCGATCTCCTCGGTGACCCAATTTGCAGAGTCCGCCATTCCCCTGCCTTTCTCATGCTCGTCGCGGAAATTGCCGCCCACAACGAGATCGTCGGCATGATCAGAGGCTGCATCAAGACTGTTACTTGCTGCTCTAAATCCACTAGAAATTCCGGAATTAACGCCTCTGATCTTCCCCACCTCGCCCCTGTTTACACCAAACTCGCTTACATCTTAGGCCTTCGCGTCTCCCCCGATCACCGGAGATTGGGGATTGGATTGAATCTGGTAAGGCGAATGGAGGAATGGTTCAGAGAGAAGAAAGCGGAATATTCATACATAGCTACAGAGATTGACAACGTCGCTTCAATTAAGCTATTCACTCATAAATGCGGCTATTTCAAGTTCAGAACGCCGTCGATTCTCGTGAACCCTGTTTTCGCCCACCGGCTCCGCCTCTCCGACCAAGTCACCATCCTCCGGCTCGAGCTGACTGTCGCCGAGACACTGTATCGATGTCGGTTCGCGGCGGCGGAGTTTTTCCCGCGCGACATTGACGCGGTGCTCTCCAACCGGCTAAGCCTCGGCACATTTCTAGCAGTGCCACGTGGGAGTTTTACAGATGGGTTGTGGGTCGAGTCTGATCGGTTTTTGTCGTGTCTGCCGGAATCGTGGGCGGTTTTAAGCGCGTGGAATTGCAAGGACGTGTATGCGCTGGAGGTTCGTGGCGTGTCCGTGGTGAAGCGAGCGATGGCAAAATTCAGCCGTGTGATTGATCGAGGGCTGCCGTGGTTGCGGCTGCCATCAGTTCCGGAGGTGTTTACGCCATTTGGAGTGCTGTTTTTGTATGGGGTAGGAGGAGAAGGCCCACTCGCAGGGAAGCTGATGAAGGCGCTTTGCCACCACGCGCACAATTTGGCAAAGGAGCGTGGCTGTGGTGTGGTGGCTACGGAGGTTTCAAACGATGAGCCGCTCAAATCCGACATTCCACATTGGAAAAAACTGTCATGTCCGGAGGATTTATGGTGCATTAAGCGCCTTGGTGAATGCTACACCGACAACTCCCTCGGCGATTGGACTAAATCGCCACCCAGCTTCTCCATATTCGTTGATCCTAGGGAATTCTAATTCACCCCCAACAGCTACCGCTTTCTTCCATTCCTTGTTTTCGATAACATAATCTAATCGTCCACGAAATAAAATAAGATATCATTTTTATAACGAACAATGTTGAATGAAAAGAATATGTAATACTATTATACGATAAAATTGGGTTCATTGCACGAGACCCACAATAGTACATTCGTATAGTTATTTGAGTACTTGTTGTGCGTTGGATGACCACTTTCAATTCATGTTGTGCATTCTAC

Coding sequence (CDS)

ATGGCAGACCAGAAGAAGAAGAAGAAGAGGAAGAACAACATTCAAATTCTCATCAGAGAATTCGACCCCACTAAAGACAGACCTGCCGTTGAAGATGTTGAACGACGATGCGAAGTTGGCTCAACCAAAAACAAAAACTTCTCCCTCTTCACCGATCTCCTCGGTGACCCAATTTGCAGAGTCCGCCATTCCCCTGCCTTTCTCATGCTCGTCGCGGAAATTGCCGCCCACAACGAGATCGTCGGCATGATCAGAGGCTGCATCAAGACTGTTACTTGCTGCTCTAAATCCACTAGAAATTCCGGAATTAACGCCTCTGATCTTCCCCACCTCGCCCCTGTTTACACCAAACTCGCTTACATCTTAGGCCTTCGCGTCTCCCCCGATCACCGGAGATTGGGGATTGGATTGAATCTGGTAAGGCGAATGGAGGAATGGTTCAGAGAGAAGAAAGCGGAATATTCATACATAGCTACAGAGATTGACAACGTCGCTTCAATTAAGCTATTCACTCATAAATGCGGCTATTTCAAGTTCAGAACGCCGTCGATTCTCGTGAACCCTGTTTTCGCCCACCGGCTCCGCCTCTCCGACCAAGTCACCATCCTCCGGCTCGAGCTGACTGTCGCCGAGACACTGTATCGATGTCGGTTCGCGGCGGCGGAGTTTTTCCCGCGCGACATTGACGCGGTGCTCTCCAACCGGCTAAGCCTCGGCACATTTCTAGCAGTGCCACGTGGGAGTTTTACAGATGGGTTGTGGGTCGAGTCTGATCGGTTTTTGTCGTGTCTGCCGGAATCGTGGGCGGTTTTAAGCGCGTGGAATTGCAAGGACGTGTATGCGCTGGAGGTTCGTGGCGTGTCCGTGGTGAAGCGAGCGATGGCAAAATTCAGCCGTGTGATTGATCGAGGGCTGCCGTGGTTGCGGCTGCCATCAGTTCCGGAGGTGTTTACGCCATTTGGAGTGCTGTTTTTGTATGGGGTAGGAGGAGAAGGCCCACTCGCAGGGAAGCTGATGAAGGCGCTTTGCCACCACGCGCACAATTTGGCAAAGGAGCGTGGCTGTGGTGTGGTGGCTACGGAGGTTTCAAACGATGAGCCGCTCAAATCCGACATTCCACATTGGAAAAAACTGTCATGTCCGGAGGATTTATGGTGCATTAAGCGCCTTGGTGAATGCTACACCGACAACTCCCTCGGCGATTGGACTAAATCGCCACCCAGCTTCTCCATATTCGTTGATCCTAGGGAATTCTAA

Protein sequence

MADQKKKKKRKNNIQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIAAHNEIVGMIRGCIKTVTCCSKSTRNSGINASDLPHLAPVYTKLAYILGLRVSPDHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF
BLAST of CmaCh20G009860 vs. Swiss-Prot
Match: HLS1_ARATH (Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana GN=HLS1 PE=1 SV=1)

HSP 1 Score: 546.2 bits (1406), Expect = 3.1e-154
Identity = 265/406 (65.27%), Postives = 318/406 (78.33%), Query Frame = 1

Query: 17  LIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIAA 76
           ++RE+DPT+D   VEDVERRCEVG +     SLFTDLLGDPICR+RHSP++LMLVAE+  
Sbjct: 3   VVREYDPTRDLVGVEDVERRCEVGPSGK--LSLFTDLLGDPICRIRHSPSYLMLVAEMGT 62

Query: 77  HN-EIVGMIRGCIKTVTCCSKSTRNSGINASDLPHLAPVYTKLAYILGLRVSPDHRRLGI 136
              EIVGMIRGCIKTVTC  K   N   + S    + P+YTKLAY+LGLRVSP HRR GI
Sbjct: 63  EKKEIVGMIRGCIKTVTCGQKLDLN---HKSQNDVVKPLYTKLAYVLGLRVSPFHRRQGI 122

Query: 137 GLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHRLR 196
           G  LV+ MEEWFR+  AEYSYIATE DN AS+ LFT KCGY +FRTPSILVNPV+AHR+ 
Sbjct: 123 GFKLVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRVN 182

Query: 197 LSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFT---DG 256
           +S +VT+++LE   AETLYR RF+  EFFPRDID+VL+N+LSLGTF+AVPRGS      G
Sbjct: 183 VSRRVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSG 242

Query: 257 LWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPS 316
            W  S +FL   PESWAVLS WNCKD + LEVRG S ++R +AK +RV+D+ LP+L+LPS
Sbjct: 243 SWPGSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPS 302

Query: 317 VPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDI 376
           +P VF PFG+ F+YG+GGEGP A K++K+LC HAHNLAK  GCGVVA EV+ ++PL+  I
Sbjct: 303 IPSVFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLAKAGGCGVVAAEVAGEDPLRRGI 362

Query: 377 PHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           PHWK LSC EDLWCIKRLG+ Y+D  +GDWTKSPP  SIFVDPREF
Sbjct: 363 PHWKVLSCDEDLWCIKRLGDDYSDGVVGDWTKSPPGVSIFVDPREF 403

BLAST of CmaCh20G009860 vs. Swiss-Prot
Match: HLS1L_ARATH (Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana GN=At2g23060 PE=2 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 3.8e-152
Identity = 263/413 (63.68%), Postives = 320/413 (77.48%), Query Frame = 1

Query: 14  IQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAE 73
           + + +RE+DP+KD   VEDVERRCEVG       SLFTDLLGDPICRVRHSP++LMLVAE
Sbjct: 3   VLVEVREYDPSKDLATVEDVERRCEVGPAGK--LSLFTDLLGDPICRVRHSPSYLMLVAE 62

Query: 74  IAAHN--EIVGMIRGCIKTVTCCSKSTR---NSGINASDLPHLAPVYTKLAYILGLRVSP 133
           I      E+VGMIRGCIKTVTC   + R       + +D+    P+YTKLAYILGLRVSP
Sbjct: 63  IGPKEKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSP 122

Query: 134 DHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNP 193
            HRR GIG  LV+ ME+WF +  AEYSY ATE DN AS+ LFT KCGY +FRTPSILVNP
Sbjct: 123 THRRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNP 182

Query: 194 VFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGS 253
           V+AHR+ +S +VT+++LE + AE LYR RF+  EFFPRDID+VL+N+LSLGTF+AVPRGS
Sbjct: 183 VYAHRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGS 242

Query: 254 -FTDGL--WVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGL 313
            +  G   W  S +FL   P+SWAVLS WNCKD + LEVRG S ++R ++K +R++D+ L
Sbjct: 243 CYGSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTL 302

Query: 314 PWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSND 373
           P+L++PS+P VF PFG+ F+YG+GGEGP A K++KALC HAHNLAKE GCGVVA EV+ +
Sbjct: 303 PFLKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEGGCGVVAAEVAGE 362

Query: 374 EPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           EPL+  IPHWK LSC EDLWCIKRLGE Y+D S+GDWTKSPP  SIFVDPREF
Sbjct: 363 EPLRRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDSIFVDPREF 413

BLAST of CmaCh20G009860 vs. TrEMBL
Match: V4SFH0_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025754mg PE=4 SV=1)

HSP 1 Score: 599.7 bits (1545), Expect = 2.6e-168
Identity = 292/405 (72.10%), Postives = 330/405 (81.48%), Query Frame = 1

Query: 16  ILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIA 75
           I++REFDP KD   VEDVERRCEVG +      LFTDLLGDPICRVRHSPAFLMLVAE+ 
Sbjct: 6   IVVREFDPNKDCLGVEDVERRCEVGPSGK--LCLFTDLLGDPICRVRHSPAFLMLVAEVG 65

Query: 76  AHNEIVGMIRGCIKTVTCCSKSTRNSGINASDL--PHLAPVYTKLAYILGLRVSPDHRRL 135
             +EIVGMIRGCIKTVTC  + +RN+    +D+  P   PVYTKLAYILGLRVSP HRR+
Sbjct: 66  --DEIVGMIRGCIKTVTCGKRISRNTKYTTNDIEPPKPLPVYTKLAYILGLRVSPSHRRM 125

Query: 136 GIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHR 195
           GIGL LV+RMEEWFRE   EYSYIATE DN AS+KLFT KCGY KFRTPSILVNPVFAHR
Sbjct: 126 GIGLKLVKRMEEWFRESGVEYSYIATENDNYASVKLFTDKCGYSKFRTPSILVNPVFAHR 185

Query: 196 LRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDGL 255
           L +  QVTI++L  + AE  YR +F+  EFFPRDID+VL+N+L+LGTFLAVPRG+++   
Sbjct: 186 LIVPKQVTIIQLNPSDAEAFYRRKFSTTEFFPRDIDSVLNNKLNLGTFLAVPRGTYSPDS 245

Query: 256 WVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPSV 315
           W  SD F SC PESWA+LS WN KDV+ LEVRG S VKR +AK +RV+DR LPWLR+PSV
Sbjct: 246 WAGSDSFFSCPPESWAILSVWNSKDVFKLEVRGASRVKRTLAKTTRVVDRVLPWLRIPSV 305

Query: 316 PEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDIP 375
           PEVF+PFG+ FLYG+GGEGP A KL+KALC HAHNLAKERGCGVVATEVS+ EPLK  IP
Sbjct: 306 PEVFSPFGLHFLYGLGGEGPRAAKLVKALCGHAHNLAKERGCGVVATEVSSREPLKLGIP 365

Query: 376 HWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           HWK LSC EDLWCIKRLGE Y+D SLGDWTKSPP  SIFVDPREF
Sbjct: 366 HWKMLSCDEDLWCIKRLGEDYSDGSLGDWTKSPPGLSIFVDPREF 406

BLAST of CmaCh20G009860 vs. TrEMBL
Match: A0A067GGV7_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g014721mg PE=4 SV=1)

HSP 1 Score: 597.8 bits (1540), Expect = 1.0e-167
Identity = 290/405 (71.60%), Postives = 329/405 (81.23%), Query Frame = 1

Query: 16  ILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIA 75
           I++REFDP KD   VEDVERRCEVG +      LFTDLLGDPICRVRHSPAFLMLVAE+ 
Sbjct: 19  IVVREFDPNKDCLGVEDVERRCEVGPSGK--LCLFTDLLGDPICRVRHSPAFLMLVAEVG 78

Query: 76  AHNEIVGMIRGCIKTVTCCSKSTRNSGINASDL--PHLAPVYTKLAYILGLRVSPDHRRL 135
             +EIVGMIRGCIKTVTC  + +RN+    +D+  P   PVYTKLAYILGLRVSP HRR+
Sbjct: 79  --DEIVGMIRGCIKTVTCGKRISRNTKYTTNDIEPPKPLPVYTKLAYILGLRVSPSHRRM 138

Query: 136 GIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHR 195
           GIGL LV+RMEEWFRE   EYSYIATE DN AS+KLFT KCGY KFRTPSILVNPVFAHR
Sbjct: 139 GIGLKLVKRMEEWFRESGVEYSYIATENDNYASVKLFTDKCGYSKFRTPSILVNPVFAHR 198

Query: 196 LRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDGL 255
           L +  QVTI++L  + AE  YR +F+  EFFPRDID+VL+N+L+LGTFLAVPRG+++   
Sbjct: 199 LIVPKQVTIIQLNPSDAEAFYRRKFSTTEFFPRDIDSVLNNKLNLGTFLAVPRGTYSPDS 258

Query: 256 WVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPSV 315
           W  SD F SC PESWA+LS WN KDV+ LEVRG S VKR +AK +R++DR  PWLR+PSV
Sbjct: 259 WAGSDSFFSCPPESWAILSVWNSKDVFKLEVRGASRVKRTLAKTTRIVDRVFPWLRIPSV 318

Query: 316 PEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDIP 375
           PEVF+PFG+ FLYG+GGEGP A KL+KALC HAHNLAKERGCGVVATEVS+ EPLK  IP
Sbjct: 319 PEVFSPFGLHFLYGLGGEGPRAAKLVKALCGHAHNLAKERGCGVVATEVSSREPLKLGIP 378

Query: 376 HWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           HWK LSC EDLWCIKRLGE Y+D SLGDWTKSPP  SIFVDPREF
Sbjct: 379 HWKMLSCDEDLWCIKRLGEDYSDGSLGDWTKSPPGLSIFVDPREF 419

BLAST of CmaCh20G009860 vs. TrEMBL
Match: B9SGZ0_RICCO (N-acetyltransferase, putative OS=Ricinus communis GN=RCOM_0578960 PE=4 SV=1)

HSP 1 Score: 591.3 bits (1523), Expect = 9.4e-166
Identity = 291/403 (72.21%), Postives = 331/403 (82.13%), Query Frame = 1

Query: 16  ILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIA 75
           I++REFDP++DR  VE+VERRCEVG +     SLFTDLLGDPICRVRHSPAFLMLVAE+ 
Sbjct: 7   IVVREFDPSRDRVGVEEVERRCEVGPSGK--LSLFTDLLGDPICRVRHSPAFLMLVAELG 66

Query: 76  AHNEIVGMIRGCIKTVTCCSKSTRNSGINASDLPHLAPVYTKLAYILGLRVSPDHRRLGI 135
              EIVGMIRGCIKTVTC  K +R+  +  +D P   PVYTK+AYILGLRVSP HRR+GI
Sbjct: 67  --EEIVGMIRGCIKTVTCGRKLSRH--VKNNDPPKPLPVYTKVAYILGLRVSPSHRRMGI 126

Query: 136 GLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHRLR 195
           GL LVR +EEWFRE  AEYSY+ATE DN AS+KLFT KCGY KFRTPSILVNPVFAHRL 
Sbjct: 127 GLKLVRTIEEWFRENGAEYSYLATENDNHASVKLFTDKCGYTKFRTPSILVNPVFAHRLA 186

Query: 196 LSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDGLWV 255
           +S++VTI +L    AE LYR RFA  EFFPRDID+VL+N+LSLGTFLAVPRGS+T   W 
Sbjct: 187 VSNRVTIFKLPPNDAELLYRRRFATTEFFPRDIDSVLNNKLSLGTFLAVPRGSYTHNSWP 246

Query: 256 ESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPSVPE 315
             D+FLS  PESWAVLS WNCKDV+ LEVRG S VKR  AK +R++D+ LP+L+LPSVPE
Sbjct: 247 GFDKFLSDPPESWAVLSVWNCKDVFRLEVRGASRVKRTFAKTTRIVDKALPFLKLPSVPE 306

Query: 316 VFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDIPHW 375
           +F PFG+ FLYGVGGEGP A K++KALC HAHNLAKERGCGVVATEVS+ EPLK  IP+W
Sbjct: 307 LFRPFGLHFLYGVGGEGPHAVKMVKALCAHAHNLAKERGCGVVATEVSSCEPLKLGIPYW 366

Query: 376 KKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           K LSC EDLWCIKRLGE Y+D S+GDWTKSPP  SIFVDPREF
Sbjct: 367 KMLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGVSIFVDPREF 403

BLAST of CmaCh20G009860 vs. TrEMBL
Match: M5W906_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021292mg PE=4 SV=1)

HSP 1 Score: 585.1 bits (1507), Expect = 6.7e-164
Identity = 292/424 (68.87%), Postives = 332/424 (78.30%), Query Frame = 1

Query: 9   KRKNNIQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFL 68
           +R NNI +++REFDP+KD   VE+VERRCEVG       SLFTDLLGDPICRVRHSPA+L
Sbjct: 7   RRVNNI-VVLREFDPSKDCEGVEEVERRCEVGP--GGELSLFTDLLGDPICRVRHSPAYL 66

Query: 69  MLVAEIAAHNE----IVGMIRGCIKTVTCCSKSTRNSGINAS----------DLPHLAPV 128
           MLVAE     +    +VGMIRGCIKTVTC  K +RN G N +          D     PV
Sbjct: 67  MLVAEQVGEEQEEKQVVGMIRGCIKTVTCGKKLSRN-GKNVTHHNKNDDVLDDTLKPLPV 126

Query: 129 YTKLAYILGLRVSPDHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKC 188
           YTKLAYILGLRVSP HRR+GIGL LV R+EEWFRE  AEYSY+AT+ DN  SI LFT KC
Sbjct: 127 YTKLAYILGLRVSPSHRRMGIGLKLVHRVEEWFRENGAEYSYMATDNDNKPSINLFTDKC 186

Query: 189 GYFKFRTPSILVNPVFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSN 248
           GY KFRTP+ILVNPVFAHR++LS  V +++L  + AE+LYR RFA  EFFPRDIDAVL+N
Sbjct: 187 GYSKFRTPAILVNPVFAHRVKLSSGVHVIKLSPSDAESLYRRRFATTEFFPRDIDAVLNN 246

Query: 249 RLSLGTFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAM 308
           RLSLGTFLAVPRG+FT G W  SD+FL+  PESWAVLS WNCKD Y LEVRG S VKR +
Sbjct: 247 RLSLGTFLAVPRGTFTAGNWPGSDQFLADPPESWAVLSVWNCKDAYTLEVRGASRVKRTL 306

Query: 309 AKFSRVIDRGLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERG 368
           AK +R++DR LPWLRLPSVPE+F PFG  FLYG+GG GP A K +KALC HAHNLAKERG
Sbjct: 307 AKTTRIVDRALPWLRLPSVPELFRPFGFHFLYGLGGSGPRAEKFVKALCDHAHNLAKERG 366

Query: 369 CGVVATEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVD 419
           CGVVATEVS+ EPL+  IPHWK+LSC EDLWCIKRLGE Y+D S+GDWTK+PP  SIFVD
Sbjct: 367 CGVVATEVSSREPLRLGIPHWKRLSCDEDLWCIKRLGEDYSDGSVGDWTKAPPGMSIFVD 426

BLAST of CmaCh20G009860 vs. TrEMBL
Match: A0A0D2NVN0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G029400 PE=4 SV=1)

HSP 1 Score: 583.9 bits (1504), Expect = 1.5e-163
Identity = 280/403 (69.48%), Postives = 327/403 (81.14%), Query Frame = 1

Query: 16  ILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIA 75
           I++RE+DP++D  +VE+VE+RCEVG +  K  SLFTDLLGDPICRVRHSPAFLMLVAE++
Sbjct: 8   IVVREYDPSRDITSVEEVEKRCEVGPSSGK-LSLFTDLLGDPICRVRHSPAFLMLVAELS 67

Query: 76  AHNEIVGMIRGCIKTVTCCSKSTRNSGINASDLPHLAPVYTKLAYILGLRVSPDHRRLGI 135
           +  EIVGMIRGCIKTVTC  K +RNS  N      L PVYTKLAYILGLRVSP HRR+GI
Sbjct: 68  STKEIVGMIRGCIKTVTCGKKISRNSKNNDPIATKLVPVYTKLAYILGLRVSPSHRRMGI 127

Query: 136 GLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHRLR 195
           GL LV  ME+WF +  AEYSYIATE DN ASI LFT KCGY +FRTP+ILVNPVFAHRL 
Sbjct: 128 GLKLVVTMEDWFTQNGAEYSYIATENDNRASINLFTDKCGYSRFRTPAILVNPVFAHRLT 187

Query: 196 LSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDGLWV 255
           +S+QVT++ L L+ AE LYR RF+  EFFPRDID+VL+NRL+LGTFLAVPRG +T   W 
Sbjct: 188 VSNQVTVIELSLSDAELLYRHRFSTVEFFPRDIDSVLNNRLNLGTFLAVPRGFYTRESWS 247

Query: 256 ESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPSVPE 315
            SD+FLS  PESWAVLS WNCKDV+ LEVRG S  ++ +AK +RV+D+ LP+LRLPS+PE
Sbjct: 248 GSDKFLSDPPESWAVLSVWNCKDVFRLEVRGASRTRKTLAKTTRVVDKLLPFLRLPSIPE 307

Query: 316 VFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDIPHW 375
           VF PFG  FLYG+GGEGP A K + ALC HAHNLAKE+GC VVATEV+  EPLK  +PHW
Sbjct: 308 VFRPFGFHFLYGLGGEGPRAAKFVNALCAHAHNLAKEKGCSVVATEVAKHEPLKDGVPHW 367

Query: 376 KKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           K+LSC  DLWCIKRLGE Y+D S+GDWTKSPP  S+FVDPREF
Sbjct: 368 KRLSCDHDLWCIKRLGEDYSDGSVGDWTKSPPEPSLFVDPREF 409

BLAST of CmaCh20G009860 vs. TAIR10
Match: AT4G37580.1 (AT4G37580.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 546.2 bits (1406), Expect = 1.8e-155
Identity = 265/406 (65.27%), Postives = 318/406 (78.33%), Query Frame = 1

Query: 17  LIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIAA 76
           ++RE+DPT+D   VEDVERRCEVG +     SLFTDLLGDPICR+RHSP++LMLVAE+  
Sbjct: 3   VVREYDPTRDLVGVEDVERRCEVGPSGK--LSLFTDLLGDPICRIRHSPSYLMLVAEMGT 62

Query: 77  HN-EIVGMIRGCIKTVTCCSKSTRNSGINASDLPHLAPVYTKLAYILGLRVSPDHRRLGI 136
              EIVGMIRGCIKTVTC  K   N   + S    + P+YTKLAY+LGLRVSP HRR GI
Sbjct: 63  EKKEIVGMIRGCIKTVTCGQKLDLN---HKSQNDVVKPLYTKLAYVLGLRVSPFHRRQGI 122

Query: 137 GLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHRLR 196
           G  LV+ MEEWFR+  AEYSYIATE DN AS+ LFT KCGY +FRTPSILVNPV+AHR+ 
Sbjct: 123 GFKLVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRVN 182

Query: 197 LSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFT---DG 256
           +S +VT+++LE   AETLYR RF+  EFFPRDID+VL+N+LSLGTF+AVPRGS      G
Sbjct: 183 VSRRVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSG 242

Query: 257 LWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPS 316
            W  S +FL   PESWAVLS WNCKD + LEVRG S ++R +AK +RV+D+ LP+L+LPS
Sbjct: 243 SWPGSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPS 302

Query: 317 VPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDI 376
           +P VF PFG+ F+YG+GGEGP A K++K+LC HAHNLAK  GCGVVA EV+ ++PL+  I
Sbjct: 303 IPSVFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLAKAGGCGVVAAEVAGEDPLRRGI 362

Query: 377 PHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           PHWK LSC EDLWCIKRLG+ Y+D  +GDWTKSPP  SIFVDPREF
Sbjct: 363 PHWKVLSCDEDLWCIKRLGDDYSDGVVGDWTKSPPGVSIFVDPREF 403

BLAST of CmaCh20G009860 vs. TAIR10
Match: AT2G23060.1 (AT2G23060.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 539.3 bits (1388), Expect = 2.1e-153
Identity = 263/413 (63.68%), Postives = 320/413 (77.48%), Query Frame = 1

Query: 14  IQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAE 73
           + + +RE+DP+KD   VEDVERRCEVG       SLFTDLLGDPICRVRHSP++LMLVAE
Sbjct: 3   VLVEVREYDPSKDLATVEDVERRCEVGPAGK--LSLFTDLLGDPICRVRHSPSYLMLVAE 62

Query: 74  IAAHN--EIVGMIRGCIKTVTCCSKSTR---NSGINASDLPHLAPVYTKLAYILGLRVSP 133
           I      E+VGMIRGCIKTVTC   + R       + +D+    P+YTKLAYILGLRVSP
Sbjct: 63  IGPKEKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSP 122

Query: 134 DHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNP 193
            HRR GIG  LV+ ME+WF +  AEYSY ATE DN AS+ LFT KCGY +FRTPSILVNP
Sbjct: 123 THRRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNP 182

Query: 194 VFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGS 253
           V+AHR+ +S +VT+++LE + AE LYR RF+  EFFPRDID+VL+N+LSLGTF+AVPRGS
Sbjct: 183 VYAHRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGS 242

Query: 254 -FTDGL--WVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGL 313
            +  G   W  S +FL   P+SWAVLS WNCKD + LEVRG S ++R ++K +R++D+ L
Sbjct: 243 CYGSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTL 302

Query: 314 PWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSND 373
           P+L++PS+P VF PFG+ F+YG+GGEGP A K++KALC HAHNLAKE GCGVVA EV+ +
Sbjct: 303 PFLKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEGGCGVVAAEVAGE 362

Query: 374 EPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           EPL+  IPHWK LSC EDLWCIKRLGE Y+D S+GDWTKSPP  SIFVDPREF
Sbjct: 363 EPLRRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDSIFVDPREF 413

BLAST of CmaCh20G009860 vs. TAIR10
Match: AT5G67430.1 (AT5G67430.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 429.9 bits (1104), Expect = 1.8e-120
Identity = 223/408 (54.66%), Postives = 288/408 (70.59%), Query Frame = 1

Query: 16  ILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIA 75
           +++RE+DP +D  +VE++E  CEVGS       L  DL+GDP+ R+R SP+F MLVAEI 
Sbjct: 8   VVVREYDPKRDLTSVEELEESCEVGS-------LLVDLMGDPLARIRQSPSFHMLVAEIG 67

Query: 76  AHNEIVGMIRGCIKTVTCCSKSTRNSGINA-SDLPHLAPVY--TKLAYILGLRVSPDHRR 135
             NEIVGMIRG IK VT         G+NA      ++P    TKLA++ GLRVSP +RR
Sbjct: 68  --NEIVGMIRGTIKMVT--------RGVNALRQADDVSPEINTTKLAFVSGLRVSPFYRR 127

Query: 136 LGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAH 195
           +GIGL LV+R+EEWF    A YSY+ TE DN+AS+KLFT K GY KFRTP+ LVNPVF H
Sbjct: 128 MGIGLKLVQRLEEWFLRNDAVYSYVQTENDNIASVKLFTEKSGYSKFRTPTFLVNPVFNH 187

Query: 196 RLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDG 255
           R+ +S +V I++L  + AE+LYR RF+  EFFP DI+++L+N+LSLGT+LAVPRG     
Sbjct: 188 RVTVSRRVKIIKLAPSDAESLYRNRFSTTEFFPSDINSILTNKLSLGTYLAVPRG----- 247

Query: 256 LWVESDRFLSCLPE---SWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLR 315
                D     LP+   SWAV+S WN KDVY L+V+G S +KR +AK +RV D   P+L+
Sbjct: 248 ----GDNVSGSLPDQTGSWAVISIWNSKDVYRLQVKGASRLKRMLAKSTRVFDGAFPFLK 307

Query: 316 LPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLK 375
           +PS P +F  F + F+YG+GGEGP A ++++ALC HAHNLA++ GC VVA EV++ EPL+
Sbjct: 308 IPSFPNLFKSFAMHFMYGIGGEGPRAAEMVEALCSHAHNLARKSGCAVVAAEVASCEPLR 367

Query: 376 SDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPRE 418
             IPHWK LS PEDLWC+KRL   Y D+ + DWTKSPP  SIFVDPRE
Sbjct: 368 VGIPHWKVLS-PEDLWCLKRLR--YDDDGV-DWTKSPPGLSIFVDPRE 385

BLAST of CmaCh20G009860 vs. TAIR10
Match: AT2G30090.1 (AT2G30090.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 295.8 bits (756), Expect = 4.1e-80
Identity = 175/412 (42.48%), Postives = 239/412 (58.01%), Query Frame = 1

Query: 15  QILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEI 74
           +++IR +D  +DR  +  +E+ CE+G   +    LFTD LGDPICR+R+SP F+MLVA +
Sbjct: 12  EVVIRCYDDRRDRIQMGRMEKSCEIGH--DHQTLLFTDTLGDPICRIRNSPFFIMLVAGV 71

Query: 75  AAHNEIVGMIRGCIKTVTCCSKSTRNSGINASDLPHLAPVYTKLAYILGLRVSPDHRRLG 134
              N++VG I+G +K V    KS R                  + Y+LGLRV P +RR G
Sbjct: 72  G--NKLVGSIQGSVKPVEFHDKSVR------------------VGYVLGLRVVPSYRRRG 131

Query: 135 IGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHR- 194
           IG  LVR++EEWF    A+Y+Y+ATE DN AS  LF  + GY  FR P+ILVNPV   R 
Sbjct: 132 IGSILVRKLEEWFESHNADYAYMATEKDNEASHGLFIGRLGYVVFRNPAILVNPVNPGRG 191

Query: 195 LRLSDQVTILRLELTVAETLYRCRFAA-AEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDG 254
           L+L   + I +L++  AE+LYR   AA  EFFP DI+ +L N+LS+GT++A         
Sbjct: 192 LKLPSDIGIRKLKVKEAESLYRRNVAATTEFFPDDINKILRNKLSIGTWVAYYNN----- 251

Query: 255 LWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPS 314
             V++ R       SWA+LS W+   V+ L +    +    + K S++    L  L L  
Sbjct: 252 --VDNTR-------SWAMLSVWDSSKVFKLRIERAPLSYLLLTKVSKLFGNFLSLLGLTV 311

Query: 315 VPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKER---GCGVVATEV----SND 374
           +P++FTPFG  FLYGV  EGP  GKL++ALC H HN+A       C VV  EV    + D
Sbjct: 312 LPDLFTPFGFYFLYGVHSEGPHCGKLVRALCEHVHNMAALNDGCACKVVVVEVDKGSNGD 371

Query: 375 EPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPRE 418
           + L+  IPHWK LSC +D+WCIK L +C   N      +S    S+FVDPRE
Sbjct: 372 DSLQRCIPHWKMLSCDDDMWCIKPL-KC-EKNKFDLSERSKSRSSLFVDPRE 385

BLAST of CmaCh20G009860 vs. NCBI nr
Match: gi|567866909|ref|XP_006426077.1| (hypothetical protein CICLE_v10025754mg [Citrus clementina])

HSP 1 Score: 599.7 bits (1545), Expect = 3.8e-168
Identity = 292/405 (72.10%), Postives = 330/405 (81.48%), Query Frame = 1

Query: 16  ILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIA 75
           I++REFDP KD   VEDVERRCEVG +      LFTDLLGDPICRVRHSPAFLMLVAE+ 
Sbjct: 6   IVVREFDPNKDCLGVEDVERRCEVGPSGK--LCLFTDLLGDPICRVRHSPAFLMLVAEVG 65

Query: 76  AHNEIVGMIRGCIKTVTCCSKSTRNSGINASDL--PHLAPVYTKLAYILGLRVSPDHRRL 135
             +EIVGMIRGCIKTVTC  + +RN+    +D+  P   PVYTKLAYILGLRVSP HRR+
Sbjct: 66  --DEIVGMIRGCIKTVTCGKRISRNTKYTTNDIEPPKPLPVYTKLAYILGLRVSPSHRRM 125

Query: 136 GIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHR 195
           GIGL LV+RMEEWFRE   EYSYIATE DN AS+KLFT KCGY KFRTPSILVNPVFAHR
Sbjct: 126 GIGLKLVKRMEEWFRESGVEYSYIATENDNYASVKLFTDKCGYSKFRTPSILVNPVFAHR 185

Query: 196 LRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDGL 255
           L +  QVTI++L  + AE  YR +F+  EFFPRDID+VL+N+L+LGTFLAVPRG+++   
Sbjct: 186 LIVPKQVTIIQLNPSDAEAFYRRKFSTTEFFPRDIDSVLNNKLNLGTFLAVPRGTYSPDS 245

Query: 256 WVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPSV 315
           W  SD F SC PESWA+LS WN KDV+ LEVRG S VKR +AK +RV+DR LPWLR+PSV
Sbjct: 246 WAGSDSFFSCPPESWAILSVWNSKDVFKLEVRGASRVKRTLAKTTRVVDRVLPWLRIPSV 305

Query: 316 PEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDIP 375
           PEVF+PFG+ FLYG+GGEGP A KL+KALC HAHNLAKERGCGVVATEVS+ EPLK  IP
Sbjct: 306 PEVFSPFGLHFLYGLGGEGPRAAKLVKALCGHAHNLAKERGCGVVATEVSSREPLKLGIP 365

Query: 376 HWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           HWK LSC EDLWCIKRLGE Y+D SLGDWTKSPP  SIFVDPREF
Sbjct: 366 HWKMLSCDEDLWCIKRLGEDYSDGSLGDWTKSPPGLSIFVDPREF 406

BLAST of CmaCh20G009860 vs. NCBI nr
Match: gi|641860259|gb|KDO78948.1| (hypothetical protein CISIN_1g014721mg [Citrus sinensis])

HSP 1 Score: 597.8 bits (1540), Expect = 1.4e-167
Identity = 290/405 (71.60%), Postives = 329/405 (81.23%), Query Frame = 1

Query: 16  ILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIA 75
           I++REFDP KD   VEDVERRCEVG +      LFTDLLGDPICRVRHSPAFLMLVAE+ 
Sbjct: 19  IVVREFDPNKDCLGVEDVERRCEVGPSGK--LCLFTDLLGDPICRVRHSPAFLMLVAEVG 78

Query: 76  AHNEIVGMIRGCIKTVTCCSKSTRNSGINASDL--PHLAPVYTKLAYILGLRVSPDHRRL 135
             +EIVGMIRGCIKTVTC  + +RN+    +D+  P   PVYTKLAYILGLRVSP HRR+
Sbjct: 79  --DEIVGMIRGCIKTVTCGKRISRNTKYTTNDIEPPKPLPVYTKLAYILGLRVSPSHRRM 138

Query: 136 GIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHR 195
           GIGL LV+RMEEWFRE   EYSYIATE DN AS+KLFT KCGY KFRTPSILVNPVFAHR
Sbjct: 139 GIGLKLVKRMEEWFRESGVEYSYIATENDNYASVKLFTDKCGYSKFRTPSILVNPVFAHR 198

Query: 196 LRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDGL 255
           L +  QVTI++L  + AE  YR +F+  EFFPRDID+VL+N+L+LGTFLAVPRG+++   
Sbjct: 199 LIVPKQVTIIQLNPSDAEAFYRRKFSTTEFFPRDIDSVLNNKLNLGTFLAVPRGTYSPDS 258

Query: 256 WVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPSV 315
           W  SD F SC PESWA+LS WN KDV+ LEVRG S VKR +AK +R++DR  PWLR+PSV
Sbjct: 259 WAGSDSFFSCPPESWAILSVWNSKDVFKLEVRGASRVKRTLAKTTRIVDRVFPWLRIPSV 318

Query: 316 PEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDIP 375
           PEVF+PFG+ FLYG+GGEGP A KL+KALC HAHNLAKERGCGVVATEVS+ EPLK  IP
Sbjct: 319 PEVFSPFGLHFLYGLGGEGPRAAKLVKALCGHAHNLAKERGCGVVATEVSSREPLKLGIP 378

Query: 376 HWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           HWK LSC EDLWCIKRLGE Y+D SLGDWTKSPP  SIFVDPREF
Sbjct: 379 HWKMLSCDEDLWCIKRLGEDYSDGSLGDWTKSPPGLSIFVDPREF 419

BLAST of CmaCh20G009860 vs. NCBI nr
Match: gi|568824174|ref|XP_006466477.1| (PREDICTED: probable N-acetyltransferase HLS1 [Citrus sinensis])

HSP 1 Score: 595.5 bits (1534), Expect = 7.2e-167
Identity = 289/403 (71.71%), Postives = 327/403 (81.14%), Query Frame = 1

Query: 18  IREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIAAH 77
           +REFDP KD   VEDVERRCEVG +      LFTDLLGDPICRVRHSPAFLMLVAE+   
Sbjct: 21  VREFDPNKDCLGVEDVERRCEVGPSGK--LCLFTDLLGDPICRVRHSPAFLMLVAEVG-- 80

Query: 78  NEIVGMIRGCIKTVTCCSKSTRNSGINASDL--PHLAPVYTKLAYILGLRVSPDHRRLGI 137
           +EIVGMIRGCIKTVTC  + +RN+    +D+  P   PVYTKLAYILGLRVSP HRR+GI
Sbjct: 81  DEIVGMIRGCIKTVTCGKRISRNTKYTTNDIEPPKPLPVYTKLAYILGLRVSPSHRRMGI 140

Query: 138 GLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHRLR 197
           GL LV+RMEEWFRE   EYSYIATE DN AS+KLFT KCGY KFRTPSILVNPVFAHRL 
Sbjct: 141 GLKLVKRMEEWFRESGVEYSYIATENDNYASVKLFTDKCGYSKFRTPSILVNPVFAHRLI 200

Query: 198 LSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDGLWV 257
           +  QVTI++L  + AE  YR +F+  EFFPRDID+VL+N+L+LGTFLAVPRG+++   W 
Sbjct: 201 VPKQVTIIQLNPSDAEAFYRRKFSTTEFFPRDIDSVLNNKLNLGTFLAVPRGTYSPDSWA 260

Query: 258 ESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPSVPE 317
            SD F SC PESWA+LS WN KDV+ LEVRG S VKR +AK +R++DR  PWLR+PSVPE
Sbjct: 261 GSDSFFSCPPESWAILSVWNSKDVFKLEVRGASRVKRTLAKTTRIVDRVFPWLRIPSVPE 320

Query: 318 VFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDIPHW 377
           VF+PFG+ FLYG+GGEGP A KL+KALC HAHNLAKERGCGVVATEVS+ EPLK  IPHW
Sbjct: 321 VFSPFGLHFLYGLGGEGPRAAKLVKALCGHAHNLAKERGCGVVATEVSSREPLKLGIPHW 380

Query: 378 KKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           K LSC EDLWCIKRLGE Y+D SLGDWTKSPP  SIFVDPREF
Sbjct: 381 KMLSCDEDLWCIKRLGEDYSDGSLGDWTKSPPGLSIFVDPREF 419

BLAST of CmaCh20G009860 vs. NCBI nr
Match: gi|255568571|ref|XP_002525259.1| (PREDICTED: probable N-acetyltransferase HLS1 [Ricinus communis])

HSP 1 Score: 591.3 bits (1523), Expect = 1.3e-165
Identity = 291/403 (72.21%), Postives = 331/403 (82.13%), Query Frame = 1

Query: 16  ILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFLMLVAEIA 75
           I++REFDP++DR  VE+VERRCEVG +     SLFTDLLGDPICRVRHSPAFLMLVAE+ 
Sbjct: 7   IVVREFDPSRDRVGVEEVERRCEVGPSGK--LSLFTDLLGDPICRVRHSPAFLMLVAELG 66

Query: 76  AHNEIVGMIRGCIKTVTCCSKSTRNSGINASDLPHLAPVYTKLAYILGLRVSPDHRRLGI 135
              EIVGMIRGCIKTVTC  K +R+  +  +D P   PVYTK+AYILGLRVSP HRR+GI
Sbjct: 67  --EEIVGMIRGCIKTVTCGRKLSRH--VKNNDPPKPLPVYTKVAYILGLRVSPSHRRMGI 126

Query: 136 GLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKCGYFKFRTPSILVNPVFAHRLR 195
           GL LVR +EEWFRE  AEYSY+ATE DN AS+KLFT KCGY KFRTPSILVNPVFAHRL 
Sbjct: 127 GLKLVRTIEEWFRENGAEYSYLATENDNHASVKLFTDKCGYTKFRTPSILVNPVFAHRLA 186

Query: 196 LSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSNRLSLGTFLAVPRGSFTDGLWV 255
           +S++VTI +L    AE LYR RFA  EFFPRDID+VL+N+LSLGTFLAVPRGS+T   W 
Sbjct: 187 VSNRVTIFKLPPNDAELLYRRRFATTEFFPRDIDSVLNNKLSLGTFLAVPRGSYTHNSWP 246

Query: 256 ESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAMAKFSRVIDRGLPWLRLPSVPE 315
             D+FLS  PESWAVLS WNCKDV+ LEVRG S VKR  AK +R++D+ LP+L+LPSVPE
Sbjct: 247 GFDKFLSDPPESWAVLSVWNCKDVFRLEVRGASRVKRTFAKTTRIVDKALPFLKLPSVPE 306

Query: 316 VFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERGCGVVATEVSNDEPLKSDIPHW 375
           +F PFG+ FLYGVGGEGP A K++KALC HAHNLAKERGCGVVATEVS+ EPLK  IP+W
Sbjct: 307 LFRPFGLHFLYGVGGEGPHAVKMVKALCAHAHNLAKERGCGVVATEVSSCEPLKLGIPYW 366

Query: 376 KKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVDPREF 419
           K LSC EDLWCIKRLGE Y+D S+GDWTKSPP  SIFVDPREF
Sbjct: 367 KMLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGVSIFVDPREF 403

BLAST of CmaCh20G009860 vs. NCBI nr
Match: gi|595843428|ref|XP_007208612.1| (hypothetical protein PRUPE_ppa021292mg [Prunus persica])

HSP 1 Score: 585.1 bits (1507), Expect = 9.7e-164
Identity = 292/424 (68.87%), Postives = 332/424 (78.30%), Query Frame = 1

Query: 9   KRKNNIQILIREFDPTKDRPAVEDVERRCEVGSTKNKNFSLFTDLLGDPICRVRHSPAFL 68
           +R NNI +++REFDP+KD   VE+VERRCEVG       SLFTDLLGDPICRVRHSPA+L
Sbjct: 7   RRVNNI-VVLREFDPSKDCEGVEEVERRCEVGP--GGELSLFTDLLGDPICRVRHSPAYL 66

Query: 69  MLVAEIAAHNE----IVGMIRGCIKTVTCCSKSTRNSGINAS----------DLPHLAPV 128
           MLVAE     +    +VGMIRGCIKTVTC  K +RN G N +          D     PV
Sbjct: 67  MLVAEQVGEEQEEKQVVGMIRGCIKTVTCGKKLSRN-GKNVTHHNKNDDVLDDTLKPLPV 126

Query: 129 YTKLAYILGLRVSPDHRRLGIGLNLVRRMEEWFREKKAEYSYIATEIDNVASIKLFTHKC 188
           YTKLAYILGLRVSP HRR+GIGL LV R+EEWFRE  AEYSY+AT+ DN  SI LFT KC
Sbjct: 127 YTKLAYILGLRVSPSHRRMGIGLKLVHRVEEWFRENGAEYSYMATDNDNKPSINLFTDKC 186

Query: 189 GYFKFRTPSILVNPVFAHRLRLSDQVTILRLELTVAETLYRCRFAAAEFFPRDIDAVLSN 248
           GY KFRTP+ILVNPVFAHR++LS  V +++L  + AE+LYR RFA  EFFPRDIDAVL+N
Sbjct: 187 GYSKFRTPAILVNPVFAHRVKLSSGVHVIKLSPSDAESLYRRRFATTEFFPRDIDAVLNN 246

Query: 249 RLSLGTFLAVPRGSFTDGLWVESDRFLSCLPESWAVLSAWNCKDVYALEVRGVSVVKRAM 308
           RLSLGTFLAVPRG+FT G W  SD+FL+  PESWAVLS WNCKD Y LEVRG S VKR +
Sbjct: 247 RLSLGTFLAVPRGTFTAGNWPGSDQFLADPPESWAVLSVWNCKDAYTLEVRGASRVKRTL 306

Query: 309 AKFSRVIDRGLPWLRLPSVPEVFTPFGVLFLYGVGGEGPLAGKLMKALCHHAHNLAKERG 368
           AK +R++DR LPWLRLPSVPE+F PFG  FLYG+GG GP A K +KALC HAHNLAKERG
Sbjct: 307 AKTTRIVDRALPWLRLPSVPELFRPFGFHFLYGLGGSGPRAEKFVKALCDHAHNLAKERG 366

Query: 369 CGVVATEVSNDEPLKSDIPHWKKLSCPEDLWCIKRLGECYTDNSLGDWTKSPPSFSIFVD 419
           CGVVATEVS+ EPL+  IPHWK+LSC EDLWCIKRLGE Y+D S+GDWTK+PP  SIFVD
Sbjct: 367 CGVVATEVSSREPLRLGIPHWKRLSCDEDLWCIKRLGEDYSDGSVGDWTKAPPGMSIFVD 426

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HLS1_ARATH3.1e-15465.27Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana GN=HLS1 PE=1 SV=1[more]
HLS1L_ARATH3.8e-15263.68Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana GN=At2g23060 PE=2... [more]
Match NameE-valueIdentityDescription
V4SFH0_9ROSI2.6e-16872.10Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025754mg PE=4 SV=1[more]
A0A067GGV7_CITSI1.0e-16771.60Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g014721mg PE=4 SV=1[more]
B9SGZ0_RICCO9.4e-16672.21N-acetyltransferase, putative OS=Ricinus communis GN=RCOM_0578960 PE=4 SV=1[more]
M5W906_PRUPE6.7e-16468.87Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021292mg PE=4 SV=1[more]
A0A0D2NVN0_GOSRA1.5e-16369.48Uncharacterized protein OS=Gossypium raimondii GN=B456_003G029400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G37580.11.8e-15565.27 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
AT2G23060.12.1e-15363.68 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
AT5G67430.11.8e-12054.66 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
AT2G30090.14.1e-8042.48 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|567866909|ref|XP_006426077.1|3.8e-16872.10hypothetical protein CICLE_v10025754mg [Citrus clementina][more]
gi|641860259|gb|KDO78948.1|1.4e-16771.60hypothetical protein CISIN_1g014721mg [Citrus sinensis][more]
gi|568824174|ref|XP_006466477.1|7.2e-16771.71PREDICTED: probable N-acetyltransferase HLS1 [Citrus sinensis][more]
gi|255568571|ref|XP_002525259.1|1.3e-16572.21PREDICTED: probable N-acetyltransferase HLS1 [Ricinus communis][more]
gi|595843428|ref|XP_007208612.1|9.7e-16468.87hypothetical protein PRUPE_ppa021292mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000182GNAT_dom
IPR016181Acyl_CoA_acyltransferase
Vocabulary: Molecular Function
TermDefinition
GO:0008080N-acetyltransferase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042967 acyl-carrier-protein biosynthetic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008080 N-acetyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G009860.1CmaCh20G009860.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000182GNAT domainPFAMPF00583Acetyltransf_1coord: 80..176
score: 3.4
IPR000182GNAT domainPROFILEPS51186GNATcoord: 16..197
score: 14
IPR016181Acyl-CoA N-acyltransferaseGENE3DG3DSA:3.40.630.30coord: 16..170
score: 1.8
IPR016181Acyl-CoA N-acyltransferaseunknownSSF55729Acyl-CoA N-acyltransferases (Nat)coord: 16..177
score: 1.17
NoneNo IPR availablePANTHERPTHR23091N-TERMINAL ACETYLTRANSFERASEcoord: 113..229
score: 1.3E-112coord: 15..90
score: 1.3E
NoneNo IPR availablePANTHERPTHR23091:SF211N-ACETYLTRANSFERASE HLS1-LIKE-RELATEDcoord: 113..229
score: 1.3E-112coord: 15..90
score: 1.3E

The following gene(s) are paralogous to this gene:

None