Cp4.1LG03g08870 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g08870
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionAcyl-CoA N-acyltransferase (NAT) superfamily protein
LocationCp4.1LG03 : 5840225 .. 5842605 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGAAGACATGTCTGTGAAACAAAGTTGGTGACACAGACCCGAATCCACCAAAATTTCAAATCAAATTCTCTTTTCCCCTAAAAAGAAATGTTTTTCACAACTCGTCTTCCCATACACCGCCTTAATTTCCACGAACTTATCTCAACGCAACAAATCTAGTTGTTTATGTTGAGAGGGAGTCCCACGTCGGCCAATTAAGAGTTTGATCATAATACATATTCACTGTGGAGACCCATGATTCATTCCTTCGACACGATTGCCGAATGAAACAGCACCCGAACGAGATTGAGCGGATCATAATTGGCATCACCATGGGCATCATGAAGGTCTCATCATAGGCACCAAGTGGGGTTAAAATATATAAATGCGAGGGGCTTTTTGAAGTATCTCTCTGTTTTATAACCTTATATTCTTGTTGAATATAAAGGAGGTAATGAAACAATAATTGAAGAAATAATTAAATGATTCATAGTCATTAGGAGAACATGATTAATGAATTCAATCCATCTTAAAACGTGGATTAAGGTGAGACCCACTTAATTAGAGTTTCTAAAATGGTGTATAAATAAGGGAAATGGTGGGGTTGAAGCTCCAAACACAATCCCCTAAAACTAGAGCTTTCCTCCTCTGTCAAACAGAGCTATGGTGCAGAAGAAGATAGCTGATGAATATAGGCTGCAAATCATTACAGAGCAAAATAGTAATAATTTGATTGTAGTGAGAGAATACTGTGAAGAGAGGGATAAGGTGTCGGTGGAGAAGCTGGAGAGGCAGTGTGATGTCGGACAGAAAGGGAAGGCTTCTATTTTCACTGATCTTCTGGGTGATCCTATTTGCCGCATTCGCCACTTCCCTTTGCACCTCATGCTGGTAAATGCTCATTTATTTGTATGTGTTTTTAAAGTGGTACCTTTTAAATAGGAAGCTTTTTTGGGAATTGTGTCACAGGTTGCAGAGTATGGAGATGCTAGAGATATCGTTGGTGTTATAAGAGGATGTATCAAACATGTCACAACAGGCCATTCTCATCATCCCCTCGACCTCGCTTATATTTTGGGCTTAAGGGTCTCTACTACTCACAGGTCTGTCGATGTTTCTTACTATTTGGAATCACGAATCTTTACAATTGTATGATATTGTTAAACGTTATGCAATGCAGGAGGCTAGGAGTTGGCACAAAATTAGTTCAACATCTAGAGAAATGGTGCAAGCAAAAGGGTGCAGACTATGCATATATAGCAACAGAATGCGCTAACCAGCCCTGCATCAACATGTTTACACACAAGTTTTCATACACAAAATTCAGATCACCCACAGTGTTGGTGCAGCCTGTCCATGCTCACTACAAGCCAATAAGCTCAGGGGTTGCCATTGTGCGCATTCCCCCGCACGTTGTAGTTAAAATTTATCGCCAACTCTTTGCAAATGCGGAGCTCTTCCCCATGGACATTGATGCCATTTTGTCCAACAAGCTTAATTTGGGCACTTTCATGGCAGTTCCTAAGAAGCTGCTTCCTAATTGGAACCCTGAAACTGGGATTCTTCCTCAGAGCTTTGCAGTGTTGAGTGTGTGGAACACTAAAGAAGTTTTCAAGTTGCAGGTTAAGGGAGTGTCTAAGCTAACTTATGCATGTTGCATGGGGAGTAGAGTGTTGGATTCATGGGTACCATGGCTTAGATTACCTTCATTTCCAGATGTGTTCAGTCAATTTGGGGTGTATTTCTTGTATGGACTATCCATGAGAGGAAGTGATGGGCCACGGCTGATGAAGTTTCTATGTAGATTCGTGCATAACATGGCTAAGGATGATGTGGGATGTGGGGCGGTGGTAGCAGAGGTAGGCCAACAGGATCCTGTGAGAGCCGCCATACCCCATTGGAGAAGATTTTCATGGAGTGAAGATTTATGGTGCATCAAGAAGCTGAGAGATTTGGAAGGGGATGAGTGTAATTGGATCAAATCTCCACCTTCATCAGCAGGGATATTTGTTGATCCTAGAGACATCTAACTATATCTTTCTTTCTCCCACATCAGTACACAAAATCATAAAAGGAATGTTCATAAATCTCTAGCTTACTCTCCTTTTCCTTTCTCTTTTTTCTTTTCAATGCTCTTTCACTCGGTGTCTCCGGTTAAAATCGCACGACAACGCACACATTCGATCAATATGGAGTATATATATATATGGTTTCAGTTTACTAGATATAGTCTTAACAGATTTAAGCACATCTCTGCTTGGGATTCCAGACCCACTTAGCTGATTAAAGCAGAGGCGGGGGAATTGGCAAGTGGCAATTATGGAAAGAATATGGAGCATTGACAGAATCAGAAGCTTGCTAGAGGCTATAGAATAGGCTTTTTAGTTCAGGCATCCCTAAG

mRNA sequence

TTGAAGACATGTCTGTGAAACAAAGTTGGTGACACAGACCCGAATCCACCAAAATTTCAAATCAAATTCTCTTTTCCCCTAAAAAGAAATGTTTTTCACAACTCGTCTTCCCATACACCGCCTTAATTTCCACGAACTTATCTCAACGCAACAAATCTAGTTGTTTATGTTGAGAGGGAGTCCCACGTCGGCCAATTAAGAGTTTGATCATAATACATATTCACTGTGGAGACCCATGATTCATTCCTTCGACACGATTGCCGAATGAAACAGCACCCGAACGAGATTGAGCGGATCATAATTGGCATCACCATGGGCATCATGAAGGTCTCATCATAGGCACCAAGTGGGGTTAAAATATATAAATGCGAGGGGCTTTTTGAAGTATCTCTCTGTTTTATAACCTTATATTCTTGTTGAATATAAAGGAGGTAATGAAACAATAATTGAAGAAATAATTAAATGATTCATAGTCATTAGGAGAACATGATTAATGAATTCAATCCATCTTAAAACGTGGATTAAGGTGAGACCCACTTAATTAGAGTTTCTAAAATGGTGTATAAATAAGGGAAATGGTGGGGTTGAAGCTCCAAACACAATCCCCTAAAACTAGAGCTTTCCTCCTCTGTCAAACAGAGCTATGGTGCAGAAGAAGATAGCTGATGAATATAGGCTGCAAATCATTACAGAGCAAAATAGTAATAATTTGATTGTAGTGAGAGAATACTGTGAAGAGAGGGATAAGGTGTCGGTGGAGAAGCTGGAGAGGCAGTGTGATGTCGGACAGAAAGGGAAGGCTTCTATTTTCACTGATCTTCTGGGTGATCCTATTTGCCGCATTCGCCACTTCCCTTTGCACCTCATGCTGGTTGCAGAGTATGGAGATGCTAGAGATATCGTTGGTGTTATAAGAGGATGTATCAAACATGTCACAACAGGCCATTCTCATCATCCCCTCGACCTCGCTTATATTTTGGGCTTAAGGGTCTCTACTACTCACAGGAGGCTAGGAGTTGGCACAAAATTAGTTCAACATCTAGAGAAATGGTGCAAGCAAAAGGGTGCAGACTATGCATATATAGCAACAGAATGCGCTAACCAGCCCTGCATCAACATGTTTACACACAAGTTTTCATACACAAAATTCAGATCACCCACAGTGTTGGTGCAGCCTGTCCATGCTCACTACAAGCCAATAAGCTCAGGGGTTGCCATTGTGCGCATTCCCCCGCACGTTGTAGTTAAAATTTATCGCCAACTCTTTGCAAATGCGGAGCTCTTCCCCATGGACATTGATGCCATTTTGTCCAACAAGCTTAATTTGGGCACTTTCATGGCAGTTCCTAAGAAGCTGCTTCCTAATTGGAACCCTGAAACTGGGATTCTTCCTCAGAGCTTTGCAGTGTTGAGTGTGTGGAACACTAAAGAAGTTTTCAAGTTGCAGGTTAAGGGAGTGTCTAAGCTAACTTATGCATGTTGCATGGGGAGTAGAGTGTTGGATTCATGGGTACCATGGCTTAGATTACCTTCATTTCCAGATGTGTTCAGTCAATTTGGGGTGTATTTCTTGTATGGACTATCCATGAGAGGAAGTGATGGGCCACGGCTGATGAAGTTTCTATGTAGATTCGTGCATAACATGGCTAAGGATGATGTGGGATGTGGGGCGGTGGTAGCAGAGGTAGGCCAACAGGATCCTGTGAGAGCCGCCATACCCCATTGGAGAAGATTTTCATGGAGTGAAGATTTATGGTGCATCAAGAAGCTGAGAGATTTGGAAGGGGATGAGTGTAATTGGATCAAATCTCCACCTTCATCAGCAGGGATATTTGTTGATCCTAGAGACATCTAACTATATCTTTCTTTCTCCCACATCAGTACACAAAATCATAAAAGGAATGTTCATAAATCTCTAGCTTACTCTCCTTTTCCTTTCTCTTTTTTCTTTTCAATGCTCTTTCACTCGGTGTCTCCGGTTAAAATCGCACGACAACGCACACATTCGATCAATATGGAGTATATATATATATGGTTTCAGTTTACTAGATATAGTCTTAACAGATTTAAGCACATCTCTGCTTGGGATTCCAGACCCACTTAGCTGATTAAAGCAGAGGCGGGGGAATTGGCAAGTGGCAATTATGGAAAGAATATGGAGCATTGACAGAATCAGAAGCTTGCTAGAGGCTATAGAATAGGCTTTTTAGTTCAGGCATCCCTAAG

Coding sequence (CDS)

ATGGTGCAGAAGAAGATAGCTGATGAATATAGGCTGCAAATCATTACAGAGCAAAATAGTAATAATTTGATTGTAGTGAGAGAATACTGTGAAGAGAGGGATAAGGTGTCGGTGGAGAAGCTGGAGAGGCAGTGTGATGTCGGACAGAAAGGGAAGGCTTCTATTTTCACTGATCTTCTGGGTGATCCTATTTGCCGCATTCGCCACTTCCCTTTGCACCTCATGCTGGTTGCAGAGTATGGAGATGCTAGAGATATCGTTGGTGTTATAAGAGGATGTATCAAACATGTCACAACAGGCCATTCTCATCATCCCCTCGACCTCGCTTATATTTTGGGCTTAAGGGTCTCTACTACTCACAGGAGGCTAGGAGTTGGCACAAAATTAGTTCAACATCTAGAGAAATGGTGCAAGCAAAAGGGTGCAGACTATGCATATATAGCAACAGAATGCGCTAACCAGCCCTGCATCAACATGTTTACACACAAGTTTTCATACACAAAATTCAGATCACCCACAGTGTTGGTGCAGCCTGTCCATGCTCACTACAAGCCAATAAGCTCAGGGGTTGCCATTGTGCGCATTCCCCCGCACGTTGTAGTTAAAATTTATCGCCAACTCTTTGCAAATGCGGAGCTCTTCCCCATGGACATTGATGCCATTTTGTCCAACAAGCTTAATTTGGGCACTTTCATGGCAGTTCCTAAGAAGCTGCTTCCTAATTGGAACCCTGAAACTGGGATTCTTCCTCAGAGCTTTGCAGTGTTGAGTGTGTGGAACACTAAAGAAGTTTTCAAGTTGCAGGTTAAGGGAGTGTCTAAGCTAACTTATGCATGTTGCATGGGGAGTAGAGTGTTGGATTCATGGGTACCATGGCTTAGATTACCTTCATTTCCAGATGTGTTCAGTCAATTTGGGGTGTATTTCTTGTATGGACTATCCATGAGAGGAAGTGATGGGCCACGGCTGATGAAGTTTCTATGTAGATTCGTGCATAACATGGCTAAGGATGATGTGGGATGTGGGGCGGTGGTAGCAGAGGTAGGCCAACAGGATCCTGTGAGAGCCGCCATACCCCATTGGAGAAGATTTTCATGGAGTGAAGATTTATGGTGCATCAAGAAGCTGAGAGATTTGGAAGGGGATGAGTGTAATTGGATCAAATCTCCACCTTCATCAGCAGGGATATTTGTTGATCCTAGAGACATCTAA

Protein sequence

MVQKKIADEYRLQIITEQNSNNLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPICRIRHFPLHLMLVAEYGDARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRLGVGTKLVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYKPISSGVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWNPETGILPQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFSQFGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRRFSWSEDLWCIKKLRDLEGDECNWIKSPPSSAGIFVDPRDI
BLAST of Cp4.1LG03g08870 vs. Swiss-Prot
Match: HLS1_ARATH (Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana GN=HLS1 PE=1 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 4.4e-105
Identity = 200/404 (49.50%), Postives = 267/404 (66.09%), Query Frame = 1

Query: 23  LIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPICRIRHFPLHLMLVAEYG- 82
           + VVREY   RD V VE +ER+C+VG  GK S+FTDLLGDPICRIRH P +LMLVAE G 
Sbjct: 1   MTVVREYDPTRDLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGT 60

Query: 83  DARDIVGVIRGCIKHVTTGHS---HH--------PL--DLAYILGLRVSTTHRRLGVGTK 142
           + ++IVG+IRGCIK VT G     +H        PL   LAY+LGLRVS  HRR G+G K
Sbjct: 61  EKKEIVGMIRGCIKTVTCGQKLDLNHKSQNDVVKPLYTKLAYVLGLRVSPFHRRQGIGFK 120

Query: 143 LVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYKPISS 202
           LV+ +E+W +Q GA+Y+YIATE  NQ  +N+FT K  Y++FR+P++LV PV+AH   +S 
Sbjct: 121 LVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRVNVSR 180

Query: 203 GVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLL-----PNWN 262
            V ++++ P     +YR  F+  E FP DID++L+NKL+LGTF+AVP+         +W 
Sbjct: 181 RVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSGSWP 240

Query: 263 PETGIL---PQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPD 322
                L   P+S+AVLSVWN K+ F L+V+G S+L       +RV+D  +P+L+LPS P 
Sbjct: 241 GSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPSIPS 300

Query: 323 VFSQFGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPH 382
           VF  FG++F+YG+   G    +++K LC   HN+AK   GCG V AEV  +DP+R  IPH
Sbjct: 301 VFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLAKAG-GCGVVAAEVAGEDPLRRGIPH 360

Query: 383 WRRFSWSEDLWCIKKLRD--LEGDECNWIKSPPSSAGIFVDPRD 403
           W+  S  EDLWCIK+L D   +G   +W KSPP    IFVDPR+
Sbjct: 361 WKVLSCDEDLWCIKRLGDDYSDGVVGDWTKSPP-GVSIFVDPRE 402

BLAST of Cp4.1LG03g08870 vs. Swiss-Prot
Match: HLS1L_ARATH (Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana GN=At2g23060 PE=2 SV=1)

HSP 1 Score: 376.3 bits (965), Expect = 4.1e-103
Identity = 192/411 (46.72%), Postives = 265/411 (64.48%), Query Frame = 1

Query: 23  LIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPICRIRHFPLHLMLVAEYG- 82
           L+ VREY   +D  +VE +ER+C+VG  GK S+FTDLLGDPICR+RH P +LMLVAE G 
Sbjct: 4   LVEVREYDPSKDLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGP 63

Query: 83  -DARDIVGVIRGCIKHVTTGHSHHPLD-------------------LAYILGLRVSTTHR 142
            + +++VG+IRGCIK VT G +   LD                   LAYILGLRVS THR
Sbjct: 64  KEKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHR 123

Query: 143 RLGVGTKLVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHA 202
           R G+G KLV+ +E W  Q GA+Y+Y ATE  N   +N+FT K  Y +FR+P++LV PV+A
Sbjct: 124 RQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYA 183

Query: 203 HYKPISSGVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLL-- 262
           H   IS  V ++++ P     +YR  F+  E FP DID++L+NKL+LGTF+AVP+     
Sbjct: 184 HRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYG 243

Query: 263 ---PNWNPETGIL---PQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWL 322
               +W      L   P S+AVLSVWN K+ F+L+V+G S+L       +R++D  +P+L
Sbjct: 244 SGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFL 303

Query: 323 RLPSFPDVFSQFGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDP 382
           ++PS P VF  FG++F+YG+   G    +++K LC   HN+AK+  GCG V AEV  ++P
Sbjct: 304 KIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEG-GCGVVAAEVAGEEP 363

Query: 383 VRAAIPHWRRFSWSEDLWCIKKLRD--LEGDECNWIKSPPSSAGIFVDPRD 403
           +R  IPHW+  S +EDLWCIK+L +   +G   +W KSPP  + IFVDPR+
Sbjct: 364 LRRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDS-IFVDPRE 412

BLAST of Cp4.1LG03g08870 vs. TrEMBL
Match: A0A0A0L9Z5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G710850 PE=4 SV=1)

HSP 1 Score: 730.7 bits (1885), Expect = 9.6e-208
Identity = 343/406 (84.48%), Postives = 371/406 (91.38%), Query Frame = 1

Query: 5   KIADEYRLQIITEQNSN-NLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDP 64
           KIADEYRLQ+ +    N NL+VVREYCEERDKVSVEK+ERQCDVGQKGK SIFTDLLGDP
Sbjct: 4   KIADEYRLQVESNTEENRNLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDP 63

Query: 65  ICRIRHFPLHLMLVAEYGDARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRL 124
           ICR+RHFP H+MLVAEYG AR+IVGVIRGCIKHVTTGHSHH L LAYILGLRVSTTHRRL
Sbjct: 64  ICRVRHFPSHVMLVAEYGKAREIVGVIRGCIKHVTTGHSHHVLKLAYILGLRVSTTHRRL 123

Query: 125 GVGTKLVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHY 184
           GVGTKLVQH+E+WCKQKGADYAYIAT+CANQP I++FT KF+YTKFRSPTVLVQPVHAHY
Sbjct: 124 GVGTKLVQHIEEWCKQKGADYAYIATDCANQPSISLFTQKFAYTKFRSPTVLVQPVHAHY 183

Query: 185 KPISSGVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWN 244
           KPI SG++IVR+PPHV VKIYR LFANAE F  DIDAIL NKLNLGTFMAVPKKLLP W+
Sbjct: 184 KPIGSGISIVRVPPHVAVKIYRHLFANAEFFAEDIDAILFNKLNLGTFMAVPKKLLPKWD 243

Query: 245 PETGILPQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFS 304
           PETGILPQSFAVLSVWNTKEVFKLQVKG+SKLTYACCMGSR+LDSW+PWLR+PSFPDVFS
Sbjct: 244 PETGILPQSFAVLSVWNTKEVFKLQVKGMSKLTYACCMGSRLLDSWLPWLRVPSFPDVFS 303

Query: 305 QFGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRR 364
           QFGVYFLYGL+MRG++G RLMK LC FVHNMAKDDVGCGA+V EVGQQDPVR AIPHW+R
Sbjct: 304 QFGVYFLYGLTMRGTNGQRLMKSLCTFVHNMAKDDVGCGALVTEVGQQDPVRVAIPHWKR 363

Query: 365 FSWSEDLWCIKKLRDLEGDE------CNWIKSPPSSAGIFVDPRDI 404
            SW+EDLWCIKKL DLEGD       C+WIKSPPSSAGIFVDPRDI
Sbjct: 364 LSWNEDLWCIKKLTDLEGDNYEGSKTCDWIKSPPSSAGIFVDPRDI 409

BLAST of Cp4.1LG03g08870 vs. TrEMBL
Match: B9GMA0_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s03570g PE=4 SV=2)

HSP 1 Score: 574.7 bits (1480), Expect = 8.8e-161
Identity = 267/389 (68.64%), Postives = 318/389 (81.75%), Query Frame = 1

Query: 22  NLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPICRIRHFPLHLMLVAEYG 81
           N +VVREY E RDKV+VE++ER C+VGQ+GK S+ TDL+GDPICR+R FP H+MLVAE G
Sbjct: 20  NFVVVREYDEGRDKVAVEEMERSCEVGQRGKHSLVTDLMGDPICRVRRFPSHVMLVAECG 79

Query: 82  DARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRLGVGTKLVQHLEKWCKQKG 141
           D  +IVGVIR C+  V T  S   + LAYILGLRVS +HRRLG+GTKLVQ +E+WCKQKG
Sbjct: 80  DGGEIVGVIRACVNTVRTRESSGYVKLAYILGLRVSPSHRRLGIGTKLVQEIEEWCKQKG 139

Query: 142 ADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYKPISSGVAIVRIPPHVVV 201
           A+Y+Y+AT+C+N+P IN+FT K  YTKFR+ T+LVQPVHAHYKP+ SG+AI+++PP +  
Sbjct: 140 AEYSYMATDCSNEPSINLFTRKCFYTKFRTLTMLVQPVHAHYKPLGSGIAIIQLPPKLAE 199

Query: 202 KIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWNPETGILPQSFAVLSVWNT 261
            IY ++FA+AE FP DI  ILS+KLNLGTFMAVPKK LP W+P+TGILP SFA+LSVWNT
Sbjct: 200 AIYCRVFADAEFFPKDIGTILSSKLNLGTFMAVPKKALPKWDPKTGILPSSFALLSVWNT 259

Query: 262 KEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFSQFGVYFLYGLSMRGSDGP 321
           KEVFKLQVKGVSKLTYACC G+R+LD+W+PWLRLPSFPDVF QFGVYFLYGL M G +  
Sbjct: 260 KEVFKLQVKGVSKLTYACCTGTRLLDAWMPWLRLPSFPDVFRQFGVYFLYGLHMEGKNAS 319

Query: 322 RLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRRFSWSEDLWCIKKLRDLEG 381
           RLMK LC F HNMA+DD GCGAVVAEV Q+DPVR  IPHWRRFSW+EDLWCIKKL D + 
Sbjct: 320 RLMKALCAFAHNMARDDDGCGAVVAEVAQRDPVREVIPHWRRFSWAEDLWCIKKLADEKL 379

Query: 382 D-------ECNWIKSPPSSAGIFVDPRDI 404
           D       + +W+K   SS  IFVDPRDI
Sbjct: 380 DDVDRRCGQSDWMKHGSSSPVIFVDPRDI 408

BLAST of Cp4.1LG03g08870 vs. TrEMBL
Match: V4S0I4_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025742mg PE=4 SV=1)

HSP 1 Score: 570.5 bits (1469), Expect = 1.7e-159
Identity = 262/405 (64.69%), Postives = 328/405 (80.99%), Query Frame = 1

Query: 5   KIADEYRLQIITEQNSNNLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPI 64
           KIA E   +   +   N++++VREY EERDK+ VE++ER+C+ GQ+GK ++ TDL+GDP+
Sbjct: 4   KIAAENSPEFPMKTKVNSVVIVREYNEERDKLGVEEIERRCETGQRGKPTLVTDLMGDPV 63

Query: 65  CRIRHFPLHLMLVAEYGDARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRLG 124
           CR+RHFP H+ LVAEYG+ ++IVGVIRGC+K VTTG S+  + LAY+LGLRVS THRRLG
Sbjct: 64  CRVRHFPSHIALVAEYGEEKEIVGVIRGCVKTVTTGGSNF-VKLAYLLGLRVSPTHRRLG 123

Query: 125 VGTKLVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYK 184
           +GTKLVQ LE+WCKQ+GA+Y+Y+AT+C N+  IN+FT K SYTKFR+PT+LVQPVHAHYK
Sbjct: 124 IGTKLVQKLEEWCKQQGAEYSYMATDCGNEASINLFTRKCSYTKFRTPTMLVQPVHAHYK 183

Query: 185 PISSGVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWNP 244
           P+ +G++IVR+P      IYR++FAN+E FP DID ILS+ LNLGTFMAVPKK +P W+P
Sbjct: 184 PVGAGISIVRLPRKSAETIYRRVFANSEFFPKDIDLILSSNLNLGTFMAVPKKFVPRWDP 243

Query: 245 ETGILPQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFSQ 304
           +TGILP SFA+LSVWNTKEVFKLQ+KGVS L YA C+GSR+LD+W+PWLRLPSFPDVF Q
Sbjct: 244 KTGILPPSFAILSVWNTKEVFKLQLKGVSALKYAFCVGSRLLDAWMPWLRLPSFPDVFRQ 303

Query: 305 FGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRRF 364
           FGVYFLYGL M G     LMK LC F HNMA+DD  CGA+VAEVG +DPVR  IPHWR+F
Sbjct: 304 FGVYFLYGLHMEGKHASLLMKSLCAFAHNMARDDGECGALVAEVGAKDPVRETIPHWRKF 363

Query: 365 SWSEDLWCIKKLRDLEGDE------CNWIKSPPSSAGIFVDPRDI 404
           SW+EDLWCIKK+  ++ D        +W+KS  S++ IFVDPRDI
Sbjct: 364 SWAEDLWCIKKIGAVDEDRNERCPPSDWMKSRSSTSVIFVDPRDI 407

BLAST of Cp4.1LG03g08870 vs. TrEMBL
Match: I1KVM7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_08G219000 PE=4 SV=1)

HSP 1 Score: 561.6 bits (1446), Expect = 7.7e-157
Identity = 260/400 (65.00%), Postives = 321/400 (80.25%), Query Frame = 1

Query: 5   KIADEYRLQIITEQNSNNLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPI 64
           KIA E   +  +E     L++V+EY E+R KV+VEKLER C+VGQ GK S+ TDL+GDPI
Sbjct: 4   KIAAEVYPKSPSETGMEPLVLVKEYEEDRHKVAVEKLERLCEVGQSGKPSLVTDLMGDPI 63

Query: 65  CRIRHFPLHLMLVAEYGDARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRLG 124
           CRIRHF LH MLVAEYG+  ++VGVIRGC+K VT G+S + ++LAYILGLRVS  HRR G
Sbjct: 64  CRIRHFQLHAMLVAEYGEEGEVVGVIRGCVKTVTRGNSVY-VELAYILGLRVSPRHRRFG 123

Query: 125 VGTKLVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYK 184
           +GTKLV+HLE+WCKQKG+ YAY+AT+C N+P +N+FT K  Y+KFR+ T+LVQPVHAHYK
Sbjct: 124 IGTKLVEHLEEWCKQKGSKYAYMATDCTNEPSVNLFTKKCGYSKFRTLTILVQPVHAHYK 183

Query: 185 PISSGVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWNP 244
           PISS VA++R+PP +   +Y  +FAN+E +P DI+ ILSNKLNLGTFMA+PKK L   +P
Sbjct: 184 PISSNVAVLRLPPRLAGSMYNHMFANSEFYPKDIELILSNKLNLGTFMAIPKKYLSKCDP 243

Query: 245 ETGILPQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFSQ 304
           + GILP S+A+LSVWNTK+VFKLQVKGVS   +ACC+G+R+LD W+PWLRLPSFPDVF  
Sbjct: 244 KRGILPPSYAILSVWNTKDVFKLQVKGVSPWAHACCVGTRLLDEWMPWLRLPSFPDVFRP 303

Query: 305 FGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRRF 364
           FGVYFLYGL M G  G +LMK LC FVHNMA+DD GCGA+VAE+GQ+DPVR A+PHWR+F
Sbjct: 304 FGVYFLYGLHMEGKCGAQLMKSLCGFVHNMARDDGGCGAIVAELGQRDPVRDAVPHWRKF 363

Query: 365 SWSEDLWCIKKLRDLEGD--ECNWIKSPPSSAGIFVDPRD 403
           SW+ED+WCIK L D + D  E +W  S  SS  IFVDPRD
Sbjct: 364 SWAEDMWCIKNLEDTKKDIQESDWFTSRSSSPVIFVDPRD 402

BLAST of Cp4.1LG03g08870 vs. TrEMBL
Match: A0A0B2RYH5_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_007087 PE=4 SV=1)

HSP 1 Score: 561.6 bits (1446), Expect = 7.7e-157
Identity = 260/400 (65.00%), Postives = 321/400 (80.25%), Query Frame = 1

Query: 5   KIADEYRLQIITEQNSNNLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPI 64
           KIA E   +  +E     L++V+EY E+R KV+VEKLER C+VGQ GK S+ TDL+GDPI
Sbjct: 4   KIAAEVYPKSPSETGMEPLVLVKEYEEDRHKVAVEKLERLCEVGQSGKPSLVTDLMGDPI 63

Query: 65  CRIRHFPLHLMLVAEYGDARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRLG 124
           CRIRHF LH MLVAEYG+  ++VGVIRGC+K VT G+S + ++LAYILGLRVS  HRR G
Sbjct: 64  CRIRHFQLHAMLVAEYGEEGEVVGVIRGCVKTVTRGNSVY-VELAYILGLRVSPRHRRFG 123

Query: 125 VGTKLVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYK 184
           +GTKLV+HLE+WCKQKG+ YAY+AT+C N+P +N+FT K  Y+KFR+ T+LVQPVHAHYK
Sbjct: 124 IGTKLVEHLEEWCKQKGSKYAYMATDCTNEPSVNLFTKKCGYSKFRTLTILVQPVHAHYK 183

Query: 185 PISSGVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWNP 244
           PISS VA++R+PP +   +Y  +FAN+E +P DI+ ILSNKLNLGTFMA+PKK L   +P
Sbjct: 184 PISSNVAVLRLPPRLAGSMYNHMFANSEFYPKDIELILSNKLNLGTFMAIPKKYLSKCDP 243

Query: 245 ETGILPQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFSQ 304
           + GILP S+A+LSVWNTK+VFKLQVKGVS   +ACC+G+R+LD W+PWLRLPSFPDVF  
Sbjct: 244 KRGILPPSYAILSVWNTKDVFKLQVKGVSPWAHACCVGTRLLDEWMPWLRLPSFPDVFRP 303

Query: 305 FGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRRF 364
           FGVYFLYGL M G  G +LMK LC FVHNMA+DD GCGA+VAE+GQ+DPVR A+PHWR+F
Sbjct: 304 FGVYFLYGLHMEGKCGAQLMKSLCGFVHNMARDDGGCGAIVAELGQRDPVRDAVPHWRKF 363

Query: 365 SWSEDLWCIKKLRDLEGD--ECNWIKSPPSSAGIFVDPRD 403
           SW+ED+WCIK L D + D  E +W  S  SS  IFVDPRD
Sbjct: 364 SWAEDMWCIKNLEDTKKDIQESDWFTSRSSSPVIFVDPRD 402

BLAST of Cp4.1LG03g08870 vs. TAIR10
Match: AT4G37580.1 (AT4G37580.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 382.9 bits (982), Expect = 2.5e-106
Identity = 200/404 (49.50%), Postives = 267/404 (66.09%), Query Frame = 1

Query: 23  LIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPICRIRHFPLHLMLVAEYG- 82
           + VVREY   RD V VE +ER+C+VG  GK S+FTDLLGDPICRIRH P +LMLVAE G 
Sbjct: 1   MTVVREYDPTRDLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGT 60

Query: 83  DARDIVGVIRGCIKHVTTGHS---HH--------PL--DLAYILGLRVSTTHRRLGVGTK 142
           + ++IVG+IRGCIK VT G     +H        PL   LAY+LGLRVS  HRR G+G K
Sbjct: 61  EKKEIVGMIRGCIKTVTCGQKLDLNHKSQNDVVKPLYTKLAYVLGLRVSPFHRRQGIGFK 120

Query: 143 LVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYKPISS 202
           LV+ +E+W +Q GA+Y+YIATE  NQ  +N+FT K  Y++FR+P++LV PV+AH   +S 
Sbjct: 121 LVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYAHRVNVSR 180

Query: 203 GVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLL-----PNWN 262
            V ++++ P     +YR  F+  E FP DID++L+NKL+LGTF+AVP+         +W 
Sbjct: 181 RVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYGSGSGSWP 240

Query: 263 PETGIL---PQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPD 322
                L   P+S+AVLSVWN K+ F L+V+G S+L       +RV+D  +P+L+LPS P 
Sbjct: 241 GSAKFLEYPPESWAVLSVWNCKDSFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPSIPS 300

Query: 323 VFSQFGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPH 382
           VF  FG++F+YG+   G    +++K LC   HN+AK   GCG V AEV  +DP+R  IPH
Sbjct: 301 VFEPFGLHFMYGIGGEGPRAVKMVKSLCAHAHNLAKAG-GCGVVAAEVAGEDPLRRGIPH 360

Query: 383 WRRFSWSEDLWCIKKLRD--LEGDECNWIKSPPSSAGIFVDPRD 403
           W+  S  EDLWCIK+L D   +G   +W KSPP    IFVDPR+
Sbjct: 361 WKVLSCDEDLWCIKRLGDDYSDGVVGDWTKSPP-GVSIFVDPRE 402

BLAST of Cp4.1LG03g08870 vs. TAIR10
Match: AT2G23060.1 (AT2G23060.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 376.3 bits (965), Expect = 2.3e-104
Identity = 192/411 (46.72%), Postives = 265/411 (64.48%), Query Frame = 1

Query: 23  LIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPICRIRHFPLHLMLVAEYG- 82
           L+ VREY   +D  +VE +ER+C+VG  GK S+FTDLLGDPICR+RH P +LMLVAE G 
Sbjct: 4   LVEVREYDPSKDLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGP 63

Query: 83  -DARDIVGVIRGCIKHVTTGHSHHPLD-------------------LAYILGLRVSTTHR 142
            + +++VG+IRGCIK VT G +   LD                   LAYILGLRVS THR
Sbjct: 64  KEKKELVGMIRGCIKTVTCGITTKRLDLTHNKSQNDVVITKPLYTKLAYILGLRVSPTHR 123

Query: 143 RLGVGTKLVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHA 202
           R G+G KLV+ +E W  Q GA+Y+Y ATE  N   +N+FT K  Y +FR+P++LV PV+A
Sbjct: 124 RQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYA 183

Query: 203 HYKPISSGVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLL-- 262
           H   IS  V ++++ P     +YR  F+  E FP DID++L+NKL+LGTF+AVP+     
Sbjct: 184 HRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYG 243

Query: 263 ---PNWNPETGIL---PQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWL 322
               +W      L   P S+AVLSVWN K+ F+L+V+G S+L       +R++D  +P+L
Sbjct: 244 SGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFL 303

Query: 323 RLPSFPDVFSQFGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDP 382
           ++PS P VF  FG++F+YG+   G    +++K LC   HN+AK+  GCG V AEV  ++P
Sbjct: 304 KIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEG-GCGVVAAEVAGEEP 363

Query: 383 VRAAIPHWRRFSWSEDLWCIKKLRD--LEGDECNWIKSPPSSAGIFVDPRD 403
           +R  IPHW+  S +EDLWCIK+L +   +G   +W KSPP  + IFVDPR+
Sbjct: 364 LRRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDS-IFVDPRE 412

BLAST of Cp4.1LG03g08870 vs. TAIR10
Match: AT5G67430.1 (AT5G67430.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 332.8 bits (852), Expect = 2.9e-91
Identity = 176/396 (44.44%), Postives = 248/396 (62.63%), Query Frame = 1

Query: 22  NLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPICRIRHFPLHLMLVAEYG 81
           N++VVREY  +RD  SVE+LE  C+VG     S+  DL+GDP+ RIR  P   MLVAE G
Sbjct: 6   NVVVVREYDPKRDLTSVEELEESCEVG-----SLLVDLMGDPLARIRQSPSFHMLVAEIG 65

Query: 82  DARDIVGVIRGCIKHVTTGHSH-----------HPLDLAYILGLRVSTTHRRLGVGTKLV 141
           +  +IVG+IRG IK VT G +            +   LA++ GLRVS  +RR+G+G KLV
Sbjct: 66  N--EIVGMIRGTIKMVTRGVNALRQADDVSPEINTTKLAFVSGLRVSPFYRRMGIGLKLV 125

Query: 142 QHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYKPISSGV 201
           Q LE+W  +  A Y+Y+ TE  N   + +FT K  Y+KFR+PT LV PV  H   +S  V
Sbjct: 126 QRLEEWFLRNDAVYSYVQTENDNIASVKLFTEKSGYSKFRTPTFLVNPVFNHRVTVSRRV 185

Query: 202 AIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWNPETGILP 261
            I+++ P     +YR  F+  E FP DI++IL+NKL+LGT++AVP+      +  +G LP
Sbjct: 186 KIIKLAPSDAESLYRNRFSTTEFFPSDINSILTNKLSLGTYLAVPR----GGDNVSGSLP 245

Query: 262 Q---SFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFSQFGV 321
               S+AV+S+WN+K+V++LQVKG S+L       +RV D   P+L++PSFP++F  F +
Sbjct: 246 DQTGSWAVISIWNSKDVYRLQVKGASRLKRMLAKSTRVFDGAFPFLKIPSFPNLFKSFAM 305

Query: 322 YFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRRFSWS 381
           +F+YG+   G     +++ LC   HN+A+   GC  V AEV   +P+R  IPHW+  S  
Sbjct: 306 HFMYGIGGEGPRAAEMVEALCSHAHNLARKS-GCAVVAAEVASCEPLRVGIPHWKVLS-P 365

Query: 382 EDLWCIKKLRDLEGDECNWIKSPPSSAGIFVDPRDI 404
           EDLWC+K+LR  + D  +W KSPP    IFVDPR+I
Sbjct: 366 EDLWCLKRLR-YDDDGVDWTKSPP-GLSIFVDPREI 386

BLAST of Cp4.1LG03g08870 vs. TAIR10
Match: AT2G30090.1 (AT2G30090.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein)

HSP 1 Score: 254.2 bits (648), Expect = 1.3e-67
Identity = 141/395 (35.70%), Postives = 227/395 (57.47%), Query Frame = 1

Query: 17  EQNSNNLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPICRIRHFPLHLML 76
           E+  +  +V+R Y + RD++ + ++E+ C++G   +  +FTD LGDPICRIR+ P  +ML
Sbjct: 6   EEEVDEEVVIRCYDDRRDRIQMGRMEKSCEIGHDHQTLLFTDTLGDPICRIRNSPFFIML 65

Query: 77  VAEYGDARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRLGVGTKLVQHLEKW 136
           VA  G+   +VG I+G +K V        + + Y+LGLRV  ++RR G+G+ LV+ LE+W
Sbjct: 66  VAGVGNK--LVGSIQGSVKPVE--FHDKSVRVGYVLGLRVVPSYRRRGIGSILVRKLEEW 125

Query: 137 CKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVH-AHYKPISSGVAIVRI 196
            +   ADYAY+ATE  N+    +F  +  Y  FR+P +LV PV+      + S + I ++
Sbjct: 126 FESHNADYAYMATEKDNEASHGLFIGRLGYVVFRNPAILVNPVNPGRGLKLPSDIGIRKL 185

Query: 197 PPHVVVKIYRQ-LFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWNPETGILPQSFA 256
                  +YR+ + A  E FP DI+ IL NKL++GT++A    +            +S+A
Sbjct: 186 KVKEAESLYRRNVAATTEFFPDDINKILRNKLSIGTWVAYYNNVDNT---------RSWA 245

Query: 257 VLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFSQFGVYFLYGLS 316
           +LSVW++ +VFKL+++            S++  +++  L L   PD+F+ FG YFLYG+ 
Sbjct: 246 MLSVWDSSKVFKLRIERAPLSYLLLTKVSKLFGNFLSLLGLTVLPDLFTPFGFYFLYGVH 305

Query: 317 MRGSDGPRLMKFLCRFVHNMA--KDDVGCGAVVAEVGQ----QDPVRAAIPHWRRFSWSE 376
             G    +L++ LC  VHNMA   D   C  VV EV +     D ++  IPHW+  S  +
Sbjct: 306 SEGPHCGKLVRALCEHVHNMAALNDGCACKVVVVEVDKGSNGDDSLQRCIPHWKMLSCDD 365

Query: 377 DLWCIKKLRDLEGDECNWIKSPPSSAGIFVDPRDI 404
           D+WCIK L+  E ++ +  +   S + +FVDPR++
Sbjct: 366 DMWCIKPLK-CEKNKFDLSERSKSRSSLFVDPREV 386

BLAST of Cp4.1LG03g08870 vs. NCBI nr
Match: gi|449439765|ref|XP_004137656.1| (PREDICTED: probable N-acetyltransferase HLS1 [Cucumis sativus])

HSP 1 Score: 730.7 bits (1885), Expect = 1.4e-207
Identity = 343/406 (84.48%), Postives = 371/406 (91.38%), Query Frame = 1

Query: 5   KIADEYRLQIITEQNSN-NLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDP 64
           KIADEYRLQ+ +    N NL+VVREYCEERDKVSVEK+ERQCDVGQKGK SIFTDLLGDP
Sbjct: 4   KIADEYRLQVESNTEENRNLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDP 63

Query: 65  ICRIRHFPLHLMLVAEYGDARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRL 124
           ICR+RHFP H+MLVAEYG AR+IVGVIRGCIKHVTTGHSHH L LAYILGLRVSTTHRRL
Sbjct: 64  ICRVRHFPSHVMLVAEYGKAREIVGVIRGCIKHVTTGHSHHVLKLAYILGLRVSTTHRRL 123

Query: 125 GVGTKLVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHY 184
           GVGTKLVQH+E+WCKQKGADYAYIAT+CANQP I++FT KF+YTKFRSPTVLVQPVHAHY
Sbjct: 124 GVGTKLVQHIEEWCKQKGADYAYIATDCANQPSISLFTQKFAYTKFRSPTVLVQPVHAHY 183

Query: 185 KPISSGVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWN 244
           KPI SG++IVR+PPHV VKIYR LFANAE F  DIDAIL NKLNLGTFMAVPKKLLP W+
Sbjct: 184 KPIGSGISIVRVPPHVAVKIYRHLFANAEFFAEDIDAILFNKLNLGTFMAVPKKLLPKWD 243

Query: 245 PETGILPQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFS 304
           PETGILPQSFAVLSVWNTKEVFKLQVKG+SKLTYACCMGSR+LDSW+PWLR+PSFPDVFS
Sbjct: 244 PETGILPQSFAVLSVWNTKEVFKLQVKGMSKLTYACCMGSRLLDSWLPWLRVPSFPDVFS 303

Query: 305 QFGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRR 364
           QFGVYFLYGL+MRG++G RLMK LC FVHNMAKDDVGCGA+V EVGQQDPVR AIPHW+R
Sbjct: 304 QFGVYFLYGLTMRGTNGQRLMKSLCTFVHNMAKDDVGCGALVTEVGQQDPVRVAIPHWKR 363

Query: 365 FSWSEDLWCIKKLRDLEGDE------CNWIKSPPSSAGIFVDPRDI 404
            SW+EDLWCIKKL DLEGD       C+WIKSPPSSAGIFVDPRDI
Sbjct: 364 LSWNEDLWCIKKLTDLEGDNYEGSKTCDWIKSPPSSAGIFVDPRDI 409

BLAST of Cp4.1LG03g08870 vs. NCBI nr
Match: gi|659130398|ref|XP_008465148.1| (PREDICTED: probable N-acetyltransferase HLS1 [Cucumis melo])

HSP 1 Score: 727.2 bits (1876), Expect = 1.5e-206
Identity = 345/406 (84.98%), Postives = 368/406 (90.64%), Query Frame = 1

Query: 5   KIADEYRLQIITEQNSN-NLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDP 64
           KIADEYRL + +    N NL+VVREYCEERDK SVEK+ERQCDVGQKGK SIFTDLLGDP
Sbjct: 29  KIADEYRLHVESNTEENRNLVVVREYCEERDKASVEKMERQCDVGQKGKPSIFTDLLGDP 88

Query: 65  ICRIRHFPLHLMLVAEYGDARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRL 124
           ICR+RHFP H+MLVAEYG AR+IVGVIRGCIKHVTTGHSHH L LAYILGLRVSTTHRRL
Sbjct: 89  ICRVRHFPSHVMLVAEYGKAREIVGVIRGCIKHVTTGHSHHVLKLAYILGLRVSTTHRRL 148

Query: 125 GVGTKLVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHY 184
           GVGTKLVQHLE+WCKQKGADYAYIAT+CANQP I++FT KFSYTKFRSPTVLVQPVHAHY
Sbjct: 149 GVGTKLVQHLEEWCKQKGADYAYIATDCANQPSISLFTEKFSYTKFRSPTVLVQPVHAHY 208

Query: 185 KPISSGVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWN 244
           KPI SG+AIVRIPPHV VKIYR LFANAE F  DIDAIL NKLNLGTFMA+PKKLLP W+
Sbjct: 209 KPIGSGIAIVRIPPHVAVKIYRYLFANAEFFAEDIDAILFNKLNLGTFMALPKKLLPKWD 268

Query: 245 PETGILPQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFS 304
           PETGILPQSFAVLSVWNTKEVFKLQVKG+SKLTYACCMGSR+LDSW+PWLR+PSFPDVFS
Sbjct: 269 PETGILPQSFAVLSVWNTKEVFKLQVKGMSKLTYACCMGSRLLDSWLPWLRVPSFPDVFS 328

Query: 305 QFGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRR 364
           QFGVYFLYGL+MRG++G RLMK LC FVHNMAKDDVGCGAVV EVGQQDPVR AIPHWRR
Sbjct: 329 QFGVYFLYGLTMRGTNGQRLMKSLCTFVHNMAKDDVGCGAVVTEVGQQDPVRVAIPHWRR 388

Query: 365 FSWSEDLWCIKKLRDLEGDEC------NWIKSPPSSAGIFVDPRDI 404
            SW+EDLWCIKKL DLEGD        +WIKSPPSSAGIFVDPRDI
Sbjct: 389 LSWNEDLWCIKKLTDLEGDNYEGSKTRDWIKSPPSSAGIFVDPRDI 434

BLAST of Cp4.1LG03g08870 vs. NCBI nr
Match: gi|566147028|ref|XP_002299247.2| (hypothetical protein POPTR_0001s03570g [Populus trichocarpa])

HSP 1 Score: 574.7 bits (1480), Expect = 1.3e-160
Identity = 267/389 (68.64%), Postives = 318/389 (81.75%), Query Frame = 1

Query: 22  NLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPICRIRHFPLHLMLVAEYG 81
           N +VVREY E RDKV+VE++ER C+VGQ+GK S+ TDL+GDPICR+R FP H+MLVAE G
Sbjct: 20  NFVVVREYDEGRDKVAVEEMERSCEVGQRGKHSLVTDLMGDPICRVRRFPSHVMLVAECG 79

Query: 82  DARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRLGVGTKLVQHLEKWCKQKG 141
           D  +IVGVIR C+  V T  S   + LAYILGLRVS +HRRLG+GTKLVQ +E+WCKQKG
Sbjct: 80  DGGEIVGVIRACVNTVRTRESSGYVKLAYILGLRVSPSHRRLGIGTKLVQEIEEWCKQKG 139

Query: 142 ADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYKPISSGVAIVRIPPHVVV 201
           A+Y+Y+AT+C+N+P IN+FT K  YTKFR+ T+LVQPVHAHYKP+ SG+AI+++PP +  
Sbjct: 140 AEYSYMATDCSNEPSINLFTRKCFYTKFRTLTMLVQPVHAHYKPLGSGIAIIQLPPKLAE 199

Query: 202 KIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWNPETGILPQSFAVLSVWNT 261
            IY ++FA+AE FP DI  ILS+KLNLGTFMAVPKK LP W+P+TGILP SFA+LSVWNT
Sbjct: 200 AIYCRVFADAEFFPKDIGTILSSKLNLGTFMAVPKKALPKWDPKTGILPSSFALLSVWNT 259

Query: 262 KEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFSQFGVYFLYGLSMRGSDGP 321
           KEVFKLQVKGVSKLTYACC G+R+LD+W+PWLRLPSFPDVF QFGVYFLYGL M G +  
Sbjct: 260 KEVFKLQVKGVSKLTYACCTGTRLLDAWMPWLRLPSFPDVFRQFGVYFLYGLHMEGKNAS 319

Query: 322 RLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRRFSWSEDLWCIKKLRDLEG 381
           RLMK LC F HNMA+DD GCGAVVAEV Q+DPVR  IPHWRRFSW+EDLWCIKKL D + 
Sbjct: 320 RLMKALCAFAHNMARDDDGCGAVVAEVAQRDPVREVIPHWRRFSWAEDLWCIKKLADEKL 379

Query: 382 D-------ECNWIKSPPSSAGIFVDPRDI 404
           D       + +W+K   SS  IFVDPRDI
Sbjct: 380 DDVDRRCGQSDWMKHGSSSPVIFVDPRDI 408

BLAST of Cp4.1LG03g08870 vs. NCBI nr
Match: gi|567870013|ref|XP_006427628.1| (hypothetical protein CICLE_v10025742mg [Citrus clementina])

HSP 1 Score: 570.5 bits (1469), Expect = 2.4e-159
Identity = 262/405 (64.69%), Postives = 328/405 (80.99%), Query Frame = 1

Query: 5   KIADEYRLQIITEQNSNNLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPI 64
           KIA E   +   +   N++++VREY EERDK+ VE++ER+C+ GQ+GK ++ TDL+GDP+
Sbjct: 4   KIAAENSPEFPMKTKVNSVVIVREYNEERDKLGVEEIERRCETGQRGKPTLVTDLMGDPV 63

Query: 65  CRIRHFPLHLMLVAEYGDARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRLG 124
           CR+RHFP H+ LVAEYG+ ++IVGVIRGC+K VTTG S+  + LAY+LGLRVS THRRLG
Sbjct: 64  CRVRHFPSHIALVAEYGEEKEIVGVIRGCVKTVTTGGSNF-VKLAYLLGLRVSPTHRRLG 123

Query: 125 VGTKLVQHLEKWCKQKGADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYK 184
           +GTKLVQ LE+WCKQ+GA+Y+Y+AT+C N+  IN+FT K SYTKFR+PT+LVQPVHAHYK
Sbjct: 124 IGTKLVQKLEEWCKQQGAEYSYMATDCGNEASINLFTRKCSYTKFRTPTMLVQPVHAHYK 183

Query: 185 PISSGVAIVRIPPHVVVKIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWNP 244
           P+ +G++IVR+P      IYR++FAN+E FP DID ILS+ LNLGTFMAVPKK +P W+P
Sbjct: 184 PVGAGISIVRLPRKSAETIYRRVFANSEFFPKDIDLILSSNLNLGTFMAVPKKFVPRWDP 243

Query: 245 ETGILPQSFAVLSVWNTKEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFSQ 304
           +TGILP SFA+LSVWNTKEVFKLQ+KGVS L YA C+GSR+LD+W+PWLRLPSFPDVF Q
Sbjct: 244 KTGILPPSFAILSVWNTKEVFKLQLKGVSALKYAFCVGSRLLDAWMPWLRLPSFPDVFRQ 303

Query: 305 FGVYFLYGLSMRGSDGPRLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRRF 364
           FGVYFLYGL M G     LMK LC F HNMA+DD  CGA+VAEVG +DPVR  IPHWR+F
Sbjct: 304 FGVYFLYGLHMEGKHASLLMKSLCAFAHNMARDDGECGALVAEVGAKDPVRETIPHWRKF 363

Query: 365 SWSEDLWCIKKLRDLEGDE------CNWIKSPPSSAGIFVDPRDI 404
           SW+EDLWCIKK+  ++ D        +W+KS  S++ IFVDPRDI
Sbjct: 364 SWAEDLWCIKKIGAVDEDRNERCPPSDWMKSRSSTSVIFVDPRDI 407

BLAST of Cp4.1LG03g08870 vs. NCBI nr
Match: gi|743943983|ref|XP_011016503.1| (PREDICTED: probable N-acetyltransferase HLS1 [Populus euphratica])

HSP 1 Score: 568.9 bits (1465), Expect = 6.9e-159
Identity = 265/388 (68.30%), Postives = 318/388 (81.96%), Query Frame = 1

Query: 22  NLIVVREYCEERDKVSVEKLERQCDVGQKGKASIFTDLLGDPICRIRHFPLHLMLVAEYG 81
           N +VVREY E RDKV+VE++ER+C+VGQ+GK S+ TDL+GDPICR+RHFP H+MLVAE G
Sbjct: 20  NFVVVREYDEGRDKVAVEEMERRCEVGQRGKHSLVTDLMGDPICRVRHFPSHVMLVAECG 79

Query: 82  DARDIVGVIRGCIKHVTTGHSHHPLDLAYILGLRVSTTHRRLGVGTKLVQHLEKWCKQKG 141
           D  +IVGVIR C+  VTT  S   + LAYILGLRVS +HRRLG+GTKLVQ +E+WCKQKG
Sbjct: 80  DGGEIVGVIRACVNTVTTRESSGSVKLAYILGLRVSPSHRRLGIGTKLVQEIEEWCKQKG 139

Query: 142 ADYAYIATECANQPCINMFTHKFSYTKFRSPTVLVQPVHAHYKPISSGVAIVRIPPHVVV 201
           A+Y+Y+AT+ +N+P IN+FT K  YTKFR+ T+LVQPVHAHYKP+ SG+AI+++PP +  
Sbjct: 140 AEYSYMATDFSNEPSINLFTRKCFYTKFRTLTMLVQPVHAHYKPLGSGIAIIQLPPKLAE 199

Query: 202 KIYRQLFANAELFPMDIDAILSNKLNLGTFMAVPKKLLPNWNPETGILPQSFAVLSVWNT 261
            IY ++FA+AE FP DI  ILS+KLNLG+FMAVPKK LP W+P TGILP SFA+LSVWNT
Sbjct: 200 AIYYRVFADAEFFPKDIGTILSSKLNLGSFMAVPKKALPKWDPTTGILPSSFALLSVWNT 259

Query: 262 KEVFKLQVKGVSKLTYACCMGSRVLDSWVPWLRLPSFPDVFSQFGVYFLYGLSMRGSDGP 321
           KEVFKLQVKGVSKLTYACC G+R+LD+W+PWLRLPSFPDVF QFGVYFLYGL M G +  
Sbjct: 260 KEVFKLQVKGVSKLTYACCTGTRLLDAWMPWLRLPSFPDVFRQFGVYFLYGLHMEGKNAS 319

Query: 322 RLMKFLCRFVHNMAKDDVGCGAVVAEVGQQDPVRAAIPHWRRFSWSEDLWCIKKLRDLEG 381
           RLMK LC F HNMA+DD GCGAVVAE+ Q+DPVR  IPHWRRFSW+EDLWCIKKL D + 
Sbjct: 320 RLMKALCAFAHNMARDDDGCGAVVAELAQRDPVREVIPHWRRFSWAEDLWCIKKLADEKL 379

Query: 382 D------ECNWIKSPPSSAGIFVDPRDI 404
           D      + + +K   SS  IFVDPRDI
Sbjct: 380 DVDRRCGQSDRMKHGSSSPVIFVDPRDI 407

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HLS1_ARATH4.4e-10549.50Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana GN=HLS1 PE=1 SV=1[more]
HLS1L_ARATH4.1e-10346.72Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana GN=At2g23060 PE=2... [more]
Match NameE-valueIdentityDescription
A0A0A0L9Z5_CUCSA9.6e-20884.48Uncharacterized protein OS=Cucumis sativus GN=Csa_3G710850 PE=4 SV=1[more]
B9GMA0_POPTR8.8e-16168.64Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s03570g PE=4 SV=2[more]
V4S0I4_9ROSI1.7e-15964.69Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025742mg PE=4 SV=1[more]
I1KVM7_SOYBN7.7e-15765.00Uncharacterized protein OS=Glycine max GN=GLYMA_08G219000 PE=4 SV=1[more]
A0A0B2RYH5_GLYSO7.7e-15765.00Uncharacterized protein OS=Glycine soja GN=glysoja_007087 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G37580.12.5e-10649.50 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
AT2G23060.12.3e-10446.72 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
AT5G67430.12.9e-9144.44 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
AT2G30090.11.3e-6735.70 Acyl-CoA N-acyltransferases (NAT) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449439765|ref|XP_004137656.1|1.4e-20784.48PREDICTED: probable N-acetyltransferase HLS1 [Cucumis sativus][more]
gi|659130398|ref|XP_008465148.1|1.5e-20684.98PREDICTED: probable N-acetyltransferase HLS1 [Cucumis melo][more]
gi|566147028|ref|XP_002299247.2|1.3e-16068.64hypothetical protein POPTR_0001s03570g [Populus trichocarpa][more]
gi|567870013|ref|XP_006427628.1|2.4e-15964.69hypothetical protein CICLE_v10025742mg [Citrus clementina][more]
gi|743943983|ref|XP_011016503.1|6.9e-15968.30PREDICTED: probable N-acetyltransferase HLS1 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008080N-acetyltransferase activity
Vocabulary: INTERPRO
TermDefinition
IPR016181Acyl_CoA_acyltransferase
IPR000182GNAT_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042967 acyl-carrier-protein biosynthetic process
biological_process GO:0006473 protein acetylation
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008080 N-acetyltransferase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g08870.1Cp4.1LG03g08870.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000182GNAT domainPFAMPF00583Acetyltransf_1coord: 87..165
score: 7.8
IPR000182GNAT domainPROFILEPS51186GNATcoord: 24..210
score: 13
IPR016181Acyl-CoA N-acyltransferaseGENE3DG3DSA:3.40.630.30coord: 25..160
score: 3.9
IPR016181Acyl-CoA N-acyltransferaseunknownSSF55729Acyl-CoA N-acyltransferases (Nat)coord: 67..168
score: 1.69
NoneNo IPR availablePANTHERPTHR23091N-TERMINAL ACETYLTRANSFERASEcoord: 1..219
score: 2.6E
NoneNo IPR availablePANTHERPTHR23091:SF195SUBFAMILY NOT NAMEDcoord: 1..219
score: 2.6E