Cp4.1LG14g06330 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g06330
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHXXXD-type acyl-transferase family protein
LocationCp4.1LG14 : 517412 .. 520438 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACTTTCAATGTGTACATTAATATGTCACAACTGCACCCATGATCCTCATAAATGCACTGCCAACCATATTAATCACTTTGCTTTTGTTTTGTTTCCTTATAAAATCTAACATCTTTGGCCTTCTCTCCATACCCACACCTCTAATTTTTCTTTTTCATCCTCAACCCATTTCGCTGCCTCTGCTTTTTGTTCCATATCCATGGCGGGCGATCTCCCTGAAGTTGTGAAGAAAGAGGAGACAGGAGGATTGAAGGTGAAAATCACAGGCAAAACCCATGTCAAACCTGTCAAGAAACTCGACAGAAAAGAGTGCCAATTGGTCACATTTGACCTCCCTTATCTCGCCTTCTACTACAACCAAAAGCTCCTCCTCTACGGCGACAATGGCGGCGAGGTTCGCTTCCCTGAGATGGTGGAGAAGCTCAAGGATGGCCTCGAGACGGTTCTCGAGCCGTTTCATCAACTTGCTGGGAGGCTAGGCAAGGACAAGGATGGCGTTTTTAGAGTTGAGTATGACAATGATATGGAAGGTGTCGAAGTGGCCGAAGCTGTCGCCGACGATGTCTGCATAGCCGATTTGGTGGCGGAGGAAGGCACCGCCACTCTCAAGGAGCTTATTCCTTACAACGGGATACTCAACTTGGAAGGTCTCCAGAGGCCATTGATGGCTGTTCAGGTCAGAATTAAATTACTTCCATGCAGCTACTAAAACGTTTTTTTTTTTTTATAAAATAAGTGGGCCAGGGCCCACTTGTTCAACGATCCAAGCCCAGAGAAAAATAGTTGTAATTGGGCCATGGATCAGTCTCTAACGGCCCATTAACACCACATTCTGTTTTCGTCCAGCTAACCAAGCTTAAAGACGGAGTCGCGATGGGGTGCGCGTTCAATCACGCGGTACTTGATGGAACTGCCACGTGGCATTTCATGAGTTCATGGGCTGAGATCTGCCGTGGGGCCCAAGACATATCCGTCGCGCCATTTCTCGAGCGTACCAAAGCGCGCAACACGCGGGTGAAACTCGACATAACGCTTCCGCCGGAACCCGCCGCTGCCAACGGCGACTCGAAGGAAGCGGCGGCGGCGGCGCCACCGCTAAGGGAGAAGGTCTTCAAATTTACAGAAGCAGCCATCGACAAGATCAAGACAAAAGTGAACTCAACAAACCCTCCATCGGACGGCTCAAATCCATTCTCCACTTTCCAATCGCTCTCCGTCCATATATGGCGCCACGTCACCCAAGCCCGAAACCTAAAGCCAGAGGACATCACCGTGTTCACCGTTTTCGCCGACTGCCGCAAACGCGTCGATCCGCCAATGCCGGAGAGCTACTTCGGAAACCTAATCCAAGCCATATTCACCGGCACCGCCGCCGGGCTACTACTAATGAATCCGCCGGAATTCGGCGCCGGCGTGATCCAAAAGGCAATCGTTTCACACAATACGGCCGCGATCGATCAAAGAAACAAAGAATGGGAGAGCGCGCCGAAGATCTTCCAGTTCAAGGACGCGGGAATGAACTGCGTGGCGGTGGGGAGTTCGCCGAGATTTAAAGTGTACGATGTGGATTTCGGATGGGGAAAACCGGAGAGCGTGAGAAGTGGATGTAACAACAGATTCGACGGAATGATGTATCTGTACCAAGGGAAGGACGGAGGGAGGAGCATTGATGTGGAGATAAGCTTGGAGGAAGAAGCTATGGCGAGGCTGGAGAAGGACAAGGAGTTTCTCATGGAGAATTAGGGTTTCGCCGTGGTTGAATAAGTGGAAGACAAAATGGCAGTGGATGAAAATGGTGGGATTTTTAATTTCTTGGGGAAATTGCTTTTATATTGTTATTATGGGACTTTTTTTACTGCAGGCGGGGGGATGAAATCAAACTAATTATCCTCTTCTTTTTCTTCTTTTCTTATTGTGCTCGTTTAACCTTGTAATATCTATGCATATTTTTAATTATGGCATATTTGTTATTAAACTTTTGGATTATATATATATATATAGTCTTTCAAACGGCACGCCTTTCGAATAGGTGTGCTCCTGCCTCTTCCCTCTTTAGGGCTCATGTTTAAATCTGAATAGGTACAATCCAGTTCATACGATTACTCAGGGATGAACCCAATCTGAAATATGAACGAAAAGGAAAAGATTCATTGAACCGATCATGAGAATACCTGTTACAATACCTATCAGCCAAAAAGTAATCCTTCAAGTAGTCTCGATCATTTACTCTATTTCATCATGACCAGGTGAATAAGCGAGTTAAGTTCCAAGACACCCATAGAATCATTTCTCTCCTATGAGCTTCAAACGCTAACACATAATTTTCTAAATCGACCACCTAAAAGAAATACCCCTCCATCCAAAATCGATTCTAATAACTTTATTTTTCAGCTCCAGGACTTGGAGGTCCAGGAGCTGAGGCAGTCGACTCCTCCTGTTACAAGTTCATACATGTTAGAGTGATAGCGCATTTTTAAAACAAAGCTTATAAACACTAGGTTCATCCAAAAAGCTACCAAATACGACCAAAGTGATTGGTTTTAGTAATCTTAAGGTACTCTTCAGTGGTGTATGCTTTGCTTGCTGATGAAGAATTATCAGCTGTTTATATACATAAGAAAACAAGCTTTCATCATCGTATAATAGTCGAGGCTGACAGAATGATGATTCTATATCCCTAATGATTGAACCCAATCGTCTCATTCGTAGACACGATGCCCGACGACGAACCAAGGCTGATTACCAATATTCTTTTCCCCTTTGGCATTCTACTTTTTCTATGTAGCTTTGAAATCTTTCCCTAGAATACTACAAGTGCTGATAATAAATAAAAATATGGATGACAAATAAGTTAACCACTCACTTGGTTCAGCCATTGCAAAAGACTCTTCTGCTTTAGGAACAAAAGGTGGGTTTGAGAAGGTATCTGATTTGCTTGAATTGTATGAGCTTTCGACTGGACTCGAGATTTCATCGAGACCAATTTCTCGCTCAAGTGTGGTCTTGAATTCTCGAGAAACGTCCTA

mRNA sequence

AACTTTCAATGTGTACATTAATATGTCACAACTGCACCCATGATCCTCATAAATGCACTGCCAACCATATTAATCACTTTGCTTTTGTTTTGTTTCCTTATAAAATCTAACATCTTTGGCCTTCTCTCCATACCCACACCTCTAATTTTTCTTTTTCATCCTCAACCCATTTCGCTGCCTCTGCTTTTTGTTCCATATCCATGGCGGGCGATCTCCCTGAAGTTGTGAAGAAAGAGGAGACAGGAGGATTGAAGGTGAAAATCACAGGCAAAACCCATGTCAAACCTGTCAAGAAACTCGACAGAAAAGAGTGCCAATTGGTCACATTTGACCTCCCTTATCTCGCCTTCTACTACAACCAAAAGCTCCTCCTCTACGGCGACAATGGCGGCGAGGTTCGCTTCCCTGAGATGGTGGAGAAGCTCAAGGATGGCCTCGAGACGGTTCTCGAGCCGTTTCATCAACTTGCTGGGAGGCTAGGCAAGGACAAGGATGGCGTTTTTAGAGTTGAGTATGACAATGATATGGAAGGTGTCGAAGTGGCCGAAGCTGTCGCCGACGATGTCTGCATAGCCGATTTGGTGGCGGAGGAAGGCACCGCCACTCTCAAGGAGCTTATTCCTTACAACGGGATACTCAACTTGGAAGGTCTCCAGAGGCCATTGATGGCTGTTCAGCTAACCAAGCTTAAAGACGGAGTCGCGATGGGGTGCGCGTTCAATCACGCGGTACTTGATGGAACTGCCACGTGGCATTTCATGAGTTCATGGGCTGAGATCTGCCGTGGGGCCCAAGACATATCCGTCGCGCCATTTCTCGAGCGTACCAAAGCGCGCAACACGCGGGTGAAACTCGACATAACGCTTCCGCCGGAACCCGCCGCTGCCAACGGCGACTCGAAGGAAGCGGCGGCGGCGGCGCCACCGCTAAGGGAGAAGGTCTTCAAATTTACAGAAGCAGCCATCGACAAGATCAAGACAAAAGTGAACTCAACAAACCCTCCATCGGACGGCTCAAATCCATTCTCCACTTTCCAATCGCTCTCCGTCCATATATGGCGCCACGTCACCCAAGCCCGAAACCTAAAGCCAGAGGACATCACCGTGTTCACCGTTTTCGCCGACTGCCGCAAACGCGTCGATCCGCCAATGCCGGAGAGCTACTTCGGAAACCTAATCCAAGCCATATTCACCGGCACCGCCGCCGGGCTACTACTAATGAATCCGCCGGAATTCGGCGCCGGCGTGATCCAAAAGGCAATCGTTTCACACAATACGGCCGCGATCGATCAAAGAAACAAAGAATGGGAGAGCGCGCCGAAGATCTTCCAGTTCAAGGACGCGGGAATGAACTGCGTGGCGGTGGGGAGTTCGCCGAGATTTAAAGTGTACGATGTGGATTTCGGATGGGGAAAACCGGAGAGCGTGAGAAGTGGATGTAACAACAGATTCGACGGAATGATGTATCTGTACCAAGGGAAGGACGGAGGGAGGAGCATTGATGTGGAGATAAGCTTGGAGGAAGAAGCTATGGCGAGGCTGGAGAAGGACAAGGAGTTTCTCATGGAGAATTAGGGTTTCGCCGTGGTTGAATAAGTGGAAGACAAAATGGCAGTGGATGAAAATGGTGGGATTTTTAATTTCTTGGGGAAATTGCTTTTATATTGTTATTATGGGACTTTTTTTACTGCAGGCGGGGGGATGAAATCAAACTAATTATCCTCTTCTTTTTCTTCTTTTCTTATTGTGCTCGTTTAACCTTGTAATATCTATGCATATTTTTAATTATGGCATATTTGTTATTAAACTTTTGGATTATATATATATATATAGTCTTTCAAACGGCACGCCTTTCGAATAGGTGTGCTCCTGCCTCTTCCCTCTTTAGGGCTCATGTTTAAATCTGAATAGGTACAATCCAGTTCATACGATTACTCAGGGATGAACCCAATCTGAAATATGAACGAAAAGGAAAAGATTCATTGAACCGATCATGAGAATACCTGTTACAATACCTATCAGCCAAAAAGTAATCCTTCAAGTAGTCTCGATCATTTACTCTATTTCATCATGACCAGGTGAATAAGCGAGTTAAGTTCCAAGACACCCATAGAATCATTTCTCTCCTATGAGCTTCAAACGCTAACACATAATTTTCTAAATCGACCACCTAAAAGAAATACCCCTCCATCCAAAATCGATTCTAATAACTTTATTTTTCAGCTCCAGGACTTGGAGGTCCAGGAGCTGAGGCAGTCGACTCCTCCTGTTACAAGTTCATACATGTTAGAGTGATAGCGCATTTTTAAAACAAAGCTTATAAACACTAGGTTCATCCAAAAAGCTACCAAATACGACCAAAGTGATTGGTTTTAGTAATCTTAAGGTACTCTTCAGTGGTGTATGCTTTGCTTGCTGATGAAGAATTATCAGCTGTTTATATACATAAGAAAACAAGCTTTCATCATCGTATAATAGTCGAGGCTGACAGAATGATGATTCTATATCCCTAATGATTGAACCCAATCGTCTCATTCGTAGACACGATGCCCGACGACGAACCAAGGCTGATTACCAATATTCTTTTCCCCTTTGGCATTCTACTTTTTCTATGTAGCTTTGAAATCTTTCCCTAGAATACTACAAGTGCTGATAATAAATAAAAATATGGATGACAAATAAGTTAACCACTCACTTGGTTCAGCCATTGCAAAAGACTCTTCTGCTTTAGGAACAAAAGGTGGGTTTGAGAAGGTATCTGATTTGCTTGAATTGTATGAGCTTTCGACTGGACTCGAGATTTCATCGAGACCAATTTCTCGCTCAAGTGTGGTCTTGAATTCTCGAGAAACGTCCTA

Coding sequence (CDS)

ATGGCGGGCGATCTCCCTGAAGTTGTGAAGAAAGAGGAGACAGGAGGATTGAAGGTGAAAATCACAGGCAAAACCCATGTCAAACCTGTCAAGAAACTCGACAGAAAAGAGTGCCAATTGGTCACATTTGACCTCCCTTATCTCGCCTTCTACTACAACCAAAAGCTCCTCCTCTACGGCGACAATGGCGGCGAGGTTCGCTTCCCTGAGATGGTGGAGAAGCTCAAGGATGGCCTCGAGACGGTTCTCGAGCCGTTTCATCAACTTGCTGGGAGGCTAGGCAAGGACAAGGATGGCGTTTTTAGAGTTGAGTATGACAATGATATGGAAGGTGTCGAAGTGGCCGAAGCTGTCGCCGACGATGTCTGCATAGCCGATTTGGTGGCGGAGGAAGGCACCGCCACTCTCAAGGAGCTTATTCCTTACAACGGGATACTCAACTTGGAAGGTCTCCAGAGGCCATTGATGGCTGTTCAGCTAACCAAGCTTAAAGACGGAGTCGCGATGGGGTGCGCGTTCAATCACGCGGTACTTGATGGAACTGCCACGTGGCATTTCATGAGTTCATGGGCTGAGATCTGCCGTGGGGCCCAAGACATATCCGTCGCGCCATTTCTCGAGCGTACCAAAGCGCGCAACACGCGGGTGAAACTCGACATAACGCTTCCGCCGGAACCCGCCGCTGCCAACGGCGACTCGAAGGAAGCGGCGGCGGCGGCGCCACCGCTAAGGGAGAAGGTCTTCAAATTTACAGAAGCAGCCATCGACAAGATCAAGACAAAAGTGAACTCAACAAACCCTCCATCGGACGGCTCAAATCCATTCTCCACTTTCCAATCGCTCTCCGTCCATATATGGCGCCACGTCACCCAAGCCCGAAACCTAAAGCCAGAGGACATCACCGTGTTCACCGTTTTCGCCGACTGCCGCAAACGCGTCGATCCGCCAATGCCGGAGAGCTACTTCGGAAACCTAATCCAAGCCATATTCACCGGCACCGCCGCCGGGCTACTACTAATGAATCCGCCGGAATTCGGCGCCGGCGTGATCCAAAAGGCAATCGTTTCACACAATACGGCCGCGATCGATCAAAGAAACAAAGAATGGGAGAGCGCGCCGAAGATCTTCCAGTTCAAGGACGCGGGAATGAACTGCGTGGCGGTGGGGAGTTCGCCGAGATTTAAAGTGTACGATGTGGATTTCGGATGGGGAAAACCGGAGAGCGTGAGAAGTGGATGTAACAACAGATTCGACGGAATGATGTATCTGTACCAAGGGAAGGACGGAGGGAGGAGCATTGATGTGGAGATAAGCTTGGAGGAAGAAGCTATGGCGAGGCTGGAGAAGGACAAGGAGTTTCTCATGGAGAATTAG

Protein sequence

MAGDLPEVVKKEETGGLKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPEMVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIADLVAEEGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFMSSWAEICRGAQDISVAPFLERTKARNTRVKLDITLPPEPAAANGDSKEAAAAAPPLREKVFKFTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQRNKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQGKDGGRSIDVEISLEEEAMARLEKDKEFLMEN
BLAST of Cp4.1LG14g06330 vs. Swiss-Prot
Match: DCR_ARATH (BAHD acyltransferase DCR OS=Arabidopsis thaliana GN=DCR PE=2 SV=1)

HSP 1 Score: 600.1 bits (1546), Expect = 2.0e-170
Identity = 306/450 (68.00%), Postives = 358/450 (79.56%), Query Frame = 1

Query: 17  LKVKITGKTHVKPVKK-LDRKECQLVTFDLPYLAFYYNQKLLLYG-DNGGEVRFP----E 76
           +K+KI  KTHVKP K  L +K+  L TFDLPYLAFYYNQK LLY   N  ++  P    E
Sbjct: 1   MKIKIMSKTHVKPTKPVLGKKQFHLTTFDLPYLAFYYNQKFLLYKFQNLLDLEEPTFQNE 60

Query: 77  MVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYD---NDMEGVEVAEAVADDVCIADL 136
           +VE LKDGL  VLE F+QLAG+L KD +GVFRVEYD   +++ GVE + A A DV + DL
Sbjct: 61  VVENLKDGLGLVLEDFYQLAGKLAKDDEGVFRVEYDAEDSEINGVEFSVAHAADVTVDDL 120

Query: 137 VAEEGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFM 196
            AE+GTA  KEL+PYNGILNLEGL RPL+AVQ+TKLKDG+AMG AFNHAVLDGT+TWHFM
Sbjct: 121 TAEDGTAKFKELVPYNGILNLEGLSRPLLAVQVTKLKDGLAMGLAFNHAVLDGTSTWHFM 180

Query: 197 SSWAEICRGAQDISVAPFLERTKARNTRVKLDITLPPEP-AAANGDSKEAAAAAPP-LRE 256
           SSWAEICRGAQ IS  PFL+R+KAR+TRVKLD+T P +P   +NG+        PP L E
Sbjct: 181 SSWAEICRGAQSISTQPFLDRSKARDTRVKLDLTAPKDPNETSNGEDAANPTVEPPQLVE 240

Query: 257 KVFKFTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTV 316
           K+F+F++ A+  IK++ NS  P SD S PFSTFQSL+ HIWRHVT AR LKPEDIT+FTV
Sbjct: 241 KIFRFSDFAVHTIKSRANSVIP-SDSSKPFSTFQSLTSHIWRHVTLARGLKPEDITIFTV 300

Query: 317 FADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQR 376
           FADCR+RVDPPMPE YFGNLIQAIFTGTAAGLL  + PEFGA VIQKAI +H+ + ID R
Sbjct: 301 FADCRRRVDPPMPEEYFGNLIQAIFTGTAAGLLAAHGPEFGASVIQKAIAAHDASVIDAR 360

Query: 377 NKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQ 436
           N EWE +PKIFQFKDAG+NCVAVGSSPRF+VY+VDFG+GKPE+VRSG NNRF+GMMYLYQ
Sbjct: 361 NDEWEKSPKIFQFKDAGVNCVAVGSSPRFRVYEVDFGFGKPETVRSGSNNRFNGMMYLYQ 420

Query: 437 GKDGGRSIDVEISLEEEAMARLEKDKEFLM 456
           GK GG SIDVEI+LE   M +L K KEFL+
Sbjct: 421 GKAGGISIDVEITLEASVMEKLVKSKEFLL 449

BLAST of Cp4.1LG14g06330 vs. Swiss-Prot
Match: Y3028_ARATH (Uncharacterized acetyltransferase At3g50280 OS=Arabidopsis thaliana GN=At3g50280 PE=3 SV=1)

HSP 1 Score: 198.7 bits (504), Expect = 1.3e-49
Identity = 135/431 (31.32%), Postives = 217/431 (50.35%), Query Frame = 1

Query: 35  RKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPEMVEKLKDGLETVLEPFHQLAGRLG 94
           R++  L  FDL  L   Y Q+ LL+     E  F   + +L+  L + L+ +   AGRL 
Sbjct: 22  REKIHLTPFDLNLLYVDYTQRGLLFPKPDPETHF---ISRLRTSLSSALDIYFPFAGRLN 81

Query: 95  K---DKDGVFRVEYDNDMEGVEVAEAVADDVCIADLVAEEGTAT--LKELIPYNGILNLE 154
           K    +D       + D  G +   AV+D V ++DL+  +G+     +   P NG+ +++
Sbjct: 82  KVENHEDETVSFYINCDGSGAKFIHAVSDSVSVSDLLRPDGSVPDFFRIFYPMNGVKSID 141

Query: 155 GLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFMSSWAEICRGAQDISVAPFLERT 214
           GL  PL+A+Q+T+++DGV +G  +NH V DG + W+F  +W++IC   Q  ++ P   + 
Sbjct: 142 GLSEPLLALQVTEMRDGVFIGFGYNHMVADGASIWNFFRTWSKICSNGQRENLQPLALKG 201

Query: 215 KARNTRVKLDITLPPEPAAANGDSKEAAAAAPPLREKVFKFTEAAIDKIKTKVNSTNPPS 274
              +  +   I +P      +  S+E +   P  +E+VF FT+  I  +K KVN      
Sbjct: 202 LFVDG-MDFPIHIPVSDTETSPPSRELS---PTFKERVFHFTKRNISDLKAKVNGEIGLR 261

Query: 275 DGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTVFADCRKRVDPPMPESYFGNLIQAI 334
           D  +  S+ Q++S H+WR + +   L  E+ T   V  D R+R++PP+ +  FG++I   
Sbjct: 262 D--HKVSSLQAVSAHMWRSIIRHSGLNQEEKTRCFVAVDLRQRLNPPLDKECFGHVIYNS 321

Query: 335 FTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQR--NKEWESAPKIFQFKDAG----M 394
              T  G L  +    G   +Q   +  +    D R   + W    KI Q    G     
Sbjct: 322 VVTTTVGEL--HDQGLGWAFLQINNMLRSLTNEDYRIYAENWVRNMKI-QKSGLGSKMTR 381

Query: 395 NCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQGKDGGRSIDVEISLEEEA 454
           + V V SSPRF+VYD DFGWGKP +VR+G +N   G +  ++G + G  IDV   L  + 
Sbjct: 382 DSVIVSSSPRFEVYDNDFGWGKPIAVRAGPSNSISGKLVFFRGIEEG-CIDVHAFLLPDV 439

BLAST of Cp4.1LG14g06330 vs. Swiss-Prot
Match: HST_TOBAC (Shikimate O-hydroxycinnamoyltransferase OS=Nicotiana tabacum GN=HST PE=1 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 1.1e-35
Identity = 122/445 (27.42%), Postives = 201/445 (45.17%), Query Frame = 1

Query: 17  LKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPEMVEKLK 76
           +K+++   T VKP  +  ++       DL  +  ++   +  Y   G    F   V  LK
Sbjct: 1   MKIEVKESTMVKPAAETPQQRLWNSNVDL-VVPNFHTPSVYFYRPTGSPNFFDGKV--LK 60

Query: 77  DGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIADLVAEEGTATL 136
           + L   L PF+ +AGRL +D+DG  R+E D   +GV   EA +D V + D      T  L
Sbjct: 61  EALSKALVPFYPMAGRLCRDEDG--RIEIDCKGQGVLFVEAESDGV-VDDFGDFAPTLEL 120

Query: 137 KELIPYNGILNLEGLQR-PLMAVQLTKLK-DGVAMGCAFNHAVLDGTATWHFMSSWAEIC 196
           ++LIP   +   +G+Q   L+ +Q+T  K  GV++G    H   DG +  HF+++W+++ 
Sbjct: 121 RQLIP--AVDYSQGIQSYALLVLQITHFKCGGVSLGVGMQHHAADGASGLHFINTWSDMA 180

Query: 197 RGAQDISVAPFLERTKAR-----NTRVKLDITLPPEPAAANGDSKEAAAAAPPLREKVFK 256
           RG  D+++ PF++RT  R       +       PP       ++   + A P     +FK
Sbjct: 181 RGL-DLTIPPFIDRTLLRARDPPQPQFPHVEYQPPPTLKVTPENTPISEAVPETSVSIFK 240

Query: 257 FTEAAIDKIKTKVNSTNPPSDGSNP-FSTFQSLSVHIWRHVTQARNLKPEDITVFTVFAD 316
            T   I+ +K K        DG+   +S+++ L+ H+WR    AR L  +  T   +  D
Sbjct: 241 LTRDQINTLKAKSKE-----DGNTVNYSSYEMLAGHVWRSTCMARGLAHDQETKLYIATD 300

Query: 317 CRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQRNKE 376
            R R+ P +P  YFGN+I        AG +   P  + A  +  A+   +   +      
Sbjct: 301 GRSRLRPSLPPGYFGNVIFTTTPIAVAGDIQSKPIWYAASKLHDALARMDNDYLRSALDY 360

Query: 377 WESAPKIFQFKDAG--MNCVAVG--SSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLY 436
            E  P +           C  +G  S  R  ++D DFGWG+P  +  G    ++G+ ++ 
Sbjct: 361 LELQPDLKALVRGAHTFKCPNLGITSWSRLPIHDADFGWGRPIFMGPG-GIAYEGLSFIL 420

Query: 437 QGKDGGRSIDVEISLEEEAMARLEK 450
                  S  V ISL+ E M   EK
Sbjct: 421 PSPTNDGSQSVAISLQAEHMKLFEK 430

BLAST of Cp4.1LG14g06330 vs. Swiss-Prot
Match: HST_ARATH (Shikimate O-hydroxycinnamoyltransferase OS=Arabidopsis thaliana GN=HST PE=2 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 3.2e-35
Identity = 127/458 (27.73%), Postives = 209/458 (45.63%), Query Frame = 1

Query: 17  LKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPEMVEKLK 76
           +K+ I   T V+P  +           DL  +  ++   +  Y   G    F   V  +K
Sbjct: 1   MKINIRDSTMVRPATETPITNLWNSNVDL-VIPRFHTPSVYFYRPTGASNFFDPQV--MK 60

Query: 77  DGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIADLVAEEG-TAT 136
           + L   L PF+ +AGRL +D DG  R+E D    G  V   VAD   + D   +   T  
Sbjct: 61  EALSKALVPFYPMAGRLKRDDDG--RIEID--CNGAGVLFVVADTPSVIDDFGDFAPTLN 120

Query: 137 LKELIPYNGILNLEGLQR-PLMAVQLTKLK-DGVAMGCAFNHAVLDGTATWHFMSSWAEI 196
           L++LIP   + +  G+   PL+ +Q+T  K  G ++G    H   DG +  HF+++W+++
Sbjct: 121 LRQLIPE--VDHSAGIHSFPLLVLQVTFFKCGGASLGVGMQHHAADGFSGLHFINTWSDM 180

Query: 197 CRGAQDISVAPFLERTKARNTRVKLDITLPPEPAAANGDSKEAAAAAPPLREK------- 256
            RG  D+++ PF++RT  R          PP+PA  + + + A +   PL          
Sbjct: 181 ARGL-DLTIPPFIDRTLLRARD-------PPQPAFHHVEYQPAPSMKIPLDPSKSGPENT 240

Query: 257 ---VFKFTEAAIDKIKTKVNSTNPPSDGSN-PFSTFQSLSVHIWRHVTQARNLKPEDITV 316
              +FK T   +  +K K        DG+   +S+++ L+ H+WR V +AR L  +  T 
Sbjct: 241 TVSIFKLTRDQLVALKAKSKE-----DGNTVSYSSYEMLAGHVWRSVGKARGLPNDQETK 300

Query: 317 FTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIV----SHN 376
             +  D R R+ P +P  YFGN+I        AG LL  P  + AG I   +V    ++ 
Sbjct: 301 LYIATDGRSRLRPQLPPGYFGNVIFTATPLAVAGDLLSKPTWYAAGQIHDFLVRMDDNYL 360

Query: 377 TAAIDQRNKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFD 436
            +A+D    + + +  +          + + S  R  +YD DFGWG+P  +  G    ++
Sbjct: 361 RSALDYLEMQPDLSALVRGAHTYKCPNLGITSWVRLPIYDADFGWGRPIFMGPG-GIPYE 420

Query: 437 GMMYLYQGKDGGRSIDVEISLEEEAMARLEKDKEFLME 457
           G+ ++        S+ V I+L+ E M   EK   FL E
Sbjct: 421 GLSFVLPSPTNDGSLSVAIALQSEHMKLFEK---FLFE 432

BLAST of Cp4.1LG14g06330 vs. Swiss-Prot
Match: SHT_ARATH (Spermidine hydroxycinnamoyl transferase OS=Arabidopsis thaliana GN=SHT PE=1 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 9.9e-29
Identity = 111/414 (26.81%), Postives = 183/414 (44.20%), Query Frame = 1

Query: 57  LLYGDNGGEVRFPEMVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAE 116
           L + D   E     +VE LK  L  VL  F+ +AGRL     G  R E + + EGVE  E
Sbjct: 40  LYFYDKPSESFQGNVVEILKTSLSRVLVHFYPMAGRLRWLPRG--RFELNCNAEGVEFIE 99

Query: 117 AVADDVCIADLVAEEGTATLKELIPYNGILN-LEGLQRPLMAVQLTKLK-DGVAMGCAFN 176
           A ++   ++D      T   + L+P     N +E +  PL   Q+TK K  G+++    +
Sbjct: 100 AESEGK-LSDFKDFSPTPEFENLMPQVNYKNPIETI--PLFLAQVTKFKCGGISLSVNVS 159

Query: 177 HAVLDGTATWHFMSSWAEICRGAQDISVAPFLERTKARNTRVKLDITLPP--------EP 236
           HA++DG +  H +S W  + RG + +   PFL+R              PP        +P
Sbjct: 160 HAIVDGQSALHLISEWGRLARG-EPLETVPFLDRKILWAGEPLPPFVSPPKFDHKEFDQP 219

Query: 237 AAANGDSKEAAAAAPPLREKVFKFTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIW 296
               G++             +   + + + K+++K N +   SD +  F+ +++++ H+W
Sbjct: 220 PFLIGETDNVEERKKKTIVVMLPLSTSQLQKLRSKANGSKH-SDPAKGFTRYETVTGHVW 279

Query: 297 RHVTQARNLKPEDITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFG 356
           R   +AR   PE  T   +  D R R++PP+P  YFGN    +   + +G L+ N   F 
Sbjct: 280 RCACKARGHSPEQPTALGICIDTRSRMEPPLPRGYFGNATLDVVAASTSGELISNELGFA 339

Query: 357 AGVIQKAI--VSHNTAAIDQRNKEWESAPKIFQFKDA---------GMNCVAVGSSPRFK 416
           A +I KAI  V++    I     + +   K FQ   A         G   + V S     
Sbjct: 340 ASLISKAIKNVTNEYVMIGIEYLKNQKDLKKFQDLHALGSTEGPFYGNPNLGVVSWLTLP 399

Query: 417 VYDVDFGWGKPESVRSGCNNRFDGMMYLYQGKDGGRSIDVEISLEEEAMARLEK 450
           +Y +DFGWGK      G ++ FDG   +   ++   S+ +   L+   M   +K
Sbjct: 400 MYGLDFGWGKEFYTGPGTHD-FDGDSLILPDQNEDGSVILATCLQVAHMEAFKK 445

BLAST of Cp4.1LG14g06330 vs. TrEMBL
Match: A0A0A0L8S0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G183330 PE=4 SV=1)

HSP 1 Score: 797.3 bits (2058), Expect = 9.4e-228
Identity = 401/458 (87.55%), Postives = 425/458 (92.79%), Query Frame = 1

Query: 1   MAGDLPEVVKKEETGGLKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYG 60
           MAGDLPE VKKEE   +KVKI GKTHVKP KKL  K  QLVTFDLPYLAFYYNQKL+LYG
Sbjct: 1   MAGDLPEAVKKEEKE-VKVKIIGKTHVKPNKKLGTKHYQLVTFDLPYLAFYYNQKLILYG 60

Query: 61  DNGGEVRFPEMVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVAD 120
           DNGGEV+FPE VEKLKDGLE VLEPFHQLAGRLGKD+DG+FRVEYD+DMEGVEVAEAVA+
Sbjct: 61  DNGGEVKFPETVEKLKDGLEMVLEPFHQLAGRLGKDEDGIFRVEYDDDMEGVEVAEAVAE 120

Query: 121 DVCIADLVAEEGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDG 180
           DV +ADLVAEEGTATLKELIPYNGILNLEGLQRPL+AVQ+TKLKDG+AMGCAFNHAVLDG
Sbjct: 121 DVGLADLVAEEGTATLKELIPYNGILNLEGLQRPLLAVQITKLKDGIAMGCAFNHAVLDG 180

Query: 181 TATWHFMSSWAEICRGAQDISVAPFLERTKARNTRVKLDIT-LPPEPAAANGDSKEAAAA 240
           TATWHFMSSWAE+ RGAQDISV PFLERTKARNTRVKLDI+  PP+PA+ANGDS   A  
Sbjct: 181 TATWHFMSSWAEVSRGAQDISVPPFLERTKARNTRVKLDISPPPPQPASANGDS--TAPP 240

Query: 241 APPLREKVFKFTEAAIDKIKTKVNSTNPPS-DGSNPFSTFQSLSVHIWRHVTQARNLKPE 300
             PL+EKVFKFTE AI+KIK+KVNS NPP  DGS PFSTFQSLSVHIWRHVTQARNLKPE
Sbjct: 241 PKPLKEKVFKFTETAINKIKSKVNSANPPKPDGSTPFSTFQSLSVHIWRHVTQARNLKPE 300

Query: 301 DITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHN 360
           DITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNP EFGAGVIQKAIVSH+
Sbjct: 301 DITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPAEFGAGVIQKAIVSHD 360

Query: 361 TAAIDQRNKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFD 420
            AAIDQRNKEWESAPKIF+FKDAGMNCVAVGSSPRFKVY+VDFGWGKPESVRSGCNNRFD
Sbjct: 361 AAAIDQRNKEWESAPKIFEFKDAGMNCVAVGSSPRFKVYEVDFGWGKPESVRSGCNNRFD 420

Query: 421 GMMYLYQGKDGGRSIDVEISLEEEAMARLEKDKEFLME 457
           GMMYLYQGK+GG  IDVEISLEEEAMARLEKDKEF++E
Sbjct: 421 GMMYLYQGKNGG--IDVEISLEEEAMARLEKDKEFVLE 453

BLAST of Cp4.1LG14g06330 vs. TrEMBL
Match: B9RIV2_RICCO (Anthranilate N-benzoyltransferase protein, putative OS=Ricinus communis GN=RCOM_1583010 PE=4 SV=1)

HSP 1 Score: 704.1 bits (1816), Expect = 1.1e-199
Identity = 350/450 (77.78%), Postives = 386/450 (85.78%), Query Frame = 1

Query: 10  KKEETGGLKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLY---GDNGGEV 69
           KK+E   LKV ITGKTHVKP KKL R+E QLVTFDLPYLAFYYNQKLLLY    D+G   
Sbjct: 8   KKKEEETLKVNITGKTHVKPNKKLGRREYQLVTFDLPYLAFYYNQKLLLYKGSADHG--- 67

Query: 70  RFPEMVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIAD 129
            F ++V KLKDGL  VLE FHQLAG+LGKD+DGVFRVEYD+DMEGVE+ EA A+ + + D
Sbjct: 68  -FEDIVGKLKDGLGVVLEDFHQLAGKLGKDEDGVFRVEYDDDMEGVEIVEATAEGISLDD 127

Query: 130 LVAEEGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHF 189
           L  EEGT + KELIPYNGILNLEGL RPL+AVQLTKLKDGV MGCAFNHA+LDGT+TWHF
Sbjct: 128 LTVEEGTTSFKELIPYNGILNLEGLHRPLLAVQLTKLKDGVVMGCAFNHAILDGTSTWHF 187

Query: 190 MSSWAEICRGAQDISVAPFLERTKARNTRVKLDITLPPEPAAANGDSKEAAAAAPPLREK 249
           MSSWAEIC GA  ISV+PFL+RTKAR+TRVKLD+TLPP+P  A+ ++   A   P LREK
Sbjct: 188 MSSWAEICNGATSISVSPFLQRTKARDTRVKLDVTLPPDPLDASSEAD--ARPVPQLREK 247

Query: 250 VFKFTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTVF 309
           VFKF+EAAID IK+KVN+ NPP DGS PFSTFQSL+VHIWRHVT AR LKPED TVFTVF
Sbjct: 248 VFKFSEAAIDMIKSKVNA-NPPLDGSKPFSTFQSLAVHIWRHVTHARELKPEDYTVFTVF 307

Query: 310 ADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQRN 369
           ADCRKRVDPPMPESYFGNLIQAIFT TA GLL MNPPEFGA VIQKAI +H+  AI++RN
Sbjct: 308 ADCRKRVDPPMPESYFGNLIQAIFTATAVGLLTMNPPEFGAAVIQKAIEAHDAKAINERN 367

Query: 370 KEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQG 429
           KEWESAPKIFQFKDAG+NCVAVGSSPRF VYDVDFGWGKPESVRSGCNNRFDGM+YLYQG
Sbjct: 368 KEWESAPKIFQFKDAGVNCVAVGSSPRFPVYDVDFGWGKPESVRSGCNNRFDGMVYLYQG 427

Query: 430 KDGGRSIDVEISLEEEAMARLEKDKEFLME 457
           K GGRSIDVEISLE   M RLEKDK+FL+E
Sbjct: 428 KSGGRSIDVEISLEAGVMERLEKDKDFLLE 450

BLAST of Cp4.1LG14g06330 vs. TrEMBL
Match: A0A061G131_THECC (HXXXD-type acyl-transferase family protein OS=Theobroma cacao GN=TCM_015221 PE=4 SV=1)

HSP 1 Score: 700.7 bits (1807), Expect = 1.2e-198
Identity = 347/451 (76.94%), Postives = 391/451 (86.70%), Query Frame = 1

Query: 7   EVVKKEETGGLKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEV 66
           EV KKEE    KVKIT K HVKP K + RKECQLVTFDLPYLAFYYNQKLL Y   GGE 
Sbjct: 12  EVGKKEEMA--KVKITSKNHVKPCKIIGRKECQLVTFDLPYLAFYYNQKLLFY--KGGE- 71

Query: 67  RFPEMVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIAD 126
            F + VEKLKDGL  VLE F+QL G+LGKD++GVFRV+YD+DM+GVEV EA A+ + + +
Sbjct: 72  -FEDKVEKLKDGLRVVLEEFYQLGGKLGKDEEGVFRVDYDDDMDGVEVLEATAEGISVDE 131

Query: 127 LVAEEGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHF 186
           L A+EGT++LK+LIPYNG+LNLEG  RPL++VQLTKLKDG+AMGCAFNHA+LDGT+TWHF
Sbjct: 132 LAADEGTSSLKDLIPYNGVLNLEGQNRPLLSVQLTKLKDGLAMGCAFNHAILDGTSTWHF 191

Query: 187 MSSWAEICRGAQDISVAPFLERTKARNTRVKLDITLPPEPA-AANGDSKEAAAAAPPLRE 246
           MSSWA+IC G+  ISV PFLERTKARNTRVKLD++LPP P  + NGD+ +     P LRE
Sbjct: 192 MSSWAQICSGSNSISVQPFLERTKARNTRVKLDLSLPPNPVESTNGDANQ----GPQLRE 251

Query: 247 KVFKFTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTV 306
           K+F+F+EAAIDKIK+KVNS NPPSDGS PFSTFQSLSVHIW HVTQARNLKPED TVFTV
Sbjct: 252 KLFRFSEAAIDKIKSKVNS-NPPSDGSKPFSTFQSLSVHIWHHVTQARNLKPEDYTVFTV 311

Query: 307 FADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQR 366
           FADCRKRVDPPMPESYFGNLIQAIFT TAAGLLL NPPEFGA ++QKAI +HN+ AID+R
Sbjct: 312 FADCRKRVDPPMPESYFGNLIQAIFTVTAAGLLLANPPEFGASIVQKAIEAHNSKAIDER 371

Query: 367 NKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQ 426
           NKEWE+APKIFQFKDAG+NCVAVGSSPRFKVYDVDFGWGKPE VRSG NNRFDGM+YLYQ
Sbjct: 372 NKEWEAAPKIFQFKDAGVNCVAVGSSPRFKVYDVDFGWGKPEGVRSGSNNRFDGMVYLYQ 431

Query: 427 GKDGGRSIDVEISLEEEAMARLEKDKEFLME 457
           GK GGRSIDVEI+LE  AM +LEKDKEFLME
Sbjct: 432 GKAGGRSIDVEITLEAGAMEKLEKDKEFLME 451

BLAST of Cp4.1LG14g06330 vs. TrEMBL
Match: A0A067JEG8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26077 PE=4 SV=1)

HSP 1 Score: 694.1 bits (1790), Expect = 1.1e-196
Identity = 339/447 (75.84%), Postives = 386/447 (86.35%), Query Frame = 1

Query: 11  KEETGGLKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPE 70
           KEET  +KVKITGK+HVKP KKL R+ECQL+TFDLPYLAFYYNQK LLY   G    F +
Sbjct: 10  KEET--VKVKITGKSHVKPNKKLGRRECQLITFDLPYLAFYYNQKFLLY--KGSNDSFAD 69

Query: 71  MVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIADLVAE 130
           MV KLKDGL  VLE FHQLAG++GKD+DGVFRVEYD+DMEGVE+ EA+A+   + DL AE
Sbjct: 70  MVGKLKDGLGVVLEDFHQLAGKIGKDEDGVFRVEYDDDMEGVELVEAIAEGTSVEDLTAE 129

Query: 131 EGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFMSSW 190
           +GT TLK+ IPYNGILNLEGL RPL+AVQLTKLKDG+A+GCAFNHA+LDGT+TWHFMSSW
Sbjct: 130 DGTTTLKDFIPYNGILNLEGLHRPLLAVQLTKLKDGLAIGCAFNHAILDGTSTWHFMSSW 189

Query: 191 AEICRGAQDISVAPFLERTKARNTRVKLDITLPPEPAA-ANGDSKEAAAAAPPLREKVFK 250
           AEIC G+  ISV PFLERTK RNTRVKLD++LPP+P + +NGD+K   +A P LREKVFK
Sbjct: 190 AEICNGSHSISVTPFLERTKTRNTRVKLDLSLPPDPLSISNGDAK---SAVPELREKVFK 249

Query: 251 FTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTVFADC 310
           F+E+ IDKIK+ VN+ NPPSDG  P+STFQSL+ HIWRHV+ AR LKPED TVFTVFADC
Sbjct: 250 FSESTIDKIKSTVNA-NPPSDGLKPYSTFQSLAAHIWRHVSHARELKPEDFTVFTVFADC 309

Query: 311 RKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQRNKEW 370
           RKRVDP MP++YFGNLIQAIFT TA GLL MNPPEFGA VIQ AI +HN  AID+RNKEW
Sbjct: 310 RKRVDPAMPDNYFGNLIQAIFTATAVGLLSMNPPEFGASVIQNAIEAHNAKAIDERNKEW 369

Query: 371 ESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQGKDG 430
           ES+PKIFQFKDAG+NCVAVGSSPRF+VY+VDFGWGKPESVRSG NNRFDGM+YLYQGK G
Sbjct: 370 ESSPKIFQFKDAGVNCVAVGSSPRFRVYEVDFGWGKPESVRSGSNNRFDGMVYLYQGKSG 429

Query: 431 GRSIDVEISLEEEAMARLEKDKEFLME 457
           GRSIDVEI+LE  AM RLEKDK+FL+E
Sbjct: 430 GRSIDVEITLEAGAMERLEKDKDFLLE 448

BLAST of Cp4.1LG14g06330 vs. TrEMBL
Match: A0A067ELZ3_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013527mg PE=4 SV=1)

HSP 1 Score: 681.4 bits (1757), Expect = 7.6e-193
Identity = 337/440 (76.59%), Postives = 382/440 (86.82%), Query Frame = 1

Query: 18  KVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPEMVEKLKD 77
           KVKIT KTHVKP K + R+ECQLVTFDLPYL FYYNQKLLLY    G+  F + VEKLKD
Sbjct: 6   KVKITSKTHVKPSKVIGRRECQLVTFDLPYLFFYYNQKLLLY--RNGDREFHDAVEKLKD 65

Query: 78  GLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIADLVAEEGTATLK 137
           GL+ VLE F+QLAG+LGKD++GVFRVEYD+DM+GVEVAEAVAD++ + DL AEEGT + K
Sbjct: 66  GLKHVLEEFYQLAGKLGKDEEGVFRVEYDDDMDGVEVAEAVADEIAVDDLTAEEGTKSFK 125

Query: 138 ELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFMSSWAEICRGA 197
           ELIPY G+LNLEG  RPL+A+QLTKLKDG+A+GCAFNHA+LDGT+TWHFMSSWA+ C GA
Sbjct: 126 ELIPYYGVLNLEGTHRPLLALQLTKLKDGMAIGCAFNHAILDGTSTWHFMSSWAQACNGA 185

Query: 198 QDISVAPFLERTKARNTRVKLDITLPPEP-AAANGDSKEAAAAAPPLREKVFKFTEAAID 257
             +SV PFLERTK+RNTRVKLD +LPP+P + ++GD+  AA   P LREKVFKF+EAAID
Sbjct: 186 TSVSVPPFLERTKSRNTRVKLDPSLPPDPHSQSSGDA--AAKQEPVLREKVFKFSEAAID 245

Query: 258 KIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTVFADCRKRVDPP 317
           KIK+KVN  N PSDGS PFSTFQSL+VHIWRHVT AR+LKPED TVFTVFADCRKRVDPP
Sbjct: 246 KIKSKVNE-NGPSDGSKPFSTFQSLAVHIWRHVTHARSLKPEDYTVFTVFADCRKRVDPP 305

Query: 318 MPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQRNKEWESAPKIF 377
           MP+SYFGNLIQAIFT TAAGLL  NPPEFGA +IQKAI +H   AID+RN+ WE+APKIF
Sbjct: 306 MPDSYFGNLIQAIFTVTAAGLLTGNPPEFGASMIQKAIEAHTAKAIDERNEGWENAPKIF 365

Query: 378 QFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQGKDGGRSIDVE 437
           QFKDAG+NCVAVGSSPRFKVY+VDFGWGKPESVRSG NNRFDGM+YLYQGK GGRSIDVE
Sbjct: 366 QFKDAGVNCVAVGSSPRFKVYEVDFGWGKPESVRSGSNNRFDGMVYLYQGKSGGRSIDVE 425

Query: 438 ISLEEEAMARLEKDKEFLME 457
           I+LE  AM RLEKD EFLME
Sbjct: 426 ITLEPGAMERLEKDNEFLME 440

BLAST of Cp4.1LG14g06330 vs. TAIR10
Match: AT5G23940.1 (AT5G23940.1 HXXXD-type acyl-transferase family protein)

HSP 1 Score: 600.1 bits (1546), Expect = 1.1e-171
Identity = 306/450 (68.00%), Postives = 358/450 (79.56%), Query Frame = 1

Query: 17  LKVKITGKTHVKPVKK-LDRKECQLVTFDLPYLAFYYNQKLLLYG-DNGGEVRFP----E 76
           +K+KI  KTHVKP K  L +K+  L TFDLPYLAFYYNQK LLY   N  ++  P    E
Sbjct: 1   MKIKIMSKTHVKPTKPVLGKKQFHLTTFDLPYLAFYYNQKFLLYKFQNLLDLEEPTFQNE 60

Query: 77  MVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYD---NDMEGVEVAEAVADDVCIADL 136
           +VE LKDGL  VLE F+QLAG+L KD +GVFRVEYD   +++ GVE + A A DV + DL
Sbjct: 61  VVENLKDGLGLVLEDFYQLAGKLAKDDEGVFRVEYDAEDSEINGVEFSVAHAADVTVDDL 120

Query: 137 VAEEGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFM 196
            AE+GTA  KEL+PYNGILNLEGL RPL+AVQ+TKLKDG+AMG AFNHAVLDGT+TWHFM
Sbjct: 121 TAEDGTAKFKELVPYNGILNLEGLSRPLLAVQVTKLKDGLAMGLAFNHAVLDGTSTWHFM 180

Query: 197 SSWAEICRGAQDISVAPFLERTKARNTRVKLDITLPPEP-AAANGDSKEAAAAAPP-LRE 256
           SSWAEICRGAQ IS  PFL+R+KAR+TRVKLD+T P +P   +NG+        PP L E
Sbjct: 181 SSWAEICRGAQSISTQPFLDRSKARDTRVKLDLTAPKDPNETSNGEDAANPTVEPPQLVE 240

Query: 257 KVFKFTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTV 316
           K+F+F++ A+  IK++ NS  P SD S PFSTFQSL+ HIWRHVT AR LKPEDIT+FTV
Sbjct: 241 KIFRFSDFAVHTIKSRANSVIP-SDSSKPFSTFQSLTSHIWRHVTLARGLKPEDITIFTV 300

Query: 317 FADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQR 376
           FADCR+RVDPPMPE YFGNLIQAIFTGTAAGLL  + PEFGA VIQKAI +H+ + ID R
Sbjct: 301 FADCRRRVDPPMPEEYFGNLIQAIFTGTAAGLLAAHGPEFGASVIQKAIAAHDASVIDAR 360

Query: 377 NKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQ 436
           N EWE +PKIFQFKDAG+NCVAVGSSPRF+VY+VDFG+GKPE+VRSG NNRF+GMMYLYQ
Sbjct: 361 NDEWEKSPKIFQFKDAGVNCVAVGSSPRFRVYEVDFGFGKPETVRSGSNNRFNGMMYLYQ 420

Query: 437 GKDGGRSIDVEISLEEEAMARLEKDKEFLM 456
           GK GG SIDVEI+LE   M +L K KEFL+
Sbjct: 421 GKAGGISIDVEITLEASVMEKLVKSKEFLL 449

BLAST of Cp4.1LG14g06330 vs. TAIR10
Match: AT5G01210.1 (AT5G01210.1 HXXXD-type acyl-transferase family protein)

HSP 1 Score: 259.2 bits (661), Expect = 4.7e-69
Identity = 159/465 (34.19%), Postives = 238/465 (51.18%), Query Frame = 1

Query: 24  KTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPEMVEKLKDGLETVL 83
           K  V P KK    + +L   DLP L+ +Y QK +L         F ++V  L+  L + L
Sbjct: 11  KCIVYPEKKSTVSDLRLSVSDLPMLSCHYIQKGVLLTSPPPSFSFDDLVSSLRRSLSSTL 70

Query: 84  EPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIADLV--AEEGTATLKELIP 143
             F  LAGR      G   +  ++   GV+   A A  V ++D++   E+     +E   
Sbjct: 71  SLFPALAGRFSTTPAGHISIVCND--AGVDFVAASAKHVKLSDVLLPGEDVPLLFREFFV 130

Query: 144 YNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFMSSWAEICRGAQDIS 203
           +  +++  G  +PL AVQ+T+L DGV +GC  NH+V DGT+ WHF +++A++  GA  I 
Sbjct: 131 FERLVSYNGHHKPLAAVQVTELHDGVFIGCTVNHSVTDGTSFWHFFNTFADVTSGACKIK 190

Query: 204 VAPFLERTKARNTRVKLDITLPPEPAAANGDSKEAAAAAPPLREKVFKFTEAAIDKIKTK 263
             P      +R+T     + LP  P    G  +    A  PLRE++F F+  AI K+K +
Sbjct: 191 HLPDF----SRHTVFDSPVVLPVPP----GGPRVTFDADQPLRERIFHFSREAITKLKQR 250

Query: 264 VNS-------------------------------TNPPS-DGSNPFSTFQSLSVHIWRHV 323
            N+                                N  S D +   S+FQSLS  +WR V
Sbjct: 251 TNNRVNGIETAVNDGRKCNGEINGKITTVLDSFLNNKKSYDRTAEISSFQSLSAQLWRSV 310

Query: 324 TQARNLKPEDITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGV 383
           T+ARNL P   T F +  +CR R++P M   YFGN IQ+I T  +AG LL     + A  
Sbjct: 311 TRARNLDPSKTTTFRMAVNCRHRLEPKMDPYYFGNAIQSIPTLASAGDLLSKDLRWSAEQ 370

Query: 384 IQKAIVSHNTAAIDQRNKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESV 443
           + + +V+H+ A + +    WES P++F   +     + +GSSPRF +YD DFGWGKP +V
Sbjct: 371 LHRNVVAHDDATVRRGIAAWESDPRLFPLGNPDGASITMGSSPRFPMYDNDFGWGKPLAV 430

Query: 444 RSGCNNRFDGMMYLYQGKDGGRSIDVEISLEEEAMARLEKDKEFL 455
           RSG  N+FDG +  + G++G  S+D+E+ L  E M  +E D EF+
Sbjct: 431 RSGGANKFDGKISAFPGREGNGSVDLEVVLAPETMTGIENDAEFM 465

BLAST of Cp4.1LG14g06330 vs. TAIR10
Match: AT5G42830.1 (AT5G42830.1 HXXXD-type acyl-transferase family protein)

HSP 1 Score: 220.3 bits (560), Expect = 2.4e-57
Identity = 152/451 (33.70%), Postives = 235/451 (52.11%), Query Frame = 1

Query: 19  VKITGKTHVKP--VKKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPEMVEKLK 78
           VKI  K+ VKP  + +  ++   L  +D   L+  Y QK LL+     +     ++EKLK
Sbjct: 7   VKIVSKSFVKPKTLPEESKQPYYLSPWDYAMLSVQYIQKGLLFHKPPLD-SIDTLLEKLK 66

Query: 79  DGLETVLEPFHQLAGRLGK---DKDGVFRVEYD-NDMEGVEVAEAVADDVCIADLV-AEE 138
           D L   L  F+ LAGRL     +K   + V  D ND  G     A +D +CI D+V A+ 
Sbjct: 67  DSLAVTLVHFYPLAGRLSSLTTEKPKSYSVFVDCNDSPGAGFIYATSD-LCIKDIVGAKY 126

Query: 139 GTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFMSSWA 198
             + ++    ++  +N +G    L++VQ+T+L DG+ +G + NHA+ DGTA W F ++W+
Sbjct: 127 VPSIVQSFFDHHKAVNHDGHTMSLLSVQVTELVDGIFIGLSMNHAMGDGTAFWKFFTAWS 186

Query: 199 EICRGAQD-------ISVAPFLERTKARNTRVKLDITLPPEPAAANGDSKEAAAAAPPLR 258
           EI +G +        +   P L+R           +        ++ D       +P L+
Sbjct: 187 EIFQGQESNQNDDLCLKNPPVLKRYIPEGYGPLFSLPY------SHPDEFIRTYESPILK 246

Query: 259 EKVFKFTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFT 318
           E++F F+   I  +KT+VN       G+   S+FQSL+  IWR +T+AR L  +  T   
Sbjct: 247 ERMFCFSSETIRMLKTRVNQIC----GTTSISSFQSLTAVIWRCITRARRLPLDRETSCR 306

Query: 319 VFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQ 378
           V AD R R+ PP+ + YFGN + A+ T   AG LL N   F A  + +A+  H +  + Q
Sbjct: 307 VAADNRGRMYPPLHKDYFGNCLSALRTAAKAGELLENDLGFAALKVHQAVAEHTSEKVSQ 366

Query: 379 RNKEWESAPKIFQF-KDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYL 438
              +W  +P I+   +      V +GSSPRF  Y  +FG GK  ++RSG  ++FDG +  
Sbjct: 367 MIDQWLKSPYIYHIDRLFEPMSVMMGSSPRFNKYGCEFGLGKGVTLRSGYAHKFDGKVSA 426

Query: 439 YQGKDGGRSIDVEISLEEEAMARLEKDKEFL 455
           Y G++GG SID+E+ L  E M  LE D+EF+
Sbjct: 427 YPGREGGGSIDLEVCLVPEFMEALESDEEFM 445

BLAST of Cp4.1LG14g06330 vs. TAIR10
Match: AT5G67150.1 (AT5G67150.1 HXXXD-type acyl-transferase family protein)

HSP 1 Score: 204.9 bits (520), Expect = 1.0e-52
Identity = 139/451 (30.82%), Postives = 219/451 (48.56%), Query Frame = 1

Query: 18  KVKITGKTHVKPV---KKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPEMVEK 77
           +V +  K+ V+P    ++ DR +  L  +DL +L   Y Q+ LL+     E     ++ +
Sbjct: 4   EVVVISKSIVRPESYNEESDRVKIHLTPWDLFFLRSEYPQRGLLFPQPDPETH---IISQ 63

Query: 78  LKDGLETVLEPFHQLAGRLGKDKD---GVFRVEYDNDMEGVEVAEAVADDVCIADLVAEE 137
           LK  L   L+ F+  AGRL K ++   G      D D  GV+   A A  V ++D++   
Sbjct: 64  LKSSLSVALKIFYPFAGRLVKVENEDAGTASFYVDCDGSGVKFIHASAKSVSVSDVLEPV 123

Query: 138 GTAT---LKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFMS 197
            +     L    P NG+ + EG+   L+A Q+T+LKDGV +G  +NH V DG++ W F +
Sbjct: 124 DSNVPEFLNRFFPANGVRSCEGISESLIAFQVTELKDGVFIGFGYNHIVADGSSFWSFFN 183

Query: 198 SWAEICRGAQDI----SVAPFLERTKARNTRVKLDITLP-PEPAAANGDSKEAAAAAPPL 257
           +W+EIC    D        P L R    +  ++  I +P PE    N         +  L
Sbjct: 184 TWSEICFNGFDADHRRKFPPLLLRGWFLD-GIEYPIRIPLPETETPN----RVVVTSSLL 243

Query: 258 REKVFKFTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVF 317
           +EK+F+ T   I ++K K N      D     S+ Q++S ++WR + +   L PE++   
Sbjct: 244 QEKIFRVTSRNISELKAKANGEVDSDD--RKISSLQAVSAYMWRSIIRNSGLNPEEVIHC 303

Query: 318 TVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAID 377
            +  D R R++PP+ +  FGN++      T    +L N   + A  I K + S       
Sbjct: 304 KLLVDMRGRLNPPLEKECFGNVVGFATVTTTVAEMLHNGLGWAALQINKTVGSQTNEEFR 363

Query: 378 QRNKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYL 437
           +  + W   P I   K A  N + + SSPRF VY  DFGWGKP +VR+G  N  DG +  
Sbjct: 364 EFAENWVKKPSILNAK-AFSNSITIASSPRFNVYGNDFGWGKPIAVRAGPGNTTDGKLIA 423

Query: 438 YQGKDGGRSIDVEISLEEEAMARLEKDKEFL 455
           Y G + G +I+ +  L    + +L  D+EFL
Sbjct: 424 YPGIEEG-NIEFQTCLSSSVLEKLSTDEEFL 442

BLAST of Cp4.1LG14g06330 vs. TAIR10
Match: AT3G50280.1 (AT3G50280.1 HXXXD-type acyl-transferase family protein)

HSP 1 Score: 198.7 bits (504), Expect = 7.5e-51
Identity = 135/431 (31.32%), Postives = 217/431 (50.35%), Query Frame = 1

Query: 35  RKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPEMVEKLKDGLETVLEPFHQLAGRLG 94
           R++  L  FDL  L   Y Q+ LL+     E  F   + +L+  L + L+ +   AGRL 
Sbjct: 22  REKIHLTPFDLNLLYVDYTQRGLLFPKPDPETHF---ISRLRTSLSSALDIYFPFAGRLN 81

Query: 95  K---DKDGVFRVEYDNDMEGVEVAEAVADDVCIADLVAEEGTAT--LKELIPYNGILNLE 154
           K    +D       + D  G +   AV+D V ++DL+  +G+     +   P NG+ +++
Sbjct: 82  KVENHEDETVSFYINCDGSGAKFIHAVSDSVSVSDLLRPDGSVPDFFRIFYPMNGVKSID 141

Query: 155 GLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFMSSWAEICRGAQDISVAPFLERT 214
           GL  PL+A+Q+T+++DGV +G  +NH V DG + W+F  +W++IC   Q  ++ P   + 
Sbjct: 142 GLSEPLLALQVTEMRDGVFIGFGYNHMVADGASIWNFFRTWSKICSNGQRENLQPLALKG 201

Query: 215 KARNTRVKLDITLPPEPAAANGDSKEAAAAAPPLREKVFKFTEAAIDKIKTKVNSTNPPS 274
              +  +   I +P      +  S+E +   P  +E+VF FT+  I  +K KVN      
Sbjct: 202 LFVDG-MDFPIHIPVSDTETSPPSRELS---PTFKERVFHFTKRNISDLKAKVNGEIGLR 261

Query: 275 DGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTVFADCRKRVDPPMPESYFGNLIQAI 334
           D  +  S+ Q++S H+WR + +   L  E+ T   V  D R+R++PP+ +  FG++I   
Sbjct: 262 D--HKVSSLQAVSAHMWRSIIRHSGLNQEEKTRCFVAVDLRQRLNPPLDKECFGHVIYNS 321

Query: 335 FTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQR--NKEWESAPKIFQFKDAG----M 394
              T  G L  +    G   +Q   +  +    D R   + W    KI Q    G     
Sbjct: 322 VVTTTVGEL--HDQGLGWAFLQINNMLRSLTNEDYRIYAENWVRNMKI-QKSGLGSKMTR 381

Query: 395 NCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQGKDGGRSIDVEISLEEEA 454
           + V V SSPRF+VYD DFGWGKP +VR+G +N   G +  ++G + G  IDV   L  + 
Sbjct: 382 DSVIVSSSPRFEVYDNDFGWGKPIAVRAGPSNSISGKLVFFRGIEEG-CIDVHAFLLPDV 439

BLAST of Cp4.1LG14g06330 vs. NCBI nr
Match: gi|659077561|ref|XP_008439271.1| (PREDICTED: BAHD acyltransferase DCR [Cucumis melo])

HSP 1 Score: 801.6 bits (2069), Expect = 7.2e-229
Identity = 401/459 (87.36%), Postives = 426/459 (92.81%), Query Frame = 1

Query: 1   MAGDLPEVVKKEETGGLKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYG 60
           MAGDLPE VKKEE   LKVKI GKTHVKP KKL  K  QL+TFDLPYLAFYYNQKL+LYG
Sbjct: 1   MAGDLPEAVKKEEKE-LKVKIIGKTHVKPNKKLGTKHYQLITFDLPYLAFYYNQKLILYG 60

Query: 61  DNGGEVRFPEMVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVAD 120
           DNGGEV+FPE VEKLKDGLE VLEPFHQLAGRLGKD+DG+FRVEYD+DM+GVEVAEAVA+
Sbjct: 61  DNGGEVKFPETVEKLKDGLEMVLEPFHQLAGRLGKDEDGIFRVEYDDDMKGVEVAEAVAE 120

Query: 121 DVCIADLVAEEGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDG 180
           DV +ADLVAEEGTATLKELIPYNGILNLEGLQRPL+AVQ+TKLKDG+AMGCAFNHAVLDG
Sbjct: 121 DVALADLVAEEGTATLKELIPYNGILNLEGLQRPLLAVQITKLKDGIAMGCAFNHAVLDG 180

Query: 181 TATWHFMSSWAEICRGAQDISVAPFLERTKARNTRVKLDI-TLPPEPAAANGDSKEAAAA 240
           TATWHFM SWAEICRGAQDISV PFLERTKARNTRVKLDI   PP+P+AANGDS   A  
Sbjct: 181 TATWHFMRSWAEICRGAQDISVPPFLERTKARNTRVKLDIPPPPPQPSAANGDS--TAPP 240

Query: 241 APPLREKVFKFTEAAIDKIKTKVNSTNPP-SDGSNPFSTFQSLSVHIWRHVTQARNLKPE 300
             PL+EKVFKFTE AI+KIK+KVNS NPP +DGS PFSTFQSLSVH+WRHVTQARNLKPE
Sbjct: 241 PKPLKEKVFKFTETAINKIKSKVNSANPPTTDGSTPFSTFQSLSVHVWRHVTQARNLKPE 300

Query: 301 DITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHN 360
           DITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNP EFGAGVIQKAIVSH+
Sbjct: 301 DITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPAEFGAGVIQKAIVSHD 360

Query: 361 TAAIDQRNKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFD 420
            AAIDQRNKEWESAPKIF+FKDAGMNCVAVGSSPRFKVY+VDFGWGKPESVRSGCNNRFD
Sbjct: 361 AAAIDQRNKEWESAPKIFEFKDAGMNCVAVGSSPRFKVYEVDFGWGKPESVRSGCNNRFD 420

Query: 421 GMMYLYQGKDGGRSIDVEISLEEEAMARLEKDKEFLMEN 458
           GMMYLYQGK+GG  IDVEISLEEEAMARLEKDKEF++EN
Sbjct: 421 GMMYLYQGKNGG--IDVEISLEEEAMARLEKDKEFVVEN 454

BLAST of Cp4.1LG14g06330 vs. NCBI nr
Match: gi|778679662|ref|XP_004140767.2| (PREDICTED: BAHD acyltransferase DCR [Cucumis sativus])

HSP 1 Score: 797.3 bits (2058), Expect = 1.4e-227
Identity = 401/458 (87.55%), Postives = 425/458 (92.79%), Query Frame = 1

Query: 1   MAGDLPEVVKKEETGGLKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYG 60
           MAGDLPE VKKEE   +KVKI GKTHVKP KKL  K  QLVTFDLPYLAFYYNQKL+LYG
Sbjct: 1   MAGDLPEAVKKEEKE-VKVKIIGKTHVKPNKKLGTKHYQLVTFDLPYLAFYYNQKLILYG 60

Query: 61  DNGGEVRFPEMVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVAD 120
           DNGGEV+FPE VEKLKDGLE VLEPFHQLAGRLGKD+DG+FRVEYD+DMEGVEVAEAVA+
Sbjct: 61  DNGGEVKFPETVEKLKDGLEMVLEPFHQLAGRLGKDEDGIFRVEYDDDMEGVEVAEAVAE 120

Query: 121 DVCIADLVAEEGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDG 180
           DV +ADLVAEEGTATLKELIPYNGILNLEGLQRPL+AVQ+TKLKDG+AMGCAFNHAVLDG
Sbjct: 121 DVGLADLVAEEGTATLKELIPYNGILNLEGLQRPLLAVQITKLKDGIAMGCAFNHAVLDG 180

Query: 181 TATWHFMSSWAEICRGAQDISVAPFLERTKARNTRVKLDIT-LPPEPAAANGDSKEAAAA 240
           TATWHFMSSWAE+ RGAQDISV PFLERTKARNTRVKLDI+  PP+PA+ANGDS   A  
Sbjct: 181 TATWHFMSSWAEVSRGAQDISVPPFLERTKARNTRVKLDISPPPPQPASANGDS--TAPP 240

Query: 241 APPLREKVFKFTEAAIDKIKTKVNSTNPPS-DGSNPFSTFQSLSVHIWRHVTQARNLKPE 300
             PL+EKVFKFTE AI+KIK+KVNS NPP  DGS PFSTFQSLSVHIWRHVTQARNLKPE
Sbjct: 241 PKPLKEKVFKFTETAINKIKSKVNSANPPKPDGSTPFSTFQSLSVHIWRHVTQARNLKPE 300

Query: 301 DITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHN 360
           DITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNP EFGAGVIQKAIVSH+
Sbjct: 301 DITVFTVFADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPAEFGAGVIQKAIVSHD 360

Query: 361 TAAIDQRNKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFD 420
            AAIDQRNKEWESAPKIF+FKDAGMNCVAVGSSPRFKVY+VDFGWGKPESVRSGCNNRFD
Sbjct: 361 AAAIDQRNKEWESAPKIFEFKDAGMNCVAVGSSPRFKVYEVDFGWGKPESVRSGCNNRFD 420

Query: 421 GMMYLYQGKDGGRSIDVEISLEEEAMARLEKDKEFLME 457
           GMMYLYQGK+GG  IDVEISLEEEAMARLEKDKEF++E
Sbjct: 421 GMMYLYQGKNGG--IDVEISLEEEAMARLEKDKEFVLE 453

BLAST of Cp4.1LG14g06330 vs. NCBI nr
Match: gi|255545220|ref|XP_002513671.1| (PREDICTED: BAHD acyltransferase DCR [Ricinus communis])

HSP 1 Score: 704.1 bits (1816), Expect = 1.6e-199
Identity = 350/450 (77.78%), Postives = 386/450 (85.78%), Query Frame = 1

Query: 10  KKEETGGLKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLY---GDNGGEV 69
           KK+E   LKV ITGKTHVKP KKL R+E QLVTFDLPYLAFYYNQKLLLY    D+G   
Sbjct: 8   KKKEEETLKVNITGKTHVKPNKKLGRREYQLVTFDLPYLAFYYNQKLLLYKGSADHG--- 67

Query: 70  RFPEMVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIAD 129
            F ++V KLKDGL  VLE FHQLAG+LGKD+DGVFRVEYD+DMEGVE+ EA A+ + + D
Sbjct: 68  -FEDIVGKLKDGLGVVLEDFHQLAGKLGKDEDGVFRVEYDDDMEGVEIVEATAEGISLDD 127

Query: 130 LVAEEGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHF 189
           L  EEGT + KELIPYNGILNLEGL RPL+AVQLTKLKDGV MGCAFNHA+LDGT+TWHF
Sbjct: 128 LTVEEGTTSFKELIPYNGILNLEGLHRPLLAVQLTKLKDGVVMGCAFNHAILDGTSTWHF 187

Query: 190 MSSWAEICRGAQDISVAPFLERTKARNTRVKLDITLPPEPAAANGDSKEAAAAAPPLREK 249
           MSSWAEIC GA  ISV+PFL+RTKAR+TRVKLD+TLPP+P  A+ ++   A   P LREK
Sbjct: 188 MSSWAEICNGATSISVSPFLQRTKARDTRVKLDVTLPPDPLDASSEAD--ARPVPQLREK 247

Query: 250 VFKFTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTVF 309
           VFKF+EAAID IK+KVN+ NPP DGS PFSTFQSL+VHIWRHVT AR LKPED TVFTVF
Sbjct: 248 VFKFSEAAIDMIKSKVNA-NPPLDGSKPFSTFQSLAVHIWRHVTHARELKPEDYTVFTVF 307

Query: 310 ADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQRN 369
           ADCRKRVDPPMPESYFGNLIQAIFT TA GLL MNPPEFGA VIQKAI +H+  AI++RN
Sbjct: 308 ADCRKRVDPPMPESYFGNLIQAIFTATAVGLLTMNPPEFGAAVIQKAIEAHDAKAINERN 367

Query: 370 KEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQG 429
           KEWESAPKIFQFKDAG+NCVAVGSSPRF VYDVDFGWGKPESVRSGCNNRFDGM+YLYQG
Sbjct: 368 KEWESAPKIFQFKDAGVNCVAVGSSPRFPVYDVDFGWGKPESVRSGCNNRFDGMVYLYQG 427

Query: 430 KDGGRSIDVEISLEEEAMARLEKDKEFLME 457
           K GGRSIDVEISLE   M RLEKDK+FL+E
Sbjct: 428 KSGGRSIDVEISLEAGVMERLEKDKDFLLE 450

BLAST of Cp4.1LG14g06330 vs. NCBI nr
Match: gi|590673007|ref|XP_007038770.1| (HXXXD-type acyl-transferase family protein [Theobroma cacao])

HSP 1 Score: 700.7 bits (1807), Expect = 1.7e-198
Identity = 347/451 (76.94%), Postives = 391/451 (86.70%), Query Frame = 1

Query: 7   EVVKKEETGGLKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEV 66
           EV KKEE    KVKIT K HVKP K + RKECQLVTFDLPYLAFYYNQKLL Y   GGE 
Sbjct: 12  EVGKKEEMA--KVKITSKNHVKPCKIIGRKECQLVTFDLPYLAFYYNQKLLFY--KGGE- 71

Query: 67  RFPEMVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIAD 126
            F + VEKLKDGL  VLE F+QL G+LGKD++GVFRV+YD+DM+GVEV EA A+ + + +
Sbjct: 72  -FEDKVEKLKDGLRVVLEEFYQLGGKLGKDEEGVFRVDYDDDMDGVEVLEATAEGISVDE 131

Query: 127 LVAEEGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHF 186
           L A+EGT++LK+LIPYNG+LNLEG  RPL++VQLTKLKDG+AMGCAFNHA+LDGT+TWHF
Sbjct: 132 LAADEGTSSLKDLIPYNGVLNLEGQNRPLLSVQLTKLKDGLAMGCAFNHAILDGTSTWHF 191

Query: 187 MSSWAEICRGAQDISVAPFLERTKARNTRVKLDITLPPEPA-AANGDSKEAAAAAPPLRE 246
           MSSWA+IC G+  ISV PFLERTKARNTRVKLD++LPP P  + NGD+ +     P LRE
Sbjct: 192 MSSWAQICSGSNSISVQPFLERTKARNTRVKLDLSLPPNPVESTNGDANQ----GPQLRE 251

Query: 247 KVFKFTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTV 306
           K+F+F+EAAIDKIK+KVNS NPPSDGS PFSTFQSLSVHIW HVTQARNLKPED TVFTV
Sbjct: 252 KLFRFSEAAIDKIKSKVNS-NPPSDGSKPFSTFQSLSVHIWHHVTQARNLKPEDYTVFTV 311

Query: 307 FADCRKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQR 366
           FADCRKRVDPPMPESYFGNLIQAIFT TAAGLLL NPPEFGA ++QKAI +HN+ AID+R
Sbjct: 312 FADCRKRVDPPMPESYFGNLIQAIFTVTAAGLLLANPPEFGASIVQKAIEAHNSKAIDER 371

Query: 367 NKEWESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQ 426
           NKEWE+APKIFQFKDAG+NCVAVGSSPRFKVYDVDFGWGKPE VRSG NNRFDGM+YLYQ
Sbjct: 372 NKEWEAAPKIFQFKDAGVNCVAVGSSPRFKVYDVDFGWGKPEGVRSGSNNRFDGMVYLYQ 431

Query: 427 GKDGGRSIDVEISLEEEAMARLEKDKEFLME 457
           GK GGRSIDVEI+LE  AM +LEKDKEFLME
Sbjct: 432 GKAGGRSIDVEITLEAGAMEKLEKDKEFLME 451

BLAST of Cp4.1LG14g06330 vs. NCBI nr
Match: gi|802768910|ref|XP_012090211.1| (PREDICTED: BAHD acyltransferase DCR-like [Jatropha curcas])

HSP 1 Score: 694.1 bits (1790), Expect = 1.6e-196
Identity = 339/447 (75.84%), Postives = 386/447 (86.35%), Query Frame = 1

Query: 11  KEETGGLKVKITGKTHVKPVKKLDRKECQLVTFDLPYLAFYYNQKLLLYGDNGGEVRFPE 70
           KEET  +KVKITGK+HVKP KKL R+ECQL+TFDLPYLAFYYNQK LLY   G    F +
Sbjct: 10  KEET--VKVKITGKSHVKPNKKLGRRECQLITFDLPYLAFYYNQKFLLY--KGSNDSFAD 69

Query: 71  MVEKLKDGLETVLEPFHQLAGRLGKDKDGVFRVEYDNDMEGVEVAEAVADDVCIADLVAE 130
           MV KLKDGL  VLE FHQLAG++GKD+DGVFRVEYD+DMEGVE+ EA+A+   + DL AE
Sbjct: 70  MVGKLKDGLGVVLEDFHQLAGKIGKDEDGVFRVEYDDDMEGVELVEAIAEGTSVEDLTAE 129

Query: 131 EGTATLKELIPYNGILNLEGLQRPLMAVQLTKLKDGVAMGCAFNHAVLDGTATWHFMSSW 190
           +GT TLK+ IPYNGILNLEGL RPL+AVQLTKLKDG+A+GCAFNHA+LDGT+TWHFMSSW
Sbjct: 130 DGTTTLKDFIPYNGILNLEGLHRPLLAVQLTKLKDGLAIGCAFNHAILDGTSTWHFMSSW 189

Query: 191 AEICRGAQDISVAPFLERTKARNTRVKLDITLPPEPAA-ANGDSKEAAAAAPPLREKVFK 250
           AEIC G+  ISV PFLERTK RNTRVKLD++LPP+P + +NGD+K   +A P LREKVFK
Sbjct: 190 AEICNGSHSISVTPFLERTKTRNTRVKLDLSLPPDPLSISNGDAK---SAVPELREKVFK 249

Query: 251 FTEAAIDKIKTKVNSTNPPSDGSNPFSTFQSLSVHIWRHVTQARNLKPEDITVFTVFADC 310
           F+E+ IDKIK+ VN+ NPPSDG  P+STFQSL+ HIWRHV+ AR LKPED TVFTVFADC
Sbjct: 250 FSESTIDKIKSTVNA-NPPSDGLKPYSTFQSLAAHIWRHVSHARELKPEDFTVFTVFADC 309

Query: 311 RKRVDPPMPESYFGNLIQAIFTGTAAGLLLMNPPEFGAGVIQKAIVSHNTAAIDQRNKEW 370
           RKRVDP MP++YFGNLIQAIFT TA GLL MNPPEFGA VIQ AI +HN  AID+RNKEW
Sbjct: 310 RKRVDPAMPDNYFGNLIQAIFTATAVGLLSMNPPEFGASVIQNAIEAHNAKAIDERNKEW 369

Query: 371 ESAPKIFQFKDAGMNCVAVGSSPRFKVYDVDFGWGKPESVRSGCNNRFDGMMYLYQGKDG 430
           ES+PKIFQFKDAG+NCVAVGSSPRF+VY+VDFGWGKPESVRSG NNRFDGM+YLYQGK G
Sbjct: 370 ESSPKIFQFKDAGVNCVAVGSSPRFRVYEVDFGWGKPESVRSGSNNRFDGMVYLYQGKSG 429

Query: 431 GRSIDVEISLEEEAMARLEKDKEFLME 457
           GRSIDVEI+LE  AM RLEKDK+FL+E
Sbjct: 430 GRSIDVEITLEAGAMERLEKDKDFLLE 448

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DCR_ARATH2.0e-17068.00BAHD acyltransferase DCR OS=Arabidopsis thaliana GN=DCR PE=2 SV=1[more]
Y3028_ARATH1.3e-4931.32Uncharacterized acetyltransferase At3g50280 OS=Arabidopsis thaliana GN=At3g50280... [more]
HST_TOBAC1.1e-3527.42Shikimate O-hydroxycinnamoyltransferase OS=Nicotiana tabacum GN=HST PE=1 SV=1[more]
HST_ARATH3.2e-3527.73Shikimate O-hydroxycinnamoyltransferase OS=Arabidopsis thaliana GN=HST PE=2 SV=1[more]
SHT_ARATH9.9e-2926.81Spermidine hydroxycinnamoyl transferase OS=Arabidopsis thaliana GN=SHT PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L8S0_CUCSA9.4e-22887.55Uncharacterized protein OS=Cucumis sativus GN=Csa_3G183330 PE=4 SV=1[more]
B9RIV2_RICCO1.1e-19977.78Anthranilate N-benzoyltransferase protein, putative OS=Ricinus communis GN=RCOM_... [more]
A0A061G131_THECC1.2e-19876.94HXXXD-type acyl-transferase family protein OS=Theobroma cacao GN=TCM_015221 PE=4... [more]
A0A067JEG8_JATCU1.1e-19675.84Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26077 PE=4 SV=1[more]
A0A067ELZ3_CITSI7.6e-19376.59Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013527mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G23940.11.1e-17168.00 HXXXD-type acyl-transferase family protein[more]
AT5G01210.14.7e-6934.19 HXXXD-type acyl-transferase family protein[more]
AT5G42830.12.4e-5733.70 HXXXD-type acyl-transferase family protein[more]
AT5G67150.11.0e-5230.82 HXXXD-type acyl-transferase family protein[more]
AT3G50280.17.5e-5131.32 HXXXD-type acyl-transferase family protein[more]
Match NameE-valueIdentityDescription
gi|659077561|ref|XP_008439271.1|7.2e-22987.36PREDICTED: BAHD acyltransferase DCR [Cucumis melo][more]
gi|778679662|ref|XP_004140767.2|1.4e-22787.55PREDICTED: BAHD acyltransferase DCR [Cucumis sativus][more]
gi|255545220|ref|XP_002513671.1|1.6e-19977.78PREDICTED: BAHD acyltransferase DCR [Ricinus communis][more]
gi|590673007|ref|XP_007038770.1|1.7e-19876.94HXXXD-type acyl-transferase family protein [Theobroma cacao][more]
gi|802768910|ref|XP_012090211.1|1.6e-19675.84PREDICTED: BAHD acyltransferase DCR-like [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016747transferase activity, transferring acyl groups other than amino-acyl groups
Vocabulary: INTERPRO
TermDefinition
IPR023213CAT-like_dom_sf
IPR003480Transferase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010143 cutin biosynthetic process
biological_process GO:0051179 localization
biological_process GO:0010090 trichome morphogenesis
cellular_component GO:0005737 cytoplasm
molecular_function GO:0047672 anthranilate N-benzoyltransferase activity
molecular_function GO:0016747 transferase activity, transferring acyl groups other than amino-acyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g06330.1Cp4.1LG14g06330.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003480TransferasePFAMPF02458Transferasecoord: 17..453
score: 7.3
IPR023213Chloramphenicol acetyltransferase-like domainGENE3DG3DSA:3.30.559.10coord: 18..213
score: 4.3E-47coord: 241..454
score: 1.7
NoneNo IPR availableunknownCoilCoilcoord: 437..457
scor
NoneNo IPR availablePANTHERPTHR31896FAMILY NOT NAMEDcoord: 14..457
score: 3.1E
NoneNo IPR availablePANTHERPTHR31896:SF4BAHD ACYLTRANSFERASE DCRcoord: 14..457
score: 3.1E

The following gene(s) are paralogous to this gene:

None