Cp4.1LG20g04290.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG20g04290.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionAlpha 1,4-glycosyltransferase family protein, putative
LocationCp4.1LG20 : 2426446 .. 2427738 (-)
Sequence length1293
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTGCTAATTTCCGGCCTTCTTTCTTCCTCCTCCTTCTACTCCGCCGCCTTCAACCTCTCAAGGAATCTGCTTATTACCATTTTCTCTCATCCCTTACCGCTTCTCTTCTTTATCTTCTCCTTCTCTTCCTTCTCGCTTTCAATGCCCTTTACGTTTTCTTTACTTACTCACCGTCGCCGGAGAAGACGATTCCTCAGTCTCCCCCTTTCTCGCCGGATAAACGATGCTCCAATCTTTCTTCTTCCGTTCCCGCTTCGACCCACGTTCTTTTTTCCATCGAAGAAAAACACCCACCGGTGATTTTTAAATCAAATTCCTCTGTTTTTCACAATCCCCATGTTAAAATCCGCGCCGATGACTCTGCTGAAATGGCGGCGAAGAGGGCCGGGAAACACAAGCGCCGCCTGAGAAGCCTCCGGCTAGAATCAAGAGAAAACAAATTTTCAGCAAGGATTGAGGAGTTCTTGGCTGCGAATTCATGTAAGCTTAGATTTTTCATGATTTGGATTTCACCATTGAATTCGTTTAGCGACAGAGAATTCTGGGCAATACAGAGCATATTCAACGCTCATGAAAATGGAAATCCGTGTTTGATCATTGTCTCGAATTCCCTCGATTCCACAAAAGGGAAACAAATTCTCAGCCCATTCTCGGAAAAAGGGTTCCCTTTGATCGCAATTTCTCCGGATTTCGATTCAATTTTCAGAAACACAGAAGCAGAGCCATGGTTCAATCAACTCCGGCGAGGAATCATAAAACCCGGCGAAATTTCTTTAGCTCAAAACCTCTCAAACTTGCTCCGATTGGCTCTGCTATACAAATTCGGCGGCATTTACCTCGACGCCGACGTGATAATTCTAAAGAACTTCACAAATCTCCGAAACGTAATTGGAGCTCAAACCATCGATTTGAAAACCGGAAATTGGAGCAGATTAAACAACGCAGTGATGATCTTCGACAAAAATCACCCACTTCTCCTTCAATTCATCCAAGAATTCGCCACAACCTTCGACGGAAACAAATGGGGTCACAACGGACCGTATTTAGTATCAAGAGTGGTGTCGAGATTGAACCAAAATCCTGGGTTTAATTTGACTGTTCTTCCTCCGTCGGCGTTTTACCCTGTTGTTTGGAGCAGAATCAGGGCTCTGTTCCAGAGTCCAAAAGATGCAGTTCATTTGAAATGGGTAAAGGCGAAACTGAAACACATTGAATCCCAGAGCTTGGCTCTCCATTTGTGGAACAGCCATAGCTTCAAGTTGAGAAGGGAAGCATTGTTGATATTATAG

mRNA sequence

ATGATTGCTAATTTCCGGCCTTCTTTCTTCCTCCTCCTTCTACTCCGCCGCCTTCAACCTCTCAAGGAATCTGCTTATTACCATTTTCTCTCATCCCTTACCGCTTCTCTTCTTTATCTTCTCCTTCTCTTCCTTCTCGCTTTCAATGCCCTTTACGTTTTCTTTACTTACTCACCGTCGCCGGAGAAGACGATTCCTCAGTCTCCCCCTTTCTCGCCGGATAAACGATGCTCCAATCTTTCTTCTTCCGTTCCCGCTTCGACCCACGTTCTTTTTTCCATCGAAGAAAAACACCCACCGGTGATTTTTAAATCAAATTCCTCTGTTTTTCACAATCCCCATGTTAAAATCCGCGCCGATGACTCTGCTGAAATGGCGGCGAAGAGGGCCGGGAAACACAAGCGCCGCCTGAGAAGCCTCCGGCTAGAATCAAGAGAAAACAAATTTTCAGCAAGGATTGAGGAGTTCTTGGCTGCGAATTCATGTAAGCTTAGATTTTTCATGATTTGGATTTCACCATTGAATTCGTTTAGCGACAGAGAATTCTGGGCAATACAGAGCATATTCAACGCTCATGAAAATGGAAATCCGTGTTTGATCATTGTCTCGAATTCCCTCGATTCCACAAAAGGGAAACAAATTCTCAGCCCATTCTCGGAAAAAGGGTTCCCTTTGATCGCAATTTCTCCGGATTTCGATTCAATTTTCAGAAACACAGAAGCAGAGCCATGGTTCAATCAACTCCGGCGAGGAATCATAAAACCCGGCGAAATTTCTTTAGCTCAAAACCTCTCAAACTTGCTCCGATTGGCTCTGCTATACAAATTCGGCGGCATTTACCTCGACGCCGACGTGATAATTCTAAAGAACTTCACAAATCTCCGAAACGTAATTGGAGCTCAAACCATCGATTTGAAAACCGGAAATTGGAGCAGATTAAACAACGCAGTGATGATCTTCGACAAAAATCACCCACTTCTCCTTCAATTCATCCAAGAATTCGCCACAACCTTCGACGGAAACAAATGGGGTCACAACGGACCGTATTTAGTATCAAGAGTGGTGTCGAGATTGAACCAAAATCCTGGGTTTAATTTGACTGTTCTTCCTCCGTCGGCGTTTTACCCTGTTGTTTGGAGCAGAATCAGGGCTCTGTTCCAGAGTCCAAAAGATGCAGTTCATTTGAAATGGGTAAAGGCGAAACTGAAACACATTGAATCCCAGAGCTTGGCTCTCCATTTGTGGAACAGCCATAGCTTCAAGTTGAGAAGGGAAGCATTGTTGATATTATAG

Coding sequence (CDS)

ATGATTGCTAATTTCCGGCCTTCTTTCTTCCTCCTCCTTCTACTCCGCCGCCTTCAACCTCTCAAGGAATCTGCTTATTACCATTTTCTCTCATCCCTTACCGCTTCTCTTCTTTATCTTCTCCTTCTCTTCCTTCTCGCTTTCAATGCCCTTTACGTTTTCTTTACTTACTCACCGTCGCCGGAGAAGACGATTCCTCAGTCTCCCCCTTTCTCGCCGGATAAACGATGCTCCAATCTTTCTTCTTCCGTTCCCGCTTCGACCCACGTTCTTTTTTCCATCGAAGAAAAACACCCACCGGTGATTTTTAAATCAAATTCCTCTGTTTTTCACAATCCCCATGTTAAAATCCGCGCCGATGACTCTGCTGAAATGGCGGCGAAGAGGGCCGGGAAACACAAGCGCCGCCTGAGAAGCCTCCGGCTAGAATCAAGAGAAAACAAATTTTCAGCAAGGATTGAGGAGTTCTTGGCTGCGAATTCATGTAAGCTTAGATTTTTCATGATTTGGATTTCACCATTGAATTCGTTTAGCGACAGAGAATTCTGGGCAATACAGAGCATATTCAACGCTCATGAAAATGGAAATCCGTGTTTGATCATTGTCTCGAATTCCCTCGATTCCACAAAAGGGAAACAAATTCTCAGCCCATTCTCGGAAAAAGGGTTCCCTTTGATCGCAATTTCTCCGGATTTCGATTCAATTTTCAGAAACACAGAAGCAGAGCCATGGTTCAATCAACTCCGGCGAGGAATCATAAAACCCGGCGAAATTTCTTTAGCTCAAAACCTCTCAAACTTGCTCCGATTGGCTCTGCTATACAAATTCGGCGGCATTTACCTCGACGCCGACGTGATAATTCTAAAGAACTTCACAAATCTCCGAAACGTAATTGGAGCTCAAACCATCGATTTGAAAACCGGAAATTGGAGCAGATTAAACAACGCAGTGATGATCTTCGACAAAAATCACCCACTTCTCCTTCAATTCATCCAAGAATTCGCCACAACCTTCGACGGAAACAAATGGGGTCACAACGGACCGTATTTAGTATCAAGAGTGGTGTCGAGATTGAACCAAAATCCTGGGTTTAATTTGACTGTTCTTCCTCCGTCGGCGTTTTACCCTGTTGTTTGGAGCAGAATCAGGGCTCTGTTCCAGAGTCCAAAAGATGCAGTTCATTTGAAATGGGTAAAGGCGAAACTGAAACACATTGAATCCCAGAGCTTGGCTCTCCATTTGTGGAACAGCCATAGCTTCAAGTTGAGAAGGGAAGCATTGTTGATATTATAG

Protein sequence

MIANFRPSFFLLLLLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTYSPSPEKTIPQSPPFSPDKRCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHNPHVKIRADDSAEMAAKRAGKHKRRLRSLRLESRENKFSARIEEFLAANSCKLRFFMIWISPLNSFSDREFWAIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSIFRNTEAEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQTIDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVVSRLNQNPGFNLTVLPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLALHLWNSHSFKLRREALLIL
BLAST of Cp4.1LG20g04290.1 vs. Swiss-Prot
Match: Y4990_ARATH (Uncharacterized protein At4g19900 OS=Arabidopsis thaliana GN=At4g19900 PE=2 SV=1)

HSP 1 Score: 126.7 bits (317), Expect = 6.1e-28
Identity = 87/278 (31.29%), Postives = 137/278 (49.28%), Query Frame = 1

Query: 149 FSARIEEFLAANSCKLRFFMIWISPLNSFSDREFWAIQSIFNAHENGNPCLIIVSNSLDS 208
           FS  ++ F     C +R FM+W SP   FS R    ++S+ + H +   C+++ S +++ 
Sbjct: 355 FSDFMDSFFRKEKCSMRVFMVWNSPGWMFSVRHQRGLESLLSQHRDA--CVVVFSETVEL 414

Query: 209 TKGKQILSPFSEKGFPLIAISPDFDSIFRNTE----AEPWFNQLRRGIIKPGEISLAQNL 268
              +   + F +  + +    P+ D + ++T     A  WF+  R+    P       + 
Sbjct: 415 DFFR---NSFVKDSYKVAVAMPNLDELLQDTPTHVFASVWFDW-RKTKFYP------THY 474

Query: 269 SNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQTIDLKTGNWSRLNNAVMIFDKNH 328
           S L+RLA LYK+GG+YLD+DVI+L + ++LRN IG +  D   G    LN AVM F+K  
Sbjct: 475 SELVRLAALYKYGGVYLDSDVIVLGSLSSLRNTIGME--DQVAG--ESLNGAVMSFEKKS 534

Query: 329 PLLLQFIQEFATTFDGNKWGHNGPYLVSRVVSRL--NQNPGFN---LTVLPPSAFYPVVW 388
           P LL+ + E+  T+D      NG  L++RV  R    +N   N   L + P S F+P+  
Sbjct: 535 PFLLECLNEYYLTYDDKCLRCNGADLLTRVAKRFLNGKNRRMNQQELNIRPSSVFFPINS 594

Query: 389 SRIRALFQSPKDAVHLKWVKAKLKHIESQSLALHLWNS 418
            +I   F  P             K I ++SL  H WNS
Sbjct: 595 QQITNYFAYPAIEDERSQQDESFKKILNESLTFHFWNS 616

BLAST of Cp4.1LG20g04290.1 vs. Swiss-Prot
Match: A4GAT_MOUSE (Lactosylceramide 4-alpha-galactosyltransferase OS=Mus musculus GN=A4galt PE=2 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 2.8e-25
Identity = 76/253 (30.04%), Postives = 123/253 (48.62%), Query Frame = 1

Query: 184 AIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISP-DFDSIFRNTEAE 243
           +++S   AH      +++     D+T   + L       FP + I P D   +F +T   
Sbjct: 103 SVESAARAHPESQVVVLMKGLPRDTTAQPRNLGISLLSCFPNVWIRPLDLQELFEDTPLA 162

Query: 244 PWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQT 303
            W+++ R    +P ++ +   LS+  R+ALL+KFGGIYLD D I+LKN  NL N +G Q+
Sbjct: 163 AWYSEARHRW-EPYQLPV---LSDASRIALLWKFGGIYLDTDFIVLKNLLNLTNTLGIQS 222

Query: 304 IDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVVSRL---- 363
             +       LN A + F++ H  L   + +F   ++G  WGH GP L++RV  +     
Sbjct: 223 RYV-------LNGAFLAFERKHEFLALCLHDFVANYNGWIWGHQGPQLLTRVFKKWCSIQ 282

Query: 364 ---NQNPGFNLTVLPPSAFYPVVWSRIRALFQ--SPKDAVHLKWVKAKLKHIESQSLALH 423
                +    +T LPP AFYP+ W   +  F+  SP++          L  + + + A+H
Sbjct: 283 SLEKSHACRGVTALPPEAFYPIPWQNWKKYFEDISPEE----------LTQLLNATYAVH 334

Query: 424 LWNSHSFKLRREA 427
           +WN  S     EA
Sbjct: 343 VWNKKSQGTHLEA 334

BLAST of Cp4.1LG20g04290.1 vs. Swiss-Prot
Match: A4GAT_RAT (Lactosylceramide 4-alpha-galactosyltransferase OS=Rattus norvegicus GN=A4galt PE=1 SV=1)

HSP 1 Score: 117.5 bits (293), Expect = 3.7e-25
Identity = 78/253 (30.83%), Postives = 123/253 (48.62%), Query Frame = 1

Query: 184 AIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISP-DFDSIFRNTEAE 243
           +++S   AH      +++     D+T   + L       FP + I P D   +F +T   
Sbjct: 104 SVESAARAHPESQVVVLMKGLPRDTTAWPRNLGISLLSCFPNVQIRPLDLQELFEDTPLA 163

Query: 244 PWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQT 303
            W+ + +       E  L   LS+  R+ALL+KFGGIYLD D I+LKN  NL N++G Q+
Sbjct: 164 AWYLEAQHR----WEPYLLPVLSDASRIALLWKFGGIYLDTDFIVLKNLRNLTNMLGIQS 223

Query: 304 IDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRV------VS 363
             +       LN A + F++ H  L   I++F   ++G  WGH GP L++RV      + 
Sbjct: 224 RYV-------LNGAFLAFERKHEFLALCIRDFVAHYNGWIWGHQGPQLLTRVFKKWCSIH 283

Query: 364 RLNQNPGF-NLTVLPPSAFYPVVWSRIRALFQ--SPKDAVHLKWVKAKLKHIESQSLALH 423
            L ++     +T LPP AFYP+ W   +  F+  SP++          L  + + + A+H
Sbjct: 284 SLKESRACRGVTALPPEAFYPIPWQNWKKYFEDVSPEE----------LAQLLNATYAVH 335

Query: 424 LWNSHSFKLRREA 427
           +WN  S     EA
Sbjct: 344 VWNKKSQGTHLEA 335

BLAST of Cp4.1LG20g04290.1 vs. Swiss-Prot
Match: A4GAT_PONPY (Lactosylceramide 4-alpha-galactosyltransferase (Fragment) OS=Pongo pygmaeus GN=A4GALT PE=3 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 2.0e-23
Identity = 72/214 (33.64%), Postives = 109/214 (50.93%), Query Frame = 1

Query: 223 FPLIAISP-DFDSIFRNTEAEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYL 282
           FP + + P D   +FR+T    W+  ++ G  +P    L   LS+  R+AL++KFGGIYL
Sbjct: 1   FPNVQMLPLDLRELFRDTPLADWYTAVQ-GRWEP---YLLPVLSDASRIALMWKFGGIYL 60

Query: 283 DADVIILKNFTNLRNVIGAQTIDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGN 342
           D D I+LKN  NL NV+G Q+  +       LN A + F + H  +   +++F   ++G 
Sbjct: 61  DTDFIVLKNLRNLTNVLGTQSRYV-------LNGAFLAFQRRHEFMALCMRDFVDHYNGW 120

Query: 343 KWGHNGPYLVSRVV-------SRLNQNPGFNLTVLPPSAFYPVVWSRIRALFQ--SPKDA 402
            WGH GP L++RV        S         +T LPP AFYP+ W   +  F+  SP++ 
Sbjct: 121 IWGHQGPQLLTRVFKKWCSIRSLAESRACRGVTTLPPEAFYPIPWQDWKKYFEDISPEE- 180

Query: 403 VHLKWVKAKLKHIESQSLALHLWNSHSFKLRREA 427
                    L  + + + A+H+WN  S   R EA
Sbjct: 181 ---------LPRLLNATYAVHVWNKKSQGTRFEA 193

BLAST of Cp4.1LG20g04290.1 vs. Swiss-Prot
Match: A4GAT_HUMAN (Lactosylceramide 4-alpha-galactosyltransferase OS=Homo sapiens GN=A4GALT PE=2 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 2.6e-23
Identity = 71/212 (33.49%), Postives = 107/212 (50.47%), Query Frame = 1

Query: 223 FPLIAISP-DFDSIFRNTEAEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYL 282
           FP + + P D   +FR+T    W+  ++ G  +P    L   LS+  R+AL++KFGGIYL
Sbjct: 136 FPNVQMLPLDLRELFRDTPLADWYAAVQ-GRWEP---YLLPVLSDASRIALMWKFGGIYL 195

Query: 283 DADVIILKNFTNLRNVIGAQTIDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGN 342
           D D I+LKN  NL NV+G Q+  +       LN A + F++ H  +   +++F   ++G 
Sbjct: 196 DTDFIVLKNLRNLTNVLGTQSRYV-------LNGAFLAFERRHEFMALCMRDFVDHYNGW 255

Query: 343 KWGHNGPYLVSRVV-------SRLNQNPGFNLTVLPPSAFYPVVWSRIRALFQSPKDAVH 402
            WGH GP L++RV        S         +T LPP AFYP+ W   +  F+       
Sbjct: 256 IWGHQGPQLLTRVFKKWCSIRSLAESRACRGVTTLPPEAFYPIPWQDWKKYFEDIN---- 315

Query: 403 LKWVKAKLKHIESQSLALHLWNSHSFKLRREA 427
                 +L  + S + A+H+WN  S   R EA
Sbjct: 316 ----PEELPRLLSATYAVHVWNKKSQGTRFEA 328

BLAST of Cp4.1LG20g04290.1 vs. TrEMBL
Match: M5WNF6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015051mg PE=4 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 3.4e-118
Identity = 241/433 (55.66%), Postives = 298/433 (68.82%), Query Frame = 1

Query: 11  LLLLLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTYSPSPEKTIPQSPP 70
           L++ + +LQ LK S     L SL  SLL LLLL LLA+N  YVF  Y P   K  P    
Sbjct: 13  LIVFINQLQYLKRSIL-SVLLSLPTSLLALLLLLLLAYNGFYVFCIYLPL-SKPSPDPAI 72

Query: 71  FSPDK------------RCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHNP-HVKI 130
           FSP                S+ SSS   S+ V++ ++E++ P+  K++    HNP +  +
Sbjct: 73  FSPGNLAGDSVSNWVPAHVSSSSSSSKISSSVMYVVKEENAPMFLKTHLPPLHNPRNSMV 132

Query: 131 RADDSAEMAAKRAGKHKRRLRSLRLESRENKFSARIEEFLAANS--CKLRFFMIWISPLN 190
                +    +R  KHKR+L+SL  E + + FS R+ +F A NS  CK+RFFM WIS   
Sbjct: 133 PIPKFSLQRPRRIRKHKRKLKSLPPEPKLSLFSTRMRDFFAGNSSSCKVRFFMTWIS-FK 192

Query: 191 SFSDREFWAIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSI 250
           +F +RE  A++S+F  H N   CL IVSNSLDS KG QIL PFSE  F ++AISPDFD +
Sbjct: 193 TFGNRELLAVESLFKFHPNA--CLAIVSNSLDSEKGSQILRPFSEMDFRVMAISPDFDYL 252

Query: 251 FRNTEAEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLR 310
           F+NT AE W+++LR G + PG +SL QNLSNLLRLALLYKFGGIYLD DVI+LK+ + LR
Sbjct: 253 FKNTPAEAWYSELRTGKVNPGGVSLGQNLSNLLRLALLYKFGGIYLDTDVIVLKSLSKLR 312

Query: 311 NVIGAQTIDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVV 370
           NVIGAQ ID +TGNWSRLNNAV++FDKNHPL+ +FIQEFA TFDGNKWGHNGPYLVSRVV
Sbjct: 313 NVIGAQAIDAQTGNWSRLNNAVLVFDKNHPLIFKFIQEFALTFDGNKWGHNGPYLVSRVV 372

Query: 371 SRLNQ---NPGFNLTVLPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLAL 426
           SR+ +   NPGFN TVL PSAFYP  WSRIR+LF+ PKD +H KW+ AKL+HI SQS AL
Sbjct: 373 SRVRENPKNPGFNFTVLTPSAFYPFNWSRIRSLFRGPKDELHSKWLLAKLRHICSQSFAL 432

BLAST of Cp4.1LG20g04290.1 vs. TrEMBL
Match: W9S212_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_012116 PE=4 SV=1)

HSP 1 Score: 428.3 bits (1100), Expect = 1.1e-116
Identity = 229/425 (53.88%), Postives = 299/425 (70.35%), Query Frame = 1

Query: 11  LLLLLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTYSPSPEKTIPQSPP 70
           +L+L+ +LQ LK S +  F    T SLL LLLLFLLA+N + VF+ + P   K  PQ   
Sbjct: 15  ILVLVHQLQDLKRSLFAFFFCVPT-SLLALLLLFLLAYNGVSVFYIHVPFLAKFPPQPAN 74

Query: 71  FS---PDKRCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHN----PHVKIRADDSA 130
           FS   P  + S+ SSS   S+ V++ + E++ P+  K++   F N    P        S 
Sbjct: 75  FSRQNPVLKFSSSSSSSKLSSSVMYVVREENSPMFLKTHFPAFQNYNNFPLKPTPKRVSK 134

Query: 131 EMAAKRAGKHKRRLRSLRLESRENK--FSARIEEFL-AANSCKLRFFMIWISPLNSFSDR 190
              +KR  K KR++R L  E++     F+AR+ EF   +  CK RFFM WIS L+SF +R
Sbjct: 135 FRISKRVKKTKRKVRRLSSETQNQNQFFTARLREFFFGSRKCKPRFFMTWISSLDSFGER 194

Query: 191 EFWAIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSIFRNTE 250
           E +A++S+F +H N   CL IVS  +DS KG  +L PFS+ GF ++AISPD+DS+ +NT 
Sbjct: 195 ELFAVESLFKSHPNA--CLAIVSKIMDSEKGNVLLKPFSDSGFRVLAISPDYDSVLKNTP 254

Query: 251 AEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGA 310
           AE WF++LR+G + PGEISL QNLSNLLRLALLYKFGGIY D D+I + +F+ LRNVIGA
Sbjct: 255 AESWFSRLRKGNVNPGEISLCQNLSNLLRLALLYKFGGIYTDTDMIFVNSFSKLRNVIGA 314

Query: 311 QTIDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVVSRLNQ 370
           QT+DL+TG+WSRLNNAV++FDKNHPLLL FI+EFA TFDGNKWGHNGPYLVSRVV RL +
Sbjct: 315 QTVDLETGHWSRLNNAVLVFDKNHPLLLMFIKEFALTFDGNKWGHNGPYLVSRVVKRLRE 374

Query: 371 NPGFNLTVLPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLALHLWNSHSF 426
           NPGFN TVLPP AFYPV WSRIR+LF++  D +H KW+  K+KH+ ++S A+HLWN  S 
Sbjct: 375 NPGFNFTVLPPHAFYPVDWSRIRSLFRNAGDELHSKWMIGKMKHVRTKSFAVHLWNRQSK 434

BLAST of Cp4.1LG20g04290.1 vs. TrEMBL
Match: B9HYL1_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s04750g PE=4 SV=2)

HSP 1 Score: 427.2 bits (1097), Expect = 2.4e-116
Identity = 219/416 (52.64%), Postives = 286/416 (68.75%), Query Frame = 1

Query: 12  LLLLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTYSPSPEKTIPQSPPF 71
           L+ ++ LQ +K S +  FL  +  SLL L+L  LL +N   VF+ + P P   +P+   F
Sbjct: 16  LVYMQHLQNIKRSIFA-FLLCIPTSLLALILFLLLFYNGFTVFYFHLPFPSNPLPEPANF 75

Query: 72  SPDKRCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHNPHVK-IRADDSAEMAAKRA 131
           S      N    +PAS  V+++++E  PPVI K+   +  NP +  I  + S      + 
Sbjct: 76  SQGNLAKNSFKKLPAS--VMYAVKEDTPPVILKTLLPLLQNPAITMIPINHSVVFKPNKT 135

Query: 132 GKHKRRLRSLRLESRENKFSARIEEFLAANSCKLRFFMIWISPLNSFSDREFWAIQSIFN 191
             ++   R LR      +FS R+ EF   + CK+RFFM WIS L SF DRE ++++S+F 
Sbjct: 136 HGYEAVKRMLRSADNLKRFSTRVREFFGNHGCKVRFFMTWISSLKSFGDRECFSVESLFR 195

Query: 192 AHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSIFRNTEAEPWFNQLRR 251
           +H +   CL+IVSNS+DS  G  +L PF +K F LIAI PDFD +F++T AE WF  L++
Sbjct: 196 SHPDA--CLVIVSNSMDSESGSLVLKPFLDKRFKLIAIKPDFDYLFKDTHAEKWFKGLKK 255

Query: 252 GIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQTIDLKTGNW 311
           G + PGE+SL QN+SNLLRLALLYKFGGIY+D DVI+LK FT LRNVIGAQTIDL+T NW
Sbjct: 256 GNVSPGEVSLGQNMSNLLRLALLYKFGGIYMDTDVIVLKRFTKLRNVIGAQTIDLETRNW 315

Query: 312 SRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVVSRLNQNPGFNLTVLP 371
           SRLNNAV+IFDK HPLL +FI+EFA TFDGNKWGHNGPYLVSRVVSR+N  PGFN TVLP
Sbjct: 316 SRLNNAVLIFDKKHPLLFKFIEEFALTFDGNKWGHNGPYLVSRVVSRVNGRPGFNFTVLP 375

Query: 372 PSAFYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLALHLWNSHSFKLRREA 427
           P AFYPV WSRIR+ F+ P+D VH  W+  KL+ I+S+S A+HLWN  S +++ E+
Sbjct: 376 PPAFYPVDWSRIRSFFRGPRDKVHSTWLHEKLEQIKSESFAVHLWNKQSREIKVES 426

BLAST of Cp4.1LG20g04290.1 vs. TrEMBL
Match: A0A061GEM3_THECC (Alpha 1,4-glycosyltransferase family protein, putative OS=Theobroma cacao GN=TCM_029915 PE=4 SV=1)

HSP 1 Score: 421.4 bits (1082), Expect = 1.3e-114
Identity = 221/421 (52.49%), Postives = 290/421 (68.88%), Query Frame = 1

Query: 9   FFLLLLLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTYSPSPEKTIPQS 68
           F +  L  R Q +K S Y  F+  L  S++ L L  LLA N   VF+   P P K+ P+ 
Sbjct: 2   FEMTRLHHRFQRIKSSVY-GFVFLLPTSIVALFLFILLACNGFSVFYINLPVPAKSSPEP 61

Query: 69  PPFSP-----DKRCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHNPHVKI-RADDS 128
               P     DK+ + L+SSV      +++++E++PPVI K++  +   P+  +   D  
Sbjct: 62  ANVLPENLPGDKKVTKLASSV------MYAVKEENPPVISKTHLPLLQKPNFSVVPVDKP 121

Query: 129 AEMAAKRAGKHKRRLRSLRLESRENKFSARIEEFLAANSCKLRFFMIWISPLNSFSDREF 188
                K+A   ++ LR LR  ++   FSA++++F   + CK RFFM WIS + SF DRE 
Sbjct: 122 LVFRPKQARLSRQILRILRSGTKAKGFSAQVKDFFQNSKCKSRFFMTWISSVESFGDREL 181

Query: 189 WAIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSIFRNTEAE 248
            A++S+F +H     CL+IVSNSLDS +GK +L PF ++GF L+A  PDFD IF+NT AE
Sbjct: 182 LAVESVFRSHPEA--CLLIVSNSLDSKRGKVVLKPFLDRGFKLVAFDPDFDYIFKNTYAE 241

Query: 249 PWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQT 308
            WFN+L+RG + PGE+SL QN+SNLLRLALLYK+GGIY+D D+I+LK F NLRNVIGAQ+
Sbjct: 242 LWFNRLKRGNLNPGEVSLGQNISNLLRLALLYKYGGIYIDTDIIVLKRFNNLRNVIGAQS 301

Query: 309 IDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVVSRLNQNP 368
           I+L+T NW+RLNNAV+IFDK+HPLL +FIQEFA TFDGNKWGHNGPYLVSRVV+R+   P
Sbjct: 302 INLETKNWTRLNNAVLIFDKHHPLLYKFIQEFALTFDGNKWGHNGPYLVSRVVARVTGRP 361

Query: 369 GFNLTVLPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLALHLWNSHSFKL 424
           GFN TVLPPSAFYPV WSRIR+LFQ P+  VH  W+  KL+ I  QS A+HLWN  S  +
Sbjct: 362 GFNFTVLPPSAFYPVDWSRIRSLFQGPRTKVHSNWLHNKLEQIRRQSFAVHLWNRQSRNV 413

BLAST of Cp4.1LG20g04290.1 vs. TrEMBL
Match: A0A0D2STE1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_010G152300 PE=4 SV=1)

HSP 1 Score: 406.8 bits (1044), Expect = 3.4e-110
Identity = 216/419 (51.55%), Postives = 286/419 (68.26%), Query Frame = 1

Query: 21  LKESAYYHFLSS-------LTASLLYLLLLFLLAFNALYVFFTY-------SPSPEKTIP 80
           +K   Y+H + S       L  + L+ L L LLA N   VF+ Y       SP P K +P
Sbjct: 2   IKHHPYFHHIKSSVFGFMFLLPTSLFALSLLLLACNGFSVFYIYLPGYINSSPEPIKLLP 61

Query: 81  QSPPFSPDKRCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHNPHVKIRADDSAEM- 140
           ++   S DK  + L SSV      +++++E++PP + K++ S+   P+  +     A + 
Sbjct: 62  EN--LSGDKTVTKLVSSV------MYAVKEENPPGVLKTHLSLVKKPNFSMVPIGKASVF 121

Query: 141 AAKRAGKHKRRLRSLRLESR-ENKFSARIEEFLAANSCKLRFFMIWISPLNSFSDREFWA 200
             K+A   ++ LR+L   ++ +  FS++++ F   + CK RFFM WISP+ S SDRE  A
Sbjct: 122 KPKQARLSRQILRTLGSGTKPKGFFSSKVKTFFRNSKCKSRFFMTWISPIESLSDRELLA 181

Query: 201 IQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSIFRNTEAEPW 260
           I+S+F +H     CL+IVSNSLDS KG  +L PFS+KGF LIA+ PDFD IF+NT AE W
Sbjct: 182 IESVFKSHPKA--CLVIVSNSLDSKKGSVVLKPFSDKGFKLIAVHPDFDYIFKNTYAETW 241

Query: 261 FNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQTID 320
           FN+L+ G I PGE+SL QNLSNLLRLALLYK+GG+YLD DV++LK+   LRNVI AQ+I+
Sbjct: 242 FNRLKNGNINPGEVSLGQNLSNLLRLALLYKYGGVYLDTDVLVLKSINRLRNVISAQSIN 301

Query: 321 LKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVVSRLNQNPGF 380
            KT NWSRLNNAV+IFD+NHPLL +FIQEFA TFDGN+WGHNGPYLVSRVV R+   PG 
Sbjct: 302 PKTKNWSRLNNAVLIFDQNHPLLFKFIQEFALTFDGNRWGHNGPYLVSRVVERVTGRPGL 361

Query: 381 NLTVLPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLALHLWNSHSFKLR 424
           N TVLPPSAFYPV W+RIR+LFQ P++  H  W+K KL  I+ QS A+HLWN  S +++
Sbjct: 362 NFTVLPPSAFYPVDWTRIRSLFQGPQNETHSNWLKLKLGQIQRQSYAIHLWNRQSKQVK 410

BLAST of Cp4.1LG20g04290.1 vs. TAIR10
Match: AT1G61050.1 (AT1G61050.1 alpha 1,4-glycosyltransferase family protein)

HSP 1 Score: 332.8 bits (852), Expect = 3.1e-91
Identity = 192/417 (46.04%), Postives = 262/417 (62.83%), Query Frame = 1

Query: 12  LLLLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTYS-PSPEKTIPQSPP 71
           +LL++RL+ L  S    F+  L  SLL LLL+ LL +N+  VF  +  P+       SP 
Sbjct: 10  VLLVQRLKRLIVS----FVFCLPMSLLGLLLMLLLIYNSFSVFSLHLVPNQPIQSTLSPT 69

Query: 72  FSPDKRCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHNPHVKIRADDSAEMAAKRA 131
                     +SS  + + +L  ++E     I K N S         R   S E+     
Sbjct: 70  HLQILHHQTSTSSSVSDSSLLLVVKETSLGFIQKQNVSSTRIEKKTRRFKRSTELTPAIT 129

Query: 132 GKHKRRLRSLRLESRENKFSARIEEFLAANSCKLRFFMIWISPLNSFSDREFWAIQSIFN 191
            +       L+++SR+ +F  R++  L+ +SC+  FFM WIS + SF DRE + I+S+F 
Sbjct: 130 QR-------LQVKSRQ-RFQTRVKSLLSKSSCESLFFMTWISSIESFGDRERFTIESLFK 189

Query: 192 AHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSIFRNTEAEPWFNQLRR 251
            H NG  CLI+VSNS D  +G  IL PF++KG  ++ I PDF  IF++T AE WF +L++
Sbjct: 190 FHPNG--CLILVSNSFDCDRGTLILKPFTDKGLKVLPIKPDFAYIFKDTSAEKWFERLKK 249

Query: 252 GIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQTIDLKTGNW 311
           G + PG I L QNLSNLLRL LLYK+GGIYLD DVIILK+ +NL NVIGAQT+D  T  W
Sbjct: 250 GTLSPGVIPLEQNLSNLLRLVLLYKYGGIYLDTDVIILKSLSNLHNVIGAQTVDPVTKKW 309

Query: 312 SRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVVSR--LNQNPGFNLTV 371
           SRLNNAV+IFDKNHPLL +FI EF+ TF+GNKWGHNGPYLVSRV++R  ++ +     +V
Sbjct: 310 SRLNNAVLIFDKNHPLLKRFIDEFSRTFNGNKWGHNGPYLVSRVITRIKISSSSDLGFSV 369

Query: 372 LPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLALHLWNSHSFKLRRE 426
           LPPSAFYPV W+RI+  +++P +     W++ +L H+   + A+HLWN  S KLR E
Sbjct: 370 LPPSAFYPVDWTRIKGFYRAPTNESD-AWLRKRLTHLRKNTFAVHLWNRESKKLRIE 411

BLAST of Cp4.1LG20g04290.1 vs. TAIR10
Match: AT5G01250.1 (AT5G01250.1 alpha 1,4-glycosyltransferase family protein)

HSP 1 Score: 306.6 bits (784), Expect = 2.4e-83
Identity = 151/318 (47.48%), Postives = 215/318 (67.61%), Query Frame = 1

Query: 113 PHVKIRADDSAEMA--AKRAGKHKRRLRSLRLESREN---KFSARIEEFLAANSCKLRFF 172
           PH+ + ++   E +   K+  +   +L+ + + S +N   KF  R+ EF+  + C++ F 
Sbjct: 70  PHLPLSSEREGERSDLLKQQTQVNEKLQVIEVFSGDNLSDKFQKRVNEFVG-DGCEVNFV 129

Query: 173 MIWISPLNSFSDREFWAIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIA 232
           M WISP + F +RE  AI+S+F +H  G  CL+I+S ++DS +G   L PF ++G+ ++A
Sbjct: 130 MTWISPADFFGNREVLAIESVFKSHPYG--CLMILSATMDSPQGYATLKPFIDRGYKVLA 189

Query: 233 ISPDFDSIFRNTEAEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVII 292
           ++PD   + + T  E W ++++ G   PG+ISLAQNLSNL+RLA LYK+GG+YLD D+I+
Sbjct: 190 VTPDLPFLLKGTAGELWLDEIKSGKRDPGKISLAQNLSNLMRLAYLYKYGGVYLDTDMIV 249

Query: 293 LKNFTNLRNVIGAQTIDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNG 352
           LK+F  LRNVIGAQT+D  + NW+RLNNAV+IFDKNHPLLL+F++EFA TF+GN WG+NG
Sbjct: 250 LKSFKGLRNVIGAQTLDPSSTNWTRLNNAVLIFDKNHPLLLKFMEEFAKTFNGNIWGYNG 309

Query: 353 PYLVSRVVSRLNQNPGFNLTVLPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIES 412
           PYLVSRV   +  + G+N TV+ PS FY V W  I+ LF+ PK     KWVK KL H++ 
Sbjct: 310 PYLVSRVARAVEGSSGYNFTVMRPSVFYSVNWLEIKKLFKVPKTEKDSKWVKTKLLHMQR 369

Query: 413 QSLALHLWNSHSFKLRRE 426
               LHLWN  S K   E
Sbjct: 370 NGYGLHLWNKFSRKYEIE 384

BLAST of Cp4.1LG20g04290.1 vs. TAIR10
Match: AT3G09020.1 (AT3G09020.1 alpha 1,4-glycosyltransferase family protein)

HSP 1 Score: 297.4 bits (760), Expect = 1.5e-80
Identity = 140/278 (50.36%), Postives = 194/278 (69.78%), Query Frame = 1

Query: 148 KFSARIEEFLAANSCKLRFFMIWISPLNSFSDREFWAIQSIFNAHENGNPCLIIVSNSLD 207
           KF  R  EFL  + C+++F M WISP   F  RE  +++S+F +H  G  CL+I+S+++D
Sbjct: 114 KFQQRATEFLR-DDCEVKFMMTWISPAELFGKREILSVESVFKSHARG--CLMILSSTMD 173

Query: 208 STKGKQILSPFSEKGFPLIAISPDFDSIFRNTEAEPWFNQLRRGIIKPGEISLAQNLSNL 267
           S +G +IL PF ++G+ ++A++PD   + ++T  E W  +++ G   PG+ISLAQNLSNL
Sbjct: 174 SLQGFRILKPFLDRGYRVMAVTPDLPFLLKDTAGESWLEEIQTGKRDPGKISLAQNLSNL 233

Query: 268 LRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQTIDLKTGNWSRLNNAVMIFDKNHPLL 327
           +RLA L+KFGG+YLD D+I+LK+F  LRNVIGAQT++  + NW+RLNNAV+IFDKNHP L
Sbjct: 234 MRLAYLFKFGGVYLDTDMIVLKSFKTLRNVIGAQTLEPVSRNWTRLNNAVLIFDKNHPFL 293

Query: 328 LQFIQEFATTFDGNKWGHNGPYLVSRVVSRLNQNPGFNLTVLPPSAFYPVVWSRIRALFQ 387
           L+ I+EFA TF+GN WGHNGPYLVSRV   +    G+N T+L P AFYPV W  I  LF+
Sbjct: 294 LKSIEEFALTFNGNVWGHNGPYLVSRVARAVEGTDGYNFTILTPPAFYPVNWVEIEKLFK 353

Query: 388 SPKDAVHLKWVKAKLKHIESQSLALHLWNSHSFKLRRE 426
            P+     K V+ K+  ++ +S  LHLWN  S K   E
Sbjct: 354 VPRTEKDSKRVQVKVLEMQKRSYGLHLWNKFSRKFEIE 388

BLAST of Cp4.1LG20g04290.1 vs. TAIR10
Match: AT2G38150.1 (AT2G38150.1 alpha 1,4-glycosyltransferase family protein)

HSP 1 Score: 288.5 bits (737), Expect = 6.8e-78
Identity = 159/389 (40.87%), Postives = 225/389 (57.84%), Query Frame = 1

Query: 48  FNALYVFFTYSPSPEKTI-PQSPPFSPDKRCSNLSSSVPASTHVLFSIEEKHPPVIFKSN 107
           F+ L ++  Y  SP+ T+   + PF P+         + +S ++                
Sbjct: 14  FSVLLLYLLYIESPKSTLYDNNLPFKPNVPLPRPYGPMSSSCNI---------------- 73

Query: 108 SSVFHNPHVKIRADDSAEMAAKRAGKHKR---------RLRSLRLESRENKFSARIEEFL 167
           +SV  + H +   D    +  ++A K++R          L  L+  ++   F  R+ +  
Sbjct: 74  NSVVDSEHKEKELDPL--LPPRKASKNQRIDWFRRKLPELEILKSTTKSKSFHTRVLDLY 133

Query: 168 AANSCKLRFFMIWISPLNSFSDREFWAIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSP 227
             N C  +FFMIW+SP NSF  RE  AI ++F    N   CL I+SNSLDS  G  IL P
Sbjct: 134 NKN-CSAQFFMIWLSPANSFGPREMLAIDTLFTT--NPGACLAILSNSLDSPNGYTILKP 193

Query: 228 FSEKGFPLIAISPDFDSIFRNTEAEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFG 287
             ++GF LIA++ D   + +NT AE W  +L+ G + PG I L  NLS+L RLA+LYK+G
Sbjct: 194 LFDQGFNLIAVTIDIPFLVKNTPAEAWLKRLKSGNMDPGSIPLFMNLSDLTRLAVLYKYG 253

Query: 288 GIYLDADVIILKNFTNLRNVIGAQTIDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATT 347
           G+YLD D+I L + T LRN IGAQ+ D  T  W+RLNNAVM+FD  HPL+ +F+QE+ATT
Sbjct: 254 GVYLDTDIIFLNDMTGLRNAIGAQSSDPATKRWTRLNNAVMVFDIYHPLMREFLQEYATT 313

Query: 348 FDGNKWGHNGPYLVSRVVSRLNQNPGF-NLTVLPPSAFYPVVWSRIRALFQSPKDAVHLK 407
           FDGNKWG+N PYLVSRV+ RL   PG+ NLT+  P AFYPV W +I+ LF+ P      K
Sbjct: 314 FDGNKWGYNSPYLVSRVIKRLGNKPGYNNLTIFSPDAFYPVNWIKIQKLFKKPATTREAK 373

Query: 408 WVKAKLKHIESQSLALHLWNSHSFKLRRE 426
           WV+  ++ +   S  +HLWN  + K++ E
Sbjct: 374 WVEKTVQDMNKGSYMIHLWNKVTRKIKIE 381

BLAST of Cp4.1LG20g04290.1 vs. TAIR10
Match: AT2G38152.1 (AT2G38152.1 alpha 1,4-glycosyltransferase family protein)

HSP 1 Score: 277.7 bits (709), Expect = 1.2e-74
Identity = 133/262 (50.76%), Postives = 177/262 (67.56%), Query Frame = 1

Query: 163 KLRFFMIWISPLNSFSDREFWAIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKG 222
           ++RFFM W SP   F  RE  A++S+F AH  G  CL+IVS SLDS +G  IL P +++G
Sbjct: 98  EVRFFMTWFSPAEYFGKREMLAVESVFKAHPQG--CLMIVSGSLDSLQGDSILKPLNDRG 157

Query: 223 FPLIAISPDFDSIFRNTEAEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLD 282
           + + A +PD   +  NT A+ WF +++     PG I L QNLSNL RLA LYK+GG+YLD
Sbjct: 158 YKVFAATPDMSLLLENTPAKSWFQEMKSCKRDPGRIPLHQNLSNLARLAFLYKYGGVYLD 217

Query: 283 ADVIILKNFTNLRNVIGAQTI-DLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGN 342
            D I+ ++F  L+N IGAQT+ +  + NW+RLNNAV+IF+K+HPL+  FI+EFA+TFDGN
Sbjct: 218 TDFIVTRSFKGLKNSIGAQTVVEGDSKNWTRLNNAVLIFEKDHPLVYSFIEEFASTFDGN 277

Query: 343 KWGHNGPYLVSRVVSRLNQNPGFNLTVLPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAK 402
           KWGHNGPYLV+RV  R  +  G N TVLPP AFYP  W  I  LFQ+P+ +     +K  
Sbjct: 278 KWGHNGPYLVTRVAQRARETIGDNFTVLPPVAFYPFNWLDIPRLFQTPRGSNDSTLLKTD 337

Query: 403 LKHIESQSLALHLWNSHSFKLR 424
           L  +  +S  LHLWN  + KL+
Sbjct: 338 LVKLNRESYGLHLWNKITRKLK 357

BLAST of Cp4.1LG20g04290.1 vs. NCBI nr
Match: gi|659068007|ref|XP_008442286.1| (PREDICTED: lactosylceramide 4-alpha-galactosyltransferase [Cucumis melo])

HSP 1 Score: 625.9 bits (1613), Expect = 5.1e-176
Identity = 320/412 (77.67%), Postives = 350/412 (84.95%), Query Frame = 1

Query: 14  LLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTYSPSPEKTIPQSPPFSP 73
           ++    PLKES   H LSSL  SLL LLLLFLLA+N L+VF    PSPEKTI    PFSP
Sbjct: 1   MIGNFNPLKESTL-HLLSSLPTSLLSLLLLFLLAYNGLHVFSISPPSPEKTISHPAPFSP 60

Query: 74  DKRCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHNPHVKIRADDSAEMAAKRAGKH 133
            KR +  S+S    TH+LFSI E HPP + KSNSSVFH+PH+K R        AKR GKH
Sbjct: 61  QKRPTADSTS----THLLFSIHENHPPPVLKSNSSVFHHPHLKSRP-------AKRVGKH 120

Query: 134 KRRLRSLRLESRENKFSARIEEFLAANSCKLRFFMIWISPLNSFSDREFWAIQSIFNAHE 193
           KRRLRSLR E +EN+FSARIEEF AANSCKLRFFM WIS L+SFSDRE WAIQSIF  HE
Sbjct: 121 KRRLRSLRSELKENEFSARIEEFFAANSCKLRFFMTWISSLDSFSDRELWAIQSIFKVHE 180

Query: 194 NGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSIFRNTEAEPWFNQLRRGII 253
           NGNPCLIIVSNSLDSTKGKQ+LSPFSE GF L+AISPDFDSIF+NT+AE WFNQLR+GII
Sbjct: 181 NGNPCLIIVSNSLDSTKGKQVLSPFSEMGFSLLAISPDFDSIFKNTDAESWFNQLRQGII 240

Query: 254 KPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQTIDLKTGNWSRL 313
           KPGEISL QNLSNLLRL LLYKFGGIY+DADVIILK+FTNLRNVIGAQT+DLKTGNWSRL
Sbjct: 241 KPGEISLGQNLSNLLRLTLLYKFGGIYIDADVIILKSFTNLRNVIGAQTMDLKTGNWSRL 300

Query: 314 NNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVVSRLNQNPGFNLTVLPPSA 373
           NNAVMIFDKNHPLLLQFI+EFATTFDGNKWGHNGPYLVSRV+SRLNQNPGFNLT+LPPSA
Sbjct: 301 NNAVMIFDKNHPLLLQFIKEFATTFDGNKWGHNGPYLVSRVISRLNQNPGFNLTILPPSA 360

Query: 374 FYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLALHLWNSHSFKLRRE 426
           FYPVVW++I+ LFQ PKDAVHLKWV AKL+ I+S+SLALHLWNSHS KL  E
Sbjct: 361 FYPVVWNKIKTLFQGPKDAVHLKWVIAKLRQIQSKSLALHLWNSHSRKLEVE 400

BLAST of Cp4.1LG20g04290.1 vs. NCBI nr
Match: gi|778658692|ref|XP_004146645.2| (PREDICTED: uncharacterized protein At4g19900 [Cucumis sativus])

HSP 1 Score: 603.6 bits (1555), Expect = 2.7e-169
Identity = 307/412 (74.51%), Postives = 342/412 (83.01%), Query Frame = 1

Query: 14  LLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTYSPSPEKTIPQSPPFSP 73
           ++    PLKE  + HFLSSL  S L LLLLFLLA+N L+VF    PSPEKTIP   PFSP
Sbjct: 1   MIGNFNPLKEFTF-HFLSSLPTSFLSLLLLFLLAYNGLHVFSISPPSPEKTIPHPAPFSP 60

Query: 74  DKRCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHNPHVKIRADDSAEMAAKRAGKH 133
            KR +  S+S    TH+LFSI E HPP + KSN+SVFH+PH+K R        AKR GKH
Sbjct: 61  QKRPTADSTS----THLLFSIHENHPPPVLKSNTSVFHHPHLKSRP-------AKRVGKH 120

Query: 134 KRRLRSLRLESRENKFSARIEEFLAANSCKLRFFMIWISPLNSFSDREFWAIQSIFNAHE 193
           KRRLRSLR E +E+ FSARIEEF AANSCKLRFFM WIS L+SFSDRE WAIQSIF  HE
Sbjct: 121 KRRLRSLRSELKESDFSARIEEFFAANSCKLRFFMTWISSLDSFSDRELWAIQSIFKVHE 180

Query: 194 NGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSIFRNTEAEPWFNQLRRGII 253
           N NPCLIIVSNSLDS KGKQILSPFSE GF L+AISPDFD IF+NTEAE WFNQL++GI+
Sbjct: 181 NENPCLIIVSNSLDSAKGKQILSPFSEMGFSLLAISPDFDVIFKNTEAELWFNQLQQGIV 240

Query: 254 KPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRNVIGAQTIDLKTGNWSRL 313
           K GEISL QNLSNLLRL LLYKFGGIY+D DVIIL+NFTNLRN IGAQT+DLKTGNWSRL
Sbjct: 241 KAGEISLGQNLSNLLRLTLLYKFGGIYIDTDVIILQNFTNLRNAIGAQTMDLKTGNWSRL 300

Query: 314 NNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVVSRLNQNPGFNLTVLPPSA 373
           NNAVMIFDKNHPLLLQFI+EFATTFDGNKWGHNGPYLVSRV+SRLNQN  FNLT+LPPSA
Sbjct: 301 NNAVMIFDKNHPLLLQFIKEFATTFDGNKWGHNGPYLVSRVISRLNQNSEFNLTILPPSA 360

Query: 374 FYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLALHLWNSHSFKLRRE 426
           FYPVVW+RI+  FQ PKD VHLKW+ AKL+HI+++SLALHLWN+HS KL+ E
Sbjct: 361 FYPVVWNRIKTFFQGPKDGVHLKWIIAKLRHIQTKSLALHLWNNHSRKLQVE 400

BLAST of Cp4.1LG20g04290.1 vs. NCBI nr
Match: gi|645237069|ref|XP_008225035.1| (PREDICTED: uncharacterized protein At4g19900 [Prunus mume])

HSP 1 Score: 434.5 bits (1116), Expect = 2.2e-118
Identity = 241/432 (55.79%), Postives = 300/432 (69.44%), Query Frame = 1

Query: 11  LLLLLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTY----SPSPEKTIP 70
           L++ + +LQ LK S     L SL  SLL LLLL LLA+N  YVF  Y     PSP+  I 
Sbjct: 13  LIVFIYQLQYLKRSIL-SVLLSLPTSLLALLLLLLLAYNGFYVFCIYLPLSKPSPDPAIF 72

Query: 71  QSPPFSPDK-------RCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHNP-HVKIR 130
            +   + D          S+ SSS   S+ V++ ++E++ P+  K+N    HNP +  + 
Sbjct: 73  SAGNLAGDSVSNWVPAHVSSSSSSSKISSSVMYVVKEENAPMFLKTNLPPLHNPRNSMVP 132

Query: 131 ADDSAEMAAKRAGKHKRRLRSLRLESRENKFSARIEEFLAANS--CKLRFFMIWISPLNS 190
               +    +R  KHKR+L+SL  E + + FS R+ +F A NS  CK+RFFM WIS   +
Sbjct: 133 ISKFSLQRPRRIKKHKRKLKSLPPEPKLSLFSTRMRDFFAGNSSSCKVRFFMTWISS-KT 192

Query: 191 FSDREFWAIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSIF 250
           F +RE  A++S+F +H N   CL IVSNSLDS KG QIL PFSE  F ++AISPDFD +F
Sbjct: 193 FGNRELLAVESLFKSHPNA--CLAIVSNSLDSKKGSQILRPFSEMDFRVMAISPDFDYLF 252

Query: 251 RNTEAEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLRN 310
           +NT AE W+ +LR G + PG +SL QNLSNLLRLALLYKFGGIYLD DVI+LK+ + LRN
Sbjct: 253 KNTPAEAWYYELRTGKVNPGGVSLGQNLSNLLRLALLYKFGGIYLDTDVIVLKSLSKLRN 312

Query: 311 VIGAQTIDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVVS 370
           VIGAQ ID +TGNWSRLNNAV++FDKNHPL+ +FIQEFA TFDGNKWGHNGPYLVSRVVS
Sbjct: 313 VIGAQAIDAQTGNWSRLNNAVLVFDKNHPLIFKFIQEFALTFDGNKWGHNGPYLVSRVVS 372

Query: 371 RLNQ---NPGFNLTVLPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLALH 426
           R+ +   NPGFN TVL PSAFYP  WSRIR+LF+ PKD +H KW+ AKL+HI SQS ALH
Sbjct: 373 RVRENPKNPGFNFTVLTPSAFYPFNWSRIRSLFRGPKDELHSKWLLAKLRHICSQSFALH 432

BLAST of Cp4.1LG20g04290.1 vs. NCBI nr
Match: gi|595905306|ref|XP_007214045.1| (hypothetical protein PRUPE_ppa015051mg [Prunus persica])

HSP 1 Score: 433.3 bits (1113), Expect = 4.8e-118
Identity = 241/433 (55.66%), Postives = 298/433 (68.82%), Query Frame = 1

Query: 11  LLLLLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTYSPSPEKTIPQSPP 70
           L++ + +LQ LK S     L SL  SLL LLLL LLA+N  YVF  Y P   K  P    
Sbjct: 13  LIVFINQLQYLKRSIL-SVLLSLPTSLLALLLLLLLAYNGFYVFCIYLPL-SKPSPDPAI 72

Query: 71  FSPDK------------RCSNLSSSVPASTHVLFSIEEKHPPVIFKSNSSVFHNP-HVKI 130
           FSP                S+ SSS   S+ V++ ++E++ P+  K++    HNP +  +
Sbjct: 73  FSPGNLAGDSVSNWVPAHVSSSSSSSKISSSVMYVVKEENAPMFLKTHLPPLHNPRNSMV 132

Query: 131 RADDSAEMAAKRAGKHKRRLRSLRLESRENKFSARIEEFLAANS--CKLRFFMIWISPLN 190
                +    +R  KHKR+L+SL  E + + FS R+ +F A NS  CK+RFFM WIS   
Sbjct: 133 PIPKFSLQRPRRIRKHKRKLKSLPPEPKLSLFSTRMRDFFAGNSSSCKVRFFMTWIS-FK 192

Query: 191 SFSDREFWAIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEKGFPLIAISPDFDSI 250
           +F +RE  A++S+F  H N   CL IVSNSLDS KG QIL PFSE  F ++AISPDFD +
Sbjct: 193 TFGNRELLAVESLFKFHPNA--CLAIVSNSLDSEKGSQILRPFSEMDFRVMAISPDFDYL 252

Query: 251 FRNTEAEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYLDADVIILKNFTNLR 310
           F+NT AE W+++LR G + PG +SL QNLSNLLRLALLYKFGGIYLD DVI+LK+ + LR
Sbjct: 253 FKNTPAEAWYSELRTGKVNPGGVSLGQNLSNLLRLALLYKFGGIYLDTDVIVLKSLSKLR 312

Query: 311 NVIGAQTIDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGNKWGHNGPYLVSRVV 370
           NVIGAQ ID +TGNWSRLNNAV++FDKNHPL+ +FIQEFA TFDGNKWGHNGPYLVSRVV
Sbjct: 313 NVIGAQAIDAQTGNWSRLNNAVLVFDKNHPLIFKFIQEFALTFDGNKWGHNGPYLVSRVV 372

Query: 371 SRLNQ---NPGFNLTVLPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAKLKHIESQSLAL 426
           SR+ +   NPGFN TVL PSAFYP  WSRIR+LF+ PKD +H KW+ AKL+HI SQS AL
Sbjct: 373 SRVRENPKNPGFNFTVLTPSAFYPFNWSRIRSLFRGPKDELHSKWLLAKLRHICSQSFAL 432

BLAST of Cp4.1LG20g04290.1 vs. NCBI nr
Match: gi|1009182116|ref|XP_015872535.1| (PREDICTED: uncharacterized protein At4g19900 [Ziziphus jujuba])

HSP 1 Score: 431.4 bits (1108), Expect = 1.8e-117
Identity = 232/444 (52.25%), Postives = 300/444 (67.57%), Query Frame = 1

Query: 11  LLLLLRRLQPLKESAYYHFLSSLTASLLYLLLLFLLAFNALYVFFTYSPSPEKTIPQSPP 70
           L+L L  LQ LK S    FL  L  S+L   LL LL +N   VF  + P   KT P+   
Sbjct: 13  LILFLHHLQELKRSVL-DFLLCLPTSILAFTLLLLLVYNGFSVFCIHLPFLAKTSPEPAN 72

Query: 71  FSPD----------------------------KRCSNLSSSVPASTHVLFSIEEKHPPVI 130
           FSP+                               S+LS+S  + +  L+ ++E++PP I
Sbjct: 73  FSPEYPAGKFPLLRLSSSYSSLRSLSSSSSSSSTSSSLSTSSSSFSSKLYVVKEENPPPI 132

Query: 131 FKSNSSVFHNPHVKIRADDSAEMAAKRAGKHKRRLRSLR-LESRENKFSARIEEFLAANS 190
            K++    ++    IR   S+  + +R  KHKR+L+ LR L  + ++FS RI EF AA +
Sbjct: 133 LKTHLQ--NHSFSVIRISTSSIRSHRRLKKHKRKLKILRSLPPQSSQFSTRIREFFAAGN 192

Query: 191 CKLRFFMIWISPLNSFSDREFWAIQSIFNAHENGNPCLIIVSNSLDSTKGKQILSPFSEK 250
           CK RFFM WIS L+ F DRE +AI+S+F +H N   CL++VSNS+DS KG QIL PFS+ 
Sbjct: 193 CKPRFFMTWISSLDFFGDRELFAIESLFKSHPNA--CLVMVSNSMDSKKGNQILKPFSDT 252

Query: 251 GFPLIAISPDFDSIFRNTEAEPWFNQLRRGIIKPGEISLAQNLSNLLRLALLYKFGGIYL 310
           GF ++ ISPDFD IF+NT A  WF +LR G + PGE++L QNLSNLLRLALLYKFGGIY+
Sbjct: 253 GFKVMTISPDFDFIFKNTHAGAWFCRLREGNVDPGEVALGQNLSNLLRLALLYKFGGIYV 312

Query: 311 DADVIILKNFTNLRNVIGAQTIDLKTGNWSRLNNAVMIFDKNHPLLLQFIQEFATTFDGN 370
           D D+IILK+F+ LRNVIGAQT+D++TGNWSRLNNAV+IFDKNHPLL +FIQEFA TF+GN
Sbjct: 313 DTDIIILKSFSKLRNVIGAQTVDIETGNWSRLNNAVLIFDKNHPLLFKFIQEFALTFNGN 372

Query: 371 KWGHNGPYLVSRVVSRLNQNPGFNLTVLPPSAFYPVVWSRIRALFQSPKDAVHLKWVKAK 426
           KWGHNGPYLVSRVVSR++   G+N T+LPPSA+YPV W+RIR+LFQ P+D +H  W+ +K
Sbjct: 373 KWGHNGPYLVSRVVSRVSGRTGYNFTILPPSAYYPVDWTRIRSLFQGPRDEIHSNWLISK 432

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y4990_ARATH6.1e-2831.29Uncharacterized protein At4g19900 OS=Arabidopsis thaliana GN=At4g19900 PE=2 SV=1[more]
A4GAT_MOUSE2.8e-2530.04Lactosylceramide 4-alpha-galactosyltransferase OS=Mus musculus GN=A4galt PE=2 SV... [more]
A4GAT_RAT3.7e-2530.83Lactosylceramide 4-alpha-galactosyltransferase OS=Rattus norvegicus GN=A4galt PE... [more]
A4GAT_PONPY2.0e-2333.64Lactosylceramide 4-alpha-galactosyltransferase (Fragment) OS=Pongo pygmaeus GN=A... [more]
A4GAT_HUMAN2.6e-2333.49Lactosylceramide 4-alpha-galactosyltransferase OS=Homo sapiens GN=A4GALT PE=2 SV... [more]
Match NameE-valueIdentityDescription
M5WNF6_PRUPE3.4e-11855.66Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015051mg PE=4 SV=1[more]
W9S212_9ROSA1.1e-11653.88Uncharacterized protein OS=Morus notabilis GN=L484_012116 PE=4 SV=1[more]
B9HYL1_POPTR2.4e-11652.64Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s04750g PE=4 SV=2[more]
A0A061GEM3_THECC1.3e-11452.49Alpha 1,4-glycosyltransferase family protein, putative OS=Theobroma cacao GN=TCM... [more]
A0A0D2STE1_GOSRA3.4e-11051.55Uncharacterized protein OS=Gossypium raimondii GN=B456_010G152300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G61050.13.1e-9146.04 alpha 1,4-glycosyltransferase family protein[more]
AT5G01250.12.4e-8347.48 alpha 1,4-glycosyltransferase family protein[more]
AT3G09020.11.5e-8050.36 alpha 1,4-glycosyltransferase family protein[more]
AT2G38150.16.8e-7840.87 alpha 1,4-glycosyltransferase family protein[more]
AT2G38152.11.2e-7450.76 alpha 1,4-glycosyltransferase family protein[more]
Match NameE-valueIdentityDescription
gi|659068007|ref|XP_008442286.1|5.1e-17677.67PREDICTED: lactosylceramide 4-alpha-galactosyltransferase [Cucumis melo][more]
gi|778658692|ref|XP_004146645.2|2.7e-16974.51PREDICTED: uncharacterized protein At4g19900 [Cucumis sativus][more]
gi|645237069|ref|XP_008225035.1|2.2e-11855.79PREDICTED: uncharacterized protein At4g19900 [Prunus mume][more]
gi|595905306|ref|XP_007214045.1|4.8e-11855.66hypothetical protein PRUPE_ppa015051mg [Prunus persica][more]
gi|1009182116|ref|XP_015872535.1|1.8e-11752.25PREDICTED: uncharacterized protein At4g19900 [Ziziphus jujuba][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR007652A1-4-GlycosylTfrase_dom
IPR007577GlycoTrfase_DXD_sugar-bd_CS
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG20g04290Cp4.1LG20g04290gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG20g04290.1:cds:001Cp4.1LG20g04290.1:cds:001CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG20g04290.1Cp4.1LG20g04290.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007577Glycosyltransferase, DXD sugar-binding motifPFAMPF04488Gly_transf_sugcoord: 179..302
score: 5.5
IPR007652Alpha 1,4-glycosyltransferase domainPFAMPF04572Gb3_synthcoord: 320..419
score: 1.7
NoneNo IPR availablePANTHERPTHR12042LACTOSYLCERAMIDE 4-ALPHA-GALACTOSYLTRANSFERASE ALPHA- 1,4-GALACTOSYLTRANSFERASEcoord: 1..425
score: 9.5E
NoneNo IPR availablePANTHERPTHR12042:SF18ALPHA 1,4-GLYCOSYLTRANSFERASE-LIKE PROTEINcoord: 1..425
score: 9.5E