Cp4.1LG00g09350 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG00g09350
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionO-fucosyltransferase family protein
LocationCp4.1LG00 : 29047721 .. 29050484 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTATTGTCTCAGTGAAGCCATTCTTTAGCGTTCTTGTTGTCGGTCTTTGTTTGACCTTGGCGGTGCTTCTCTTGTCGCCGTCCTCTTCGCTTTTCCCCGGCGCTTCAGTCTCCTCTGCTCATCAACGGTGCGCTTTACTTTCCCTTTTTCATTATGTTTATGTTCTGACTTCGAGTATATCATTGGATGATCATGAATTTGAAAGCTGCAATTAAAATGAAAAAATTGCTCGATATTGGCGTTTTTTTCCCGAAGAAATAGTGAAAATCCTGCTTGATCTTGCATCATTTGAGAATATGTACTTGTGTTTCTTTCTTAATTAGTTTATGTTGCTTGCTTTGTACTGTGTGGTTGCATTGAGAGTTGGAGAAATCATAAAGAAATTGGATCTTGAGTGGAATTTCACGAGATGAAGTTAAAATAAGGTTTTCTTCGTCTCAGATCTAGTAATAGGATAGAGGCTCTGTCGTGTATCGTCCCCCCTACCTCTCATTCCTCCTCAATGGTGCCATTTTTTTTAGTTCTTTTTGTGAAATCTAGAGTGTAATTAGCAGTTTCAGCTGAAAGACTTGACATAGTTTTTCTACACTACAGAAGATTAAATTGATCCAATTTTAATTGCAGAGGCTACACGGATATATGGAGTCTTCGAAGGATAGCAGAGTGGAGTCCTTGCAAGTGGTGGCTTCAAGGTCACCGACCTGGTAAACTGATTTGATTTCTGGCGACAACTTCTTGTTTACTCTCATGTTTTTCTGTCTTTTCAACTTTTTTTCTATAATGGAATAGCAAGTTTACTTAATTTGTAAGCCACAACCTTTGGACGAATTATGATGATTTGGGTACTGGGAAATTCTTTAGTAGGAATTCACAGCAGACAATATGCATGTAATAAGATATAAAAGGTTATATGGACTTTATGTTGTACTAGGAAAAATGGTTAGATTGTAACAAGGGTTGGGGGGATTCTGATAGGTTTTGATAGTCTGAGTCAGTTTTAATTGGCTTTCATTGCAGACATTTGGATTGGTTTTTTGTTGAAATGCTGCAGCTCTGCCTGCTGATAGCAATGGATATATACGCGTGGATTGCTATGGTGGGCTCAATCAGATGAGAAGAGATGTAATATTCAATGGTTTTTCTATATTTGTAATTTAACTTTGTTCATTACATGGTTAAAGCTTTCTAGCTAAAAGCTGACACTCTTTATCATGTGTGCAGTTGTGTGATGGAGTTGGCGTAGCACGGCTGTTGAATGCTACTCTGGTTCTGCCAAAGTTTGAAGTGGCTGCATATTGGAATGAATCAAGGTAATTAATGCCAATATCTTTTCACTTGCCTTCCATGTTCTATTTCATGCATAGACCTACCTGCAACTTTTTATTCACAAATGGTAATTTTATTTAACTTGGGTTTTTCCTTATGATGCTATCGAACATGTTCATTTACTGACTTCTGGGTGTCAAAATACTCCAATACATTTTCCACTCACTAAAGAAAATGTTATAAGCTGCTTGCATGTGGTTTAACCTTGCTGCTTTTTTTTCCAGTGGTTTTGCAGATGTATTTGATGTTGACTATTTTATTAGTCAGATGAATGGTTATGTAAAAGTTGTTAAAGAGTTGCCACCAGAGTTCGCGTCCAAAGAGCCTCATCATGTGGATTGTAGTAAACGCAAAGGACAGTTCGATTATATTGAAAGTGTCCTACCATCCCTGCTGGAACATCATTATATTTCAATAACTCCAGCAATGAGCCAGAGAAGGGATAGGTACTTCAGAGTTGTAGTATTCTTTATTATTCGTCCTTTTGGTTTGGTTTAGGTCAGTACCATGATAGAACAACGCCCCGATTGGCATCCAAAGCATTGACAGCTACCTCGTTTCAGTTTCAATTGTGTGTGCAATGTTGTACAGGAGTTCTTTTTTGCTATCTTCTTTGTCTTTGCTGTAGTTCTCTAATTTCAGTGATGCGCTTAGTCTGTGGCAATGCTCTTAAATATTTTCCTCCCAAAATTCTTTTACAGGTATCCTCAGTATGCAAAAGCTGCCCTATGTCAAGCTTGTTACAATGCTTTGCGCCTTGCTAAACCCGTGGAGGAGAAAGCCAAGAAACTTTTGGAAGCGATACCAAAGCCCTTTCTCTCTCTTCATCTTCGTTTCGAGCCAGATATGGTGGCTTACAGCCAATGTGAATATCAAGGACTTTCTACTGCTTCTCTAGAAGCAATTGAGGCCGCTCGGGTAGATAGGAAACCATGGACCGGAGAGTTAGCCAAAATATGGAGAAAGCGAGGGAAATGTCCTCTCACCCCTCGTGAGACAGCCCTCATATTTCAAGCACTCCATATTCCAACTAATACAAATATTTACTTGGCTGCTGGTGATGGTTTGATGGAAATGGAAGGCTTTACATCAGTGTACACCAACGTAGTAACCAAGTCTAGTTTCCTCAGTAGCGACGATTTCTCAAACATGCATGGTAACACAAAAGCTGCACTGGATTATTATGTGTCTATTAATAGTGATTCTTATGTAGCAACATTCTTTGGAAATATGGACAAGATGGTCGCAGCAATGCGCGCATTCAACGGAAAGCAGAGGACATTGTTTTTAAGTCGACGAGCTTTTGCAGAGTTCACTTACAATGGTCTGAAAGGGAAGGAGCTGAATCAAGCATTGTGGAAGGCTCATAGAGATGATTTCGTCATGGGCAGGGGATCTGCATTGTCCGACTGCTTTTGTGAGTTCAAACTCTGA

mRNA sequence

ATGTTTATTGTCTCAGTGAAGCCATTCTTTAGCGTTCTTGTTGTCGGTCTTTGTTTGACCTTGGCGGTGCTTCTCTTGTCGCCGTCCTCTTCGCTTTTCCCCGGCGCTTCAGTCTCCTCTGCTCATCAACGAGGCTACACGGATATATGGAGTCTTCGAAGGATAGCAGAGTGGAGTCCTTGCAAGTGGTGGCTTCAAGGTCACCGACCTGCTCTGCCTGCTGATAGCAATGGATATATACGCGTGGATTGCTATGGTGGGCTCAATCAGATGAGAAGAGATTTGTGTGATGGAGTTGGCGTAGCACGGCTGTTGAATGCTACTCTGGTTCTGCCAAAGTTTGAAGTGGCTGCATATTGGAATGAATCAAGTGGTTTTGCAGATGTATTTGATGTTGACTATTTTATTAGTCAGATGAATGGTTATGTAAAAGTTGTTAAAGAGTTGCCACCAGAGTTCGCGTCCAAAGAGCCTCATCATGTGGATTGTAGTAAACGCAAAGGACAGTTCGATTATATTGAAAGTGTCCTACCATCCCTGCTGGAACATCATTATATTTCAATAACTCCAGCAATGAGCCAGAGAAGGGATAGGTATCCTCAGTATGCAAAAGCTGCCCTATGTCAAGCTTGTTACAATGCTTTGCGCCTTGCTAAACCCGTGGAGGAGAAAGCCAAGAAACTTTTGGAAGCGATACCAAAGCCCTTTCTCTCTCTTCATCTTCGTTTCGAGCCAGATATGGTGGCTTACAGCCAATGTGAATATCAAGGACTTTCTACTGCTTCTCTAGAAGCAATTGAGGCCGCTCGGGTAGATAGGAAACCATGGACCGGAGAGTTAGCCAAAATATGGAGAAAGCGAGGGAAATGTCCTCTCACCCCTCGTGAGACAGCCCTCATATTTCAAGCACTCCATATTCCAACTAATACAAATATTTACTTGGCTGCTGGTGATGGTTTGATGGAAATGGAAGGCTTTACATCAGTGTACACCAACGTAGTAACCAAGTCTAGTTTCCTCAGTAGCGACGATTTCTCAAACATGCATGGTAACACAAAAGCTGCACTGGATTATTATGTGTCTATTAATAGTGATTCTTATGTAGCAACATTCTTTGGAAATATGGACAAGATGGTCGCAGCAATGCGCGCATTCAACGGAAAGCAGAGGACATTGTTTTTAAGTCGACGAGCTTTTGCAGAGTTCACTTACAATGGTCTGAAAGGGAAGGAGCTGAATCAAGCATTGTGGAAGGCTCATAGAGATGATTTCGTCATGGGCAGGGGATCTGCATTGTCCGACTGCTTTTGTGAGTTCAAACTCTGA

Coding sequence (CDS)

ATGTTTATTGTCTCAGTGAAGCCATTCTTTAGCGTTCTTGTTGTCGGTCTTTGTTTGACCTTGGCGGTGCTTCTCTTGTCGCCGTCCTCTTCGCTTTTCCCCGGCGCTTCAGTCTCCTCTGCTCATCAACGAGGCTACACGGATATATGGAGTCTTCGAAGGATAGCAGAGTGGAGTCCTTGCAAGTGGTGGCTTCAAGGTCACCGACCTGCTCTGCCTGCTGATAGCAATGGATATATACGCGTGGATTGCTATGGTGGGCTCAATCAGATGAGAAGAGATTTGTGTGATGGAGTTGGCGTAGCACGGCTGTTGAATGCTACTCTGGTTCTGCCAAAGTTTGAAGTGGCTGCATATTGGAATGAATCAAGTGGTTTTGCAGATGTATTTGATGTTGACTATTTTATTAGTCAGATGAATGGTTATGTAAAAGTTGTTAAAGAGTTGCCACCAGAGTTCGCGTCCAAAGAGCCTCATCATGTGGATTGTAGTAAACGCAAAGGACAGTTCGATTATATTGAAAGTGTCCTACCATCCCTGCTGGAACATCATTATATTTCAATAACTCCAGCAATGAGCCAGAGAAGGGATAGGTATCCTCAGTATGCAAAAGCTGCCCTATGTCAAGCTTGTTACAATGCTTTGCGCCTTGCTAAACCCGTGGAGGAGAAAGCCAAGAAACTTTTGGAAGCGATACCAAAGCCCTTTCTCTCTCTTCATCTTCGTTTCGAGCCAGATATGGTGGCTTACAGCCAATGTGAATATCAAGGACTTTCTACTGCTTCTCTAGAAGCAATTGAGGCCGCTCGGGTAGATAGGAAACCATGGACCGGAGAGTTAGCCAAAATATGGAGAAAGCGAGGGAAATGTCCTCTCACCCCTCGTGAGACAGCCCTCATATTTCAAGCACTCCATATTCCAACTAATACAAATATTTACTTGGCTGCTGGTGATGGTTTGATGGAAATGGAAGGCTTTACATCAGTGTACACCAACGTAGTAACCAAGTCTAGTTTCCTCAGTAGCGACGATTTCTCAAACATGCATGGTAACACAAAAGCTGCACTGGATTATTATGTGTCTATTAATAGTGATTCTTATGTAGCAACATTCTTTGGAAATATGGACAAGATGGTCGCAGCAATGCGCGCATTCAACGGAAAGCAGAGGACATTGTTTTTAAGTCGACGAGCTTTTGCAGAGTTCACTTACAATGGTCTGAAAGGGAAGGAGCTGAATCAAGCATTGTGGAAGGCTCATAGAGATGATTTCGTCATGGGCAGGGGATCTGCATTGTCCGACTGCTTTTGTGAGTTCAAACTCTGA

Protein sequence

MFIVSVKPFFSVLVVGLCLTLAVLLLSPSSSLFPGASVSSAHQRGYTDIWSLRRIAEWSPCKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYWNESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDYIESVLPSLLEHHYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKPFLSLHLRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKRGKCPLTPRETALIFQALHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKELNQALWKAHRDDFVMGRGSALSDCFCEFKL
BLAST of Cp4.1LG00g09350 vs. Swiss-Prot
Match: Y1491_ARATH (Uncharacterized protein At1g04910 OS=Arabidopsis thaliana GN=At1g04910 PE=2 SV=1)

HSP 1 Score: 170.6 bits (431), Expect = 3.8e-41
Identity = 125/422 (29.62%), Postives = 213/422 (50.47%), Query Frame = 1

Query: 13  LVVGLCLTLAVLLLSPSSSLFPGASVSSAHQR------GYTDIWSLRRIAEWSPCKWWLQ 72
           +V  L + + VLL+   S LF  A++ S  +          ++W   +   W P     +
Sbjct: 21  MVAKLSIGVIVLLICTLSLLF-SANIGSNREPTRPSKINVEELWESAKSGGWRPSSA-PR 80

Query: 73  GHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYWNESSGF 132
              P    ++NGY+RV C GGLNQ R  +C+ V  AR++NATLVLP+ +  ++W++ SGF
Sbjct: 81  SDWPPPTKETNGYLRVRCNGGLNQQRSAICNAVLAARIMNATLVLPELDANSFWHDDSGF 140

Query: 133 ADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFD-----YIESVLPSLL 192
             ++DV++FI  +   VK+V ++P    + +   +   + +   D     Y+ + L ++ 
Sbjct: 141 QGIYDVEHFIETLKYDVKIVGKIPDVHKNGKTKKIKAFQIRPPRDAPIEWYLTTALKAMR 200

Query: 193 EHHYISITPAMSQRRDRY--PQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKP--FL 252
           EH  I +TP   +  +    P+Y +   C+  Y+ALR    + + ++ +++ +     F+
Sbjct: 201 EHSAIYLTPFSHRLAEEIDNPEYQRLR-CRVNYHALRFKPHIMKLSESIVDKLRSQGHFM 260

Query: 253 SLHLRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKR---GKCPLTP 312
           S+HLRFE DM+A++ C        + E  +  R  RK    +   I+ +R   GKCPLTP
Sbjct: 261 SIHLRFEMDMLAFAGC----FDIFNPEEQKILRKYRKENFADKRLIYNERRAIGKCPLTP 320

Query: 313 RETALIFQALHIPTNTNIYLAAGD---GLMEMEGFTSVYTNVVTKSSFLSSDDFS-NMHG 372
            E  LI +A+    +T IYLAAG+   G   M+ F +++  +   SS   S++ S    G
Sbjct: 321 EEVGLILRAMRFDNSTRIYLAAGELFGGEQFMKPFRTLFPRLDNHSSVDPSEELSATSQG 380

Query: 373 NTKAALDYYVSINSDSYVATFFG--NMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLK 411
              +A+DY V + SD ++ T+ G  N    +   R + G + T+   R+A A       K
Sbjct: 381 LIGSAVDYMVCLLSDIFMPTYDGPSNFANNLLGHRLYYGFRTTIRPDRKALAPIFIAREK 435

BLAST of Cp4.1LG00g09350 vs. TrEMBL
Match: A0A0A0KBV3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G432630 PE=4 SV=1)

HSP 1 Score: 805.4 bits (2079), Expect = 3.3e-230
Identity = 390/441 (88.44%), Postives = 414/441 (93.88%), Query Frame = 1

Query: 1   MFIVSVKPFFSVLVVGLCLTLAVLLLSPSSSLFPGASVSSAHQRGYTDIWSLRRIAEWSP 60
           MF+VSVKPFFSVL+V L L+ AVLLLSP SS F   S S  ++RGYTDIWS+RRI EW P
Sbjct: 1   MFLVSVKPFFSVLLVTLSLSFAVLLLSPPSSFFSNTSFSFTNRRGYTDIWSVRRIVEWRP 60

Query: 61  CKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYW 120
           CKWWL+GH PALPAD+NGYIRVDCYGGLNQMRRDLCDGVG+ARLLNATLVLPKFEVAAYW
Sbjct: 61  CKWWLRGHLPALPADTNGYIRVDCYGGLNQMRRDLCDGVGIARLLNATLVLPKFEVAAYW 120

Query: 121 NESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDYIESVLPSL 180
           NESSGFADVFDVD+FI QMNGYVKV KELPPEFASKEP+HVDCSKRKGQFDYIESVLPSL
Sbjct: 121 NESSGFADVFDVDHFIGQMNGYVKVAKELPPEFASKEPYHVDCSKRKGQFDYIESVLPSL 180

Query: 181 LEHHYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKPFLSLH 240
           LEHHYISITPAMSQRRDRYPQYAKAALCQ CYN LRLAK VE+KA++LLEAIPKPFLSLH
Sbjct: 181 LEHHYISITPAMSQRRDRYPQYAKAALCQVCYNGLRLAKSVEKKARELLEAIPKPFLSLH 240

Query: 241 LRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKRGKCPLTPRETALI 300
           LRFEPDMVAYSQCEY+GLS  SLEAIEA R DRKPWTG+LA+IWRKRGKCPLTPRETALI
Sbjct: 241 LRFEPDMVAYSQCEYKGLSPTSLEAIEATRGDRKPWTGQLAEIWRKRGKCPLTPRETALI 300

Query: 301 FQALHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGNTKAALDYYV 360
           FQALHIPTNTNIYLAAGDGLME+EGFTSVYTNVVTKSSFLS++DFS+MHGNTKAALDYYV
Sbjct: 301 FQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSNNDFSSMHGNTKAALDYYV 360

Query: 361 SINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKELNQALWKAH 420
           SINSD YVATFFGNMDKMVAAMRAFNGKQ+TLFLSRRAFAEFTY GL+GKEL+QALWK H
Sbjct: 361 SINSDYYVATFFGNMDKMVAAMRAFNGKQKTLFLSRRAFAEFTYKGLEGKELDQALWKTH 420

Query: 421 RDDFVMGRGSALSDCFCEFKL 442
           RDDF MGRGSALSDCFCEFKL
Sbjct: 421 RDDFSMGRGSALSDCFCEFKL 441

BLAST of Cp4.1LG00g09350 vs. TrEMBL
Match: M5X4Y8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005861mg PE=4 SV=1)

HSP 1 Score: 721.5 bits (1861), Expect = 6.4e-205
Identity = 352/441 (79.82%), Postives = 388/441 (87.98%), Query Frame = 1

Query: 1   MFIVSVKPFFSVLVVGLCLTLAVLLLSPSSSLFPGASVSSAHQRGYTDIWSLRRIAEWSP 60
           M ++SVKP F++ V+ L L L+VLLLSP+S L+  A +         DIWS+ R+ EW P
Sbjct: 1   MVVISVKPIFTIGVLTLSLILSVLLLSPTSPLYK-APLFLTSPMNKLDIWSVGRMVEWRP 60

Query: 61  CKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYW 120
           CKWWLQGH  ALP+ +NG+IRVDCYGGLNQMRRDLCDGVG+ARLLNATLVLPKFEVAAYW
Sbjct: 61  CKWWLQGHLTALPSKNNGFIRVDCYGGLNQMRRDLCDGVGIARLLNATLVLPKFEVAAYW 120

Query: 121 NESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDYIESVLPSL 180
           NESSGFADVFDVDYFI QMNG++KVVKELP + +SKEP HVDCSKRKGQFDYIESVLPSL
Sbjct: 121 NESSGFADVFDVDYFIQQMNGFIKVVKELPSDISSKEPFHVDCSKRKGQFDYIESVLPSL 180

Query: 181 LEHHYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKPFLSLH 240
           LEHHYISITPAMSQRRDRYPQYAKA+LCQACY+ALRL K +E+KA +LLEAIPKPFLSLH
Sbjct: 181 LEHHYISITPAMSQRRDRYPQYAKASLCQACYSALRLTKSLEKKASELLEAIPKPFLSLH 240

Query: 241 LRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKRGKCPLTPRETALI 300
           LRFEPDMVAYSQCEY  LS+AS+EAIEAAR DRKPWTGE+A+IWRKRGKCPLTP ETA I
Sbjct: 241 LRFEPDMVAYSQCEYPDLSSASMEAIEAARGDRKPWTGEVAQIWRKRGKCPLTPNETAFI 300

Query: 301 FQALHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGNTKAALDYYV 360
            Q L IPTNTNIYLAAGDGLME+EG TSVYTNV TKSS LS +DF +MHGNTKAALDYYV
Sbjct: 301 LQVLSIPTNTNIYLAAGDGLMEIEGLTSVYTNVFTKSSLLSGEDFKSMHGNTKAALDYYV 360

Query: 361 SINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKELNQALWKAH 420
           SINSDSY+AT+FGNMDKMVAAMRAFNG  +TLFLSRRAFAEFT+ GL+GKEL  AL KAH
Sbjct: 361 SINSDSYIATYFGNMDKMVAAMRAFNGLYKTLFLSRRAFAEFTFQGLRGKELMNALRKAH 420

Query: 421 RDDFVMGRGSALSDCFCEFKL 442
           RDDF MGRGSAL DCFCEFKL
Sbjct: 421 RDDFAMGRGSALPDCFCEFKL 440

BLAST of Cp4.1LG00g09350 vs. TrEMBL
Match: A0A067KXV6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06161 PE=4 SV=1)

HSP 1 Score: 716.8 bits (1849), Expect = 1.6e-203
Identity = 353/442 (79.86%), Postives = 383/442 (86.65%), Query Frame = 1

Query: 1   MFIVSVKPFFSVLVVGLCLTLAVLLLSPSSSLFPGA-SVSSAHQRGYTDIWSLRRIAEWS 60
           M  VSVKP F+ +     L LAV+ LSPS++LF    S SS       DIWS+RRI EW 
Sbjct: 1   MIAVSVKPLFTFIFT-FSLVLAVVFLSPSTTLFSSTISSSSDSSTRKLDIWSVRRIVEWR 60

Query: 61  PCKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAY 120
           PCKWWLQGH  ALPA SNGYIRVDCYGGLNQMRRD CDGVG+ARLLNATLVLPKFE AAY
Sbjct: 61  PCKWWLQGHLTALPAKSNGYIRVDCYGGLNQMRRDFCDGVGIARLLNATLVLPKFEAAAY 120

Query: 121 WNESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDYIESVLPS 180
           WNESSGFADVFDVDYFI Q+ G+VKVVK+LPPE ASKEP HVDCSKRKGQFDYIESVLPS
Sbjct: 121 WNESSGFADVFDVDYFIQQVKGFVKVVKDLPPEIASKEPFHVDCSKRKGQFDYIESVLPS 180

Query: 181 LLEHHYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKPFLSL 240
           LLEHHYISITPAMSQRRDRYP YAKAALCQACY+ALRL + +E+KA +LLEAIPKPFLSL
Sbjct: 181 LLEHHYISITPAMSQRRDRYPSYAKAALCQACYSALRLTRSLEKKASELLEAIPKPFLSL 240

Query: 241 HLRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKRGKCPLTPRETAL 300
           HLRFEPDMVAYSQCEY GLS AS+EAIEAAR  RKPWTGE A+IWRKRGKCPLTP ETAL
Sbjct: 241 HLRFEPDMVAYSQCEYLGLSPASMEAIEAARDYRKPWTGESARIWRKRGKCPLTPNETAL 300

Query: 301 IFQALHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGNTKAALDYY 360
           I QAL IPTNTNIYLAAGDGLME+EG TS+YTNV  K++ LSS+DF++MHGNTKAALDYY
Sbjct: 301 ILQALSIPTNTNIYLAAGDGLMEIEGLTSIYTNVFNKATLLSSEDFTSMHGNTKAALDYY 360

Query: 361 VSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKELNQALWKA 420
           VSINSDSY+AT+FGNMDKMVAAMRA+ G  +TLFLSRRAFAEFT+  LKGKEL QALWK 
Sbjct: 361 VSINSDSYMATYFGNMDKMVAAMRAYKGLYKTLFLSRRAFAEFTFQSLKGKELMQALWKT 420

Query: 421 HRDDFVMGRGSALSDCFCEFKL 442
           H +DFVMGRGSAL DCFCEFKL
Sbjct: 421 HEEDFVMGRGSALPDCFCEFKL 441

BLAST of Cp4.1LG00g09350 vs. TrEMBL
Match: A0A059BSS4_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F02536 PE=4 SV=1)

HSP 1 Score: 707.2 bits (1824), Expect = 1.2e-200
Identity = 342/438 (78.08%), Postives = 384/438 (87.67%), Query Frame = 1

Query: 4   VSVKPFFSVLVVGLCLTLAVLLLSPSSSLFPGASVSSAHQRGYTDIWSLRRIAEWSPCKW 63
           VSVKP F VLVV L + LA+ +L+P S +F   S SS  + G+ DIWS+RR+ EW PCKW
Sbjct: 9   VSVKPLFGVLVVSLSVLLAISVLAPRS-IFSQTSFSSLSREGF-DIWSVRRMVEWRPCKW 68

Query: 64  WLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYWNES 123
           WLQGH  ALP  SNGYIR+DCYGGLNQMRRDLCDGVG+ARLLNATLV+PKFEVAAYWNES
Sbjct: 69  WLQGHLTALPEKSNGYIRIDCYGGLNQMRRDLCDGVGIARLLNATLVMPKFEVAAYWNES 128

Query: 124 SGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDYIESVLPSLLEH 183
           SGFADVFDVDYFI QM G+V+V KELPPEFASKEP  VDCSKRKGQFDY+ESVLP LL+H
Sbjct: 129 SGFADVFDVDYFIQQMTGFVRVAKELPPEFASKEPFSVDCSKRKGQFDYLESVLPFLLKH 188

Query: 184 HYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKPFLSLHLRF 243
           HYISITPAMSQRRDRYP+YAKAALCQ CY+ALRL K +E+KA +LL AIPKPFL+LHLRF
Sbjct: 189 HYISITPAMSQRRDRYPEYAKAALCQGCYSALRLTKSLEKKASELLLAIPKPFLALHLRF 248

Query: 244 EPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKRGKCPLTPRETALIFQA 303
           EPDMVAYSQCEY GLS +S+EAIEAAR D+KPWT +LAKIWR+RGKCPLTPRETA I +A
Sbjct: 249 EPDMVAYSQCEYSGLSPSSMEAIEAARGDKKPWTADLAKIWRRRGKCPLTPRETAFILKA 308

Query: 304 LHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGNTKAALDYYVSIN 363
           L IPT+T+IYLAAGDG+ME+EG TSVYTNVVTKS  LS +DF +MHGNTKAALDYYVSIN
Sbjct: 309 LRIPTDTHIYLAAGDGVMEIEGLTSVYTNVVTKSVLLSGEDFKSMHGNTKAALDYYVSIN 368

Query: 364 SDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKELNQALWKAHRDD 423
           SD+Y+AT+FGNMDKMVAAMRAFNG  +TLFLSRRAF+E  Y GL+GK L +ALW+AHRDD
Sbjct: 369 SDAYMATYFGNMDKMVAAMRAFNGLYKTLFLSRRAFSELIYKGLRGKRLMRALWEAHRDD 428

Query: 424 FVMGRGSALSDCFCEFKL 442
           FVMG GSAL DCFCEFKL
Sbjct: 429 FVMGIGSALPDCFCEFKL 444

BLAST of Cp4.1LG00g09350 vs. TrEMBL
Match: A0A061GHF3_THECC (O-fucosyltransferase family protein isoform 1 OS=Theobroma cacao GN=TCM_028237 PE=4 SV=1)

HSP 1 Score: 702.2 bits (1811), Expect = 4.0e-199
Identity = 347/450 (77.11%), Postives = 387/450 (86.00%), Query Frame = 1

Query: 1   MFIVSVKPFFSVLVVGLCLTLAVLLLSPSSSLF-PGASVSSAHQ-------RGYTDIWSL 60
           M +VSVKP F VL+  L L LA++LLSPS     P  SV+S +        RG +DIWS+
Sbjct: 1   MIMVSVKPLF-VLISTLSLFLAIVLLSPSRPFSQPSQSVNSLNMLPIRSLSRGKSDIWSV 60

Query: 61  RRIAEWSPCKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLP 120
           +RI EW PCKWWLQ H   LPA SNGYIRV+CYGGLNQMRRD CDGVG+ARLLNATLVLP
Sbjct: 61  KRIVEWRPCKWWLQSHLTPLPAKSNGYIRVNCYGGLNQMRRDFCDGVGIARLLNATLVLP 120

Query: 121 KFEVAAYWNESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDY 180
           KFEVAAYWNESSGFADVFDV+YFI QM+G+VKVV+ELPPE +SKEP  VDCSKRKGQFDY
Sbjct: 121 KFEVAAYWNESSGFADVFDVNYFIKQMSGFVKVVRELPPEISSKEPFRVDCSKRKGQFDY 180

Query: 181 IESVLPSLLEHHYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAI 240
           IESVLPSLL+HHYISITPAMSQRRDRYPQYAKAALCQ CY+ALRL K +E+KA +LLEAI
Sbjct: 181 IESVLPSLLKHHYISITPAMSQRRDRYPQYAKAALCQGCYSALRLTKSLEKKANELLEAI 240

Query: 241 PKPFLSLHLRFEPDMVAYSQCEYQGLSTASLEAIEAAR-VDRKPWTGELAKIWRKRGKCP 300
           PKPFL+LHLRFEPDMVAYSQC+Y GLS  S+EAIEAAR  DRKPWTGE A+IWRKRGKCP
Sbjct: 241 PKPFLALHLRFEPDMVAYSQCQYSGLSPTSIEAIEAARGDDRKPWTGEAARIWRKRGKCP 300

Query: 301 LTPRETALIFQALHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGN 360
           L P ETA I QA+ +PTNTNIYLAAGDGLME+EG TS+YTNVVTKS+ LS +DF +MHGN
Sbjct: 301 LIPNETAFILQAISVPTNTNIYLAAGDGLMEIEGLTSIYTNVVTKSALLSGEDFKSMHGN 360

Query: 361 TKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKE 420
           TKAALDYYVSINSDSYVAT+FGNMDKMVAAMRAF G  +TLFLSRRAF+EFT  GL+GK+
Sbjct: 361 TKAALDYYVSINSDSYVATYFGNMDKMVAAMRAFKGLYKTLFLSRRAFSEFTSEGLEGKQ 420

Query: 421 LNQALWKAHRDDFVMGRGSALSDCFCEFKL 442
           L +ALWK H++DFVMGRGSAL DCFCEFKL
Sbjct: 421 LMKALWKVHKEDFVMGRGSALPDCFCEFKL 449

BLAST of Cp4.1LG00g09350 vs. TAIR10
Match: AT1G52630.1 (AT1G52630.1 O-fucosyltransferase family protein)

HSP 1 Score: 664.1 bits (1712), Expect = 6.1e-191
Identity = 324/438 (73.97%), Postives = 367/438 (83.79%), Query Frame = 1

Query: 6   VKPFFSVLVVGLCLTLAVLLLSPSSSLF--PGASVSSAHQRGYTDIWSLRRIAEWSPCKW 65
           VKP F V V+   L L V+LLSPS  +   P  S SS    G +DIWS++RI EW PCKW
Sbjct: 6   VKPLF-VFVLTFSLLLVVILLSPSPHILQIPFPSGSSV---GSSDIWSVKRIMEWRPCKW 65

Query: 66  WLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYWNES 125
           WLQGH   LPA +NGYIRVDCYGGLNQMRRDLCDGVG+ARLLNATLVLPKFEVAAYWNES
Sbjct: 66  WLQGHLTPLPAKTNGYIRVDCYGGLNQMRRDLCDGVGIARLLNATLVLPKFEVAAYWNES 125

Query: 126 SGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDYIESVLPSLLEH 185
           SGFADVFDVDYFI +M+GY++VVKELP + ASKEP  VDCSKRKGQFDYIESVLP LLEH
Sbjct: 126 SGFADVFDVDYFIQKMSGYIEVVKELPKDIASKEPFKVDCSKRKGQFDYIESVLPLLLEH 185

Query: 186 HYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKPFLSLHLRF 245
           HYIS TPAMSQRRDRYP+YA+A LCQACY+A+ L   +E+KA +L +AIPKPFLSLHLRF
Sbjct: 186 HYISFTPAMSQRRDRYPEYARATLCQACYSAIHLTSSLEKKAVELFDAIPKPFLSLHLRF 245

Query: 246 EPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKRGKCPLTPRETALIFQA 305
           EPDMVAYSQCEY  LS +S+ AIEAAR DRKPWTGELA+ WRKRGKCPLTP ET L+ Q+
Sbjct: 246 EPDMVAYSQCEYPNLSPSSIAAIEAARADRKPWTGELAQTWRKRGKCPLTPNETVLMLQS 305

Query: 306 LHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGNTKAALDYYVSIN 365
           L+IPT+TNIYLAAGDGLMEMEGFTSVYTNV TKS  L+ +DF+ MHGNTKAALDY+VSIN
Sbjct: 306 LNIPTSTNIYLAAGDGLMEMEGFTSVYTNVFTKSVLLNQEDFTRMHGNTKAALDYHVSIN 365

Query: 366 SDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKELNQALWKAHRDD 425
           SD+YVAT+FGNMDK+VAAMR +     TLFLSR+AFAE T  GL+G EL +ALW+ H+ D
Sbjct: 366 SDAYVATYFGNMDKIVAAMRTYKQMHNTLFLSRKAFAELTSQGLEGAELKKALWEVHKSD 425

Query: 426 FVMGRGSALSDCFCEFKL 442
           F +GRG AL DCFCEF+L
Sbjct: 426 FAIGRGFALPDCFCEFEL 439

BLAST of Cp4.1LG00g09350 vs. TAIR10
Match: AT3G02250.1 (AT3G02250.1 O-fucosyltransferase family protein)

HSP 1 Score: 221.5 bits (563), Expect = 1.0e-57
Identity = 134/403 (33.25%), Postives = 212/403 (52.61%), Query Frame = 1

Query: 48  DIWSLRRIAEWSPC----KWWLQGHRPALPA----------DSNGYIRVDCYGGLNQMRR 107
           ++W  R +  W  C     + +     +LP            +NGY+ V C GGLNQMR 
Sbjct: 64  EMWGPRLLKGWPSCFNHHDFPIAAEMTSLPMKIALPPKRIYQNNGYLMVSCNGGLNQMRA 123

Query: 108 DLCDGVGVARLLNATLVLPKFEVAAYWNESSGFADVFDVDYFISQMNGYVKVVKELPPEF 167
            +CD V +AR +N TL++P+ +  ++WN+ S F D+FDVD+FIS +   V+++KELPP  
Sbjct: 124 AICDMVTIARYMNVTLIVPELDKTSFWNDPSEFKDIFDVDHFISSLRDEVRILKELPPRL 183

Query: 168 ASKEP----HHVDCSKRKGQFDYIESVLPSLLEHHYISITPAMSQ-RRDRYPQYAKAALC 227
             +      H +          Y + +LP + ++  + +    ++   +  P   +   C
Sbjct: 184 KRRVRLGLYHTMPPISWSNMSYYQDQILPLVKKYKVVHLNKTDTRLANNELPVEIQKLRC 243

Query: 228 QACYNALRLAKPVEEKAKKLLEAIPK--PFLSLHLRFEPDMVAYSQCEYQGLSTASLEAI 287
           +A +N LR    +EE  +++++ + +  PFL LHLR+E DM+A+S C + G +    E +
Sbjct: 244 RANFNGLRFTPKIEELGRRVVKILREKGPFLVLHLRYEMDMLAFSGCSH-GCNRYEEEEL 303

Query: 288 EAARVDRKPWTGEL--AKIWRKRGKCPLTPRETALIFQALHIPTNTNIYLAAGD---GLM 347
              R     W  ++  +++ RK G CPLTP ETAL   AL I  N  IY+AAG+   G  
Sbjct: 304 TRMRYAYPWWKEKVIDSELKRKEGLCPLTPEETALTLSALGIDRNVQIYIAAGEIYGGKR 363

Query: 348 EMEGFTSVYTNVVTKSSFLSSDD--FSNMHGNTKAALDYYVSINSDSYVATFFGNMDKMV 407
            ++  T V+ NVV K + L S D  F   H +  AALDY +S+ SD +V T++GNM K+V
Sbjct: 364 RLKALTDVFPNVVRKETLLDSSDLSFCKNHSSQMAALDYLISLESDIFVPTYYGNMAKVV 423

Query: 408 AAMRAFNGKQRTLFLSRRAFAEFT---YNGLKGKELNQALWKA 420
              R F G ++T+ L+R+   +     Y GL   E+     KA
Sbjct: 424 EGHRRFLGFKKTIELNRKLLVKLIDEYYEGLLSWEVFSTTVKA 465

BLAST of Cp4.1LG00g09350 vs. TAIR10
Match: AT5G15740.1 (AT5G15740.1 O-fucosyltransferase family protein)

HSP 1 Score: 210.7 bits (535), Expect = 1.8e-54
Identity = 122/336 (36.31%), Postives = 187/336 (55.65%), Query Frame = 1

Query: 76  SNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYWNESSGFADVFDVDYF 135
           +NGY+ V C GGLNQMR  +CD V VAR +N TL++P+ +  ++WN+ S F D+FDVD+F
Sbjct: 106 NNGYLMVSCNGGLNQMRAAICDMVTVARYMNVTLIVPELDKTSFWNDPSEFKDIFDVDHF 165

Query: 136 ISQMNGYVKVVKELPPEFASKEP----HHVDCSKRKGQFDYIESVLPSLLEHHYISITPA 195
           IS +   V+++KELPP    +      H +          Y   +LP + +H  + +   
Sbjct: 166 ISSLRDEVRILKELPPRLKKRVELGVYHEMPPISWSNMSYYQNQILPLVKKHKVLHLNRT 225

Query: 196 MSQ-RRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPK--PFLSLHLRFEPDMV 255
            ++   +  P   +   C+  +N L+    +EE  +++++ + +  PFL LHLR+E DM+
Sbjct: 226 DTRLANNGLPVEVQKLRCRVNFNGLKFTPQIEELGRRVVKILREKGPFLVLHLRYEMDML 285

Query: 256 AYSQCEYQGLSTASLEAIEAARVDRKPWTGEL--AKIWRKRGKCPLTPRETALIFQALHI 315
           A+S C + G +    E +   R     W  ++  +++ RK G CPLTP ETAL   AL I
Sbjct: 286 AFSGCSH-GCNPEEEEELTRMRYAYPWWKEKVINSELKRKDGLCPLTPEETALTLTALGI 345

Query: 316 PTNTNIYLAAGD---GLMEMEGFTSVYTNVVTKSSFLSSD--DFSNMHGNTKAALDYYVS 375
             N  IY+AAG+   G   M+  T  + NVV K + L S   DF   H +  AALDY V+
Sbjct: 346 DRNVQIYIAAGEIYGGQRRMKALTDAFPNVVRKETLLESSDLDFCRNHSSQMAALDYLVA 405

Query: 376 INSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRR 398
           + SD +V T  GNM ++V   R F G ++T+ L+RR
Sbjct: 406 LESDIFVPTNDGNMARVVEGHRRFLGFKKTIQLNRR 440

BLAST of Cp4.1LG00g09350 vs. TAIR10
Match: AT5G35570.1 (AT5G35570.1 O-fucosyltransferase family protein)

HSP 1 Score: 198.0 bits (502), Expect = 1.2e-50
Identity = 123/391 (31.46%), Postives = 206/391 (52.69%), Query Frame = 1

Query: 46  YTDIWSLRRIAEWSPCKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLL 105
           ++ +W+      +S C    +  R  L A++NGY+ ++  GGLNQMR  +CD V VA+++
Sbjct: 222 FSGVWAKPESGNFSRCIDSSRS-RKKLGANTNGYLLINANGGLNQMRFGICDMVAVAKIM 281

Query: 106 NATLVLPKFEVAAYWNESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSK 165
            ATLVLP  + ++YW + SGF D+FD  +FI ++   + +V+ LP E A  EP       
Sbjct: 282 KATLVLPSLDHSSYWADDSGFKDLFDWQHFIEELKDDIHIVEMLPSELAGIEPFVKTPIS 341

Query: 166 RKGQFDYIESVLPSLLEHHYISITPAMSQ-RRDRYPQYAKAALCQACYNALRLAKPVEEK 225
                 Y + VLP L +H  + +T   S+   +  P   +   C+  Y AL+ + P+EE 
Sbjct: 342 WSKVGYYKKEVLPLLKQHIVMYLTHTDSRLANNDLPDSVQKLRCRVNYRALKYSAPIEEL 401

Query: 226 AKKLLEAIPK---PFLSLHLRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELA 285
              L+  + +   P+L+LHLR+E DM+A++ C +  L+    E +   R +   W  +  
Sbjct: 402 GNVLVSRMRQDRGPYLALHLRYEKDMLAFTGCSH-SLTAEEDEELRQMRYEVSHWKEKEI 461

Query: 286 KIWRKR--GKCPLTPRETALIFQALHIPTNTNIYLAAGD--GLMEMEGFTSVYTNVVTKS 345
               +R  G CPLTPRET+L+ +AL  P+++ IYL AG+  G   M+   + + N+ + S
Sbjct: 462 NGTERRLQGGCPLTPRETSLLLRALEFPSSSRIYLVAGEAYGNGSMDPLNTDFPNIFSHS 521

Query: 346 SFLSSDDFS--NMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLS 405
              + ++ S  N H N  A LDY V++ S+ ++ T+ GNM K V   R F   ++T+   
Sbjct: 522 ILATKEELSPFNNHQNMLAGLDYIVALQSEVFLYTYDGNMAKAVQGHRRFEDFKKTINPD 581

Query: 406 RRAFAEFT----YNGLKGKELNQALWKAHRD 423
           +  F +         +  K+ +  + K H+D
Sbjct: 582 KMNFVKLVDALDEGRISWKKFSSKVKKLHKD 610

BLAST of Cp4.1LG00g09350 vs. TAIR10
Match: AT4G16650.1 (AT4G16650.1 O-fucosyltransferase family protein)

HSP 1 Score: 196.4 bits (498), Expect = 3.6e-50
Identity = 145/455 (31.87%), Postives = 225/455 (49.45%), Query Frame = 1

Query: 10  FSVLVVGLCLTLAVLLLSPSSSLFPGASVSSAHQRGYTDIWSLRRIAEWSPCKWWLQGHR 69
           FS+ V+ L     V  L   S      S+    +R   D+W  +    +  C    +   
Sbjct: 55  FSLGVISLFTGHVVSHLEWYSQQLSKRSLLDMSRREPIDVWKSKYSKFFYGCSERGRNFL 114

Query: 70  PALPADS-NGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYWNESSGFAD 129
           PA+   S NGY+ +   GGLNQ R  + D V VAR+LNATLV+P+ +  +YW + S F+D
Sbjct: 115 PAVQEQSSNGYLLIAASGGLNQQRTGITDAVVVARILNATLVVPELDHHSYWKDDSDFSD 174

Query: 130 VFDVDYFISQMNGYVKVVKELPPEF--ASKEPHHVDCSKRKGQFD-YIESVLPSLLEHHY 189
           +FDV++FIS +   V +VK +P     A ++P +     RK   + Y++ VLP L   H 
Sbjct: 175 IFDVNWFISSLAKDVTIVKRVPDRVMRAMEKPPYTTRVPRKSTLEYYLDQVLPILTRRHV 234

Query: 190 ISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEE---KAKKLLEAIPKPFLSLHLR 249
           + +T    +  +   +  +   C+  Y+ALR  K ++    K  K +  + K F+++HLR
Sbjct: 235 LQLTKFDYRLANDLDEDMQKLRCRVNYHALRFTKRIQSVGMKVVKRMRKMAKRFIAVHLR 294

Query: 250 FEPDMVAYSQCEYQGLSTASLEAIE-AARVDRKPWTGELAKIWRKRGKCPLTPRETALIF 309
           FEPDM+A+S C++ G      E  E   R D  P    L +  RKRGKCPLTP E  L+ 
Sbjct: 295 FEPDMLAFSGCDFGGGEKERAELAEIRKRWDTLPDLDPLEE--RKRGKCPLTPHEVGLML 354

Query: 310 QALHIPTNTNIYLAAGD---GLMEMEGFTSVYTNVVTKSSFLSSDDFSNM--HGNTKAAL 369
           +AL    +T IY+A+G+   G   ++    ++ N  TK   L++D+   +  + +  AA+
Sbjct: 355 RALGFTNDTYIYVASGEIYGGEKTLKPLRELFPNFYTK-EMLANDELKPLLPYSSRLAAI 414

Query: 370 DYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRT----------LFLSRRAFAEFTYNG 429
           DY VS  SD ++    GNM K++A  R + G +RT          LF+ R      T+  
Sbjct: 415 DYIVSDESDVFITNNNGNMAKILAGRRRYMGHKRTIRPNAKKLSALFMDREKMEWQTF-- 474

Query: 430 LKGKELNQALWKAHRDDFVMGRG---SALSDCFCE 439
            K  +  Q  +    D+F  GRG        C C+
Sbjct: 475 AKKVKSCQRGFMGDPDEFKPGRGEFHEYPQSCICQ 504

BLAST of Cp4.1LG00g09350 vs. NCBI nr
Match: gi|659122364|ref|XP_008461103.1| (PREDICTED: uncharacterized protein At1g04910 isoform X1 [Cucumis melo])

HSP 1 Score: 813.9 bits (2101), Expect = 1.4e-232
Identity = 395/441 (89.57%), Postives = 413/441 (93.65%), Query Frame = 1

Query: 1   MFIVSVKPFFSVLVVGLCLTLAVLLLSPSSSLFPGASVSSAHQRGYTDIWSLRRIAEWSP 60
           MF+VSVKPFFSVL+V L L+LAVLLLSP SS F   S S  ++R YTDIWS+RRI EW P
Sbjct: 1   MFLVSVKPFFSVLLVTLSLSLAVLLLSPPSSFFSATSFSFTNERSYTDIWSVRRIVEWRP 60

Query: 61  CKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYW 120
           CKWWL+GH PALPADSNGYIRVDCYGGLNQMRRDLCDGVG+ARLLNATLVLPKFEVAAYW
Sbjct: 61  CKWWLRGHLPALPADSNGYIRVDCYGGLNQMRRDLCDGVGIARLLNATLVLPKFEVAAYW 120

Query: 121 NESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDYIESVLPSL 180
           NESSGFADVFDVDYFI QMNGYVKV KELPPE ASKEP+HVDCSKRKGQFDYIESVLPSL
Sbjct: 121 NESSGFADVFDVDYFIGQMNGYVKVAKELPPELASKEPYHVDCSKRKGQFDYIESVLPSL 180

Query: 181 LEHHYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKPFLSLH 240
           LEHHYISITPAMSQRRDRYPQYAKAALCQ CYN LRLAKPVE+KA++LLEAIPKPFLSLH
Sbjct: 181 LEHHYISITPAMSQRRDRYPQYAKAALCQVCYNGLRLAKPVEKKARELLEAIPKPFLSLH 240

Query: 241 LRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKRGKCPLTPRETALI 300
           LRFEPDMVAYSQCEY+GLS  SLEAIE  R DRKPWTGELA+IWRKRGKCPLTPRETALI
Sbjct: 241 LRFEPDMVAYSQCEYKGLSPTSLEAIEETRGDRKPWTGELAEIWRKRGKCPLTPRETALI 300

Query: 301 FQALHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGNTKAALDYYV 360
           FQALHIPTNTNIYLAAGDGLME+EGFTSVYTNVVTKSSFL+SDDFS+MHGNTKAALDYYV
Sbjct: 301 FQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLTSDDFSSMHGNTKAALDYYV 360

Query: 361 SINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKELNQALWKAH 420
           SINSD YVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTY GL+GKELNQALWK H
Sbjct: 361 SINSDYYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYKGLEGKELNQALWKTH 420

Query: 421 RDDFVMGRGSALSDCFCEFKL 442
           RDDF MGRGSALSDCFCEFKL
Sbjct: 421 RDDFAMGRGSALSDCFCEFKL 441

BLAST of Cp4.1LG00g09350 vs. NCBI nr
Match: gi|449436132|ref|XP_004135848.1| (PREDICTED: uncharacterized protein At1g04910 isoform X1 [Cucumis sativus])

HSP 1 Score: 805.4 bits (2079), Expect = 4.8e-230
Identity = 390/441 (88.44%), Postives = 414/441 (93.88%), Query Frame = 1

Query: 1   MFIVSVKPFFSVLVVGLCLTLAVLLLSPSSSLFPGASVSSAHQRGYTDIWSLRRIAEWSP 60
           MF+VSVKPFFSVL+V L L+ AVLLLSP SS F   S S  ++RGYTDIWS+RRI EW P
Sbjct: 1   MFLVSVKPFFSVLLVTLSLSFAVLLLSPPSSFFSNTSFSFTNRRGYTDIWSVRRIVEWRP 60

Query: 61  CKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYW 120
           CKWWL+GH PALPAD+NGYIRVDCYGGLNQMRRDLCDGVG+ARLLNATLVLPKFEVAAYW
Sbjct: 61  CKWWLRGHLPALPADTNGYIRVDCYGGLNQMRRDLCDGVGIARLLNATLVLPKFEVAAYW 120

Query: 121 NESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDYIESVLPSL 180
           NESSGFADVFDVD+FI QMNGYVKV KELPPEFASKEP+HVDCSKRKGQFDYIESVLPSL
Sbjct: 121 NESSGFADVFDVDHFIGQMNGYVKVAKELPPEFASKEPYHVDCSKRKGQFDYIESVLPSL 180

Query: 181 LEHHYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKPFLSLH 240
           LEHHYISITPAMSQRRDRYPQYAKAALCQ CYN LRLAK VE+KA++LLEAIPKPFLSLH
Sbjct: 181 LEHHYISITPAMSQRRDRYPQYAKAALCQVCYNGLRLAKSVEKKARELLEAIPKPFLSLH 240

Query: 241 LRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKRGKCPLTPRETALI 300
           LRFEPDMVAYSQCEY+GLS  SLEAIEA R DRKPWTG+LA+IWRKRGKCPLTPRETALI
Sbjct: 241 LRFEPDMVAYSQCEYKGLSPTSLEAIEATRGDRKPWTGQLAEIWRKRGKCPLTPRETALI 300

Query: 301 FQALHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGNTKAALDYYV 360
           FQALHIPTNTNIYLAAGDGLME+EGFTSVYTNVVTKSSFLS++DFS+MHGNTKAALDYYV
Sbjct: 301 FQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSNNDFSSMHGNTKAALDYYV 360

Query: 361 SINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKELNQALWKAH 420
           SINSD YVATFFGNMDKMVAAMRAFNGKQ+TLFLSRRAFAEFTY GL+GKEL+QALWK H
Sbjct: 361 SINSDYYVATFFGNMDKMVAAMRAFNGKQKTLFLSRRAFAEFTYKGLEGKELDQALWKTH 420

Query: 421 RDDFVMGRGSALSDCFCEFKL 442
           RDDF MGRGSALSDCFCEFKL
Sbjct: 421 RDDFSMGRGSALSDCFCEFKL 441

BLAST of Cp4.1LG00g09350 vs. NCBI nr
Match: gi|659122366|ref|XP_008461105.1| (PREDICTED: uncharacterized protein At1g04910 isoform X2 [Cucumis melo])

HSP 1 Score: 768.8 bits (1984), Expect = 5.0e-219
Identity = 366/396 (92.42%), Postives = 379/396 (95.71%), Query Frame = 1

Query: 46  YTDIWSLRRIAEWSPCKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLL 105
           YTDIWS+RRI EW PCKWWL+GH PALPADSNGYIRVDCYGGLNQMRRDLCDGVG+ARLL
Sbjct: 14  YTDIWSVRRIVEWRPCKWWLRGHLPALPADSNGYIRVDCYGGLNQMRRDLCDGVGIARLL 73

Query: 106 NATLVLPKFEVAAYWNESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSK 165
           NATLVLPKFEVAAYWNESSGFADVFDVDYFI QMNGYVKV KELPPE ASKEP+HVDCSK
Sbjct: 74  NATLVLPKFEVAAYWNESSGFADVFDVDYFIGQMNGYVKVAKELPPELASKEPYHVDCSK 133

Query: 166 RKGQFDYIESVLPSLLEHHYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKA 225
           RKGQFDYIESVLPSLLEHHYISITPAMSQRRDRYPQYAKAALCQ CYN LRLAKPVE+KA
Sbjct: 134 RKGQFDYIESVLPSLLEHHYISITPAMSQRRDRYPQYAKAALCQVCYNGLRLAKPVEKKA 193

Query: 226 KKLLEAIPKPFLSLHLRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWR 285
           ++LLEAIPKPFLSLHLRFEPDMVAYSQCEY+GLS  SLEAIE  R DRKPWTGELA+IWR
Sbjct: 194 RELLEAIPKPFLSLHLRFEPDMVAYSQCEYKGLSPTSLEAIEETRGDRKPWTGELAEIWR 253

Query: 286 KRGKCPLTPRETALIFQALHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDF 345
           KRGKCPLTPRETALIFQALHIPTNTNIYLAAGDGLME+EGFTSVYTNVVTKSSFL+SDDF
Sbjct: 254 KRGKCPLTPRETALIFQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLTSDDF 313

Query: 346 SNMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYN 405
           S+MHGNTKAALDYYVSINSD YVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTY 
Sbjct: 314 SSMHGNTKAALDYYVSINSDYYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYK 373

Query: 406 GLKGKELNQALWKAHRDDFVMGRGSALSDCFCEFKL 442
           GL+GKELNQALWK HRDDF MGRGSALSDCFCEFKL
Sbjct: 374 GLEGKELNQALWKTHRDDFAMGRGSALSDCFCEFKL 409

BLAST of Cp4.1LG00g09350 vs. NCBI nr
Match: gi|645277327|ref|XP_008243720.1| (PREDICTED: uncharacterized protein At1g04910 [Prunus mume])

HSP 1 Score: 723.0 bits (1865), Expect = 3.1e-205
Identity = 350/441 (79.37%), Postives = 388/441 (87.98%), Query Frame = 1

Query: 1   MFIVSVKPFFSVLVVGLCLTLAVLLLSPSSSLFPGASVSSAHQRGYTDIWSLRRIAEWSP 60
           M ++SVKP F++ V+ L L LAVLLLSP+S L+  A +         D+WS+ R+ EW P
Sbjct: 1   MVVISVKPIFTIGVLTLSLILAVLLLSPTSPLYK-APLFLTSPMNKLDVWSVERMVEWRP 60

Query: 61  CKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYW 120
           CKWWL+GH  ALP+ SNG+IRVDCYGGLNQMRRDLCDGVG+ARLLNATLVLPKFEVAAYW
Sbjct: 61  CKWWLKGHLTALPSKSNGFIRVDCYGGLNQMRRDLCDGVGIARLLNATLVLPKFEVAAYW 120

Query: 121 NESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDYIESVLPSL 180
           NESSGFADVFDVDYFI QMNG++KVVKELP + +SKEP HVDCSKRKGQFDYIESVLPSL
Sbjct: 121 NESSGFADVFDVDYFIQQMNGFIKVVKELPSDISSKEPFHVDCSKRKGQFDYIESVLPSL 180

Query: 181 LEHHYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKPFLSLH 240
           LEHHYISITPAMSQRRDRYPQYAKA+LCQACY+ALRL K +E+KA +LLEAIPKPFLSLH
Sbjct: 181 LEHHYISITPAMSQRRDRYPQYAKASLCQACYSALRLTKSLEKKASELLEAIPKPFLSLH 240

Query: 241 LRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKRGKCPLTPRETALI 300
           LRFEPDMVAYSQCEY  LS+ S+EAIEAAR DRKPWTGE+A+IWRKRGKCPLTP ETA I
Sbjct: 241 LRFEPDMVAYSQCEYPDLSSVSMEAIEAARGDRKPWTGEVAQIWRKRGKCPLTPNETAFI 300

Query: 301 FQALHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGNTKAALDYYV 360
            + L IPT+TNIYLAAGDGLME+EG TSVYTNV TKSS LS +DF +MHGNTKAALDYYV
Sbjct: 301 LKVLSIPTSTNIYLAAGDGLMEIEGLTSVYTNVFTKSSLLSGEDFKSMHGNTKAALDYYV 360

Query: 361 SINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKELNQALWKAH 420
           SINSDSY+AT+FGNMDKMVAAMRAFNG  +TLFLSRRAFAEFT+ GL+GKEL  ALWKAH
Sbjct: 361 SINSDSYIATYFGNMDKMVAAMRAFNGLYKTLFLSRRAFAEFTFQGLRGKELMNALWKAH 420

Query: 421 RDDFVMGRGSALSDCFCEFKL 442
           RDDF MGRGSAL DCFCEFKL
Sbjct: 421 RDDFAMGRGSALPDCFCEFKL 440

BLAST of Cp4.1LG00g09350 vs. NCBI nr
Match: gi|595932837|ref|XP_007215394.1| (hypothetical protein PRUPE_ppa005861mg [Prunus persica])

HSP 1 Score: 721.5 bits (1861), Expect = 9.1e-205
Identity = 352/441 (79.82%), Postives = 388/441 (87.98%), Query Frame = 1

Query: 1   MFIVSVKPFFSVLVVGLCLTLAVLLLSPSSSLFPGASVSSAHQRGYTDIWSLRRIAEWSP 60
           M ++SVKP F++ V+ L L L+VLLLSP+S L+  A +         DIWS+ R+ EW P
Sbjct: 1   MVVISVKPIFTIGVLTLSLILSVLLLSPTSPLYK-APLFLTSPMNKLDIWSVGRMVEWRP 60

Query: 61  CKWWLQGHRPALPADSNGYIRVDCYGGLNQMRRDLCDGVGVARLLNATLVLPKFEVAAYW 120
           CKWWLQGH  ALP+ +NG+IRVDCYGGLNQMRRDLCDGVG+ARLLNATLVLPKFEVAAYW
Sbjct: 61  CKWWLQGHLTALPSKNNGFIRVDCYGGLNQMRRDLCDGVGIARLLNATLVLPKFEVAAYW 120

Query: 121 NESSGFADVFDVDYFISQMNGYVKVVKELPPEFASKEPHHVDCSKRKGQFDYIESVLPSL 180
           NESSGFADVFDVDYFI QMNG++KVVKELP + +SKEP HVDCSKRKGQFDYIESVLPSL
Sbjct: 121 NESSGFADVFDVDYFIQQMNGFIKVVKELPSDISSKEPFHVDCSKRKGQFDYIESVLPSL 180

Query: 181 LEHHYISITPAMSQRRDRYPQYAKAALCQACYNALRLAKPVEEKAKKLLEAIPKPFLSLH 240
           LEHHYISITPAMSQRRDRYPQYAKA+LCQACY+ALRL K +E+KA +LLEAIPKPFLSLH
Sbjct: 181 LEHHYISITPAMSQRRDRYPQYAKASLCQACYSALRLTKSLEKKASELLEAIPKPFLSLH 240

Query: 241 LRFEPDMVAYSQCEYQGLSTASLEAIEAARVDRKPWTGELAKIWRKRGKCPLTPRETALI 300
           LRFEPDMVAYSQCEY  LS+AS+EAIEAAR DRKPWTGE+A+IWRKRGKCPLTP ETA I
Sbjct: 241 LRFEPDMVAYSQCEYPDLSSASMEAIEAARGDRKPWTGEVAQIWRKRGKCPLTPNETAFI 300

Query: 301 FQALHIPTNTNIYLAAGDGLMEMEGFTSVYTNVVTKSSFLSSDDFSNMHGNTKAALDYYV 360
            Q L IPTNTNIYLAAGDGLME+EG TSVYTNV TKSS LS +DF +MHGNTKAALDYYV
Sbjct: 301 LQVLSIPTNTNIYLAAGDGLMEIEGLTSVYTNVFTKSSLLSGEDFKSMHGNTKAALDYYV 360

Query: 361 SINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFAEFTYNGLKGKELNQALWKAH 420
           SINSDSY+AT+FGNMDKMVAAMRAFNG  +TLFLSRRAFAEFT+ GL+GKEL  AL KAH
Sbjct: 361 SINSDSYIATYFGNMDKMVAAMRAFNGLYKTLFLSRRAFAEFTFQGLRGKELMNALRKAH 420

Query: 421 RDDFVMGRGSALSDCFCEFKL 442
           RDDF MGRGSAL DCFCEFKL
Sbjct: 421 RDDFAMGRGSALPDCFCEFKL 440

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1491_ARATH3.8e-4129.62Uncharacterized protein At1g04910 OS=Arabidopsis thaliana GN=At1g04910 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KBV3_CUCSA3.3e-23088.44Uncharacterized protein OS=Cucumis sativus GN=Csa_7G432630 PE=4 SV=1[more]
M5X4Y8_PRUPE6.4e-20579.82Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005861mg PE=4 SV=1[more]
A0A067KXV6_JATCU1.6e-20379.86Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06161 PE=4 SV=1[more]
A0A059BSS4_EUCGR1.2e-20078.08Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F02536 PE=4 SV=1[more]
A0A061GHF3_THECC4.0e-19977.11O-fucosyltransferase family protein isoform 1 OS=Theobroma cacao GN=TCM_028237 P... [more]
Match NameE-valueIdentityDescription
AT1G52630.16.1e-19173.97 O-fucosyltransferase family protein[more]
AT3G02250.11.0e-5733.25 O-fucosyltransferase family protein[more]
AT5G15740.11.8e-5436.31 O-fucosyltransferase family protein[more]
AT5G35570.11.2e-5031.46 O-fucosyltransferase family protein[more]
AT4G16650.13.6e-5031.87 O-fucosyltransferase family protein[more]
Match NameE-valueIdentityDescription
gi|659122364|ref|XP_008461103.1|1.4e-23289.57PREDICTED: uncharacterized protein At1g04910 isoform X1 [Cucumis melo][more]
gi|449436132|ref|XP_004135848.1|4.8e-23088.44PREDICTED: uncharacterized protein At1g04910 isoform X1 [Cucumis sativus][more]
gi|659122366|ref|XP_008461105.1|5.0e-21992.42PREDICTED: uncharacterized protein At1g04910 isoform X2 [Cucumis melo][more]
gi|645277327|ref|XP_008243720.1|3.1e-20579.37PREDICTED: uncharacterized protein At1g04910 [Prunus mume][more]
gi|595932837|ref|XP_007215394.1|9.1e-20579.82hypothetical protein PRUPE_ppa005861mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR024709FucosylTrfase_pln
IPR019378GDP-Fuc_O-FucTrfase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG00g09350.1Cp4.1LG00g09350.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019378GDP-fucose protein O-fucosyltransferasePFAMPF10250O-FucTcoord: 79..385
score: 7.5
IPR024709O-fucosyltransferase, plantPIRPIRSF009360UCP009360coord: 9..439
score: 1.1E
NoneNo IPR availablePANTHERPTHR31741FAMILY NOT NAMEDcoord: 1..439
score: 9.2E
NoneNo IPR availablePANTHERPTHR31741:SF2O-FUCOSYLTRANSFERASE FAMILY PROTEINcoord: 1..439
score: 9.2E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG00g09350Cucumber (Chinese Long) v3cpecucB0010
Cp4.1LG00g09350Cucurbita pepo (Zucchini)cpecpeB001