Cp4.1LG03g06530 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g06530
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSialyltransferase-like protein
LocationCp4.1LG03 : 3934033 .. 3938365 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGTACTGAAAAATGAGCAAATATAGATAATTTAGATCAATTAAATCCAAAGAATTAGGATGAAAATAGGTTTTTTTTATTTTTTATTTCTTTTAATTACGGAGGGCTCGAACAGACGAACAGAGGGAAACTGAGAATTGATATAAAAACCCGATTCTTGAATTCCATACGGAGATCGGATTTGCAGAGTGATTCGCTTTCAGTCTCAGTGCCATTCATCCACTCTCTGGCATTCCAGAAATCTGGTCCTCGATGAGATCTCTCAAAGCTCCCTTCTCCAACAATGGTACCAGCGCCAGAAGGCTTACGCTTCTTCATCTTGTTTGCGCTGCTGCTCTCTTCTCTTTTCTTGTTTTTGTCATTCAATCCTCCTTCTTTGCAGGTTTCTCGTCCCACAGTTCCTTATGTCTCCCTTTACCATTCATCTCCTTCATCATTTCCTTCATGCTTTTCTCTTTTCTTCCTCAAATTCTCAGGTTATCAGCAGCCTATTGCAGATCTTAACAGGGAGGAGGTTCGGATTCTATCGGATTTCCAGTCCAGCGTTCAGCAATGCGTGGTTTGTATTTTTCTTAGCTTTACCTTAATTGCGTTCATACTCTGTTCCCTTTTTCGATTAAACATTTGCTGGAGGAGGGGAAAAGGCTCAGCCTATGTGATAATTTGAAATGTATTTGATTTCCTTTGATTAGTTCTGATTTCTAGATGAAGTGGATTTATTCATCTCAATGATATGGATTACGATGTTGTTTGTTGCATGTCCAATGCATCCTAATATTTCTAGGAGTTTAAAGAGAGCAGTATTTTCTAACCACTTTTTGGAGCATAGTTATTTGTGCCGTCTGATCTCCACTTCTAGTTATTTGAAGCCGCACATTTGAGATTTTCAATAACCTTGGGTATCAACCCAGTCATGACAAAGCTCACGGCCTACTGTTTCTACATATCCTGAAAATGTCTATTGTCTATTAGAAGACTGCATCTTAGTTGTTTTAATATTTTAAACCTATGCATGCATTGTGTATGTGGATTAGTTTCTGCAGGTTCATTAGCCTTTTGAACTTCTACTGATCATGAAATTGGGTACAGACAAAGAGGGGGCTTGGACTCACTGCACATATTATTGACCACTGCAAGTTGGTTCTCAAGTTCCCAGAAGGCACAAATAGTACATGGGTCTCTCTCTCTCTCACATCTGTGTGTATGTGTGCACGTTGTTCTTCTTTGTTGCTGATTTTTTAATGCTCTGTATTGGCAGTATAATGAGCAATTCAAGATTTATGAGCCTTTGGAGTATCCGTATGATGTGTGTGAGGCAATATTATTGTGGGAACAGGTTGGTAGTAGCCATCTTTAATTTGTGCATTCTCTTAAAGAGCATACATACCCATTCTAATGAATGTTGAGCAATACAGTATCGCAACATGACTACTGTCTTGACGAGAGAGTATTTGGATGCACGGCCTGATGGGTGGTTTGATTATGCTGCAAAGAGAATCGCCCAATTGTAGGGTTACTCTCCTTCAAGTTAATTTCATCTGGTAGATTTATTTGATCTAGAGACTGCAATTGTTTTTGGTGTAGTGAAATCTATTTACCTTTGTGTAGGGGAGCAGATAAATGTTACAATAGAACTCTTTGTGAGGAACACCTAAATTTGATCCTACCATCAAAACCTCCTTTTCACCCCAAGCAGTTTAAAACCTGTGCAGTTGTGGGAAACTCCGGAGATCTTCTAAAGACTGAGTTTGGTGAAGAAATTGATAGTCATGATGCTGTTATACGTGATAATGAGGCACCTGTAAATGAAGTAAGCAAGCTGTACTAGCCAATTCTTTCGATTACTTACTCAAGAATCTTTTCCGTCGTTTATTGATTTATTAAACGCCTTATTCAGGTTAGTTAGAGTCTTTTACTCAGTAGTTTGTCCTGTAAATAGTAATTTAATAACTCTTGCTGTTGTCATTTTTTCCTTGTGCATTTGTTTTGTCTCAAGTATCTAATTTCCTTTTAGATATAAAGTGTTCTTGAATGCCTACAAATAAATTCAGTATCAATGGTTGTTTCCCTTTCCATTTTCCCAAGTTATCCCCAGCTGATTTATGTCTCATAGAAACCATACTCATCAAGAGATCATTGAAAAAACCAAAATGTCCTTGTTAAAAAAAGATTGAGTTGTGAACTCGTAGATTAGAAGAAGATCTCTGCAGCATGCAATTTTGGACTTTTGCCATCTTCATTCAAATTGTGGAAAATGGATAGTATCAGGAACTTCAATAGGGAGACTTTCTGATGCTGAAAGTTGAATCTGAATCCTTGTGGTCTGTAACACCTAGGGAACTCTGGTTTATTGTCTTTGAACATTAAATTTAATGCTTCACATTTTGTGATGTTTTGTGACATTGATTTATTACATCAAGTACTCCAGAAGATATTTAGACCCTCATGTTGACCCATATGATCCCCTGTAGAAATATGCCAAATATGTTGGCCTGAAGAGGGATTTTCGTCTTGTTGTAAGAGGTGCTGCTGGCAATATGATTGCAATTCTAAATGGGTCTGGTAAGGTGTCTTATTTCTGTTGTTAAATATTAACTGGACTCTGTTTAAGATGTCAGGTCTGAGAATTTCTTTCTTTCTTCAGATGATGAAGTACTTGTTATAAAGAGCGTGATTCACAGAGATTTTAATGCAATGATTAAGGTAAAAGCTACTTCATATTCTCATGTGGCAAATTTTCTTATTACTGTACAGTAGCGCTGAAAGGCGAGGAAATTTAAAAATGGTATGAGGCTAGGAGATGATCTAGTTTGGAACTGTAACAATTTGTCTGAAAGGGAAAGGCTTGGAATTGGCGATTGTTTCCCTTTCTGTATGCTTTGGAAACTTATTACCTTTTGTAAAAGATTTGTATGTTTTAATTTGCAGCTTATTCCCAATCCAGTTTATCTCTTCCAAGGTATTGTTCTACGTAGAGGTGCCAAGGGAACTGGAATGAAATCTATTGAATTAGCTCTTTCTATGTGTGATATTGTTGACATATATGGTTTCACTGTTGACCCTGGCTACACTGAATGGTAAGTGTAACTTGCTGTAGCACTCAGTTAAATTGGCCTTAAGTCATTTTTTTAGCTTATTTGCGCGTTAAAGGAGTAAGAAACTTGGTTGTTTGGAACTTTTATGGCTTCATTGATTACCACACGCATCAAATGCGACTCCGTTGTCACCAACTTCTGTTTATCTTTTATCTCACCGTCATCGTCGCTTACTTGCAGCTGGTCTCTTGTGTTTTCTCTGTTCCAGGACAAGGTACTTCTCCACACCCAGGAAAGGCCATAATCCACTTCAAGGAAGGGCATACTACCAGCTATTAGAGTGTCTTGGTGTAAGTTTTTGCCCCCCCCNCCCCCCCCCCCCCCCCCCCAAAAAAAAGGCGTAATCCATGCCTTGATACTCATATGGCTTAGGACTTTTCAGGTTATAAGGATACACTCTCCCATGAGATCTAAGAGGAAGCAAGATTGGTCTGATGTTCCAGATCGAAAAACGATTAGGAGGGCTCACACTGCAGCCTTGAGCTTGAAGAAGAGTCAATCAGGTCAAGCGGGTGATTTGGGACAGTTTGGCAGCTGTAAAGTGTGGGGCAATGTAGACCCTGGGACCGAAGGTCCTATCTCAGGATCCCCCGATATGAGCGATACAAGGAAACACTCCGGTTATAGTAAGTGGGAACTTACACCCTTCAACAGTTTGAGAAAAGAAGCACAAGATCATTATAAGCAGATGGAAGGGGTCTCCCTGTACAAAATGGACGGCAATAAGCTGGATGATCTTGTTTGTGTGAGACATTCTTTCGATTCTAGCGCATAACAACGTTCTCGACGATATCCTATGCACCAATACCTTCTTTCCAATTAGAATTCACCCATTCAACATGTCTGAACTGGCTCTCCTTGCTAATTGTATTATGGTACACACCGTGGCTAGGCACAAGCAATAGAGGCATCACTGAAACCCCATTCCTTTCCGAGAGAGGGCGGTCGAGACATCGTGTGCCATTGACAAAAGATGATTTATCCTACTACTTGGGTCTCCATGCCTGGACTTCAGAGGTCTGTGGATTGACAAAATTAGGTTGTGATACAATGTTTAGTAGGATGTTAGCAAGTACGAAAATTGAATCACTTTATTCTTTGGCTTTGTTCCTATGTAGGTTGTAAAGCTTGTATTTTATCCTCTCAAACAAAAGTGTATCATTAGTTGTTTTCGCCATTATTTGTAAACACTCTCAAGGTAAAAACGTCCATTTTCTTTTTCTTTACGAGGACTT

mRNA sequence

ATGAATCTCAGTGCCATTCATCCACTCTCTGGCATTCCAGAAATCTGGTCCTCGATGAGATCTCTCAAAGCTCCCTTCTCCAACAATGGTACCAGCGCCAGAAGGCTTACGCTTCTTCATCTTGTTTGCGCTGCTGCTCTCTTCTCTTTTCTTGTTTTTGTCATTCAATCCTCCTTCTTTGCAGGTTATCAGCAGCCTATTGCAGATCTTAACAGGGAGGAGGTTCGGATTCTATCGGATTTCCAGTCCAGCGTTCAGCAATGCGTGACAAAGAGGGGGCTTGGACTCACTGCACATATTATTGACCACTGCAAGTTGGTTCTCAAGTTCCCAGAAGGCACAAATAGTACATGGTATAATGAGCAATTCAAGATTTATGAGCCTTTGGAGTATCCGTATGATGTGTGTGAGGCAATATTATTGTGGGAACAGTATCGCAACATGACTACTGTCTTGACGAGAGAGTATTTGGATGCACGGCCTGATGGGTGGTTTGATTATGCTGCAAAGAGAATCGCCCAATTGGGAGCAGATAAATGTTACAATAGAACTCTTTGTGAGGAACACCTAAATTTGATCCTACCATCAAAACCTCCTTTTCACCCCAAGCAGTTTAAAACCTGTGCAGTTGTGGGAAACTCCGGAGATCTTCTAAAGACTGAGTTTGGTGAAGAAATTGATAGTCATGATGCTGTTATACGTGATAATGAGGCACCTGTAAATGAAAAATATGCCAAATATGTTGGCCTGAAGAGGGATTTTCGTCTTGTTGTAAGAGGTGCTGCTGGCAATATGATTGCAATTCTAAATGGGTCTGATGATGAAGTACTTGTTATAAAGAGCGTGATTCACAGAGATTTTAATGCAATGATTAAGCTTATTCCCAATCCAGTTTATCTCTTCCAAGGTATTGTTCTACGTAGAGGTGCCAAGGGAACTGGAATGAAATCTATTGAATTAGCTCTTTCTATGTGTGATATTGTTGACATATATGGTTTCACTGTTGACCCTGGCTACACTGAATGGTTATAAGGATACACTCTCCCATGAGATCTAAGAGGAAGCAAGATTGGTCTGATGTTCCAGATCGAAAAACGATTAGGAGGGCTCACACTGCAGCCTTGAGCTTGAAGAAGAGTCAATCAGGTCAAGCGGGTGATTTGGGACAGTTTGGCAGCTGTAAAGTGTGGGGCAATGTAGACCCTGGGACCGAAGGTCCTATCTCAGGATCCCCCGATATGAGCGATACAAGGAAACACTCCGGTTATAGTAAGTGGGAACTTACACCCTTCAACAGTTTGAGAAAAGAAGCACAAGATCATTATAAGCAGATGGAAGGGGTCTCCCTGTACAAAATGGACGGCAATAAGCTGGATGATCTTGTTTGTGTGAGACATTCTTTCGATTCTAGCGCATAACAACGTTCTCGACGATATCCTATGCACCAATACCTTCTTTCCAATTAGAATTCACCCATTCAACATGTCTGAACTGGCTCTCCTTGCTAATTGTATTATGGTACACACCGTGGCTAGGCACAAGCAATAGAGGCATCACTGAAACCCCATTCCTTTCCGAGAGAGGGCGGTCGAGACATCGTGTGCCATTGACAAAAGATGATTTATCCTACTACTTGGGTCTCCATGCCTGGACTTCAGAGGTCTGTGGATTGACAAAATTAGGTTGTGATACAATGTTTAGTAGGATGTTAGCAAGTACGAAAATTGAATCACTTTATTCTTTGGCTTTGTTCCTATGTAGGTTGTAAAGCTTGTATTTTATCCTCTCAAACAAAAGTGTATCATTAGTTGTTTTCGCCATTATTTGTAAACACTCTCAAGGTAAAAACGTCCATTTTCTTTTTCTTTACGAGGACTT

Coding sequence (CDS)

ATGAATCTCAGTGCCATTCATCCACTCTCTGGCATTCCAGAAATCTGGTCCTCGATGAGATCTCTCAAAGCTCCCTTCTCCAACAATGGTACCAGCGCCAGAAGGCTTACGCTTCTTCATCTTGTTTGCGCTGCTGCTCTCTTCTCTTTTCTTGTTTTTGTCATTCAATCCTCCTTCTTTGCAGGTTATCAGCAGCCTATTGCAGATCTTAACAGGGAGGAGGTTCGGATTCTATCGGATTTCCAGTCCAGCGTTCAGCAATGCGTGACAAAGAGGGGGCTTGGACTCACTGCACATATTATTGACCACTGCAAGTTGGTTCTCAAGTTCCCAGAAGGCACAAATAGTACATGGTATAATGAGCAATTCAAGATTTATGAGCCTTTGGAGTATCCGTATGATGTGTGTGAGGCAATATTATTGTGGGAACAGTATCGCAACATGACTACTGTCTTGACGAGAGAGTATTTGGATGCACGGCCTGATGGGTGGTTTGATTATGCTGCAAAGAGAATCGCCCAATTGGGAGCAGATAAATGTTACAATAGAACTCTTTGTGAGGAACACCTAAATTTGATCCTACCATCAAAACCTCCTTTTCACCCCAAGCAGTTTAAAACCTGTGCAGTTGTGGGAAACTCCGGAGATCTTCTAAAGACTGAGTTTGGTGAAGAAATTGATAGTCATGATGCTGTTATACGTGATAATGAGGCACCTGTAAATGAAAAATATGCCAAATATGTTGGCCTGAAGAGGGATTTTCGTCTTGTTGTAAGAGGTGCTGCTGGCAATATGATTGCAATTCTAAATGGGTCTGATGATGAAGTACTTGTTATAAAGAGCGTGATTCACAGAGATTTTAATGCAATGATTAAGCTTATTCCCAATCCAGTTTATCTCTTCCAAGGTATTGTTCTACGTAGAGGTGCCAAGGGAACTGGAATGAAATCTATTGAATTAGCTCTTTCTATGTGTGATATTGTTGACATATATGGTTTCACTGTTGACCCTGGCTACACTGAATGGTTATAA

Protein sequence

MNLSAIHPLSGIPEIWSSMRSLKAPFSNNGTSARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSSVQQCVTKRGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKPPFHPKQFKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAGNMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVDIYGFTVDPGYTEWL
BLAST of Cp4.1LG03g06530 vs. Swiss-Prot
Match: SIA1_ARATH (Sialyltransferase-like protein 1 OS=Arabidopsis thaliana GN=SIA1 PE=2 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 3.1e-152
Identity = 256/311 (82.32%), Postives = 278/311 (89.39%), Query Frame = 1

Query: 32  SARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSSVQQCVTK 91
           + R+L LL L+   A+FS  VF IQSSFFA   + + DL  E+++ILSDFQSSVQQCV  
Sbjct: 6   AGRKLPLLQLLGCVAVFSVFVFTIQSSFFADNNRKL-DLQPEDIQILSDFQSSVQQCVAN 65

Query: 92  RGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQYRNMTTV 151
           RGLGL+AHIIDHC L+LKFPEGTNSTWYN QFK++E LE+ Y+VCEA+LLWEQYRNMTTV
Sbjct: 66  RGLGLSAHIIDHCNLILKFPEGTNSTWYNAQFKVFEALEFKYNVCEAVLLWEQYRNMTTV 125

Query: 152 LTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKPPFHPKQFKTCAVV 211
           LTREYLD RPDGW DYAA RIAQLGADKCYNRTLCEEHLN+ILP+KPPFHP+QF  CAVV
Sbjct: 126 LTREYLDVRPDGWLDYAAMRIAQLGADKCYNRTLCEEHLNVILPAKPPFHPRQFHKCAVV 185

Query: 212 GNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAGNMIAILNG 271
           GNSGDLLKTEFGEEIDSHDAV RDNEAPVNEKYAKYVG+KRDFRLVVRGAA NMI ILNG
Sbjct: 186 GNSGDLLKTEFGEEIDSHDAVFRDNEAPVNEKYAKYVGVKRDFRLVVRGAARNMIKILNG 245

Query: 272 SDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVDIY 331
           SD+EVL+IKSV HRDFN MIK IPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVDIY
Sbjct: 246 SDNEVLIIKSVTHRDFNEMIKRIPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVDIY 305

Query: 332 GFTVDPGYTEW 343
           GFTVDPGYTEW
Sbjct: 306 GFTVDPGYTEW 315

BLAST of Cp4.1LG03g06530 vs. Swiss-Prot
Match: STLP5_ORYSJ (Sialyltransferase-like protein 5 OS=Oryza sativa subsp. japonica GN=STLP5 PE=2 SV=1)

HSP 1 Score: 505.0 bits (1299), Expect = 6.5e-142
Identity = 243/319 (76.18%), Postives = 270/319 (84.64%), Query Frame = 1

Query: 25  PFSNNGTSARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIA-DLNREEVRILSDFQS 84
           P S+     RR T++ L+  A  F   V  IQSSFF   +     DL+ +EVR LS FQS
Sbjct: 7   PLSSLPPPPRRPTVVLLLGLALAFCLAVLSIQSSFFTAPRLASRLDLDSDEVRALSGFQS 66

Query: 85  SVQQCVTKRGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWE 144
            VQQCV +RGLGLTA IIDHCKLVL+FP+GTNSTWYN QFK +EPLEY YDVCE ILLWE
Sbjct: 67  RVQQCVARRGLGLTADIIDHCKLVLRFPKGTNSTWYNTQFKYFEPLEYNYDVCETILLWE 126

Query: 145 QYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKPPFHPK 204
           QYRNMTTVLTREYLD RPDGW DYAAKRIAQLGADKCYNRTLCEE L+++LP+KPPFHP+
Sbjct: 127 QYRNMTTVLTREYLDVRPDGWLDYAAKRIAQLGADKCYNRTLCEELLSVLLPAKPPFHPR 186

Query: 205 QFKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAG 264
           QF TCAVVGNSGDLLKTEFG+EID+HDAV RDNEAPVN+KYAKYVGLKRDFRLVVRGAA 
Sbjct: 187 QFATCAVVGNSGDLLKTEFGQEIDAHDAVFRDNEAPVNKKYAKYVGLKRDFRLVVRGAAR 246

Query: 265 NMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSIELALS 324
           NM  IL GS DEVL+IKS+ H++ NA+IK +PNPVYLFQGIVLRRGAKGTGMKSIELALS
Sbjct: 247 NMAPILKGSSDEVLIIKSLTHKEINAVIKELPNPVYLFQGIVLRRGAKGTGMKSIELALS 306

Query: 325 MCDIVDIYGFTVDPGYTEW 343
           MCDI+D+YGFTVDP YTEW
Sbjct: 307 MCDIIDMYGFTVDPNYTEW 325

BLAST of Cp4.1LG03g06530 vs. Swiss-Prot
Match: STLP4_ORYSJ (Sialyltransferase-like protein 4 OS=Oryza sativa subsp. japonica GN=STLP4 PE=2 SV=1)

HSP 1 Score: 289.3 bits (739), Expect = 5.6e-77
Identity = 153/317 (48.26%), Postives = 209/317 (65.93%), Query Frame = 1

Query: 36  LTLLHLVCAAALFS----FLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSSVQQCVTK 95
           + +L L  AAA+FS     LV++   S + G +   ADL       L   QS   +CV  
Sbjct: 1   MRVLPLALAAAIFSGVTAILVYLSGLSSYGGARVSDADL-----AALGALQSGFSKCVDA 60

Query: 96  RGLGLTAHI-IDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQYRNMTT 155
            GLGL A    D+C++V+++P  T+S W + +    E L + +++CEA+  WEQ RN TT
Sbjct: 61  NGLGLKAIPGEDYCRVVIQYPSDTDSKWKDPKTGEPEGLSFEFNLCEAVASWEQVRNSTT 120

Query: 156 VLTREYLDARPDGWFDYAAKRIAQ-LGADKCYNRTLCEEHLNLILPSKPPFHPKQFKTCA 215
           +LT+EY+DA P+GW +YA +RI + +  +KC NRTLC E L+L+LP  PP+ P+QF  CA
Sbjct: 121 ILTKEYIDALPNGWEEYAWRRINKGIHLNKCQNRTLCMEKLSLVLPETPPYVPRQFGRCA 180

Query: 216 VVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAG--NMIA 275
           VVGNSGDLLKT+FG+EIDS+D VIR+N AP+ + Y +YVG K  FRL+ RG+A   + + 
Sbjct: 181 VVGNSGDLLKTKFGDEIDSYDVVIRENGAPI-QNYTEYVGTKSTFRLLNRGSAKALDKVV 240

Query: 276 ILNGSDDEVLVIKSVIHRDFNAMIKLIP--NPVYLFQGIVLRRGAKGTGMKSIELALSMC 335
            L+ +  E L++K+ IH   N MI+ IP  NPVYL  G      AKGTG+K++E ALSMC
Sbjct: 241 ELDETKKEALIVKTTIHDIMNQMIREIPITNPVYLMLGTSFGSSAKGTGLKALEFALSMC 300

Query: 336 DIVDIYGFTVDPGYTEW 343
           D VD+YGFTVDPGY EW
Sbjct: 301 DSVDMYGFTVDPGYKEW 311

BLAST of Cp4.1LG03g06530 vs. Swiss-Prot
Match: SIA2_ARATH (Sialyltransferase-like protein 2 OS=Arabidopsis thaliana GN=SIA2 PE=2 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 6.9e-75
Identity = 142/313 (45.37%), Postives = 202/313 (64.54%), Query Frame = 1

Query: 36  LTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSSVQQCVTKRGLG 95
           + LLHL+   AL + +  V+                 E++  L   Q+  Q+CV+  GLG
Sbjct: 1   MKLLHLIFLLALTTGISAVLIYIIGVSNLYESNRFTNEDLEALQSLQNGFQKCVSANGLG 60

Query: 96  LTAHI-IDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQYRNMTTVLTR 155
           L A +  D+CK+ + FP+ T   W + +    E L Y +D+CEA+  WEQ RN +T+LT+
Sbjct: 61  LQAAMGRDYCKVSINFPKDTVPKWKDPKSGELEGLSYEFDLCEAVATWEQVRNSSTILTK 120

Query: 156 EYLDARPDGWFDYAAKRIAQ-LGADKCYNRTLCEEHLNLILPSKPPFHPKQFKTCAVVGN 215
           EY+DA P+GW DYA +RI + +  ++C N++LC E L+L+LP  PP+ P+QF  CAV+GN
Sbjct: 121 EYIDALPNGWEDYAWRRINKGIQLNRCQNKSLCIEKLSLVLPETPPYFPRQFGRCAVIGN 180

Query: 216 SGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAG--NMIAILNG 275
           SGDLLKT+FG+EID++D V+R+N AP+ + Y +YVG K  FRL+ RG+A   + +  L+ 
Sbjct: 181 SGDLLKTKFGKEIDTYDTVLRENGAPI-QNYKEYVGEKSTFRLLNRGSAKALDKVVELDE 240

Query: 276 SDDEVLVIKSVIHRDFNAMIKLIP--NPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVD 335
              EVL++K+ IH   N MI+ +P  NPVYL  G      AKGTG+K++E ALS CD VD
Sbjct: 241 KKQEVLLVKTTIHDIMNKMIREVPIKNPVYLMLGASFGSAAKGTGLKALEFALSTCDSVD 300

Query: 336 IYGFTVDPGYTEW 343
           +YGFTVDPGY EW
Sbjct: 301 MYGFTVDPGYKEW 312

BLAST of Cp4.1LG03g06530 vs. Swiss-Prot
Match: SIA8D_HUMAN (CMP-N-acetylneuraminate-poly-alpha-2,8-sialyltransferase OS=Homo sapiens GN=ST8SIA4 PE=1 SV=1)

HSP 1 Score: 73.2 bits (178), Expect = 6.3e-12
Identity = 42/73 (57.53%), Postives = 50/73 (68.49%), Query Frame = 1

Query: 183 RTLCEEH-LNLILPSKPPFHPKQFKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVN 242
           RTL   H L+ +LP   P   ++FKTCAVVGNSG LL +E G+EIDSH+ VIR N APV 
Sbjct: 116 RTLNISHDLHSLLPEVSPMKNRRFKTCAVVGNSGILLDSECGKEIDSHNFVIRCNLAPVV 175

Query: 243 EKYAKYVGLKRDF 255
           E +A  VG K DF
Sbjct: 176 E-FAADVGTKSDF 187

BLAST of Cp4.1LG03g06530 vs. TrEMBL
Match: A0A0A0L8S8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G599430 PE=3 SV=1)

HSP 1 Score: 636.3 bits (1640), Expect = 2.1e-179
Identity = 307/324 (94.75%), Postives = 313/324 (96.60%), Query Frame = 1

Query: 19  MRSLKAPFSNNGTSARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRIL 78
           MRSLK   SNNG   RRLTLLHLVCAAALFSFLVFVIQSSFFAGY QP+ DLNREEVRIL
Sbjct: 1   MRSLKPASSNNGVG-RRLTLLHLVCAAALFSFLVFVIQSSFFAGYHQPLVDLNREEVRIL 60

Query: 79  SDFQSSVQQCVTKRGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEA 138
           SDFQS+VQQCV  RGLGLTAHIIDHCKL+LKFPEGTNSTWYNEQFKIYEPLEYPYDVCEA
Sbjct: 61  SDFQSNVQQCVANRGLGLTAHIIDHCKLILKFPEGTNSTWYNEQFKIYEPLEYPYDVCEA 120

Query: 139 ILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKP 198
           ILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNR+LCEEHLNLILPSKP
Sbjct: 121 ILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRSLCEEHLNLILPSKP 180

Query: 199 PFHPKQFKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVV 258
           PFHP+QFKTCAVVGNSGDLLKTEFG+EIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVV
Sbjct: 181 PFHPRQFKTCAVVGNSGDLLKTEFGDEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVV 240

Query: 259 RGAAGNMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSI 318
           RGAA NMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSI
Sbjct: 241 RGAARNMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSI 300

Query: 319 ELALSMCDIVDIYGFTVDPGYTEW 343
           ELALSMCDIVDIYGFTVDPGYTEW
Sbjct: 301 ELALSMCDIVDIYGFTVDPGYTEW 323

BLAST of Cp4.1LG03g06530 vs. TrEMBL
Match: A0A059AP51_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I01604 PE=3 SV=1)

HSP 1 Score: 568.9 bits (1465), Expect = 4.1e-159
Identity = 269/309 (87.06%), Postives = 290/309 (93.85%), Query Frame = 1

Query: 34  RRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSSVQQCVTKRG 93
           R LT+LHLVCAAALFS LVFVIQSS+FAG +Q    ++ E+VRILSDFQSSV+QCV  RG
Sbjct: 13  RNLTILHLVCAAALFSLLVFVIQSSYFAGSRQ--VPISGEDVRILSDFQSSVRQCVANRG 72

Query: 94  LGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQYRNMTTVLT 153
           LGLTAHIIDHCKL+LKFPEGTNSTWYNEQFKI+EPLEY YDVCEAILLWEQYRNMTTVLT
Sbjct: 73  LGLTAHIIDHCKLILKFPEGTNSTWYNEQFKIFEPLEYSYDVCEAILLWEQYRNMTTVLT 132

Query: 154 REYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKPPFHPKQFKTCAVVGN 213
           REYLDARPDGW +YAAKRIAQLGADKCYNRTLCE+HLN+ILP+KPPFHP+QF++CAVVGN
Sbjct: 133 REYLDARPDGWLEYAAKRIAQLGADKCYNRTLCEDHLNIILPAKPPFHPRQFRSCAVVGN 192

Query: 214 SGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAGNMIAILNGSD 273
           SGDLLKTEFG+EID HDAVIRDNEAPVNEKYAK+VGLKRDFRLVVRGAA NM+ ILNGS 
Sbjct: 193 SGDLLKTEFGKEIDGHDAVIRDNEAPVNEKYAKHVGLKRDFRLVVRGAARNMVKILNGSA 252

Query: 274 DEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVDIYGF 333
           DEVL+IKSV HRDFN MIK IPNPVYLFQGIVLRRGAKGTGMKSIELALSMCD+VDIYGF
Sbjct: 253 DEVLIIKSVTHRDFNTMIKSIPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDVVDIYGF 312

Query: 334 TVDPGYTEW 343
           TVDPGYTEW
Sbjct: 313 TVDPGYTEW 319

BLAST of Cp4.1LG03g06530 vs. TrEMBL
Match: A0A067L0A8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26870 PE=3 SV=1)

HSP 1 Score: 566.2 bits (1458), Expect = 2.7e-158
Identity = 268/318 (84.28%), Postives = 287/318 (90.25%), Query Frame = 1

Query: 25  PFSNNGTSARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSS 84
           P+ N+  S R   +LHL+C AA FS +VF IQSSFFAG +   +DLN+EE+  LS+FQ S
Sbjct: 3   PYKNSNNSRRPAAVLHLLCVAAFFSIIVFAIQSSFFAGNRN--SDLNKEEIHTLSEFQFS 62

Query: 85  VQQCVTKRGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQ 144
           VQQCV  RGLGLTAHI+DHCKL LKFPEGTNSTWYN QFKIYEPLEY YDVC+AILLWEQ
Sbjct: 63  VQQCVANRGLGLTAHIVDHCKLTLKFPEGTNSTWYNAQFKIYEPLEYHYDVCDAILLWEQ 122

Query: 145 YRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKPPFHPKQ 204
           YRNMTTVLTREYLDARPDGW DYAAKRIAQLG DKCYNRTLCEEHLNLILP+KPPF P+Q
Sbjct: 123 YRNMTTVLTREYLDARPDGWLDYAAKRIAQLGDDKCYNRTLCEEHLNLILPAKPPFRPRQ 182

Query: 205 FKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAGN 264
           F+TCAVVGNSGDLLKTEFG+EIDSHDAVIRDNEAPVNEKYAK+VGLKRDFRLVVRGAA N
Sbjct: 183 FQTCAVVGNSGDLLKTEFGKEIDSHDAVIRDNEAPVNEKYAKHVGLKRDFRLVVRGAARN 242

Query: 265 MIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSIELALSM 324
           M+ ILNGS DEVL+IKSV HRDFNAMIK IPNPVYLFQGIVLRRGAKGTGMKSIELALSM
Sbjct: 243 MVTILNGSTDEVLIIKSVTHRDFNAMIKSIPNPVYLFQGIVLRRGAKGTGMKSIELALSM 302

Query: 325 CDIVDIYGFTVDPGYTEW 343
           CDIVDIYGFTVDPGYTEW
Sbjct: 303 CDIVDIYGFTVDPGYTEW 318

BLAST of Cp4.1LG03g06530 vs. TrEMBL
Match: A0A068V7Y6_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00019215001 PE=3 SV=1)

HSP 1 Score: 561.6 bits (1446), Expect = 6.5e-157
Identity = 277/329 (84.19%), Postives = 293/329 (89.06%), Query Frame = 1

Query: 19  MRSLKAPFSNN-GTSARRLTLLHLVCAAALFSFLVFVIQSSFF-AGYQQPIA---DLNRE 78
           + S  A  +NN    +RR  LLHLVCAAA+FS +VF+IQSSFF AG Q+  A     N E
Sbjct: 9   LSSASASAANNLALISRRPALLHLVCAAAVFSLIVFLIQSSFFTAGNQKQRAVNIHNNEE 68

Query: 79  EVRILSDFQSSVQQCVTKRGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPY 138
           E RILSDFQSSVQQCV  RGLGLTA IIDHCKLVLKFP+GTNSTWYNEQFKI+EPLEY Y
Sbjct: 69  EFRILSDFQSSVQQCVANRGLGLTAIIIDHCKLVLKFPQGTNSTWYNEQFKIFEPLEYTY 128

Query: 139 DVCEAILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLI 198
           D CEA+LLWEQYRNMTTVLTREYLDARPDGW DYAAKRIAQLGADKCYNRTLCEEHLNL+
Sbjct: 129 DTCEALLLWEQYRNMTTVLTREYLDARPDGWLDYAAKRIAQLGADKCYNRTLCEEHLNLL 188

Query: 199 LPSKPPFHPKQFKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRD 258
           LP+KPPFHP+QF TCAVVGNSGDLLKTEFGEEID+HDAVIRDNEAPVNEKYAKYVGLKRD
Sbjct: 189 LPAKPPFHPRQFATCAVVGNSGDLLKTEFGEEIDTHDAVIRDNEAPVNEKYAKYVGLKRD 248

Query: 259 FRLVVRGAAGNMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGT 318
           FRLVVRGAA NM+ ILNGS DEVL+IKSV HRDFNAMIK IPNPVYLFQGIVLRRGAKGT
Sbjct: 249 FRLVVRGAARNMVTILNGSVDEVLIIKSVTHRDFNAMIKGIPNPVYLFQGIVLRRGAKGT 308

Query: 319 GMKSIELALSMCDIVDIYGFTVDPGYTEW 343
           GMKSIELALSMCD VDIYGFTVDPGYTEW
Sbjct: 309 GMKSIELALSMCDDVDIYGFTVDPGYTEW 337

BLAST of Cp4.1LG03g06530 vs. TrEMBL
Match: A0A076KX85_TOBAC (Sialyltransferase-like protein OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 559.3 bits (1440), Expect = 3.2e-156
Identity = 263/310 (84.84%), Postives = 288/310 (92.90%), Query Frame = 1

Query: 33  ARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSSVQQCVTKR 92
           +R+  L+ L+C AALFS ++  IQSSFF G    + +++REE+RILSDFQS++QQCV  R
Sbjct: 11  SRKPALVRLLCVAALFSIILLAIQSSFFTGSWNAV-NISREEIRILSDFQSNLQQCVANR 70

Query: 93  GLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQYRNMTTVL 152
           GLGLTAHIIDHC ++LKFPEGTNSTWYNEQFKI+EPLEY YDVCEAILLWEQYRNMTTVL
Sbjct: 71  GLGLTAHIIDHCNVILKFPEGTNSTWYNEQFKIFEPLEYKYDVCEAILLWEQYRNMTTVL 130

Query: 153 TREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKPPFHPKQFKTCAVVG 212
           TREYLD+RPDGWFDYAAKRIAQLGADKCYN+TLCEEHLNLILP+KPPFHP+QF+ CAVVG
Sbjct: 131 TREYLDSRPDGWFDYAAKRIAQLGADKCYNQTLCEEHLNLILPAKPPFHPRQFRKCAVVG 190

Query: 213 NSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAGNMIAILNGS 272
           NSGDLLKT+FGEEIDSHDAVIRDNEAPVNEKYAK+VGLKRDFRLVVRGAA NMI ILNGS
Sbjct: 191 NSGDLLKTQFGEEIDSHDAVIRDNEAPVNEKYAKHVGLKRDFRLVVRGAARNMIKILNGS 250

Query: 273 DDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVDIYG 332
           DDEVL+IKSVIHRDFNAMIK I NPVYLFQGIVLRRGAKGTGMKSIELALSMCD+VDIYG
Sbjct: 251 DDEVLIIKSVIHRDFNAMIKKIRNPVYLFQGIVLRRGAKGTGMKSIELALSMCDVVDIYG 310

Query: 333 FTVDPGYTEW 343
           FTVDPGYTEW
Sbjct: 311 FTVDPGYTEW 319

BLAST of Cp4.1LG03g06530 vs. TAIR10
Match: AT1G08660.1 (AT1G08660.1 MALE GAMETOPHYTE DEFECTIVE 2)

HSP 1 Score: 539.3 bits (1388), Expect = 1.8e-153
Identity = 256/311 (82.32%), Postives = 278/311 (89.39%), Query Frame = 1

Query: 32  SARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSSVQQCVTK 91
           + R+L LL L+   A+FS  VF IQSSFFA   + + DL  E+++ILSDFQSSVQQCV  
Sbjct: 6   AGRKLPLLQLLGCVAVFSVFVFTIQSSFFADNNRKL-DLQPEDIQILSDFQSSVQQCVAN 65

Query: 92  RGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQYRNMTTV 151
           RGLGL+AHIIDHC L+LKFPEGTNSTWYN QFK++E LE+ Y+VCEA+LLWEQYRNMTTV
Sbjct: 66  RGLGLSAHIIDHCNLILKFPEGTNSTWYNAQFKVFEALEFKYNVCEAVLLWEQYRNMTTV 125

Query: 152 LTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKPPFHPKQFKTCAVV 211
           LTREYLD RPDGW DYAA RIAQLGADKCYNRTLCEEHLN+ILP+KPPFHP+QF  CAVV
Sbjct: 126 LTREYLDVRPDGWLDYAAMRIAQLGADKCYNRTLCEEHLNVILPAKPPFHPRQFHKCAVV 185

Query: 212 GNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAGNMIAILNG 271
           GNSGDLLKTEFGEEIDSHDAV RDNEAPVNEKYAKYVG+KRDFRLVVRGAA NMI ILNG
Sbjct: 186 GNSGDLLKTEFGEEIDSHDAVFRDNEAPVNEKYAKYVGVKRDFRLVVRGAARNMIKILNG 245

Query: 272 SDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVDIY 331
           SD+EVL+IKSV HRDFN MIK IPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVDIY
Sbjct: 246 SDNEVLIIKSVTHRDFNEMIKRIPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVDIY 305

Query: 332 GFTVDPGYTEW 343
           GFTVDPGYTEW
Sbjct: 306 GFTVDPGYTEW 315

BLAST of Cp4.1LG03g06530 vs. TAIR10
Match: AT3G48820.1 (AT3G48820.1 Glycosyltransferase family 29 (sialyltransferase) family protein)

HSP 1 Score: 282.3 bits (721), Expect = 3.9e-76
Identity = 142/313 (45.37%), Postives = 202/313 (64.54%), Query Frame = 1

Query: 36  LTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSSVQQCVTKRGLG 95
           + LLHL+   AL + +  V+                 E++  L   Q+  Q+CV+  GLG
Sbjct: 1   MKLLHLIFLLALTTGISAVLIYIIGVSNLYESNRFTNEDLEALQSLQNGFQKCVSANGLG 60

Query: 96  LTAHI-IDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQYRNMTTVLTR 155
           L A +  D+CK+ + FP+ T   W + +    E L Y +D+CEA+  WEQ RN +T+LT+
Sbjct: 61  LQAAMGRDYCKVSINFPKDTVPKWKDPKSGELEGLSYEFDLCEAVATWEQVRNSSTILTK 120

Query: 156 EYLDARPDGWFDYAAKRIAQ-LGADKCYNRTLCEEHLNLILPSKPPFHPKQFKTCAVVGN 215
           EY+DA P+GW DYA +RI + +  ++C N++LC E L+L+LP  PP+ P+QF  CAV+GN
Sbjct: 121 EYIDALPNGWEDYAWRRINKGIQLNRCQNKSLCIEKLSLVLPETPPYFPRQFGRCAVIGN 180

Query: 216 SGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAG--NMIAILNG 275
           SGDLLKT+FG+EID++D V+R+N AP+ + Y +YVG K  FRL+ RG+A   + +  L+ 
Sbjct: 181 SGDLLKTKFGKEIDTYDTVLRENGAPI-QNYKEYVGEKSTFRLLNRGSAKALDKVVELDE 240

Query: 276 SDDEVLVIKSVIHRDFNAMIKLIP--NPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVD 335
              EVL++K+ IH   N MI+ +P  NPVYL  G      AKGTG+K++E ALS CD VD
Sbjct: 241 KKQEVLLVKTTIHDIMNKMIREVPIKNPVYLMLGASFGSAAKGTGLKALEFALSTCDSVD 300

Query: 336 IYGFTVDPGYTEW 343
           +YGFTVDPGY EW
Sbjct: 301 MYGFTVDPGYKEW 312

BLAST of Cp4.1LG03g06530 vs. TAIR10
Match: AT1G08280.1 (AT1G08280.1 Glycosyltransferase family 29 (sialyltransferase) family protein)

HSP 1 Score: 51.6 bits (122), Expect = 1.1e-06
Identity = 24/55 (43.64%), Postives = 37/55 (67.27%), Query Frame = 1

Query: 203 KQFKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLV 258
           +++ +CAVVGNSG LL +++G+ ID H+ VIR N A   E++ K VG K +   +
Sbjct: 172 ERYLSCAVVGNSGTLLNSQYGDLIDKHEIVIRLNNAK-TERFEKKVGSKTNISFI 225

BLAST of Cp4.1LG03g06530 vs. NCBI nr
Match: gi|449455926|ref|XP_004145701.1| (PREDICTED: CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferase 1 [Cucumis sativus])

HSP 1 Score: 636.3 bits (1640), Expect = 3.0e-179
Identity = 307/324 (94.75%), Postives = 313/324 (96.60%), Query Frame = 1

Query: 19  MRSLKAPFSNNGTSARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRIL 78
           MRSLK   SNNG   RRLTLLHLVCAAALFSFLVFVIQSSFFAGY QP+ DLNREEVRIL
Sbjct: 1   MRSLKPASSNNGVG-RRLTLLHLVCAAALFSFLVFVIQSSFFAGYHQPLVDLNREEVRIL 60

Query: 79  SDFQSSVQQCVTKRGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEA 138
           SDFQS+VQQCV  RGLGLTAHIIDHCKL+LKFPEGTNSTWYNEQFKIYEPLEYPYDVCEA
Sbjct: 61  SDFQSNVQQCVANRGLGLTAHIIDHCKLILKFPEGTNSTWYNEQFKIYEPLEYPYDVCEA 120

Query: 139 ILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKP 198
           ILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNR+LCEEHLNLILPSKP
Sbjct: 121 ILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRSLCEEHLNLILPSKP 180

Query: 199 PFHPKQFKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVV 258
           PFHP+QFKTCAVVGNSGDLLKTEFG+EIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVV
Sbjct: 181 PFHPRQFKTCAVVGNSGDLLKTEFGDEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVV 240

Query: 259 RGAAGNMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSI 318
           RGAA NMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSI
Sbjct: 241 RGAARNMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSI 300

Query: 319 ELALSMCDIVDIYGFTVDPGYTEW 343
           ELALSMCDIVDIYGFTVDPGYTEW
Sbjct: 301 ELALSMCDIVDIYGFTVDPGYTEW 323

BLAST of Cp4.1LG03g06530 vs. NCBI nr
Match: gi|659098197|ref|XP_008450023.1| (PREDICTED: CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferase 1 [Cucumis melo])

HSP 1 Score: 634.4 bits (1635), Expect = 1.1e-178
Identity = 306/324 (94.44%), Postives = 313/324 (96.60%), Query Frame = 1

Query: 19  MRSLKAPFSNNGTSARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRIL 78
           MRSLK   S+NG   RRLTLLHLVCAAALFSFLVFVIQSSFFAGY QP+ DLNREEVRIL
Sbjct: 1   MRSLKPASSSNGVG-RRLTLLHLVCAAALFSFLVFVIQSSFFAGYHQPLVDLNREEVRIL 60

Query: 79  SDFQSSVQQCVTKRGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEA 138
           SDFQS+VQQCV  RGLGLTAHIIDHCKL+LKFPEGTNSTWYNEQFKIYEPLEYPYDVCEA
Sbjct: 61  SDFQSNVQQCVANRGLGLTAHIIDHCKLILKFPEGTNSTWYNEQFKIYEPLEYPYDVCEA 120

Query: 139 ILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKP 198
           ILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNR+LCEEHLNLILPSKP
Sbjct: 121 ILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRSLCEEHLNLILPSKP 180

Query: 199 PFHPKQFKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVV 258
           PFHP+QFKTCAVVGNSGDLLKTEFG+EIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVV
Sbjct: 181 PFHPRQFKTCAVVGNSGDLLKTEFGDEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVV 240

Query: 259 RGAAGNMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSI 318
           RGAA NMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSI
Sbjct: 241 RGAARNMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSI 300

Query: 319 ELALSMCDIVDIYGFTVDPGYTEW 343
           ELALSMCDIVDIYGFTVDPGYTEW
Sbjct: 301 ELALSMCDIVDIYGFTVDPGYTEW 323

BLAST of Cp4.1LG03g06530 vs. NCBI nr
Match: gi|702464600|ref|XP_010028948.1| (PREDICTED: uncharacterized protein LOC104419104 [Eucalyptus grandis])

HSP 1 Score: 568.9 bits (1465), Expect = 5.9e-159
Identity = 269/309 (87.06%), Postives = 290/309 (93.85%), Query Frame = 1

Query: 34  RRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSSVQQCVTKRG 93
           R LT+LHLVCAAALFS LVFVIQSS+FAG +Q    ++ E+VRILSDFQSSV+QCV  RG
Sbjct: 13  RNLTILHLVCAAALFSLLVFVIQSSYFAGSRQ--VPISGEDVRILSDFQSSVRQCVANRG 72

Query: 94  LGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQYRNMTTVLT 153
           LGLTAHIIDHCKL+LKFPEGTNSTWYNEQFKI+EPLEY YDVCEAILLWEQYRNMTTVLT
Sbjct: 73  LGLTAHIIDHCKLILKFPEGTNSTWYNEQFKIFEPLEYSYDVCEAILLWEQYRNMTTVLT 132

Query: 154 REYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKPPFHPKQFKTCAVVGN 213
           REYLDARPDGW +YAAKRIAQLGADKCYNRTLCE+HLN+ILP+KPPFHP+QF++CAVVGN
Sbjct: 133 REYLDARPDGWLEYAAKRIAQLGADKCYNRTLCEDHLNIILPAKPPFHPRQFRSCAVVGN 192

Query: 214 SGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAGNMIAILNGSD 273
           SGDLLKTEFG+EID HDAVIRDNEAPVNEKYAK+VGLKRDFRLVVRGAA NM+ ILNGS 
Sbjct: 193 SGDLLKTEFGKEIDGHDAVIRDNEAPVNEKYAKHVGLKRDFRLVVRGAARNMVKILNGSA 252

Query: 274 DEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDIVDIYGF 333
           DEVL+IKSV HRDFN MIK IPNPVYLFQGIVLRRGAKGTGMKSIELALSMCD+VDIYGF
Sbjct: 253 DEVLIIKSVTHRDFNTMIKSIPNPVYLFQGIVLRRGAKGTGMKSIELALSMCDVVDIYGF 312

Query: 334 TVDPGYTEW 343
           TVDPGYTEW
Sbjct: 313 TVDPGYTEW 319

BLAST of Cp4.1LG03g06530 vs. NCBI nr
Match: gi|802564549|ref|XP_012067346.1| (PREDICTED: uncharacterized protein LOC105630208 [Jatropha curcas])

HSP 1 Score: 566.2 bits (1458), Expect = 3.8e-158
Identity = 268/318 (84.28%), Postives = 287/318 (90.25%), Query Frame = 1

Query: 25  PFSNNGTSARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRILSDFQSS 84
           P+ N+  S R   +LHL+C AA FS +VF IQSSFFAG +   +DLN+EE+  LS+FQ S
Sbjct: 3   PYKNSNNSRRPAAVLHLLCVAAFFSIIVFAIQSSFFAGNRN--SDLNKEEIHTLSEFQFS 62

Query: 85  VQQCVTKRGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEAILLWEQ 144
           VQQCV  RGLGLTAHI+DHCKL LKFPEGTNSTWYN QFKIYEPLEY YDVC+AILLWEQ
Sbjct: 63  VQQCVANRGLGLTAHIVDHCKLTLKFPEGTNSTWYNAQFKIYEPLEYHYDVCDAILLWEQ 122

Query: 145 YRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKPPFHPKQ 204
           YRNMTTVLTREYLDARPDGW DYAAKRIAQLG DKCYNRTLCEEHLNLILP+KPPF P+Q
Sbjct: 123 YRNMTTVLTREYLDARPDGWLDYAAKRIAQLGDDKCYNRTLCEEHLNLILPAKPPFRPRQ 182

Query: 205 FKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVVRGAAGN 264
           F+TCAVVGNSGDLLKTEFG+EIDSHDAVIRDNEAPVNEKYAK+VGLKRDFRLVVRGAA N
Sbjct: 183 FQTCAVVGNSGDLLKTEFGKEIDSHDAVIRDNEAPVNEKYAKHVGLKRDFRLVVRGAARN 242

Query: 265 MIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSIELALSM 324
           M+ ILNGS DEVL+IKSV HRDFNAMIK IPNPVYLFQGIVLRRGAKGTGMKSIELALSM
Sbjct: 243 MVTILNGSTDEVLIIKSVTHRDFNAMIKSIPNPVYLFQGIVLRRGAKGTGMKSIELALSM 302

Query: 325 CDIVDIYGFTVDPGYTEW 343
           CDIVDIYGFTVDPGYTEW
Sbjct: 303 CDIVDIYGFTVDPGYTEW 318

BLAST of Cp4.1LG03g06530 vs. NCBI nr
Match: gi|719978626|ref|XP_010249221.1| (PREDICTED: uncharacterized protein LOC104591853 [Nelumbo nucifera])

HSP 1 Score: 565.5 bits (1456), Expect = 6.5e-158
Identity = 274/324 (84.57%), Postives = 292/324 (90.12%), Query Frame = 1

Query: 19  MRSLKAPFSNNGTSARRLTLLHLVCAAALFSFLVFVIQSSFFAGYQQPIADLNREEVRIL 78
           MR LKA  SN     RR T L L+CAA  FS LV  IQ+SFF+G ++   DLNREE+R L
Sbjct: 1   MRPLKASSSNY----RRPTFLLLICAAIAFSLLVLAIQTSFFSGNRK--YDLNREEIRTL 60

Query: 79  SDFQSSVQQCVTKRGLGLTAHIIDHCKLVLKFPEGTNSTWYNEQFKIYEPLEYPYDVCEA 138
           +DFQ+SV QCV  RGLGLTA I+DHCKLVLKFPEGTNSTWYNEQFKI+EPLEY YDVCEA
Sbjct: 61  TDFQTSVLQCVANRGLGLTAQIVDHCKLVLKFPEGTNSTWYNEQFKIFEPLEYNYDVCEA 120

Query: 139 ILLWEQYRNMTTVLTREYLDARPDGWFDYAAKRIAQLGADKCYNRTLCEEHLNLILPSKP 198
           ILLWEQYRNMTTVLTREYLDARPDGW +YAAKRIAQLGADKCYN +LCEEHLNLILPSKP
Sbjct: 121 ILLWEQYRNMTTVLTREYLDARPDGWLEYAAKRIAQLGADKCYNHSLCEEHLNLILPSKP 180

Query: 199 PFHPKQFKTCAVVGNSGDLLKTEFGEEIDSHDAVIRDNEAPVNEKYAKYVGLKRDFRLVV 258
           PFHP+QF+TCAVVGNSGDLLKTEFGEEID HDAVIRDNEAPVNEKYAK+VGLKRDFRLVV
Sbjct: 181 PFHPRQFRTCAVVGNSGDLLKTEFGEEIDEHDAVIRDNEAPVNEKYAKHVGLKRDFRLVV 240

Query: 259 RGAAGNMIAILNGSDDEVLVIKSVIHRDFNAMIKLIPNPVYLFQGIVLRRGAKGTGMKSI 318
           RGAA NM+AILNGSDDEVL+IKS+ HRDFNAMIK IPNPVYLFQGIVLRRGAKGTGMKSI
Sbjct: 241 RGAARNMVAILNGSDDEVLIIKSLTHRDFNAMIKTIPNPVYLFQGIVLRRGAKGTGMKSI 300

Query: 319 ELALSMCDIVDIYGFTVDPGYTEW 343
           ELALSMCDIVDIYGFTVDPGYTEW
Sbjct: 301 ELALSMCDIVDIYGFTVDPGYTEW 318

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SIA1_ARATH3.1e-15282.32Sialyltransferase-like protein 1 OS=Arabidopsis thaliana GN=SIA1 PE=2 SV=1[more]
STLP5_ORYSJ6.5e-14276.18Sialyltransferase-like protein 5 OS=Oryza sativa subsp. japonica GN=STLP5 PE=2 S... [more]
STLP4_ORYSJ5.6e-7748.26Sialyltransferase-like protein 4 OS=Oryza sativa subsp. japonica GN=STLP4 PE=2 S... [more]
SIA2_ARATH6.9e-7545.37Sialyltransferase-like protein 2 OS=Arabidopsis thaliana GN=SIA2 PE=2 SV=1[more]
SIA8D_HUMAN6.3e-1257.53CMP-N-acetylneuraminate-poly-alpha-2,8-sialyltransferase OS=Homo sapiens GN=ST8S... [more]
Match NameE-valueIdentityDescription
A0A0A0L8S8_CUCSA2.1e-17994.75Uncharacterized protein OS=Cucumis sativus GN=Csa_3G599430 PE=3 SV=1[more]
A0A059AP51_EUCGR4.1e-15987.06Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I01604 PE=3 SV=1[more]
A0A067L0A8_JATCU2.7e-15884.28Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26870 PE=3 SV=1[more]
A0A068V7Y6_COFCA6.5e-15784.19Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00019215001 PE=3 SV=1[more]
A0A076KX85_TOBAC3.2e-15684.84Sialyltransferase-like protein OS=Nicotiana tabacum PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT1G08660.11.8e-15382.32 MALE GAMETOPHYTE DEFECTIVE 2[more]
AT3G48820.13.9e-7645.37 Glycosyltransferase family 29 (sialyltransferase) family protein[more]
AT1G08280.11.1e-0643.64 Glycosyltransferase family 29 (sialyltransferase) family protein[more]
Match NameE-valueIdentityDescription
gi|449455926|ref|XP_004145701.1|3.0e-17994.75PREDICTED: CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferas... [more]
gi|659098197|ref|XP_008450023.1|1.1e-17894.44PREDICTED: CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferas... [more]
gi|702464600|ref|XP_010028948.1|5.9e-15987.06PREDICTED: uncharacterized protein LOC104419104 [Eucalyptus grandis][more]
gi|802564549|ref|XP_012067346.1|3.8e-15884.28PREDICTED: uncharacterized protein LOC105630208 [Jatropha curcas][more]
gi|719978626|ref|XP_010249221.1|6.5e-15884.57PREDICTED: uncharacterized protein LOC104591853 [Nelumbo nucifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008373sialyltransferase activity
Vocabulary: Biological Process
TermDefinition
GO:0006486protein glycosylation
Vocabulary: INTERPRO
TermDefinition
IPR001675Glyco_trans_29
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009846 pollen germination
biological_process GO:0009860 pollen tube growth
biological_process GO:0006486 protein glycosylation
biological_process GO:0097503 sialylation
biological_process GO:0007020 microtubule nucleation
cellular_component GO:0005768 endosome
cellular_component GO:0005802 trans-Golgi network
cellular_component GO:0005575 cellular_component
molecular_function GO:0008373 sialyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g06530.1Cp4.1LG03g06530.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001675Glycosyl transferase family 29PFAMPF00777Glyco_transf_29coord: 179..290
score: 1.1
NoneNo IPR availablePANTHERPTHR13713SIALYLTRANSFERASEcoord: 12..342
score: 7.1E
NoneNo IPR availablePANTHERPTHR13713:SF31SUBFAMILY NOT NAMEDcoord: 12..342
score: 7.1E