CmaCh20G001660 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G001660
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionBeta-galactoside alpha-2,3-sialyltransferase (Sialyltransferase 4A)
LocationCma_Chr20 : 806346 .. 807780 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGTGGTTCTTGTTTCTTACCTACTCTTTCAAATCTCGTCACCGACGAGATTGCATTCATGTTCACTGCTTGTTGCTTTTCAAAATCTCTAAGCAATCACAGAAATCTCGCCTTTTCCGAGTTCAAATTCGAAGCATCTTAACTGTTCCCCTATTATCTCTTTGGATGCCGAACTTCCTGGGAGTCGATCTGCGACATATTGGTTCAAGATCTGTCCAGCCAATGCTAGGTTTCATTTAGCATTGCCCCACAATCGGTCCGTCGCGATGAAGCGCACGGTCCGTCCAGTGTTCAGCGTGCTGCTGTTCCTTACCTTTGCCGTCACTCTCATCTGCCGCCTCATCTTCCGCCGCGGCCTCAGTTTCTTTGAGATGGAAACTAATGTAATCTCTCCAAGATCTCCAGCATTCATGTTCAATTCCACACTGCTGAAATTTGCTTCGGTTGATTTGGGAGAGGCTCAGTCGAAGCGAGAGATAGAGCAGTTATTGGAGGGGAAATTTGGTGGTCCGAGGACGTACAAGACTTTCGCTACTTGGCGAAGATTCAATCACTATGACATGAAAGCGAGGCCCTCGAATAGCTTTCCAGTGACATTCCGCTCTCCAGCTTTCTACCGACATTGGTTGGATTTCAGGCGGGCATTGAGTGGCTGGGTGAGAAGAAAAGGGTTTACAACGGATATAATGCCAGAACTGGTAAGGCTTATCAAAGCCCCACTGGACAGGCACAACGGGTTGGTGGGTTCAGATCAACGGTATTCATCATGTGCCGTCGTGGGAAACAGTGGAATCCTATTGAACAGTGATTATGGGAAGCTAATTGACAGCCATGAGATTGTTATTCGATTGAACAATGCAAAAACAAACAAGTATGAGAGCAAAGTTGGGTCCAAAACCAGCGTTTCTTTCATCAATAGCAACATCTTGCACCTTTGTGCCAGAAGAGAAGGGTGTTTTTGCCATCCTTATGGGCCAAACGTGCCGACAATCATGTACATTTGTCAACCTGTACATTTCATGGACTACACTGTCTGCAACAGTTCCCACAAATCCCCGCTGCTGATAACAGATCCAAGTTTTGATGCTCTGTGCTCTAGGATTGTAAAGTATTACTCAATCAAACGCTTTGTGGAGGTGACAGGGAAATCGTTGGAAGAATGGAGCTCAGCCCACGAAGGTCCCTTATTCCACTATTCTTCTGGCATGCAAGCCGTTATGTTGGCCGTAGGAATTTGTGACAAAGTAAGCATATTTGGGTTTGGGAAATTGGCTTCAGCAAGGCACCATTATCACACAAACCAGAAGGCCGAACTGGGTTTACACGATTATGAGGCAGAGTATGCTTTCTATTACGATTTGATTGCTAGGCCACAGAGAATACCTTTCTTGTCGGACAAGTTCAAGATACCTTCTACGGTTTTATATCGATGA

mRNA sequence

ATGCGTGCAATCACAGAAATCTCGCCTTTTCCGAGTTCAAATTCGAAGCATCTTAACTGTTCCCCTATTATCTCTTTGGATGCCGAACTTCCTGGGAGTCGATCTGCGACATATTGGTTCAAGATCTGTCCAGCCAATGCTAGGTTTCATTTAGCATTGCCCCACAATCGGTCCGTCGCGATGAAGCGCACGGTCCGTCCAGTGTTCAGCGTGCTGCTGTTCCTTACCTTTGCCGTCACTCTCATCTGCCGCCTCATCTTCCGCCGCGGCCTCAGTTTCTTTGAGATGGAAACTAATGTAATCTCTCCAAGATCTCCAGCATTCATGTTCAATTCCACACTGCTGAAATTTGCTTCGGTTGATTTGGGAGAGGCTCAGTCGAAGCGAGAGATAGAGCAGTTATTGGAGGGGAAATTTGGTGGTCCGAGGACGTACAAGACTTTCGCTACTTGGCGAAGATTCAATCACTATGACATGAAAGCGAGGCCCTCGAATAGCTTTCCAGTGACATTCCGCTCTCCAGCTTTCTACCGACATTGGTTGGATTTCAGGCGGGCATTGAGTGGCTGGGTGAGAAGAAAAGGGTTTACAACGGATATAATGCCAGAACTGGTAAGGCTTATCAAAGCCCCACTGGACAGGCACAACGGGTTGGTGGGTTCAGATCAACGGTATTCATCATGTGCCGTCGTGGGAAACAGTGGAATCCTATTGAACAGTGATTATGGGAAGCTAATTGACAGCCATGAGATTGTTATTCGATTGAACAATGCAAAAACAAACAAGTATGAGAGCAAAGTTGGGTCCAAAACCAGCGTTTCTTTCATCAATAGCAACATCTTGCACCTTTGTGCCAGAAGAGAAGGGTGTTTTTGCCATCCTTATGGGCCAAACGTGCCGACAATCATGTACATTTGTCAACCTGTACATTTCATGGACTACACTGTCTGCAACAGTTCCCACAAATCCCCGCTGCTGATAACAGATCCAAGTTTTGATGCTCTGTGCTCTAGGATTGTAAAGTATTACTCAATCAAACGCTTTGTGGAGGTGACAGGGAAATCGTTGGAAGAATGGAGCTCAGCCCACGAAGGTCCCTTATTCCACTATTCTTCTGGCATGCAAGCCGTTATGTTGGCCGTAGGAATTTGTGACAAAGTAAGCATATTTGGGTTTGGGAAATTGGCTTCAGCAAGGCACCATTATCACACAAACCAGAAGGCCGAACTGGGTTTACACGATTATGAGGCAGAGTATGCTTTCTATTACGATTTGATTGCTAGGCCACAGAGAATACCTTTCTTGTCGGACAAGTTCAAGATACCTTCTACGGTTTTATATCGATGA

Coding sequence (CDS)

ATGCGTGCAATCACAGAAATCTCGCCTTTTCCGAGTTCAAATTCGAAGCATCTTAACTGTTCCCCTATTATCTCTTTGGATGCCGAACTTCCTGGGAGTCGATCTGCGACATATTGGTTCAAGATCTGTCCAGCCAATGCTAGGTTTCATTTAGCATTGCCCCACAATCGGTCCGTCGCGATGAAGCGCACGGTCCGTCCAGTGTTCAGCGTGCTGCTGTTCCTTACCTTTGCCGTCACTCTCATCTGCCGCCTCATCTTCCGCCGCGGCCTCAGTTTCTTTGAGATGGAAACTAATGTAATCTCTCCAAGATCTCCAGCATTCATGTTCAATTCCACACTGCTGAAATTTGCTTCGGTTGATTTGGGAGAGGCTCAGTCGAAGCGAGAGATAGAGCAGTTATTGGAGGGGAAATTTGGTGGTCCGAGGACGTACAAGACTTTCGCTACTTGGCGAAGATTCAATCACTATGACATGAAAGCGAGGCCCTCGAATAGCTTTCCAGTGACATTCCGCTCTCCAGCTTTCTACCGACATTGGTTGGATTTCAGGCGGGCATTGAGTGGCTGGGTGAGAAGAAAAGGGTTTACAACGGATATAATGCCAGAACTGGTAAGGCTTATCAAAGCCCCACTGGACAGGCACAACGGGTTGGTGGGTTCAGATCAACGGTATTCATCATGTGCCGTCGTGGGAAACAGTGGAATCCTATTGAACAGTGATTATGGGAAGCTAATTGACAGCCATGAGATTGTTATTCGATTGAACAATGCAAAAACAAACAAGTATGAGAGCAAAGTTGGGTCCAAAACCAGCGTTTCTTTCATCAATAGCAACATCTTGCACCTTTGTGCCAGAAGAGAAGGGTGTTTTTGCCATCCTTATGGGCCAAACGTGCCGACAATCATGTACATTTGTCAACCTGTACATTTCATGGACTACACTGTCTGCAACAGTTCCCACAAATCCCCGCTGCTGATAACAGATCCAAGTTTTGATGCTCTGTGCTCTAGGATTGTAAAGTATTACTCAATCAAACGCTTTGTGGAGGTGACAGGGAAATCGTTGGAAGAATGGAGCTCAGCCCACGAAGGTCCCTTATTCCACTATTCTTCTGGCATGCAAGCCGTTATGTTGGCCGTAGGAATTTGTGACAAAGTAAGCATATTTGGGTTTGGGAAATTGGCTTCAGCAAGGCACCATTATCACACAAACCAGAAGGCCGAACTGGGTTTACACGATTATGAGGCAGAGTATGCTTTCTATTACGATTTGATTGCTAGGCCACAGAGAATACCTTTCTTGTCGGACAAGTTCAAGATACCTTCTACGGTTTTATATCGATGA

Protein sequence

MRAITEISPFPSSNSKHLNCSPIISLDAELPGSRSATYWFKICPANARFHLALPHNRSVAMKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEMETNVISPRSPAFMFNSTLLKFASVDLGEAQSKREIEQLLEGKFGGPRTYKTFATWRRFNHYDMKARPSNSFPVTFRSPAFYRHWLDFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVGSDQRYSSCAVVGNSGILLNSDYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNILHLCARREGCFCHPYGPNVPTIMYICQPVHFMDYTVCNSSHKSPLLITDPSFDALCSRIVKYYSIKRFVEVTGKSLEEWSSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARHHYHTNQKAELGLHDYEAEYAFYYDLIARPQRIPFLSDKFKIPSTVLYR
BLAST of CmaCh20G001660 vs. Swiss-Prot
Match: GT29A_ARATH (Beta-1,6-galactosyltransferase GALT29A OS=Arabidopsis thaliana GN=GALT29A PE=1 SV=1)

HSP 1 Score: 450.7 bits (1158), Expect = 1.9e-125
Identity = 229/407 (56.27%), Postives = 279/407 (68.55%), Query Frame = 1

Query: 61  MKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEMETNVISPRSPAFM-----FNSTLL 120
           MKR+VRP+FS LLF  FA TLICR+  RR  S F   + +    S   M     FN TLL
Sbjct: 1   MKRSVRPLFSALLFAFFAATLICRVAIRR--SSFSFASAIAELGSSGLMTEDIVFNETLL 60

Query: 121 KFASVDLGEAQSKREIEQLLEGKFGGPRTYKTFATWRRFNHYDMKARPS----------- 180
           +FA++D GE   K+E++ + +        Y       R +   M  RPS           
Sbjct: 61  EFAAIDPGEPNFKQEVDLISD--------YDHTRRSHRRHFSSMSIRPSEQQRRVSRDIA 120

Query: 181 --NSFPVTFRSPAFYRHWLDFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVG-S 240
             + FPVT RS   YR+W +F+R L  W RR+ +  +IM +L+RL+K P+D HNG+V  S
Sbjct: 121 SSSKFPVTLRSSQAYRYWSEFKRNLRLWARRRAYEPNIMLDLIRLVKNPIDVHNGVVSIS 180

Query: 241 DQRYSSCAVVGNSGILLNSDYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNIL 300
            +RY SCAVVGNSG LLNS YG LID HEIVIRLNNAKT ++E KVGSKT++SFINSNIL
Sbjct: 181 SERYLSCAVVGNSGTLLNSQYGDLIDKHEIVIRLNNAKTERFEKKVGSKTNISFINSNIL 240

Query: 301 HLCARREGCFCHPYGPNVPTIMYICQPVHFMDYTVCNSSHKSPLLITDPSFDALCSRIVK 360
           H C RRE C+CHPYG  VP +MYICQP+H +DYT+C  SH++PLLITDP FD +C+RIVK
Sbjct: 241 HQCGRRESCYCHPYGETVPIVMYICQPIHVLDYTLCKPSHRAPLLITDPRFDVMCARIVK 300

Query: 361 YYSIKRFV-EVTGKSLEEWSSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARH 420
           YYS+K+F+ E   K   +WS  HEG LFHYSSGMQAVMLAVGIC+KVS+FGFGKL S +H
Sbjct: 301 YYSVKKFLEEKKAKGFVDWSKDHEGSLFHYSSGMQAVMLAVGICEKVSVFGFGKLNSTKH 360

Query: 421 HYHTNQKAELGLHDYEAEYAFYYDLIARPQRIPFLSDKFKIPSTVLY 448
           HYHTNQKAEL LHDYEAEY  Y DL   P+ IPFL  +FKIP   +Y
Sbjct: 361 HYHTNQKAELKLHDYEAEYRLYRDLENSPRAIPFLPKEFKIPLVQVY 397

BLAST of CmaCh20G001660 vs. Swiss-Prot
Match: STLP1_ORYSI (Sialyltransferase-like protein 1 OS=Oryza sativa subsp. indica GN=STLP1 PE=3 SV=1)

HSP 1 Score: 343.2 bits (879), Expect = 4.3e-93
Identity = 184/409 (44.99%), Postives = 242/409 (59.17%), Query Frame = 1

Query: 61  MKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEMETNVISPRSPA--FMFNSTLLKFA 120
           MKR +R  F+VLLF+          + RR +        V++P  P      N+TLL+ A
Sbjct: 1   MKRPLRRPFAVLLFVVLCAAASFPSVLRRSVG----PAPVLAPLPPLDPARLNATLLRLA 60

Query: 121 SVDLGEAQSKREIEQLLEGKFGGPRTYKTFATWR--------RFNHYDMKARPSNSFPVT 180
           + D  EA  +R+++ LLEG+   P +      WR           H+          P  
Sbjct: 61  AADPSEAPLRRDVDDLLEGRL--PASSARARAWRLRGDRLHLHLRHHQFPVYRRGHHPDH 120

Query: 181 FRSPAFY---RHWL----DFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVGSDQ 240
              P  +   R  L      RRAL  W R +     ++  L  L+  P            
Sbjct: 121 DHDPLLHPLPRQELLLDPSLRRALRSWHRLRRHDPGVLRNLPSLLSLP-----------G 180

Query: 241 RYSSCAVVGNSGILLNSDYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNILHL 300
           R  SCAVVGNSGILL + +G LIDSH  V RLNNA+ + + + VG+KT++SFINSN+LHL
Sbjct: 181 RIPSCAVVGNSGILLGASHGALIDSHAAVFRLNNARISGFAANVGAKTNLSFINSNVLHL 240

Query: 301 CARREGCFCHPYGPNVPTIMYICQPVHFMDYTVCNSS----HKSPLLITDPSFDALCSRI 360
           CARR  CFCHPYG  VP ++YICQ  HF+D   CN+S    H + + +TDP  D LC+RI
Sbjct: 241 CARRPNCFCHPYGDGVPILLYICQAAHFLDVASCNASSRSLHAASISVTDPRLDVLCARI 300

Query: 361 VKYYSIKRFVEVTGKSLEEWSSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASAR 420
           VKYYS++RFV  TG++ EEWSS  +  +FHYSSGMQA+M+AVG+CD+VS+FGFGK A A+
Sbjct: 301 VKYYSLRRFVAETGRAAEEWSSTRDAAMFHYSSGMQAIMVAVGVCDRVSVFGFGKAADAK 360

Query: 421 HHYHTNQKAELGLHDYEAEYAFYYDLIARPQRIPFLSDK-FKIPSTVLY 448
           HHYH+NQKAEL LHDY+AEYAFY DL  RP+ +PFL+D    +P  V Y
Sbjct: 361 HHYHSNQKAELDLHDYKAEYAFYRDLADRPEVVPFLNDAGIAVPPVVFY 392

BLAST of CmaCh20G001660 vs. Swiss-Prot
Match: STLP1_ORYSJ (Sialyltransferase-like protein 1 OS=Oryza sativa subsp. japonica GN=STLP1 PE=2 SV=1)

HSP 1 Score: 343.2 bits (879), Expect = 4.3e-93
Identity = 184/409 (44.99%), Postives = 242/409 (59.17%), Query Frame = 1

Query: 61  MKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEMETNVISPRSPA--FMFNSTLLKFA 120
           MKR +R  F+VLLF+          + RR +        V++P  P      N+TLL+ A
Sbjct: 1   MKRPLRRPFAVLLFVVLCAAASFPSVLRRSVG----PAPVLAPLPPLDPARLNATLLRLA 60

Query: 121 SVDLGEAQSKREIEQLLEGKFGGPRTYKTFATWR--------RFNHYDMKARPSNSFPVT 180
           + D  EA  +R+++ LLEG+   P +      WR           H+          P  
Sbjct: 61  AADPSEAPLRRDVDDLLEGRL--PASSARARAWRLRGDRLHLHLRHHQFPVYRRGHHPDH 120

Query: 181 FRSPAFY---RHWL----DFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVGSDQ 240
              P  +   R  L      RRAL  W R +     ++  L  L+  P            
Sbjct: 121 DHDPLLHPLPRQELHLDPSLRRALRSWHRLRRHDPGVLRNLPSLLSLP-----------G 180

Query: 241 RYSSCAVVGNSGILLNSDYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNILHL 300
           R  SCAVVGNSGILL + +G LIDSH  V RLNNA+ + + + VG+KT++SFINSN+LHL
Sbjct: 181 RIPSCAVVGNSGILLGASHGALIDSHAAVFRLNNARISGFAANVGAKTNLSFINSNVLHL 240

Query: 301 CARREGCFCHPYGPNVPTIMYICQPVHFMDYTVCNSS----HKSPLLITDPSFDALCSRI 360
           CARR  CFCHPYG  VP ++YICQ  HF+D   CN+S    H + + +TDP  D LC+RI
Sbjct: 241 CARRPNCFCHPYGDGVPILLYICQAAHFLDVASCNASSRSLHAASISVTDPRLDVLCARI 300

Query: 361 VKYYSIKRFVEVTGKSLEEWSSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASAR 420
           VKYYS++RFV  TG++ EEWSS  +  +FHYSSGMQA+M+AVG+CD+VS+FGFGK A A+
Sbjct: 301 VKYYSLRRFVAETGRAAEEWSSTRDAAMFHYSSGMQAIMVAVGVCDRVSVFGFGKAADAK 360

Query: 421 HHYHTNQKAELGLHDYEAEYAFYYDLIARPQRIPFLSDK-FKIPSTVLY 448
           HHYH+NQKAEL LHDY+AEYAFY DL  RP+ +PFL+D    +P  V Y
Sbjct: 361 HHYHSNQKAELDLHDYKAEYAFYRDLADRPEVVPFLNDAGIAVPPVVFY 392

BLAST of CmaCh20G001660 vs. Swiss-Prot
Match: STLP3_ORYSJ (Sialyltransferase-like protein 3 OS=Oryza sativa subsp. japonica GN=STLP3 PE=2 SV=1)

HSP 1 Score: 308.5 bits (789), Expect = 1.2e-82
Identity = 151/288 (52.43%), Postives = 198/288 (68.75%), Query Frame = 1

Query: 185 RALSGWV-RRKGFTTDIMPELVRLIKAPLDRHNGL-----VGSDQRYSSCAVVGNSGILL 244
           R L  WV +++ F   +M ELV LIK P+DR+NG       G  +RY+SCAVVGNSGILL
Sbjct: 97  RGLREWVGKQERFDPGVMSELVELIKRPIDRYNGDGGGGGEGEGRRYASCAVVGNSGILL 156

Query: 245 NSDYGKLIDSHEIVIRLNNAKTN--KYESKVGSKTSVSFINSNILHLCA--RREGCFCHP 304
            +++G+LID HE+V+RLNNA     +Y   VG++T ++F+NSN+L  CA  RR  CFC  
Sbjct: 157 AAEHGELIDGHELVVRLNNAPAGDGRYARHVGARTGLAFLNSNVLSQCAVPRRGACFCRA 216

Query: 305 YGPNVPTIMYICQPVHFMDYTVCNSSHKS-----------PLLITDPSFDALCSRIVKYY 364
           YG  VP + Y+C   HF+++ VCN++  S           P+++TDP  DALC+RIVKYY
Sbjct: 217 YGEGVPILTYMCNAAHFVEHAVCNNASSSSSGAADATAAAPVIVTDPRLDALCARIVKYY 276

Query: 365 SIKRFVEVTGKSLEEWSSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARHHYH 424
           S++RF   TG+  EEW+  HE  +FHYSSGMQAV+ A G+CD+VS+FGFGK ASARHHYH
Sbjct: 277 SLRRFARETGRPAEEWARRHEEGMFHYSSGMQAVVAAAGVCDRVSVFGFGKDASARHHYH 336

Query: 425 TNQKAELGLHDYEAEYAFYYDLIARPQRIPFLSDK---FKIPSTVLYR 449
           T Q+ EL LHDYEAEY FY DL +RP+ IPFL  +   F++P    YR
Sbjct: 337 TLQRRELDLHDYEAEYEFYRDLESRPEAIPFLRQRNSGFRLPPVSFYR 384

BLAST of CmaCh20G001660 vs. Swiss-Prot
Match: STLP3_ORYSI (Sialyltransferase-like protein 3 OS=Oryza sativa subsp. indica GN=STLP3 PE=3 SV=1)

HSP 1 Score: 308.5 bits (789), Expect = 1.2e-82
Identity = 151/288 (52.43%), Postives = 198/288 (68.75%), Query Frame = 1

Query: 185 RALSGWV-RRKGFTTDIMPELVRLIKAPLDRHNGL-----VGSDQRYSSCAVVGNSGILL 244
           R L  WV +++ F   +M ELV LIK P+DR+NG       G  +RY+SCAVVGNSGILL
Sbjct: 97  RGLREWVGKQERFDPGVMSELVELIKRPIDRYNGDGGGGGEGEGRRYASCAVVGNSGILL 156

Query: 245 NSDYGKLIDSHEIVIRLNNAKTN--KYESKVGSKTSVSFINSNILHLCA--RREGCFCHP 304
            +++G+LID HE+V+RLNNA     +Y   VG++T ++F+NSN+L  CA  RR  CFC  
Sbjct: 157 AAEHGELIDGHELVVRLNNAPAGDGRYARHVGARTGLAFLNSNVLSQCAVPRRGACFCRA 216

Query: 305 YGPNVPTIMYICQPVHFMDYTVCNSSHKS-----------PLLITDPSFDALCSRIVKYY 364
           YG  VP + Y+C   HF+++ VCN++  S           P+++TDP  DALC+RIVKYY
Sbjct: 217 YGEGVPILTYMCNAAHFVEHAVCNNASSSSSGAADATAAAPVIVTDPRLDALCARIVKYY 276

Query: 365 SIKRFVEVTGKSLEEWSSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARHHYH 424
           S++RF   TG+  EEW+  HE  +FHYSSGMQAV+ A G+CD+VS+FGFGK ASARHHYH
Sbjct: 277 SLRRFARETGRPAEEWARRHEEGMFHYSSGMQAVVAAAGVCDRVSVFGFGKDASARHHYH 336

Query: 425 TNQKAELGLHDYEAEYAFYYDLIARPQRIPFLSDK---FKIPSTVLYR 449
           T Q+ EL LHDYEAEY FY DL +RP+ IPFL  +   F++P    YR
Sbjct: 337 TLQRRELDLHDYEAEYEFYRDLESRPEAIPFLRQRDSGFRLPPVSFYR 384

BLAST of CmaCh20G001660 vs. TrEMBL
Match: A0A0A0KDP9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G152940 PE=3 SV=1)

HSP 1 Score: 718.0 bits (1852), Expect = 7.1e-204
Identity = 338/388 (87.11%), Postives = 366/388 (94.33%), Query Frame = 1

Query: 61  MKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEMETNVISPRSPAFMFNSTLLKFASV 120
           MKRTVRPVFSVLLF+TF+VTLI RLIFRRGL+ F++ETNVI+PR P F+FNSTLLKFASV
Sbjct: 1   MKRTVRPVFSVLLFITFSVTLIFRLIFRRGLTSFDLETNVITPRPPPFVFNSTLLKFASV 60

Query: 121 DLGEAQSKREIEQLLEGKFGGPRTYKTFATWRRFNHYDMKARPSNSFPVTFRSPAFYRHW 180
           DL EAQ KREIEQLLE  FGGPRTYKT+ATWR+FNHY  KARPSNSFPVTFRSPAFYRHW
Sbjct: 61  DLAEAQLKREIEQLLEANFGGPRTYKTYATWRKFNHYSKKARPSNSFPVTFRSPAFYRHW 120

Query: 181 LDFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVGSDQRYSSCAVVGNSGILLNS 240
           LDFRRALSGW RRKG+ TDIMPELVRLIK PLD+H+ LVGSDQRY SCAVVGNSGILLNS
Sbjct: 121 LDFRRALSGWARRKGYATDIMPELVRLIKHPLDKHSELVGSDQRYPSCAVVGNSGILLNS 180

Query: 241 DYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNILHLCARREGCFCHPYGPNVP 300
            YG+LIDSH++VIRLNNAKT+ YE+KVGSKT++SFINSNILHLCARREGCFCHPYGPNVP
Sbjct: 181 GYGRLIDSHDVVIRLNNAKTDNYENKVGSKTNISFINSNILHLCARREGCFCHPYGPNVP 240

Query: 301 TIMYICQPVHFMDYTVCNSSHKSPLLITDPSFDALCSRIVKYYSIKRFVEVTGKSLEEWS 360
           T+MYICQPVHFMDYT+CN+SHKSPLL+TDPSFDALCS+IVKYYSIKRFVEVTGKS EEWS
Sbjct: 241 TVMYICQPVHFMDYTICNTSHKSPLLVTDPSFDALCSKIVKYYSIKRFVEVTGKSSEEWS 300

Query: 361 SAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARHHYHTNQKAELGLHDYEAEYA 420
           SAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGK  SA+HHYHTNQKAEL LHDYEAEYA
Sbjct: 301 SAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKSVSAKHHYHTNQKAELSLHDYEAEYA 360

Query: 421 FYYDLIARPQRIPFLSDKFKIPSTVLYR 449
           FYYDLI+RPQRIPFLSDKFK+P TVLY+
Sbjct: 361 FYYDLISRPQRIPFLSDKFKVPPTVLYQ 388

BLAST of CmaCh20G001660 vs. TrEMBL
Match: B9H1R5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0004s19490g PE=3 SV=1)

HSP 1 Score: 572.0 bits (1473), Expect = 6.3e-160
Identity = 258/390 (66.15%), Postives = 320/390 (82.05%), Query Frame = 1

Query: 61  MKRTVRPVFSVLLFLTFAVTLICRLIFRRG--LSFFEMETNVISPRSPAFMFNSTLLKFA 120
           MKR+VRP+FS+LL + FA+TL CR++  RG  + F E E   +  +    +FNSTLLK++
Sbjct: 1   MKRSVRPLFSILLLVVFALTLSCRILIPRGDGVGFIEFEKPKLILQKKVPVFNSTLLKYS 60

Query: 121 SVDLGEAQSKREIEQLLEGKFGGPRTYKTFATWRRFNHYDMKARPSNSFPVTFRSPAFYR 180
           ++D+GE Q+K EIE+LLEG F     Y++FATWRRFNH+D++AR S   P+  RSP FYR
Sbjct: 61  AIDIGEEQAKHEIEELLEGNFDSRGRYRSFATWRRFNHHDVRARSSRGIPLMLRSPQFYR 120

Query: 181 HWLDFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVGSDQRYSSCAVVGNSGILL 240
           +WLDFRRAL  W R+K +  +IM EL+ L+K P+DRHNGLVGS++RY SCAVVGNSGIL+
Sbjct: 121 YWLDFRRALHDWARKKRYQPEIMDELIGLLKGPIDRHNGLVGSERRYGSCAVVGNSGILM 180

Query: 241 NSDYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNILHLCARREGCFCHPYGPN 300
             +YG+LID HE+VIRLNNA+T +YE  VG+KT++SF+NSNILHLC RR+GCFCHPYG N
Sbjct: 181 QKEYGELIDRHEVVIRLNNARTERYERNVGAKTNISFVNSNILHLCGRRQGCFCHPYGAN 240

Query: 301 VPTIMYICQPVHFMDYTVCNSSHKSPLLITDPSFDALCSRIVKYYSIKRFVEVTGKSLEE 360
           VP +MYICQP HF+DYTVCNSSH +PL++TDP FD LC+RIVKYYS+KRFVE TGKSL+E
Sbjct: 241 VPMVMYICQPAHFLDYTVCNSSHDAPLIVTDPRFDLLCARIVKYYSLKRFVEETGKSLDE 300

Query: 361 WSSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARHHYHTNQKAELGLHDYEAE 420
           W SAH+G +FHYSSGMQAVMLAVGICDKVSIFGFGK A ARHHYHTNQKAEL LHDYEAE
Sbjct: 301 WGSAHDGSMFHYSSGMQAVMLAVGICDKVSIFGFGKSALARHHYHTNQKAELKLHDYEAE 360

Query: 421 YAFYYDLIARPQRIPFLSDKFKIPSTVLYR 449
           Y  Y+DL+  PQ +PF++DKFK P+ V+Y+
Sbjct: 361 YDLYHDLVNNPQAVPFITDKFKFPAAVIYQ 390

BLAST of CmaCh20G001660 vs. TrEMBL
Match: A0A061GVE5_THECC (Sialyltransferase, putative OS=Theobroma cacao GN=TCM_041444 PE=3 SV=1)

HSP 1 Score: 570.1 bits (1468), Expect = 2.4e-159
Identity = 278/429 (64.80%), Postives = 326/429 (75.99%), Query Frame = 1

Query: 32  GSRSATYWFKICPANARFHLALPH--------NRSVAMKRTVRPVFSVLLFLTFAVTLIC 91
           GS S T  FKI      F+L  P+        N   +MKR VRP+ S+L+ +  A TL C
Sbjct: 10  GSESNTDTFKI----HTFYLFWPYLAPIYRSTNLIASMKRPVRPLISILMLVALAATLSC 69

Query: 92  RLIFRRGLSFF---EME-TNVISPRSPAFMFNSTLLKFASVDLGEAQSKREIEQLLEGKF 151
           R+  RR   F    E+E T VI    P  +FNS+LLKFA+ D+GE +SK EIEQLLEG F
Sbjct: 70  RIAIRRRGVFTVSTELESTRVIIQPPPMQIFNSSLLKFAATDIGEEKSKHEIEQLLEGNF 129

Query: 152 GGPRTYKTFATWRRFNHYDMKARPSNSFPVTFRSPAFYRHWLDFRRALSGWVRRKGFTTD 211
                Y+TFATW RFN +D+KAR SN   V  RSP FYR+WLDFRR L  W R+K F  +
Sbjct: 130 ASQGRYRTFATWNRFNRHDIKARNSNGLSVMLRSPKFYRYWLDFRRNLQDWARKKMFQPE 189

Query: 212 IMPELVRLIKAPLDRHNGLVGSDQRYSSCAVVGNSGILLNSDYGKLIDSHEIVIRLNNAK 271
           IM +LVRL+K P+D HNGL+ SD+ Y SCAVVGNSGILL+SD+G LID HE+VIRLNNA+
Sbjct: 190 IMMDLVRLVKVPIDNHNGLISSDKAYKSCAVVGNSGILLSSDHGALIDGHEVVIRLNNAR 249

Query: 272 TNKYESKVGSKTSVSFINSNILHLCARREGCFCHPYGPNVPTIMYICQPVHFMDYTVCNS 331
           T ++E  VGSKTS+SF+NSNILHLCARREGCFCHPYG NVP +MYICQPVHFMDY VCNS
Sbjct: 250 TERFEKNVGSKTSISFVNSNILHLCARREGCFCHPYGGNVPMVMYICQPVHFMDYLVCNS 309

Query: 332 SHKSPLLITDPSFDALCSRIVKYYSIKRFVEVTGKSLEEWSSAHEGPLFHYSSGMQAVML 391
           SHK+PLLITDP FD LC+RIVKYYS+KRFV+ TGK L EW S H+G +FHYSSGMQAVML
Sbjct: 310 SHKAPLLITDPRFDMLCARIVKYYSVKRFVQETGKPLGEWGSTHDGSMFHYSSGMQAVML 369

Query: 392 AVGICDKVSIFGFGKLASARHHYHTNQKAELGLHDYEAEYAFYYDLIARPQRIPFLSDKF 449
           A+GICDKVSIFGFGK  SA+HHYHTNQKAEL LHDYEAEYAFY+DL+  PQ IPF+SDKF
Sbjct: 370 ALGICDKVSIFGFGKSTSAKHHYHTNQKAELRLHDYEAEYAFYHDLVNNPQAIPFISDKF 429

BLAST of CmaCh20G001660 vs. TrEMBL
Match: M5VYM2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006961mg PE=3 SV=1)

HSP 1 Score: 564.7 bits (1454), Expect = 1.0e-157
Identity = 267/390 (68.46%), Postives = 317/390 (81.28%), Query Frame = 1

Query: 61  MKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEMETNV-ISPRSPAFMFNSTLLKFAS 120
           MKR+VRP+FS+L+ + FA TL CR   R  LS  E+E  V I P  P  +FN+TLLK+A+
Sbjct: 1   MKRSVRPLFSLLILIVFAATLSCRTAVRHSLSSIELEKKVLIQPPKP--VFNATLLKYAA 60

Query: 121 VDLGEAQSKREIEQLLEGKFGGPRTYKTFATWRRFNHYDMKARPSNSFPVTFRSPAFYRH 180
           VD  EAQ+K+EIEQLLEG F     Y+TFA+WRRFNH+D++AR S   PV  RSP FYR+
Sbjct: 61  VDASEAQAKQEIEQLLEGNFASLGRYRTFASWRRFNHHDIRARTSVGLPVMLRSPQFYRY 120

Query: 181 WLDFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVGSDQ-RYSSCAVVGNSGILL 240
           WLDFRR LS W R K F  D+M +LVRL++ P+DRHNGLV S+Q RYSSCAVVGNSGILL
Sbjct: 121 WLDFRRVLSDWSRNKRFHPDVMLDLVRLVRYPIDRHNGLVDSEQRRYSSCAVVGNSGILL 180

Query: 241 NSDYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNILHLCARREGCFCHPYGPN 300
            S++G LIDSHE+VIRLNNA+   +E KVGSKT++SF+NSNILHLCARR+GCFCHPYG N
Sbjct: 181 KSNHGALIDSHEVVIRLNNARIQGFEGKVGSKTNISFVNSNILHLCARRDGCFCHPYGLN 240

Query: 301 VPTIMYICQPVHFMDYTVCNSSHKSPLLITDPSFDALCSRIVKYYSIKRFVEVTGKSLEE 360
           VP IMYICQP+HF DYTVCN SHK PLL+TDP FD LC+RIVKYYS+KRFVE  GKS ++
Sbjct: 241 VPMIMYICQPLHFFDYTVCNISHKVPLLVTDPRFDVLCARIVKYYSLKRFVEEAGKSFDQ 300

Query: 361 WSSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARHHYHTNQKAELGLHDYEAE 420
           W + H+G +FHYSSGMQA+MLA+GICDKVS+FGFGK  SA+HHYHTNQKAEL LHDY+AE
Sbjct: 301 WGAVHDGAMFHYSSGMQAIMLALGICDKVSVFGFGKSDSAKHHYHTNQKAELRLHDYQAE 360

Query: 421 YAFYYDLIARPQRIPFLSDKFKIPSTVLYR 449
           Y FY DL+ RPQ IPF+SDKFKIP  VLY+
Sbjct: 361 YDFYRDLVERPQVIPFISDKFKIPPVVLYQ 388

BLAST of CmaCh20G001660 vs. TrEMBL
Match: F6HHG9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0080g00250 PE=3 SV=1)

HSP 1 Score: 562.8 bits (1449), Expect = 3.8e-157
Identity = 266/383 (69.45%), Postives = 314/383 (81.98%), Query Frame = 1

Query: 61  MKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEMETNVIS-PRSPAFMFNSTLLKFAS 120
           MKR+VRP+FS+LL +  A TL  R+  RRG + F +ET+ ++ PR    +FNSTLL+ A+
Sbjct: 1   MKRSVRPLFSILLLIAAAGTLTFRVALRRGFASFHLETDDLAGPRRVVPVFNSTLLRIAA 60

Query: 121 VDLGEAQSKREIEQLLEGKFGGPRTYKTFATWRRFNHYDMKARPSNSFPVTFRSPAFYRH 180
           VDL EAQSK EIEQ+LEG F G   Y+TFA+WRRFNH D  AR S   P+  RSP FYR+
Sbjct: 61  VDLSEAQSKHEIEQMLEGNFAGQGRYRTFASWRRFNHIDDHARSSRGVPIQLRSPKFYRY 120

Query: 181 WLDFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVGSDQRYSSCAVVGNSGILLN 240
           WL+FRR L+ W R K F  +IM +L+RL+K P+D HNGLVG  +RYSSCAVVGNSGILL 
Sbjct: 121 WLEFRRNLNDWYRNKRFHPEIMSDLIRLVKHPIDEHNGLVGLGKRYSSCAVVGNSGILLR 180

Query: 241 SDYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNILHLCARREGCFCHPYGPNV 300
           SDYG++IDSHE VIRLNNA+ +++E  VGSKTS+SFINSNILHLCARREGCFCHPYG NV
Sbjct: 181 SDYGEMIDSHEAVIRLNNARLDRFEHNVGSKTSISFINSNILHLCARREGCFCHPYGVNV 240

Query: 301 PTIMYICQPVHFMDYTVCNSSHKSPLLITDPSFDALCSRIVKYYSIKRFVEVTGKSLEEW 360
           P IMYICQP+HF+DY +CNSSHK+PLLITDP FD LC+RIVKYYS+KRFVEV  K ++EW
Sbjct: 241 PMIMYICQPLHFLDYALCNSSHKAPLLITDPRFDMLCARIVKYYSLKRFVEVLKKPVDEW 300

Query: 361 SSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARHHYHTNQKAELGLHDYEAEY 420
           SSAH+G +FHYSSGMQAVMLA+GICDKVSIFGFGK ASA+HHYHT QKAEL LHDYEAEY
Sbjct: 301 SSAHDGAMFHYSSGMQAVMLALGICDKVSIFGFGKSASAKHHYHTTQKAELPLHDYEAEY 360

Query: 421 AFYYDLIARPQRIPFLSDKFKIP 443
            FY+DL+ RPQ IPF+S KF IP
Sbjct: 361 DFYHDLVERPQVIPFISSKFNIP 383

BLAST of CmaCh20G001660 vs. TAIR10
Match: AT1G08280.1 (AT1G08280.1 Glycosyltransferase family 29 (sialyltransferase) family protein)

HSP 1 Score: 450.7 bits (1158), Expect = 1.1e-126
Identity = 229/407 (56.27%), Postives = 279/407 (68.55%), Query Frame = 1

Query: 61  MKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEMETNVISPRSPAFM-----FNSTLL 120
           MKR+VRP+FS LLF  FA TLICR+  RR  S F   + +    S   M     FN TLL
Sbjct: 1   MKRSVRPLFSALLFAFFAATLICRVAIRR--SSFSFASAIAELGSSGLMTEDIVFNETLL 60

Query: 121 KFASVDLGEAQSKREIEQLLEGKFGGPRTYKTFATWRRFNHYDMKARPS----------- 180
           +FA++D GE   K+E++ + +        Y       R +   M  RPS           
Sbjct: 61  EFAAIDPGEPNFKQEVDLISD--------YDHTRRSHRRHFSSMSIRPSEQQRRVSRDIA 120

Query: 181 --NSFPVTFRSPAFYRHWLDFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVG-S 240
             + FPVT RS   YR+W +F+R L  W RR+ +  +IM +L+RL+K P+D HNG+V  S
Sbjct: 121 SSSKFPVTLRSSQAYRYWSEFKRNLRLWARRRAYEPNIMLDLIRLVKNPIDVHNGVVSIS 180

Query: 241 DQRYSSCAVVGNSGILLNSDYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNIL 300
            +RY SCAVVGNSG LLNS YG LID HEIVIRLNNAKT ++E KVGSKT++SFINSNIL
Sbjct: 181 SERYLSCAVVGNSGTLLNSQYGDLIDKHEIVIRLNNAKTERFEKKVGSKTNISFINSNIL 240

Query: 301 HLCARREGCFCHPYGPNVPTIMYICQPVHFMDYTVCNSSHKSPLLITDPSFDALCSRIVK 360
           H C RRE C+CHPYG  VP +MYICQP+H +DYT+C  SH++PLLITDP FD +C+RIVK
Sbjct: 241 HQCGRRESCYCHPYGETVPIVMYICQPIHVLDYTLCKPSHRAPLLITDPRFDVMCARIVK 300

Query: 361 YYSIKRFV-EVTGKSLEEWSSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARH 420
           YYS+K+F+ E   K   +WS  HEG LFHYSSGMQAVMLAVGIC+KVS+FGFGKL S +H
Sbjct: 301 YYSVKKFLEEKKAKGFVDWSKDHEGSLFHYSSGMQAVMLAVGICEKVSVFGFGKLNSTKH 360

Query: 421 HYHTNQKAELGLHDYEAEYAFYYDLIARPQRIPFLSDKFKIPSTVLY 448
           HYHTNQKAEL LHDYEAEY  Y DL   P+ IPFL  +FKIP   +Y
Sbjct: 361 HYHTNQKAELKLHDYEAEYRLYRDLENSPRAIPFLPKEFKIPLVQVY 397

BLAST of CmaCh20G001660 vs. TAIR10
Match: AT3G48820.1 (AT3G48820.1 Glycosyltransferase family 29 (sialyltransferase) family protein)

HSP 1 Score: 51.2 bits (121), Expect = 1.9e-06
Identity = 22/55 (40.00%), Postives = 36/55 (65.45%), Query Frame = 1

Query: 223 QRYSSCAVVGNSGILLNSDYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFIN 278
           +++  CAV+GNSG LL + +GK ID+++ V+R N A    Y+  VG K++   +N
Sbjct: 170 RQFGRCAVIGNSGDLLKTKFGKEIDTYDTVLRENGAPIQNYKEYVGEKSTFRLLN 224

BLAST of CmaCh20G001660 vs. TAIR10
Match: AT1G08660.1 (AT1G08660.1 MALE GAMETOPHYTE DEFECTIVE 2)

HSP 1 Score: 48.9 bits (115), Expect = 9.4e-06
Identity = 25/49 (51.02%), Postives = 33/49 (67.35%), Query Frame = 1

Query: 223 QRYSSCAVVGNSGILLNSDYGKLIDSHEIVIRLNNAKTN-KYESKVGSK 271
           +++  CAVVGNSG LL +++G+ IDSH+ V R N A  N KY   VG K
Sbjct: 176 RQFHKCAVVGNSGDLLKTEFGEEIDSHDAVFRDNEAPVNEKYAKYVGVK 224

BLAST of CmaCh20G001660 vs. NCBI nr
Match: gi|659117388|ref|XP_008458576.1| (PREDICTED: CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferase 1 [Cucumis melo])

HSP 1 Score: 730.7 bits (1885), Expect = 1.5e-207
Identity = 343/388 (88.40%), Postives = 371/388 (95.62%), Query Frame = 1

Query: 61  MKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEMETNVISPRSPAFMFNSTLLKFASV 120
           MKRTVRPVFSVLLF+TF +TLI RLIFRRGLS F++ETNVI+PR PAF+FNSTLL+FASV
Sbjct: 1   MKRTVRPVFSVLLFITFGITLIFRLIFRRGLSSFDLETNVITPRPPAFVFNSTLLEFASV 60

Query: 121 DLGEAQSKREIEQLLEGKFGGPRTYKTFATWRRFNHYDMKARPSNSFPVTFRSPAFYRHW 180
           DLGEAQSKREIEQLLE  FGGPRTYKT+ATWRRFNHY+ KARPSNSFPVTFRSPAFYRHW
Sbjct: 61  DLGEAQSKREIEQLLEANFGGPRTYKTYATWRRFNHYNKKARPSNSFPVTFRSPAFYRHW 120

Query: 181 LDFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVGSDQRYSSCAVVGNSGILLNS 240
           LDFRRALSGWVRRKG+ TDIMP+LVRLIK PLD+HNGLVGSDQRY SCAVVGNSGILLNS
Sbjct: 121 LDFRRALSGWVRRKGYATDIMPKLVRLIKHPLDKHNGLVGSDQRYPSCAVVGNSGILLNS 180

Query: 241 DYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNILHLCARREGCFCHPYGPNVP 300
           DYG+LIDSH++VIRLNNAKT+ YE+KVGSKT++SFINSNILHLCARREGCFCHPYGPNVP
Sbjct: 181 DYGRLIDSHDVVIRLNNAKTDNYENKVGSKTNISFINSNILHLCARREGCFCHPYGPNVP 240

Query: 301 TIMYICQPVHFMDYTVCNSSHKSPLLITDPSFDALCSRIVKYYSIKRFVEVTGKSLEEWS 360
           T+MYICQPVHFMDYT+CN+SHKSPLL+TDPSFDALCS+IVKYYSIKRFVE TGKS EEWS
Sbjct: 241 TVMYICQPVHFMDYTICNTSHKSPLLVTDPSFDALCSKIVKYYSIKRFVEATGKSPEEWS 300

Query: 361 SAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARHHYHTNQKAELGLHDYEAEYA 420
           SAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGK  SA+HHYHTNQKAEL LHDYEAEYA
Sbjct: 301 SAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKSVSAKHHYHTNQKAELSLHDYEAEYA 360

Query: 421 FYYDLIARPQRIPFLSDKFKIPSTVLYR 449
           FYYDLI+RPQRIPFLSDKFK+P TVLY+
Sbjct: 361 FYYDLISRPQRIPFLSDKFKVPPTVLYQ 388

BLAST of CmaCh20G001660 vs. NCBI nr
Match: gi|449463074|ref|XP_004149259.1| (PREDICTED: CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferase 1 [Cucumis sativus])

HSP 1 Score: 718.0 bits (1852), Expect = 1.0e-203
Identity = 338/388 (87.11%), Postives = 366/388 (94.33%), Query Frame = 1

Query: 61  MKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEMETNVISPRSPAFMFNSTLLKFASV 120
           MKRTVRPVFSVLLF+TF+VTLI RLIFRRGL+ F++ETNVI+PR P F+FNSTLLKFASV
Sbjct: 1   MKRTVRPVFSVLLFITFSVTLIFRLIFRRGLTSFDLETNVITPRPPPFVFNSTLLKFASV 60

Query: 121 DLGEAQSKREIEQLLEGKFGGPRTYKTFATWRRFNHYDMKARPSNSFPVTFRSPAFYRHW 180
           DL EAQ KREIEQLLE  FGGPRTYKT+ATWR+FNHY  KARPSNSFPVTFRSPAFYRHW
Sbjct: 61  DLAEAQLKREIEQLLEANFGGPRTYKTYATWRKFNHYSKKARPSNSFPVTFRSPAFYRHW 120

Query: 181 LDFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVGSDQRYSSCAVVGNSGILLNS 240
           LDFRRALSGW RRKG+ TDIMPELVRLIK PLD+H+ LVGSDQRY SCAVVGNSGILLNS
Sbjct: 121 LDFRRALSGWARRKGYATDIMPELVRLIKHPLDKHSELVGSDQRYPSCAVVGNSGILLNS 180

Query: 241 DYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNILHLCARREGCFCHPYGPNVP 300
            YG+LIDSH++VIRLNNAKT+ YE+KVGSKT++SFINSNILHLCARREGCFCHPYGPNVP
Sbjct: 181 GYGRLIDSHDVVIRLNNAKTDNYENKVGSKTNISFINSNILHLCARREGCFCHPYGPNVP 240

Query: 301 TIMYICQPVHFMDYTVCNSSHKSPLLITDPSFDALCSRIVKYYSIKRFVEVTGKSLEEWS 360
           T+MYICQPVHFMDYT+CN+SHKSPLL+TDPSFDALCS+IVKYYSIKRFVEVTGKS EEWS
Sbjct: 241 TVMYICQPVHFMDYTICNTSHKSPLLVTDPSFDALCSKIVKYYSIKRFVEVTGKSSEEWS 300

Query: 361 SAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARHHYHTNQKAELGLHDYEAEYA 420
           SAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGK  SA+HHYHTNQKAEL LHDYEAEYA
Sbjct: 301 SAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKSVSAKHHYHTNQKAELSLHDYEAEYA 360

Query: 421 FYYDLIARPQRIPFLSDKFKIPSTVLYR 449
           FYYDLI+RPQRIPFLSDKFK+P TVLY+
Sbjct: 361 FYYDLISRPQRIPFLSDKFKVPPTVLYQ 388

BLAST of CmaCh20G001660 vs. NCBI nr
Match: gi|1009113279|ref|XP_015872656.1| (PREDICTED: beta-1,6-galactosyltransferase GALT29A [Ziziphus jujuba])

HSP 1 Score: 575.5 bits (1482), Expect = 8.2e-161
Identity = 269/393 (68.45%), Postives = 321/393 (81.68%), Query Frame = 1

Query: 58  SVAMKRTVRPVFSVLLFLTFAVTLICRLIFRRGLSFFEME-TNVISPRSPAFMFNSTLLK 117
           S  MKR+VRP+FSVLL + FA TL  R + RR L   E++  NV+   +P  + N TLLK
Sbjct: 46  SPTMKRSVRPLFSVLLLIIFAATLTTRNVVRRSLGSTELDDNNVVVQITPRPVLNGTLLK 105

Query: 118 FASVDLGEAQSKREIEQLLEGKFGGPRTYKTFATWRRFNHYDMKARPSNSFPVTFRSPAF 177
           +A++D+GEA+SK+EIEQLLEG FG    Y+TFA WRRFNH D +AR S   P+  RSP F
Sbjct: 106 YAAIDIGEARSKQEIEQLLEGNFGSLGRYRTFAAWRRFNHRDFRARTSMGTPLLLRSPNF 165

Query: 178 YRHWLDFRRALSGWVRRKGFTTDIMPELVRLIKAPLDRHNGLVGSDQ-RYSSCAVVGNSG 237
           YR+WLDFRR L  WVR K F  DIM ELV+L+K P+D+HNGL G D  R+SSCAVVGNSG
Sbjct: 166 YRYWLDFRRTLQNWVRNKRFQPDIMSELVKLVKLPIDKHNGLAGLDNNRFSSCAVVGNSG 225

Query: 238 ILLNSDYGKLIDSHEIVIRLNNAKTNKYESKVGSKTSVSFINSNILHLCARREGCFCHPY 297
           ILL +DYG LID+H++VIRLNNAKT  YE KVGSKT++SF+NSNILHLCARREGCFCHPY
Sbjct: 226 ILLKNDYGDLIDNHQVVIRLNNAKTEGYERKVGSKTTISFVNSNILHLCARREGCFCHPY 285

Query: 298 GPNVPTIMYICQPVHFMDYTVCNSSHKSPLLITDPSFDALCSRIVKYYSIKRFVEVTGKS 357
           G N+P +MYICQP+HF+DYT+CNSSHK+PLL+TD  FD LC+RIVKYYS+KRFVEVTGKS
Sbjct: 286 GVNIPIVMYICQPLHFLDYTICNSSHKAPLLVTDARFDVLCARIVKYYSLKRFVEVTGKS 345

Query: 358 LEEWSSAHEGPLFHYSSGMQAVMLAVGICDKVSIFGFGKLASARHHYHTNQKAELGLHDY 417
           LEEW S H+G +FHYSSGMQA+MLA+GICD+VS+FGFGK  SA+HHYHTNQKAEL LHDY
Sbjct: 346 LEEWGSEHDGSMFHYSSGMQAIMLALGICDRVSVFGFGKSDSAKHHYHTNQKAELHLHDY 405

Query: 418 EAEYAFYYDLIARPQRIPFLSDKFKIPSTVLYR 449
           EAEYAFY+DL+ RPQ IPF+S  FKIP  V+Y+
Sbjct: 406 EAEYAFYHDLVQRPQVIPFISGDFKIPPVVIYQ 438

BLAST of CmaCh20G001660 vs. NCBI nr
Match: gi|645218098|ref|XP_008228970.1| (PREDICTED: uncharacterized protein LOC103328352 [Prunus mume])

HSP 1 Score: 573.5 bits (1477), Expect = 3.1e-160
Identity = 279/420 (66.43%), Postives = 332/420 (79.05%), Query Frame = 1

Query: 37  TYWFKICPANARF-HLALPHNRS-----VAMKRTVRPVFSVLLFLTFAVTLICRLIFRRG 96
           T  FKIC     + HLA P N       V MKR+VRP+FS+LL + FA TL CR + R  
Sbjct: 13  TQSFKICTRLYLWPHLATPINPFRDFVIVPMKRSVRPLFSLLLLIVFAATLSCRTVVRHS 72

Query: 97  LSFFEMETNV-ISPRSPAFMFNSTLLKFASVDLGEAQSKREIEQLLEGKFGGPRTYKTFA 156
           LS  E+E  V I P  P  +FN+TLLK+A+VD  EA++K+EIEQLLEG F     Y+TFA
Sbjct: 73  LSSIELEKKVLIQPPKP--VFNATLLKYAAVDASEAKAKQEIEQLLEGNFASLGRYRTFA 132

Query: 157 TWRRFNHYDMKARPSNSFPVTFRSPAFYRHWLDFRRALSGWVRRKGFTTDIMPELVRLIK 216
           +WRRFNH+D++AR S   PV  RSP FYR+WLDFRR LS W R K F  D+M +LVRL++
Sbjct: 133 SWRRFNHHDIRARTSVGLPVMLRSPQFYRYWLDFRRVLSDWSRIKRFQPDVMLDLVRLVR 192

Query: 217 APLDRHNGLVGSDQ-RYSSCAVVGNSGILLNSDYGKLIDSHEIVIRLNNAKTNKYESKVG 276
            P+DRH+GLV S+Q RYSSCAVVGNSGILL S+YG LIDSHE+VIRLNNA+   +E KVG
Sbjct: 193 YPIDRHSGLVDSEQRRYSSCAVVGNSGILLKSNYGALIDSHEVVIRLNNARIQGFEGKVG 252

Query: 277 SKTSVSFINSNILHLCARREGCFCHPYGPNVPTIMYICQPVHFMDYTVCNSSHKSPLLIT 336
           SKT++SF+NSNILHLCARR+GCFCHPYG NVP IMYICQP+HF DYTVCN SHK+PLL+T
Sbjct: 253 SKTNISFVNSNILHLCARRDGCFCHPYGLNVPMIMYICQPLHFFDYTVCNISHKAPLLVT 312

Query: 337 DPSFDALCSRIVKYYSIKRFVEVTGKSLEEWSSAHEGPLFHYSSGMQAVMLAVGICDKVS 396
           DP FD LC+RIVKYYS+KRFVE TGKS ++W + H+G +FHYSSGMQA+MLA+GICDKVS
Sbjct: 313 DPRFDVLCARIVKYYSLKRFVEETGKSFDQWGAVHDGAMFHYSSGMQAIMLALGICDKVS 372

Query: 397 IFGFGKLASARHHYHTNQKAELGLHDYEAEYAFYYDLIARPQRIPFLSDKFKIPSTVLYR 449
           +FGFGK  SA+HHYHTNQKAEL LHDY+AEY FY DL+ RPQ IPF+SDKFKIP  VLY+
Sbjct: 373 VFGFGKSDSAKHHYHTNQKAELRLHDYQAEYDFYRDLVERPQVIPFISDKFKIPPVVLYQ 430

BLAST of CmaCh20G001660 vs. NCBI nr
Match: gi|694375097|ref|XP_009364302.1| (PREDICTED: CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferase 1-like [Pyrus x bretschneideri])

HSP 1 Score: 573.2 bits (1476), Expect = 4.1e-160
Identity = 280/420 (66.67%), Postives = 329/420 (78.33%), Query Frame = 1

Query: 37  TYWFKIC-PANARFHLALPHNR-----SVAMKRTVRPVFSVLLFLTFAVTLICRLIFRRG 96
           T   KIC P +   HLA P N      ++ MKR+VRP+FS+LL + FA TL CR   R  
Sbjct: 13  TQSIKICTPIHFWPHLATPINPFNDLFTLPMKRSVRPLFSILLLIVFAATLSCRNAVRHS 72

Query: 97  LSFFEMETNV-ISPRSPAFMFNSTLLKFASVDLGEAQSKREIEQLLEGKFGGPRTYKTFA 156
              FE+E  V I P  P  +FN+TLL+FA+VD GEAQ+K+EIEQLLEG F     Y+TFA
Sbjct: 73  ---FELENKVLIQPSRP--VFNATLLRFAAVDAGEAQAKKEIEQLLEGNFASLGKYRTFA 132

Query: 157 TWRRFNHYDMKARPSNSFPVTFRSPAFYRHWLDFRRALSGWVRRKGFTTDIMPELVRLIK 216
           +WRRFNH+D++A+ S   PV  RSP FYR+WLDFRR LS W R K F  D+M +LVRL++
Sbjct: 133 SWRRFNHHDIRAKTSTGLPVMLRSPRFYRYWLDFRRVLSDWSRNKRFHADVMLDLVRLVR 192

Query: 217 APLDRHNGLVGSDQ-RYSSCAVVGNSGILLNSDYGKLIDSHEIVIRLNNAKTNKYESKVG 276
            P+DRHNGLVGS+Q RYSSCAVVGNSGILL S++G LIDSHE+VIRLNNA+   +  KVG
Sbjct: 193 NPIDRHNGLVGSEQRRYSSCAVVGNSGILLKSNHGALIDSHEVVIRLNNARIQSFSEKVG 252

Query: 277 SKTSVSFINSNILHLCARREGCFCHPYGPNVPTIMYICQPVHFMDYTVCNSSHKSPLLIT 336
           SKTS+SF+NSNILHLCARR+GCFCHPYG NVP IMYICQPVH  DYT+CN SHK+PLL+T
Sbjct: 253 SKTSISFVNSNILHLCARRDGCFCHPYGLNVPMIMYICQPVHLFDYTICNMSHKAPLLVT 312

Query: 337 DPSFDALCSRIVKYYSIKRFVEVTGKSLEEWSSAHEGPLFHYSSGMQAVMLAVGICDKVS 396
           D  FD LC+RIVKYYS+KRFVE TGKS EEW + H+G +FHYSSGMQA+MLA+GICDKVS
Sbjct: 313 DSRFDMLCARIVKYYSLKRFVEETGKSFEEWGAVHDGSMFHYSSGMQAIMLALGICDKVS 372

Query: 397 IFGFGKLASARHHYHTNQKAELGLHDYEAEYAFYYDLIARPQRIPFLSDKFKIPSTVLYR 449
           +FGFGK  SA+HHYHTNQKAEL LHDY AEYAFY+DL  RPQ IPFLSDKF IP  VLY+
Sbjct: 373 VFGFGKSDSAKHHYHTNQKAELRLHDYPAEYAFYHDLAERPQVIPFLSDKFNIPPVVLYQ 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GT29A_ARATH1.9e-12556.27Beta-1,6-galactosyltransferase GALT29A OS=Arabidopsis thaliana GN=GALT29A PE=1 S... [more]
STLP1_ORYSI4.3e-9344.99Sialyltransferase-like protein 1 OS=Oryza sativa subsp. indica GN=STLP1 PE=3 SV=... [more]
STLP1_ORYSJ4.3e-9344.99Sialyltransferase-like protein 1 OS=Oryza sativa subsp. japonica GN=STLP1 PE=2 S... [more]
STLP3_ORYSJ1.2e-8252.43Sialyltransferase-like protein 3 OS=Oryza sativa subsp. japonica GN=STLP3 PE=2 S... [more]
STLP3_ORYSI1.2e-8252.43Sialyltransferase-like protein 3 OS=Oryza sativa subsp. indica GN=STLP3 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0KDP9_CUCSA7.1e-20487.11Uncharacterized protein OS=Cucumis sativus GN=Csa_6G152940 PE=3 SV=1[more]
B9H1R5_POPTR6.3e-16066.15Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0004s19490g PE=3 SV=1[more]
A0A061GVE5_THECC2.4e-15964.80Sialyltransferase, putative OS=Theobroma cacao GN=TCM_041444 PE=3 SV=1[more]
M5VYM2_PRUPE1.0e-15768.46Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006961mg PE=3 SV=1[more]
F6HHG9_VITVI3.8e-15769.45Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0080g00250 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT1G08280.11.1e-12656.27 Glycosyltransferase family 29 (sialyltransferase) family protein[more]
AT3G48820.11.9e-0640.00 Glycosyltransferase family 29 (sialyltransferase) family protein[more]
AT1G08660.19.4e-0651.02 MALE GAMETOPHYTE DEFECTIVE 2[more]
Match NameE-valueIdentityDescription
gi|659117388|ref|XP_008458576.1|1.5e-20788.40PREDICTED: CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferas... [more]
gi|449463074|ref|XP_004149259.1|1.0e-20387.11PREDICTED: CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferas... [more]
gi|1009113279|ref|XP_015872656.1|8.2e-16168.45PREDICTED: beta-1,6-galactosyltransferase GALT29A [Ziziphus jujuba][more]
gi|645218098|ref|XP_008228970.1|3.1e-16066.43PREDICTED: uncharacterized protein LOC103328352 [Prunus mume][more]
gi|694375097|ref|XP_009364302.1|4.1e-16066.67PREDICTED: CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferas... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001675Glyco_trans_29
Vocabulary: Biological Process
TermDefinition
GO:0006486protein glycosylation
Vocabulary: Molecular Function
TermDefinition
GO:0008373sialyltransferase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0001574 ganglioside biosynthetic process
biological_process GO:0001575 globoside metabolic process
biological_process GO:0018146 keratan sulfate biosynthetic process
biological_process GO:0016266 O-glycan processing
biological_process GO:0097503 sialylation
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003836 beta-galactoside (CMP) alpha-2,3-sialyltransferase activity
molecular_function GO:0008378 galactosyltransferase activity
molecular_function GO:0008373 sialyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G001660.1CmaCh20G001660.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001675Glycosyl transferase family 29PFAMPF00777Glyco_transf_29coord: 198..426
score: 4.8
NoneNo IPR availablePANTHERPTHR13713SIALYLTRANSFERASEcoord: 172..446
score: 9.7E
NoneNo IPR availablePANTHERPTHR13713:SF54SUBFAMILY NOT NAMEDcoord: 172..446
score: 9.7E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh20G001660CmaCh02G013270Cucurbita maxima (Rimu)cmacmaB470