CmoCh04G022600 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G022600
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionNucleotide-diphospho-sugar transferase family protein
LocationCmo_Chr04 : 16895850 .. 16899451 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTGCAGAAGTTTCAGTACGAAATGCTCCTCCATTAATGGCGGCTGTGGCTGTGGCTCCCACAAGCTTTTCAATTCTCTGTATTTGGCATCTCTCTCTCTCTAGAACGCGTATGAACGGTTGATGCTGCTAATTTTTCTGCTCTTTTCTTTGTAATGTCTATGATCGGTGCATGTAATGTCTATGATCGGCACTTGTATTGTCTATGATCGTTTCCGCCATGCTTGGCTACTCCTCTTCCTTCTCCTTCCGTCGTACACTTCAGATTTTTCTCCTCTTCGCTGCCATTTCTCTCTCGTGCCTTGTTGTTCTAAGAGAACTCGACTCCCTTCGCGACTTCCCTCTCTTCTCCTTGACTACTTTCTCTGATTCTTCTCCTGCTTCGTTATTCCTCCCTTCCCTCGATGATGACTACAGCGAGCCTTTTGCGGTGAGTTAACTCGGTTTCCTTATTTTTACCTTCCGTTCGTGCTTTGGTTTATTAATTCATTAGTTTTTGTGTTTTTATGTTATTTTGGTTTGTGGCTATCATTCCAGTTTGCTTAATCGTGTTTTTTGCTTCTCGTTCGTTTCGAATCGCGAGTTTGAATTTCATTGATGTGAAAGTGGATTACAGTGAATAATCCTTGATGTTTAGGGATGGGGTGTAATGAACACGAATATGGAAATAATTGTTCCTCGTTGATGATGCCCGAAAATATAGAAACGATTATGGCATACCTTTCTAGCTGAAGTTTTGATTATCTTTTATACGGCTTGCATGTCTGCACTCTGTGTGTTCATCTATTCCTCATCCTCTCTGTCTTTGAAGCATGAAAGCAGAGTTGATTTTGTTTTCTTCTTTGAATTTGAACCTTCAAATCTTTGCTCCATTCTGCTAATTTTGCTTTTCTCTGGGTCTGTTAGGGAATGTGCTAATGCTATTTAGGTTGTCTAATATTGCTATCTTCGAAGCTTTCTTTGTTTTCCTAGTTAAAATTCTAGCATTTTAGTTTGAAATAGTCTACTTCTTGTCCATTTTCTTCCTTATTTTACTTAAGAGTTGCGTGATTTTTCATGAAACCAAGGAAATGATGTTATGTTGTGAATCTCTGAAGAATAAGAAGCTTAATGTTATTTTGGAAGATGAATTTCCTTGCCAGAAGAACATGCATATAGTAGCACAGTGTGGCTGATATGAACAATGTACAGAAATGTAGATAGACTTTCTCTACTTGTATATGTTGAGGATTGTTGGGAGAGAGTCCCACGTTGGCTAATTTAGGGAATAATCATGGGTTTATAAGTAAGGAAAACATCTCTATTGGTAGAGGCCTTTTGGGGAAGCCCATAGCAAAGCCATAAAACTTATGCTCAAAGTGGACAATATCATATGATTGTGGAGAGTCGTGATTCCTAACAATATACGCTTCTTATTTAATTACAACTTGCTTGTAAACTATTATGCTCCGAGTATTGAACTGTTATGGATTGGGGTTTTCTTTCATGCTTAAATTTATACTTTTAGTACCCCTTTTGTCTGGTTATAGGACGCTGATGAGTTTGGACTGGACAATGTCTTAAGGGATGCTGCGACAGAAGACAGAACTGTTATTTTAACCACTTTAAATCAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTCGAAAGCCTTAGAATTGGAAATCGAACTCACCAACTGTTAAACCACTTGGTTATTATTGCATTGGATAAAAAGGCATTTGTTCGCTGTTTGGATATTCATATTCATTGCTTTGCTCTTGTCACTGAAGGAGTTGATTTTCATTCAGAGGCACAGTTTATGACACCCGACTATTTGAAGATGATGTGGAGAAGGATTGATTTTCTGCGAACTGTTCTTGAGATAGGGTACAATTTTGTTTTTACGGTACCATACTCTCTCCTTTCACCTTGAGATTGTTGCTTCCTTAGTTTTTAATAACATCAATACTTCGTTTTTCAATTTCCTTTTAAAGGTTGAGATAAGTATTATCGTGTTGAGGTAAATCATTTACTACAAGGCAACTGCATATGGACGTTGACCTTCATGACCGATATGGACAACATGTATCAGGGTGTCTTCTGTCTTATTTGGGTATCCAACATTTACCTGGTTGTCTCGAGTCTAAAGTGTTTGACGACATACACAATATCGAAACCTAGTGTTTGTGCTTCATAACTATATGACATGACAAATATTGAATAGGTATCAAACTGGATGACAATGTATCTTTTTCTGATTAAGAAGTCCATGTTCCAATGGATGGACCTATTCTCTATGGGAGTTCAGGGAGAGGCTTCCAGGCAAGGAAAGATTCCTTTATTGATTATATCTCACCTCTGCTCCACAAGCTCATTCTAAATTAGGAATTCAAGGTCACTCCAGTGACCACTTTGATGTCTTCCTCATTCGCCGTGATAGTGCGACAACACAAGCTTCGAAAGGGAATTAGGCTAAAATTCTTGAACGTGCTTGGGTGGTTTCTCGAGGGGAAATATCTTTTTTGGATGGGGGAATTTCCTTCATCAATCAAATGTCACCTCTCCTCCCCAGGTTGATTAAGGAATTAGGGTCAATCCATTGCCCACTTGGTTGTCTTCCGCCTCCACTATGAACAGTAACTCCCATATTGCAAAACATTGAAGCCTAACCATGTTCTCCCTGGCCTTTCATTTCAGTATCAAATTTCTGCTTACGGTAGCAAGACGTGCCTTTCCTAAATTTTTGTGGTTTGAAATCTGCTATTTAGCTGATCAACATCTCTTATACTTACACAACTCTTGGCTATTTACCAATTACTTTAGGATGTATCAGTTTTTGCTTCCATGTATGCTGAAGGATGGCAAGGATACTTAGATCAAGATATAATATTTTTAGTTTTTATTTCTGTATTTCTTCTAATTCCGCTACTCTTGTTTCTTCCTTCAGGATGCTGATGTTATGTGGTTCAGAGATCCATTCCCATTCTTTGATATGAATGCAGATTTCCAGATTGCTTGTGATCAATACCTAGGCATCCCTGAAGATTTAAGCAACAGACCAAATGGAGGGTTTAACTATGTGAAATCCAATAATCGTTCAATTGAGTTTTACAAGTACTGGTACTCATCGCGGGAAACTTATCCGAAATACCATGATCAGGATGTTCTTAACAAGATCAAATTCGAACCTTTCATCGATGACATTGGGCTGAAGATTAGATTCTTGGATACTGCTTATTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAACCGTGTATTAACCATGCATGCAAACTGCTGTGTTGGACTGAAAAGTAAGCTTCATGATCTTAGAATTATGCTTGAGGATTGGAAACGATACATGTCTATGCCACCATATGTTAAGGGATCAACAAGTTCAGTTTGGAGAGTTCCTCAGTACTGCAGGTTTGGTAATATTTCTAGTATTAGGAGTTCCTAGTTAAATATAGTTCTGGTTTAGGGAGTCCTATTCTTTAGGACTTATTAGTCTATTCTTTGCTGCTTAAGTTTTTTTTTTCTGATGATATTTTTAGGGTACATTGTAAAGGCTATATATAGCCTGCTTACCATTAATAAAACTGAATAAGAT

mRNA sequence

ATTTGCAGAAGTTTCAGTACGAAATGCTCCTCCATTAATGGCGGCTGTGGCTGTGGCTCCCACAAGCTTTTCAATTCTCTGTATTTGGCATCTCTCTCTCTCTAGAACGCGTATGAACGGTTGATGCTGCTAATTTTTCTGCTCTTTTCTTTGTAATGTCTATGATCGGTGCATGTAATGTCTATGATCGGCACTTGTATTGTCTATGATCGTTTCCGCCATGCTTGGCTACTCCTCTTCCTTCTCCTTCCGTCGTACACTTCAGATTTTTCTCCTCTTCGCTGCCATTTCTCTCTCGTGCCTTGTTGTTCTAAGAGAACTCGACTCCCTTCGCGACTTCCCTCTCTTCTCCTTGACTACTTTCTCTGATTCTTCTCCTGCTTCGTTATTCCTCCCTTCCCTCGATGATGACTACAGCGAGCCTTTTGCGGACGCTGATGAGTTTGGACTGGACAATGTCTTAAGGGATGCTGCGACAGAAGACAGAACTGTTATTTTAACCACTTTAAATCAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTCGAAAGCCTTAGAATTGGAAATCGAACTCACCAACTGTTAAACCACTTGGTTATTATTGCATTGGATAAAAAGGCATTTGTTCGCTGTTTGGATATTCATATTCATTGCTTTGCTCTTGTCACTGAAGGAGTTGATTTTCATTCAGAGGCACAGTTTATGACACCCGACTATTTGAAGATGATGTGGAGAAGGATTGATTTTCTGCGAACTGTTCTTGAGATAGGGTACAATTTTGTTTTTACGGATGCTGATGTTATGTGGTTCAGAGATCCATTCCCATTCTTTGATATGAATGCAGATTTCCAGATTGCTTGTGATCAATACCTAGGCATCCCTGAAGATTTAAGCAACAGACCAAATGGAGGGTTTAACTATGTGAAATCCAATAATCGTTCAATTGAGTTTTACAAGTACTGGTACTCATCGCGGGAAACTTATCCGAAATACCATGATCAGGATGTTCTTAACAAGATCAAATTCGAACCTTTCATCGATGACATTGGGCTGAAGATTAGATTCTTGGATACTGCTTATTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAACCGTGTATTAACCATGCATGCAAACTGCTGTGTTGGACTGAAAAGTAAGCTTCATGATCTTAGAATTATGCTTGAGGATTGGAAACGATACATGTCTATGCCACCATATGTTAAGGGATCAACAAGTTCAGTTTGGAGAGTTCCTCAGTACTGCAGGTTTGGTAATATTTCTAGTATTAGGAGTTCCTAGTTAAATATAGTTCTGGTTTAGGGAGTCCTATTCTTTAGGACTTATTAGTCTATTCTTTGCTGCTTAAGTTTTTTTTTTCTGATGATATTTTTAGGGTACATTGTAAAGGCTATATATAGCCTGCTTACCATTAATAAAACTGAATAAGAT

Coding sequence (CDS)

ATGATCGTTTCCGCCATGCTTGGCTACTCCTCTTCCTTCTCCTTCCGTCGTACACTTCAGATTTTTCTCCTCTTCGCTGCCATTTCTCTCTCGTGCCTTGTTGTTCTAAGAGAACTCGACTCCCTTCGCGACTTCCCTCTCTTCTCCTTGACTACTTTCTCTGATTCTTCTCCTGCTTCGTTATTCCTCCCTTCCCTCGATGATGACTACAGCGAGCCTTTTGCGGACGCTGATGAGTTTGGACTGGACAATGTCTTAAGGGATGCTGCGACAGAAGACAGAACTGTTATTTTAACCACTTTAAATCAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTCGAAAGCCTTAGAATTGGAAATCGAACTCACCAACTGTTAAACCACTTGGTTATTATTGCATTGGATAAAAAGGCATTTGTTCGCTGTTTGGATATTCATATTCATTGCTTTGCTCTTGTCACTGAAGGAGTTGATTTTCATTCAGAGGCACAGTTTATGACACCCGACTATTTGAAGATGATGTGGAGAAGGATTGATTTTCTGCGAACTGTTCTTGAGATAGGGTACAATTTTGTTTTTACGGATGCTGATGTTATGTGGTTCAGAGATCCATTCCCATTCTTTGATATGAATGCAGATTTCCAGATTGCTTGTGATCAATACCTAGGCATCCCTGAAGATTTAAGCAACAGACCAAATGGAGGGTTTAACTATGTGAAATCCAATAATCGTTCAATTGAGTTTTACAAGTACTGGTACTCATCGCGGGAAACTTATCCGAAATACCATGATCAGGATGTTCTTAACAAGATCAAATTCGAACCTTTCATCGATGACATTGGGCTGAAGATTAGATTCTTGGATACTGCTTATTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAACCGTGTATTAACCATGCATGCAAACTGCTGTGTTGGACTGAAAAGTAAGCTTCATGATCTTAGAATTATGCTTGAGGATTGGAAACGATACATGTCTATGCCACCATATGTTAAGGGATCAACAAGTTCAGTTTGGAGAGTTCCTCAGTACTGCAGGTTTGGTAATATTTCTAGTATTAGGAGTTCCTAG
BLAST of CmoCh04G022600 vs. Swiss-Prot
Match: Y4597_ARATH (Uncharacterized protein At4g15970 OS=Arabidopsis thaliana GN=At4g15970 PE=2 SV=1)

HSP 1 Score: 317.4 bits (812), Expect = 2.1e-85
Identity = 150/277 (54.15%), Postives = 195/277 (70.40%), Query Frame = 1

Query: 82  LDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRTHQLLNHLVIIALDKKA 141
           L  +L +AATED+TVI+TTLN+AW+ PNS  DLFL S  +G  T  LL HLV+  LD++A
Sbjct: 42  LGKILTEAATEDKTVIITTLNKAWSEPNSTFDLFLHSFHVGKGTKPLLRHLVVACLDEEA 101

Query: 142 FVRCLDIHIH-CFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDAD 201
           + RC ++H H C+ + T G+DF  +  FMTPDYLKMMWRRI+FL T+L++ YNF+FT   
Sbjct: 102 YSRCSEVHPHRCYFMKTPGIDFAGDKMFMTPDYLKMMWRRIEFLGTLLKLRYNFIFTI-- 161

Query: 202 VMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSNNRSIEFYKYWYSSRE 261
                 PFP      DFQIACD+Y G  +D+ N  NGGF +VK+N R+I+FY YWY SR 
Sbjct: 162 ------PFPRLSKEVDFQIACDRYSGDDKDIHNAVNGGFAFVKANQRTIDFYNYWYMSRL 221

Query: 262 TYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGLK 321
            YP  HDQDVL++IK   +   IGLK+RFLDT YFGGFCEPS+DL++V TMHANCCVGL+
Sbjct: 222 RYPDRHDQDVLDQIKGGGYPAKIGLKMRFLDTKYFGGFCEPSRDLDKVCTMHANCCVGLE 281

Query: 322 SKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
           +K+ DLR ++ DW+ Y+S      G   + WR P+ C
Sbjct: 282 NKIKDLRQVIVDWENYVSAAKTTDGQIMT-WRDPENC 309

BLAST of CmoCh04G022600 vs. Swiss-Prot
Match: Y1869_ARATH (Uncharacterized protein At1g28695 OS=Arabidopsis thaliana GN=At1g28695 PE=2 SV=1)

HSP 1 Score: 203.8 bits (517), Expect = 3.3e-51
Identity = 109/279 (39.07%), Postives = 163/279 (58.42%), Query Frame = 1

Query: 89  AATEDRTVILTTLNQAWASP----NSVIDLFLESLRIGNRTHQLLNHLVIIALDKKAFVR 148
           AA  ++TVI+T +N+A+       ++++DLFLES   G  T  LL+HL+++A+D+ A+ R
Sbjct: 53  AAGNNKTVIITMVNKAYVKEVGRGSTMLDLFLESFWEGEGTLPLLDHLMVVAVDQTAYDR 112

Query: 149 CLDIHIHCFALVTE-GVDFHSEAQFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDADVMW 208
           C    +HC+ + TE GVD   E  FM+ D+++MMWRR   +  VL  GYN +FTD DVMW
Sbjct: 113 CRFKRLHCYKMETEDGVDLEGEKVFMSKDFIEMMWRRTRLILDVLRRGYNVIFTDTDVMW 172

Query: 209 FRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSNNRSIEFYKYWYSSRETYP 268
            R P    +M+ D QI+ D+ + +   L N    GF +V+SNN++I  ++ WY  R    
Sbjct: 173 LRSPLSRLNMSLDMQISVDR-INVGGQLINT---GFYHVRSNNKTISLFQKWYDMRLNST 232

Query: 269 KYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGLKSKL 328
              +QDVL  +    F + +GL + FL T  F GFC+ S  +  V T+HANCC+ + +K+
Sbjct: 233 GMKEQDVLKNLLDSGFFNQLGLNVGFLSTTEFSGFCQDSPHMGVVTTVHANCCLHIPAKV 292

Query: 329 HDLRIMLEDWKRYMS------MPPYVKGSTSSVWRVPQY 357
            DL  +L DWKRY +        P++K S S  W    Y
Sbjct: 293 FDLTRVLRDWKRYKASHVNSKWSPHLKCSRS--WNDTHY 325

BLAST of CmoCh04G022600 vs. TrEMBL
Match: A0A0A0KPV2_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_5G107050 PE=3 SV=1)

HSP 1 Score: 573.5 bits (1477), Expect = 1.8e-160
Identity = 278/345 (80.58%), Postives = 308/345 (89.28%), Query Frame = 1

Query: 14  SFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSLDD-DYSE 73
           SFR +  I LLF AISLSCLV+LREL+SLR FPLFS +T S   P   FL SL   D+  
Sbjct: 6   SFRCSPHILLLFTAISLSCLVILRELNSLRYFPLFSFSTSSGPPPLPPFLLSLPHHDHLS 65

Query: 74  PFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRTHQLLNHL 133
           P  +ADE+GLD VL+DAATED+TVILTTLN+AWASPN+VIDLFL+S RIGNRTHQLL+HL
Sbjct: 66  P--EADEYGLDKVLKDAATEDKTVILTTLNEAWASPNAVIDLFLQSFRIGNRTHQLLDHL 125

Query: 134 VIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLRTVLEIGY 193
           VIIALDKKAF+RCLDIHIHC +LVTEGVDF SEA FM+PDYLKMMWRRIDFLRTVLE+GY
Sbjct: 126 VIIALDKKAFMRCLDIHIHCVSLVTEGVDFRSEAYFMSPDYLKMMWRRIDFLRTVLEMGY 185

Query: 194 NFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSNNRSIEFY 253
           NFVFTDADVMWFRDPFPFFD+NADFQIACDQYLGIP+DL NRPNGGFNYVKSNNRSIEFY
Sbjct: 186 NFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFY 245

Query: 254 KYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMH 313
           KYWYS+RETYP YHDQDVLN+IK++ FI++IGLKIRFLDTAYFGGFCEPSKDLNRVLTMH
Sbjct: 246 KYWYSARETYPGYHDQDVLNRIKYDFFIEEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMH 305

Query: 314 ANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
           ANCC+G+ SKLHDLRI+LEDWK YMSMPPY+K S+   WRVPQ C
Sbjct: 306 ANCCIGMDSKLHDLRILLEDWKHYMSMPPYLKTSSIQSWRVPQNC 348

BLAST of CmoCh04G022600 vs. TrEMBL
Match: A0A0B2RYA6_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_013623 PE=4 SV=1)

HSP 1 Score: 478.0 bits (1229), Expect = 1.0e-131
Identity = 233/352 (66.19%), Postives = 274/352 (77.84%), Query Frame = 1

Query: 6   MLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  S +   RR +     FA +SLSCLV+L ++DS R      L++F  S   S F   
Sbjct: 1   MLQESKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHR-----FLSSFHSSYSLSGFTRI 60

Query: 66  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRT 125
               Y++P A ++E+ L+ +L DAA +DRTVILTTLN+AWA+PNSVIDLFLES RIG+RT
Sbjct: 61  FPSVYNDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLR 185
              LNHLVIIALD+KAF RC  IH HCF+LV+E  DFH EA FMTP YL MMW+RIDFLR
Sbjct: 121 STFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 245
           TVLE+GYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G  +D+ NRPNGGFNYVKSN
Sbjct: 181 TVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTGGFDDVQNRPNGGFNYVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IK  PFI DIGLK+RFLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLKMRFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
           N+V TMHANCC+G+ SKLHDLRIML+DWK Y+S+PP +K  +   WRVPQ C
Sbjct: 301 NQVCTMHANCCLGMDSKLHDLRIMLQDWKHYLSLPPSLKRLSVVSWRVPQKC 347

BLAST of CmoCh04G022600 vs. TrEMBL
Match: K7LLW7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G280600 PE=4 SV=1)

HSP 1 Score: 478.0 bits (1229), Expect = 1.0e-131
Identity = 233/352 (66.19%), Postives = 274/352 (77.84%), Query Frame = 1

Query: 6   MLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  S +   RR +     FA +SLSCLV+L ++DS R      L++F  S   S F   
Sbjct: 1   MLRESKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHR-----FLSSFHSSYSLSGFTRI 60

Query: 66  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRT 125
               Y++P A ++E+ L+ +L DAA +DRTVILTTLN+AWA+PNSVIDLFLES RIG+RT
Sbjct: 61  FPSVYNDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLR 185
              LNHLVIIALD+KAF RC  IH HCF+LV+E  DFH EA FMTP YL MMW+RIDFLR
Sbjct: 121 STFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 245
           TVLE+GYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G  +D+ NRPNGGFNYVKSN
Sbjct: 181 TVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTGGFDDVQNRPNGGFNYVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IK  PFI DIGLK+RFLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLKMRFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
           N+V TMHANCC+G+ SKLHDLRIML+DWK Y+S+PP +K  +   WRVPQ C
Sbjct: 301 NQVCTMHANCCLGMDSKLHDLRIMLQDWKHYLSLPPSLKRLSVVSWRVPQKC 347

BLAST of CmoCh04G022600 vs. TrEMBL
Match: A0A0R0I0C9_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G280600 PE=4 SV=1)

HSP 1 Score: 475.3 bits (1222), Expect = 6.6e-131
Identity = 232/350 (66.29%), Postives = 273/350 (78.00%), Query Frame = 1

Query: 6   MLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  S +   RR +     FA +SLSCLV+L ++DS R      L++F  S   S F   
Sbjct: 1   MLRESKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHR-----FLSSFHSSYSLSGFTRI 60

Query: 66  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRT 125
               Y++P A ++E+ L+ +L DAA +DRTVILTTLN+AWA+PNSVIDLFLES RIG+RT
Sbjct: 61  FPSVYNDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLR 185
              LNHLVIIALD+KAF RC  IH HCF+LV+E  DFH EA FMTP YL MMW+RIDFLR
Sbjct: 121 STFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 245
           TVLE+GYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G  +D+ NRPNGGFNYVKSN
Sbjct: 181 TVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTGGFDDVQNRPNGGFNYVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IK  PFI DIGLK+RFLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLKMRFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQ 356
           N+V TMHANCC+G+ SKLHDLRIML+DWK Y+S+PP +K  +   WRVPQ
Sbjct: 301 NQVCTMHANCCLGMDSKLHDLRIMLQDWKHYLSLPPSLKRLSVVSWRVPQ 345

BLAST of CmoCh04G022600 vs. TrEMBL
Match: I1NFB3_SOYBN (Glycosyltransferase OS=Glycine max GN=GLYMA_20G109100 PE=3 SV=1)

HSP 1 Score: 466.8 bits (1200), Expect = 2.4e-128
Identity = 227/352 (64.49%), Postives = 272/352 (77.27%), Query Frame = 1

Query: 6   MLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  + +   RR +     FA +SLSCLV+L ++DS R      L++F  S   S F   
Sbjct: 1   MLRKAKTIPLRRAVAASFFFATVSLSCLVLLGDVDSHR-----FLSSFHSSYSLSGFTRI 60

Query: 66  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRT 125
               Y++P A ++E+ L+ +L +AA +DRTVILTTLN+AWA+PNSVIDLFLES RIG+RT
Sbjct: 61  FPSVYNDPVATSNEYPLEKILNEAAMKDRTVILTTLNEAWAAPNSVIDLFLESFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLR 185
              L+HLVIIALD+KAF RC  IH +CF+LV+E  DFH EA FMTP YL MMW+RIDFLR
Sbjct: 121 STFLDHLVIIALDQKAFARCQVIHTYCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 245
           TVLE+GYNFVFTDAD+MWFRDPFP F ++ADFQIACD + G  +D+ NRPNGGFNYVKSN
Sbjct: 181 TVLEMGYNFVFTDADIMWFRDPFPLFHLDADFQIACDHFTGRFDDVQNRPNGGFNYVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IK  PFI DIGLK+RFLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLKMRFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
           N+V TMHANCC+G+ SKLHDLRIML+DWK Y+S+P  +K  +   WRVPQ C
Sbjct: 301 NQVCTMHANCCLGMDSKLHDLRIMLQDWKHYLSLPTSLKRLSVVSWRVPQKC 347

BLAST of CmoCh04G022600 vs. TAIR10
Match: AT1G14590.1 (AT1G14590.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 434.9 bits (1117), Expect = 5.0e-122
Identity = 214/341 (62.76%), Postives = 257/341 (75.37%), Query Frame = 1

Query: 23  LLFAAISLSCLVVLRELDSLR-DFPLFSLTTFSDSSPASLFLPSLDDDYSEPFADADEFG 82
           L  AAIS+SC V+ R  DSL    P+F L+++ D+                     +E  
Sbjct: 46  LFLAAISISCFVLYRAADSLSFSPPIFDLSSYLDN---------------------EEPK 105

Query: 83  LDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRTHQLLNHLVIIALDKKA 142
           L++VL  AAT DRTV+LTTLN AWA+P SVIDLF ES RIG  T Q+L+HLVI+ALD KA
Sbjct: 106 LEDVLSKAATRDRTVVLTTLNAAWAAPGSVIDLFFESFRIGEETSQILDHLVIVALDAKA 165

Query: 143 FVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDADV 202
           + RCL++H HCF+LVTEGVDF  EA FMT  YLKMMWRRID LR+VLE+GYNFVFTDADV
Sbjct: 166 YSRCLELHKHCFSLVTEGVDFSREAYFMTRSYLKMMWRRIDLLRSVLEMGYNFVFTDADV 225

Query: 203 MWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSNNRSIEFYKYWYSSRET 262
           MWFR+PFP F M ADFQIACD YLG   DL NRPNGGFN+V+SNNR+I FYKYWY+SR  
Sbjct: 226 MWFRNPFPRFYMYADFQIACDHYLGRSNDLHNRPNGGFNFVRSNNRTILFYKYWYASRLR 285

Query: 263 YPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGLKS 322
           +P YHDQDVLN +K EPF+  IGLK+RFL+TAYFGG CEPS+DLN V TMHANCC G++S
Sbjct: 286 FPGYHDQDVLNFLKAEPFVFRIGLKMRFLNTAYFGGLCEPSRDLNLVRTMHANCCYGMES 345

Query: 323 KLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYCRFGNI 363
           KLHDLRIML+DWK +MS+P ++K S+   W+VPQ C   ++
Sbjct: 346 KLHDLRIMLQDWKDFMSLPLHLKQSSGFSWKVPQNCSLDSL 365

BLAST of CmoCh04G022600 vs. TAIR10
Match: AT2G02061.1 (AT2G02061.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 401.0 bits (1029), Expect = 8.0e-112
Identity = 198/320 (61.88%), Postives = 233/320 (72.81%), Query Frame = 1

Query: 56  SSPASLFLPSLDDDYSEPFA-------DADEFGLDNVLRDAATEDRTVILTTLNQAWASP 115
           SS  S   PS++D  S P         + +E  L+ VLR AAT+D TVILTTLN+AWA+P
Sbjct: 75  SSTLSRIFPSVNDSSSSPSPSPSLSPEEIEEPKLEEVLRRAATKDGTVILTTLNEAWAAP 134

Query: 116 NSVIDLFLESLRIGNRTHQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHS-EAQ 175
            SVIDLF ES RIG  T +LL HLVIIALD KA+ RC ++H HCF L TEGVDF   EA 
Sbjct: 135 GSVIDLFFESFRIGKGTRRLLKHLVIIALDAKAYSRCQELHKHCFRLETEGVDFSGGEAY 194

Query: 176 FMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGI 235
           FMTP YL MMWRRI FLR+VLE GYNFVFTDADVMWFR+PF  F  + DFQIACD Y+G 
Sbjct: 195 FMTPSYLTMMWRRISFLRSVLEKGYNFVFTDADVMWFRNPFRRFYEDGDFQIACDHYIGR 254

Query: 236 PEDLSNRPNGGFNYVKSNNRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKI 295
           P D  NRPNGGF +V++NNRSI FYK+WY SR  YPK HDQDVLN IK +PF+  + ++I
Sbjct: 255 PNDFRNRPNGGFTFVRANNRSIGFYKFWYDSRTKYPKNHDQDVLNFIKTDPFLWKLRIRI 314

Query: 296 RFLDTAYFGGFCEPSKDLNRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGST 355
           RFL+T YFGGFCEPSKDLN V TMHANCC GL SKLHDLRIML+DW+ + S+P +   S+
Sbjct: 315 RFLNTVYFGGFCEPSKDLNLVCTMHANCCFGLDSKLHDLRIMLQDWRDFKSLPLHSNQSS 374

Query: 356 SSVWRVPQYCRFGNISSIRS 368
              W VPQ C   ++  + S
Sbjct: 375 GFTWSVPQNCSLDSLRPVDS 394

BLAST of CmoCh04G022600 vs. TAIR10
Match: AT5G44820.1 (AT5G44820.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 374.8 bits (961), Expect = 6.2e-104
Identity = 177/338 (52.37%), Postives = 236/338 (69.82%), Query Frame = 1

Query: 20  QIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSLDDDYSEPFADADE 79
           +I +LF  ++ SCLV+ +    L+   + +LT+   +SP+ L LP+L+     P     +
Sbjct: 32  RILILFLGLTASCLVLYKTAYPLQRLNVSNLTSLQ-ASPSPL-LPNLNSSEISPETTKPK 91

Query: 80  FGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRTHQLLNHLVIIALDK 139
                +L +A+T++ TVI+TTLNQAWA PNS+ DLFLES RIG  T QLL H+V++ LD 
Sbjct: 92  LSFKEILENASTKNNTVIITTLNQAWAEPNSLFDLFLESFRIGQGTQQLLKHVVVVCLDI 151

Query: 140 KAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDA 199
           KAF RC  +H +C+ + T   DF  E  + TPDYLKMMW RID L  VLE+G+NF+FTDA
Sbjct: 152 KAFERCSQLHTNCYHIETSETDFSGEKVYNTPDYLKMMWARIDLLTQVLEMGFNFIFTDA 211

Query: 200 DVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSNNRSIEFYKYWYSSR 259
           D+MW RDPFP    + DFQ+ACD++ G P D  N  NGGF YV+SNNRSIEFYK+W+ SR
Sbjct: 212 DIMWLRDPFPRLYPDGDFQMACDRFFGNPYDSDNWVNGGFTYVRSNNRSIEFYKFWHKSR 271

Query: 260 ETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGL 319
             YP  HDQDV N+IK EPFI +IG+++RF DT YFGGFC+ S+D+N V TMHANCC+GL
Sbjct: 272 LDYPDLHDQDVFNRIKHEPFISEIGIQMRFFDTVYFGGFCQTSRDINLVCTMHANCCIGL 331

Query: 320 KSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
             KLHDL ++L+DW++Y+S+   V+ +T   W VP  C
Sbjct: 332 DKKLHDLNLVLDDWRKYLSLSEPVQNTT---WSVPMKC 364

BLAST of CmoCh04G022600 vs. TAIR10
Match: AT4G19970.1 (AT4G19970.1 Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR005069))

HSP 1 Score: 366.3 bits (939), Expect = 2.2e-101
Identity = 182/357 (50.98%), Postives = 237/357 (66.39%), Query Frame = 1

Query: 2   IVSAMLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASL 61
           I  + L Y S+   +   +I +L   ++ +CL++ +       +PL      ++ S    
Sbjct: 371 IPPSFLDYGSAIGQKEVKKILVLVLGLA-ACLLLYKTA-----YPLHQELDVNNLSSR-- 430

Query: 62  FLPSLDD-DYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLR 121
             P LD    S P   +       VL +A+TE+RTVI+TTLNQAWA PNS+ DLFLES R
Sbjct: 431 --PLLDHTSSSSPLTRSKSISFREVLENASTENRTVIVTTLNQAWAEPNSLFDLFLESFR 490

Query: 122 IGNRTHQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRR 181
           IG  T +LL H+V++ LD KAF RC  +H +C+ L T G DF  E  F TPDYLKMMWRR
Sbjct: 491 IGQGTKKLLQHVVVVCLDSKAFARCSQLHPNCYYLKTTGTDFSGEKLFATPDYLKMMWRR 550

Query: 182 IDFLRTVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFN 241
           I+ L  VLE+GYNF+FTDAD+MW RDPFP    + DFQ+ACD++ G P D  N  NGGF 
Sbjct: 551 IELLTQVLEMGYNFIFTDADIMWLRDPFPRLYPDGDFQMACDRFFGDPHDSDNWVNGGFT 610

Query: 242 YVKSNNRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCE 301
           YVKSN+RSIEFYK+WY+SR  YPK HDQDV N+IK +  + +IG+++RF DT YFGGFC+
Sbjct: 611 YVKSNHRSIEFYKFWYNSRLDYPKMHDQDVFNQIKHKALVSEIGIQMRFFDTVYFGGFCQ 670

Query: 302 PSKDLNRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
            S+D+N V TMHANCCVGL  KLHDL ++L+DW+ Y+S+   VK +T   W VP  C
Sbjct: 671 TSRDINLVCTMHANCCVGLAKKLHDLNLVLDDWRNYLSLSEPVKNTT---WSVPMKC 714

BLAST of CmoCh04G022600 vs. TAIR10
Match: AT4G15970.1 (AT4G15970.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 318.2 bits (814), Expect = 6.8e-87
Identity = 150/277 (54.15%), Postives = 195/277 (70.40%), Query Frame = 1

Query: 82  LDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRTHQLLNHLVIIALDKKA 141
           L  +L +AATED+TVI+TTLN+AW+ PNS  DLFL S  +G  T  LL HLV+  LD++A
Sbjct: 33  LGKILTEAATEDKTVIITTLNKAWSEPNSTFDLFLHSFHVGKGTKPLLRHLVVACLDEEA 92

Query: 142 FVRCLDIHIH-CFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDAD 201
           + RC ++H H C+ + T G+DF  +  FMTPDYLKMMWRRI+FL T+L++ YNF+FT   
Sbjct: 93  YSRCSEVHPHRCYFMKTPGIDFAGDKMFMTPDYLKMMWRRIEFLGTLLKLRYNFIFTI-- 152

Query: 202 VMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSNNRSIEFYKYWYSSRE 261
                 PFP      DFQIACD+Y G  +D+ N  NGGF +VK+N R+I+FY YWY SR 
Sbjct: 153 ------PFPRLSKEVDFQIACDRYSGDDKDIHNAVNGGFTFVKANQRTIDFYNYWYMSRL 212

Query: 262 TYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGLK 321
            YP  HDQDVL++IK   +   IGLK+RFLDT YFGGFCEPS+DL++V TMHANCCVGL+
Sbjct: 213 RYPDRHDQDVLDQIKGGGYPAKIGLKMRFLDTKYFGGFCEPSRDLDKVCTMHANCCVGLE 272

Query: 322 SKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
           +K+ DLR ++ DW+ Y+S      G   + WR P+ C
Sbjct: 273 NKIKDLRQVIVDWENYVSAAKTTDGQIMT-WRDPENC 300

BLAST of CmoCh04G022600 vs. NCBI nr
Match: gi|659072690|ref|XP_008466761.1| (PREDICTED: uncharacterized protein At4g15970 [Cucumis melo])

HSP 1 Score: 574.3 bits (1479), Expect = 1.5e-160
Identity = 278/347 (80.12%), Postives = 309/347 (89.05%), Query Frame = 1

Query: 13  FSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSL--DDDY 72
           +SFR +L I LLF AISLSCLV+LREL+SLR FPLFS +T S   P   F  SL  DDD 
Sbjct: 5   YSFRCSLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTPSGPPPVPPFFLSLPHDDDL 64

Query: 73  SEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRTHQLLN 132
           S     ADE+GLD VL+DAATED+TVILTTLN+AWA+PN+VIDLFL+S RIGN+THQLL+
Sbjct: 65  SL----ADEYGLDKVLKDAATEDKTVILTTLNEAWAAPNAVIDLFLQSFRIGNQTHQLLD 124

Query: 133 HLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLRTVLEI 192
           HLVIIALDKKAF+RCLDIH+HC ALVTEGVDF SEA FM+PDYLKMMWRRIDFLRTVLE+
Sbjct: 125 HLVIIALDKKAFMRCLDIHVHCVALVTEGVDFRSEAYFMSPDYLKMMWRRIDFLRTVLEM 184

Query: 193 GYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSNNRSIE 252
           GYNFVFTDADVMWFRDPFPFFD+NADFQIACDQYLGIP+DL NRPNGGFNYVKSNNRSIE
Sbjct: 185 GYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIE 244

Query: 253 FYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDLNRVLT 312
           FYKYWYS+RETYP YHDQDVLN+IK++ FI++IGLKIRFLDTAYFGGFCEPSKDLNRVLT
Sbjct: 245 FYKYWYSARETYPGYHDQDVLNRIKYDFFIEEIGLKIRFLDTAYFGGFCEPSKDLNRVLT 304

Query: 313 MHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
           MHANCC+G+ SKLHDLRI+LEDWK YMSMPPY+K S+   WRVPQ C
Sbjct: 305 MHANCCIGMDSKLHDLRILLEDWKHYMSMPPYLKTSSIQSWRVPQNC 347

BLAST of CmoCh04G022600 vs. NCBI nr
Match: gi|449463499|ref|XP_004149471.1| (PREDICTED: uncharacterized protein At4g15970 [Cucumis sativus])

HSP 1 Score: 573.5 bits (1477), Expect = 2.6e-160
Identity = 278/345 (80.58%), Postives = 308/345 (89.28%), Query Frame = 1

Query: 14  SFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSLDD-DYSE 73
           SFR +  I LLF AISLSCLV+LREL+SLR FPLFS +T S   P   FL SL   D+  
Sbjct: 6   SFRCSPHILLLFTAISLSCLVILRELNSLRYFPLFSFSTSSGPPPLPPFLLSLPHHDHLS 65

Query: 74  PFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRTHQLLNHL 133
           P  +ADE+GLD VL+DAATED+TVILTTLN+AWASPN+VIDLFL+S RIGNRTHQLL+HL
Sbjct: 66  P--EADEYGLDKVLKDAATEDKTVILTTLNEAWASPNAVIDLFLQSFRIGNRTHQLLDHL 125

Query: 134 VIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLRTVLEIGY 193
           VIIALDKKAF+RCLDIHIHC +LVTEGVDF SEA FM+PDYLKMMWRRIDFLRTVLE+GY
Sbjct: 126 VIIALDKKAFMRCLDIHIHCVSLVTEGVDFRSEAYFMSPDYLKMMWRRIDFLRTVLEMGY 185

Query: 194 NFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSNNRSIEFY 253
           NFVFTDADVMWFRDPFPFFD+NADFQIACDQYLGIP+DL NRPNGGFNYVKSNNRSIEFY
Sbjct: 186 NFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFY 245

Query: 254 KYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMH 313
           KYWYS+RETYP YHDQDVLN+IK++ FI++IGLKIRFLDTAYFGGFCEPSKDLNRVLTMH
Sbjct: 246 KYWYSARETYPGYHDQDVLNRIKYDFFIEEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMH 305

Query: 314 ANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
           ANCC+G+ SKLHDLRI+LEDWK YMSMPPY+K S+   WRVPQ C
Sbjct: 306 ANCCIGMDSKLHDLRILLEDWKHYMSMPPYLKTSSIQSWRVPQNC 348

BLAST of CmoCh04G022600 vs. NCBI nr
Match: gi|955347631|ref|XP_014618915.1| (PREDICTED: uncharacterized protein At4g15970-like isoform X1 [Glycine max])

HSP 1 Score: 482.6 bits (1241), Expect = 6.0e-133
Identity = 237/362 (65.47%), Postives = 279/362 (77.07%), Query Frame = 1

Query: 6   MLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  S +   RR +     FA +SLSCLV+L ++DS R      L++F  S   S F   
Sbjct: 1   MLRESKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHR-----FLSSFHSSYSLSGFTRI 60

Query: 66  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRT 125
               Y++P A ++E+ L+ +L DAA +DRTVILTTLN+AWA+PNSVIDLFLES RIG+RT
Sbjct: 61  FPSVYNDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLR 185
              LNHLVIIALD+KAF RC  IH HCF+LV+E  DFH EA FMTP YL MMW+RIDFLR
Sbjct: 121 STFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 245
           TVLE+GYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G  +D+ NRPNGGFNYVKSN
Sbjct: 181 TVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTGGFDDVQNRPNGGFNYVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IK  PFI DIGLK+RFLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLKMRFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYCRFGNISSI 365
           N+V TMHANCC+G+ SKLHDLRIML+DWK Y+S+PP +K  +   WRVPQ CR   + SI
Sbjct: 301 NQVCTMHANCCLGMDSKLHDLRIMLQDWKHYLSLPPSLKRLSVVSWRVPQKCRLKFLLSI 357

Query: 366 RS 368
            S
Sbjct: 361 ES 357

BLAST of CmoCh04G022600 vs. NCBI nr
Match: gi|1009152804|ref|XP_015894295.1| (PREDICTED: uncharacterized protein At4g15970 [Ziziphus jujuba])

HSP 1 Score: 478.4 bits (1230), Expect = 1.1e-131
Identity = 230/344 (66.86%), Postives = 275/344 (79.94%), Query Frame = 1

Query: 19  LQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSLDDDYSEPFADAD 78
           L+  L  A + L+ LV+LR+ +SLR    F+       SP++ F PS    Y       +
Sbjct: 45  LRRLLFLAVVLLAGLVLLRDTESLRFLSRFT------PSPSAYFFPS--SSYLPQSVSEE 104

Query: 79  EFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRTHQLLNHLVIIALD 138
           E+ L+ VL+DAA  D+TVILTTLN+AWA+PNSV+DLFLES RIG+RT +LLNHLVIIALD
Sbjct: 105 EYTLEKVLKDAAMADKTVILTTLNEAWAAPNSVVDLFLESFRIGDRTSRLLNHLVIIALD 164

Query: 139 KKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTD 198
            KAF RCLD+H HCF LV+EG+DFH EA FMTP YLKMMWRRIDFLR+VLE+GYNFVFTD
Sbjct: 165 MKAFKRCLDVHRHCFFLVSEGIDFHQEAYFMTPAYLKMMWRRIDFLRSVLEMGYNFVFTD 224

Query: 199 ADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSNNRSIEFYKYWYSS 258
           AD+MWFRDPFP F ++ADFQIACD +LG P+D++N PNGGFNYVKSNN+SIEFYK+WY+S
Sbjct: 225 ADIMWFRDPFPRFYLDADFQIACDHFLGSPDDVNNIPNGGFNYVKSNNQSIEFYKFWYAS 284

Query: 259 RETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVG 318
           + TYP YHDQDVLN IKF PFID+IGLK+RFLDTAYFGG CEPSKDLN V TMHANCC G
Sbjct: 285 QHTYPGYHDQDVLNIIKFHPFIDEIGLKMRFLDTAYFGGLCEPSKDLNEVCTMHANCCFG 344

Query: 319 LKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYCRFGNI 363
           L SKLHDLRIML+DWK++MS+ P +K S+   WRVPQ C   ++
Sbjct: 345 LNSKLHDLRIMLQDWKKFMSLQPSLKRSSFISWRVPQNCSLDSL 380

BLAST of CmoCh04G022600 vs. NCBI nr
Match: gi|734414577|gb|KHN37408.1| (Hypothetical protein glysoja_013623 [Glycine soja])

HSP 1 Score: 478.0 bits (1229), Expect = 1.5e-131
Identity = 233/352 (66.19%), Postives = 274/352 (77.84%), Query Frame = 1

Query: 6   MLGYSSSFSFRRTLQIFLLFAAISLSCLVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  S +   RR +     FA +SLSCLV+L ++DS R      L++F  S   S F   
Sbjct: 1   MLQESKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHR-----FLSSFHSSYSLSGFTRI 60

Query: 66  LDDDYSEPFADADEFGLDNVLRDAATEDRTVILTTLNQAWASPNSVIDLFLESLRIGNRT 125
               Y++P A ++E+ L+ +L DAA +DRTVILTTLN+AWA+PNSVIDLFLES RIG+RT
Sbjct: 61  FPSVYNDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAQFMTPDYLKMMWRRIDFLR 185
              LNHLVIIALD+KAF RC  IH HCF+LV+E  DFH EA FMTP YL MMW+RIDFLR
Sbjct: 121 STFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEDLSNRPNGGFNYVKSN 245
           TVLE+GYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G  +D+ NRPNGGFNYVKSN
Sbjct: 181 TVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTGGFDDVQNRPNGGFNYVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKFEPFIDDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IK  PFI DIGLK+RFLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLKMRFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGLKSKLHDLRIMLEDWKRYMSMPPYVKGSTSSVWRVPQYC 358
           N+V TMHANCC+G+ SKLHDLRIML+DWK Y+S+PP +K  +   WRVPQ C
Sbjct: 301 NQVCTMHANCCLGMDSKLHDLRIMLQDWKHYLSLPPSLKRLSVVSWRVPQKC 347

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y4597_ARATH2.1e-8554.15Uncharacterized protein At4g15970 OS=Arabidopsis thaliana GN=At4g15970 PE=2 SV=1[more]
Y1869_ARATH3.3e-5139.07Uncharacterized protein At1g28695 OS=Arabidopsis thaliana GN=At1g28695 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KPV2_CUCSA1.8e-16080.58Glycosyltransferase OS=Cucumis sativus GN=Csa_5G107050 PE=3 SV=1[more]
A0A0B2RYA6_GLYSO1.0e-13166.19Uncharacterized protein OS=Glycine soja GN=glysoja_013623 PE=4 SV=1[more]
K7LLW7_SOYBN1.0e-13166.19Uncharacterized protein OS=Glycine max GN=GLYMA_10G280600 PE=4 SV=1[more]
A0A0R0I0C9_SOYBN6.6e-13166.29Uncharacterized protein OS=Glycine max GN=GLYMA_10G280600 PE=4 SV=1[more]
I1NFB3_SOYBN2.4e-12864.49Glycosyltransferase OS=Glycine max GN=GLYMA_20G109100 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G14590.15.0e-12262.76 Nucleotide-diphospho-sugar transferase family protein[more]
AT2G02061.18.0e-11261.88 Nucleotide-diphospho-sugar transferase family protein[more]
AT5G44820.16.2e-10452.37 Nucleotide-diphospho-sugar transferase family protein[more]
AT4G19970.12.2e-10150.98 Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR00506... [more]
AT4G15970.16.8e-8754.15 Nucleotide-diphospho-sugar transferase family protein[more]
Match NameE-valueIdentityDescription
gi|659072690|ref|XP_008466761.1|1.5e-16080.12PREDICTED: uncharacterized protein At4g15970 [Cucumis melo][more]
gi|449463499|ref|XP_004149471.1|2.6e-16080.58PREDICTED: uncharacterized protein At4g15970 [Cucumis sativus][more]
gi|955347631|ref|XP_014618915.1|6.0e-13365.47PREDICTED: uncharacterized protein At4g15970-like isoform X1 [Glycine max][more]
gi|1009152804|ref|XP_015894295.1|1.1e-13166.86PREDICTED: uncharacterized protein At4g15970 [Ziziphus jujuba][more]
gi|734414577|gb|KHN37408.1|1.5e-13166.19Hypothetical protein glysoja_013623 [Glycine soja][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005069Nucl-diP-sugar_transferase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016310 phosphorylation
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0071555 cell wall organization
cellular_component GO:0005575 cellular_component
cellular_component GO:0000139 Golgi membrane
molecular_function GO:0016301 kinase activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G022600.1CmoCh04G022600.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005069Nucleotide-diphospho-sugar transferasePFAMPF03407Nucleotid_transcoord: 128..326
score: 1.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 66..168
score: 1.4E-43coord: 206..229
score: 1.4
NoneNo IPR availablePANTHERPTHR24015:SF419NUCLEOTIDE-DIPHOSPHO-SUGAR TRANSFERASE DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 66..168
score: 1.4E-43coord: 206..229
score: 1.4