Cp4.1LG01g18710 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g18710
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGlycosyltransferase
LocationCp4.1LG01 : 16081061 .. 16089588 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TATTGCAAAAGGTGGGTCCAACTTAGATTTAGTAGGTGACAGATGAAGTGGGTTGCAAATTTGAAACAGTAATCTGTTCCATTCTGATTAGCGGACAGAGAAGGAAAATGGAGAAGATAAGTGCGTTCGTTTATGGTAGTAATGCTAACAATTTGCTGAAGTTTCAGTACAAAATCCTCCTCCATTAATGGCGGCTGTGGCTGTGGCTCCCACAAGCTTTACAGTTCTCTGTATTTGGCATCTCTCTCTCTCTAGAACGCGTATGAACGGTTGATGCTGCTAATTTTTCTGCTCTTTTCTTTGTAAATGTCTATGATCGGTGCATGTAATGTCTATGATCGGTACTTGTATTGTCTATGATCGTTTCCGCCATGCTTAGCTACTCCTCTTCCTTCTCCTTCCGTCGTACACTTCAGATTTTTCTCCTCTTCGCTGCCATTTCTCTCTCGTGCGTTGTTGTTCTAAGAGAACTCGACTCCCTTCGCGACTTCCCTCTCTTCTCCTTGACTACTTTCTCTGATTCTTCTCCTGCTTCGTTATTCCTCCCTTCCCTCGATGATGACGACAACGAGCCTTTTGCGGTGAGTTAACTCGGTTTCCTTATTTTTACCTTCCGTTCGTGCTTTGGTTTATTAATTCGTTAGTTTTTGTGTTTTTTTTGTTATTTTGGTTTGTGGCTATCATTCCAGTTTGCTTAATCGTGTTTTTTGCTTCTCGTTCGTTTCGAATCGCGAGTTTGAATTTCATTGATGTGAAAGTGGATTACAATGAATAATCCTTGATGTTGAGGGATGGGGTGTAATGAACACGAATATGGAAATAATTGTTCCTCGTTGATGATGCCCGAAAATATAGAAACGATTGTGACATACCTGTCCAGCTGAAGTTTTGATTATCTTTTATACGGCTTGCATGTCTGCACTCTGTGTGTTCATCTATTCCTCATCCTCTCTGTCTTTCAAGCATGAAAGCAGAGTTGATTTTGTTTTCTTCTTTGAATTTGAACCTTCAAATCTTTGCTACATTCTGCTAATTTTGCTTTTCTCTGGGTCTGTTAGGGAATGTGCTAATGCTATTTAGGTTGTCTAATATTGCTATCTTCGAAGCTTTCTTTGTTTTCCTAGTTAAAATTCTAGCATTTTGGTTTGAAATAATCTACTTCTTGTCCATTTTCTTCCTTATTTGACATAAGAGTTGCGTGATTTTTCATGAAACCAAGGAAATGATGTTATGTTGTGAATCTCTGAAGAATAAGAAGCTTAGTGTTGTTTTGGAAGATGAATTTCCCTGCCAGAAGAACACGCATATAGTAGCACAGTGTGGCTGATATGAATAATGTATAGGAATGTAGACAGACTTTCTCTACTTGTATATGTTGAGGATTGTTGGGAGAGAGTCCCCACGTTGGCTAATTTAGGGAATAATCATGGGTTTATAAGTAAGGAATACATCTCTATTGGTACGAGGCTTTTGGGGGAGCCCAAAGCAAAGCCATGAGAGCTTATGCTCAAAGTGGACAATATCATACTATTCTGGAGAGTCGTGATTCCGAATAATATACTCTTCTTGTTGCTTGTAAACTATTATGCTCTGAGTATTGAACTGTTATGGATTGGGGTTTTCTTTCATGCTTAAATTTATACTTTTAGTACCCCTTTTGTCTGATTATAGGACGCTGATGAGTTTGGACTGGACAATGTCTTAAAGGATGCTGCGACAGAAGACAGAACTGTTATTTTAACCACTTTAAATCAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTCGAAAGCTTTAGAATTGGAAATCGAACTCACCAACTGTTAAACCACTTGGTTATTATTGCATTGGATAAAAAGGCATTTGTTCGCTGTTTGGATATCCATATTCATTGCTTTGCTCTTGTCACCGAAGGAGTTGATTTTCATTCAGAGGCACATTTTATGACACCTGACTATTTGAAGATGATGTGGAGAAGGATTGATTTTCTGCGAACTGTTCTTGAGATAGGGTACAATTTCGTTTTTACGGTACCATACTCTCTCCTTTCACCTTGAGATTGTTGCTTCCTTAGTTTTTATTAACATCAATTATTCGTTTTTCGATTTCCTTTTAAAGGTTGGGATAAGTATTATCGTGTTGAGGTAAATCATTTACTACAAGGCAACTGCATATGGACGTTGACCTTCATGACCGATATGGACAACATGTATCAGGGTGTCTTCTGTCTTATTTGGGTATCCAACATTTACCTGGTTGTCTCGAGTCTAAAGTGTTTGACGACATACACAATTTCGAAACCTAGTGTTTGTGCTTCATAACTATATGACATGACAAATATTGAATAGGTATCAAACTGGATGACAATGTATCTTTTTCTGATTAAGAAGTCCATGTTCCAATGGATGGACCTATTCTCTATGGGAGTTCAGGCAGAGGCTTCCAGGCAAGGAAAGATTCCTTTATTGATTATATCTCACCTCTGCTCCACAAGCTCATTCTAAATTAGGAATTCAAGGTCACTCCAGTGACCACTTTGATGTCTTCCTCATTCACCGTGATAGTGCAACAACACGAGCTTCGAAAGGGAGTCAGGCTAAAATTCTTGAACATGCTTGGGTGGTTTCTCGAGGGGAAATATCTTTTTTGGATAGGGGAAATTCCTTTATCAATCAAATGTCACCTCTCCTCCCCAGGTTGATTAAGGAATTAGGGTCAATCCATTGCCCACTTGGTTGTCTTCCGCCTCCACTATGAACAGTAACTCCCATATTGCAAAACATTGAAGCCTCGCCATGTTCTCCCTGGCCTTTCATTTCAGTATCAAATTTCTGCTTACGGTAGCAAGACGTGCCTTTCTTAAAATTTTGTGGTTTGAAATCTGCTACTTAGCTGATCAACATCTCTTATACTTTCACAACTCTTGGCTGTTTACCAATTACTTTAGGATGTATCAGTTTTTTCTTCCATGTATGCTGAAGGATGGCAAGGATACATAGATCAAGATATTATATTTTTAGTTTTTATTTCTGTATTTCTTCTAATTCCGCTACGCTTGTTTCTTCCTTCAGGATGCTGATGTTATGTGGTTCAGAGATCCATTCCCATTCTTTGATATGAATGCAGATTTCCAGATTGCTTGTGATCAATACCTAGGCATCCCTGAATATTTAGGCAACAGACCGAATGGAGGGTTTAACTATGTGAAATCCAATAACCGTTCGATTGAGTTTTACAAGTACTGGTACTCATCGCGGGAAACTTATCCGAAATACCATGATCAGGATGTTCTTAATAAAATCAAATACGAACCTTTCATCAATGACATTGGGCTGAAGATTAGATTCTTGGATACTGCTTATTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAACCGTGTATTAACCATGCATGCAAACTGCTGTGTTGGACTGAAAAGTAAGCTTCATGATCTTAGAATTATGCTTGAGGATTGGAAACGATACATGTCTATGCCATCATATGTTAAGGGACCATCAATTTCAGTTTGGAGAGTTCCTCAGCACTGCAGGATTGGTAACATTTCTAGTATTAGGAGTTACTAGTCTACTCTTTTGTTTTAAATATACTTCTGGTTCAGGGAGTCCTATTCTTTAGGATTTATTAGTCTATTCTTTGTGACTTTTAAGTTTTTTTTTTTTTTTTTTTTNTTCCTTCCTGATGATATTTTTAGGTTCATCGGTTGTTGTAAAGGCTATATAATTACATAATTACATGGTTAGAATTAATAAAATTGAATTTAAGATCCAATTGTTTATATTATCTGCTCTAACAATCTGTTGCTGAACTTATCAATTGGTATCAGAGCCCGTGGAAAGAAGTTGCAATCTTCAAGTGATCAAAATGAGTGATGACAAAACATTAAGGCAAATTCTTCACTTGAATGGTCACTATGATCATGGGAGTGAATTGATGGAGAACTTGCTTAAAGCAAAAGGCTTATGGAGTGAATTGATGGAGAACTTGCTTAAAGCAAAAGGCTTATGGAGCTTGGTGGAGAATGGTTTTTCAGGACTGGTAGAAGGGACATAATTGACTGGACATAATTGACTGCTGCACAACAAGAACATCTTGACGATGTTAGAGTCAAGGATCATCAGGTAAAACAAGAACATCTTGACGATGTTAGAGTCAAGGATCATCAGGTAAAACATTACCTGTACCAAGCAATTGATCGAACTGTCTTTGAACAAATCTTGGACCGTCGCACATCTAAGATTGTTTGGGACTCAATGAAGAGAAAGTTTGGTGGAAACCAAAGAGTGAAGAAATCACTCCTTAATGCATTAAGAAGAGAGTTTGGAGTATTTTGCAAGAGTGATGCAAGTTGCTAATAAGATGAGAAGCAATGGAGAAGAAATGCCAGAAAAGAAAATTGTAGAAAAAATTCTGCGCACCCTAACTGACAAATTTACATACGCTGTATTATCAATTGAAGAATCAAAAGATACTGATACTATGTCCATTGATGAGTTAAAAAGTTCGTTAATGGTACATGATCAAAAATTTCGACGTCCCAGCAACAATGATGAAGAACAGGTGCTGAATGTTGTAGGTCGATCTAGTACAAACAACAGAGGGAGAGGAACGTACAAAGGAAGAAGGCGTGGAAGAGGGAGAATAAATTTCAACAAAGCAACTGTTGAATGTCCAAAGTGGAAGAATGAAGCAAATTATGCTGAGCTTGACGAAGAAGATGAGATGCTACTGATAGCCTATATAGAACTCCATGGATCAAAGAGAAGTGACGCCTGGTTCTTAGACTAGGGTTGCTCAAATCACGTGTGGTAATGAAAACATGTTTTCAAGCTTAAATAAAACTTTTACTCATAGTGTCAAATTTGGAAATAATACCAAAATGAAAGTATTTTGAAAGGCTTTGTAATTTTTTTTTTTTTTTTGCAAGAAAATCGTTGTACTATTGGAGAAGTATATTGGGCACTTGAACTCAAAAACAATCTTTTGAGCGTTGGACAACTTCAAGAAAAGGGAGTAGATGTACGGTTAAAAAAATGGAATATGCAGTATTTATCATCCACAGAAAGGAAAAATAGCAGAATCAATTATGAGTGCAAACCAAATATTCATTTTGCTTACAAAGTCATCAATCACAACAATTGAAGGGAGTTGCCTCCAAGTCTCTAATACAAGTCTCTAATATAGATCAATCAACACTTTGACACTATCGTTATAGTCATCTCAGATACAAAAACCTTGGTATTTTGAAACATAAAAAACATGGTGGAAGGTTTGCCACAAATTGTTGATCCAAGCATCACTTGCGAAACATGTATAAAGGGCAAACAACATCGGACTCCAATTCCAAAACACAGTGGAGAGCAACTGAAAAATTGGGACTTGTTCATGCTGATTTGTGTGGTCCGATTACTCCGTCTTTAAGCAGCGGAAAAAGGTATGTGCTATGTTTTACAGATGATTTTTCTAGAAAGGCATGAATGTACTTTCTCCCAGAAAAAATCAGAAACATTTTACCATTTTAAATGTTTCAAAATGTTCGTGGAAAAAGAAGTTGGAATGCCCATAAAATGCCTGATAGAGGAAGAGAATTCAATTCAGCAGAATTTAATGACTTTTGCAAGCCGCATGGGGTGAAGAGACAACTGACCACAACTTATACTCCACAACAAAATGGAGTAGCTGAACCCAAAAATCGTACAGTGATGAACTTGGTAAGAGCTGTGTTAACAGAAAGGAAAGTACCAAAGAGAGCTGTTATGTGGGTGAATCATATCTTAAATCGATCACCAACTCTTGCAATGAAAGATGTAACACCAGAAGAGGCTTGGAGTGGAGTAAAACCATCCGTTGACTATTTTCGAGTTTTCAAATGTGTGAGATATGTTCATATTTCAGATGCTAAGAGGAAAAAGCTTGAAGATAAAAGTGTAAGTTGTGTTTCATTTGGAATCAGTGGTGAATTCAAAGGGTACAGAATGTTTGATCCTGTAGCAAACAAAATCATAGTTAGCTGCGATGTAATTTTTGATGAGGATCGTGAATGAGATTGGAAAAAATCCTGAAAATGAAACTGATTTAGATTGGGGAGAAAGCATTGAAATTTCAACCGCAACGAAAGAAGACGCAGAACAACACACAGGTTCAACTTCATCGATTAACTCAGATAGTGAAATTGAACTTGTTGTTGCACCTGCGACTAAACCTATTGCTGAAATCGAACCAAGTTTGGCTATAAGGGAGGGCAGAAGTCGACATCCACCTATATGGTCAATAGATTATGTTTCCAGTGAAGGTTTGTCCAAAGAAGATGATGTTAACATGACATTTTTTTACAAATTTAGATCTTTTAAAGTATGAGGAAGCAGTCAAAAACTCAAAGTGGAGGATTGCCATGGACGAGGAAATTAAATCCATTGATGGCACTTAGTAGCTCTACCTGCCGGTGTTAAGAAAATTAGAGTAAAATGGATTTATAAAACTAAGTTAAATGAGCTTGGAGAGGTTGACAAATACAAGGCAAAATTGGTGGTAAAAGGATACACACAAGAGCATGGAATTGATTACACTAAAGTTTTTGCGCCGGTAGCTAGAATGGATATTGTGTGGATGATCATAGGTTTTGCAGCTCAGAAAGGCTGGAAACTTCATCAGTTAGATGTAAAATCAGCTTTTTTTACATGGAGAATTGAAAGAAGATATCTTTCTAGAACAACCAAGAGGCTATAAAAAAAGGGAAGTGAACATATTGTGTCTAAGTTCAACAAAGCACTCTACGGGTTAAAACAGGCACCTAGAGCATGGTCCAGCCGAATCGAATCCTACTTCATCAAAGAAGGTTTTGAAAGCAGTTCTAGTGAGCATACACTCTTCGTTAAAAGGAAGAAAGGTAACATTTTGATTGTAAACATATATGTTGATGATCTTTTGTTTACTAGCAATGACGAATCATTATTGGAAGAATTTAAGTGTTCTATGAAAAAGGAGTTTGAGACTGATCTAGGACAGATGAGGTTCCTTCTAGGGATTGAAGTAATACAACGATCAAGTGACATTTTTATGTCAAAGGAAGTATACAGCTAGGGTTTTGAAGCAATTTGAGATAGAAAATTACAACTCTGTCTGTAATCCCATTGTTCCAGGACAAAAAAATTGGTAAAGATGAAAATGACATTTTCAAACAGATAGTAGGAAGTTTGATGTATCTCATTGCCACTCGTTCCGATCTTATGTTTGTTATCAGTCTAATTAGTCGTTTTATGGCTTGTCCAACGCAACAACATTTTGCAGTAGCAAAAAGAGCTTTGAGGTACTTGAGAGGTACAAGTAACTATGGTGTATTCTACAAAAGGGGAGGAGTGAGTGAGTTGATCGGTTTTACTGATAGTGACTATGTTGGCGCCATAGAAGACCGCAAAAGCACTTTAGACTATGTGTTTATGATGAGTGAAGGAGCAGTGACTTGGTTATTTAGAAAGCAACCTATAGTCACTTTATCAACCACAGAGGCAGAACTTGTTGTTGCTGCTGCATGTGCATGTTAAGCTGTTTGGATGAGAAGAATACTCAAGGAGATCGGTTTTTTTAGAGACAGAAGGAACCAAACTTATGTGTGATAATGCTTCAACCATTAAGTTGTCAAAAAACCCAGTTCTACATTCTCGATTTTGTGTTTATATTTGGTTCTCTTAGTCCTTACTGTTCTAGTGTCTAATTTTTGTCACTACAATTCTGATGAATTCTATCGAACTTAGAGTGATACATATTGAGCATCAACTTTGCATTTCTCATTACTCATTTTGTGGGCGTTAAAATATAGTATCTAAGTACTCATTTTGCAGCCGTTTCTGTAAAATGTAAGCCAAAAGAACATAGCCTACCTAAGATTTATATTAAGGGGTGGACAGGGTGGAGTCTGAGAATATAATCTCATCCTTAACCCCATTTGAAAAATGGTTCCCAAACACTTTTTAATTTCTCGTCGAGGAATCCCCTCTCTATTCAAGACATGTTCCCACGGGACGAGATATCTTTGGTTACAACCAAAACGTAGATGGAGGGAGTTAATATTAAGGAGTGGATGGAGTTGGATGCTTAAACCCCATGGGCCAAATGCTTTAGTGAACCTCTTTGTTAAAAGCCTTACAGAAACGAAGTATCAAATACACCAAAGCAAATGAAACAAAGCGCTTACACAAAACAACAATGGCAACTTTTATTGTATATCACCACTCCCATAAAGCTCTCATCAGTGTCTACTTCCACTGATAGACAACTGATTTGAAACAACAAAGACCATTCAAACACAGTGCTTTGTGCAGAAGCAGCGGCGGCGGAAGCCTCTGCAATGCCCACCATGGAAGCCCTCTGTTTTGCAGACATGGCCGCAGTTGTTCTTACTGAAGCACAACCCCCTGAAGTGGTGGCTCGGAGACTCGCATGTCCTTGCCTCTGTAGCCATTGGCGCCATCCCTGGACCAAATCAACCATTTTAATCATCTAA

mRNA sequence

TATTGCAAAAGGTGGGTCCAACTTAGATTTAGTAGGTGACAGATGAAGTGGGTTGCAAATTTGAAACAGTAATCTGTTCCATTCTGATTAGCGGACAGAGAAGGAAAATGGAGAAGATAAGTGCGTTCGTTTATGGTAGTAATGCTAACAATTTGCTGAAGTTTCAGTACAAAATCCTCCTCCATTAATGGCGGCTGTGGCTGTGGCTCCCACAAGCTTTACAGTTCTCTGTATTTGGCATCTCTCTCTCTCTAGAACGCGTATGAACGGTTGATGCTGCTAATTTTTCTGCTCTTTTCTTTGTAAATGTCTATGATCGGTGCATGTAATGTCTATGATCGGTACTTGTATTGTCTATGATCGTTTCCGCCATGCTTAGCTACTCCTCTTCCTTCTCCTTCCGTCGTACACTTCAGATTTTTCTCCTCTTCGCTGCCATTTCTCTCTCGTGCGTTGTTGTTCTAAGAGAACTCGACTCCCTTCGCGACTTCCCTCTCTTCTCCTTGACTACTTTCTCTGATTCTTCTCCTGCTTCGTTATTCCTCCCTTCCCTCGATGATGACGACAACGAGCCTTTTGCGGACGCTGATGAGTTTGGACTGGACAATGTCTTAAAGGATGCTGCGACAGAAGACAGAACTGTTATTTTAACCACTTTAAATCAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTCGAAAGCTTTAGAATTGGAAATCGAACTCACCAACTGTTAAACCACTTGGTTATTATTGCATTGGATAAAAAGGCATTTGTTCGCTGTTTGGATATCCATATTCATTGCTTTGCTCTTGTCACCGAAGGAGTTGATTTTCATTCAGAGGCACATTTTATGACACCTGACTATTTGAAGATGATGTGGAGAAGGATTGATTTTCTGCGAACTGTTCTTGAGATAGGGTACAATTTCGTTTTTACGGATGCTGATGTTATGTGGTTCAGAGATCCATTCCCATTCTTTGATATGAATGCAGATTTCCAGATTGCTTGTGATCAATACCTAGGCATCCCTGAATATTTAGGCAACAGACCGAATGGAGGGTTTAACTATGTGAAATCCAATAACCGTTCGATTGAGTTTTACAAGTACTGGTACTCATCGCGGGAAACTTATCCGAAATACCATGATCAGGATGTTCTTAATAAAATCAAATACGAACCTTTCATCAATGACATTGGGCTGAAGATTAGATTCTTGGATACTGCTTATTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAACCGTGTATTAACCATGCATGCAAACTGCTGTGTTGGACTGAAAATGCTTTGTGCAGAAGCAGCGGCGGCGGAAGCCTCTGCAATGCCCACCATGGAAGCCCTCTGTTTTGCAGACATGGCCGCAGTTGTTCTTACTGAAGCACAACCCCCTGAAGTGGTGGCTCGGAGACTCGCATGTCCTTGCCTCTGTAGCCATTGGCGCCATCCCTGGACCAAATCAACCATTTTAATCATCTAA

Coding sequence (CDS)

ATGATCGTTTCCGCCATGCTTAGCTACTCCTCTTCCTTCTCCTTCCGTCGTACACTTCAGATTTTTCTCCTCTTCGCTGCCATTTCTCTCTCGTGCGTTGTTGTTCTAAGAGAACTCGACTCCCTTCGCGACTTCCCTCTCTTCTCCTTGACTACTTTCTCTGATTCTTCTCCTGCTTCGTTATTCCTCCCTTCCCTCGATGATGACGACAACGAGCCTTTTGCGGACGCTGATGAGTTTGGACTGGACAATGTCTTAAAGGATGCTGCGACAGAAGACAGAACTGTTATTTTAACCACTTTAAATCAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTCGAAAGCTTTAGAATTGGAAATCGAACTCACCAACTGTTAAACCACTTGGTTATTATTGCATTGGATAAAAAGGCATTTGTTCGCTGTTTGGATATCCATATTCATTGCTTTGCTCTTGTCACCGAAGGAGTTGATTTTCATTCAGAGGCACATTTTATGACACCTGACTATTTGAAGATGATGTGGAGAAGGATTGATTTTCTGCGAACTGTTCTTGAGATAGGGTACAATTTCGTTTTTACGGATGCTGATGTTATGTGGTTCAGAGATCCATTCCCATTCTTTGATATGAATGCAGATTTCCAGATTGCTTGTGATCAATACCTAGGCATCCCTGAATATTTAGGCAACAGACCGAATGGAGGGTTTAACTATGTGAAATCCAATAACCGTTCGATTGAGTTTTACAAGTACTGGTACTCATCGCGGGAAACTTATCCGAAATACCATGATCAGGATGTTCTTAATAAAATCAAATACGAACCTTTCATCAATGACATTGGGCTGAAGATTAGATTCTTGGATACTGCTTATTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAACCGTGTATTAACCATGCATGCAAACTGCTGTGTTGGACTGAAAATGCTTTGTGCAGAAGCAGCGGCGGCGGAAGCCTCTGCAATGCCCACCATGGAAGCCCTCTGTTTTGCAGACATGGCCGCAGTTGTTCTTACTGAAGCACAACCCCCTGAAGTGGTGGCTCGGAGACTCGCATGTCCTTGCCTCTGTAGCCATTGGCGCCATCCCTGGACCAAATCAACCATTTTAATCATCTAA

Protein sequence

MIVSAMLSYSSSFSFRRTLQIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSLDDDDNEPFADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRTHQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSNNRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGLKMLCAEAAAAEASAMPTMEALCFADMAAVVLTEAQPPEVVARRLACPCLCSHWRHPWTKSTILII
BLAST of Cp4.1LG01g18710 vs. Swiss-Prot
Match: Y4597_ARATH (Uncharacterized protein At4g15970 OS=Arabidopsis thaliana GN=At4g15970 PE=2 SV=1)

HSP 1 Score: 288.9 bits (738), Expect = 8.2e-77
Identity = 137/240 (57.08%), Postives = 174/240 (72.50%), Query Frame = 1

Query: 82  LDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRTHQLLNHLVIIALDKKA 141
           L  +L +AATED+TVI+TTLN+AW+ PNS  DLFL SF +G  T  LL HLV+  LD++A
Sbjct: 42  LGKILTEAATEDKTVIITTLNKAWSEPNSTFDLFLHSFHVGKGTKPLLRHLVVACLDEEA 101

Query: 142 FVRCLDIHIH-CFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDAD 201
           + RC ++H H C+ + T G+DF  +  FMTPDYLKMMWRRI+FL T+L++ YNF+FT   
Sbjct: 102 YSRCSEVHPHRCYFMKTPGIDFAGDKMFMTPDYLKMMWRRIEFLGTLLKLRYNFIFTI-- 161

Query: 202 VMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSNNRSIEFYKYWYSSRE 261
                 PFP      DFQIACD+Y G  + + N  NGGF +VK+N R+I+FY YWY SR 
Sbjct: 162 ------PFPRLSKEVDFQIACDRYSGDDKDIHNAVNGGFAFVKANQRTIDFYNYWYMSRL 221

Query: 262 TYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGLK 321
            YP  HDQDVL++IK   +   IGLK+RFLDT YFGGFCEPS+DL++V TMHANCCVGL+
Sbjct: 222 RYPDRHDQDVLDQIKGGGYPAKIGLKMRFLDTKYFGGFCEPSRDLDKVCTMHANCCVGLE 273

BLAST of Cp4.1LG01g18710 vs. Swiss-Prot
Match: Y1869_ARATH (Uncharacterized protein At1g28695 OS=Arabidopsis thaliana GN=At1g28695 PE=2 SV=1)

HSP 1 Score: 187.6 bits (475), Expect = 2.6e-46
Identity = 96/234 (41.03%), Postives = 140/234 (59.83%), Query Frame = 1

Query: 89  AATEDRTVILTTLNQAWASP----NSVIDLFLESFRIGNRTHQLLNHLVIIALDKKAFVR 148
           AA  ++TVI+T +N+A+       ++++DLFLESF  G  T  LL+HL+++A+D+ A+ R
Sbjct: 53  AAGNNKTVIITMVNKAYVKEVGRGSTMLDLFLESFWEGEGTLPLLDHLMVVAVDQTAYDR 112

Query: 149 CLDIHIHCFALVTE-GVDFHSEAHFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDADVMW 208
           C    +HC+ + TE GVD   E  FM+ D+++MMWRR   +  VL  GYN +FTD DVMW
Sbjct: 113 CRFKRLHCYKMETEDGVDLEGEKVFMSKDFIEMMWRRTRLILDVLRRGYNVIFTDTDVMW 172

Query: 209 FRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSNNRSIEFYKYWYSSRETYP 268
            R P    +M+ D QI+ D+        G   N GF +V+SNN++I  ++ WY  R    
Sbjct: 173 LRSPLSRLNMSLDMQISVDRI----NVGGQLINTGFYHVRSNNKTISLFQKWYDMRLNST 232

Query: 269 KYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCV 318
              +QDVL  +    F N +GL + FL T  F GFC+ S  +  V T+HANCC+
Sbjct: 233 GMKEQDVLKNLLDSGFFNQLGLNVGFLSTTEFSGFCQDSPHMGVVTTVHANCCL 282

BLAST of Cp4.1LG01g18710 vs. TrEMBL
Match: A0A0A0KPV2_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_5G107050 PE=3 SV=1)

HSP 1 Score: 511.9 bits (1317), Expect = 6.7e-142
Identity = 250/306 (81.70%), Postives = 276/306 (90.20%), Query Frame = 1

Query: 14  SFRRTLQIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSLDDDDNEP 73
           SFR +  I LLF AISLSC+V+LREL+SLR FPLFS +T S   P   FL SL   D+  
Sbjct: 6   SFRCSPHILLLFTAISLSCLVILRELNSLRYFPLFSFSTSSGPPPLPPFLLSLPHHDHLS 65

Query: 74  FADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRTHQLLNHLV 133
             +ADE+GLD VLKDAATED+TVILTTLN+AWASPN+VIDLFL+SFRIGNRTHQLL+HLV
Sbjct: 66  -PEADEYGLDKVLKDAATEDKTVILTTLNEAWASPNAVIDLFLQSFRIGNRTHQLLDHLV 125

Query: 134 IIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLRTVLEIGYN 193
           IIALDKKAF+RCLDIHIHC +LVTEGVDF SEA+FM+PDYLKMMWRRIDFLRTVLE+GYN
Sbjct: 126 IIALDKKAFMRCLDIHIHCVSLVTEGVDFRSEAYFMSPDYLKMMWRRIDFLRTVLEMGYN 185

Query: 194 FVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSNNRSIEFYK 253
           FVFTDADVMWFRDPFPFFD+NADFQIACDQYLGIP+ L NRPNGGFNYVKSNNRSIEFYK
Sbjct: 186 FVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYK 245

Query: 254 YWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHA 313
           YWYS+RETYP YHDQDVLN+IKY+ FI +IGLKIRFLDTAYFGGFCEPSKDLNRVLTMHA
Sbjct: 246 YWYSARETYPGYHDQDVLNRIKYDFFIEEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHA 305

Query: 314 NCCVGL 320
           NCC+G+
Sbjct: 306 NCCIGM 310

BLAST of Cp4.1LG01g18710 vs. TrEMBL
Match: A0A0B2RYA6_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_013623 PE=4 SV=1)

HSP 1 Score: 421.0 bits (1081), Expect = 1.5e-114
Identity = 208/314 (66.24%), Postives = 245/314 (78.03%), Query Frame = 1

Query: 6   MLSYSSSFSFRRTLQIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  S +   RR +     FA +SLSC+V+L ++DS R      L++F  S   S F   
Sbjct: 1   MLQESKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHR-----FLSSFHSSYSLSGFTRI 60

Query: 66  LDDDDNEPFADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRT 125
                N+P A ++E+ L+ +L DAA +DRTVILTTLN+AWA+PNSVIDLFLESFRIG+RT
Sbjct: 61  FPSVYNDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLR 185
              LNHLVIIALD+KAF RC  IH HCF+LV+E  DFH EA+FMTP YL MMW+RIDFLR
Sbjct: 121 STFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSN 245
           TVLE+GYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G  + + NRPNGGFNYVKSN
Sbjct: 181 TVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTGGFDDVQNRPNGGFNYVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IK  PFI DIGLK+RFLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLKMRFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGL 320
           N+V TMHANCC+G+
Sbjct: 301 NQVCTMHANCCLGM 309

BLAST of Cp4.1LG01g18710 vs. TrEMBL
Match: K7LLW7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G280600 PE=4 SV=1)

HSP 1 Score: 420.6 bits (1080), Expect = 2.0e-114
Identity = 208/314 (66.24%), Postives = 245/314 (78.03%), Query Frame = 1

Query: 6   MLSYSSSFSFRRTLQIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  S +   RR +     FA +SLSC+V+L ++DS R      L++F  S   S F   
Sbjct: 1   MLRESKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHR-----FLSSFHSSYSLSGFTRI 60

Query: 66  LDDDDNEPFADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRT 125
                N+P A ++E+ L+ +L DAA +DRTVILTTLN+AWA+PNSVIDLFLESFRIG+RT
Sbjct: 61  FPSVYNDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLR 185
              LNHLVIIALD+KAF RC  IH HCF+LV+E  DFH EA+FMTP YL MMW+RIDFLR
Sbjct: 121 STFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSN 245
           TVLE+GYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G  + + NRPNGGFNYVKSN
Sbjct: 181 TVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTGGFDDVQNRPNGGFNYVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IK  PFI DIGLK+RFLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLKMRFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGL 320
           N+V TMHANCC+G+
Sbjct: 301 NQVCTMHANCCLGM 309

BLAST of Cp4.1LG01g18710 vs. TrEMBL
Match: A0A0R0I0C9_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G280600 PE=4 SV=1)

HSP 1 Score: 420.6 bits (1080), Expect = 2.0e-114
Identity = 208/314 (66.24%), Postives = 245/314 (78.03%), Query Frame = 1

Query: 6   MLSYSSSFSFRRTLQIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  S +   RR +     FA +SLSC+V+L ++DS R      L++F  S   S F   
Sbjct: 1   MLRESKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHR-----FLSSFHSSYSLSGFTRI 60

Query: 66  LDDDDNEPFADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRT 125
                N+P A ++E+ L+ +L DAA +DRTVILTTLN+AWA+PNSVIDLFLESFRIG+RT
Sbjct: 61  FPSVYNDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLR 185
              LNHLVIIALD+KAF RC  IH HCF+LV+E  DFH EA+FMTP YL MMW+RIDFLR
Sbjct: 121 STFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSN 245
           TVLE+GYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G  + + NRPNGGFNYVKSN
Sbjct: 181 TVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTGGFDDVQNRPNGGFNYVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IK  PFI DIGLK+RFLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLKMRFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGL 320
           N+V TMHANCC+G+
Sbjct: 301 NQVCTMHANCCLGM 309

BLAST of Cp4.1LG01g18710 vs. TrEMBL
Match: A0A067KPV4_JATCU (Glycosyltransferase OS=Jatropha curcas GN=JCGZ_04849 PE=3 SV=1)

HSP 1 Score: 414.8 bits (1065), Expect = 1.1e-112
Identity = 199/299 (66.56%), Postives = 241/299 (80.60%), Query Frame = 1

Query: 23  LLFAAISLSCVVVLRELDSLR--DFPLFSLTTFSDSSPASLFLPSLDDDDNEPFADADEF 82
           L+FA + +SC+++ R  DSLR   FP  S  +F    P+ +F       DN+     +E 
Sbjct: 46  LVFAVLCISCLLLYRAADSLRFLSFPSGSSVSFPHIFPSLVF-------DNDSVPVRNEQ 105

Query: 83  GLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRTHQLLNHLVIIALDKK 142
            L+NVL+DAA ED+TVILTTLN+AWA+PNS++DLFL SFRIG  T +LLNHLVIIALD+K
Sbjct: 106 KLENVLEDAAMEDKTVILTTLNEAWAAPNSIVDLFLASFRIGVHTRRLLNHLVIIALDQK 165

Query: 143 AFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDAD 202
           A+ RC+++HIHCFALVTEG+DF  EA+FMTP Y+KMMWRRIDFLR+VLE+GYNFVFTDAD
Sbjct: 166 AYTRCMELHIHCFALVTEGIDFRKEAYFMTPAYVKMMWRRIDFLRSVLELGYNFVFTDAD 225

Query: 203 VMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSNNRSIEFYKYWYSSRE 262
           VMWFRDPFP F  +ADFQIACD + G    + N+PNGGFN+V+SNNRSIEFYK+WYSSRE
Sbjct: 226 VMWFRDPFPRFYSDADFQIACDHFTGSSVNIENKPNGGFNFVRSNNRSIEFYKFWYSSRE 285

Query: 263 TYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGL 320
           TYP YHDQDVLN IK++ F+ D+GLK+RFLDTAYFGG CEPSKDL+ V TMHANCC GL
Sbjct: 286 TYPGYHDQDVLNFIKFDSFVEDLGLKMRFLDTAYFGGLCEPSKDLSLVCTMHANCCFGL 337

BLAST of Cp4.1LG01g18710 vs. TAIR10
Match: AT1G14590.1 (AT1G14590.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 381.3 bits (978), Expect = 6.9e-106
Identity = 196/318 (61.64%), Postives = 232/318 (72.96%), Query Frame = 1

Query: 4   SAMLSYSSSFSFRRTLQIFLLFAAISLSCVVVLRELDSLR-DFPLFSLTTFSDSSPASLF 63
           S  +S   S   RR     L  AAIS+SC V+ R  DSL    P+F L+++ D+      
Sbjct: 30  SGEMSPGPSIPLRRAA---LFLAAISISCFVLYRAADSLSFSPPIFDLSSYLDN------ 89

Query: 64  LPSLDDDDNEPFADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIG 123
                          +E  L++VL  AAT DRTV+LTTLN AWA+P SVIDLF ESFRIG
Sbjct: 90  ---------------EEPKLEDVLSKAATRDRTVVLTTLNAAWAAPGSVIDLFFESFRIG 149

Query: 124 NRTHQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRID 183
             T Q+L+HLVI+ALD KA+ RCL++H HCF+LVTEGVDF  EA+FMT  YLKMMWRRID
Sbjct: 150 EETSQILDHLVIVALDAKAYSRCLELHKHCFSLVTEGVDFSREAYFMTRSYLKMMWRRID 209

Query: 184 FLRTVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYV 243
            LR+VLE+GYNFVFTDADVMWFR+PFP F M ADFQIACD YLG    L NRPNGGFN+V
Sbjct: 210 LLRSVLEMGYNFVFTDADVMWFRNPFPRFYMYADFQIACDHYLGRSNDLHNRPNGGFNFV 269

Query: 244 KSNNRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPS 303
           +SNNR+I FYKYWY+SR  +P YHDQDVLN +K EPF+  IGLK+RFL+TAYFGG CEPS
Sbjct: 270 RSNNRTILFYKYWYASRLRFPGYHDQDVLNFLKAEPFVFRIGLKMRFLNTAYFGGLCEPS 323

Query: 304 KDLNRVLTMHANCCVGLK 321
           +DLN V TMHANCC G++
Sbjct: 330 RDLNLVRTMHANCCYGME 323

BLAST of Cp4.1LG01g18710 vs. TAIR10
Match: AT2G02061.1 (AT2G02061.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 352.4 bits (903), Expect = 3.4e-97
Identity = 175/272 (64.34%), Postives = 204/272 (75.00%), Query Frame = 1

Query: 56  SSPASLFLPSLDDDDNEPFA-------DADEFGLDNVLKDAATEDRTVILTTLNQAWASP 115
           SS  S   PS++D  + P         + +E  L+ VL+ AAT+D TVILTTLN+AWA+P
Sbjct: 75  SSTLSRIFPSVNDSSSSPSPSPSLSPEEIEEPKLEEVLRRAATKDGTVILTTLNEAWAAP 134

Query: 116 NSVIDLFLESFRIGNRTHQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHS-EAH 175
            SVIDLF ESFRIG  T +LL HLVIIALD KA+ RC ++H HCF L TEGVDF   EA+
Sbjct: 135 GSVIDLFFESFRIGKGTRRLLKHLVIIALDAKAYSRCQELHKHCFRLETEGVDFSGGEAY 194

Query: 176 FMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGI 235
           FMTP YL MMWRRI FLR+VLE GYNFVFTDADVMWFR+PF  F  + DFQIACD Y+G 
Sbjct: 195 FMTPSYLTMMWRRISFLRSVLEKGYNFVFTDADVMWFRNPFRRFYEDGDFQIACDHYIGR 254

Query: 236 PEYLGNRPNGGFNYVKSNNRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKI 295
           P    NRPNGGF +V++NNRSI FYK+WY SR  YPK HDQDVLN IK +PF+  + ++I
Sbjct: 255 PNDFRNRPNGGFTFVRANNRSIGFYKFWYDSRTKYPKNHDQDVLNFIKTDPFLWKLRIRI 314

Query: 296 RFLDTAYFGGFCEPSKDLNRVLTMHANCCVGL 320
           RFL+T YFGGFCEPSKDLN V TMHANCC GL
Sbjct: 315 RFLNTVYFGGFCEPSKDLNLVCTMHANCCFGL 346

BLAST of Cp4.1LG01g18710 vs. TAIR10
Match: AT5G44820.1 (AT5G44820.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 338.2 bits (866), Expect = 6.7e-93
Identity = 160/300 (53.33%), Postives = 215/300 (71.67%), Query Frame = 1

Query: 20  QIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSLDDDDNEPFADADE 79
           +I +LF  ++ SC+V+ +    L+   + +LT+   +SP+ L LP+L+  +  P     +
Sbjct: 32  RILILFLGLTASCLVLYKTAYPLQRLNVSNLTSLQ-ASPSPL-LPNLNSSEISPETTKPK 91

Query: 80  FGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRTHQLLNHLVIIALDK 139
                +L++A+T++ TVI+TTLNQAWA PNS+ DLFLESFRIG  T QLL H+V++ LD 
Sbjct: 92  LSFKEILENASTKNNTVIITTLNQAWAEPNSLFDLFLESFRIGQGTQQLLKHVVVVCLDI 151

Query: 140 KAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDA 199
           KAF RC  +H +C+ + T   DF  E  + TPDYLKMMW RID L  VLE+G+NF+FTDA
Sbjct: 152 KAFERCSQLHTNCYHIETSETDFSGEKVYNTPDYLKMMWARIDLLTQVLEMGFNFIFTDA 211

Query: 200 DVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSNNRSIEFYKYWYSSR 259
           D+MW RDPFP    + DFQ+ACD++ G P    N  NGGF YV+SNNRSIEFYK+W+ SR
Sbjct: 212 DIMWLRDPFPRLYPDGDFQMACDRFFGNPYDSDNWVNGGFTYVRSNNRSIEFYKFWHKSR 271

Query: 260 ETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGL 319
             YP  HDQDV N+IK+EPFI++IG+++RF DT YFGGFC+ S+D+N V TMHANCC+GL
Sbjct: 272 LDYPDLHDQDVFNRIKHEPFISEIGIQMRFFDTVYFGGFCQTSRDINLVCTMHANCCIGL 329

BLAST of Cp4.1LG01g18710 vs. TAIR10
Match: AT4G19970.1 (AT4G19970.1 Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR005069))

HSP 1 Score: 326.6 bits (836), Expect = 2.0e-89
Identity = 163/319 (51.10%), Postives = 216/319 (67.71%), Query Frame = 1

Query: 2   IVSAMLSYSSSFSFRRTLQIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASL 61
           I  + L Y S+   +   +I +L   ++ +C+++ +       +PL      ++ S    
Sbjct: 371 IPPSFLDYGSAIGQKEVKKILVLVLGLA-ACLLLYKTA-----YPLHQELDVNNLSSR-- 430

Query: 62  FLPSLDD-DDNEPFADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFR 121
             P LD    + P   +       VL++A+TE+RTVI+TTLNQAWA PNS+ DLFLESFR
Sbjct: 431 --PLLDHTSSSSPLTRSKSISFREVLENASTENRTVIVTTLNQAWAEPNSLFDLFLESFR 490

Query: 122 IGNRTHQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRR 181
           IG  T +LL H+V++ LD KAF RC  +H +C+ L T G DF  E  F TPDYLKMMWRR
Sbjct: 491 IGQGTKKLLQHVVVVCLDSKAFARCSQLHPNCYYLKTTGTDFSGEKLFATPDYLKMMWRR 550

Query: 182 IDFLRTVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFN 241
           I+ L  VLE+GYNF+FTDAD+MW RDPFP    + DFQ+ACD++ G P    N  NGGF 
Sbjct: 551 IELLTQVLEMGYNFIFTDADIMWLRDPFPRLYPDGDFQMACDRFFGDPHDSDNWVNGGFT 610

Query: 242 YVKSNNRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCE 301
           YVKSN+RSIEFYK+WY+SR  YPK HDQDV N+IK++  +++IG+++RF DT YFGGFC+
Sbjct: 611 YVKSNHRSIEFYKFWYNSRLDYPKMHDQDVFNQIKHKALVSEIGIQMRFFDTVYFGGFCQ 670

Query: 302 PSKDLNRVLTMHANCCVGL 320
            S+D+N V TMHANCCVGL
Sbjct: 671 TSRDINLVCTMHANCCVGL 679

BLAST of Cp4.1LG01g18710 vs. TAIR10
Match: AT4G15970.1 (AT4G15970.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 289.7 bits (740), Expect = 2.7e-78
Identity = 137/240 (57.08%), Postives = 174/240 (72.50%), Query Frame = 1

Query: 82  LDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRTHQLLNHLVIIALDKKA 141
           L  +L +AATED+TVI+TTLN+AW+ PNS  DLFL SF +G  T  LL HLV+  LD++A
Sbjct: 33  LGKILTEAATEDKTVIITTLNKAWSEPNSTFDLFLHSFHVGKGTKPLLRHLVVACLDEEA 92

Query: 142 FVRCLDIHIH-CFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDAD 201
           + RC ++H H C+ + T G+DF  +  FMTPDYLKMMWRRI+FL T+L++ YNF+FT   
Sbjct: 93  YSRCSEVHPHRCYFMKTPGIDFAGDKMFMTPDYLKMMWRRIEFLGTLLKLRYNFIFTI-- 152

Query: 202 VMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSNNRSIEFYKYWYSSRE 261
                 PFP      DFQIACD+Y G  + + N  NGGF +VK+N R+I+FY YWY SR 
Sbjct: 153 ------PFPRLSKEVDFQIACDRYSGDDKDIHNAVNGGFTFVKANQRTIDFYNYWYMSRL 212

Query: 262 TYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGLK 321
            YP  HDQDVL++IK   +   IGLK+RFLDT YFGGFCEPS+DL++V TMHANCCVGL+
Sbjct: 213 RYPDRHDQDVLDQIKGGGYPAKIGLKMRFLDTKYFGGFCEPSRDLDKVCTMHANCCVGLE 264

BLAST of Cp4.1LG01g18710 vs. NCBI nr
Match: gi|659072690|ref|XP_008466761.1| (PREDICTED: uncharacterized protein At4g15970 [Cucumis melo])

HSP 1 Score: 514.2 bits (1323), Expect = 1.9e-142
Identity = 249/307 (81.11%), Postives = 278/307 (90.55%), Query Frame = 1

Query: 13  FSFRRTLQIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSLDDDDNE 72
           +SFR +L I LLF AISLSC+V+LREL+SLR FPLFS +T S   P   F  SL  DD+ 
Sbjct: 5   YSFRCSLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTPSGPPPVPPFFLSLPHDDD- 64

Query: 73  PFADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRTHQLLNHL 132
             + ADE+GLD VLKDAATED+TVILTTLN+AWA+PN+VIDLFL+SFRIGN+THQLL+HL
Sbjct: 65  -LSLADEYGLDKVLKDAATEDKTVILTTLNEAWAAPNAVIDLFLQSFRIGNQTHQLLDHL 124

Query: 133 VIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLRTVLEIGY 192
           VIIALDKKAF+RCLDIH+HC ALVTEGVDF SEA+FM+PDYLKMMWRRIDFLRTVLE+GY
Sbjct: 125 VIIALDKKAFMRCLDIHVHCVALVTEGVDFRSEAYFMSPDYLKMMWRRIDFLRTVLEMGY 184

Query: 193 NFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSNNRSIEFY 252
           NFVFTDADVMWFRDPFPFFD+NADFQIACDQYLGIP+ L NRPNGGFNYVKSNNRSIEFY
Sbjct: 185 NFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFY 244

Query: 253 KYWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMH 312
           KYWYS+RETYP YHDQDVLN+IKY+ FI +IGLKIRFLDTAYFGGFCEPSKDLNRVLTMH
Sbjct: 245 KYWYSARETYPGYHDQDVLNRIKYDFFIEEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMH 304

Query: 313 ANCCVGL 320
           ANCC+G+
Sbjct: 305 ANCCIGM 309

BLAST of Cp4.1LG01g18710 vs. NCBI nr
Match: gi|449463499|ref|XP_004149471.1| (PREDICTED: uncharacterized protein At4g15970 [Cucumis sativus])

HSP 1 Score: 511.9 bits (1317), Expect = 9.6e-142
Identity = 250/306 (81.70%), Postives = 276/306 (90.20%), Query Frame = 1

Query: 14  SFRRTLQIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSLDDDDNEP 73
           SFR +  I LLF AISLSC+V+LREL+SLR FPLFS +T S   P   FL SL   D+  
Sbjct: 6   SFRCSPHILLLFTAISLSCLVILRELNSLRYFPLFSFSTSSGPPPLPPFLLSLPHHDHLS 65

Query: 74  FADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRTHQLLNHLV 133
             +ADE+GLD VLKDAATED+TVILTTLN+AWASPN+VIDLFL+SFRIGNRTHQLL+HLV
Sbjct: 66  -PEADEYGLDKVLKDAATEDKTVILTTLNEAWASPNAVIDLFLQSFRIGNRTHQLLDHLV 125

Query: 134 IIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLRTVLEIGYN 193
           IIALDKKAF+RCLDIHIHC +LVTEGVDF SEA+FM+PDYLKMMWRRIDFLRTVLE+GYN
Sbjct: 126 IIALDKKAFMRCLDIHIHCVSLVTEGVDFRSEAYFMSPDYLKMMWRRIDFLRTVLEMGYN 185

Query: 194 FVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSNNRSIEFYK 253
           FVFTDADVMWFRDPFPFFD+NADFQIACDQYLGIP+ L NRPNGGFNYVKSNNRSIEFYK
Sbjct: 186 FVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYK 245

Query: 254 YWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHA 313
           YWYS+RETYP YHDQDVLN+IKY+ FI +IGLKIRFLDTAYFGGFCEPSKDLNRVLTMHA
Sbjct: 246 YWYSARETYPGYHDQDVLNRIKYDFFIEEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHA 305

Query: 314 NCCVGL 320
           NCC+G+
Sbjct: 306 NCCIGM 310

BLAST of Cp4.1LG01g18710 vs. NCBI nr
Match: gi|645228399|ref|XP_008220977.1| (PREDICTED: uncharacterized protein At4g15970 [Prunus mume])

HSP 1 Score: 422.2 bits (1084), Expect = 1.0e-114
Identity = 208/298 (69.80%), Postives = 241/298 (80.87%), Query Frame = 1

Query: 22  FLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPSLDDDDNEPFADADEFG 81
           FLLF A+ LSC+++  + ++LR  P FS      SSP+ LF       DN P     E  
Sbjct: 36  FLLFVAVCLSCLLLYNDSNALRFLPRFS------SSPSVLF-----SADNSPLVSG-EHR 95

Query: 82  LDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRTHQLLNHLVIIALDKKA 141
           L  VLKDAA ED TVILTTLN+AWA+PNS+IDLFLESF+IG  TH+LLNHLVIIALD+KA
Sbjct: 96  LQKVLKDAAMEDGTVILTTLNEAWAAPNSIIDLFLESFKIGVGTHRLLNHLVIIALDQKA 155

Query: 142 FVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLRTVLEIGYNFVFTDADV 201
           F RCL++H HCFAL+T+GVDF  EA+FMTP YLKMMW RIDFLR+VLE+GYNFVFTDADV
Sbjct: 156 FQRCLELHTHCFALITQGVDFRREAYFMTPHYLKMMWARIDFLRSVLEMGYNFVFTDADV 215

Query: 202 MWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSNNRSIEFYKYWYSSRET 261
           MWFRDPFP F M+ADFQIACD +LG  + L N+PNGGFNYVKSNNRSIEFYK+WYSSRET
Sbjct: 216 MWFRDPFPQFYMDADFQIACDHFLGSSDDLENKPNGGFNYVKSNNRSIEFYKFWYSSRET 275

Query: 262 YPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCVGL 320
           YP +HDQDVLN IK+ P  + IGL+++FLDTAYFGGFCEPSKDLN+V TMHANCC GL
Sbjct: 276 YPGFHDQDVLNIIKFHPSTSTIGLRMKFLDTAYFGGFCEPSKDLNQVCTMHANCCYGL 321

BLAST of Cp4.1LG01g18710 vs. NCBI nr
Match: gi|502121038|ref|XP_004497167.1| (PREDICTED: uncharacterized protein At4g15970 [Cicer arietinum])

HSP 1 Score: 422.2 bits (1084), Expect = 1.0e-114
Identity = 207/314 (65.92%), Postives = 244/314 (77.71%), Query Frame = 1

Query: 6   MLSYSSSFSFRRTLQIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  S     RR L   LLFA++SLSC+++ R++DS R F     + F  S P   F   
Sbjct: 1   MLPESKVLHIRRALATALLFASVSLSCLILFRDVDSYRFF-----SRFPSSYPLPRFSSF 60

Query: 66  LDDDDNEPFADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRT 125
                N+P A ++E+ L+ +L +AA EDRTVILTTLN+AWA+PNSVIDLFL+SFRIG+RT
Sbjct: 61  FPLVSNDPTATSNEYPLEKILNEAAMEDRTVILTTLNEAWAAPNSVIDLFLQSFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLR 185
            +LLNHLVIIALD+KAF RC  IH HCF L +E  DFH EA+FMTP YL MMWRRIDFLR
Sbjct: 121 RRLLNHLVIIALDQKAFARCKVIHAHCFLLASEEADFHEEAYFMTPSYLMMMWRRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSN 245
           +VLE+GYNFVFTDAD+MWFRDPFP F ++ADFQIACD + G  + + NRPNGGFN+VKSN
Sbjct: 181 SVLEMGYNFVFTDADIMWFRDPFPQFHLDADFQIACDHFTGSFDDVQNRPNGGFNFVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IKY PFI DIGLK+ FLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNIIKYHPFIADIGLKMTFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGL 320
           N+V TMHANCC G+
Sbjct: 301 NQVCTMHANCCFGM 309

BLAST of Cp4.1LG01g18710 vs. NCBI nr
Match: gi|734414577|gb|KHN37408.1| (Hypothetical protein glysoja_013623 [Glycine soja])

HSP 1 Score: 421.0 bits (1081), Expect = 2.2e-114
Identity = 208/314 (66.24%), Postives = 245/314 (78.03%), Query Frame = 1

Query: 6   MLSYSSSFSFRRTLQIFLLFAAISLSCVVVLRELDSLRDFPLFSLTTFSDSSPASLFLPS 65
           ML  S +   RR +     FA +SLSC+V+L ++DS R      L++F  S   S F   
Sbjct: 1   MLQESKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHR-----FLSSFHSSYSLSGFTRI 60

Query: 66  LDDDDNEPFADADEFGLDNVLKDAATEDRTVILTTLNQAWASPNSVIDLFLESFRIGNRT 125
                N+P A ++E+ L+ +L DAA +DRTVILTTLN+AWA+PNSVIDLFLESFRIG+RT
Sbjct: 61  FPSVYNDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRT 120

Query: 126 HQLLNHLVIIALDKKAFVRCLDIHIHCFALVTEGVDFHSEAHFMTPDYLKMMWRRIDFLR 185
              LNHLVIIALD+KAF RC  IH HCF+LV+E  DFH EA+FMTP YL MMW+RIDFLR
Sbjct: 121 STFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLR 180

Query: 186 TVLEIGYNFVFTDADVMWFRDPFPFFDMNADFQIACDQYLGIPEYLGNRPNGGFNYVKSN 245
           TVLE+GYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G  + + NRPNGGFNYVKSN
Sbjct: 181 TVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTGGFDDVQNRPNGGFNYVKSN 240

Query: 246 NRSIEFYKYWYSSRETYPKYHDQDVLNKIKYEPFINDIGLKIRFLDTAYFGGFCEPSKDL 305
           NRSIEFYK+WYSSRETYP YHDQDVLN IK  PFI DIGLK+RFLDT  FGG CEPS+DL
Sbjct: 241 NRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLKMRFLDTTNFGGLCEPSRDL 300

Query: 306 NRVLTMHANCCVGL 320
           N+V TMHANCC+G+
Sbjct: 301 NQVCTMHANCCLGM 309

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y4597_ARATH8.2e-7757.08Uncharacterized protein At4g15970 OS=Arabidopsis thaliana GN=At4g15970 PE=2 SV=1[more]
Y1869_ARATH2.6e-4641.03Uncharacterized protein At1g28695 OS=Arabidopsis thaliana GN=At1g28695 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KPV2_CUCSA6.7e-14281.70Glycosyltransferase OS=Cucumis sativus GN=Csa_5G107050 PE=3 SV=1[more]
A0A0B2RYA6_GLYSO1.5e-11466.24Uncharacterized protein OS=Glycine soja GN=glysoja_013623 PE=4 SV=1[more]
K7LLW7_SOYBN2.0e-11466.24Uncharacterized protein OS=Glycine max GN=GLYMA_10G280600 PE=4 SV=1[more]
A0A0R0I0C9_SOYBN2.0e-11466.24Uncharacterized protein OS=Glycine max GN=GLYMA_10G280600 PE=4 SV=1[more]
A0A067KPV4_JATCU1.1e-11266.56Glycosyltransferase OS=Jatropha curcas GN=JCGZ_04849 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G14590.16.9e-10661.64 Nucleotide-diphospho-sugar transferase family protein[more]
AT2G02061.13.4e-9764.34 Nucleotide-diphospho-sugar transferase family protein[more]
AT5G44820.16.7e-9353.33 Nucleotide-diphospho-sugar transferase family protein[more]
AT4G19970.12.0e-8951.10 Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR00506... [more]
AT4G15970.12.7e-7857.08 Nucleotide-diphospho-sugar transferase family protein[more]
Match NameE-valueIdentityDescription
gi|659072690|ref|XP_008466761.1|1.9e-14281.11PREDICTED: uncharacterized protein At4g15970 [Cucumis melo][more]
gi|449463499|ref|XP_004149471.1|9.6e-14281.70PREDICTED: uncharacterized protein At4g15970 [Cucumis sativus][more]
gi|645228399|ref|XP_008220977.1|1.0e-11469.80PREDICTED: uncharacterized protein At4g15970 [Prunus mume][more]
gi|502121038|ref|XP_004497167.1|1.0e-11465.92PREDICTED: uncharacterized protein At4g15970 [Cicer arietinum][more]
gi|734414577|gb|KHN37408.1|2.2e-11466.24Hypothetical protein glysoja_013623 [Glycine soja][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005069Nucl-diP-sugar_transferase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
biological_process GO:0016310 phosphorylation
biological_process GO:0071555 cell wall organization
cellular_component GO:0005575 cellular_component
cellular_component GO:0000139 Golgi membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0016740 transferase activity
molecular_function GO:0016301 kinase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g18710.1Cp4.1LG01g18710.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005069Nucleotide-diphospho-sugar transferasePFAMPF03407Nucleotid_transcoord: 128..319
score: 2.4
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 66..165
score: 1.6E-44coord: 206..229
score: 1.6
NoneNo IPR availablePANTHERPTHR24015:SF419NUCLEOTIDE-DIPHOSPHO-SUGAR TRANSFERASE DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 206..229
score: 1.6E-44coord: 66..165
score: 1.6