Cp4.1LG01g14540 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g14540
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionUDP-Glycosyltransferase superfamily protein isoform 1
LocationCp4.1LG01 : 7577346 .. 7586886 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGGGAAATTCTTGGACTTCAAAACAGAAATTTTCCCGGTTACCAGGCGGGTATAAAAGCGAAACAGAACGGCGAAAAAACGAACTAGAAAAGAAAACAAGAAACGAATCACTTTCCCTCCACTTGGTGCTCTACTCTCGCTCTCTTTTCCCTCTAAAAATCAAGCTTCAACTTCCTACTCTCCTCATCTCTAACCAGTATAGGTTCCCGCCATTTTTATCGAGAAGAAGATTTCGTACAATATTCTTTATGGTGCCGGACTCATCTCCACCGGTTGTTGACGACGGCGCTTGTGATCTCGGTTTCTTATCGTCCAAAGAACGCTCTCTTTCGAGGCGCAATCTCAAGCAGCATCAGGAGCAAGACAATGTGTCCTCGGATCGCTCTGTCTGCCGTTTTCGATCAAACCTCGACCGGCGCGATCGCTACGGGTGGTTTCCGTTCAGAAGGAGATCGTTCATCGTTTTGGCGTTCTTCGTTTTGTTCACGATGTTCATGTTTCAGTTGTTTCTGGAGAGTTCGATGACTTCGGTGTTCTTGAAAAGGAGCAAGAAAGCTTGGCCGCGTGAGGCAGAGTTGAAGCCCGGGAGGACACTTAAGTTCGTGCCGCAGAGGATTCCTCGGAAGTTTATTGAAGGTAATGAGGTTGATCGATTGCACTCGGAGGATCATGTTGGTTTCCGGAAACCGAGGCTTGCTCTGGTGAGTGGTTAAGTTTCTATTCGTTCTTACTTTCTCGAGTTATGGTTTAATTAGCCACTCGAACGAACTTATCTGACAATGAGTTAAGTTACGTCATCAAACATCCATTTCAATGCCTTTCTGCCTTAGAAGATGAACTCAAACCATTTTCACTGGTGCGCAATAATAGATTGCGTTGATTAGTTTTTTCGCTTTCTGAAATAAATTGTTTATACGAATTAGTGCAACACCTGCTTTCTCTCTAGAGGTCAGCAACCAGAAGTTGAACGTTCACACGACTAATGTACAGTGCACACTGTTTTAAAACTTCCAAGCAACTTTAGATTGAGCTCCATGGCATTACGGATGGTCTTGATTTTTGAAGTGACTAAGCCCGGAGTTACGACCACCTAAAGAAAAATAACGTGTATCTTTTCGAGGTAGTGATTATGCGTCACTGTAAAACTACAGACGAGTATACACTGTATTTTTACTGATAAAAATATTATATCACGTTTTTTTTGGCAGGAAAGTTGTGTAAAAAATGAAATCATGTTTTATGAATCTATAGTCATTACTTTAGACCTTTTAGGGATGAAAAAACTTTAGGGGAAGAACTTCGAATGGCAAGATAGATACAGTTGATGGAAATGAACAGTGGATTCCGATATGAGCCTACTCCACTATACCTACTGTGATCATATGCCAATAATATCATCAATTTGGTATTAGCATGTTTTGGCGTCTAAGAAATATTGTAAGCTGACGGGATATCATAGTGTATTGCTCTTGAATAATTTTTCACTGTCGACTTCATGCTGATGGTTAATTGCTTTCCCTACAGCGATCATTATTAAGTTTTCACGATCCGTAAGATTCTGTCATAGTGACCGAATATTCTTGGCTTTAGAGCTTAGTAGCCACTACCAATTTTCCTGTCACTCTGAGTTCTAAAGTTAAACTCCATTCTTTTCGATGCAGAATCTTGGTTACTTTTGAGAATTTGTTGTATTAATACTCAGCAACTCTTATTACTAGCTTTTACATGTTTATATATGCCGTGATGCACATAACAATGTGTTAGTGAATGTTGTATCATCATGTGTAGACTCGATAATGTACGAGAGTCCTAAAATGCACGGCTATTTGAGCAGCCACATCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCNTTTTATAAACGAACAAACTATTCCTAATTTTTTTGTTGCCTAAATTATCAGATATTGAGAAACATGGAGAAAGATTCACTATCCTTGTTCTTAATTACTGTAATGAAGAACATGAGGGAGCTTGGATATGTGTTTGAGGTGAGGTCTTTGACATTGCTTTTGTTACTCGTATTGCATAATTGATTTTTATTTGCTGAAATAGCCGTGTAAGAAGTCACACGATGCATGGTTTTCTAGGCATATTCTGTGAGATCGGGCATTGATGTGTCGTTGAAACCTTCATGCTTGAAGGGTAATATTATTGGTTAGTTTTTAAATGATTATATTCTTATGGAAGAGGCTCAGATTTTTGCAGTTGGCAATGGAGAAGCACGTCAAATGTGGCTGAAACTTGGTCGGGTTGTCCTTTTAAGCCCAAAGCAGTTTGGCCAGATCAATTGGTTACTGTAAGTGCATATATATCTATACACACGCATACATCGGTTGTGCATTCGTACATAAAAGTAGACTAATGCCGTTCTTGTTTCCTATCTGTATGAGCACAAAATGATCTCTGTTAGTAGTCAGAACTGCATGAATGGTTAACTTTTGTCATTGCAGTTTTGAAGGCATTATCGTCGATTCTTTTGAAGGGAAGGAGGCTATTACAAGGTTGGTTTATTGATACTATATCTTTTAAAAAATGTTAGTGATGGAGTTTTCTGGAGTCTGAGGCCACGGTAAGTCATATGTTTTATCTGCTTGTAGGTGAAAATAATCACTGCTCAACTTCAGAAAACTTTTTAAATAATGTGGAATAATTATATTTACTCTATTTTTCACTTTTTTAACAAAATATTATATTAATATGTAAAGATAATTACGAAATAGGAGAATAAGGAATCCTCCAATTTAAAATACAGTGAAGTAAAAGAAATACAGTTAACCTTGCACGTTATGAACTTCTTACTATTAATGTTTACGAACACAGTTATCTTGTTAATATTTTTTTCTTTTATTTCAAATTGTAAAAACTGATGCAATGAGAATTGCATTTTTCTGGAAGATGTTGAAATTAGACTACTATAGTATTAATGATAGCAAATTTGTTTGGAGAATGTTGATACAGAATTATGAATATCTAAATGCATTTTCCTGGTGAATGATTGGAGGGACCTTTCCTTTAGTACGGTTGATGTGAAGAGATTTGAGACTTTTTTCCCCTTTTCATGGTATATAACTAATGAGTTTTCATGTATGCAGCATTATGCAGGAACCTTTTTGTTCAATACCACTTATATGGATCATTCAGGATGATATCCTAGCCAAGCGTCTTAAAATGTACAAGGACAAGGGCTGGGAGAATCTTGTTTCTCATTGGAGAAGTACTTTTAGCAGAGCTAGTGTTATTGTGTTTCCCAATTTTGCTCTTCCTGTAAGTTCGTGAACTTGGCTGTTTATATTATGTGGCTTTTAGACGCATGGTTAGCCTCAACTTTGACCTTTTTCTTTCCATAAAAAAAGTAAAAAAGACATTGACGTTTCTCTTAGAAACTGCTCATCTTTTATTTCCTTTTATATCAGATGCTATATAGTGCGCTTGACACTGGAAACTTTCATGTGATCCACGGATCACCAGTGGACGTTTGGACTGCTGAAATTTATAAGAGCTCTCACTTCAAGTTTAAATTAGGAGAGAAACTTGGATTTGGTATAGAAGATTTCGTAGTTCTTGTGGTTGGAAATTCCTTCTATAATGAGCTATCACCGGAATATGCTGCGGCATTGTATCGCATGGGACCTCTACTAACAGAATTTGCAAGGAGGAAGAATCCTAGAGGGTCGTTTAAATTTGTTTTCTTGTATGGTAATTCCTCCGACGGATGCAATGATGCTCTGCAGGTAGTCTTTTGTTATGCTTGTCACATTTTTAGCTTTATTTGCGTGCCAATTTGCCTCGTAATGCTACTTTCTCCTTCTCAAATTATATAGTGTGTAATTATTGGATATGTGAAATTGAGAAAATATTGATTGAATTGAAGTAGGTTTTATTTTCCTAAAATTTTAGTATAACTAGCAAATGGGATTTTATCTTCTTCTGTTCCAACTCTTGCTGGGAAATTTAAATCAAAGTTCTTCACCTCTGTACGATTTTTGTACTAAGCTCGACAGAGAAATCCTATGAATAATTGAATCCACAAATTTTCTGTTTTAGTTGTTGCAATATTTCTTAAGTGAAAATTTATAAGGTGAATGTAACTTGAAGCGGAAGAATTAGATTACCTTAAGAAATTATGATTATAATGGTAATGGACTGATAGTTCCAAATAGCATATGGGATGAGCATTTGGAGAAAACCAACTTCAAATTTATATTCATGATGGGGATTGAGTTAGTTCTGGCTTGGCAAGTGTCTATGTGGATATATGCCGTATGTATGTTTTTTGCCTCCTCAATTATCTCAAATGGCTGAAAGAAAGGATGTTTATCCTGCCCTTGCTTAGGTGGTTGGGAGTTGTGACGTCCAATGGATGGGTCCGTCACTTAACCGAAGAATCTTGCATCTTCATAGCATAATAGTTCGTTTTTTTAAATTTAGAATTGAAGAGGAAGGAGGGTTTGACAATAGCTTTTATTAATTCTTGTTGTGGAGTTTGATCCTTTCCTTTGGCCTTGTTTGGTTTAGATGTATCATGTAAAGTTTCTTATACTATAGCAAGACAACGCTTATATCTAGTGAGAATAAATGCTATTGAGATAGGAGGATACAGAAGATGGTCTTAAAATAAAATACTACTACCCATGTAATTAGCAGCTGATACTTGCATGAAATATCTGAGACTGAGTAAGTTTGAGAACAACATGCTAGCTTTAGAAAAAGGCAGTAAACTTTTGGATCACTTCTTCTGTATCGTATGGCCACTACAACTTTCTTGTAATTTGATTTCACAGTGACTGGAAATGAACGACACATCAAACAAGGTGAAGTCATAAAATATTATTTTTGTGTTTAATATGATTAGAAAGTAGTGGAAATGGGAAGGTTGATTTTTGGAGTTCGGAGTTTCTGTGGATTCCTGGTTCCATTCTCCATGTATGTCACTTATATGCTTTAATGATAGGGATCTAAGAATGGTACAAGGATCTGCCTTAACTATCGGGGTTTTAAATTCCTTGTCTATAGGGGCTTGAACATTCTAATAATGGTATAGGTGTCACTTGTAATCTTTGGTTTTCCAAACTTTGCTAATGAAGAAAGTAAGTGGACAGGACGTAGGCTAGGTACAAAAGAATGGCGTTGTCTTGTTATATGCAATGAATACATGAATACCTTTAAGATAAGACTGCTGTGTATGAGTTGGAACGGTCGGTGGATGCACCACCTTGTTTCTTTTCTTGATTTATTTTCTGCATACTTTTAAACGAGACCAAGTGTTGCACTTGTCTGACTCTAGATAAGGATGAATGGCGAATGTTTCTATTTGTACTTTTTCTTATGCATATGGTCTGACTCTAGATAAGTACTTTTTCTGCAGTATTTCTTTGGATTAATGATTAATCGTTCAACTTATGTAGGAAACTGCTTCACGTTTAAGACTTCCTCGTGGTTATTTAAGCCATTATAGCTTTGATCAAGACGTAAATGGTATTTTGTACGTGGCCGATATTGTTCTTTATGAATCTTCCCAAAATGTACAAGATTTTCCTCCCTTGCTCATTCGGGCGATGACCTTTGGAGTCCCAATAGTGGCACCTGATATGCCCATTATTAACCAATATGTGAGTTCATTCTACTTCTCTCCCCTCTTCCTCCACTAGAAAGGAAAAAATATAATAATAGTGTCTTGAAACATGCAGGTTGTTGGGGGGGTCCATGGATTACTTGTTACTAAATTTAGTTCAGATGCTTTGATAAGAGCTCTCTCTAATCTTTGTTTTGATGGAAGGCTCGCTAGAATTGCTAACAATCTTGCTTCATCTGGAAAATTACTTGCCAAAAATCTTCTTGCTTTAGAGTGCATTACTGGATATGCAAATCTGTTGGAGGAAGTCCTCAATTTCCCATCAGACGTTATACTGCCAGGTTCCATTACCCAGCTTCCAGAAGCAGCGTGGGAATGGGATCTCTTTTGGAAGGAAATAATACAGGGATCTTCCAATGAGCAACGCGATAAGAATGTTAAAAAGAAATCTAGTGTGGTGATTAAACTCGAAGAGGAGTTCTCTGACCTTGTTAGTCCCTTGAACATCTCCAGTCCTAGAAAGGAGATTTTGGTGCATGATATCCCAACTCAACAAGATTGGGATATTATCGGGGAAATAGATCGTACTGAAGAATATGACAGAGTGGAAATGGAGGAGGTATGTTGTTATTTTTCCATTATGGCTTGTCTGCAATTTTTATGTCACTCACTTTCCTGAGAGCGAGGCCATATGTGATTTGTTATGTAGCTTCAAGAAAGAACAGAAAGAATATTAGGTTCATGGGAAAAAATATATCGTAGCGCACGGAAGTCCGAAAAGATGAAGCTTGAAAATGAGAATGACGAGGAAGATCTCGAAAGGGCAGGGCAAGCAGTATGCATTTATGAGATATACAGCGGACCTGGAGCTTGGTCATTTTTGCATCATGGTTCTATGTTTCGTGGACTTAGTCTTGTGAGCTTCTTCCATCCAAAACTATCTGCTGATATTTTTCTGTTTGTTTTATTTTGTCTCACTTTTCTTTGTTATTATCTTCTGTGTCATTCATTCGAAAATGCAAGTAAGATTTTAGTCCATCAATCGTGCACTCTTGCTAGATCCTCAGTCTTGAATGTTAAACATGTATAAGAGAAGATAGTTTTGCTACATATCCAGCCATCGCTTCCTTAAAAAAACATCAACTAGATTGTTGAATTTAAAACTTGTGAATTTTATGGTGTCAAACATTAAACTCGATCTTGGCCCATTTTTTAACTTTTTGAACTTGTTATTCTTAAATGTGGATGTATTCCTTGCTTCCAAGATACTCATTTTACATTCTTTATTGCTTGTAATTTACATTGTACCCTACAGTCTTCGAGAGCACTGAGGTTGGAATCAGATGATGTCAATGCTCCCAAGCGTCTTCCTCTTTTGGAAGACAGATTCTATCAGGACATTCTTTGTGAGATGGGAGGAATGTTTGCTGTTGCAAATGAGATTGATACAATTCACAGAAGACCTTGGATTGGTTTCCAATCGTGGCAAGCTGACGGTAGGAAGGTAATCCGTACTTGCTAACATTCTTTCTTGCTTAGTTGTACTATTAGGCTGTAAGGCTTCAAGAGAAATGTAAATGGTGGATAGAAATTTTTGGAGGTTCTAGTTTAGACAAGGAATACCTTTACCCCCTTTTCTTAACGGCCGCCCAATCTATTTGGTTTGCAAATGAAAGATTTTCATAACCATCTGGAATCTGGTAGTCAAAGATTTTGAGACTATATCTTCTTATGTAACAATCTTAGGCTTTATTAAACCACTGGCCTCCAGGTGTAATGAACGGGTTTTAAGCTCTATGCTATAATATATCTATGCTGCTAAGTAGATTTTTTTTAATGTATTTATTTATTTACTTCTATTCCTCTGATTCGTAATCAGCCTCTTCTGTTAAAAAGCAGGAGTCATTATCTAAAAAGGCTGGAAAGGTCTTGGAAGAAGCAATTCAGAATAATACTAGAGGGGAAGTTATTTACTTTTGGGCGTACATGGACGTGGATTCTGAAGTCACGGACAGCGCTGATGGTCCTTTTTGGCACACATGTGACATCTTCAATCGGGGACATTGCAGGTATATCAGTCATTCAACATATTCTTGTATGATAAGTTAGTTGTTCCATGTTCATAGGTCAAGTCTTCCATTGTTTTTTTAAAAACAAATAATTCTCATTAAAGGGTATGAAAGGTTTAAAAAGAAGAATTCCAAAGAAATCAGAGGAGCTTACAACAAAAGCATCATTCCAATTGGCATAAATGGATAATCTATTGACATTAACGAATGCATCCCTTTAGTCTTTTGATATTAATTGAAAGTTGACCCAGTACTTCGTATTTTATGATTGGAAATATAGAGAGAGTTTAATTGATTCTGCATCAACTTCCAAGTCGTGCGTGTTAACCAACTTTATCATCACTAATTAACATCAAATACGAAATGAAGAACTAAAGTATAATACATTAGAATTTTACATTAAACAGATGCAATTTGAACATGTTCTCACTTAAAATTACATCCATCCAGTTCTACGTTTAAAGATGCCTTTAGGCAGATGTATGGACTACATCCATCACATTCGGAAGCTCTTCCTCCAATGCCTAATGATGGCGGTCTCTGGTCTTATCTGCATAGCTGGGTGATGCCAACCCCTACATTTGTGGAGTTCATAATGTTTTCCCGGTAAGCACATTATATATATATCAGAAGTAGAACCCCATTTATTTCGCACATTCCCTCTATTTTTTTGTTCTTCTTCAAAATTTCCTCCCTCACCACTGTGGATGTAATTTTGGCTTGAATAGTGACTGAAATTTTCGTATGTGAGCTGCAGGATGTTTGTTGATTCCGTAGATGCCGTGAACAGAAAGCTTGACAATAGCAGCAAGTGTTTGCTGGCTTCCACTGGACTGGAGGTAAATATCCTACTTCTCTCCCTTGAGGAGATTACTTCTATGGTGAAATTAATTTCTTTATAATGTTCGAGTTAACGCAATTCTTCCCTAAATACTTCAAAATTTTTGGGAAAAACACTAATTAAATGGTCCCAAAACCACTCCCAATATGTGTTTGAAATGCCTTCTAAAAATATATTTTAAATAAAACACTTATTACATCAGCACCGCAAAAGAAAATTTATTAAGTGTTTCTCCTAAATGCACTTTAAGATTTTTTATTAGGTTCTTAAAATTTTTTAGTCTTCTGTTTTTGATGGGCAGAGAAGGCAGTGTTATTGCCGGCTGTTGGATATCCTGATAAACGTGTGGGCGTACCACAGTGGGCGGAGAATGGTTTATTTAACCCCACGTTCAGGCTCGCTAGTGGAGCAGCATCCCCTTGAAGAACGTCAGGACTTCATGTGGTCCAAATTCTTCAACATCACATTATTGAAAGCCATGGATGCAGACTTGGCCGAAGCTGCCGATGATGGCGATCACCCGAGAACCAAATGGTTATGGCCATTAACAGGAGACGTATTCTGGGAAGGGATGTATGCAAGGAAAAGCAAAGAAAGGCACAGGCACAGGCACAAAGTTGAAAAGAGGACAAAACCCCGACATAAAAAATCAGGCAACCGCCGTAATCATGAACACAAGCAAAAACCACTTGGAAAATAGCTGACAACAAACTAATAGTCTATTTGCAGCAAATGGTAAGTTTAACTCATTTATTTATTTTTGTCTTCTCATCGAAAATTGGGATTTTCTTATTGACTAGATATTTGGTTCTCTTCTTTCAAACGGTTAGCAGCAGATTGTAGATAAGATAGATCGAAATGGTGATGATGCTTTACGAGTACAGAAGACTATCGTCAATTCAAGTACGTCACTTTCTTTCAACCTCTTTTCCATAATACGTGGAGTCTAGCAATTAGCAATTAGATGATGTATATAATATCTGTAATCAACATTTTTTGCATTCAATTTAGAACTTAGCGAAATTGACGAAATGA

mRNA sequence

AAGGGAAATTCTTGGACTTCAAAACAGAAATTTTCCCGGTTACCAGGCGGGTATAAAAGCGAAACAGAACGGCGAAAAAACGAACTAGAAAAGAAAACAAGAAACGAATCACTTTCCCTCCACTTGGTGCTCTACTCTCGCTCTCTTTTCCCTCTAAAAATCAAGCTTCAACTTCCTACTCTCCTCATCTCTAACCAGTATAGGTTCCCGCCATTTTTATCGAGAAGAAGATTTCGTACAATATTCTTTATGGTGCCGGACTCATCTCCACCGGTTGTTGACGACGGCGCTTGTGATCTCGGTTTCTTATCGTCCAAAGAACGCTCTCTTTCGAGGCGCAATCTCAAGCAGCATCAGGAGCAAGACAATGTGTCCTCGGATCGCTCTGTCTGCCGTTTTCGATCAAACCTCGACCGGCGCGATCGCTACGGGTGGTTTCCGTTCAGAAGGAGATCGTTCATCGTTTTGGCGTTCTTCGTTTTGTTCACGATGTTCATGTTTCAGTTGTTTCTGGAGAGTTCGATGACTTCGGTGTTCTTGAAAAGGAGCAAGAAAGCTTGGCCGCGTGAGGCAGAGTTGAAGCCCGGGAGGACACTTAAGTTCGTGCCGCAGAGGATTCCTCGGAAGTTTATTGAAGGTAATGAGGTTGATCGATTGCACTCGGAGGATCATGTTGGTTTCCGGAAACCGAGGCTTGCTCTGATATTGAGAAACATGGAGAAAGATTCACTATCCTTGTTCTTAATTACTGTAATGAAGAACATGAGGGAGCTTGGATATGTGTTTGAGATTTTTGCAGTTGGCAATGGAGAAGCACGTCAAATGTGGCTGAAACTTGGTCGGGTTGTCCTTTTAAGCCCAAAGCAGTTTGGCCAGATCAATTGGTTACTTTTTGAAGGCATTATCGTCGATTCTTTTGAAGGGAAGGAGGCTATTACAAGCATTATGCAGGAACCTTTTTGTTCAATACCACTTATATGGATCATTCAGGATGATATCCTAGCCAAGCGTCTTAAAATGTACAAGGACAAGGGCTGGGAGAATCTTGTTTCTCATTGGAGAAGTACTTTTAGCAGAGCTAGTGTTATTGTGTTTCCCAATTTTGCTCTTCCTATGCTATATAGTGCGCTTGACACTGGAAACTTTCATGTGATCCACGGATCACCAGTGGACGTTTGGACTGCTGAAATTTATAAGAGCTCTCACTTCAAGTTTAAATTAGGAGAGAAACTTGGATTTGGTATAGAAGATTTCGTAGTTCTTGTGGTTGGAAATTCCTTCTATAATGAGCTATCACCGGAATATGCTGCGGCATTGTATCGCATGGGACCTCTACTAACAGAATTTGCAAGGAGGAAGAATCCTAGAGGGTCGTTTAAATTTGTTTTCTTGTATGGTAATTCCTCCGACGGATGCAATGATGCTCTGCAGAACTTAGCGAAATTGACGAAATGA

Coding sequence (CDS)

AAGGGAAATTCTTGGACTTCAAAACAGAAATTTTCCCGGTTACCAGGCGGGTATAAAAGCGAAACAGAACGGCGAAAAAACGAACTAGAAAAGAAAACAAGAAACGAATCACTTTCCCTCCACTTGGTGCTCTACTCTCGCTCTCTTTTCCCTCTAAAAATCAAGCTTCAACTTCCTACTCTCCTCATCTCTAACCAGTATAGGTTCCCGCCATTTTTATCGAGAAGAAGATTTCGTACAATATTCTTTATGGTGCCGGACTCATCTCCACCGGTTGTTGACGACGGCGCTTGTGATCTCGGTTTCTTATCGTCCAAAGAACGCTCTCTTTCGAGGCGCAATCTCAAGCAGCATCAGGAGCAAGACAATGTGTCCTCGGATCGCTCTGTCTGCCGTTTTCGATCAAACCTCGACCGGCGCGATCGCTACGGGTGGTTTCCGTTCAGAAGGAGATCGTTCATCGTTTTGGCGTTCTTCGTTTTGTTCACGATGTTCATGTTTCAGTTGTTTCTGGAGAGTTCGATGACTTCGGTGTTCTTGAAAAGGAGCAAGAAAGCTTGGCCGCGTGAGGCAGAGTTGAAGCCCGGGAGGACACTTAAGTTCGTGCCGCAGAGGATTCCTCGGAAGTTTATTGAAGGTAATGAGGTTGATCGATTGCACTCGGAGGATCATGTTGGTTTCCGGAAACCGAGGCTTGCTCTGATATTGAGAAACATGGAGAAAGATTCACTATCCTTGTTCTTAATTACTGTAATGAAGAACATGAGGGAGCTTGGATATGTGTTTGAGATTTTTGCAGTTGGCAATGGAGAAGCACGTCAAATGTGGCTGAAACTTGGTCGGGTTGTCCTTTTAAGCCCAAAGCAGTTTGGCCAGATCAATTGGTTACTTTTTGAAGGCATTATCGTCGATTCTTTTGAAGGGAAGGAGGCTATTACAAGCATTATGCAGGAACCTTTTTGTTCAATACCACTTATATGGATCATTCAGGATGATATCCTAGCCAAGCGTCTTAAAATGTACAAGGACAAGGGCTGGGAGAATCTTGTTTCTCATTGGAGAAGTACTTTTAGCAGAGCTAGTGTTATTGTGTTTCCCAATTTTGCTCTTCCTATGCTATATAGTGCGCTTGACACTGGAAACTTTCATGTGATCCACGGATCACCAGTGGACGTTTGGACTGCTGAAATTTATAAGAGCTCTCACTTCAAGTTTAAATTAGGAGAGAAACTTGGATTTGGTATAGAAGATTTCGTAGTTCTTGTGGTTGGAAATTCCTTCTATAATGAGCTATCACCGGAATATGCTGCGGCATTGTATCGCATGGGACCTCTACTAACAGAATTTGCAAGGAGGAAGAATCCTAGAGGGTCGTTTAAATTTGTTTTCTTGTATGGTAATTCCTCCGACGGATGCAATGATGCTCTGCAGAACTTAGCGAAATTGACGAAATGA

Protein sequence

KGNSWTSKQKFSRLPGGYKSETERRKNELEKKTRNESLSLHLVLYSRSLFPLKIKLQLPTLLISNQYRFPPFLSRRRFRTIFFMVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRYGWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFEIFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLAKLTK
BLAST of Cp4.1LG01g14540 vs. TrEMBL
Match: W9QYJ0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_005213 PE=4 SV=1)

HSP 1 Score: 382.5 bits (981), Expect = 7.7e-103
Identity = 202/391 (51.66%), Postives = 279/391 (71.36%), Query Frame = 1

Query: 96  GACDLGFLSSKERSLSRRNLKQHQEQDN--VSSDRSVCRFRSNLDRR-DRYGWFPFRRRS 155
           G  DLGF S ++R   +RN     ++D   V +DR+  R RS+ + R +R G+  F+ +S
Sbjct: 21  GGNDLGFHSIRDRLRFKRNPNPSHDRDRTKVFADRAPVRGRSHYNSRFNRKGFLWFKGKS 80

Query: 156 FIVLAF-FVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVPQRIPRKFI 215
            + L   F +F   M  + L+SS+ SVF + S++       LK G TL+FVP RI R+  
Sbjct: 81  TLYLVIIFAVFLFGMASMVLQSSIMSVFKQGSERGRLLREGLKFGTTLRFVPGRISRRLA 140

Query: 216 EGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFEIFAVGNGE 275
           + N +DRL +E  +  RKPRLAL+L NM+K+S SL LIT++KN+++LGY  +IFAV NG 
Sbjct: 141 DANGLDRLRNEPRIAVRKPRLALVLGNMKKNSESLMLITIVKNIQKLGYALKIFAVENGN 200

Query: 276 ARQMWLKLG-RVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSIPLIWIIQ 335
           AR MW +LG ++ +L  + +G ++W +FEG+IVDS   KEAI+S+MQEPFC++PLIWI+Q
Sbjct: 201 ARTMWEQLGGQISILGFESYGHMDWSIFEGVIVDSLGAKEAISSLMQEPFCTVPLIWIVQ 260

Query: 336 DDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFHVIHGSPV 395
           +D LA RL +Y++ GW +L+SHWRS FSRA+VIVFP+F+LPMLYS LD+GNF VI GSPV
Sbjct: 261 EDTLASRLPVYEEMGWMHLISHWRSAFSRANVIVFPDFSLPMLYSVLDSGNFFVIPGSPV 320

Query: 396 DVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNELSPEYAAALYRMGPLLTEF 455
           DVW AE Y  +H K +L    GFG ED +VL+VG+S FYNEL+ +YA A++ +GPLL ++
Sbjct: 321 DVWAAESYVKTHSKTQLRMDYGFGKEDLLVLIVGSSTFYNELAWDYAVAMHSVGPLLIKY 380

Query: 456 ARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
           ARRK+  GSFKFVFL GNS+DG ND L+ +A
Sbjct: 381 ARRKDSGGSFKFVFLCGNSTDGYNDVLKEVA 411

BLAST of Cp4.1LG01g14540 vs. TrEMBL
Match: A0A072VPQ5_MEDTR (UDP-glycosyltransferase family protein OS=Medicago truncatula GN=MTR_1g090860 PE=4 SV=1)

HSP 1 Score: 377.5 bits (968), Expect = 2.5e-101
Identity = 207/401 (51.62%), Postives = 272/401 (67.83%), Query Frame = 1

Query: 88  SSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRF--RSNL----DRRD 147
           S P + D G  D+ F S + R   +RN   H  Q ++SSDR + R   RS+L     R+ 
Sbjct: 9   SQPEIDDTGGTDVAFSSIRGRFPFKRN-PSHHRQKSLSSDRQLPRSSTRSHLHNRFSRKS 68

Query: 148 RYGWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKF 207
               FP  +  F  L F V+F      + ++SS+TSVF +R+++       L+ G TLKF
Sbjct: 69  LLSLFP--KSGFYALIFAVVFLFAFASMVMQSSITSVFRQRNERGRNLREGLEFGSTLKF 128

Query: 208 VPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYV 267
           VP ++ ++F+  + +DRL  +  +G R PR+ALIL +M  D  SL L+TV++N+++LGYV
Sbjct: 129 VPGKVSQRFLSWDALDRLRFQPRIGVRAPRIALILGHMTVDPQSLMLVTVIQNLQKLGYV 188

Query: 268 FEIFAVGNGEARQMWLKLGRVVL-LSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPF 327
           F+IF VG G AR +W  +G  +   S  Q GQI+W  FEGIIVDS E KEAI+S+MQEPF
Sbjct: 189 FKIFGVGRGNARSIWENIGGGLSPFSTDQQGQIDWSNFEGIIVDSLEAKEAISSLMQEPF 248

Query: 328 CSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTG 387
           CS+PLIWIIQ+D L+ RL +YK  GW++L+SHWRS FSRASVIVFP+F  PMLYS LDTG
Sbjct: 249 CSVPLIWIIQEDSLSNRLPVYKQMGWQHLISHWRSAFSRASVIVFPDFTYPMLYSELDTG 308

Query: 388 NFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNELSPEYAAAL 447
           NF VI GSPVDVW AE Y  +H K +L E  GFG  D VVLVVG+S FY++LS EYA A+
Sbjct: 309 NFFVIPGSPVDVWAAESYSKTHTKDQLRELSGFGKNDMVVLVVGSSIFYDDLSWEYAVAM 368

Query: 448 YRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
             +GPLLT++ARR +   SFKFVFL GNS+DG +DALQ +A
Sbjct: 369 NSIGPLLTKYARRNDAAESFKFVFLCGNSTDGYDDALQEVA 406

BLAST of Cp4.1LG01g14540 vs. TrEMBL
Match: A0A0L9TE45_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan588s001600 PE=4 SV=1)

HSP 1 Score: 362.8 bits (930), Expect = 6.3e-97
Identity = 199/410 (48.54%), Postives = 275/410 (67.07%), Query Frame = 1

Query: 88  SSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQ----------DNVSSDRSVCR--FRS 147
           +S P +DD   D+GF + +     +RN   ++ +           N SS  S  R    S
Sbjct: 8   ASQPEIDDAGGDIGFHAIRGGFPFKRNPSHYRHRGSFDRQLPRTSNSSSSNSSSRSHLHS 67

Query: 148 NLDRRDRYGW-FPFRRRS--FIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAE 207
            L R+    W FPF +       L   V+F      + +++S+TSVF +R+++   R   
Sbjct: 68  RLTRKGLLLWLFPFSKSKSGLYALIIAVVFLFAFASVVMQNSITSVFRQRAERGRYRLEG 127

Query: 208 LKPGRTLKFVPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVM 267
           L+ G TL+FVP R+ + F+ GN +DR+ S+  +G R PR+ALIL +M  D  SL L+TV+
Sbjct: 128 LRFGTTLRFVPGRVSQGFLSGNGLDRIRSQPRLGVRPPRIALILGHMTIDPQSLMLVTVI 187

Query: 268 KNMRELGYVFEIFAVGNGEARQMWLKLGR-VVLLSPKQFGQINWLLFEGIIVDSFEGKEA 327
           +N+++LGYVF+IFAVG+G+A  +W  +G  +  L+ ++ G I+W +FEGIIV S E KEA
Sbjct: 188 RNLQKLGYVFKIFAVGHGKAHSIWESIGGGISRLNIEKQGLIDWSIFEGIIVGSLEAKEA 247

Query: 328 ITSIMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALP 387
           ++S+MQEPFCSIPLIWIIQ+D L+ RL +Y+  GWE+LVSHWR+ FSRASV+VFP+F  P
Sbjct: 248 VSSLMQEPFCSIPLIWIIQEDRLSSRLPVYEQMGWEHLVSHWRNAFSRASVVVFPDFTYP 307

Query: 388 MLYSALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNE 447
           MLYS LDTGNF VI GSPVDVW AE Y+ +H K +L E  GF   D VVLVVG+S FY++
Sbjct: 308 MLYSGLDTGNFFVIPGSPVDVWAAERYRETHGKDQLRELSGFDKYDMVVLVVGSSVFYDD 367

Query: 448 LSPEYAAALYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
           LS +YA A++ +GPLLT++ARR +   SFKFVFL GNS+DG +DALQ +A
Sbjct: 368 LSWDYAVAMHSIGPLLTKYARRNDATESFKFVFLCGNSTDGSDDALQEVA 417

BLAST of Cp4.1LG01g14540 vs. TrEMBL
Match: A0A0S3SP93_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.08G121500 PE=4 SV=1)

HSP 1 Score: 362.8 bits (930), Expect = 6.3e-97
Identity = 199/410 (48.54%), Postives = 275/410 (67.07%), Query Frame = 1

Query: 88  SSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQ----------DNVSSDRSVCR--FRS 147
           +S P +DD   D+GF + +     +RN   ++ +           N SS  S  R    S
Sbjct: 8   ASQPEIDDAGGDIGFHAIRGGFPFKRNPSHYRHRGSFDRQLPRTSNSSSSNSSSRSHLHS 67

Query: 148 NLDRRDRYGW-FPFRRRS--FIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAE 207
            L R+    W FPF +       L   V+F      + +++S+TSVF +R+++   R   
Sbjct: 68  RLTRKGLLLWLFPFSKSKSGLYALIIAVVFLFAFASVVMQNSITSVFRQRAERGRYRLEG 127

Query: 208 LKPGRTLKFVPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVM 267
           L+ G TL+FVP R+ + F+ GN +DR+ S+  +G R PR+ALIL +M  D  SL L+TV+
Sbjct: 128 LRFGTTLRFVPGRVSQGFLSGNGLDRIRSQPRLGVRPPRIALILGHMTIDPQSLMLVTVI 187

Query: 268 KNMRELGYVFEIFAVGNGEARQMWLKLGR-VVLLSPKQFGQINWLLFEGIIVDSFEGKEA 327
           +N+++LGYVF+IFAVG+G+A  +W  +G  +  L+ ++ G I+W +FEGIIV S E KEA
Sbjct: 188 RNLQKLGYVFKIFAVGHGKAHSIWESIGGGISRLNIEKQGLIDWSIFEGIIVGSLEAKEA 247

Query: 328 ITSIMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALP 387
           ++S+MQEPFCSIPLIWIIQ+D L+ RL +Y+  GWE+LVSHWR+ FSRASV+VFP+F  P
Sbjct: 248 VSSLMQEPFCSIPLIWIIQEDRLSSRLPVYEQMGWEHLVSHWRNAFSRASVVVFPDFTYP 307

Query: 388 MLYSALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNE 447
           MLYS LDTGNF VI GSPVDVW AE Y+ +H K +L E  GF   D VVLVVG+S FY++
Sbjct: 308 MLYSGLDTGNFFVIPGSPVDVWAAERYRETHGKDQLRELSGFDKYDMVVLVVGSSVFYDD 367

Query: 448 LSPEYAAALYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
           LS +YA A++ +GPLLT++ARR +   SFKFVFL GNS+DG +DALQ +A
Sbjct: 368 LSWDYAVAMHSIGPLLTKYARRNDATESFKFVFLCGNSTDGSDDALQEVA 417

BLAST of Cp4.1LG01g14540 vs. TrEMBL
Match: A0A0R0I366_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G200000 PE=4 SV=1)

HSP 1 Score: 360.9 bits (925), Expect = 2.4e-96
Identity = 196/412 (47.57%), Postives = 272/412 (66.02%), Query Frame = 1

Query: 88  SSPPVVDDGAC--DLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRYG- 147
           +S P +DDG    D+GF + +     +RN   H+ +   S DR + R  +N +  +    
Sbjct: 8   ASQPEIDDGGGGGDIGFGAIRGGFPFKRNPSHHRHRG--SFDRQLPRSNNNSNSNNNINR 67

Query: 148 -----------W---FPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPRE 207
                      W   FP  +  F      V+F   +  L ++SS+TSVF +R+++A    
Sbjct: 68  SHLHKRKGLLLWLFPFPKSKSGFYAFIIAVVFLFALASLVMQSSITSVFRQRAERASYIR 127

Query: 208 AELKPGRTLKFVPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLIT 267
             ++ G  L+FVP +I ++F+ G+ +D + S+  +G R PR+ALIL +M  D  SL L+T
Sbjct: 128 GGIRFGSALRFVPGKISQRFLSGDGLDPVRSQPRIGVRAPRIALILGHMTIDPQSLMLVT 187

Query: 268 VMKNMRELGYVFEIFAVGNGEARQMWLKLGRVVL-LSPKQFGQINWLLFEGIIVDSFEGK 327
           V++N+++LGYVF+IFAVG+G+AR +W  +G  +  LS K  G I+W +FEGIIVDS E K
Sbjct: 188 VIRNLQKLGYVFKIFAVGHGKARSIWENIGGGISPLSAKHQGLIDWSIFEGIIVDSLEAK 247

Query: 328 EAITSIMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFA 387
            AI+S+MQ+PFCS+PLIWIIQ+D L+ RL +Y+  GWE++VSHWRS FSRA V+VFP+F 
Sbjct: 248 VAISSVMQDPFCSVPLIWIIQEDSLSSRLPVYEQMGWEHIVSHWRSAFSRAGVVVFPDFT 307

Query: 388 LPMLYSALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FY 447
            PMLYS LDTGNF VI GSPVDVW AE Y  +H K +L E  GFG  D +VLVVG+S FY
Sbjct: 308 YPMLYSELDTGNFFVIPGSPVDVWAAESYSKTHAKDQLRELSGFGKNDMLVLVVGSSVFY 367

Query: 448 NELSPEYAAALYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
           + LS +YA A++ +GPLLT++ARR     SFKFVFL GNS+DG +DALQ +A
Sbjct: 368 DNLSWDYAVAMHSVGPLLTKYARRNGATDSFKFVFLCGNSTDGYDDALQGVA 417

BLAST of Cp4.1LG01g14540 vs. TAIR10
Match: AT5G04480.1 (AT5G04480.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 332.0 bits (850), Expect = 6.0e-91
Identity = 184/390 (47.18%), Postives = 251/390 (64.36%), Query Frame = 1

Query: 96  GACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRR--DRYGWFPFRR-RS 155
           G  D  F S ++R   +RN    +++ +   DR   R R +   R  +R G     + R 
Sbjct: 29  GNGDTSFHSIRDRLRLKRNSSDRRDRSHSGLDRPSLRTRPHHIGRSLNRKGLLSLLKPRG 88

Query: 156 FIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVPQRIPRKFIE 215
             +L F V FT+  F +       S+  + + K     +++  G TLK+VP  I R  IE
Sbjct: 89  TCLLYFLVAFTVCAFVMSSLLLQNSITWQGNVKGGQVRSQIGLGSTLKYVPGGIARTLIE 148

Query: 216 GNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFEIFAVGNGEA 275
           G  +D L S   +G R PRLAL+L NM+KD  +L L+TVMKN+++LGYVF++FAV NGEA
Sbjct: 149 GKGLDPLRSAVRIGVRPPRLALVLGNMKKDPRTLMLVTVMKNLQKLGYVFKVFAVENGEA 208

Query: 276 RQMWLKL-GRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSIPLIWIIQD 335
           R +W +L G V +L  +Q G  +W +FEG+I DS E KEAI+S+MQEPF S+PLIWI+ +
Sbjct: 209 RSLWEQLAGHVKVLVSEQLGHADWTIFEGVIADSLEAKEAISSLMQEPFRSVPLIWIVHE 268

Query: 336 DILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFHVIHGSPVD 395
           DILA RL +Y+  G  +L+SHWRS F+RA V+VFP F LPML+S LD GNF VI  S VD
Sbjct: 269 DILANRLPVYQRMGQNSLISHWRSAFARADVVVFPQFTLPMLHSVLDDGNFVVIPESVVD 328

Query: 396 VWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNELSPEYAAALYRMGPLLTEFA 455
           VW AE Y  +H K  L E   FG +D ++LV+G+S FY+E S + A A++ +GPLLT + 
Sbjct: 329 VWAAESYSETHTKQNLREINEFGEDDVIILVLGSSFFYDEFSWDNAVAMHMLGPLLTRYG 388

Query: 456 RRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
           RRK+  GSFKFVFLYGNS+ G +DA+Q +A
Sbjct: 389 RRKDTSGSFKFVFLYGNSTKGQSDAVQEVA 418

BLAST of Cp4.1LG01g14540 vs. TAIR10
Match: AT4G01210.1 (AT4G01210.1 glycosyl transferase family 1 protein)

HSP 1 Score: 166.4 bits (420), Expect = 4.4e-41
Identity = 119/409 (29.10%), Postives = 200/409 (48.90%), Query Frame = 1

Query: 96  GACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRYGWFPFRRRSFIV 155
           G+ + G  + ++    R   +Q Q+Q        + R RS L R      F +     I+
Sbjct: 2   GSLESGIPTKRDNGGVRGGRQQQQQQQQ--QQFFLQRNRSRLSRFFLLKSFNYLLWISII 61

Query: 156 LAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKP-------------GRTLKFV 215
             FF  F   +FQ+FL      + + +S K W  +  L P             G  ++  
Sbjct: 62  CVFF--FFAVLFQMFLPG----LVIDKSDKPWISKEILPPDLVGFREKGFLDFGDDVRIE 121

Query: 216 PQRIPRKFIEGNEVDRLHSED------HVGFRKPRLALILRNMEKDSLSLFLITVMKNMR 275
           P ++  KF          S          GFRKP+LAL+  ++  D   + ++++ K ++
Sbjct: 122 PTKLLMKFQRDAHGFNFTSSSLNTTLQRFGFRKPKLALVFGDLLADPEQVLMVSLSKALQ 181

Query: 276 ELGYVFEIFAVGNGEARQMWLKLG-RVVLLSPKQFGQ--INWLLFEGIIVDSFEGKEAIT 335
           E+GY  E++++ +G    +W K+G  V +L P Q     I+WL ++GIIV+S   +   T
Sbjct: 182 EVGYAIEVYSLEDGPVNSIWQKMGVPVTILKPNQESSCVIDWLSYDGIIVNSLRARSMFT 241

Query: 336 SIMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPML 395
             MQEPF S+PLIW+I ++ LA R + Y   G   L++ W+  FSRASV+VF N+ LP+L
Sbjct: 242 CFMQEPFKSLPLIWVINEETLAVRSRQYNSTGQTELLTDWKKIFSRASVVVFHNYLLPIL 301

Query: 396 YSALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSF-YNELS 455
           Y+  D GNF+VI GSP      E+ K+ + +F   +      +D V+ +VG+ F Y    
Sbjct: 302 YTEFDAGNFYVIPGSP-----EEVCKAKNLEFPPQK------DDVVISIVGSQFLYKGQW 361

Query: 456 PEYAAALYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLAK 482
            E+A  L  + PL +     ++     K + L G ++   + A++ +++
Sbjct: 362 LEHALLLQALRPLFSG-NYLESDNSHLKIIVLGGETASNYSVAIETISQ 390

BLAST of Cp4.1LG01g14540 vs. NCBI nr
Match: gi|659092332|ref|XP_008447017.1| (PREDICTED: uncharacterized protein LOC103489564 [Cucumis melo])

HSP 1 Score: 612.8 bits (1579), Expect = 5.0e-172
Identity = 307/397 (77.33%), Postives = 339/397 (85.39%), Query Frame = 1

Query: 84  MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY 143
           M+ +S PP  DDG   +GFLS +ERSLS+RNLKQHQEQDNVSSDR V R RSNL R D  
Sbjct: 1   MMQESFPPSDDDGDGGIGFLSYRERSLSKRNLKQHQEQDNVSSDRPVTRSRSNLGRSDTR 60

Query: 144 GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 203
            WF F RRS    A F L  +F+   +LES MTSVFLKRS+KAW R+AELK G TLKF P
Sbjct: 61  RWFAFSRRSIFAFAGFSLLLLFVVTFYLESLMTSVFLKRSEKAWSRDAELKLGMTLKFAP 120

Query: 204 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE 263
           QRIPRKFIEGNEVDRLHS++  GFRKPRLALILR+MEKDS SLFLITVMKNM+ELGY FE
Sbjct: 121 QRIPRKFIEGNEVDRLHSDNRFGFRKPRLALILRSMEKDSQSLFLITVMKNMKELGYAFE 180

Query: 264 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 323
           IFAV NGEARQMW +LGR+VLLSPKQFGQI+WLLFEGIIVDSFEGKEAITSIM EPFCS+
Sbjct: 181 IFAVANGEARQMWQELGRLVLLSPKQFGQIDWLLFEGIIVDSFEGKEAITSIMVEPFCSV 240

Query: 324 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 383
           PLIWIIQDDIL+KRL MYKD+GWENLVSHWRSTFSRASV+VFPNFALPM YSALDTGNFH
Sbjct: 241 PLIWIIQDDILSKRLNMYKDRGWENLVSHWRSTFSRASVVVFPNFALPMFYSALDTGNFH 300

Query: 384 VIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 443
           VI GSPVDVW+AEIYK +HFK++LG+KLGF +ED VVLVVG+SFYNELS EYA AL RMG
Sbjct: 301 VIQGSPVDVWSAEIYKKTHFKYELGKKLGFDVEDIVVLVVGSSFYNELSSEYAVALNRMG 360

Query: 444 PLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
           P+LT+   RKNP  SFKFVFL GNS++GCNDALQ  A
Sbjct: 361 PVLTKLP-RKNPEVSFKFVFLCGNSTNGCNDALQETA 396

BLAST of Cp4.1LG01g14540 vs. NCBI nr
Match: gi|1009119918|ref|XP_015876641.1| (PREDICTED: uncharacterized protein LOC107413250 isoform X2 [Ziziphus jujuba])

HSP 1 Score: 394.0 bits (1011), Expect = 3.7e-106
Identity = 206/402 (51.24%), Postives = 287/402 (71.39%), Query Frame = 1

Query: 88  SSPP-VVDDGACDLGFLSSKERSLSRRNLK--QHQEQDNVSSDRSVCRFRSNLDRRDRYG 147
           SSPP ++DD   DLGF S ++R   RRN    Q++ +  +  DR   R+RS+  R +R G
Sbjct: 8   SSPPGILDDNGNDLGFHSIRDRFRFRRNSNPSQNRGRGRIFPDRLSSRYRSHHGRFNRKG 67

Query: 148 W---FPFRRRSFIVLAFFVLFTMF-MFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLK 207
           +   FPF+ +  + L   +   +F M  + L+SS+T VF + S++       LK G TL+
Sbjct: 68  FLLLFPFKGKLALYLVIMLALVLFAMASMVLQSSITLVFRQGSERGRLFRYGLKFGSTLR 127

Query: 208 FVPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGY 267
           FVP RI R+ +EG  VDR  ++  +G R PRLALIL +M KD+ SL L+TV+KN+++LGY
Sbjct: 128 FVPGRISRRIMEGGGVDRFRNQARIGVRPPRLALILGHMTKDAQSLMLVTVIKNIKKLGY 187

Query: 268 VFEIFAVGNGEARQMWLKLG-RVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEP 327
           V +IFAV NG A  MW ++G ++ +L P+ FG I+W +F+GI+VDSFE K A++S+MQEP
Sbjct: 188 VLKIFAVQNGNAHSMWEQVGGQISILDPEHFGHIDWTIFDGIVVDSFEAKAALSSLMQEP 247

Query: 328 FCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDT 387
           F SIPLIWIIQ+D LAKRL +Y++ GW++L+SHW++   RA++IVFP+F LPMLYS LDT
Sbjct: 248 FSSIPLIWIIQEDTLAKRLPVYEEMGWKHLISHWKNALGRANLIVFPDFTLPMLYSVLDT 307

Query: 388 GNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNELSPEYAAA 447
           GNF V+ GSPVD+W AE Y  +H K +L    GF  ED +VLVVG+S F++ELS +YA A
Sbjct: 308 GNFFVVPGSPVDIWAAESYSKTHSKIQLRNDSGFSEEDLLVLVVGSSLFFDELSWDYAVA 367

Query: 448 LYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
           ++ +GPLLT++A+RK+P GSFKFVFL GNS+DG +DALQ +A
Sbjct: 368 MHAIGPLLTKYAKRKDPGGSFKFVFLCGNSTDGHDDALQEVA 409

BLAST of Cp4.1LG01g14540 vs. NCBI nr
Match: gi|1009119916|ref|XP_015876640.1| (PREDICTED: uncharacterized protein LOC107413250 isoform X1 [Ziziphus jujuba])

HSP 1 Score: 394.0 bits (1011), Expect = 3.7e-106
Identity = 206/402 (51.24%), Postives = 287/402 (71.39%), Query Frame = 1

Query: 88  SSPP-VVDDGACDLGFLSSKERSLSRRNLK--QHQEQDNVSSDRSVCRFRSNLDRRDRYG 147
           SSPP ++DD   DLGF S ++R   RRN    Q++ +  +  DR   R+RS+  R +R G
Sbjct: 8   SSPPGILDDNGNDLGFHSIRDRFRFRRNSNPSQNRGRGRIFPDRLSSRYRSHHGRFNRKG 67

Query: 148 W---FPFRRRSFIVLAFFVLFTMF-MFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLK 207
           +   FPF+ +  + L   +   +F M  + L+SS+T VF + S++       LK G TL+
Sbjct: 68  FLLLFPFKGKLALYLVIMLALVLFAMASMVLQSSITLVFRQGSERGRLFRYGLKFGSTLR 127

Query: 208 FVPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGY 267
           FVP RI R+ +EG  VDR  ++  +G R PRLALIL +M KD+ SL L+TV+KN+++LGY
Sbjct: 128 FVPGRISRRIMEGGGVDRFRNQARIGVRPPRLALILGHMTKDAQSLMLVTVIKNIKKLGY 187

Query: 268 VFEIFAVGNGEARQMWLKLG-RVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEP 327
           V +IFAV NG A  MW ++G ++ +L P+ FG I+W +F+GI+VDSFE K A++S+MQEP
Sbjct: 188 VLKIFAVQNGNAHSMWEQVGGQISILDPEHFGHIDWTIFDGIVVDSFEAKAALSSLMQEP 247

Query: 328 FCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDT 387
           F SIPLIWIIQ+D LAKRL +Y++ GW++L+SHW++   RA++IVFP+F LPMLYS LDT
Sbjct: 248 FSSIPLIWIIQEDTLAKRLPVYEEMGWKHLISHWKNALGRANLIVFPDFTLPMLYSVLDT 307

Query: 388 GNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNELSPEYAAA 447
           GNF V+ GSPVD+W AE Y  +H K +L    GF  ED +VLVVG+S F++ELS +YA A
Sbjct: 308 GNFFVVPGSPVDIWAAESYSKTHSKIQLRNDSGFSEEDLLVLVVGSSLFFDELSWDYAVA 367

Query: 448 LYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
           ++ +GPLLT++A+RK+P GSFKFVFL GNS+DG +DALQ +A
Sbjct: 368 MHAIGPLLTKYAKRKDPGGSFKFVFLCGNSTDGHDDALQEVA 409

BLAST of Cp4.1LG01g14540 vs. NCBI nr
Match: gi|703094213|ref|XP_010095179.1| (hypothetical protein L484_005213 [Morus notabilis])

HSP 1 Score: 382.5 bits (981), Expect = 1.1e-102
Identity = 202/391 (51.66%), Postives = 279/391 (71.36%), Query Frame = 1

Query: 96  GACDLGFLSSKERSLSRRNLKQHQEQDN--VSSDRSVCRFRSNLDRR-DRYGWFPFRRRS 155
           G  DLGF S ++R   +RN     ++D   V +DR+  R RS+ + R +R G+  F+ +S
Sbjct: 21  GGNDLGFHSIRDRLRFKRNPNPSHDRDRTKVFADRAPVRGRSHYNSRFNRKGFLWFKGKS 80

Query: 156 FIVLAF-FVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVPQRIPRKFI 215
            + L   F +F   M  + L+SS+ SVF + S++       LK G TL+FVP RI R+  
Sbjct: 81  TLYLVIIFAVFLFGMASMVLQSSIMSVFKQGSERGRLLREGLKFGTTLRFVPGRISRRLA 140

Query: 216 EGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFEIFAVGNGE 275
           + N +DRL +E  +  RKPRLAL+L NM+K+S SL LIT++KN+++LGY  +IFAV NG 
Sbjct: 141 DANGLDRLRNEPRIAVRKPRLALVLGNMKKNSESLMLITIVKNIQKLGYALKIFAVENGN 200

Query: 276 ARQMWLKLG-RVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSIPLIWIIQ 335
           AR MW +LG ++ +L  + +G ++W +FEG+IVDS   KEAI+S+MQEPFC++PLIWI+Q
Sbjct: 201 ARTMWEQLGGQISILGFESYGHMDWSIFEGVIVDSLGAKEAISSLMQEPFCTVPLIWIVQ 260

Query: 336 DDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFHVIHGSPV 395
           +D LA RL +Y++ GW +L+SHWRS FSRA+VIVFP+F+LPMLYS LD+GNF VI GSPV
Sbjct: 261 EDTLASRLPVYEEMGWMHLISHWRSAFSRANVIVFPDFSLPMLYSVLDSGNFFVIPGSPV 320

Query: 396 DVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNELSPEYAAALYRMGPLLTEF 455
           DVW AE Y  +H K +L    GFG ED +VL+VG+S FYNEL+ +YA A++ +GPLL ++
Sbjct: 321 DVWAAESYVKTHSKTQLRMDYGFGKEDLLVLIVGSSTFYNELAWDYAVAMHSVGPLLIKY 380

Query: 456 ARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
           ARRK+  GSFKFVFL GNS+DG ND L+ +A
Sbjct: 381 ARRKDSGGSFKFVFLCGNSTDGYNDVLKEVA 411

BLAST of Cp4.1LG01g14540 vs. NCBI nr
Match: gi|502118210|ref|XP_004496154.1| (PREDICTED: uncharacterized protein LOC101505326 [Cicer arietinum])

HSP 1 Score: 378.6 bits (971), Expect = 1.6e-101
Identity = 210/407 (51.60%), Postives = 283/407 (69.53%), Query Frame = 1

Query: 88  SSPPVVDD--GACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRF----RSNLDRR- 147
           SS P +DD  G  D+GF S + R   +RN   ++++   SSDR + R     RS+L  R 
Sbjct: 8   SSQPEIDDAGGGSDVGFSSIRGRFPFKRNPNLNRDRHRSSSDRQLPRSANSSRSHLHNRF 67

Query: 148 DRYGW---FPF--RRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKP 207
            R G+   FPF   +     L F V+F   +  + +++S+TSVF +R++ +      LK 
Sbjct: 68  TRKGFLSLFPFFKGKSGLYALIFVVVFLFALASMVMQNSITSVFRQRNEGSRYLREGLKF 127

Query: 208 GRTLKFVPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNM 267
           G T+KFVP ++ +KF+ G+ +DRL S+  +G R PR+ALIL +M  D  SL L+TV++N+
Sbjct: 128 GSTIKFVPGKVSQKFLSGDGLDRLRSQPRIGVRSPRIALILGHMSVDPQSLMLVTVIQNL 187

Query: 268 RELGYVFEIFAVGNGEARQMWLKLGR-VVLLSPKQFGQINWLLFEGIIVDSFEGKEAITS 327
           ++LGYVF+IF VG+ +AR +W  +G  +  LS +Q GQI+W  +  IIVDS E KEAI+S
Sbjct: 188 QKLGYVFKIFVVGHRKARSIWENVGGGLSSLSTEQQGQIDWSTYXXIIVDSLEAKEAISS 247

Query: 328 IMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLY 387
           +MQEPFCSIPLIWIIQ+D L+ RL +Y+  GW++LVSHWRS FSRASVIVFP+F  PMLY
Sbjct: 248 LMQEPFCSIPLIWIIQEDSLSSRLPVYEQMGWQHLVSHWRSAFSRASVIVFPDFTYPMLY 307

Query: 388 SALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNELSP 447
           S LDTGNF VI GSPVDVW AE Y+ +H K +L E  GFG  D VVLVVG+S FY++LS 
Sbjct: 308 SELDTGNFFVIPGSPVDVWAAESYRKTHSKDQLRELSGFGKNDMVVLVVGSSIFYDDLSW 367

Query: 448 EYAAALYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
           EYA A++ +GPLLT++ARR +   SFKFVFL GNS+DG +DALQ +A
Sbjct: 368 EYAVAMHSIGPLLTKYARRSDAAESFKFVFLCGNSTDGYDDALQEVA 414

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
W9QYJ0_9ROSA7.7e-10351.66Uncharacterized protein OS=Morus notabilis GN=L484_005213 PE=4 SV=1[more]
A0A072VPQ5_MEDTR2.5e-10151.62UDP-glycosyltransferase family protein OS=Medicago truncatula GN=MTR_1g090860 PE... [more]
A0A0L9TE45_PHAAN6.3e-9748.54Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan588s001600 PE=4 SV=1[more]
A0A0S3SP93_PHAAN6.3e-9748.54Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.08G121500 PE=... [more]
A0A0R0I366_SOYBN2.4e-9647.57Uncharacterized protein OS=Glycine max GN=GLYMA_10G200000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G04480.16.0e-9147.18 UDP-Glycosyltransferase superfamily protein[more]
AT4G01210.14.4e-4129.10 glycosyl transferase family 1 protein[more]
Match NameE-valueIdentityDescription
gi|659092332|ref|XP_008447017.1|5.0e-17277.33PREDICTED: uncharacterized protein LOC103489564 [Cucumis melo][more]
gi|1009119918|ref|XP_015876641.1|3.7e-10651.24PREDICTED: uncharacterized protein LOC107413250 isoform X2 [Ziziphus jujuba][more]
gi|1009119916|ref|XP_015876640.1|3.7e-10651.24PREDICTED: uncharacterized protein LOC107413250 isoform X1 [Ziziphus jujuba][more]
gi|703094213|ref|XP_010095179.1|1.1e-10251.66hypothetical protein L484_005213 [Morus notabilis][more]
gi|502118210|ref|XP_004496154.1|1.6e-10151.60PREDICTED: uncharacterized protein LOC101505326 [Cicer arietinum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009058 biosynthetic process
biological_process GO:0008150 biological_process
cellular_component GO:0044444 cytoplasmic part
cellular_component GO:0012505 endomembrane system
cellular_component GO:0043231 intracellular membrane-bounded organelle
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0016020 membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g14540.1Cp4.1LG01g14540.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR12526GLYCOSYLTRANSFERASEcoord: 211..480
score: 4.2
NoneNo IPR availablePANTHERPTHR12526:SF354SUBFAMILY NOT NAMEDcoord: 211..480
score: 4.2

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g14540Cucsa.308850Cucumber (Gy14) v1cgycpeB0806
Cp4.1LG01g14540CmaCh15G004910Cucurbita maxima (Rimu)cmacpeB313
Cp4.1LG01g14540CmaCh04G014030Cucurbita maxima (Rimu)cmacpeB720
Cp4.1LG01g14540CmaCh04G025390Cucurbita maxima (Rimu)cmacpeB733
Cp4.1LG01g14540CmoCh04G014770Cucurbita moschata (Rifu)cmocpeB673
Cp4.1LG01g14540CmoCh15G005000Cucurbita moschata (Rifu)cmocpeB275
Cp4.1LG01g14540CmoCh04G026580Cucurbita moschata (Rifu)cmocpeB689
Cp4.1LG01g14540Cla013723Watermelon (97103) v1cpewmB394
Cp4.1LG01g14540Cla020519Watermelon (97103) v1cpewmB427
Cp4.1LG01g14540Csa5G616900Cucumber (Chinese Long) v2cpecuB439
Cp4.1LG01g14540MELO3C012380Melon (DHL92) v3.5.1cpemeB363
Cp4.1LG01g14540ClCG05G021470Watermelon (Charleston Gray)cpewcgB388
Cp4.1LG01g14540ClCG08G007970Watermelon (Charleston Gray)cpewcgB416
Cp4.1LG01g14540CSPI05G26640Wild cucumber (PI 183967)cpecpiB439
Cp4.1LG01g14540Lsi08G006720Bottle gourd (USVL1VR-Ls)cpelsiB384
Cp4.1LG01g14540Lsi04G001200Bottle gourd (USVL1VR-Ls)cpelsiB352
Cp4.1LG01g14540MELO3C019854.2Melon (DHL92) v3.6.1cpemedB469
Cp4.1LG01g14540MELO3C012379.2Melon (DHL92) v3.6.1cpemedB423
Cp4.1LG01g14540CsaV3_5G035780Cucumber (Chinese Long) v3cpecucB0543
Cp4.1LG01g14540Bhi07G000674Wax gourdcpewgoB0569
Cp4.1LG01g14540Bhi04G001038Wax gourdcpewgoB0535
Cp4.1LG01g14540CsGy5G026080Cucumber (Gy14) v2cgybcpeB648
Cp4.1LG01g14540Carg01926Silver-seed gourdcarcpeB1177
Cp4.1LG01g14540Carg02434Silver-seed gourdcarcpeB0369
Cp4.1LG01g14540Carg04596Silver-seed gourdcarcpeB0963
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG01g14540Cp4.1LG01g24720Cucurbita pepo (Zucchini)cpecpeB379
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g14540Cucurbita pepo (Zucchini)cpecpeB040
Cp4.1LG01g14540Cucurbita maxima (Rimu)cmacpeB429
Cp4.1LG01g14540Cucurbita moschata (Rifu)cmocpeB392
Cp4.1LG01g14540Silver-seed gourdcarcpeB0678