CmaCh13G000650 (gene) Cucurbita maxima (Rimu)

NameCmaCh13G000650
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionO-glucosyltransferase rumi
LocationCma_Chr13 : 536460 .. 540565 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGAGTTTCCAAATAAGGCATTTTCCAATTCCCATTCCATTACCTTATCACACAATGCAGATCCTCACATCTCCGCCATGGCTCCGCTACATAGACCTCCCACATCCCGCGCTCTCTCCTCCCGCACTCCCTCCAACATTCTCCCCTCCGTCGTCGCCCTCTCCTTCCTTGCCCTTACTTTCCTCGTCTGCTACAAGGTCTGCCACTCCATTGTTTTATTGCAGCATTTCTTCGAACACCTTGTGTGCTGATTCCATTTCTGATTTCTTTGCTGTGTTTCAGGTGGATGACTTTGCTGCTCAAACCAAGACTGTTGCTGGCCACAACTTGGATCCAACCCCATGGCATTTGTTCCCGCCAAAAATCTTCAGCGAAGATACTCGCCATGCCAGAACTGTCAAGATCATCCACTGCTCTTACCTCGCTTGCCGCTACGCCAACAACACTGCTACTCGATTACCTTTGCATTCCGCTGTTTCGACTCACCAATGCCCTGAACTCTTCCGCTGGATTCATCACGATCTGGATCCGTGGGCTCGAAGCCGAATATCGATGAAACACTTGGATGAATCTATGAAATTTGCGGCGTTTCGTGTTGTGATCGTGGAGGGTAGGCTTTATGTGGATATGTATTATGCTTGTGTGCAGAGCAGGGCGATTTTCACGATCTGGGGTTTGGTTCAATTGCTGAGAAGGTTCCCTGGAATGGTGCCGGATGTGGACATGATGTTTGATTGTATGGACAGGCCGACTATCAATCGGACTGAGAACAAAGACATGCCGCTGCCTCTGTTTCGGTATTGCACGACCGACGCTCACTTTGACATTCCATTTCCCGATTGGTCTTTCTGGGGATGGTATGTTATGCGTATGACATAATATCCTTTCACTGCTTTGGTCATTTGATTTGTCTCATGCTTCAATTTTAACTTTTTGTCTGAAAACTTTTGAACCATAAAAAATACTAACTCTGCCTTTTATTTAAAAAATATTCATTCATTCTTTCTTTTAGCGACTATGGTAAACTTTTTAAAATTACTTTTGAAATAGATCTTAAAAAAAATAGTGGATTAAAAATCGACATGGACTTAATTAGGACTCTTGTTTGGATTAAGTTTGAAAGAATAGTATTTAGATTTATTGACCTTTCGTCCAATTACTAAAGATCAAAGGTAAAAAAATACACCGAATAGCATGCTGCATAGAGTTAATATGAATTGTTAAACATAAAATTGAAAGTTTATAAACTTATTTGAAAGTTTATAAGATATAGAGACTAGATAGACTCGAATTTATTATAAACTTTTTAAAATATAAGATTTTATTGAAGCAATTTGATCATATTACTTTTAGAATTCTTGTCTTGGTTGGTGTTTTTCTCAAACAATTCTATGTTATGTGATGGGGAAACTTATGCTATATTTCATCAATGGATGTGGGAATCTGATGCAGGCCAGAAGTGAACATAAGATCATGGGGGGAAGAGTTTAAAGATATCAAGAAAAGTTCAAAAAGCTCGAATTGGTCGAGCAAGTTACCTCGAGCATATTGGAAGGGAAATCCAGATGTTGCTTCCCCTGTTCGTACCGAGTTGCTGACTTGCAATCACTCAATAAAGTGGGGTGCTCAGATCATGCGTCAGGTTCAAACTGGAACCCTCAATTTCTCTTGCTGCATTTTTGTATTATAAGTGAACAATTAATTGTTGAGAACTGAGATAACCATTCTGTTGATGGAAGGACTGGGATCAAGAAGCAAGAGATGGTTTTGAGCAGTCCAAGCTATCTAAACAATGCAACCACCGGTGAAGATTCTTTTGAATAGTTAACCAAATTCAGCTATTAGGAGACGTAGACATGTCAGTTTTGTTGTCTTAGCTGAGCCCACCATGTGTTTGTCTGCTTCTGCAGGTATAAAATCTATGCTGAAGGGTTCGCTTGGTCTGTGAGCTTGAAGTACATTCTTTCATGTGGTTCAATGTCTTTGATTATTTCACCCCTATATCAAGATTTCTTCAGCCGTGGTCTTGATCCTTTGAAGAACTACTGGCCCATCCCCTTCGATAATATGTGCGAGTCTATTAAGCATGCTGTTGACTGGGGAAATGATCATTTATCTGAGGTATTCATTATCTTCTTTTTTTGGTTCTTGGTTCTTGGTTATCACTATATCTTTCTTCTTCCCGCAATTGTACTATATAGAGAAGTATTGGCTTGGTTTTATCTGTGAATTTGAAACTTTAAAAAGCTAGACTAACCATTGATACAAATTTGAGGTAGATTTGTAGTTTTAAGTCAACCTCTCCCTATCCGACACGTTTTAAAAATTAAGGAGAAGTTTGAAAGGAAAACCCAAAGAAGACAATATCTGCTAGCGGTGGGCTTGGACTGTTACAAATGGTATTAGAGCCGACACCCGGCGGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGGACACCCGGCGGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGGACACCCGGCGGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGGACACCCGGCGGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGGACACCCGGCGGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGGACACCCGGCGGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGGACACCCGGCGGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGGACACCCGGAGGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGTGGACACCCGGCGGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGTGGACACCCGGCGGTGTGCCAACAAGGACGTTGAGCACCGAAAGGGAGTGTGGACACCCGGCAGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGTGGACACCCGACAGTGTGCCAACGAGGACGTTGAGCACCGAAAGGGAGTGTGGACACCCGACGGTGTGTAAGGATGTTGGGCCTTGAAAGGGAGTGGACACTAGGTGGTGTGCCAGTAAGGATGTTGAGCCTTGAAAGGGAGTGGACACTAGGTGGTGTGCCAGTAAGGATGTTGAGCCTTAAAAGGGAGTGGACACTAGGTGGTGTGCCAGTAAGGATGTTGGGCCTTGAAAGGGAGTGGACACTGAACGGTGTGCCAACAAGGATGTTGGCCCCAAAGGGGGTGGAATGTAAGATCTCATATCAATTGGAGAGAGGAACGTGTGACAGTGAGGACATTGAGCCCCAAACTTATTCGAATTAGAATCATGAACCAAATCTGCCCGTAGAGTATGTTATTTTTTCTCTATCATATGTTCAAAAGAATCTCCACTCGTTTTGATCAGGAAGTATGGAGTGTATTGAGCGATATGAGGCTATGTTTAAGAATGATACGTATAAAATGTATGTCCAAAACAGGCCGAGGCCATAGGACAACAGGGACAGAATTTCATGGAGAGCTTGAGCATGGACACCGTCTATGCTTACATGTTTCAGCTAATCACAGAGTACTCAAAGCTTCTGGACTTCAAGCCGACCCCGCCCCCATCAGCTTTAGAAGTGTGTTCTGAGTCCTTGCTTTGCATTGCTGATGAGAAACAGAGGCAGTTCCTCGAGAAGTCAGCCACCTCGGCTTCTTTGGTCCCTCCATGCTCGCTCAACCGTGCCGGTAGCGATAGCGTTTACAGTTGGTTGCAGCAAGAAGAGACGAGGAAGGCGATGTAGGTGGAAGAAATGGCTGCAGGGAAAGCCTCAGAGTTGAAGTTTGTATGTTTTTTCTTCCATTTTAGAGTAATGTTTATATTATAGTGCTCTATAGAGGATTTATTGTGAGCTTGTGTAAAAGATGTATCTGAGACTCTGATCTCAGTCTATTTAGAGAGCCTGATTTGGCTTGTACCTGCCTTGCTTGATACTGATTCTATTATTTAGGTATACAATGCAAGCGCGCAACACTCTCGTTCAACAAATGAGAGTTTTTTTTGGTGATCTTCACCCCTTTTGTAAAACCAACATGAAAGGAACCAATCGGCAAATGGGTATCAATCAGATCCGCCTCTCT

mRNA sequence

TTGAGTTTCCAAATAAGGCATTTTCCAATTCCCATTCCATTACCTTATCACACAATGCAGATCCTCACATCTCCGCCATGGCTCCGCTACATAGACCTCCCACATCCCGCGCTCTCTCCTCCCGCACTCCCTCCAACATTCTCCCCTCCGTCGTCGCCCTCTCCTTCCTTGCCCTTACTTTCCTCGTCTGCTACAAGGTGGATGACTTTGCTGCTCAAACCAAGACTGTTGCTGGCCACAACTTGGATCCAACCCCATGGCATTTGTTCCCGCCAAAAATCTTCAGCGAAGATACTCGCCATGCCAGAACTGTCAAGATCATCCACTGCTCTTACCTCGCTTGCCGCTACGCCAACAACACTGCTACTCGATTACCTTTGCATTCCGCTGTTTCGACTCACCAATGCCCTGAACTCTTCCGCTGGATTCATCACGATCTGGATCCGTGGGCTCGAAGCCGAATATCGATGAAACACTTGGATGAATCTATGAAATTTGCGGCGTTTCGTGTTGTGATCGTGGAGGGTAGGCTTTATGTGGATATGTATTATGCTTGTGTGCAGAGCAGGGCGATTTTCACGATCTGGGGTTTGGTTCAATTGCTGAGAAGGTTCCCTGGAATGGTGCCGGATGTGGACATGATGTTTGATTGTATGGACAGGCCGACTATCAATCGGACTGAGAACAAAGACATGCCGCTGCCTCTGTTTCGGTATTGCACGACCGACGCTCACTTTGACATTCCATTTCCCGATTGGTCTTTCTGGGGATGGCCAGAAGTGAACATAAGATCATGGGGGGAAGAGTTTAAAGATATCAAGAAAAGTTCAAAAAGCTCGAATTGGTCGAGCAAGTTACCTCGAGCATATTGGAAGGGAAATCCAGATGTTGCTTCCCCTGTTCGTACCGAGTTGCTGACTTGCAATCACTCAATAAAGTGGGGTGCTCAGATCATGCGTCAGGACTGGGATCAAGAAGCAAGAGATGGTTTTGAGCAGTCCAAGCTATCTAAACAATGCAACCACCGGTATAAAATCTATGCTGAAGGGTTCGCTTGGTCTGTGAGCTTGAAGTACATTCTTTCATGTGGTTCAATGTCTTTGATTATTTCACCCCTATATCAAGATTTCTTCAGCCGTGGTCTTGATCCTTTGAAGAACTACTGGCCCATCCCCTTCGATAATATGTGCGAGTCTATTAAGCATGCTGTTGACTGGGGAAATGATCATTTATCTGAGGCCGAGGCCATAGGACAACAGGGACAGAATTTCATGGAGAGCTTGAGCATGGACACCGTCTATGCTTACATGTTTCAGCTAATCACAGAGTACTCAAAGCTTCTGGACTTCAAGCCGACCCCGCCCCCATCAGCTTTAGAAGTGTGTTCTGAGTCCTTGCTTTGCATTGCTGATGAGAAACAGAGGCAGTTCCTCGAGAAGTCAGCCACCTCGGCTTCTTTGGTCCCTCCATGCTCGCTCAACCGTGCCGGTAGCGATAGCGTTTACAGTTGGTTGCAGCAAGAAGAGACGAGGAAGGCGATGTAGGTGGAAGAAATGGCTGCAGGGAAAGCCTCAGAGTTGAAGTTTGTATGTTTTTTCTTCCATTTTAGAGTAATGTTTATATTATAGTGCTCTATAGAGGATTTATTGTGAGCTTGTGTAAAAGATGTATCTGAGACTCTGATCTCAGTCTATTTAGAGAGCCTGATTTGGCTTGTACCTGCCTTGCTTGATACTGATTCTATTATTTAGGTATACAATGCAAGCGCGCAACACTCTCGTTCAACAAATGAGAGTTTTTTTTGGTGATCTTCACCCCTTTTGTAAAACCAACATGAAAGGAACCAATCGGCAAATGGGTATCAATCAGATCCGCCTCTCT

Coding sequence (CDS)

ATGGCTCCGCTACATAGACCTCCCACATCCCGCGCTCTCTCCTCCCGCACTCCCTCCAACATTCTCCCCTCCGTCGTCGCCCTCTCCTTCCTTGCCCTTACTTTCCTCGTCTGCTACAAGGTGGATGACTTTGCTGCTCAAACCAAGACTGTTGCTGGCCACAACTTGGATCCAACCCCATGGCATTTGTTCCCGCCAAAAATCTTCAGCGAAGATACTCGCCATGCCAGAACTGTCAAGATCATCCACTGCTCTTACCTCGCTTGCCGCTACGCCAACAACACTGCTACTCGATTACCTTTGCATTCCGCTGTTTCGACTCACCAATGCCCTGAACTCTTCCGCTGGATTCATCACGATCTGGATCCGTGGGCTCGAAGCCGAATATCGATGAAACACTTGGATGAATCTATGAAATTTGCGGCGTTTCGTGTTGTGATCGTGGAGGGTAGGCTTTATGTGGATATGTATTATGCTTGTGTGCAGAGCAGGGCGATTTTCACGATCTGGGGTTTGGTTCAATTGCTGAGAAGGTTCCCTGGAATGGTGCCGGATGTGGACATGATGTTTGATTGTATGGACAGGCCGACTATCAATCGGACTGAGAACAAAGACATGCCGCTGCCTCTGTTTCGGTATTGCACGACCGACGCTCACTTTGACATTCCATTTCCCGATTGGTCTTTCTGGGGATGGCCAGAAGTGAACATAAGATCATGGGGGGAAGAGTTTAAAGATATCAAGAAAAGTTCAAAAAGCTCGAATTGGTCGAGCAAGTTACCTCGAGCATATTGGAAGGGAAATCCAGATGTTGCTTCCCCTGTTCGTACCGAGTTGCTGACTTGCAATCACTCAATAAAGTGGGGTGCTCAGATCATGCGTCAGGACTGGGATCAAGAAGCAAGAGATGGTTTTGAGCAGTCCAAGCTATCTAAACAATGCAACCACCGGTATAAAATCTATGCTGAAGGGTTCGCTTGGTCTGTGAGCTTGAAGTACATTCTTTCATGTGGTTCAATGTCTTTGATTATTTCACCCCTATATCAAGATTTCTTCAGCCGTGGTCTTGATCCTTTGAAGAACTACTGGCCCATCCCCTTCGATAATATGTGCGAGTCTATTAAGCATGCTGTTGACTGGGGAAATGATCATTTATCTGAGGCCGAGGCCATAGGACAACAGGGACAGAATTTCATGGAGAGCTTGAGCATGGACACCGTCTATGCTTACATGTTTCAGCTAATCACAGAGTACTCAAAGCTTCTGGACTTCAAGCCGACCCCGCCCCCATCAGCTTTAGAAGTGTGTTCTGAGTCCTTGCTTTGCATTGCTGATGAGAAACAGAGGCAGTTCCTCGAGAAGTCAGCCACCTCGGCTTCTTTGGTCCCTCCATGCTCGCTCAACCGTGCCGGTAGCGATAGCGTTTACAGTTGGTTGCAGCAAGAAGAGACGAGGAAGGCGATGTAG

Protein sequence

MAPLHRPPTSRALSSRTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSEDTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTHQCPELFRWIHHDLDPWARSRISMKHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFDCMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLSKQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPSALEVCSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQEETRKAM
BLAST of CmaCh13G000650 vs. Swiss-Prot
Match: RUMI_DROME (O-glucosyltransferase rumi OS=Drosophila melanogaster GN=rumi PE=1 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 3.8e-18
Identity = 76/330 (23.03%), Postives = 143/330 (43.33%), Query Frame = 1

Query: 117 IHHDLDPWARSRISMKHLDESMKFAAFRVVIVEGRLYVD---MYYACVQSRAIFTIWGLV 176
           +  DL P+  + ++ + ++ S ++   +  I   RLY D   M+ A  +        G+ 
Sbjct: 80  LKRDLAPYKSTGVTRQMIESSARYGT-KYKIYGHRLYRDANCMFPARCE--------GIE 139

Query: 177 QLLRRFPGMVPDVDMMFDCMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFW-GW 236
             L      +PD+D++ +  D P +N         P+F +  T  + DI +P W+FW G 
Sbjct: 140 HFLLPLVATLPDMDLIINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGG 199

Query: 237 PEV-----NIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKGNPD--------VASPVRTEL 296
           P        I  W +  + ++K + +  WS K    +++G+          + S    EL
Sbjct: 200 PATKLHPRGIGRWDQMREKLEKRAAAIPWSQKRSLGFFRGSRTSDERDSLILLSRRNPEL 259

Query: 297 LTCNHSIKWGAQIMRQDWDQEARDGFEQSKLSKQCNHRYKIYAEGFAWSVSLKYILSCGS 356
           +   ++   G +  +   D  A D   +      C ++Y     G A S  LK++  C S
Sbjct: 260 VEAQYTKNQGWKSPKDTLDAPAAD---EVSFEDHCKYKYLFNFRGVAASFRLKHLFLCKS 319

Query: 357 MSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHLSEAEAIGQQGQNFM 416
           +   +   +Q+FF   L P  +Y P+      +  +H + +   + + A+ I Q+G +F+
Sbjct: 320 LVFHVGDEWQEFFYDQLKPWVHYVPLKSYPSQQEYEHILSFFKKNDALAQEIAQRGYDFI 379

Query: 417 -ESLSMDTVYAYMFQLITEYSKLLDFKPTP 429
            E L M  +  Y  +L+  Y KLL ++  P
Sbjct: 380 WEHLRMKDIKCYWRKLLKRYVKLLQYEVKP 397

BLAST of CmaCh13G000650 vs. Swiss-Prot
Match: PGLT1_BOVIN (Protein O-glucosyltransferase 1 OS=Bos taurus GN=POGLUT1 PE=2 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 3.8e-18
Identity = 82/327 (25.08%), Postives = 141/327 (43.12%), Query Frame = 1

Query: 117 IHHDLDPWARSRISMKHLDESMKFA-AFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQL 176
           I  DL P+ R  IS K + E ++        I++ RLY +       SR      G+   
Sbjct: 61  IEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYRESD-CMFPSRCS----GVEHF 120

Query: 177 LRRFPGMVPDVDMMFDCMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWG---- 236
           +    G +PD++M+ +  D P + +    +  +P+F +  T  + DI +P W+FW     
Sbjct: 121 ILEVIGRLPDMEMVINVRDYPQVPKW--MEPAIPIFSFSKTLEYHDIMYPAWTFWEGGPA 180

Query: 237 -WP--EVNIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKGNPDVASPVRTELLTC---NHS 296
            WP   + +  W    +D+ +S+    W  K   AY++G+    SP R  L+     N  
Sbjct: 181 VWPIYPMGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSPERDPLILLSRKNPK 240

Query: 297 IKWGAQIMRQDW----DQEARDGFEQSKLSKQCNHRYKIYAEGFAWSVSLKYILSCGSMS 356
           +        Q W    D   +   +   L   C ++Y     G A S   K++  CGS+ 
Sbjct: 241 LVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLV 300

Query: 357 LIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHLSEAEAIGQQGQNF-ME 416
             +   + +FF   L P  +Y P+  D    +++  + +   +   A+ I ++G  F + 
Sbjct: 301 FHVGDEWLEFFYPQLKPWVHYIPVKTD--LSNVQELLQFVKANDDVAQEIAERGSQFILN 360

Query: 417 SLSMDTVYAYMFQLITEYSKLLDFKPT 428
            L MD +  Y   L+TEYSK L +  T
Sbjct: 361 HLKMDDITCYWENLLTEYSKFLSYNVT 375

BLAST of CmaCh13G000650 vs. Swiss-Prot
Match: PGLT1_HUMAN (Protein O-glucosyltransferase 1 OS=Homo sapiens GN=POGLUT1 PE=1 SV=1)

HSP 1 Score: 93.6 bits (231), Expect = 6.5e-18
Identity = 83/338 (24.56%), Postives = 143/338 (42.31%), Query Frame = 1

Query: 106 STHQCPELFRWIHHDLDPWARSRISMKHLDESMKFA-AFRVVIVEGRLYVDMYYACVQSR 165
           S+  C      I  DL P+ R  IS K + E ++        I + RLY +       SR
Sbjct: 50  SSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQITKNRLYREND-CMFPSR 109

Query: 166 AIFTIWGLVQLLRRFPGMVPDVDMMFDCMDRPTINRTENKDMPLPLFRYCTTDAHFDIPF 225
                 G+   +    G +PD++M+ +  D P + +    +  +P+F +  T  + DI +
Sbjct: 110 CS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKW--MEPAIPVFSFSKTSEYHDIMY 169

Query: 226 PDWSFWG-----WP--EVNIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKGNPDVASPVRT 285
           P W+FW      WP     +  W    +D+ +S+    W  K   AY++G+    SP R 
Sbjct: 170 PAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSPERD 229

Query: 286 ELLTC---NHSIKWGAQIMRQDW----DQEARDGFEQSKLSKQCNHRYKIYAEGFAWSVS 345
            L+     N  +        Q W    D   +   +   L   C ++Y     G A S  
Sbjct: 230 PLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFR 289

Query: 346 LKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHLSEAEA 405
            K++  CGS+   +   + +FF   L P  +Y P+  D    +++  + +   +   A+ 
Sbjct: 290 FKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKTD--LSNVQELLQFVKANDDVAQE 349

Query: 406 IGQQGQNFMES-LSMDTVYAYMFQLITEYSKLLDFKPT 428
           I ++G  F+ + L MD +  Y   L++EYSK L +  T
Sbjct: 350 IAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSYNVT 375

BLAST of CmaCh13G000650 vs. Swiss-Prot
Match: PGLT1_RAT (Protein O-glucosyltransferase 1 OS=Rattus norvegicus GN=Poglut1 PE=3 SV=1)

HSP 1 Score: 91.7 bits (226), Expect = 2.5e-17
Identity = 83/338 (24.56%), Postives = 145/338 (42.90%), Query Frame = 1

Query: 106 STHQCPELFRWIHHDLDPWARSRISMKHLDESMKFA-AFRVVIVEGRLYVDMYYACVQSR 165
           S+  C      I  DL P+ R  IS K + E ++        I++ RL+ +       SR
Sbjct: 50  SSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRRLGTHYQIIKHRLFREDD-CMFPSR 109

Query: 166 AIFTIWGLVQLLRRFPGMVPDVDMMFDCMDRPTINRTENKDMPLPLFRYCTTDAHFDIPF 225
                  +++++RR    +PD++M+ +  D P + +    +  +P+F +  T  + DI +
Sbjct: 110 CSGVEHFILEVIRR----LPDMEMVINVRDYPQVPKW--MEPTIPVFSFSKTSEYHDIMY 169

Query: 226 PDWSFWG-----WP--EVNIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKGNPDVASPVRT 285
           P W+FW      WP     +  W    +D+ +S+    W  K   AY++G+    SP R 
Sbjct: 170 PAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSR--TSPERD 229

Query: 286 ELLTC---NHSIKWGAQIMRQDW----DQEARDGFEQSKLSKQCNHRYKIYAEGFAWSVS 345
            L+     N  +        Q W    D   +   +   L   C ++Y     G A S  
Sbjct: 230 PLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLIDHCKYKYLFNFRGVAASFR 289

Query: 346 LKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHLSEAEA 405
            K++  CGS+   +   + +FF   L P  +Y P+  D     ++  + +   +   A+ 
Sbjct: 290 FKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKTD--LSDVQELLQFVKANDDLAQE 349

Query: 406 IGQQGQNF-MESLSMDTVYAYMFQLITEYSKLLDFKPT 428
           I ++G  F +  L MD +  Y   L+TEYSK L +  T
Sbjct: 350 IAKRGSQFIINHLQMDDITCYWENLLTEYSKFLSYNVT 375

BLAST of CmaCh13G000650 vs. Swiss-Prot
Match: PGLT1_MOUSE (Protein O-glucosyltransferase 1 OS=Mus musculus GN=Poglut1 PE=1 SV=2)

HSP 1 Score: 90.9 bits (224), Expect = 4.2e-17
Identity = 82/338 (24.26%), Postives = 145/338 (42.90%), Query Frame = 1

Query: 106 STHQCPELFRWIHHDLDPWARSRISMKHLDESMKFA-AFRVVIVEGRLYVDMYYACVQSR 165
           S+  C      I  DL P+ R  IS K + E ++        I++ RL+ +       SR
Sbjct: 50  SSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLFREDD-CMFPSR 109

Query: 166 AIFTIWGLVQLLRRFPGMVPDVDMMFDCMDRPTINRTENKDMPLPLFRYCTTDAHFDIPF 225
                  +++++ R    +PD++M+ +  D P + +    +  +P+F +  T  + DI +
Sbjct: 110 CSGVEHFILEVIHR----LPDMEMVINVRDYPQVPKW--MEPTIPVFSFSKTSEYHDIMY 169

Query: 226 PDWSFWG-----WP--EVNIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKGNPDVASPVRT 285
           P W+FW      WP     +  W    +D+ +S+    W  K   AY++G+    SP R 
Sbjct: 170 PAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSR--TSPERD 229

Query: 286 ELLTC---NHSIKWGAQIMRQDW----DQEARDGFEQSKLSKQCNHRYKIYAEGFAWSVS 345
            L+     N  +        Q W    D   +   +   L   C +RY     G A S  
Sbjct: 230 PLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLIDHCKYRYLFNFRGVAASFR 289

Query: 346 LKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHLSEAEA 405
            K++  CGS+   +   + +FF   L P  +Y P+  D    +++  + +   +   A+ 
Sbjct: 290 FKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKTD--LSNVQELLQFVKANDDIAQE 349

Query: 406 IGQQGQNF-MESLSMDTVYAYMFQLITEYSKLLDFKPT 428
           I ++G  F +  L MD +  Y   L+T+YSK L +  T
Sbjct: 350 IAKRGSQFIINHLQMDDITCYWENLLTDYSKFLSYNVT 375

BLAST of CmaCh13G000650 vs. TrEMBL
Match: A0A0A0LY89_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G531170 PE=4 SV=1)

HSP 1 Score: 854.7 bits (2207), Expect = 5.3e-245
Identity = 395/471 (83.86%), Postives = 425/471 (90.23%), Query Frame = 1

Query: 12  ALSSRTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSE 71
           A + R PS++LPSVVA+ FL+LTFL+CYKVDDFAAQTKTVAGHNLDPTPWHLFPPK FS+
Sbjct: 2   APAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSD 61

Query: 72  DTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTHQCPELFRWIHHDLDPWARSRISM 131
           +TRHAR VKIIHCSYL CRYA N AT+ P HSAVS  +CPE FRWIHHDLDPWAR+RISM
Sbjct: 62  ETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISM 121

Query: 132 KHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFD 191
             L+ES KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPDVDMMFD
Sbjct: 122 TQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFD 181

Query: 192 CMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSS 251
           CMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFWGWPEVN+RSW EEF+DIKK S
Sbjct: 182 CMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGS 241

Query: 252 KSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLS 311
           K+ +W +K PRAYWKGNPDV SP R ELL CNHS  WGAQIMRQDW QEARDG+EQSKLS
Sbjct: 242 KNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLS 301

Query: 312 KQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMC 371
            QCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMC
Sbjct: 302 NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMC 361

Query: 372 ESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPS 431
           ESIKHAVDWGN H  EAE IG+QGQ FMESLSMDTVY+YMF LITEYSKL DFKPTPPPS
Sbjct: 362 ESIKHAVDWGNTHFPEAETIGRQGQKFMESLSMDTVYSYMFHLITEYSKLQDFKPTPPPS 421

Query: 432 ALEVCSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQE 483
           ALEVC++SLLCIADEKQ QFLEKSA S S VPPCSLNR GSD +YSWLQQ+
Sbjct: 422 ALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 472

BLAST of CmaCh13G000650 vs. TrEMBL
Match: B9SI30_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0612530 PE=4 SV=1)

HSP 1 Score: 692.6 bits (1786), Expect = 3.5e-196
Identity = 321/478 (67.15%), Postives = 384/478 (80.33%), Query Frame = 1

Query: 7   PPTSRALSSRTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPP 66
           PP   A  +R PS + P ++ L  ++LT L  Y+VD+FA++TKTVAGHNLDPTPWH+FPP
Sbjct: 4   PPPKAA--ARVPSYLFPCLLGL--VSLTLLFFYQVDNFASRTKTVAGHNLDPTPWHIFPP 63

Query: 67  KIFSEDTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTH-QCPELFRWIHHDLDPWA 126
           + F E+TR AR  KII CSYL C Y N T TR    S+   + +CPE FR+IHHDL PWA
Sbjct: 64  RTFDEETRQARAYKIIQCSYLTCPYTNTTTTRRRSQSSSQANAKCPEFFRFIHHDLQPWA 123

Query: 127 RSRISMKHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPD 186
           R+ I+ KH+ E+ KFAAFRVVI EGRLY+D+YYACVQSR +FT+WGL+QLL R+PGMVPD
Sbjct: 124 RTGITKKHIAEAKKFAAFRVVIFEGRLYLDLYYACVQSRMMFTVWGLLQLLNRYPGMVPD 183

Query: 187 VDMMFDCMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFK 246
           VD+MFDCMDRP IN+TE+   PLP+FRYCTT  HFDIPFPDWSFWGWPE+NIRSW EEF+
Sbjct: 184 VDIMFDCMDRPVINKTEHISFPLPIFRYCTTQNHFDIPFPDWSFWGWPEINIRSWNEEFR 243

Query: 247 DIKKSSKSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGF 306
           DIK+ S+S +WS K PRAYWKGNPDV SP+RTEL+ CNHS KWGA IMRQDW +EAR GF
Sbjct: 244 DIKRGSQSKSWSKKWPRAYWKGNPDVLSPIRTELMQCNHSRKWGAHIMRQDWGEEARAGF 303

Query: 307 EQSKLSKQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPI 366
           E+SKLS QCN+RYKIYAEGFAWSVSLKYI+SCGS++LIISP Y+DFFSRGL P  NYWP+
Sbjct: 304 ERSKLSNQCNYRYKIYAEGFAWSVSLKYIISCGSLALIISPQYEDFFSRGLVPASNYWPV 363

Query: 367 PFDNMCESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFK 426
             D +C SIK AVDWGN + SEAE+IG+ GQ+FME+LSM+ VY YMF LITEYSKL  FK
Sbjct: 364 ASDELCRSIKFAVDWGNANPSEAESIGKAGQDFMETLSMEGVYDYMFHLITEYSKLQVFK 423

Query: 427 PTPPPSALEVCSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQEE 484
           P  P SALEVC++SLLC AD KQ+QFLE+SA   S  P CSL  A  +++ SWLQ+++
Sbjct: 424 PVLPSSALEVCADSLLCFADPKQKQFLERSAAFPSPKPACSLQPADGNAIKSWLQEKQ 477

BLAST of CmaCh13G000650 vs. TrEMBL
Match: A0A059AF90_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_J01883 PE=4 SV=1)

HSP 1 Score: 684.1 bits (1764), Expect = 1.2e-193
Identity = 315/468 (67.31%), Postives = 372/468 (79.49%), Query Frame = 1

Query: 16  RTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSEDTRH 75
           R PSN+LP ++ALS  +++ L+ YKVDDFA+QTKTVAGHNLDPTPWHLFPPK F+E TR+
Sbjct: 10  RRPSNLLPCLIALSLFSISALLLYKVDDFASQTKTVAGHNLDPTPWHLFPPKTFNEKTRY 69

Query: 76  ARTVKIIHCSYLACRYANNTATRLPLHSAVSTHQCPELFRWIHHDLDPWARSRISMKHLD 135
           AR  KII CSYL C YA  +     L  + S   CP  F WI  DL+PW R+ IS  HL 
Sbjct: 70  ARASKIIQCSYLTCPYATGSIRGQDLSRSRSARACPAFFAWIRRDLEPWVRTGISPAHLM 129

Query: 136 ESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFDCMDR 195
           E+ +FA+FRVVI EG+LYVD YYACVQSRA+FTIWGL+QLLRR+PGMVPDVD+MFDCMD+
Sbjct: 130 EAKRFASFRVVIFEGKLYVDFYYACVQSRAMFTIWGLLQLLRRYPGMVPDVDLMFDCMDK 189

Query: 196 PTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSSKSSN 255
           P+INRTE+  MPLPLFRYCTT  HFDIPFPDWSFWGWPE N++ W EEF+DIK+ S+   
Sbjct: 190 PSINRTEHASMPLPLFRYCTTPGHFDIPFPDWSFWGWPETNLKPWDEEFRDIKQGSQVLR 249

Query: 256 WSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLSKQCN 315
           WS K P AYWKGNPDV SPVRTELL CNHS  W AQ+MRQDW +EAR G+EQSKLS QCN
Sbjct: 250 WSKKSPYAYWKGNPDVESPVRTELLKCNHSRMWNAQVMRQDWAEEARAGYEQSKLSNQCN 309

Query: 316 HRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIK 375
           HRYKIYAEG+AWSVSLKYI++CGS +LIISP Y+DFFSRGL P++NYWPI   N+C SIK
Sbjct: 310 HRYKIYAEGYAWSVSLKYIIACGSPALIISPEYEDFFSRGLFPMRNYWPISSTNLCPSIK 369

Query: 376 HAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPSALEV 435
           +AV+WGN + SEAEAIG++GQ+FME LSMD +Y YM+ LI EYSKL +FKP P  SA EV
Sbjct: 370 YAVNWGNANPSEAEAIGKRGQDFMEDLSMDRIYDYMYHLIMEYSKLQNFKPIPSSSAREV 429

Query: 436 CSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQEE 484
           C +SLLC AD KQRQFLE+S   AS   PC+   A   +V SW++Q+E
Sbjct: 430 CVDSLLCFADPKQRQFLERSTALASEEAPCTFKAARGITVTSWIKQKE 477

BLAST of CmaCh13G000650 vs. TrEMBL
Match: A0A067LDK6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08464 PE=4 SV=1)

HSP 1 Score: 676.4 bits (1744), Expect = 2.6e-191
Identity = 309/476 (64.92%), Postives = 381/476 (80.04%), Query Frame = 1

Query: 11  RALSSRTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFS 70
           R+   R PS +LP ++    ++LT L  Y+VD+FA++TKTVAGHNLDPTPWH+FP K F 
Sbjct: 6   RSTIPRYPSYLLPCLIGC--VSLTLLFVYQVDNFASRTKTVAGHNLDPTPWHIFPLKNFD 65

Query: 71  EDTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTH---QCPELFRWIHHDLDPWARS 130
           E+TR AR  KII C +L C Y N+  T  P  S  S+    +CPE FR+IH DL+PW+R+
Sbjct: 66  EETRQARAYKIIQCQFLTCPYTNDNTTAQPRRSKSSSKLSAECPEFFRYIHRDLEPWSRT 125

Query: 131 RISMKHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVD 190
            I+  H+ E+  FAAFRVVI  GRLY+D+YYACVQSR +FTIWGL+Q+LRR+PGMVPDVD
Sbjct: 126 GITKNHIMEAKNFAAFRVVIFGGRLYLDLYYACVQSRLMFTIWGLLQMLRRYPGMVPDVD 185

Query: 191 MMFDCMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDI 250
            MFDCMDRP IN+TE+  MPLPLFRY TT+ HFDIPFPDWSFWGWPE+NIRSWGEEF+DI
Sbjct: 186 FMFDCMDRPIINKTEHSSMPLPLFRYDTTEDHFDIPFPDWSFWGWPEINIRSWGEEFQDI 245

Query: 251 KKSSKSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQ 310
           K+ S+S +WS K PRAYWKGNPDV SP+RTEL+ CNHS KWGAQIMRQ+WD+EAR GFE 
Sbjct: 246 KRGSQSKSWSKKWPRAYWKGNPDVLSPLRTELMQCNHSRKWGAQIMRQNWDEEARAGFEG 305

Query: 311 SKLSKQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPF 370
           SKLS QC++RYKIYAEGFAWSVSLKYI+SCGS++LIISP Y+DFFSRGL P +NYWP+  
Sbjct: 306 SKLSNQCDYRYKIYAEGFAWSVSLKYIVSCGSLALIISPQYEDFFSRGLIPKENYWPVSA 365

Query: 371 DNMCESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPT 430
           + +C SIK AVDWGN + S+A+AIG+ GQ+FME+LSMD VY YMF LI+EYSKL DFKP 
Sbjct: 366 NELCRSIKFAVDWGNANPSKAKAIGKAGQDFMEALSMDRVYDYMFHLISEYSKLQDFKPV 425

Query: 431 PPPSALEVCSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQEE 484
           PPP ALE C +S+LC A++K+++FL++S    S  PPC+L  A  + + SWLQQ++
Sbjct: 426 PPPDALEACMDSILCFAEQKEKEFLKRSTVFPSATPPCTLQPADGNLIKSWLQQKQ 479

BLAST of CmaCh13G000650 vs. TrEMBL
Match: W9RZY2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022004 PE=4 SV=1)

HSP 1 Score: 660.2 bits (1702), Expect = 1.9e-186
Identity = 305/446 (68.39%), Postives = 356/446 (79.82%), Query Frame = 1

Query: 40  KVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSEDTRHARTVKIIHCSYLACRYA----NNT 99
           +VDD AAQTKTVAG NLDPTPWHLFPPK FS +TRH+R  KI+HCSYLAC ++    N +
Sbjct: 95  QVDDIAAQTKTVAGDNLDPTPWHLFPPKTFSGETRHSRLYKILHCSYLACSHSAYKYNPS 154

Query: 100 ATRLPLHSAVSTHQCPELFRWIHHDLDPWARSRISMKHLDESMKFAAFRVVIVEGRLYVD 159
             R       +  +CPE FRWIH DL+PWAR+ IS  HL+E+ +FAAFR VIV GRL+VD
Sbjct: 155 VKRRRSDPDSAARKCPEFFRWIHQDLEPWARTGISAGHLEEAREFAAFRAVIVGGRLFVD 214

Query: 160 MYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFDCMDRPTINRTENKDMPLPLFRYCT 219
           +YYACVQSR +FTIWGL+QLLRR+PGMVPDVDM+FDCMD+P+IN TE+   PLPLFRYCT
Sbjct: 215 LYYACVQSRTMFTIWGLLQLLRRYPGMVPDVDMVFDCMDKPSINGTEHGSFPLPLFRYCT 274

Query: 220 TDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKGNPDVASPV 279
           T AHFDIPFPDWSFWGWPE N+  W EEF+DIK+ S+ ++W+ K PRAYWKGNPDV SPV
Sbjct: 275 TQAHFDIPFPDWSFWGWPETNLNPWDEEFRDIKRGSERTSWTKKHPRAYWKGNPDVDSPV 334

Query: 280 RTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLSKQCNHRYKIYAEGFAWSVSLKYIL 339
           RTELL CNHS  WGAQI RQDW +EA+ G+E+S+LS QCN+RYKIYAEG+AWSVSLKYIL
Sbjct: 335 RTELLNCNHSRTWGAQIWRQDWTEEAKGGYEKSRLSNQCNNRYKIYAEGYAWSVSLKYIL 394

Query: 340 SCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHLSEAEAIGQQG 399
           SCGS++LIISP Y+DFF RGL P+KNYWPI   ++C SIK+ V+WGN H SEA+AIG+ G
Sbjct: 395 SCGSLALIISPQYEDFFIRGLIPMKNYWPISSTDLCPSIKYGVEWGNAHPSEAKAIGKGG 454

Query: 400 QNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPSALEVCSESLLCIADEKQRQFLEKS 459
           Q FMESLSM+ VY YMF LI EYSKL  FKP  P SALEVC ESLLC AD KQR+ LEKS
Sbjct: 455 QEFMESLSMNRVYDYMFHLINEYSKLQTFKPVRPSSALEVCPESLLCHADSKQRKLLEKS 514

Query: 460 ATSASLVPPCSLNRAGSDSVYSWLQQ 482
               S  PPCSL    SD + SW+QQ
Sbjct: 515 TAHPSPNPPCSLQPPDSDIIKSWVQQ 540

BLAST of CmaCh13G000650 vs. TAIR10
Match: AT1G07220.1 (AT1G07220.1 Arabidopsis thaliana protein of unknown function (DUF821))

HSP 1 Score: 634.8 bits (1636), Expect = 4.4e-182
Identity = 298/478 (62.34%), Postives = 367/478 (76.78%), Query Frame = 1

Query: 14  SSRTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSEDT 73
           S R+PS +L  V+ALSF + T L+ YKVDDF AQTKT+AGHNL+PTPWH+FP K FS  T
Sbjct: 14  SPRSPSYLLLCVLALSFFSFTALLFYKVDDFIAQTKTLAGHNLEPTPWHIFPRKSFSAAT 73

Query: 74  RHARTVKIIHCSYLACRYANNTATRLPLHSAVS----TH--QCPELFRWIHHDLDPWARS 133
           +H++  +I+ CSY +C Y      +  LHS       TH  QCP+ FRWIH DL+PWA++
Sbjct: 74  KHSQAYRILQCSYFSCPYKAVVQPK-SLHSESGSGRQTHQPQCPDFFRWIHRDLEPWAKT 133

Query: 134 RISMKHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVD 193
            ++ +H+  +   AAFRVVI+ G+LYVD+YYACVQSR +FTIWG++QLL ++PGMVPDVD
Sbjct: 134 GVTKEHVKRAKANAAFRVVILSGKLYVDLYYACVQSRMMFTIWGILQLLTKYPGMVPDVD 193

Query: 194 MMFDCMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDI 253
           MMFDCMD+P IN+TE +  P+PLFRYCT +AH DIPFPDWSFWGW E N+R W EEF DI
Sbjct: 194 MMFDCMDKPIINQTEYQSFPVPLFRYCTNEAHLDIPFPDWSFWGWSETNLRPWEEEFGDI 253

Query: 254 KKSSKSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQ 313
           K+ S+  +W +K PRAYWKGNPDV SP+R EL+ CNHS  WGAQIMRQDW +EA+ GFEQ
Sbjct: 254 KQGSRRRSWYNKQPRAYWKGNPDVVSPIRLELMKCNHSRLWGAQIMRQDWAEEAKGGFEQ 313

Query: 314 SKLSKQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPF 373
           SKLS QCNHRYKIYAEG+AWSVSLKYILSCGSM+LIISP Y+DFFSRGL P +NYWPI  
Sbjct: 314 SKLSNQCNHRYKIYAEGYAWSVSLKYILSCGSMTLIISPEYEDFFSRGLLPKENYWPISP 373

Query: 374 DNMCESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPT 433
            ++C SIK+AVDWGN + SEAE IG++GQ +MESLSM+ VY YMF LITEYSKL  FKP 
Sbjct: 374 TDLCRSIKYAVDWGNSNPSEAETIGKRGQGYMESLSMNRVYDYMFHLITEYSKLQKFKPE 433

Query: 434 PPPSALEVCSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQEETR 486
            P SA EVC+ SLLCIA++K+R+ LE+S    SL  PC       + +  WL Q++ +
Sbjct: 434 KPASANEVCAGSLLCIAEQKERELLERSRVVPSLDQPCKFPVEDRNRL-EWLIQQKNK 489

BLAST of CmaCh13G000650 vs. TAIR10
Match: AT5G23850.1 (AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821))

HSP 1 Score: 395.6 bits (1015), Expect = 4.5e-110
Identity = 190/431 (44.08%), Postives = 273/431 (63.34%), Query Frame = 1

Query: 45  AAQTKTVAGHNLDPTPWHLFPPKIFSEDTRHARTVKIIHCSYLACRYANNTATRLPLHSA 104
           AA T T        TP +  P  + ++  +   T   +HCS      AN T    P +  
Sbjct: 70  AATTTTTKTQTQTITPKYPRPTTVITQSPKPEFT---LHCS------ANETTASCPSNKY 129

Query: 105 VSTHQ-------------CPELFRWIHHDLDPWARSRISMKHLDESMKFAAFRVVIVEGR 164
            +T               CP+ FRWIH DL PW+R+ I+ + L+ + K A FR+ IV G+
Sbjct: 130 PTTTSFEDDDTNHPPTATCPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGK 189

Query: 165 LYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFDCMDRPTINRTE----NKDMP 224
           +YV+ +    Q+R +FTIWG +QLLR++PG +PD+++MFDC+D P +  TE    N   P
Sbjct: 190 IYVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSP 249

Query: 225 LPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKG 284
            PLFRYC  +   DI FPDWSFWGW EVNI+ W    K++++ ++ + W ++ P AYWKG
Sbjct: 250 PPLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKG 309

Query: 285 NPDVASPVRTELLTCNHSIK--WGAQIMRQDWDQEARDGFEQSKLSKQCNHRYKIYAEGF 344
           NP VA   R +L+ CN S +  W A++  QDW +E+++G++QS L+ QC+HRYKIY EG 
Sbjct: 310 NPMVAE-TRQDLMKCNVSEEHEWNARLYAQDWIKESKEGYKQSDLASQCHHRYKIYIEGS 369

Query: 345 AWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHL 404
           AWSVS KYIL+C S++L++ P Y DFF+RGL P  +YWP+   + C SIK AVDWGN H+
Sbjct: 370 AWSVSEKYILACDSVTLLVKPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWGNSHI 429

Query: 405 SEAEAIGQQGQNFM-ESLSMDTVYAYMFQLITEYSKLLDFKPTPPPSALEVCSESLLCIA 456
            +A+ IG+   +F+ + L MD VY YM+ L+TEYSKLL FKP  P +A+E+CSE++ C+ 
Sbjct: 430 QKAQDIGKAASDFIQQDLKMDYVYDYMYHLLTEYSKLLQFKPEIPRNAVEICSETMACLR 489

BLAST of CmaCh13G000650 vs. TAIR10
Match: AT3G48980.1 (AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821))

HSP 1 Score: 381.7 bits (979), Expect = 6.7e-106
Identity = 173/366 (47.27%), Postives = 250/366 (68.31%), Query Frame = 1

Query: 110 CPELFRWIHHDLDPWARSRISMKHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTI 169
           CP+ FRWIH DL PW ++ I+ + L+ +   A FR+ I+ GR+YV+ +    Q+R +FTI
Sbjct: 136 CPDYFRWIHEDLRPWEKTGITREALERANATAIFRLAIINGRIYVEKFREAFQTRDVFTI 195

Query: 170 WGLVQLLRRFPGMVPDVDMMFDCMDRPTINRTE----NKDMPLPLFRYCTTDAHFDIPFP 229
           WG VQLLRR+PG +PD+++MFDC+D P +   E    ++  P PLFRYC  D   DI FP
Sbjct: 196 WGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFP 255

Query: 230 DWSFWGWPEVNIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHS 289
           DWS+WGW EVNI+ W    K++++ ++ + W  + P AYWKGNP VA   R +L+ CN S
Sbjct: 256 DWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAE-TRLDLMKCNLS 315

Query: 290 --IKWGAQIMRQDWDQEARDGFEQSKLSKQCNHRYKIYAEGFAWSVSLKYILSCGSMSLI 349
               W A++ +QDW +E+++G++QS L+ QC+HRYKIY EG AWSVS KYIL+C S++L+
Sbjct: 316 EVYDWKARLYKQDWVKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLM 375

Query: 350 ISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHLSEAEAIGQQGQNFM-ESL 409
           + P Y DFF+RG+ P  +YWP+  D+ C SIK AVDWGN H+ +A+ IG++   F+ + L
Sbjct: 376 VKPHYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASEFVQQEL 435

Query: 410 SMDTVYAYMFQLITEYSKLLDFKPTPPPSALEVCSESLLCIADEKQRQFLEKSATS-ASL 468
            MD VY YMF L+ +YSKLL FKP  P ++ E+CSE++ C  D  +R+F+ +S     + 
Sbjct: 436 KMDYVYDYMFHLLIQYSKLLRFKPEIPQNSTELCSEAMACPRDGNERKFMMESLVKRPAE 495

BLAST of CmaCh13G000650 vs. TAIR10
Match: AT2G45830.1 (AT2G45830.1 downstream target of AGL15 2)

HSP 1 Score: 380.9 bits (977), Expect = 1.1e-105
Identity = 179/374 (47.86%), Postives = 250/374 (66.84%), Query Frame = 1

Query: 102 HSAVSTHQCPELFRWIHHDLDPWARSRISMKHLDESMKFAAFRVVIVEGRLYVDMYYACV 161
           HS +ST  CP  FRWIH DL PW  + ++   L+++ + A FRVVI++GR+YV  Y   +
Sbjct: 113 HSRIST--CPSYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVYVKKYRKSI 172

Query: 162 QSRAIFTIWGLVQLLRRFPGMVPDVDMMFDCMDRPTIN----RTENKDMPLPLFRYCTTD 221
           Q+R +FT+WG+VQLLR +PG +PD+++MFD  DRPT+     + +    P PLFRYC+ D
Sbjct: 173 QTRDVFTLWGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQGQQHPAPPPLFRYCSDD 232

Query: 222 AHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKGNPDVASPVRT 281
           A  DI FPDWSFWGW EVNI+ W +    I++ +K + W  ++  AYW+GNP+VA P R 
Sbjct: 233 ASLDIVFPDWSFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNPNVA-PTRR 292

Query: 282 ELLTCNHSIK--WGAQIMRQDWDQEARDGFEQSKLSKQCNHRYKIYAEGFAWSVSLKYIL 341
           +LL CN S +  W  ++  QDWD+E+R+GF+ S L  QC HRYKIY EG+AWSVS KYI+
Sbjct: 293 DLLRCNVSAQEDWNTRLYIQDWDRESREGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIM 352

Query: 342 SCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHLSEAEAIGQQG 401
           +C SM+L + P++ DF+ RG+ PL++YWPI   + C S+K AV WGN HL +A  IG++G
Sbjct: 353 ACDSMTLYVRPMFYDFYVRGMMPLQHYWPIRDTSKCTSLKFAVHWGNTHLDQASKIGEEG 412

Query: 402 QNFM-ESLSMDTVYAYMFQLITEYSKLLDFKPTPPPSALEVCSESLLCIADEKQRQFLEK 461
             F+ E + M+ VY YMF L+ EY+KLL FKP  P  A E+  + + C A  + R F+E+
Sbjct: 413 SRFIREEVKMEYVYDYMFHLMNEYAKLLKFKPEIPWGATEITPDIMGCSATGRWRDFMEE 472

Query: 462 SATS-ASLVPPCSL 468
           S     S   PC +
Sbjct: 473 SMVMFPSEESPCEM 483

BLAST of CmaCh13G000650 vs. TAIR10
Match: AT1G63420.1 (AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821))

HSP 1 Score: 379.4 bits (973), Expect = 3.3e-105
Identity = 182/379 (48.02%), Postives = 250/379 (65.96%), Query Frame = 1

Query: 106 STHQCPELFRWIHHDLDPWARSRISMKHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRA 165
           S   CP+ F+WIH DL PW  + I+ + ++     A FR+VI+ G+++V+ Y   +Q+R 
Sbjct: 166 SNRSCPDYFKWIHEDLKPWRETGITKEMVERGKTTAHFRLVILNGKVFVENYKKSIQTRD 225

Query: 166 IFTIWGLVQLLRRFPGMVPDVDMMFDCMDRPTI--------NRTENKDMPLPLFRYCTTD 225
            FT+WG++QLLR++PG +PDVD+MFDC DRP I        NRT  ++ P PLFRYC   
Sbjct: 226 AFTLWGILQLLRKYPGKLPDVDLMFDCDDRPVIRSDGYNILNRTV-ENAPPPLFRYCGDR 285

Query: 226 AHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSSKSSNWSSKLPRAYWKGNPDVASPVRT 285
              DI FPDWSFWGW E+NIR W +  K++++  K   +  +   AYWKGNP VASP R 
Sbjct: 286 WTVDIVFPDWSFWGWQEINIREWSKVLKEMEEGKKKKKFMERDAYAYWKGNPFVASPSRE 345

Query: 286 ELLTCNHSI--KWGAQIMRQDWDQEARDGFEQSKLSKQCNHRYKIYAEGFAWSVSLKYIL 345
           +LLTCN S    W A+I  QDW  E + GFE S ++ QC +RYKIY EG+AWSVS KYIL
Sbjct: 346 DLLTCNLSSLHDWNARIFIQDWISEGQRGFENSNVANQCTYRYKIYIEGYAWSVSEKYIL 405

Query: 346 SCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIKHAVDWGNDHLSEAEAIGQQG 405
           +C S++L++ P Y DFFSR L PL++YWPI   + C SIK AVDW N+H  +A+ IG++ 
Sbjct: 406 ACDSVTLMVKPYYYDFFSRTLQPLQHYWPIRDKDKCRSIKFAVDWLNNHTQKAQEIGREA 465

Query: 406 QNFME-SLSMDTVYAYMFQLITEYSKLLDFKPTPPPSALEVCSESLLCIADEKQRQFLEK 465
             FM+  LSM+ VY YMF L+ EYSKLL +KP  P +++E+C+E+L+C ++ +    ++K
Sbjct: 466 SEFMQRDLSMENVYDYMFHLLNEYSKLLKYKPQVPKNSVELCTEALVCPSEGEDVNGVDK 525

Query: 466 SATSASLVP------PCSL 468
                SLV       PCSL
Sbjct: 526 KFMIGSLVSRPHASGPCSL 543

BLAST of CmaCh13G000650 vs. NCBI nr
Match: gi|659115070|ref|XP_008457372.1| (PREDICTED: uncharacterized protein LOC103497080 [Cucumis melo])

HSP 1 Score: 869.4 bits (2245), Expect = 3.0e-249
Identity = 402/472 (85.17%), Postives = 431/472 (91.31%), Query Frame = 1

Query: 12  ALSSRTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSE 71
           A + R PS++LP+VVA+SFL+LTFL+CYKVDDFAAQTKTVAGHNLDPTPWHLFPPK F++
Sbjct: 2   APAPRPPSHLLPAVVAISFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFND 61

Query: 72  DTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTHQCPELFRWIHHDLDPWARSRISM 131
           +TRHAR VKIIHCSYL CRY  N AT+ P HSAVS  +CPE FRWIHHDLDPWA++RISM
Sbjct: 62  ETRHARAVKIIHCSYLTCRYVTNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWAQTRISM 121

Query: 132 KHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFD 191
             L+ES KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFD
Sbjct: 122 TQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFD 181

Query: 192 CMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSS 251
           CMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFWGWPEVN+RSW EEF+DIKK S
Sbjct: 182 CMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGS 241

Query: 252 KSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLS 311
           K+ +W +K PRAYWKGNPDV SP RTELL CNHS KWGAQIMRQDW QEARDG+EQSKLS
Sbjct: 242 KNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRKWGAQIMRQDWAQEARDGYEQSKLS 301

Query: 312 KQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMC 371
            QCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMC
Sbjct: 302 NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMC 361

Query: 372 ESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPS 431
           ESIKHAVDWGN H  EAE IGQQGQNFMESLSMDTVY+YMF LITEYSKLLDFKPTPPPS
Sbjct: 362 ESIKHAVDWGNTHFPEAETIGQQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPPS 421

Query: 432 ALEVCSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQEE 484
           ALEVC++SLLCIADEKQRQFLEKSA S S VPPCSLNRAGSD +YSWLQQ E
Sbjct: 422 ALEVCADSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQTE 473

BLAST of CmaCh13G000650 vs. NCBI nr
Match: gi|449455154|ref|XP_004145318.1| (PREDICTED: O-glucosyltransferase rumi homolog [Cucumis sativus])

HSP 1 Score: 854.7 bits (2207), Expect = 7.6e-245
Identity = 395/471 (83.86%), Postives = 425/471 (90.23%), Query Frame = 1

Query: 12  ALSSRTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSE 71
           A + R PS++LPSVVA+ FL+LTFL+CYKVDDFAAQTKTVAGHNLDPTPWHLFPPK FS+
Sbjct: 2   APAPRPPSHLLPSVVAICFLSLTFLLCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKTFSD 61

Query: 72  DTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTHQCPELFRWIHHDLDPWARSRISM 131
           +TRHAR VKIIHCSYL CRYA N AT+ P HSAVS  +CPE FRWIHHDLDPWAR+RISM
Sbjct: 62  ETRHARAVKIIHCSYLTCRYATNNATKFPFHSAVSAPKCPEFFRWIHHDLDPWARTRISM 121

Query: 132 KHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFD 191
             L+ES KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPDVDMMFD
Sbjct: 122 TQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQMLRRYPGMVPDVDMMFD 181

Query: 192 CMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSS 251
           CMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFWGWPEVN+RSW EEF+DIKK S
Sbjct: 182 CMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSWREEFEDIKKGS 241

Query: 252 KSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLS 311
           K+ +W +K PRAYWKGNPDV SP R ELL CNHS  WGAQIMRQDW QEARDG+EQSKLS
Sbjct: 242 KNLSWFNKFPRAYWKGNPDVDSPAREELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLS 301

Query: 312 KQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMC 371
            QCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMC
Sbjct: 302 NQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFTNMC 361

Query: 372 ESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPS 431
           ESIKHAVDWGN H  EAE IG+QGQ FMESLSMDTVY+YMF LITEYSKL DFKPTPPPS
Sbjct: 362 ESIKHAVDWGNTHFPEAETIGRQGQKFMESLSMDTVYSYMFHLITEYSKLQDFKPTPPPS 421

Query: 432 ALEVCSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQE 483
           ALEVC++SLLCIADEKQ QFLEKSA S S VPPCSLNR GSD +YSWLQQ+
Sbjct: 422 ALEVCTDSLLCIADEKQMQFLEKSAASVSSVPPCSLNRGGSDIIYSWLQQK 472

BLAST of CmaCh13G000650 vs. NCBI nr
Match: gi|1009114526|ref|XP_015873736.1| (PREDICTED: O-glucosyltransferase rumi homolog [Ziziphus jujuba])

HSP 1 Score: 701.0 bits (1808), Expect = 1.4e-198
Identity = 321/472 (68.01%), Postives = 385/472 (81.57%), Query Frame = 1

Query: 17  TPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSEDTRHA 76
           +PS+ILPS+ A + L++TF++ YKVD+FAAQTKTVAGHNL+PTPWHLFPPK F+E TR  
Sbjct: 11  SPSHILPSIAAFASLSITFVIIYKVDNFAAQTKTVAGHNLEPTPWHLFPPKTFNEQTRQV 70

Query: 77  RTVKIIHCSYLACRYANNTATRLPLHSA-----VSTHQCPELFRWIHHDLDPWARSRISM 136
           R  KI+HCSYL C Y++N      L S       S  +CP+ +RWIHHDL+PW+R+RIS 
Sbjct: 71  RNYKILHCSYLTCGYSSNDDESDLLLSRSTKAKASGKKCPDFYRWIHHDLEPWSRTRIST 130

Query: 137 KHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFD 196
            HL E+ +FAAFR VIV G+LYVD+YYACVQSRA+FTIWGL+QLLRR+PGMVPDVDMMFD
Sbjct: 131 THLKEAREFAAFRAVIVGGKLYVDLYYACVQSRAMFTIWGLLQLLRRYPGMVPDVDMMFD 190

Query: 197 CMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSS 256
           CMD+P+INRTE++ MPLPLFRYCTT+ HFDIPFPDWSFWGWPE N+  W EEFK IK  S
Sbjct: 191 CMDKPSINRTEHRSMPLPLFRYCTTEDHFDIPFPDWSFWGWPETNLNPWDEEFKSIKYGS 250

Query: 257 KSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLS 316
           + ++WS K PRAYWKGNPDV SP+RTELL CNHS KWGAQIMRQDW +EA+ G+E+SKLS
Sbjct: 251 QETSWSKKAPRAYWKGNPDVGSPIRTELLNCNHSRKWGAQIMRQDWAEEAKGGYEKSKLS 310

Query: 317 KQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMC 376
            QC++RYKIYAEG+AWSVS+KYILSCGS+SLIISP Y+DFFSRGL P KNYWPI   ++C
Sbjct: 311 SQCDYRYKIYAEGYAWSVSMKYILSCGSLSLIISPKYEDFFSRGLFPGKNYWPISDTDLC 370

Query: 377 ESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPS 436
            SIK+AVDWGN H SEAEAIG+ G++FM SLSMD VY YMF LI EYSKLLDFKP  P S
Sbjct: 371 PSIKYAVDWGNAHPSEAEAIGRGGRDFMGSLSMDRVYDYMFHLINEYSKLLDFKPVRPSS 430

Query: 437 ALEVCSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQEE 484
           +LEVC ESLLC+AD KQR+ LE+S    S  PPC L    S+ + +W+Q+++
Sbjct: 431 SLEVCKESLLCLADAKQRELLERSTAYTSPSPPCFLQPPDSNFINNWIQKKK 482

BLAST of CmaCh13G000650 vs. NCBI nr
Match: gi|255569363|ref|XP_002525649.1| (PREDICTED: protein O-glucosyltransferase 1 [Ricinus communis])

HSP 1 Score: 692.6 bits (1786), Expect = 5.0e-196
Identity = 321/478 (67.15%), Postives = 384/478 (80.33%), Query Frame = 1

Query: 7   PPTSRALSSRTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPP 66
           PP   A  +R PS + P ++ L  ++LT L  Y+VD+FA++TKTVAGHNLDPTPWH+FPP
Sbjct: 4   PPPKAA--ARVPSYLFPCLLGL--VSLTLLFFYQVDNFASRTKTVAGHNLDPTPWHIFPP 63

Query: 67  KIFSEDTRHARTVKIIHCSYLACRYANNTATRLPLHSAVSTH-QCPELFRWIHHDLDPWA 126
           + F E+TR AR  KII CSYL C Y N T TR    S+   + +CPE FR+IHHDL PWA
Sbjct: 64  RTFDEETRQARAYKIIQCSYLTCPYTNTTTTRRRSQSSSQANAKCPEFFRFIHHDLQPWA 123

Query: 127 RSRISMKHLDESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPD 186
           R+ I+ KH+ E+ KFAAFRVVI EGRLY+D+YYACVQSR +FT+WGL+QLL R+PGMVPD
Sbjct: 124 RTGITKKHIAEAKKFAAFRVVIFEGRLYLDLYYACVQSRMMFTVWGLLQLLNRYPGMVPD 183

Query: 187 VDMMFDCMDRPTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFK 246
           VD+MFDCMDRP IN+TE+   PLP+FRYCTT  HFDIPFPDWSFWGWPE+NIRSW EEF+
Sbjct: 184 VDIMFDCMDRPVINKTEHISFPLPIFRYCTTQNHFDIPFPDWSFWGWPEINIRSWNEEFR 243

Query: 247 DIKKSSKSSNWSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGF 306
           DIK+ S+S +WS K PRAYWKGNPDV SP+RTEL+ CNHS KWGA IMRQDW +EAR GF
Sbjct: 244 DIKRGSQSKSWSKKWPRAYWKGNPDVLSPIRTELMQCNHSRKWGAHIMRQDWGEEARAGF 303

Query: 307 EQSKLSKQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPI 366
           E+SKLS QCN+RYKIYAEGFAWSVSLKYI+SCGS++LIISP Y+DFFSRGL P  NYWP+
Sbjct: 304 ERSKLSNQCNYRYKIYAEGFAWSVSLKYIISCGSLALIISPQYEDFFSRGLVPASNYWPV 363

Query: 367 PFDNMCESIKHAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFK 426
             D +C SIK AVDWGN + SEAE+IG+ GQ+FME+LSM+ VY YMF LITEYSKL  FK
Sbjct: 364 ASDELCRSIKFAVDWGNANPSEAESIGKAGQDFMETLSMEGVYDYMFHLITEYSKLQVFK 423

Query: 427 PTPPPSALEVCSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQEE 484
           P  P SALEVC++SLLC AD KQ+QFLE+SA   S  P CSL  A  +++ SWLQ+++
Sbjct: 424 PVLPSSALEVCADSLLCFADPKQKQFLERSAAFPSPKPACSLQPADGNAIKSWLQEKQ 477

BLAST of CmaCh13G000650 vs. NCBI nr
Match: gi|702480170|ref|XP_010032976.1| (PREDICTED: O-glucosyltransferase rumi [Eucalyptus grandis])

HSP 1 Score: 684.1 bits (1764), Expect = 1.8e-193
Identity = 315/468 (67.31%), Postives = 372/468 (79.49%), Query Frame = 1

Query: 16  RTPSNILPSVVALSFLALTFLVCYKVDDFAAQTKTVAGHNLDPTPWHLFPPKIFSEDTRH 75
           R PSN+LP ++ALS  +++ L+ YKVDDFA+QTKTVAGHNLDPTPWHLFPPK F+E TR+
Sbjct: 10  RRPSNLLPCLIALSLFSISALLLYKVDDFASQTKTVAGHNLDPTPWHLFPPKTFNEKTRY 69

Query: 76  ARTVKIIHCSYLACRYANNTATRLPLHSAVSTHQCPELFRWIHHDLDPWARSRISMKHLD 135
           AR  KII CSYL C YA  +     L  + S   CP  F WI  DL+PW R+ IS  HL 
Sbjct: 70  ARASKIIQCSYLTCPYATGSIRGQDLSRSRSARACPAFFAWIRRDLEPWVRTGISPAHLM 129

Query: 136 ESMKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDMMFDCMDR 195
           E+ +FA+FRVVI EG+LYVD YYACVQSRA+FTIWGL+QLLRR+PGMVPDVD+MFDCMD+
Sbjct: 130 EAKRFASFRVVIFEGKLYVDFYYACVQSRAMFTIWGLLQLLRRYPGMVPDVDLMFDCMDK 189

Query: 196 PTINRTENKDMPLPLFRYCTTDAHFDIPFPDWSFWGWPEVNIRSWGEEFKDIKKSSKSSN 255
           P+INRTE+  MPLPLFRYCTT  HFDIPFPDWSFWGWPE N++ W EEF+DIK+ S+   
Sbjct: 190 PSINRTEHASMPLPLFRYCTTPGHFDIPFPDWSFWGWPETNLKPWDEEFRDIKQGSQVLR 249

Query: 256 WSSKLPRAYWKGNPDVASPVRTELLTCNHSIKWGAQIMRQDWDQEARDGFEQSKLSKQCN 315
           WS K P AYWKGNPDV SPVRTELL CNHS  W AQ+MRQDW +EAR G+EQSKLS QCN
Sbjct: 250 WSKKSPYAYWKGNPDVESPVRTELLKCNHSRMWNAQVMRQDWAEEARAGYEQSKLSNQCN 309

Query: 316 HRYKIYAEGFAWSVSLKYILSCGSMSLIISPLYQDFFSRGLDPLKNYWPIPFDNMCESIK 375
           HRYKIYAEG+AWSVSLKYI++CGS +LIISP Y+DFFSRGL P++NYWPI   N+C SIK
Sbjct: 310 HRYKIYAEGYAWSVSLKYIIACGSPALIISPEYEDFFSRGLFPMRNYWPISSTNLCPSIK 369

Query: 376 HAVDWGNDHLSEAEAIGQQGQNFMESLSMDTVYAYMFQLITEYSKLLDFKPTPPPSALEV 435
           +AV+WGN + SEAEAIG++GQ+FME LSMD +Y YM+ LI EYSKL +FKP P  SA EV
Sbjct: 370 YAVNWGNANPSEAEAIGKRGQDFMEDLSMDRIYDYMYHLIMEYSKLQNFKPIPSSSAREV 429

Query: 436 CSESLLCIADEKQRQFLEKSATSASLVPPCSLNRAGSDSVYSWLQQEE 484
           C +SLLC AD KQRQFLE+S   AS   PC+   A   +V SW++Q+E
Sbjct: 430 CVDSLLCFADPKQRQFLERSTALASEEAPCTFKAARGITVTSWIKQKE 477

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RUMI_DROME3.8e-1823.03O-glucosyltransferase rumi OS=Drosophila melanogaster GN=rumi PE=1 SV=1[more]
PGLT1_BOVIN3.8e-1825.08Protein O-glucosyltransferase 1 OS=Bos taurus GN=POGLUT1 PE=2 SV=1[more]
PGLT1_HUMAN6.5e-1824.56Protein O-glucosyltransferase 1 OS=Homo sapiens GN=POGLUT1 PE=1 SV=1[more]
PGLT1_RAT2.5e-1724.56Protein O-glucosyltransferase 1 OS=Rattus norvegicus GN=Poglut1 PE=3 SV=1[more]
PGLT1_MOUSE4.2e-1724.26Protein O-glucosyltransferase 1 OS=Mus musculus GN=Poglut1 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LY89_CUCSA5.3e-24583.86Uncharacterized protein OS=Cucumis sativus GN=Csa_1G531170 PE=4 SV=1[more]
B9SI30_RICCO3.5e-19667.15Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0612530 PE=4 SV=1[more]
A0A059AF90_EUCGR1.2e-19367.31Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_J01883 PE=4 SV=1[more]
A0A067LDK6_JATCU2.6e-19164.92Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08464 PE=4 SV=1[more]
W9RZY2_9ROSA1.9e-18668.39Uncharacterized protein OS=Morus notabilis GN=L484_022004 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G07220.14.4e-18262.34 Arabidopsis thaliana protein of unknown function (DUF821)[more]
AT5G23850.14.5e-11044.08 Arabidopsis thaliana protein of unknown function (DUF821)[more]
AT3G48980.16.7e-10647.27 Arabidopsis thaliana protein of unknown function (DUF821)[more]
AT2G45830.11.1e-10547.86 downstream target of AGL15 2[more]
AT1G63420.13.3e-10548.02 Arabidopsis thaliana protein of unknown function (DUF821)[more]
Match NameE-valueIdentityDescription
gi|659115070|ref|XP_008457372.1|3.0e-24985.17PREDICTED: uncharacterized protein LOC103497080 [Cucumis melo][more]
gi|449455154|ref|XP_004145318.1|7.6e-24583.86PREDICTED: O-glucosyltransferase rumi homolog [Cucumis sativus][more]
gi|1009114526|ref|XP_015873736.1|1.4e-19868.01PREDICTED: O-glucosyltransferase rumi homolog [Ziziphus jujuba][more]
gi|255569363|ref|XP_002525649.1|5.0e-19667.15PREDICTED: protein O-glucosyltransferase 1 [Ricinus communis][more]
gi|702480170|ref|XP_010032976.1|1.8e-19367.31PREDICTED: O-glucosyltransferase rumi [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006598LipoPS_modifying
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016740 transferase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh13G000650.1CmaCh13G000650.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006598Lipopolysaccharide-modifying proteinPFAMPF05686Glyco_transf_90coord: 107..481
score: 2.3E
IPR006598Lipopolysaccharide-modifying proteinSMARTSM00672cap10coord: 182..425
score: 1.0E
NoneNo IPR availablePANTHERPTHR12203KDEL LYS-ASP-GLU-LEU CONTAINING - RELATEDcoord: 44..485
score: 5.8E
NoneNo IPR availablePANTHERPTHR12203:SF13F10K1.7 PROTEINcoord: 44..485
score: 5.8E

The following gene(s) are paralogous to this gene:

None