CmoCh07G000780 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh07G000780
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionhydroxyproline O-galactosyltransferase GALT2-like
LocationCmo_Chr07: 483453 .. 488590 (-)
RNA-Seq ExpressionCmoCh07G000780
SyntenyCmoCh07G000780
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGTCCTTGAAGGGTGTTCTTATTTTCATTTCATGGGTGACTTCAATCTACCACGACCACGACAGCCTCGTTTTCGGGCCTTTCCGATTTCACCATGTTTATGGGGCTTTCCATTTATGTCTTCAATCTAAACTTATGGTCAATTTCTTTCCGCCAGGTTGTTCCCCCATTCACTTGATCTTCGGCGATTGATTGGTGTTGCCCAGCCGTGTTCATTGGTTTCTTTTGTTTTCTCTTTCCGTCTAGCATTTCCTAAATAGGGTTTGTTCGGACAACATTCAACTAAGATCATAAAAGGAAACGAACCCCGCTGATTTTGGGGTTCTACACCAGTTCTTCATGTTTCCTATTGCGATGTATTGCTTATTCTGAAATTTGGTGATTCCTAAACCCTGTCTTTCACATTCATTCGATCTCCTGGTTATTGCGGTATGCGCTCTTGCTTGTATATAGATTTTTGTTATTGCATTGTATTGATACACGTACGGATGCTGGGCTGAACATTTGGTGGGTCAGATGAATTGGTGTGTGTAATTTATGGTTGTTGTAGAGTACGTTTGAGCCTCTTGTTAGGTAATTTGTTTTTGTAGATGAAGAAGCTTAAGACAGAACCTCCTGTTGCGAGGAGGTTCAGGTTGTCGCATTTTCTTCTCGTAATTGGATTGTTGTATTTAGTTTTCATATCGTTTAAGTTTCCACGTTTTTTGGGAATTGCTACAACGTTGAGTGGGGATGAAAGTAATATTGGGTTGGATGGGACAGTCGGAGTGGATGGGGAAGGGGTGGATTTTAGCAAAGCATCGTTGGCTTCCGTTTATAAGGATACATTTCATCGGAAATTGGAAGATAACCAGCACTTAGAAGCACCATTGATGCCTAAAACAGAGCCACTCGAAGAGGTGAATAATGTTACCGGACCTATTAAGCCGATTCAGCATAAATATGGTCGGATTACTGGTAAAGTTTCGATTCAGCAGAATCATACTAACGACTTTTCCATCCTCGAGAGAATGGCAGATGAAGCTTGGACATTAGGCTCGAAGGCTTGGGAAGAAGTGGATAAATTTGGATTAAATGAGACTACTGAAAGTTCTATGCTCGAGGGAAAGCCTGAGTCGTGTCCTTCATGGATATCTACGGAAGGGAAGGAACTGTTGGAGGGAGATGGAATCATGTTCCTTCCTTGTGGGCTTGCTGCAGGTTCATCTATTACAATAATTGGAACCCCGCATCACGCTCATGTGGAGTATGTACCCCAACTTTTGAAGCTGGGAGGTGATCCTACGGTTTTGGTTTCACAGTTCATGGTTGAATTGCAAGGATTGAAATCAGTTGATGGTGAAGACCCACCAAAGATCCTTCACTTGAATCCACGACTGAAAGGCGATTGGAGTAAACGGCCGGTCATTGAACATAATACATGTTACAGGATGCAGTGGGGAACGGCTCAAAGGTGTGATGGTTTGCCATCAAGTAGCGACGATGAAATGCTTGGTGAGATTAATCTCCTCCTTTCTGTTTTCATGTGTTTTTCATCACTAGCATTCAATCACTCCCGTGTGACACTCATAACCGGTGATGGTGTATGATGGCTGCAAATTATTGTTCTTCTCGACAATAGTGTAAACTAGTAAATGAGGAATATTTTCAATTTGGTCCAGCTAAAAGAAAGAATTTGTTCAGCATGCGTTTCACCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCCCCCCCTCCCCCACCCCCCTTATAGTAGGAATTCTTAAGTTGCTTAGAGTCTTAGCTCGTATTTGATAGTCTAGATGTGCATTGGAGTTCTGTTTCTAGACGGATTTCCTGTCCTCATTGAAATCTAGCCAGGAGTCCTAATTTTCTCAATCTATGGTGAGGAAGGTAAAGGTTATGAAATCTCTGATCTTGATTTATTTATTTTCCATTAATTTGAATCTACTCACCAACGATTTCAATTCCCAACTCAATTTTTTCCACTCTTTTAGCCGTGCTATAGTGCCAACGCCATCTTACATTTTGTTCCCTTTCAATCCTCTTATTTGACAGTTGATGGAAATCTTCGTTGTGATAAATGGCTGAGGAGTGATATTGAAGACTCAAAAGAAACAAAAACATCCTCATGGTTCAAGAGATTCATAGGGAGGGAGCAGAAGCCAGAAGTAACCTGGCCATTTCCCTTTGTGGAGGGTAGATTGTTTATCCTCACACTACGTGCAGGCGTTGATGGATACCATATTAATGTTGGGGGTCGGCACTTGACTTCTTTTGCCTATCGCCCTGTAAGTATGTGGAGAAGAGGAGGATGCAGATAGCTATTTCATATTCATTCGTGTTTGATGTTTTTCTTACTGCTGCAGTTGGTATAGATCATCTCTCCCATGCTGTTTTAACATTAATTTTCTTACATCTTTGTTCGACTTTCATTTAGGGATTTACGCTTGAAGATGCAACTGGATTAGCTGTAAAAGGAGATGTAGACATTCATTCTACATATGCTACATCTCTTCCTACCTCTCATCCAAGCTTCTCTCCCCATCGAGTTCTTGAGATGTCAGAGAAATGGAAATCTCAGCCTTTACCAAAGAAATCGGTCTATCTTTTTATTGGTGTTCTATCTGCTACTAATCACTTTGCGGAGCGCATGGCTGTTAGGAAAACTTGGATGCAATCTTCAGCTGTGAAGTCATCAAATGTAGTTGTTCGCTTCTTTGTTGCACTGGTATGACTCTTGTGTTCATCCCTCTAAGGTATACGTGTTCCATTTTTCTGGCAACAAAATCTATATATGCTATTTTCCACGGACAATGGAATAGGTTTACATATGAAAGGAAAACAAGAAAGCATGCAATGGTTAGCCATGCTGGTTCACATTTTTTTAAAATAAGAGTTTATGAACTGCGCTGCAATGTTGTATTGTACTCCTGTGAGTCGGATGTGTTCTTGTATCAACGCCATTGAAGCGGCGGTTCATATTTCTTTTCATTATTGTTATTTTGTATTTCTTCTCATGTATAAACGCGCCGTGTAGAATCCGAGGAATGAGGTCAATGCTGTGCTGAAGAAGGAAGCCGCATACTTCGGTGATATTGTGATCCTGCCCTTCATGGACCGCTATGAGCTAGTTGTTCTCAAGACTATTGCTATATGTGAGTTTGGGGTGAGTTTGCGTTTCCCTTCTTCTGTTTATCAGTATTTGAATCTTCTTTGTAGAACTTTGTTGTATAAACTGACTAGTGTTTGACTTTAGTGTATGCTGTTTTTCATTATGTGTTAAGTACTCCACTTTTCTCCTTGTAGGCTGTGAATTTGACGGCTTCATATGTTATGAAATGTGATGACGACACCTTTGTGAGGGTGGAAACTGTTATAAAACAGATTGAAGGCATTTCATCCAAGAAGTCCCTGTACATGGGCAATCTCAACCTCTTGCATCGCCCTCTCAGACATGGAAAATGGGCAGTCACATATTTGGTACATTGAATAAAATTTGGCTATTTTGTGCTTTTATCCTACTTGTTGCATCCTTGAGTTACAACGGCTTGGCTAATCCACCAGTACTTCTAGATCACTTTCGTCAATCAAGAAATGTTTAGCATAGCATTGCTGCTTGAAAACTAAGATTGGCCATTGACATAACTCCTCCTAGGCTAGCACAAAGGAGGGCTTTTTCTGATGGAAAGAACTCGGTATTGTTATTCATCAACAAGAGGGTATAAATCTATGGAGGACCGGGTAGCTTATTCGCTCTCGACCTGATTATCCAAATTCTCATCATAAAAATTGTGAGGTTCCACATCGATTGAAGAGGGGAACGAAACATTCCTTATAAGGGTGTGGAAACCTCTCCCTAACATACGCATTTTAAAACCTTGAGGGGAAGCCCTAGAAGGGAAACCCAAAGATGACAATACCGGCTAGCGGTGGGCTTGGGCTGTTACAAATGGTATTAGAGCGGGGGTGGATTGTAAGGTCACCATCAGTTGGAGAGTAGAACGAAGCATTCCTTATAAGAGTATGGAAACCTCTCCCTAGCATACGCGTTTCAAAACTTTTAGGGGAAGCCTTTGAAGAGAAAGCCCAAAAAAAGATAATACCTGCCAGCAGGCAGGCTTGAACTGTTACAAAAATATACCCCCTCACAACACTTGCATTCTCAAACATTTCCTTCCCGACTCTCTTTCATTCCTATTTCTCATCACTTGATTCAATTACAAAAGTCTGCCCACCTAACCCCCTTTCCTTTAATATACATTTATCACTGATGGCCTATCACTCTCTTTACACCTTTTAAACAATCCTTTTTGTTTCCTTTCTCATGCGAATGATTTATTTCCATGTACGGTAACAGGAATGGCCAGAAGAAGTGTATCCTCCATACGCCAATGGCCCAGGATATATCATTTCCATTGACATTGCCAAATACATTGTGGCTCAACATGAAAGCAGGAGCTTGAGGGTCAGTTTGAGAATACAAGCCTTGATGTCTCTGTTTGTTTGTTTTATACCGCTTTTGTGTATTAACTTGATGTTTTTGACATGAAAGATATTCAAGATGGAGGATGTGAGCATGGGAATGTGGGTGGAGCAGTTCAACAGTACGGTGGCGATCGTTCAGTACTCGCACAGCTGGAAATTCTGCCAGTATGGGTGCATGGAGGACTATTTTACAGCACACTATCAATCTCCAAGGCAGATAATTTGCCTGTGGGATAAATTAACGCAGGGCCAAGCTCACTGTTGCAACTTCAGGTGACAATTTTCCACGATTTTTACCCGCTCTATTTCTACCTGTGTTTATTGTTCAGTGTTAGCGTTACAAACTGTGATGCATGATAACTGTTAATTTTTTTTTCTTCTTTTCTTTTTATGTTTCATCAACAACAATTTTCCTGTATAGATTATAATGTGGAATGGGAATTTTGATATGAGGTAGCATGATTACTCCCCTTGTTATTTCACTTTCAC

mRNA sequence

AGTCCTTGAAGGGTGTTCTTATTTTCATTTCATGGGTGACTTCAATCTACCACGACCACGACAGCCTCGTTTTCGGGCCTTTCCGATTTCACCATGTTTATGGGGCTTTCCATTTATGTCTTCAATCTAAACTTATGGTCAATTTCTTTCCGCCAGGTTGTTCCCCCATTCACTTGATCTTCGGCGATTGATTGGTGTTGCCCAGCCGTGTTCATTGGTTTCTTTTGTTTTCTCTTTCCGTCTAGCATTTCCTAAATAGGGTTTGTTCGGACAACATTCAACTAAGATCATAAAAGGAAACGAACCCCGCTGATTTTGGGGTTCTACACCAGTTCTTCATGTTTCCTATTGCGATGTATTGCTTATTCTGAAATTTGGTGATTCCTAAACCCTGTCTTTCACATTCATTCGATCTCCTGGTTATTGCGGTATGCGCTCTTGCTTGTATATAGATTTTTGTTATTGCATTGTATTGATACACGTACGGATGCTGGGCTGAACATTTGGTGGGTCAGATGAATTGGTGTGTGTAATTTATGGTTGTTGTAGAGTACGTTTGAGCCTCTTGTTAGGTAATTTGTTTTTGTAGATGAAGAAGCTTAAGACAGAACCTCCTGTTGCGAGGAGGTTCAGGTTGTCGCATTTTCTTCTCGTAATTGGATTGTTGTATTTAGTTTTCATATCGTTTAAGTTTCCACGTTTTTTGGGAATTGCTACAACGTTGAGTGGGGATGAAAGTAATATTGGGTTGGATGGGACAGTCGGAGTGGATGGGGAAGGGGTGGATTTTAGCAAAGCATCGTTGGCTTCCGTTTATAAGGATACATTTCATCGGAAATTGGAAGATAACCAGCACTTAGAAGCACCATTGATGCCTAAAACAGAGCCACTCGAAGAGGTGAATAATGTTACCGGACCTATTAAGCCGATTCAGCATAAATATGGTCGGATTACTGGTAAAGTTTCGATTCAGCAGAATCATACTAACGACTTTTCCATCCTCGAGAGAATGGCAGATGAAGCTTGGACATTAGGCTCGAAGGCTTGGGAAGAAGTGGATAAATTTGGATTAAATGAGACTACTGAAAGTTCTATGCTCGAGGGAAAGCCTGAGTCGTGTCCTTCATGGATATCTACGGAAGGGAAGGAACTGTTGGAGGGAGATGGAATCATGTTCCTTCCTTGTGGGCTTGCTGCAGGTTCATCTATTACAATAATTGGAACCCCGCATCACGCTCATGTGGAGTATGTACCCCAACTTTTGAAGCTGGGAGGTGATCCTACGGTTTTGGTTTCACAGTTCATGGTTGAATTGCAAGGATTGAAATCAGTTGATGGTGAAGACCCACCAAAGATCCTTCACTTGAATCCACGACTGAAAGGCGATTGGAGTAAACGGCCGGTCATTGAACATAATACATGTTACAGGATGCAGTGGGGAACGGCTCAAAGGTGTGATGGTTTGCCATCAAGTAGCGACGATGAAATGCTTGTTGATGGAAATCTTCGTTGTGATAAATGGCTGAGGAGTGATATTGAAGACTCAAAAGAAACAAAAACATCCTCATGGTTCAAGAGATTCATAGGGAGGGAGCAGAAGCCAGAAGTAACCTGGCCATTTCCCTTTGTGGAGGGTAGATTGTTTATCCTCACACTACGTGCAGGCGTTGATGGATACCATATTAATGTTGGGGGTCGGCACTTGACTTCTTTTGCCTATCGCCCTGGATTTACGCTTGAAGATGCAACTGGATTAGCTGTAAAAGGAGATGTAGACATTCATTCTACATATGCTACATCTCTTCCTACCTCTCATCCAAGCTTCTCTCCCCATCGAGTTCTTGAGATGTCAGAGAAATGGAAATCTCAGCCTTTACCAAAGAAATCGGTCTATCTTTTTATTGGTGTTCTATCTGCTACTAATCACTTTGCGGAGCGCATGGCTGTTAGGAAAACTTGGATGCAATCTTCAGCTGTGAAGTCATCAAATGTAGTTGTTCGCTTCTTTGTTGCACTGTTTATGAACTGCGCTGCAATGTTGTATTGTACTCCTGTGAGTCGGATGTGTTCTTGTATCAACGCCATTGAAGCGGCGAATCCGAGGAATGAGGTCAATGCTGTGCTGAAGAAGGAAGCCGCATACTTCGGTGATATTGTGATCCTGCCCTTCATGGACCGCTATGAGCTAGTTGTTCTCAAGACTATTGCTATATGTGAGTTTGGGGTGAGTTTGCGTTTCCCTTCTTCTGTTTATCAGTATTTGAATCTTCTTTGTAGAACTTTGTTGTATAAACTGACTAGTGCTGTGAATTTGACGGCTTCATATGTTATGAAATGTGATGACGACACCTTTGTGAGGGTGGAAACTGTTATAAAACAGATTGAAGGCATTTCATCCAAGAAGTCCCTGTACATGGGCAATCTCAACCTCTTGCATCGCCCTCTCAGACATGGAAAATGGGCAGTCACATATTTGGAATGGCCAGAAGAAGTGTATCCTCCATACGCCAATGGCCCAGGATATATCATTTCCATTGACATTGCCAAATACATTGTGGCTCAACATGAAAGCAGGAGCTTGAGGATATTCAAGATGGAGGATGTGAGCATGGGAATGTGGGTGGAGCAGTTCAACAGTACGGTGGCGATCGTTCAGTACTCGCACAGCTGGAAATTCTGCCAGTATGGGTGCATGGAGGACTATTTTACAGCACACTATCAATCTCCAAGGCAGATAATTTGCCTGTGGGATAAATTAACGCAGGGCCAAGCTCACTGTTGCAACTTCAGGTGACAATTTTCCACGATTTTTACCCGCTCTATTTCTACCTGTGTTTATTGTTCAGTGTTAGCGTTACAAACTGTGATGCATGATAACTGTTAATTTTTTTTTCTTCTTTTCTTTTTATGTTTCATCAACAACAATTTTCCTGTATAGATTATAATGTGGAATGGGAATTTTGATATGAGGTAGCATGATTACTCCCCTTGTTATTTCACTTTCAC

Coding sequence (CDS)

ATGAAGAAGCTTAAGACAGAACCTCCTGTTGCGAGGAGGTTCAGGTTGTCGCATTTTCTTCTCGTAATTGGATTGTTGTATTTAGTTTTCATATCGTTTAAGTTTCCACGTTTTTTGGGAATTGCTACAACGTTGAGTGGGGATGAAAGTAATATTGGGTTGGATGGGACAGTCGGAGTGGATGGGGAAGGGGTGGATTTTAGCAAAGCATCGTTGGCTTCCGTTTATAAGGATACATTTCATCGGAAATTGGAAGATAACCAGCACTTAGAAGCACCATTGATGCCTAAAACAGAGCCACTCGAAGAGGTGAATAATGTTACCGGACCTATTAAGCCGATTCAGCATAAATATGGTCGGATTACTGGTAAAGTTTCGATTCAGCAGAATCATACTAACGACTTTTCCATCCTCGAGAGAATGGCAGATGAAGCTTGGACATTAGGCTCGAAGGCTTGGGAAGAAGTGGATAAATTTGGATTAAATGAGACTACTGAAAGTTCTATGCTCGAGGGAAAGCCTGAGTCGTGTCCTTCATGGATATCTACGGAAGGGAAGGAACTGTTGGAGGGAGATGGAATCATGTTCCTTCCTTGTGGGCTTGCTGCAGGTTCATCTATTACAATAATTGGAACCCCGCATCACGCTCATGTGGAGTATGTACCCCAACTTTTGAAGCTGGGAGGTGATCCTACGGTTTTGGTTTCACAGTTCATGGTTGAATTGCAAGGATTGAAATCAGTTGATGGTGAAGACCCACCAAAGATCCTTCACTTGAATCCACGACTGAAAGGCGATTGGAGTAAACGGCCGGTCATTGAACATAATACATGTTACAGGATGCAGTGGGGAACGGCTCAAAGGTGTGATGGTTTGCCATCAAGTAGCGACGATGAAATGCTTGTTGATGGAAATCTTCGTTGTGATAAATGGCTGAGGAGTGATATTGAAGACTCAAAAGAAACAAAAACATCCTCATGGTTCAAGAGATTCATAGGGAGGGAGCAGAAGCCAGAAGTAACCTGGCCATTTCCCTTTGTGGAGGGTAGATTGTTTATCCTCACACTACGTGCAGGCGTTGATGGATACCATATTAATGTTGGGGGTCGGCACTTGACTTCTTTTGCCTATCGCCCTGGATTTACGCTTGAAGATGCAACTGGATTAGCTGTAAAAGGAGATGTAGACATTCATTCTACATATGCTACATCTCTTCCTACCTCTCATCCAAGCTTCTCTCCCCATCGAGTTCTTGAGATGTCAGAGAAATGGAAATCTCAGCCTTTACCAAAGAAATCGGTCTATCTTTTTATTGGTGTTCTATCTGCTACTAATCACTTTGCGGAGCGCATGGCTGTTAGGAAAACTTGGATGCAATCTTCAGCTGTGAAGTCATCAAATGTAGTTGTTCGCTTCTTTGTTGCACTGTTTATGAACTGCGCTGCAATGTTGTATTGTACTCCTGTGAGTCGGATGTGTTCTTGTATCAACGCCATTGAAGCGGCGAATCCGAGGAATGAGGTCAATGCTGTGCTGAAGAAGGAAGCCGCATACTTCGGTGATATTGTGATCCTGCCCTTCATGGACCGCTATGAGCTAGTTGTTCTCAAGACTATTGCTATATGTGAGTTTGGGGTGAGTTTGCGTTTCCCTTCTTCTGTTTATCAGTATTTGAATCTTCTTTGTAGAACTTTGTTGTATAAACTGACTAGTGCTGTGAATTTGACGGCTTCATATGTTATGAAATGTGATGACGACACCTTTGTGAGGGTGGAAACTGTTATAAAACAGATTGAAGGCATTTCATCCAAGAAGTCCCTGTACATGGGCAATCTCAACCTCTTGCATCGCCCTCTCAGACATGGAAAATGGGCAGTCACATATTTGGAATGGCCAGAAGAAGTGTATCCTCCATACGCCAATGGCCCAGGATATATCATTTCCATTGACATTGCCAAATACATTGTGGCTCAACATGAAAGCAGGAGCTTGAGGATATTCAAGATGGAGGATGTGAGCATGGGAATGTGGGTGGAGCAGTTCAACAGTACGGTGGCGATCGTTCAGTACTCGCACAGCTGGAAATTCTGCCAGTATGGGTGCATGGAGGACTATTTTACAGCACACTATCAATCTCCAAGGCAGATAATTTGCCTGTGGGATAAATTAACGCAGGGCCAAGCTCACTGTTGCAACTTCAGGTGA

Protein sequence

MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGVDGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGRITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSWISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMVELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEMLVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEMSEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNCAAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHESRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLWDKLTQGQAHCCNFR
Homology
BLAST of CmoCh07G000780 vs. ExPASy Swiss-Prot
Match: A7XDQ9 (Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana OX=3702 GN=GALT2 PE=1 SV=1)

HSP 1 Score: 898.7 bits (2321), Expect = 4.5e-260
Identity = 456/743 (61.37%), Postives = 542/743 (72.95%), Query Frame = 0

Query: 1   MKKLKTEP----PVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDG 60
           MK++K+E       +RRF+LSHFLL I   YLVF++FKFP F+ +   LSGD    GLDG
Sbjct: 1   MKRVKSESFRGVYSSRRFKLSHFLLAIAGFYLVFLAFKFPHFIEMVAMLSGD---TGLDG 60

Query: 61  TVGVDGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQH 120
            +      V  S     S+  D  +RKLED  H   P   +    EE  N +  I+P+  
Sbjct: 61  ALSDTSLDVSLS----GSLRNDMLNRKLEDEDHQSGPSTTQKVSPEEKINGSKQIQPLLF 120

Query: 121 KYGRITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTES-SMLEGKPE 180
           +YGRI+G+V  ++N T   S  ERMADEAW LGSKAWE+VDKF +++  ES S+ EGK E
Sbjct: 121 RYGRISGEVMRRRNRTIHMSPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKVE 180

Query: 181 SCPSWISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGD-PTVL 240
           SCPS IS  G +L + + IM LPCGLAAGSSITI+GTP +AH E VPQ  +L      VL
Sbjct: 181 SCPSQISMNGDDLNKANRIMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMVL 240

Query: 241 VSQFMVELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPS 300
           VSQFMVELQGLK+ DGE PPKILHLNPR+KGDW+ RPVIEHNTCYRMQWG AQRCDG PS
Sbjct: 241 VSQFMVELQGLKTGDGEYPPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTPS 300

Query: 301 SSDDEMLVDGNLRCDKWLRSDI---EDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRL 360
             D ++LVDG  RC+KW ++DI    DSKE+KT+SWFKRFIGREQKPEVTW FPF EG++
Sbjct: 301 KKDADVLVDGFRRCEKWTQNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKV 360

Query: 361 FILTLRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPS 420
           F+LTLRAG+DG+HINVGGRH++SF YRPGFT+EDATGLAV GDVDIHS +ATSL TSHPS
Sbjct: 361 FVLTLRAGIDGFHINVGGRHVSSFPYRPGFTIEDATGLAVTGDVDIHSIHATSLSTSHPS 420

Query: 421 FSPHRVLEMSEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVR 480
           FSP + +E S +WK+ PLP     LF+GVLSATNHF+ERMAVRKTWMQ  ++KSS+VV R
Sbjct: 421 FSPQKAIEFSSEWKAPPLPGTPFRLFMGVLSATNHFSERMAVRKTWMQHPSIKSSDVVAR 480

Query: 481 FFVALFMNCAAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRY 540
           FFVAL                          NPR EVNA+LKKEA YFGDIVILPFMDRY
Sbjct: 481 FFVAL--------------------------NPRKEVNAMLKKEAEYFGDIVILPFMDRY 540

Query: 541 ELVVLKTIAICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRV 600
           ELVVLKTIAICEFGV                           N+TA Y+MKCDDDTF+RV
Sbjct: 541 ELVVLKTIAICEFGVQ--------------------------NVTAPYIMKCDDDTFIRV 600

Query: 601 ETVIKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDI 660
           E+++KQI+G+S +KSLYMGNLNL HRPLR GKW VT+ EWPE VYPPYANGPGYIIS +I
Sbjct: 601 ESILKQIDGVSPEKSLYMGNLNLRHRPLRTGKWTVTWEEWPEAVYPPYANGPGYIISSNI 660

Query: 661 AKYIVAQHESRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQ 720
           AKYIV+Q+    LR+FKMEDVSMG+WVEQFN+++  V+YSHSWKFCQYGC  +Y+TAHYQ
Sbjct: 661 AKYIVSQNSRHKLRLFKMEDVSMGLWVEQFNASMQPVEYSHSWKFCQYGCTLNYYTAHYQ 684

Query: 721 SPRQIICLWDKLTQGQAHCCNFR 735
           SP Q++CLWD L +G+  CCNFR
Sbjct: 721 SPSQMMCLWDNLLKGRPQCCNFR 684

BLAST of CmoCh07G000780 vs. ExPASy Swiss-Prot
Match: Q8GXG6 (Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana OX=3702 GN=GALT4 PE=2 SV=2)

HSP 1 Score: 669.8 bits (1727), Expect = 3.4e-191
Identity = 367/751 (48.87%), Postives = 472/751 (62.85%), Query Frame = 0

Query: 1   MKKLKTEPPVAR-RFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDE--SNIGLDGT 60
           MKK K +   ++ RF L  FLLV+ L Y + +SF+ P      +    D+  S+   D  
Sbjct: 1   MKKSKLDNSSSQIRFGLVQFLLVVLLFYFLCMSFEIPFIFRTGSGSGSDDVSSSSFADAL 60

Query: 61  VG--VDGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQ 120
               V G G   +   +    +   HR  +D   ++  L  +   + E  +V+       
Sbjct: 61  PRPMVVGGGSREANWVVGEEEEADPHRHFKDPGRVQLRLPER--KMREFKSVS------- 120

Query: 121 HKYGRITGKVSIQQN--HTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGK 180
                I    S   N   +++FSI  + A  A ++G K W+ +D  GL +  + + ++ +
Sbjct: 121 ----EIFVNESFFDNGGFSDEFSIFHKTAKHAISMGRKMWDGLDS-GLIK-PDKAPVKTR 180

Query: 181 PESCPSWISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTV 240
            E CP  +S    E +    I+ LPCGL  GS IT++ TPH AHVE         GD T 
Sbjct: 181 IEKCPDMVSVSESEFVNRSRILVLPCGLTLGSHITVVATPHWAHVE-------KDGDKTA 240

Query: 241 LVSQFMVELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLP 300
           +VSQFM+ELQGLK+VDGEDPP+ILH NPR+KGDWS RPVIE NTCYRMQWG+  RCDG  
Sbjct: 241 MVSQFMMELQGLKAVDGEDPPRILHFNPRIKGDWSGRPVIEQNTCYRMQWGSGLRCDG-R 300

Query: 301 SSSDDEMLVDGNLRCDKWLRSDI------EDSKETKTSSWFKRFIGREQKPEV-TWPFPF 360
            SSDDE  VDG ++C++W R D       +D  E+K + W  R +GR +K     W +PF
Sbjct: 301 ESSDDEEYVDGEVKCERWKRDDDDGGNNGDDFDESKKTWWLNRLMGRRKKMITHDWDYPF 360

Query: 361 VEGRLFILTLRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLP 420
            EG+LF+LTLRAG++GYHI+V GRH+TSF YR GF LEDATGLAVKG++D+HS YA SLP
Sbjct: 361 AEGKLFVLTLRAGMEGYHISVNGRHITSFPYRTGFVLEDATGLAVKGNIDVHSVYAASLP 420

Query: 421 TSHPSFSPHRVLEMSEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSS 480
           +++PSF+P + LEM   WK+  LP+K V LFIG+LSA NHFAERMAVRK+WMQ   V+SS
Sbjct: 421 STNPSFAPQKHLEMQRIWKAPSLPQKPVELFIGILSAGNHFAERMAVRKSWMQQKLVRSS 480

Query: 481 NVVVRFFVALFMNCAAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILP 540
            VV RFFVAL                          + R EVN  LKKEA YFGDIVI+P
Sbjct: 481 KVVARFFVAL--------------------------HARKEVNVDLKKEAEYFGDIVIVP 540

Query: 541 FMDRYELVVLKTIAICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDD 600
           +MD Y+LVVLKT+AICE+GV+                           + A YVMKCDDD
Sbjct: 541 YMDHYDLVVLKTVAICEYGVN--------------------------TVAAKYVMKCDDD 600

Query: 601 TFVRVETVIKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYI 660
           TFVRV+ VI++ E +  ++SLY+GN+N  H+PLR GKWAVT+ EWPEE YPPYANGPGYI
Sbjct: 601 TFVRVDAVIQEAEKVKGRESLYIGNINFNHKPLRTGKWAVTFEEWPEEYYPPYANGPGYI 660

Query: 661 ISIDIAKYIVAQHESRSLRIFKMEDVSMGMWVEQFNST--VAIVQYSHSWKFCQYGCMED 720
           +S D+AK+IV   E + LR+FKMEDVSMGMWVE+FN T  VA+V   HS KFCQ+GC+ED
Sbjct: 661 LSYDVAKFIVDDFEQKRLRLFKMEDVSMGMWVEKFNETRPVAVV---HSLKFCQFGCIED 673

Query: 721 YFTAHYQSPRQIICLWDKLTQ-GQAHCCNFR 735
           YFTAHYQSPRQ+IC+WDKL + G+  CCN R
Sbjct: 721 YFTAHYQSPRQMICMWDKLQRLGKPQCCNMR 673

BLAST of CmoCh07G000780 vs. ExPASy Swiss-Prot
Match: Q9LV16 (Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana OX=3702 GN=GALT6 PE=2 SV=2)

HSP 1 Score: 664.1 bits (1712), Expect = 1.9e-189
Identity = 354/735 (48.16%), Postives = 455/735 (61.90%), Query Frame = 0

Query: 15  RLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGVDGEGVDFSKASLAS 74
           R    L+ +GLLY++ I+F+ P      T LS                     S+  L  
Sbjct: 25  RSVQILMAVGLLYMLLITFEIP--FVFKTGLS-------------------SLSQDPLTR 84

Query: 75  VYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGRITGKVSIQQNHTN- 134
             K    R+L++ +   AP  P    L + +    P + ++ +  RI   +       N 
Sbjct: 85  PEKHNSQRELQERR---APTRPLKSLLYQESQSESPAQGLRRR-TRILSSLRFDPETFNP 144

Query: 135 ---DFSI-LERMADEAWTLGSKAWEEVDK----FGLNETTESSMLEGKPESCPSWISTEG 194
              D S+ L + A  AW +G K WEE++       L +  +  + E    SC   +S  G
Sbjct: 145 SSKDGSVELHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTG 204

Query: 195 KELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKL-GGDPTVLVSQFMVELQG 254
            +LL+   IM LPCGL  GS IT++G P  AH E  P++  L  GD  V VSQF +ELQG
Sbjct: 205 SDLLKRGNIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQG 264

Query: 255 LKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEMLVDG 314
           LK+V+GE+PP+ILHLNPRLKGDWS +PVIE NTCYRMQWG+AQRC+G   S DDE  VDG
Sbjct: 265 LKAVEGEEPPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGW-RSRDDEETVDG 324

Query: 315 NLRCDKWLRSDIEDSKETKTSS----WFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 374
            ++C+KW R D   SKE ++S     W  R IGR +K  V WPFPF   +LF+LTL AG+
Sbjct: 325 QVKCEKWARDDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGL 384

Query: 375 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 434
           +GYH++V G+H+TSF YR GFTLEDATGL + GD+D+HS +A SLPTSHPSFSP R LE+
Sbjct: 385 EGYHVSVDGKHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLEL 444

Query: 435 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 494
           S  W++  LP + V +FIG+LSA NHFAERMAVR++WMQ   VKSS VV RFFVAL    
Sbjct: 445 SSNWQAPSLPDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVAL---- 504

Query: 495 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 554
                                 + R EVN  LKKEA +FGDIVI+P+MD Y+LVVLKT+A
Sbjct: 505 ----------------------HSRKEVNVELKKEAEFFGDIVIVPYMDSYDLVVLKTVA 564

Query: 555 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 614
           ICE+G                          A  L A ++MKCDDDTFV+V+ V+ + + 
Sbjct: 565 ICEYG--------------------------AHQLAAKFIMKCDDDTFVQVDAVLSEAKK 624

Query: 615 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 674
             + +SLY+GN+N  H+PLR GKW+VTY EWPEE YPPYANGPGYI+S DI+++IV + E
Sbjct: 625 TPTDRSLYIGNINYYHKPLRQGKWSVTYEEWPEEDYPPYANGPGYILSNDISRFIVKEFE 681

Query: 675 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 734
              LR+FKMEDVS+GMWVEQFN+    V Y HS +FCQ+GC+E+Y TAHYQSPRQ+ICLW
Sbjct: 685 KHKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRFCQFGCIENYLTAHYQSPRQMICLW 681

BLAST of CmoCh07G000780 vs. ExPASy Swiss-Prot
Match: Q8RX55 (Hydroxyproline O-galactosyltransferase GALT5 OS=Arabidopsis thaliana OX=3702 GN=GALT5 PE=1 SV=1)

HSP 1 Score: 659.8 bits (1701), Expect = 3.5e-188
Identity = 349/734 (47.55%), Postives = 457/734 (62.26%), Query Frame = 0

Query: 20  LLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGVDGEGVDFSKASLASVYKDT 79
           ++ IG LYLV +S + P                            + F   S +SV  D 
Sbjct: 30  IMAIGFLYLVIVSVEIP----------------------------LVFKSWSSSSVPLDA 89

Query: 80  FHR--KLEDNQHLEAPLMPKTEPLEEVN-NVTGPI----------KPIQHKYGRITGKVS 139
             R  KL + Q  +  ++P   PLE V+  V+ P           K  +H  G ++    
Sbjct: 90  LSRLEKLNNEQEPQVEIIP-NPPLEPVSYPVSNPTIVTRTDLVQNKVREHHRGVLSSLRF 149

Query: 140 IQQN---HTNDFSI-LERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSWIS 199
             +     + D S+ L + A EAW LG K W+E++   L +  E    + KP+SCP  +S
Sbjct: 150 DSETFDPSSKDGSVELHKSAKEAWQLGRKLWKELESGRLEKLVEKPE-KNKPDSCPHSVS 209

Query: 200 TEGKELLEGDG-IMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMVE 259
             G E +  +  +M LPCGL  GS IT++G P  AH +         GD + LVSQF++E
Sbjct: 210 LTGSEFMNRENKLMELPCGLTLGSHITLVGRPRKAHPK--------EGDWSKLVSQFVIE 269

Query: 260 LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEML 319
           LQGLK+V+GEDPP+ILH NPRLKGDWSK+PVIE N+CYRMQWG AQRC+G   S DDE  
Sbjct: 270 LQGLKTVEGEDPPRILHFNPRLKGDWSKKPVIEQNSCYRMQWGPAQRCEGW-KSRDDEET 329

Query: 320 VDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGVD 379
           VD +++C+KW+R D   S+ ++   W  R IGR ++ +V WPFPFVE +LF+LTL AG++
Sbjct: 330 VDSHVKCEKWIRDDDNYSEGSRARWWLNRLIGRRKRVKVEWPFPFVEEKLFVLTLSAGLE 389

Query: 380 GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEMS 439
           GYHINV G+H+TSF YR GFTLEDATGL V GD+D+HS +  SLPTSHPSF+P R LE+S
Sbjct: 390 GYHINVDGKHVTSFPYRTGFTLEDATGLTVNGDIDVHSVFVASLPTSHPSFAPQRHLELS 449

Query: 440 EKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNCA 499
           ++W++  +P   V +FIG+LSA NHF+ERMAVRK+WMQ   + S+ VV RFFVAL     
Sbjct: 450 KRWQAPVVPDGPVEIFIGILSAGNHFSERMAVRKSWMQHVLITSAKVVARFFVAL----- 509

Query: 500 AMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAI 559
                                + R EVN  LKKEA YFGDIV++P+MD Y+LVVLKT+AI
Sbjct: 510 ---------------------HGRKEVNVELKKEAEYFGDIVLVPYMDSYDLVVLKTVAI 569

Query: 560 CEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEGI 619
           CE G                          A+  +A Y+MKCDDDTFV++  VI +++ +
Sbjct: 570 CEHG--------------------------ALAFSAKYIMKCDDDTFVKLGAVINEVKKV 629

Query: 620 SSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHES 679
              +SLY+GN+N  H+PLR GKWAVTY EWPEE YPPYANGPGY++S DIA++IV + E 
Sbjct: 630 PEGRSLYIGNMNYYHKPLRGGKWAVTYEEWPEEDYPPYANGPGYVLSSDIARFIVDKFER 672

Query: 680 RSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLWD 735
             LR+FKMEDVS+GMWVE F +T   V Y HS +FCQ+GC+E+Y+TAHYQSPRQ+ICLWD
Sbjct: 690 HKLRLFKMEDVSVGMWVEHFKNTTNPVDYRHSLRFCQFGCVENYYTAHYQSPRQMICLWD 672

BLAST of CmoCh07G000780 vs. ExPASy Swiss-Prot
Match: Q8L7F9 (Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana OX=3702 GN=GALT1 PE=1 SV=1)

HSP 1 Score: 300.1 bits (767), Expect = 7.1e-80
Identity = 199/591 (33.67%), Postives = 289/591 (48.90%), Query Frame = 0

Query: 153 WEE----VDKFGLNETTESSMLEGKPESCPSWISTEGKELLEGDGI-MFLPCGLAAGSSI 212
           WE     V+   L +  E+   +GK E CP ++S       +G  + + +PCGL  GSSI
Sbjct: 126 WESLVSAVEAKKLVDVNENQTRKGKEELCPQFLSKMNATEADGSSLKLQIPCGLTQGSSI 185

Query: 213 TIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMVELQGLKSVDGEDPPKILHLNPRLKGDW 272
           T+IG P                    LV  F ++L G       DPP I+H N RL GD 
Sbjct: 186 TVIGIPDG------------------LVGSFRIDLTGQPLPGEPDPPIIVHYNVRLLGDK 245

Query: 273 S-KRPVIEHNTCYRMQ-WGTAQRCDGLPSSSDDEMLVDGNLRCDKWLRSDIEDSKETKTS 332
           S + PVI  N+    Q WG  +RC       D    VD    C+K +  +I  +  T   
Sbjct: 246 STEDPVIVQNSWTASQDWGAEERCPKF--DPDMNKKVDDLDECNKMVGGEINRTSSTSLQ 305

Query: 333 SWFKRF--IGREQKPEVTWPFPFVEGRLFILTLRAGVDGYHINVGGRHLTSFAYRPGFTL 392
           S   R   + RE      + FPF +G L + TLR G +G  + V G+H+TSFA+R     
Sbjct: 306 SNTSRGVPVAREASKHEKY-FPFKQGFLSVATLRVGTEGMQMTVDGKHITSFAFRDTLEP 365

Query: 393 EDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEMSEKWKSQPL-PKKSVYLFIGVLS 452
              + + + GD  + S  A+ LPTS  S     V+++ E  KS  L P + + L IGV S
Sbjct: 366 WLVSEIRITGDFRLISILASGLPTSEES---EHVVDL-EALKSPTLSPLRPLDLVIGVFS 425

Query: 453 ATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNCAAMLYCTPVSRMCSCINAIEAA 512
             N+F  RMAVR+TWMQ   V+S  V VRFFV         L+ +P+             
Sbjct: 426 TANNFKRRMAVRRTWMQYDDVRSGRVAVRFFVG--------LHKSPL------------- 485

Query: 513 NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVSLRFPSSVYQYLNLLC 572
                VN  L  EA  +GD+ ++PF+D Y L+  KT+AIC FG  +              
Sbjct: 486 -----VNLELWNEARTYGDVQLMPFVDYYSLISWKTLAICIFGTEVD------------- 545

Query: 573 RTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEGISSKKSLYMGNLNLLHRPLRH- 632
                        +A ++MK DDD FVRV+ V+  +   ++ + L  G +N   +P+R+ 
Sbjct: 546 -------------SAKFIMKTDDDAFVRVDEVLLSLSMTNNTRGLIYGLINSDSQPIRNP 605

Query: 633 -GKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHESRSLRIFKMEDVSMGMWVEQ 692
             KW ++Y EWPEE YPP+A+GPGYI+S DIA+ +    +  +L++FK+EDV+MG+W+ +
Sbjct: 606 DSKWYISYEEWPEEKYPPWAHGPGYIVSRDIAESVGKLFKEGNLKMFKLEDVAMGIWIAE 639

Query: 693 FNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLWDKLTQGQAHCC 732
                    Y +  +    GC + Y  AHYQSP ++ CLW K  + +   C
Sbjct: 666 LTKHGLEPHYENDGRIISDGCKDGYVVAHYQSPAEMTCLWRKYQETKRSLC 639

BLAST of CmoCh07G000780 vs. ExPASy TrEMBL
Match: A0A6J1EEU7 (hydroxyproline O-galactosyltransferase GALT2-like OS=Cucurbita moschata OX=3662 GN=LOC111433689 PE=3 SV=1)

HSP 1 Score: 1375.5 bits (3559), Expect = 0.0e+00
Identity = 682/734 (92.92%), Postives = 682/734 (92.92%), Query Frame = 0

Query: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60
           MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV
Sbjct: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60

Query: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120
           DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR
Sbjct: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180
           ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW
Sbjct: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180

Query: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240
           ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV
Sbjct: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360
           LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV
Sbjct: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420
           DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420

Query: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 480
           SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL    
Sbjct: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL---- 480

Query: 481 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540
                                 NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA
Sbjct: 481 ----------------------NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540

Query: 541 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 600
           ICEFG                          AVNLTASYVMKCDDDTFVRVETVIKQIEG
Sbjct: 541 ICEFG--------------------------AVNLTASYVMKCDDDTFVRVETVIKQIEG 600

Query: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660
           ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE
Sbjct: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660

Query: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 720
           SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW
Sbjct: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 682

Query: 721 DKLTQGQAHCCNFR 735
           DKLTQGQAHCCNFR
Sbjct: 721 DKLTQGQAHCCNFR 682

BLAST of CmoCh07G000780 vs. ExPASy TrEMBL
Match: A0A6J1KWJ5 (hydroxyproline O-galactosyltransferase GALT2-like OS=Cucurbita maxima OX=3661 GN=LOC111497063 PE=3 SV=1)

HSP 1 Score: 1343.6 bits (3476), Expect = 0.0e+00
Identity = 667/734 (90.87%), Postives = 671/734 (91.42%), Query Frame = 0

Query: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60
           MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRF  IATTLSGDESNIGLDGT+GV
Sbjct: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFFEIATTLSGDESNIGLDGTIGV 60

Query: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120
           D EGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR
Sbjct: 61  DREGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180
           ITGKVS QQNHTNDFSILE+MADEAWTLG KAWEEVDKFGLNETTESSMLEGKPE CPSW
Sbjct: 121 ITGKVSSQQNHTNDFSILEKMADEAWTLGLKAWEEVDKFGLNETTESSMLEGKPELCPSW 180

Query: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240
           ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQF V
Sbjct: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFTV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360
           LVDGNLRCDKWLRSDI DSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV
Sbjct: 301 LVDGNLRCDKWLRSDIVDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420
           DGYHINVGGRHLTSFA+RPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAHRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420

Query: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 480
           SE WKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL    
Sbjct: 421 SEIWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL---- 480

Query: 481 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540
                                 NPR EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA
Sbjct: 481 ----------------------NPRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540

Query: 541 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 600
           ICEFGV                          VNLTASYVMKCDDDTFVRVETVIKQIEG
Sbjct: 541 ICEFGV--------------------------VNLTASYVMKCDDDTFVRVETVIKQIEG 600

Query: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660
           ISSKKSLYMGNLNLLHRPLRHGKWAVTY+EWPEEVYPPYANGPGYIISIDIAKYIVAQHE
Sbjct: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYVEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660

Query: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 720
           SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW
Sbjct: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 682

Query: 721 DKLTQGQAHCCNFR 735
           DKLTQG AHCCNFR
Sbjct: 721 DKLTQGHAHCCNFR 682

BLAST of CmoCh07G000780 vs. ExPASy TrEMBL
Match: A0A5D3CNA3 (Hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G009340 PE=3 SV=1)

HSP 1 Score: 1266.9 bits (3277), Expect = 0.0e+00
Identity = 620/734 (84.47%), Postives = 652/734 (88.83%), Query Frame = 0

Query: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60
           MKK+KTEPPVARR RLSH LLVIG+LYLVFISFKFPRFL IA TLSGDESN GLD   GV
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSN-GV 60

Query: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120
           D EG+DFSKASL+SVYKDTFHRKLEDN+HLEAPL PK EPLEEVNNVTGPIKPIQHKYGR
Sbjct: 61  DSEGMDFSKASLSSVYKDTFHRKLEDNEHLEAPLTPKKEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180
           ITG +S   NHTNDFS+LE+MADEAWTLG  AWEE+DKFGLNET ESS+LEGKPESCPSW
Sbjct: 121 ITGNISSLLNHTNDFSMLEKMADEAWTLGLMAWEEIDKFGLNETAESSILEGKPESCPSW 180

Query: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240
           IST+GK+L+EGDG+MFLPCGLAAGSSITIIGTPH AH EYVPQLLK+GGDP V+VSQFMV
Sbjct: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPNVMVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360
           LVDGN RC+KWLRSD+ D+KE+KT+SWFKRFIGREQKPEVTWPFPF+EGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWLRSDVTDTKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420
           DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSP RVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 480
           SEKWKSQPLPK SV+LFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL    
Sbjct: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL---- 480

Query: 481 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540
                                 NPR EVNAVLK+EAAYFGDIVILPFMDRYELVVLKTIA
Sbjct: 481 ----------------------NPRKEVNAVLKREAAYFGDIVILPFMDRYELVVLKTIA 540

Query: 541 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 600
           ICEFGV                          VNLTASY+MKCDDDTFVRVETV+KQIEG
Sbjct: 541 ICEFGV--------------------------VNLTASYIMKCDDDTFVRVETVLKQIEG 600

Query: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660
           ISSKKSLYMGNLNLLHRPLRHGKWAVTY EWPEEVYPPYANGPGYI+SIDIAKYIV+QHE
Sbjct: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAKYIVSQHE 660

Query: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 720
           +RSLRIFKMEDVSMGMWVEQFNSTVA VQYSH+WKFCQYGCMEDYFTAHYQSPRQI+CLW
Sbjct: 661 NRSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSPRQILCLW 681

Query: 721 DKLTQGQAHCCNFR 735
           DKL +G AHCCNFR
Sbjct: 721 DKLARGHAHCCNFR 681

BLAST of CmoCh07G000780 vs. ExPASy TrEMBL
Match: A0A1S3AZQ8 (hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo OX=3656 GN=LOC103484337 PE=3 SV=1)

HSP 1 Score: 1266.9 bits (3277), Expect = 0.0e+00
Identity = 620/734 (84.47%), Postives = 652/734 (88.83%), Query Frame = 0

Query: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60
           MKK+KTEPPVARR RLSH LLVIG+LYLVFISFKFPRFL IA TLSGDESN GLD   GV
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSN-GV 60

Query: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120
           D EG+DFSKASL+SVYKDTFHRKLEDN+HLEAPL PK EPLEEVNNVTGPIKPIQHKYGR
Sbjct: 61  DSEGMDFSKASLSSVYKDTFHRKLEDNEHLEAPLTPKKEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180
           ITG +S   NHTNDFS+LE+MADEAWTLG  AWEE+DKFGLNET ESS+LEGKPESCPSW
Sbjct: 121 ITGNISSLLNHTNDFSMLEKMADEAWTLGLMAWEEIDKFGLNETAESSILEGKPESCPSW 180

Query: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240
           IST+GK+L+EGDG+MFLPCGLAAGSSITIIGTPH AH EYVPQLLK+GGDP V+VSQFMV
Sbjct: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPNVMVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360
           LVDGN RC+KWLRSD+ D+KE+KT+SWFKRFIGREQKPEVTWPFPF+EGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWLRSDVTDTKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420
           DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSP RVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 480
           SEKWKSQPLPK SV+LFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL    
Sbjct: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL---- 480

Query: 481 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540
                                 NPR EVNAVLK+EAAYFGDIVILPFMDRYELVVLKTIA
Sbjct: 481 ----------------------NPRKEVNAVLKREAAYFGDIVILPFMDRYELVVLKTIA 540

Query: 541 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 600
           ICEFGV                          VNLTASY+MKCDDDTFVRVETV+KQIEG
Sbjct: 541 ICEFGV--------------------------VNLTASYIMKCDDDTFVRVETVLKQIEG 600

Query: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660
           ISSKKSLYMGNLNLLHRPLRHGKWAVTY EWPEEVYPPYANGPGYI+SIDIAKYIV+QHE
Sbjct: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAKYIVSQHE 660

Query: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 720
           +RSLRIFKMEDVSMGMWVEQFNSTVA VQYSH+WKFCQYGCMEDYFTAHYQSPRQI+CLW
Sbjct: 661 NRSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSPRQILCLW 681

Query: 721 DKLTQGQAHCCNFR 735
           DKL +G AHCCNFR
Sbjct: 721 DKLARGHAHCCNFR 681

BLAST of CmoCh07G000780 vs. ExPASy TrEMBL
Match: A0A5A7UAW9 (Hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold120G001140 PE=3 SV=1)

HSP 1 Score: 1265.8 bits (3274), Expect = 0.0e+00
Identity = 619/734 (84.33%), Postives = 652/734 (88.83%), Query Frame = 0

Query: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60
           MKK+KTEPPVARR RLSH LLVIG+LYLVFISFKFPRFL IA TLSGDESN GLD   GV
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSN-GV 60

Query: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120
           D EG+DFSKASL+SVYKDTFHRKLEDN+HLEAPL PK EPLEEVNNVTGPIKPIQHKYGR
Sbjct: 61  DSEGMDFSKASLSSVYKDTFHRKLEDNEHLEAPLTPKKEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180
           ITG +S   NHTNDFS+LE+MADEAWTLG  AWEE+DKFGLNET ESS+LEGKPESCPSW
Sbjct: 121 ITGNISSLLNHTNDFSMLEKMADEAWTLGLMAWEEIDKFGLNETAESSILEGKPESCPSW 180

Query: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240
           IST+GK+L+EGDG+MFLPCGLAAGSSITIIGTPH AH EYVPQLLK+GGDP V+VSQFMV
Sbjct: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPNVMVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360
           LVDGN RC+KWLRSD+ D+KE+KT+SWFKRFIGREQKPEVTWPFPF+EGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWLRSDVTDTKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420
           DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSP RVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 480
           SEKWKSQPLPK SV+LFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL    
Sbjct: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL---- 480

Query: 481 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540
                                 NPR EVNAVLK+EAAYFGDIVILPFMDRYELVVLKTIA
Sbjct: 481 ----------------------NPRKEVNAVLKREAAYFGDIVILPFMDRYELVVLKTIA 540

Query: 541 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 600
           ICEFGV                          +NLTASY+MKCDDDTFVRVETV+KQIEG
Sbjct: 541 ICEFGV--------------------------MNLTASYIMKCDDDTFVRVETVLKQIEG 600

Query: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660
           ISSKKSLYMGNLNLLHRPLRHGKWAVTY EWPEEVYPPYANGPGYI+SIDIAKYIV+QHE
Sbjct: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAKYIVSQHE 660

Query: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 720
           +RSLRIFKMEDVSMGMWVEQFNSTVA VQYSH+WKFCQYGCMEDYFTAHYQSPRQI+CLW
Sbjct: 661 NRSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSPRQILCLW 681

Query: 721 DKLTQGQAHCCNFR 735
           DKL +G AHCCNFR
Sbjct: 721 DKLARGHAHCCNFR 681

BLAST of CmoCh07G000780 vs. NCBI nr
Match: XP_022926607.1 (hydroxyproline O-galactosyltransferase GALT2-like [Cucurbita moschata])

HSP 1 Score: 1375.5 bits (3559), Expect = 0.0e+00
Identity = 682/734 (92.92%), Postives = 682/734 (92.92%), Query Frame = 0

Query: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60
           MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV
Sbjct: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60

Query: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120
           DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR
Sbjct: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180
           ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW
Sbjct: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180

Query: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240
           ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV
Sbjct: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360
           LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV
Sbjct: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420
           DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420

Query: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 480
           SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL    
Sbjct: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL---- 480

Query: 481 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540
                                 NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA
Sbjct: 481 ----------------------NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540

Query: 541 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 600
           ICEFG                          AVNLTASYVMKCDDDTFVRVETVIKQIEG
Sbjct: 541 ICEFG--------------------------AVNLTASYVMKCDDDTFVRVETVIKQIEG 600

Query: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660
           ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE
Sbjct: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660

Query: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 720
           SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW
Sbjct: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 682

Query: 721 DKLTQGQAHCCNFR 735
           DKLTQGQAHCCNFR
Sbjct: 721 DKLTQGQAHCCNFR 682

BLAST of CmoCh07G000780 vs. NCBI nr
Match: KAG7026442.1 (Hydroxyproline O-galactosyltransferase GALT2 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1369.8 bits (3544), Expect = 0.0e+00
Identity = 680/734 (92.64%), Postives = 680/734 (92.64%), Query Frame = 0

Query: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60
           MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFL IATTLSGDESNIGLDGTVGV
Sbjct: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLEIATTLSGDESNIGLDGTVGV 60

Query: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120
           DGEG DFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR
Sbjct: 61  DGEGGDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180
           ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW
Sbjct: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180

Query: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240
           ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV
Sbjct: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360
           LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV
Sbjct: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420
           DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420

Query: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 480
           SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL    
Sbjct: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL---- 480

Query: 481 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540
                                 NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA
Sbjct: 481 ----------------------NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540

Query: 541 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 600
           ICEFG                          AVNLTASYVMKCDDDTFVRVETVIKQIEG
Sbjct: 541 ICEFG--------------------------AVNLTASYVMKCDDDTFVRVETVIKQIEG 600

Query: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660
           ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE
Sbjct: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660

Query: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 720
           SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW
Sbjct: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 682

Query: 721 DKLTQGQAHCCNFR 735
           DKLTQGQAHCCNFR
Sbjct: 721 DKLTQGQAHCCNFR 682

BLAST of CmoCh07G000780 vs. NCBI nr
Match: XP_023517655.1 (hydroxyproline O-galactosyltransferase GALT2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1360.1 bits (3519), Expect = 0.0e+00
Identity = 675/734 (91.96%), Postives = 675/734 (91.96%), Query Frame = 0

Query: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60
           MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFL IATTLSGDESN GLDG VGV
Sbjct: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLEIATTLSGDESNTGLDGAVGV 60

Query: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120
           DGEGVDFSK SLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR
Sbjct: 61  DGEGVDFSKPSLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180
           ITGKVSIQQNHTNDFSILERMADEAWTLG KAWEEVDKFGLNETTESSMLEGKPESCPSW
Sbjct: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGLKAWEEVDKFGLNETTESSMLEGKPESCPSW 180

Query: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240
           ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV
Sbjct: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360
           LVDGNLRCDKWLRSDI DSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV
Sbjct: 301 LVDGNLRCDKWLRSDIVDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420
           DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420

Query: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 480
           SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL    
Sbjct: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL---- 480

Query: 481 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540
                                 NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA
Sbjct: 481 ----------------------NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540

Query: 541 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 600
           ICEFG                          AVNLTASYVMKCDDDTFVRVETVIKQIEG
Sbjct: 541 ICEFG--------------------------AVNLTASYVMKCDDDTFVRVETVIKQIEG 600

Query: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660
           ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDI KYIVAQHE
Sbjct: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIVKYIVAQHE 660

Query: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 720
           SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW
Sbjct: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 682

Query: 721 DKLTQGQAHCCNFR 735
           DKLTQGQAHCCNFR
Sbjct: 721 DKLTQGQAHCCNFR 682

BLAST of CmoCh07G000780 vs. NCBI nr
Match: XP_023003453.1 (hydroxyproline O-galactosyltransferase GALT2-like [Cucurbita maxima])

HSP 1 Score: 1343.6 bits (3476), Expect = 0.0e+00
Identity = 667/734 (90.87%), Postives = 671/734 (91.42%), Query Frame = 0

Query: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60
           MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRF  IATTLSGDESNIGLDGT+GV
Sbjct: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFFEIATTLSGDESNIGLDGTIGV 60

Query: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120
           D EGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR
Sbjct: 61  DREGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180
           ITGKVS QQNHTNDFSILE+MADEAWTLG KAWEEVDKFGLNETTESSMLEGKPE CPSW
Sbjct: 121 ITGKVSSQQNHTNDFSILEKMADEAWTLGLKAWEEVDKFGLNETTESSMLEGKPELCPSW 180

Query: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240
           ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQF V
Sbjct: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFTV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360
           LVDGNLRCDKWLRSDI DSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV
Sbjct: 301 LVDGNLRCDKWLRSDIVDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420
           DGYHINVGGRHLTSFA+RPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAHRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420

Query: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 480
           SE WKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL    
Sbjct: 421 SEIWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL---- 480

Query: 481 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540
                                 NPR EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA
Sbjct: 481 ----------------------NPRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540

Query: 541 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 600
           ICEFGV                          VNLTASYVMKCDDDTFVRVETVIKQIEG
Sbjct: 541 ICEFGV--------------------------VNLTASYVMKCDDDTFVRVETVIKQIEG 600

Query: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660
           ISSKKSLYMGNLNLLHRPLRHGKWAVTY+EWPEEVYPPYANGPGYIISIDIAKYIVAQHE
Sbjct: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYVEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660

Query: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 720
           SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW
Sbjct: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 682

Query: 721 DKLTQGQAHCCNFR 735
           DKLTQG AHCCNFR
Sbjct: 721 DKLTQGHAHCCNFR 682

BLAST of CmoCh07G000780 vs. NCBI nr
Match: KAG6594436.1 (CSC1-like protein ERD4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1285.4 bits (3325), Expect = 0.0e+00
Identity = 644/698 (92.26%), Postives = 644/698 (92.26%), Query Frame = 0

Query: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60
           MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFL IATTLSGDESNIGLDGTVGV
Sbjct: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLEIATTLSGDESNIGLDGTVGV 60

Query: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120
           DGEG DFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR
Sbjct: 61  DGEGGDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180
           ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW
Sbjct: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180

Query: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240
           ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV
Sbjct: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360
           LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV
Sbjct: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420
           DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420

Query: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 480
           SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL    
Sbjct: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVAL---- 480

Query: 481 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540
                                 NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA
Sbjct: 481 ----------------------NPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 540

Query: 541 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 600
           ICEFG                          AVNLTASYVMKCDDDTFVRVETVIKQIEG
Sbjct: 541 ICEFG--------------------------AVNLTASYVMKCDDDTFVRVETVIKQIEG 600

Query: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 660
           ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE
Sbjct: 601 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 646

Query: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQ 699
           SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQ
Sbjct: 661 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQ 646

BLAST of CmoCh07G000780 vs. TAIR 10
Match: AT4G21060.2 (Galactosyltransferase family protein )

HSP 1 Score: 898.7 bits (2321), Expect = 3.2e-261
Identity = 456/743 (61.37%), Postives = 542/743 (72.95%), Query Frame = 0

Query: 1   MKKLKTEP----PVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDG 60
           MK++K+E       +RRF+LSHFLL I   YLVF++FKFP F+ +   LSGD    GLDG
Sbjct: 1   MKRVKSESFRGVYSSRRFKLSHFLLAIAGFYLVFLAFKFPHFIEMVAMLSGD---TGLDG 60

Query: 61  TVGVDGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQH 120
            +      V  S     S+  D  +RKLED  H   P   +    EE  N +  I+P+  
Sbjct: 61  ALSDTSLDVSLS----GSLRNDMLNRKLEDEDHQSGPSTTQKVSPEEKINGSKQIQPLLF 120

Query: 121 KYGRITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTES-SMLEGKPE 180
           +YGRI+G+V  ++N T   S  ERMADEAW LGSKAWE+VDKF +++  ES S+ EGK E
Sbjct: 121 RYGRISGEVMRRRNRTIHMSPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKVE 180

Query: 181 SCPSWISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGD-PTVL 240
           SCPS IS  G +L + + IM LPCGLAAGSSITI+GTP +AH E VPQ  +L      VL
Sbjct: 181 SCPSQISMNGDDLNKANRIMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMVL 240

Query: 241 VSQFMVELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPS 300
           VSQFMVELQGLK+ DGE PPKILHLNPR+KGDW+ RPVIEHNTCYRMQWG AQRCDG PS
Sbjct: 241 VSQFMVELQGLKTGDGEYPPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTPS 300

Query: 301 SSDDEMLVDGNLRCDKWLRSDI---EDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRL 360
             D ++LVDG  RC+KW ++DI    DSKE+KT+SWFKRFIGREQKPEVTW FPF EG++
Sbjct: 301 KKDADVLVDGFRRCEKWTQNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKV 360

Query: 361 FILTLRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPS 420
           F+LTLRAG+DG+HINVGGRH++SF YRPGFT+EDATGLAV GDVDIHS +ATSL TSHPS
Sbjct: 361 FVLTLRAGIDGFHINVGGRHVSSFPYRPGFTIEDATGLAVTGDVDIHSIHATSLSTSHPS 420

Query: 421 FSPHRVLEMSEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVR 480
           FSP + +E S +WK+ PLP     LF+GVLSATNHF+ERMAVRKTWMQ  ++KSS+VV R
Sbjct: 421 FSPQKAIEFSSEWKAPPLPGTPFRLFMGVLSATNHFSERMAVRKTWMQHPSIKSSDVVAR 480

Query: 481 FFVALFMNCAAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRY 540
           FFVAL                          NPR EVNA+LKKEA YFGDIVILPFMDRY
Sbjct: 481 FFVAL--------------------------NPRKEVNAMLKKEAEYFGDIVILPFMDRY 540

Query: 541 ELVVLKTIAICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRV 600
           ELVVLKTIAICEFGV                           N+TA Y+MKCDDDTF+RV
Sbjct: 541 ELVVLKTIAICEFGVQ--------------------------NVTAPYIMKCDDDTFIRV 600

Query: 601 ETVIKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDI 660
           E+++KQI+G+S +KSLYMGNLNL HRPLR GKW VT+ EWPE VYPPYANGPGYIIS +I
Sbjct: 601 ESILKQIDGVSPEKSLYMGNLNLRHRPLRTGKWTVTWEEWPEAVYPPYANGPGYIISSNI 660

Query: 661 AKYIVAQHESRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQ 720
           AKYIV+Q+    LR+FKMEDVSMG+WVEQFN+++  V+YSHSWKFCQYGC  +Y+TAHYQ
Sbjct: 661 AKYIVSQNSRHKLRLFKMEDVSMGLWVEQFNASMQPVEYSHSWKFCQYGCTLNYYTAHYQ 684

Query: 721 SPRQIICLWDKLTQGQAHCCNFR 735
           SP Q++CLWD L +G+  CCNFR
Sbjct: 721 SPSQMMCLWDNLLKGRPQCCNFR 684

BLAST of CmoCh07G000780 vs. TAIR 10
Match: AT4G21060.1 (Galactosyltransferase family protein )

HSP 1 Score: 879.8 bits (2272), Expect = 1.5e-255
Identity = 442/713 (61.99%), Postives = 523/713 (73.35%), Query Frame = 0

Query: 27  YLVFISFKFPRFLGIATTLSGDESNIGLDGTVGVDGEGVDFSKASLASVYKDTFHRKLED 86
           YLVF++FKFP F+ +   LSGD    GLDG +      V  S     S+  D  +RKLED
Sbjct: 88  YLVFLAFKFPHFIEMVAMLSGD---TGLDGALSDTSLDVSLS----GSLRNDMLNRKLED 147

Query: 87  NQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGRITGKVSIQQNHTNDFSILERMADEAW 146
             H   P   +    EE  N +  I+P+  +YGRI+G+V  ++N T   S  ERMADEAW
Sbjct: 148 EDHQSGPSTTQKVSPEEKINGSKQIQPLLFRYGRISGEVMRRRNRTIHMSPFERMADEAW 207

Query: 147 TLGSKAWEEVDKFGLNETTES-SMLEGKPESCPSWISTEGKELLEGDGIMFLPCGLAAGS 206
            LGSKAWE+VDKF +++  ES S+ EGK ESCPS IS  G +L + + IM LPCGLAAGS
Sbjct: 208 ILGSKAWEDVDKFEVDKINESASIFEGKVESCPSQISMNGDDLNKANRIMLLPCGLAAGS 267

Query: 207 SITIIGTPHHAHVEYVPQLLKLGGD-PTVLVSQFMVELQGLKSVDGEDPPKILHLNPRLK 266
           SITI+GTP +AH E VPQ  +L      VLVSQFMVELQGLK+ DGE PPKILHLNPR+K
Sbjct: 268 SITILGTPQYAHKESVPQRSRLTRSYGMVLVSQFMVELQGLKTGDGEYPPKILHLNPRIK 327

Query: 267 GDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEMLVDGNLRCDKWLRSDI---EDSKE 326
           GDW+ RPVIEHNTCYRMQWG AQRCDG PS  D ++LVDG  RC+KW ++DI    DSKE
Sbjct: 328 GDWNHRPVIEHNTCYRMQWGVAQRCDGTPSKKDADVLVDGFRRCEKWTQNDIIDMVDSKE 387

Query: 327 TKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGVDGYHINVGGRHLTSFAYRPGF 386
           +KT+SWFKRFIGREQKPEVTW FPF EG++F+LTLRAG+DG+HINVGGRH++SF YRPGF
Sbjct: 388 SKTTSWFKRFIGREQKPEVTWSFPFAEGKVFVLTLRAGIDGFHINVGGRHVSSFPYRPGF 447

Query: 387 TLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEMSEKWKSQPLPKKSVYLFIGVL 446
           T+EDATGLAV GDVDIHS +ATSL TSHPSFSP + +E S +WK+ PLP     LF+GVL
Sbjct: 448 TIEDATGLAVTGDVDIHSIHATSLSTSHPSFSPQKAIEFSSEWKAPPLPGTPFRLFMGVL 507

Query: 447 SATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNCAAMLYCTPVSRMCSCINAIEA 506
           SATNHF+ERMAVRKTWMQ  ++KSS+VV RFFVAL                         
Sbjct: 508 SATNHFSERMAVRKTWMQHPSIKSSDVVARFFVAL------------------------- 567

Query: 507 ANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVSLRFPSSVYQYLNLL 566
            NPR EVNA+LKKEA YFGDIVILPFMDRYELVVLKTIAICEFGV               
Sbjct: 568 -NPRKEVNAMLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGVQ-------------- 627

Query: 567 CRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEGISSKKSLYMGNLNLLHRPLRH 626
                       N+TA Y+MKCDDDTF+RVE+++KQI+G+S +KSLYMGNLNL HRPLR 
Sbjct: 628 ------------NVTAPYIMKCDDDTFIRVESILKQIDGVSPEKSLYMGNLNLRHRPLRT 687

Query: 627 GKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHESRSLRIFKMEDVSMGMWVEQF 686
           GKW VT+ EWPE VYPPYANGPGYIIS +IAKYIV+Q+    LR+FKMEDVSMG+WVEQF
Sbjct: 688 GKWTVTWEEWPEAVYPPYANGPGYIISSNIAKYIVSQNSRHKLRLFKMEDVSMGLWVEQF 741

Query: 687 NSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLWDKLTQGQAHCCNFR 735
           N+++  V+YSHSWKFCQYGC  +Y+TAHYQSP Q++CLWD L +G+  CCNFR
Sbjct: 748 NASMQPVEYSHSWKFCQYGCTLNYYTAHYQSPSQMMCLWDNLLKGRPQCCNFR 741

BLAST of CmoCh07G000780 vs. TAIR 10
Match: AT1G27120.1 (Galactosyltransferase family protein )

HSP 1 Score: 669.8 bits (1727), Expect = 2.4e-192
Identity = 367/751 (48.87%), Postives = 472/751 (62.85%), Query Frame = 0

Query: 1   MKKLKTEPPVAR-RFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDE--SNIGLDGT 60
           MKK K +   ++ RF L  FLLV+ L Y + +SF+ P      +    D+  S+   D  
Sbjct: 1   MKKSKLDNSSSQIRFGLVQFLLVVLLFYFLCMSFEIPFIFRTGSGSGSDDVSSSSFADAL 60

Query: 61  VG--VDGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQ 120
               V G G   +   +    +   HR  +D   ++  L  +   + E  +V+       
Sbjct: 61  PRPMVVGGGSREANWVVGEEEEADPHRHFKDPGRVQLRLPER--KMREFKSVS------- 120

Query: 121 HKYGRITGKVSIQQN--HTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGK 180
                I    S   N   +++FSI  + A  A ++G K W+ +D  GL +  + + ++ +
Sbjct: 121 ----EIFVNESFFDNGGFSDEFSIFHKTAKHAISMGRKMWDGLDS-GLIK-PDKAPVKTR 180

Query: 181 PESCPSWISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTV 240
            E CP  +S    E +    I+ LPCGL  GS IT++ TPH AHVE         GD T 
Sbjct: 181 IEKCPDMVSVSESEFVNRSRILVLPCGLTLGSHITVVATPHWAHVE-------KDGDKTA 240

Query: 241 LVSQFMVELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLP 300
           +VSQFM+ELQGLK+VDGEDPP+ILH NPR+KGDWS RPVIE NTCYRMQWG+  RCDG  
Sbjct: 241 MVSQFMMELQGLKAVDGEDPPRILHFNPRIKGDWSGRPVIEQNTCYRMQWGSGLRCDG-R 300

Query: 301 SSSDDEMLVDGNLRCDKWLRSDI------EDSKETKTSSWFKRFIGREQKPEV-TWPFPF 360
            SSDDE  VDG ++C++W R D       +D  E+K + W  R +GR +K     W +PF
Sbjct: 301 ESSDDEEYVDGEVKCERWKRDDDDGGNNGDDFDESKKTWWLNRLMGRRKKMITHDWDYPF 360

Query: 361 VEGRLFILTLRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLP 420
            EG+LF+LTLRAG++GYHI+V GRH+TSF YR GF LEDATGLAVKG++D+HS YA SLP
Sbjct: 361 AEGKLFVLTLRAGMEGYHISVNGRHITSFPYRTGFVLEDATGLAVKGNIDVHSVYAASLP 420

Query: 421 TSHPSFSPHRVLEMSEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSS 480
           +++PSF+P + LEM   WK+  LP+K V LFIG+LSA NHFAERMAVRK+WMQ   V+SS
Sbjct: 421 STNPSFAPQKHLEMQRIWKAPSLPQKPVELFIGILSAGNHFAERMAVRKSWMQQKLVRSS 480

Query: 481 NVVVRFFVALFMNCAAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILP 540
            VV RFFVAL                          + R EVN  LKKEA YFGDIVI+P
Sbjct: 481 KVVARFFVAL--------------------------HARKEVNVDLKKEAEYFGDIVIVP 540

Query: 541 FMDRYELVVLKTIAICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDD 600
           +MD Y+LVVLKT+AICE+GV+                           + A YVMKCDDD
Sbjct: 541 YMDHYDLVVLKTVAICEYGVN--------------------------TVAAKYVMKCDDD 600

Query: 601 TFVRVETVIKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYI 660
           TFVRV+ VI++ E +  ++SLY+GN+N  H+PLR GKWAVT+ EWPEE YPPYANGPGYI
Sbjct: 601 TFVRVDAVIQEAEKVKGRESLYIGNINFNHKPLRTGKWAVTFEEWPEEYYPPYANGPGYI 660

Query: 661 ISIDIAKYIVAQHESRSLRIFKMEDVSMGMWVEQFNST--VAIVQYSHSWKFCQYGCMED 720
           +S D+AK+IV   E + LR+FKMEDVSMGMWVE+FN T  VA+V   HS KFCQ+GC+ED
Sbjct: 661 LSYDVAKFIVDDFEQKRLRLFKMEDVSMGMWVEKFNETRPVAVV---HSLKFCQFGCIED 673

Query: 721 YFTAHYQSPRQIICLWDKLTQ-GQAHCCNFR 735
           YFTAHYQSPRQ+IC+WDKL + G+  CCN R
Sbjct: 721 YFTAHYQSPRQMICMWDKLQRLGKPQCCNMR 673

BLAST of CmoCh07G000780 vs. TAIR 10
Match: AT5G62620.1 (Galactosyltransferase family protein )

HSP 1 Score: 664.1 bits (1712), Expect = 1.3e-190
Identity = 354/735 (48.16%), Postives = 455/735 (61.90%), Query Frame = 0

Query: 15  RLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGVDGEGVDFSKASLAS 74
           R    L+ +GLLY++ I+F+ P      T LS                     S+  L  
Sbjct: 25  RSVQILMAVGLLYMLLITFEIP--FVFKTGLS-------------------SLSQDPLTR 84

Query: 75  VYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGRITGKVSIQQNHTN- 134
             K    R+L++ +   AP  P    L + +    P + ++ +  RI   +       N 
Sbjct: 85  PEKHNSQRELQERR---APTRPLKSLLYQESQSESPAQGLRRR-TRILSSLRFDPETFNP 144

Query: 135 ---DFSI-LERMADEAWTLGSKAWEEVDK----FGLNETTESSMLEGKPESCPSWISTEG 194
              D S+ L + A  AW +G K WEE++       L +  +  + E    SC   +S  G
Sbjct: 145 SSKDGSVELHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTG 204

Query: 195 KELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKL-GGDPTVLVSQFMVELQG 254
            +LL+   IM LPCGL  GS IT++G P  AH E  P++  L  GD  V VSQF +ELQG
Sbjct: 205 SDLLKRGNIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQG 264

Query: 255 LKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEMLVDG 314
           LK+V+GE+PP+ILHLNPRLKGDWS +PVIE NTCYRMQWG+AQRC+G   S DDE  VDG
Sbjct: 265 LKAVEGEEPPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGW-RSRDDEETVDG 324

Query: 315 NLRCDKWLRSDIEDSKETKTSS----WFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 374
            ++C+KW R D   SKE ++S     W  R IGR +K  V WPFPF   +LF+LTL AG+
Sbjct: 325 QVKCEKWARDDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGL 384

Query: 375 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 434
           +GYH++V G+H+TSF YR GFTLEDATGL + GD+D+HS +A SLPTSHPSFSP R LE+
Sbjct: 385 EGYHVSVDGKHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLEL 444

Query: 435 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNC 494
           S  W++  LP + V +FIG+LSA NHFAERMAVR++WMQ   VKSS VV RFFVAL    
Sbjct: 445 SSNWQAPSLPDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVAL---- 504

Query: 495 AAMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIA 554
                                 + R EVN  LKKEA +FGDIVI+P+MD Y+LVVLKT+A
Sbjct: 505 ----------------------HSRKEVNVELKKEAEFFGDIVIVPYMDSYDLVVLKTVA 564

Query: 555 ICEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEG 614
           ICE+G                          A  L A ++MKCDDDTFV+V+ V+ + + 
Sbjct: 565 ICEYG--------------------------AHQLAAKFIMKCDDDTFVQVDAVLSEAKK 624

Query: 615 ISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHE 674
             + +SLY+GN+N  H+PLR GKW+VTY EWPEE YPPYANGPGYI+S DI+++IV + E
Sbjct: 625 TPTDRSLYIGNINYYHKPLRQGKWSVTYEEWPEEDYPPYANGPGYILSNDISRFIVKEFE 681

Query: 675 SRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLW 734
              LR+FKMEDVS+GMWVEQFN+    V Y HS +FCQ+GC+E+Y TAHYQSPRQ+ICLW
Sbjct: 685 KHKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRFCQFGCIENYLTAHYQSPRQMICLW 681

BLAST of CmoCh07G000780 vs. TAIR 10
Match: AT1G74800.1 (Galactosyltransferase family protein )

HSP 1 Score: 659.8 bits (1701), Expect = 2.5e-189
Identity = 349/734 (47.55%), Postives = 457/734 (62.26%), Query Frame = 0

Query: 20  LLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGVDGEGVDFSKASLASVYKDT 79
           ++ IG LYLV +S + P                            + F   S +SV  D 
Sbjct: 30  IMAIGFLYLVIVSVEIP----------------------------LVFKSWSSSSVPLDA 89

Query: 80  FHR--KLEDNQHLEAPLMPKTEPLEEVN-NVTGPI----------KPIQHKYGRITGKVS 139
             R  KL + Q  +  ++P   PLE V+  V+ P           K  +H  G ++    
Sbjct: 90  LSRLEKLNNEQEPQVEIIP-NPPLEPVSYPVSNPTIVTRTDLVQNKVREHHRGVLSSLRF 149

Query: 140 IQQN---HTNDFSI-LERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSWIS 199
             +     + D S+ L + A EAW LG K W+E++   L +  E    + KP+SCP  +S
Sbjct: 150 DSETFDPSSKDGSVELHKSAKEAWQLGRKLWKELESGRLEKLVEKPE-KNKPDSCPHSVS 209

Query: 200 TEGKELLEGDG-IMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMVE 259
             G E +  +  +M LPCGL  GS IT++G P  AH +         GD + LVSQF++E
Sbjct: 210 LTGSEFMNRENKLMELPCGLTLGSHITLVGRPRKAHPK--------EGDWSKLVSQFVIE 269

Query: 260 LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEML 319
           LQGLK+V+GEDPP+ILH NPRLKGDWSK+PVIE N+CYRMQWG AQRC+G   S DDE  
Sbjct: 270 LQGLKTVEGEDPPRILHFNPRLKGDWSKKPVIEQNSCYRMQWGPAQRCEGW-KSRDDEET 329

Query: 320 VDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGVD 379
           VD +++C+KW+R D   S+ ++   W  R IGR ++ +V WPFPFVE +LF+LTL AG++
Sbjct: 330 VDSHVKCEKWIRDDDNYSEGSRARWWLNRLIGRRKRVKVEWPFPFVEEKLFVLTLSAGLE 389

Query: 380 GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEMS 439
           GYHINV G+H+TSF YR GFTLEDATGL V GD+D+HS +  SLPTSHPSF+P R LE+S
Sbjct: 390 GYHINVDGKHVTSFPYRTGFTLEDATGLTVNGDIDVHSVFVASLPTSHPSFAPQRHLELS 449

Query: 440 EKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALFMNCA 499
           ++W++  +P   V +FIG+LSA NHF+ERMAVRK+WMQ   + S+ VV RFFVAL     
Sbjct: 450 KRWQAPVVPDGPVEIFIGILSAGNHFSERMAVRKSWMQHVLITSAKVVARFFVAL----- 509

Query: 500 AMLYCTPVSRMCSCINAIEAANPRNEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAI 559
                                + R EVN  LKKEA YFGDIV++P+MD Y+LVVLKT+AI
Sbjct: 510 ---------------------HGRKEVNVELKKEAEYFGDIVLVPYMDSYDLVVLKTVAI 569

Query: 560 CEFGVSLRFPSSVYQYLNLLCRTLLYKLTSAVNLTASYVMKCDDDTFVRVETVIKQIEGI 619
           CE G                          A+  +A Y+MKCDDDTFV++  VI +++ +
Sbjct: 570 CEHG--------------------------ALAFSAKYIMKCDDDTFVKLGAVINEVKKV 629

Query: 620 SSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIAKYIVAQHES 679
              +SLY+GN+N  H+PLR GKWAVTY EWPEE YPPYANGPGY++S DIA++IV + E 
Sbjct: 630 PEGRSLYIGNMNYYHKPLRGGKWAVTYEEWPEEDYPPYANGPGYVLSSDIARFIVDKFER 672

Query: 680 RSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQSPRQIICLWD 735
             LR+FKMEDVS+GMWVE F +T   V Y HS +FCQ+GC+E+Y+TAHYQSPRQ+ICLWD
Sbjct: 690 HKLRLFKMEDVSVGMWVEHFKNTTNPVDYRHSLRFCQFGCVENYYTAHYQSPRQMICLWD 672

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A7XDQ94.5e-26061.37Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8GXG63.4e-19148.87Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q9LV161.9e-18948.16Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8RX553.5e-18847.55Hydroxyproline O-galactosyltransferase GALT5 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8L7F97.1e-8033.67Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana OX=3702 GN=GALT1 PE... [more]
Match NameE-valueIdentityDescription
A0A6J1EEU70.0e+0092.92hydroxyproline O-galactosyltransferase GALT2-like OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1KWJ50.0e+0090.87hydroxyproline O-galactosyltransferase GALT2-like OS=Cucurbita maxima OX=3661 GN... [more]
A0A5D3CNA30.0e+0084.47Hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo var. makuwa OX=1194... [more]
A0A1S3AZQ80.0e+0084.47hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo OX=3656 GN=LOC10348... [more]
A0A5A7UAW90.0e+0084.33Hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo var. makuwa OX=1194... [more]
Match NameE-valueIdentityDescription
XP_022926607.10.0e+0092.92hydroxyproline O-galactosyltransferase GALT2-like [Cucurbita moschata][more]
KAG7026442.10.0e+0092.64Hydroxyproline O-galactosyltransferase GALT2 [Cucurbita argyrosperma subsp. argy... [more]
XP_023517655.10.0e+0091.96hydroxyproline O-galactosyltransferase GALT2-like [Cucurbita pepo subsp. pepo][more]
XP_023003453.10.0e+0090.87hydroxyproline O-galactosyltransferase GALT2-like [Cucurbita maxima][more]
KAG6594436.10.0e+0092.26CSC1-like protein ERD4, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
AT4G21060.23.2e-26161.37Galactosyltransferase family protein [more]
AT4G21060.11.5e-25561.99Galactosyltransferase family protein [more]
AT1G27120.12.4e-19248.87Galactosyltransferase family protein [more]
AT5G62620.11.3e-19048.16Galactosyltransferase family protein [more]
AT1G74800.12.5e-18947.55Galactosyltransferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001079Galectin, carbohydrate recognition domainSMARTSM00908Gal_bind_lectin_2coord: 197..402
e-value: 1.7E-24
score: 97.4
IPR001079Galectin, carbohydrate recognition domainPFAMPF00337Gal-bind_lectincoord: 194..401
e-value: 4.1E-48
score: 162.3
IPR001079Galectin, carbohydrate recognition domainPROSITEPS51304GALECTINcoord: 193..403
score: 30.539888
IPR001079Galectin, carbohydrate recognition domainCDDcd00070GLECTcoord: 196..399
e-value: 7.97673E-23
score: 92.3119
NoneNo IPR availableGENE3D2.60.120.200coord: 299..401
e-value: 7.5E-11
score: 43.6
NoneNo IPR availableGENE3D3.90.550.50coord: 425..730
e-value: 3.1E-18
score: 67.9
NoneNo IPR availableGENE3D2.60.120.200coord: 194..298
e-value: 1.0E-12
score: 49.6
NoneNo IPR availablePANTHERPTHR11214:SF212HYDROXYPROLINE O-GALACTOSYLTRANSFERASE GALT2coord: 8..546
coord: 574..734
IPR002659Glycosyl transferase, family 31PFAMPF01762Galactosyl_Tcoord: 556..681
e-value: 5.4E-17
score: 62.2
IPR002659Glycosyl transferase, family 31PANTHERPTHR11214BETA-1,3-N-ACETYLGLUCOSAMINYLTRANSFERASEcoord: 8..546
coord: 574..734
IPR013320Concanavalin A-like lectin/glucanase domain superfamilySUPERFAMILY49899Concanavalin A-like lectins/glucanasescoord: 195..400

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh07G000780.1CmoCh07G000780.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0008378 galactosyltransferase activity
molecular_function GO:0016758 hexosyltransferase activity