Csor.00g078610 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g078610
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
Descriptionheparan-alpha-glucosaminide N-acetyltransferase-like
LocationCsor_Chr14: 7213875 .. 7218694 (-)
RNA-Seq ExpressionCsor.00g078610
SyntenyCsor.00g078610
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSinitialstart_codonintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCACTCCGGAAGGATATGGGGAAGTATGACTCCATCAAACCACTCCATGACTGTGATCTCGCCAATGAGACTGCTGTTCTCATAAATCCAGATTCCCTGACTCTCTTCTCTGTTTCTCACCACTGTAATCACTCTCATCAAGATGTTGAGATGGCTCTTCTTGATTCTCATTCCAGATCTCCTCTTCCTCTTCACAATGCCAATCCCTTAACTCCTCCTGCTTCTTCCAAACTCGATGACGCCCAATTTTCCTCTCCGGCTAGGCCCATTCTCCGATCTTCTCCCCAAGGCCAAGCCCAACGTCTTGCTTCGCTCGATGTATTTCGCGGCATCACTGTTGCGGTAATCATTTTGTTTCATTTTCTTTCTTGCTTTCTTTCTTTCCTTTTCCCCTTATTGCTTTCGTTGTTGAACTTATTTGTTGGATTTTGTTGATGTTCTTCGATTGGATGGGATGATCTTATTTCTTGTTTGTATTATCGGCCATTGAGATTTCTCGTAGCCGATGGACTGGGGGAGAAAGAACATTGGTCTCATTGTGGTCTTGTGTTTTACTATTTTTTTAAAACAAACAAAAATATCAGGATTTGACTTCCAGAATTTTACCGGCCATCGGTGGAGATTGGAAGAAATAATATCTATTTTGACTTTTAAAATTGAAGTTTCTGATTGGGGGAAGAGGGTTTCAGTATTAATTGTAGGTTTTCTGATCCTCGTATCATGAAATGAGTTCCTTCATTTGAGGCGATTTTAGGGGAAGGGATGAAACCTAATTCTGACCATGTTTCCCAACTTTTTGTTTTGGAAATGAATTATTCCCCGATTGGTTTTTCTTTTAAAAAAGAAAAAAGTTCTTACCATAATTTCAAGTTATCCCCTCCCTCTCCTTATTTGTTTTAAATAAAGATATATCATGTTCTTCAATCTATCACCATCCACTTGTCAACTCTTTTGAATAAACAAACGGGATGCTGATCGGTGAAGCATATAGTTTGGTGGTGGCAGAAACAGGAGGGGTCGTTAATTGGGTTTAAAAATTATATTATACCAAACCGCGTGCAATGAAGCAGGCACATTATTTGTGTAGGACATGCATGATGACGCAGCAAGCAAGCTTCATGGTCTTGCTGGTCGTCTTCAATGGCGACCGGTTTAGAACATTGCTTGAAGAAGAAAAAGGAGAAGAACCCTTTAACCCTTCTTTTTCATCTGTTTTCTCTCTGGCTTTGCCCTGAAGAAACCATGAACAAAACACTTCCTTGCGGCAACTTTACTCATTTTATGCACAGATGTTAGACGGATACGCAAAGATCCATTTGCATCTTTTGACCCGTTCCTTTGATGTGATTTTTCACCCGTGTTTATTTAGGTGTAGCTGTTCAGCATAATCTTTCTGGTGCTTTTAATTATTTTAAACAATTTGTTGGCTTCCGATTTGGCAACCTTTCTTTTTATCGGTTAGGTTACTGTCCAAAGTTGCTTTGAGGACTGACATGCTTGGATACTTCACATGTCGCCATGCCCAACCATGGGGCGCTGGATTCAATTTTTGGAGTTCTCTGGAAGGAAATAGTTAGAAAGTTCGCAACTTTAGAGCCAGTAATTCTAATTTACTATATAATTTTGGGGCATGATAGAGAGAACTTTAAATTCTGACCTTTTTTTTCTCATAATTTATTTTAATGTAAGGTCAACCTGACCCATCCCTTGTTTTTTTTTTTTCTCTTTTCATAAGTTATGCTGCCTCTACACAGAAATCTTCATGTAGTTTGATTGTGGCGAGCATGTTGAAAAGCTCAAGTTCAATAAGTTGACTTTGTTTTAATTATCTATTGAATGAGGTATGGAAATGATCAATGGGAAAGAAAACAAAACAATAATAGTTTCTTGATCTGGTCTACATTACGTGATCAGTGGATTCATACATCTGAATATAAGACGCTTCTTCTTTCTTCTGACAATTGGGTATCAATTATTATGAGCGCATTTTGTTTGGCTTTGATGTTGTATTTAGAAGTGAAGTTTCTATTGTGTAACGTCTCTCAAACGCGTTACGGGGTTAATGACTATTCATTTGGCGTGGAAAATAATGCAATAGCGTTACAGAGACTACAAAAGACTAAACGTTATGAATGATTATTATGTGAACTTAATACAATGTATTAATTGCACCCTTTTATCATTATTTGTTTGCAGCTAATGATAGTGGTGGACTATGGTGGTGGGGTTATGCCTGCAATAAACCATTCACCATGGGATGGGTTAACCCTGGCAGATCTTGTAATGCCATTTTTTCTATTCATTGTTGGAGTTTCGCTTGCCCTTGCTTACAAGGTGATTCTTTATTCATTTGCCCTTTAACTTCCAAGCAATTGAAAACTTCAAAAACACTAATGTCTTTATTTTTAAAATTTGAATTATGTTCTTTTTACTTGGCATGTCATTGCCATTCATTTACTAAGCTTGAGAATTTCAAAAATCAAATTTGAATAAGTTCCGTAGGATGTATTATATTCCACTATTCCAACTACTTTTTACCCCAACATTAAGAAGATTATTATTGTTGTTGACCATTTCCCCCACGTGAAGAGTTCGCTTTTATTTTCCCTTTCCCTTTACACAGAAAATTCCAAGCCGAGGCATTGCAACTCAGAAGGCTGTGTTACGGACGTTGAAGCTCTTGTTCTTAGGCCTCTTCCTTCAAGGTAAGTTACGAGAAATGTAATATTTATTCAATGTAAAATGTAATATTTATTCAATGTAAAACTTAGAACGTGTTCTCTCATCCGGAAGGGAATTAATCAAAACTAGTGAACTTTCTAAATTTCAATGCATAGGAGATTGTTTTAAATATTTGAGCGAGTTAGGTACAGTGCATTAATTTGTGAGGTCCATGCTGAAAAAGTAATCTTCTCTGTTCAGGAGGCTTTCTCCATGGCATAAACAATCTAACTTATGGAGTGGATATCCAGCAAATTAGATGGATGGGAATCTTACAGGTAACATTTTTCCTCTACAGTGACGCTGTACCGTTTGTTTCCTCAATTGTTTGCTATTTACTCGTTACTCTTTCCAGTTATTTTATGGTTTTCTTCCCTTCTCTACAGAGAATTGCAATAGCATATTTTCTGGCAGCTGTGTGTGAGATATGGCTAAAGGGAAGTGATTATGTGAATTCAGAAACTGCATTGCGGAGAAAGTATCAATTGCAGCTGTAAGATTCTTGAATGTCTTTTTCTTCTCTTTTTCCTCTTTAAATTTGTATCTCTTCAATCTAACCTTTCGATTTTCATTGGAACAATATTGGGCTTCTTTAACTCATTTTTCATGGTTGTTTTTATATTTATTCTAAATATTGCCAACTTCCTCTCAACAGGGTTGTTGCTGTCATCCTCACTACGTTATATCTTGTCCTGTCATATGGACTGTACGTTCCTGATTGGGAGTATCAAGTTCCAAGTCAATCTACTTCCAATATGGCCTCTCCAAAGACATTTTCTGTGAGCATTGAATCTTTTAAGTCATTATTTTCATTGATGGGTTAAGAATACCGGATTAGTTAGTTATAATTGGTTGCTTGAATGGCTCTCTCACTTAATGCCCAATTTCAGGTGAAATGTGGCACACGTGGTGACACTGGACCAGCCTGCAATGCTGTGGGAATGATAGATCGTAAGATATTTGGTATTCAACATCTGTATAAAAGACCTATTTATGCACGGTCTGAGGTGAATATTTGAGGACACTTTGTTTACTCCCTATAGGTTGTGCTTATCCTGATACTTCATTCTTTTCTTTTTTTGTTCTGTCTGTTCACAGCAATGCAGCATTAATTCACCAGACTATGGTCCATTGCCTCCTAATGCTCCTTCTTGGTGTCAAGCTCCTTTTGATCCCGAAGGAATTTTAAGGTATCCATCTTTATTTTGATTCTATCTATATGCAATAAATTGGAGGCAATGCACAAAATTTACTGCAGTTTCAGTTTGTTATTCTTATCGTATGTCGTTGAATTTTGTAGCACAGTGATGGCTGTCGTGACCTGCTTGGTTGGCTTGCATTACGGGCATATCATTGTACATTTCAAAGTGAGTGAGACACTGCATCGTACATTGTACTTTTGCATACACTCGTGTGTGCATGAGTAAATTCTTTTTCGTCTGTCCTATGTAGAAAACAACAATTTCTAATCTATCTTATCAACTTGCAGGATCACCGAGACAGAATGCTTCATTGGATTATTCCTTCTTCTTGTCTAATTGTGTTGGCCCTTGGGTTAGACTTCGTAGGTGAGTGAGATTCGGTCAATTTGATGATATGATAGAAAGATTGAAGGGACAAGGACAAATAGTAACTCAAATTCTCTTGCAGGGATGCATATAAATAAGGTTCTTTATACAGTTAGTTACATGAGTGTCACTACTGGTGCAGCCGGACTTCTCTTTACCGGGATATACTTGATGGTATAAAACAACTTCATTCTCTGATCTATATCATGGTTATCGACAAGCACGGTTAGCGACTGATTGTGTACGGTTGATGCAGGTTGATGTGTACAGATGGAGGCGCATGAATGTGGTGATGGAGTGGATGGGAAAGCATGCATTGGTGATATATGTTCTGGCTGCCTGTAACGTGCTGCCTGTGGTTCTCCAAGGCTTCTACTCAGGGCAGCCACAGAACAACATCGTAAGTTTGCTTCCCTTAATGAAATGGACTGACACGATCGTATCCACATTACTAGAGTGGTTGATTCTGAGATGTTATATATGTTTTTATTTAATTGGCAGCTGAGACTAATTGGAGTTACAACGTGA

mRNA sequence

ATGCCACTCCGGAAGGATATGGGGAAGTATGACTCCATCAAACCACTCCATGACTGTGATCTCGCCAATGAGACTGCTGTTCTCATAAATCCAGATTCCCTGACTCTCTTCTCTGTTTCTCACCACTGTAATCACTCTCATCAAGATGTTGAGATGGCTCTTCTTGATTCTCATTCCAGATCTCCTCTTCCTCTTCACAATGCCAATCCCTTAACTCCTCCTGCTTCTTCCAAACTCGATGACGCCCAATTTTCCTCTCCGGCTAGGCCCATTCTCCGATCTTCTCCCCAAGGCCAAGCCCAACGTCTTGCTTCGCTCGATGTATTTCGCGGCATCACTGTTGCGCTAATGATAGTGGTGGACTATGGTGGTGGGGTTATGCCTGCAATAAACCATTCACCATGGGATGGGTTAACCCTGGCAGATCTTGTAATGCCATTTTTTCTATTCATTGTTGGAGTTTCGCTTGCCCTTGCTTACAAGAAAATTCCAAGCCGAGGCATTGCAACTCAGAAGGCTGTGTTACGGACGTTGAAGCTCTTGTTCTTAGGCCTCTTCCTTCAAGGAGGCTTTCTCCATGGCATAAACAATCTAACTTATGGAGTGGATATCCAGCAAATTAGATGGATGGGAATCTTACAGAGAATTGCAATAGCATATTTTCTGGCAGCTGTGTGTGAGATATGGCTAAAGGGAAGTGATTATGTGAATTCAGAAACTGCATTGCGGAGAAAGTATCAATTGCAGCTGGTTGTTGCTGTCATCCTCACTACGTTATATCTTGTCCTGTCATATGGACTGTACGTTCCTGATTGGGAGTATCAAGTTCCAAGTCAATCTACTTCCAATATGGCCTCTCCAAAGACATTTTCTGTGAAATGTGGCACACGTGGTGACACTGGACCAGCCTGCAATGCTGTGGGAATGATAGATCGTAAGATATTTGGTATTCAACATCTGTATAAAAGACCTATTTATGCACGGTCTGAGCAATGCAGCATTAATTCACCAGACTATGGTCCATTGCCTCCTAATGCTCCTTCTTGGTGTCAAGCTCCTTTTGATCCCGAAGGAATTTTAAGCACAGTGATGGCTGTCGTGACCTGCTTGGTTGGCTTGCATTACGGGCATATCATTGTACATTTCAAAGATCACCGAGACAGAATGCTTCATTGGATTATTCCTTCTTCTTGTCTAATTGTGTTGGCCCTTGGGTTAGACTTCGTAGGGATGCATATAAATAAGGTTCTTTATACAGTTAGTTACATGAGTGTCACTACTGGTGCAGCCGGACTTCTCTTTACCGGGATATACTTGATGGTTGATGTGTACAGATGGAGGCGCATGAATGTGGTGATGGAGTGGATGGGAAAGCATGCATTGGTGATATATGTTCTGGCTGCCTGTAACGTGCTGCCTGTGGTTCTCCAAGGCTTCTACTCAGGGCAGCCACAGAACAACATCCTGAGACTAATTGGAGTTACAACGTGA

Coding sequence (CDS)

ATGCCACTCCGGAAGGATATGGGGAAGTATGACTCCATCAAACCACTCCATGACTGTGATCTCGCCAATGAGACTGCTGTTCTCATAAATCCAGATTCCCTGACTCTCTTCTCTGTTTCTCACCACTGTAATCACTCTCATCAAGATGTTGAGATGGCTCTTCTTGATTCTCATTCCAGATCTCCTCTTCCTCTTCACAATGCCAATCCCTTAACTCCTCCTGCTTCTTCCAAACTCGATGACGCCCAATTTTCCTCTCCGGCTAGGCCCATTCTCCGATCTTCTCCCCAAGGCCAAGCCCAACGTCTTGCTTCGCTCGATGTATTTCGCGGCATCACTGTTGCGCTAATGATAGTGGTGGACTATGGTGGTGGGGTTATGCCTGCAATAAACCATTCACCATGGGATGGGTTAACCCTGGCAGATCTTGTAATGCCATTTTTTCTATTCATTGTTGGAGTTTCGCTTGCCCTTGCTTACAAGAAAATTCCAAGCCGAGGCATTGCAACTCAGAAGGCTGTGTTACGGACGTTGAAGCTCTTGTTCTTAGGCCTCTTCCTTCAAGGAGGCTTTCTCCATGGCATAAACAATCTAACTTATGGAGTGGATATCCAGCAAATTAGATGGATGGGAATCTTACAGAGAATTGCAATAGCATATTTTCTGGCAGCTGTGTGTGAGATATGGCTAAAGGGAAGTGATTATGTGAATTCAGAAACTGCATTGCGGAGAAAGTATCAATTGCAGCTGGTTGTTGCTGTCATCCTCACTACGTTATATCTTGTCCTGTCATATGGACTGTACGTTCCTGATTGGGAGTATCAAGTTCCAAGTCAATCTACTTCCAATATGGCCTCTCCAAAGACATTTTCTGTGAAATGTGGCACACGTGGTGACACTGGACCAGCCTGCAATGCTGTGGGAATGATAGATCGTAAGATATTTGGTATTCAACATCTGTATAAAAGACCTATTTATGCACGGTCTGAGCAATGCAGCATTAATTCACCAGACTATGGTCCATTGCCTCCTAATGCTCCTTCTTGGTGTCAAGCTCCTTTTGATCCCGAAGGAATTTTAAGCACAGTGATGGCTGTCGTGACCTGCTTGGTTGGCTTGCATTACGGGCATATCATTGTACATTTCAAAGATCACCGAGACAGAATGCTTCATTGGATTATTCCTTCTTCTTGTCTAATTGTGTTGGCCCTTGGGTTAGACTTCGTAGGGATGCATATAAATAAGGTTCTTTATACAGTTAGTTACATGAGTGTCACTACTGGTGCAGCCGGACTTCTCTTTACCGGGATATACTTGATGGTTGATGTGTACAGATGGAGGCGCATGAATGTGGTGATGGAGTGGATGGGAAAGCATGCATTGGTGATATATGTTCTGGCTGCCTGTAACGTGCTGCCTGTGGTTCTCCAAGGCTTCTACTCAGGGCAGCCACAGAACAACATCCTGAGACTAATTGGAGTTACAACGTGA

Protein sequence

MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSRSPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVVDYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSETALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGILSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTVSYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYSGQPQNNILRLIGVTT
Homology
BLAST of Csor.00g078610 vs. ExPASy Swiss-Prot
Match: Q3UDW8 (Heparan-alpha-glucosaminide N-acetyltransferase OS=Mus musculus OX=10090 GN=Hgsnat PE=1 SV=2)

HSP 1 Score: 129.4 bits (324), Expect = 1.1e-28
Identity = 115/420 (27.38%), Postives = 194/420 (46.19%), Query Frame = 0

Query: 76  SSKLDDAQFSSPARPILRSSP------QGQAQRLASLDVFRGITVALMIVVDYGGGVMPA 135
           + +L +++  SP+R    S+       +  A RL  +D FRG+ + LM+ V+YGGG    
Sbjct: 228 TDRLINSELGSPSRADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWY 287

Query: 136 INHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQG 195
             HS W+GLT+ADLV P+F+FI+G S+ L+   I  RG +  K + + +   FL L   G
Sbjct: 288 FKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFL-LICIG 347

Query: 196 GFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCE--IWLKGSDYVNSETALRRKYQ 255
             +   N     +   ++R  G+LQR+ + YF+ AV E   W    D    E++      
Sbjct: 348 VIIVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPVPDSCTLESSCFSLRD 407

Query: 256 L-----QLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDTG- 315
           +     Q +  + L +++L L++ L VP                  T  +  G  GD G 
Sbjct: 408 ITSSWPQWLTILTLESIWLALTFFLPVP---------------GCPTGYLGPGGIGDLGK 467

Query: 316 -PAC--NAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEG 375
            P C   A G IDR + G  HLY+ P                          +  +DPEG
Sbjct: 468 YPHCTGGAAGYIDRLLLGDNHLYQHP------------------SSTVLYHTEVAYDPEG 527

Query: 376 ILSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLI-VLALGLDFVGMH----- 435
           +L T+ ++V   +G+  G I+V++KD    +L       C++ ++++ L  V  +     
Sbjct: 528 VLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIP 587

Query: 436 INKVLYTVSYMSVTTGAAGLLFTGIYLMVDV--------YRWRRMNVVMEWMGKHALVIY 465
           INK L+++SY++  +  A  +   +Y +VDV        + +  MN ++ ++G   L  Y
Sbjct: 588 INKNLWSISYVTTLSCFAFFILLILYPVVDVKGLWTGTPFFYPGMNSILVYVGHEVLENY 613

BLAST of Csor.00g078610 vs. ExPASy Swiss-Prot
Match: Q68CP4 (Heparan-alpha-glucosaminide N-acetyltransferase OS=Homo sapiens OX=9606 GN=HGSNAT PE=1 SV=2)

HSP 1 Score: 120.9 bits (302), Expect = 4.0e-26
Identity = 106/385 (27.53%), Postives = 182/385 (47.27%), Query Frame = 0

Query: 102 RLASLDVFRGITVALMIVVDYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYK 161
           RL S+D FRGI + LM+ V+YGGG      H+ W+GLT+ADLV P+F+FI+G S+ L+  
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 326

Query: 162 KIPSRGIA----TQKAVLRTLKLLFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIA 221
            I  RG +      K   R+  L+ +G+ +        N     +   ++R  G+LQR+ 
Sbjct: 327 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVNP-----NYCLGPLSWDKVRIPGVLQRLG 386

Query: 222 IAYFLAAVCEIWLKG--SDYVNSETALRRKYQL-----QLVVAVILTTLYLVLSYGLYVP 281
           + YF+ AV E+       ++  SE +      +     Q ++ ++L  L+L L++ L VP
Sbjct: 387 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 446

Query: 282 DWEYQVPSQSTSNMASPKTFSVKCGTRGDTG--PAC--NAVGMIDRKIFGIQHLYKRPIY 341
                             T  +  G  GD G  P C   A G IDR + G  HLY+ P  
Sbjct: 447 ---------------GCPTGYLGPGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSS 506

Query: 342 ARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGILSTVMAVVTCLVGLHYGHIIVHFKDHR 401
           A      +                   +DPEGIL T+ ++V   +G+  G I++++K   
Sbjct: 507 AVLYHTEV------------------AYDPEGILGTINSIVMAFLGVQAGKILLYYKART 566

Query: 402 DRMLHWIIPSSCLI-VLALGLDFVG-----MHINKVLYTVSYMSVTTGAAGLLFTGIYLM 461
             +L       C++ ++++ L  V      + +NK L+++SY++  +  A  +   +Y +
Sbjct: 567 KDILIRFTAWCCILGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPV 612

Query: 462 VDVYRWRRMNVVMEWMGKHALVIYV 466
           VDV +         + G +++++YV
Sbjct: 627 VDV-KGLWTGTPFFYPGMNSILVYV 612

BLAST of Csor.00g078610 vs. NCBI nr
Match: KAG6581643.1 (Heparan-alpha-glucosaminide N-acetyltransferase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 997 bits (2577), Expect = 0.0
Identity = 496/496 (100.00%), Postives = 496/496 (100.00%), Query Frame = 0

Query: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60
           MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR
Sbjct: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60

Query: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120
           SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV
Sbjct: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120

Query: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180
           DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL
Sbjct: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180

Query: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240
           LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET
Sbjct: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240

Query: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300
           ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT
Sbjct: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300

Query: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360
           GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL
Sbjct: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360

Query: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420
           STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV
Sbjct: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420

Query: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480
           SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY
Sbjct: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480

Query: 481 SGQPQNNILRLIGVTT 496
           SGQPQNNILRLIGVTT
Sbjct: 481 SGQPQNNILRLIGVTT 496

BLAST of Csor.00g078610 vs. NCBI nr
Match: KAG7018137.1 (Heparan-alpha-glucosaminide N-acetyltransferase, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 996 bits (2575), Expect = 0.0
Identity = 495/496 (99.80%), Postives = 496/496 (100.00%), Query Frame = 0

Query: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60
           MP+RKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR
Sbjct: 1   MPIRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60

Query: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120
           SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV
Sbjct: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120

Query: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180
           DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL
Sbjct: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180

Query: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240
           LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET
Sbjct: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240

Query: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300
           ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT
Sbjct: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300

Query: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360
           GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL
Sbjct: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360

Query: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420
           STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV
Sbjct: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420

Query: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480
           SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY
Sbjct: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480

Query: 481 SGQPQNNILRLIGVTT 496
           SGQPQNNILRLIGVTT
Sbjct: 481 SGQPQNNILRLIGVTT 496

BLAST of Csor.00g078610 vs. NCBI nr
Match: XP_022935357.1 (heparan-alpha-glucosaminide N-acetyltransferase-like [Cucurbita moschata])

HSP 1 Score: 993 bits (2566), Expect = 0.0
Identity = 493/496 (99.40%), Postives = 495/496 (99.80%), Query Frame = 0

Query: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60
           MP+RKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSH+DVEMALLDSHSR
Sbjct: 1   MPIRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHEDVEMALLDSHSR 60

Query: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120
           SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV
Sbjct: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120

Query: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180
           DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL
Sbjct: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180

Query: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240
           LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET
Sbjct: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240

Query: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300
           ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPK FSVKCGTRGDT
Sbjct: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKIFSVKCGTRGDT 300

Query: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360
           GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL
Sbjct: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360

Query: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420
           STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV
Sbjct: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420

Query: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480
           SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY
Sbjct: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480

Query: 481 SGQPQNNILRLIGVTT 496
           SGQPQNNILRLIGVTT
Sbjct: 481 SGQPQNNILRLIGVTT 496

BLAST of Csor.00g078610 vs. NCBI nr
Match: XP_023528239.1 (heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 978 bits (2529), Expect = 0.0
Identity = 488/496 (98.39%), Postives = 491/496 (98.99%), Query Frame = 0

Query: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60
           MP+RKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSH+DVEMALLDSHSR
Sbjct: 92  MPIRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHEDVEMALLDSHSR 151

Query: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120
           SPLPLHNANPLTPPASSKLDDAQFSS ARPILRSSP GQ  RLASLDVFRGITVALMIVV
Sbjct: 152 SPLPLHNANPLTPPASSKLDDAQFSSSARPILRSSPPGQ--RLASLDVFRGITVALMIVV 211

Query: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180
           DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL
Sbjct: 212 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 271

Query: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240
           LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET
Sbjct: 272 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 331

Query: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300
           ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPK FSVKCGTRGDT
Sbjct: 332 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKIFSVKCGTRGDT 391

Query: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360
           GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL
Sbjct: 392 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 451

Query: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420
           STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV
Sbjct: 452 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 511

Query: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480
           SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY
Sbjct: 512 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 571

Query: 481 SGQPQNNILRLIGVTT 496
           SGQPQNNILRLIG+TT
Sbjct: 572 SGQPQNNILRLIGITT 585

BLAST of Csor.00g078610 vs. NCBI nr
Match: XP_022983292.1 (heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 973 bits (2516), Expect = 0.0
Identity = 484/496 (97.58%), Postives = 490/496 (98.79%), Query Frame = 0

Query: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60
           M +RKDMGKYD IKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSH+DVEMALLDSHSR
Sbjct: 1   MSIRKDMGKYDPIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHEDVEMALLDSHSR 60

Query: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120
           SPLPLHNANPLTPPASSKLDDAQFSS ARP+LRSSPQGQ  RLASLDVFRGITVALMIVV
Sbjct: 61  SPLPLHNANPLTPPASSKLDDAQFSSSARPVLRSSPQGQ--RLASLDVFRGITVALMIVV 120

Query: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180
           DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL
Sbjct: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180

Query: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240
           LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET
Sbjct: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240

Query: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300
           ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPK FSVKCGTRGDT
Sbjct: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKIFSVKCGTRGDT 300

Query: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360
           GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL
Sbjct: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360

Query: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420
           STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLA+GLDF+GMHINKVLYTV
Sbjct: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTV 420

Query: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480
           SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY
Sbjct: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480

Query: 481 SGQPQNNILRLIGVTT 496
           SGQPQNNILRLIG+TT
Sbjct: 481 SGQPQNNILRLIGITT 494

BLAST of Csor.00g078610 vs. ExPASy TrEMBL
Match: A0A6J1F5B2 (heparan-alpha-glucosaminide N-acetyltransferase-like OS=Cucurbita moschata OX=3662 GN=LOC111442267 PE=4 SV=1)

HSP 1 Score: 993 bits (2566), Expect = 0.0
Identity = 493/496 (99.40%), Postives = 495/496 (99.80%), Query Frame = 0

Query: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60
           MP+RKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSH+DVEMALLDSHSR
Sbjct: 1   MPIRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHEDVEMALLDSHSR 60

Query: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120
           SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV
Sbjct: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120

Query: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180
           DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL
Sbjct: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180

Query: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240
           LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET
Sbjct: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240

Query: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300
           ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPK FSVKCGTRGDT
Sbjct: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKIFSVKCGTRGDT 300

Query: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360
           GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL
Sbjct: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360

Query: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420
           STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV
Sbjct: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420

Query: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480
           SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY
Sbjct: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480

Query: 481 SGQPQNNILRLIGVTT 496
           SGQPQNNILRLIGVTT
Sbjct: 481 SGQPQNNILRLIGVTT 496

BLAST of Csor.00g078610 vs. ExPASy TrEMBL
Match: A0A6J1IYW6 (heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111481916 PE=4 SV=1)

HSP 1 Score: 973 bits (2516), Expect = 0.0
Identity = 484/496 (97.58%), Postives = 490/496 (98.79%), Query Frame = 0

Query: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60
           M +RKDMGKYD IKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSH+DVEMALLDSHSR
Sbjct: 1   MSIRKDMGKYDPIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHEDVEMALLDSHSR 60

Query: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120
           SPLPLHNANPLTPPASSKLDDAQFSS ARP+LRSSPQGQ  RLASLDVFRGITVALMIVV
Sbjct: 61  SPLPLHNANPLTPPASSKLDDAQFSSSARPVLRSSPQGQ--RLASLDVFRGITVALMIVV 120

Query: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180
           DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL
Sbjct: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180

Query: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240
           LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET
Sbjct: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240

Query: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300
           ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPK FSVKCGTRGDT
Sbjct: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKIFSVKCGTRGDT 300

Query: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360
           GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL
Sbjct: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360

Query: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420
           STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLA+GLDF+GMHINKVLYTV
Sbjct: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTV 420

Query: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480
           SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY
Sbjct: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480

Query: 481 SGQPQNNILRLIGVTT 496
           SGQPQNNILRLIG+TT
Sbjct: 481 SGQPQNNILRLIGITT 494

BLAST of Csor.00g078610 vs. ExPASy TrEMBL
Match: A0A6J1J7D2 (heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111481916 PE=4 SV=1)

HSP 1 Score: 962 bits (2486), Expect = 0.0
Identity = 478/492 (97.15%), Postives = 485/492 (98.58%), Query Frame = 0

Query: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60
           M +RKDMGKYD IKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSH+DVEMALLDSHSR
Sbjct: 1   MSIRKDMGKYDPIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHEDVEMALLDSHSR 60

Query: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120
           SPLPLHNANPLTPPASSKLDDAQFSS ARP+LRSSPQGQ  RLASLDVFRGITVALMIVV
Sbjct: 61  SPLPLHNANPLTPPASSKLDDAQFSSSARPVLRSSPQGQ--RLASLDVFRGITVALMIVV 120

Query: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180
           DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL
Sbjct: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180

Query: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240
           LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET
Sbjct: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240

Query: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300
           ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPK FSVKCGTRGDT
Sbjct: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKIFSVKCGTRGDT 300

Query: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360
           GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL
Sbjct: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360

Query: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420
           STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLA+GLDF+GMHINKVLYTV
Sbjct: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTV 420

Query: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480
           SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY
Sbjct: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480

Query: 481 SGQPQNNILRLI 492
           SGQPQNNI+ L+
Sbjct: 481 SGQPQNNIVSLL 490

BLAST of Csor.00g078610 vs. ExPASy TrEMBL
Match: A0A5A7T699 (Heparan-alpha-glucosaminide N-acetyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00540 PE=4 SV=1)

HSP 1 Score: 891 bits (2302), Expect = 0.0
Identity = 442/494 (89.47%), Postives = 462/494 (93.52%), Query Frame = 0

Query: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60
           M +RKDMG Y+ IK   DCDL NETA+LINPDS+TL SVS HCN S +DVEMAL  SHSR
Sbjct: 1   MAIRKDMGNYEPIKGADDCDLVNETAILINPDSVTLVSVSKHCNQSDEDVEMALRGSHSR 60

Query: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120
           SPLP+HNANPLT P SSK+D+ QFSS  RPILRSS Q    RL SLDVFRGITVALMIVV
Sbjct: 61  SPLPIHNANPLTTPVSSKIDEPQFSSSVRPILRSSDQ--CHRLVSLDVFRGITVALMIVV 120

Query: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180
           DY GGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL
Sbjct: 121 DYAGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180

Query: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240
           LFLGLFLQGGFLHG+NNLTYGVDIQQIRWMGILQRIAIAYFLAA+CEIWLKGSDYVNSET
Sbjct: 181 LFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSET 240

Query: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300
           ALRRKYQLQLVVAV+LT LYLVLSYGLYVPDWEYQVPS + SN+ASPK FSVKCGTRGDT
Sbjct: 241 ALRRKYQLQLVVAVVLTLLYLVLSYGLYVPDWEYQVPSLTPSNVASPKIFSVKCGTRGDT 300

Query: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360
           GPACNAVGMIDRKIFGIQHLYKRPIYAR+EQCSIN+PDYGPLPP+APSWCQAPFDPEG+L
Sbjct: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLL 360

Query: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420
           STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLA+GLDF+GMHINKVLYTV
Sbjct: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTV 420

Query: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480
           SYMSVT GAAGLLFTGIYLMVDVY WRRMNVVMEWMGKHALVIYVLAACNVLPV+LQGFY
Sbjct: 421 SYMSVTAGAAGLLFTGIYLMVDVYSWRRMNVVMEWMGKHALVIYVLAACNVLPVILQGFY 480

Query: 481 SGQPQNNILRLIGV 494
            GQPQNNILRLIGV
Sbjct: 481 LGQPQNNILRLIGV 492

BLAST of Csor.00g078610 vs. ExPASy TrEMBL
Match: A0A1S3CNA5 (heparan-alpha-glucosaminide N-acetyltransferase OS=Cucumis melo OX=3656 GN=LOC103502834 PE=4 SV=1)

HSP 1 Score: 891 bits (2302), Expect = 0.0
Identity = 442/494 (89.47%), Postives = 462/494 (93.52%), Query Frame = 0

Query: 1   MPLRKDMGKYDSIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHQDVEMALLDSHSR 60
           M +RKDMG Y+ IK   DCDL NETA+LINPDS+TL SVS HCN S +DVEMAL  SHSR
Sbjct: 1   MAIRKDMGNYEPIKGADDCDLVNETAILINPDSVTLVSVSKHCNQSDEDVEMALRGSHSR 60

Query: 61  SPLPLHNANPLTPPASSKLDDAQFSSPARPILRSSPQGQAQRLASLDVFRGITVALMIVV 120
           SPLP+HNANPLT P SSK+D+ QFSS  RPILRSS Q    RL SLDVFRGITVALMIVV
Sbjct: 61  SPLPIHNANPLTTPVSSKIDEPQFSSSVRPILRSSDQ--CHRLVSLDVFRGITVALMIVV 120

Query: 121 DYGGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180
           DY GGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL
Sbjct: 121 DYAGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKL 180

Query: 181 LFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSET 240
           LFLGLFLQGGFLHG+NNLTYGVDIQQIRWMGILQRIAIAYFLAA+CEIWLKGSDYVNSET
Sbjct: 181 LFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSET 240

Query: 241 ALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKTFSVKCGTRGDT 300
           ALRRKYQLQLVVAV+LT LYLVLSYGLYVPDWEYQVPS + SN+ASPK FSVKCGTRGDT
Sbjct: 241 ALRRKYQLQLVVAVVLTLLYLVLSYGLYVPDWEYQVPSLTPSNVASPKIFSVKCGTRGDT 300

Query: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGIL 360
           GPACNAVGMIDRKIFGIQHLYKRPIYAR+EQCSIN+PDYGPLPP+APSWCQAPFDPEG+L
Sbjct: 301 GPACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLL 360

Query: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLALGLDFVGMHINKVLYTV 420
           STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLA+GLDF+GMHINKVLYTV
Sbjct: 361 STVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTV 420

Query: 421 SYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFY 480
           SYMSVT GAAGLLFTGIYLMVDVY WRRMNVVMEWMGKHALVIYVLAACNVLPV+LQGFY
Sbjct: 421 SYMSVTAGAAGLLFTGIYLMVDVYSWRRMNVVMEWMGKHALVIYVLAACNVLPVILQGFY 480

Query: 481 SGQPQNNILRLIGV 494
            GQPQNNILRLIGV
Sbjct: 481 LGQPQNNILRLIGV 492

BLAST of Csor.00g078610 vs. TAIR 10
Match: AT5G47900.1 (Protein of unknown function (DUF1624) )

HSP 1 Score: 547.7 bits (1410), Expect = 9.3e-156
Identity = 257/402 (63.93%), Postives = 326/402 (81.09%), Query Frame = 0

Query: 94  SSPQGQAQRLASLDVFRGITVALMIVVDYGGGVMPAINHSPWDGLTLADLVMPFFLFIVG 153
           SS     +RL SLDVFRG+TVA MI+VD  GG++P+INHSPWDG+TLAD VMPFFLFIVG
Sbjct: 37  SSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVG 96

Query: 154 VSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGINNLTYGVDIQQIRWMGIL 213
           VSLA AYK +  R +AT+KA++R+LKLL LGLFLQGGF+HG+NNLTYG+D+++IR MGIL
Sbjct: 97  VSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGIL 156

Query: 214 QRIAIAYFLAAVCEIWLKGSDYVNSETALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWE 273
           QRIAIAY + A+CEIWLKG+  V+SE ++ +KY+   VVA ++TT+YL L YGLYVPDWE
Sbjct: 157 QRIAIAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWE 216

Query: 274 YQVPSQST-SNMASPKTFSVKCGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARSEQC 333
           YQ+  +   S + +     VKCG RG TGP CNAVGM+DR   GIQHLY++P+YAR++QC
Sbjct: 217 YQILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQC 276

Query: 334 SINSPDYGPLPPNAPSWCQAPFDPEGILSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHW 393
           SIN P+ GPLPP+APSWCQAPFDPEG+LS++MA VTCLVGLHYGHII+HFKDH+ R+  W
Sbjct: 277 SINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQW 336

Query: 394 IIPSSCLIVLALGLDFVGMHINKVLYTVSYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVV 453
           I+ S CL++L L L+  GMH+NK LYT+SYM VT+GA+G L + IYLMVDVY ++R ++V
Sbjct: 337 ILRSFCLLMLGLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMVDVYGYKRASLV 396

Query: 454 MEWMGKHALVIYVLAACNVLPVVLQGFYSGQPQNNILRLIGV 495
           +EWMG HAL IYVL ACN++ +++ GFY   P NN+L LIG+
Sbjct: 397 LEWMGIHALPIYVLIACNLVFLIIHGFYWKNPINNLLHLIGI 438

BLAST of Csor.00g078610 vs. TAIR 10
Match: AT5G47900.7 (Protein of unknown function (DUF1624) )

HSP 1 Score: 466.5 bits (1199), Expect = 2.7e-131
Identity = 228/386 (59.07%), Postives = 285/386 (73.83%), Query Frame = 0

Query: 94  SSPQGQAQRLASLDVFRGITVALMIVVDYGGGVMPAINHSPWDGLTLADLVMPFFLFIVG 153
           SS     +RL SLDVFRG+TVA MI+VD  GG++P+INHSPWDG+TLAD VMPFFLFIVG
Sbjct: 37  SSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVG 96

Query: 154 VSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGINNLTYGVDIQQIRWMGIL 213
           VSLA AYK +  R +AT+KA++R+LKLL LGLFLQGGF+HG+NNLTYG+D+++IR MGIL
Sbjct: 97  VSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGIL 156

Query: 214 QRIAIAYFLAAVCEIWLKGSDYVNSETALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWE 273
           QRIAIAY + A+CEIWLKG+  V+SE ++ +KY+   VVA ++TT+YL L YGLYVPDWE
Sbjct: 157 QRIAIAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWE 216

Query: 274 YQVPSQST-SNMASPKTFSVKCGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARSEQC 333
           YQ+  +   S + +     VKCG RG TGP CNAVGM+DR   GIQHLY++P+YAR++QC
Sbjct: 217 YQILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQC 276

Query: 334 SINSPDYGPLPPNAPSWCQAPFDPEGILSTVMAVVTCLVGLHYGHIIVHFK--------- 393
           SIN P+ GPLPP+APSWCQAPFDPEG+LS++MA VTCLVGLHYGHII+HFK         
Sbjct: 277 SINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKRNGSKGQVY 336

Query: 394 ----------------------------DHRDRMLHWIIPSSCLIVLALGLDFVGMHINK 442
                                       DH+ R+  WI+ S CL++L L L+  GMH+NK
Sbjct: 337 NEPSISIRPFFFILSETYLLLYVINFLQDHKKRLNQWILRSFCLLMLGLALNLFGMHLNK 396

BLAST of Csor.00g078610 vs. TAIR 10
Match: AT5G47900.4 (Protein of unknown function (DUF1624) )

HSP 1 Score: 465.7 bits (1197), Expect = 4.7e-131
Identity = 234/409 (57.21%), Postives = 302/409 (73.84%), Query Frame = 0

Query: 94  SSPQGQAQRLASLDVFRGITVALMIVVDYGGGVMPAINHSPWDGLTLADLVMPFFLFIVG 153
           SS     +RL SLDVFRG+TVA MI+VD  GG++P+INHSPWDG+TLAD VMPFFLFIVG
Sbjct: 31  SSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVG 90

Query: 154 VSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGINNLTYGVDIQQIRWMGIL 213
           VSLA AYK +  R +AT+KA++R+LKLL LGLFLQGGF+HG+NNLTYG+D+++IR MGIL
Sbjct: 91  VSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGIL 150

Query: 214 QRIAIAYFLAAVCEIWLKGSDYVNSETALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWE 273
           QRIAIAY + A+CEIWLKG+  V+SE ++ +KY+   VVA ++TT+YL L YGLYVPDWE
Sbjct: 151 QRIAIAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWE 210

Query: 274 YQVPSQST-SNMASPKTFSVKCGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARSEQC 333
           YQ+  +   S + +     VKCG RG TGP CNAVGM+DR   GIQHLY++P+YAR++QC
Sbjct: 211 YQILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQC 270

Query: 334 SINSPDYGPLPPNAPSWCQAPFDPEGILSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHW 393
           SIN P+ GPLPP+APSWCQAPFDPEG+LS++MA VTCLVGLHYGHII+HFK +  +   +
Sbjct: 271 SINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKRNGSKGQVY 330

Query: 394 IIPSSCLIVLALGLDFVGMHINKVLYTVSYMSVTTGAAGLLFTGIYL-------MVDVYR 453
             PS  + +      F  M     L +     V +    L   GI++       +VDVY 
Sbjct: 331 NEPS--ISIRRSQKAFESMDFTFFLSS----DVRSRTEPLWGLGIFVIRDIPNGLVDVYG 390

Query: 454 WRRMNVVMEWMGKHALVIYVLAACNVLPVVLQGFYSGQPQNNILRLIGV 495
           ++R ++V+EWMG HAL IYVL ACN++ +++ GFY   P NN+L LIG+
Sbjct: 391 YKRASLVLEWMGIHALPIYVLIACNLVFLIIHGFYWKNPINNLLHLIGI 433

BLAST of Csor.00g078610 vs. TAIR 10
Match: AT5G47900.6 (Protein of unknown function (DUF1624) )

HSP 1 Score: 454.1 bits (1167), Expect = 1.4e-127
Identity = 208/338 (61.54%), Postives = 274/338 (81.07%), Query Frame = 0

Query: 158 LAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIA 217
           +++  +PS+ +AT+KA++R+LKLL LGLFLQGGF+HG+NNLTYG+D+++IR MGILQRIA
Sbjct: 1   MSFAVLPSQFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIA 60

Query: 218 IAYFLAAVCEIWLKGSDYVNSETALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVP 277
           IAY + A+CEIWLKG+  V+SE ++ +KY+   VVA ++TT+YL L YGLYVPDWEYQ+ 
Sbjct: 61  IAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 120

Query: 278 SQST-SNMASPKTFSVKCGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINS 337
            +   S + +     VKCG RG TGP CNAVGM+DR   GIQHLY++P+YAR++QCSIN 
Sbjct: 121 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 180

Query: 338 PDYGPLPPNAPSWCQAPFDPEGILSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPS 397
           P+ GPLPP+APSWCQAPFDPEG+LS++MA VTCLVGLHYGHII+HFKDH+ R+  WI+ S
Sbjct: 181 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWILRS 240

Query: 398 SCLIVLALGLDFVGMHINKVLYTVSYMSVTTGAAGLLFTGIYLMVDVYRWRRMNVVMEWM 457
            CL++L L L+  GMH+NK LYT+SYM VT+GA+G L + IYLMVDVY ++R ++V+EWM
Sbjct: 241 FCLLMLGLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMVDVYGYKRASLVLEWM 300

Query: 458 GKHALVIYVLAACNVLPVVLQGFYSGQPQNNILRLIGV 495
           G HAL IYVL ACN++ +++ GFY   P NN+L LIG+
Sbjct: 301 GIHALPIYVLIACNLVFLIIHGFYWKNPINNLLHLIGI 338

BLAST of Csor.00g078610 vs. TAIR 10
Match: AT5G47900.2 (Protein of unknown function (DUF1624) )

HSP 1 Score: 416.4 bits (1069), Expect = 3.2e-116
Identity = 195/291 (67.01%), Postives = 241/291 (82.82%), Query Frame = 0

Query: 94  SSPQGQAQRLASLDVFRGITVALMIVVDYGGGVMPAINHSPWDGLTLADLVMPFFLFIVG 153
           SS     +RL SLDVFRG+TVA MI+VD  GG++P+INHSPWDG+TLAD VMPFFLFIVG
Sbjct: 37  SSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVG 96

Query: 154 VSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGINNLTYGVDIQQIRWMGIL 213
           VSLA AYK +  R +AT+KA++R+LKLL LGLFLQGGF+HG+NNLTYG+D+++IR MGIL
Sbjct: 97  VSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGIL 156

Query: 214 QRIAIAYFLAAVCEIWLKGSDYVNSETALRRKYQLQLVVAVILTTLYLVLSYGLYVPDWE 273
           QRIAIAY + A+CEIWLKG+  V+SE ++ +KY+   VVA ++TT+YL L YGLYVPDWE
Sbjct: 157 QRIAIAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWE 216

Query: 274 YQVPSQST-SNMASPKTFSVKCGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARSEQC 333
           YQ+  +   S + +     VKCG RG TGP CNAVGM+DR   GIQHLY++P+YAR++QC
Sbjct: 217 YQILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQC 276

Query: 334 SINSPDYGPLPPNAPSWCQAPFDPEGILSTVMAVVTCLVGLHYGHIIVHFK 384
           SIN P+ GPLPP+APSWCQAPFDPEG+LS++MA VTCLVGLHYGHII+HFK
Sbjct: 277 SINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFK 327

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3UDW81.1e-2827.38Heparan-alpha-glucosaminide N-acetyltransferase OS=Mus musculus OX=10090 GN=Hgsn... [more]
Q68CP44.0e-2627.53Heparan-alpha-glucosaminide N-acetyltransferase OS=Homo sapiens OX=9606 GN=HGSNA... [more]
Match NameE-valueIdentityDescription
KAG6581643.10.0100.00Heparan-alpha-glucosaminide N-acetyltransferase, partial [Cucurbita argyrosperma... [more]
KAG7018137.10.099.80Heparan-alpha-glucosaminide N-acetyltransferase, partial [Cucurbita argyrosperma... [more]
XP_022935357.10.099.40heparan-alpha-glucosaminide N-acetyltransferase-like [Cucurbita moschata][more]
XP_023528239.10.098.39heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Cucurbita pepo ... [more]
XP_022983292.10.097.58heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 [Cucurbita maxim... [more]
Match NameE-valueIdentityDescription
A0A6J1F5B20.099.40heparan-alpha-glucosaminide N-acetyltransferase-like OS=Cucurbita moschata OX=36... [more]
A0A6J1IYW60.097.58heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 OS=Cucurbita max... [more]
A0A6J1J7D20.097.15heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 OS=Cucurbita max... [more]
A0A5A7T6990.089.47Heparan-alpha-glucosaminide N-acetyltransferase OS=Cucumis melo var. makuwa OX=1... [more]
A0A1S3CNA50.089.47heparan-alpha-glucosaminide N-acetyltransferase OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT5G47900.19.3e-15663.93Protein of unknown function (DUF1624) [more]
AT5G47900.72.7e-13159.07Protein of unknown function (DUF1624) [more]
AT5G47900.44.7e-13157.21Protein of unknown function (DUF1624) [more]
AT5G47900.61.4e-12761.54Protein of unknown function (DUF1624) [more]
AT5G47900.23.2e-11667.01Protein of unknown function (DUF1624) [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012429Heparan-alpha-glucosaminide N-acetyltransferase, catalytic domainPFAMPF07786DUF1624coord: 102..226
e-value: 4.2E-7
score: 29.8
NoneNo IPR availablePANTHERPTHR31061LD22376Pcoord: 80..495
NoneNo IPR availablePANTHERPTHR31061:SF31HEPARAN-ALPHA-GLUCOSAMINIDE N-ACETYLTRANSFERASE-LIKEcoord: 80..495

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g078610.m01Csor.00g078610.m01mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016740 transferase activity