CaUC10G191580 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC10G191580
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionheparan-alpha-glucosaminide N-acetyltransferase
LocationCiama_Chr10: 25897182 .. 25903420 (+)
RNA-Seq ExpressionCaUC10G191580
SyntenyCaUC10G191580
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTATTTTTTCTGGGTTGGTTGGTTGGAACTATAAATAAGCAAGTAAGAAGCAGAGGGGCCAATGGAAGTGATTGAGAGAAAGAGAAAAGGGGTTGCAGAAGAAGGAAAGCAAAGAATTTAGGGGAAAACTGGGAAGCCATTCCAACCCAGCAGGGAAGATTTTCTTTCCTTCTTCCCTTCTTCCCTTCTTCCCTTCTTCCTTTTCCCTTAACCCTTCTTCTTCTTCTTCTTCCTCTTTTTTAAGTTCATTCTCTTGCTCGATCTCTCTGTCCTTCATCATCCTCCAACTTCACTTCCCACTTAGTTTCGGTATATGGTGTTGTTCTTGTTGTAATGGCAATCCGTAAAGATATGGGTAACTACGAGCCTATCAAAGGAGTCGATGACTGTGATCTCGTCAATGAGACTGCTATTCTCATAAATCCAGATTCCTTTACTCTAGTCTCTGTTTCTAAGCACTGTAATCAGACTGATGAAGATGTTGAGATGGCTCTTCGTGATTCACATTCCAGATCTCCTCTTCCTCTCCACAATGCCAATCCCTTAACCACTACTGTTTCTTCCAAAATCGATGACCCTCAATTTTCTTCTTCTGCTAGACCCATTCTCCGATCTTCCGACCAACGCCAACGCCTTGTTTCGCTTGATGTATTTCGCGGTCTCACTGTTGCGGTAATCATTTTCTTTCACTTCCTTTCTTTCTTTATTCTTACCCAGGATTGCTTTCGTTTAACTTCTTGTTTGGATTTTGATGATTCTTTTCTCCCCATGTTGGTGTTTTTTTAATTGGATGGAATGATTTTATTTGTTGTTTGTGTTATCACCGATTGAGATTTCTCGTAATCGCATGGATTGGAGGAAAGAACATTAGTCTTATTGTGGACTTGTTTTACTATTTTTTAAAAAAGGGTTTATAAAAAAAAATATCAGGATTTGACTTGCAGAATTTTACCAGCCGTCGGTGGAGATTGGAAGAAACAATATCTATTTTGACTTTTCAAACTGAAGTATTTAATAGAGGGGGTTTCAGTTTTTAGAAAACTACCCATGCCAGTATTAATCGGAGTTTTGATCCTCGAATGATGATACGAGTTCCTTCATTTAGGGAGATTTTAGGGGAAGAGATGAAACCTAATTTTGACCATGTTTCCGAAATTTTAGATTGGGAAATAGAAAAAGAAAAAAAGAAGTTTTTACCATTATTTCAAGTTGTTTTAAAATAAAATATATCATGTTCTACAATCTGTCAACCATTCATTTGTCATCCCCTTTTGAATAAACAAAGGGTATTGATCGGTGAAGAATATAAGTTTGATAGTGTAGAATCAGGGGGGCCACTAATTGGGTTTAAAGATAATATTAAACAAAACCGCGTGCAATGAAGGAGACGTATTATTTCTGTAGGCATGCATGATTCAAGCTGTGCATGATGACGCAGCAAGCAAGCTTCATAGTTTTGCTGGTTGTCTTTAATGGCTATTTGGTTTAGAAGGTGCTTGAAGAAGAAGAAGAAGAGGAACACTTTAACCCTTCTTTTTCATTTGTTTTCTGTAAGGCTTTGCCCTGAAGAAACCATGAACAAAACACTGCCTTGCGGCAACTTTACTCTTTCAAGAACAGCTGGTAGATGGACACGTAAAGACCCATTTGCCATCTTTTGACCCGTTCCTTTGATGTGATCTTTCACCAGTGTCTAATTTAGGTGTAGCTGTTGAGCAAAATCTTTCTGGTGCTTTTAATTACTTTAACAATTGTGGGTTTCCGATTTGGCAAGCTTTCTTTTTATCGGTTAGGTTATGGCCTAAAGTTGCGTTTAGGATTGACATGCTTGGATATTTTACATGTTGCCATACCCAACTATGGGGGTGCTTCGTACAGTTCTTGATGGAAGGAAATGGTTGGACAGTTTGTAATTTAAGAGCTAGTAATTTTGATAGAGTAATCTAATTTTTCTATACATATAATTTCATAGTTTTGATGGAACCGAAGTGACATGACACTTTGTTTCTTTTTAACTTTTTTCTTAATATTATCTTCTATACCAAGTCAAAAGTGAAAGTTGAGAGATCGAGCTCCAAAATGTTGTAAGATTAGATGGATTGTTACAAGAGAATAGTTGCGCTAAGCACAACCTTGTCTAGATACTTATGAAAAGAGAAATAAGAAAATAGAAAAAGAAAAGAGATCTGGGGTTAGATGGCAACTGACTTTTGTTTGTTAAGAAGCATATTAATTCCTAGTAGTTATTAGCTTAGGTTCTAACCATAGATCGTGATGAACACATTGAATAGCCTAAGTTCAACAAGTTGACTTTGTTTTCATTTATGTCTGTTGAATGAGGTATGGAATTGTTCAGTGGGAAAGAAAACAAAATAATGGTCCTTATTGTGGTTTATTACGCGATCAATGGACTTATAAATCTTAGTATAAGACCATATTAGACCTTCTTCTCTCAGTGCTGACGATAATTGGAAGTTTTTGATTTGCAATTTGTAGCTCTGAGTGGTTGTGAGATGAGATCATAATGTGTGCATTTTGTTTGGTTTCAATGTTGTATTGAGGTGTGATGTTATTATTGTAATAATGTGTCTCAAACAGTTACGGGGTTAATGCCTCTGTTCAGTTGGCCTTGGAAAATAATGGGATGACTGAATAGAGACTACAAAAGACAAAATGTTATGAATGATTATTATGTGAACTTAATTCAATATATTAATTGTCCTTAAAAAAAAAAAAAATCAATATATTAATTGCGGCTCAACCACTATCCAAAAAAGGGAAATTAATTTCTTCATTTTATCCTTATTTATTTGCAGCTAATGATAGTCGTGGACTATGCTGGTGGTGTTATGCCTGCAATAAATCATTCACCATGGAATGGTTTAACCCTGGCAGATCTTGTAATGCCATTTTTTCTATTCATTGTTGGAGTTTCACTTGCCCTTGCTTACAAGGTAGTTCTTTATCCATTGCCCTTTTAACTCACAAGCAACTGAAAACCGCACAATACAAATGCCTGTATGTTTAAAAAATAAAATTTAAATTTAAATTAGGTTCTTTTTACTTTAAAATTGGGCCTTGGCATGTCACGGACATTCCTTTACTAAGCTAGAAAAATTTCAAAAATCAAATTTGAACGCTTTGGAGTTTTCTAGGAGGCAATCCGCCTGCTACGTTCTTCCATTATTCCAACTACTTTTTACCCCGAAATTAAGAAGATTATTGTTGTTGTTGTTCTATTTTCCCCACGTGAAATAAGTTTGTTTTTATTTCCCCTCCCCCTTAACCCAGAAAATTCCAAGCAGAGGCATTGCAACTCAGAAGGCTGTGTTACGGACGTTGAAACTTTTGTTCTTAGGCCTCTTTCTTCAAGGTAACTTGCAAGAAATATAATACTAATTCAACTGTTATTTATGGATTTTTCCACTGCTGAAGTTAGAATCTGTTCTCTCATCTAGAAGGGAATTGATTAGAATCAGTAAACTTCCGAGATTATTTTAAATATTCGAGGGAGTTAGGTAGACGACATTATTTTGTGAGGTCTATGCTGAAAAAGTAACCTTCTCTGTTCAGGTGGTTTTCTCCATGGCGTAAACAATCTAACTTATGGAGTGGATATCCAGCAAATTAGATGGATGGGAATCTTACAGGTAACATTTTTCCTCTACATTGACGGTGGACCATTTTTTTTCTTCAATTTTTACTATTTCTCCTTACCCTTTTTTAATTATTTGATTTGATGGTTCTTTTTCATTTTTCTTCCCTTTTACTACAGAGAATAGCAATTGCATATTTCCTGGCAGCTCTGTGTGAGATATGGCTAAAGGGCAGTGATTATGTGAATTCAGAAACTGCACTGCGGAGAAAGTATCAATTACAGCTGTAAGATTCTTTTACCTTCCTTTTCTTCTCTAACTTTGTATCTCTTCTCTTATCCAAAAAAAACTTTGTATCTCTTCAATCTGATCTTTTAATTTTCATTTTTTTAATTGCATTGCGTATAGGCGCAAATTAGGTTAGAAGAATATTGGGTTTCTTTAAATCACTCTTATCCTTCAGTGTCTGCTGGCCAATTTTCACAAACAAATTACTTTGATCGCTTAATAGTTGTGTGATATATATTTCATTAAAATCATGTCAGTTGGTACTCCGTGATATATGGGGTTAGAATTGTTTGTTTATAAAAATGTTCCGTTGACAAGAATGAAGTATTTGAGTTCACATTAGAAATTACTATTTTTTTTTTCTTTTTTCGTCTTATATTTATTTTAAATATTTCCAACTTCCTCTCAACAGGGTTGTCGCTGTTGTCCTCACAACGTTATATCTTGTCCTGTCATATGGATTGTATGTTCCTGATTGGGAGTACCAAGTTCCAAGTCTAACTACATCCAATGTGGCTTCTCCAAAGACATTTTCTGTGAGCTTTGAATCCTTAATTCATATTATTTTCATTCAGGGGTTAAGAATACCAGAATAGTAATAATTGGTTAGCTGAATGGCTGTCTCACTTAATGCCCCAATTTCAGGTGAAATGTGGCACACGTGGTGACACTGGACCAGCCTGCAATGCTGTGGGAATGATAGATCGTAAGATATTTGGTATTCAACATCTGTATAAAAGACCTATTTATGCACGGTCTGAGGTAAAAATTTGAGAAGACTTTTTTTTTACCTCCTAAAGGGTGTGCTTATCTTGATAATTCATGTTTCTTTTTTTGTCTGTTCTCAGCAATGCAGCATTAATGCACCAGACTATGGTCCATTGCCTCCTGATGCTCCTTCTTGGTGTCAAGCTCCTTTTGATCCCGAAGGACTTTTAAGGTATCCATGTTTCTTCTGATTCTATCTATAGAAAATCATACTAATGCGTTATGTTAGTGCACAAAATTTATTGCAGCGTTTCAGTTTGTTATTCTTATCGCATCTCGTTGCATTTTGCAGCACAGTGATGGCTGTTGTGACCTGCTTGGTTGGCTTGCATTATGGGCATATCATTGTACATTTCAAAGTGAGTGAGACACTCTAGCTTACATTGTATTTTTACATGCATTTGTGCGTCTGTTCTATACAGAAAAAACAGTTTCTTATCTTTCTTATCTACTTTCAGGATCACCGAGACAGAATGCTTCATTGGATTATACCTTCTTCTTGTCTAATTGTGTTGGCCATTGGCTTGGATTTCTTAGGTGAGTGAGATTCGTTAAAAGTGATGATTTGATAGAGAGACTGAAGGGACTAACAAATCGTAACTCAAACTCTTTTGCAGGGATGCATATAAATAAGGTTCTTTATACAGTTAGTTACATGAGTGTCACTGCTGGTGCAGCTGGACTGCTCTTTACCGGGATATACTTGATGGTATTAAAACTTCTTTCTCTGATCTCTATTCATGATTCATTTGGTAACTAATACTGACGAGCACGGTTAACGGTTGGTTGTGTATGGTTGTGTGTAGGTTGATGTGTACAGATGGAGACGCATGAATGTGGTGATGGAGTGGATGGGAAAGCATGCATTGGTGATATACGTTCTCGCTGCCTGCAACGTTCTGCCTGTGATTCTCCAAGGCTTCTATTCGGGGCAGCCTCAGAACAACATCGTAAGTTTGCTTCCCTAAATGATATGACACAACCTAATCTCCATTATTATAGTGATTGATTCTGAGTTGTTATATTATGCAATTGCCAGCTGAGGCTAATTGGAGTTCCAGCGTGAAGATGGTGCTGCCAAGGTCTGTCCCCTGGAATATATATAGACTAGACCACATAGCGTCGTTATGTCGATTGGTGGTTGATTGACATTTTCTATTAAAAGAGGAACAATAAGATAGAGAGGTATATGTTCGTTTGCACACTTGGCTCGTGTTTAATAGAAAAGATTAGTTTAGTTTAGTTTTGTGTAAAATATTTAGAAGGGAATGAAATGGAAGTTATATATTATTTTGGTGCTCTGCCTCTACACTCATTGTATACCAAGTTTCCTACATCTGCCCGTATAATTTACAAAAGCAACAGTGTGTGCTAGTTGTAGAAGATGATGATGACCTTAAATCTTTCCTTACTTTATGAAGGAAAGTTGTATTGATATCATTTGATTGATTGAATGAATGAATGAATGAATTCTCTACTTTTGTATTCATATACATATATAACTTTCTTGCTAATGTTGACAAAATATGTTGCAGTGAAAATGATGTGAGGTTTTCCAACATAGTTTATACAGTCGGTGAATGA

mRNA sequence

TTTTTATTTTTTCTGGGTTGGTTGGTTGGAACTATAAATAAGCAAGTAAGAAGCAGAGGGGCCAATGGAAGTGATTGAGAGAAAGAGAAAAGGGGTTGCAGAAGAAGGAAAGCAAAGAATTTAGGGGAAAACTGGGAAGCCATTCCAACCCAGCAGGGAAGATTTTCTTTCCTTCTTCCCTTCTTCCCTTCTTCCCTTCTTCCTTTTCCCTTAACCCTTCTTCTTCTTCTTCTTCCTCTTTTTTAAGTTCATTCTCTTGCTCGATCTCTCTGTCCTTCATCATCCTCCAACTTCACTTCCCACTTAGTTTCGGTATATGGTGTTGTTCTTGTTGTAATGGCAATCCGTAAAGATATGGGTAACTACGAGCCTATCAAAGGAGTCGATGACTGTGATCTCGTCAATGAGACTGCTATTCTCATAAATCCAGATTCCTTTACTCTAGTCTCTGTTTCTAAGCACTGTAATCAGACTGATGAAGATGTTGAGATGGCTCTTCGTGATTCACATTCCAGATCTCCTCTTCCTCTCCACAATGCCAATCCCTTAACCACTACTGTTTCTTCCAAAATCGATGACCCTCAATTTTCTTCTTCTGCTAGACCCATTCTCCGATCTTCCGACCAACGCCAACGCCTTGTTTCGCTTGATGTATTTCGCGGTCTCACTGTTGCGCTAATGATAGTCGTGGACTATGCTGGTGGTGTTATGCCTGCAATAAATCATTCACCATGGAATGGTTTAACCCTGGCAGATCTTGTAATGCCATTTTTTCTATTCATTGTTGGAGTTTCACTTGCCCTTGCTTACAAGAAAATTCCAAGCAGAGGCATTGCAACTCAGAAGGCTGTGTTACGGACGTTGAAACTTTTGTTCTTAGGCCTCTTTCTTCAAGGTGGTTTTCTCCATGGCGTAAACAATCTAACTTATGGAGTGGATATCCAGCAAATTAGATGGATGGGAATCTTACAGAGAATAGCAATTGCATATTTCCTGGCAGCTCTGTGTGAGATATGGCTAAAGGGCAGTGATTATGTGAATTCAGAAACTGCACTGCGGAGAAAGTATCAATTACAGCTGGTTGTCGCTGTTGTCCTCACAACGTTATATCTTGTCCTGTCATATGGATTGTATGTTCCTGATTGGGAGTACCAAGTTCCAAGTCTAACTACATCCAATGTGGCTTCTCCAAAGACATTTTCTGTGAAATGTGGCACACGTGGTGACACTGGACCAGCCTGCAATGCTGTGGGAATGATAGATCGTAAGATATTTGGTATTCAACATCTGTATAAAAGACCTATTTATGCACGGTCTGAGCAATGCAGCATTAATGCACCAGACTATGGTCCATTGCCTCCTGATGCTCCTTCTTGGTGTCAAGCTCCTTTTGATCCCGAAGGACTTTTAAGCACAGTGATGGCTGTTGTGACCTGCTTGGTTGGCTTGCATTATGGGCATATCATTGTACATTTCAAAGATCACCGAGACAGAATGCTTCATTGGATTATACCTTCTTCTTGTCTAATTGTGTTGGCCATTGGCTTGGATTTCTTAGGGATGCATATAAATAAGGTTCTTTATACAGTTAGTTACATGAGTGTCACTGCTGGTGCAGCTGGACTGCTCTTTACCGGGATATACTTGATGATGGAGACGCATGAATGTGGTGATGGAGTGGATGGGAAAGCATGCATTGGTGATATACGTTCTCGCTGCCTGCAACGTTCTGCCTGTGATTCTCCAAGGCTTCTATTCGGGGCAGCCTCAGAACAACATCCTGAGGCTAATTGGAGTTCCAGCGTGAAGATGGTGCTGCCAAGTGAAAATGATGTGAGGTTTTCCAACATAGTTTATACAGTCGGTGAATGA

Coding sequence (CDS)

ATGGCAATCCGTAAAGATATGGGTAACTACGAGCCTATCAAAGGAGTCGATGACTGTGATCTCGTCAATGAGACTGCTATTCTCATAAATCCAGATTCCTTTACTCTAGTCTCTGTTTCTAAGCACTGTAATCAGACTGATGAAGATGTTGAGATGGCTCTTCGTGATTCACATTCCAGATCTCCTCTTCCTCTCCACAATGCCAATCCCTTAACCACTACTGTTTCTTCCAAAATCGATGACCCTCAATTTTCTTCTTCTGCTAGACCCATTCTCCGATCTTCCGACCAACGCCAACGCCTTGTTTCGCTTGATGTATTTCGCGGTCTCACTGTTGCGCTAATGATAGTCGTGGACTATGCTGGTGGTGTTATGCCTGCAATAAATCATTCACCATGGAATGGTTTAACCCTGGCAGATCTTGTAATGCCATTTTTTCTATTCATTGTTGGAGTTTCACTTGCCCTTGCTTACAAGAAAATTCCAAGCAGAGGCATTGCAACTCAGAAGGCTGTGTTACGGACGTTGAAACTTTTGTTCTTAGGCCTCTTTCTTCAAGGTGGTTTTCTCCATGGCGTAAACAATCTAACTTATGGAGTGGATATCCAGCAAATTAGATGGATGGGAATCTTACAGAGAATAGCAATTGCATATTTCCTGGCAGCTCTGTGTGAGATATGGCTAAAGGGCAGTGATTATGTGAATTCAGAAACTGCACTGCGGAGAAAGTATCAATTACAGCTGGTTGTCGCTGTTGTCCTCACAACGTTATATCTTGTCCTGTCATATGGATTGTATGTTCCTGATTGGGAGTACCAAGTTCCAAGTCTAACTACATCCAATGTGGCTTCTCCAAAGACATTTTCTGTGAAATGTGGCACACGTGGTGACACTGGACCAGCCTGCAATGCTGTGGGAATGATAGATCGTAAGATATTTGGTATTCAACATCTGTATAAAAGACCTATTTATGCACGGTCTGAGCAATGCAGCATTAATGCACCAGACTATGGTCCATTGCCTCCTGATGCTCCTTCTTGGTGTCAAGCTCCTTTTGATCCCGAAGGACTTTTAAGCACAGTGATGGCTGTTGTGACCTGCTTGGTTGGCTTGCATTATGGGCATATCATTGTACATTTCAAAGATCACCGAGACAGAATGCTTCATTGGATTATACCTTCTTCTTGTCTAATTGTGTTGGCCATTGGCTTGGATTTCTTAGGGATGCATATAAATAAGGTTCTTTATACAGTTAGTTACATGAGTGTCACTGCTGGTGCAGCTGGACTGCTCTTTACCGGGATATACTTGATGATGGAGACGCATGAATGTGGTGATGGAGTGGATGGGAAAGCATGCATTGGTGATATACGTTCTCGCTGCCTGCAACGTTCTGCCTGTGATTCTCCAAGGCTTCTATTCGGGGCAGCCTCAGAACAACATCCTGAGGCTAATTGGAGTTCCAGCGTGAAGATGGTGCTGCCAAGTGAAAATGATGTGAGGTTTTCCAACATAGTTTATACAGTCGGTGAATGA

Protein sequence

MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSRSPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMMETHECGDGVDGKACIGDIRSRCLQRSACDSPRLLFGAASEQHPEANWSSSVKMVLPSENDVRFSNIVYTVGE
Homology
BLAST of CaUC10G191580 vs. NCBI nr
Match: XP_038905626.1 (heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Benincasa hispida])

HSP 1 Score: 863.2 bits (2229), Expect = 1.1e-246
Identity = 428/442 (96.83%), Postives = 434/442 (98.19%), Query Frame = 0

Query: 1   MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSR 60
           MAIRKDMGNYEPIKG DDCDL NETAILINPDS TLVSVSKHCNQTDEDVEMALRDSHSR
Sbjct: 1   MAIRKDMGNYEPIKGTDDCDLANETAILINPDSLTLVSVSKHCNQTDEDVEMALRDSHSR 60

Query: 61  SPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDY 120
           SPLPLHNANPLTT VSSKIDDPQFSSSARPILRSS+QRQRLVSLDVFRG+TVALMIVVDY
Sbjct: 61  SPLPLHNANPLTTPVSSKIDDPQFSSSARPILRSSEQRQRLVSLDVFRGITVALMIVVDY 120

Query: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180
           AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF
Sbjct: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180

Query: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240
           LGLFLQGGFLHGVNNLTYGVDIQ+IRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL
Sbjct: 181 LGLFLQGGFLHGVNNLTYGVDIQEIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240

Query: 241 RRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGP 300
           RRKYQLQLVVA VLTTLYLVLSYGLYV DWEYQVPSLTTSNVASPK FSVKCGTRGDTGP
Sbjct: 241 RRKYQLQLVVAAVLTTLYLVLSYGLYVSDWEYQVPSLTTSNVASPKIFSVKCGTRGDTGP 300

Query: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360
           ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST
Sbjct: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360

Query: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420
           VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY
Sbjct: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420

Query: 421 MSVTAGAAGLLFTGIYLMMETH 443
           MSVTAGAAGLLFTGIYLM++ +
Sbjct: 421 MSVTAGAAGLLFTGIYLMVDVY 442

BLAST of CaUC10G191580 vs. NCBI nr
Match: XP_008465168.1 (PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Cucumis melo] >KAA0037686.1 heparan-alpha-glucosaminide N-acetyltransferase [Cucumis melo var. makuwa] >TYK10687.1 heparan-alpha-glucosaminide N-acetyltransferase [Cucumis melo var. makuwa])

HSP 1 Score: 850.9 bits (2197), Expect = 5.7e-243
Identity = 422/442 (95.48%), Postives = 431/442 (97.51%), Query Frame = 0

Query: 1   MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSR 60
           MAIRKDMGNYEPIKG DDCDLVNETAILINPDS TLVSVSKHCNQ+DEDVEMALR SHSR
Sbjct: 1   MAIRKDMGNYEPIKGADDCDLVNETAILINPDSVTLVSVSKHCNQSDEDVEMALRGSHSR 60

Query: 61  SPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDY 120
           SPLP+HNANPLTT VSSKID+PQFSSS RPILRSSDQ  RLVSLDVFRG+TVALMIVVDY
Sbjct: 61  SPLPIHNANPLTTPVSSKIDEPQFSSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVDY 120

Query: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180
           AGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF
Sbjct: 121 AGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180

Query: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240
           LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL
Sbjct: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240

Query: 241 RRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGP 300
           RRKYQLQLVVAVVLT LYLVLSYGLYVPDWEYQVPSLT SNVASPK FSVKCGTRGDTGP
Sbjct: 241 RRKYQLQLVVAVVLTLLYLVLSYGLYVPDWEYQVPSLTPSNVASPKIFSVKCGTRGDTGP 300

Query: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360
           ACNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST
Sbjct: 301 ACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360

Query: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420
           VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY
Sbjct: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420

Query: 421 MSVTAGAAGLLFTGIYLMMETH 443
           MSVTAGAAGLLFTGIYLM++ +
Sbjct: 421 MSVTAGAAGLLFTGIYLMVDVY 442

BLAST of CaUC10G191580 vs. NCBI nr
Match: XP_004141153.1 (heparan-alpha-glucosaminide N-acetyltransferase [Cucumis sativus])

HSP 1 Score: 848.2 bits (2190), Expect = 3.7e-242
Identity = 420/442 (95.02%), Postives = 430/442 (97.29%), Query Frame = 0

Query: 1   MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSR 60
           MAIRKDMGNYEPIKG DDCDLVNETAILINPDS TLVSVSKHCNQ+DEDVEMALR SHSR
Sbjct: 1   MAIRKDMGNYEPIKGADDCDLVNETAILINPDSVTLVSVSKHCNQSDEDVEMALRGSHSR 60

Query: 61  SPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDY 120
           SPLP+HNANPLTT VSSKID+PQFSSS RPILRSSDQ  RLVSLDVFRG+TVALMIVVDY
Sbjct: 61  SPLPIHNANPLTTPVSSKIDEPQFSSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVDY 120

Query: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180
           AGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF
Sbjct: 121 AGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180

Query: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240
           LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL
Sbjct: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240

Query: 241 RRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGP 300
           RRKYQLQLV AVVLT LYL LSYGLYVPDWEYQVPSLTTS+VASPK FSVKCGTRGDTGP
Sbjct: 241 RRKYQLQLVAAVVLTMLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVKCGTRGDTGP 300

Query: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360
           ACNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST
Sbjct: 301 ACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360

Query: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420
           VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY
Sbjct: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420

Query: 421 MSVTAGAAGLLFTGIYLMMETH 443
           MSVTAGAAGLLFTGIYLM++ +
Sbjct: 421 MSVTAGAAGLLFTGIYLMVDVY 442

BLAST of CaUC10G191580 vs. NCBI nr
Match: KAE8651279.1 (hypothetical protein Csa_000910 [Cucumis sativus])

HSP 1 Score: 848.2 bits (2190), Expect = 3.7e-242
Identity = 420/442 (95.02%), Postives = 430/442 (97.29%), Query Frame = 0

Query: 1   MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSR 60
           MAIRKDMGNYEPIKG DDCDLVNETAILINPDS TLVSVSKHCNQ+DEDVEMALR SHSR
Sbjct: 1   MAIRKDMGNYEPIKGADDCDLVNETAILINPDSVTLVSVSKHCNQSDEDVEMALRGSHSR 60

Query: 61  SPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDY 120
           SPLP+HNANPLTT VSSKID+PQFSSS RPILRSSDQ  RLVSLDVFRG+TVALMIVVDY
Sbjct: 61  SPLPIHNANPLTTPVSSKIDEPQFSSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVDY 120

Query: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180
           AGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF
Sbjct: 121 AGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180

Query: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240
           LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL
Sbjct: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240

Query: 241 RRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGP 300
           RRKYQLQLV AVVLT LYL LSYGLYVPDWEYQVPSLTTS+VASPK FSVKCGTRGDTGP
Sbjct: 241 RRKYQLQLVAAVVLTMLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVKCGTRGDTGP 300

Query: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360
           ACNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST
Sbjct: 301 ACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360

Query: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420
           VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY
Sbjct: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420

Query: 421 MSVTAGAAGLLFTGIYLMMETH 443
           MSVTAGAAGLLFTGIYLM++ +
Sbjct: 421 MSVTAGAAGLLFTGIYLMVDVY 442

BLAST of CaUC10G191580 vs. NCBI nr
Match: XP_022983293.1 (heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 816.2 bits (2107), Expect = 1.6e-232
Identity = 400/442 (90.50%), Postives = 420/442 (95.02%), Query Frame = 0

Query: 1   MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSR 60
           M+IRKDMG Y+PIK + DCDL NETA+LINPDS TL SVS HCN + EDVEMAL DSHSR
Sbjct: 1   MSIRKDMGKYDPIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHEDVEMALLDSHSR 60

Query: 61  SPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDY 120
           SPLPLHNANPLT   SSK+DD QFSSSARP+LRSS Q QRL SLDVFRG+TVALMIVVDY
Sbjct: 61  SPLPLHNANPLTPPASSKLDDAQFSSSARPVLRSSPQGQRLASLDVFRGITVALMIVVDY 120

Query: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180
            GGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF
Sbjct: 121 GGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180

Query: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240
           LGLFLQGGFLHG+NNLTYGVDIQQIRWMGILQRIAIAYFLAA+CEIWLKGSDYVNSETAL
Sbjct: 181 LGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSETAL 240

Query: 241 RRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGP 300
           RRKYQLQLVVAV+LTTLYLVLSYGLYVPDWEYQVPS +TSN+ASPK FSVKCGTRGDTGP
Sbjct: 241 RRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKIFSVKCGTRGDTGP 300

Query: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360
           ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSIN+PDYGPLPP+APSWCQAPFDPEG+LST
Sbjct: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGILST 360

Query: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420
           VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY
Sbjct: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420

Query: 421 MSVTAGAAGLLFTGIYLMMETH 443
           MSVT GAAGLLFTGIYLM++ +
Sbjct: 421 MSVTTGAAGLLFTGIYLMVDVY 442

BLAST of CaUC10G191580 vs. ExPASy Swiss-Prot
Match: Q3UDW8 (Heparan-alpha-glucosaminide N-acetyltransferase OS=Mus musculus OX=10090 GN=Hgsnat PE=1 SV=2)

HSP 1 Score: 120.2 bits (300), Expect = 7.0e-26
Identity = 107/387 (27.65%), Postives = 179/387 (46.25%), Query Frame = 0

Query: 75  VSSKIDDPQ----FSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDYAGGVMPAINH 134
           ++S++  P      S+  +P  R S    RL  +D FRGL + LM+ V+Y GG      H
Sbjct: 232 INSELGSPSRADPLSADYQPETRRS-SANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKH 291

Query: 135 SPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFL 194
           S WNGLT+ADLV P+F+FI+G S+ L+   I  RG +  K + + +   FL L   G  +
Sbjct: 292 SSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFL-LICIGVII 351

Query: 195 HGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCE--IWLKGSDYVNSETALRRKYQL-- 254
              N     +   ++R  G+LQR+ + YF+ A+ E   W    D    E++      +  
Sbjct: 352 VNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPVPDSCTLESSCFSLRDITS 411

Query: 255 ---QLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTG--PA 314
              Q +  + L +++L L++ L VP                  T  +  G  GD G  P 
Sbjct: 412 SWPQWLTILTLESIWLALTFFLPVP---------------GCPTGYLGPGGIGDLGKYPH 471

Query: 315 C--NAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLS 374
           C   A G IDR + G  HLY+ P                          +  +DPEG+L 
Sbjct: 472 CTGGAAGYIDRLLLGDNHLYQHP------------------SSTVLYHTEVAYDPEGVLG 531

Query: 375 TVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLI-VLAIGLDFLGMH-----INK 434
           T+ ++V   +G+  G I+V++KD    +L       C++ +++I L  +  +     INK
Sbjct: 532 TINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINK 583

Query: 435 VLYTVSYMSVTAGAAGLLFTGIYLMME 441
            L+++SY++  +  A  +   +Y +++
Sbjct: 592 NLWSISYVTTLSCFAFFILLILYPVVD 583

BLAST of CaUC10G191580 vs. ExPASy Swiss-Prot
Match: Q68CP4 (Heparan-alpha-glucosaminide N-acetyltransferase OS=Homo sapiens OX=9606 GN=HGSNAT PE=1 SV=2)

HSP 1 Score: 113.2 bits (282), Expect = 8.5e-24
Identity = 97/358 (27.09%), Postives = 169/358 (47.21%), Query Frame = 0

Query: 100 RLVSLDVFRGLTVALMIVVDYAGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYK 159
           RL S+D FRG+ + LM+ V+Y GG      H+ WNGLT+ADLV P+F+FI+G S+ L+  
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 326

Query: 160 KIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYF 219
            I  RG +  + + +     FL L   G  +   N     +   ++R  G+LQR+ + YF
Sbjct: 327 SILQRGCSKFRLLGKIAWRSFL-LICIGIIIVNPNYCLGPLSWDKVRIPGVLQRLGVTYF 386

Query: 220 LAALCEIWLKG--SDYVNSETALRRKYQL-----QLVVAVVLTTLYLVLSYGLYVPDWEY 279
           + A+ E+       ++  SE +      +     Q ++ +VL  L+L L++ L VP    
Sbjct: 387 VVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP---- 446

Query: 280 QVPSLTTSNVASPKTFSVKCGTRGDTG--PAC--NAVGMIDRKIFGIQHLYKRPIYARSE 339
                         T  +  G  GD G  P C   A G IDR + G  HLY+ P  A   
Sbjct: 447 -----------GCPTGYLGPGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLY 506

Query: 340 QCSINAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRML 399
              +                   +DPEG+L T+ ++V   +G+  G I++++K     +L
Sbjct: 507 HTEV------------------AYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDIL 566

Query: 400 HWIIPSSCLI-VLAIGLDFLG-----MHINKVLYTVSYMSVTAGAAGLLFTGIYLMME 441
                  C++ ++++ L  +      + +NK L+++SY++  +  A  +   +Y +++
Sbjct: 567 IRFTAWCCILGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVD 590

BLAST of CaUC10G191580 vs. ExPASy TrEMBL
Match: A0A5A7T699 (Heparan-alpha-glucosaminide N-acetyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00540 PE=4 SV=1)

HSP 1 Score: 850.9 bits (2197), Expect = 2.8e-243
Identity = 422/442 (95.48%), Postives = 431/442 (97.51%), Query Frame = 0

Query: 1   MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSR 60
           MAIRKDMGNYEPIKG DDCDLVNETAILINPDS TLVSVSKHCNQ+DEDVEMALR SHSR
Sbjct: 1   MAIRKDMGNYEPIKGADDCDLVNETAILINPDSVTLVSVSKHCNQSDEDVEMALRGSHSR 60

Query: 61  SPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDY 120
           SPLP+HNANPLTT VSSKID+PQFSSS RPILRSSDQ  RLVSLDVFRG+TVALMIVVDY
Sbjct: 61  SPLPIHNANPLTTPVSSKIDEPQFSSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVDY 120

Query: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180
           AGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF
Sbjct: 121 AGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180

Query: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240
           LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL
Sbjct: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240

Query: 241 RRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGP 300
           RRKYQLQLVVAVVLT LYLVLSYGLYVPDWEYQVPSLT SNVASPK FSVKCGTRGDTGP
Sbjct: 241 RRKYQLQLVVAVVLTLLYLVLSYGLYVPDWEYQVPSLTPSNVASPKIFSVKCGTRGDTGP 300

Query: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360
           ACNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST
Sbjct: 301 ACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360

Query: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420
           VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY
Sbjct: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420

Query: 421 MSVTAGAAGLLFTGIYLMMETH 443
           MSVTAGAAGLLFTGIYLM++ +
Sbjct: 421 MSVTAGAAGLLFTGIYLMVDVY 442

BLAST of CaUC10G191580 vs. ExPASy TrEMBL
Match: A0A1S3CNA5 (heparan-alpha-glucosaminide N-acetyltransferase OS=Cucumis melo OX=3656 GN=LOC103502834 PE=4 SV=1)

HSP 1 Score: 850.9 bits (2197), Expect = 2.8e-243
Identity = 422/442 (95.48%), Postives = 431/442 (97.51%), Query Frame = 0

Query: 1   MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSR 60
           MAIRKDMGNYEPIKG DDCDLVNETAILINPDS TLVSVSKHCNQ+DEDVEMALR SHSR
Sbjct: 1   MAIRKDMGNYEPIKGADDCDLVNETAILINPDSVTLVSVSKHCNQSDEDVEMALRGSHSR 60

Query: 61  SPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDY 120
           SPLP+HNANPLTT VSSKID+PQFSSS RPILRSSDQ  RLVSLDVFRG+TVALMIVVDY
Sbjct: 61  SPLPIHNANPLTTPVSSKIDEPQFSSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVDY 120

Query: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180
           AGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF
Sbjct: 121 AGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180

Query: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240
           LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL
Sbjct: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240

Query: 241 RRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGP 300
           RRKYQLQLVVAVVLT LYLVLSYGLYVPDWEYQVPSLT SNVASPK FSVKCGTRGDTGP
Sbjct: 241 RRKYQLQLVVAVVLTLLYLVLSYGLYVPDWEYQVPSLTPSNVASPKIFSVKCGTRGDTGP 300

Query: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360
           ACNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST
Sbjct: 301 ACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360

Query: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420
           VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY
Sbjct: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420

Query: 421 MSVTAGAAGLLFTGIYLMMETH 443
           MSVTAGAAGLLFTGIYLM++ +
Sbjct: 421 MSVTAGAAGLLFTGIYLMVDVY 442

BLAST of CaUC10G191580 vs. ExPASy TrEMBL
Match: A0A0A0LFP0 (DUF1624 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G845460 PE=4 SV=1)

HSP 1 Score: 848.2 bits (2190), Expect = 1.8e-242
Identity = 420/442 (95.02%), Postives = 430/442 (97.29%), Query Frame = 0

Query: 1   MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSR 60
           MAIRKDMGNYEPIKG DDCDLVNETAILINPDS TLVSVSKHCNQ+DEDVEMALR SHSR
Sbjct: 1   MAIRKDMGNYEPIKGADDCDLVNETAILINPDSVTLVSVSKHCNQSDEDVEMALRGSHSR 60

Query: 61  SPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDY 120
           SPLP+HNANPLTT VSSKID+PQFSSS RPILRSSDQ  RLVSLDVFRG+TVALMIVVDY
Sbjct: 61  SPLPIHNANPLTTPVSSKIDEPQFSSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVDY 120

Query: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180
           AGGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF
Sbjct: 121 AGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180

Query: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240
           LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL
Sbjct: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240

Query: 241 RRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGP 300
           RRKYQLQLV AVVLT LYL LSYGLYVPDWEYQVPSLTTS+VASPK FSVKCGTRGDTGP
Sbjct: 241 RRKYQLQLVAAVVLTMLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVKCGTRGDTGP 300

Query: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360
           ACNAVGMIDRKIFGIQHLYKRPIYAR+EQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST
Sbjct: 301 ACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360

Query: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420
           VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY
Sbjct: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420

Query: 421 MSVTAGAAGLLFTGIYLMMETH 443
           MSVTAGAAGLLFTGIYLM++ +
Sbjct: 421 MSVTAGAAGLLFTGIYLMVDVY 442

BLAST of CaUC10G191580 vs. ExPASy TrEMBL
Match: A0A6J1IYW6 (heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111481916 PE=4 SV=1)

HSP 1 Score: 816.2 bits (2107), Expect = 7.5e-233
Identity = 400/442 (90.50%), Postives = 420/442 (95.02%), Query Frame = 0

Query: 1   MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSR 60
           M+IRKDMG Y+PIK + DCDL NETA+LINPDS TL SVS HCN + EDVEMAL DSHSR
Sbjct: 1   MSIRKDMGKYDPIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHEDVEMALLDSHSR 60

Query: 61  SPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDY 120
           SPLPLHNANPLT   SSK+DD QFSSSARP+LRSS Q QRL SLDVFRG+TVALMIVVDY
Sbjct: 61  SPLPLHNANPLTPPASSKLDDAQFSSSARPVLRSSPQGQRLASLDVFRGITVALMIVVDY 120

Query: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180
            GGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF
Sbjct: 121 GGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180

Query: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240
           LGLFLQGGFLHG+NNLTYGVDIQQIRWMGILQRIAIAYFLAA+CEIWLKGSDYVNSETAL
Sbjct: 181 LGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSETAL 240

Query: 241 RRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGP 300
           RRKYQLQLVVAV+LTTLYLVLSYGLYVPDWEYQVPS +TSN+ASPK FSVKCGTRGDTGP
Sbjct: 241 RRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKIFSVKCGTRGDTGP 300

Query: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360
           ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSIN+PDYGPLPP+APSWCQAPFDPEG+LST
Sbjct: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGILST 360

Query: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420
           VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY
Sbjct: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420

Query: 421 MSVTAGAAGLLFTGIYLMMETH 443
           MSVT GAAGLLFTGIYLM++ +
Sbjct: 421 MSVTTGAAGLLFTGIYLMVDVY 442

BLAST of CaUC10G191580 vs. ExPASy TrEMBL
Match: A0A6J1J7D2 (heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111481916 PE=4 SV=1)

HSP 1 Score: 816.2 bits (2107), Expect = 7.5e-233
Identity = 400/442 (90.50%), Postives = 420/442 (95.02%), Query Frame = 0

Query: 1   MAIRKDMGNYEPIKGVDDCDLVNETAILINPDSFTLVSVSKHCNQTDEDVEMALRDSHSR 60
           M+IRKDMG Y+PIK + DCDL NETA+LINPDS TL SVS HCN + EDVEMAL DSHSR
Sbjct: 1   MSIRKDMGKYDPIKPLHDCDLANETAVLINPDSLTLFSVSHHCNHSHEDVEMALLDSHSR 60

Query: 61  SPLPLHNANPLTTTVSSKIDDPQFSSSARPILRSSDQRQRLVSLDVFRGLTVALMIVVDY 120
           SPLPLHNANPLT   SSK+DD QFSSSARP+LRSS Q QRL SLDVFRG+TVALMIVVDY
Sbjct: 61  SPLPLHNANPLTPPASSKLDDAQFSSSARPVLRSSPQGQRLASLDVFRGITVALMIVVDY 120

Query: 121 AGGVMPAINHSPWNGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180
            GGVMPAINHSPW+GLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF
Sbjct: 121 GGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLF 180

Query: 181 LGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETAL 240
           LGLFLQGGFLHG+NNLTYGVDIQQIRWMGILQRIAIAYFLAA+CEIWLKGSDYVNSETAL
Sbjct: 181 LGLFLQGGFLHGINNLTYGVDIQQIRWMGILQRIAIAYFLAAVCEIWLKGSDYVNSETAL 240

Query: 241 RRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVPSLTTSNVASPKTFSVKCGTRGDTGP 300
           RRKYQLQLVVAV+LTTLYLVLSYGLYVPDWEYQVPS +TSN+ASPK FSVKCGTRGDTGP
Sbjct: 241 RRKYQLQLVVAVILTTLYLVLSYGLYVPDWEYQVPSQSTSNMASPKIFSVKCGTRGDTGP 300

Query: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLST 360
           ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSIN+PDYGPLPP+APSWCQAPFDPEG+LST
Sbjct: 301 ACNAVGMIDRKIFGIQHLYKRPIYARSEQCSINSPDYGPLPPNAPSWCQAPFDPEGILST 360

Query: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420
           VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY
Sbjct: 361 VMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSY 420

Query: 421 MSVTAGAAGLLFTGIYLMMETH 443
           MSVT GAAGLLFTGIYLM++ +
Sbjct: 421 MSVTTGAAGLLFTGIYLMVDVY 442

BLAST of CaUC10G191580 vs. TAIR 10
Match: AT5G47900.1 (Protein of unknown function (DUF1624) )

HSP 1 Score: 491.5 bits (1264), Expect = 8.2e-139
Identity = 237/362 (65.47%), Postives = 294/362 (81.22%), Query Frame = 0

Query: 87  SARPILRSSD---QRQRLVSLDVFRGLTVALMIVVDYAGGVMPAINHSPWNGLTLADLVM 146
           SA  I RSS     ++RLVSLDVFRGLTVA MI+VD  GG++P+INHSPW+G+TLAD VM
Sbjct: 29  SALQISRSSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVM 88

Query: 147 PFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTYGVDIQ 206
           PFFLFIVGVSLA AYK +  R +AT+KA++R+LKLL LGLFLQGGF+HG+NNLTYG+D++
Sbjct: 89  PFFLFIVGVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVE 148

Query: 207 QIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLVVAVVLTTLYLVLSY 266
           +IR MGILQRIAIAY + ALCEIWLKG+  V+SE ++ +KY+   VVA V+TT+YL L Y
Sbjct: 149 KIRLMGILQRIAIAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLY 208

Query: 267 GLYVPDWEYQVPSLTTSNVASPKTF---SVKCGTRGDTGPACNAVGMIDRKIFGIQHLYK 326
           GLYVPDWEYQ+  L     ++  TF    VKCG RG TGP CNAVGM+DR   GIQHLY+
Sbjct: 209 GLYVPDWEYQI--LKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYR 268

Query: 327 RPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHF 386
           +P+YAR++QCSIN P+ GPLPPDAPSWCQAPFDPEGLLS++MA VTCLVGLHYGHII+HF
Sbjct: 269 KPVYARTKQCSINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHF 328

Query: 387 KDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMME 443
           KDH+ R+  WI+ S CL++L + L+  GMH+NK LYT+SYM VT+GA+G L + IYLM++
Sbjct: 329 KDHKKRLNQWILRSFCLLMLGLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMVD 388

BLAST of CaUC10G191580 vs. TAIR 10
Match: AT5G47900.7 (Protein of unknown function (DUF1624) )

HSP 1 Score: 472.2 bits (1214), Expect = 5.1e-133
Identity = 240/405 (59.26%), Postives = 296/405 (73.09%), Query Frame = 0

Query: 87  SARPILRSSD---QRQRLVSLDVFRGLTVALMIVVDYAGGVMPAINHSPWNGLTLADLVM 146
           SA  I RSS     ++RLVSLDVFRGLTVA MI+VD  GG++P+INHSPW+G+TLAD VM
Sbjct: 29  SALQISRSSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVM 88

Query: 147 PFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTYGVDIQ 206
           PFFLFIVGVSLA AYK +  R +AT+KA++R+LKLL LGLFLQGGF+HG+NNLTYG+D++
Sbjct: 89  PFFLFIVGVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVE 148

Query: 207 QIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLVVAVVLTTLYLVLSY 266
           +IR MGILQRIAIAY + ALCEIWLKG+  V+SE ++ +KY+   VVA V+TT+YL L Y
Sbjct: 149 KIRLMGILQRIAIAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLY 208

Query: 267 GLYVPDWEYQVPSLTTSNVASPKTF---SVKCGTRGDTGPACNAVGMIDRKIFGIQHLYK 326
           GLYVPDWEYQ+  L     ++  TF    VKCG RG TGP CNAVGM+DR   GIQHLY+
Sbjct: 209 GLYVPDWEYQI--LKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYR 268

Query: 327 RPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHF 386
           +P+YAR++QCSIN P+ GPLPPDAPSWCQAPFDPEGLLS++MA VTCLVGLHYGHII+HF
Sbjct: 269 KPVYARTKQCSINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHF 328

Query: 387 K-------------------------------------DHRDRMLHWIIPSSCLIVLAIG 444
           K                                     DH+ R+  WI+ S CL++L + 
Sbjct: 329 KRNGSKGQVYNEPSISIRPFFFILSETYLLLYVINFLQDHKKRLNQWILRSFCLLMLGLA 388

BLAST of CaUC10G191580 vs. TAIR 10
Match: AT5G47900.2 (Protein of unknown function (DUF1624) )

HSP 1 Score: 421.4 bits (1082), Expect = 1.0e-117
Identity = 206/301 (68.44%), Postives = 248/301 (82.39%), Query Frame = 0

Query: 87  SARPILRSSD---QRQRLVSLDVFRGLTVALMIVVDYAGGVMPAINHSPWNGLTLADLVM 146
           SA  I RSS     ++RLVSLDVFRGLTVA MI+VD  GG++P+INHSPW+G+TLAD VM
Sbjct: 29  SALQISRSSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVM 88

Query: 147 PFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTYGVDIQ 206
           PFFLFIVGVSLA AYK +  R +AT+KA++R+LKLL LGLFLQGGF+HG+NNLTYG+D++
Sbjct: 89  PFFLFIVGVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVE 148

Query: 207 QIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLVVAVVLTTLYLVLSY 266
           +IR MGILQRIAIAY + ALCEIWLKG+  V+SE ++ +KY+   VVA V+TT+YL L Y
Sbjct: 149 KIRLMGILQRIAIAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLY 208

Query: 267 GLYVPDWEYQVPSLTTSNVASPKTF---SVKCGTRGDTGPACNAVGMIDRKIFGIQHLYK 326
           GLYVPDWEYQ+  L     ++  TF    VKCG RG TGP CNAVGM+DR   GIQHLY+
Sbjct: 209 GLYVPDWEYQI--LKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYR 268

Query: 327 RPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHF 382
           +P+YAR++QCSIN P+ GPLPPDAPSWCQAPFDPEGLLS++MA VTCLVGLHYGHII+HF
Sbjct: 269 KPVYARTKQCSINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHF 327

BLAST of CaUC10G191580 vs. TAIR 10
Match: AT5G47900.4 (Protein of unknown function (DUF1624) )

HSP 1 Score: 421.4 bits (1082), Expect = 1.0e-117
Identity = 206/301 (68.44%), Postives = 248/301 (82.39%), Query Frame = 0

Query: 87  SARPILRSSD---QRQRLVSLDVFRGLTVALMIVVDYAGGVMPAINHSPWNGLTLADLVM 146
           SA  I RSS     ++RLVSLDVFRGLTVA MI+VD  GG++P+INHSPW+G+TLAD VM
Sbjct: 23  SALQISRSSSLPPDKERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVM 82

Query: 147 PFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTYGVDIQ 206
           PFFLFIVGVSLA AYK +  R +AT+KA++R+LKLL LGLFLQGGF+HG+NNLTYG+D++
Sbjct: 83  PFFLFIVGVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVE 142

Query: 207 QIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETALRRKYQLQLVVAVVLTTLYLVLSY 266
           +IR MGILQRIAIAY + ALCEIWLKG+  V+SE ++ +KY+   VVA V+TT+YL L Y
Sbjct: 143 KIRLMGILQRIAIAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLY 202

Query: 267 GLYVPDWEYQVPSLTTSNVASPKTF---SVKCGTRGDTGPACNAVGMIDRKIFGIQHLYK 326
           GLYVPDWEYQ+  L     ++  TF    VKCG RG TGP CNAVGM+DR   GIQHLY+
Sbjct: 203 GLYVPDWEYQI--LKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYR 262

Query: 327 RPIYARSEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHF 382
           +P+YAR++QCSIN P+ GPLPPDAPSWCQAPFDPEGLLS++MA VTCLVGLHYGHII+HF
Sbjct: 263 KPVYARTKQCSINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHF 321

BLAST of CaUC10G191580 vs. TAIR 10
Match: AT5G47900.6 (Protein of unknown function (DUF1624) )

HSP 1 Score: 394.0 bits (1011), Expect = 1.8e-109
Identity = 183/290 (63.10%), Postives = 236/290 (81.38%), Query Frame = 0

Query: 156 LAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIA 215
           +++  +PS+ +AT+KA++R+LKLL LGLFLQGGF+HG+NNLTYG+D+++IR MGILQRIA
Sbjct: 1   MSFAVLPSQFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIA 60

Query: 216 IAYFLAALCEIWLKGSDYVNSETALRRKYQLQLVVAVVLTTLYLVLSYGLYVPDWEYQVP 275
           IAY + ALCEIWLKG+  V+SE ++ +KY+   VVA V+TT+YL L YGLYVPDWEYQ+ 
Sbjct: 61  IAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQI- 120

Query: 276 SLTTSNVASPKTF---SVKCGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARSEQCSI 335
            L     ++  TF    VKCG RG TGP CNAVGM+DR   GIQHLY++P+YAR++QCSI
Sbjct: 121 -LKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSI 180

Query: 336 NAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWII 395
           N P+ GPLPPDAPSWCQAPFDPEGLLS++MA VTCLVGLHYGHII+HFKDH+ R+  WI+
Sbjct: 181 NYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWIL 240

Query: 396 PSSCLIVLAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMMETH 443
            S CL++L + L+  GMH+NK LYT+SYM VT+GA+G L + IYLM++ +
Sbjct: 241 RSFCLLMLGLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMVDVY 288

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905626.11.1e-24696.83heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Benincasa hispida][more]
XP_008465168.15.7e-24395.48PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Cucumis melo] >KAA00... [more]
XP_004141153.13.7e-24295.02heparan-alpha-glucosaminide N-acetyltransferase [Cucumis sativus][more]
KAE8651279.13.7e-24295.02hypothetical protein Csa_000910 [Cucumis sativus][more]
XP_022983293.11.6e-23290.50heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 [Cucurbita maxim... [more]
Match NameE-valueIdentityDescription
Q3UDW87.0e-2627.65Heparan-alpha-glucosaminide N-acetyltransferase OS=Mus musculus OX=10090 GN=Hgsn... [more]
Q68CP48.5e-2427.09Heparan-alpha-glucosaminide N-acetyltransferase OS=Homo sapiens OX=9606 GN=HGSNA... [more]
Match NameE-valueIdentityDescription
A0A5A7T6992.8e-24395.48Heparan-alpha-glucosaminide N-acetyltransferase OS=Cucumis melo var. makuwa OX=1... [more]
A0A1S3CNA52.8e-24395.48heparan-alpha-glucosaminide N-acetyltransferase OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0LFP01.8e-24295.02DUF1624 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G845460 PE=... [more]
A0A6J1IYW67.5e-23390.50heparan-alpha-glucosaminide N-acetyltransferase-like isoform X1 OS=Cucurbita max... [more]
A0A6J1J7D27.5e-23390.50heparan-alpha-glucosaminide N-acetyltransferase-like isoform X2 OS=Cucurbita max... [more]
Match NameE-valueIdentityDescription
AT5G47900.18.2e-13965.47Protein of unknown function (DUF1624) [more]
AT5G47900.75.1e-13359.26Protein of unknown function (DUF1624) [more]
AT5G47900.21.0e-11768.44Protein of unknown function (DUF1624) [more]
AT5G47900.41.0e-11768.44Protein of unknown function (DUF1624) [more]
AT5G47900.61.8e-10963.10Protein of unknown function (DUF1624) [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012429Heparan-alpha-glucosaminide N-acetyltransferase, catalytic domainPFAMPF07786DUF1624coord: 100..224
e-value: 4.2E-8
score: 33.0
NoneNo IPR availablePANTHERPTHR31061LD22376Pcoord: 40..440
NoneNo IPR availablePANTHERPTHR31061:SF31HEPARAN-ALPHA-GLUCOSAMINIDE N-ACETYLTRANSFERASE-LIKEcoord: 40..440

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC10G191580.1CaUC10G191580.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016740 transferase activity