Cla97C04G072200 (gene) Watermelon (97103) v2.5

Overview
NameCla97C04G072200
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptionsulfoquinovosidase-like isoform X1
LocationCla97Chr04: 18940491 .. 18946576 (-)
RNA-Seq ExpressionCla97C04G072200
SyntenyCla97C04G072200
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAACAACGACTTTTCTTTCAAAAGCTTTCCAGTGTTTATATGAACTTCCGGAACTTCCACAGCCTTCTTCTCTTTGTTCCAACACAACACAAGCATGAAAAACCTCAAAATTACCAAAAAACATCATATACACCTCAACAATCCTTTCCCTTCACCTCCAACTTCGTTTCCTTTACTTCAGGGGGACCTCTCTGCAAATTTTCAAGCACTTCCTCCCTACAAAGCCTTCTCAATCGGAAAGGATTTTCATCTTCTATGGAGGTCCGAAAATGGTGGGTCTCTCTCAATTTATCATCTCTCTCAGCCAACCAGATCAATCTGGTCAACAATCCCAGGTCAAGCTTTTGTATCTGCAGCAATGGTGGAAACTGAGGTGGAAGAAAGTCGAGGTTCGTTTGCTGTGAAAGATGGGGCTGTTCATTTGGTTTGTAATCATCAAACAATTGATGATATCAAGGAGATCAATGGTTGCGATCATGAGTTGGAGGTTAAAGATCATCATTTTCCATCTGGGTATTTGGGATTGGACAAGAAAGACCATCAGAAAGATGCTCAGTTCCCTATGTTGCTCATTAATGGCAGAATATTCAATACCAAGAAGAAGAAGATGAAGAAGAACAGGCTTCAAGAAACTGGTTTCAATGGTGATGTAAAATGTAACTCGAAAGTTCCTCCTGCTTCTGCAAGGTATTGGTTATTGTTTGAACAGAAAAACGGCAGCCAAATTGGATTTCAAGTGATGCTTGGCCAACCCAGCTATGAATATCGCCAAATGGCTCATTCAAGAGGGGGATTCAGCAGGCTTAAGTTTGGATTGCATCGGCTGAGGAAACGGAAAGTTGAATGGTATTGGTCTTTAGCAAAACTGAAAGGGTGTGTTAGAGTTTCTTCTTCAGAAGAGGAAATGGAAGTCTTCAGAGCAGCTGGGGAATTTGAATCATTCAACAGGGTATGCTTGACCTATTCAAGTGATGAGAAGGAAAGATTCTTTGGTTTTGGAGAGCAATTCTCTCATATGGACTTCAAGGGAAAAAGAGTACCAATCTTTGTTCAAGAACAGGGTATTGGTAGAGGAGATCAACCTATCACTTTTGCAGCTAATCTGGTTAGTTACAGGTAAGTTAAACAAACAGTACACTTATTCATCTCTTAATTCTAAAACTAAGTTTGAAAATTTTTATCACTTCAGGTCTGGAGGTGATTGGAGTACTACATATGCTCCATCTCCATTTTACATGACATCCAAAATGAGGTCTCTGTACCTTGAAGGATATGAGTACTCTGTATTTGATCTGACAAAGAATGATAGAGTTCAGATTCAGGTAGCCTTTCAATAACTATAGCACTTCACTTTCTTTCATCAACCTAAAAAGCCATGAAATGTTTGAATAGAGGTTTGGTTTAAACTAAAAGAGAAAGTAAAAAAGAAACATAACAATGCCAAAATGTAAAGATAGAGCATGTGAGGAATAAAAAACTAAAACATTGAGATTTGGTTTAACACAAAACTAGAAAGTAAAGAAAAAAGATCTGAACTATTCATCTTAGGATACCCCTTGAAACCGTAAGGCACTCATATACGAATTTTTAGCTTACAGTATTCCCATTCTCTGTGAACTGGAGATTGCAGATTCATGGCAATTCCATTGAAGGAAGGATATTGCACGGGAACTCACCTTCAGAGCTTATTGAACGTTTCACTGAAACCATTGGGAGGCCTACAGAGCTTCCTGGATGGATAATATCAGGTGCTGTGGTAGGGATGCAGGGTGGCACTGACGCTGCACGCAAAATTTGGGATGAGCTAAAAGCTCGTGAAGTTCCCATTTCTGCATTTTGGCTGCAGGTAGTACAAATTTGTCATTTCCAAAAGTTAGTTATAGATTTCTGAATCAAACCAAAGGCAGAAACTGACGCACAGTTGTATAGTATTTATTACATGCTTTTGTAATGGTATAGTTTTTCTATAATGTATAGAGATTCTTTTAGATTCATTCCTAATCTAAAAGAAGAATGTTATGTAGGACTGGGTGGGGCAGAGAGAAACAGTAATTGGATCACAACTGTGGTGGAACTGGGAAGTTGACACAACAAGGTACTATGGATGGAGACAATTAATTAAGGATCTCGGTGCTCGGCATATAAAAGTAATGACATACTGCAATCCCTGCCTAGCTCCGGTATTTTCTTCTTCCACACTTTCAATTCTTCTGCAATCACCAAATCAGAAATATTGTTACTAAACTTGCATTATGTTAAGAGCAAGGATATTTATCGACCTTTCTTCTTTTCCCCTTTATGTTGTAAAATATTGTTTTCCCACCCGATAGACAAATACACTGCCTTCTTAATGGCTACCTAAATGGACAGAAATTGAGATAAAATAGTAAAGACTAATCATTTCCGTTTCAAATTTGTACATGATTTCAGACTGATGAGAAGCAGAACAGAAGGAGAAACCTTTATGAGGAAGCAAAGGAGTTGGGGATCTTGGTAAAGAAGAAGAATGGAGAACCCTATATGGTTCCGAACACAGCTTTTGATGTGGGAATGTTGGATCTTACACACCCAAATACTTCCAGTTGGTTCAAGGAAATTCTACAAGAAATGGTGGATGATGGTGTGAGAGGATGGATGGCTGATTTCGGCGAAGGCTTGCCCGTAGATGCCACCCTCTATTCAGGTTATGCTAAGAGAAGATGACAGTTCAATGCTAATATTTTGTACTCAATTTCAAAGTAACTACCTAAGAATAGTATAATATCATATAATTAAGAATGAATTTTTACATTCATTTTCTTTAGAGCAATGATTAGGTACAATCTAGGCTACACCTAATCCATACTCCTATTTCCTTTACACACTAACAAATCATTTTAAAAAAAAAAATTATCACATGACAGCTTTTTACTTGAGCTGAGAGAGCACCTAAATTTTTCTCCTTTCTTAATATTCATTGCCAAAAGATTCTTATTTTATATTTTTTTCAAAAAAGCTTTATCACTGAAAATTTCAACTGCTTCATAGAAATCGTGAAGCCCTTTCCTTGGTTAGAGATGGATTGGAAAGGGAGTTATATGCATTGATCAATCTTACGTGGTTCTTTGAACATAAACCTTCCAAGTCGAGATGAACGTAAAATGTCATAATTCCCACAAAGCAGGGGTACGAAATGAGTTCAAGATTCCTTATTCTTTTCAACATTTTTGTGTTTGCGTCCTGAACAAGAAGCTGATTAAGATCTGTGTGCAAATCAATTTCACTAAAACATTTTAGATTTTACCTTAACCAGAACTGCATACTTCCCCGCTGTGGTATATTAATATTACTCACAGATTTAGAACCATAATTTCAGGTGAAGATCCTATTACTGCACATAACAGATACCCAGAAATATGGGCGCAGATTAACAGAGAATTTGCAGATGAATGGAAAAGTAATCTTGTTGGTAAGGAGAAAGAAGACCCGCAAGAGGCCTTAGTTTTCTTCATGAGAGCTGGTTTTAGAAACAGTCCTAAATGGGGGATGCTATTTTGGGAAGGAGACCAAATGGTAAGTTGGCAAGCTAATGATGGGATAAAGAGTGCTGTCACTGGCCTTTTGAGCAGTGGACTTTCAGGGTATGCTTTTAATCACAGTGATATAGGAGGGTACTGTGCAGTAGATTTACCTTTCATCAAGTACCGAAGAAGCGAGGAACTGCTTCTACGGTGGATGGAGTTAAATGCCTTTACCACTGTTTTCAGAACCCATGAAGTAAGTTAAAAACCAACTTTACTTCTTTTCCCTATTCTGTTTGCTCTGTCCCTTTCTTCTCTGACAATAACCATAACCTGAAAAAGAACATTATGTATGACATCAAATTTACTAAACGCATGAAATGGGAATGCATAAGAAGCAAGCTGTCGAGATTGCATTATCTTTCTTTCACTCCCTTTCTTCAGATTTATCTTTAGTAAATTCCCGAAACATATGGAACCTTATAACCTGCACTGAGCTTTATATACAAATCAACTATGGTTTGCATCCAATGAACGCCTCATGTTGGGAAGATTGAATTATCATATATACTATACTTCTACAACCATATAAAGTTAGCACCAGAATATAATTATCCTCTACAAATTTTCAACATACTCAGTCTTGTTAAAGCACTGTTTCCAACTGTCATCTGCATGCAGGGAAACAAACCATCCTGCAACAGCCAATTCTACTCAAACGATCGAACATTATCACAGTTTGCTCGCTTTGCCAAGGTCTATAGTGCTTGGAAGTTTTACAGAATCCAACTTGTGAAGGTAAATATGCAACCTTTTACATTCCATCTGCAACTTCTATATTTTCAGAGCTGATGACCTTTCTCTTCTGTGGTTTGGATCACAAAATAATACAGGAAGCAGCTCAAAGGGGCCTACCCGTATGCCGCCACCTATTTGTTCACTACCCAGATGACGAGTATGTGCTTACATTGGGTCACCAGCAGTTTTTGGTTGGTTCTGAGATCCTAGTTGTGCCTGTCCTTGACAAAGGCAAGAACAATGTCCAAGCCTACTTTCCTCTGGGTGAAAGTTCTTCATGGCAACATATATGGACAGGGGAGGTATATGCTAAACCAGGCTGCGAAATTAAAGTAGACACTCCAGTTGGCTATCCAGCTGTATTTGTCAAGGTTGGCTCCATTGTAGGGGAAACTTTCATTAAAAACCTGAAAATTTTCAACATTCTTTAAGAAAGTGACCCTTGAAACTGGGGGTTGCTTACTGTTCATTGAATTTTAATCTAGATTTACATTTTTTCCCCCCTATACTTTGTTACTTTTCATGTTTTAGTTCATCAATGATCCAATTAAATGAAAACTAAATACTGGGAATGAATCTAAATGCCTGCTTGATACGTAATCCCAGAGCTAAAATGAGACTATAATAAGGATGGTGATGACTGATGGATTTACGATGTTACGAGGAACTGAGATCTAATGAACAAAAGCATAGGACGTACAAGTGACTGTAAAAGAACTAAAATTATGTGCTTATTTTAATCAAGGTTTAAAAAACAAGTTTAGTAATTGTACTTCCAATGAGTTAATAGTTCTATTCTTGGACTTCTGTATTTCAAGATTTAGTCAGTTTAGTCAATATTTTGTGATGATTTAGCTTCAAAACCGTTCCCTTTACATTAAAAATAAAAAATAAATAAGCTACTAAGGTGTAATGCAGGTTTGAGTAGCAACCACCTGGATCTCACGTATAAAAAGAATAAAGATGATGATTGTTAGTATCTAGATGGTTGTCACATCAAACTCGTACTATATCTTAAATACTGATTTTTATTCAGAATAGGAATTAAATCGAAGAACTATCTTTACCTACTAGAATATGGGACTAAATGATTATTTCAAAGAAAGTGCAGTGCAGCCTGCAGGATAGGTCTGCTTTCAATAATAATTTAGGGACTATTATTTCACTATACTGTAAAATCATAACATAAGAACAGAAAACATGAGAACTTCTGCTACATACTTTTTGAGCAAAATCACAAGAAGTCTTTTCGTCCAAGAATAAAAATCAGTAACGCAAAACATATCTTACCAGTCAATCCAAATTCAAAACAGTCTTAATTTACGAACCACTCCCCACCATTTTTCAAAAGAAATCAGAACACCGAACCTTACTATCATCCCCTACATCAGACACGCTCTTCCGTCAATCACAGTGCACAGAATAAAACAACAACAATACAGAAATACATCAAACATATCTCTAGTGCATGCTTCATTACGAAAACAGAGCCTCGCAGCAATGGGGATTGCAATGCTAACAAAAACCTAATAGAATTAAAACCGTGCATTGGAAGTCGCATACAACATTATTCTGCAAAAATCTGAGCGTTTAGACTGTAGAGAGTAACAGAACTGCCAATGGGATGATCATTGCAGAGTTTAAAGACTGAAAATACAAAAGAGATGATTGAAACTCACTTCGATGATGAGCGATTCTGGTGCCATGGCGGAGGATCGCCATGGAAAATTTGGGTCCGGACTCCGGAGACTTGGGATTATAGGTTGCATTTGCATGTGGTTGGTTAG

mRNA sequence

CAAACAACGACTTTTCTTTCAAAAGCTTTCCAGTGTTTATATGAACTTCCGGAACTTCCACAGCCTTCTTCTCTTTGTTCCAACACAACACAAGCATGAAAAACCTCAAAATTACCAAAAAACATCATATACACCTCAACAATCCTTTCCCTTCACCTCCAACTTCGTTTCCTTTACTTCAGGGGGACCTCTCTGCAAATTTTCAAGCACTTCCTCCCTACAAAGCCTTCTCAATCGGAAAGGATTTTCATCTTCTATGGAGGTCCGAAAATGGTGGGTCTCTCTCAATTTATCATCTCTCTCAGCCAACCAGATCAATCTGGTCAACAATCCCAGGTCAAGCTTTTGTATCTGCAGCAATGGTGGAAACTGAGGTGGAAGAAAGTCGAGGTTCGTTTGCTGTGAAAGATGGGGCTGTTCATTTGGTTTGTAATCATCAAACAATTGATGATATCAAGGAGATCAATGGTTGCGATCATGAGTTGGAGGTTAAAGATCATCATTTTCCATCTGGGTATTTGGGATTGGACAAGAAAGACCATCAGAAAGATGCTCAGTTCCCTATGTTGCTCATTAATGGCAGAATATTCAATACCAAGAAGAAGAAGATGAAGAAGAACAGGCTTCAAGAAACTGGTTTCAATGGTGATGTAAAATGTAACTCGAAAGTTCCTCCTGCTTCTGCAAGGTATTGGTTATTGTTTGAACAGAAAAACGGCAGCCAAATTGGATTTCAAGTGATGCTTGGCCAACCCAGCTATGAATATCGCCAAATGGCTCATTCAAGAGGGGGATTCAGCAGGCTTAAGTTTGGATTGCATCGGCTGAGGAAACGGAAAGTTGAATGGTATTGGTCTTTAGCAAAACTGAAAGGGTGTGTTAGAGTTTCTTCTTCAGAAGAGGAAATGGAAGTCTTCAGAGCAGCTGGGGAATTTGAATCATTCAACAGGGTATGCTTGACCTATTCAAGTGATGAGAAGGAAAGATTCTTTGGTTTTGGAGAGCAATTCTCTCATATGGACTTCAAGGGAAAAAGAGTACCAATCTTTGTTCAAGAACAGGGTATTGGTAGAGGAGATCAACCTATCACTTTTGCAGCTAATCTGGTTAGTTACAGGTCTGGAGGTGATTGGAGTACTACATATGCTCCATCTCCATTTTACATGACATCCAAAATGAGGTCTCTGTACCTTGAAGGATATGAGTACTCTGTATTTGATCTGACAAAGAATGATAGAGTTCAGATTCAGATTCATGGCAATTCCATTGAAGGAAGGATATTGCACGGGAACTCACCTTCAGAGCTTATTGAACGTTTCACTGAAACCATTGGGAGGCCTACAGAGCTTCCTGGATGGATAATATCAGGTGCTGTGGTAGGGATGCAGGGTGGCACTGACGCTGCACGCAAAATTTGGGATGAGCTAAAAGCTCGTGAAGTTCCCATTTCTGCATTTTGGCTGCAGGACTGGGTGGGGCAGAGAGAAACAGTAATTGGATCACAACTGTGGTGGAACTGGGAAGTTGACACAACAAGGTACTATGGATGGAGACAATTAATTAAGGATCTCGGTGCTCGGCATATAAAAGTAATGACATACTGCAATCCCTGCCTAGCTCCGACTGATGAGAAGCAGAACAGAAGGAGAAACCTTTATGAGGAAGCAAAGGAGTTGGGGATCTTGGTAAAGAAGAAGAATGGAGAACCCTATATGGTTCCGAACACAGCTTTTGATGTGGGAATGTTGGATCTTACACACCCAAATACTTCCAGTTGGTTCAAGGAAATTCTACAAGAAATGGTGGATGATGGTGTGAGAGGATGGATGGCTGATTTCGGCGAAGGCTTGCCCGTAGATGCCACCCTCTATTCAGGTGAAGATCCTATTACTGCACATAACAGATACCCAGAAATATGGGCGCAGATTAACAGAGAATTTGCAGATGAATGGAAAAGTAATCTTGTTGGTAAGGAGAAAGAAGACCCGCAAGAGGCCTTAGTTTTCTTCATGAGAGCTGGTTTTAGAAACAGTCCTAAATGGGGGATGCTATTTTGGGAAGGAGACCAAATGGTAAGTTGGCAAGCTAATGATGGGATAAAGAGTGCTGTCACTGGCCTTTTGAGCAGTGGACTTTCAGGGTATGCTTTTAATCACAGTGATATAGGAGGGTACTGTGCAGTAGATTTACCTTTCATCAAGTACCGAAGAAGCGAGGAACTGCTTCTACGGTGGATGGAGTTAAATGCCTTTACCACTGTTTTCAGAACCCATGAAGGAAACAAACCATCCTGCAACAGCCAATTCTACTCAAACGATCGAACATTATCACAGTTTGCTCGCTTTGCCAAGGTCTATAGTGCTTGGAAGTTTTACAGAATCCAACTTGTGAAGGAAGCAGCTCAAAGGGGCCTACCCGTATGCCGCCACCTATTTGTTCACTACCCAGATGACGAGTATGTGCTTACATTGGGTCACCAGCAGTTTTTGGTTGGTTCTGAGATCCTAGTTGTGCCTGTCCTTGACAAAGGCAAGAACAATGTCCAAGCCTACTTTCCTCTGGGTGAAAGTTCTTCATGGCAACATATATGGACAGGGGAGGTATATGCTAAACCAGGCTGCGAAATTAAAGTAGACACTCCAGTTGGCTATCCAGCTGTATTTGTCAAGAGTTTAAAGACTGAAAATACAAAAGAGATGATTGAAACTCACTTCGATGATGAGCGATTCTGGTGCCATGGCGGAGGATCGCCATGGAAAATTTGGGTCCGGACTCCGGAGACTTGGGATTATAGGTTGCATTTGCATGTGGTTGGTTAG

Coding sequence (CDS)

ATGAAAAACCTCAAAATTACCAAAAAACATCATATACACCTCAACAATCCTTTCCCTTCACCTCCAACTTCGTTTCCTTTACTTCAGGGGGACCTCTCTGCAAATTTTCAAGCACTTCCTCCCTACAAAGCCTTCTCAATCGGAAAGGATTTTCATCTTCTATGGAGGTCCGAAAATGGTGGGTCTCTCTCAATTTATCATCTCTCTCAGCCAACCAGATCAATCTGGTCAACAATCCCAGGTCAAGCTTTTGTATCTGCAGCAATGGTGGAAACTGAGGTGGAAGAAAGTCGAGGTTCGTTTGCTGTGAAAGATGGGGCTGTTCATTTGGTTTGTAATCATCAAACAATTGATGATATCAAGGAGATCAATGGTTGCGATCATGAGTTGGAGGTTAAAGATCATCATTTTCCATCTGGGTATTTGGGATTGGACAAGAAAGACCATCAGAAAGATGCTCAGTTCCCTATGTTGCTCATTAATGGCAGAATATTCAATACCAAGAAGAAGAAGATGAAGAAGAACAGGCTTCAAGAAACTGGTTTCAATGGTGATGTAAAATGTAACTCGAAAGTTCCTCCTGCTTCTGCAAGGTATTGGTTATTGTTTGAACAGAAAAACGGCAGCCAAATTGGATTTCAAGTGATGCTTGGCCAACCCAGCTATGAATATCGCCAAATGGCTCATTCAAGAGGGGGATTCAGCAGGCTTAAGTTTGGATTGCATCGGCTGAGGAAACGGAAAGTTGAATGGTATTGGTCTTTAGCAAAACTGAAAGGGTGTGTTAGAGTTTCTTCTTCAGAAGAGGAAATGGAAGTCTTCAGAGCAGCTGGGGAATTTGAATCATTCAACAGGGTATGCTTGACCTATTCAAGTGATGAGAAGGAAAGATTCTTTGGTTTTGGAGAGCAATTCTCTCATATGGACTTCAAGGGAAAAAGAGTACCAATCTTTGTTCAAGAACAGGGTATTGGTAGAGGAGATCAACCTATCACTTTTGCAGCTAATCTGGTTAGTTACAGGTCTGGAGGTGATTGGAGTACTACATATGCTCCATCTCCATTTTACATGACATCCAAAATGAGGTCTCTGTACCTTGAAGGATATGAGTACTCTGTATTTGATCTGACAAAGAATGATAGAGTTCAGATTCAGATTCATGGCAATTCCATTGAAGGAAGGATATTGCACGGGAACTCACCTTCAGAGCTTATTGAACGTTTCACTGAAACCATTGGGAGGCCTACAGAGCTTCCTGGATGGATAATATCAGGTGCTGTGGTAGGGATGCAGGGTGGCACTGACGCTGCACGCAAAATTTGGGATGAGCTAAAAGCTCGTGAAGTTCCCATTTCTGCATTTTGGCTGCAGGACTGGGTGGGGCAGAGAGAAACAGTAATTGGATCACAACTGTGGTGGAACTGGGAAGTTGACACAACAAGGTACTATGGATGGAGACAATTAATTAAGGATCTCGGTGCTCGGCATATAAAAGTAATGACATACTGCAATCCCTGCCTAGCTCCGACTGATGAGAAGCAGAACAGAAGGAGAAACCTTTATGAGGAAGCAAAGGAGTTGGGGATCTTGGTAAAGAAGAAGAATGGAGAACCCTATATGGTTCCGAACACAGCTTTTGATGTGGGAATGTTGGATCTTACACACCCAAATACTTCCAGTTGGTTCAAGGAAATTCTACAAGAAATGGTGGATGATGGTGTGAGAGGATGGATGGCTGATTTCGGCGAAGGCTTGCCCGTAGATGCCACCCTCTATTCAGGTGAAGATCCTATTACTGCACATAACAGATACCCAGAAATATGGGCGCAGATTAACAGAGAATTTGCAGATGAATGGAAAAGTAATCTTGTTGGTAAGGAGAAAGAAGACCCGCAAGAGGCCTTAGTTTTCTTCATGAGAGCTGGTTTTAGAAACAGTCCTAAATGGGGGATGCTATTTTGGGAAGGAGACCAAATGGTAAGTTGGCAAGCTAATGATGGGATAAAGAGTGCTGTCACTGGCCTTTTGAGCAGTGGACTTTCAGGGTATGCTTTTAATCACAGTGATATAGGAGGGTACTGTGCAGTAGATTTACCTTTCATCAAGTACCGAAGAAGCGAGGAACTGCTTCTACGGTGGATGGAGTTAAATGCCTTTACCACTGTTTTCAGAACCCATGAAGGAAACAAACCATCCTGCAACAGCCAATTCTACTCAAACGATCGAACATTATCACAGTTTGCTCGCTTTGCCAAGGTCTATAGTGCTTGGAAGTTTTACAGAATCCAACTTGTGAAGGAAGCAGCTCAAAGGGGCCTACCCGTATGCCGCCACCTATTTGTTCACTACCCAGATGACGAGTATGTGCTTACATTGGGTCACCAGCAGTTTTTGGTTGGTTCTGAGATCCTAGTTGTGCCTGTCCTTGACAAAGGCAAGAACAATGTCCAAGCCTACTTTCCTCTGGGTGAAAGTTCTTCATGGCAACATATATGGACAGGGGAGGTATATGCTAAACCAGGCTGCGAAATTAAAGTAGACACTCCAGTTGGCTATCCAGCTGTATTTGTCAAGAGTTTAAAGACTGAAAATACAAAAGAGATGATTGAAACTCACTTCGATGATGAGCGATTCTGGTGCCATGGCGGAGGATCGCCATGGAAAATTTGGGTCCGGACTCCGGAGACTTGGGATTATAGGTTGCATTTGCATGTGGTTGGTTAG

Protein sequence

MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENGGSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDIKEINGCDHELEVKDHHFPSGYLGLDKKDHQKDAQFPMLLINGRIFNTKKKKMKKNRLQETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRLKFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKERFFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYMTSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTELPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEVDTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGEPYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDPITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGDQMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELNAFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCRHLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVYAKPGCEIKVDTPVGYPAVFVKSLKTENTKEMIETHFDDERFWCHGGGSPWKIWVRTPETWDYRLHLHVVG
Homology
BLAST of Cla97C04G072200 vs. NCBI nr
Match: XP_038882634.1 (sulfoquinovosidase-like [Benincasa hispida])

HSP 1 Score: 1699.1 bits (4399), Expect = 0.0e+00
Identity = 809/861 (93.96%), Postives = 830/861 (96.40%), Query Frame = 0

Query: 1   MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENG 60
           M NLKITKKHH+HLNNPFPSPPTSFPLLQGDLSANFQ LPPYKAFSIG+DF LLWRSENG
Sbjct: 52  MTNLKITKKHHMHLNNPFPSPPTSFPLLQGDLSANFQTLPPYKAFSIGRDFQLLWRSENG 111

Query: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 120
           GSLSIYHLS PTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI
Sbjct: 112 GSLSIYHLSHPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 171

Query: 121 KEINGCDHELEVKDHHFPSGYLGLDKKDHQKDAQFPMLLINGRIFNTKKK---KMKKNRL 180
           KEING DHELEVKDHHFPSGYLGLD+K HQKD +FPMLLINGRIFNTKKK   K KKNRL
Sbjct: 172 KEINGWDHELEVKDHHFPSGYLGLDQKSHQKDTKFPMLLINGRIFNTKKKMMMKKKKNRL 231

Query: 181 QETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRL 240
           QETGFNGD+KCNSKV PASARYW+LFEQKN SQIGFQVMLGQPSYEYRQM HSRGGFSRL
Sbjct: 232 QETGFNGDIKCNSKVLPASARYWVLFEQKNSSQIGFQVMLGQPSYEYRQMTHSRGGFSRL 291

Query: 241 KFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKER 300
           KF LHRLRKRK EWYWSL KLKG VRVSSSEEEMEV RAA EF  FNRV LTYSS+EKER
Sbjct: 292 KFRLHRLRKRKFEWYWSLPKLKGFVRVSSSEEEMEVLRAAEEFGEFNRVFLTYSSEEKER 351

Query: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYM 360
           FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYR+GGDWSTTYAPSPFYM
Sbjct: 352 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYM 411

Query: 361 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTE 420
           TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSI+GRILHGNSPSELIERFTETIGRP E
Sbjct: 412 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIQGRILHGNSPSELIERFTETIGRPPE 471

Query: 421 LPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEV 480
           LPGWIISGAVVGMQGGTDA RKIW++LKA EVPISAFWLQDWVGQRETVIGSQLWWNWEV
Sbjct: 472 LPGWIISGAVVGMQGGTDAVRKIWNDLKAHEVPISAFWLQDWVGQRETVIGSQLWWNWEV 531

Query: 481 DTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGE 540
           DTTRY+GW+QLIKDLGA HIKVMTYCNPCLAPTDEKQNRRRNLYEEAK LGILVKKKNGE
Sbjct: 532 DTTRYFGWKQLIKDLGAWHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKALGILVKKKNGE 591

Query: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600
           PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLP+DATLYSG+DP
Sbjct: 592 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPIDATLYSGDDP 651

Query: 601 ITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660
           ITAHNRYPEIWAQINREFADEWKS LVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD
Sbjct: 652 ITAHNRYPEIWAQINREFADEWKSKLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 711

Query: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELN 720
           QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAV+LPFIKYRRSEELLLRWMELN
Sbjct: 712 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVNLPFIKYRRSEELLLRWMELN 771

Query: 721 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780
           AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLP+CR
Sbjct: 772 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPICR 831

Query: 781 HLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVY 840
           HLFVHYP+DEYVLTLGHQQFLVGSEILVVPVLDKGKNNV+AYFPL ESSSWQHIWTGEV+
Sbjct: 832 HLFVHYPEDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVKAYFPLNESSSWQHIWTGEVH 891

Query: 841 AKPGCEIKVDTPVGYPAVFVK 859
           AKPGCEIKVD PVGYPAVF+K
Sbjct: 892 AKPGCEIKVDAPVGYPAVFIK 912

BLAST of Cla97C04G072200 vs. NCBI nr
Match: XP_008455717.1 (PREDICTED: sulfoquinovosidase-like isoform X1 [Cucumis melo] >KAA0025932.1 sulfoquinovosidase-like isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 1657.5 bits (4291), Expect = 0.0e+00
Identity = 793/861 (92.10%), Postives = 821/861 (95.35%), Query Frame = 0

Query: 1   MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENG 60
           M NLKITKKHHIHLNNPFPSPPTSFPLLQG+LSANFQ L  YK FSIGKDF LLWRS+NG
Sbjct: 1   MTNLKITKKHHIHLNNPFPSPPTSFPLLQGELSANFQVLSSYKFFSIGKDFQLLWRSDNG 60

Query: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 120
           GSLSIYHLS PTRSIWSTI GQAFVSAAMVETEVEESRGSFAVKDGAVHL+CNHQTIDDI
Sbjct: 61  GSLSIYHLSDPTRSIWSTISGQAFVSAAMVETEVEESRGSFAVKDGAVHLICNHQTIDDI 120

Query: 121 KEINGCDHELEVKDHHFPSGYLGLDKKDHQK-DAQFPMLLINGRIFNTKKKKM--KKNRL 180
           KEINGCDHELEVK+HHFPSGYLGLD K+++K DA+FPMLLI+GRIFNT++KKM  KKN+L
Sbjct: 121 KEINGCDHELEVKEHHFPSGYLGLDLKNYEKEDARFPMLLISGRIFNTERKKMMKKKNKL 180

Query: 181 QETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRL 240
           QET FNGDVKCNSKV  ASARYWLLFEQK+ SQIGFQVMLGQPSYEYRQ+AHSRGGF+RL
Sbjct: 181 QETSFNGDVKCNSKVLSASARYWLLFEQKSSSQIGFQVMLGQPSYEYRQIAHSRGGFNRL 240

Query: 241 KFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKER 300
           KF  HRLRKRK EW WSL KLKG VRV SSEEEMEV RAA EFE+FNR CLTYSS+EKER
Sbjct: 241 KFRWHRLRKRKFEWRWSLTKLKGFVRVCSSEEEMEVLRAAEEFEAFNRACLTYSSEEKER 300

Query: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYM 360
           FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYR+GGDWSTTYAPSPFYM
Sbjct: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYM 360

Query: 361 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTE 420
           TSKMRSLYLEGYEYS+FDLTKNDRVQIQIHGNSI+GRILHGNSPSELIE FTETIGRP E
Sbjct: 361 TSKMRSLYLEGYEYSIFDLTKNDRVQIQIHGNSIQGRILHGNSPSELIECFTETIGRPPE 420

Query: 421 LPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEV 480
           LPGWIISGAVVGMQGGT+  RKIWDELKA EVPISAFWLQDWVGQRETVIGSQLWWNWEV
Sbjct: 421 LPGWIISGAVVGMQGGTNIVRKIWDELKAHEVPISAFWLQDWVGQRETVIGSQLWWNWEV 480

Query: 481 DTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGE 540
           D TRY GW+QLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAK LGILVKKKNGE
Sbjct: 481 DATRYSGWKQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKALGILVKKKNGE 540

Query: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600
           PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP
Sbjct: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600

Query: 601 ITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660
           ITAHNRYPEIWAQINREF DEWKS LVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD
Sbjct: 601 ITAHNRYPEIWAQINREFVDEWKSKLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660

Query: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELN 720
           QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAV+LPFIKYRRSEELLLRWMELN
Sbjct: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVNLPFIKYRRSEELLLRWMELN 720

Query: 721 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780
           AFTTVFRTHEGNKPSCNSQFYSN+RTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR
Sbjct: 721 AFTTVFRTHEGNKPSCNSQFYSNNRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780

Query: 781 HLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVY 840
           HLFVHYP+DEYVLTLGHQQFLVGSEILVVPVLDKGKN V+AYFPL +SSSWQHIWTGEVY
Sbjct: 781 HLFVHYPEDEYVLTLGHQQFLVGSEILVVPVLDKGKNYVKAYFPLDDSSSWQHIWTGEVY 840

Query: 841 AKPGCEIKVDTPVGYPAVFVK 859
           AK GCEIKVD PVGYPAVF+K
Sbjct: 841 AKLGCEIKVDAPVGYPAVFIK 861

BLAST of Cla97C04G072200 vs. NCBI nr
Match: XP_004144332.2 (uncharacterized protein LOC101219337 [Cucumis sativus] >KGN54706.1 hypothetical protein Csa_012207 [Cucumis sativus])

HSP 1 Score: 1653.6 bits (4281), Expect = 0.0e+00
Identity = 783/861 (90.94%), Postives = 824/861 (95.70%), Query Frame = 0

Query: 1   MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENG 60
           M NLK+TKKHHIHLNNPFPSPP SFPLLQG+LSAN+QAL  YK FSIGKDF LLWRS+NG
Sbjct: 21  MTNLKVTKKHHIHLNNPFPSPPPSFPLLQGELSANYQALSSYKFFSIGKDFQLLWRSDNG 80

Query: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 120
           GSLSIYHLS PTRSIWSTI GQAFVSAAMVETEVEESRGSFAVKDGAVHL+CNHQTIDDI
Sbjct: 81  GSLSIYHLSDPTRSIWSTISGQAFVSAAMVETEVEESRGSFAVKDGAVHLICNHQTIDDI 140

Query: 121 KEINGCDHELEVKDHHFPSGYLGLDKKDHQK-DAQFPMLLINGRIFNTKKKKM--KKNRL 180
           KEINGCDHE EVK+HHFPSGYLGLD K+++K DAQFPMLLI+GRIFNT+KK+M  KKN+L
Sbjct: 141 KEINGCDHEFEVKEHHFPSGYLGLDLKNYEKEDAQFPMLLISGRIFNTEKKRMMKKKNKL 200

Query: 181 QETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRL 240
           QET FNGDVKCNSKV  ASARYW+ FEQK+ SQIGFQVMLGQPSYE+RQ+AHSRGGF+RL
Sbjct: 201 QETSFNGDVKCNSKVLSASARYWVFFEQKSSSQIGFQVMLGQPSYEHRQIAHSRGGFNRL 260

Query: 241 KFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKER 300
           KF LHRLRKRK EW+WSL KLKG VRV SSE+E+EV RAA EFE+FNRVCLTYSS+EKER
Sbjct: 261 KFRLHRLRKRKFEWHWSLTKLKGFVRVPSSEKEVEVLRAAEEFEAFNRVCLTYSSEEKER 320

Query: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYM 360
           FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANL+SYR+GGDWSTTYAPSPFYM
Sbjct: 321 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLISYRAGGDWSTTYAPSPFYM 380

Query: 361 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTE 420
           TSKMRSLYLEGYEYS+FDLTKNDRVQIQIHGNS++GRILHGNSPSELIERFTETIGRP E
Sbjct: 381 TSKMRSLYLEGYEYSIFDLTKNDRVQIQIHGNSVQGRILHGNSPSELIERFTETIGRPPE 440

Query: 421 LPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEV 480
           LPGWIISGAVVGMQGGT+  RKIWDELKA EVPISAFWLQDWVGQRETVIGSQLWWNWEV
Sbjct: 441 LPGWIISGAVVGMQGGTNVVRKIWDELKAHEVPISAFWLQDWVGQRETVIGSQLWWNWEV 500

Query: 481 DTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGE 540
           D TRY GW+QLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAK LGIL+KKKNGE
Sbjct: 501 DATRYSGWKQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKALGILIKKKNGE 560

Query: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600
           PYMVPNTAFDVGMLDLTHPNTSSWFK+ILQEMV+DGVRGWMADFGEGLPVDATLYSGEDP
Sbjct: 561 PYMVPNTAFDVGMLDLTHPNTSSWFKKILQEMVNDGVRGWMADFGEGLPVDATLYSGEDP 620

Query: 601 ITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660
           ITAHNRYPEIWAQINREF DEWKS LVGKEKEDP+EALVFFMRAGFRNSPKWGMLFWEGD
Sbjct: 621 ITAHNRYPEIWAQINREFVDEWKSKLVGKEKEDPEEALVFFMRAGFRNSPKWGMLFWEGD 680

Query: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELN 720
           QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAV+LPFIKYRRSEELLLRWMELN
Sbjct: 681 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVNLPFIKYRRSEELLLRWMELN 740

Query: 721 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780
           AFTTVFRTHEGNKPSCNSQFYS+DRTLSQFARFAKVYSAWKFYRIQLVKEAA+RGLPVCR
Sbjct: 741 AFTTVFRTHEGNKPSCNSQFYSSDRTLSQFARFAKVYSAWKFYRIQLVKEAAERGLPVCR 800

Query: 781 HLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVY 840
           HLFVHYP+DEYVLTLGHQQFLVGSEILVVPVLDKGKNNV AYFPLG++SSWQHIWTGEVY
Sbjct: 801 HLFVHYPEDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVNAYFPLGDNSSWQHIWTGEVY 860

Query: 841 AKPGCEIKVDTPVGYPAVFVK 859
           AK GCEIKVD PVGYPAVF+K
Sbjct: 861 AKLGCEIKVDAPVGYPAVFIK 881

BLAST of Cla97C04G072200 vs. NCBI nr
Match: XP_022153908.1 (uncharacterized protein LOC111021314 isoform X1 [Momordica charantia])

HSP 1 Score: 1650.6 bits (4273), Expect = 0.0e+00
Identity = 780/861 (90.59%), Postives = 816/861 (94.77%), Query Frame = 0

Query: 1   MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENG 60
           M NLKITKKHHIH NNPFPS PTS P ++GDLSANFQALP  K  SIG+DF LLWR ENG
Sbjct: 1   MTNLKITKKHHIHFNNPFPSAPTSLPSVEGDLSANFQALPAIKVLSIGQDFQLLWRFENG 60

Query: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 120
           GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQT+DDI
Sbjct: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTVDDI 120

Query: 121 KEINGCDHELEVKDHHFPSGYLGLDKKDHQKDAQFPMLLINGRIFNTKKKKM---KKNRL 180
           + ING DHELEVKDHHFPSGYLGLD+K H KDAQFPMLLINGRIFNTKKK M   KKNRL
Sbjct: 121 RVINGWDHELEVKDHHFPSGYLGLDQKMHLKDAQFPMLLINGRIFNTKKKMMRRKKKNRL 180

Query: 181 QETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRL 240
           QETGFNGD+K N + PPASARYW+LFEQKN SQIGFQVMLGQPSYE RQMAHSRG F R 
Sbjct: 181 QETGFNGDLKYNPRAPPASARYWVLFEQKNSSQIGFQVMLGQPSYECRQMAHSRGRFDRF 240

Query: 241 KFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKER 300
           KF LHRL+KRKVEWYWSLAKLKGCVRVSSSEEEME  R+A EFE FNRVC TY+S+EKER
Sbjct: 241 KFRLHRLKKRKVEWYWSLAKLKGCVRVSSSEEEMEGLRSAEEFEGFNRVCFTYTSEEKER 300

Query: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYM 360
           FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYR+GGDWSTTYAPSPFYM
Sbjct: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYM 360

Query: 361 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTE 420
           TS+MRSLYLEGYEYSVFDLTKNDRVQIQIHGNSI+G ILHGNSPSELIERFT+TIGRP E
Sbjct: 361 TSRMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIQGWILHGNSPSELIERFTDTIGRPPE 420

Query: 421 LPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEV 480
           LPGW+ISGAVVGMQGGTDA R+IWD+LK  +VPISAFWLQDWVGQRETVIGSQLWWNWEV
Sbjct: 421 LPGWMISGAVVGMQGGTDAVRQIWDDLKVYKVPISAFWLQDWVGQRETVIGSQLWWNWEV 480

Query: 481 DTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGE 540
           DTTRY GW+QLIKDLGA+HIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGIL+KKKNGE
Sbjct: 481 DTTRYCGWKQLIKDLGAQHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILIKKKNGE 540

Query: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600
           PYMVPNTAFDVGMLDLTHPN+SSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP
Sbjct: 541 PYMVPNTAFDVGMLDLTHPNSSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600

Query: 601 ITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660
           I AHNRYPEIWAQ+NREFADEWKS LVGKEKEDP EALVFFMRAGFRNSPKWGMLFWEGD
Sbjct: 601 IAAHNRYPEIWAQMNREFADEWKSKLVGKEKEDPHEALVFFMRAGFRNSPKWGMLFWEGD 660

Query: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELN 720
           QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAV+LPFIKYRRSEELLLRWMELN
Sbjct: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVNLPFIKYRRSEELLLRWMELN 720

Query: 721 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780
           AFTT+FRTHEGNKPSCNSQFYSNDRTL+QFARFAKVYSAWKFYR+QLVKEAAQ+GLPVCR
Sbjct: 721 AFTTIFRTHEGNKPSCNSQFYSNDRTLTQFARFAKVYSAWKFYRVQLVKEAAQKGLPVCR 780

Query: 781 HLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVY 840
           HLFVHYP+DEYVLTL HQQFLVGSEILVVPVLDKGKNNV+AYFPLGESSSWQHIWTGE+Y
Sbjct: 781 HLFVHYPEDEYVLTLSHQQFLVGSEILVVPVLDKGKNNVKAYFPLGESSSWQHIWTGELY 840

Query: 841 AKPGCEIKVDTPVGYPAVFVK 859
            KPGCE+KVD PVGYPAVF+K
Sbjct: 841 TKPGCEVKVDAPVGYPAVFIK 861

BLAST of Cla97C04G072200 vs. NCBI nr
Match: TYK01674.1 (sulfoquinovosidase-like isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 1648.3 bits (4267), Expect = 0.0e+00
Identity = 790/861 (91.75%), Postives = 817/861 (94.89%), Query Frame = 0

Query: 1   MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENG 60
           M NLKITKKHHIHLNNPFPSPPTSFPLLQG+LSANFQ L  YK FSIGKDF LLWRS+NG
Sbjct: 1   MTNLKITKKHHIHLNNPFPSPPTSFPLLQGELSANFQVLSSYKFFSIGKDFQLLWRSDNG 60

Query: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 120
           GSLSIYHLS PTRSIWSTI GQAFVSAAMVETEVEESRGSFAVKDGAVHL+CNHQTIDDI
Sbjct: 61  GSLSIYHLSDPTRSIWSTISGQAFVSAAMVETEVEESRGSFAVKDGAVHLICNHQTIDDI 120

Query: 121 KEINGCDHELEVKDHHFPSGYLGLDKKDHQK-DAQFPMLLINGRIFNTKKKKM--KKNRL 180
           KEINGCDHELEVK+HHFPSGYLGLD K+++K DAQFPMLLI+GRIFNT++KKM  KKN+L
Sbjct: 121 KEINGCDHELEVKEHHFPSGYLGLDLKNYEKEDAQFPMLLISGRIFNTERKKMMKKKNKL 180

Query: 181 QETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRL 240
           QET FNGDVKCNSKV  ASARYW+LFEQK+ SQIGFQVMLGQPSYEYRQ+AHS GGF+R 
Sbjct: 181 QETSFNGDVKCNSKVLSASARYWVLFEQKSSSQIGFQVMLGQPSYEYRQIAHSSGGFNRP 240

Query: 241 KFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKER 300
           KF  HRLRKRK EW WSL KLKG VRV SSEEEMEV RAA EFE+FNR CLTYSS+EKER
Sbjct: 241 KFRWHRLRKRKFEWRWSLTKLKGFVRVCSSEEEMEVLRAAEEFEAFNRACLTYSSEEKER 300

Query: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYM 360
           FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYR+GGDWSTTYAPSPFYM
Sbjct: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYM 360

Query: 361 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTE 420
           TSKMRSLYLEGYEYS FDLTKNDRVQIQIHGNSI+GRILHGNSPSELIE FTETIGRP E
Sbjct: 361 TSKMRSLYLEGYEYSTFDLTKNDRVQIQIHGNSIQGRILHGNSPSELIECFTETIGRPPE 420

Query: 421 LPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEV 480
           LPGWIISGAVVGMQGGT+  RKIWDELKA EVPISAFWLQDWVGQRETVIGSQLWWNWEV
Sbjct: 421 LPGWIISGAVVGMQGGTNIVRKIWDELKAHEVPISAFWLQDWVGQRETVIGSQLWWNWEV 480

Query: 481 DTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGE 540
           D TRY GW+QLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAK LGILVKKKNGE
Sbjct: 481 DATRYSGWKQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKALGILVKKKNGE 540

Query: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600
            YMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP
Sbjct: 541 HYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600

Query: 601 ITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660
           ITAHNRYPEIWAQINREF DEWKS LVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD
Sbjct: 601 ITAHNRYPEIWAQINREFVDEWKSKLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660

Query: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELN 720
           QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAV+LPFIKYRRSEELLLRWMELN
Sbjct: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVNLPFIKYRRSEELLLRWMELN 720

Query: 721 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780
           AFTTVFRTHEGNKPSCNSQFYSN+RTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR
Sbjct: 721 AFTTVFRTHEGNKPSCNSQFYSNNRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780

Query: 781 HLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVY 840
           HLFVHYP+DEYVLTLGHQQFLVGSEILVVPVLDKGKN V+AYFPL +SSSWQHIWTGEVY
Sbjct: 781 HLFVHYPEDEYVLTLGHQQFLVGSEILVVPVLDKGKNYVKAYFPLDDSSSWQHIWTGEVY 840

Query: 841 AKPGCEIKVDTPVGYPAVFVK 859
           AK GCEIKVD PVGYPAVF+K
Sbjct: 841 AKLGCEIKVDAPVGYPAVFIK 861

BLAST of Cla97C04G072200 vs. ExPASy Swiss-Prot
Match: P32138 (Sulfoquinovosidase OS=Escherichia coli (strain K12) OX=83333 GN=yihQ PE=1 SV=3)

HSP 1 Score: 400.6 bits (1028), Expect = 4.7e-110
Identity = 211/604 (34.93%), Postives = 337/604 (55.79%), Query Frame = 0

Query: 258 LKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKERFFGFGEQFSHMDFKGKRVPI 317
           +   + +S+ ++   +     +  + NR+ L  ++  ++  +G GEQFS+ D +GK  P+
Sbjct: 89  ISATLNISADDQGRLLLELQNDNLNHNRIWLRLAAQPEDHIYGCGEQFSYFDLRGKPFPL 148

Query: 318 FVQEQGIGRGDQP-ITFAANLVSYRSGGDWSTTYAPSPFYMTSKMRSLYLEGYEYSVFDL 377
           +  EQG+GR  Q  +T+ A+     +GGD+  T+ P P +++++    +++   Y  FD 
Sbjct: 149 WTSEQGVGRNKQTYVTWQAD-CKENAGGDYYWTFFPQPTFVSTQKYYCHVDNSCYMNFDF 208

Query: 378 TKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTELPGWIISGAVVGMQGGTDA 437
           +  +  ++ +  +    R    ++   L+E+ T  +GR  ELP WI  G  +G+QGGT+ 
Sbjct: 209 SAPEYHELALWEDKATLRFECADTYISLLEKLTALLGRQPELPDWIYDGVTLGIQGGTEV 268

Query: 438 ARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEVDTTRYYGWRQLIKDLGARH 497
            +K  D ++   V ++  W QDW G R T  G ++ WNW+ ++  Y      IK      
Sbjct: 269 CQKKLDTMRNAGVKVNGIWAQDWSGIRMTSFGKRVMWNWKWNSENYPQLDSRIKQWNQEG 328

Query: 498 IKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGEPYMVPNTAFDVGMLDLTHP 557
           ++ + Y NP +A         ++L EEA + G L K  +G  Y+V    F  G++DLT+P
Sbjct: 329 VQFLAYINPYVASD-------KDLCEEAAQHGYLAKDASGGDYLVEFGEFYGGVVDLTNP 388

Query: 558 NTSSWFKEILQE-MVDDGVRGWMADFGEGLPVDATLYSGEDPITAHNRYPEIWAQINREF 617
              +WFKE++++ M++ G  GWMADFGE LP D  L++G      HN +P +WA+ N E 
Sbjct: 389 EAYAWFKEVIKKNMIELGCGGWMADFGEYLPTDTYLHNGVSAEIMHNAWPALWAKCNYEA 448

Query: 618 ADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGDQMVSWQANDGIKSAVTGL 677
            +E      GK  E     ++FFMRAG   S K+  + W GDQ V W  +DG+ S V   
Sbjct: 449 LEE-----TGKLGE-----ILFFMRAGSTGSQKYSTMMWAGDQNVDWSLDDGLASVVPAA 508

Query: 678 LSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELNAFTTVFRTHEGNKPSCNS 737
           LS  ++G+  +HSDIGGY  +     + +RS+ELLLRW + +AFT + RTHEGN+P  N 
Sbjct: 509 LSLAMTGHGLHHSDIGGYTTL----FEMKRSKELLLRWCDFSAFTPMMRTHEGNRPGDNW 568

Query: 738 QFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCRHLFVHYPDDEYVLTLGHQ 797
           QF  +  T++ FAR   V++  K Y  + V   A+ GLPV R LF+HY DD +  TL + 
Sbjct: 569 QFDGDAETIAHFARMTTVFTTLKPYLKEAVALNAKSGLPVMRPLFLHYEDDAHTYTLKY- 628

Query: 798 QFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVYAKPGCEIKVDTPVGYPAV 857
           Q+L+G +ILV PV ++G+++   Y P     +W H WTGE +   G E+ V+ P+G P V
Sbjct: 629 QYLLGRDILVAPVHEEGRSDWTLYLP---EDNWVHAWTGEAFR--GGEVTVNAPIGKPPV 664

Query: 858 FVKS 860
           F ++
Sbjct: 689 FYRA 664

BLAST of Cla97C04G072200 vs. ExPASy Swiss-Prot
Match: Q9F234 (Alpha-glucosidase 2 OS=Bacillus thermoamyloliquefaciens OX=1425 PE=3 SV=1)

HSP 1 Score: 157.5 bits (397), Expect = 7.0e-37
Identity = 140/626 (22.36%), Postives = 252/626 (40.26%), Query Frame = 0

Query: 286 VCLTYSSDEKERFFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGD 345
           VC     DE + F+GFGE+   +D +G+ + ++  +                V      +
Sbjct: 137 VCCFKMMDEADHFYGFGEKTGFLDKRGETMTMWNTD----------------VYAPHNPE 196

Query: 346 WSTTYAPSPFYMTSKMRS---LYLEGYEYSVFDL-TKNDRVQIQIHGNSIEGRILHGNSP 405
               Y   P++MT +  S   ++ +    + FD  T  D       G +I+  +  G +P
Sbjct: 197 TDPLYQSHPYFMTVRNGSAHGIFFDNTYKTTFDFQTATDEYCFSAEGGAIDYYVFAGPTP 256

Query: 406 SELIERFTETIGRPTELPGWIISGAVVGMQGGTD-AARKIWDELKAREVPISAFWLQDWV 465
            +++E++T+  GR    P W +          T+   R+I      +++P+   +L    
Sbjct: 257 KDVLEQYTDLTGRMPLPPKWALGYHQSRYSYETEQEVREIAQTFIEKDIPLDVIYLDIHY 316

Query: 466 GQRETVIGSQLWWNWEVDTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNL 525
                V        +  D  R+   +QLI DL  + I+V+   +P +     K++    +
Sbjct: 317 MNGYRV--------FTFDRNRFPNLKQLIADLKQKGIRVVPIVDPGV-----KEDPEYVI 376

Query: 526 YEEAKELGILVKKKNGEPYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMAD 585
           Y+E        K   G  Y            D T+     W+ E  Q   D G+ G   D
Sbjct: 377 YQEGIRHDYFCKYIEGNVYFGEVWPGKSAFPDFTNKKVRKWWGEKHQFYTDLGIEGIWND 436

Query: 586 FGEGLPVDATLYSGEDPITAHNRYPEIWAQINREFADEW-KSNLVGKEKEDPQEALVFFM 645
             E    + T       I  ++  P+   +++  +     ++   G +K    +      
Sbjct: 437 MNEPSVFNETKTMDVKVIHDNDGDPKTHRELHNVYGFMMGEATYKGMKKLLNGKRPFLLT 496

Query: 646 RAGFRNSPKWGMLFWEGDQMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLP 705
           RAGF    ++  + W GD    W   + ++ ++   ++ GLSG AF   D+GG+      
Sbjct: 497 RAGFSGIQRYAAV-WTGDNRSFW---EHLQMSLPMCMNLGLSGVAFCGPDVGGFA----- 556

Query: 706 FIKYRRSEELLLRWMELNAFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKF 765
              +  + ELL RWM++ AFT  FR H          +   ++      ++ ++   W  
Sbjct: 557 ---HNTNGELLTRWMQVGAFTPYFRNHCAIGFRRQEPWAFGEKYERIIKKYIRLRYQWLP 616

Query: 766 YRIQLVKEAAQRGLPVCRHLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAY 825
           +   L  EA + G PV R LF  YPDDE    L + +FLVG+ +L+ P++        AY
Sbjct: 617 HLYTLFAEAHETGAPVMRPLFFEYPDDENTYNL-YDEFLVGANVLIAPIMTPSTTRRVAY 676

Query: 826 FPLGESSSWQHIWTGEVYAKPGCEIKVDTPVGYPAVFVKSLK------TENTKEMIETHF 885
           FP G   +W   WTGEV  + G    +   +    +F+K          + + EM + H 
Sbjct: 677 FPKG---NWVDYWTGEV-LEGGQYHLISADLETLPIFIKQGSAIALGDVKRSTEMPDEHR 716

Query: 886 DDERFWCHGGGSPWKIWVRTPETWDY 900
               +  +GG + + ++    +T+ Y
Sbjct: 737 TVHIYKANGGKATYVLYDDDGQTFSY 716

BLAST of Cla97C04G072200 vs. ExPASy Swiss-Prot
Match: P96793 (Alpha-xylosidase XylQ OS=Lactiplantibacillus pentosus OX=1589 GN=xylQ PE=1 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 5.9e-36
Identity = 139/553 (25.14%), Postives = 243/553 (43.94%), Query Frame = 0

Query: 296 ERFFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPF 355
           E+ +G GE+F++    G+ V  + Q+ G G                        Y   PF
Sbjct: 158 EKIYGLGERFTNFVKNGQVVDTWNQDGGTGS--------------------EQAYKNIPF 217

Query: 356 YMTSKMRSLYLEGYEYSVFDLTKN--DRVQIQIHGNSIEGRILHGNSPSELIERFTETIG 415
           Y++S    ++++  +   F++     DRVQ    G S++  +++G +P E++ R+T+  G
Sbjct: 218 YISSNGYGVFVDESQRVSFEIGSENVDRVQFSTEGQSLQYYVIYGPTPKEVLHRYTQLTG 277

Query: 416 RPTELPGWIISGAVVGMQGGTDAAR----KIWDELKAREVPISAFWLQDWVGQRETVIGS 475
                P W   G  +     TD +     K  D ++   +P+  F   D   Q+    G 
Sbjct: 278 AIKLPPAWSF-GLWLTTSFTTDYSEETVLKFIDGMQEHHIPLDVFHF-DCFWQK----GF 337

Query: 476 QLWWNWEVDTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGI 535
           + W   E D  ++     L+K +  R IKV  + NP +A       ++  L++EAK+ G 
Sbjct: 338 E-WCTLEWDKEQFPDPEGLLKKIHDRGIKVCVWLNPYIA-------QKSPLFKEAKDKGY 397

Query: 536 LVKKKNGEPYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPV-D 595
           L+ ++NG+ +         G +D T+P    W+++ L+ ++D GV  +  DFGE +P  D
Sbjct: 398 LLTRENGDIWQWDLWQAGNGFVDFTNPAAVKWYQDKLKVLLDMGVDSFKTDFGERIPAED 457

Query: 596 ATLYSGEDPITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPK 655
              + G +P   HN Y     Q NR   +      V ++++   EA++F       ++P 
Sbjct: 458 VKFFDGSNPQQEHNYYT---LQYNRAVYE------VIQQEKGADEAVLFARSQRLVHNP- 517

Query: 656 WGMLFWEGDQMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEE 715
              + + G   +S ++       + G LS  LSG+ F   DIGG+   D P      + +
Sbjct: 518 ---IQYTGAATIS-RSTAQCVIQLRGGLSFLLSGFGFWSHDIGGF--EDGPGTP---TAD 577

Query: 716 LLLRWMELNAFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEA 775
           L  RW +    ++  R H  +       F  +D  +    ++    S   +   +    A
Sbjct: 578 LYKRWSQFGLLSSHSRYHGSDVYRVPWNF--DDEAVENTRKYVNKLSLMPYIYTEAAHAA 637

Query: 776 AQRGLPVCRHLFVHYPDDEYVLTLGHQQFLVGSEILVVPVL-DKGKNNVQAYFPLGESSS 835
           A  G P+ R +F+ + DD+ V      Q++ GS+ILV P+  D+GK     Y P G+   
Sbjct: 638 AAYGNPLMRPMFLEFGDDDNVYD-NATQYMFGSKILVAPIFNDQGK--AHFYLPSGK--- 649

Query: 836 WQHIWTGEVYAKP 841
           W  I  G+VY  P
Sbjct: 698 WTSILDGKVYQAP 649

BLAST of Cla97C04G072200 vs. ExPASy Swiss-Prot
Match: P31434 (Alpha-xylosidase OS=Escherichia coli (strain K12) OX=83333 GN=yicI PE=1 SV=2)

HSP 1 Score: 148.3 bits (373), Expect = 4.2e-34
Identity = 141/612 (23.04%), Postives = 254/612 (41.50%), Query Frame = 0

Query: 234 FSRLKFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFE---SFNRVCLTY 293
           ++  K G    R  K E +WSL  L+   R++ S+ +   +      +    F R+ L  
Sbjct: 100 YAEFKSGNLSARVSKGE-FWSLDFLRNGERITGSQVKNNGYVQDTNNQRNYMFERLDLGV 159

Query: 294 SSDEKERFFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTY 353
                E  +G GE+F+ +   G+ V  + ++                     G      Y
Sbjct: 160 G----ETVYGLGERFTALVRNGQTVETWNRD--------------------GGTSTEQAY 219

Query: 354 APSPFYMTSKMRSLYLEGYEYSVFDL--TKNDRVQIQIHGNSIEGRILHGNSPSELIERF 413
              PFYMT++   + +   +   F++   K  +VQ  +    +E  ++ G +P  +++R+
Sbjct: 220 KNIPFYMTNRGYGVLVNHPQCVSFEVGSEKVSKVQFSVESEYLEYFVIDGPTPKAVLDRY 279

Query: 414 TETIGRPTELPGWIIS-GAVVGMQGGTDAA--RKIWDELKAREVPI-----SAFWLQDWV 473
           T   GRP   P W              D A      D +  R +P+       FW++ + 
Sbjct: 280 TRFTGRPALPPAWSFGLWLTTSFTTNYDEATVNSFIDGMAERNLPLHVFHFDCFWMKAF- 339

Query: 474 GQRETVIGSQLWWNWEVDTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNL 533
                      W ++E D   +     +I+ L A+ +K+  + NP +        ++  +
Sbjct: 340 ----------QWCDFEWDPLTFPDPEGMIRRLKAKGLKICVWINPYI-------GQKSPV 399

Query: 534 YEEAKELGILVKKKNGEPYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMAD 593
           ++E +E G L+K+ +G  +        + + D T+P+   W+ + L+ +V  GV  +  D
Sbjct: 400 FKELQEKGYLLKRPDGSLWQWDKWQPGLAIYDFTNPDACKWYADKLKGLVAMGVDCFKTD 459

Query: 594 FGEGLPVDATLYSGEDPITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMR 653
           FGE +P D   + G DP   HN Y    A I  E       + VG+E+       V F R
Sbjct: 460 FGERIPTDVQWFDGSDPQKMHNHY----AYIYNELVWNVLKDTVGEEE------AVLFAR 519

Query: 654 AGFRNSPKWGMLFWEGDQMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPF 713
           +    + K+  + W GD   ++++   +  ++ G LS GLSG+ F   DIGG+       
Sbjct: 520 SASVGAQKF-PVHWGGDCYANYES---MAESLRGGLSIGLSGFGFWSHDIGGF------- 579

Query: 714 IKYRRSEELLLRWMELNAFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFY 773
            +      +  RW      ++  R H G+K S    +  +D +      F ++      Y
Sbjct: 580 -ENTAPAHVYKRWCAFGLLSSHSRLH-GSK-SYRVPWAYDDESCDVVRFFTQLKCRMMPY 639

Query: 774 RIQLVKEAAQRGLPVCRHLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYF 833
             +    A  RG P+ R + + +PDD     L  +Q+++G  ++V PV  +   +VQ Y 
Sbjct: 640 LYREAARANARGTPMMRAMMMEFPDDPACDYL-DRQYMLGDNVMVAPVFTEA-GDVQFYL 639

BLAST of Cla97C04G072200 vs. ExPASy Swiss-Prot
Match: B3PEE6 (Oligosaccharide 4-alpha-D-glucosyltransferase OS=Cellvibrio japonicus (strain Ueda107) OX=498211 GN=agd31B PE=1 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 5.5e-34
Identity = 130/571 (22.77%), Postives = 227/571 (39.75%), Query Frame = 0

Query: 296 ERFFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPF 355
           E+  G G++   MD +G+R P++                 N   Y         Y   P 
Sbjct: 152 EKILGGGQRILGMDRRGQRFPLY-----------------NRAHYGYSDHSGQMYFGLPA 211

Query: 356 YMTSKMRSLYLEGYEYSVFDL--TKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIG 415
            M+SK   L  +       D+  T++D +Q++         ++ GNS   LIE FT+  G
Sbjct: 212 IMSSKQYILVFDNSASGAMDIGKTESDILQLEAKSGRSAYILVAGNSYPSLIENFTQVTG 271

Query: 416 RPTELPGWIISGAVVGMQGGTDA-ARKIWDELKAREVPISAFWLQ-DWVGQRETVIGSQL 475
           R    P W +          ++A  R    + K  + P+    L   W G+        L
Sbjct: 272 RQPLPPRWALGSFASRFGYRSEAETRATVQKYKTEDFPLDTIVLDLYWFGKDIKGHMGNL 331

Query: 476 WWNWEVDTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILV 535
            W+ E   T       ++ D   + +K +    P +  + ++       +++A +   L 
Sbjct: 332 DWDKENFPTPL----DMMADFKQQGVKTVLITEPFVLTSSKR-------WDDAVKAKALA 391

Query: 536 KKKNGEPYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGE-GLPVDAT 595
           K   G+P        + G++D+     S WF  I +++   GV GW  D GE  +  + T
Sbjct: 392 KDPQGQPKAFELYFGNGGIIDVFSKEGSRWFSSIYKDLSKQGVAGWWGDLGEPEMHPEDT 451

Query: 596 LYSGEDPITAHNRYPEIWAQ-INREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKW 655
            ++  D  T HN Y   WA+ + ++  D++           P+      MRAGF  S ++
Sbjct: 452 QHAIGDADTVHNAYGHRWAEMLYQQQLDQF-----------PELRPFIMMRAGFVGSQRY 511

Query: 656 GMLFWEGDQMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEEL 715
           GM+ W GD   +W    G+ S V   L   L G+ + HSD+GG+   +         +E+
Sbjct: 512 GMIPWTGDVSRTW---GGLASQVELALQMSLLGFGYIHSDLGGFADGE------TLDKEM 571

Query: 716 LLRWMELNAFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAA 775
            +RW++   F  V+R H G     +   + ++ T +      K+      Y      +  
Sbjct: 572 YIRWLQYGVFQPVYRPH-GQDHIPSEPVFQDEETKAILRPLVKLRYRMLPYIYTAAYQNT 631

Query: 776 QRGLPVCRHLFVHYPDDEYVLTLGHQ-QFLVGSEILVVPVLDKGKNNVQAYFPLGESSSW 835
             G+P+ R LF  + D++    + ++  +  G  +LV P+   G  +V    P G    W
Sbjct: 632 LTGMPLMRPLF--FSDEKNPALIDNKTSYFWGDSLLVTPITQAGVESVSIPAPKG---VW 668

Query: 836 QHIWTGEVYAKPGCEIKVDTPVGYPAVFVKS 860
              W    Y   G  + + T +    V VK+
Sbjct: 692 FDFWKDTRYQTDGAPLTLPTDLHTIPVLVKA 668

BLAST of Cla97C04G072200 vs. ExPASy TrEMBL
Match: A0A5A7SLB8 (Sulfoquinovosidase-like isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold34G002350 PE=3 SV=1)

HSP 1 Score: 1657.5 bits (4291), Expect = 0.0e+00
Identity = 793/861 (92.10%), Postives = 821/861 (95.35%), Query Frame = 0

Query: 1   MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENG 60
           M NLKITKKHHIHLNNPFPSPPTSFPLLQG+LSANFQ L  YK FSIGKDF LLWRS+NG
Sbjct: 1   MTNLKITKKHHIHLNNPFPSPPTSFPLLQGELSANFQVLSSYKFFSIGKDFQLLWRSDNG 60

Query: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 120
           GSLSIYHLS PTRSIWSTI GQAFVSAAMVETEVEESRGSFAVKDGAVHL+CNHQTIDDI
Sbjct: 61  GSLSIYHLSDPTRSIWSTISGQAFVSAAMVETEVEESRGSFAVKDGAVHLICNHQTIDDI 120

Query: 121 KEINGCDHELEVKDHHFPSGYLGLDKKDHQK-DAQFPMLLINGRIFNTKKKKM--KKNRL 180
           KEINGCDHELEVK+HHFPSGYLGLD K+++K DA+FPMLLI+GRIFNT++KKM  KKN+L
Sbjct: 121 KEINGCDHELEVKEHHFPSGYLGLDLKNYEKEDARFPMLLISGRIFNTERKKMMKKKNKL 180

Query: 181 QETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRL 240
           QET FNGDVKCNSKV  ASARYWLLFEQK+ SQIGFQVMLGQPSYEYRQ+AHSRGGF+RL
Sbjct: 181 QETSFNGDVKCNSKVLSASARYWLLFEQKSSSQIGFQVMLGQPSYEYRQIAHSRGGFNRL 240

Query: 241 KFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKER 300
           KF  HRLRKRK EW WSL KLKG VRV SSEEEMEV RAA EFE+FNR CLTYSS+EKER
Sbjct: 241 KFRWHRLRKRKFEWRWSLTKLKGFVRVCSSEEEMEVLRAAEEFEAFNRACLTYSSEEKER 300

Query: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYM 360
           FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYR+GGDWSTTYAPSPFYM
Sbjct: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYM 360

Query: 361 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTE 420
           TSKMRSLYLEGYEYS+FDLTKNDRVQIQIHGNSI+GRILHGNSPSELIE FTETIGRP E
Sbjct: 361 TSKMRSLYLEGYEYSIFDLTKNDRVQIQIHGNSIQGRILHGNSPSELIECFTETIGRPPE 420

Query: 421 LPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEV 480
           LPGWIISGAVVGMQGGT+  RKIWDELKA EVPISAFWLQDWVGQRETVIGSQLWWNWEV
Sbjct: 421 LPGWIISGAVVGMQGGTNIVRKIWDELKAHEVPISAFWLQDWVGQRETVIGSQLWWNWEV 480

Query: 481 DTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGE 540
           D TRY GW+QLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAK LGILVKKKNGE
Sbjct: 481 DATRYSGWKQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKALGILVKKKNGE 540

Query: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600
           PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP
Sbjct: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600

Query: 601 ITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660
           ITAHNRYPEIWAQINREF DEWKS LVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD
Sbjct: 601 ITAHNRYPEIWAQINREFVDEWKSKLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660

Query: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELN 720
           QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAV+LPFIKYRRSEELLLRWMELN
Sbjct: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVNLPFIKYRRSEELLLRWMELN 720

Query: 721 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780
           AFTTVFRTHEGNKPSCNSQFYSN+RTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR
Sbjct: 721 AFTTVFRTHEGNKPSCNSQFYSNNRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780

Query: 781 HLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVY 840
           HLFVHYP+DEYVLTLGHQQFLVGSEILVVPVLDKGKN V+AYFPL +SSSWQHIWTGEVY
Sbjct: 781 HLFVHYPEDEYVLTLGHQQFLVGSEILVVPVLDKGKNYVKAYFPLDDSSSWQHIWTGEVY 840

Query: 841 AKPGCEIKVDTPVGYPAVFVK 859
           AK GCEIKVD PVGYPAVF+K
Sbjct: 841 AKLGCEIKVDAPVGYPAVFIK 861

BLAST of Cla97C04G072200 vs. ExPASy TrEMBL
Match: A0A1S3C1N4 (sulfoquinovosidase-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103495822 PE=3 SV=1)

HSP 1 Score: 1657.5 bits (4291), Expect = 0.0e+00
Identity = 793/861 (92.10%), Postives = 821/861 (95.35%), Query Frame = 0

Query: 1   MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENG 60
           M NLKITKKHHIHLNNPFPSPPTSFPLLQG+LSANFQ L  YK FSIGKDF LLWRS+NG
Sbjct: 1   MTNLKITKKHHIHLNNPFPSPPTSFPLLQGELSANFQVLSSYKFFSIGKDFQLLWRSDNG 60

Query: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 120
           GSLSIYHLS PTRSIWSTI GQAFVSAAMVETEVEESRGSFAVKDGAVHL+CNHQTIDDI
Sbjct: 61  GSLSIYHLSDPTRSIWSTISGQAFVSAAMVETEVEESRGSFAVKDGAVHLICNHQTIDDI 120

Query: 121 KEINGCDHELEVKDHHFPSGYLGLDKKDHQK-DAQFPMLLINGRIFNTKKKKM--KKNRL 180
           KEINGCDHELEVK+HHFPSGYLGLD K+++K DA+FPMLLI+GRIFNT++KKM  KKN+L
Sbjct: 121 KEINGCDHELEVKEHHFPSGYLGLDLKNYEKEDARFPMLLISGRIFNTERKKMMKKKNKL 180

Query: 181 QETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRL 240
           QET FNGDVKCNSKV  ASARYWLLFEQK+ SQIGFQVMLGQPSYEYRQ+AHSRGGF+RL
Sbjct: 181 QETSFNGDVKCNSKVLSASARYWLLFEQKSSSQIGFQVMLGQPSYEYRQIAHSRGGFNRL 240

Query: 241 KFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKER 300
           KF  HRLRKRK EW WSL KLKG VRV SSEEEMEV RAA EFE+FNR CLTYSS+EKER
Sbjct: 241 KFRWHRLRKRKFEWRWSLTKLKGFVRVCSSEEEMEVLRAAEEFEAFNRACLTYSSEEKER 300

Query: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYM 360
           FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYR+GGDWSTTYAPSPFYM
Sbjct: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYM 360

Query: 361 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTE 420
           TSKMRSLYLEGYEYS+FDLTKNDRVQIQIHGNSI+GRILHGNSPSELIE FTETIGRP E
Sbjct: 361 TSKMRSLYLEGYEYSIFDLTKNDRVQIQIHGNSIQGRILHGNSPSELIECFTETIGRPPE 420

Query: 421 LPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEV 480
           LPGWIISGAVVGMQGGT+  RKIWDELKA EVPISAFWLQDWVGQRETVIGSQLWWNWEV
Sbjct: 421 LPGWIISGAVVGMQGGTNIVRKIWDELKAHEVPISAFWLQDWVGQRETVIGSQLWWNWEV 480

Query: 481 DTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGE 540
           D TRY GW+QLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAK LGILVKKKNGE
Sbjct: 481 DATRYSGWKQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKALGILVKKKNGE 540

Query: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600
           PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP
Sbjct: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600

Query: 601 ITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660
           ITAHNRYPEIWAQINREF DEWKS LVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD
Sbjct: 601 ITAHNRYPEIWAQINREFVDEWKSKLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660

Query: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELN 720
           QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAV+LPFIKYRRSEELLLRWMELN
Sbjct: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVNLPFIKYRRSEELLLRWMELN 720

Query: 721 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780
           AFTTVFRTHEGNKPSCNSQFYSN+RTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR
Sbjct: 721 AFTTVFRTHEGNKPSCNSQFYSNNRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780

Query: 781 HLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVY 840
           HLFVHYP+DEYVLTLGHQQFLVGSEILVVPVLDKGKN V+AYFPL +SSSWQHIWTGEVY
Sbjct: 781 HLFVHYPEDEYVLTLGHQQFLVGSEILVVPVLDKGKNYVKAYFPLDDSSSWQHIWTGEVY 840

Query: 841 AKPGCEIKVDTPVGYPAVFVK 859
           AK GCEIKVD PVGYPAVF+K
Sbjct: 841 AKLGCEIKVDAPVGYPAVFIK 861

BLAST of Cla97C04G072200 vs. ExPASy TrEMBL
Match: A0A0A0L3I8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G430880 PE=3 SV=1)

HSP 1 Score: 1653.6 bits (4281), Expect = 0.0e+00
Identity = 783/861 (90.94%), Postives = 824/861 (95.70%), Query Frame = 0

Query: 1   MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENG 60
           M NLK+TKKHHIHLNNPFPSPP SFPLLQG+LSAN+QAL  YK FSIGKDF LLWRS+NG
Sbjct: 21  MTNLKVTKKHHIHLNNPFPSPPPSFPLLQGELSANYQALSSYKFFSIGKDFQLLWRSDNG 80

Query: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 120
           GSLSIYHLS PTRSIWSTI GQAFVSAAMVETEVEESRGSFAVKDGAVHL+CNHQTIDDI
Sbjct: 81  GSLSIYHLSDPTRSIWSTISGQAFVSAAMVETEVEESRGSFAVKDGAVHLICNHQTIDDI 140

Query: 121 KEINGCDHELEVKDHHFPSGYLGLDKKDHQK-DAQFPMLLINGRIFNTKKKKM--KKNRL 180
           KEINGCDHE EVK+HHFPSGYLGLD K+++K DAQFPMLLI+GRIFNT+KK+M  KKN+L
Sbjct: 141 KEINGCDHEFEVKEHHFPSGYLGLDLKNYEKEDAQFPMLLISGRIFNTEKKRMMKKKNKL 200

Query: 181 QETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRL 240
           QET FNGDVKCNSKV  ASARYW+ FEQK+ SQIGFQVMLGQPSYE+RQ+AHSRGGF+RL
Sbjct: 201 QETSFNGDVKCNSKVLSASARYWVFFEQKSSSQIGFQVMLGQPSYEHRQIAHSRGGFNRL 260

Query: 241 KFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKER 300
           KF LHRLRKRK EW+WSL KLKG VRV SSE+E+EV RAA EFE+FNRVCLTYSS+EKER
Sbjct: 261 KFRLHRLRKRKFEWHWSLTKLKGFVRVPSSEKEVEVLRAAEEFEAFNRVCLTYSSEEKER 320

Query: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYM 360
           FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANL+SYR+GGDWSTTYAPSPFYM
Sbjct: 321 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLISYRAGGDWSTTYAPSPFYM 380

Query: 361 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTE 420
           TSKMRSLYLEGYEYS+FDLTKNDRVQIQIHGNS++GRILHGNSPSELIERFTETIGRP E
Sbjct: 381 TSKMRSLYLEGYEYSIFDLTKNDRVQIQIHGNSVQGRILHGNSPSELIERFTETIGRPPE 440

Query: 421 LPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEV 480
           LPGWIISGAVVGMQGGT+  RKIWDELKA EVPISAFWLQDWVGQRETVIGSQLWWNWEV
Sbjct: 441 LPGWIISGAVVGMQGGTNVVRKIWDELKAHEVPISAFWLQDWVGQRETVIGSQLWWNWEV 500

Query: 481 DTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGE 540
           D TRY GW+QLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAK LGIL+KKKNGE
Sbjct: 501 DATRYSGWKQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKALGILIKKKNGE 560

Query: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600
           PYMVPNTAFDVGMLDLTHPNTSSWFK+ILQEMV+DGVRGWMADFGEGLPVDATLYSGEDP
Sbjct: 561 PYMVPNTAFDVGMLDLTHPNTSSWFKKILQEMVNDGVRGWMADFGEGLPVDATLYSGEDP 620

Query: 601 ITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660
           ITAHNRYPEIWAQINREF DEWKS LVGKEKEDP+EALVFFMRAGFRNSPKWGMLFWEGD
Sbjct: 621 ITAHNRYPEIWAQINREFVDEWKSKLVGKEKEDPEEALVFFMRAGFRNSPKWGMLFWEGD 680

Query: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELN 720
           QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAV+LPFIKYRRSEELLLRWMELN
Sbjct: 681 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVNLPFIKYRRSEELLLRWMELN 740

Query: 721 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780
           AFTTVFRTHEGNKPSCNSQFYS+DRTLSQFARFAKVYSAWKFYRIQLVKEAA+RGLPVCR
Sbjct: 741 AFTTVFRTHEGNKPSCNSQFYSSDRTLSQFARFAKVYSAWKFYRIQLVKEAAERGLPVCR 800

Query: 781 HLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVY 840
           HLFVHYP+DEYVLTLGHQQFLVGSEILVVPVLDKGKNNV AYFPLG++SSWQHIWTGEVY
Sbjct: 801 HLFVHYPEDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVNAYFPLGDNSSWQHIWTGEVY 860

Query: 841 AKPGCEIKVDTPVGYPAVFVK 859
           AK GCEIKVD PVGYPAVF+K
Sbjct: 861 AKLGCEIKVDAPVGYPAVFIK 881

BLAST of Cla97C04G072200 vs. ExPASy TrEMBL
Match: A0A6J1DIT6 (uncharacterized protein LOC111021314 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111021314 PE=3 SV=1)

HSP 1 Score: 1650.6 bits (4273), Expect = 0.0e+00
Identity = 780/861 (90.59%), Postives = 816/861 (94.77%), Query Frame = 0

Query: 1   MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENG 60
           M NLKITKKHHIH NNPFPS PTS P ++GDLSANFQALP  K  SIG+DF LLWR ENG
Sbjct: 1   MTNLKITKKHHIHFNNPFPSAPTSLPSVEGDLSANFQALPAIKVLSIGQDFQLLWRFENG 60

Query: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 120
           GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQT+DDI
Sbjct: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTVDDI 120

Query: 121 KEINGCDHELEVKDHHFPSGYLGLDKKDHQKDAQFPMLLINGRIFNTKKKKM---KKNRL 180
           + ING DHELEVKDHHFPSGYLGLD+K H KDAQFPMLLINGRIFNTKKK M   KKNRL
Sbjct: 121 RVINGWDHELEVKDHHFPSGYLGLDQKMHLKDAQFPMLLINGRIFNTKKKMMRRKKKNRL 180

Query: 181 QETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRL 240
           QETGFNGD+K N + PPASARYW+LFEQKN SQIGFQVMLGQPSYE RQMAHSRG F R 
Sbjct: 181 QETGFNGDLKYNPRAPPASARYWVLFEQKNSSQIGFQVMLGQPSYECRQMAHSRGRFDRF 240

Query: 241 KFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKER 300
           KF LHRL+KRKVEWYWSLAKLKGCVRVSSSEEEME  R+A EFE FNRVC TY+S+EKER
Sbjct: 241 KFRLHRLKKRKVEWYWSLAKLKGCVRVSSSEEEMEGLRSAEEFEGFNRVCFTYTSEEKER 300

Query: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYM 360
           FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYR+GGDWSTTYAPSPFYM
Sbjct: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYM 360

Query: 361 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTE 420
           TS+MRSLYLEGYEYSVFDLTKNDRVQIQIHGNSI+G ILHGNSPSELIERFT+TIGRP E
Sbjct: 361 TSRMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIQGWILHGNSPSELIERFTDTIGRPPE 420

Query: 421 LPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEV 480
           LPGW+ISGAVVGMQGGTDA R+IWD+LK  +VPISAFWLQDWVGQRETVIGSQLWWNWEV
Sbjct: 421 LPGWMISGAVVGMQGGTDAVRQIWDDLKVYKVPISAFWLQDWVGQRETVIGSQLWWNWEV 480

Query: 481 DTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGE 540
           DTTRY GW+QLIKDLGA+HIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGIL+KKKNGE
Sbjct: 481 DTTRYCGWKQLIKDLGAQHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILIKKKNGE 540

Query: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600
           PYMVPNTAFDVGMLDLTHPN+SSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP
Sbjct: 541 PYMVPNTAFDVGMLDLTHPNSSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600

Query: 601 ITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660
           I AHNRYPEIWAQ+NREFADEWKS LVGKEKEDP EALVFFMRAGFRNSPKWGMLFWEGD
Sbjct: 601 IAAHNRYPEIWAQMNREFADEWKSKLVGKEKEDPHEALVFFMRAGFRNSPKWGMLFWEGD 660

Query: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELN 720
           QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAV+LPFIKYRRSEELLLRWMELN
Sbjct: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVNLPFIKYRRSEELLLRWMELN 720

Query: 721 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780
           AFTT+FRTHEGNKPSCNSQFYSNDRTL+QFARFAKVYSAWKFYR+QLVKEAAQ+GLPVCR
Sbjct: 721 AFTTIFRTHEGNKPSCNSQFYSNDRTLTQFARFAKVYSAWKFYRVQLVKEAAQKGLPVCR 780

Query: 781 HLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVY 840
           HLFVHYP+DEYVLTL HQQFLVGSEILVVPVLDKGKNNV+AYFPLGESSSWQHIWTGE+Y
Sbjct: 781 HLFVHYPEDEYVLTLSHQQFLVGSEILVVPVLDKGKNNVKAYFPLGESSSWQHIWTGELY 840

Query: 841 AKPGCEIKVDTPVGYPAVFVK 859
            KPGCE+KVD PVGYPAVF+K
Sbjct: 841 TKPGCEVKVDAPVGYPAVFIK 861

BLAST of Cla97C04G072200 vs. ExPASy TrEMBL
Match: A0A5D3BR81 (Sulfoquinovosidase-like isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold517G00070 PE=3 SV=1)

HSP 1 Score: 1648.3 bits (4267), Expect = 0.0e+00
Identity = 790/861 (91.75%), Postives = 817/861 (94.89%), Query Frame = 0

Query: 1   MKNLKITKKHHIHLNNPFPSPPTSFPLLQGDLSANFQALPPYKAFSIGKDFHLLWRSENG 60
           M NLKITKKHHIHLNNPFPSPPTSFPLLQG+LSANFQ L  YK FSIGKDF LLWRS+NG
Sbjct: 1   MTNLKITKKHHIHLNNPFPSPPTSFPLLQGELSANFQVLSSYKFFSIGKDFQLLWRSDNG 60

Query: 61  GSLSIYHLSQPTRSIWSTIPGQAFVSAAMVETEVEESRGSFAVKDGAVHLVCNHQTIDDI 120
           GSLSIYHLS PTRSIWSTI GQAFVSAAMVETEVEESRGSFAVKDGAVHL+CNHQTIDDI
Sbjct: 61  GSLSIYHLSDPTRSIWSTISGQAFVSAAMVETEVEESRGSFAVKDGAVHLICNHQTIDDI 120

Query: 121 KEINGCDHELEVKDHHFPSGYLGLDKKDHQK-DAQFPMLLINGRIFNTKKKKM--KKNRL 180
           KEINGCDHELEVK+HHFPSGYLGLD K+++K DAQFPMLLI+GRIFNT++KKM  KKN+L
Sbjct: 121 KEINGCDHELEVKEHHFPSGYLGLDLKNYEKEDAQFPMLLISGRIFNTERKKMMKKKNKL 180

Query: 181 QETGFNGDVKCNSKVPPASARYWLLFEQKNGSQIGFQVMLGQPSYEYRQMAHSRGGFSRL 240
           QET FNGDVKCNSKV  ASARYW+LFEQK+ SQIGFQVMLGQPSYEYRQ+AHS GGF+R 
Sbjct: 181 QETSFNGDVKCNSKVLSASARYWVLFEQKSSSQIGFQVMLGQPSYEYRQIAHSSGGFNRP 240

Query: 241 KFGLHRLRKRKVEWYWSLAKLKGCVRVSSSEEEMEVFRAAGEFESFNRVCLTYSSDEKER 300
           KF  HRLRKRK EW WSL KLKG VRV SSEEEMEV RAA EFE+FNR CLTYSS+EKER
Sbjct: 241 KFRWHRLRKRKFEWRWSLTKLKGFVRVCSSEEEMEVLRAAEEFEAFNRACLTYSSEEKER 300

Query: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRSGGDWSTTYAPSPFYM 360
           FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYR+GGDWSTTYAPSPFYM
Sbjct: 301 FFGFGEQFSHMDFKGKRVPIFVQEQGIGRGDQPITFAANLVSYRAGGDWSTTYAPSPFYM 360

Query: 361 TSKMRSLYLEGYEYSVFDLTKNDRVQIQIHGNSIEGRILHGNSPSELIERFTETIGRPTE 420
           TSKMRSLYLEGYEYS FDLTKNDRVQIQIHGNSI+GRILHGNSPSELIE FTETIGRP E
Sbjct: 361 TSKMRSLYLEGYEYSTFDLTKNDRVQIQIHGNSIQGRILHGNSPSELIECFTETIGRPPE 420

Query: 421 LPGWIISGAVVGMQGGTDAARKIWDELKAREVPISAFWLQDWVGQRETVIGSQLWWNWEV 480
           LPGWIISGAVVGMQGGT+  RKIWDELKA EVPISAFWLQDWVGQRETVIGSQLWWNWEV
Sbjct: 421 LPGWIISGAVVGMQGGTNIVRKIWDELKAHEVPISAFWLQDWVGQRETVIGSQLWWNWEV 480

Query: 481 DTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKELGILVKKKNGE 540
           D TRY GW+QLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAK LGILVKKKNGE
Sbjct: 481 DATRYSGWKQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLYEEAKALGILVKKKNGE 540

Query: 541 PYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600
            YMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP
Sbjct: 541 HYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVDDGVRGWMADFGEGLPVDATLYSGEDP 600

Query: 601 ITAHNRYPEIWAQINREFADEWKSNLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660
           ITAHNRYPEIWAQINREF DEWKS LVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD
Sbjct: 601 ITAHNRYPEIWAQINREFVDEWKSKLVGKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGD 660

Query: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELN 720
           QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAV+LPFIKYRRSEELLLRWMELN
Sbjct: 661 QMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVNLPFIKYRRSEELLLRWMELN 720

Query: 721 AFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780
           AFTTVFRTHEGNKPSCNSQFYSN+RTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR
Sbjct: 721 AFTTVFRTHEGNKPSCNSQFYSNNRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCR 780

Query: 781 HLFVHYPDDEYVLTLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIWTGEVY 840
           HLFVHYP+DEYVLTLGHQQFLVGSEILVVPVLDKGKN V+AYFPL +SSSWQHIWTGEVY
Sbjct: 781 HLFVHYPEDEYVLTLGHQQFLVGSEILVVPVLDKGKNYVKAYFPLDDSSSWQHIWTGEVY 840

Query: 841 AKPGCEIKVDTPVGYPAVFVK 859
           AK GCEIKVD PVGYPAVF+K
Sbjct: 841 AKLGCEIKVDAPVGYPAVFIK 861

BLAST of Cla97C04G072200 vs. TAIR 10
Match: AT1G68560.1 (alpha-xylosidase 1 )

HSP 1 Score: 106.7 bits (265), Expect = 1.0e-22
Identity = 128/567 (22.57%), Postives = 227/567 (40.04%), Query Frame = 0

Query: 350 YAPSPFYMTSKMRSLYLEGYEYSVFDLTKN--------DRVQIQIHGNSIEGRILHGNSP 409
           Y   P YM   +R++  + Y ++V  L  N        D +  ++ G   +   + G SP
Sbjct: 214 YGSHPMYM--DLRNVGGKAYAHAVLLLNSNGMDVFYRGDSLTYKVIGGVFDFYFIAGPSP 273

Query: 410 SELIERFTETIGRPTELPGWIISGAVVGM-QGGTDAARKIWDELKAREVPISAFWLQD-- 469
             +++++T+ IGRP  +P W +                 + D  K  ++P+   W  D  
Sbjct: 274 LNVVDQYTQLIGRPAPMPYWSLGFHQCRWGYHNLSVVEDVVDNYKKAKIPLDVIWNDDDH 333

Query: 470 WVGQRETVIGSQLWWNWEVDTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRR 529
             G ++  +    +        +   +   I  +G ++I +    +P +       N   
Sbjct: 334 MDGHKDFTLNPVAY-----PRAKLLAFLDKIHKIGMKYIVIN---DPGIG-----VNASY 393

Query: 530 NLYEEAKELGILVKKKNGEPYMVPNTAFDVGMLDLTHPNTSSWFKEILQEMVD----DGV 589
             ++ A    + +K + G+P++       V   D  +P T SW+ + ++   D    DG+
Sbjct: 394 GTFQRAMAADVFIKYE-GKPFLAQVWPGPVYFPDFLNPKTVSWWGDEIKRFHDLVPIDGL 453

Query: 590 ---RGWMADFGEGL---PVDATLYSGEDP-----ITAHNRYPEIW-------------AQ 649
                 +++F  GL   P      SGE P     +   N     W             A 
Sbjct: 454 WIDMNEVSNFCSGLCTIPEGKQCPSGEGPGWVCCLDCKNITKTRWDDPPYKINATGVVAP 513

Query: 650 INREFADEWKSNLVGKEKEDPQEALVF--------------------FMRAGFRNSPKWG 709
           +  +      ++  G  + D      F                      R+ F  S ++ 
Sbjct: 514 VGFKTIATSATHYNGVREYDAHSIYGFSETIATHKGLLNVQGKRPFILSRSTFVGSGQYA 573

Query: 710 MLFWEGDQMVSWQANDGIKSAVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELL 769
              W GD   +WQ+   ++ +++ +L+ G+ G     SDI G+          + +EEL 
Sbjct: 574 -AHWTGDNQGTWQS---LQVSISTMLNFGIFGVPMVGSDICGFYP--------QPTEELC 633

Query: 770 LRWMELNAFTTVFRTHEGNKPSCNSQFYSNDRTLSQFARFAKVYSAWKF--YRIQLVKEA 829
            RW+E+ AF    R H  N  S   + Y  D T++  AR A +   +K   +   L  EA
Sbjct: 634 NRWIEVGAFYPFSRDH-ANYYSPRQELYQWD-TVADSARNA-LGMRYKILPFLYTLNYEA 693

Query: 830 AQRGLPVCRHLFVHYPDDEYVLTLGH-QQFLVGSEILVVPVLDKGKNNVQAYFPLGESSS 853
              G P+ R LF  +P  EY    G+ +QFL+GS  ++ PVL++GK  V+A FP G   S
Sbjct: 694 HMTGAPIARPLFFSFP--EYTECYGNSRQFLLGSSFMISPVLEQGKTEVEALFPPG---S 744

BLAST of Cla97C04G072200 vs. TAIR 10
Match: AT3G45940.1 (Glycosyl hydrolases family 31 protein )

HSP 1 Score: 105.1 bits (261), Expect = 2.9e-22
Identity = 121/526 (23.00%), Postives = 211/526 (40.11%), Query Frame = 0

Query: 350 YAPSPFYMTSKMRSLYLEGYEYSVFDLT--------KNDRVQIQIHGNSIEGRILHGNSP 409
           Y   P YM   +R++  + Y +SV  L         + D +  ++ G   +     G SP
Sbjct: 211 YGSHPVYM--DLRNVSGKAYAHSVLLLNSHGMDVFYRGDSLTYKVIGGVFDFYFFAGPSP 270

Query: 410 SELIERFTETIGRPTELPGWIISGAVVGM-QGGTDAARKIWDELKAREVPISAFWLQDWV 469
             +++++T  IGRP  +P W +               + + D  +  ++P+   W     
Sbjct: 271 LNVVDQYTSLIGRPAPMPYWSLGFHQCRWGYRNVSVVKDVVDNYQKAKIPLDVIW----- 330

Query: 470 GQRETVIGSQLWWNWEVDTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNL 529
              + + G   + ++ +D    +   +L+  L   H   M Y    +       N    +
Sbjct: 331 NDADYMDG---YKDFTLDLVN-FPHAKLLSFLDRIHKMGMKYV--VIKDPGIGVNASYGV 390

Query: 530 YEEAKELGILVKKKNGEPYMVPNTAFDVGMLDLTHPNTSSW-------FKEILQ------ 589
           Y+      + +K + G+P++       V   D  +P T SW       F E++       
Sbjct: 391 YQRGMASDVFIKYE-GKPFLAQVWPGPVYFPDFLNPKTVSWWGDEIRRFHELVPIDGLWI 450

Query: 590 EMVDDGVRGWMADFG-EGLPVDATLYSGEDPITAHNRY--PEIWAQINREFADEWKSNLV 649
           +M +    G  A  G + +P  A  Y+G     AH+ Y   E  A      A + K   +
Sbjct: 451 DMNEINATGHKASLGFKTIPTSAYHYNGVREYDAHSIYGFSEAIATHKALLAVQGKRPFI 510

Query: 650 GKEKEDPQEALVFFMRAGFRNSPKWGMLFWEGDQMVSWQANDGIKSAVTGLLSSGLSGYA 709
                          R+ F  S ++    W GD   +WQ+   ++ +++ +L+ G+ G  
Sbjct: 511 -------------LSRSTFVGSGQYA-AHWTGDNQGTWQS---LQVSISTMLNFGIFGVP 570

Query: 710 FNHSDIGGYCAVDLPFIKYRRSEELLLRWMELNAFTTVFRTHEGNKPSCNSQFYSNDRTL 769
              SDI G+             EEL  RW+E+ AF    R H        + +Y+  + L
Sbjct: 571 MVGSDICGFFP--------PTPEELCNRWIEVGAFYPFSRDH--------ADYYAPRKEL 630

Query: 770 SQFARFAKVYSAWKFYRIQLVK-------EAAQRGLPVCRHLFVHYPDDEYVLTLGHQQF 829
            Q+   A+        R +L+        EA   G P+ R LF  +P+      L  +QF
Sbjct: 631 YQWGTVAESARNALGMRYKLLPFLYTLNYEAHMSGAPIARPLFFSFPEFTECYGLS-KQF 685

Query: 830 LVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHIW--TGEVYAKPG 842
           L+GS +++ PVL++GK  V+A FP G   SW H++  T  V +K G
Sbjct: 691 LLGSSLMISPVLEQGKTQVEALFPPG---SWYHMFDMTQVVVSKNG 685

BLAST of Cla97C04G072200 vs. TAIR 10
Match: AT5G11720.1 (Glycosyl hydrolases family 31 protein )

HSP 1 Score: 100.1 bits (248), Expect = 9.4e-21
Identity = 128/550 (23.27%), Postives = 217/550 (39.45%), Query Frame = 0

Query: 350 YAPSPFYMTSK-MRSLYLEGYEYSVFDLTKN--------DRVQIQIHGNSIEGRILHGNS 409
           Y   PFYM  +  +     G  + V  L  N         R+   + G  I+  +  G S
Sbjct: 230 YGSHPFYMDVRGSKGNEEAGTTHGVLLLNSNGMDVKYEGHRITYNVIGGVIDLYVFAGPS 289

Query: 410 PSELIERFTETIGRPTELPGWIIS--GAVVGMQGGTDAARKIWDELKAREVPISAFWLQ- 469
           P  ++ ++TE IGRP  +P W         G +  +D    +    KA  +P+   W   
Sbjct: 290 PEMVMNQYTELIGRPAPMPYWSFGFHQCRYGYKNVSDLEYVVDGYAKA-GIPLEVMWTDI 349

Query: 470 DWV-GQRETVIGSQLWWNWEVDTTRYYGWRQLIKDLGARHIKVMTYCNPCLAPTDEKQNR 529
           D++ G ++  +      N+  D  + +     +  L     K +   +P +       + 
Sbjct: 350 DYMDGYKDFTLDPV---NFPEDKMQSF-----VDTLHKNGQKYVLILDPGIG-----VDS 409

Query: 530 RRNLYEEAKELGILVKKKNGEPYMVPNTAFDVGMLDLTHPNTSS-WFKEI--LQEMVD-D 589
               Y    E  + + K+NGEPY+       V   D  +P  ++ W  EI   QE++  D
Sbjct: 410 SYGTYNRGMEADVFI-KRNGEPYLGEVWPGKVYFPDFLNPAAATFWSNEIKMFQEILPLD 469

Query: 590 GVRGWM----------ADFGEGLPVDATLY---SGEDPITAHNRYPEIWAQINREFADEW 649
           G+  W+          +    G  +D   Y   +  D    +N+     +      ++  
Sbjct: 470 GL--WIDMNELSNFITSPLSSGSSLDDPPYKINNSGDKRPINNKTVPATSIHFGNISEYD 529

Query: 650 KSNLVG-KEKEDPQEALV--------FFMRAGFRNSPKWGMLFWEGDQMVSWQANDGIKS 709
             NL G  E +   +A+V           R+ F +S K+    W GD    W   + +  
Sbjct: 530 AHNLYGLLEAKATHQAVVDITGKRPFILSRSTFVSSGKY-TAHWTGDNAAKW---EDLAY 589

Query: 710 AVTGLLSSGLSGYAFNHSDIGGYCAVDLPFIKYRRSEELLLRWMELNAFTTVFRTHEGNK 769
           ++ G+L+ GL G     +DI G+         +  +EEL  RW++L AF    R H  + 
Sbjct: 590 SIPGILNFGLFGIPMVGADICGF--------SHDTTEELCRRWIQLGAFYPFARDH-SSL 649

Query: 770 PSCNSQFYSNDRTLSQFARFAKVYSAWKFYRIQLVKEAAQRGLPVCRHLFVHYPDDEYVL 829
            +   + Y  D   S   +   +      +   L+ EA   G P+ R LF  +P D    
Sbjct: 650 GTARQELYLWDSVASSARKVLGLRMRLLPHLYTLMYEAHVSGNPIARPLFFSFPQDTKTY 709

Query: 830 TLGHQQFLVGSEILVVPVLDKGKNNVQAYFPLGESSSWQHI--WTGEVYAKPGCEIKVDT 859
            +   QFL+G  I+V P L +G   V AYFP G   +W  +  ++  V    G  +++DT
Sbjct: 710 EI-DSQFLIGKSIMVSPALKQGAVAVDAYFPAG---NWFDLFNYSFAVGGDSGKHVRLDT 745

BLAST of Cla97C04G072200 vs. TAIR 10
Match: AT5G63840.1 (Glycosyl hydrolases family 31 protein )

HSP 1 Score: 97.8 bits (242), Expect = 4.7e-20
Identity = 99/403 (24.57%), Postives = 162/403 (40.20%), Query Frame = 0

Query: 471 LWWNWE-VDTTRYYGW--------RQLIKDLGARHIKVMTYCNPCLAPTDEKQNRRRNLY 530
           LW + E  D  RY+ W         ++ K L A+  K++T  +P +     K++    L+
Sbjct: 396 LWLDIEHTDGKRYFTWDSVLFPHPEEMQKKLAAKGRKMVTIVDPHI-----KRDDSYFLH 455

Query: 531 EEAKELGILVKKKNGEPYMVPNTAFDVGMLDLTHPNTSSW------FKEILQEMVDDGVR 590
           +EA ++G  VK  +G+ +           +D+  P    W      +K  +       + 
Sbjct: 456 KEATQMGYYVKDSSGKDFDGWCWPGSSSYIDMLSPEIRKWWGGRFSYKNYVGS--TPSLY 515

Query: 591 GWMADFGE---------GLPVDATLYSGEDPITAHNRYPEIWAQINREFADEWKSNLVGK 650
            W  D  E          +P DA    G +    HN Y          F       LV  
Sbjct: 516 TW-NDMNEPSVFNGPEVTMPRDALHVGGVEHREVHNAY-------GYYFHMATSDGLV-- 575

Query: 651 EKEDPQEALVFFMRAGFRNSPKWGMLFWEGDQMVSWQANDGIKSAVTGLLSSGLSGYAFN 710
            +E+ ++      RA F  + ++G + W GD    W   + ++ ++  +L+ GL+G  F+
Sbjct: 576 MREEGKDRPFVLSRAIFPGTQRYGAI-WTGDNTAEW---EHLRVSIPMILTLGLTGITFS 635

Query: 711 HSDIGGYCAVDLPFIKYRRSEELLLRWMELNAFTTVFRTHEGNKPSCNSQFYSNDRTLSQ 770
            +DIGG+     P        ELL+RW ++ A+   FR H  +       +   +R    
Sbjct: 636 GADIGGFFGNPEP--------ELLVRWYQVGAYYPFFRGHAHHDTKRREPWLFGERNTEL 695

Query: 771 FARFAKVYSAWKFYRIQLVKEAAQRGLPVCRHLFVHYPDDEYVLTLGHQQFLVGSEILVV 830
                        Y   L +EA   G+PV R L++ +P DE   +   + F+VGS +LV 
Sbjct: 696 MRDAIHTRYTLLPYFYTLFREANVTGVPVVRPLWMEFPQDEATFS-NDEAFMVGSGLLVQ 755

Query: 831 PVLDKGKNNVQAYFPLGESSSWQHIWTGEVYAKPGCEIKVDTP 850
            V  KG      Y P  E  SW  +  G+ Y   G   K+D P
Sbjct: 756 GVYTKGTTQASVYLPGKE--SWYDLRNGKTYV-GGKTHKMDAP 765

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882634.10.0e+0093.96sulfoquinovosidase-like [Benincasa hispida][more]
XP_008455717.10.0e+0092.10PREDICTED: sulfoquinovosidase-like isoform X1 [Cucumis melo] >KAA0025932.1 sulfo... [more]
XP_004144332.20.0e+0090.94uncharacterized protein LOC101219337 [Cucumis sativus] >KGN54706.1 hypothetical ... [more]
XP_022153908.10.0e+0090.59uncharacterized protein LOC111021314 isoform X1 [Momordica charantia][more]
TYK01674.10.0e+0091.75sulfoquinovosidase-like isoform X1 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
P321384.7e-11034.93Sulfoquinovosidase OS=Escherichia coli (strain K12) OX=83333 GN=yihQ PE=1 SV=3[more]
Q9F2347.0e-3722.36Alpha-glucosidase 2 OS=Bacillus thermoamyloliquefaciens OX=1425 PE=3 SV=1[more]
P967935.9e-3625.14Alpha-xylosidase XylQ OS=Lactiplantibacillus pentosus OX=1589 GN=xylQ PE=1 SV=1[more]
P314344.2e-3423.04Alpha-xylosidase OS=Escherichia coli (strain K12) OX=83333 GN=yicI PE=1 SV=2[more]
B3PEE65.5e-3422.77Oligosaccharide 4-alpha-D-glucosyltransferase OS=Cellvibrio japonicus (strain Ue... [more]
Match NameE-valueIdentityDescription
A0A5A7SLB80.0e+0092.10Sulfoquinovosidase-like isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C... [more]
A0A1S3C1N40.0e+0092.10sulfoquinovosidase-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103495822 PE=3 ... [more]
A0A0A0L3I80.0e+0090.94Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G430880 PE=3 SV=1[more]
A0A6J1DIT60.0e+0090.59uncharacterized protein LOC111021314 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A5D3BR810.0e+0091.75Sulfoquinovosidase-like isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E56... [more]
Match NameE-valueIdentityDescription
AT1G68560.11.0e-2222.57alpha-xylosidase 1 [more]
AT3G45940.12.9e-2223.00Glycosyl hydrolases family 31 protein [more]
AT5G11720.19.4e-2123.27Glycosyl hydrolases family 31 protein [more]
AT5G63840.14.7e-2024.57Glycosyl hydrolases family 31 protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000322Glycoside hydrolase family 31PFAMPF01055Glyco_hydro_31coord: 396..858
e-value: 7.0E-63
score: 213.3
IPR013780Glycosyl hydrolase, all-betaGENE3D2.60.40.1180coord: 806..859
e-value: 4.1E-6
score: 28.7
NoneNo IPR availableGENE3D3.20.20.80Glycosidasescoord: 400..805
e-value: 4.3E-84
score: 284.7
NoneNo IPR availableGENE3D2.60.40.1760glycosyl hydrolase (family 31)coord: 221..398
e-value: 2.6E-20
score: 74.9
NoneNo IPR availablePANTHERPTHR46959SULFOQUINOVOSIDASEcoord: 33..864
NoneNo IPR availableCDDcd14752GH31_Ncoord: 282..413
e-value: 5.11967E-22
score: 90.3228
NoneNo IPR availableSUPERFAMILY51011Glycosyl hydrolase domaincoord: 774..858
IPR044112Sulfoquinovosidase YihQ-likeCDDcd06594GH31_glucosidase_YihQcoord: 413..751
e-value: 0.0
score: 524.07
IPR017853Glycoside hydrolase superfamilySUPERFAMILY51445(Trans)glycosidasescoord: 403..772
IPR011013Galactose mutarotase-like domain superfamilySUPERFAMILY74650Galactose mutarotase-likecoord: 259..401

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C04G072200.2Cla97C04G072200.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0003824 catalytic activity