Cp4.1LG17g08300 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG17g08300
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionBeta-glucosidase 12
LocationCp4.1LG17: 5377093 .. 5390586 (+)
RNA-Seq ExpressionCp4.1LG17g08300
SyntenyCp4.1LG17g08300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTATTGAATTAAATGATCAGAAAAAATATGGGACCATAGCAGTGGAGAAGTGGCTACCGATTTCTACCATCGTTATAAGGTAACATTTTTACATTCTTATTTTTCACTTTATCGTAACCTAAAGACCTTTTTATATTTAATCACGTCCATATCTACAACTACATTATTTTCGTTAATTTACAATATTAGTATTTAAATTATTTAGATGTGTCAAACACGTCTCTTAACTATTAATTATATCACTCTGAAAATTCAACTAACAGACTATAGATAATTGTGAAGTCTAATTAAATTAGCAATTAAATTCATCCAAATTTTCGGGTTATTCTAATATTTTTTAATATTTAAATATTTAGGAACTTATTATATATATTTGGTTATTATTAAAAGTAAATTAATAATATTGGATCATAATTATAATTATATATATATTAAAAATGTGTGATTTAATTTGTAGGAGGACGTGCAAATAATGAAGAAGATGGGGTTGGACTCTTTCAGATTCTCCATCTCTTGGTCAAGGATTCTCCCCAGTAATTTTTTTTTTACCCAAAATTAATTATTTCCATTTCCATCGGTAAATTTTTGTTTTTTTTACCGTTTTATTTTTCCTTCGAAAAAAAAAATGACAGAGGGAACCGTTCGTGGAGGAGTGAATCCACTTGGCGTCAAATTTTACAACAATCTCATCAATGAGCTCCTAGCCAATGGTAAAAAATTTAAAAAGAGGTTAATTTTATGATTAAATTAAGGTGAGAATTATATTTTTATTTATTTTAAATAATTACTTATCGTGAAAAAAAAAATACAGGAATAATACCTTACGTCACTCTTTTTCACTGGGACCTTCCTCAAGCACTTGAAGATAAGTACGATGGATTTCGAAACGTCAAAATTGTGTAAGTAAAATAATTAATTTTATTATATAAAATTAAAAGTTTACTATGCTTAATTTTATATTATAGGAGTCAAATTTAAACATTAATTGTAATTAATTAAAAATTAATTAATGTAATTTTGGTGATTAAACAGGAATGATTTTCGGAATTACGCGGACTTATGCTTCAAATTATTCGGCGATCGGGTCAAGTATTGGACAACCCTTAACGAACCATATTCATTTAGCGCTTATGGGTACAACAGCGGCACTTTCGCTCCTGGAAGATGTTCCAACTACGTCGGGAATTGTACCGCCGGCAACTCCGGCACCGAGCCCTACATCGTTGCTCACAATCTCCTCCTCTCCCATGCCGCTGCCGTCAAAGTCTACAGGACAAGGTACCAGGTCAGTATTCAAGAACAAATCCCTAAATCAATCTCTTCATTGTTAATCAGATTCTTGTTTTGGACACCGATTTTGCTAATTCGGTTTCATTTGAGAGCTACAGGCAAAGCAGAAGGGGAAGATTGGAATCACATTGGTGACTCACTGGTTTAGGCCCAAACGCAACACGCCAGCTTCGCGAGCGGCGGCGAATCGAGCTCTTGATTTCTTCTTGGGATGGTGAGTGTCTTGATCGTAATCGTTAGTGGAGAAATGTTGGTGAAAATTAAGTAACTGAGTTGAAATGAATTTTAGGTTTCTGCATCCGATTACTTATGGCACCTACCCTAAATCGATGCGTCAGTACGTTGGAGATAGACTGCCGAAATTCTCCGCGGCAGAGTCAAAAAGTGTTAAAGGATCAATGGATTTTCTTGGAATGAATTACTACACTGGCAATTTTGCTGACAATGTGCCTTTCTCGAATTCGCCTAACAAAAGCTATAGTTCTGATTCACACGTCTCATTCTCTAGTATGTCTTCTTTCTTTTCTTTGACGAGATAATTAAATTTTAGGGCGAATTTCGATCTTTACACCTAAATTCGAATAAGCCATGCAATTTCTATGTTTGGAATTCATGAAATGTGATATTAGACGAGAGCTGAATTTTGATTGAGATTTGTGAAGTTTTAGAATTTTATTGAGTGAGTTCTTGAAGCTTGGTAGATGGTAGGTTGTTTTTTGTTTGAAAGTTTAGACTAAAATATGATTCAATTGTCAAGTTTAAGTGATTTTTCATTCGAACCAAACCTTTTTCTTATCCTTAACATGAATTTTTTCTTCTACAGCGGAGAAAGATGGTGTTCTAATTGGACCAGCGGTTCGTACTCTCTCTATTTATCCATCTAGAGCTGTTATTTTCATTACTTTAATTAGCAGCTGAGCGATTTATTTATGCTTCTGCAGACTGGTTTGAACTGGCTTTACATCTACCCAGAGGGCATCCGTCTACTTTTGAAATACGTTAAAGCAGAGTACAAAGATCCAGTTATTTACATCACCGAGAATGGTACCTCTCTTTCTCTCCAATCAATGGCTATACAACGATGCATTTAGTTAGTCATATGAAAATTGAAATTTGAAGACAAAAGATCGAATAAAAAGTTTGTATTTCAAATATGAGATCCCACGTCGGTTGGAGAGAGGAATGGAACATTTCTTATAAGGGTGTGGAAACCTCTCCTTGGTAGAGGCGTTTCAAAAACCGTGAGGGTAATGGCGATATGTAATGGGTAAAAGTGAACAATATTTGTTAGTGGTGTACTTGGGCTGTTACAAATGGTATCAAAGCCAAACACCAGACGGTGTGCCAGCGAGGACGCTAGGCTCTCGAAGAGGGTGGATTGTGAGATCTCACGTCGGTTGGAGAGGGGAAAGAAACATTTCTTATAAGGGTGTGGAAACATCTCCCTAGTAGATGCATTTTAAAAACAGTGAGATTGATAGCGATACGAAACGGACCAAAGCGGACAATATTTGCTAGCGGTAGACTTGGGCTGTTACAAATGATATCAAAGTCAGACACCGAACAGTATGCCAGTGAGGACACGCTAGGCTCCAAAGGGGGTGGATTATGAGATCCCACATCGGCGGTTGGAGAGGGGAAGGAAACATTTTTTATAAGGGCGTGGAAACCTCTCCCTGGTAGACACGTTTTAAAAACTGTGAGGCTGACGGTGATACGTAACGGACCAAAGCGGACAATATTTGTTAGTTGTGGACTTAGACTATTACAATCATATTTACCGTGTTTTTGGATGTATTTGTAAATAGGTTTAGAAATGATTAGAAATGTGTTTGGTTGAATCTCATTGTTTATCAATTTATGAACATGTTTTCTGCTAAATGGATACTAGACAAGCCCTAAACTATTGACTTGTTGTGGTGTACAGGTATGGCTTATTCAGACAATATGACACTGCCAATTAAGGAAGCTCTAAAAGATGGAACAAGGATCAAATACCACCACGCCCATCTTGCATCTCTTCTTCAAGCTATCAAGTACGAATGATCTCATAAAACATCAAACTCCATTGATAAGAACAGATGAACTCAGAACTTCTCTAATGGTTGAGCTTTTGAAATGACAGGGAAGGAGTGAATGTGAAGGGATACTACGCCTGGACTTTTCTGGACGACTTTGAATGGGACGCAGGCTACACGGTGCGGTTCGGCCTCGTCTACATCGATTTCAGGCACAAATTGGGAAGGTACCTCAAGTATTCTGCTTACTGGTTAAAGAGGTTCCTACTTCATTGAGCTTAGCAGTAGAATCTCTCCCACATGCCGCCGTTTCTTGTCCAATGGAACCACTATTGAATAAGGAGAGGGCGTTGGAGTGGGTAACATGAGAAACTTTTGTGTCAAGAACTTCTTTCCTTCTGCAATTTGAGTTTGTTGATTGATATTTCTCAATAAAAGGTTTAGAATCTTGAGTGTAAAAGTGTTCATGAATCGAGTCGAACTAGTTCAAACGGATCGACCCGGTACAATAAGTCAAGTTATTATGTAGCTTTTTTAAGTTGATTCCGCTTAAATATTTATGTAATTGTTGATCATTGTAATCGCCCTTGTTTTACTATTAATAATATGGCTTCTTTATGAACAATTTTGCATTGCATTGAACCAAGAATGAACCCTTTTAGAGATTTTTTTTTTCTACACTTTTTGGAGGCAGAATTAGATTTCAAGGGCAAACTTTAAGTTAGCTAGGGGGTACATGGTCCGGGTTGAGGGATTTTTTTGACCCAACTCAAAAGTTCGCGTTGGTTGGATTGGTAACCCAACCCGAAATTTTTCACGACCCAACCCAACCCAACTCTCCATTTTCGGGTTGGGTTCGGGTTGGGTTGTTAATCTTTTAAAAAAAAGTTTTATTTATCAACAATTTATAATTATTCGGATACAATCTTATATAAAATATATTAAAAATTTATAAACAAACACAAATCAATCCATAATTATTAAAAATAAAAATAAAAACAAATTCAATAAAAAAACAAGAAGCAAAAATAGTAACATATTAATATAGTTCGGGTGAAAATTGTGACATATTAACATAGTTTAACATGTTATAAAACTAAAGAATGTTAAGTACGAACATTTCTTGAAAATGGGGTTAATAGAAAAAATATGAAAATATATGTTTATTATACGTGATTTGATAAAGATAGAGAGTCCTAAACTATTAGAAAAATAATATATATATAAATTGTGATATAATAATATTACCACAAATAAAAAATAAATTCTAAATATATTTAAACTAAAATTTAGAATTGCTTGTCTAGGAACATTTGTCTAGGAACATATATTTGTTAGTAAGGTATTTTTTTTGAAATATATATATATATATATAATTTATGCTGTATAATTTATGAAGATATAATTGTTTGTTAGTGAGTTATTTTTAAAATATATATATAATATAATATAATAGGAAAAAAAAAACATGTATTTGGATATATATAATTTATGAACGTATAATTGTTTGTTAGTAAGGTATTTTTAAAAATATATAGTTTTCTTAGACCGGCTAAAGTACAACTCTGGCAAACCTATTATAGATAAGATTTCTTAATATATTTTATATATTTGAATTTGTCTCGGGTCAACTCGAGAGTTTTATCCACGAACCCGAAAAGAAACCGATTCACTTCGAGTTCAGAAAAATGAATCCAACCTAACTCTTGTAGTTCGGGTTGGGTTGATCCGGGTTGTCGGGTTGAATGTACACCCCTAACATGATAAGTGGCTCTATTGTTGATTTGAGTGGACAAATTGTAAGATGTATGATGGCGTGTGCCATGACCCCTGCACATATCATAACTATGATTGTTATGTTGTATGTTGACTTTAAGCATACCATGTTATGTCATATTACAAGCCACCGTACCACTTTGTTATGTCATGATATACCATGTTATGCTATGGGGTGTTATTTATGTTCATAAAGCTATGTTATAGAGTGATTTTTCATGTTGTACAATTATATTACGATGTTGATAAACTATTATACATAAACCTCCATACACCACGCCATGACGACTATATCATATTAAAAAATCTGACAGGAAGTATGTATGCATGATTCATAGACTCATTACATTATAAATGAACGAAGGTTTGACTTGCATCATAAATGCGTACTTGATGTTATGTTGGCCCTTTCTCTCTTGGTGTTCAGTAGCCCATTAGTGAGTGCTAGCCCCGACGAACTAGGGTCATGGGGTACATGAGCTTACCCAGTGGGTCCATATGCACGTAAAATGTTAATTCCATGACACATTTTATAAAATATTTTAAATAGATAGTTTTTAACACGCATTATAAAACATTAATTTCATCCTCGTCAAAATTCAAACCTCCTCGGACATAATAAATAAATAAATAAATTCCAAATCAACATTAAATCTTGAATGAATAATTGGAAACAAGATAAGGGCGTCCAACTCATTTTAGCAATAAGATTATGATATTGATAATGAGTCACGTTTATATTTAAATTATAAAATAATAATGTACTTTTGACAAAATTTCTTTAAAATGGAGGGGCCAAATTATTAAAATAAGGATCCATTATTAGTCATTTCTATATATAAAATTAATTTTAAATTCATATGATAATACAATTTTGCTTAGATTTTGTTATAATTTTAAATTAAAATTGCATGATTTCAATTTTGTATTATTAGTTTTGTCATTAGAACTAAATTAATAATTACGTTATTTTTTTTAATAATTACGACCTATTTGAATAATAGAGATTAATGATTACAATATTGACTTCAACAATAATTACAGACAAACAGTGTAAAAAGACTAATGATTACAACAATAATTATGATAATTTTATTAGCAGTTGGTCGGATGAAAAACAACATTCAAACAAAATGTCAAAAATCAAATAAATAAATATATAATTTTTTAAATAAAAAAGAGTTTTAAACTCTTAATATCACAAAAAAAAAAAAAAGTGTATTTATTCATTTATTTATTCTGAAAATTTAAAATGTGTAATTATATTAAAAAACAATAATAACTTTAATAAATATAATATTTCATATTAGTAATTTAATTATTTTTGTTAAATTAAATTATGCATTATAATTATATGATAGTGTAAAATAATAATTATATAAATTAAAATATGTATTCAACCTAATATTTAACTATTTTTATAATTTATATAAAATACAACTTTACATCCATTTTATCTAAAAAAAATACTTCCAAACTTTCAATGGTCGCAAAAATACCCCAAACTTTTAAAAAATCATTAATATCATTCAACAGTTTTAAAAAGTTCAAAAACATCCCGGTAAAATTTTAAAATTAATATTAACACCCGTACCCGTTGTCTCTGTAAAAAAAATGTTTGATTTTTTTAACTATTAATGTATGGACGAAAATTGTCTATTGACACCTTATCAATTATATCTGAACTTTTTAATATTAATTTTAAAAAATTTATGGATATTTCTAAAACGTGATGGTAAAAATATTTTTTTAATTTTTTTTTAGTTTAAGTTTAAAGAGTATTTTTTAAAATTTTAAAATTTAATTAAAACTGGGATAAGGTGCTAAATTAATAAAAATATTAATTTAGCGATAATTGAAAGGTAAAATTAATACTATTCATTTAATTCCCTCTCTGACAGGTAAAGCAGCTTTGGTCCACCAAATAATGCAAGGCGACATGTCGGCAATCATCTTTATTTATTTATTTATTTATAATAACCATCCATTTAGGGAAGGTTTCCACACCTTCACTCTTTCCTTCCTCCAATTGATATGGGACCACCCTCAAATCCACCCCCCTTGGGGACCCAGCGTCCTTACTGGCACATCGCCTAATCCACCCCCCTTGGGGGCCAGCGTCCTTACTGGCACATCGCCTCGTGTCTACCCCCTTCGGGGAACAGTGAGAAGGCTGACACATCATCCGGTGTCTGACTCTGATACCATTTGTAACGACCCAGGTCCACCGCTAGAAGATATTGTCCTCTTTAGGTTTTTCCTTTCGGACTCCCCTCAAGGCTTTAAAACACGTCTGCTAGGGGAAGGTTTCCACACCCTTATAAATGGTGATTTCTTCTCCTTCCCAACCAATGTGGGACATCACACTTAATTATAAGAACTCCATAATTAAACATGTTTTACCCCGAATAATCTATTGGATAATGTTCACATTTACAAGCGAGAAGTAAATAAAGTGACCATGTTGCTAATTTTAACTGACGAAAGAAATATAGAGGACAACTATATGGCCCGTAGGAATTACGACTCTCCACAATGGTATAATATTGTCCACTTTGAACATTAAAAAAAAAAAAAAAACTCATTCTACAGTAGAAAGTGTTACTTACTTATAAATTCATAATATTTTTCTTAATTTGTCACTGTGGGACTACTTTATCAATAATTCTCAATAATTCTCAATAATTATCACGTGTTGTAATTCCAAATTTGTCATGAAACACGTATTTCAAACCGACATAATCTTTTATATATATATATATATATAGAGACCCAATTTATTATTATTATTATTATTATTATTATTTAAATAATTTGTGGCAGTCTTTTTTTTTTAGAATAAATTTTTTAAAAAAAGTGATGTGGTCGTATAACTAGACAATGAAAGTACATAAATGCATTAGATTTTTTTGAATTGAGTGATGGGGAGAGTGAGTACTGAAATATCCATTTTTGAATTTGTATATTATCCAGTTATTAAATAAATTAATTTATGTCATTTTTTTCATACTTTTAAAAAAATATTAAAAAAAATAAATGAATTAAATCATGTCCTCCCCCACCGCTCCATTATTTAAACGCGAGATGGGGTTGAGGATCTACACAAGATCTGACGACGAGTAGCAGCATGGCGGCGGCTAGTAGTGCTCCGGTGCTTCTGATAATGCTCATCGCCGCCGCAACTGCCGGCAGCGGTTTGGCTGATGGTGTGGAGCCGAGTCATAGTTCGGTTCCGTTCAACCGGAGCAGTTTCCCGCCGGGTTTTGTGTTCGGAGCTGGCTCTGCCGCTTACCAGGTCCTCCTCCCTCCCCCTCTCGCCATTTTTAGGGTTCCGCCGTACCGGCGCTGCCATTGCCATTTTCGNAAATAAAAAAAAAAAAAAAAAAAAAAATTGCAGTTGGAAGGAGCAGCAAGTATAGATGGAAGAGGTCCAAGTATTTGGGATACTTTCATTAAGAACCACCCAGGTCTCTTTCTTCATTATCCAATTATTTTCATTTTTGCCTCATTAAATAAAAAAACATTGAAAATTTATAAATATTATACTTTTTAAATATTAAAAATGTTAAACGAACACGACTCTCCACAATGGTACGATACTATCCACGGCTTTGCTTTGGGTTTCCCCGAAAGACCTCATACCAATGGAGTTAGTATTCCTCACTTATAAACCCATGATCATTCCCTAAATTAGACAAATTAGCCGATGTGGGACTTTCATCATCCAACACCTCCCCTTAATCGAGGCTAGACTCCTTTTCTTTTGGAGTTCTTCTTAGGCCTTCGAGGAGGCTTGACTCATTTTCTTTTGGAGTTCTTCTTAGTCATTTTTTACTGCCTTCGAGTAGGCTTGACTCCTTTTCTTTTGGAGTTATTTGTTCGATATTTGAGGATTTACCAATCTACTGGCACGACTAAGTTTAGGGCATGGCTCTGATACCATGTTAGACGAACACTACTCTCCACAATGGTATAATATTGTCCACTTTAAGCATAAGCTCTCATGGCTTTGCGTTGGGCTTCCCCAAAAAAGCCTCATACGAAAGTTAGTATTCCTCATCTATAAACCCACGATTATTCCCTAAATTAGCCGATGTGAGACTTTCATCATCCAACAAAAAAAAACCAACAATGGTAAGTTATTGTCCACTTTGAGTGTCCCAAAAAGCCTCGTAACAATGTAGAGTGTATTATTTGTTTATAAACTCATGATCGTTTCCTAAATTAGATGACAGAGACTTTCATCTCCAACAACTCTAACCCTGTTTTTTTTTTTTTTTTTTTACTTTTTAGAAAGTATATTTTTTTATTTTTTTAGTTAAAGGGTATTATTGAAATTTTAAAAAAAGTTTAAATTTATTTTTAAGATAAAAGGATAAAGTTTAGCGATATTTTTTTAAATAATTTATTGTAATTATTATTATTATTTTTTACTAATATAAAAGTTAGTAGTTGGAAAAACTAAATAAGAAGAATTGATTGAAATTTTAAAATTTATATGAAATATATTAAATGATTAAATTTAATGGATGTAAATATCTAATTTTTATTAATTTAGTGTTTTTTTTTATTTTATTTTTTAAAGTATAAAAATTATTCATGCTTAGTAAATTGAAAACATAGTAATAATAATGATATTAATTATTTATAATATAACGCGCTCATATGGTTGATGGGTTATTGAATTATATGATCAGAAAAAATATGGGACCATAAAAATGGAGAAGTGGCTACCGATTTTTACCATCGTTATAAGGTAAAATTTGATATTTTAAAAATTTAGAGATTTATTAGACATATTTTATTATATATTGTGAATATTAATTAATTATATATATATATATATTTTCAGGAGGATATACAATTGATGAAAAAGATTGGGTTGGACTCTTTCAGATTCTCCATCTCTTGGTCAAGGATCCTGCCCAGTAAGTTTTTTTTTACCCAAAATTAATTATTTCCATTTCCATGGGTAAAAAATTGTTTTTTTTACCGTTTTATTTTTCCTTGTAAAAAAATGACAGAGGGAAACCTTCGTGGAGGAGTGAATCCACTTGGCGTCAAATTCTACAATAATGTCATCAATGAGCTCCTCGCCAATGGTAAAAAAATTAATGCCTTGTTTTGTGAAATTTCTTCAAATAATAATAATAATAATAATAATAATAATAAATCAATTATAATATAACATTAAATCACCCCTAAATATTAATTCTTAGAATTATACTAATTATTTATTGAATTATTGAATCACGCTCGCAAATAAATAAATACAGGAATAATACCATACGTCACTCTCTTTCATTGGGATCTTCCTCAAGCACTCGAAGATGAGTATAATGGATTTCGAAGCGCTAAAGTTGTGTAAGTAATTAATTAGTTTTATTAGACATAAAAATAAAAATTTATAATTAATTTTATATTTAATTTTAAAATATTGAATGAATAATTCTAATATTTAAAATTATTTATTTATTTTTTTTTGAAGACGAAGAGAGTAAATTTATAATTAGTAACTAATCAAAATTAATTAATGTAATTTTGGTGATTAAAACAGGAATGATTTTCGGGAATACGCGGACTTATGCTTCAAATTATTTGGCGATCGGGTCAAGTATTGGACAACCCTTAACGAACCATATTCGTTTACCGTTTTTGGGTACAACGGCGGCACTTTCGCTCCTGGAAGATGCTCCAACTACGTCGGAAATTGCACCGCCGGCAACTCCGGCACCGAGCCCTACATCGTCGCCCACAATCTCCTCCTCTCCCACGCCGCTGCCGTCAAAGTCTACAGGACAAAGTACCAGGTTAGTATTCAAGAACAAAACCCTAAATTAATCTCTTCGTAACATAATCGGAATCTTGTTTTGGACACTGATTTTGCTAATTCGAATTCACTTGAGAGCTACAGGCAAAGCAGAAGGGGCAGATTGGAATTACATTGGTGACTCACTGGTTTAGGGCCAAACGCAACACGGCAGCTTCCCAAGCGGCGGTGTATCGAGCTCTTGATTTCTTCTTGGGATGGTGAGTGTTTCTATATTAATCGTTAGTTGAGAAATGTTGGTGAGAATTGAACTGAATCGAAATGAATTTTAGGTTTCTGCATCCGATTACTTATGGCGACTACCCGAAATCGATGCACCAGTACGTCGGAAATAGGCTGCCGAAATTCTCCGTGGCAGAGTCACAAAGCATTAAAGGATCAATGGATTTTCTTGGAATGAATTACTACACTGGAAATTTCGTTGACGATATACCTTTCTCGAATTCGCCTAACATAAGCTACAGTTCTGATATGCACATCTCCCTCTCGAGTATGTCTTCTTTCTTTTCTTTGACAAGATAATTAAACTTGAATAAGCTACAATTCGATTTCTATGTTAGCAATTTATGAAGATATGATGTTGGACCAGAGTTGAATTTTGATTGAAATTTGTGAATTCTAAGAACTCTATTAAATGGTCAATTTGGCTTGAAATGAGTGAGTTCTTTCTTGTTCCAAAGTTTAGAGTGAAAATTAATACAGTTTTTGAAGTTTATGTGATTTTTCATTTGAATCAAAGCTTTTCTTTATCCTTTTAACATAATTCTTTTCCTCTGCTACAGCGGATAATGATGGTGTTCTAATTGGACCTGCGGTTCGAACTCTCTCTATCTTTACTTTAATTAGCAGCCGAGCCATTTGTTAATGAACTGTAAATGGGTTCTTTTGCTTCTGCAGACTAGTTTGAACTGGCTTTACATCTACCCAGAAGGCATTCATCTACTTTTGAAATACATTAAAGAAGAATACAAAGATCCAGTTATTTACATCACCGAGAATGGTACCTTTCTTTCTCTCCAATCAATGGCTATACAACGTTGCATTTAGTTAGTCATATGAAAGATGTGTTAGTCCACATCGGCTAATTTAGGGAATGATCATGGGTTTATAATCCAGGTATGAGACCTTTTGGGGAAGCCCAAAACAAAGCCATGAGAGCTTATGCTCAAAGTGGACAATATCATACCATTGTGGAGAGTCGTGTTCGTTTAACATGGTATCAGAGCCATGCCCTAAAGTTAGTCGTGCCAATAGATTGGTAAATCCTCAAATATCGAACAAAGGACTCCAAAAGAAAAGGAGTCAAGCATCCTCGAAGGCAGTAAAAAATGACTAAGACTCCAAAGGAGTCGAGCCTTGGTTAAGGGGAGGCGTACTTTGTTCGAGGGGAGGTGTACTTTGTTCGAGGGGAGGTGTTGGATGATTAAAGTCCCACATCGGCTAATTTAGGGAATGACCATGAGTTTATAATCAAAGAATACTCTCTCCATTGGTATGAGGTCTTTTTGGGAAGCCCAAAGCAAAGCCATGAGAGCTTATGCTCAAAGTGGACAATATCATACTATTGTGGAGAGTCGTGTTCGTTTAACAAGATGAAATTTGAAGACAAAAGATAGAATAAAAAGTTTGTATTTCATATTTCCAGTGTTTTTGAATGTATTTGGAAACAGATATAGAAATGATTAGAAACGTGTTTGGTAATCTGACAATAAAAATAGTTGAATCTCATTTTGGATACCAAACAAGCCCTGAACTATTGGCTTGTTGTGGTGTAAAGGTATGGCTTATTCAGACAATACGACACTGCCAATTAAGGAAGCTCTGAAAGATGGAACAAGGATCAGATACCACTACGCCCATCTTGAAGCTATTCTCCAAGCTATTAAGTATGAATGATCTGATAAACATCAAACTCCATTGATATGAACAGATGAAATCAGAACTTTTCTAATGGTTGTGCTTTTGAAATGACAGGGAAGGAGTGAATGTGAAGGGATACTACGCCTGGACTCTTATGGATGACTTTGAATGGGACGCAGGCTACACGGTGCGATTCGGCCTCATCTACGTCGATTTCAGGCACAAATTGGGAAGGTACCTCAAGTATTCTGCTTACTGGTTGAAGAGGTTCCTTCTTCATTGAGCTTTGCTGTAGAATCTCTCCCACTTGCTGTAGCTTCTTCCCCAATGGAACCACTAATGAATAAGGAGGGGAGGGGAGTGGAGTGGAGTGGGTGTCATGAGAAACGTTTGTGTTTCATTTGTGTCGGGAATCTCTTTCCTTTTGCAGTTTGTGTGATATTTCTCAATAAAAGGTTTAGAATGTTGAATGTAGAAGTGTTTATGAATCGAGTTGAAGTAGTTCAAACCATATTCATGGGCTCTTTAGTTCTTTTAAGCCAGCTTTGGAAGAAAGAGACCAATGTGGTGAGAGAGATACCACTTTTCCAATTATGATTAAAGTGGGTGAACCAATTCTGTTGCTTTGATGATTTCATCTGAAAGGTCCTTCAGTTCTGCAAAAACCTGCATAATTGTAACACATCCAAGCAAGATTTGACATGACCCCTCGATA

mRNA sequence

ATGGAAAAAATATGGGACCATAGCAGTGGAGAAGTGGCTACCGATTTCTACCATCGTTATAAGGAGGACGTGCAAATAATGAAGAAGATGGGGTTGGACTCTTTCAGATTCTCCATCTCTTGGTCAAGGATTCTCCCCAAGGGAACCGTTCGTGGAGGAGTGAATCCACTTGGCGTCAAATTTTACAACAATCTCATCAATGAGCTCCTAGCCAATGGAATAATACCTTACGTCACTCTTTTTCACTGGGACCTTCCTCAAGCACTTGAAGATAAGTACGATGGATTTCGAAACGTCAAAATTGTGAATGATTTTCGGAATTACGCGGACTTATGCTTCAAATTATTCGGCGATCGGGTCAAGTATTGGACAACCCTTAACGAACCATATTCATTTAGCGCTTATGGGTACAACAGCGGCACTTTCGCTCCTGGAAGATGTTCCAACTACGTCGGGAATTGTACCGCCGGCAACTCCGGCACCGAGCCCTACATCGTTGCTCACAATCTCCTCCTCTCCCATGCCGCTGCCGTCAAAGTCTACAGGACAAGGTACCAGGTCAGTATTCAAGAACAAATCCCTAAATCAATCTCTTCATTAGCTACAGGCAAAGCAGAAGGGGAAGATTGGAATCACATTGCGGAGAAAGATGGTGTTCTAATTGGACCAGCGACTGGTTTGAACTGGCTTTACATCTACCCAGAGGGCATCCGTCTACTTTTGAAATACGTTAAAGCAGAGTACAAAGATCCAGTTATTTACATCACCGAGAATGGAAGCTCTAAAAGATGGAACAAGGATCAAATACCACCACGCCCATCTTGCATCTCTTCTTCAAGCTATCAAAATCTTGAGTGTAAAAGTGTTCATGAATCGAGTCGAACTAGTTCAAACGGATCGACCCGGTACAATAAGTCAAGATCTACACAAGATCTGACGACGAGTAGCAGCATGGCGGCGGCTAGTAGTGCTCCGGTGCTTCTGATAATGCTCATCGCCGCCGCAACTGCCGGCAGCGGTTTGGCTGATGGTGTGGAGCCGAGTCATAGTTCGGTTCCGTTCAACCGGAGCAGTTTCCCGCCGGGTTTTGTGTTCGGAGCTGGCTCTGCCGCTTACCAGTTGGAAGGAGCAGCAAGTATAGATGGAAGAGGTCCAAGTATTTGGGATACTTTCATTAAGAACCACCCAGAAAAAATATGGGACCATAAAAATGGAGAAGTGGCTACCGATTTTTACCATCGTTATAAGGAGGATATACAATTGATGAAAAAGATTGGGTTGGACTCTTTCAGATTCTCCATCTCTTGGTCAAGGATCCTGCCCAAGGGAAACCTTCGTGGAGGAGTGAATCCACTTGGCGTCAAATTCTACAATAATGTCATCAATGAGCTCCTCGCCAATGGAATAATACCATACGTCACTCTCTTTCATTGGGATCTTCCTCAAGCACTCGAAGATGAGTATAATGGATTTCGAAGCGCTAAAGTTGTGAATGATTTTCGGGAATACGCGGACTTATGCTTCAAATTATTTGGCGATCGGGTCAAGTATTGGACAACCCTTAACGAACCATATTCGTTTACCGTTTTTGGGTACAACGGCGGCACTTTCGCTCCTGGAAGATGCTCCAACTACGTCGGAAATTGCACCGCCGGCAACTCCGGCACCGAGCCCTACATCGTCGCCCACAATCTCCTCCTCTCCCACGCCGCTGCCGTCAAAGTCTACAGGACAAAGTACCAGGCAAAGCAGAAGGGGCAGATTGGAATTACATTGGTGACTCACTGGTTTAGGGCCAAACGCAACACGGCAGCTTCCCAAGCGGCGACTAGTTTGAACTGGCTTTACATCTACCCAGAAGGCATTCATCTACTTTTGAAATACATTAAAGAAGAATACAAAGATCCAGTTATTTACATCACCGAGAATGGTATGGCTTATTCAGACAATACGACACTGCCAATTAAGGAAGCTCTGAAAGATGGAACAAGGATCAGATACCACTACGCCCATCTTGAAGCTATTCTCCAAGCTATTAAGGAAGGAGTGAATGTGAAGGGATACTACGCCTGGACTCTTATGGATGACTTTGAATGGGACGCAGGCTACACGGTGCGATTCGGCCTCATCTACGTCGATTTCAGGCACAAATTGGGAAGGTACCTCAAGTATTCTGCTTACTGGTTGAAGAGGTTCCTTCTTCATTGAGCTTTGCTGTAGAATCTCTCCCACTTGCTGTAGCTTCTTCCCCAATGGAACCACTAATGAATAAGGAGGGGAGGGGAGTGGAGTGGAGTGGGTGTCATGAGAAACGTTTGTGTTTCATTTGTGTCGGGAATCTCTTTCCTTTTGCAGTTTGTGTGATATTTCTCAATAAAAGGTTTAGAATGTTGAATGTAGAAGTGTTTATGAATCGAGTTGAAGTAGTTCAAACCATATTCATGGGCTCTTTAGTTCTTTTAAGCCAGCTTTGGAAGAAAGAGACCAATGTGGTGAGAGAGATACCACTTTTCCAATTATGATTAAAGTGGGTGAACCAATTCTGTTGCTTTGATGATTTCATCTGAAAGGTCCTTCAGTTCTGCAAAAACCTGCATAATTGTAACACATCCAAGCAAGATTTGACATGACCCCTCGATA

Coding sequence (CDS)

ATGGAAAAAATATGGGACCATAGCAGTGGAGAAGTGGCTACCGATTTCTACCATCGTTATAAGGAGGACGTGCAAATAATGAAGAAGATGGGGTTGGACTCTTTCAGATTCTCCATCTCTTGGTCAAGGATTCTCCCCAAGGGAACCGTTCGTGGAGGAGTGAATCCACTTGGCGTCAAATTTTACAACAATCTCATCAATGAGCTCCTAGCCAATGGAATAATACCTTACGTCACTCTTTTTCACTGGGACCTTCCTCAAGCACTTGAAGATAAGTACGATGGATTTCGAAACGTCAAAATTGTGAATGATTTTCGGAATTACGCGGACTTATGCTTCAAATTATTCGGCGATCGGGTCAAGTATTGGACAACCCTTAACGAACCATATTCATTTAGCGCTTATGGGTACAACAGCGGCACTTTCGCTCCTGGAAGATGTTCCAACTACGTCGGGAATTGTACCGCCGGCAACTCCGGCACCGAGCCCTACATCGTTGCTCACAATCTCCTCCTCTCCCATGCCGCTGCCGTCAAAGTCTACAGGACAAGGTACCAGGTCAGTATTCAAGAACAAATCCCTAAATCAATCTCTTCATTAGCTACAGGCAAAGCAGAAGGGGAAGATTGGAATCACATTGCGGAGAAAGATGGTGTTCTAATTGGACCAGCGACTGGTTTGAACTGGCTTTACATCTACCCAGAGGGCATCCGTCTACTTTTGAAATACGTTAAAGCAGAGTACAAAGATCCAGTTATTTACATCACCGAGAATGGAAGCTCTAAAAGATGGAACAAGGATCAAATACCACCACGCCCATCTTGCATCTCTTCTTCAAGCTATCAAAATCTTGAGTGTAAAAGTGTTCATGAATCGAGTCGAACTAGTTCAAACGGATCGACCCGGTACAATAAGTCAAGATCTACACAAGATCTGACGACGAGTAGCAGCATGGCGGCGGCTAGTAGTGCTCCGGTGCTTCTGATAATGCTCATCGCCGCCGCAACTGCCGGCAGCGGTTTGGCTGATGGTGTGGAGCCGAGTCATAGTTCGGTTCCGTTCAACCGGAGCAGTTTCCCGCCGGGTTTTGTGTTCGGAGCTGGCTCTGCCGCTTACCAGTTGGAAGGAGCAGCAAGTATAGATGGAAGAGGTCCAAGTATTTGGGATACTTTCATTAAGAACCACCCAGAAAAAATATGGGACCATAAAAATGGAGAAGTGGCTACCGATTTTTACCATCGTTATAAGGAGGATATACAATTGATGAAAAAGATTGGGTTGGACTCTTTCAGATTCTCCATCTCTTGGTCAAGGATCCTGCCCAAGGGAAACCTTCGTGGAGGAGTGAATCCACTTGGCGTCAAATTCTACAATAATGTCATCAATGAGCTCCTCGCCAATGGAATAATACCATACGTCACTCTCTTTCATTGGGATCTTCCTCAAGCACTCGAAGATGAGTATAATGGATTTCGAAGCGCTAAAGTTGTGAATGATTTTCGGGAATACGCGGACTTATGCTTCAAATTATTTGGCGATCGGGTCAAGTATTGGACAACCCTTAACGAACCATATTCGTTTACCGTTTTTGGGTACAACGGCGGCACTTTCGCTCCTGGAAGATGCTCCAACTACGTCGGAAATTGCACCGCCGGCAACTCCGGCACCGAGCCCTACATCGTCGCCCACAATCTCCTCCTCTCCCACGCCGCTGCCGTCAAAGTCTACAGGACAAAGTACCAGGCAAAGCAGAAGGGGCAGATTGGAATTACATTGGTGACTCACTGGTTTAGGGCCAAACGCAACACGGCAGCTTCCCAAGCGGCGACTAGTTTGAACTGGCTTTACATCTACCCAGAAGGCATTCATCTACTTTTGAAATACATTAAAGAAGAATACAAAGATCCAGTTATTTACATCACCGAGAATGGTATGGCTTATTCAGACAATACGACACTGCCAATTAAGGAAGCTCTGAAAGATGGAACAAGGATCAGATACCACTACGCCCATCTTGAAGCTATTCTCCAAGCTATTAAGGAAGGAGTGAATGTGAAGGGATACTACGCCTGGACTCTTATGGATGACTTTGAATGGGACGCAGGCTACACGGTGCGATTCGGCCTCATCTACGTCGATTTCAGGCACAAATTGGGAAGGTACCTCAAGTATTCTGCTTACTGGTTGAAGAGGTTCCTTCTTCATTGA

Protein sequence

MEKIWDHSSGEVATDFYHRYKEDVQIMKKMGLDSFRFSISWSRILPKGTVRGGVNPLGVKFYNNLINELLANGIIPYVTLFHWDLPQALEDKYDGFRNVKIVNDFRNYADLCFKLFGDRVKYWTTLNEPYSFSAYGYNSGTFAPGRCSNYVGNCTAGNSGTEPYIVAHNLLLSHAAAVKVYRTRYQVSIQEQIPKSISSLATGKAEGEDWNHIAEKDGVLIGPATGLNWLYIYPEGIRLLLKYVKAEYKDPVIYITENGSSKRWNKDQIPPRPSCISSSSYQNLECKSVHESSRTSSNGSTRYNKSRSTQDLTTSSSMAAASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGAASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTEPYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAATSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITENGMAYSDNTTLPIKEALKDGTRIRYHYAHLEAILQAIKEGVNVKGYYAWTLMDDFEWDAGYTVRFGLIYVDFRHKLGRYLKYSAYWLKRFLLH
Homology
BLAST of Cp4.1LG17g08300 vs. ExPASy Swiss-Prot
Match: A2SY66 (Vicianin hydrolase (Fragment) OS=Vicia sativa subsp. nigra OX=3909 PE=1 SV=1)

HSP 1 Score: 554.3 bits (1427), Expect = 2.1e-156
Identity = 271/500 (54.20%), Postives = 328/500 (65.60%), Query Frame = 0

Query: 328 LIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGAASIDGRGPSI 387
           L  L+A  T     +  V PSH +  FN+S FP  F+FG GS+AYQ+EGA++IDGRGPSI
Sbjct: 11  LATLLAVVTGTGTPSQEVHPSHYATTFNKSLFPKDFLFGIGSSAYQVEGASNIDGRGPSI 70

Query: 388 WDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRILPKGNLRG 447
           WDTF K HPEKIWDH +G +  DFYHRYK DI+++K+IGLDS+RFSISWSRI PKG  +G
Sbjct: 71  WDTFTKQHPEKIWDHSSGNIGADFYHRYKSDIKIVKEIGLDSYRFSISWSRIFPKG--KG 130

Query: 448 GVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDFREYADLC 507
            VNPLGVKFYNNVINE+LANG+IP+VTLFHWDLPQ+LEDEY GF S+KVV DF  YAD  
Sbjct: 131 EVNPLGVKFYNNVINEILANGLIPFVTLFHWDLPQSLEDEYKGFLSSKVVKDFENYADFV 190

Query: 508 FKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTEPYIVAHNLLL 567
           FK +GDRVK+W TLNEP+S+ ++GYNGGTFAPGRCS Y GNC  G+S TEPYIVAHNL+L
Sbjct: 191 FKTYGDRVKHWVTLNEPFSYALYGYNGGTFAPGRCSKYAGNCEYGDSSTEPYIVAHNLIL 250

Query: 568 SHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQA------------------- 627
           SHAAA K+Y+TKYQA QKG IG TLVTH+F    N+AA +                    
Sbjct: 251 SHAAAAKLYKTKYQAHQKGNIGATLVTHYFEPHSNSAADRVAASRALDFFFGWFAHPLTY 310

Query: 628 ------------------------------------------------------------ 687
                                                                       
Sbjct: 311 GHYPQSMISSLGNRLPKFSKEEVELTKGSYDFLGVNYYSTYYAQSAPLTTVNRTFYTDIQ 370

Query: 688 --------------ATSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITENGMAYSDNTTLP 735
                         AT LNWLY+YP+GIH L+ ++K+ YK+P++YITENG+A S N ++P
Sbjct: 371 ANVSPLKNGAPIGPATDLNWLYVYPKGIHSLVTHMKDVYKNPIVYITENGVAQSRNDSIP 430

BLAST of Cp4.1LG17g08300 vs. ExPASy Swiss-Prot
Match: B8AVF0 (Beta-glucosidase 12 OS=Oryza sativa subsp. indica OX=39946 GN=BGLU12 PE=3 SV=1)

HSP 1 Score: 469.2 bits (1206), Expect = 8.8e-131
Identity = 242/514 (47.08%), Postives = 313/514 (60.89%), Query Frame = 0

Query: 318 MAAASSAP--VLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLE 377
           MAAA + P  +LL  L+ A  A        EP     P +R SFP GF+FG  S++YQ E
Sbjct: 1   MAAAGAMPGGLLLTFLLLAVVASGAYNSAGEP-----PVSRRSFPKGFIFGTASSSYQYE 60

Query: 378 GAASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSIS 437
           G A+  GRGPSIWDTF   HPEKI D  NG+VA+D YH YKED++LMK +G+D++RFSIS
Sbjct: 61  GGAAEGGRGPSIWDTFTHQHPEKIADRSNGDVASDSYHLYKEDVRLMKDMGMDAYRFSIS 120

Query: 438 WSRILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAK 497
           W+RILP G+LRGGVN  G+K+YNN+INELL+ G+ P++TLFHWD PQALED+YNGF S  
Sbjct: 121 WTRILPNGSLRGGVNKEGIKYYNNLINELLSKGVQPFITLFHWDSPQALEDKYNGFLSPN 180

Query: 498 VVNDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNY-VGNCTAGNS 557
           ++NDF++YA++CFK FGDRVK W T NEP++F   GY  G FAPGRCS +  GNC+ G+S
Sbjct: 181 IINDFKDYAEICFKEFGDRVKNWITFNEPWTFCSNGYATGLFAPGRCSPWEKGNCSVGDS 240

Query: 558 GTEPYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWF----RAKRNTAASQAA- 617
           G EPY   H+ LL+HA  V++Y+ KYQA QKG+IGITLV+HWF    R+K N  A++ A 
Sbjct: 241 GREPYTACHHQLLAHAETVRLYKAKYQALQKGKIGITLVSHWFVPFSRSKSNNDAAKRAI 300

Query: 618 ------------------------------------------------------------ 677
                                                                       
Sbjct: 301 DFMFGWFMDPLIRGDYPLSMRGLVGNRLPQFTKEQSKLVKGAFDFIGLNYYTANYADNLP 360

Query: 678 --TSLN---------------------------WLYIYPEGIHLLLKYIKEEYKDPVIYI 735
               LN                           WLY+YP+G   LL Y+KE Y +P +YI
Sbjct: 361 PSNGLNNSYTTDSRANLTGVRNGIPIGPQAASPWLYVYPQGFRDLLLYVKENYGNPTVYI 420

BLAST of Cp4.1LG17g08300 vs. ExPASy Swiss-Prot
Match: Q7XKV4 (Beta-glucosidase 12 OS=Oryza sativa subsp. japonica OX=39947 GN=BGLU12 PE=1 SV=2)

HSP 1 Score: 468.4 bits (1204), Expect = 1.5e-130
Identity = 242/514 (47.08%), Postives = 313/514 (60.89%), Query Frame = 0

Query: 318 MAAASSAP--VLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLE 377
           MAAA + P  +LL  L+ A  A        EP     P +R SFP GF+FG  S++YQ E
Sbjct: 1   MAAAGAMPGGLLLTFLLLAVVASGAYNGAGEP-----PVSRRSFPKGFIFGTASSSYQYE 60

Query: 378 GAASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSIS 437
           G A+  GRGPSIWDTF   HPEKI D  NG+VA+D YH YKED++LMK +G+D++RFSIS
Sbjct: 61  GGAAEGGRGPSIWDTFTHQHPEKIADRSNGDVASDSYHLYKEDVRLMKDMGMDAYRFSIS 120

Query: 438 WSRILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAK 497
           W+RILP G+LRGGVN  G+K+YNN+INELL+ G+ P++TLFHWD PQALED+YNGF S  
Sbjct: 121 WTRILPNGSLRGGVNKEGIKYYNNLINELLSKGVQPFITLFHWDSPQALEDKYNGFLSPN 180

Query: 498 VVNDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNY-VGNCTAGNS 557
           ++NDF++YA++CFK FGDRVK W T NEP++F   GY  G FAPGRCS +  GNC+ G+S
Sbjct: 181 IINDFKDYAEICFKEFGDRVKNWITFNEPWTFCSNGYATGLFAPGRCSPWEKGNCSVGDS 240

Query: 558 GTEPYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWF----RAKRNTAASQAA- 617
           G EPY   H+ LL+HA  V++Y+ KYQA QKG+IGITLV+HWF    R+K N  A++ A 
Sbjct: 241 GREPYTACHHQLLAHAETVRLYKAKYQALQKGKIGITLVSHWFVPFSRSKSNDDAAKRAI 300

Query: 618 ------------------------------------------------------------ 677
                                                                       
Sbjct: 301 DFMFGWFMDPLIRGDYPLSMRGLVGNRLPQFTKEQSKLVKGAFDFIGLNYYTANYADNLP 360

Query: 678 --TSLN---------------------------WLYIYPEGIHLLLKYIKEEYKDPVIYI 735
               LN                           WLY+YP+G   LL Y+KE Y +P +YI
Sbjct: 361 PSNGLNNSYTTDSRANLTGVRNGIPIGPQAASPWLYVYPQGFRDLLLYVKENYGNPTVYI 420

BLAST of Cp4.1LG17g08300 vs. ExPASy Swiss-Prot
Match: Q7XKV2 (Beta-glucosidase 13 OS=Oryza sativa subsp. japonica OX=39947 GN=BGLU13 PE=2 SV=2)

HSP 1 Score: 459.5 bits (1181), Expect = 7.0e-128
Identity = 236/503 (46.92%), Postives = 305/503 (60.64%), Query Frame = 0

Query: 326 VLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGAASIDGRGP 385
           +LL +L+  A +G       EP     P +R SFP GF+FG  S++YQ EG A   GRGP
Sbjct: 13  ILLPLLLVVAVSG-------EPP----PISRRSFPEGFIFGTASSSYQYEGGAREGGRGP 72

Query: 386 SIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRILPKGNL 445
           SIWDTF   HP+KI D  NG+VA D YH YKED+++MK +G+D++RFSISW+RILP G+L
Sbjct: 73  SIWDTFTHQHPDKIADKSNGDVAADSYHLYKEDVRIMKDMGVDAYRFSISWTRILPNGSL 132

Query: 446 RGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDFREYAD 505
            GG+N  G+ +YNN+INELL  G+ P+VTLFHWD PQALED+YNGF S  ++ND++EYA+
Sbjct: 133 SGGINREGISYYNNLINELLLKGVQPFVTLFHWDSPQALEDKYNGFLSPNIINDYKEYAE 192

Query: 506 LCFKLFGDRVKYWTTLNEPYSFTVFGY-NGGTFAPGRCSNYVGNCTAGNSGTEPYIVAHN 565
            CFK FGDRVK+W T NEP SF V GY +GG FAPGRCS + GNC+AG+SG EPY   H+
Sbjct: 193 TCFKEFGDRVKHWITFNEPLSFCVAGYASGGMFAPGRCSPWEGNCSAGDSGREPYTACHH 252

Query: 566 LLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWF----RAKRNTAASQAA----------- 625
            LL+HA  V++Y+ KYQ  QKG+IGITLV++WF    R+K N  A++ A           
Sbjct: 253 QLLAHAETVRLYKEKYQVLQKGKIGITLVSNWFVPFSRSKSNIDAARRALDFMLGWFMDP 312

Query: 626 ----------------------------------------------------TSLN---- 685
                                                                 LN    
Sbjct: 313 LIRGEYPLSMRELVRNRLPQFTKEQSELIKGSFDFIGLNYYTSNYAGSLPPSNGLNNSYS 372

Query: 686 -----------------------WLYIYPEGIHLLLKYIKEEYKDPVIYITENGMAYSDN 734
                                  WLYIYP+G   L+ Y+KE Y +P IYITENG+   +N
Sbjct: 373 TDARANLTAVRNGIPIGPQAASPWLYIYPQGFRELVLYVKENYGNPTIYITENGVDEFNN 432

BLAST of Cp4.1LG17g08300 vs. ExPASy Swiss-Prot
Match: O64882 (Beta-glucosidase 17 OS=Arabidopsis thaliana OX=3702 GN=BGLU17 PE=2 SV=1)

HSP 1 Score: 456.1 bits (1172), Expect = 7.7e-127
Identity = 237/504 (47.02%), Postives = 303/504 (60.12%), Query Frame = 0

Query: 326 VLLIMLIAAATAGSGLADGVEPS--HSSVPFNRSSFPPGFVFGAGSAAYQLEGAASIDGR 385
           + +I++I+  T+ S L   ++PS    S    RSSFP  F FGA S+AYQ EGAA++DGR
Sbjct: 6   IFIIIIISIITSISELY-ALDPSFLRLSTSLQRSSFPQDFRFGAASSAYQSEGAANVDGR 65

Query: 386 GPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRILPKG 445
            PSIWDTF K +PEKI D  NG+VA +FY+R+KED+  MK+IGLDSFRFSISWSRILP+G
Sbjct: 66  EPSIWDTFTKQYPEKISDGSNGDVADEFYYRFKEDVAHMKEIGLDSFRFSISWSRILPRG 125

Query: 446 NLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDFREY 505
            + GGVN  G+ FYN++INEL++NGI P VTLFHWD PQALEDEY GF + ++V DF EY
Sbjct: 126 TVAGGVNQAGINFYNHLINELISNGIRPLVTLFHWDTPQALEDEYGGFLNPQIVKDFVEY 185

Query: 506 ADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTEPYIVAH 565
            D+CFK FGDRVK W T+NEP  F V GYN G  APGRCS+YV NCT GNS TEPY+VAH
Sbjct: 186 VDICFKEFGDRVKEWITINEPNMFAVLGYNVGNIAPGRCSSYVQNCTVGNSATEPYLVAH 245

Query: 566 NLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA-------------- 625
            L+LSHAA V++YR KYQ+   G IG+T+ T+W   K NT A + A              
Sbjct: 246 YLILSHAATVQLYREKYQSFHGGTIGMTIQTYWMIPKYNTPACREAAKRALDFFFGWFAD 305

Query: 626 ------------------------------------------------------------ 685
                                                                       
Sbjct: 306 PITYGDYPKTMRELVGNRLPKFTKKQSKMVRGSFDFFGLNYYTSRYVEDVMFYANTNLSY 365

Query: 686 --------------------TSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITENGMAYSD 734
                               TS +WL+I PEG   +L YIK ++++PVI +TENGM   +
Sbjct: 366 TTDSRVNQTTEKNGVPVGEPTSADWLFICPEGFQDVLLYIKSKFQNPVILVTENGMPSEN 425

BLAST of Cp4.1LG17g08300 vs. NCBI nr
Match: TXG55760.1 (hypothetical protein EZV62_017073 [Acer yangbiense])

HSP 1 Score: 846 bits (2185), Expect = 1.79e-294
Identity = 459/923 (49.73%), Postives = 563/923 (61.00%), Query Frame = 0

Query: 2   EKIWDHSSGEVATDFYHRYKEDVQIMKKMGLDSFRFSISWSRILPKGTVRGGVNPLGVKF 61
           +KI + SSG VA D YHRYKEDV +MK +G D++RFSISWSRILP G + GGVNP G+ +
Sbjct: 80  DKIKNGSSGVVAVDSYHRYKEDVALMKDIGFDAYRFSISWSRILPYGNLSGGVNPQGITY 139

Query: 62  YNNLINELLANGIIPYVTLFHWDLPQALEDKYDGFRNVKIVNDFRNYADLCFKLFGDRVK 121
           YNNLIN+LL+NG+ P+VTLFHWDLPQALED+Y GF + +IVNDF++Y +LC++ FGDRVK
Sbjct: 140 YNNLINQLLSNGLQPFVTLFHWDLPQALEDEYGGFLHPQIVNDFQDYVELCYRNFGDRVK 199

Query: 122 YWTTLNEPYSFSAYGYNSGTFAPGRCSNYVGNCTAGNSGTEPYIVAHNLLLSHAAAVKVY 181
            W T+NEP +F++ GY +G FAPGRCSN    C+ GNSGTEPYIV+H+LLL+HAAAVK+Y
Sbjct: 200 QWITVNEPLTFASLGYANGAFAPGRCSN----CSGGNSGTEPYIVSHHLLLAHAAAVKLY 259

Query: 182 RTRYQVSIQEQI----------PKSISSLATGKAEGE------DW--------------- 241
           R +YQ+S + QI          P +I S    +A         DW               
Sbjct: 260 RDKYQMSQKGQIGIALNCAWIVPLNIESETDHQAASRSLSFSYDWFLEPLKSGSYPADMV 319

Query: 242 NHIAEK------------------------------------------------------ 301
           N++ E+                                                      
Sbjct: 320 NNVGERLPRFNKEQSSMVQGSFDFLGLNYYTSNYATNINTPCKNNQNPSYLTDSCVNLTT 379

Query: 302 --DGVLIGPATGLNWLYIYPEGIRLLLKYVKAEYKDPVIYITENGSSK----------RW 361
             +GV IGP    +WLY+YPEGIR LL Y K +  +PVIYITENG  +            
Sbjct: 380 KRNGVAIGPKAASDWLYVYPEGIRHLLLYTKNKLNNPVIYITENGVDEINDGKLSLEDNM 439

Query: 362 NKDQIPPRPSCISSSSYQNLECKSVHESSRTS----SNGSTRYNKSRSTQDLTTSSSMAA 421
             +      S + S   + +  +     S       SNG   Y     T  +        
Sbjct: 440 RIEYYKSHLSYVQSGIMEGVNVRGYFAWSFLDNFEWSNG---YTVRFGTVYVDYEDGF-- 499

Query: 422 ASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGAASI 481
             SA  L+   I+   A    +D V+P H S+PFNRS FP GF FGAGSAAYQ EGAASI
Sbjct: 500 -KSALYLIQKFISWLKAILTRSDAVKPMHYSMPFNRSLFPAGFTFGAGSAAYQSEGAASI 559

Query: 482 DGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRIL 541
           DGRGPSIWDTF KNHPEKIWD K+GEVA DFYHRYK+DIQ+MK++GL+SFRFS+SWSR+L
Sbjct: 560 DGRGPSIWDTFTKNHPEKIWDRKSGEVADDFYHRYKDDIQIMKRLGLNSFRFSMSWSRLL 619

Query: 542 PKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDF 601
           PKG L GGVNPLGVKFYNNVINELLANG+ P+VTLFHWDLPQALEDEY GF S+K+V DF
Sbjct: 620 PKGKLSGGVNPLGVKFYNNVINELLANGMTPFVTLFHWDLPQALEDEYGGFLSSKIVKDF 679

Query: 602 REYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTEPYI 661
           ++Y+D CFK FGDRVK+W T+NEPYS++  GYNGGTFAPGRCS+Y+GNCTAG+S TEPYI
Sbjct: 680 QDYSDFCFKTFGDRVKHWATMNEPYSYSNNGYNGGTFAPGRCSSYMGNCTAGDSSTEPYI 739

Query: 662 VAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKR-NTAASQAA---------- 721
           VAHNLLLSHA AVK+Y+TKYQA QKGQIGIT+VT+WF  K  N+ A Q A          
Sbjct: 740 VAHNLLLSHAVAVKIYKTKYQAYQKGQIGITIVTNWFIPKSANSVADQKAVYRMLDFLFG 799

Query: 722 ------------------------------------------------------------ 728
                                                                       
Sbjct: 800 WFADPIIHGDYPKTMRTLVGNRLPKFTAAQSNMLKGSIDFLGVNYYTTNYAAHTLHAVKV 859

BLAST of Cp4.1LG17g08300 vs. NCBI nr
Match: XP_023513422.1 (vicianin hydrolase-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 821 bits (2120), Expect = 6.94e-293
Identity = 418/512 (81.64%), Postives = 418/512 (81.64%), Query Frame = 0

Query: 318 MAAASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA 377
           MAAASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA
Sbjct: 1   MAAASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA 60

Query: 378 ASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWS 437
           ASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWS
Sbjct: 61  ASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWS 120

Query: 438 RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVV 497
           RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVV
Sbjct: 121 RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVV 180

Query: 498 NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE 557
           NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE
Sbjct: 181 NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE 240

Query: 558 PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA-------- 617
           PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA        
Sbjct: 241 PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAAVYRALDFF 300

Query: 618 ------------------------------------------------------------ 677
                                                                       
Sbjct: 301 LGWFLHPITYGDYPKSMHQYVGNRLPKFSVAESQSIKGSMDFLGMNYYTGNFVDDIPFSN 360

Query: 678 --------------------------TSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN 735
                                     TSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN
Sbjct: 361 SPNISYSSDMHISLSTDNDGVLIGPATSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN 420

BLAST of Cp4.1LG17g08300 vs. NCBI nr
Match: XP_023004523.1 (vicianin hydrolase-like [Cucurbita maxima])

HSP 1 Score: 808 bits (2087), Expect = 7.16e-288
Identity = 409/512 (79.88%), Postives = 414/512 (80.86%), Query Frame = 0

Query: 318 MAAASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA 377
           MAAASSAPVLLIMLIAAA AGSGL DGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA
Sbjct: 1   MAAASSAPVLLIMLIAAAIAGSGLTDGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA 60

Query: 378 ASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWS 437
           ASIDGRGPSIWDTFIKNHPEKIWDHKNGE+ATDFYHRYKEDIQLMKKIGLDSFRFSISWS
Sbjct: 61  ASIDGRGPSIWDTFIKNHPEKIWDHKNGEMATDFYHRYKEDIQLMKKIGLDSFRFSISWS 120

Query: 438 RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVV 497
           RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRS KVV
Sbjct: 121 RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSVKVV 180

Query: 498 NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE 557
           NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE
Sbjct: 181 NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE 240

Query: 558 PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA-------- 617
           PYIVAHNLLLSHAAAV+VYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA        
Sbjct: 241 PYIVAHNLLLSHAAAVEVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAAVYRALDFF 300

Query: 618 ------------------------------------------------------------ 677
                                                                       
Sbjct: 301 LGWFLHPLTYGDYPKSMHQYVGNRLPKFSVAESQSIKGSMDFLGMNYYTGNFVDDIPFSN 360

Query: 678 --------------------------TSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN 735
                                     TSLNWLYIYPEGIHLLLKYIK EYKDPVIYITEN
Sbjct: 361 SPNISYSSDMHISLSTDNDGVLIGPATSLNWLYIYPEGIHLLLKYIKAEYKDPVIYITEN 420

BLAST of Cp4.1LG17g08300 vs. NCBI nr
Match: XP_022960188.1 (vicianin hydrolase-like [Cucurbita moschata])

HSP 1 Score: 803 bits (2075), Expect = 4.76e-286
Identity = 407/512 (79.49%), Postives = 414/512 (80.86%), Query Frame = 0

Query: 318 MAAASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA 377
           MAAASSAPVLLIMLIAAA AGSG ADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA
Sbjct: 1   MAAASSAPVLLIMLIAAAIAGSGSADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA 60

Query: 378 ASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWS 437
           ASIDGRGPSIWDTFIKNHPEKIWDHKNGE+ATDFYHRYKEDIQLMKKIGLDSFRFSISWS
Sbjct: 61  ASIDGRGPSIWDTFIKNHPEKIWDHKNGEMATDFYHRYKEDIQLMKKIGLDSFRFSISWS 120

Query: 438 RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVV 497
           RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEY+GFRSAKVV
Sbjct: 121 RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYDGFRSAKVV 180

Query: 498 NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE 557
           NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE
Sbjct: 181 NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE 240

Query: 558 PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA-------- 617
           PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA        
Sbjct: 241 PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAAVYRALDFF 300

Query: 618 ------------------------------------------------------------ 677
                                                                       
Sbjct: 301 LGWFLHPITYGDYPKSMHQYVGNRLPKFSVAESQSIKGSMDFLGMNYYTGNFVDDIPFSN 360

Query: 678 --------------------------TSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN 735
                                     TSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN
Sbjct: 361 SPNISYSSDMHISLSTDNDGVLIGPATSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN 420

BLAST of Cp4.1LG17g08300 vs. NCBI nr
Match: KAB5520230.1 (hypothetical protein DKX38_024549 [Salix brachista])

HSP 1 Score: 781 bits (2016), Expect = 2.48e-269
Identity = 438/956 (45.82%), Postives = 531/956 (55.54%), Query Frame = 0

Query: 46   PKGTVRGGVNPLGVKFYNNLINELLANGIIPYVTLFHWDLPQALEDKYDGFRNVKIVNDF 105
            P+G +RGGVNPLGV+FYNNLINELLANGI P+VTLFHWDLPQALED+Y GF + K V+D+
Sbjct: 76   PEGKIRGGVNPLGVRFYNNLINELLANGITPFVTLFHWDLPQALEDEYSGFLSSKAVDDY 135

Query: 106  RNYADLCFKLFGDRVKYWTTLNEPYSFSAYGYNSGTFAPGRCSNYVGNCTAGNSGTEPYI 165
             +Y D CFK FGDRVK+W T NEPYSFS  GYN+GTFAPGRCSNYVGNCT GNSGTEPYI
Sbjct: 136  VDYVDFCFKTFGDRVKHWCTFNEPYSFSNNGYNTGTFAPGRCSNYVGNCTHGNSGTEPYI 195

Query: 166  VAHNLLLSHAAAVKVYRTRYQVS------------------------------------- 225
            VAHNL+L HAAAVK+YR +YQ S                                     
Sbjct: 196  VAHNLILGHAAAVKLYREKYQASQKGIIGITIVTHWFIPKSPKSEEDIKAAYRILDFLFS 255

Query: 226  --------------------------------------------IQEQIPKSI---SSLA 285
                                                        +  ++PK     S+L 
Sbjct: 256  LENPRRPSRFGYEDSNSYTLIVTKLCRFANPLTYGDYPEIMKAIVGHRLPKFTKEQSALV 315

Query: 286  TGKAEGEDWNHIA---------------------------EKDGVLIGPATGLNWLYIYP 345
             G  +    N+                              K G  IG  T LNWL+IYP
Sbjct: 316  KGSIDFLGVNYYTTNYAANNPAPNKVNFSYSGDSQTILSTSKGGHPIGTPTALNWLFIYP 375

Query: 346  EGIRLLLKYVKAEYKDPVIYITENGSSKRWNKDQIPPRPSC---------------ISSS 405
            +G+  L+ Y+K +YK+P IYITENG +   N   +P + +                +S +
Sbjct: 376  KGMYDLMLYIKDKYKNPPIYITENGLADA-NNASLPVKEALKDGLRIRYLANHLQYLSKA 435

Query: 406  SYQNLECKSVHESS-----RTSSNGSTRYNKSRSTQDLTTSSS--------------MAA 465
              + +  K  ++ +        +  + R+ K      +    +              MA 
Sbjct: 436  IQEGVNVKGYYQWAFWDDFEWDAGYTVRFGKPTCHAIVLAPKNLRYIIVDGIRFYQVMAT 495

Query: 466  ASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGAASI 525
              +A  L  +++A+  A +    G +PS  S+PFNR+SFP  F FGAG+AAYQ EGAA I
Sbjct: 496  VQAASFLHFVIVASLLAST---HGAKPSRYSMPFNRTSFPKDFTFGAGTAAYQSEGAAYI 555

Query: 526  DGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRIL 585
            DG+GPSIWDTF K HPEKIWD   G  A DFYHRYKED+QLMKKIGLDSFRFSISWSR+L
Sbjct: 556  DGKGPSIWDTFTKQHPEKIWDQSTGNEAIDFYHRYKEDVQLMKKIGLDSFRFSISWSRVL 615

Query: 586  PKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDF 645
            PKG + GGVNPLGV+FYNN+INELLANGI P+VTLF WDLPQALEDEY GF S+K V+D+
Sbjct: 616  PKGKISGGVNPLGVRFYNNLINELLANGITPFVTLFQWDLPQALEDEYYGFLSSKAVDDY 675

Query: 646  REYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTEPYI 705
             +Y D CFK FGDRVK+W T NEPYSF+  GYN GTFAPGRCSNYVGNCT GNSGTEPY 
Sbjct: 676  VDYVDFCFKTFGDRVKHWCTFNEPYSFSSNGYNSGTFAPGRCSNYVGNCTHGNSGTEPYT 735

Query: 706  VAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKR------------------- 735
            VAHNL+L HAAAVK+YR KYQA Q G IGIT+VTHWF  K                    
Sbjct: 736  VAHNLILGHAAAVKLYRQKYQASQNGIIGITIVTHWFIPKSPKSEEDIKAAYRILDFLFA 795

BLAST of Cp4.1LG17g08300 vs. ExPASy TrEMBL
Match: A0A5C7HFE8 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_017073 PE=3 SV=1)

HSP 1 Score: 846 bits (2185), Expect = 8.67e-295
Identity = 459/923 (49.73%), Postives = 563/923 (61.00%), Query Frame = 0

Query: 2   EKIWDHSSGEVATDFYHRYKEDVQIMKKMGLDSFRFSISWSRILPKGTVRGGVNPLGVKF 61
           +KI + SSG VA D YHRYKEDV +MK +G D++RFSISWSRILP G + GGVNP G+ +
Sbjct: 80  DKIKNGSSGVVAVDSYHRYKEDVALMKDIGFDAYRFSISWSRILPYGNLSGGVNPQGITY 139

Query: 62  YNNLINELLANGIIPYVTLFHWDLPQALEDKYDGFRNVKIVNDFRNYADLCFKLFGDRVK 121
           YNNLIN+LL+NG+ P+VTLFHWDLPQALED+Y GF + +IVNDF++Y +LC++ FGDRVK
Sbjct: 140 YNNLINQLLSNGLQPFVTLFHWDLPQALEDEYGGFLHPQIVNDFQDYVELCYRNFGDRVK 199

Query: 122 YWTTLNEPYSFSAYGYNSGTFAPGRCSNYVGNCTAGNSGTEPYIVAHNLLLSHAAAVKVY 181
            W T+NEP +F++ GY +G FAPGRCSN    C+ GNSGTEPYIV+H+LLL+HAAAVK+Y
Sbjct: 200 QWITVNEPLTFASLGYANGAFAPGRCSN----CSGGNSGTEPYIVSHHLLLAHAAAVKLY 259

Query: 182 RTRYQVSIQEQI----------PKSISSLATGKAEGE------DW--------------- 241
           R +YQ+S + QI          P +I S    +A         DW               
Sbjct: 260 RDKYQMSQKGQIGIALNCAWIVPLNIESETDHQAASRSLSFSYDWFLEPLKSGSYPADMV 319

Query: 242 NHIAEK------------------------------------------------------ 301
           N++ E+                                                      
Sbjct: 320 NNVGERLPRFNKEQSSMVQGSFDFLGLNYYTSNYATNINTPCKNNQNPSYLTDSCVNLTT 379

Query: 302 --DGVLIGPATGLNWLYIYPEGIRLLLKYVKAEYKDPVIYITENGSSK----------RW 361
             +GV IGP    +WLY+YPEGIR LL Y K +  +PVIYITENG  +            
Sbjct: 380 KRNGVAIGPKAASDWLYVYPEGIRHLLLYTKNKLNNPVIYITENGVDEINDGKLSLEDNM 439

Query: 362 NKDQIPPRPSCISSSSYQNLECKSVHESSRTS----SNGSTRYNKSRSTQDLTTSSSMAA 421
             +      S + S   + +  +     S       SNG   Y     T  +        
Sbjct: 440 RIEYYKSHLSYVQSGIMEGVNVRGYFAWSFLDNFEWSNG---YTVRFGTVYVDYEDGF-- 499

Query: 422 ASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGAASI 481
             SA  L+   I+   A    +D V+P H S+PFNRS FP GF FGAGSAAYQ EGAASI
Sbjct: 500 -KSALYLIQKFISWLKAILTRSDAVKPMHYSMPFNRSLFPAGFTFGAGSAAYQSEGAASI 559

Query: 482 DGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRIL 541
           DGRGPSIWDTF KNHPEKIWD K+GEVA DFYHRYK+DIQ+MK++GL+SFRFS+SWSR+L
Sbjct: 560 DGRGPSIWDTFTKNHPEKIWDRKSGEVADDFYHRYKDDIQIMKRLGLNSFRFSMSWSRLL 619

Query: 542 PKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDF 601
           PKG L GGVNPLGVKFYNNVINELLANG+ P+VTLFHWDLPQALEDEY GF S+K+V DF
Sbjct: 620 PKGKLSGGVNPLGVKFYNNVINELLANGMTPFVTLFHWDLPQALEDEYGGFLSSKIVKDF 679

Query: 602 REYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTEPYI 661
           ++Y+D CFK FGDRVK+W T+NEPYS++  GYNGGTFAPGRCS+Y+GNCTAG+S TEPYI
Sbjct: 680 QDYSDFCFKTFGDRVKHWATMNEPYSYSNNGYNGGTFAPGRCSSYMGNCTAGDSSTEPYI 739

Query: 662 VAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKR-NTAASQAA---------- 721
           VAHNLLLSHA AVK+Y+TKYQA QKGQIGIT+VT+WF  K  N+ A Q A          
Sbjct: 740 VAHNLLLSHAVAVKIYKTKYQAYQKGQIGITIVTNWFIPKSANSVADQKAVYRMLDFLFG 799

Query: 722 ------------------------------------------------------------ 728
                                                                       
Sbjct: 800 WFADPIIHGDYPKTMRTLVGNRLPKFTAAQSNMLKGSIDFLGVNYYTTNYAAHTLHAVKV 859

BLAST of Cp4.1LG17g08300 vs. ExPASy TrEMBL
Match: A0A6N2KL69 (Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS96664 PE=3 SV=1)

HSP 1 Score: 825 bits (2130), Expect = 4.60e-291
Identity = 429/825 (52.00%), Postives = 507/825 (61.45%), Query Frame = 0

Query: 2   EKIWDHSSGEVATDFYHRYKEDVQIMKKMGLDSFRFSISWSRILPKGTVRGGVNPLGVKF 61
           EKIWD S+G VA DFYHRYKEDVQ+MKK+GLDSFRFSISWSR+LPKG + GGVNPLGV+F
Sbjct: 39  EKIWDKSTGNVAIDFYHRYKEDVQLMKKIGLDSFRFSISWSRVLPKGKISGGVNPLGVRF 98

Query: 62  YNNLINELLANGIIPYVTLFHWDLPQALEDKYDGFRNVKIVNDFRNYADLCFKLFGDRVK 121
           YNNLINELLANGI P+VTLFHWDLPQALED+Y GF + K V+D+ +Y D CFK FGDRVK
Sbjct: 99  YNNLINELLANGITPFVTLFHWDLPQALEDEYSGFLSSKAVDDYVDYVDFCFKTFGDRVK 158

Query: 122 YWTTLNEPYSFSAYGYNSGTFAPGRCSNYVGNCTAGNSGTEPYIVAHNLLLSHAAAVKVY 181
           +W T NEPYSFS  GYN+GTFAPGRCS+YVGNCT GNSGTEPY VAHN++L HAAAVK+Y
Sbjct: 159 HWCTFNEPYSFSNNGYNTGTFAPGRCSSYVGNCTHGNSGTEPYRVAHNMILGHAAAVKLY 218

Query: 182 RTRYQVSIQEQIPKSISS--LATGKAEGEDWNHIAEKDGVLIGPATGLNWLYIYPEGIRL 241
           R +YQ S + +I  +I +        + E+ N  A ++   +     LNWL+IYP+G+  
Sbjct: 219 REKYQASQKGKIGITIVTNWFIPKSPKSEEDNKAAYRE---LDFLFALNWLFIYPKGMYD 278

Query: 242 LLKYVKAEYKDPVIYITENGSSKRWNKDQIPPRPSCISSSSYQNLECKSVHESSRTSSNG 301
           L+ Y+K +YK+P                                                
Sbjct: 279 LMLYIKDKYKNP------------------------------------------------ 338

Query: 302 STRYNKSRSTQDLTTSSSMAAASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSF 361
                                                                P  R S 
Sbjct: 339 -----------------------------------------------------PIMRMS- 398

Query: 362 PPGFVFGAGSAAYQLEGAASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDI 421
                          EGAA IDG+GPSIWDTF K HPEKIWD   G V  DFYHRYKED+
Sbjct: 399 ---------------EGAAYIDGKGPSIWDTFTKQHPEKIWDQSTGNVGIDFYHRYKEDV 458

Query: 422 QLMKKIGLDSFRFSISWSRILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWD 481
           QLMKKIGLDSFRFSISWSR+LPKG + GGVNPLGV+FYNN+INELLANGI P+VTLFHWD
Sbjct: 459 QLMKKIGLDSFRFSISWSRVLPKGKISGGVNPLGVRFYNNLINELLANGITPFVTLFHWD 518

Query: 482 LPQALEDEYNGFRSAKVVNDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAP 541
           LPQALEDEY GF S+K V+D+ +Y D CFK FGDRVK+W T NEPYSF+  GYN GTFAP
Sbjct: 519 LPQALEDEYYGFLSSKAVDDYVDYVDFCFKTFGDRVKHWCTFNEPYSFSNNGYNTGTFAP 578

Query: 542 GRCSNYVGNCTAGNSGTEPYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRA 601
           GRCSNYVGNCT GNSGTEPY VAHNL+L HAAA K+YR KYQ+ QKG+IGIT+VT+WF  
Sbjct: 579 GRCSNYVGNCTHGNSGTEPYTVAHNLILGHAAAAKLYREKYQSSQKGKIGITIVTNWFIP 638

Query: 602 KR---------------------------------------------------------- 661
           K                                                           
Sbjct: 639 KSPKSKEDIKAAYRELDFLFGWFVNPLTYGDYPEVMKAIVGHRLPKFTKEQSALILGVNY 698

Query: 662 ---NTAASQAA----------------------------TSLNWLYIYPEGIHLLLKYIK 721
              N AA+  A                            T+LNWL+IYP+G++ L+ YIK
Sbjct: 699 YTTNYAANNPAPNKVNFSYSGDSQTILSTSKGGHPIGTPTALNWLFIYPKGMYDLMLYIK 743

Query: 722 EEYKDPVIYITENGMAYSDNTTLPIKEALKDGTRIRYHYAHLEAILQAIKEGVNVKGYYA 735
           ++YK+P IYITENG+A ++N +LP++EA+KDG RIRY   HL+ + +AIKEGVNVKGYY 
Sbjct: 759 DKYKNPPIYITENGLADANNASLPVEEAVKDGLRIRYLANHLQYLSKAIKEGVNVKGYYQ 743

BLAST of Cp4.1LG17g08300 vs. ExPASy TrEMBL
Match: A0A6J1KZS8 (vicianin hydrolase-like OS=Cucurbita maxima OX=3661 GN=LOC111497800 PE=3 SV=1)

HSP 1 Score: 808 bits (2087), Expect = 3.47e-288
Identity = 409/512 (79.88%), Postives = 414/512 (80.86%), Query Frame = 0

Query: 318 MAAASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA 377
           MAAASSAPVLLIMLIAAA AGSGL DGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA
Sbjct: 1   MAAASSAPVLLIMLIAAAIAGSGLTDGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA 60

Query: 378 ASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWS 437
           ASIDGRGPSIWDTFIKNHPEKIWDHKNGE+ATDFYHRYKEDIQLMKKIGLDSFRFSISWS
Sbjct: 61  ASIDGRGPSIWDTFIKNHPEKIWDHKNGEMATDFYHRYKEDIQLMKKIGLDSFRFSISWS 120

Query: 438 RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVV 497
           RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRS KVV
Sbjct: 121 RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSVKVV 180

Query: 498 NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE 557
           NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE
Sbjct: 181 NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE 240

Query: 558 PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA-------- 617
           PYIVAHNLLLSHAAAV+VYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA        
Sbjct: 241 PYIVAHNLLLSHAAAVEVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAAVYRALDFF 300

Query: 618 ------------------------------------------------------------ 677
                                                                       
Sbjct: 301 LGWFLHPLTYGDYPKSMHQYVGNRLPKFSVAESQSIKGSMDFLGMNYYTGNFVDDIPFSN 360

Query: 678 --------------------------TSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN 735
                                     TSLNWLYIYPEGIHLLLKYIK EYKDPVIYITEN
Sbjct: 361 SPNISYSSDMHISLSTDNDGVLIGPATSLNWLYIYPEGIHLLLKYIKAEYKDPVIYITEN 420

BLAST of Cp4.1LG17g08300 vs. ExPASy TrEMBL
Match: A0A6J1H846 (vicianin hydrolase-like OS=Cucurbita moschata OX=3662 GN=LOC111461000 PE=3 SV=1)

HSP 1 Score: 803 bits (2075), Expect = 2.30e-286
Identity = 407/512 (79.49%), Postives = 414/512 (80.86%), Query Frame = 0

Query: 318 MAAASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA 377
           MAAASSAPVLLIMLIAAA AGSG ADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA
Sbjct: 1   MAAASSAPVLLIMLIAAAIAGSGSADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGA 60

Query: 378 ASIDGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWS 437
           ASIDGRGPSIWDTFIKNHPEKIWDHKNGE+ATDFYHRYKEDIQLMKKIGLDSFRFSISWS
Sbjct: 61  ASIDGRGPSIWDTFIKNHPEKIWDHKNGEMATDFYHRYKEDIQLMKKIGLDSFRFSISWS 120

Query: 438 RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVV 497
           RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEY+GFRSAKVV
Sbjct: 121 RILPKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYDGFRSAKVV 180

Query: 498 NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE 557
           NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE
Sbjct: 181 NDFREYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTE 240

Query: 558 PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA-------- 617
           PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA        
Sbjct: 241 PYIVAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAAVYRALDFF 300

Query: 618 ------------------------------------------------------------ 677
                                                                       
Sbjct: 301 LGWFLHPITYGDYPKSMHQYVGNRLPKFSVAESQSIKGSMDFLGMNYYTGNFVDDIPFSN 360

Query: 678 --------------------------TSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN 735
                                     TSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN
Sbjct: 361 SPNISYSSDMHISLSTDNDGVLIGPATSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITEN 420

BLAST of Cp4.1LG17g08300 vs. ExPASy TrEMBL
Match: A0A5N5JS02 (Uncharacterized protein OS=Salix brachista OX=2182728 GN=DKX38_024549 PE=3 SV=1)

HSP 1 Score: 781 bits (2016), Expect = 1.20e-269
Identity = 438/956 (45.82%), Postives = 531/956 (55.54%), Query Frame = 0

Query: 46   PKGTVRGGVNPLGVKFYNNLINELLANGIIPYVTLFHWDLPQALEDKYDGFRNVKIVNDF 105
            P+G +RGGVNPLGV+FYNNLINELLANGI P+VTLFHWDLPQALED+Y GF + K V+D+
Sbjct: 76   PEGKIRGGVNPLGVRFYNNLINELLANGITPFVTLFHWDLPQALEDEYSGFLSSKAVDDY 135

Query: 106  RNYADLCFKLFGDRVKYWTTLNEPYSFSAYGYNSGTFAPGRCSNYVGNCTAGNSGTEPYI 165
             +Y D CFK FGDRVK+W T NEPYSFS  GYN+GTFAPGRCSNYVGNCT GNSGTEPYI
Sbjct: 136  VDYVDFCFKTFGDRVKHWCTFNEPYSFSNNGYNTGTFAPGRCSNYVGNCTHGNSGTEPYI 195

Query: 166  VAHNLLLSHAAAVKVYRTRYQVS------------------------------------- 225
            VAHNL+L HAAAVK+YR +YQ S                                     
Sbjct: 196  VAHNLILGHAAAVKLYREKYQASQKGIIGITIVTHWFIPKSPKSEEDIKAAYRILDFLFS 255

Query: 226  --------------------------------------------IQEQIPKSI---SSLA 285
                                                        +  ++PK     S+L 
Sbjct: 256  LENPRRPSRFGYEDSNSYTLIVTKLCRFANPLTYGDYPEIMKAIVGHRLPKFTKEQSALV 315

Query: 286  TGKAEGEDWNHIA---------------------------EKDGVLIGPATGLNWLYIYP 345
             G  +    N+                              K G  IG  T LNWL+IYP
Sbjct: 316  KGSIDFLGVNYYTTNYAANNPAPNKVNFSYSGDSQTILSTSKGGHPIGTPTALNWLFIYP 375

Query: 346  EGIRLLLKYVKAEYKDPVIYITENGSSKRWNKDQIPPRPSC---------------ISSS 405
            +G+  L+ Y+K +YK+P IYITENG +   N   +P + +                +S +
Sbjct: 376  KGMYDLMLYIKDKYKNPPIYITENGLADA-NNASLPVKEALKDGLRIRYLANHLQYLSKA 435

Query: 406  SYQNLECKSVHESS-----RTSSNGSTRYNKSRSTQDLTTSSS--------------MAA 465
              + +  K  ++ +        +  + R+ K      +    +              MA 
Sbjct: 436  IQEGVNVKGYYQWAFWDDFEWDAGYTVRFGKPTCHAIVLAPKNLRYIIVDGIRFYQVMAT 495

Query: 466  ASSAPVLLIMLIAAATAGSGLADGVEPSHSSVPFNRSSFPPGFVFGAGSAAYQLEGAASI 525
              +A  L  +++A+  A +    G +PS  S+PFNR+SFP  F FGAG+AAYQ EGAA I
Sbjct: 496  VQAASFLHFVIVASLLAST---HGAKPSRYSMPFNRTSFPKDFTFGAGTAAYQSEGAAYI 555

Query: 526  DGRGPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRIL 585
            DG+GPSIWDTF K HPEKIWD   G  A DFYHRYKED+QLMKKIGLDSFRFSISWSR+L
Sbjct: 556  DGKGPSIWDTFTKQHPEKIWDQSTGNEAIDFYHRYKEDVQLMKKIGLDSFRFSISWSRVL 615

Query: 586  PKGNLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDF 645
            PKG + GGVNPLGV+FYNN+INELLANGI P+VTLF WDLPQALEDEY GF S+K V+D+
Sbjct: 616  PKGKISGGVNPLGVRFYNNLINELLANGITPFVTLFQWDLPQALEDEYYGFLSSKAVDDY 675

Query: 646  REYADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTEPYI 705
             +Y D CFK FGDRVK+W T NEPYSF+  GYN GTFAPGRCSNYVGNCT GNSGTEPY 
Sbjct: 676  VDYVDFCFKTFGDRVKHWCTFNEPYSFSSNGYNSGTFAPGRCSNYVGNCTHGNSGTEPYT 735

Query: 706  VAHNLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKR------------------- 735
            VAHNL+L HAAAVK+YR KYQA Q G IGIT+VTHWF  K                    
Sbjct: 736  VAHNLILGHAAAVKLYRQKYQASQNGIIGITIVTHWFIPKSPKSEEDIKAAYRILDFLFA 795

BLAST of Cp4.1LG17g08300 vs. TAIR 10
Match: AT2G44480.1 (beta glucosidase 17 )

HSP 1 Score: 456.1 bits (1172), Expect = 5.5e-128
Identity = 237/504 (47.02%), Postives = 303/504 (60.12%), Query Frame = 0

Query: 326 VLLIMLIAAATAGSGLADGVEPS--HSSVPFNRSSFPPGFVFGAGSAAYQLEGAASIDGR 385
           + +I++I+  T+ S L   ++PS    S    RSSFP  F FGA S+AYQ EGAA++DGR
Sbjct: 6   IFIIIIISIITSISELY-ALDPSFLRLSTSLQRSSFPQDFRFGAASSAYQSEGAANVDGR 65

Query: 386 GPSIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRILPKG 445
            PSIWDTF K +PEKI D  NG+VA +FY+R+KED+  MK+IGLDSFRFSISWSRILP+G
Sbjct: 66  EPSIWDTFTKQYPEKISDGSNGDVADEFYYRFKEDVAHMKEIGLDSFRFSISWSRILPRG 125

Query: 446 NLRGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDFREY 505
            + GGVN  G+ FYN++INEL++NGI P VTLFHWD PQALEDEY GF + ++V DF EY
Sbjct: 126 TVAGGVNQAGINFYNHLINELISNGIRPLVTLFHWDTPQALEDEYGGFLNPQIVKDFVEY 185

Query: 506 ADLCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVGNCTAGNSGTEPYIVAH 565
            D+CFK FGDRVK W T+NEP  F V GYN G  APGRCS+YV NCT GNS TEPY+VAH
Sbjct: 186 VDICFKEFGDRVKEWITINEPNMFAVLGYNVGNIAPGRCSSYVQNCTVGNSATEPYLVAH 245

Query: 566 NLLLSHAAAVKVYRTKYQAKQKGQIGITLVTHWFRAKRNTAASQAA-------------- 625
            L+LSHAA V++YR KYQ+   G IG+T+ T+W   K NT A + A              
Sbjct: 246 YLILSHAATVQLYREKYQSFHGGTIGMTIQTYWMIPKYNTPACREAAKRALDFFFGWFAD 305

Query: 626 ------------------------------------------------------------ 685
                                                                       
Sbjct: 306 PITYGDYPKTMRELVGNRLPKFTKKQSKMVRGSFDFFGLNYYTSRYVEDVMFYANTNLSY 365

Query: 686 --------------------TSLNWLYIYPEGIHLLLKYIKEEYKDPVIYITENGMAYSD 734
                               TS +WL+I PEG   +L YIK ++++PVI +TENGM   +
Sbjct: 366 TTDSRVNQTTEKNGVPVGEPTSADWLFICPEGFQDVLLYIKSKFQNPVILVTENGMPSEN 425

BLAST of Cp4.1LG17g08300 vs. TAIR 10
Match: AT2G44450.1 (beta glucosidase 15 )

HSP 1 Score: 428.3 bits (1100), Expect = 1.2e-119
Identity = 232/503 (46.12%), Postives = 295/503 (58.65%), Query Frame = 0

Query: 327 LLIMLIAAATAGSGLADGVEPSHSSVP-FNRSSFPPGFVFGAGSAAYQLEGAASIDGRGP 386
           LL++LI  A+      D +  ++SS P   RS FP  F+FG+ ++AYQ+EG A  DGRGP
Sbjct: 8   LLVVLIVLAS-----NDVLANNNSSTPKLRRSDFPEDFIFGSATSAYQVEGGAHEDGRGP 67

Query: 387 SIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRILPKGNL 446
           SIWDTF + +PEKI D  NG VA + YH YKED+ L+ +IG +++RFSISWSRILP+GNL
Sbjct: 68  SIWDTFSEKYPEKIKDGSNGSVADNSYHLYKEDVALLHQIGFNAYRFSISWSRILPRGNL 127

Query: 447 RGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDFREYAD 506
           +GG+N  G+ +YNN+INELL+ GI P+ T+FHWD PQALED Y GFR A++VNDFR+YAD
Sbjct: 128 KGGINQAGIDYYNNLINELLSKGIKPFATMFHWDTPQALEDAYGGFRGAEIVNDFRDYAD 187

Query: 507 LCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVG-NCTAGNSGTEPYIVAHN 566
           +CFK FGDRVK+W TLNEP +    GY  G  APGRCS +   NCT GN  TEPYIV HN
Sbjct: 188 ICFKNFGDRVKHWMTLNEPLTVVQQGYVAGVMAPGRCSKFTNPNCTDGNGATEPYIVGHN 247

Query: 567 LLLSHAAAVKVYRTKYQAKQKGQIGITLVTHW---------------------------- 626
           L+LSH AAV+VYR KY+A Q+GQ+GI L   W                            
Sbjct: 248 LILSHGAAVQVYREKYKASQQGQVGIALNAGWNLPYTESPKDRLAAARAMAFTFDYFMEP 307

Query: 627 --------------------FRAKRN------------------------------TAAS 686
                               F A+++                              T  S
Sbjct: 308 LVTGKYPVDMVNNVKGRLPIFTAQQSKMLKGSYDFIGINYYSSTYAKDVPCSTKDVTMFS 367

Query: 687 QAATSL---------------NWLYIYPEGIHLLLKYIKEEYKDPVIYITENGM-AYSDN 734
               S+               +WL IYP+GI  L+ Y K ++KDPV+YITENG   +S N
Sbjct: 368 DPCASVTGERDGVPIGPKAASDWLLIYPKGIRDLVLYAKYKFKDPVMYITENGRDEFSTN 427

BLAST of Cp4.1LG17g08300 vs. TAIR 10
Match: AT5G42260.1 (beta glucosidase 12 )

HSP 1 Score: 427.2 bits (1097), Expect = 2.7e-119
Identity = 232/503 (46.12%), Postives = 288/503 (57.26%), Query Frame = 0

Query: 327 LLIMLIAAATAGSGLADGVEPSHSSVP-FNRSSFPPGFVFGAGSAAYQLEGAASIDGRGP 386
           LL+ +I  A     L + +   HSS P   RS FP  F+FGA ++AYQ+EGAA  DGRGP
Sbjct: 8   LLVFIIVLA-----LNEVMAKKHSSTPKLRRSDFPEDFIFGAATSAYQVEGAAHEDGRGP 67

Query: 387 SIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRILPKGNL 446
           SIWDTF + +PEKI D  NG +A+D YH YKED+ L+ +IG D++RFSISWSRILP+ NL
Sbjct: 68  SIWDTFSEKYPEKIKDGSNGSIASDSYHLYKEDVGLLHQIGFDAYRFSISWSRILPRENL 127

Query: 447 RGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDFREYAD 506
           +GG+N  G+ +YNN+INELL+ GI P+ T+FHWD PQ+LED Y GF  A++VNDFR+YAD
Sbjct: 128 KGGINQAGIDYYNNLINELLSKGIKPFATIFHWDTPQSLEDAYGGFLGAEIVNDFRDYAD 187

Query: 507 LCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVG-NCTAGNSGTEPYIVAHN 566
           +CFK FGDRVK+W TLNEP +    GY  G  APGRCS +   NCTAGN  TEPYIV HN
Sbjct: 188 ICFKNFGDRVKHWMTLNEPLTVVQQGYVAGVMAPGRCSKFTNPNCTAGNGATEPYIVGHN 247

Query: 567 LLLSHAAAVKVYRTKYQAKQKGQIGITLVTHW---------------------------- 626
           L+L+H  AVKVYR KY+A QKGQ+GI L   W                            
Sbjct: 248 LILAHGEAVKVYREKYKASQKGQVGIALNAGWNLPYSESAEDRLAAARAMAFTFDYFMEP 307

Query: 627 ---------------------FRAK-------------RNTAASQAATSL---------- 686
                                F AK             RN  +S  A  +          
Sbjct: 308 LVTGKYPIDMVNYVKGGRLPTFTAKQSKMLKGSYDFIGRNYYSSSYAKDVPCSSENVTLF 367

Query: 687 ----------------------NWLYIYPEGIHLLLKYIKEEYKDPVIYITENGMAYSDN 734
                                 +WL IYP+GI  LL Y K ++KDPV+YITENG   +  
Sbjct: 368 SDPCASVTGEREGVPIGPKAASDWLLIYPKGIRDLLLYAKYKFKDPVMYITENGRDEAST 427

BLAST of Cp4.1LG17g08300 vs. TAIR 10
Match: AT2G25630.1 (beta glucosidase 14 )

HSP 1 Score: 425.6 bits (1093), Expect = 7.9e-119
Identity = 220/468 (47.01%), Postives = 279/468 (59.62%), Query Frame = 0

Query: 345 VEPSHSSVP-FNRSSFPPGFVFGAGSAAYQLEGAASIDGRGPSIWDTFIKNHPEKIWDHK 404
           V   HSS P   ++ FP  F+FGA ++AYQ+EGAA  DGRGPSIWDTF + +PEKI D  
Sbjct: 20  VAKRHSSTPKLRKTDFPEDFIFGAATSAYQVEGAAQEDGRGPSIWDTFSEKYPEKIKDGS 79

Query: 405 NGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRILPKGNLRGGVNPLGVKFYNNVINE 464
           NG +A D YH YKED+ L+ +IG +++RFSISWSRILP+GNL+GG+N  G+ +YNN+INE
Sbjct: 80  NGSIADDSYHLYKEDVGLLHQIGFNAYRFSISWSRILPRGNLKGGINQAGIDYYNNLINE 139

Query: 465 LLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDFREYADLCFKLFGDRVKYWTTLNE 524
           LL+ GI P+ T+FHWD PQ LED Y GFR A++VNDFR+YAD+CFK FGDRVK+W TLNE
Sbjct: 140 LLSKGIKPFATIFHWDTPQDLEDAYGGFRGAEIVNDFRDYADICFKSFGDRVKHWITLNE 199

Query: 525 PYSFTVFGYNGGTFAPGRCSNYVG-NCTAGNSGTEPYIVAHNLLLSHAAAVKVYRTKYQA 584
           P +    GY  G  APGRCS +   NCTAGN  TEPYIV HNL+L+H  A+KVYR KY+A
Sbjct: 200 PLTVVQQGYVAGVMAPGRCSKFTNPNCTAGNGATEPYIVGHNLILAHGEAIKVYRKKYKA 259

Query: 585 KQKGQIGITLVTHW----FRAKRNTAASQAATSLNWLYI--------YP----------- 644
            QKGQ+GI L   W      +  +  A+  A +  + Y         YP           
Sbjct: 260 SQKGQVGIALNAGWNLPYTESAEDRLAAARAMAFTFDYFMEPLVTGKYPVDMVNNVKGGR 319

Query: 645 ------------------------------------------------------EGIHLL 704
                                                                  GI  L
Sbjct: 320 LPTFTSKQSNMLKGSYDFIGINYYSSSYAKDVPCSSENVTMFSDPCASVTGERDGGIRDL 379

Query: 705 LKYIKEEYKDPVIYITENGMAYSDNTTLPIKEALKDGTRIRYHYAHLEAILQAIKEGVNV 734
           + Y K ++KDPV+YITENG   +       K  LKDG RI Y+  HL+ +  AI  G NV
Sbjct: 380 ILYAKYKFKDPVMYITENGRDEASTG----KILLKDGDRIDYYARHLKMVQDAILIGANV 439

BLAST of Cp4.1LG17g08300 vs. TAIR 10
Match: AT5G44640.1 (beta glucosidase 13 )

HSP 1 Score: 425.2 bits (1092), Expect = 1.0e-118
Identity = 229/503 (45.53%), Postives = 289/503 (57.46%), Query Frame = 0

Query: 327 LLIMLIAAATAGSGLADGVEPSHSSVP-FNRSSFPPGFVFGAGSAAYQLEGAASIDGRGP 386
           LL+ +I  A+      + +   HSS P   RS FP  F+FGA ++AYQ+EGAA  DGRGP
Sbjct: 8   LLVFIIVLAS-----NEVIAKKHSSTPKLRRSDFPKDFIFGAATSAYQVEGAAHEDGRGP 67

Query: 387 SIWDTFIKNHPEKIWDHKNGEVATDFYHRYKEDIQLMKKIGLDSFRFSISWSRILPKGNL 446
           SIWDTF + +PEKI D  NG +A+D YH YKED+ L+ +IG  ++RFSISWSRILP+GNL
Sbjct: 68  SIWDTFSEKYPEKIKDGTNGSIASDSYHLYKEDVGLLHQIGFGAYRFSISWSRILPRGNL 127

Query: 447 RGGVNPLGVKFYNNVINELLANGIIPYVTLFHWDLPQALEDEYNGFRSAKVVNDFREYAD 506
           +GG+N  G+ +YNN+INELL+ GI P+ T+FHWD PQ+LED Y GF  A++VNDFR+YAD
Sbjct: 128 KGGINQAGIDYYNNLINELLSKGIKPFATIFHWDTPQSLEDAYGGFFGAEIVNDFRDYAD 187

Query: 507 LCFKLFGDRVKYWTTLNEPYSFTVFGYNGGTFAPGRCSNYVG-NCTAGNSGTEPYIVAHN 566
           +CFK FGDRVK+W TLNEP +    GY  G  APGRCS +   NCTAGN  TEPYIV HN
Sbjct: 188 ICFKNFGDRVKHWMTLNEPLTVVQQGYVAGVMAPGRCSKFTNPNCTAGNGATEPYIVGHN 247

Query: 567 LLLSHAAAVKVYRTKYQAKQKGQIGITLVTHW---------------------------- 626
           L+L+H  AVKVYR KY+A QKGQ+GI L   W                            
Sbjct: 248 LILAHGEAVKVYREKYKASQKGQVGIALNAGWNLPYTESAEDRLAAARAMAFTFDYFMEP 307

Query: 627 ---------------------FRAKRN------------------------------TAA 686
                                F AK++                              T  
Sbjct: 308 LVTGKYPVDMVNNVKDGRLPTFTAKQSKMLKGSYDFIGINYYSSSYAKDVPCSSENVTLF 367

Query: 687 SQAATSL---------------NWLYIYPEGIHLLLKYIKEEYKDPVIYITENGMAYSDN 734
           S    S+               +WL IYP+GI  LL Y K ++KDPV+YITENG   +  
Sbjct: 368 SDPCASVTGEREGVPIGPKAASDWLLIYPKGIRDLLLYAKYKFKDPVMYITENGRDEAST 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A2SY662.1e-15654.20Vicianin hydrolase (Fragment) OS=Vicia sativa subsp. nigra OX=3909 PE=1 SV=1[more]
B8AVF08.8e-13147.08Beta-glucosidase 12 OS=Oryza sativa subsp. indica OX=39946 GN=BGLU12 PE=3 SV=1[more]
Q7XKV41.5e-13047.08Beta-glucosidase 12 OS=Oryza sativa subsp. japonica OX=39947 GN=BGLU12 PE=1 SV=2[more]
Q7XKV27.0e-12846.92Beta-glucosidase 13 OS=Oryza sativa subsp. japonica OX=39947 GN=BGLU13 PE=2 SV=2[more]
O648827.7e-12747.02Beta-glucosidase 17 OS=Arabidopsis thaliana OX=3702 GN=BGLU17 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
TXG55760.11.79e-29449.73hypothetical protein EZV62_017073 [Acer yangbiense][more]
XP_023513422.16.94e-29381.64vicianin hydrolase-like [Cucurbita pepo subsp. pepo][more]
XP_023004523.17.16e-28879.88vicianin hydrolase-like [Cucurbita maxima][more]
XP_022960188.14.76e-28679.49vicianin hydrolase-like [Cucurbita moschata][more]
KAB5520230.12.48e-26945.82hypothetical protein DKX38_024549 [Salix brachista][more]
Match NameE-valueIdentityDescription
A0A5C7HFE88.67e-29549.73Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_017073 PE=3 SV=1[more]
A0A6N2KL694.60e-29152.00Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS96664 PE=3 SV=1[more]
A0A6J1KZS83.47e-28879.88vicianin hydrolase-like OS=Cucurbita maxima OX=3661 GN=LOC111497800 PE=3 SV=1[more]
A0A6J1H8462.30e-28679.49vicianin hydrolase-like OS=Cucurbita moschata OX=3662 GN=LOC111461000 PE=3 SV=1[more]
A0A5N5JS021.20e-26945.82Uncharacterized protein OS=Salix brachista OX=2182728 GN=DKX38_024549 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G44480.15.5e-12847.02beta glucosidase 17 [more]
AT2G44450.11.2e-11946.12beta glucosidase 15 [more]
AT5G42260.12.7e-11946.12beta glucosidase 12 [more]
AT2G25630.17.9e-11947.01beta glucosidase 14 [more]
AT5G44640.11.0e-11845.53beta glucosidase 13 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001360Glycoside hydrolase family 1PRINTSPR00131GLHYDRLASE1coord: 661..672
score: 44.22
coord: 682..699
score: 55.56
coord: 706..718
score: 49.13
coord: 638..646
score: 63.7
IPR001360Glycoside hydrolase family 1PFAMPF00232Glyco_hydro_1coord: 220..263
e-value: 1.6E-6
score: 26.8
coord: 3..186
e-value: 6.8E-69
score: 232.4
coord: 357..612
e-value: 8.9E-92
score: 307.9
IPR001360Glycoside hydrolase family 1PANTHERPTHR10353GLYCOSYL HYDROLASEcoord: 209..266
IPR001360Glycoside hydrolase family 1PANTHERPTHR10353GLYCOSYL HYDROLASEcoord: 609..734
IPR001360Glycoside hydrolase family 1PANTHERPTHR10353GLYCOSYL HYDROLASEcoord: 2..198
IPR001360Glycoside hydrolase family 1PANTHERPTHR10353GLYCOSYL HYDROLASEcoord: 327..611
NoneNo IPR availableGENE3D3.20.20.80Glycosidasescoord: 200..272
e-value: 1.4E-11
score: 45.5
NoneNo IPR availableGENE3D3.20.20.80Glycosidasescoord: 610..734
e-value: 8.1E-49
score: 168.3
NoneNo IPR availableGENE3D3.20.20.80Glycosidasescoord: 339..609
e-value: 5.1E-122
score: 409.7
coord: 1..199
e-value: 4.6E-91
score: 307.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 287..315
NoneNo IPR availablePANTHERPTHR10353:SF261BETA-GLUCOSIDASE 17coord: 327..611
NoneNo IPR availablePANTHERPTHR10353:SF261BETA-GLUCOSIDASE 17coord: 609..734
NoneNo IPR availablePANTHERPTHR10353:SF261BETA-GLUCOSIDASE 17coord: 209..266
NoneNo IPR availablePANTHERPTHR10353:SF261BETA-GLUCOSIDASE 17coord: 2..198
IPR018120Glycoside hydrolase family 1, active sitePROSITEPS00572GLYCOSYL_HYDROL_F1_1coord: 638..646
IPR033132Glycosyl hydrolases family 1, N-terminal conserved sitePROSITEPS00653GLYCOSYL_HYDROL_F1_2coord: 363..377
IPR017853Glycoside hydrolase superfamilySUPERFAMILY51445(Trans)glycosidasescoord: 350..732
IPR017853Glycoside hydrolase superfamilySUPERFAMILY51445(Trans)glycosidasescoord: 2..262

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g08300.1Cp4.1LG17g08300.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
molecular_function GO:0008422 beta-glucosidase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds