Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGATCAACGGCCGTAAATTAGTCAAGAGGAGGTCTGACGTGGTGGAAGAGGCCCCACATCCCTATTTATGTATATATATTATATTACACTGATCAGTTGCAGTTTCGCTTTCGCGCGCTCTCCACTCTCGTCGGGACACCGCCGGCCACTGCCATTGTCCGGTAGCCACTGGAACTTTTCAAAACCAATTGCCATTGCTGGATTCCTCGACGTCATCCACCACCATGCCAATTTTCTCTACAAATTTTCATTTCTACTTTCCACTTCCACTTCCGATCTCTCTTTTTTCTCTTTTCCTCTTCTTATGGCGTCCTTTTTTTCTTCCACTTTCCTTATCTTCGTTACAATTTTCGCCGCGTTCTCCACTTCTCGGTCGTCGACGATCGGAGTCGAATATATTTCGCGGCTTCTTGAGATTCAGGATCGCGAGAGAGTGCCTGCGTATGTCCAAGTTGCTGCAGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCATCTCCCTAGCTTTGACTTTCAGATTGTTTCTAAGGTACTTGGCTGATTTTCTTTACAAGTTCCAGTGTTATCTATTTGACTGATGAAAGAAGTATAGACTACAGAGTAGTTTGTTAGAGAGATTTTGATTTGTTTTCCTAGGCACTCAAACATTTAGGAAAATGAAAGAATTATTGTACATTTGAAGTTTCGTTTGCTTATAGTGTCTGTTTTTCATGTTCATATTTCTCGTGGTCTTTCTATGTGTTGAATTTTCTCAGGACAAATGTGGTGGAGAATCTTGCTTCGTGATCAGGAACCATCGTGCGTTCAGGAAACCCGGGGATCCTGAGATTTTGTACGTAGAAACTATCTGCTAATCTTTTCATTCAGTTTGCCTTGATCTAACTGGCCTTTAAAATTGGATTAAGCTACTATGTTGCTTCTTGTTATTTAATAAGTGAAGGATCGGGAGTTCACATCACGAAAAATATTAATTCGTATTGCACAGTGTTCTTTGTATATAGAAATTATAATCGATCTCTTACTTTTCAACATGAACACGATTTCCAATATTTTTCTAGAATCGCTGGGGTCACTGGAGTGGAGATTTTAGCTGGCTTGCACTGGTATCTTAAGCACTGGTGTGGTGCACACATATCTTGGGATAAAACAGGTGGCTCACAACTGTTTTCTGTACCTAAGGCAGGCCTGTTACCTCGTATTCAAACAAACGAGGTTGTGGTTCAGAGGCCTATTCCTTTGAACTACTATCAAAATGCAGTTACATCAAGCTGTAAGGTTTTCTTTTTGGATTTTTCATTACATCCATCTCGTTGCAATGCACAATTTTCATAACTGAAATTGAATTACAAAATTGTGTTTCTCCAGTAACAGTTATTCGTTATTGTTTCAATACATATTTTTCTATTTTCACTTTCTTTGCTTCTGGTAGACTCTTTTGCCTGGTGGGACTGGAAAAGGTGGGAAAAGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATTTACTGGGCAGGAGGCTATTTGGCGAAAAGTATTTCGGGTATGTTTTCCTCTATTTATAAGTGTTATGTTATTCATACTGTCTTCAACATAATGATATTTTCAGCAAGTGTGAACTGTCATCTTACCCTACATATGCTATAATCTCTGACTTATCTTAATATTTTTGATGATTGAAACAATGTTACTAATTGTGCATCATGGGACCATGGATGCAATAGTCTACATCCTTTGCACTATAGCAATACAAGTATAATATTACTCTGCTGCTTCCTGCTATTGGAAATTGTTTCTGGCATTATCTCAACTTTTCCTTACATTAGAAGCTTCTTTTGTTCATTACTTTCTTCATCATTGCTATGGTAGCTTTCATGTCCTTTTCTTCAAAAGAAAAAAAAGAAAAAGAAAAAAACAGCTTCCCTTGAGTTATAGGCATGCTCGTTTCCTACATTTTAATTTATGCAACAAAGTGGCTAAAGCGAAAAAGCTATCTCTAGTAATTTTTTTTGGTTAGATCCTATTACAATTTACTTAGTTCAAAATATAGATCGATCTCATTCCACACTATATTTGCAGAAATTTAATATAAGCAACTCAGATTTGGATGATTTCTTTGGAGGTCCTGCTTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAGTGAGTAGTTGCTTTGTAACTGCGTTGAAATTAGATCATTGTTGGATCGACATTCTACTCCGCTGGGACTAAAATGTAAGATAGAATGATAAATATATAATCATCGTCACAATCAACTACATAATCTTCCTGCTTTAATCAGATATATGTAGAGAATGGCTTAAATGTGGCATGTATTTTGACTCTTACATTTCAGTTCTAGTTATAAAAAACTCACTTATAATTTCTTATCATTGCGTTTTTAATATCAACACTCTTATCACAGATGGGGTGGGCCACTGCCGCAAAGTTGGTTTGATCAACAACTTATTCTGCAGAAAAAGGTTATTGGCAGAATGTTTGAGCTGGGAATGACTCCAGGTATTTTTTAAGCACTGTCAGATAATTTTTGTAGTACCTCATGTTGTAATGGAGTTCATAATTCTTTGGTCTTGACCCAGTAGGAAACTTAAGATTCTGTTTCATTTAAATATTTTCACTTTCATGCTGGTTTTCTTTTTGAATTGCTGTTTTAAAGCTTCAGTAATGTTGCTATAGTTCGTACAGCCTATTTACTGGACAAAGTTCTTGAAAGCTTTTGTGCTGGGAAGAATTTATGGACCATGGTTTTCTTTTTTCTCCTTTCCAGTTTTGCCAGCCTTTTCGGGTAATATTCCTGCTGCTTTCAAACAAATATATCCAGCAGCAAAAATAACACGCTTGGGAAATTGGTAACTCAACACTCCCTTATGGTCTTATCTTATATAAATTTCATATTTGGTATGGTTAATGATGTTTTTATTGTGTTGTTTTGGCATTTTCTTAGAATATATTTTGAAACATCCTGGAACCAGTGTTGGTCAAATCCATGATACTATAAAAGAACAAAAGTAGTCCATGATACGAAGAATGTGATTCACTTTTGAAAATAATCACTTTAGTGAACTGTTTGATGATATTTTTTTATTTTGTTTTTGGTTTTAATATTTTTCAAGGTTTTTTAAGTTTGTCTAGACGTGCTTACTATTTTAAAACAAAGTGTAATTTGAATGACTACTATTAAAATTTGAGGATTTTTAGGATAAAAACATTACCTAAAGAAAAATATTTAAAATGGAAAAAAAAAAAAGTGATCGAAGGATTAGTGGAAGGAAGCAGGCAAGTGAAGAGAATCTAGTGGAAAACTGTATACACAGGGAGAGTAGTAGGAGTGAATGTCTAAATAGGAGAAAGGCATGTGAATCGAGAGAAGTTAGAAATTTTATTTTTAGATGGAGTCGACTAGAAAATAATATATAGAGGGAGATCTCGAGAGGGATTAATGCCAGAAAGTATTGGGATAGAAGTTAATGAAATTATTTTGATTGATGAGTGAGAAAGTATATCTAACATGATTAATAAGAGAAACAATGGAGAGGAATTAGTAAGCTAAGTTATATAGAGAGAAGTTATTAAGTTGGAGCTAACATGAGTTCCATTCATTAGAAATTTACTGCCAAAATATGAAATGATCTCTTCATAGAGTAGTCTGACCAGAAAACCTCTTACTTTTTATTTGAAGCAACAACCTGCAGTAATCCTTCAAATCACGAGACCTTACAATGATGCTTGAGCAACAAGAAATTTTGCAGAGTACTTGTTTAACTAATGACAATGTCAATTGTCTCTATGCTACCATTGTCCTGCAGGTTTACTGTTCATAGTGACCCTAGATGGTGCTGCACTTACCTTCTTGACGCCATGGACCCTTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAACAGAAAGGTATTTTTAGACCTACGCGAATGAAATGTATGAGAACATATTTCATGCCTTTTCTTAAATGATGCTTGATGATTCTAATAGTCTGAAATTGCATTTTTTGGCCAGAATATGGAAGAACATCCCATGTATACAATTGGTATGGTGCTTGTTTCCTCAACATATTTTGTATTGTGTGCGGTTCATTTTTCTTTTAATCTTCACTGTAAGATGTGTATGGTCAGCTTTGTTACATTATAATAGCAGAAAAGCTAGCGAGAAACAAGCATTTAAAAATGGTGCAATACTGGTTTCAACAGTCTTTAAATTGTATTTTCTTCAAACATGAGAACTATCCGCCATCAATTTTTCACATCATGAATATCTTTTCACCTTACAGTGATACCTTTGACGAGAACACTCCACCTGTTGATGATGTGGAATACATCTCTTCATTAGGTTCAGCTATTTTTGGAGGAATGCAGGCAGGAGATTCTAATGCTGTCTGGCTAATGCAGGTAATTGTTAGTTCGTTTTTCCATGTAAGGGGCTGTTAGGCCAATTCACTTAATTCTCTGTATAGTTCCAATTTTTATGGATGTCCCTGAAGGGACCCAATTCACTAAGTTTTCTAAGTCATATCACAAAGCAAGTGGAACATGATATGGATTACTCTAGTGAATGTGCATTTCATCTCTTCTTGTTCTGTTTTGGTGAAATGTGAACTTTTTTCTCGATCTTCTTAATGAAATTTGTTCATGTACAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAACAAATGAAGGTTATAATCGTGCTTTATGTCACCAATTGCTCATGCAATCATACACGAATATATCTTCCTACTTTCAATCATCATTAGATCATTGCATGCATGAAGAATTTTATCAGGGAATCTTTGACTCAGATCCGTTGTGTTATTTGTAAATGCTAATTCATTTACTTGGGCTCGTTGTATGGTGTACCTTCTTTTATCTACTCATCTTTAGGGTTGTCATTGTGCATTACTTTCACTTCTGGTTGAAATTGGTGAGATTTTTCAGGCCCTTTTACATTCTGTCCCTCTGGGAAGGCTGGTAGTTCTTGATCTGTATGCTGAAGTGAAGCCTATCTGGATATCCTCTGAGCAATTTTATGGCATTCCTTACATCTGGAAAGTCTCTATTTCCATTCTTTTGCTTAATCTTAATGCTCATATCTAATAAATGGTCAAGGCTTAGGCTTACGGAAGGCTTTTTCCATAGCATATCAGCAATGTTGATGAGTATGTTATAATTATCAGTCAAGAATCGACATTCAAAGTTCATTAGTTTTAAGTTTTCTTTATTTGTTACATTCATGAATGTCTAAAGTTATTTTTATCCGGCCTTGACCTTTTTTTAGTTGCTATACATTGGTTGATTATGTTGACAATCTAAGAACTAACCATTTAAAAGCAGTGGTGTTGAGAATGGCATGTTATATATCTTTTTACAAGCCCATTTTTATAAGTTAATAATAATTTGAAGAGAGAATCTCAAACCTACGGTTGATTTATTAAAGTTTCCTTGATTAAGGATTACATTATCTAATTTTTTTTGGCTAAATTTTATATCCTTTTCAATGTAATTTCAGTCCAATGTTATCTTCGTGTTTTTACTTTATAAGAAAGATGTTTCTGGCAAGGCAATTTTAATACCCCATTTCATTTAGTTTGCAGGTTAAAACAGCAGTTATCTGACTTTAGCCGTTTCAAGAAACTTTTTGTAACAAAACTGCTAATCTCGTTAATTCTGTTGTCTTGAGCTAAAACTAGATTGGTTTATTGTTTCATTAGCCATATGCTATATTTCAATGAATGTGTAATACTCGTGAATTTTAGCTTCAATAACATGAATTTCTATTCACAGGTGCATGCTACATAATTTTGCTGGAAATGTTGAGATGTATGGCATTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGTAGTCCATACTCGACAATGGTAAGATTTTCATCTACTATTATTTGGTGTCTAGATGTTGTCATAATGTTTTGGCAGCATATTGAACTCCTATGGTTATACAAATTTTTTATTCGAGGAGGAAACAAGTCTCTTTACTAATAAAAATGAGACTACCGCTCAAAATACAAGATGATTATACAAAGAACAGAAAATGTATAATTTAGGGATCGGTAGGCACACCTAGGCGTCTCAACTAGGTTGACACCCCTTTAGCACCCTCATCATCTCCAAAATAAATACCCAACTCTAGGTATACTAATTTTGATAAATACTCCAATAGTTTTATAAAAGTAAATTTGGTGCTTTCTTTCTCTCGTCCATAAGTGCAGCTCCAGGTATATATGAGTATTAATGTAACTTTGAGGAGATGAAGTGATTCTCTTTTTTTCTTTTCATTATTGAAAAAACGGCTTGAAATATTATTTTGGCATTTAATAATACTCAATTCTTTTATTTCTTTTTATTATGTAGTGAAGTGACAAATGGCTTAATCATAGTGGATTGTGGCAGCACAGTCACTCTGCCAATTACAATAAAAATTCTACCCATAACATATCATCTCTCATTAACATGTAAAAGCTTTATATTTAACACATCTATGAAGATATTACTAATTCCAATCTTCCTCTACCCTTCTCTTTTTCTCATATATCTGAGGTGTATCATTTATTTTGCAATGTTTAAGTTCGTTTTTAGCTTTGTCAACGGTGTTTTCATTTGAAGGTCGGTGTAGGAATGTCCATGGAAGGAATTGAACAGAATCCTGTTGTCTACGATCTTATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGGTACAAATGTTTTACAATTATTTTTAAAATATAATGGCCCATTTGCTTCATTGCCAGTTGTATTCATGCTCCATGTGTGGAAAGCTGACAAGTGACATTGCATAGATTTAATCGAATATACGGAAGGGTCCATCATATGAAAAGGATAGTGAGATCATTTCCTTTCTGCCGTGCATATGTTCACATCCTGGCTAATGCCAATGTGGTTTTATTTTCCAAGTTGATTTCACAACTAGTTCTGTTGATGTCTTGCATGCAATGCATTATGAGAAGGAACTGACAGATGCGAAAGATAGAAAATGTTTCATAATAAATTCCATACAATTCAAGAACAATCACCACAAGTTCTATGTAAAGTATCTAAGCATGGAAGAGAGCTGAGATCTATATGGAGATTAGAAATACCTGGAGCTATTATATAGATCAACTAACTTGCGAAGTTACATGAATTAAGCAATCTTCCTCAACTAACCAAATATAGGCAATCTTCTTTGCCAACAAATATAGGCAACTTCCTAAGCTATGCTCTACTATAATAGTAAGTATATGTTTTTTCTCCACCGACCATACTGATGTAGACGAAGCCCATAGGTCAATAATTAAGGCCATTTGCTTATTAAATAGCTAAGATAAAATTTTCGTTTGATTCGTTACCCAAATATGAAATAATTGTTAACTTCATTGCAAATCATTCCAACAATAGAAAATTTCTTTTCTAGTTCAACAATTTCAGGTTTGTTCATAACATCTTGGCCAACATACTGCACACTTTCCCCATGATTTCTGAATTTTAGCTCTATCACTTTCTATTTTAAAGTTTATTTTTCTCCAATAGAATTTATTTTCATGATAGATTTTGCGCAATTGTGTTTATAGAAAATTTTGTTCCCATCAATTAAAATTTGTAAAATACTATCCTTTCATTTTCAGAAATGGCTTCCTCAGTATTCAGTCAGGCGTTATGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCGTCTACAACTGCACTGACGGTGCCAATGTAAGTATGAAAAACACTGTGATGGCCATTCAACCTTTTCTTCTCTTTTGGTACTTCTACGACTAGTGTTTCCCGTCTTGTTGCTTTCAAATGGTCATCGCCGTTCTCCCTTCGTCTATTGTGTTAAAAAATGTTATGCTAACATATGTCTGTCACTATGGAAAGGACAAAAACAGGGATGTAATTGTGGCATTTCCTGATGTTGATCCATCGGCAATCTTAGTATTACCCGAGGGGTCCAACCGACATGGGAACTTGGACTCAAGCGTGGATCGCCTCCAGGATGCAACGTTTGATCGACCTCATCTTTGGTATCCTACTTCTGAAGTAATTAGTGCACTCAAGCTTTTCATTGCTGGTGGCGATCAACTCTCTAGTAGCAACACTTACAGGTTGACTCACGAATTTAAAATATCTCATCATTTACTTGAAACCTATAGAGATATCGAATATCGTCCTTTTCTCTATGTTGAAGTCTAGATTATTGAATTGGCATTTGTCACCACTCTTAGGTATGACCTTGTAGATTTGACCAGGCAAGCTCTAGCCAAATACTCGAATGAACTGTTCTTTAGAATTGTCAAGGCATATCAGTTACATGATGTACAGACAATGGCCAGCTTAAGCCAAGAGTTCCTTGAACTTGTCAATGATATTGACACATTATTGGCTTGTCACGAGGGATTTCTTTTGGGACCTTGGCTACAAAGCGCCAAACAACTCGCCCGAAGTGAAGAGGAGGAAAAACAGGTTCAATTCAAGTCCATTCTACCAATTATGAGAAAATTATACTAATAACATCCGAATTCTGACTCATCTCAAATTACAGTATGAATGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGAAGAAGCAAGTTTGCTTCGTGATTATGGTAGTGATTATTATCATTTATCATTGAGATTAGGCAATTGTACATTCAAATTTGACTTATTTAACCGCGACCCAGGAAACAAGTACTGGAGTGGACTCTTGGGCGATTATTACTGTCCTCGAGCTGCAATATACTTGAAGTTCTTGAAAGAAAGTTCGGAGAATGGATACAGATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAACTAACAAATGATTGGCAAAGCAGCAGAAAGATTTACCCTGTGGAAAGCAATGGAGATGCACTTGACACATCCCACTGGCTATACAACAAGTACTTGCAAATACCTGAGAGCTCTGACCAATGAATCAGGCAAGTTATCCTGCTAGCAACTAAGATTTGAATGTTGCATAATGTTTATTTTGCATATTTTAATTCCTCTCCCGTTTTCTGTTTCGTGTTTGAACTAGTATTATTATTTCGCAGGTTTACAACCTCCTTTTCCTTCTACACAAATGGCCAACGACACAATTTATCTCCCGGTAATGCAAAACCATTTCTTTGTCTTTTAAATAGTATCATTGAATTTCTTGTGAACGTTAAGCTTATTGTGTCATTTCCAATCCGAACATAATTTAACAGATAAGACATCAACTATCATGAATCTGTTGACCTAAACAAAATATATTTAATTATGATCCCAGC
mRNA sequence
ATGAAGATCAACGGCCGTAAATTAGTCAAGAGGAGTTGCAGTTTCGCTTTCGCGCGCTCTCCACTCTCGTCGGGACACCGCCGGCCACTGCCATTGTCCGGTAGCCACTGGAACTTTTCAAAACCAATTGCCATTGCTGGATTCCTCGACGTCATCCACCACCATGCCAATTTTCTCTACAAATTTTCATTTCTACTTTCCACTTCCACTTCCGATCTCTCTTTTTTCTCTTTTCCTCTTCTTATGGCGTCCTTTTTTTCTTCCACTTTCCTTATCTTCGTTACAATTTTCGCCGCGTTCTCCACTTCTCGGTCGTCGACGATCGGAGTCGAATATATTTCGCGGCTTCTTGAGATTCAGGATCGCGAGAGAGTGCCTGCGTATGTCCAAGTTGCTGCAGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCATCTCCCTAGCTTTGACTTTCAGATTGTTTCTAAGGACAAATGTGGTGGAGAATCTTGCTTCGTGATCAGGAACCATCGTGCGTTCAGGAAACCCGGGGATCCTGAGATTTTAATCGCTGGGGTCACTGGAGTGGAGATTTTAGCTGGCTTGCACTGGTATCTTAAGCACTGGTGTGGTGCACACATATCTTGGGATAAAACAGGTGGCTCACAACTGTTTTCTGTACCTAAGGCAGGCCTGTTACCTCGTATTCAAACAAACGAGGTTGTGGTTCAGAGGCCTATTCCTTTGAACTACTATCAAAATGCAGTTACATCAAGCTACTCTTTTGCCTGGTGGGACTGGAAAAGGTGGGAAAAGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATTTACTGGGCAGGAGGCTATTTGGCGAAAAGTATTTCGGAAATTTAATATAAGCAACTCAGATTTGGATGATTTCTTTGGAGGTCCTGCTTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAATGGGGTGGGCCACTGCCGCAAAGTTGGTTTGATCAACAACTTATTCTGCAGAAAAAGGTTATTGGCAGAATGTTTGAGCTGGGAATGACTCCAGTTTTGCCAGCCTTTTCGGGTAATATTCCTGCTGCTTTCAAACAAATATATCCAGCAGCAAAAATAACACGCTTGGGAAATTGGTTTACTGTTCATAGTGACCCTAGATGGTGCTGCACTTACCTTCTTGACGCCATGGACCCTTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAACAGAAAGAATATGGAAGAACATCCCATGTATACAATTGTGATACCTTTGACGAGAACACTCCACCTGTTGATGATGTGGAATACATCTCTTCATTAGGTTCAGCTATTTTTGGAGGAATGCAGGCAGGAGATTCTAATGCTGTCTGGCTAATGCAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAACAAATGAAGGCCCTTTTACATTCTGTCCCTCTGGGAAGGCTGGTAGTTCTTGATCTGTATGCTGAAGTGAAGCCTATCTGGATATCCTCTGAGCAATTTTATGGCATTCCTTACATCTGGAAACAGTGGTGCATGCTACATAATTTTGCTGGAAATGTTGAGATGTATGGCATTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGTAGTCCATACTCGACAATGGTCGGTGTAGGAATGTCCATGGAAGGAATTGAACAGAATCCTGTTGTCTACGATCTTATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGAAATGGCTTCCTCAGTATTCAGTCAGGCGTTATGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCGTCTACAACTGCACTGACGGTGCCAATGACAAAAACAGGGATGTAATTGTGGCATTTCCTGATGTTGATCCATCGGCAATCTTAGTATTACCCGAGGGGTCCAACCGACATGGGAACTTGGACTCAAGCGTGGATCGCCTCCAGGATGCAACGTTTGATCGACCTCATCTTTGGTATCCTACTTCTGAAGTAATTAGTGCACTCAAGCTTTTCATTGCTGGTGGCGATCAACTCTCTAGTAGCAACACTTACAGGTATGACCTTGTAGATTTGACCAGGCAAGCTCTAGCCAAATACTCGAATGAACTGTTCTTTAGAATTGTCAAGGCATATCAGTTACATGATGTACAGACAATGGCCAGCTTAAGCCAAGAGTTCCTTGAACTTGTCAATGATATTGACACATTATTGGCTTGTCACGAGGGATTTCTTTTGGGACCTTGGCTACAAAGCGCCAAACAACTCGCCCGAAGTGAAGAGGAGGAAAAACAGTATGAATGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGAAGAAGCAAGTTTGCTTCGTGATTATGGAAACAAGTACTGGAGTGGACTCTTGGGCGATTATTACTGTCCTCGAGCTGCAATATACTTGAAGTTCTTGAAAGAAAGTTCGGAGAATGGATACAGATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAACTAACAAATGATTGGCAAAGCAGCAGAAAGATTTACCCTGTGGAAAGCAATGGAGATGCACTTGACACATCCCACTGGCTATACAACAAGTACTTGCAAATACCTGAGAGCTCTGACCAATGAATCAGGTTTACAACCTCCTTTTCCTTCTACACAAATGGCCAACGACACAATTTATCTCCCGGTAATGCAAAACCATTTCTTTGTCTTTTAAATAGTATCATTGAATTTCTTGTGAACGTTAAGCTTATTGTGTCATTTCCAATCCGAACATAATTTAACAGATAAGACATCAACTATCATGAATCTGTTGACCTAAACAAAATATATTTAATTATGATCCCAGC
Coding sequence (CDS)
ATGAAGATCAACGGCCGTAAATTAGTCAAGAGGAGTTGCAGTTTCGCTTTCGCGCGCTCTCCACTCTCGTCGGGACACCGCCGGCCACTGCCATTGTCCGGTAGCCACTGGAACTTTTCAAAACCAATTGCCATTGCTGGATTCCTCGACGTCATCCACCACCATGCCAATTTTCTCTACAAATTTTCATTTCTACTTTCCACTTCCACTTCCGATCTCTCTTTTTTCTCTTTTCCTCTTCTTATGGCGTCCTTTTTTTCTTCCACTTTCCTTATCTTCGTTACAATTTTCGCCGCGTTCTCCACTTCTCGGTCGTCGACGATCGGAGTCGAATATATTTCGCGGCTTCTTGAGATTCAGGATCGCGAGAGAGTGCCTGCGTATGTCCAAGTTGCTGCAGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCATCTCCCTAGCTTTGACTTTCAGATTGTTTCTAAGGACAAATGTGGTGGAGAATCTTGCTTCGTGATCAGGAACCATCGTGCGTTCAGGAAACCCGGGGATCCTGAGATTTTAATCGCTGGGGTCACTGGAGTGGAGATTTTAGCTGGCTTGCACTGGTATCTTAAGCACTGGTGTGGTGCACACATATCTTGGGATAAAACAGGTGGCTCACAACTGTTTTCTGTACCTAAGGCAGGCCTGTTACCTCGTATTCAAACAAACGAGGTTGTGGTTCAGAGGCCTATTCCTTTGAACTACTATCAAAATGCAGTTACATCAAGCTACTCTTTTGCCTGGTGGGACTGGAAAAGGTGGGAAAAGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATTTACTGGGCAGGAGGCTATTTGGCGAAAAGTATTTCGGAAATTTAATATAAGCAACTCAGATTTGGATGATTTCTTTGGAGGTCCTGCTTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAATGGGGTGGGCCACTGCCGCAAAGTTGGTTTGATCAACAACTTATTCTGCAGAAAAAGGTTATTGGCAGAATGTTTGAGCTGGGAATGACTCCAGTTTTGCCAGCCTTTTCGGGTAATATTCCTGCTGCTTTCAAACAAATATATCCAGCAGCAAAAATAACACGCTTGGGAAATTGGTTTACTGTTCATAGTGACCCTAGATGGTGCTGCACTTACCTTCTTGACGCCATGGACCCTTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAACAGAAAGAATATGGAAGAACATCCCATGTATACAATTGTGATACCTTTGACGAGAACACTCCACCTGTTGATGATGTGGAATACATCTCTTCATTAGGTTCAGCTATTTTTGGAGGAATGCAGGCAGGAGATTCTAATGCTGTCTGGCTAATGCAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAACAAATGAAGGCCCTTTTACATTCTGTCCCTCTGGGAAGGCTGGTAGTTCTTGATCTGTATGCTGAAGTGAAGCCTATCTGGATATCCTCTGAGCAATTTTATGGCATTCCTTACATCTGGAAACAGTGGTGCATGCTACATAATTTTGCTGGAAATGTTGAGATGTATGGCATTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGTAGTCCATACTCGACAATGGTCGGTGTAGGAATGTCCATGGAAGGAATTGAACAGAATCCTGTTGTCTACGATCTTATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGAAATGGCTTCCTCAGTATTCAGTCAGGCGTTATGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCGTCTACAACTGCACTGACGGTGCCAATGACAAAAACAGGGATGTAATTGTGGCATTTCCTGATGTTGATCCATCGGCAATCTTAGTATTACCCGAGGGGTCCAACCGACATGGGAACTTGGACTCAAGCGTGGATCGCCTCCAGGATGCAACGTTTGATCGACCTCATCTTTGGTATCCTACTTCTGAAGTAATTAGTGCACTCAAGCTTTTCATTGCTGGTGGCGATCAACTCTCTAGTAGCAACACTTACAGGTATGACCTTGTAGATTTGACCAGGCAAGCTCTAGCCAAATACTCGAATGAACTGTTCTTTAGAATTGTCAAGGCATATCAGTTACATGATGTACAGACAATGGCCAGCTTAAGCCAAGAGTTCCTTGAACTTGTCAATGATATTGACACATTATTGGCTTGTCACGAGGGATTTCTTTTGGGACCTTGGCTACAAAGCGCCAAACAACTCGCCCGAAGTGAAGAGGAGGAAAAACAGTATGAATGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGAAGAAGCAAGTTTGCTTCGTGATTATGGAAACAAGTACTGGAGTGGACTCTTGGGCGATTATTACTGTCCTCGAGCTGCAATATACTTGAAGTTCTTGAAAGAAAGTTCGGAGAATGGATACAGATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAACTAACAAATGATTGGCAAAGCAGCAGAAAGATTTACCCTGTGGAAAGCAATGGAGATGCACTTGACACATCCCACTGGCTATACAACAAGTACTTGCAAATACCTGAGAGCTCTGACCAATGA
Protein sequence
MKINGRKLVKRSCSFAFARSPLSSGHRRPLPLSGSHWNFSKPIAIAGFLDVIHHHANFLYKFSFLLSTSTSDLSFFSFPLLMASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRLLPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKHWCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWKRWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ*
Homology
BLAST of CSPI01G01750 vs. ExPASy Swiss-Prot
Match:
Q9FNA3 (Alpha-N-acetylglucosaminidase OS=Arabidopsis thaliana OX=3702 GN=NAGLU PE=2 SV=1)
HSP 1 Score: 996.5 bits (2575), Expect = 1.9e-289
Identity = 467/782 (59.72%), Postives = 585/782 (74.81%), Query Frame = 0
Query: 113 ISRLLEIQDRERVPAYVQVAAARGVLRRLLPSHLPSFDFQIVSKDKCGGESCFVIRNHRA 172
I LL+ D + VQ +AA+G+L+RLLP+H SF+ +I+SKD CGG SCFVI N+
Sbjct: 28 IDGLLDRLDSLLPTSSVQESAAKGLLQRLLPTHSQSFELRIISKDACGGTSCFVIENYDG 87
Query: 173 FRKPGDPEILIAGVTGVEILAGLHWYLKHWCGAHISWDKTGGSQLFSVPKAGLLPRIQTN 232
+ G PEILI G TGVEI +GLHWYLK+ C AH+SWDKTGG Q+ SVP+ G LPRI +
Sbjct: 88 PGRIG-PEILIKGTTGVEIASGLHWYLKYKCNAHVSWDKTGGIQVASVPQPGHLPRIDSK 147
Query: 233 EVVVQRPIPLNYYQNAVTSSYSFAWWDWKRWEKEIDWMALQGINMPLAFTGQEAIWRKVF 292
+ ++RP+P NYYQN VTSSYS+ WW W+RWE+EIDWMALQGIN+PLAFTGQEAIW+KVF
Sbjct: 148 RIFIRRPVPWNYYQNVVTSSYSYVWWGWERWEREIDWMALQGINLPLAFTGQEAIWQKVF 207
Query: 293 RKFNISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMT 352
++FNIS DLDD+FGGPAFLAW+RMGNLH WGGPL ++W D QL+LQK+++ RM + GMT
Sbjct: 208 KRFNISKEDLDDYFGGPAFLAWARMGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMT 267
Query: 353 PVLPAFSGNIPAAFKQIYPAAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIE 412
PVLP+FSGN+P+A ++IYP A ITRL NW TV D RWCCTYLL+ DPLF+EIG+AFI+
Sbjct: 268 PVLPSFSGNVPSALRKIYPEANITRLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIK 327
Query: 413 QQQKEYGRTSHVYNCDTFDENTPPVDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSY 472
QQ +EYG +++YNCDTF+ENTPP + EYISSLG+A++ M G+ NAVWLMQGW+FS
Sbjct: 328 QQTEEYGEITNIYNCDTFNENTPPTSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSS 387
Query: 473 D-PFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWISSEQFYGIPYIWKQWCMLHNFAGN 532
D FW+P Q+KALLHSVP G+++VLDLYAEVKPIW S QFYG PYI WCMLHNF GN
Sbjct: 388 DSKFWKPPQLKALLHSVPFGKMIVLDLYAEVKPIWNKSAQFYGTPYI---WCMLHNFGGN 447
Query: 533 VEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKW 592
+EMYG LDSI+SGP++AR S STMVGVGM MEGIEQNPVVY+L SEMAF+ KVDV+KW
Sbjct: 448 IEMYGALDSISSGPVDARVSKNSTMVGVGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKW 507
Query: 593 LPQYSVRRYGHLVPSIQDAWDVLYHTVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGS 652
L Y+ RRY I+ AW++LYHTVYNCTDG D N D IV PD DPS+ V +
Sbjct: 508 LKSYARRRYMKENHQIEAAWEILYHTVYNCTDGIADHNTDFIVKLPDWDPSS-SVQDDLK 567
Query: 653 NRHGNLDSSVD-------RLQDATFDRP--HLWYPTSEVISALKLFIAGGDQLSSSNTYR 712
+ + S+ QD T D P HLWY T EVI ALKLF+ GD LS S TYR
Sbjct: 568 QKDSYMISTGPYETKRRVLFQDKTADLPKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYR 627
Query: 713 YDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQEFLELVNDIDTLLACHEGFLL 772
YD+VDLTRQ L+K +N+++ V A+ D+ ++ LS++FLEL+ D+D LLA + LL
Sbjct: 628 YDMVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLL 687
Query: 773 GPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYCP 832
G WL+SAK+LA++ +E KQYEWNARTQ+TMW+D+ + S L DY NK+WSGLL DYY P
Sbjct: 688 GTWLESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLP 747
Query: 833 RAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDW-QSSRKIYPVESNGDALDTSHWLYN 884
RA +Y + +S + F + WRREWI +++ W QSS ++YPV++ GDAL S L +
Sbjct: 748 RARLYFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLS 804
BLAST of CSPI01G01750 vs. ExPASy Swiss-Prot
Match:
P54802 (Alpha-N-acetylglucosaminidase OS=Homo sapiens OX=9606 GN=NAGLU PE=1 SV=2)
HSP 1 Score: 552.0 bits (1421), Expect = 1.3e-155
Identity = 292/758 (38.52%), Postives = 437/758 (57.65%), Query Frame = 0
Query: 130 QVAAARGVLRRLLPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGV 189
+ AA R ++ RLL P+ DF + + + + + G + + G TGV
Sbjct: 28 EAAAVRALVARLLGPG-PAADFSVSVERALAAKPGL---DTYSLGGGGAARVRVRGSTGV 87
Query: 190 EILAGLHWYLKHWCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAV 249
AGLH YL+ +CG H++W GSQL +P+ LP + E+ P YYQN
Sbjct: 88 AAAAGLHRYLRDFCGCHVAW---SGSQL-RLPRP--LPAV-PGELTEATPNRYRYYQNVC 147
Query: 250 TSSYSFAWWDWKRWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGP 309
T SYSF WWDW RWE+EIDWMAL GIN+ LA++GQEAIW++V+ ++ +++++FF GP
Sbjct: 148 TQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVYLALGLTQAEINEFFTGP 207
Query: 310 AFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQI 369
AFLAW RMGNLH W GPLP SW +QL LQ +V+ +M GMTPVLPAF+G++P A ++
Sbjct: 208 AFLAWGRMGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRV 267
Query: 370 YPAAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDT 429
+P +T++G+W H + + C++LL DP+F IG F+ + KE+G T H+Y DT
Sbjct: 268 FPQVNVTKMGSW--GHFNCSYSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADT 327
Query: 430 FDENTPPVDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDP-FWRPQQMKALLHSV 489
F+E PP + Y+++ +A++ M A D+ AVWL+QGW+F + P FW P Q++A+L +V
Sbjct: 328 FNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAV 387
Query: 490 PLGRLVVLDLYAEVKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEA 549
P GRL+VLDL+AE +P++ + F G P+I WCMLHNF GN ++G L+++ GP A
Sbjct: 388 PRGRLLVLDLFAESQPVYTRTASFQGQPFI---WCMLHNFGGNHGLFGALEAVNGGPEAA 447
Query: 550 RSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKV-DVKKWLPQYSVRRYGHLVPSI 609
R P STMVG GM+ EGI QN VVY LM+E+ ++ + V D+ W+ ++ RRYG P
Sbjct: 448 RLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDA 507
Query: 610 QDAWDVLYHTVYNCT-DGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQD 669
AW +L +VYNC+ + NR +V P + +
Sbjct: 508 GAAWRLLLRSVYNCSGEACRGHNRSPLVRRPSLQMNT----------------------- 567
Query: 670 ATFDRPHLWYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVK 729
+WY S+V A +L + L++S +RYDL+DLTRQA+ + + +
Sbjct: 568 ------SIWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARS 627
Query: 730 AYQLHDVQTMASLSQEF-LELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWN 789
AY ++ ++ EL+ +D +LA FLLG WL+ A+ A SE E YE N
Sbjct: 628 AYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQN 687
Query: 790 ARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSN 849
+R Q+T+W E ++L DY NK +GL+ +YY PR ++L+ L +S G F
Sbjct: 688 SRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQ 734
Query: 850 WRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKY 884
+ + +L + S++ YP + GD +D + ++ KY
Sbjct: 748 FDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
BLAST of CSPI01G01750 vs. ExPASy TrEMBL
Match:
A0A1S3BVG2 (alpha-N-acetylglucosaminidase-like OS=Cucumis melo OX=3656 GN=LOC103493939 PE=4 SV=1)
HSP 1 Score: 1637.5 bits (4239), Expect = 0.0e+00
Identity = 780/811 (96.18%), Postives = 795/811 (98.03%), Query Frame = 0
Query: 82 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 141
MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 142 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 201
LPSHL SFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 202 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 261
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT+EVV++RPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 262 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 321
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF+ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 322 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 381
KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYP+AKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 382 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 441
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
Query: 442 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 501
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRP QMKALLHSVPLGRLVVLDLYAE
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 502 VKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 561
VKPIWISSEQFYG PYI WCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM
Sbjct: 421 VKPIWISSEQFYGTPYI---WCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 480
Query: 562 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNC 621
SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHT+YNC
Sbjct: 481 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNC 540
Query: 622 TDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWYPTSEV 681
TDGANDKNRDVIVAFPDVDPS+ILVLPEGS++HG LDSS+D LQDATFDRPHLWYPTS+V
Sbjct: 541 TDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKV 600
Query: 682 ISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQ 741
ISALKLFI GGDQLS SNTYRYDLVDLTRQALAKYSNELFFR VKAYQL+D QTMASLSQ
Sbjct: 601 ISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQ 660
Query: 742 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEA 801
EFLELVNDIDTLLACHEGFLLGPWLQSAKQLA+SEEEEKQYEWNARTQITMWFDNTEEEA
Sbjct: 661 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEA 720
Query: 802 SLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSR 861
SLLRDYGNKYWSGLLGDYY PRAAIY KFLKESSENGYRFPLSNWRREWIKLTNDWQSSR
Sbjct: 721 SLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSR 780
Query: 862 KIYPVESNGDALDTSHWLYNKYLQIPESSDQ 893
KIYPVESNGDAL TSHWLYNKYLQIPESSDQ
Sbjct: 781 KIYPVESNGDALHTSHWLYNKYLQIPESSDQ 808
BLAST of CSPI01G01750 vs. ExPASy TrEMBL
Match:
A0A5D3BH46 (Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold350G002030 PE=4 SV=1)
HSP 1 Score: 1544.6 bits (3998), Expect = 0.0e+00
Identity = 751/847 (88.67%), Postives = 768/847 (90.67%), Query Frame = 0
Query: 82 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 141
MASFFSSTFLI VTIFAAFSTSRSSTIGVEYISRLLEIQDRER PAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIIVTIFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
Query: 142 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 201
LPSHL SFDFQI DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQI---DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 202 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 261
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT+EVV++RPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 262 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 321
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF+KFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 322 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 381
KWGGPLP SWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYP+AKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 382 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 441
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYG+TSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGKTSHVYNCDTFDENTPPVDEVE 360
Query: 442 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 501
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDL
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDL--- 420
Query: 502 VKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 561
CMLHNFAGNVEMYGILDSIASGPIEARSS YSTMVGVGM
Sbjct: 421 ---------------------CMLHNFAGNVEMYGILDSIASGPIEARSSQYSTMVGVGM 480
Query: 562 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNC 621
SMEGIEQNPVVYDLMSEM FQ NKVDVKKWLPQYSVRRYGHLVPSIQDAWD+LYHT+YNC
Sbjct: 481 SMEGIEQNPVVYDLMSEMGFQRNKVDVKKWLPQYSVRRYGHLVPSIQDAWDILYHTIYNC 540
Query: 622 TDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWYPTSEV 681
TDGANDKNRDVIVAFPDVDPS+ILVLPEGS++HG LDSS+D LQDATFDRPHLWYPTS+V
Sbjct: 541 TDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKV 600
Query: 682 ISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQ 741
ISALKLFI GGDQLS SNTYRYDLVDLTRQALAKYSNELFFR VKAYQL+D QTMASLSQ
Sbjct: 601 ISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQ 660
Query: 742 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEA 801
EFLELVNDIDTLLACHEGFLLGPWLQSAKQLA+SEEEEKQYEWNARTQITMWFDNTEEEA
Sbjct: 661 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEA 720
Query: 802 SLLRDY------------------------------------GNKYWSGLLGDYYCPRAA 861
SLLRDY GNKYWSGLLGDYY PRAA
Sbjct: 721 SLLRDYGNDNSGPGLNSISIDCHLSSRLGNCTFKFDLFNLDPGNKYWSGLLGDYYGPRAA 780
Query: 862 IYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQ 893
IY KFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDAL TSHWLYNKYLQ
Sbjct: 781 IYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLYNKYLQ 820
BLAST of CSPI01G01750 vs. ExPASy TrEMBL
Match:
A0A6J1C176 (alpha-N-acetylglucosaminidase-like OS=Momordica charantia OX=3673 GN=LOC111007441 PE=4 SV=1)
HSP 1 Score: 1528.1 bits (3955), Expect = 0.0e+00
Identity = 726/812 (89.41%), Postives = 766/812 (94.33%), Query Frame = 0
Query: 82 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 141
MAS F + FLIFV++FAAFSTSR STIGV YISRLLEIQDRER PA+VQVAAARGVLRRL
Sbjct: 1 MASPFPAIFLIFVSLFAAFSTSRFSTIGVGYISRLLEIQDRERAPAHVQVAAARGVLRRL 60
Query: 142 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 201
LPSHL SFDFQIVSKDKCG ESCFVIRNHR+FR+PGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGRESCFVIRNHRSFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 202 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 261
WCGAHISWDKTGGSQLFSVPKAGLLPRIQ+NE++VQRP+PLNYYQNAVTSSYSFAWWDW+
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQSNEIIVQRPVPLNYYQNAVTSSYSFAWWDWE 180
Query: 262 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 321
RW+KEIDWMALQGINMPLAFTGQEAIW+KVF+KFNISN+DLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWKKEIDWMALQGINMPLAFTGQEAIWQKVFQKFNISNTDLDDFFGGPAFLAWSRMGNLH 240
Query: 322 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 381
KWGG LPQSWFDQQLILQKKV+ RMFELGMTPVLPAFSGNIPAAFKQIYP+AKITRLGNW
Sbjct: 241 KWGGSLPQSWFDQQLILQKKVLARMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 382 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 441
F+VHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQ KEYGRTSH+YNCDTFDENTPPVD E
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQLKEYGRTSHLYNCDTFDENTPPVDAAE 360
Query: 442 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 501
YISSLG+AIFGGMQAGDS+AVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 420
Query: 502 VKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 561
VKPIWISSEQFYG PYI WCMLHNFAGNVEMYGILDSIASGPIEAR+SPYSTMVGVGM
Sbjct: 421 VKPIWISSEQFYGTPYI---WCMLHNFAGNVEMYGILDSIASGPIEARNSPYSTMVGVGM 480
Query: 562 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNC 621
SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWL QYS+RRYG LVPSIQDAWDVLYHT+YNC
Sbjct: 481 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLDQYSIRRYGQLVPSIQDAWDVLYHTIYNC 540
Query: 622 TDGANDKNRDVIVAFPDVDPSAILVLPEGS--NRHGNLDSSVDRLQDATFDRPHLWYPTS 681
TDGA DKNRDVIVAFPDVDPS+IL LPEGS +R+ N +SSV L ATFDRPHLWY TS
Sbjct: 541 TDGAYDKNRDVIVAFPDVDPSSILELPEGSDRDRYRNFNSSVGSLLHATFDRPHLWYSTS 600
Query: 682 EVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASL 741
EVI ALKLFIAG DQLS SNTYRYDLVDLTRQALAKYSNELFFRIVKAYQL+D Q MASL
Sbjct: 601 EVIRALKLFIAGSDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQKMASL 660
Query: 742 SQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEE 801
SQ+FLELV DIDTLLACHEGFLLGPWL+SAKQLA+ EE+EKQYEWNARTQITMWFDNTE+
Sbjct: 661 SQQFLELVKDIDTLLACHEGFLLGPWLESAKQLAQDEEQEKQYEWNARTQITMWFDNTED 720
Query: 802 EASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQS 861
EASLLRDYGNKYWSGLLGDYY PRAAIY KFLKES ENGY FPLSNWRREWIKLTNDWQ+
Sbjct: 721 EASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESLENGYGFPLSNWRREWIKLTNDWQN 780
Query: 862 SRKIYPVESNGDALDTSHWLYNKYLQIPESSD 892
SRK++PVE +GDA+DTS WLY KY+QI ES D
Sbjct: 781 SRKVFPVEISGDAIDTSRWLYRKYMQILESYD 809
BLAST of CSPI01G01750 vs. ExPASy TrEMBL
Match:
A0A6J1ECY3 (alpha-N-acetylglucosaminidase-like OS=Cucurbita moschata OX=3662 GN=LOC111432041 PE=4 SV=1)
HSP 1 Score: 1523.1 bits (3942), Expect = 0.0e+00
Identity = 722/811 (89.03%), Postives = 760/811 (93.71%), Query Frame = 0
Query: 82 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 141
MA F++ LIF++IF FSTS SSTIG YISRLL+IQDRER P+ VQVAAARGVLRRL
Sbjct: 1 MAPPFAAVCLIFLSIFTTFSTSFSSTIGFVYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
Query: 142 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 201
LPSHL SFDFQI+SKD CGGESCF+IRNHRAFR+PGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 202 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 261
WCGAHISWDKTGGSQLFSVPK G LP IQ++E++V+RPIPLNYYQNAVTSSYSFAWWDW+
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIQSDEIIVRRPIPLNYYQNAVTSSYSFAWWDWE 180
Query: 262 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 321
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF+KFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 322 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 381
KWGGPLPQSWFDQQLILQKKV GRMFELGMTPVLPAFSGNIPAAFKQIYP+AKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVTGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 382 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 441
F+VHSDPRWCCTYLLDAMDPLFVEIG+AFIEQQ KEYGRTSHVYNCDTFDENTPPVDDVE
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 442 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 501
YISSLG+AIFGGMQAGDS+AVWLMQGWMFSYDPFWRPQQMKALLHSV LGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
Query: 502 VKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 561
VKPIWI+SEQFYG+PYI WCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM
Sbjct: 421 VKPIWIASEQFYGVPYI---WCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 480
Query: 562 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNC 621
SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWL QYS+RRYGHLVPSIQDAWDVLYHT+YNC
Sbjct: 481 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNC 540
Query: 622 TDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWYPTSEV 681
TDGA DKNRDVIVAFPDVDPS+I V+PEGS+RH LQDA F+RPHLWYPTSEV
Sbjct: 541 TDGAYDKNRDVIVAFPDVDPSSISVIPEGSDRH-----DTGSLQDAIFERPHLWYPTSEV 600
Query: 682 ISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQ 741
I ALKLFIA GDQLS SNTYRYDLVDLTRQALAKYSNELFFRIVKAYQL DVQT SLSQ
Sbjct: 601 IRALKLFIASGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQ 660
Query: 742 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEA 801
+FLELVNDIDTL+ACHEGFLLGPWLQSAKQLA+ E++EKQYEWNARTQITMWFDNTEEEA
Sbjct: 661 QFLELVNDIDTLVACHEGFLLGPWLQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEA 720
Query: 802 SLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSR 861
SLLRDYGNKYWSGLL DYY PRAAIY KFLKES ENGY FPLSNWRREWIKLTNDWQSSR
Sbjct: 721 SLLRDYGNKYWSGLLSDYYGPRAAIYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSR 780
Query: 862 KIYPVESNGDALDTSHWLYNKYLQIPESSDQ 893
K+YPV+SNGDA+DTS WLYNKY Q+ ES DQ
Sbjct: 781 KVYPVKSNGDAVDTSRWLYNKYFQVLESYDQ 803
BLAST of CSPI01G01750 vs. ExPASy TrEMBL
Match:
A0A5A7UWC6 (Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G001950 PE=4 SV=1)
HSP 1 Score: 1517.7 bits (3928), Expect = 0.0e+00
Identity = 743/853 (87.10%), Postives = 762/853 (89.33%), Query Frame = 0
Query: 82 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 141
MASFFSSTFLIFVTIFAAFSTSRSST GVEYISRLLE+QDRER PAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTTGVEYISRLLEVQDRERAPAYVQVAAARGVLRRL 60
Query: 142 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 201
LPSHL SFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 202 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 261
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT EVV++RPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTGEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 262 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 321
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF+KFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 322 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 381
KWGGPLP SWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYP+AKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 382 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKE-YGR-----TSHVYNCDTFDENTP 441
F VHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQK +G T + DTFDENTP
Sbjct: 301 FAVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKGIFGPMLMKCTRTYFMPDTFDENTP 360
Query: 442 PVDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVV 501
PVD+VEYISSLGSAIFGGMQ GDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVV
Sbjct: 361 PVDEVEYISSLGSAIFGGMQTGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVV 420
Query: 502 LDLYAEVKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEARSSPYST 561
LDL CMLHNFAGNVEMYGILDSIASGPIEARSS YST
Sbjct: 421 LDL------------------------CMLHNFAGNVEMYGILDSIASGPIEARSSQYST 480
Query: 562 MVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLY 621
MVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWD+LY
Sbjct: 481 MVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDILY 540
Query: 622 HTVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLW 681
HT+YNCTDGANDKNRDVIVAFPDVDPS+ILVLPEGS++HG LDSS+D LQDATFDRPHLW
Sbjct: 541 HTIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLW 600
Query: 682 YPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQT 741
YPTS+VISALKLFI GGDQL SNTYRYDLVDLTRQALAKYSNELFFRIVKAYQL+D QT
Sbjct: 601 YPTSKVISALKLFIVGGDQLFGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQT 660
Query: 742 MASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFD 801
MASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLA+SEEEEKQYEWNARTQITMWFD
Sbjct: 661 MASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFD 720
Query: 802 NTEEEASLLRDY------------------------------------GNKYWSGLLGDY 861
NTEEEASLLRDY GNKYWSGLLGDY
Sbjct: 721 NTEEEASLLRDYGNDNSGPGLNSISIDCHLSSRLGNCTFKFDLFNLDPGNKYWSGLLGDY 780
Query: 862 YCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWL 893
Y PRAAIY KFLKESS+NGYRFPLSNWRREWIKLTN WQSSRKIYPVESNGDAL TSHWL
Sbjct: 781 YGPRAAIYFKFLKESSKNGYRFPLSNWRREWIKLTNAWQSSRKIYPVESNGDALHTSHWL 829
BLAST of CSPI01G01750 vs. NCBI nr
Match:
XP_011658935.1 (alpha-N-acetylglucosaminidase [Cucumis sativus])
HSP 1 Score: 1685.2 bits (4363), Expect = 0.0e+00
Identity = 807/811 (99.51%), Postives = 807/811 (99.51%), Query Frame = 0
Query: 82 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 141
MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 142 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 201
LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRK GDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKSGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 202 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 261
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 262 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 321
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 322 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 381
KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 300
Query: 382 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 441
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 442 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 501
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 420
Query: 502 VKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 561
VKPIWISSEQFYGIPYI WCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM
Sbjct: 421 VKPIWISSEQFYGIPYI---WCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 480
Query: 562 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNC 621
SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNC
Sbjct: 481 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNC 540
Query: 622 TDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWYPTSEV 681
TDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWYPTSEV
Sbjct: 541 TDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWYPTSEV 600
Query: 682 ISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQ 741
ISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQ
Sbjct: 601 ISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQ 660
Query: 742 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEA 801
EFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEA
Sbjct: 661 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEA 720
Query: 802 SLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSR 861
SLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSR
Sbjct: 721 SLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSR 780
Query: 862 KIYPVESNGDALDTSHWLYNKYLQIPESSDQ 893
KIYPVESNGDALDTSHWLYNKYLQIPESSDQ
Sbjct: 781 KIYPVESNGDALDTSHWLYNKYLQIPESSDQ 808
BLAST of CSPI01G01750 vs. NCBI nr
Match:
XP_008453133.1 (PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis melo])
HSP 1 Score: 1637.5 bits (4239), Expect = 0.0e+00
Identity = 780/811 (96.18%), Postives = 795/811 (98.03%), Query Frame = 0
Query: 82 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 141
MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 142 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 201
LPSHL SFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 202 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 261
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT+EVV++RPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 262 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 321
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF+ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 322 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 381
KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYP+AKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 382 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 441
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
Query: 442 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 501
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRP QMKALLHSVPLGRLVVLDLYAE
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 502 VKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 561
VKPIWISSEQFYG PYI WCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM
Sbjct: 421 VKPIWISSEQFYGTPYI---WCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 480
Query: 562 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNC 621
SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHT+YNC
Sbjct: 481 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNC 540
Query: 622 TDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWYPTSEV 681
TDGANDKNRDVIVAFPDVDPS+ILVLPEGS++HG LDSS+D LQDATFDRPHLWYPTS+V
Sbjct: 541 TDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKV 600
Query: 682 ISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQ 741
ISALKLFI GGDQLS SNTYRYDLVDLTRQALAKYSNELFFR VKAYQL+D QTMASLSQ
Sbjct: 601 ISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQ 660
Query: 742 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEA 801
EFLELVNDIDTLLACHEGFLLGPWLQSAKQLA+SEEEEKQYEWNARTQITMWFDNTEEEA
Sbjct: 661 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEA 720
Query: 802 SLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSR 861
SLLRDYGNKYWSGLLGDYY PRAAIY KFLKESSENGYRFPLSNWRREWIKLTNDWQSSR
Sbjct: 721 SLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSR 780
Query: 862 KIYPVESNGDALDTSHWLYNKYLQIPESSDQ 893
KIYPVESNGDAL TSHWLYNKYLQIPESSDQ
Sbjct: 781 KIYPVESNGDALHTSHWLYNKYLQIPESSDQ 808
BLAST of CSPI01G01750 vs. NCBI nr
Match:
KGN63620.2 (hypothetical protein Csa_013990 [Cucumis sativus])
HSP 1 Score: 1619.0 bits (4191), Expect = 0.0e+00
Identity = 786/829 (94.81%), Postives = 786/829 (94.81%), Query Frame = 0
Query: 82 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 141
MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 142 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 201
LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRK GDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKSGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 202 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 261
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 262 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 321
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 322 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 381
KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 300
Query: 382 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 441
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 442 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMK------------------A 501
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMK A
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKGCHCALLSLLVEIGEIFQA 420
Query: 502 LLHSVPLGRLVVLDLYAEVKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIAS 561
LLHSVPLGRLVVLDL CMLHNFAGNVEMYGILDSIAS
Sbjct: 421 LLHSVPLGRLVVLDL------------------------CMLHNFAGNVEMYGILDSIAS 480
Query: 562 GPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHL 621
GPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHL
Sbjct: 481 GPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHL 540
Query: 622 VPSIQDAWDVLYHTVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDR 681
VPSIQDAWDVLYHTVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDR
Sbjct: 541 VPSIQDAWDVLYHTVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDR 600
Query: 682 LQDATFDRPHLWYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFR 741
LQDATFDRPHLWYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFR
Sbjct: 601 LQDATFDRPHLWYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFR 660
Query: 742 IVKAYQLHDVQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYE 801
IVKAYQLHDVQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYE
Sbjct: 661 IVKAYQLHDVQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYE 720
Query: 802 WNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPL 861
WNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPL
Sbjct: 721 WNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPL 780
Query: 862 SNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ 893
SNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ
Sbjct: 781 SNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ 805
BLAST of CSPI01G01750 vs. NCBI nr
Match:
XP_038880130.1 (alpha-N-acetylglucosaminidase-like [Benincasa hispida])
HSP 1 Score: 1586.2 bits (4106), Expect = 0.0e+00
Identity = 759/813 (93.36%), Postives = 778/813 (95.69%), Query Frame = 0
Query: 82 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 141
MAS FSS FLIFV+IFAAFSTSRSSTIGV YISRLLEIQDRER PAYVQVAAARGVL RL
Sbjct: 1 MASPFSSIFLIFVSIFAAFSTSRSSTIGVGYISRLLEIQDRERAPAYVQVAAARGVLHRL 60
Query: 142 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 201
LPSHL SFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLK+
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKN 120
Query: 202 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 261
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT+E+V+QRP+PLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEIVIQRPVPLNYYQNAVTSSYSFAWWDWK 180
Query: 262 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 321
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF KFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFHKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 322 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 381
KWGGPLPQSWFDQQLILQKKV GRMFELGMTPVLPAFSGNIPAAFK IYP+AKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVTGRMFELGMTPVLPAFSGNIPAAFKHIYPSAKITRLGNW 300
Query: 382 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 441
F+VHSDPRWCCTYLLDA DPLFVEIGKAFIEQQQKEYGRTSH+YNCDTFDENTPPVD+VE
Sbjct: 301 FSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQQKEYGRTSHIYNCDTFDENTPPVDEVE 360
Query: 442 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 501
YISSLG+AIFGGMQAGDSNAVWLMQGWMFSYDPFWRP QMKALLHSVPLGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 502 VKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 561
VKP+WISSEQFYG PYI WCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM
Sbjct: 421 VKPVWISSEQFYGTPYI---WCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 480
Query: 562 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNC 621
SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWL QYS+RRYGHLVPSIQDAWDVLYHT+YNC
Sbjct: 481 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNC 540
Query: 622 TDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVD--RLQDATFDRPHLWYPTS 681
TDGANDKNRDVIVAFPDVDPS+ILVLPEGS RHGNLDS VD RL DA FDRPHLWYPTS
Sbjct: 541 TDGANDKNRDVIVAFPDVDPSSILVLPEGSERHGNLDSRVDSLRLGDAMFDRPHLWYPTS 600
Query: 682 EVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASL 741
EV ALKLFIAGGDQLS SNTYRYDLVDLTRQALAKYSNELFFRIVKAYQL+D QTMA+L
Sbjct: 601 EVTRALKLFIAGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMANL 660
Query: 742 SQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEE 801
SQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLA+ EEEEKQYEWNARTQITMWFDNTEE
Sbjct: 661 SQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQIEEEEKQYEWNARTQITMWFDNTEE 720
Query: 802 EASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQS 861
EASLLRDYGNKYWSGLLGDYY PRAAIY KFLKESSENGYRF LSNWRREWIKLTNDWQS
Sbjct: 721 EASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFQLSNWRREWIKLTNDWQS 780
Query: 862 SRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ 893
SRK+YPVESNGDALDTSH LY KYLQ ES DQ
Sbjct: 781 SRKVYPVESNGDALDTSHCLYYKYLQRLESFDQ 810
BLAST of CSPI01G01750 vs. NCBI nr
Match:
TYJ98583.1 (alpha-N-acetylglucosaminidase-like [Cucumis melo var. makuwa])
HSP 1 Score: 1544.6 bits (3998), Expect = 0.0e+00
Identity = 751/847 (88.67%), Postives = 768/847 (90.67%), Query Frame = 0
Query: 82 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 141
MASFFSSTFLI VTIFAAFSTSRSSTIGVEYISRLLEIQDRER PAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIIVTIFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
Query: 142 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 201
LPSHL SFDFQI DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQI---DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 202 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 261
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT+EVV++RPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 262 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 321
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF+KFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 322 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 381
KWGGPLP SWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYP+AKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 382 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 441
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYG+TSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGKTSHVYNCDTFDENTPPVDEVE 360
Query: 442 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 501
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDL
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDL--- 420
Query: 502 VKPIWISSEQFYGIPYIWKQWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM 561
CMLHNFAGNVEMYGILDSIASGPIEARSS YSTMVGVGM
Sbjct: 421 ---------------------CMLHNFAGNVEMYGILDSIASGPIEARSSQYSTMVGVGM 480
Query: 562 SMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNC 621
SMEGIEQNPVVYDLMSEM FQ NKVDVKKWLPQYSVRRYGHLVPSIQDAWD+LYHT+YNC
Sbjct: 481 SMEGIEQNPVVYDLMSEMGFQRNKVDVKKWLPQYSVRRYGHLVPSIQDAWDILYHTIYNC 540
Query: 622 TDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWYPTSEV 681
TDGANDKNRDVIVAFPDVDPS+ILVLPEGS++HG LDSS+D LQDATFDRPHLWYPTS+V
Sbjct: 541 TDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKV 600
Query: 682 ISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQ 741
ISALKLFI GGDQLS SNTYRYDLVDLTRQALAKYSNELFFR VKAYQL+D QTMASLSQ
Sbjct: 601 ISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQ 660
Query: 742 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEA 801
EFLELVNDIDTLLACHEGFLLGPWLQSAKQLA+SEEEEKQYEWNARTQITMWFDNTEEEA
Sbjct: 661 EFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEA 720
Query: 802 SLLRDY------------------------------------GNKYWSGLLGDYYCPRAA 861
SLLRDY GNKYWSGLLGDYY PRAA
Sbjct: 721 SLLRDYGNDNSGPGLNSISIDCHLSSRLGNCTFKFDLFNLDPGNKYWSGLLGDYYGPRAA 780
Query: 862 IYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQ 893
IY KFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDAL TSHWLYNKYLQ
Sbjct: 781 IYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLYNKYLQ 820
BLAST of CSPI01G01750 vs. TAIR 10
Match:
AT5G13690.1 (alpha-N-acetylglucosaminidase family / NAGLU family )
HSP 1 Score: 996.5 bits (2575), Expect = 1.4e-290
Identity = 467/782 (59.72%), Postives = 585/782 (74.81%), Query Frame = 0
Query: 113 ISRLLEIQDRERVPAYVQVAAARGVLRRLLPSHLPSFDFQIVSKDKCGGESCFVIRNHRA 172
I LL+ D + VQ +AA+G+L+RLLP+H SF+ +I+SKD CGG SCFVI N+
Sbjct: 28 IDGLLDRLDSLLPTSSVQESAAKGLLQRLLPTHSQSFELRIISKDACGGTSCFVIENYDG 87
Query: 173 FRKPGDPEILIAGVTGVEILAGLHWYLKHWCGAHISWDKTGGSQLFSVPKAGLLPRIQTN 232
+ G PEILI G TGVEI +GLHWYLK+ C AH+SWDKTGG Q+ SVP+ G LPRI +
Sbjct: 88 PGRIG-PEILIKGTTGVEIASGLHWYLKYKCNAHVSWDKTGGIQVASVPQPGHLPRIDSK 147
Query: 233 EVVVQRPIPLNYYQNAVTSSYSFAWWDWKRWEKEIDWMALQGINMPLAFTGQEAIWRKVF 292
+ ++RP+P NYYQN VTSSYS+ WW W+RWE+EIDWMALQGIN+PLAFTGQEAIW+KVF
Sbjct: 148 RIFIRRPVPWNYYQNVVTSSYSYVWWGWERWEREIDWMALQGINLPLAFTGQEAIWQKVF 207
Query: 293 RKFNISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMT 352
++FNIS DLDD+FGGPAFLAW+RMGNLH WGGPL ++W D QL+LQK+++ RM + GMT
Sbjct: 208 KRFNISKEDLDDYFGGPAFLAWARMGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMT 267
Query: 353 PVLPAFSGNIPAAFKQIYPAAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIE 412
PVLP+FSGN+P+A ++IYP A ITRL NW TV D RWCCTYLL+ DPLF+EIG+AFI+
Sbjct: 268 PVLPSFSGNVPSALRKIYPEANITRLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIK 327
Query: 413 QQQKEYGRTSHVYNCDTFDENTPPVDDVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSY 472
QQ +EYG +++YNCDTF+ENTPP + EYISSLG+A++ M G+ NAVWLMQGW+FS
Sbjct: 328 QQTEEYGEITNIYNCDTFNENTPPTSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSS 387
Query: 473 D-PFWRPQQMKALLHSVPLGRLVVLDLYAEVKPIWISSEQFYGIPYIWKQWCMLHNFAGN 532
D FW+P Q+KALLHSVP G+++VLDLYAEVKPIW S QFYG PYI WCMLHNF GN
Sbjct: 388 DSKFWKPPQLKALLHSVPFGKMIVLDLYAEVKPIWNKSAQFYGTPYI---WCMLHNFGGN 447
Query: 533 VEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKW 592
+EMYG LDSI+SGP++AR S STMVGVGM MEGIEQNPVVY+L SEMAF+ KVDV+KW
Sbjct: 448 IEMYGALDSISSGPVDARVSKNSTMVGVGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKW 507
Query: 593 LPQYSVRRYGHLVPSIQDAWDVLYHTVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGS 652
L Y+ RRY I+ AW++LYHTVYNCTDG D N D IV PD DPS+ V +
Sbjct: 508 LKSYARRRYMKENHQIEAAWEILYHTVYNCTDGIADHNTDFIVKLPDWDPSS-SVQDDLK 567
Query: 653 NRHGNLDSSVD-------RLQDATFDRP--HLWYPTSEVISALKLFIAGGDQLSSSNTYR 712
+ + S+ QD T D P HLWY T EVI ALKLF+ GD LS S TYR
Sbjct: 568 QKDSYMISTGPYETKRRVLFQDKTADLPKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYR 627
Query: 713 YDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQEFLELVNDIDTLLACHEGFLL 772
YD+VDLTRQ L+K +N+++ V A+ D+ ++ LS++FLEL+ D+D LLA + LL
Sbjct: 628 YDMVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLL 687
Query: 773 GPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYCP 832
G WL+SAK+LA++ +E KQYEWNARTQ+TMW+D+ + S L DY NK+WSGLL DYY P
Sbjct: 688 GTWLESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLP 747
Query: 833 RAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDW-QSSRKIYPVESNGDALDTSHWLYN 884
RA +Y + +S + F + WRREWI +++ W QSS ++YPV++ GDAL S L +
Sbjct: 748 RARLYFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLS 804
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FNA3 | 1.9e-289 | 59.72 | Alpha-N-acetylglucosaminidase OS=Arabidopsis thaliana OX=3702 GN=NAGLU PE=2 SV=1 | [more] |
P54802 | 1.3e-155 | 38.52 | Alpha-N-acetylglucosaminidase OS=Homo sapiens OX=9606 GN=NAGLU PE=1 SV=2 | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3BVG2 | 0.0e+00 | 96.18 | alpha-N-acetylglucosaminidase-like OS=Cucumis melo OX=3656 GN=LOC103493939 PE=4 ... | [more] |
A0A5D3BH46 | 0.0e+00 | 88.67 | Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E56... | [more] |
A0A6J1C176 | 0.0e+00 | 89.41 | alpha-N-acetylglucosaminidase-like OS=Momordica charantia OX=3673 GN=LOC11100744... | [more] |
A0A6J1ECY3 | 0.0e+00 | 89.03 | alpha-N-acetylglucosaminidase-like OS=Cucurbita moschata OX=3662 GN=LOC111432041... | [more] |
A0A5A7UWC6 | 0.0e+00 | 87.10 | Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C... | [more] |
Match Name | E-value | Identity | Description | |
XP_011658935.1 | 0.0e+00 | 99.51 | alpha-N-acetylglucosaminidase [Cucumis sativus] | [more] |
XP_008453133.1 | 0.0e+00 | 96.18 | PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis melo] | [more] |
KGN63620.2 | 0.0e+00 | 94.81 | hypothetical protein Csa_013990 [Cucumis sativus] | [more] |
XP_038880130.1 | 0.0e+00 | 93.36 | alpha-N-acetylglucosaminidase-like [Benincasa hispida] | [more] |
TYJ98583.1 | 0.0e+00 | 88.67 | alpha-N-acetylglucosaminidase-like [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
AT5G13690.1 | 1.4e-290 | 59.72 | alpha-N-acetylglucosaminidase family / NAGLU family | [more] |