Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCCTCTTTTTCTTCCACTTTCCTTATTTTCGTTTCACTTTTCGCCGCCTTCTCCACTTCTCGGTCGTCGACGATCGGAGTCGAATACATTTCGCGGCTTCTTGAGATTCAGGATCGCGAGAGAGCACCTGCGTATGTCCAAGTTGCTGCGGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCACCTCTCTAGCTTTGACTTTCAGATTGTCTCTAAGGTACTTGGCCGGTTTTCTTTACAAGTTTCTGTGTTAACTATTTGACTGATGAAAGAACTATAGACTACAAAGTAGTTTGTTAGAGAGATTTTAATTTGTTTTCTAAGGCACTTGGACATTTAGGAAAATGAAAGAATTATTGTGTACTTGAAGTTCAGTCTGCTTCTAGTGTCTGTTTTTCATGTTCATATTTCTCGTGGTCTTTCTATATGTTGATTTTTCTCAGGACAAATGTGGTGGAGAATTTTGCTTTGTGATCAGGAACCATCGCGCGTTCAGAAAACCCGGGGATCCTGAGATTTTGTACGTAGAAACTATTTGCTAATCTTTTTCATTCGGTTCGCCTTGATCTAATTGGCCTTTAAAATTGGATTAAGCTACTATGTTGGTTCTTGTTATTTAATAAGTGAAGGATCGGGAGTTCACATCACGAAAAATATGAATTCACATTGCACAGTGTTCTTTGTATATCAAGACTAAAATCGATCTCTTACTTTTGGACATGAATACGATTTCCAATCTTTTTCTAGAATCGCTGGGGTCACTGGAGTGGAGATTTTAGCTGGCTTGCACTGGTATCTTAAGCACTGGTGTGGTGCACACATATCTTGGGATAAAACAGGTGGCTCACAACTATTTTCTGTACCTAAGGCAGGCCTGTTACCTCGTATTCAAACAGACGAAGTTGTGATCCGGAGGCCTATTCCTTTGAACTACTATCAAAATGCAGTTACATCAAGCTGTAAGGTTTTCTTTTTGGATTTTTCATTACATCCATCTCGCTGCAATGCACAATTTACATAACAGAAATCGAATTACAATATTTTGTGTTCCTCCAATAACAGTTATTTGTTTACTGTTTCGATACATGTTTTCTATTTTCACTTTCTTTGCTTCTGGTAGACTCTTTTGCCTGGTGGGACTGGAAAAGGTGGGAAAAGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATTTACTGGGCAGGAGGCTATTTGGCGAAAAGTATTTCAGGTATGTTTTCCTCTATTTATAAGTGTTCTGTTATTCATACTGTCTTCAACATAATGATATTTTCAACAAGTGCGTGCTGTCATCTTACCCTACATATGCTATAATCTCTGACTTATCTTAATATTTTTTATGATTCATCATGGGACCATGGATGCAATAATCTACATCATTTGCACTATAGTAATACAAGTATAATATTACTCTATGCTGCTTCCTGCTATTGGAAATTGTTTCTGGCATTATCTCAACTTTTGCTTACATTAGAAGCTTCTTTTGTTCAGTACTTTCTTCATCATTGCTATGGTAGCTTTCATGTCCTTTTCTTAAAAAAAAAAAAAAAAAAAAAAAAAAAGTTCTCCCTTGAGTTATAGATATGCTCGTTTCCTACATTTTAATTTATGCAACAAAGTGGCTAAAGCGAAAAAGCTATCTCTAATAATTTCTTTAGGCTAGATCCTACTACAATTTACTTGGTTCAAAGTATAGATTGATTTCATTCCACACTATATTTGCAGAAATTTAATATAAGCAACTCAGATTTGGATGATTTCTTTGGAGGTCCAGCTTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAGTGAGTACTTGCTTTGTAACTGCGTTGAAATTAGATCATTGTTGGATGGATATTCTATTCCGCTGGGACTAAATGAAAGATAGAATGATAAATATACAATCATTGTCACAATCAGCTACATAATCTTCCTGCTTTAATCAGATTTATGTAGAGAATGGCTTAAATGTGGCATGTTTTTTTACTCTTACATTTCAGTTCTAGTAATAAAAAGCTCACTGATAATTTCTTATCATTGCCTTTTTAATATCAACACTCTTATCACAGATGGGGTGGGCCACTGCCGCAAAGTTGGTTTGATCAACAACTTATTCTGCAGAAAAAAGTTATTGGCAGAATGTTTGAGCTAGGAATGAATCCAGGTTTTTTTTAAGCACTGTCAGTAAATTTATGTAGTACCCCATGTTGTAATAGAGTTCATAGTTCTTTGGCCTTGACCCAGTAGGAAACTTAAGATTCTGTTTCATTAAAATATTTTCACTTTCATGCTGGCTTTCTTTTTGAATTGCTGTTTTAAAGCTTCAGTGATGTTGCTATAGTTCGTACAGCCTATTTACTGGACAAAGTTCTAGAAAGCTTTTGTGCTGGGAACAATTTATGGACCATGTTTTTCTTTTTTCTTCTTTCCAGTTTTGCCAGCCTTTTCGGGTAATATTCCTGCTGCTTTCAAACAAATATTTCCATCAGCAAAAATAACACGCTTGGGAAATTGGTAACTCAACACTCCCTTATGGTCTTATCTTATATAAATTTCATATTTGGTATGGTTAATAATGTTTTTATTGTGTGTTTTGGCATTTTCTTAGAATATATTTTGAAACATTCTGGAACCAGTGTTGGTCAAATCCATGATACTAAAAAAAGAACAGAAATAGTTCATGATACGAAGAATGTGATGCACTTTTGAAAATAATTACTTTAGTGAAGTGTTTGATGATAATTTTTTTGTTATTGGTCTTAATATTTTAGAAGTTTTTTTAAATTTGTCTAGAAGTGCTTACTATTTTAAAACGAAGTGTAATTTGAATGACTAATATTAAAATTTGAAGGATTTTTAGGATAAAAACATTACCTAAAGAAAAATATTTAAAATGGAAAAAAAAGTGATCGAAAAAATTAGTGGAAGGAAGCAGGCAAGTGAAGAAAATCTAGTGGAAAACTGTATACACAGGGAGAGTAGTAGAAGTGAATGTCTAAATAGGAGAAAGGCATGTGAATGGAGAGATTTTGTTAAAAGAAGTATGTCTGCAGAGAGAAGTTAGAAATTTTATTTTTAGATGGAGTCAACTAGAAAATTATATATAGAGGAAGATCTCGAGAGGGATAATGCAAGAAAGTGTTGGGATAGAAGTTAATGAAATTAGTTTGATTGGTGAGCGAGAAAGTATATCTAACATGACTAGTAAGAAAGACAATGGAGAGCAATTAGTAAGATAAATTATATAGAGAGAAGTTATTAAGTTGGAGCTAACATGAGTTCCATTCATTAGAAATTTACTACCAAAATATGAAATGATCTCTTCATGGAGTAGTCTGACCAGAAAACCTCTAACTTTTTATTTGAAGCAACAACCTTGGCGGTAATCCTTTAAGTCACGAGACTCTACAATGATGCTTGAGCAACAAGAAACTTTTGCAAAGTACTTGTTTAACCAATGACAATGTCAATTGTCTCTTTACTACCATTGTCCTGCAGGTTTACTGTTCATAGTGACCCTAGATGGTGCTGCACTTACCTTCTTGATGCCATGGACCCTTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAACAGAAAGGTATTTTTGGACCTATGCTAATGAAATGTATGACAACATATTTCACGCCTTTTTTCTTAAATGATGCTTGATGATTCTAATAGTCTGAAATTGCATTTTTTGGCCAGAATATGGAAGAACTTCCCATGTATACAATTGGTATGGTGCTTGTTTCCTCAACCGACTTTGTATTGTGTGCGGTTCATTTTTCTTTTAATTTTCACTGTAAGATGTGTATGGTCAGCTTTGTTACAATATAATGACAGAAAAGCTAGCGAGAAACAAGCATTTAAAATTGGTGCAATATTGGTTTCGACAGTCTTTAAATTGTATTTTCTTCAAACATGAGAAATATTTGCCATCAATTTACATCATGAATAATCTTTTCACCCTACAGTGATACCTTTGACGAGAACACTCCACCTGTTGATGAGGTGGAATACATCTCTTCATTAGGTTCAGCTATTTTTGGAGGAATGCAGGCAGGAGATTCTGATGCTGTCTGGCTAATGCAGGTAATTGTTGGTTTGTTTTTCTATGTAAGGGACTGTTAGGCCAATTCACTTAATTCTCTGTGTAGTTCCAATTTCAATGGATGTCCCTGAAGGGACCCAATTCGCTAAGTTTTCTAAGTCATATCACAAAGCAAGTGGAACATGATATGGAACTCTAGTGAATGTGCACTGCATCTCTTCTCGTTCTGTTTTGGTGAAATGTCAACTTTTTTCTCGATCTTGTTAATGAAATTGTTCATGTACAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAACAAATGAAGGTTATAATCATGCTTTATGTCACCACTTGCTCGTGCAATCATATCACGAATATATCTTTCTACTTTCAATCATCATTAGATCATTGCATGTATGAAGGATCTTATCAGAGAATCTTTGACTCAGATCTGTCGTGTTATTTGTAAATGCTAATTTATTTACTTGGGCTAGTTGTATGGTGTACCTTCTTTTATCTACTCAATCTTTAGGGTTGTCATTGTGCATTACTTTCACTTCTGGTTGAAATTGGTGAGCATTTTTCAGGCCCTTTTACATTCTGTCCCTCTGGGAAAGCTGGTAGTCCTCGATCTGTATGCTGAAGTGAAGCCGATCTGGATATCCTCTGAGCAATTTTATGGCACTCCTTACATCTGGAAAGTCTCTATTTCCATTCTTTTGCTTAATCTTAATGTTCATATCTAATAAATGATCAAGATTTAGGCTTACAGAAGGGCTTTTTCATAGCATATCAGCAATGTTGATGAGTGTGTTATAATTATCGGTCAAGAATCGACATTCAAAGTTCATTAGTTTAAAGTTTTCTTCATTTGTTATACTTTTTGATATCCACGAGTGTCTTAAGTTATTTTTATCTGGCTATTAACCTTTTTTTAGTTGCTATACGTAGGTTGATTATATTGACAATCTGAGAACTAACCATTTAAAAGCAGTGCTGTCAAGAATGACATATTATATATCTTTTTACAAGCCCATTTTTATAAGTTAATAATAATTTGAAGAGACCTTGATTAAGCATTACATTATCTAGTTTTTTTGGCTAAAAATTATATCCCTTTCAATGTAATTTCAGTCCAATGTTATCTTCGTGTTTTTACTTTATAAGAAAGATGTTTCTGGCAAGGCAATTTTGATACCCCATTTCATTTAGTTTGCAGGTTAAAACAGCAGTTATCTGACTTTAGCCGTTTTAAGAAACTTTTTTTTAACAAAACTGCTTATCTCGTTAATTCTGTTGTCTTAAGCTAAAACTAGATTGGTTTATTGTTTCATTAGCTTTATGCTATATTTCAATGAATGTGTAATGCTCTTAAATTTTAGCTTCAATAATATGAATTGCTATCCACAGGTGCATGCTCCATAATTTTGCTGGAAATGTTGAGATGTATGGCATTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGTAGTCCGTACTCGACAATGGTAAGTTTTTCATCTTCTATTATTTGGTGTCTAGATGTTGTCATAATGTTTTGGCAGCATATAGAATTCCCATGGGTATACTAATTTTTTATTTGAAAGGAAACAAGTCTCTTTATTAATAAAAATGAGACTACTTCTCAAAGTACAAGAACAGAAAATGTAGAATTTAGGGATCGGTAGGCACACCTTGGCATCTCAACTAGGTTGCACCCCTTTAGCGCCCTCATCATCTCCAAAATAAATTCCCAGCTCTGGGTATGTATGGGTATAATACTAATTTTGATATTTAACTCCATTAGTTTGGTAGAAGTAAATTTGGTGCTTTCTTTCTCTCGTCCATAAGTACAGCTCCGGGTATATATATGAGTATTAATGTAATTTTGAAGAGATGAACTGATTCTCTTTTTTCTTTTCATTATTGAAAAGACGGCTTGAAAGGATTTTTTGGCATTTATATTGCTCAATTCTTTTATTTTTTTTATTTATGTATTGAAGTGACAAATGGCTTAATCATAGTGGATCGTGGCAGCACAGTCACTCTTCCAATTACAATAAAAAATTTACCCATAACATATCATCGCTCATTAAAATGTAAAAGCTTTATATTTAACATATTTATGAAGATATTACTAATTCCAATCTCGCTCTAACCTTCTCTTTTTCTCATATGTCTGAGGTGTATCATTTATTTTGCAATGTTTAAATTCGTTTTTAGCTTTGTCAACGATGTTTTCATTTGAAGGTCGGTGTAGGAATGTCCATGGAAGGAATTGAACAGAATCCTGTTGTCTACGATCTTATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGGTACAAATGTCTGACAATTATTTTTAAAATATAATGGCCCATTTACTTCGTTGCCAGTTGTATTCATGTTCCATGTGCGGAAAGCTGACAAGTGACATTACATAGATTTAATCGAATATACGGAAGGGCCAATCATATGAAAAGGATAGTGAGATCATTTCTTTTCTACCGTGCATAGGTTCACATCCTGGTTAATGCCAATGTGGTTTTATTTTCCAAGTTGATTCCACAACTAGTTCTGTTGATGTCTTGCATGCAATGCATTATGAGAAGGAACCGATAGATGCGAAAGATATTAGAAAATATTTCATAATAAATTCCATACAATTCAAGAACAATCACGACAAGTTCTATGTAAAATATCTAAGCATGGAAGAAAGCTGAGATCTATATGGAGATTAGAAATACCTGGAGCTATTAGATAGATCAACTAACTTGCGAAGTTACATGAATTAAGCAATCTTCCTCAACTAGACCAAATATAGGCAATCTTCTTTGCCGATAAGTATAGACAACTTCCTAAACTATGCTCTACTAGAACAGTAAGTATACGTTTTTTCTCCACCGACCATACTAATGTAGACGAAGTCCATAGGTCAATAATTAAGGCCATTTGCTCATTAAATAGCTAAGATATAATTTTCATTCGATTAGTTACCCAAATATGAAATAATTGTTAACTTCATTGCAATTCTTTCCAACAATAGAAATTTTCTTTTCTAGTTCAACAATTTCAGGTTGTTCATAACCTTTGGCCAACATACTGCACACTTTTCCCATGATTTCTGAATTTTAGCCCTAGCACTTTCTATTTTAAAGTTTATTTTTTCCAATAGAATTTATTTTCATGATAGATTTTGCGCAATTGTGTTTATAGAAAATATTGTTCCCATCAGTCAGAATTTGTAAAATACTATCCTTTCATTTCCAGAAATGGCTTCCTCAGTATTCAGTAAGGCGTTATGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAACTGCACTGACGGTGCCAATGTAAGTATGAAAAAACCTGTAATGTCCATTCAACCTTTTCTTCTCTTTTGGTACTTCTACGACTGGTGTTTCCAGTCTGTTGCTTTCGAATGGTTATCACCGTTCTCCCTCTGTCTATTTTCCTTCCTTGCTCATGCAAAGTCTGAATTATGTTCAAACATGACATGCTAACATATGTCGGTAACTATGGAAAGGACAAAAACAGGGATGTAATTGTGGCATTTCCTGATGTTGATCCATCGTCAATCTTAGTATTACCTGAGGTGTCTGACCGACATGGGAACTTGGACTCAAGCATGGATGGCCTCCAGGATGCAACATTTGATCGACCTCATCTTTGGTATCCTACTTCTGAAGTAATTCGTGCACTCAAGCTTTTCATTGCTGGTGGCAATCAACTCTCTGGTAGCAACACTTACAGGTTGACTCACTAATTTAAAATATCTCATCATTTACTTGAAACCTATAGAGATATCAAATATCGTCCTTTTCTCTGTTCAAGTCTAGATTATTGAATTGGCATTCGTTGCCACTCTTAGGTATGACCTTGTAGATTTGACCAGGCAAGCTCTAGCCAAATACTCAAATGAACTGTTCTTTAGAATTGTCAAAGCATATCAGTTATATGATGCACAAACAATGGCCAGCTTAAGCCAAGAGTTCCTTGAACTTGTCAATGATATTGACACATTATTGGCTTGTCACGAGGGATTTCTTTTGGGACCTTGGCTACAAAGCGCCAAACAACTCGCCCACAGTAAAGAGGAGGAAAAACAGGTTCAATTCAAGTCCAGTCTATCAATTATGAGAAAATTATAATAATAACATCCGAATTATGACTCATCTCAAATTACAGTATGAATGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGACGAAGCAAGTTTGCTTCGTGATTATGGTAATGATACTTTGGACCTGGACTCAACTCTATATTGATTGATTGTCATTTATCATCGAGATTAGGCAATTGTACATTCAAATTTGACTTATTTAAACTCGACTCAGGAAACAAGTACTGGAGTGGACTCTTGGGCGATTACTATGGTCCTCGAGCTGCAATATACTTCAAGTTCTTGAAAGAAAGTTCGGAGAATGGATACAGATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAACTAACAAATGATTGGCAAAGCAGCAGAAAGATTTACCCTGTGGAAAGCAATGGAGATGCACTTGACACATCCCATTGGCTCTACAACAAATACTTGCAAATACCTGAAAGCTCTGATCAATGAATCAGGCAAGTTATCTGCTAGTAACTAAGATTTGAATGTTGCATATTTTAATAGTTCTCTCATTTTCTATTCCGTGTTTAAACTAGTATTATTTCGCAGGTTAACGACCTCCTTTTCCTTATACACAAATGGTCTACAATACTATATCTCCCGGTAATGCAAAACCATTTCTTTTTGTCTTTTAAATAGTATCGTCGAG
mRNA sequence
ATGGCGTCCTCTTTTTCTTCCACTTTCCTTATTTTCGTTTCACTTTTCGCCGCCTTCTCCACTTCTCGGTCGTCGACGATCGGAGTCGAATACATTTCGCGGCTTCTTGAGATTCAGGATCGCGAGAGAGCACCTGCGTATGTCCAAGTTGCTGCGGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCACCTCTCTAGCTTTGACTTTCAGATTGTCTCTAAGGACAAATGTGGTGGAGAATTTTGCTTTGTGATCAGGAACCATCGCGCGTTCAGAAAACCCGGGGATCCTGAGATTTTAATCGCTGGGGTCACTGGAGTGGAGATTTTAGCTGGCTTGCACTGGTATCTTAAGCACTGGTGTGGTGCACACATATCTTGGGATAAAACAGGTGGCTCACAACTATTTTCTGTACCTAAGGCAGGCCTGTTACCTCGTATTCAAACAGACGAAGTTGTGATCCGGAGGCCTATTCCTTTGAACTACTATCAAAATGCAGTTACATCAAGCTACTCTTTTGCCTGGTGGGACTGGAAAAGGTGGGAAAAGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATTTACTGGGCAGGAGGCTATTTGGCGAAAAGTATTTCAGAAATTTAATATAAGCAACTCAGATTTGGATGATTTCTTTGGAGGTCCAGCTTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAATGGGGTGGGCCACTGCCGCAAAGTTGGTTTGATCAACAACTTATTCTGCAGAAAAAAGTTATTGGCAGAATGTTTGAGCTAGGAATGAATCCAGTTTTGCCAGCCTTTTCGGGTAATATTCCTGCTGCTTTCAAACAAATATTTCCATCAGCAAAAATAACACGCTTGGGAAATTGGTTTACTGTTCATAGTGACCCTAGATGGTGCTGCACTTACCTTCTTGATGCCATGGACCCTTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAACAGAAAGAATATGGAAGAACTTCCCATGTATACAATTGTGATACCTTTGACGAGAACACTCCACCTGTTGATGAGGTGGAATACATCTCTTCATTAGGTTCAGCTATTTTTGGAGGAATGCAGGCAGGAGATTCTGATGCTGTCTGGCTAATGCAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAACAAATGAAGGCCCTTTTACATTCTGTCCCTCTGGGAAAGCTGGTAGTCCTCGATCTGTATGCTGAAGTGAAGCCGATCTGGATATCCTCTGAGCAATTTTATGGCACTCCTTACATCTGGAAAATGTATGGCATTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGTAGTCCGTACTCGACAATGGTCGGTGTAGGAATGTCCATGGAAGGAATTGAACAGAATCCTGTTGTCTACGATCTTATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGAAATGGCTTCCTCAGTATTCAGTAAGGCGTTATGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAACTGCACTGACGGTGCCAATGACAAAAACAGGGATGTAATTGTGGCATTTCCTGATGTTGATCCATCGTCAATCTTAGTATTACCTGAGGTGTCTGACCGACATGGGAACTTGGACTCAAGCATGGATGGCCTCCAGGATGCAACATTTGATCGACCTCATCTTTGGTATCCTACTTCTGAAGTAATTCGTGCACTCAAGCTTTTCATTGCTGGTGGCAATCAACTCTCTGGTAGCAACACTTACAGGTATGACCTTGTAGATTTGACCAGGCAAGCTCTAGCCAAATACTCAAATGAACTGTTCTTTAGAATTGTCAAAGCATATCAGTTATATGATGCACAAACAATGGCCAGCTTAAGCCAAGAGTTCCTTGAACTTGTCAATGATATTGACACATTATTGGCTTGTCACGAGGGATTTCTTTTGGGACCTTGGCTACAAAGCGCCAAACAACTCGCCCACAGTAAAGAGGAGGAAAAACAGTATGAATGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGACGAAGCAAGTTTGCTTCGTGATTATGGAAACAAGTACTGGAGTGGACTCTTGGGCGATTACTATGGTCCTCGAGCTGCAATATACTTCAAGTTCTTGAAAGAAAGTTCGGAGAATGGATACAGATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAACTAACAAATGATTGGCAAAGCAGCAGAAAGATTTACCCTGTGGAAAGCAATGGAGATGCACTTGACACATCCCATTGGCTCTACAACAAATACTTGCAAATACCTGAAAGCTCTGATCAATGAATCAGGTTAACGACCTCCTTTTCCTTATACACAAATGGTCTACAATACTATATCTCCCGGTAATGCAAAACCATTTCTTTTTGTCTTTTAAATAGTATCGTCGAG
Coding sequence (CDS)
ATGGCGTCCTCTTTTTCTTCCACTTTCCTTATTTTCGTTTCACTTTTCGCCGCCTTCTCCACTTCTCGGTCGTCGACGATCGGAGTCGAATACATTTCGCGGCTTCTTGAGATTCAGGATCGCGAGAGAGCACCTGCGTATGTCCAAGTTGCTGCGGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCACCTCTCTAGCTTTGACTTTCAGATTGTCTCTAAGGACAAATGTGGTGGAGAATTTTGCTTTGTGATCAGGAACCATCGCGCGTTCAGAAAACCCGGGGATCCTGAGATTTTAATCGCTGGGGTCACTGGAGTGGAGATTTTAGCTGGCTTGCACTGGTATCTTAAGCACTGGTGTGGTGCACACATATCTTGGGATAAAACAGGTGGCTCACAACTATTTTCTGTACCTAAGGCAGGCCTGTTACCTCGTATTCAAACAGACGAAGTTGTGATCCGGAGGCCTATTCCTTTGAACTACTATCAAAATGCAGTTACATCAAGCTACTCTTTTGCCTGGTGGGACTGGAAAAGGTGGGAAAAGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATTTACTGGGCAGGAGGCTATTTGGCGAAAAGTATTTCAGAAATTTAATATAAGCAACTCAGATTTGGATGATTTCTTTGGAGGTCCAGCTTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAATGGGGTGGGCCACTGCCGCAAAGTTGGTTTGATCAACAACTTATTCTGCAGAAAAAAGTTATTGGCAGAATGTTTGAGCTAGGAATGAATCCAGTTTTGCCAGCCTTTTCGGGTAATATTCCTGCTGCTTTCAAACAAATATTTCCATCAGCAAAAATAACACGCTTGGGAAATTGGTTTACTGTTCATAGTGACCCTAGATGGTGCTGCACTTACCTTCTTGATGCCATGGACCCTTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAACAGAAAGAATATGGAAGAACTTCCCATGTATACAATTGTGATACCTTTGACGAGAACACTCCACCTGTTGATGAGGTGGAATACATCTCTTCATTAGGTTCAGCTATTTTTGGAGGAATGCAGGCAGGAGATTCTGATGCTGTCTGGCTAATGCAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAACAAATGAAGGCCCTTTTACATTCTGTCCCTCTGGGAAAGCTGGTAGTCCTCGATCTGTATGCTGAAGTGAAGCCGATCTGGATATCCTCTGAGCAATTTTATGGCACTCCTTACATCTGGAAAATGTATGGCATTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGTAGTCCGTACTCGACAATGGTCGGTGTAGGAATGTCCATGGAAGGAATTGAACAGAATCCTGTTGTCTACGATCTTATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGAAATGGCTTCCTCAGTATTCAGTAAGGCGTTATGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAACTGCACTGACGGTGCCAATGACAAAAACAGGGATGTAATTGTGGCATTTCCTGATGTTGATCCATCGTCAATCTTAGTATTACCTGAGGTGTCTGACCGACATGGGAACTTGGACTCAAGCATGGATGGCCTCCAGGATGCAACATTTGATCGACCTCATCTTTGGTATCCTACTTCTGAAGTAATTCGTGCACTCAAGCTTTTCATTGCTGGTGGCAATCAACTCTCTGGTAGCAACACTTACAGGTATGACCTTGTAGATTTGACCAGGCAAGCTCTAGCCAAATACTCAAATGAACTGTTCTTTAGAATTGTCAAAGCATATCAGTTATATGATGCACAAACAATGGCCAGCTTAAGCCAAGAGTTCCTTGAACTTGTCAATGATATTGACACATTATTGGCTTGTCACGAGGGATTTCTTTTGGGACCTTGGCTACAAAGCGCCAAACAACTCGCCCACAGTAAAGAGGAGGAAAAACAGTATGAATGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGACGAAGCAAGTTTGCTTCGTGATTATGGAAACAAGTACTGGAGTGGACTCTTGGGCGATTACTATGGTCCTCGAGCTGCAATATACTTCAAGTTCTTGAAAGAAAGTTCGGAGAATGGATACAGATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAACTAACAAATGATTGGCAAAGCAGCAGAAAGATTTACCCTGTGGAAAGCAATGGAGATGCACTTGACACATCCCATTGGCTCTACAACAAATACTTGCAAATACCTGAAAGCTCTGATCAATGA
Protein sequence
MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRLLPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKHWCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWKRWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVEYISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGKLVVLDLYAEVKPIWISSEQFYGTPYIWKMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEVSDRHGNLDSSMDGLQDATFDRPHLWYPTSEVIRALKLFIAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDNTEDEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ
Homology
BLAST of PI0023410 vs. ExPASy Swiss-Prot
Match:
Q9FNA3 (Alpha-N-acetylglucosaminidase OS=Arabidopsis thaliana OX=3702 GN=NAGLU PE=2 SV=1)
HSP 1 Score: 966.8 bits (2498), Expect = 1.5e-280
Identity = 468/811 (57.71%), Postives = 585/811 (72.13%), Query Frame = 0
Query: 6 SSTFLIFVSLFAAFSTSRSSTIGVEY--ISRLLEIQDRERAPAYVQVAAARGVLRRLLPS 65
S ++ V L +F S T+ + I LL+ D + VQ +AA+G+L+RLLP+
Sbjct: 3 SIKLVLLVLLIISF---HSQTVSKHHPTIDGLLDRLDSLLPTSSVQESAAKGLLQRLLPT 62
Query: 66 HLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKHWCG 125
H SF+ +I+SKD CGG CFVI N+ + G PEILI G TGVEI +GLHWYLK+ C
Sbjct: 63 HSQSFELRIISKDACGGTSCFVIENYDGPGRIG-PEILIKGTTGVEIASGLHWYLKYKCN 122
Query: 126 AHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWKRWE 185
AH+SWDKTGG Q+ SVP+ G LPRI + + IRRP+P NYYQN VTSSYS+ WW W+RWE
Sbjct: 123 AHVSWDKTGGIQVASVPQPGHLPRIDSKRIFIRRPVPWNYYQNVVTSSYSYVWWGWERWE 182
Query: 186 KEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLHKWG 245
+EIDWMALQGIN+PLAFTGQEAIW+KVF++FNIS DLDD+FGGPAFLAW+RMGNLH WG
Sbjct: 183 REIDWMALQGINLPLAFTGQEAIWQKVFKRFNISKEDLDDYFGGPAFLAWARMGNLHAWG 242
Query: 246 GPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNWFTV 305
GPL ++W D QL+LQK+++ RM + GM PVLP+FSGN+P+A ++I+P A ITRL NW TV
Sbjct: 243 GPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANITRLDNWNTV 302
Query: 306 HSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVEYIS 365
D RWCCTYLL+ DPLF+EIG+AFI+QQ +EYG +++YNCDTF+ENTPP E EYIS
Sbjct: 303 DGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPPTSEPEYIS 362
Query: 366 SLGSAIFGGMQAGDSDAVWLMQGWMFSYD-PFWRPQQMKALLHSVPLGKLVVLDLYAEVK 425
SLG+A++ M G+ +AVWLMQGW+FS D FW+P Q+KALLHSVP GK++VLDLYAEVK
Sbjct: 363 SLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIVLDLYAEVK 422
Query: 426 PIWISSEQFYGTPYIW----------KMYGILDSIASGPIEARSSPYSTMVGVGMSMEGI 485
PIW S QFYGTPYIW +MYG LDSI+SGP++AR S STMVGVGM MEGI
Sbjct: 423 PIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVGVGMCMEGI 482
Query: 486 EQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDGAN 545
EQNPVVY+L SEMAF+ KVDV+KWL Y+ RRY I+ AW++LYHT+YNCTDG
Sbjct: 483 EQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTVYNCTDGIA 542
Query: 546 DKNRDVIVAFPDVDPSSILVLPEVSDRHGNLDSSMDG-----------LQDATFDRP--H 605
D N D IV PD DPSS V D DS M QD T D P H
Sbjct: 543 DHNTDFIVKLPDWDPSS-----SVQDDLKQKDSYMISTGPYETKRRVLFQDKTADLPKAH 602
Query: 606 LWYPTSEVIRALKLFIAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDA 665
LWY T EVI+ALKLF+ G+ LS S TYRYD+VDLTRQ L+K +N+++ V A+ D
Sbjct: 603 LWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYTEAVTAFVKKDI 662
Query: 666 QTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMW 725
++ LS++FLEL+ D+D LLA + LLG WL+SAK+LA + +E KQYEWNARTQ+TMW
Sbjct: 663 GSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQYEWNARTQVTMW 722
Query: 726 FDNTEDEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKL 785
+D+ + S L DY NK+WSGLL DYY PRA +YF + +S + F + WRREWI +
Sbjct: 723 YDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIFKVEKWRREWIMM 782
Query: 786 TNDW-QSSRKIYPVESNGDALDTSHWLYNKY 790
++ W QSS ++YPV++ GDAL S L +KY
Sbjct: 783 SHKWQQSSSEVYPVKAKGDALAISRHLLSKY 804
BLAST of PI0023410 vs. ExPASy Swiss-Prot
Match:
P54802 (Alpha-N-acetylglucosaminidase OS=Homo sapiens OX=9606 GN=NAGLU PE=1 SV=2)
HSP 1 Score: 525.4 bits (1352), Expect = 1.1e-147
Identity = 277/711 (38.96%), Postives = 416/711 (58.51%), Query Frame = 0
Query: 96 GDPEILIAGVTGVEILAGLHWYLKHWCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVI 155
G + + G TGV AGLH YL+ +CG H++W GSQL +P+ LP + E+
Sbjct: 71 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAW---SGSQL-RLPRP--LPAV-PGELTE 130
Query: 156 RRPIPLNYYQNAVTSSYSFAWWDWKRWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFN 215
P YYQN T SYSF WWDW RWE+EIDWMAL GIN+ LA++GQEAIW++V+
Sbjct: 131 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVYLALG 190
Query: 216 ISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLP 275
++ +++++FF GPAFLAW RMGNLH W GPLP SW +QL LQ +V+ +M GM PVLP
Sbjct: 191 LTQAEINEFFTGPAFLAWGRMGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLP 250
Query: 276 AFSGNIPAAFKQIFPSAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQK 335
AF+G++P A ++FP +T++G+W H + + C++LL DP+F IG F+ + K
Sbjct: 251 AFAGHVPEAVTRVFPQVNVTKMGSW--GHFNCSYSCSFLLAPEDPIFPIIGSLFLRELIK 310
Query: 336 EYGRTSHVYNCDTFDENTPPVDEVEYISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDP-F 395
E+G T H+Y DTF+E PP E Y+++ +A++ M A D++AVWL+QGW+F + P F
Sbjct: 311 EFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQF 370
Query: 396 WRPQQMKALLHSVPLGKLVVLDLYAEVKPIWISSEQFYGTPYIWKM----------YGIL 455
W P Q++A+L +VP G+L+VLDL+AE +P++ + F G P+IW M +G L
Sbjct: 371 WGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGAL 430
Query: 456 DSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKV-DVKKWLPQYSV 515
+++ GP AR P STMVG GM+ EGI QN VVY LM+E+ ++ + V D+ W+ ++
Sbjct: 431 EAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAA 490
Query: 516 RRYGHLVPSIQDAWDVLYHTIYNCT-DGANDKNRDVIVAFPDVDPSSILVLPEVSDRHGN 575
RRYG P AW +L ++YNC+ + NR +V P + ++
Sbjct: 491 RRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPSLQMNT------------- 550
Query: 576 LDSSMDGLQDATFDRPHLWYPTSEVIRALKLFIAGGNQLSGSNTYRYDLVDLTRQALAKY 635
+WY S+V A +L + L+ S +RYDL+DLTRQA+ +
Sbjct: 551 ----------------SIWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQEL 610
Query: 636 SNELFFRIVKAYQLYDAQTMASLSQE----FLELVNDIDTLLACHEGFLLGPWLQSAKQL 695
+ L++ +A Y ++ +ASL + EL+ +D +LA FLLG WL+ A+
Sbjct: 611 VS-LYYE--EARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAA 670
Query: 696 AHSKEEEKQYEWNARTQITMWFDNTEDEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLK 755
A S+ E YE N+R Q+T+W E ++L DY NK +GL+ +YY PR ++ + L
Sbjct: 671 AVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALV 730
Query: 756 ESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKY 790
+S G F + + +L + S++ YP + GD +D + ++ KY
Sbjct: 731 DSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
BLAST of PI0023410 vs. ExPASy TrEMBL
Match:
A0A1S3BVG2 (alpha-N-acetylglucosaminidase-like OS=Cucumis melo OX=3656 GN=LOC103493939 PE=4 SV=1)
HSP 1 Score: 1615.1 bits (4181), Expect = 0.0e+00
Identity = 773/808 (95.67%), Postives = 785/808 (97.15%), Query Frame = 0
Query: 1 MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
MAS FSSTFLIFV++FAAFSTSRSSTIGVEYISRLLEIQDRER PAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQIVSKDKCGGE CFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKVIGRMFELGM PVLPAFSGNIPAAFKQI+PSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
Query: 361 YISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGKLVVLDLYAE 420
YISSLGSAIFGGMQAGDS+AVWLMQGWMFSYDPFWRP QMKALLHSVPLG+LVVLDLYAE
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 421 VKPIWISSEQFYGTPYIW----------KMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
VKPIWISSEQFYGTPYIW +MYGILDSIASGPIEARSSPYSTMVGVGMSME
Sbjct: 421 VKPIWISSEQFYGTPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
Query: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDG 540
GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDG
Sbjct: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDG 540
Query: 541 ANDKNRDVIVAFPDVDPSSILVLPEVSDRHGNLDSSMDGLQDATFDRPHLWYPTSEVIRA 600
ANDKNRDVIVAFPDVDPSSILVLPE SD+HG LDSSMDGLQDATFDRPHLWYPTS+VI A
Sbjct: 541 ANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKVISA 600
Query: 601 LKLFIAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQEFL 660
LKLFI GG+QLSGSNTYRYDLVDLTRQALAKYSNELFFR VKAYQLYDAQTMASLSQEFL
Sbjct: 601 LKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQEFL 660
Query: 661 ELVNDIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDNTEDEASLL 720
ELVNDIDTLLACHEGFLLGPWLQSAKQLA S+EEEKQYEWNARTQITMWFDNTE+EASLL
Sbjct: 661 ELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLL 720
Query: 721 RDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIY 780
RDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIY
Sbjct: 721 RDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIY 780
Query: 781 PVESNGDALDTSHWLYNKYLQIPESSDQ 799
PVESNGDAL TSHWLYNKYLQIPESSDQ
Sbjct: 781 PVESNGDALHTSHWLYNKYLQIPESSDQ 808
BLAST of PI0023410 vs. ExPASy TrEMBL
Match:
A0A5D3BH46 (Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold350G002030 PE=4 SV=1)
HSP 1 Score: 1528.8 bits (3957), Expect = 0.0e+00
Identity = 747/834 (89.57%), Postives = 761/834 (91.25%), Query Frame = 0
Query: 1 MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
MAS FSSTFLI V++FAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIIVTIFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQI DKCGGE CFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQI---DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNW 300
KWGGPLP SWFDQQLILQKKVIGRMFELGM PVLPAFSGNIPAAFKQI+PSAKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYG+TSHVYNCDTFDENTPPVDEVE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGKTSHVYNCDTFDENTPPVDEVE 360
Query: 361 YISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGKLVVLDLYAE 420
YISSLGSAIFGGMQAGDS+AVWLMQGWMFSYDPFWRPQQMKALLHSVPLG+LVVLDL
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDL--- 420
Query: 421 VKPIWISSEQFYGTPYIWKMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYD 480
F G +MYGILDSIASGPIEARSS YSTMVGVGMSMEGIEQNPVVYD
Sbjct: 421 -----CMLHNFAGNV---EMYGILDSIASGPIEARSSQYSTMVGVGMSMEGIEQNPVVYD 480
Query: 481 LMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDGANDKNRDVIV 540
LMSEM FQ NKVDVKKWLPQYSVRRYGHLVPSIQDAWD+LYHTIYNCTDGANDKNRDVIV
Sbjct: 481 LMSEMGFQRNKVDVKKWLPQYSVRRYGHLVPSIQDAWDILYHTIYNCTDGANDKNRDVIV 540
Query: 541 AFPDVDPSSILVLPEVSDRHGNLDSSMDGLQDATFDRPHLWYPTSEVIRALKLFIAGGNQ 600
AFPDVDPSSILVLPE SD+HG LDSSMDGLQDATFDRPHLWYPTS+VI ALKLFI GG+Q
Sbjct: 541 AFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKVISALKLFIVGGDQ 600
Query: 601 LSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQEFLELVNDIDTLL 660
LSGSNTYRYDLVDLTRQALAKYSNELFFR VKAYQLYDAQTMASLSQEFLELVNDIDTLL
Sbjct: 601 LSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQEFLELVNDIDTLL 660
Query: 661 ACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDNTEDEASLLRDY------- 720
ACHEGFLLGPWLQSAKQLA S+EEEKQYEWNARTQITMWFDNTE+EASLLRDY
Sbjct: 661 ACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNDNSGP 720
Query: 721 -----------------------------GNKYWSGLLGDYYGPRAAIYFKFLKESSENG 780
GNKYWSGLLGDYYGPRAAIYFKFLKESSENG
Sbjct: 721 GLNSISIDCHLSSRLGNCTFKFDLFNLDPGNKYWSGLLGDYYGPRAAIYFKFLKESSENG 780
Query: 781 YRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ 799
YRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDAL TSHWLYNKYLQIPESSDQ
Sbjct: 781 YRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 820
BLAST of PI0023410 vs. ExPASy TrEMBL
Match:
A0A6J1C176 (alpha-N-acetylglucosaminidase-like OS=Momordica charantia OX=3673 GN=LOC111007441 PE=4 SV=1)
HSP 1 Score: 1514.2 bits (3919), Expect = 0.0e+00
Identity = 722/809 (89.25%), Postives = 760/809 (93.94%), Query Frame = 0
Query: 1 MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
MAS F + FLIFVSLFAAFSTSR STIGV YISRLLEIQDRERAPA+VQVAAARGVLRRL
Sbjct: 1 MASPFPAIFLIFVSLFAAFSTSRFSTIGVGYISRLLEIQDRERAPAHVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQIVSKDKCG E CFVIRNHR+FR+PGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGRESCFVIRNHRSFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
WCGAHISWDKTGGSQLFSVPKAGLLPRIQ++E++++RP+PLNYYQNAVTSSYSFAWWDW+
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQSNEIIVQRPVPLNYYQNAVTSSYSFAWWDWE 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RW+KEIDWMALQGINMPLAFTGQEAIW+KVFQKFNISN+DLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWKKEIDWMALQGINMPLAFTGQEAIWQKVFQKFNISNTDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNW 300
KWGG LPQSWFDQQLILQKKV+ RMFELGM PVLPAFSGNIPAAFKQI+PSAKITRLGNW
Sbjct: 241 KWGGSLPQSWFDQQLILQKKVLARMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
F+VHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQ KEYGRTSH+YNCDTFDENTPPVD E
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQLKEYGRTSHLYNCDTFDENTPPVDAAE 360
Query: 361 YISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGKLVVLDLYAE 420
YISSLG+AIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLG+LVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 420
Query: 421 VKPIWISSEQFYGTPYIW----------KMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
VKPIWISSEQFYGTPYIW +MYGILDSIASGPIEAR+SPYSTMVGVGMSME
Sbjct: 421 VKPIWISSEQFYGTPYIWCMLHNFAGNVEMYGILDSIASGPIEARNSPYSTMVGVGMSME 480
Query: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDG 540
GIEQNPVVYDLMSEMAFQHNKVDVKKWL QYS+RRYG LVPSIQDAWDVLYHTIYNCTDG
Sbjct: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLDQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDG 540
Query: 541 ANDKNRDVIVAFPDVDPSSILVLPEVS--DRHGNLDSSMDGLQDATFDRPHLWYPTSEVI 600
A DKNRDVIVAFPDVDPSSIL LPE S DR+ N +SS+ L ATFDRPHLWY TSEVI
Sbjct: 541 AYDKNRDVIVAFPDVDPSSILELPEGSDRDRYRNFNSSVGSLLHATFDRPHLWYSTSEVI 600
Query: 601 RALKLFIAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQE 660
RALKLFIAG +QLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQ MASLSQ+
Sbjct: 601 RALKLFIAGSDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQKMASLSQQ 660
Query: 661 FLELVNDIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDNTEDEAS 720
FLELV DIDTLLACHEGFLLGPWL+SAKQLA +E+EKQYEWNARTQITMWFDNTEDEAS
Sbjct: 661 FLELVKDIDTLLACHEGFLLGPWLESAKQLAQDEEQEKQYEWNARTQITMWFDNTEDEAS 720
Query: 721 LLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRK 780
LLRDYGNKYWSGLLGDYYGPRAAIYFKFLKES ENGY FPLSNWRREWIKLTNDWQ+SRK
Sbjct: 721 LLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESLENGYGFPLSNWRREWIKLTNDWQNSRK 780
Query: 781 IYPVESNGDALDTSHWLYNKYLQIPESSD 798
++PVE +GDA+DTS WLY KY+QI ES D
Sbjct: 781 VFPVEISGDAIDTSRWLYRKYMQILESYD 809
BLAST of PI0023410 vs. ExPASy TrEMBL
Match:
A0A5A7UWC6 (Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G001950 PE=4 SV=1)
HSP 1 Score: 1499.6 bits (3881), Expect = 0.0e+00
Identity = 738/840 (87.86%), Postives = 755/840 (89.88%), Query Frame = 0
Query: 1 MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
MAS FSSTFLIFV++FAAFSTSRSST GVEYISRLLE+QDRERAPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTTGVEYISRLLEVQDRERAPAYVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQIVSKDKCGGE CFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT EVVIRRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTGEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNW 300
KWGGPLP SWFDQQLILQKKVIGRMFELGM PVLPAFSGNIPAAFKQI+PSAKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKE-YGR-----TSHVYNCDTFDENTP 360
F VHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQK +G T + DTFDENTP
Sbjct: 301 FAVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKGIFGPMLMKCTRTYFMPDTFDENTP 360
Query: 361 PVDEVEYISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGKLVV 420
PVDEVEYISSLGSAIFGGMQ GDS+AVWLMQGWMFSYDPFWRPQQMKALLHSVPLG+LVV
Sbjct: 361 PVDEVEYISSLGSAIFGGMQTGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVV 420
Query: 421 LDLYAEVKPIWISSEQFYGTPYIWKMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQ 480
LDL F G +MYGILDSIASGPIEARSS YSTMVGVGMSMEGIEQ
Sbjct: 421 LDL--------CMLHNFAGNV---EMYGILDSIASGPIEARSSQYSTMVGVGMSMEGIEQ 480
Query: 481 NPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDGANDK 540
NPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWD+LYHTIYNCTDGANDK
Sbjct: 481 NPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDILYHTIYNCTDGANDK 540
Query: 541 NRDVIVAFPDVDPSSILVLPEVSDRHGNLDSSMDGLQDATFDRPHLWYPTSEVIRALKLF 600
NRDVIVAFPDVDPSSILVLPE SD+HG LDSSMDGLQDATFDRPHLWYPTS+VI ALKLF
Sbjct: 541 NRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKVISALKLF 600
Query: 601 IAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQEFLELVN 660
I GG+QL GSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQEFLELVN
Sbjct: 601 IVGGDQLFGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQEFLELVN 660
Query: 661 DIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDNTEDEASLLRDY- 720
DIDTLLACHEGFLLGPWLQSAKQLA S+EEEKQYEWNARTQITMWFDNTE+EASLLRDY
Sbjct: 661 DIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYG 720
Query: 721 -----------------------------------GNKYWSGLLGDYYGPRAAIYFKFLK 780
GNKYWSGLLGDYYGPRAAIYFKFLK
Sbjct: 721 NDNSGPGLNSISIDCHLSSRLGNCTFKFDLFNLDPGNKYWSGLLGDYYGPRAAIYFKFLK 780
Query: 781 ESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ 799
ESS+NGYRFPLSNWRREWIKLTN WQSSRKIYPVESNGDAL TSHWLYNKYLQIPESSDQ
Sbjct: 781 ESSKNGYRFPLSNWRREWIKLTNAWQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 829
BLAST of PI0023410 vs. ExPASy TrEMBL
Match:
A0A6J1ECY3 (alpha-N-acetylglucosaminidase-like OS=Cucurbita moschata OX=3662 GN=LOC111432041 PE=4 SV=1)
HSP 1 Score: 1499.6 bits (3881), Expect = 0.0e+00
Identity = 713/808 (88.24%), Postives = 749/808 (92.70%), Query Frame = 0
Query: 1 MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
MA F++ LIF+S+F FSTS SSTIG YISRLL+IQDRERAP+ VQVAAARGVLRRL
Sbjct: 1 MAPPFAAVCLIFLSIFTTFSTSFSSTIGFVYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQI+SKD CGGE CF+IRNHRAFR+PGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
WCGAHISWDKTGGSQLFSVPK G LP IQ+DE+++RRPIPLNYYQNAVTSSYSFAWWDW+
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIQSDEIIVRRPIPLNYYQNAVTSSYSFAWWDWE 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKV GRMFELGM PVLPAFSGNIPAAFKQI+PSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVTGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
F+VHSDPRWCCTYLLDAMDPLFVEIG+AFIEQQ KEYGRTSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 361 YISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGKLVVLDLYAE 420
YISSLG+AIFGGMQAGDS AVWLMQGWMFSYDPFWRPQQMKALLHSV LG+LVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
Query: 421 VKPIWISSEQFYGTPYIW----------KMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
VKPIWI+SEQFYG PYIW +MYGILDSIASGPIEARSSPYSTMVGVGMSME
Sbjct: 421 VKPIWIASEQFYGVPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
Query: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDG 540
GIEQNPVVYDLMSEMAFQHNKVDVKKWL QYS+RRYGHLVPSIQDAWDVLYHTIYNCTDG
Sbjct: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDG 540
Query: 541 ANDKNRDVIVAFPDVDPSSILVLPEVSDRHGNLDSSMDGLQDATFDRPHLWYPTSEVIRA 600
A DKNRDVIVAFPDVDPSSI V+PE SDRH LQDA F+RPHLWYPTSEVIRA
Sbjct: 541 AYDKNRDVIVAFPDVDPSSISVIPEGSDRH-----DTGSLQDAIFERPHLWYPTSEVIRA 600
Query: 601 LKLFIAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQEFL 660
LKLFIA G+QLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQL D QT SLSQ+FL
Sbjct: 601 LKLFIASGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFL 660
Query: 661 ELVNDIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDNTEDEASLL 720
ELVNDIDTL+ACHEGFLLGPWLQSAKQLA +++EKQYEWNARTQITMWFDNTE+EASLL
Sbjct: 661 ELVNDIDTLVACHEGFLLGPWLQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLL 720
Query: 721 RDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIY 780
RDYGNKYWSGLL DYYGPRAAIYFKFLKES ENGY FPLSNWRREWIKLTNDWQSSRK+Y
Sbjct: 721 RDYGNKYWSGLLSDYYGPRAAIYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVY 780
Query: 781 PVESNGDALDTSHWLYNKYLQIPESSDQ 799
PV+SNGDA+DTS WLYNKY Q+ ES DQ
Sbjct: 781 PVKSNGDAVDTSRWLYNKYFQVLESYDQ 803
BLAST of PI0023410 vs. NCBI nr
Match:
XP_008453133.1 (PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis melo])
HSP 1 Score: 1615.1 bits (4181), Expect = 0.0e+00
Identity = 773/808 (95.67%), Postives = 785/808 (97.15%), Query Frame = 0
Query: 1 MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
MAS FSSTFLIFV++FAAFSTSRSSTIGVEYISRLLEIQDRER PAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQIVSKDKCGGE CFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKVIGRMFELGM PVLPAFSGNIPAAFKQI+PSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
Query: 361 YISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGKLVVLDLYAE 420
YISSLGSAIFGGMQAGDS+AVWLMQGWMFSYDPFWRP QMKALLHSVPLG+LVVLDLYAE
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 421 VKPIWISSEQFYGTPYIW----------KMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
VKPIWISSEQFYGTPYIW +MYGILDSIASGPIEARSSPYSTMVGVGMSME
Sbjct: 421 VKPIWISSEQFYGTPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
Query: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDG 540
GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDG
Sbjct: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDG 540
Query: 541 ANDKNRDVIVAFPDVDPSSILVLPEVSDRHGNLDSSMDGLQDATFDRPHLWYPTSEVIRA 600
ANDKNRDVIVAFPDVDPSSILVLPE SD+HG LDSSMDGLQDATFDRPHLWYPTS+VI A
Sbjct: 541 ANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKVISA 600
Query: 601 LKLFIAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQEFL 660
LKLFI GG+QLSGSNTYRYDLVDLTRQALAKYSNELFFR VKAYQLYDAQTMASLSQEFL
Sbjct: 601 LKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQEFL 660
Query: 661 ELVNDIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDNTEDEASLL 720
ELVNDIDTLLACHEGFLLGPWLQSAKQLA S+EEEKQYEWNARTQITMWFDNTE+EASLL
Sbjct: 661 ELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLL 720
Query: 721 RDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIY 780
RDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIY
Sbjct: 721 RDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIY 780
Query: 781 PVESNGDALDTSHWLYNKYLQIPESSDQ 799
PVESNGDAL TSHWLYNKYLQIPESSDQ
Sbjct: 781 PVESNGDALHTSHWLYNKYLQIPESSDQ 808
BLAST of PI0023410 vs. NCBI nr
Match:
XP_011658935.1 (alpha-N-acetylglucosaminidase [Cucumis sativus])
HSP 1 Score: 1597.8 bits (4136), Expect = 0.0e+00
Identity = 763/808 (94.43%), Postives = 783/808 (96.91%), Query Frame = 0
Query: 1 MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
MAS FSSTFLIFV++FAAFSTSRSSTIGVEYISRLLEIQDRER PAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHL SFDFQIVSKDKCGGE CFVIRNHRAFRK GDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKSGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT+EVV++RPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF+KFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKVIGRMFELGM PVLPAFSGNIPAAFKQI+P+AKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 300
Query: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 361 YISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGKLVVLDLYAE 420
YISSLGSAIFGGMQAGDS+AVWLMQGWMFSYDPFWRPQQMKALLHSVPLG+LVVLDLYAE
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 420
Query: 421 VKPIWISSEQFYGTPYIW----------KMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
VKPIWISSEQFYG PYIW +MYGILDSIASGPIEARSSPYSTMVGVGMSME
Sbjct: 421 VKPIWISSEQFYGIPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
Query: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDG 540
GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHT+YNCTDG
Sbjct: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTVYNCTDG 540
Query: 541 ANDKNRDVIVAFPDVDPSSILVLPEVSDRHGNLDSSMDGLQDATFDRPHLWYPTSEVIRA 600
ANDKNRDVIVAFPDVDPS+ILVLPE S+RHGNLDSS+D LQDATFDRPHLWYPTSEVI A
Sbjct: 541 ANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWYPTSEVISA 600
Query: 601 LKLFIAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQEFL 660
LKLFIAGG+QLS SNTYRYDLVDLTRQALAKYSNELFFRIVKAYQL+D QTMASLSQEFL
Sbjct: 601 LKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTMASLSQEFL 660
Query: 661 ELVNDIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDNTEDEASLL 720
ELVNDIDTLLACHEGFLLGPWLQSAKQLA S+EEEKQYEWNARTQITMWFDNTE+EASLL
Sbjct: 661 ELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDNTEEEASLL 720
Query: 721 RDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIY 780
RDYGNKYWSGLLGDYY PRAAIY KFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIY
Sbjct: 721 RDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIY 780
Query: 781 PVESNGDALDTSHWLYNKYLQIPESSDQ 799
PVESNGDALDTSHWLYNKYLQIPESSDQ
Sbjct: 781 PVESNGDALDTSHWLYNKYLQIPESSDQ 808
BLAST of PI0023410 vs. NCBI nr
Match:
XP_038880130.1 (alpha-N-acetylglucosaminidase-like [Benincasa hispida])
HSP 1 Score: 1565.8 bits (4053), Expect = 0.0e+00
Identity = 751/810 (92.72%), Postives = 773/810 (95.43%), Query Frame = 0
Query: 1 MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
MAS FSS FLIFVS+FAAFSTSRSSTIGV YISRLLEIQDRERAPAYVQVAAARGVL RL
Sbjct: 1 MASPFSSIFLIFVSIFAAFSTSRSSTIGVGYISRLLEIQDRERAPAYVQVAAARGVLHRL 60
Query: 61 LPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQIVSKDKCGGE CFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLK+
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKN 120
Query: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDE+VI+RP+PLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEIVIQRPVPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF KFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFHKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKV GRMFELGM PVLPAFSGNIPAAFK I+PSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVTGRMFELGMTPVLPAFSGNIPAAFKHIYPSAKITRLGNW 300
Query: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
F+VHSDPRWCCTYLLDA DPLFVEIGKAFIEQQQKEYGRTSH+YNCDTFDENTPPVDEVE
Sbjct: 301 FSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQQKEYGRTSHIYNCDTFDENTPPVDEVE 360
Query: 361 YISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGKLVVLDLYAE 420
YISSLG+AIFGGMQAGDS+AVWLMQGWMFSYDPFWRP QMKALLHSVPLG+LVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 421 VKPIWISSEQFYGTPYIW----------KMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
VKP+WISSEQFYGTPYIW +MYGILDSIASGPIEARSSPYSTMVGVGMSME
Sbjct: 421 VKPVWISSEQFYGTPYIWCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSME 480
Query: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDG 540
GIEQNPVVYDLMSEMAFQHNKVDVKKWL QYS+RRYGHLVPSIQDAWDVLYHTIYNCTDG
Sbjct: 481 GIEQNPVVYDLMSEMAFQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDG 540
Query: 541 ANDKNRDVIVAFPDVDPSSILVLPEVSDRHGNLDSSMDGLQ--DATFDRPHLWYPTSEVI 600
ANDKNRDVIVAFPDVDPSSILVLPE S+RHGNLDS +D L+ DA FDRPHLWYPTSEV
Sbjct: 541 ANDKNRDVIVAFPDVDPSSILVLPEGSERHGNLDSRVDSLRLGDAMFDRPHLWYPTSEVT 600
Query: 601 RALKLFIAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQE 660
RALKLFIAGG+QLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMA+LSQE
Sbjct: 601 RALKLFIAGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMANLSQE 660
Query: 661 FLELVNDIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDNTEDEAS 720
FLELVNDIDTLLACHEGFLLGPWLQSAKQLA +EEEKQYEWNARTQITMWFDNTE+EAS
Sbjct: 661 FLELVNDIDTLLACHEGFLLGPWLQSAKQLAQIEEEEKQYEWNARTQITMWFDNTEEEAS 720
Query: 721 LLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRK 780
LLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRF LSNWRREWIKLTNDWQSSRK
Sbjct: 721 LLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFQLSNWRREWIKLTNDWQSSRK 780
Query: 781 IYPVESNGDALDTSHWLYNKYLQIPESSDQ 799
+YPVESNGDALDTSH LY KYLQ ES DQ
Sbjct: 781 VYPVESNGDALDTSHCLYYKYLQRLESFDQ 810
BLAST of PI0023410 vs. NCBI nr
Match:
KGN63620.2 (hypothetical protein Csa_013990 [Cucumis sativus])
HSP 1 Score: 1539.2 bits (3984), Expect = 0.0e+00
Identity = 745/816 (91.30%), Postives = 765/816 (93.75%), Query Frame = 0
Query: 1 MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
MAS FSSTFLIFV++FAAFSTSRSSTIGVEYISRLLEIQDRER PAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHL SFDFQIVSKDKCGGE CFVIRNHRAFRK GDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKSGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT+EVV++RPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF+KFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKVIGRMFELGM PVLPAFSGNIPAAFKQI+P+AKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 300
Query: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 361 YISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMK------------------A 420
YISSLGSAIFGGMQAGDS+AVWLMQGWMFSYDPFWRPQQMK A
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKGCHCALLSLLVEIGEIFQA 420
Query: 421 LLHSVPLGKLVVLDLYAEVKPIWISSEQFYGTPYIWKMYGILDSIASGPIEARSSPYSTM 480
LLHSVPLG+LVVLDL F G +MYGILDSIASGPIEARSSPYSTM
Sbjct: 421 LLHSVPLGRLVVLDL--------CMLHNFAGNV---EMYGILDSIASGPIEARSSPYSTM 480
Query: 481 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 540
VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH
Sbjct: 481 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 540
Query: 541 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEVSDRHGNLDSSMDGLQDATFDRPHLWY 600
T+YNCTDGANDKNRDVIVAFPDVDPS+ILVLPE S+RHGNLDSS+D LQDATFDRPHLWY
Sbjct: 541 TVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWY 600
Query: 601 PTSEVIRALKLFIAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTM 660
PTSEVI ALKLFIAGG+QLS SNTYRYDLVDLTRQALAKYSNELFFRIVKAYQL+D QTM
Sbjct: 601 PTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTM 660
Query: 661 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDN 720
ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLA S+EEEKQYEWNARTQITMWFDN
Sbjct: 661 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDN 720
Query: 721 TEDEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTND 780
TE+EASLLRDYGNKYWSGLLGDYY PRAAIY KFLKESSENGYRFPLSNWRREWIKLTND
Sbjct: 721 TEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTND 780
Query: 781 WQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ 799
WQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ
Sbjct: 781 WQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ 805
BLAST of PI0023410 vs. NCBI nr
Match:
TYJ98583.1 (alpha-N-acetylglucosaminidase-like [Cucumis melo var. makuwa])
HSP 1 Score: 1528.8 bits (3957), Expect = 0.0e+00
Identity = 747/834 (89.57%), Postives = 761/834 (91.25%), Query Frame = 0
Query: 1 MASSFSSTFLIFVSLFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
MAS FSSTFLI V++FAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIIVTIFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQI DKCGGE CFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQI---DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNW 300
KWGGPLP SWFDQQLILQKKVIGRMFELGM PVLPAFSGNIPAAFKQI+PSAKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYG+TSHVYNCDTFDENTPPVDEVE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGKTSHVYNCDTFDENTPPVDEVE 360
Query: 361 YISSLGSAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGKLVVLDLYAE 420
YISSLGSAIFGGMQAGDS+AVWLMQGWMFSYDPFWRPQQMKALLHSVPLG+LVVLDL
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDL--- 420
Query: 421 VKPIWISSEQFYGTPYIWKMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYD 480
F G +MYGILDSIASGPIEARSS YSTMVGVGMSMEGIEQNPVVYD
Sbjct: 421 -----CMLHNFAGNV---EMYGILDSIASGPIEARSSQYSTMVGVGMSMEGIEQNPVVYD 480
Query: 481 LMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDGANDKNRDVIV 540
LMSEM FQ NKVDVKKWLPQYSVRRYGHLVPSIQDAWD+LYHTIYNCTDGANDKNRDVIV
Sbjct: 481 LMSEMGFQRNKVDVKKWLPQYSVRRYGHLVPSIQDAWDILYHTIYNCTDGANDKNRDVIV 540
Query: 541 AFPDVDPSSILVLPEVSDRHGNLDSSMDGLQDATFDRPHLWYPTSEVIRALKLFIAGGNQ 600
AFPDVDPSSILVLPE SD+HG LDSSMDGLQDATFDRPHLWYPTS+VI ALKLFI GG+Q
Sbjct: 541 AFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKVISALKLFIVGGDQ 600
Query: 601 LSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMASLSQEFLELVNDIDTLL 660
LSGSNTYRYDLVDLTRQALAKYSNELFFR VKAYQLYDAQTMASLSQEFLELVNDIDTLL
Sbjct: 601 LSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQEFLELVNDIDTLL 660
Query: 661 ACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMWFDNTEDEASLLRDY------- 720
ACHEGFLLGPWLQSAKQLA S+EEEKQYEWNARTQITMWFDNTE+EASLLRDY
Sbjct: 661 ACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNDNSGP 720
Query: 721 -----------------------------GNKYWSGLLGDYYGPRAAIYFKFLKESSENG 780
GNKYWSGLLGDYYGPRAAIYFKFLKESSENG
Sbjct: 721 GLNSISIDCHLSSRLGNCTFKFDLFNLDPGNKYWSGLLGDYYGPRAAIYFKFLKESSENG 780
Query: 781 YRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ 799
YRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDAL TSHWLYNKYLQIPESSDQ
Sbjct: 781 YRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 820
BLAST of PI0023410 vs. TAIR 10
Match:
AT5G13690.1 (alpha-N-acetylglucosaminidase family / NAGLU family )
HSP 1 Score: 966.8 bits (2498), Expect = 1.0e-281
Identity = 468/811 (57.71%), Postives = 585/811 (72.13%), Query Frame = 0
Query: 6 SSTFLIFVSLFAAFSTSRSSTIGVEY--ISRLLEIQDRERAPAYVQVAAARGVLRRLLPS 65
S ++ V L +F S T+ + I LL+ D + VQ +AA+G+L+RLLP+
Sbjct: 3 SIKLVLLVLLIISF---HSQTVSKHHPTIDGLLDRLDSLLPTSSVQESAAKGLLQRLLPT 62
Query: 66 HLSSFDFQIVSKDKCGGEFCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKHWCG 125
H SF+ +I+SKD CGG CFVI N+ + G PEILI G TGVEI +GLHWYLK+ C
Sbjct: 63 HSQSFELRIISKDACGGTSCFVIENYDGPGRIG-PEILIKGTTGVEIASGLHWYLKYKCN 122
Query: 126 AHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWKRWE 185
AH+SWDKTGG Q+ SVP+ G LPRI + + IRRP+P NYYQN VTSSYS+ WW W+RWE
Sbjct: 123 AHVSWDKTGGIQVASVPQPGHLPRIDSKRIFIRRPVPWNYYQNVVTSSYSYVWWGWERWE 182
Query: 186 KEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLHKWG 245
+EIDWMALQGIN+PLAFTGQEAIW+KVF++FNIS DLDD+FGGPAFLAW+RMGNLH WG
Sbjct: 183 REIDWMALQGINLPLAFTGQEAIWQKVFKRFNISKEDLDDYFGGPAFLAWARMGNLHAWG 242
Query: 246 GPLPQSWFDQQLILQKKVIGRMFELGMNPVLPAFSGNIPAAFKQIFPSAKITRLGNWFTV 305
GPL ++W D QL+LQK+++ RM + GM PVLP+FSGN+P+A ++I+P A ITRL NW TV
Sbjct: 243 GPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANITRLDNWNTV 302
Query: 306 HSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVEYIS 365
D RWCCTYLL+ DPLF+EIG+AFI+QQ +EYG +++YNCDTF+ENTPP E EYIS
Sbjct: 303 DGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPPTSEPEYIS 362
Query: 366 SLGSAIFGGMQAGDSDAVWLMQGWMFSYD-PFWRPQQMKALLHSVPLGKLVVLDLYAEVK 425
SLG+A++ M G+ +AVWLMQGW+FS D FW+P Q+KALLHSVP GK++VLDLYAEVK
Sbjct: 363 SLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIVLDLYAEVK 422
Query: 426 PIWISSEQFYGTPYIW----------KMYGILDSIASGPIEARSSPYSTMVGVGMSMEGI 485
PIW S QFYGTPYIW +MYG LDSI+SGP++AR S STMVGVGM MEGI
Sbjct: 423 PIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVGVGMCMEGI 482
Query: 486 EQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDGAN 545
EQNPVVY+L SEMAF+ KVDV+KWL Y+ RRY I+ AW++LYHT+YNCTDG
Sbjct: 483 EQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTVYNCTDGIA 542
Query: 546 DKNRDVIVAFPDVDPSSILVLPEVSDRHGNLDSSMDG-----------LQDATFDRP--H 605
D N D IV PD DPSS V D DS M QD T D P H
Sbjct: 543 DHNTDFIVKLPDWDPSS-----SVQDDLKQKDSYMISTGPYETKRRVLFQDKTADLPKAH 602
Query: 606 LWYPTSEVIRALKLFIAGGNQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDA 665
LWY T EVI+ALKLF+ G+ LS S TYRYD+VDLTRQ L+K +N+++ V A+ D
Sbjct: 603 LWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYTEAVTAFVKKDI 662
Query: 666 QTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAHSKEEEKQYEWNARTQITMW 725
++ LS++FLEL+ D+D LLA + LLG WL+SAK+LA + +E KQYEWNARTQ+TMW
Sbjct: 663 GSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQYEWNARTQVTMW 722
Query: 726 FDNTEDEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKL 785
+D+ + S L DY NK+WSGLL DYY PRA +YF + +S + F + WRREWI +
Sbjct: 723 YDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIFKVEKWRREWIMM 782
Query: 786 TNDW-QSSRKIYPVESNGDALDTSHWLYNKY 790
++ W QSS ++YPV++ GDAL S L +KY
Sbjct: 783 SHKWQQSSSEVYPVKAKGDALAISRHLLSKY 804
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FNA3 | 1.5e-280 | 57.71 | Alpha-N-acetylglucosaminidase OS=Arabidopsis thaliana OX=3702 GN=NAGLU PE=2 SV=1 | [more] |
P54802 | 1.1e-147 | 38.96 | Alpha-N-acetylglucosaminidase OS=Homo sapiens OX=9606 GN=NAGLU PE=1 SV=2 | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3BVG2 | 0.0e+00 | 95.67 | alpha-N-acetylglucosaminidase-like OS=Cucumis melo OX=3656 GN=LOC103493939 PE=4 ... | [more] |
A0A5D3BH46 | 0.0e+00 | 89.57 | Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E56... | [more] |
A0A6J1C176 | 0.0e+00 | 89.25 | alpha-N-acetylglucosaminidase-like OS=Momordica charantia OX=3673 GN=LOC11100744... | [more] |
A0A5A7UWC6 | 0.0e+00 | 87.86 | Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C... | [more] |
A0A6J1ECY3 | 0.0e+00 | 88.24 | alpha-N-acetylglucosaminidase-like OS=Cucurbita moschata OX=3662 GN=LOC111432041... | [more] |
Match Name | E-value | Identity | Description | |
XP_008453133.1 | 0.0e+00 | 95.67 | PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis melo] | [more] |
XP_011658935.1 | 0.0e+00 | 94.43 | alpha-N-acetylglucosaminidase [Cucumis sativus] | [more] |
XP_038880130.1 | 0.0e+00 | 92.72 | alpha-N-acetylglucosaminidase-like [Benincasa hispida] | [more] |
KGN63620.2 | 0.0e+00 | 91.30 | hypothetical protein Csa_013990 [Cucumis sativus] | [more] |
TYJ98583.1 | 0.0e+00 | 89.57 | alpha-N-acetylglucosaminidase-like [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
AT5G13690.1 | 1.0e-281 | 57.71 | alpha-N-acetylglucosaminidase family / NAGLU family | [more] |