Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGATCAACGGCCGTAAATTAGTCAAGAGGAGGTCCGATGTGGATGAAGAGGCCCCACATCCCTGTTTATATATTATATTACACTAATCAGTTGCAGTTTCGCTTTCGCGCGCTCTTCCACTCTCGTCGGGACACCGCCGGCCACTGCCATTGTCTGGTAGCCACTGGAACTTTTCAAAACCAATTGCCATTGCTGGATTCCTCGACGTCATCCACCACCATGCCGATTTTCTCTACAAATTTTCGTTTCTACTTTCTACTTCCACTTCTGATCTCTCTTTTTTCTCTGTTCCTCTTCTTATGGCGTCCTTTTTCTCTTCCACTTTTCTTATCTTCGTTACAATTTTCGCCGCCTTCTCAACTTCTCGGTCGTCGACGATCGGAGTGGAGTACATTTCGCGGCTTCTTGAGATTCAGGATCGCGAGAGAGTGCCAGCGTATGTCCAAGTTGCTGCGGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCACCTCTCTAGCTTTGACTTTCAGATTGTCTCTAAGGTACTTGGATGATTTTCTTTACAAGTTGCTGTGTTATCTATTTGACTGATGAAAGAAGTATAGACTACAGAGTAGTTTGTTAGAGAGATTTTAATTTGTTTTCTAAGGTACTCGGACATTTAGGAAAATGAAAGAATTATTGTGTACTTGAAGTTTCGTTTGCTTCTAGTGTCTGTTTTACATGTTTATATTTCTCGTGGCCTTCTTATATGTTGATTTTTCTCAGGACAAATGTGGTGGAGAATCTTGCTTTGTGATCAGGAACCATCGCGCGTTCAGGAAACCCGGGGATCCTGAGATTTTGTACGTCGAAACTATTTGCTAATCTTTTCATTCAGTTCGCCTTGATCTAATTGGCCTTTAAAATTGGATTAAGCTACTGTGTTGCTTCTTGTTATTTAATAATTGTAGGATCGGGAGTTCACATCACGAAAAATATGAATTCCTATTGCACATTGTTCTTTGTATATTGAAACTAAAATCAATCTCTTACTTTTCAACATGAACACGATTTCCAATCTTTTTTTAGAATCGCTGGGGTCACTGGAGTGGAGGTTTTAGCTGGCTTGCACTGGTATCTTAAGCACTGGTGTGGTGCACACATATCTTGGGATAAAACAGGTGGCTCGCAACTGTTTTCTGTACCTAAGGCAGGACTGTTACCTCGTATTCAAACAGACGAAGTTGTGATTCGGAGGCCTATTCCTTTGAACTACTATCAAAATGCAGTTACATCAAGCTGTAAGGTTTTCTTTTTGGATTTTTCATTACATCCATCTTGTTGCAATGCACAATTTACATAACAGAAATCGAATTACAATATTTGGTGTTCCTCCAATAACAGTTATTTTTTCACTGTTTTGATACATGTTTTTCTATTTTCACTTTCTTTGCTTCTGGTAGACTCTTTTGCCTGGTGGGACTGGAAAAGGTGGGAAAAGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATTTACTGGGCAGGAGGCTATTTGGCGAAAAGTATTTCAGGTATGTTTTCCTCTATTTATAAGTGTTATGTTATTCATACTGTCTTCAACATAATGAAATTTTCAGCAAGTGTAACTGTCATCTTACCCTACATATACTATAATCTCTGACTTATCTTAATATATTTGATGATTGAAACAATGTTACTAATTGTGCATCATGGGACCATGGATGCAATAGTCTACATCATTTGCAGTATAGGAATAGAAGTATAATATTACTCTCTGCTGCTTCCTGCTATTAGAAATTGTTTCTGGCATTATCTCAACTCTTCCTTACATTAGAAGCTTCTTTTGATCAGTACTTTCTTCATCATTGCTATGGTAGCTTTCATTTCCTTTTCTTAAAAAAAAAAGAAAAAGTTATAGGCATGCTCGTTTCCTACATTTTAATTTATGCAACAAAGTGGCTAAATCGAAAAAGCTATCTCTAGTAATTTCTTTTGGCTATATCCTATTACAGTTTACTTGGTTCAAAGTATAGATCGATCTCATTCCACACTATATTTGCAGAACTTTAATATAAGCAACTCAGATTTGGATGATTTCTTCGGAGGTCCAGCTTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAGTGAGTAGTTGCTTTGTAACTGCGTTGAAATTAGATGATTGGTGGATGGATATTCTATTCCACTGGGACTAAAATGAAAGAATGATAAATATACAATCATCGTCACAGTCAGCTACATAATCTTCCTGCTTTAATCAGATATATGTAGAGAATGGCTTAAATGTGGCATGTTTTTTGACTCTTATATTTCACTTCTAGTAATAAAAAGATCGCTGGTAATTCTTATCTTTGCCTTTTTAATATCAACACTCTTATCACAGATGGGGTGGGCCACTGCCGCAAAGTTGGTTTGATCAACAACTTATTCTGCAGAAAAAGGTTATTGGCAGAATGTTTGAGCTGGGAATGACTCCAGGTTTTTTTAAGCACTGTCATTAAATTTATGTAGTATCTCATGTTGTAAAGAGTTCATAATTCTTTGGACTTGACCCAGTAGGAAACTTAAGATTCTGTTTCATTTAAATATTTTCACTTTCATGCTGGCTTTCTTTTTGAATTGCTGTTTTAAAGCTTCAGTAATGTTGCTATAGTTCGTACAGCCTATTTACTGGACAAAGTTCTAGAAAGCTTTTGTTCTGGGAACAATTTATGGACCATGGTTTTCTTTTTTCTCCTTTCCAGTTTTGCCAGCCTTTTCGGGTAATATTCCTGCTGCTTTCAAACAAATATATCCATCAGCAAAAATAACACGCTTGGGAAATTGGTAACTCAACACTCCCTTATGGTCTTATCGTATATAAATTTCATATTTGATATGGTTAATAATGTTTTTATTGTATTGTTTTGACATTTTCTTAGAATATATTTTGACACATTCTGGAACCAGTGTTGGTCAAATCCATGATAGTCCACGATACGAAGAATGTGATGCACTTTTGAAAATAATCACTTTAGTGTACTGTTTGATAATATTTTTTTTGTTTTTGGTCTTAATATTTTAGAAGTTTTTTTAAATTTGTCTAAATATGCTTATTATTTTAAAACAAAGTGTAATTCGAATGACTAAAGAAAAACATTTTTAGGATAAAACATTACCTAAAGAAAAATATTTAAAATGGAAAAAAAAAAGATCGAAGGATTAGTGGAAGGAAGCAGGCAAATGAAGAGAATCTAGTGGAAAACTTTATACACAGGGAGAGTAGTAAAGGTGAATGTCTAAATAGGAGAAAGGCATGTGAATTGAGAGAAGTTAGAAATTTTATTTTTAGATGGAGTCAACTAGAAAATTATATACAGAGGAAGATCTCGATAGGGATTAATGCCAGAAAGTATTGGGATAGAAGTTAATGAAATTAGTTTGATTGGTGAGCGAGAAAGTATATCTAACATGACTAGTAAGAGAAACAATGGAGAGCAATTAGTAAGACAAGTTATATAGAGAGAAGTTATTAAATTCGAGCTAACATGAGTTCCATTCATTAGAAATTTACTACCAAAATATGAAATGATCTCTTCATAGAGTAGTCTGTAGAAAACCTCCAACTTTTTATTTGAAGCAACAAGCTGCGGTAATCCTTTAAGTCGTGATGCTTGAGCATCAAGAATTTTTTGCAGAGTGGTTTAACTAATGACAATGTCAATTGTCTCTATGGTACCATTGTCCTGCAGGTTTACTGTTCATAGTGACCCTAGATGGTGCTGCACTTACCTTCTTGATGCCATGGACCCTTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAACAGAAAGGTATTTTTGGACGTATGCTAATGAAATGTACGAGAACATATTTCATGCCTTTTTCTTAAATGATGCTTGATGATTCTCATAGTCTGAAATTACATTTTTTGGCCAGAATATGGAAGAACTTCCCATGTATACAATTGGTATGGTGCTTGTTTCCTCAACAAACTTTGTATTGTGTGTGGTTCATTTTTCTTTTGATTTTCACTGCAAGATGTGTATGGTCAGCTTTGTTACATTATAATGGCAGAAAAGCTAGCGAGAAACAAGCATTTAAAATTGGTGTAATATTGGTTTCAACAGTCTTTAAATTGTATTTTCTTCAAACATGAGAAATATCCGCAATCAATTTTTGACATCATGAATATCTTTTCACCCTACAGTGATACCTTTGACGAGAACACTCCACCTGTTGATGAGGTGGAATACATCTCTTCATTAGGTTCAGCTATTTTTGGAGGAATGCAGGCAGGAGATTCTAATGCTGTCTGGCTAATGCAGGTAATTGTTGGTTCGCTTTTCCATCTAAGGGGCTGTTGGGCCAATTCACTTAATTCTCTGTATAGTTCCAATTTTAATGGATGACCCCGAAGGGATCCAATTCACTGAGTTTTCTAAGTCATATCACAAGCAAGTGGAACATGATATGGATTACTCTAGTGAATGTGCGCTGCATCTCTTCTTGTTCTGTTTTGTTGAAATGTGAACTTTTTTCTCGATCTTGCTAATGAAATTTGTTCATGCACAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCCACAAATGAAGGTTATAATCATGCTTTATGTCACCAATTGCTCATGCAATCATACACGAATATATCTTCCTACTTTCAATCATCATTAGATCATTTCATGTATGAAGGATTTTATCAGGGAATCTTTGACTCAGATCCGTTGTGTTATCTGTAATTGCTAATTCATTTACTTGGGCTCGTTGTATGGTGTACCTTTTTTTATCTACTCAATCTTTAGGGTTGTCATTGTGCATTACTTTCACTTCTGGTTGAAATTGGTGAGCATTTTCAGGCACTTTTACATTCTGTCCCTCTGGGAAGGCTGGTAGTCCTCGATCTGTATGCTGAAGTGAAGCCAATCTGGATATCCTCTGAGCAATTTTATGGCACTCCTTACATCTGGAAAGTCTCTATTTCCATTCTTTTGTTTAATCTTAATGTTCATATCTAATAGGTCAAGATTTAGGCTTACAGAAGGCTTTTTTCACAACATATCAGCGATGATGAGAATGTTATAATTATCGGTCAAGAATCGATATTCAAAATTCATTAGTTTTAAGTTTTCTTTATTTGTTATACTTTTTGATATTCACAAGTGTCTAAAGTTATTTTTCATCCGACTATTAACCTTTTTTTAGTTGCTATACATAGGTTGATTATATCGACAATCTAAGAACTAACCATTTAAAAGCAGTGGGGTTGAGAATGGCGTATTATATATCTTTTTGCAAGCCCATTTTTATAAGTTAATAATAATTTGAAGAGAGATTCTCAAACCCATTGTTGATTGATTAAAGTTTCCTTGCTTAAGCATTACATTATCTAATTTTTTGGCTAAAATTATATCCTTTTCAATGTAATTTCAGTCCAATGTTTTCTTCGTGTTTTTACTTTCTAAGAAAGATGTTTCTGCCAAGGCAATTTTGATTCCCCATTTCATTTAGTTTTCAGGTTAAAACAGCAGTTATCTGACTTTAGCCGTTTCAAGAAACTTTTTTTTTATAACAAAACTGCTTATCTCGTTAATTCTGTTGTCTTAAGCTAAAACTAGATTGGTTTATTTTTGCATTAGCCATATGCTATATTTCAATGAATGTGTAAATTTTAGCTTCAATAATATGAATTGCTATTCACAGGTGCATGCTACATAATTTTGCTGGAAATGTTGAGATGTATGGCATTTTAGATTCAATAGCATCCGGACCAATTGAAGCTCGTAGTAGTCCGTACTCAACAATGGTAAGTTCTTCATCTACTATTATTTGGTGTCTAGATGTTGTCATAATGTTTTGGCAGCATATTGAATTCCTATGGGTATACTGAGTTTTTATTTGAAAAGGAAACAATTCTCTTTATTAATAAAAATGAGACTACTGCTCAAAACACAAGAGGATTATACAAAGAACAGAAAATGTAGAATTTAGGGATCGGAAGGCACACTTAGGTGTCTCAACTAGGTTGACACCCCTTTAGCACCGTCATCATCTCCAAAATAAATACCCAACTGTGGGTATATATGGGTATACTAATTTTGATAAATGCTCCAATAGTTTTATAAAAGTAAATTTGGTGCTTTCTTTCTCTCGTCCATAAGGTGCAGCTCTGGGAATATATATGAGTATTAATGTAATTTTGAAGAGATGAATTGATTATCTTTTTTCTTTTCATTATTGAAAAGACGGCTTGAAAGATTATTTTGGCATTTAATATTGCTCAATTCATTTATTTTTTTATTATGTAGTGAAGTGACAAATGGCTTAATCATAGTGGATCGTGGCAGCACAGTCACACTGCCAAAGCTTTATATTTAACATATCTATGAAGATATTACTAATTCCAATCTTCCTCTAACCTTCTCTTTTTCTCATATGCCTGTGGTGTATCATTTATTTTGCAATGTTTAAATTCGTTTTAACTTTGTCAACGGTGTTTTCATTTGAAGGTCGGTGTAGGAATGTCCATGGAAGGAATTGAACAGAATCCTGTTGTCTACGATCTTATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGGTACAAATGTTTTACAATTATTTTTAAAATATAATGACCCATTTACTTCATTGCCAGTTGTATTCATGTTCCATGTGCGGAAAGCTGACAAGTGACTTTACATAGATTTCATCGAATATACTGAAGGGCCCATCATATGAAAAGGATAGTGAAATCATTTCTTTTCTGCCGTGCATATGTTCACATCCTGGCTAATGCCAATGTGGTTTTATTTTCCAAGTTGGTTTCACAATTAGTTCTGTTGATGTCTTGCATGCAATGCATTATGAGAAGGAACTGATAGATGCGAAAGATGTTAGAAAATGTTTCATAGTGAATTTCATAGAATTTAAGAACAATCACGACAAGTTCTATGTAAAATATCTAAGCATGGAAGAAATATGAGATCTATATGGAGATTAGAAAAACTGGAGCTCTTAGATAGATCAACTAACTTGCAAAGTTACTTGAATTAAGCAATCTTCCTCAACTAACCAAATATAGGCAATCTTCTTTGCCGATAAATATAGGCAACTTCAAGTATGCTATACTATAACATAAGTATATGTTTTTTCTCCACCGACCATACTAATGTAGACGAAGTCCATAGGTCAATAATTAAGGCCATTTGCTTATTAAATAGCTAAGATATAATTTTCGTTTGATTCGTTCCCAAATATGAAATAATTGTTAACTTCAACAATAGAAATTTTCTTTTCTAGTTCAACAAATTTCAGGTTTGCTCATTACCTTTGGCCAACATACTGCACACTTTTCCCATGATTTCTGAATTTTATCTCTAGCACTCTCTATTTTAAAGTTTATTTTCTCCAATAGAATTTATTTTCATGATAGATTTTGCGCAATTGTGTTTATAGAAAATTTTGTTCTCATCAATCAGAATTTGTAAAATACTATCCTTTCATTTCCAGAAATGGCTTCCTCAGTATTCAGTAAGGCGTTATGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAACTGCACTGACGGTGCCAATGTAAGTACGAAAAAAACTGTAATGTCCATTCAACCTTTTCTTCTCTTTTGGTATTCTACGACTAGTGTTTCCAGTCTTGTTGCTTTCAAATGGTCATGGCTCGTCTAATTTCCTTTCTTGCTCGTGCAAAGTCTGAATTGTGTTCAAACATGTTATGCTAACGTATGTTGGTCACTATGGAAAGGACAAAAACAGGGATGTAATTGTGGCATTTCCTGATGTTGATCCATCGTCAATCTTAGTATTACCTGAGGGGTCCGACCAACATGGGATCTTGGACTCAAGCATGGATGGCCTCCAGGATGCAACGTTTGATCGACCTCATCTTTGGTATCCTACTTCTAAAGTAATTAGTGCACTCAAGCTTTTCATTGTTGGTGGCGATCAACTCTCTGGTAGCAACACTTACAGGTGGACTCACGAATTTAAAATATCTTATCACTTACTTGAAATCTATAGAGATATAAAAAATCGTCCTTTTCCCTCTGTTGAAGTCTAGATTATTGAATTGGCATTTGTTACCACTCTTAGGTATGACCTCGTAGATTTGACCAGGCAAGCTCTAGCCAAATACTCAAATGAACTGTTCTTTAGAACTGTCAAGGCATATCAGTTATATGATGCACAGACAATGGCCAGCTTAAGCCAAGAATTCCTCGAACTTGTCAATGATATTGACACATTATTGGCTTGTCACGAAGGATTTCTTTTGGGACCTTGGCTACAAAGCGCCAAACAACTCGCCCAAAGTGAAGAGGAGGAAAAACAGGTTCACATTCTACCAATTATGAGAAAATTATACCAATAACATCCAAATTTTAACTCATCTCAAATTACAGTATGAATGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGAGGAAGCAAGTTTGCTTCGTGATTATGGTAATGATAACTCTGGACCTGGACTCAACTCTATATCAATAGATTGTCATTTATCATCGAGATTAGGCAATTGTACATTCAAATTTGACTTATTTAATCTCGACCCAGGAAACAAGTACTGGAGTGGACTCTTGGGAGATTACTACGGTCCTCGAGCTGCAATATACTTCAAGTTCTTGAAAGAAAGTTCGGAGAATGGATACAGATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAACTAACAAATGATTGGCAAAGCAGCAGAAAGATTTACCCTGTGGAAAGCAATGGAGATGCACTTCACACATCCCACTGGCTCTACAACAAATACTTGCAAATACCTGAGAGCTCCGATCAATGA
mRNA sequence
ATGAAGATCAACGGCCGTAAATTAGTCAAGAGGAGGTCCGATGTGGATGAAGAGGCCCCACATCCCTTTTCGCTTTCGCGCGCTCTTCCACTCTCGTCGGGACACCGCCGGCCACTGCCATTGTCTGGTAGCCACTGGAACTTTTCAAAACCAATTGCCATTGCTGGATTCCTCGACGTCATCCACCACCATGCCGATTTTCTCTACAAATTTTCGTTTCTACTTTCTACTTCCACTTCTGATCTCTCTTTTTTCTCTGTTCCTCTTCTTATGGCGTCCTTTTTCTCTTCCACTTTTCTTATCTTCGTTACAATTTTCGCCGCCTTCTCAACTTCTCGGTCGTCGACGATCGGAGTGGAGTACATTTCGCGGCTTCTTGAGATTCAGGATCGCGAGAGAGTGCCAGCGTATGTCCAAGTTGCTGCGGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCACCTCTCTAGCTTTGACTTTCAGATTGTCTCTAAGGACAAATGTGGTGGAGAATCTTGCTTTGTGATCAGGAACCATCGCGCGTTCAGGAAACCCGGGGATCCTGAGATTTTAATCGCTGGGGTCACTGGAGTGGAGGTTTTAGCTGGCTTGCACTGGTATCTTAAGCACTGGTGTGGTGCACACATATCTTGGGATAAAACAGGTGGCTCGCAACTGTTTTCTGTACCTAAGGCAGGACTGTTACCTCGTATTCAAACAGACGAAGTTGTGATTCGGAGGCCTATTCCTTTGAACTACTATCAAAATGCAGTTACATCAAGCTACTCTTTTGCCTGGTGGGACTGGAAAAGGTGGGAAAAGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATTTACTGGGCAGGAGGCTATTTGGCGAAAAGTATTTCAGAACTTTAATATAAGCAACTCAGATTTGGATGATTTCTTCGGAGGTCCAGCTTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAATGGGGTGGGCCACTGCCGCAAAGTTGGTTTGATCAACAACTTATTCTGCAGAAAAAGGTTATTGGCAGAATGTTTGAGCTGGGAATGACTCCAGTTTTGCCAGCCTTTTCGGGTAATATTCCTGCTGCTTTCAAACAAATATATCCATCAGCAAAAATAACACGCTTGGGAAATTGGTTTACTGTTCATAGTGACCCTAGATGGTGCTGCACTTACCTTCTTGATGCCATGGACCCTTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAACAGAAAGAATATGGAAGAACTTCCCATGTATACAATTGTGATACCTTTGACGAGAACACTCCACCTGTTGATGAGGTGGAATACATCTCTTCATTAGGTTCAGCTATTTTTGGAGGAATGCAGGCAGGAGATTCTAATGCTGTCTGGCTAATGCAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCCACAAATGAAGGCACTTTTACATTCTGTCCCTCTGGGAAGGCTGGTAGTCCTCGATCTGTATGCTGAAGTGAAGCCAATCTGGATATCCTCTGAGCAATTTTATGGCACTCCTTACATCTGGAAAGTCTCTATTTCCATTCTTTTGTGCATGCTACATAATTTTGCTGGAAATGTTGAGATGTATGGCATTTTAGATTCAATAGCATCCGGACCAATTGAAGCTCGTAGTAGTCCGTACTCAACAATGGTCGGTGTAGGAATGTCCATGGAAGGAATTGAACAGAATCCTGTTGTCTACGATCTTATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGAAATGGCTTCCTCAGTATTCAGTAAGGCGTTATGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAACTGCACTGACGGTGCCAATGACAAAAACAGGGATGTAATTGTGGCATTTCCTGATGTTGATCCATCGTCAATCTTAGTATTACCTGAGGGGTCCGACCAACATGGGATCTTGGACTCAAGCATGGATGGCCTCCAGGATGCAACGTTTGATCGACCTCATCTTTGGTATCCTACTTCTAAAGTAATTAGTGCACTCAAGCTTTTCATTGTTGGTGGCGATCAACTCTCTGGTAGCAACACTTACAGGTATGACCTCGTAGATTTGACCAGGCAAGCTCTAGCCAAATACTCAAATGAACTGTTCTTTAGAACTGTCAAGGCATATCAGTTATATGATGCACAGACAATGGCCAGCTTAAGCCAAGAATTCCTCGAACTTGTCAATGATATTGACACATTATTGGCTTGTCACGAAGGATTTCTTTTGGGACCTTGGCTACAAAGCGCCAAACAACTCGCCCAAAGTGAAGAGGAGGAAAAACAGTATGAATGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGAGGAAGCAAGTTTGCTTCGTGATTATGGAAACAAGTACTGGAGTGGACTCTTGGGAGATTACTACGGTCCTCGAGCTGCAATATACTTCAAGTTCTTGAAAGAAAGTTCGGAGAATGGATACAGATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAACTAACAAATGATTGGCAAAGCAGCAGAAAGATTTACCCTGTGGAAAGCAATGGAGATGCACTTCACACATCCCACTGGCTCTACAACAAATACTTGCAAATACCTGAGAGCTCCGATCAATGA
Coding sequence (CDS)
ATGAAGATCAACGGCCGTAAATTAGTCAAGAGGAGGTCCGATGTGGATGAAGAGGCCCCACATCCCTTTTCGCTTTCGCGCGCTCTTCCACTCTCGTCGGGACACCGCCGGCCACTGCCATTGTCTGGTAGCCACTGGAACTTTTCAAAACCAATTGCCATTGCTGGATTCCTCGACGTCATCCACCACCATGCCGATTTTCTCTACAAATTTTCGTTTCTACTTTCTACTTCCACTTCTGATCTCTCTTTTTTCTCTGTTCCTCTTCTTATGGCGTCCTTTTTCTCTTCCACTTTTCTTATCTTCGTTACAATTTTCGCCGCCTTCTCAACTTCTCGGTCGTCGACGATCGGAGTGGAGTACATTTCGCGGCTTCTTGAGATTCAGGATCGCGAGAGAGTGCCAGCGTATGTCCAAGTTGCTGCGGCTCGTGGAGTTCTTCGTCGGCTTCTTCCTTCTCACCTCTCTAGCTTTGACTTTCAGATTGTCTCTAAGGACAAATGTGGTGGAGAATCTTGCTTTGTGATCAGGAACCATCGCGCGTTCAGGAAACCCGGGGATCCTGAGATTTTAATCGCTGGGGTCACTGGAGTGGAGGTTTTAGCTGGCTTGCACTGGTATCTTAAGCACTGGTGTGGTGCACACATATCTTGGGATAAAACAGGTGGCTCGCAACTGTTTTCTGTACCTAAGGCAGGACTGTTACCTCGTATTCAAACAGACGAAGTTGTGATTCGGAGGCCTATTCCTTTGAACTACTATCAAAATGCAGTTACATCAAGCTACTCTTTTGCCTGGTGGGACTGGAAAAGGTGGGAAAAGGAAATAGACTGGATGGCTCTTCAGGGTATCAATATGCCTCTAGCATTTACTGGGCAGGAGGCTATTTGGCGAAAAGTATTTCAGAACTTTAATATAAGCAACTCAGATTTGGATGATTTCTTCGGAGGTCCAGCTTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAATGGGGTGGGCCACTGCCGCAAAGTTGGTTTGATCAACAACTTATTCTGCAGAAAAAGGTTATTGGCAGAATGTTTGAGCTGGGAATGACTCCAGTTTTGCCAGCCTTTTCGGGTAATATTCCTGCTGCTTTCAAACAAATATATCCATCAGCAAAAATAACACGCTTGGGAAATTGGTTTACTGTTCATAGTGACCCTAGATGGTGCTGCACTTACCTTCTTGATGCCATGGACCCTTTATTTGTCGAGATTGGTAAAGCATTTATTGAGCAACAACAGAAAGAATATGGAAGAACTTCCCATGTATACAATTGTGATACCTTTGACGAGAACACTCCACCTGTTGATGAGGTGGAATACATCTCTTCATTAGGTTCAGCTATTTTTGGAGGAATGCAGGCAGGAGATTCTAATGCTGTCTGGCTAATGCAGGGGTGGATGTTTTCATATGATCCATTCTGGAGGCCTCCACAAATGAAGGCACTTTTACATTCTGTCCCTCTGGGAAGGCTGGTAGTCCTCGATCTGTATGCTGAAGTGAAGCCAATCTGGATATCCTCTGAGCAATTTTATGGCACTCCTTACATCTGGAAAGTCTCTATTTCCATTCTTTTGTGCATGCTACATAATTTTGCTGGAAATGTTGAGATGTATGGCATTTTAGATTCAATAGCATCCGGACCAATTGAAGCTCGTAGTAGTCCGTACTCAACAATGGTCGGTGTAGGAATGTCCATGGAAGGAATTGAACAGAATCCTGTTGTCTACGATCTTATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGAAATGGCTTCCTCAGTATTCAGTAAGGCGTTATGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAACTGCACTGACGGTGCCAATGACAAAAACAGGGATGTAATTGTGGCATTTCCTGATGTTGATCCATCGTCAATCTTAGTATTACCTGAGGGGTCCGACCAACATGGGATCTTGGACTCAAGCATGGATGGCCTCCAGGATGCAACGTTTGATCGACCTCATCTTTGGTATCCTACTTCTAAAGTAATTAGTGCACTCAAGCTTTTCATTGTTGGTGGCGATCAACTCTCTGGTAGCAACACTTACAGGTATGACCTCGTAGATTTGACCAGGCAAGCTCTAGCCAAATACTCAAATGAACTGTTCTTTAGAACTGTCAAGGCATATCAGTTATATGATGCACAGACAATGGCCAGCTTAAGCCAAGAATTCCTCGAACTTGTCAATGATATTGACACATTATTGGCTTGTCACGAAGGATTTCTTTTGGGACCTTGGCTACAAAGCGCCAAACAACTCGCCCAAAGTGAAGAGGAGGAAAAACAGTATGAATGGAATGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGAGGAAGCAAGTTTGCTTCGTGATTATGGAAACAAGTACTGGAGTGGACTCTTGGGAGATTACTACGGTCCTCGAGCTGCAATATACTTCAAGTTCTTGAAAGAAAGTTCGGAGAATGGATACAGATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAACTAACAAATGATTGGCAAAGCAGCAGAAAGATTTACCCTGTGGAAAGCAATGGAGATGCACTTCACACATCCCACTGGCTCTACAACAAATACTTGCAAATACCTGAGAGCTCCGATCAATGA
Protein sequence
MKINGRKLVKRRSDVDEEAPHPFSLSRALPLSSGHRRPLPLSGSHWNFSKPIAIAGFLDVIHHHADFLYKFSFLLSTSTSDLSFFSVPLLMASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRLLPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKHWCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWKRWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAEVKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ
Homology
BLAST of IVF0012391 vs. ExPASy Swiss-Prot
Match:
Q9FNA3 (Alpha-N-acetylglucosaminidase OS=Arabidopsis thaliana OX=3702 GN=NAGLU PE=2 SV=1)
HSP 1 Score: 998.8 bits (2581), Expect = 3.9e-290
Identity = 471/786 (59.92%), Postives = 586/786 (74.55%), Query Frame = 0
Query: 122 ISRLLEIQDRERVPAYVQVAAARGVLRRLLPSHLSSFDFQIVSKDKCGGESCFVIRNHRA 181
I LL+ D + VQ +AA+G+L+RLLP+H SF+ +I+SKD CGG SCFVI N+
Sbjct: 28 IDGLLDRLDSLLPTSSVQESAAKGLLQRLLPTHSQSFELRIISKDACGGTSCFVIENYDG 87
Query: 182 FRKPGDPEILIAGVTGVEVLAGLHWYLKHWCGAHISWDKTGGSQLFSVPKAGLLPRIQTD 241
+ G PEILI G TGVE+ +GLHWYLK+ C AH+SWDKTGG Q+ SVP+ G LPRI +
Sbjct: 88 PGRIG-PEILIKGTTGVEIASGLHWYLKYKCNAHVSWDKTGGIQVASVPQPGHLPRIDSK 147
Query: 242 EVVIRRPIPLNYYQNAVTSSYSFAWWDWKRWEKEIDWMALQGINMPLAFTGQEAIWRKVF 301
+ IRRP+P NYYQN VTSSYS+ WW W+RWE+EIDWMALQGIN+PLAFTGQEAIW+KVF
Sbjct: 148 RIFIRRPVPWNYYQNVVTSSYSYVWWGWERWEREIDWMALQGINLPLAFTGQEAIWQKVF 207
Query: 302 QNFNISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMT 361
+ FNIS DLDD+FGGPAFLAW+RMGNLH WGGPL ++W D QL+LQK+++ RM + GMT
Sbjct: 208 KRFNISKEDLDDYFGGPAFLAWARMGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMT 267
Query: 362 PVLPAFSGNIPAAFKQIYPSAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIE 421
PVLP+FSGN+P+A ++IYP A ITRL NW TV D RWCCTYLL+ DPLF+EIG+AFI+
Sbjct: 268 PVLPSFSGNVPSALRKIYPEANITRLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIK 327
Query: 422 QQQKEYGRTSHVYNCDTFDENTPPVDEVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSY 481
QQ +EYG +++YNCDTF+ENTPP E EYISSLG+A++ M G+ NAVWLMQGW+FS
Sbjct: 328 QQTEEYGEITNIYNCDTFNENTPPTSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSS 387
Query: 482 D-PFWRPPQMKALLHSVPLGRLVVLDLYAEVKPIWISSEQFYGTPYIWKVSISILLCMLH 541
D FW+PPQ+KALLHSVP G+++VLDLYAEVKPIW S QFYGTPYIW CMLH
Sbjct: 388 DSKFWKPPQLKALLHSVPFGKMIVLDLYAEVKPIWNKSAQFYGTPYIW--------CMLH 447
Query: 542 NFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKV 601
NF GN+EMYG LDSI+SGP++AR S STMVGVGM MEGIEQNPVVY+L SEMAF+ KV
Sbjct: 448 NFGGNIEMYGALDSISSGPVDARVSKNSTMVGVGMCMEGIEQNPVVYELTSEMAFRDEKV 507
Query: 602 DVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDGANDKNRDVIVAFPDVDPSSILV 661
DV+KWL Y+ RRY I+ AW++LYHT+YNCTDG D N D IV PD DPSS +
Sbjct: 508 DVQKWLKSYARRRYMKENHQIEAAWEILYHTVYNCTDGIADHNTDFIVKLPDWDPSSSVQ 567
Query: 662 LP-EGSDQHGILDSSMDG-----LQDATFDRP--HLWYPTSKVISALKLFIVGGDQLSGS 721
+ D + I + QD T D P HLWY T +VI ALKLF+ GD LS S
Sbjct: 568 DDLKQKDSYMISTGPYETKRRVLFQDKTADLPKAHLWYSTKEVIQALKLFLEAGDDLSRS 627
Query: 722 NTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQEFLELVNDIDTLLACHE 781
TYRYD+VDLTRQ L+K +N+++ V A+ D ++ LS++FLEL+ D+D LLA +
Sbjct: 628 LTYRYDMVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDD 687
Query: 782 GFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGD 841
LLG WL+SAK+LA++ +E KQYEWNARTQ+TMW+D+ + S L DY NK+WSGLL D
Sbjct: 688 NCLLGTWLESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLED 747
Query: 842 YYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDW-QSSRKIYPVESNGDALHTSH 898
YY PRA +YF + +S + F + WRREWI +++ W QSS ++YPV++ GDAL S
Sbjct: 748 YYLPRARLYFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISR 804
BLAST of IVF0012391 vs. ExPASy Swiss-Prot
Match:
P54802 (Alpha-N-acetylglucosaminidase OS=Homo sapiens OX=9606 GN=NAGLU PE=1 SV=2)
HSP 1 Score: 541.2 bits (1393), Expect = 2.2e-152
Identity = 285/719 (39.64%), Postives = 422/719 (58.69%), Query Frame = 0
Query: 186 GDPEILIAGVTGVEVLAGLHWYLKHWCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVI 245
G + + G TGV AGLH YL+ +CG H++W GSQL +P+ LP + E+
Sbjct: 71 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAW---SGSQL-RLPRP--LPAV-PGELTE 130
Query: 246 RRPIPLNYYQNAVTSSYSFAWWDWKRWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFN 305
P YYQN T SYSF WWDW RWE+EIDWMAL GIN+ LA++GQEAIW++V+
Sbjct: 131 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVYLALG 190
Query: 306 ISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLP 365
++ +++++FF GPAFLAW RMGNLH W GPLP SW +QL LQ +V+ +M GMTPVLP
Sbjct: 191 LTQAEINEFFTGPAFLAWGRMGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLP 250
Query: 366 AFSGNIPAAFKQIYPSAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQK 425
AF+G++P A +++P +T++G+W H + + C++LL DP+F IG F+ + K
Sbjct: 251 AFAGHVPEAVTRVFPQVNVTKMGSW--GHFNCSYSCSFLLAPEDPIFPIIGSLFLRELIK 310
Query: 426 EYGRTSHVYNCDTFDENTPPVDEVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDP-F 485
E+G T H+Y DTF+E PP E Y+++ +A++ M A D+ AVWL+QGW+F + P F
Sbjct: 311 EFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQF 370
Query: 486 WRPPQMKALLHSVPLGRLVVLDLYAEVKPIWISSEQFYGTPYIWKVSISILLCMLHNFAG 545
W P Q++A+L +VP GRL+VLDL+AE +P++ + F G P+IW CMLHNF G
Sbjct: 371 WGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIW--------CMLHNFGG 430
Query: 546 NVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKV-DVK 605
N ++G L+++ GP AR P STMVG GM+ EGI QN VVY LM+E+ ++ + V D+
Sbjct: 431 NHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLA 490
Query: 606 KWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCT-DGANDKNRDVIVAFPDVDPSSILVLP 665
W+ ++ RRYG P AW +L ++YNC+ + NR +V P + ++
Sbjct: 491 AWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEACRGHNRSPLVRRPSLQMNT----- 550
Query: 666 EGSDQHGILDSSMDGLQDATFDRPHLWYPTSKVISALKLFIVGGDQLSGSNTYRYDLVDL 725
+WY S V A +L + L+ S +RYDL+DL
Sbjct: 551 ------------------------SIWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDL 610
Query: 726 TRQALAKYSNELFFRTVKAYQLYDAQTMASLSQE----FLELVNDIDTLLACHEGFLLGP 785
TRQA+ + + L++ +A Y ++ +ASL + EL+ +D +LA FLLG
Sbjct: 611 TRQAVQELVS-LYYE--EARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGS 670
Query: 786 WLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYGPRA 845
WL+ A+ A SE E YE N+R Q+T+W E ++L DY NK +GL+ +YY PR
Sbjct: 671 WLEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYTPRW 730
Query: 846 AIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLYNKY 898
++ + L +S G F + + +L + S++ YP + GD + + ++ KY
Sbjct: 731 RLFLEALVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734
BLAST of IVF0012391 vs. ExPASy TrEMBL
Match:
A0A1S3BVG2 (alpha-N-acetylglucosaminidase-like OS=Cucumis melo OX=3656 GN=LOC103493939 PE=4 SV=1)
HSP 1 Score: 1685.2 bits (4363), Expect = 0.0e+00
Identity = 808/816 (99.02%), Postives = 808/816 (99.02%), Query Frame = 0
Query: 91 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 150
MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 151 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 210
LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 211 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 270
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 271 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 330
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 331 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 390
KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 391 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 450
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
Query: 451 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 510
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 511 VKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 570
VKPIWISSEQFYGTPYIW CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM
Sbjct: 421 VKPIWISSEQFYGTPYIW--------CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 480
Query: 571 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 630
VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH
Sbjct: 481 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 540
Query: 631 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY 690
TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY
Sbjct: 541 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY 600
Query: 691 PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM 750
PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM
Sbjct: 601 PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM 660
Query: 751 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN 810
ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN
Sbjct: 661 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN 720
Query: 811 TEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTND 870
TEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTND
Sbjct: 721 TEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTND 780
Query: 871 WQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 907
WQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ
Sbjct: 781 WQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 808
BLAST of IVF0012391 vs. ExPASy TrEMBL
Match:
A0A5D3BH46 (Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold350G002030 PE=4 SV=1)
HSP 1 Score: 1580.5 bits (4091), Expect = 0.0e+00
Identity = 774/852 (90.85%), Postives = 776/852 (91.08%), Query Frame = 0
Query: 91 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 150
MASFFSSTFLI VTIFAAFSTSRSSTIGVEYISRLLEIQDRER PAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIIVTIFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
Query: 151 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 210
LPSHLSSFDFQI DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQI---DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 211 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 270
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 271 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 330
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 331 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 390
KWGGPLP SWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 391 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 450
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYG+TSHVYNCDTFDENTPPVDEVE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGKTSHVYNCDTFDENTPPVDEVE 360
Query: 451 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 510
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRP QMKALLHSVPLGRLVVLD
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLD---- 420
Query: 511 VKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 570
LCMLHNFAGNVEMYGILDSIASGPIEARSS YSTM
Sbjct: 421 -------------------------LCMLHNFAGNVEMYGILDSIASGPIEARSSQYSTM 480
Query: 571 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 630
VGVGMSMEGIEQNPVVYDLMSEM FQ NKVDVKKWLPQYSVRRYGHLVPSIQDAWD+LYH
Sbjct: 481 VGVGMSMEGIEQNPVVYDLMSEMGFQRNKVDVKKWLPQYSVRRYGHLVPSIQDAWDILYH 540
Query: 631 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY 690
TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY
Sbjct: 541 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY 600
Query: 691 PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM 750
PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM
Sbjct: 601 PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM 660
Query: 751 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN 810
ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN
Sbjct: 661 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN 720
Query: 811 TEEEASLLRDY------------------------------------GNKYWSGLLGDYY 870
TEEEASLLRDY GNKYWSGLLGDYY
Sbjct: 721 TEEEASLLRDYGNDNSGPGLNSISIDCHLSSRLGNCTFKFDLFNLDPGNKYWSGLLGDYY 780
Query: 871 GPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLY 907
GPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLY
Sbjct: 781 GPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLY 820
BLAST of IVF0012391 vs. ExPASy TrEMBL
Match:
A0A5A7UWC6 (Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G001950 PE=4 SV=1)
HSP 1 Score: 1546.6 bits (4003), Expect = 0.0e+00
Identity = 763/858 (88.93%), Postives = 768/858 (89.51%), Query Frame = 0
Query: 91 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 150
MASFFSSTFLIFVTIFAAFSTSRSST GVEYISRLLE+QDRER PAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTTGVEYISRLLEVQDRERAPAYVQVAAARGVLRRL 60
Query: 151 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 210
LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 211 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 270
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT EVVIRRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTGEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 271 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 330
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 331 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 390
KWGGPLP SWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 391 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKE-YGR-----TSHVYNCDTFDENTP 450
F VHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQK +G T + DTFDENTP
Sbjct: 301 FAVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKGIFGPMLMKCTRTYFMPDTFDENTP 360
Query: 451 PVDEVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVV 510
PVDEVEYISSLGSAIFGGMQ GDSNAVWLMQGWMFSYDPFWRP QMKALLHSVPLGRLVV
Sbjct: 361 PVDEVEYISSLGSAIFGGMQTGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVV 420
Query: 511 LDLYAEVKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGILDSIASGPIEARS 570
LD LCMLHNFAGNVEMYGILDSIASGPIEARS
Sbjct: 421 LD-----------------------------LCMLHNFAGNVEMYGILDSIASGPIEARS 480
Query: 571 SPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDA 630
S YSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDA
Sbjct: 481 SQYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDA 540
Query: 631 WDVLYHTIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFD 690
WD+LYHTIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFD
Sbjct: 541 WDILYHTIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFD 600
Query: 691 RPHLWYPTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQL 750
RPHLWYPTSKVISALKLFIVGGDQL GSNTYRYDLVDLTRQALAKYSNELFFR VKAYQL
Sbjct: 601 RPHLWYPTSKVISALKLFIVGGDQLFGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQL 660
Query: 751 YDAQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQI 810
YDAQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQI
Sbjct: 661 YDAQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQI 720
Query: 811 TMWFDNTEEEASLLRDY------------------------------------GNKYWSG 870
TMWFDNTEEEASLLRDY GNKYWSG
Sbjct: 721 TMWFDNTEEEASLLRDYGNDNSGPGLNSISIDCHLSSRLGNCTFKFDLFNLDPGNKYWSG 780
Query: 871 LLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALH 907
LLGDYYGPRAAIYFKFLKESS+NGYRFPLSNWRREWIKLTN WQSSRKIYPVESNGDALH
Sbjct: 781 LLGDYYGPRAAIYFKFLKESSKNGYRFPLSNWRREWIKLTNAWQSSRKIYPVESNGDALH 829
BLAST of IVF0012391 vs. ExPASy TrEMBL
Match:
A0A6J1C176 (alpha-N-acetylglucosaminidase-like OS=Momordica charantia OX=3673 GN=LOC111007441 PE=4 SV=1)
HSP 1 Score: 1525.8 bits (3949), Expect = 0.0e+00
Identity = 726/817 (88.86%), Postives = 766/817 (93.76%), Query Frame = 0
Query: 91 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 150
MAS F + FLIFV++FAAFSTSR STIGV YISRLLEIQDRER PA+VQVAAARGVLRRL
Sbjct: 1 MASPFPAIFLIFVSLFAAFSTSRFSTIGVGYISRLLEIQDRERAPAHVQVAAARGVLRRL 60
Query: 151 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 210
LPSHLSSFDFQIVSKDKCG ESCFVIRNHR+FR+PGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGRESCFVIRNHRSFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 211 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 270
WCGAHISWDKTGGSQLFSVPKAGLLPRIQ++E++++RP+PLNYYQNAVTSSYSFAWWDW+
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQSNEIIVQRPVPLNYYQNAVTSSYSFAWWDWE 180
Query: 271 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 330
RW+KEIDWMALQGINMPLAFTGQEAIW+KVFQ FNISN+DLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWKKEIDWMALQGINMPLAFTGQEAIWQKVFQKFNISNTDLDDFFGGPAFLAWSRMGNLH 240
Query: 331 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 390
KWGG LPQSWFDQQLILQKKV+ RMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGSLPQSWFDQQLILQKKVLARMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 391 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 450
F+VHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQ KEYGRTSH+YNCDTFDENTPPVD E
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQLKEYGRTSHLYNCDTFDENTPPVDAAE 360
Query: 451 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 510
YISSLG+AIFGGMQAGDS+AVWLMQGWMFSYDPFWRP QMKALLHSVPLGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 420
Query: 511 VKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 570
VKPIWISSEQFYGTPYIW CMLHNFAGNVEMYGILDSIASGPIEAR+SPYSTM
Sbjct: 421 VKPIWISSEQFYGTPYIW--------CMLHNFAGNVEMYGILDSIASGPIEARNSPYSTM 480
Query: 571 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 630
VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWL QYS+RRYG LVPSIQDAWDVLYH
Sbjct: 481 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLDQYSIRRYGQLVPSIQDAWDVLYH 540
Query: 631 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGS--DQHGILDSSMDGLQDATFDRPHL 690
TIYNCTDGA DKNRDVIVAFPDVDPSSIL LPEGS D++ +SS+ L ATFDRPHL
Sbjct: 541 TIYNCTDGAYDKNRDVIVAFPDVDPSSILELPEGSDRDRYRNFNSSVGSLLHATFDRPHL 600
Query: 691 WYPTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQ 750
WY TS+VI ALKLFI G DQLSGSNTYRYDLVDLTRQALAKYSNELFFR VKAYQLYDAQ
Sbjct: 601 WYSTSEVIRALKLFIAGSDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQ 660
Query: 751 TMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWF 810
MASLSQ+FLELV DIDTLLACHEGFLLGPWL+SAKQLAQ EE+EKQYEWNARTQITMWF
Sbjct: 661 KMASLSQQFLELVKDIDTLLACHEGFLLGPWLESAKQLAQDEEQEKQYEWNARTQITMWF 720
Query: 811 DNTEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLT 870
DNTE+EASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKES ENGY FPLSNWRREWIKLT
Sbjct: 721 DNTEDEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESLENGYGFPLSNWRREWIKLT 780
Query: 871 NDWQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSD 906
NDWQ+SRK++PVE +GDA+ TS WLY KY+QI ES D
Sbjct: 781 NDWQNSRKVFPVEISGDAIDTSRWLYRKYMQILESYD 809
BLAST of IVF0012391 vs. ExPASy TrEMBL
Match:
A0A6J1ECY3 (alpha-N-acetylglucosaminidase-like OS=Cucurbita moschata OX=3662 GN=LOC111432041 PE=4 SV=1)
HSP 1 Score: 1521.1 bits (3937), Expect = 0.0e+00
Identity = 723/816 (88.60%), Postives = 757/816 (92.77%), Query Frame = 0
Query: 91 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 150
MA F++ LIF++IF FSTS SSTIG YISRLL+IQDRER P+ VQVAAARGVLRRL
Sbjct: 1 MAPPFAAVCLIFLSIFTTFSTSFSSTIGFVYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
Query: 151 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 210
LPSHLSSFDFQI+SKD CGGESCF+IRNHRAFR+PGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 211 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 270
WCGAHISWDKTGGSQLFSVPK G LP IQ+DE+++RRPIPLNYYQNAVTSSYSFAWWDW+
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIQSDEIIVRRPIPLNYYQNAVTSSYSFAWWDWE 180
Query: 271 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 330
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 331 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 390
KWGGPLPQSWFDQQLILQKKV GRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVTGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 391 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 450
F+VHSDPRWCCTYLLDAMDPLFVEIG+AFIEQQ KEYGRTSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 451 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 510
YISSLG+AIFGGMQAGDS+AVWLMQGWMFSYDPFWRP QMKALLHSV LGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
Query: 511 VKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 570
VKPIWI+SEQFYG PYIW CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM
Sbjct: 421 VKPIWIASEQFYGVPYIW--------CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 480
Query: 571 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 630
VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWL QYS+RRYGHLVPSIQDAWDVLYH
Sbjct: 481 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYH 540
Query: 631 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY 690
TIYNCTDGA DKNRDVIVAFPDVDPSSI V+PEGSD+H LQDA F+RPHLWY
Sbjct: 541 TIYNCTDGAYDKNRDVIVAFPDVDPSSISVIPEGSDRH-----DTGSLQDAIFERPHLWY 600
Query: 691 PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM 750
PTS+VI ALKLFI GDQLSGSNTYRYDLVDLTRQALAKYSNELFFR VKAYQL D QT
Sbjct: 601 PTSEVIRALKLFIASGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLDDVQTT 660
Query: 751 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN 810
SLSQ+FLELVNDIDTL+ACHEGFLLGPWLQSAKQLAQ E++EKQYEWNARTQITMWFDN
Sbjct: 661 VSLSQQFLELVNDIDTLVACHEGFLLGPWLQSAKQLAQDEQQEKQYEWNARTQITMWFDN 720
Query: 811 TEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTND 870
TEEEASLLRDYGNKYWSGLL DYYGPRAAIYFKFLKES ENGY FPLSNWRREWIKLTND
Sbjct: 721 TEEEASLLRDYGNKYWSGLLSDYYGPRAAIYFKFLKESLENGYAFPLSNWRREWIKLTND 780
Query: 871 WQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 907
WQSSRK+YPV+SNGDA+ TS WLYNKY Q+ ES DQ
Sbjct: 781 WQSSRKVYPVKSNGDAVDTSRWLYNKYFQVLESYDQ 803
BLAST of IVF0012391 vs. NCBI nr
Match:
XP_008453133.1 (PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis melo])
HSP 1 Score: 1677 bits (4344), Expect = 0.0
Identity = 808/816 (99.02%), Postives = 808/816 (99.02%), Query Frame = 0
Query: 91 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 150
MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 151 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 210
LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 211 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 270
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 271 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 330
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 331 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 390
KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 391 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 450
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
Query: 451 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 510
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 511 VKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 570
VKPIWISSEQFYGTPYIW CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM
Sbjct: 421 VKPIWISSEQFYGTPYIW--------CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 480
Query: 571 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 630
VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH
Sbjct: 481 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 540
Query: 631 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY 690
TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY
Sbjct: 541 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY 600
Query: 691 PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM 750
PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM
Sbjct: 601 PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM 660
Query: 751 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN 810
ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN
Sbjct: 661 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN 720
Query: 811 TEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTND 870
TEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTND
Sbjct: 721 TEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTND 780
Query: 871 WQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 906
WQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ
Sbjct: 781 WQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 808
BLAST of IVF0012391 vs. NCBI nr
Match:
XP_011658935.1 (alpha-N-acetylglucosaminidase [Cucumis sativus])
HSP 1 Score: 1622 bits (4201), Expect = 0.0
Identity = 779/816 (95.47%), Postives = 794/816 (97.30%), Query Frame = 0
Query: 91 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 150
MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 151 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 210
LPSHL SFDFQIVSKDKCGGESCFVIRNHRAFRK GDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKSGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 211 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 270
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT+EVV++RPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 271 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 330
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF+ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 331 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 390
KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYP+AKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 300
Query: 391 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 450
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 451 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 510
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRP QMKALLHSVPLGRLVVLDLYAE
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 420
Query: 511 VKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 570
VKPIWISSEQFYG PYIW CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM
Sbjct: 421 VKPIWISSEQFYGIPYIW--------CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 480
Query: 571 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 630
VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH
Sbjct: 481 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 540
Query: 631 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY 690
T+YNCTDGANDKNRDVIVAFPDVDPS+ILVLPEGS++HG LDSS+D LQDATFDRPHLWY
Sbjct: 541 TVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLDSSVDRLQDATFDRPHLWY 600
Query: 691 PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM 750
PTS+VISALKLFI GGDQLS SNTYRYDLVDLTRQALAKYSNELFFR VKAYQL+D QTM
Sbjct: 601 PTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLHDVQTM 660
Query: 751 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN 810
ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLA+SEEEEKQYEWNARTQITMWFDN
Sbjct: 661 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEEEKQYEWNARTQITMWFDN 720
Query: 811 TEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTND 870
TEEEASLLRDYGNKYWSGLLGDYY PRAAIY KFLKESSENGYRFPLSNWRREWIKLTND
Sbjct: 721 TEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENGYRFPLSNWRREWIKLTND 780
Query: 871 WQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 906
WQSSRKIYPVESNGDAL TSHWLYNKYLQIPESSDQ
Sbjct: 781 WQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ 808
BLAST of IVF0012391 vs. NCBI nr
Match:
XP_038880130.1 (alpha-N-acetylglucosaminidase-like [Benincasa hispida])
HSP 1 Score: 1583 bits (4099), Expect = 0.0
Identity = 763/818 (93.28%), Postives = 781/818 (95.48%), Query Frame = 0
Query: 91 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 150
MAS FSS FLIFV+IFAAFSTSRSSTIGV YISRLLEIQDRER PAYVQVAAARGVL RL
Sbjct: 1 MASPFSSIFLIFVSIFAAFSTSRSSTIGVGYISRLLEIQDRERAPAYVQVAAARGVLHRL 60
Query: 151 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 210
LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVE+LAGLHWYLK+
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKN 120
Query: 211 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 270
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDE+VI+RP+PLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEIVIQRPVPLNYYQNAVTSSYSFAWWDWK 180
Query: 271 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 330
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFHKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 331 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 390
KWGGPLPQSWFDQQLILQKKV GRMFELGMTPVLPAFSGNIPAAFK IYPSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVTGRMFELGMTPVLPAFSGNIPAAFKHIYPSAKITRLGNW 300
Query: 391 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 450
F+VHSDPRWCCTYLLDA DPLFVEIGKAFIEQQQKEYGRTSH+YNCDTFDENTPPVDEVE
Sbjct: 301 FSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQQKEYGRTSHIYNCDTFDENTPPVDEVE 360
Query: 451 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 510
YISSLG+AIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 511 VKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 570
VKP+WISSEQFYGTPYIW CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM
Sbjct: 421 VKPVWISSEQFYGTPYIW--------CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 480
Query: 571 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 630
VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWL QYS+RRYGHLVPSIQDAWDVLYH
Sbjct: 481 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYH 540
Query: 631 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQ--DATFDRPHL 690
TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGS++HG LDS +D L+ DA FDRPHL
Sbjct: 541 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSERHGNLDSRVDSLRLGDAMFDRPHL 600
Query: 691 WYPTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQ 750
WYPTS+V ALKLFI GGDQLSGSNTYRYDLVDLTRQALAKYSNELFFR VKAYQLYDAQ
Sbjct: 601 WYPTSEVTRALKLFIAGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQ 660
Query: 751 TMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWF 810
TMA+LSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQ EEEEKQYEWNARTQITMWF
Sbjct: 661 TMANLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQIEEEEKQYEWNARTQITMWF 720
Query: 811 DNTEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLT 870
DNTEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRF LSNWRREWIKLT
Sbjct: 721 DNTEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFQLSNWRREWIKLT 780
Query: 871 NDWQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 906
NDWQSSRK+YPVESNGDAL TSH LY KYLQ ES DQ
Sbjct: 781 NDWQSSRKVYPVESNGDALDTSHCLYYKYLQRLESFDQ 810
BLAST of IVF0012391 vs. NCBI nr
Match:
TYJ98583.1 (alpha-N-acetylglucosaminidase-like [Cucumis melo var. makuwa])
HSP 1 Score: 1572 bits (4070), Expect = 0.0
Identity = 774/852 (90.85%), Postives = 776/852 (91.08%), Query Frame = 0
Query: 91 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 150
MASFFSSTFLI VTIFAAFSTSRSSTIGVEYISRLLEIQDRER PAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIIVTIFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
Query: 151 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 210
LPSHLSSFDFQI DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQI---DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 211 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 270
WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 271 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 330
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 331 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 390
KWGGPLP SWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 391 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 450
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYG+TSHVYNCDTFDENTPPVDEVE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGKTSHVYNCDTFDENTPPVDEVE 360
Query: 451 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 510
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRP QMKALLHSVPLGRLVVLDL
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDL--- 420
Query: 511 VKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTM 570
CMLHNFAGNVEMYGILDSIASGPIEARSS YSTM
Sbjct: 421 --------------------------CMLHNFAGNVEMYGILDSIASGPIEARSSQYSTM 480
Query: 571 VGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYH 630
VGVGMSMEGIEQNPVVYDLMSEM FQ NKVDVKKWLPQYSVRRYGHLVPSIQDAWD+LYH
Sbjct: 481 VGVGMSMEGIEQNPVVYDLMSEMGFQRNKVDVKKWLPQYSVRRYGHLVPSIQDAWDILYH 540
Query: 631 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY 690
TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY
Sbjct: 541 TIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWY 600
Query: 691 PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM 750
PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM
Sbjct: 601 PTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTM 660
Query: 751 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN 810
ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN
Sbjct: 661 ASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDN 720
Query: 811 TEEEASLLRDYGN------------------------------------KYWSGLLGDYY 870
TEEEASLLRDYGN KYWSGLLGDYY
Sbjct: 721 TEEEASLLRDYGNDNSGPGLNSISIDCHLSSRLGNCTFKFDLFNLDPGNKYWSGLLGDYY 780
Query: 871 GPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLY 906
GPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLY
Sbjct: 781 GPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLY 820
BLAST of IVF0012391 vs. NCBI nr
Match:
KGN63620.2 (hypothetical protein Csa_013990 [Cucumis sativus])
HSP 1 Score: 1557 bits (4031), Expect = 0.0
Identity = 759/834 (91.01%), Postives = 774/834 (92.81%), Query Frame = 0
Query: 91 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 150
MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 151 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 210
LPSHL SFDFQIVSKDKCGGESCFVIRNHRAFRK GDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLPSFDFQIVSKDKCGGESCFVIRNHRAFRKSGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 211 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 270
WCGAHISWDKTGGSQLFSVPKAGLLPRIQT+EVV++RPIPLNYYQNAVTSSYSFAWWDWK
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTNEVVVQRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 271 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 330
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF+ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFRKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 331 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 390
KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYP+AKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPAAKITRLGNW 300
Query: 391 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 450
FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 451 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMK------------------A 510
YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRP QMK A
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKGCHCALLSLLVEIGEIFQA 420
Query: 511 LLHSVPLGRLVVLDLYAEVKPIWISSEQFYGTPYIWKVSISILLCMLHNFAGNVEMYGIL 570
LLHSVPLGRLVVLDL CMLHNFAGNVEMYGIL
Sbjct: 421 LLHSVPLGRLVVLDL-----------------------------CMLHNFAGNVEMYGIL 480
Query: 571 DSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVR 630
DSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVR
Sbjct: 481 DSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLPQYSVR 540
Query: 631 RYGHLVPSIQDAWDVLYHTIYNCTDGANDKNRDVIVAFPDVDPSSILVLPEGSDQHGILD 690
RYGHLVPSIQDAWDVLYHT+YNCTDGANDKNRDVIVAFPDVDPS+ILVLPEGS++HG LD
Sbjct: 541 RYGHLVPSIQDAWDVLYHTVYNCTDGANDKNRDVIVAFPDVDPSAILVLPEGSNRHGNLD 600
Query: 691 SSMDGLQDATFDRPHLWYPTSKVISALKLFIVGGDQLSGSNTYRYDLVDLTRQALAKYSN 750
SS+D LQDATFDRPHLWYPTS+VISALKLFI GGDQLS SNTYRYDLVDLTRQALAKYSN
Sbjct: 601 SSVDRLQDATFDRPHLWYPTSEVISALKLFIAGGDQLSSSNTYRYDLVDLTRQALAKYSN 660
Query: 751 ELFFRTVKAYQLYDAQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLAQSEEE 810
ELFFR VKAYQL+D QTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLA+SEEE
Sbjct: 661 ELFFRIVKAYQLHDVQTMASLSQEFLELVNDIDTLLACHEGFLLGPWLQSAKQLARSEEE 720
Query: 811 EKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYGPRAAIYFKFLKESSENG 870
EKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYY PRAAIY KFLKESSENG
Sbjct: 721 EKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYYCPRAAIYLKFLKESSENG 780
Query: 871 YRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 906
YRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDAL TSHWLYNKYLQIPESSDQ
Sbjct: 781 YRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALDTSHWLYNKYLQIPESSDQ 805
BLAST of IVF0012391 vs. TAIR 10
Match:
AT5G13690.1 (alpha-N-acetylglucosaminidase family / NAGLU family )
HSP 1 Score: 998.8 bits (2581), Expect = 2.8e-291
Identity = 471/786 (59.92%), Postives = 586/786 (74.55%), Query Frame = 0
Query: 122 ISRLLEIQDRERVPAYVQVAAARGVLRRLLPSHLSSFDFQIVSKDKCGGESCFVIRNHRA 181
I LL+ D + VQ +AA+G+L+RLLP+H SF+ +I+SKD CGG SCFVI N+
Sbjct: 28 IDGLLDRLDSLLPTSSVQESAAKGLLQRLLPTHSQSFELRIISKDACGGTSCFVIENYDG 87
Query: 182 FRKPGDPEILIAGVTGVEVLAGLHWYLKHWCGAHISWDKTGGSQLFSVPKAGLLPRIQTD 241
+ G PEILI G TGVE+ +GLHWYLK+ C AH+SWDKTGG Q+ SVP+ G LPRI +
Sbjct: 88 PGRIG-PEILIKGTTGVEIASGLHWYLKYKCNAHVSWDKTGGIQVASVPQPGHLPRIDSK 147
Query: 242 EVVIRRPIPLNYYQNAVTSSYSFAWWDWKRWEKEIDWMALQGINMPLAFTGQEAIWRKVF 301
+ IRRP+P NYYQN VTSSYS+ WW W+RWE+EIDWMALQGIN+PLAFTGQEAIW+KVF
Sbjct: 148 RIFIRRPVPWNYYQNVVTSSYSYVWWGWERWEREIDWMALQGINLPLAFTGQEAIWQKVF 207
Query: 302 QNFNISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMT 361
+ FNIS DLDD+FGGPAFLAW+RMGNLH WGGPL ++W D QL+LQK+++ RM + GMT
Sbjct: 208 KRFNISKEDLDDYFGGPAFLAWARMGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMT 267
Query: 362 PVLPAFSGNIPAAFKQIYPSAKITRLGNWFTVHSDPRWCCTYLLDAMDPLFVEIGKAFIE 421
PVLP+FSGN+P+A ++IYP A ITRL NW TV D RWCCTYLL+ DPLF+EIG+AFI+
Sbjct: 268 PVLPSFSGNVPSALRKIYPEANITRLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIK 327
Query: 422 QQQKEYGRTSHVYNCDTFDENTPPVDEVEYISSLGSAIFGGMQAGDSNAVWLMQGWMFSY 481
QQ +EYG +++YNCDTF+ENTPP E EYISSLG+A++ M G+ NAVWLMQGW+FS
Sbjct: 328 QQTEEYGEITNIYNCDTFNENTPPTSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSS 387
Query: 482 D-PFWRPPQMKALLHSVPLGRLVVLDLYAEVKPIWISSEQFYGTPYIWKVSISILLCMLH 541
D FW+PPQ+KALLHSVP G+++VLDLYAEVKPIW S QFYGTPYIW CMLH
Sbjct: 388 DSKFWKPPQLKALLHSVPFGKMIVLDLYAEVKPIWNKSAQFYGTPYIW--------CMLH 447
Query: 542 NFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKV 601
NF GN+EMYG LDSI+SGP++AR S STMVGVGM MEGIEQNPVVY+L SEMAF+ KV
Sbjct: 448 NFGGNIEMYGALDSISSGPVDARVSKNSTMVGVGMCMEGIEQNPVVYELTSEMAFRDEKV 507
Query: 602 DVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDGANDKNRDVIVAFPDVDPSSILV 661
DV+KWL Y+ RRY I+ AW++LYHT+YNCTDG D N D IV PD DPSS +
Sbjct: 508 DVQKWLKSYARRRYMKENHQIEAAWEILYHTVYNCTDGIADHNTDFIVKLPDWDPSSSVQ 567
Query: 662 LP-EGSDQHGILDSSMDG-----LQDATFDRP--HLWYPTSKVISALKLFIVGGDQLSGS 721
+ D + I + QD T D P HLWY T +VI ALKLF+ GD LS S
Sbjct: 568 DDLKQKDSYMISTGPYETKRRVLFQDKTADLPKAHLWYSTKEVIQALKLFLEAGDDLSRS 627
Query: 722 NTYRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQEFLELVNDIDTLLACHE 781
TYRYD+VDLTRQ L+K +N+++ V A+ D ++ LS++FLEL+ D+D LLA +
Sbjct: 628 LTYRYDMVDLTRQVLSKLANQVYTEAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDD 687
Query: 782 GFLLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGD 841
LLG WL+SAK+LA++ +E KQYEWNARTQ+TMW+D+ + S L DY NK+WSGLL D
Sbjct: 688 NCLLGTWLESAKKLAKNGDERKQYEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLED 747
Query: 842 YYGPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDW-QSSRKIYPVESNGDALHTSH 898
YY PRA +YF + +S + F + WRREWI +++ W QSS ++YPV++ GDAL S
Sbjct: 748 YYLPRARLYFNEMLKSLRDKKIFKVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISR 804
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FNA3 | 3.9e-290 | 59.92 | Alpha-N-acetylglucosaminidase OS=Arabidopsis thaliana OX=3702 GN=NAGLU PE=2 SV=1 | [more] |
P54802 | 2.2e-152 | 39.64 | Alpha-N-acetylglucosaminidase OS=Homo sapiens OX=9606 GN=NAGLU PE=1 SV=2 | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3BVG2 | 0.0e+00 | 99.02 | alpha-N-acetylglucosaminidase-like OS=Cucumis melo OX=3656 GN=LOC103493939 PE=4 ... | [more] |
A0A5D3BH46 | 0.0e+00 | 90.85 | Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E56... | [more] |
A0A5A7UWC6 | 0.0e+00 | 88.93 | Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C... | [more] |
A0A6J1C176 | 0.0e+00 | 88.86 | alpha-N-acetylglucosaminidase-like OS=Momordica charantia OX=3673 GN=LOC11100744... | [more] |
A0A6J1ECY3 | 0.0e+00 | 88.60 | alpha-N-acetylglucosaminidase-like OS=Cucurbita moschata OX=3662 GN=LOC111432041... | [more] |
Match Name | E-value | Identity | Description | |
AT5G13690.1 | 2.8e-291 | 59.92 | alpha-N-acetylglucosaminidase family / NAGLU family | [more] |