Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSstart_codoninitialpolypeptideintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCCTCCTTTTGCTGCCGTTTTTCTCATCTTTCTTTCCATATTCATCACCTTCTCCACTTCCTTTTCCTCCACCATTGGAGTCGGATACATTTCGCGGCTTCTTGATATCCAGGATCGCGAGAGGGCGCCTTCGTCTGTCCAAGTTGCTGCGGCTCGTGGAGTTCTTCGTCGCCTTCTTCCTTCGCACCTCTCCAGCTTTGACTTTCAAATTCTCTCTAAGGTACTTGGAATTTTAGGAGATTTTAATTTGTTTTGTTCGAGGACGGCGCGGGAGTATAGCTTTATCCGATTCTCGGAAATTTGGGAAATGTAAAAACTCTCTTCGTCTAGTCTGCTCCTACTATCTGTTTCTTATGTTCATTCTTCTCACGGTTTATGTATTCCTCATTTTTCCCAGGACGCATGTGGTGGGGAATCTTGCTTTCTGATCAGGAACCATCGTGCGTTCAGGAGACCGGGGGATCCTGAAATTTTGTAAGTAAAATGGATTCATACACATCATTCATCGCACAATGTTCCTTGTAAATTAAAATGGAGATTGAAGTGGATACTTATAACACGAACACAACTTCCAATCTTTCCTAGAATCGCTGGTGTCACTGGAGTGGAGATTTTAGCTGGTTTGCATTGGTATTTAAAGCACTGGTGTGGTGCACACATATCATGGGATAAAACTGGTGGCTCACAGCTATTTTCTGTACCTAAGCCTGGCTCATTGCCTCTTATTAAAAGTGACGAAATTATTGTTCAGAGGCCTATTCCCTTAAACTATTATCAAAATGCAGTTACATCAAGCTGTAAGGTTTCTCTGAAATTGCATTAGAATACTTCGTGTTCCTCCAATAACAGTGATTTGGTTACTGTTTCTATACATATGTTTCTATTTTAATTTCCTCTGCTTTTCAAAGACTCTTTTGCCTGGTGGGACTGGGAGAGATGGGAGAAGGAAATAGATTGGATGGCTCTTCAGGGTATCAATATGCCTCTGGCATTTACTGGGCAGGAGGCTATCTGGCGGAAAGTATTTCAAGTATGTTTTCCTCTTCTTATCTGCATAATTTTTCCCCTACTGTCGTTCGACATAATGATATATTGAGTAGTACCATAGAAAATATAACTGCGAATGCCATTCTTTCTGCGGTAGAAGATGTTTAAGAAGTCAATTTATCCTGCAAATGCTATAACCCCTGACTTGTCTTCATATTTACGACGATTGGAACAATATTATTGTGAGATCTCACATCAGTCAGAGAGGGGAATAAAGCATTCTTTATAAAGGTGTGGAAACCTGTCCCTAACAGACGTGTTTTAGAAATCTTGAGGGGAAATCCAAAAGTATAATATCATTTCAAGCTTCTTTTGTTCACTGATTTCTTCATCATGCTAGCGTAGCTTTCCTCTCTTTTCTGGCTTTATCTGAATTGTAGTTTACTTGTTTCCTGCATTTTAATTTAATGTCATAAAGTGGCCAAAGCTAAGAAGTTGTCTCTAATAATTTTTTGAGGTTCTTATCTTATTGCAACTTACTTGCTTCAAAGTCTAGATTGATCTCACTCCAGACTATATTTGCAGAAATTTAATATAAGCAACTCAGATTTGGACGATTTCTTTGGAGGTCCAGCCTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAGTGAGTAGTTGTTTTGTAACTGCATTGAAATCAAATCATACTGTCAGATGGTTGCCTAAAATGAAAGATGGAATGATAATATAGAATCATCATCATAACTAGCTACTCGAATATGCAGAGAGTGGCTTAAAAGTATCATGTTTATTAGCTCCACTATTAAAAATCTTCATATAATTTATTATAGTTTCCTTTCGAATATCAACATTCTGATAACAGATGGGGTGGGCCGCTGCCGCAAAGTTGGTTTGATCAACAGCTTATTCTGCAGAAAAAAGTTATTGGCAGAATGTTTGAGCTTGGAATGACTCCAGGTATCGAATGTCAGTCAATAGATTGATTATGACATGTTCTTGCAGTGCGCAAGAGTTGAAAATATTCAAACTTGATATTCTGTTCCATTAAATATTTTCACTTTGATATTGACCTTTTCTTTTGGAATTGCTGTTTTAAAGCTTCAGTGCTGTTGAAGACTATTTTTTTGTTTTTTACTATAGTTTGACTTCTAATTGCGTTGAAAAGAATCCACAACTTAGAAATATCTTGTGCTGGGAATAATCTGAGGACCATGGTTTTCTTTTTCCTCCCTTTCAGTTCTCCCAGCCTTTTCAGGTAATATTCCTGCTGCTTTCAAACAAATATATCCATCAGCGAAGATAACACGCTTAGGAAATTGGTAACTCCCCACGTCCTTATCTGATATCAAATTCATATTTGTTATGGTTAATACTGTTTCTTTACTCTTGGATCTCTTTGACATCTGCTTAGAATATATTCTGATACACTCTGAAACCAGAGTTAGTCAAGTCCATGGATACCAAGGATGTTTTAATTATTGACAATATCAATACGCTATGTCATACAATTGCTCTGCAGGTTTTCTGTTCACAGTGACCCTAGATGGTGCTGCACGTACCTCCTTGATGCCATGGACCCTTTATTTGTTGAAATCGGTAGAGCATTTATTGAGCAACAACTGAAAGGTATTTTGGACCTATGCTAGTGAACTGTATGAGAATATGATATTTCATGCTTTTTCGTTAAATGATACCTGACTGTTCTAATTGTGTCTGAAATTACCTTTTTTTTCGTCAGAATATGGAAGAACTTCCCATGTATACAATTGGTATGCTACTCTTTTCCTCTGATTCCTCGATGGTTAGCTTACTTACATTAGAATGACAGAAAAGCAAACGAGAACAAACGTTTAAAATTGGTGCAGTATTGGTTTGGACAGTCTTTGAATTGCGTCTTTTCTTCAATTTTCAATCATCATTAGAATATCTTTTCACCCTACAGTGATACCTTTGACGAGAACACTCCACCTGTGGATGATGTAGAATACATATCTTCATTAGGTGCAGCTATTTTTGGAGGAATGCAGGCGGGAGATTCTAGTGCTGTCTGGCTTATGCAGGTGATTGTTGGTTTGTTTTGCCATATAAGGGTTTTTATATCAATTCATTAAGGTTCTCATATCACTAAACTAGTGTAACTTGATAATGAAATTTGTTCATCTACAGGGTTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAACAAATGAAGGTTACAAGAATTCTTTCATACCCTCCAATTTTCAATCATCATTAGATCATTGCCATATCAAGGCTTTACTTGTGGAATCCTTGGCTCAGATCCTTTGTTAGTTATAAATGCTAATTCATTATATACTACTACACCTTCTTCTATCTATTCAATCTCAAGGCTTTCTTTCACTCTTGGTTGACGTTGGTGACGCATTTTTCAGGCCCTTTTACATTCTGTGTCTTTGGGAAGGCTAGTAGTCCTTGATTTGTATGCTGAAGTGAAACCGATCTGGATAGCTTCTGAGCAATTTTATGGAGTCCCTTACATCTGGAAAGTCACTATTCCTTTCTTTTGCTCGATTTTAATGTTGAGGTCTATAAACTGGAAGAAATTTTTCTTGATTAAGCATAAAGTTGGCTGATCTTTTTGCTATTTTTATGAATGTCTTAAACTTTTGTTAAATTGTATCAGTTTACAATGTCTCAATAATTCTGCTGTCTTGTCCCAATGTCCTGGACCATATATAGTGCCTTTGGTGTCAGCTCAGGGTTCCAAGACTAAGTTGATTTAGTGTTTCTTTAGCTGTGTGCTATATTTCAATGAATTTATGATGCTCGAAAATTTAGCTATAATGATATCAATTGCCTATTGCAGGTGCATGCTACATAACTTTGCTGGAAATGTCGAAATGTATGGCATTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGTAGTCCATACTCGACAATGGTAAGTTTATTATCTACCATTATTTTAAATTTAGGTGTTATTATAATGTTTTGGCAGCATTATTTAATTGCAACAACTATACTAATTTCATATTAACTCCAAATTTTTAAAAAATAAATATTGGGTGCTTTTTCTCTCTACCCACAAGTACTATATTTTACATTTCGAGTATATGTATACATATATTTGTAACTCTGAAGAGATGAACTGATTCTCTTTCTTTCGTTTTTCACTGGAAGACTATTATGGAATTTAATATTGATCATTAGATTATGAAAGCAGTATAGCTGATATTGTTGGGGATCATTAGAGGTTGATCATGAGTTTATAAGTAACAAATATATCTCCATTGGTATAAGGCCTTTTGGGAAAACCAAAAGTAAAGCCATGGGAGATTATGCTCAAAGTGGACAATATTATACCATTGTGGAGGTTTGTGATTCCTAATAGACATATCTATGAAGATATTGCATATTTCAATCTCCCTCTCACGTTCTCTTTTTCTAAACGTCTGTTAGGTGTATCATTTATTTGCAATGTTCAAATTTGTGTTCCTTTTTGTCACCTATGCTTTCATTTGAAGGTCGGGGTAGGAATGTCCATGGAAGGAATAGAGCAGAACCCTGTTGTGTATGATCTCATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGGTACATTTTGTGTATTATTCATGTTCCGCTTTTGGAAAGCTGACAAATAACCTCGCATAGAATAATATGGTCCTTTCATTTCCAGAAATGGCTGTATCAATATTCAATAAGACGCTACGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAACTGCACTGATGGTGCCTATGTAAGTACAAAAGAACTGTTGTCTGTGCTTACAGTTTCGTTGCTTTAGAATGGTCATTTCTTTTCTTTTCTTTCTGATGCGAATCTGTGAATCATATGTTCAAACATGCCATGCTAACACATCCTGGTCAACATGGAAAAGGACAAAAACAGGGACGTAATTGTGGCATTTCCTGATGTTGATCCATCCTCAATCTCAGTAATACCCGAGGGGTCGGACCGACACGATGCAGGTAGTCTCCAGGATGCAATATTTGAACGACCTCATCTTTGGTATCCTACTTCTGAAGTAATTCGTGCATTAAAGCTTTTCGTTGCCAGCGGTGATCAACTTTCCGGCAGTAACACTTACAGGTTGACCCACGAATTCAAAATATCCTATTATTGAAGTTTGGATCACTGAATATGCATTCGTTGTTACTCTTAGGTATGACCTTGTGGACTTGACTAGACAAGCTCTAGCCAAATACTCGAACGAACTGTTCTTTAGAATTGTCAAAGCATATCAGTTGGATGATGTGCAAACAACGGTGAGCTTAAGCCAGCAGTTCCTTGAACTTGTAAATGATATAGACACATTAGTGGCTTGTCATGAGGGATTTCTTCTAGGACCTTGGCTACAAAGCGCCAAGCAACTCGCCCAAGATGAACAGCAGGAAAAACAGGTTCAGTTTTGAGTCCACTCTACCAATTATGGGACAATTATATTAACTGACATTCGGTATCTGATTAATCTCAAATTACAGTATGAGTGGAACGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGAGGAAGCAAGTTTACTTCGTGATTATGGTAATGATCATTACTCTGTGTGGATATATAGTCACTGAGATGTTGAATTGTATTGTATTTGTACATTCAAATTCAGGAAACAAGTACTGGAGTGGTCTGTTGAGCGATTACTACGGTCCTCGAGCTGCAATATACTTCAAGTTCTTGAAAGAAAGCTTAGAGAATGGGTATGCATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAGCTTACAAATGACTGGCAAAGCAGCAGAAAGGTTTACCCTGTGAAAAGCAATGGTGATGCTGTGGACACATCCCGGTGGCTTTACAACAAATACTTGCAAATGCTTGAAAGCTACGATCAATGA
mRNA sequence
ATGGCGCCTCCTTTTGCTGCCGTTTTTCTCATCTTTCTTTCCATATTCATCACCTTCTCCACTTCCTTTTCCTCCACCATTGGAGTCGGATACATTTCGCGGCTTCTTGATATCCAGGATCGCGAGAGGGCGCCTTCGTCTGTCCAAGTTGCTGCGGCTCGTGGAGTTCTTCGTCGCCTTCTTCCTTCGCACCTCTCCAGCTTTGACTTTCAAATTCTCTCTAAGGACGCATGTGGTGGGGAATCTTGCTTTCTGATCAGGAACCATCGTGCGTTCAGGAGACCGGGGGATCCTGAAATTTTAATCGCTGGTGTCACTGGAGTGGAGATTTTAGCTGGTTTGCATTGGTATTTAAAGCACTGGTGTGGTGCACACATATCATGGGATAAAACTGGTGGCTCACAGCTATTTTCTGTACCTAAGCCTGGCTCATTGCCTCTTATTAAAAGTGACGAAATTATTGTTCAGAGGCCTATTCCCTTAAACTATTATCAAAATGCAGTTACATCAAGCTACTCTTTTGCCTGGTGGGACTGGGAGAGATGGGAGAAGGAAATAGATTGGATGGCTCTTCAGGGTATCAATATGCCTCTGGCATTTACTGGGCAGGAGGCTATCTGGCGGAAAGTATTTCAAAAATTTAATATAAGCAACTCAGATTTGGACGATTTCTTTGGAGGTCCAGCCTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAATGGGGTGGGCCGCTGCCGCAAAGTTGGTTTGATCAACAGCTTATTCTGCAGAAAAAAGTTATTGGCAGAATGTTTGAGCTTGGAATGACTCCAGTTCTCCCAGCCTTTTCAGGTAATATTCCTGCTGCTTTCAAACAAATATATCCATCAGCGAAGATAACACGCTTAGGAAATTGGTTTTCTGTTCACAGTGACCCTAGATGGTGCTGCACGTACCTCCTTGATGCCATGGACCCTTTATTTGTTGAAATCGGTAGAGCATTTATTGAGCAACAACTGAAAGAATATGGAAGAACTTCCCATGTATACAATTGTGATACCTTTGACGAGAACACTCCACCTGTGGATGATGTAGAATACATATCTTCATTAGGTGCAGCTATTTTTGGAGGAATGCAGGCGGGAGATTCTAGTGCTGTCTGGCTTATGCAGGGTTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAACAAATGAAGGCCCTTTTACATTCTGTGTCTTTGGGAAGGCTAGTAGTCCTTGATTTGTATGCTGAAGTGAAACCGATCTGGATAGCTTCTGAGCAATTTTATGGAGTCCCTTACATCTGGAAAGTCACTATTCCTTTCTTTTGCTCGATTTTAATGTTGAGTTTACAATGTCTCAATAATTCTGCTGTCTTGTCCCAATGTCCTGGACCATATATAGTGCCTTTGGTGTCAGCTCAGGGTTCCAAGACTAAGTGCATGCTACATAACTTTGCTGGAAATGTCGAAATGTATGGCATTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGTAGTCCATACTCGACAATGGTCGGGGTAGGAATGTCCATGGAAGGAATAGAGCAGAACCCTGTTGTGTATGATCTCATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGAAATGGCTGTATCAATATTCAATAAGACGCTACGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAACTGCACTGATGGTGCCTATGACAAAAACAGGGACGTAATTGTGGCATTTCCTGATGTTGATCCATCCTCAATCTCAGTAATACCCGAGGGGTCGGACCGACACGATGCAGGTAGTCTCCAGGATGCAATATTTGAACGACCTCATCTTTGGTATCCTACTTCTGAAGTAATTCGTGCATTAAAGCTTTTCGTTGCCAGCGGTGATCAACTTTCCGGCAGTAACACTTACAGGTATGACCTTGTGGACTTGACTAGACAAGCTCTAGCCAAATACTCGAACGAACTGTTCTTTAGAATTGTCAAAGCATATCAGTTGGATGATGTGCAAACAACGGTGAGCTTAAGCCAGCAGTTCCTTGAACTTGTAAATGATATAGACACATTAGTGGCTTGTCATGAGGGATTTCTTCTAGGACCTTGGCTACAAAGCGCCAAGCAACTCGCCCAAGATGAACAGCAGGAAAAACAGTATGAGTGGAACGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGAGGAAGCAAGTTTACTTCGTGATTATGGAAACAAGTACTGGAGTGGTCTGTTGAGCGATTACTACGGTCCTCGAGCTGCAATATACTTCAAGTTCTTGAAAGAAAGCTTAGAGAATGGGTATGCATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAGCTTACAAATGACTGGCAAAGCAGCAGAAAGGTTTACCCTGTGAAAAGCAATGGTGATGCTGTGGACACATCCCGGTGGCTTTACAACAAATACTTGCAAATGCTTGAAAGCTACGATCAATGA
Coding sequence (CDS)
ATGGCGCCTCCTTTTGCTGCCGTTTTTCTCATCTTTCTTTCCATATTCATCACCTTCTCCACTTCCTTTTCCTCCACCATTGGAGTCGGATACATTTCGCGGCTTCTTGATATCCAGGATCGCGAGAGGGCGCCTTCGTCTGTCCAAGTTGCTGCGGCTCGTGGAGTTCTTCGTCGCCTTCTTCCTTCGCACCTCTCCAGCTTTGACTTTCAAATTCTCTCTAAGGACGCATGTGGTGGGGAATCTTGCTTTCTGATCAGGAACCATCGTGCGTTCAGGAGACCGGGGGATCCTGAAATTTTAATCGCTGGTGTCACTGGAGTGGAGATTTTAGCTGGTTTGCATTGGTATTTAAAGCACTGGTGTGGTGCACACATATCATGGGATAAAACTGGTGGCTCACAGCTATTTTCTGTACCTAAGCCTGGCTCATTGCCTCTTATTAAAAGTGACGAAATTATTGTTCAGAGGCCTATTCCCTTAAACTATTATCAAAATGCAGTTACATCAAGCTACTCTTTTGCCTGGTGGGACTGGGAGAGATGGGAGAAGGAAATAGATTGGATGGCTCTTCAGGGTATCAATATGCCTCTGGCATTTACTGGGCAGGAGGCTATCTGGCGGAAAGTATTTCAAAAATTTAATATAAGCAACTCAGATTTGGACGATTTCTTTGGAGGTCCAGCCTTTCTTGCATGGTCACGTATGGGAAATTTGCACAAATGGGGTGGGCCGCTGCCGCAAAGTTGGTTTGATCAACAGCTTATTCTGCAGAAAAAAGTTATTGGCAGAATGTTTGAGCTTGGAATGACTCCAGTTCTCCCAGCCTTTTCAGGTAATATTCCTGCTGCTTTCAAACAAATATATCCATCAGCGAAGATAACACGCTTAGGAAATTGGTTTTCTGTTCACAGTGACCCTAGATGGTGCTGCACGTACCTCCTTGATGCCATGGACCCTTTATTTGTTGAAATCGGTAGAGCATTTATTGAGCAACAACTGAAAGAATATGGAAGAACTTCCCATGTATACAATTGTGATACCTTTGACGAGAACACTCCACCTGTGGATGATGTAGAATACATATCTTCATTAGGTGCAGCTATTTTTGGAGGAATGCAGGCGGGAGATTCTAGTGCTGTCTGGCTTATGCAGGGTTGGATGTTTTCATATGATCCATTCTGGAGGCCTCAACAAATGAAGGCCCTTTTACATTCTGTGTCTTTGGGAAGGCTAGTAGTCCTTGATTTGTATGCTGAAGTGAAACCGATCTGGATAGCTTCTGAGCAATTTTATGGAGTCCCTTACATCTGGAAAGTCACTATTCCTTTCTTTTGCTCGATTTTAATGTTGAGTTTACAATGTCTCAATAATTCTGCTGTCTTGTCCCAATGTCCTGGACCATATATAGTGCCTTTGGTGTCAGCTCAGGGTTCCAAGACTAAGTGCATGCTACATAACTTTGCTGGAAATGTCGAAATGTATGGCATTTTAGATTCAATAGCATCTGGACCAATTGAAGCTCGTAGTAGTCCATACTCGACAATGGTCGGGGTAGGAATGTCCATGGAAGGAATAGAGCAGAACCCTGTTGTGTATGATCTCATGTCTGAAATGGCTTTTCAACACAACAAAGTTGATGTCAAGAAATGGCTGTATCAATATTCAATAAGACGCTACGGTCATTTAGTCCCTTCAATACAAGATGCCTGGGATGTATTATATCACACCATCTACAACTGCACTGATGGTGCCTATGACAAAAACAGGGACGTAATTGTGGCATTTCCTGATGTTGATCCATCCTCAATCTCAGTAATACCCGAGGGGTCGGACCGACACGATGCAGGTAGTCTCCAGGATGCAATATTTGAACGACCTCATCTTTGGTATCCTACTTCTGAAGTAATTCGTGCATTAAAGCTTTTCGTTGCCAGCGGTGATCAACTTTCCGGCAGTAACACTTACAGGTATGACCTTGTGGACTTGACTAGACAAGCTCTAGCCAAATACTCGAACGAACTGTTCTTTAGAATTGTCAAAGCATATCAGTTGGATGATGTGCAAACAACGGTGAGCTTAAGCCAGCAGTTCCTTGAACTTGTAAATGATATAGACACATTAGTGGCTTGTCATGAGGGATTTCTTCTAGGACCTTGGCTACAAAGCGCCAAGCAACTCGCCCAAGATGAACAGCAGGAAAAACAGTATGAGTGGAACGCAAGAACTCAAATAACCATGTGGTTTGACAACACAGAGGAGGAAGCAAGTTTACTTCGTGATTATGGAAACAAGTACTGGAGTGGTCTGTTGAGCGATTACTACGGTCCTCGAGCTGCAATATACTTCAAGTTCTTGAAAGAAAGCTTAGAGAATGGGTATGCATTTCCATTGAGTAATTGGAGGAGAGAGTGGATAAAGCTTACAAATGACTGGCAAAGCAGCAGAAAGGTTTACCCTGTGAAAAGCAATGGTGATGCTGTGGACACATCCCGGTGGCTTTACAACAAATACTTGCAAATGCTTGAAAGCTACGATCAATGA
Protein sequence
MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRLLPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKHWCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWERWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVEYISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAEVKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSKTKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVDPSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPWLQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAAIYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQMLESYDQ
Homology
BLAST of Csor.00g156120 vs. ExPASy Swiss-Prot
Match:
Q9FNA3 (Alpha-N-acetylglucosaminidase OS=Arabidopsis thaliana OX=3702 GN=NAGLU PE=2 SV=1)
HSP 1 Score: 984.9 bits (2545), Expect = 5.5e-286
Identity = 472/822 (57.42%), Postives = 585/822 (71.17%), Query Frame = 0
Query: 32 ISRLLDIQDRERAPSSVQVAAARGVLRRLLPSHLSSFDFQILSKDACGGESCFLIRNHRA 91
I LLD D SSVQ +AA+G+L+RLLP+H SF+ +I+SKDACGG SCF+I N+
Sbjct: 28 IDGLLDRLDSLLPTSSVQESAAKGLLQRLLPTHSQSFELRIISKDACGGTSCFVIENYDG 87
Query: 92 FRRPGDPEILIAGVTGVEILAGLHWYLKHWCGAHISWDKTGGSQLFSVPKPGSLPLIKSD 151
R G PEILI G TGVEI +GLHWYLK+ C AH+SWDKTGG Q+ SVP+PG LP I S
Sbjct: 88 PGRIG-PEILIKGTTGVEIASGLHWYLKYKCNAHVSWDKTGGIQVASVPQPGHLPRIDSK 147
Query: 152 EIIVQRPIPLNYYQNAVTSSYSFAWWDWERWEKEIDWMALQGINMPLAFTGQEAIWRKVF 211
I ++RP+P NYYQN VTSSYS+ WW WERWE+EIDWMALQGIN+PLAFTGQEAIW+KVF
Sbjct: 148 RIFIRRPVPWNYYQNVVTSSYSYVWWGWERWEREIDWMALQGINLPLAFTGQEAIWQKVF 207
Query: 212 QKFNISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMT 271
++FNIS DLDD+FGGPAFLAW+RMGNLH WGGPL ++W D QL+LQK+++ RM + GMT
Sbjct: 208 KRFNISKEDLDDYFGGPAFLAWARMGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMT 267
Query: 272 PVLPAFSGNIPAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDAMDPLFVEIGRAFIE 331
PVLP+FSGN+P+A ++IYP A ITRL NW +V D RWCCTYLL+ DPLF+EIG AFI+
Sbjct: 268 PVLPSFSGNVPSALRKIYPEANITRLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIK 327
Query: 332 QQLKEYGRTSHVYNCDTFDENTPPVDDVEYISSLGAAIFGGMQAGDSSAVWLMQGWMFSY 391
QQ +EYG +++YNCDTF+ENTPP + EYISSLGAA++ M G+ +AVWLMQGW+FS
Sbjct: 328 QQTEEYGEITNIYNCDTFNENTPPTSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSS 387
Query: 392 D-PFWRPQQMKALLHSVSLGRLVVLDLYAEVKPIWIASEQFYGVPYIWKVTIPFFCSILM 451
D FW+P Q+KALLHSV G+++VLDLYAEVKPIW S QFYG PYIW
Sbjct: 388 DSKFWKPPQLKALLHSVPFGKMIVLDLYAEVKPIWNKSAQFYGTPYIW------------ 447
Query: 452 LSLQCLNNSAVLSQCPGPYIVPLVSAQGSKTKCMLHNFAGNVEMYGILDSIASGPIEARS 511
CMLHNF GN+EMYG LDSI+SGP++AR
Sbjct: 448 --------------------------------CMLHNFGGNIEMYGALDSISSGPVDARV 507
Query: 512 SPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLYQYSIRRYGHLVPSIQDA 571
S STMVGVGM MEGIEQNPVVY+L SEMAF+ KVDV+KWL Y+ RRY I+ A
Sbjct: 508 SKNSTMVGVGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAA 567
Query: 572 WDVLYHTIYNCTDGAYDKNRDVIVAFPDVDPSS-------------ISVIPEGSDRHDAG 631
W++LYHT+YNCTDG D N D IV PD DPSS IS P + R
Sbjct: 568 WEILYHTVYNCTDGIADHNTDFIVKLPDWDPSSSVQDDLKQKDSYMISTGPYETKRRVLF 627
Query: 632 SLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDLVDLTRQALAKYSNELFF 691
+ A + HLWY T EVI+ALKLF+ +GD LS S TYRYD+VDLTRQ L+K +N+++
Sbjct: 628 QDKTADLPKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYT 687
Query: 692 RIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPWLQSAKQLAQDEQQEKQY 751
V A+ D+ + LS++FLEL+ D+D L+A + LLG WL+SAK+LA++ + KQY
Sbjct: 688 EAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQY 747
Query: 752 EWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAAIYFKFLKESLENGYAFP 811
EWNARTQ+TMW+D+ + S L DY NK+WSGLL DYY PRA +YF + +SL + F
Sbjct: 748 EWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIFK 804
Query: 812 LSNWRREWIKLTNDW-QSSRKVYPVKSNGDAVDTSRWLYNKY 839
+ WRREWI +++ W QSS +VYPVK+ GDA+ SR L +KY
Sbjct: 808 VEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 804
BLAST of Csor.00g156120 vs. ExPASy Swiss-Prot
Match:
P54802 (Alpha-N-acetylglucosaminidase OS=Homo sapiens OX=9606 GN=NAGLU PE=1 SV=2)
HSP 1 Score: 529.6 bits (1363), Expect = 6.3e-149
Identity = 280/747 (37.48%), Postives = 421/747 (56.36%), Query Frame = 0
Query: 96 GDPEILIAGVTGVEILAGLHWYLKHWCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIV 155
G + + G TGV AGLH YL+ +CG H++W GSQL +P+P LP + E+
Sbjct: 71 GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAW---SGSQL-RLPRP--LPAVPG-ELTE 130
Query: 156 QRPIPLNYYQNAVTSSYSFAWWDWERWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFN 215
P YYQN T SYSF WWDW RWE+EIDWMAL GIN+ LA++GQEAIW++V+
Sbjct: 131 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVYLALG 190
Query: 216 ISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLP 275
++ +++++FF GPAFLAW RMGNLH W GPLP SW +QL LQ +V+ +M GMTPVLP
Sbjct: 191 LTQAEINEFFTGPAFLAWGRMGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLP 250
Query: 276 AFSGNIPAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLK 335
AF+G++P A +++P +T++G+W H + + C++LL DP+F IG F+ + +K
Sbjct: 251 AFAGHVPEAVTRVFPQVNVTKMGSW--GHFNCSYSCSFLLAPEDPIFPIIGSLFLRELIK 310
Query: 336 EYGRTSHVYNCDTFDENTPPVDDVEYISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDP-F 395
E+G T H+Y DTF+E PP + Y+++ A++ M A D+ AVWL+QGW+F + P F
Sbjct: 311 EFG-TDHIYGADTFNEMQPPSSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQF 370
Query: 396 WRPQQMKALLHSVSLGRLVVLDLYAEVKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQ 455
W P Q++A+L +V GRL+VLDL+AE +P++ + F G P+IW
Sbjct: 371 WGPAQIRAVLGAVPRGRLLVLDLFAESQPVYTRTASFQGQPFIW---------------- 430
Query: 456 CLNNSAVLSQCPGPYIVPLVSAQGSKTKCMLHNFAGNVEMYGILDSIASGPIEARSSPYS 515
CMLHNF GN ++G L+++ GP AR P S
Sbjct: 431 ----------------------------CMLHNFGGNHGLFGALEAVNGGPEAARLFPNS 490
Query: 516 TMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKV-DVKKWLYQYSIRRYGHLVPSIQDAWDV 575
TMVG GM+ EGI QN VVY LM+E+ ++ + V D+ W+ ++ RRYG P AW +
Sbjct: 491 TMVGTGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRL 550
Query: 576 LYHTIYNCT-DGAYDKNRDVIVAFPDVDPSSISVIPEGSDRHDAGSLQDAIFERPHLWYP 635
L ++YNC+ + NR +V P + ++ +WY
Sbjct: 551 LLRSVYNCSGEACRGHNRSPLVRRPSLQMNT------------------------SIWYN 610
Query: 636 TSEVIRALKLFVASGDQLSGSNTYRYDLVDLTRQALAKYSNELFFRIVKAYQLDDVQTTV 695
S+V A +L + S L+ S +RYDL+DLTRQA+ + + + AY ++ + +
Sbjct: 611 RSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELASLL 670
Query: 696 SLSQQF-LELVNDIDTLVACHEGFLLGPWLQSAKQLAQDEQQEKQYEWNARTQITMWFDN 755
EL+ +D ++A FLLG WL+ A+ A E + YE N+R Q+T+W
Sbjct: 671 RAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW--- 730
Query: 756 TEEEASLLRDYGNKYWSGLLSDYYGPRAAIYFKFLKESLENGYAFPLSNWRREWIKLTND 815
E ++L DY NK +GL+++YY PR ++ + L +S+ G F + + +L
Sbjct: 731 -GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQLEQA 734
Query: 816 WQSSRKVYPVKSNGDAVDTSRWLYNKY 839
+ S++ YP + GD VD ++ ++ KY
Sbjct: 791 FVLSKQRYPSQPRGDTVDLAKKIFLKY 734
BLAST of Csor.00g156120 vs. NCBI nr
Match:
KAG6587494.1 (Alpha-N-acetylglucosaminidase, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1746 bits (4521), Expect = 0.0
Identity = 847/847 (100.00%), Postives = 847/847 (100.00%), Query Frame = 0
Query: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL
Sbjct: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
Query: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK
Sbjct: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
Query: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA
Sbjct: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
Query: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD
Sbjct: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
Query: 601 PSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDL 660
PSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDL
Sbjct: 601 PSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDL 660
Query: 661 VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW
Sbjct: 661 VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
Query: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA
Sbjct: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
Query: 781 IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ 840
IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ
Sbjct: 781 IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ 840
Query: 841 MLESYDQ 847
MLESYDQ
Sbjct: 841 MLESYDQ 847
BLAST of Csor.00g156120 vs. NCBI nr
Match:
XP_023529905.1 (alpha-N-acetylglucosaminidase-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1628 bits (4215), Expect = 0.0
Identity = 796/847 (93.98%), Postives = 800/847 (94.45%), Query Frame = 0
Query: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
MAPPFAAVFLIFLSIF TFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL
Sbjct: 1 MAPPFAAVFLIFLSIFTTFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
WCGAHISWDKTGGSQLFSVPKPGSLPLI+SDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIQSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
KWGGPLP SWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPPSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
Query: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
VKPIWIASEQFYGVPYIW
Sbjct: 421 VKPIWIASEQFYGVPYIW------------------------------------------ 480
Query: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGM MEGIEQNPVVYDLMSEMA
Sbjct: 481 --CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMCMEGIEQNPVVYDLMSEMA 540
Query: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD
Sbjct: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
Query: 601 PSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDL 660
PSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLF+ASGDQLSGSNTYRYDL
Sbjct: 601 PSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLFIASGDQLSGSNTYRYDL 660
Query: 661 VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW
Sbjct: 661 VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
Query: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTE+EASLLRDYGNKYWSGLLSDYYGPRAA
Sbjct: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEQEASLLRDYGNKYWSGLLSDYYGPRAA 780
Query: 781 IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ 840
IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ
Sbjct: 781 IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ 803
Query: 841 MLESYDQ 847
+LESYDQ
Sbjct: 841 VLESYDQ 803
BLAST of Csor.00g156120 vs. NCBI nr
Match:
XP_022924603.1 (alpha-N-acetylglucosaminidase-like [Cucurbita moschata])
HSP 1 Score: 1618 bits (4190), Expect = 0.0
Identity = 792/847 (93.51%), Postives = 796/847 (93.98%), Query Frame = 0
Query: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
MAPPFAAV LIFLSIF TFSTSFSSTIG YISRLLDIQDRERAPSSVQVAAARGVLRRL
Sbjct: 1 MAPPFAAVCLIFLSIFTTFSTSFSSTIGFVYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
WCGAHISWDKTGGSQLFSVPKPGSLPLI+SDEIIV+RPIPLNYYQNAVTSSYSFAWWDWE
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIQSDEIIVRRPIPLNYYQNAVTSSYSFAWWDWE 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKV GRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVTGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
Query: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
VKPIWIASEQFYGVPYIW
Sbjct: 421 VKPIWIASEQFYGVPYIW------------------------------------------ 480
Query: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA
Sbjct: 481 --CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
Query: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD
Sbjct: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
Query: 601 PSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDL 660
PSSISVIPEGSDRHD GSLQDAIFERPHLWYPTSEVIRALKLF+ASGDQLSGSNTYRYDL
Sbjct: 601 PSSISVIPEGSDRHDTGSLQDAIFERPHLWYPTSEVIRALKLFIASGDQLSGSNTYRYDL 660
Query: 661 VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW
Sbjct: 661 VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
Query: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA
Sbjct: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
Query: 781 IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ 840
IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKY Q
Sbjct: 781 IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYFQ 803
Query: 841 MLESYDQ 847
+LESYDQ
Sbjct: 841 VLESYDQ 803
BLAST of Csor.00g156120 vs. NCBI nr
Match:
XP_022972296.1 (alpha-N-acetylglucosaminidase-like [Cucurbita maxima])
HSP 1 Score: 1601 bits (4146), Expect = 0.0
Identity = 786/847 (92.80%), Postives = 790/847 (93.27%), Query Frame = 0
Query: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
MAPPFAAVFLIFLSIF TFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL
Sbjct: 1 MAPPFAAVFLIFLSIFTTFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
WCGAHISWDKTGGSQLFS PKPGSLPLIKSDEIIV+RPIPLNYYQNAVTSSYSFAWWDWE
Sbjct: 121 WCGAHISWDKTGGSQLFSAPKPGSLPLIKSDEIIVKRPIPLNYYQNAVTSSYSFAWWDWE 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
KWGGPLP SWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
FSV SDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE
Sbjct: 301 FSVQSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
Query: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
VKPIWIASEQFYGVPYIW
Sbjct: 421 VKPIWIASEQFYGVPYIW------------------------------------------ 480
Query: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA
Sbjct: 481 --CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
Query: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
FQHNKVDVKKWLYQYSIRRYGH VPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD
Sbjct: 541 FQHNKVDVKKWLYQYSIRRYGHSVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
Query: 601 PSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDL 660
PSSI EGSDRHDAG LQDAIFERPHLWYPTSEVIRALKLF+ASGDQLSGSNTYRYDL
Sbjct: 601 PSSI----EGSDRHDAGRLQDAIFERPHLWYPTSEVIRALKLFIASGDQLSGSNTYRYDL 660
Query: 661 VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
VDLTRQALAKYSNELFFRIVKAYQLDD+ TTVSLSQQFLELVNDIDTLVACHEGFLLGPW
Sbjct: 661 VDLTRQALAKYSNELFFRIVKAYQLDDLNTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
Query: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA
Sbjct: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
Query: 781 IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ 840
IYFKFLKESLENGYAFPLSNWR WIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ
Sbjct: 781 IYFKFLKESLENGYAFPLSNWRSGWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ 799
Query: 841 MLESYDQ 847
+LESYDQ
Sbjct: 841 VLESYDQ 799
BLAST of Csor.00g156120 vs. NCBI nr
Match:
XP_038880130.1 (alpha-N-acetylglucosaminidase-like [Benincasa hispida])
HSP 1 Score: 1500 bits (3883), Expect = 0.0
Identity = 729/854 (85.36%), Postives = 765/854 (89.58%), Query Frame = 0
Query: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
MA PF+++FLIF+SIF FSTS SSTIGVGYISRLL+IQDRERAP+ VQVAAARGVL RL
Sbjct: 1 MASPFSSIFLIFVSIFAAFSTSRSSTIGVGYISRLLEIQDRERAPAYVQVAAARGVLHRL 60
Query: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQI+SKD CGGESCF+IRNHRAFR+PGDPEILIAGVTGVEILAGLHWYLK+
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEILAGLHWYLKN 120
Query: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
WCGAHISWDKTGGSQLFSVPK G LP I++DEI++QRP+PLNYYQNAVTSSYSFAWWDW+
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEIVIQRPVPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVF KFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFHKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKV GRMFELGMTPVLPAFSGNIPAAFK IYPSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVTGRMFELGMTPVLPAFSGNIPAAFKHIYPSAKITRLGNW 300
Query: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
FSVHSDPRWCCTYLLDA DPLFVEIG+AFIEQQ KEYGRTSH+YNCDTFDENTPPVD+VE
Sbjct: 301 FSVHSDPRWCCTYLLDATDPLFVEIGKAFIEQQQKEYGRTSHIYNCDTFDENTPPVDEVE 360
Query: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
YISSLGAAIFGGMQAGDS+AVWLMQGWMFSYDPFWRP QMKALLHSV LGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
VKP+WI+SEQFYG PYIW
Sbjct: 421 VKPVWISSEQFYGTPYIW------------------------------------------ 480
Query: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA
Sbjct: 481 --CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
Query: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGA DKNRDVIVAFPDVD
Sbjct: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGANDKNRDVIVAFPDVD 600
Query: 601 PSSISVIPEGSDRH-------DAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGS 660
PSSI V+PEGS+RH D+ L DA+F+RPHLWYPTSEV RALKLF+A GDQLSGS
Sbjct: 601 PSSILVLPEGSERHGNLDSRVDSLRLGDAMFDRPHLWYPTSEVTRALKLFIAGGDQLSGS 660
Query: 661 NTYRYDLVDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHE 720
NTYRYDLVDLTRQALAKYSNELFFRIVKAYQL D QT +LSQ+FLELVNDIDTL+ACHE
Sbjct: 661 NTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQTMANLSQEFLELVNDIDTLLACHE 720
Query: 721 GFLLGPWLQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSD 780
GFLLGPWLQSAKQLAQ E++EKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLL D
Sbjct: 721 GFLLGPWLQSAKQLAQIEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGD 780
Query: 781 YYGPRAAIYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRW 840
YYGPRAAIYFKFLKES ENGY F LSNWRREWIKLTNDWQSSRKVYPV+SNGDA+DTS
Sbjct: 781 YYGPRAAIYFKFLKESSENGYRFQLSNWRREWIKLTNDWQSSRKVYPVESNGDALDTSHC 810
Query: 841 LYNKYLQMLESYDQ 847
LY KYLQ LES+DQ
Sbjct: 841 LYYKYLQRLESFDQ 810
BLAST of Csor.00g156120 vs. ExPASy TrEMBL
Match:
A0A6J1ECY3 (alpha-N-acetylglucosaminidase-like OS=Cucurbita moschata OX=3662 GN=LOC111432041 PE=4 SV=1)
HSP 1 Score: 1618 bits (4190), Expect = 0.0
Identity = 792/847 (93.51%), Postives = 796/847 (93.98%), Query Frame = 0
Query: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
MAPPFAAV LIFLSIF TFSTSFSSTIG YISRLLDIQDRERAPSSVQVAAARGVLRRL
Sbjct: 1 MAPPFAAVCLIFLSIFTTFSTSFSSTIGFVYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
WCGAHISWDKTGGSQLFSVPKPGSLPLI+SDEIIV+RPIPLNYYQNAVTSSYSFAWWDWE
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIQSDEIIVRRPIPLNYYQNAVTSSYSFAWWDWE 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKV GRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVTGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
Query: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
VKPIWIASEQFYGVPYIW
Sbjct: 421 VKPIWIASEQFYGVPYIW------------------------------------------ 480
Query: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA
Sbjct: 481 --CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
Query: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD
Sbjct: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
Query: 601 PSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDL 660
PSSISVIPEGSDRHD GSLQDAIFERPHLWYPTSEVIRALKLF+ASGDQLSGSNTYRYDL
Sbjct: 601 PSSISVIPEGSDRHDTGSLQDAIFERPHLWYPTSEVIRALKLFIASGDQLSGSNTYRYDL 660
Query: 661 VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW
Sbjct: 661 VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
Query: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA
Sbjct: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
Query: 781 IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ 840
IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKY Q
Sbjct: 781 IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYFQ 803
Query: 841 MLESYDQ 847
+LESYDQ
Sbjct: 841 VLESYDQ 803
BLAST of Csor.00g156120 vs. ExPASy TrEMBL
Match:
A0A6J1I5L2 (alpha-N-acetylglucosaminidase-like OS=Cucurbita maxima OX=3661 GN=LOC111470873 PE=4 SV=1)
HSP 1 Score: 1601 bits (4146), Expect = 0.0
Identity = 786/847 (92.80%), Postives = 790/847 (93.27%), Query Frame = 0
Query: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
MAPPFAAVFLIFLSIF TFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL
Sbjct: 1 MAPPFAAVFLIFLSIFTTFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
WCGAHISWDKTGGSQLFS PKPGSLPLIKSDEIIV+RPIPLNYYQNAVTSSYSFAWWDWE
Sbjct: 121 WCGAHISWDKTGGSQLFSAPKPGSLPLIKSDEIIVKRPIPLNYYQNAVTSSYSFAWWDWE 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
KWGGPLP SWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
FSV SDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE
Sbjct: 301 FSVQSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
Query: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
Query: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
VKPIWIASEQFYGVPYIW
Sbjct: 421 VKPIWIASEQFYGVPYIW------------------------------------------ 480
Query: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA
Sbjct: 481 --CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
Query: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
FQHNKVDVKKWLYQYSIRRYGH VPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD
Sbjct: 541 FQHNKVDVKKWLYQYSIRRYGHSVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
Query: 601 PSSISVIPEGSDRHDAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDL 660
PSSI EGSDRHDAG LQDAIFERPHLWYPTSEVIRALKLF+ASGDQLSGSNTYRYDL
Sbjct: 601 PSSI----EGSDRHDAGRLQDAIFERPHLWYPTSEVIRALKLFIASGDQLSGSNTYRYDL 660
Query: 661 VDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
VDLTRQALAKYSNELFFRIVKAYQLDD+ TTVSLSQQFLELVNDIDTLVACHEGFLLGPW
Sbjct: 661 VDLTRQALAKYSNELFFRIVKAYQLDDLNTTVSLSQQFLELVNDIDTLVACHEGFLLGPW 720
Query: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA
Sbjct: 721 LQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAA 780
Query: 781 IYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ 840
IYFKFLKESLENGYAFPLSNWR WIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ
Sbjct: 781 IYFKFLKESLENGYAFPLSNWRSGWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQ 799
Query: 841 MLESYDQ 847
+LESYDQ
Sbjct: 841 VLESYDQ 799
BLAST of Csor.00g156120 vs. ExPASy TrEMBL
Match:
A0A1S3BVG2 (alpha-N-acetylglucosaminidase-like OS=Cucumis melo OX=3656 GN=LOC103493939 PE=4 SV=1)
HSP 1 Score: 1494 bits (3868), Expect = 0.0
Identity = 724/852 (84.98%), Postives = 761/852 (89.32%), Query Frame = 0
Query: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
MA F++ FLIF++IF FSTS SSTIGV YISRLL+IQDRER P+ VQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIFVTIFAAFSTSRSSTIGVEYISRLLEIQDRERVPAYVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQI+SKD CGGESCF+IRNHRAFR+PGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
WCGAHISWDKTGGSQLFSVPK G LP I++DE++++RPIPLNYYQNAVTSSYSFAWWDW+
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQ FNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQNFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
F+VHSDPRWCCTYLLDAMDPLFVEIG+AFIEQQ KEYGRTSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGRTSHVYNCDTFDENTPPVDEVE 360
Query: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
YISSLG+AIFGGMQAGDS+AVWLMQGWMFSYDPFWRP QMKALLHSV LGRLVVLDLYAE
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPPQMKALLHSVPLGRLVVLDLYAE 420
Query: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
VKPIWI+SEQFYG PYIW
Sbjct: 421 VKPIWISSEQFYGTPYIW------------------------------------------ 480
Query: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA
Sbjct: 481 --CMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
Query: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
FQHNKVDVKKWL QYS+RRYGHLVPSIQDAWDVLYHTIYNCTDGA DKNRDVIVAFPDVD
Sbjct: 541 FQHNKVDVKKWLPQYSVRRYGHLVPSIQDAWDVLYHTIYNCTDGANDKNRDVIVAFPDVD 600
Query: 601 PSSISVIPEGSDRH-----DAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNT 660
PSSI V+PEGSD+H LQDA F+RPHLWYPTS+VI ALKLF+ GDQLSGSNT
Sbjct: 601 PSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKVISALKLFIVGGDQLSGSNT 660
Query: 661 YRYDLVDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGF 720
YRYDLVDLTRQALAKYSNELFFR VKAYQL D QT SLSQ+FLELVNDIDTL+ACHEGF
Sbjct: 661 YRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQEFLELVNDIDTLLACHEGF 720
Query: 721 LLGPWLQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYY 780
LLGPWLQSAKQLAQ E++EKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLL DYY
Sbjct: 721 LLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLGDYY 780
Query: 781 GPRAAIYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLY 840
GPRAAIYFKFLKES ENGY FPLSNWRREWIKLTNDWQSSRK+YPV+SNGDA+ TS WLY
Sbjct: 781 GPRAAIYFKFLKESSENGYRFPLSNWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLY 808
Query: 841 NKYLQMLESYDQ 847
NKYLQ+ ES DQ
Sbjct: 841 NKYLQIPESSDQ 808
BLAST of Csor.00g156120 vs. ExPASy TrEMBL
Match:
A0A6J1C176 (alpha-N-acetylglucosaminidase-like OS=Momordica charantia OX=3673 GN=LOC111007441 PE=4 SV=1)
HSP 1 Score: 1493 bits (3866), Expect = 0.0
Identity = 727/853 (85.23%), Postives = 760/853 (89.10%), Query Frame = 0
Query: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
MA PF A+FLIF+S+F FSTS STIGVGYISRLL+IQDRERAP+ VQVAAARGVLRRL
Sbjct: 1 MASPFPAIFLIFVSLFAAFSTSRFSTIGVGYISRLLEIQDRERAPAHVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQI+SKD CG ESCF+IRNHR+FRRPGDPEILIAGVTGVEILAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQIVSKDKCGRESCFVIRNHRSFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
WCGAHISWDKTGGSQLFSVPK G LP I+S+EIIVQRP+PLNYYQNAVTSSYSFAWWDWE
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQSNEIIVQRPVPLNYYQNAVTSSYSFAWWDWE 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RW+KEIDWMALQGINMPLAFTGQEAIW+KVFQKFNISN+DLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWKKEIDWMALQGINMPLAFTGQEAIWQKVFQKFNISNTDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
KWGG LPQSWFDQQLILQKKV+ RMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGSLPQSWFDQQLILQKKVLARMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
FSVHSDPRWCCTYLLDAMDPLFVEIG+AFIEQQLKEYGRTSH+YNCDTFDENTPPVD E
Sbjct: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQLKEYGRTSHLYNCDTFDENTPPVDAAE 360
Query: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
YISSLGAAIFGGMQAGDS AVWLMQGWMFSYDPFWRPQQMKALLHSV LGRLVVLDLYAE
Sbjct: 361 YISSLGAAIFGGMQAGDSDAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDLYAE 420
Query: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
VKPIWI+SEQFYG PYIW
Sbjct: 421 VKPIWISSEQFYGTPYIW------------------------------------------ 480
Query: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
CMLHNFAGNVEMYGILDSIASGPIEAR+SPYSTMVGVGMSMEGIEQNPVVYDLMSEMA
Sbjct: 481 --CMLHNFAGNVEMYGILDSIASGPIEARNSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
Query: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
FQHNKVDVKKWL QYSIRRYG LVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD
Sbjct: 541 FQHNKVDVKKWLDQYSIRRYGQLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
Query: 601 PSSISVIPEGSDRH-------DAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGS 660
PSSI +PEGSDR GSL A F+RPHLWY TSEVIRALKLF+A DQLSGS
Sbjct: 601 PSSILELPEGSDRDRYRNFNSSVGSLLHATFDRPHLWYSTSEVIRALKLFIAGSDQLSGS 660
Query: 661 NTYRYDLVDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHE 720
NTYRYDLVDLTRQALAKYSNELFFRIVKAYQL D Q SLSQQFLELV DIDTL+ACHE
Sbjct: 661 NTYRYDLVDLTRQALAKYSNELFFRIVKAYQLYDAQKMASLSQQFLELVKDIDTLLACHE 720
Query: 721 GFLLGPWLQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSD 780
GFLLGPWL+SAKQLAQDE+QEKQYEWNARTQITMWFDNTE+EASLLRDYGNKYWSGLL D
Sbjct: 721 GFLLGPWLESAKQLAQDEEQEKQYEWNARTQITMWFDNTEDEASLLRDYGNKYWSGLLGD 780
Query: 781 YYGPRAAIYFKFLKESLENGYAFPLSNWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRW 840
YYGPRAAIYFKFLKESLENGY FPLSNWRREWIKLTNDWQ+SRKV+PV+ +GDA+DTSRW
Sbjct: 781 YYGPRAAIYFKFLKESLENGYGFPLSNWRREWIKLTNDWQNSRKVFPVEISGDAIDTSRW 809
Query: 841 LYNKYLQMLESYD 846
LY KY+Q+LESYD
Sbjct: 841 LYRKYMQILESYD 809
BLAST of Csor.00g156120 vs. ExPASy TrEMBL
Match:
A0A5D3BH46 (Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold350G002030 PE=4 SV=1)
HSP 1 Score: 1408 bits (3645), Expect = 0.0
Identity = 699/888 (78.72%), Postives = 736/888 (82.88%), Query Frame = 0
Query: 1 MAPPFAAVFLIFLSIFITFSTSFSSTIGVGYISRLLDIQDRERAPSSVQVAAARGVLRRL 60
MA F++ FLI ++IF FSTS SSTIGV YISRLL+IQDRERAP+ VQVAAARGVLRRL
Sbjct: 1 MASFFSSTFLIIVTIFAAFSTSRSSTIGVEYISRLLEIQDRERAPAYVQVAAARGVLRRL 60
Query: 61 LPSHLSSFDFQILSKDACGGESCFLIRNHRAFRRPGDPEILIAGVTGVEILAGLHWYLKH 120
LPSHLSSFDFQI D CGGESCF+IRNHRAFR+PGDPEILIAGVTGVE+LAGLHWYLKH
Sbjct: 61 LPSHLSSFDFQI---DKCGGESCFVIRNHRAFRKPGDPEILIAGVTGVEVLAGLHWYLKH 120
Query: 121 WCGAHISWDKTGGSQLFSVPKPGSLPLIKSDEIIVQRPIPLNYYQNAVTSSYSFAWWDWE 180
WCGAHISWDKTGGSQLFSVPK G LP I++DE++++RPIPLNYYQNAVTSSYSFAWWDW+
Sbjct: 121 WCGAHISWDKTGGSQLFSVPKAGLLPRIQTDEVVIRRPIPLNYYQNAVTSSYSFAWWDWK 180
Query: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH
Sbjct: 181 RWEKEIDWMALQGINMPLAFTGQEAIWRKVFQKFNISNSDLDDFFGGPAFLAWSRMGNLH 240
Query: 241 KWGGPLPQSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
KWGGPLP SWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW
Sbjct: 241 KWGGPLPHSWFDQQLILQKKVIGRMFELGMTPVLPAFSGNIPAAFKQIYPSAKITRLGNW 300
Query: 301 FSVHSDPRWCCTYLLDAMDPLFVEIGRAFIEQQLKEYGRTSHVYNCDTFDENTPPVDDVE 360
F+VHSDPRWCCTYLLDAMDPLFVEIG+AFIEQQ KEYG+TSHVYNCDTFDENTPPVD+VE
Sbjct: 301 FTVHSDPRWCCTYLLDAMDPLFVEIGKAFIEQQQKEYGKTSHVYNCDTFDENTPPVDEVE 360
Query: 361 YISSLGAAIFGGMQAGDSSAVWLMQGWMFSYDPFWRPQQMKALLHSVSLGRLVVLDLYAE 420
YISSLG+AIFGGMQAGDS+AVWLMQGWMFSYDPFWRPQQMKALLHSV LGRLVVLDL
Sbjct: 361 YISSLGSAIFGGMQAGDSNAVWLMQGWMFSYDPFWRPQQMKALLHSVPLGRLVVLDL--- 420
Query: 421 VKPIWIASEQFYGVPYIWKVTIPFFCSILMLSLQCLNNSAVLSQCPGPYIVPLVSAQGSK 480
Sbjct: 421 ------------------------------------------------------------ 480
Query: 481 TKCMLHNFAGNVEMYGILDSIASGPIEARSSPYSTMVGVGMSMEGIEQNPVVYDLMSEMA 540
CMLHNFAGNVEMYGILDSIASGPIEARSS YSTMVGVGMSMEGIEQNPVVYDLMSEM
Sbjct: 481 --CMLHNFAGNVEMYGILDSIASGPIEARSSQYSTMVGVGMSMEGIEQNPVVYDLMSEMG 540
Query: 541 FQHNKVDVKKWLYQYSIRRYGHLVPSIQDAWDVLYHTIYNCTDGAYDKNRDVIVAFPDVD 600
FQ NKVDVKKWL QYS+RRYGHLVPSIQDAWD+LYHTIYNCTDGA DKNRDVIVAFPDVD
Sbjct: 541 FQRNKVDVKKWLPQYSVRRYGHLVPSIQDAWDILYHTIYNCTDGANDKNRDVIVAFPDVD 600
Query: 601 PSSISVIPEGSDRH-----DAGSLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNT 660
PSSI V+PEGSD+H LQDA F+RPHLWYPTS+VI ALKLF+ GDQLSGSNT
Sbjct: 601 PSSILVLPEGSDQHGILDSSMDGLQDATFDRPHLWYPTSKVISALKLFIVGGDQLSGSNT 660
Query: 661 YRYDLVDLTRQALAKYSNELFFRIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGF 720
YRYDLVDLTRQALAKYSNELFFR VKAYQL D QT SLSQ+FLELVNDIDTL+ACHEGF
Sbjct: 661 YRYDLVDLTRQALAKYSNELFFRTVKAYQLYDAQTMASLSQEFLELVNDIDTLLACHEGF 720
Query: 721 LLGPWLQSAKQLAQDEQQEKQYEWNARTQITMWFDNTEEEASLLRDYGN----------- 780
LLGPWLQSAKQLAQ E++EKQYEWNARTQITMWFDNTEEEASLLRDYGN
Sbjct: 721 LLGPWLQSAKQLAQSEEEEKQYEWNARTQITMWFDNTEEEASLLRDYGNDNSGPGLNSIS 780
Query: 781 -------------------------KYWSGLLSDYYGPRAAIYFKFLKESLENGYAFPLS 840
KYWSGLL DYYGPRAAIYFKFLKES ENGY FPLS
Sbjct: 781 IDCHLSSRLGNCTFKFDLFNLDPGNKYWSGLLGDYYGPRAAIYFKFLKESSENGYRFPLS 820
Query: 841 NWRREWIKLTNDWQSSRKVYPVKSNGDAVDTSRWLYNKYLQMLESYDQ 847
NWRREWIKLTNDWQSSRK+YPV+SNGDA+ TS WLYNKYLQ+ ES DQ
Sbjct: 841 NWRREWIKLTNDWQSSRKIYPVESNGDALHTSHWLYNKYLQIPESSDQ 820
BLAST of Csor.00g156120 vs. TAIR 10
Match:
AT5G13690.1 (alpha-N-acetylglucosaminidase family / NAGLU family )
HSP 1 Score: 984.9 bits (2545), Expect = 3.9e-287
Identity = 472/822 (57.42%), Postives = 585/822 (71.17%), Query Frame = 0
Query: 32 ISRLLDIQDRERAPSSVQVAAARGVLRRLLPSHLSSFDFQILSKDACGGESCFLIRNHRA 91
I LLD D SSVQ +AA+G+L+RLLP+H SF+ +I+SKDACGG SCF+I N+
Sbjct: 28 IDGLLDRLDSLLPTSSVQESAAKGLLQRLLPTHSQSFELRIISKDACGGTSCFVIENYDG 87
Query: 92 FRRPGDPEILIAGVTGVEILAGLHWYLKHWCGAHISWDKTGGSQLFSVPKPGSLPLIKSD 151
R G PEILI G TGVEI +GLHWYLK+ C AH+SWDKTGG Q+ SVP+PG LP I S
Sbjct: 88 PGRIG-PEILIKGTTGVEIASGLHWYLKYKCNAHVSWDKTGGIQVASVPQPGHLPRIDSK 147
Query: 152 EIIVQRPIPLNYYQNAVTSSYSFAWWDWERWEKEIDWMALQGINMPLAFTGQEAIWRKVF 211
I ++RP+P NYYQN VTSSYS+ WW WERWE+EIDWMALQGIN+PLAFTGQEAIW+KVF
Sbjct: 148 RIFIRRPVPWNYYQNVVTSSYSYVWWGWERWEREIDWMALQGINLPLAFTGQEAIWQKVF 207
Query: 212 QKFNISNSDLDDFFGGPAFLAWSRMGNLHKWGGPLPQSWFDQQLILQKKVIGRMFELGMT 271
++FNIS DLDD+FGGPAFLAW+RMGNLH WGGPL ++W D QL+LQK+++ RM + GMT
Sbjct: 208 KRFNISKEDLDDYFGGPAFLAWARMGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMT 267
Query: 272 PVLPAFSGNIPAAFKQIYPSAKITRLGNWFSVHSDPRWCCTYLLDAMDPLFVEIGRAFIE 331
PVLP+FSGN+P+A ++IYP A ITRL NW +V D RWCCTYLL+ DPLF+EIG AFI+
Sbjct: 268 PVLPSFSGNVPSALRKIYPEANITRLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIK 327
Query: 332 QQLKEYGRTSHVYNCDTFDENTPPVDDVEYISSLGAAIFGGMQAGDSSAVWLMQGWMFSY 391
QQ +EYG +++YNCDTF+ENTPP + EYISSLGAA++ M G+ +AVWLMQGW+FS
Sbjct: 328 QQTEEYGEITNIYNCDTFNENTPPTSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSS 387
Query: 392 D-PFWRPQQMKALLHSVSLGRLVVLDLYAEVKPIWIASEQFYGVPYIWKVTIPFFCSILM 451
D FW+P Q+KALLHSV G+++VLDLYAEVKPIW S QFYG PYIW
Sbjct: 388 DSKFWKPPQLKALLHSVPFGKMIVLDLYAEVKPIWNKSAQFYGTPYIW------------ 447
Query: 452 LSLQCLNNSAVLSQCPGPYIVPLVSAQGSKTKCMLHNFAGNVEMYGILDSIASGPIEARS 511
CMLHNF GN+EMYG LDSI+SGP++AR
Sbjct: 448 --------------------------------CMLHNFGGNIEMYGALDSISSGPVDARV 507
Query: 512 SPYSTMVGVGMSMEGIEQNPVVYDLMSEMAFQHNKVDVKKWLYQYSIRRYGHLVPSIQDA 571
S STMVGVGM MEGIEQNPVVY+L SEMAF+ KVDV+KWL Y+ RRY I+ A
Sbjct: 508 SKNSTMVGVGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAA 567
Query: 572 WDVLYHTIYNCTDGAYDKNRDVIVAFPDVDPSS-------------ISVIPEGSDRHDAG 631
W++LYHT+YNCTDG D N D IV PD DPSS IS P + R
Sbjct: 568 WEILYHTVYNCTDGIADHNTDFIVKLPDWDPSSSVQDDLKQKDSYMISTGPYETKRRVLF 627
Query: 632 SLQDAIFERPHLWYPTSEVIRALKLFVASGDQLSGSNTYRYDLVDLTRQALAKYSNELFF 691
+ A + HLWY T EVI+ALKLF+ +GD LS S TYRYD+VDLTRQ L+K +N+++
Sbjct: 628 QDKTADLPKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYT 687
Query: 692 RIVKAYQLDDVQTTVSLSQQFLELVNDIDTLVACHEGFLLGPWLQSAKQLAQDEQQEKQY 751
V A+ D+ + LS++FLEL+ D+D L+A + LLG WL+SAK+LA++ + KQY
Sbjct: 688 EAVTAFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQY 747
Query: 752 EWNARTQITMWFDNTEEEASLLRDYGNKYWSGLLSDYYGPRAAIYFKFLKESLENGYAFP 811
EWNARTQ+TMW+D+ + S L DY NK+WSGLL DYY PRA +YF + +SL + F
Sbjct: 748 EWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIFK 804
Query: 812 LSNWRREWIKLTNDW-QSSRKVYPVKSNGDAVDTSRWLYNKY 839
+ WRREWI +++ W QSS +VYPVK+ GDA+ SR L +KY
Sbjct: 808 VEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 804
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FNA3 | 5.5e-286 | 57.42 | Alpha-N-acetylglucosaminidase OS=Arabidopsis thaliana OX=3702 GN=NAGLU PE=2 SV=1 | [more] |
P54802 | 6.3e-149 | 37.48 | Alpha-N-acetylglucosaminidase OS=Homo sapiens OX=9606 GN=NAGLU PE=1 SV=2 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1ECY3 | 0.0 | 93.51 | alpha-N-acetylglucosaminidase-like OS=Cucurbita moschata OX=3662 GN=LOC111432041... | [more] |
A0A6J1I5L2 | 0.0 | 92.80 | alpha-N-acetylglucosaminidase-like OS=Cucurbita maxima OX=3661 GN=LOC111470873 P... | [more] |
A0A1S3BVG2 | 0.0 | 84.98 | alpha-N-acetylglucosaminidase-like OS=Cucumis melo OX=3656 GN=LOC103493939 PE=4 ... | [more] |
A0A6J1C176 | 0.0 | 85.23 | alpha-N-acetylglucosaminidase-like OS=Momordica charantia OX=3673 GN=LOC11100744... | [more] |
A0A5D3BH46 | 0.0 | 78.72 | Alpha-N-acetylglucosaminidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E56... | [more] |
Match Name | E-value | Identity | Description | |
AT5G13690.1 | 3.9e-287 | 57.42 | alpha-N-acetylglucosaminidase family / NAGLU family | [more] |