Sgr019477 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr019477
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionEndoglucanase
Locationtig00153347: 1020063 .. 1023575 (+)
RNA-Seq ExpressionSgr019477
SyntenySgr019477
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCCCGACTGATAAAAAGCTTTTCTCAGCTGGTAGGTACCCAAAAAATAGTCCAGTAAAGTTTCGAGGAGATTCAGGCTTGGAAGATGGGATTTCAAGCAATAAACCGGATGGTCTTATTGGTGGTTTCTATGATTCAGGAAACAACATTAAGTTCACTTTCCCCACAGCTTATACCATTACTCTTCTAAGCTGGAGTGTGATTGAGTATCATCCAAAGTATGCAGACATGAATGAGCTTGATCATGTCAAGGACATCATCAGATGGGGAACTGATTATTTGCTCAAAGTTTTGGTGGCCCCAAATGCCACTTCTGATCAAACCATAATATATTCTCAGGTAAGTCATATGACCTCTCAATGAGAAAAATTGCGATACTCACAATGACATTATTCCACAACACTGAATCTCCCATGTCATGTCTATGAGTCTATCACGTAACTATCTAATGTGTGATGCTAGGTAGGAAGTGCCAGTAATGATAGCAATGTTCAAACTAATGACAACTGCTGGCAAAGACCAGAAGACATGAGGTACCCAAGACCTGTTTCAAAATGTGATACCCGGGCTTCGGATCTTGCTGGAGAGATTATTGCAGCATTATCAGCTGCATCATTAGTGTTCAAAGAAGATAACAATTATTCAAGAGAATTAGCAAAAGCTGCAGAGAAATTGTTTGAGCAAGTGACTAAGTTAGACCCTAGTAAGCAAGGAACTTACACCATGGTTGATTCATGTGGAGGAGAAGCAAGAAACTTCTACAACTCATCAAGTTACATGGATGAATTGATATGGGCAGGAACTTGGTTGTTCTTTGCTACCGGAAACACTTCATATCTTGCCTATGCCACTGATGCTGTCAGATTTCAATTAGCACAGAGCGAGGAAGCAATATTGACAGAGGGATTTTCTATTGGAACAATAAGTTCAGTGCAACTGCGGTAACTCCAATCAAAGTCTTATTTGAATTTTGGATAAAAGAGGAAGAATCTATAGCTGATGAATGTTTTACATTACCACTCTTTCAATAGGTACTATTGACACGTCTTCTCTACTTTCATGATATTGGCTACCCATATGAATATGCCTTAGGAGCATCATCAAACATGACAGACATCCTCATGTGTTCTTATCTCATTGATCAACACTTCGATAGGACACCTGGTAAACATCTGTTCAGATCCTATAGATCAAATAAACTTAATATCTATGGAATTGCAAGTACACTTGGACAAATCGACTACCCCACTGATTTTTCTGTCAACAACAAGAAGACAAGTAAGTTTGTTTCCAGTAATCAGCTATTCTTTCAATTGTTAATGTAGGTGGACTGATCCTCCTAAGGCCTGATGATGGAGCACCACTCCAGTTTGCTGCAACAGCCTCATTTCTCAGTAAATTGTACAGTGATTACCTTGATCTTTTGGGAGCATCATACATAAGTTGCATTTTTGCCTATCCAGGCTTTTCTCTGGAAAAGTTGCGGACCTTCTCCAGGTCTCAGGTAAGTGCAATATACTAAGAACTATTTTCTCTGTCTTCTATTTCCAACAACAGAAAATGTAGAACATATTAACATTCAACAAAATCACCTCAAGTTAAGTTGATTTTAGAGAGCAATAAGACAATATTTTCAAATGCCTTTTTTTAGTGAAGGAAAAACATACTGTTCTGCTTTCTGAGAAAAGTAAAACTATTCAAATACCGTATTTTGAGATCTTTTATCTTTCTGCTTTTAGAGGCAGTTTTGAAACTTGTTAGTTGGAAACAAAACCAAACAAATGTTAGAAACATCAATACACATTCTAGACTGAAGTCTAAAAATCCTTCAGTTAGATATAAGATTTTAATTATATTAAACTTTTTCAGCTCAACTACATACTTGGAGATAATCCTATGAAAATGAGCTACGTCGTTAGCTTCGGAAACAATTTTCCCACCCATGTCCACCACAGGAGTGCCTCAATTCCTTGGGATGGTCAATTCCATTCATGTGCTGAAGGAGATAGATGGTTGTTATCTAAGGCTTCAAATCCAAATATTCTTTCCGGAGCCATGGTGGCAGGACCAGACAAGTTAGACCACTTCTTGGATGATAGGGAAAAACCTTGGTTTACTGAACCAAGTATAGCAAGCAATGCAGGTTTAGTTGCAGCGCTCATTGCTCTACATGACTATCCAGGTGATACTTCCGGTTTTAATGGGAAAAATTTAGGCATAGATCAGATGTCAATCTTTGATAGAATCCATGTGGCTTCTGTAGCTCCTTGATATTCCTAAGATATATTTTTAACTTCTCATAAGAGCAAAAATGTGACTTCAGAAATCTTCACTCTGTAATGTCCCACAATCTTTCTTACATGCAAGAATATGAACAAATTGTTTCTTCCCAATCTGCATGACTTTTTTGAGTAAATAACTATGCAATCTCATTGTTAATTCTTTGCTTTTACACTTTTTATCATTGCTTGTGTTCTCTCCCTACAAGCTAATATAAAATTCTCAGTTTTTGACACCTCTACTAATTAATAGATCTGTAGGAAACAAAATTTCATCCCAAAGAATTATCCAAAGCATAATATCAGCCTCTCCAAACCAAAGACTGTTTTTAAATGTGAAAGTTAAGAGTTAAGGCTTGAGAAGATATCTTTTATTTAACCAATTCAGTAGTACATGAGAACAGAATTTTTTCACTTTGATAGTATTTAGTCATTTGTTGTAATTTAATTACAGGCCTATTAGAATTGAGACTTGAGAATTAAAAAAAACTGTCTGTTGATGAAGAAAACTCGTTTGGTACGATATGATGACAGCATGAACAGAGATTTATAGTATTTGATTTAAATTTTCCAATACACTCTGTAACCAAAGTAATTGATGATACATATTACATATTACATATTACAAACCCAACCAATCCGCAAGTTAGATTAAAGTACACTCAACTAAGTTTAAAGACAATCGGCACTGTATTGATCTTTCAAATTTTAGAATCCCAATGTCGCAAAAGGATGAGAAGAAAAGACCTACCGCAACAGTCGCGTACACTCAAAAGACATGAAGTAAATCAGTGACCCACTGCTGGAAAGCAATAGATTTTAAGAAAAAAGGAGAGACGCAGAGAGGAATTTCAATCACAGGAAAAGACCTGTTGGGGCAATCTCAAAGCCTCGGCTTTGGTGAAGCAGGTGTCGTCGATGGCATCGCTCCGGCACCACCGGCATTTGGGGCTCTGCGAACACTGTAACGCCGAAACCTTCTCACCGCAATTTGGGATCTGCTGATGTCTCCTTCTCTGTAAGCTCAATAAACTACGATCCGAACTGGCAAACGTCATTTTGCGATCTCCAGATAACCCATTACATGAGACGCAGAGGAATAGTAAGACGAGGAAGATGGAGACCGGCGTTGGAAGATCAGAGCTCCTTCCCATGGAAATGGAAGACAAACCTATGGGAACCTAAGTCCCACAGCTACACTTCAGAACTCCACTGCAGGCAGGTGTCTTAG

mRNA sequence

ATGCCCCCGACTGATAAAAAGCTTTTCTCAGCTGGTAGGTACCCAAAAAATAGTCCAGTAAAGTTTCGAGGAGATTCAGGCTTGGAAGATGGGATTTCAAGCAATAAACCGGATGGTCTTATTGGTGGTTTCTATGATTCAGGAAACAACATTAAGTTCACTTTCCCCACAGCTTATACCATTACTCTTCTAAGCTGGAGTGTGATTGAGTATCATCCAAAGTATGCAGACATGAATGAGCTTGATCATGTCAAGGACATCATCAGATGGGGAACTGATTATTTGCTCAAAGTTTTGGTGGCCCCAAATGCCACTTCTGATCAAACCATAATATATTCTCAGGTAGGAAGTGCCAGTAATGATAGCAATGTTCAAACTAATGACAACTGCTGGCAAAGACCAGAAGACATGAGGTACCCAAGACCTGTTTCAAAATGTGATACCCGGGCTTCGGATCTTGCTGGAGAGATTATTGCAGCATTATCAGCTGCATCATTAGTGTTCAAAGAAGATAACAATTATTCAAGAGAATTAGCAAAAGCTGCAGAGAAATTGTTTGAGCAAGTGACTAAGTTAGACCCTAGTAAGCAAGGAACTTACACCATGGTTGATTCATGTGGAGGAGAAGCAAGAAACTTCTACAACTCATCAAGTTACATGGATGAATTGATATGGGCAGGAACTTGGTTGTTCTTTGCTACCGGAAACACTTCATATCTTGCCTATGCCACTGATGCTGTCAGATTTCAATTAGCACAGAGCGAGGAAGCAATATTGACAGAGGGATTTTCTATTGGAACAATAAGTTCAGTGCAACTGCGGACACCTGGTAAACATCTGTTCAGATCCTATAGATCAAATAAACTTAATATCTATGGAATTGCAAGTACACTTGGACAAATCGACTACCCCACTGATTTTTCTGTCAACAACAAGAAGACAAGTGGACTGATCCTCCTAAGGCCTGATGATGGAGCACCACTCCAGTTTGCTGCAACAGCCTCATTTCTCAGTAAATTGTACAGTGATTACCTTGATCTTTTGGGAGCATCATACATAAGTTGCATTTTTGCCTATCCAGGCTTTTCTCTGGAAAAGTTGCGGACCTTCTCCAGGTCTCAGCTCAACTACATACTTGGAGATAATCCTATGAAAATGAGCTACGTCGTTAGCTTCGGAAACAATTTTCCCACCCATGTCCACCACAGGAGTGCCTCAATTCCTTGGGATGGTCAATTCCATTCATGTGCTGAAGGAGATAGATGGTTGTTATCTAAGGCTTCAAATCCAAATATTCTTTCCGGAGCCATGGTGGCAGGACCAGACAAGTTAGACCACTTCTTGGATGATAGGGAAAAACCTTGGTTTACTGAACCAAGTATAGCAAGCAATGCAGGTTTAGTTGCAGCGCTCATTGCTCTACATGACTATCCAGGTGATACTTCCGGTTTTAATGGGAAAAATTTAGGCATAGATCAGATGTGTCGTCGATGGCATCGCTCCGGCACCACCGGCATTTGGGGCTCTGCGAACACTGTAACGCCGAAACCTTCTCACCGCAATTTGGGATCTGCTGATGTCTCCTTCTCTGTAAGCTCAATAAACTACGATCCGAACTGGCAAACGTCATTTTGCGATCTCCAGATAACCCATTACATGAGACGCAGAGGAATAGTAAGACGAGGAAGATGGAGACCGGCGTTGGAAGATCAGAGCTCCTTCCCATGGAAATGGAAGACAAACCTATGGGAACCTAAGTCCCACAGCTACACTTCAGAACTCCACTGCAGGCAGGTGTCTTAG

Coding sequence (CDS)

ATGCCCCCGACTGATAAAAAGCTTTTCTCAGCTGGTAGGTACCCAAAAAATAGTCCAGTAAAGTTTCGAGGAGATTCAGGCTTGGAAGATGGGATTTCAAGCAATAAACCGGATGGTCTTATTGGTGGTTTCTATGATTCAGGAAACAACATTAAGTTCACTTTCCCCACAGCTTATACCATTACTCTTCTAAGCTGGAGTGTGATTGAGTATCATCCAAAGTATGCAGACATGAATGAGCTTGATCATGTCAAGGACATCATCAGATGGGGAACTGATTATTTGCTCAAAGTTTTGGTGGCCCCAAATGCCACTTCTGATCAAACCATAATATATTCTCAGGTAGGAAGTGCCAGTAATGATAGCAATGTTCAAACTAATGACAACTGCTGGCAAAGACCAGAAGACATGAGGTACCCAAGACCTGTTTCAAAATGTGATACCCGGGCTTCGGATCTTGCTGGAGAGATTATTGCAGCATTATCAGCTGCATCATTAGTGTTCAAAGAAGATAACAATTATTCAAGAGAATTAGCAAAAGCTGCAGAGAAATTGTTTGAGCAAGTGACTAAGTTAGACCCTAGTAAGCAAGGAACTTACACCATGGTTGATTCATGTGGAGGAGAAGCAAGAAACTTCTACAACTCATCAAGTTACATGGATGAATTGATATGGGCAGGAACTTGGTTGTTCTTTGCTACCGGAAACACTTCATATCTTGCCTATGCCACTGATGCTGTCAGATTTCAATTAGCACAGAGCGAGGAAGCAATATTGACAGAGGGATTTTCTATTGGAACAATAAGTTCAGTGCAACTGCGGACACCTGGTAAACATCTGTTCAGATCCTATAGATCAAATAAACTTAATATCTATGGAATTGCAAGTACACTTGGACAAATCGACTACCCCACTGATTTTTCTGTCAACAACAAGAAGACAAGTGGACTGATCCTCCTAAGGCCTGATGATGGAGCACCACTCCAGTTTGCTGCAACAGCCTCATTTCTCAGTAAATTGTACAGTGATTACCTTGATCTTTTGGGAGCATCATACATAAGTTGCATTTTTGCCTATCCAGGCTTTTCTCTGGAAAAGTTGCGGACCTTCTCCAGGTCTCAGCTCAACTACATACTTGGAGATAATCCTATGAAAATGAGCTACGTCGTTAGCTTCGGAAACAATTTTCCCACCCATGTCCACCACAGGAGTGCCTCAATTCCTTGGGATGGTCAATTCCATTCATGTGCTGAAGGAGATAGATGGTTGTTATCTAAGGCTTCAAATCCAAATATTCTTTCCGGAGCCATGGTGGCAGGACCAGACAAGTTAGACCACTTCTTGGATGATAGGGAAAAACCTTGGTTTACTGAACCAAGTATAGCAAGCAATGCAGGTTTAGTTGCAGCGCTCATTGCTCTACATGACTATCCAGGTGATACTTCCGGTTTTAATGGGAAAAATTTAGGCATAGATCAGATGTGTCGTCGATGGCATCGCTCCGGCACCACCGGCATTTGGGGCTCTGCGAACACTGTAACGCCGAAACCTTCTCACCGCAATTTGGGATCTGCTGATGTCTCCTTCTCTGTAAGCTCAATAAACTACGATCCGAACTGGCAAACGTCATTTTGCGATCTCCAGATAACCCATTACATGAGACGCAGAGGAATAGTAAGACGAGGAAGATGGAGACCGGCGTTGGAAGATCAGAGCTCCTTCCCATGGAAATGGAAGACAAACCTATGGGAACCTAAGTCCCACAGCTACACTTCAGAACTCCACTGCAGGCAGGTGTCTTAG

Protein sequence

MPPTDKKLFSAGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTNDNCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFEQVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEAILTEGFSIGTISSVQLRTPGKHLFRSYRSNKLNIYGIASTLGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIFAYPGFSLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCAEGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGFNGKNLGIDQMCRRWHRSGTTGIWGSANTVTPKPSHRNLGSADVSFSVSSINYDPNWQTSFCDLQITHYMRRRGIVRRGRWRPALEDQSSFPWKWKTNLWEPKSHSYTSELHCRQVS
Homology
BLAST of Sgr019477 vs. NCBI nr
Match: XP_031745535.1 (endoglucanase 9-like [Cucumis sativus])

HSP 1 Score: 812.8 bits (2098), Expect = 2.0e-231
Identity = 402/497 (80.89%), Postives = 434/497 (87.32%), Query Frame = 0

Query: 3   PTDKKLFSAGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTIT 62
           PTDKKLFSAGRYPK+SPVKFRGDSGLEDG+SSNKPDGLIGGFYDSGNNIKFTFPTAYTIT
Sbjct: 7   PTDKKLFSAGRYPKSSPVKFRGDSGLEDGVSSNKPDGLIGGFYDSGNNIKFTFPTAYTIT 66

Query: 63  LLSWSVIEYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDS 122
           LLSWSVIEYHPKYADMNELDHVKDIIRWGT+YLLK+ VAPNATSDQTIIYSQVGS+SNDS
Sbjct: 67  LLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKIFVAPNATSDQTIIYSQVGSSSNDS 126

Query: 123 NVQTNDNCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAA 182
           N QTNDNCWQRPEDM YPRP+S CD RASDLAGEI+AALSA+SLVF+ED NYSRELAKAA
Sbjct: 127 NAQTNDNCWQRPEDMMYPRPISTCDARASDLAGEIVAALSASSLVFREDTNYSRELAKAA 186

Query: 183 EKLFEQVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAY 242
           EKLF+QVTKLDP +QGTY+ VDSCGGEAR FYNSSSY DELIWAGTWLFFATGNTSYL+Y
Sbjct: 187 EKLFQQVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLSY 246

Query: 243 ATDAVRFQLAQSEEAILTEGFS--IGTISSVQLRTPGKHLFRSYRSNKLNIYGIASTLGQ 302
           ATDAVRFQLAQSEEA +  G        S+  +       F           G++S + +
Sbjct: 247 ATDAVRFQLAQSEEASIGRGIFNWNNKFSATAVLLTRLLYFHDTGYPYEYALGVSSNMTE 306

Query: 303 I---DYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIF 362
           I    Y  D    ++   GLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASY+SCIF
Sbjct: 307 ILMCSYLID-QHYDRTPGGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYMSCIF 366

Query: 363 AYPGFSLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCA 422
           A PGFSLEKLR+FS SQLNYILGDNP+KMSYVV +GNNFPTHVHHR+ASIPWDGQF+SCA
Sbjct: 367 ANPGFSLEKLRSFSNSQLNYILGDNPLKMSYVVGYGNNFPTHVHHRAASIPWDGQFYSCA 426

Query: 423 EGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIALHDY 482
           EGDRWLLSKASNPNILSGAMVAGPD  DHF DDREKPWFTEPSIASNAGLVAAL+AL+DY
Sbjct: 427 EGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDY 486

Query: 483 PGDTSGFNGKNLGIDQM 495
           PGDTS FNGK+LGID+M
Sbjct: 487 PGDTSDFNGKDLGIDKM 502

BLAST of Sgr019477 vs. NCBI nr
Match: KAE8653204.1 (hypothetical protein Csa_019838 [Cucumis sativus])

HSP 1 Score: 797.0 bits (2057), Expect = 1.1e-226
Identity = 397/495 (80.20%), Postives = 429/495 (86.67%), Query Frame = 0

Query: 8   LFSAGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWS 67
           LFSAGRYPK+SPVKFRGDSGLEDG+SSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWS
Sbjct: 99  LFSAGRYPKSSPVKFRGDSGLEDGVSSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWS 158

Query: 68  VIEYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTN 127
           VIEYHPKYADMNELDHVKDIIRWGT+YLLK+ VAPNATSDQTIIYSQVGS+SNDSN QTN
Sbjct: 159 VIEYHPKYADMNELDHVKDIIRWGTEYLLKIFVAPNATSDQTIIYSQVGSSSNDSNAQTN 218

Query: 128 DNCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFE 187
           DNCWQRPEDM YPRP+S CD RASDLAGEI+AALSA+SLVF+ED NYSRELAKAAEKLF+
Sbjct: 219 DNCWQRPEDMMYPRPISTCDARASDLAGEIVAALSASSLVFREDTNYSRELAKAAEKLFQ 278

Query: 188 QVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAV 247
           QVTKLDP +QGTY+ VDSCGGEAR FYNSSSY DELIWAGTWLFFATGNTSYL+YATDAV
Sbjct: 279 QVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLSYATDAV 338

Query: 248 RFQLAQSEEAILTEGFS--IGTISSVQLRTPGKHLFRSYRSNKLNIYGIASTLGQI---D 307
           RFQLAQSEEA +  G        S+  +       F           G++S + +I    
Sbjct: 339 RFQLAQSEEASIGRGIFNWNNKFSATAVLLTRLLYFHDTGYPYEYALGVSSNMTEILMCS 398

Query: 308 YPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIFAYPGF 367
           Y  D    ++   GLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASY+SCIFA PGF
Sbjct: 399 YLID-QHYDRTPGGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYMSCIFANPGF 458

Query: 368 SLEKLRTFSRSQ---LNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCAEG 427
           SLEKLR+FS SQ   LNYILGDNP+KMSYVV +GNNFPTHVHHR+ASIPWDGQF+SCAEG
Sbjct: 459 SLEKLRSFSNSQASALNYILGDNPLKMSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEG 518

Query: 428 DRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIALHDYPG 487
           DRWLLSKASNPNILSGAMVAGPD  DHF DDREKPWFTEPSIASNAGLVAAL+AL+DYPG
Sbjct: 519 DRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPG 578

Query: 488 DTSGFNGKNLGIDQM 495
           DTS FNGK+LGID+M
Sbjct: 579 DTSDFNGKDLGIDKM 592

BLAST of Sgr019477 vs. NCBI nr
Match: XP_022140170.1 (endoglucanase 25-like [Momordica charantia])

HSP 1 Score: 791.2 bits (2042), Expect = 6.3e-225
Identity = 402/510 (78.82%), Postives = 428/510 (83.92%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVIE 70
           +GRYPKNSPVKFRGDSGLEDG+  NK DGL+GGFYDSGNNIKFTFPTAYTITLLSWSVIE
Sbjct: 111 SGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIE 170

Query: 71  YHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTNDNC 130
           YHPKYADMNELDHV+DIIRWGTDYLLKV VAPN TSDQ IIYSQVGSASNDSNVQTNDNC
Sbjct: 171 YHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAIIYSQVGSASNDSNVQTNDNC 230

Query: 131 WQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFEQVT 190
           WQRPED RYPRPVSKCDTRASDLAGEI+AALSAASLVFKEDNNYS ELAKAAEKLFE+VT
Sbjct: 231 WQRPEDTRYPRPVSKCDTRASDLAGEIVAALSAASLVFKEDNNYSGELAKAAEKLFEEVT 290

Query: 191 KLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAVRFQ 250
           KLDPS+QGTYT+VDSCGGEARNFYNSSSYMDELIWAGTWLF+ATGNTSYLAYATDAVRFQ
Sbjct: 291 KLDPSEQGTYTLVDSCGGEARNFYNSSSYMDELIWAGTWLFYATGNTSYLAYATDAVRFQ 350

Query: 251 LAQSEEAILTEG-------FSIGTISSVQLR------TPGKHLFRSYRSNKLNIYGIAST 310
           LAQS+E+ +  G       FS   +   +L        P +H   +  SNK +I   +  
Sbjct: 351 LAQSKESSIDRGIFDWNNKFSATAVLLTRLLYFHDIVYPYEHALGA-SSNKTDILMCSYL 410

Query: 311 LGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIF 370
           + Q          N+   GLI+LRPD GAPLQFAATASFLSKLYSDYLDLLGASY+SCIF
Sbjct: 411 IDQ--------HFNRTPGGLIILRPDGGAPLQFAATASFLSKLYSDYLDLLGASYMSCIF 470

Query: 371 AYPGFSLEKLRTFSRS-------------QLNYILGDNPMKMSYVVSFGNNFPTHVHHRS 430
           A PGFSLEKLRTFSRS             QLNYILGDNPMKMSYVV FG NFPTHVHHR 
Sbjct: 471 ANPGFSLEKLRTFSRSQASAIDFNGIELFQLNYILGDNPMKMSYVVGFGTNFPTHVHHRG 530

Query: 431 ASIPWDGQFHSCAEGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASN 490
           ASIP DGQF+SCAEGDRWLLSKASNPNILSGA+V GPDK DHF DDR KPWFTEPSIASN
Sbjct: 531 ASIPRDGQFYSCAEGDRWLLSKASNPNILSGALVTGPDKFDHFSDDRGKPWFTEPSIASN 590

Query: 491 AGLVAALIALHDYPGDTSGFNGKNLGIDQM 495
           AGLVAAL+ALHDYPGDTS FNGK+LGIDQM
Sbjct: 591 AGLVAALVALHDYPGDTSDFNGKDLGIDQM 611

BLAST of Sgr019477 vs. NCBI nr
Match: KAA0057069.1 (endoglucanase 25 [Cucumis melo var. makuwa])

HSP 1 Score: 788.1 bits (2034), Expect = 5.3e-224
Identity = 395/495 (79.80%), Postives = 426/495 (86.06%), Query Frame = 0

Query: 8   LFSAGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWS 67
           LFSAGRYPK+SPVKFRGDSGL+DG+SSNKPDGLIGGFYDSGNN+KFTFPTAYTITLLSWS
Sbjct: 80  LFSAGRYPKSSPVKFRGDSGLKDGVSSNKPDGLIGGFYDSGNNMKFTFPTAYTITLLSWS 139

Query: 68  VIEYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTN 127
           VIEYHPKYADMNELDHVKDIIRWGT+YLLKV VAPNATSDQTIIYSQVGS+SN+S  QTN
Sbjct: 140 VIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVAPNATSDQTIIYSQVGSSSNESKAQTN 199

Query: 128 DNCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFE 187
           DNCWQRPEDM YPRPVS CD RASDLAGEI+AALSA+SLVF+ED NYS ELAKAAEKLF+
Sbjct: 200 DNCWQRPEDMMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQ 259

Query: 188 QVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAV 247
           QVTKLDP +QGTY+ VDSCGGEAR FYNSSSY DELIWAGTWLFFATGNTSYL+YATDAV
Sbjct: 260 QVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLSYATDAV 319

Query: 248 RFQLAQSEEAILTEGFS--IGTISSVQLRTPGKHLFRSYRSNKLNIYGIASTLGQI---D 307
           RFQLAQSEEA +  G        S+  +       F           G++S + +I    
Sbjct: 320 RFQLAQSEEASIGRGIFNWNNKFSATAVLLTRLLYFHDTGYPYEYALGVSSNMTEILMCS 379

Query: 308 YPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIFAYPGF 367
           Y  D    N+  SGLILL PDD APLQFAATASFLSKLYSDYLDLLGASY+SCIFA P F
Sbjct: 380 YLIDQHF-NRTPSGLILLSPDDKAPLQFAATASFLSKLYSDYLDLLGASYMSCIFANPRF 439

Query: 368 SLEKLRTFSRSQ---LNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCAEG 427
           SLEKLR+FS+SQ   LNYILGDNP+KMSYVV +GNNFPTHVHHR+ASIPWDGQF+SCAEG
Sbjct: 440 SLEKLRSFSKSQASALNYILGDNPLKMSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEG 499

Query: 428 DRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIALHDYPG 487
           DRWLLSKASNPNILSGAMVAGPDK DHF DDREKPWFTEPSIASNAGLVAAL+AL+DYPG
Sbjct: 500 DRWLLSKASNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPG 559

Query: 488 DTSGFNGKNLGIDQM 495
           DT  FNGKNLGIDQM
Sbjct: 560 DTPDFNGKNLGIDQM 573

BLAST of Sgr019477 vs. NCBI nr
Match: KAG6573332.1 (Endoglucanase 7, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 696.0 bits (1795), Expect = 2.8e-196
Identity = 354/497 (71.23%), Postives = 390/497 (78.47%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVIE 70
           +GRYP+NSPV FRGDSGL+DG+SS+KPDGL+GGFYDSGNNIKFTFPTAYTITLL WSVIE
Sbjct: 112 SGRYPENSPVDFRGDSGLDDGVSSSKPDGLVGGFYDSGNNIKFTFPTAYTITLLGWSVIE 171

Query: 71  YHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTNDNC 130
           YHPKYADMNELDHVKDII+WGTDYLLKV VAPN+TSD+TIIYSQVGS SNDS  Q NDNC
Sbjct: 172 YHPKYADMNELDHVKDIIKWGTDYLLKVFVAPNSTSDRTIIYSQVGSVSNDSKAQNNDNC 231

Query: 131 WQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFEQVT 190
           WQRPEDMRY RPVS+CD RASDLAGE++AALSAASLVFKEDNNYS ELAKA EKLFEQVT
Sbjct: 232 WQRPEDMRYTRPVSECDARASDLAGEVVAALSAASLVFKEDNNYSGELAKAVEKLFEQVT 291

Query: 191 KLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAVRFQ 250
           KLDPS+QGTYT+VD CGGEARNFYNSS +MDELIWAG+WLFFATGN SYLAY+TDAVRFQ
Sbjct: 292 KLDPSEQGTYTLVDLCGGEARNFYNSSGFMDELIWAGSWLFFATGNASYLAYSTDAVRFQ 351

Query: 251 LAQSEEAILTEG-------FSIGTISSVQL------RTPGKHLFRSYRSNKLNIYGIAST 310
           LA++E A + +G       FS   +   +L        P +H+ R+  SN   I   +  
Sbjct: 352 LARTEVASIDQGIFDWNNKFSATAVLLTRLLYFHDIGYPYEHVLRA-SSNMTEILMCSYL 411

Query: 311 LGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIF 370
           + Q          N+   GLILLRP DGAPLQFAATASFLSKLYSDYLDLLGASY+SCIF
Sbjct: 412 IEQ--------HFNRTPGGLILLRPADGAPLQFAATASFLSKLYSDYLDLLGASYMSCIF 471

Query: 371 AYPGFSLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCA 430
           A PGFSLEKL+ FS                               R ASIPWDGQF+SC 
Sbjct: 472 ANPGFSLEKLKAFS-------------------------------RGASIPWDGQFYSCT 531

Query: 431 EGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIALHDY 490
           EGDRWLLSK  NPN+L GAMVAGPDK DHF DDREKPWFTEP+IASNAGLVAALIALHDY
Sbjct: 532 EGDRWLLSKGPNPNLLIGAMVAGPDKFDHFSDDREKPWFTEPTIASNAGLVAALIALHDY 568

Query: 491 PGDTSGFNGKNLGIDQM 495
           PGD S +NGKN+GIDQM
Sbjct: 592 PGDVSVYNGKNIGIDQM 568

BLAST of Sgr019477 vs. ExPASy Swiss-Prot
Match: Q84R49 (Endoglucanase 10 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU2 PE=2 SV=1)

HSP 1 Score: 354.4 bits (908), Expect = 2.6e-96
Identity = 204/480 (42.50%), Postives = 286/480 (59.58%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGIS-SNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVI 70
           +G  PK++ V +RG+S ++DG+S S     L+GGFYD+G+ IKF +P A+++T+LSWSVI
Sbjct: 126 SGPLPKHNGVSWRGNSCMKDGLSDSTVRKSLVGGFYDAGDAIKFNYPMAWSMTMLSWSVI 185

Query: 71  EYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTNDN 130
           EY  KY  + ELDHVK++I+WGTDYLLK   +   T D+ +    VG  S     Q ND+
Sbjct: 186 EYKAKYEAIGELDHVKELIKWGTDYLLKTFNSSADTIDRIVAQVGVGDTSK-GGAQPNDH 245

Query: 131 -CWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFEQ 190
            CW RPED+ YPRPV++C +  SDLA E+ AAL+AAS+VFK+   YS +L + A+ L+  
Sbjct: 246 YCWMRPEDIDYPRPVTECHS-CSDLASEMAAALAAASIVFKDSKTYSDKLVRGAKALY-- 305

Query: 191 VTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYAT---- 250
             K    ++G Y+     G +   FYNS+SY DE +W G W++FATGN +YL+ AT    
Sbjct: 306 --KFGRLQRGRYS---PNGSDQAIFYNSTSYWDEFVWGGAWMYFATGNNTYLSVATAPGM 365

Query: 251 --DAVRFQLAQSEEAILT--EGFSIGTISSVQLRT------PGKHLFRSYRSNKLNIYGI 310
              A  + L      + T  +      +   +LR       P + + R++ +   N+   
Sbjct: 366 AKHAGAYWLDSPNYGVFTWDDKLPGAQVLLSRLRLFLSPGYPYEEILRTFHNQTDNV--- 425

Query: 311 ASTLGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYIS 370
                   Y   ++  N    G+I L      PLQ+   A+FL+ LYSDYLD        
Sbjct: 426 -----MCSYLPMYNSFNFTKGGMIQLNHGRPQPLQYVVNAAFLASLYSDYLDAADTPGWY 485

Query: 371 CIFAYPGFSLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFH 430
           C   +  ++ E LR F+RSQL+Y+LG NP+KMSYVV FGN +P   HHR ASIP +G  +
Sbjct: 486 CGPTF--YTTEVLRKFARSQLDYVLGKNPLKMSYVVGFGNKYPKRAHHRGASIPHNGVKY 545

Query: 431 SCAEGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIAL 475
            C  G +W  +K  NPNIL GA+VAGPD+ D F D R    +TEP++A+NAGLVAALI+L
Sbjct: 546 GCKGGFKWRETKKPNPNILIGALVAGPDRHDGFKDVRTNYNYTEPTLAANAGLVAALISL 586

BLAST of Sgr019477 vs. ExPASy Swiss-Prot
Match: Q38890 (Endoglucanase 25 OS=Arabidopsis thaliana OX=3702 GN=KOR PE=1 SV=1)

HSP 1 Score: 353.6 bits (906), Expect = 4.4e-96
Identity = 213/482 (44.19%), Postives = 290/482 (60.17%), Query Frame = 0

Query: 7   KLFSA---GRYPKNSPVKFRGDSGLEDGISSNKP--DGLIGGFYDSGNNIKFTFPTAYTI 66
           K F+A   G+ PK++ V +RG+SGL+DG          L+GG+YD+G+ IKF FP AY +
Sbjct: 118 KFFNAQKSGKLPKHNNVSWRGNSGLQDGKGETGSFYKDLVGGYYDAGDAIKFNFPMAYAM 177

Query: 67  TLLSWSVIEYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQ-TIIYSQVGSA-S 126
           T+LSWSVIEY  KY    EL HVK++I+WGTDY LK     N+T+D    + SQVGS  +
Sbjct: 178 TMLSWSVIEYSAKYEAAGELTHVKELIKWGTDYFLKTF---NSTADSIDDLVSQVGSGNT 237

Query: 127 NDSNVQTNDN-CWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSREL 186
           +D N   ND+ CW RPEDM Y RPV+ C+   SDLA E+ AAL++AS+VFK++  YS++L
Sbjct: 238 DDGNTDPNDHYCWMRPEDMDYKRPVTTCNGGCSDLAAEMAAALASASIVFKDNKEYSKKL 297

Query: 187 AKAAEKLFEQVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTS 246
              A+ +++       +++G Y+   +   E+  FYNSS Y DE IW G W+++ATGN +
Sbjct: 298 VHGAKVVYQ----FGRTRRGRYS---AGTAESSKFYNSSMYWDEFIWGGAWMYYATGNVT 357

Query: 247 YLAYATDAVRFQLAQSEEAILTEG-FS-IGTISSVQLRTPGKHLFRS--YRSNK-LNIYG 306
           YL   T     + A +       G FS    ++  QL      LF S  Y   + L  + 
Sbjct: 358 YLNLITQPTMAKHAGAFWGGPYYGVFSWDNKLAGAQLLLSRLRLFLSPGYPYEEILRTFH 417

Query: 307 IASTLGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYI 366
             +++    Y   F+  N+   GLI L      PLQ++  A+FL+ LYSDYLD   A+  
Sbjct: 418 NQTSIVMCSYLPIFNKFNRTNGGLIELNHGAPQPLQYSVNAAFLATLYSDYLD---AADT 477

Query: 367 SCIFAYPGF-SLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQ 426
              +  P F S   LR F+RSQ++YILG NP KMSYVV FG  +P HVHHR ASIP +  
Sbjct: 478 PGWYCGPNFYSTSVLRDFARSQIDYILGKNPRKMSYVVGFGTKYPRHVHHRGASIPKNKV 537

Query: 427 FHSCAEGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALI 475
            ++C  G +W  SK  NPN + GAMVAGPDK D + D R    +TEP++A NAGLVAAL+
Sbjct: 538 KYNCKGGWKWRDSKKPNPNTIEGAMVAGPDKRDGYRDVRMNYNYTEPTLAGNAGLVAALV 586

BLAST of Sgr019477 vs. ExPASy Swiss-Prot
Match: Q7XUK4 (Endoglucanase 12 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU3 PE=2 SV=2)

HSP 1 Score: 350.1 bits (897), Expect = 4.9e-95
Identity = 199/476 (41.81%), Postives = 281/476 (59.03%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGIS-SNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVI 70
           +GR PKN+ +K+RG+SGL DG   ++   GL+GG+YD+G+NIKF FP A+++T+LSWSVI
Sbjct: 127 SGRLPKNNGIKWRGNSGLSDGSDLTDVKGGLVGGYYDAGDNIKFHFPLAFSMTMLSWSVI 186

Query: 71  EYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTNDN 130
           EY  KY  + E DHV+++I+WGTDYLL    +  +T D+  +YSQVG A  +     +  
Sbjct: 187 EYSAKYKAVGEYDHVRELIKWGTDYLLLTFNSSASTIDK--VYSQVGIAKINGTQPDDHY 246

Query: 131 CWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFEQV 190
           CW RPEDM YPRPV    + A DL GE+ AAL+AAS+VF+++  YS++L   A  +++  
Sbjct: 247 CWNRPEDMAYPRPVQTAGS-APDLGGEMAAALAAASIVFRDNAAYSKKLVNGAAAVYKFA 306

Query: 191 TKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAVRF 250
                   G  T           +YNS+SY DE +W+  W+++ATGN +Y+ +ATD    
Sbjct: 307 -----RSSGRRTPYSRGNQYIEYYYNSTSYWDEYMWSAAWMYYATGNNTYITFATDPRLP 366

Query: 251 QLAQSEEAILTEGFSIGTISSVQLRTPGKHLFRSYRSNKLNI----------YGIASTLG 310
           + A++  +IL   FS   + S   + PG  L  S     LN           Y   +++ 
Sbjct: 367 KNAKAFYSIL--DFS---VFSWDNKLPGAELLLSRLRMFLNPGYPYEESLIGYHNTTSMN 426

Query: 311 QIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIFAY 370
              Y   F   N    GL       G PLQ+    SFL+ LY+DY++ +      C    
Sbjct: 427 MCTYFPRFGAFNFTKGGLAQFNHGKGQPLQYTVANSFLAALYADYMESVNVPGWYC---G 486

Query: 371 PGF-SLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCAE 430
           P F +++ LR+F+RSQ+NYILGDNP KMSYVV +G  +P  +HHR AS P +G  +SC  
Sbjct: 487 PYFMTVDDLRSFARSQVNYILGDNPKKMSYVVGYGKKYPRRLHHRGASTPHNGIKYSCTG 546

Query: 431 GDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIAL 475
           G +W  +K ++PN+L GAMV GPDK D F D R      EP++  NAGLVAAL+AL
Sbjct: 547 GYKWRDTKGADPNVLVGAMVGGPDKNDQFKDARLTYAQNEPTLVGNAGLVAALVAL 586

BLAST of Sgr019477 vs. ExPASy Swiss-Prot
Match: P0C1U4 (Endoglucanase 9 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU1 PE=2 SV=1)

HSP 1 Score: 349.7 bits (896), Expect = 6.4e-95
Identity = 203/476 (42.65%), Postives = 283/476 (59.45%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGISSNKPD-GLIGGFYDSGNNIKFTFPTAYTITLLSWSVI 70
           +G+ PKN+ V +RG+S ++DG+S       L+GG+YD+G+ +KF FP A+++TLLSWSVI
Sbjct: 128 SGKLPKNNNVHWRGNSCMKDGLSDPAVGRSLVGGYYDAGDAVKFNFPAAFSMTLLSWSVI 187

Query: 71  EYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGS-ASNDSNVQTND 130
           EY  KY  + EL H++D I+WG DY LK   +   T D+ ++  QVGS A++  + Q ND
Sbjct: 188 EYSAKYEAVGELGHIRDTIKWGADYFLKTFNSTADTIDRVVM--QVGSGATSPGSTQPND 247

Query: 131 N-CWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFE 190
           + CW RPED+ YPRPV +C    SDLA E+ A+L+AAS+VFK++  YS++L   A  LF 
Sbjct: 248 HYCWMRPEDIDYPRPVVECHA-CSDLAAEMAASLAAASIVFKDNKAYSQKLVHGATTLF- 307

Query: 191 QVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAV 250
              K     +G Y+   + G +A  FYNS+SY DE +W G+W++ ATGN+SYL  AT   
Sbjct: 308 ---KFARQNRGRYS---AGGSDAAKFYNSTSYWDEFVWGGSWMYLATGNSSYLQLATHP- 367

Query: 251 RFQLAQSEEAILTEGFSIGTISSVQLRTPGKHLFRSYR---------SNKLNIYGIASTL 310
             +LA+   A    G   G  S     T  + L    R            L  +   +++
Sbjct: 368 --KLAKHAGA-YWGGPDYGVFSWDNKLTGAQVLLSRLRLFLSPGYPYEEILRTFHNQTSI 427

Query: 311 GQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIFA 370
               Y   F   N+   GLI L      PLQ+   A+FL+ LY DYL+        C   
Sbjct: 428 IMCSYLPIFKSFNRTKGGLIQLNHGRPQPLQYVVNAAFLASLYGDYLEAADTPGWYCGPH 487

Query: 371 YPGFSLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCAE 430
           +  + +E LR F+R+Q+ YILG NP+KMSYVV +GN +P  VHHR ASIP +G  + C  
Sbjct: 488 F--YPIETLRNFARTQIEYILGKNPLKMSYVVGYGNRYPKRVHHRGASIPKNGVHYGCKG 547

Query: 431 GDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIAL 475
           G +W  +K  NPNI+ GAMVAGPD+ D F D R+   +TE ++A NAGLVAAL+AL
Sbjct: 548 GWKWRETKKPNPNIIVGAMVAGPDRHDGFKDVRKNYNYTEATLAGNAGLVAALVAL 587

BLAST of Sgr019477 vs. ExPASy Swiss-Prot
Match: O04478 (Endoglucanase 7 OS=Arabidopsis thaliana OX=3702 GN=KOR2 PE=2 SV=1)

HSP 1 Score: 345.9 bits (886), Expect = 9.2e-94
Identity = 202/481 (42.00%), Postives = 288/481 (59.88%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGISSNKPD---GLIGGFYDSGNNIKFTFPTAYTITLLSWS 70
           +G+ PK + V +RGDSG +DG+    PD   GL+GG+YD G+N+KF FP A+++T+LSWS
Sbjct: 133 SGKLPKKNKVSWRGDSGTKDGL----PDVVGGLVGGYYDGGSNVKFHFPMAFSMTMLSWS 192

Query: 71  VIEYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTN 130
           +IEY  KY  ++E DH++D+++WGTDYLL  L   N+ +    IY+QVG    DS    +
Sbjct: 193 LIEYSHKYKAIDEYDHMRDVLKWGTDYLL--LTFNNSATRLDHIYTQVGGGLRDSESPDD 252

Query: 131 DNCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKL-- 190
             CWQ+PEDM Y RPV    T A+DL  E+ AAL+AAS+VF +  +Y+++L K AE L  
Sbjct: 253 IYCWQKPEDMSYDRPVLS-STSAADLGAEVSAALAAASIVFTDKPDYAKKLKKGAETLYP 312

Query: 191 -FEQVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYAT 250
            F   ++      G  T        A+ FYNS+S  DE +WAG WL++ATGN +Y+ +AT
Sbjct: 313 FFRSKSRRKRYSDGQPT--------AQAFYNSTSMFDEFMWAGAWLYYATGNKTYIQFAT 372

Query: 251 DAVRFQLAQSEEAILTEGFSIGTISSVQLRTPGKHLFRS-YR---------SNKLNIYGI 310
                 + Q+ +A       +  + S   + PG  L  + YR          N LN Y  
Sbjct: 373 TP---SVPQTAKAFANRPELM--VPSWNNKLPGAMLLMTRYRLFLNPGFPYENMLNRYHN 432

Query: 311 ASTLGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYIS 370
           A+ +    Y   ++V N+ + GL+ L      PL++ A ASFL+ L++DYL+  G     
Sbjct: 433 ATGITMCAYLKQYNVFNRTSGGLMQLNLGKPRPLEYVAHASFLASLFADYLNSTGVPGWY 492

Query: 371 CIFAYPGF-SLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQF 430
           C    P F     L+ F++SQ++YILGDNP+KMSYVV FG  FP  VHHR A+IP D + 
Sbjct: 493 C---GPTFVENHVLKDFAQSQIDYILGDNPLKMSYVVGFGKKFPRRVHHRGATIPNDKKR 552

Query: 431 HSCAEGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIA 475
            SC EG ++  +K  NPN ++GAMV GP+K D F D R     +EP+++ NAGLVAAL++
Sbjct: 553 RSCREGLKYRDTKNPNPNNITGAMVGGPNKFDEFHDLRNNYNASEPTLSGNAGLVAALVS 590

BLAST of Sgr019477 vs. ExPASy TrEMBL
Match: A0A6J1CED2 (Endoglucanase OS=Momordica charantia OX=3673 GN=LOC111010902 PE=3 SV=1)

HSP 1 Score: 791.2 bits (2042), Expect = 3.0e-225
Identity = 402/510 (78.82%), Postives = 428/510 (83.92%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVIE 70
           +GRYPKNSPVKFRGDSGLEDG+  NK DGL+GGFYDSGNNIKFTFPTAYTITLLSWSVIE
Sbjct: 111 SGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIE 170

Query: 71  YHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTNDNC 130
           YHPKYADMNELDHV+DIIRWGTDYLLKV VAPN TSDQ IIYSQVGSASNDSNVQTNDNC
Sbjct: 171 YHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAIIYSQVGSASNDSNVQTNDNC 230

Query: 131 WQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFEQVT 190
           WQRPED RYPRPVSKCDTRASDLAGEI+AALSAASLVFKEDNNYS ELAKAAEKLFE+VT
Sbjct: 231 WQRPEDTRYPRPVSKCDTRASDLAGEIVAALSAASLVFKEDNNYSGELAKAAEKLFEEVT 290

Query: 191 KLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAVRFQ 250
           KLDPS+QGTYT+VDSCGGEARNFYNSSSYMDELIWAGTWLF+ATGNTSYLAYATDAVRFQ
Sbjct: 291 KLDPSEQGTYTLVDSCGGEARNFYNSSSYMDELIWAGTWLFYATGNTSYLAYATDAVRFQ 350

Query: 251 LAQSEEAILTEG-------FSIGTISSVQLR------TPGKHLFRSYRSNKLNIYGIAST 310
           LAQS+E+ +  G       FS   +   +L        P +H   +  SNK +I   +  
Sbjct: 351 LAQSKESSIDRGIFDWNNKFSATAVLLTRLLYFHDIVYPYEHALGA-SSNKTDILMCSYL 410

Query: 311 LGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIF 370
           + Q          N+   GLI+LRPD GAPLQFAATASFLSKLYSDYLDLLGASY+SCIF
Sbjct: 411 IDQ--------HFNRTPGGLIILRPDGGAPLQFAATASFLSKLYSDYLDLLGASYMSCIF 470

Query: 371 AYPGFSLEKLRTFSRS-------------QLNYILGDNPMKMSYVVSFGNNFPTHVHHRS 430
           A PGFSLEKLRTFSRS             QLNYILGDNPMKMSYVV FG NFPTHVHHR 
Sbjct: 471 ANPGFSLEKLRTFSRSQASAIDFNGIELFQLNYILGDNPMKMSYVVGFGTNFPTHVHHRG 530

Query: 431 ASIPWDGQFHSCAEGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASN 490
           ASIP DGQF+SCAEGDRWLLSKASNPNILSGA+V GPDK DHF DDR KPWFTEPSIASN
Sbjct: 531 ASIPRDGQFYSCAEGDRWLLSKASNPNILSGALVTGPDKFDHFSDDRGKPWFTEPSIASN 590

Query: 491 AGLVAALIALHDYPGDTSGFNGKNLGIDQM 495
           AGLVAAL+ALHDYPGDTS FNGK+LGIDQM
Sbjct: 591 AGLVAALVALHDYPGDTSDFNGKDLGIDQM 611

BLAST of Sgr019477 vs. ExPASy TrEMBL
Match: A0A5A7UU46 (Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold96G002590 PE=3 SV=1)

HSP 1 Score: 788.1 bits (2034), Expect = 2.6e-224
Identity = 395/495 (79.80%), Postives = 426/495 (86.06%), Query Frame = 0

Query: 8   LFSAGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWS 67
           LFSAGRYPK+SPVKFRGDSGL+DG+SSNKPDGLIGGFYDSGNN+KFTFPTAYTITLLSWS
Sbjct: 80  LFSAGRYPKSSPVKFRGDSGLKDGVSSNKPDGLIGGFYDSGNNMKFTFPTAYTITLLSWS 139

Query: 68  VIEYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTN 127
           VIEYHPKYADMNELDHVKDIIRWGT+YLLKV VAPNATSDQTIIYSQVGS+SN+S  QTN
Sbjct: 140 VIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVAPNATSDQTIIYSQVGSSSNESKAQTN 199

Query: 128 DNCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFE 187
           DNCWQRPEDM YPRPVS CD RASDLAGEI+AALSA+SLVF+ED NYS ELAKAAEKLF+
Sbjct: 200 DNCWQRPEDMMYPRPVSTCDARASDLAGEIVAALSASSLVFREDTNYSGELAKAAEKLFQ 259

Query: 188 QVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAV 247
           QVTKLDP +QGTY+ VDSCGGEAR FYNSSSY DELIWAGTWLFFATGNTSYL+YATDAV
Sbjct: 260 QVTKLDPIEQGTYSSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLSYATDAV 319

Query: 248 RFQLAQSEEAILTEGFS--IGTISSVQLRTPGKHLFRSYRSNKLNIYGIASTLGQI---D 307
           RFQLAQSEEA +  G        S+  +       F           G++S + +I    
Sbjct: 320 RFQLAQSEEASIGRGIFNWNNKFSATAVLLTRLLYFHDTGYPYEYALGVSSNMTEILMCS 379

Query: 308 YPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIFAYPGF 367
           Y  D    N+  SGLILL PDD APLQFAATASFLSKLYSDYLDLLGASY+SCIFA P F
Sbjct: 380 YLIDQHF-NRTPSGLILLSPDDKAPLQFAATASFLSKLYSDYLDLLGASYMSCIFANPRF 439

Query: 368 SLEKLRTFSRSQ---LNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCAEG 427
           SLEKLR+FS+SQ   LNYILGDNP+KMSYVV +GNNFPTHVHHR+ASIPWDGQF+SCAEG
Sbjct: 440 SLEKLRSFSKSQASALNYILGDNPLKMSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEG 499

Query: 428 DRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIALHDYPG 487
           DRWLLSKASNPNILSGAMVAGPDK DHF DDREKPWFTEPSIASNAGLVAAL+AL+DYPG
Sbjct: 500 DRWLLSKASNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPG 559

Query: 488 DTSGFNGKNLGIDQM 495
           DT  FNGKNLGIDQM
Sbjct: 560 DTPDFNGKNLGIDQM 573

BLAST of Sgr019477 vs. ExPASy TrEMBL
Match: A0A6J1ACP6 (Endoglucanase OS=Herrania umbratica OX=108875 GN=LOC110416795 PE=3 SV=1)

HSP 1 Score: 585.9 bits (1509), Expect = 1.9e-163
Identity = 310/492 (63.01%), Postives = 356/492 (72.36%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVIE 70
           +G YP  SP+KFRG SGL DG + N    L+GGFYDSGNNIKFTFPTAYTITLLSWSVIE
Sbjct: 128 SGNYPSKSPIKFRGSSGLRDGNTRNTRADLVGGFYDSGNNIKFTFPTAYTITLLSWSVIE 187

Query: 71  YHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDS-NVQTND- 130
           YH KYAD+ EL+H+KDIIRWG+DYLLKV VAPNATS+ TI+YSQVGSA ND+ N  +ND 
Sbjct: 188 YHQKYADIGELEHIKDIIRWGSDYLLKVFVAPNATSEPTILYSQVGSAGNDTRNPGSNDI 247

Query: 131 NCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFEQ 190
           NCWQRPEDM Y RPVS CD  ASDLAGEI+AALSAAS+VFKE+N +S+ L KAAEKL+  
Sbjct: 248 NCWQRPEDMSYYRPVSVCDETASDLAGEIVAALSAASIVFKEENEHSQRLTKAAEKLYGI 307

Query: 191 VTKLDP-SKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAV 250
             K D   K  TYT +D+CGGEAR FYNSS Y DEL+W GTWLFFATGN +YL YAT   
Sbjct: 308 AEKKDKIHKAVTYTTIDACGGEARKFYNSSGYKDELVWGGTWLFFATGNYTYLDYAT--T 367

Query: 251 RFQLAQSEEAILTEGF-----SIGTISSVQLRTPGKHLFRSYRSNKLNIYGIASTLGQID 310
            F  A + E I  +G       +   +++  R    H         L +    +      
Sbjct: 368 NFAAASNNETIADKGIFYWNNKLTATAALLTRLRFFHDLGFPYEKALGLSSEMTDQLMCS 427

Query: 311 YPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIFAYPGF 370
           Y ++ +  N+   GLILLRPD G PLQFAATASFLSKLY DYL LLG S  +C   Y GF
Sbjct: 428 YLSEQNF-NRTPGGLILLRPDYGEPLQFAATASFLSKLYKDYLTLLGRSGGNCT-KYDGF 487

Query: 371 SLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCAEGDRW 430
           SLE L++FS SQ+NYILGDNP KMSY+V FG+++PT VHHRSASIPWDGQFHSCAEGDRW
Sbjct: 488 SLEMLQSFSISQVNYILGDNPRKMSYMVGFGDHYPTKVHHRSASIPWDGQFHSCAEGDRW 547

Query: 431 LLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIALHDYPGDTS 490
           L S+  NPNIL GAMVAGPD  D F D+R+KPWFTEPSIASNAGLVAALIA H       
Sbjct: 548 LHSQDRNPNILWGAMVAGPDHFDGFSDERDKPWFTEPSIASNAGLVAALIANH------- 605

Query: 491 GFNGKNLGIDQM 495
                NLG+DQM
Sbjct: 608 ---APNLGLDQM 605

BLAST of Sgr019477 vs. ExPASy TrEMBL
Match: A0A6P6ANB4 (Endoglucanase OS=Durio zibethinus OX=66656 GN=LOC111311196 PE=3 SV=1)

HSP 1 Score: 583.2 bits (1502), Expect = 1.3e-162
Identity = 306/496 (61.69%), Postives = 359/496 (72.38%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVIE 70
           +G YP NSP++FRG SGL+DG  SN P  L+GGFYDSGNNIKFTFPTAYTITLLSWSVIE
Sbjct: 125 SGNYPSNSPIRFRGRSGLQDGNLSNTPADLVGGFYDSGNNIKFTFPTAYTITLLSWSVIE 184

Query: 71  YHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDS--NVQTND 130
           YH KYAD+ EL H+KD+I+WG+DYLLKV +APNATSD TI+YSQVGSA NDS  +V  + 
Sbjct: 185 YHQKYADIGELGHIKDVIKWGSDYLLKVFIAPNATSDPTILYSQVGSAGNDSQNSVPNDI 244

Query: 131 NCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFEQ 190
           NCWQRPE+M Y RPVS CD  ASDLAGEI+AAL+AAS+VFKE+N YS+ L KAA+KL+E 
Sbjct: 245 NCWQRPEEMSYKRPVSVCDATASDLAGEIVAALAAASIVFKEENEYSQGLIKAAKKLYEI 304

Query: 191 VTKLDP-SKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAV 250
             K D   K  TYT +D+CGGEAR FYNSS Y DEL+W  TWLFFATGN +YL YAT   
Sbjct: 305 TEKEDQIHKAATYTTIDACGGEARKFYNSSGYKDELVWGETWLFFATGNYTYLDYAT--T 364

Query: 251 RFQLAQSEEAILTEG-------FSIGTISSVQLRTPGKHLFRSYRSNKLNIYGIASTLGQ 310
            F  A + E I  +G        +  T+   +LR      FR          GI+S +  
Sbjct: 365 NFASAANNETIDDKGIFYWNNKLTANTVLLTRLR-----FFRDLGFPYEEALGISSNMTD 424

Query: 311 IDYPTDFSVNN--KKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIFA 370
               +  S  N  +   GLILLRPD   PLQFAATASFLSKLY+DYL LL  S  +C   
Sbjct: 425 HIMCSYLSEQNFYRTPGGLILLRPDYSGPLQFAATASFLSKLYNDYLTLLHRSGWNC--T 484

Query: 371 YPGFSLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCAE 430
              FSLE L++FS SQ+NYILGDNP KMSY+V FG+++PT VHHRSASIPWDGQ++SC E
Sbjct: 485 NDAFSLEMLQSFSTSQVNYILGDNPRKMSYMVGFGDHYPTQVHHRSASIPWDGQYYSCDE 544

Query: 431 GDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIALHDYP 490
           GDRWL S+  NPNIL GAMVAGPD+ D F D+R+KPWFTEPSIASNAGLVAALIA  D P
Sbjct: 545 GDRWLHSQDRNPNILLGAMVAGPDQFDDFSDERDKPWFTEPSIASNAGLVAALIAHLDPP 604

Query: 491 GDTSGFNGKNLGIDQM 495
             ++   G NLG+D M
Sbjct: 605 RISAASKGPNLGLDLM 611

BLAST of Sgr019477 vs. ExPASy TrEMBL
Match: A0A061FQ48 (Endoglucanase OS=Theobroma cacao OX=3641 GN=TCM_035466 PE=3 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 2.2e-162
Identity = 311/498 (62.45%), Postives = 359/498 (72.09%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVIE 70
           +G YP  SP+KFRG SGL DG + N    L+GGFYDSGNNIKFTFP AYTITLLSWSVIE
Sbjct: 128 SGNYPSKSPIKFRGSSGLRDGNTGNTRADLVGGFYDSGNNIKFTFPAAYTITLLSWSVIE 187

Query: 71  YHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDS-NVQTND- 130
           YH KY D+ EL+H+KD+IRWG+DYLLKV VAPNATS+ TI+YSQVGSA ND+ N  +ND 
Sbjct: 188 YHQKYEDIGELEHIKDVIRWGSDYLLKVFVAPNATSEPTILYSQVGSAGNDTQNPGSNDI 247

Query: 131 NCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKLFEQ 190
           NCWQRPEDM Y RPVS CD  ASDLAGEI+AALSAAS+VFKE+N YS+ L KAAEKL+  
Sbjct: 248 NCWQRPEDMNYERPVSVCDETASDLAGEIVAALSAASIVFKEENEYSQRLTKAAEKLYGI 307

Query: 191 VTKLDP-SKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAV 250
             K D   K  TYT +D+CGGEAR FYNSS Y DEL+W GTWLFFATGN +YL YAT   
Sbjct: 308 TEKKDKIHKAVTYTTIDACGGEARKFYNSSGYKDELVWGGTWLFFATGNYTYLDYAT--T 367

Query: 251 RFQLAQSEEAILTEGF-----SIGTISSVQLRTPGKH-LFRSYR-----SNKLNIYGIAS 310
            F  A + E I  +G       +   +++  R    H L   Y      S+K+    + S
Sbjct: 368 NFAAASNNETIADKGIFYWNNKLTATAALLTRLRFFHDLGFPYEKALGLSSKMTDQLMCS 427

Query: 311 TLGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCI 370
            L + ++       N+   GLILLRPD G PLQFAATASFLSKLY DYL LLG S  +C 
Sbjct: 428 YLSKQNF-------NRTPGGLILLRPDYGEPLQFAATASFLSKLYKDYLTLLGRSGGNCT 487

Query: 371 FAYPGFSLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSC 430
               GFSLE L++FS SQ+NYILGDNP KMSY+V FG+++PT VHHRSASIPWDGQFHSC
Sbjct: 488 -KCDGFSLEMLQSFSISQVNYILGDNPRKMSYMVGFGDHYPTKVHHRSASIPWDGQFHSC 547

Query: 431 AEGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIALHD 490
           AEG+RWL S+  NPNIL GAMVAGPD  D F D+R+KPWFTEPSIASNAGLVAALIA H 
Sbjct: 548 AEGNRWLRSQDRNPNILLGAMVAGPDHFDGFSDERDKPWFTEPSIASNAGLVAALIANH- 605

Query: 491 YPGDTSGFNGKNLGIDQM 495
                    G NLG+DQM
Sbjct: 608 ---------GPNLGLDQM 605

BLAST of Sgr019477 vs. TAIR 10
Match: AT5G49720.1 (glycosyl hydrolase 9A1 )

HSP 1 Score: 353.6 bits (906), Expect = 3.1e-97
Identity = 213/482 (44.19%), Postives = 290/482 (60.17%), Query Frame = 0

Query: 7   KLFSA---GRYPKNSPVKFRGDSGLEDGISSNKP--DGLIGGFYDSGNNIKFTFPTAYTI 66
           K F+A   G+ PK++ V +RG+SGL+DG          L+GG+YD+G+ IKF FP AY +
Sbjct: 118 KFFNAQKSGKLPKHNNVSWRGNSGLQDGKGETGSFYKDLVGGYYDAGDAIKFNFPMAYAM 177

Query: 67  TLLSWSVIEYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQ-TIIYSQVGSA-S 126
           T+LSWSVIEY  KY    EL HVK++I+WGTDY LK     N+T+D    + SQVGS  +
Sbjct: 178 TMLSWSVIEYSAKYEAAGELTHVKELIKWGTDYFLKTF---NSTADSIDDLVSQVGSGNT 237

Query: 127 NDSNVQTNDN-CWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSREL 186
           +D N   ND+ CW RPEDM Y RPV+ C+   SDLA E+ AAL++AS+VFK++  YS++L
Sbjct: 238 DDGNTDPNDHYCWMRPEDMDYKRPVTTCNGGCSDLAAEMAAALASASIVFKDNKEYSKKL 297

Query: 187 AKAAEKLFEQVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTS 246
              A+ +++       +++G Y+   +   E+  FYNSS Y DE IW G W+++ATGN +
Sbjct: 298 VHGAKVVYQ----FGRTRRGRYS---AGTAESSKFYNSSMYWDEFIWGGAWMYYATGNVT 357

Query: 247 YLAYATDAVRFQLAQSEEAILTEG-FS-IGTISSVQLRTPGKHLFRS--YRSNK-LNIYG 306
           YL   T     + A +       G FS    ++  QL      LF S  Y   + L  + 
Sbjct: 358 YLNLITQPTMAKHAGAFWGGPYYGVFSWDNKLAGAQLLLSRLRLFLSPGYPYEEILRTFH 417

Query: 307 IASTLGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYI 366
             +++    Y   F+  N+   GLI L      PLQ++  A+FL+ LYSDYLD   A+  
Sbjct: 418 NQTSIVMCSYLPIFNKFNRTNGGLIELNHGAPQPLQYSVNAAFLATLYSDYLD---AADT 477

Query: 367 SCIFAYPGF-SLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQ 426
              +  P F S   LR F+RSQ++YILG NP KMSYVV FG  +P HVHHR ASIP +  
Sbjct: 478 PGWYCGPNFYSTSVLRDFARSQIDYILGKNPRKMSYVVGFGTKYPRHVHHRGASIPKNKV 537

Query: 427 FHSCAEGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALI 475
            ++C  G +W  SK  NPN + GAMVAGPDK D + D R    +TEP++A NAGLVAAL+
Sbjct: 538 KYNCKGGWKWRDSKKPNPNTIEGAMVAGPDKRDGYRDVRMNYNYTEPTLAGNAGLVAALV 586

BLAST of Sgr019477 vs. TAIR 10
Match: AT1G65610.1 (Six-hairpin glycosidases superfamily protein )

HSP 1 Score: 345.9 bits (886), Expect = 6.5e-95
Identity = 202/481 (42.00%), Postives = 288/481 (59.88%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGISSNKPD---GLIGGFYDSGNNIKFTFPTAYTITLLSWS 70
           +G+ PK + V +RGDSG +DG+    PD   GL+GG+YD G+N+KF FP A+++T+LSWS
Sbjct: 133 SGKLPKKNKVSWRGDSGTKDGL----PDVVGGLVGGYYDGGSNVKFHFPMAFSMTMLSWS 192

Query: 71  VIEYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTN 130
           +IEY  KY  ++E DH++D+++WGTDYLL  L   N+ +    IY+QVG    DS    +
Sbjct: 193 LIEYSHKYKAIDEYDHMRDVLKWGTDYLL--LTFNNSATRLDHIYTQVGGGLRDSESPDD 252

Query: 131 DNCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNYSRELAKAAEKL-- 190
             CWQ+PEDM Y RPV    T A+DL  E+ AAL+AAS+VF +  +Y+++L K AE L  
Sbjct: 253 IYCWQKPEDMSYDRPVLS-STSAADLGAEVSAALAAASIVFTDKPDYAKKLKKGAETLYP 312

Query: 191 -FEQVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYAT 250
            F   ++      G  T        A+ FYNS+S  DE +WAG WL++ATGN +Y+ +AT
Sbjct: 313 FFRSKSRRKRYSDGQPT--------AQAFYNSTSMFDEFMWAGAWLYYATGNKTYIQFAT 372

Query: 251 DAVRFQLAQSEEAILTEGFSIGTISSVQLRTPGKHLFRS-YR---------SNKLNIYGI 310
                 + Q+ +A       +  + S   + PG  L  + YR          N LN Y  
Sbjct: 373 TP---SVPQTAKAFANRPELM--VPSWNNKLPGAMLLMTRYRLFLNPGFPYENMLNRYHN 432

Query: 311 ASTLGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYIS 370
           A+ +    Y   ++V N+ + GL+ L      PL++ A ASFL+ L++DYL+  G     
Sbjct: 433 ATGITMCAYLKQYNVFNRTSGGLMQLNLGKPRPLEYVAHASFLASLFADYLNSTGVPGWY 492

Query: 371 CIFAYPGF-SLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQF 430
           C    P F     L+ F++SQ++YILGDNP+KMSYVV FG  FP  VHHR A+IP D + 
Sbjct: 493 C---GPTFVENHVLKDFAQSQIDYILGDNPLKMSYVVGFGKKFPRRVHHRGATIPNDKKR 552

Query: 431 HSCAEGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIA 475
            SC EG ++  +K  NPN ++GAMV GP+K D F D R     +EP+++ NAGLVAAL++
Sbjct: 553 RSCREGLKYRDTKNPNPNNITGAMVGGPNKFDEFHDLRNNYNASEPTLSGNAGLVAALVS 590

BLAST of Sgr019477 vs. TAIR 10
Match: AT4G24260.1 (glycosyl hydrolase 9A3 )

HSP 1 Score: 330.9 bits (847), Expect = 2.2e-90
Identity = 206/490 (42.04%), Postives = 281/490 (57.35%), Query Frame = 0

Query: 4   TDKKLFSA---GRYPKN-SPVKFRGDSGLEDGISSNKP--DGLIGGFYDSGNNIKFTFPT 63
           T  K F+A   G+ PKN   V +R DS L+DG          L+GG+YD+G++IKF FP 
Sbjct: 115 TALKFFNAQQSGKLPKNIYNVSWRHDSCLQDGKGDPGQCYKDLVGGYYDAGDSIKFNFPM 174

Query: 64  AYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQT-IIYSQVG 123
           +Y +T+LSWSVIEY  KY    EL+HVK++I+WGTDY LK     N+++D   ++  QVG
Sbjct: 175 SYAMTMLSWSVIEYSAKYQAAGELEHVKELIKWGTDYFLKTF---NSSADNIYVMVEQVG 234

Query: 124 S--ASNDSNVQTNDNCWQRPEDMRYPRPVSKCDTRASDLAGEIIAALSAASLVFKEDNNY 183
           S  +   S +  +  CW RPED+ Y R VS+C +  SDLA E+ AAL++AS+VFK++  Y
Sbjct: 235 SGVSGRGSELHNDHYCWMRPEDIHYKRTVSQCYSSCSDLAAEMAAALASASIVFKDNRLY 294

Query: 184 SRELAKAAEKLFEQVTKLDPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFAT 243
           S+ L   A+ L+   T    + +  Y+     G E+  FYNSS + DEL+W G WL++AT
Sbjct: 295 SKNLVHGAKTLYRFAT----TSRNRYS---QNGKESSKFYNSSMFEDELLWGGAWLYYAT 354

Query: 244 GNTSYLAYATDAVRFQLAQSEEAILTEGFSIGTIS------SVQLRTPGKHLFRS---YR 303
           GN +YL   T      +A+   A     +  G  S        QL      LF S     
Sbjct: 355 GNVTYLERVTS---HHMAEKAGAFGNSPY-YGVFSWDNKLPGAQLLLTRMRLFLSPGYPY 414

Query: 304 SNKLNIYGIASTLGQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYL 363
            + L+ +   +      Y   +   N+   GLI L      PLQ+ A A+FL+ L+SDYL
Sbjct: 415 EDMLSEFHNQTGRVMCSYLPYYKKFNRTNGGLIQLNHGAPQPLQYVANAAFLAALFSDYL 474

Query: 364 DLLGASYISCIFAYPGF-SLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRS 423
           +   A+     +  P F + E LR FSRSQ++YILG NP KMSYVV +G  +P  VHHR 
Sbjct: 475 E---AADTPGWYCGPNFYTTEFLRNFSRSQIDYILGKNPRKMSYVVGYGQRYPKQVHHRG 534

Query: 424 ASIPWDGQFHSCAEGDRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASN 475
           ASIP      +C  G +W  SK +NPN ++GAMVAGPDK D F D R    +TEP++A N
Sbjct: 535 ASIP-KNMKETCTGGFKWKKSKKNNPNAINGAMVAGPDKHDGFHDIRTNYNYTEPTLAGN 586

BLAST of Sgr019477 vs. TAIR 10
Match: AT1G19940.1 (glycosyl hydrolase 9B5 )

HSP 1 Score: 275.4 bits (703), Expect = 1.1e-73
Identity = 173/472 (36.65%), Postives = 258/472 (54.66%), Query Frame = 0

Query: 16  KNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKY 75
           +N+ + +RGDSGL+DG  S     L  G YD+G+++KF FP A+T T+LSWS++EY  + 
Sbjct: 69  ENNEISWRGDSGLKDG--SEASIDLSKGLYDAGDHMKFGFPMAFTATVLSWSILEYGDQM 128

Query: 76  ADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTNDNCWQRPE 135
           A +N LDH KD ++W TD+L+    +PN      ++Y QVG      +  T+  CW RPE
Sbjct: 129 ASLNLLDHAKDSLKWTTDFLINAHPSPN------VLYIQVG------DPVTDHKCWDRPE 188

Query: 136 DMRYPRPVSKCDTR--ASDLAGEIIAALSAASLVFKE-DNNYSRELAKAAEKLFEQVTKL 195
            M   R ++K DT+   +++A E  AA++AASLVFKE D  YS  L K A++LF+     
Sbjct: 189 TMTRKRTLTKIDTKTPGTEVAAETAAAMAAASLVFKESDTKYSSTLLKHAKQLFD----F 248

Query: 196 DPSKQGTYTMVDSCGGEARNFYNSSSYMDELIWAGTWLFFATGNTSYLAYATDAVRFQLA 255
             + +G+Y++      E +++YNS+ Y DEL+WA +WL+ AT + +YL +          
Sbjct: 249 ADNNRGSYSVNIP---EVQSYYNSTGYGDELLWAASWLYHATEDQTYLDFV--------- 308

Query: 256 QSEEAILTEGFSIGTISSVQLRTPGKHL-------FRSYRSNKLNIYGIASTLGQID--- 315
            SE       F   +  S   + PG H+       F+   S    + G   T   +    
Sbjct: 309 -SENGEEFGNFGSPSWFSWDNKLPGTHILLSRLTFFKKGLSGSKGLQGFKETAEAVMCGL 368

Query: 316 YPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIFAYPGF 375
            P+  +  + +T G ++   +  A LQ   +++FL+ LYSDY+   G   +SC  +   F
Sbjct: 369 IPSSPTATSSRTDGGLIWVSEWNA-LQHPVSSAFLATLYSDYMLTSGVKELSC--SDQSF 428

Query: 376 SLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASIPWDGQFHSCAEGDRW 435
               LR F+RSQ +Y+LG NP KMSY+V +G  +P  VHHR ASIP D     C +G +W
Sbjct: 429 KPSDLRKFARSQADYMLGKNPEKMSYLVGYGEKYPEFVHHRGASIPADAT-TGCKDGFKW 488

Query: 436 LLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIAL 475
           L S   NPN+  GA+V GP   D F+D R      EPS  ++A +V  L +L
Sbjct: 489 LNSDEPNPNVAYGALVGGPFLNDTFIDARNNSMQNEPSTYNSALVVGLLSSL 505

BLAST of Sgr019477 vs. TAIR 10
Match: AT4G11050.1 (glycosyl hydrolase 9C3 )

HSP 1 Score: 270.0 bits (689), Expect = 4.5e-72
Identity = 179/480 (37.29%), Postives = 251/480 (52.29%), Query Frame = 0

Query: 11  AGRYPKNSPVKFRGDSGLEDGISSNKPDGLIGGFYDSGNNIKFTFPTAYTITLLSWSVIE 70
           +G  P N  V +R  SGL DG SS     L+GG+YD+G+N+KF  P A+T+T + WS+IE
Sbjct: 43  SGHLPPNQRVSWRSHSGLYDGKSSGV--DLVGGYYDAGDNVKFGLPMAFTVTTMCWSIIE 102

Query: 71  YHPKYADMNELDHVKDIIRWGTDYLLKVLVAPNATSDQTIIYSQVGSASNDSNVQTNDNC 130
           Y  +     EL H  D ++WGTDY +K    PN      ++Y +VG   +D        C
Sbjct: 103 YGGQLESNGELGHAIDAVKWGTDYFIKAHPEPN------VLYGEVGDGKSD------HYC 162

Query: 131 WQRPEDMRYPRPVSKCDTR--ASDLAGEIIAALSAASLVF-KEDNNYSRELAKAAEKLFE 190
           WQRPE+M   R   K D     SDLAGE  AA++AAS+VF + D +YS EL + A +LFE
Sbjct: 163 WQRPEEMTTDRRAYKIDRNNPGSDLAGETAAAMAAASIVFRRSDPSYSAELLRHAHQLFE 222

Query: 191 QVTKLDPSKQGTYTMVDSCGGEARNFYNS-SSYMDELIWAGTWLFFATGNTSYLAY---- 250
              K     +G Y   DS    A+ +Y S S Y DEL+WA  WL+ AT +  YL Y    
Sbjct: 223 FADKY----RGKY---DSSITVAQKYYRSVSGYNDELLWAAAWLYQATNDKYYLDYLGKN 282

Query: 251 --ATDAVRFQLAQSEEAILTEGFSIGTISSVQLRTPGKH--LFRSYRSNKLNIYGIASTL 310
             +     + + +    +   G        +     G+H  +F  Y+        + S L
Sbjct: 283 GDSMGGTGWSMTEFGWDVKYAGVQTLVAKVLMQGKGGEHTAVFERYQQKAEQF--MCSLL 342

Query: 311 GQIDYPTDFSVNNKKTSGLILLRPDDGAPLQFAATASFLSKLYSDYLDLLGASYISCIFA 370
           G+       + N KKT G ++ R      +QF  +ASFL+ +YSDYL     S    + +
Sbjct: 343 GK------STKNIKKTPGGLIFR-QSWNNMQFVTSASFLATVYSDYLSY---SKRDLLCS 402

Query: 371 YPGFSLEKLRTFSRSQLNYILGDNPMKMSYVVSFGNNFPTHVHHRSASI---PWDGQFHS 430
               S  +L  FS+SQ++YILGDNP   SY+V +G N+P  VHHR +SI     D +F +
Sbjct: 403 QGNISPSQLLEFSKSQVDYILGDNPRATSYMVGYGENYPRQVHHRGSSIVSFNVDQKFVT 462

Query: 431 CAEG-DRWLLSKASNPNILSGAMVAGPDKLDHFLDDREKPWFTEPSIASNAGLVAALIAL 475
           C  G   W   K S+PN+L+GA+V GPD  D+F D R+    TEP+  +NA L+  L  L
Sbjct: 463 CRGGYATWFSRKGSDPNVLTGALVGGPDAYDNFADQRDNYEQTEPATYNNAPLLGVLARL 489

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_031745535.12.0e-23180.89endoglucanase 9-like [Cucumis sativus][more]
KAE8653204.11.1e-22680.20hypothetical protein Csa_019838 [Cucumis sativus][more]
XP_022140170.16.3e-22578.82endoglucanase 25-like [Momordica charantia][more]
KAA0057069.15.3e-22479.80endoglucanase 25 [Cucumis melo var. makuwa][more]
KAG6573332.12.8e-19671.23Endoglucanase 7, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
Q84R492.6e-9642.50Endoglucanase 10 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU2 PE=2 SV=1[more]
Q388904.4e-9644.19Endoglucanase 25 OS=Arabidopsis thaliana OX=3702 GN=KOR PE=1 SV=1[more]
Q7XUK44.9e-9541.81Endoglucanase 12 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU3 PE=2 SV=2[more]
P0C1U46.4e-9542.65Endoglucanase 9 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU1 PE=2 SV=1[more]
O044789.2e-9442.00Endoglucanase 7 OS=Arabidopsis thaliana OX=3702 GN=KOR2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1CED23.0e-22578.82Endoglucanase OS=Momordica charantia OX=3673 GN=LOC111010902 PE=3 SV=1[more]
A0A5A7UU462.6e-22479.80Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold96G002590 ... [more]
A0A6J1ACP61.9e-16363.01Endoglucanase OS=Herrania umbratica OX=108875 GN=LOC110416795 PE=3 SV=1[more]
A0A6P6ANB41.3e-16261.69Endoglucanase OS=Durio zibethinus OX=66656 GN=LOC111311196 PE=3 SV=1[more]
A0A061FQ482.2e-16262.45Endoglucanase OS=Theobroma cacao OX=3641 GN=TCM_035466 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G49720.13.1e-9744.19glycosyl hydrolase 9A1 [more]
AT1G65610.16.5e-9542.00Six-hairpin glycosidases superfamily protein [more]
AT4G24260.12.2e-9042.04glycosyl hydrolase 9A3 [more]
AT1G19940.11.1e-7336.65glycosyl hydrolase 9B5 [more]
AT4G11050.14.5e-7237.29glycosyl hydrolase 9C3 [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 11..471
e-value: 3.1E-94
score: 316.9
IPR012341Six-hairpin glycosidase-like superfamilyGENE3D1.50.10.10coord: 8..477
e-value: 1.2E-110
score: 372.5
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 10..494
NoneNo IPR availablePANTHERPTHR22298:SF109ENDOGLUCANASEcoord: 10..494
IPR008928Six-hairpin glycosidase superfamilySUPERFAMILY48208Six-hairpin glycosidasescoord: 11..481

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr019477.1Sgr019477.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030245 cellulose catabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008810 cellulase activity
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds