CsaV3_5G025530 (gene) Cucumber (Chinese Long) v3

NameCsaV3_5G025530
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionCellulase (glycosyl hydrolase family 5) protein
Locationchr5 : 20666575 .. 20668603 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAGGACAACCGTGGAAAAATCTCGCTTTAGCTTGTGTTTTCGTGTTGTTAACTTTTAAGGCCTATTCATTGCCTCTCTCGACAAATGGAAGATGGATTGTTGAGGCAACAACCGGCCAACGTGTGAAGTTGATGTGTGTAAATTGGCCGGGACACATGCAAGCCATGGTGGCAGAGGGTCTTCATCTCAGACCGCTTGACGACATTGCAACCATGGTGGCGAACTTGCGGTTTAATTGTGTGCGTTTGACATATTCAATTCACATGTTCACACGCTATGCTAATTTGACTGTGAAGCAATCCTTTGAGAATTTTGATATGAAAGAAGCCATAGCAGGTATAGCCCAAAACAATCCTACTATATTGAACATGAAGGTCGTTGAGGCTTATGAAGCAGTAGTTGATTCACTTGGTGCACATGGAGTTATGGTAGTTTCTGATAATCATATAAGCCAACCAAGATGGTGTTGTAGCAATGACGATGGCAATGGCTTTTTTGGAGATCGCTATTTCAATTCTCAAGAATGGCTAGAAGGACTTAGTTTGGCAGCTCAAAGCCTAAAAAACAAACCTCAGGTCTGGTTTATTTATTTACTTTGTTTTCATTAAACTACTTCTAAAAGAACTATAAAATAGGCTGAAGCATATAATAAAAAGACAACAGAAAAAGCCTTAATTTAACACATTTTAGACAATAAATTAACTACATCTAAATTATTAAAGAAGATACAAAAAAAAACAATTTAATCCCTCCAATTCATATATAACACAACTCTTAAAATAGAGGGGAAGTATTTGTACTTCAACCTATAAGTTATCATTAATTGTTTTATATGACGTTTTGACAAATTAGATGATAATTTATATGTAAATTTAAAGGTGGTGGCAATGAGTTTGAGAAACGAACCACGAGGACCAAATCAAAATGTGGAGATGTGGTTTCAATATATGAGCCAAGGAGCTAAACTTGTTCACCAAATTAACCCAAACGCTTTAGTAGTGGTTTCTGGACTAAGTTATGACACCGATCTAAGCTTCTTGAAGAATAGGTCAATGGGCTTCAATTTGGACAACAAGCTTGTATTTGAAGCTCACTTGTACTCCTTCACAAACAACATGAGTGATTATTGGACGTCAAAGCCATTGAACACGTTTTGTGCTAATGTCAATCAAGGATTTGAAGACCGTGCTGGATTTCTTGTGAGAGGACAAAACCCAATACCTCTCTTTGTGAGTGAGTTTGGGATCAACCAAATGGGAGCAAATGAAGGACAAAATCGATTCTTGAGTTGCTTTTTTACCTATCTTACTGAGAATGATTTTGATTGGGGCCTATGGGCGTTGCAAGGTAGCTACTACTATAGAGAAGGTGTGAAAAACGATGAAGAAACCTTTGGCGTCTTAGATGCCAAGTTTACAAATGTCAAAAACCCGAAATTCCTTCAGAAGTTTCAGCTTATGCAAACCAAGCTTCAAGGTATATAACGTTTAAAAAGTGATACATACATTTGAATCATGTTTTAGTTAAGAATAATGTTAGTCTCTATTCATTTTTTCTTCTCACCGATGAATCACACAATTGCTCTTTAACACAGATCCAAGCTCAAATCTCACAACCTCATTCATAATGTACCATCCTCTTAGTGGTGAGTGCGTGCAAGTGAACAAAAAGTACCAACTAGGAGTTAGTAGCTGCAAGACATCAAATCGTTGGAGTCACGAACAAGATGGCACTCCAATTAAGTTGGTAGGATCTATATTGTGTTTGCAAGCTGTAGGAGATGGACTCCCTCCAATTCTATCTAAAGACTGCTCAAGCCAACAAAGTGCATGGAAATATGCTTCAAATGCTAAGCTTCAACTGGCCACTGTTGATGAAGAAGGACAAGCCTTGTGTTTGCAAAGGGCTTCTCATTCCCATCAAATTTTGACTAACAAATGCATGTGTCCTAATGATTCTGAATGCCAAGGAGATCCACAAAGTCAATGGTTTACACTTGTACCATCAAATGTACATCTCAATTAA

mRNA sequence

ATGACAGGACAACCGTGGAAAAATCTCGCTTTAGCTTGTGTTTTCGTGTTGTTAACTTTTAAGGCCTATTCATTGCCTCTCTCGACAAATGGAAGATGGATTGTTGAGGCAACAACCGGCCAACGTGTGAAGTTGATGTGTGTAAATTGGCCGGGACACATGCAAGCCATGGTGGCAGAGGGTCTTCATCTCAGACCGCTTGACGACATTGCAACCATGGTGGCGAACTTGCGGTTTAATTGTGTGCGTTTGACATATTCAATTCACATGTTCACACGCTATGCTAATTTGACTGTGAAGCAATCCTTTGAGAATTTTGATATGAAAGAAGCCATAGCAGGTATAGCCCAAAACAATCCTACTATATTGAACATGAAGGTCGTTGAGGCTTATGAAGCAGTAGTTGATTCACTTGGTGCACATGGAGTTATGGTAGTTTCTGATAATCATATAAGCCAACCAAGATGGTGTTGTAGCAATGACGATGGCAATGGCTTTTTTGGAGATCGCTATTTCAATTCTCAAGAATGGCTAGAAGGACTTAGTTTGGCAGCTCAAAGCCTAAAAAACAAACCTCAGGTGGTGGCAATGAGTTTGAGAAACGAACCACGAGGACCAAATCAAAATGTGGAGATGTGGTTTCAATATATGAGCCAAGGAGCTAAACTTGTTCACCAAATTAACCCAAACGCTTTAGTAGTGGTTTCTGGACTAAGTTATGACACCGATCTAAGCTTCTTGAAGAATAGGTCAATGGGCTTCAATTTGGACAACAAGCTTGTATTTGAAGCTCACTTGTACTCCTTCACAAACAACATGAGTGATTATTGGACGTCAAAGCCATTGAACACGTTTTGTGCTAATGTCAATCAAGGATTTGAAGACCGTGCTGGATTTCTTGTGAGAGGACAAAACCCAATACCTCTCTTTGTGAGTGAGTTTGGGATCAACCAAATGGGAGCAAATGAAGGACAAAATCGATTCTTGAGTTGCTTTTTTACCTATCTTACTGAGAATGATTTTGATTGGGGCCTATGGGCGTTGCAAGGTAGCTACTACTATAGAGAAGGTGTGAAAAACGATGAAGAAACCTTTGGCGTCTTAGATGCCAAGTTTACAAATGTCAAAAACCCGAAATTCCTTCAGAAGTTTCAGCTTATGCAAACCAAGCTTCAAGATCCAAGCTCAAATCTCACAACCTCATTCATAATGTACCATCCTCTTAGTGGTGAGTGCGTGCAAGTGAACAAAAAGTACCAACTAGGAGTTAGTAGCTGCAAGACATCAAATCGTTGGAGTCACGAACAAGATGGCACTCCAATTAAGTTGGTAGGATCTATATTGTGTTTGCAAGCTGTAGGAGATGGACTCCCTCCAATTCTATCTAAAGACTGCTCAAGCCAACAAAGTGCATGGAAATATGCTTCAAATGCTAAGCTTCAACTGGCCACTGTTGATGAAGAAGGACAAGCCTTGTGTTTGCAAAGGGCTTCTCATTCCCATCAAATTTTGACTAACAAATGCATGTGTCCTAATGATTCTGAATGCCAAGGAGATCCACAAAGTCAATGGTTTACACTTGTACCATCAAATGTACATCTCAATTAA

Coding sequence (CDS)

ATGACAGGACAACCGTGGAAAAATCTCGCTTTAGCTTGTGTTTTCGTGTTGTTAACTTTTAAGGCCTATTCATTGCCTCTCTCGACAAATGGAAGATGGATTGTTGAGGCAACAACCGGCCAACGTGTGAAGTTGATGTGTGTAAATTGGCCGGGACACATGCAAGCCATGGTGGCAGAGGGTCTTCATCTCAGACCGCTTGACGACATTGCAACCATGGTGGCGAACTTGCGGTTTAATTGTGTGCGTTTGACATATTCAATTCACATGTTCACACGCTATGCTAATTTGACTGTGAAGCAATCCTTTGAGAATTTTGATATGAAAGAAGCCATAGCAGGTATAGCCCAAAACAATCCTACTATATTGAACATGAAGGTCGTTGAGGCTTATGAAGCAGTAGTTGATTCACTTGGTGCACATGGAGTTATGGTAGTTTCTGATAATCATATAAGCCAACCAAGATGGTGTTGTAGCAATGACGATGGCAATGGCTTTTTTGGAGATCGCTATTTCAATTCTCAAGAATGGCTAGAAGGACTTAGTTTGGCAGCTCAAAGCCTAAAAAACAAACCTCAGGTGGTGGCAATGAGTTTGAGAAACGAACCACGAGGACCAAATCAAAATGTGGAGATGTGGTTTCAATATATGAGCCAAGGAGCTAAACTTGTTCACCAAATTAACCCAAACGCTTTAGTAGTGGTTTCTGGACTAAGTTATGACACCGATCTAAGCTTCTTGAAGAATAGGTCAATGGGCTTCAATTTGGACAACAAGCTTGTATTTGAAGCTCACTTGTACTCCTTCACAAACAACATGAGTGATTATTGGACGTCAAAGCCATTGAACACGTTTTGTGCTAATGTCAATCAAGGATTTGAAGACCGTGCTGGATTTCTTGTGAGAGGACAAAACCCAATACCTCTCTTTGTGAGTGAGTTTGGGATCAACCAAATGGGAGCAAATGAAGGACAAAATCGATTCTTGAGTTGCTTTTTTACCTATCTTACTGAGAATGATTTTGATTGGGGCCTATGGGCGTTGCAAGGTAGCTACTACTATAGAGAAGGTGTGAAAAACGATGAAGAAACCTTTGGCGTCTTAGATGCCAAGTTTACAAATGTCAAAAACCCGAAATTCCTTCAGAAGTTTCAGCTTATGCAAACCAAGCTTCAAGATCCAAGCTCAAATCTCACAACCTCATTCATAATGTACCATCCTCTTAGTGGTGAGTGCGTGCAAGTGAACAAAAAGTACCAACTAGGAGTTAGTAGCTGCAAGACATCAAATCGTTGGAGTCACGAACAAGATGGCACTCCAATTAAGTTGGTAGGATCTATATTGTGTTTGCAAGCTGTAGGAGATGGACTCCCTCCAATTCTATCTAAAGACTGCTCAAGCCAACAAAGTGCATGGAAATATGCTTCAAATGCTAAGCTTCAACTGGCCACTGTTGATGAAGAAGGACAAGCCTTGTGTTTGCAAAGGGCTTCTCATTCCCATCAAATTTTGACTAACAAATGCATGTGTCCTAATGATTCTGAATGCCAAGGAGATCCACAAAGTCAATGGTTTACACTTGTACCATCAAATGTACATCTCAATTAA

Protein sequence

MTGQPWKNLALACVFVLLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLHLRPLDDIATMVANLRFNCVRLTYSIHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQSLKNKPQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFGVLDAKFTNVKNPKFLQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECVQVNKKYQLGVSSCKTSNRWSHEQDGTPIKLVGSILCLQAVGDGLPPILSKDCSSQQSAWKYASNAKLQLATVDEEGQALCLQRASHSHQILTNKCMCPNDSECQGDPQSQWFTLVPSNVHLN
BLAST of CsaV3_5G025530 vs. NCBI nr
Match: XP_011652778.1 (PREDICTED: uncharacterized protein LOC105435091 [Cucumis sativus])

HSP 1 Score: 997.3 bits (2577), Expect = 2.0e-287
Identity = 494/609 (81.12%), Postives = 508/609 (83.42%), Query Frame = 0

Query: 1   MTGQPWKNLALACVFVLLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAE 60
           M+GQ WKN++LACVFVLLTF+AYSLPLSTNGRWIVEATTGQRVKL+CVNWPGHMQAMVAE
Sbjct: 1   MSGQQWKNVSLACVFVLLTFEAYSLPLSTNGRWIVEATTGQRVKLICVNWPGHMQAMVAE 60

Query: 61  GLHLRPLDDIATMVANLRFNCVRLTYSIHMFTRYANLTVK-------------------- 120
           GLHL+PLDDIA MV  LRFNCVRLTYSIHMFTRYANLTVK                    
Sbjct: 61  GLHLKPLDDIAAMVVKLRFNCVRLTYSIHMFTRYANLTVKQSFENFDLKEAIVGIAQNNP 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 TILNMKVVEAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLQG 180

Query: 181 -------------QSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVS 240
                        QSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVS
Sbjct: 181 LSLATQSLKTKPQQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVS 240

Query: 241 DNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQSLKNKPQVVAMSLRNEPRGPN 300
           DNHISQPRWCCSNDDGNGFFGDRYFNSQEWL+GLSLA QSLK KPQVVAMSLRNEPRGPN
Sbjct: 241 DNHISQPRWCCSNDDGNGFFGDRYFNSQEWLQGLSLATQSLKTKPQVVAMSLRNEPRGPN 300

Query: 301 QNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLY 360
           QNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLY
Sbjct: 301 QNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLY 360

Query: 361 SFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNR 420
           SFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNR
Sbjct: 361 SFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNR 420

Query: 421 FLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFGVLDAKFTNVKNPKFLQKFQLM 480
           FLSCFFTYLT+NDFDWGLWALQGSYYYREGVKNDEETFGVLD+KFTNVKNPKFLQKFQLM
Sbjct: 421 FLSCFFTYLTKNDFDWGLWALQGSYYYREGVKNDEETFGVLDSKFTNVKNPKFLQKFQLM 480

Query: 481 QTKLQDPSSNLTTSFIMYHPLSGECVQVNKKYQLGVSSCKTSNRWSHEQDGTPIKLVGSI 517
           QTKLQDPSSNLTTSFIMYHPLSGECV++NKKYQLGVSSCKTSNRWSHEQD TPIKL GSI
Sbjct: 481 QTKLQDPSSNLTTSFIMYHPLSGECVRMNKKYQLGVSSCKTSNRWSHEQDDTPIKLAGSI 540

BLAST of CsaV3_5G025530 vs. NCBI nr
Match: KGN50395.1 (hypothetical protein Csa_5G171770 [Cucumis sativus])

HSP 1 Score: 963.8 bits (2490), Expect = 2.5e-277
Identity = 452/533 (84.80%), Postives = 496/533 (93.06%), Query Frame = 0

Query: 4   QPWKNLALACVFVLLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLH 63
           Q WKN+AL CVFVLLTFKAYSLPLSTNGRWIV+ATTGQRVKLMCVNWPGHMQ M+AEGLH
Sbjct: 5   QQWKNIALVCVFVLLTFKAYSLPLSTNGRWIVDATTGQRVKLMCVNWPGHMQGMLAEGLH 64

Query: 64  LRPLDDIATMVANLRFNCVRLTYSIHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTIL 123
            RPLDDI ++VA LRFNCVRLTYSIHMFTR+ANLTV+QSFENFDMK+A+AGIAQNNP+++
Sbjct: 65  RRPLDDIISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIAQNNPSLV 124

Query: 124 NMKVVEAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSL 183
           N+ +VEAY AVVDSL AHGVMVVSDNHISQPRWCC+NDDGNGFFGDRYF+ +EWL+G+SL
Sbjct: 125 NLTLVEAYGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEEWLQGISL 184

Query: 184 AAQSLKNKPQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTD 243
           AAQSLK+K +VVAMS+RNEPRGPNQNVE WFQYMSQGAKL+HQINPNALVVVSGLSYDTD
Sbjct: 185 AAQSLKSKAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVSGLSYDTD 244

Query: 244 LSFLKNRSMGFNLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRG 303
           LSFLKNRSMGFNLDNKLVFEAHLYSFTNNM D+W SKPLNTFCA+VNQGFEDRAGFLVRG
Sbjct: 245 LSFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRG 304

Query: 304 QNPIPLFVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEE 363
           QNP+PLFVSEFGI+Q G NEGQNRFLSCFF+YLTENDFDWGLWALQGSYYYREGVKN EE
Sbjct: 305 QNPMPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYREGVKNAEE 364

Query: 364 TFGVLDAKFTNVKNPK-FLQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECVQVNKKYQLG 423
            FGVLD+ F   KN K FLQ+FQLMQTKLQDPSSN TTS IMYHPLSG CV++NKKYQLG
Sbjct: 365 NFGVLDSTFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVRMNKKYQLG 424

Query: 424 VSSCKTSNRWSHEQDGTPIKLVGSILCLQAVGDGLPPILSKDCSSQQSAWKYASNAKLQL 483
           +SSCKTSNRW HEQD +PIKL GS+LCL+A+G GLPPILS+DCSSQQS WKY S+AKLQL
Sbjct: 425 ISSCKTSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWKYGSSAKLQL 484

Query: 484 ATVDEEGQALCLQR-ASHSHQILTNKCMCPNDSECQGDPQSQWFTLVPSNVHL 535
           ATVDE+GQALCLQR ASHSHQI+TNKC+C NDS+CQ DPQSQWFTLVPSN+ L
Sbjct: 485 ATVDEQGQALCLQRAASHSHQIVTNKCLCSNDSQCQEDPQSQWFTLVPSNLRL 537

BLAST of CsaV3_5G025530 vs. NCBI nr
Match: XP_008467257.1 (PREDICTED: endoglucanase-like [Cucumis melo])

HSP 1 Score: 951.4 bits (2458), Expect = 1.3e-273
Identity = 450/528 (85.23%), Postives = 491/528 (92.99%), Query Frame = 0

Query: 7   KNLALACVFV-LLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLHLR 66
           KN+AL CVFV LLTFKA+SLPLSTNGRWI++ATTG+RVKLMCVNW GHMQ M+ EGLH R
Sbjct: 8   KNIALVCVFVLLLTFKAFSLPLSTNGRWIIDATTGRRVKLMCVNWAGHMQGMLVEGLHRR 67

Query: 67  PLDDIATMVANLRFNCVRLTYSIHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNM 126
           PLDDIA +VA LRFNCVRLTYSIHMFTR+ANLTVKQSFENFDMK+A+AGIAQNNP+ILN+
Sbjct: 68  PLDDIAALVAKLRFNCVRLTYSIHMFTRHANLTVKQSFENFDMKDAMAGIAQNNPSILNL 127

Query: 127 KVVEAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAA 186
            +VEAY AVVDSL AHG+MVVSDNHISQPRWCC N+DGNGFFGDRYF+ QEWL+G+SLAA
Sbjct: 128 TLVEAYGAVVDSLVAHGIMVVSDNHISQPRWCCDNNDGNGFFGDRYFDPQEWLQGISLAA 187

Query: 187 QSLKNKPQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLS 246
           QSLK+K QVVAMSLRNE RGPNQNVE WFQYMSQGAKL+HQINPNALVVVSGLSYDTDLS
Sbjct: 188 QSLKSKAQVVAMSLRNELRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVSGLSYDTDLS 247

Query: 247 FLKNRSMGFNLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQN 306
           FLKNRSMGFNLDNKLVFEAHLYSFTNNM D+W SKPLNTFCA++NQGFEDRAGFLVRGQN
Sbjct: 248 FLKNRSMGFNLDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFEDRAGFLVRGQN 307

Query: 307 PIPLFVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETF 366
           PIPLFVSEFGI+Q G NEGQNRFLSCFF+YLTENDFDWGLWALQGSYYY+ GVKN EE F
Sbjct: 308 PIPLFVSEFGIDQTGTNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYKVGVKNAEENF 367

Query: 367 GVLDAKFTNVKNPK-FLQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECVQVNKKYQLGVS 426
           GVLD+ FT  KN K FLQ+FQLMQTKLQDPSSN TT+FIMYHPLSG CV++NKKYQLG+S
Sbjct: 368 GVLDSNFTKAKNSKLFLQRFQLMQTKLQDPSSNFTTTFIMYHPLSGGCVRMNKKYQLGIS 427

Query: 427 SCKTSNRWSHEQDGTPIKLVGSILCLQAVGDGLPPILSKDCSSQQSAWKYASNAKLQLAT 486
           SCKTSNRWSHEQDG PIKL GSILCL+A+G GLPPILS+DCSSQQS W+YASNAKLQLAT
Sbjct: 428 SCKTSNRWSHEQDGAPIKLAGSILCLKAIGVGLPPILSQDCSSQQSIWRYASNAKLQLAT 487

Query: 487 VDEEGQALCLQRASHSHQILTNKCMCPNDSECQGDPQSQWFTLVPSNV 533
           VDE+GQALCLQRASHSHQI+TNKC+C  DS+CQ DPQSQWFTLVPSN+
Sbjct: 488 VDEQGQALCLQRASHSHQIVTNKCLCTIDSQCQEDPQSQWFTLVPSNL 535

BLAST of CsaV3_5G025530 vs. NCBI nr
Match: XP_011656162.1 (PREDICTED: uncharacterized protein LOC105435646 [Cucumis sativus] >KGN50394.1 hypothetical protein Csa_5G171760 [Cucumis sativus])

HSP 1 Score: 951.4 bits (2458), Expect = 1.3e-273
Identity = 448/533 (84.05%), Postives = 489/533 (91.74%), Query Frame = 0

Query: 4   QPWKNLALACVFVLLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLH 63
           Q W+N+AL CVFV L  KA SLPLSTNGRWIV+ATTG RVKLMCVNW GHMQ M+AEGLH
Sbjct: 5   QQWRNIALVCVFVFLISKACSLPLSTNGRWIVDATTGNRVKLMCVNWAGHMQGMLAEGLH 64

Query: 64  LRPLDDIATMVANLRFNCVRLTYSIHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTIL 123
           LRPLDDIA +V   RFNCVRLTYSIHMFTR+ANLTV+QSFENFDMK+A+AGIAQNNP+IL
Sbjct: 65  LRPLDDIAALVVKSRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDALAGIAQNNPSIL 124

Query: 124 NMKVVEAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSL 183
           NM VV+AY AV+DSL AH VMVVSDNHISQPRWCC+NDDGNGFFGDRYF+ QEWL+G+SL
Sbjct: 125 NMTVVQAYGAVIDSLAAHRVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPQEWLQGISL 184

Query: 184 AAQSLKNKPQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTD 243
           AAQ+LK+K QVVAMSLRNEPRGPNQNVEMWFQYMSQGAKL+HQINPNALVVVSGLSYDTD
Sbjct: 185 AAQNLKSKSQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLIHQINPNALVVVSGLSYDTD 244

Query: 244 LSFLKNRSMGFNLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRG 303
           LSFLKNRSMGFNLDNKLVFEAHLYSFTNNM D+W SKPLNTFCA+VNQGFEDRAGFLVRG
Sbjct: 245 LSFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRG 304

Query: 304 QNPIPLFVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEE 363
           QNPIPLFVSEFGI+Q G NEGQNRFLSCFF+YLTENDFDWGLWALQGSYYYREGVKN EE
Sbjct: 305 QNPIPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYREGVKNAEE 364

Query: 364 TFGVLDAKFTNVKNPK-FLQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECVQVNKKYQLG 423
            FGVLD+ F   KN K FLQ+FQLMQTKLQDPSSN TTS IMYHPLSG CV++NKKYQLG
Sbjct: 365 NFGVLDSTFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVRMNKKYQLG 424

Query: 424 VSSCKTSNRWSHEQDGTPIKLVGSILCLQAVGDGLPPILSKDCSSQQSAWKYASNAKLQL 483
           +SSCKTSNRW HEQD +PIKL GS+LCL+A+G GLPPILS+DCSSQQS WKY SNAKLQL
Sbjct: 425 ISSCKTSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWKYGSNAKLQL 484

Query: 484 ATVDEEGQALCLQR-ASHSHQILTNKCMCPNDSECQGDPQSQWFTLVPSNVHL 535
           AT+DE+GQALCLQR ASHSHQ++TNKC+C +DS+CQ DPQSQWFTLVPSN+ L
Sbjct: 485 ATIDEQGQALCLQRAASHSHQLVTNKCLCSSDSQCQEDPQSQWFTLVPSNLRL 537

BLAST of CsaV3_5G025530 vs. NCBI nr
Match: XP_008459280.1 (PREDICTED: uncharacterized protein LOC103498458 [Cucumis melo])

HSP 1 Score: 928.3 bits (2398), Expect = 1.2e-266
Identity = 438/530 (82.64%), Postives = 480/530 (90.57%), Query Frame = 0

Query: 15  FVLLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLHLRPLDDIATMV 74
           ++ L  +AYSLPLSTNGRWI++ATTG+RVKLMCVNW GHMQ M+ EGLH RPLDDIA +V
Sbjct: 188 YIDLEPEAYSLPLSTNGRWIIDATTGRRVKLMCVNWAGHMQGMLVEGLHRRPLDDIAALV 247

Query: 75  ANLRFNCVRLTYSIHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAV 134
           A LRFNCVRLTYSIHMFTR+AN+TVKQSFENFDMK+A+ GIAQNNP+ILN+ +VEAY AV
Sbjct: 248 AKLRFNCVRLTYSIHMFTRHANVTVKQSFENFDMKDALVGIAQNNPSILNLTLVEAYGAV 307

Query: 135 VDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQSLKNKPQV 194
           VDSL AHG+MVVSDNHISQPRWCC NDDGNGFFGDRYF+ QEW +G+SLAAQSLK+K QV
Sbjct: 308 VDSLAAHGIMVVSDNHISQPRWCCDNDDGNGFFGDRYFDPQEWFQGISLAAQSLKSKAQV 367

Query: 195 VAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGF 254
           VAMSLRNEPRGPNQNVE WFQYMSQGAKL+HQINPNALVVVSGLSYDTDLSFL+NRSMGF
Sbjct: 368 VAMSLRNEPRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVSGLSYDTDLSFLRNRSMGF 427

Query: 255 NLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEF 314
           NLDNKLVFEAHLYSFTNNM D+W SKPLNTFCA++NQGFEDRAGFLVRGQNPIPLFVSEF
Sbjct: 428 NLDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFEDRAGFLVRGQNPIPLFVSEF 487

Query: 315 GINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFGVLDAKFTN 374
           GI+Q G NEGQNRFLSCFF+YLTENDFDWGLWALQGSYYY+ GVKN  E FGVLD+ FT 
Sbjct: 488 GIDQTGTNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYKVGVKNAGENFGVLDSNFTK 547

Query: 375 VKNPK-FLQKFQLMQTKLQ---------DPSSNLTTSFIMYHPLSGECVQVNKKYQLGVS 434
            KN K FLQ+FQLMQTKLQ         DPSSN TTSFIMYHPLSG C+++NKKYQLG+S
Sbjct: 548 AKNSKLFLQRFQLMQTKLQESQRLLFYTDPSSNFTTSFIMYHPLSGGCMRMNKKYQLGIS 607

Query: 435 SCKTSNRWSHEQDGTPIKLVGSILCLQAVGDGLPPILSKDCSSQQSAWKYASNAKLQLAT 494
           SCKTSNRWSHEQDG PIKL GSILCL+A+G GLPPILS+DCSSQQS WKY SNAKLQLAT
Sbjct: 608 SCKTSNRWSHEQDGAPIKLAGSILCLKAIGVGLPPILSQDCSSQQSIWKYGSNAKLQLAT 667

Query: 495 VDEEGQALCLQRASHSHQILTNKCMCPNDSECQGDPQSQWFTLVPSNVHL 535
           VDE+GQALCLQRASHSHQI+TNKC+C NDS+CQ DPQSQWFTLVPSN+ L
Sbjct: 668 VDEQGQALCLQRASHSHQIVTNKCLCSNDSQCQEDPQSQWFTLVPSNLRL 717

BLAST of CsaV3_5G025530 vs. TAIR10
Match: AT3G26140.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 433.7 bits (1114), Expect = 1.6e-121
Identity = 216/510 (42.35%), Postives = 312/510 (61.18%), Query Frame = 0

Query: 26  PLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLHLRPLDDIATMVANLRFNCVRLT 85
           PLSTN RWI++   GQRVKL CVNWP H+Q +VAEGL  + +DD+A  +  + FNCVR T
Sbjct: 4   PLSTNSRWIID-EKGQRVKLACVNWPSHLQPVVAEGLSKQSVDDLAKKIMAMGFNCVRFT 63

Query: 86  YSIHMFTRYA---NLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHG 145
           + + + T      N+TV+QSF++  + + I+G    NP+++++ ++EAY+ VV  LG + 
Sbjct: 64  WPLDLATNETLANNVTVRQSFQSLGLNDDISGFETKNPSMIDLPLIEAYKKVVAKLGNNN 123

Query: 146 VMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQSLKNKPQVVAMSLRNE 205
           VMV+ DNH+++P WCC  +DGNGFFGD +F+   W+ GL+  A + K    VV MSLRNE
Sbjct: 124 VMVILDNHVTKPGWCCGYNDGNGFFGDTFFDPTTWIAGLTKIAMTFKGATNVVGMSLRNE 183

Query: 206 PRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVF 265
            RGP QNV+ WF+YM QGA+ VH+ NPN LV++SGLSYDTDLSF+++R +      KLVF
Sbjct: 184 LRGPKQNVDDWFKYMQQGAEAVHEANPNVLVILSGLSYDTDLSFVRSRHVNLTFTRKLVF 243

Query: 266 EAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGAN 325
           E H YSFTN  ++ W+SK  N  C  + +  E+  GF +R     P+F+SEFGI+  G N
Sbjct: 244 ELHRYSFTN--TNTWSSKNPNEACGEILKSIENGGGFNLR---DFPVFLSEFGIDLRGKN 303

Query: 326 EGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFGVLDAKFTNVKNPKFLQ 385
              NR++ C   +  END DW +W LQGSYY REGV    E +G+LD+ +  V++  FLQ
Sbjct: 304 VNDNRYIGCILGWAAENDVDWSIWTLQGSYYLREGVVGMSEFYGILDSDWVRVRSQSFLQ 363

Query: 386 KFQLMQTKLQDPSSNLTTSFIMYHPLSGECV--QVNKKYQLGVSSCKTSNRWSHEQDGTP 445
           +  L+ + LQ P S      +++HPL+G C+   +    ++ +  C  S  WS+    T 
Sbjct: 364 RLSLILSPLQGPGSQSKVYNLVFHPLTGLCMLQSILDPTKVTLGLCNESQPWSYTPQNT- 423

Query: 446 IKLVGSILCLQAVGDGLPPILSK-DCSSQQ-SAWKYASNAKLQLATVDEEGQALCLQRAS 505
           + L    LCL++ G   P  LS+  CSS   S W+  S + + LA       +LCL    
Sbjct: 424 LTLKDKSLCLESTGPNAPVKLSETSCSSPNLSEWETISASNMLLA-AKSTNNSLCLD-VD 483

Query: 506 HSHQILTNKCMCPNDSECQGDPQSQWFTLV 529
            ++ ++ + C C    +   DP SQWF +V
Sbjct: 484 ETNNLMASNCKCVKGEDSSCDPISQWFKIV 504

BLAST of CsaV3_5G025530 vs. TAIR10
Match: AT3G26130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 427.9 bits (1099), Expect = 8.9e-120
Identity = 227/523 (43.40%), Postives = 312/523 (59.66%), Query Frame = 0

Query: 15  FVLLTFKAYSLPLSTNGRWIV-EATTGQRVKLMCVNWPGHMQAMVAEGLHLRPLDDIATM 74
           +V+ TF   + P ST+ RWIV +   G+RVKL CVNWP H++  VAEGL  +PLD IA  
Sbjct: 14  YVITTF---AFPPSTDSRWIVDDGNKGRRVKLTCVNWPSHLETAVAEGLSKQPLDAIAEK 73

Query: 75  VANLRFNCVRLTYSIHMFTR---YANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEA 134
           + ++ FNCVRLT+ +++ T     A +TV+QS   F + EA++G   +NPTIL++ +++A
Sbjct: 74  IVSMGFNCVRLTWPLYLATDESFSAFMTVRQSLRKFRLFEAVSGFQTHNPTILDLPLIKA 133

Query: 135 YEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQSLKN 194
           ++ VV  L  H VMV+ DNHISQP WCCS++DGNGFFGD++ N Q W++GL   A    N
Sbjct: 134 FQEVVYCLEKHRVMVILDNHISQPGWCCSDNDGNGFFGDKHLNPQVWIKGLKKMASMFAN 193

Query: 195 -KPQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKN 254
               VV MSLRNE RGP QN++ W++YM +GA+ VH +NPN LV+VSGL+Y TDLSFL+ 
Sbjct: 194 VSSNVVGMSLRNELRGPKQNIKDWYKYMREGAEAVHSVNPNVLVIVSGLNYATDLSFLRE 253

Query: 255 RSMGFNLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPL 314
           R    +   K+VFE H Y F N     W    LN  C    +     +GFL+  +  IPL
Sbjct: 254 RPFEVSFRRKVVFEIHWYGFWNT----WEGDNLNKICGKETEKMMKMSGFLL--EKGIPL 313

Query: 315 FVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFGVLD 374
           FVSEFGI+Q G N   N+FLSCF     + D DW LW L GSYY RE     +E++GVLD
Sbjct: 314 FVSEFGIDQRGNNANDNKFLSCFMALAADRDLDWSLWTLAGSYYIREKSIGSDESYGVLD 373

Query: 375 AKFTNVKNPKFLQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECVQVNKKYQLGVSSCKTS 434
             +++++N   LQ    +QT             IM+HP +G C+     +QL + SC  S
Sbjct: 374 FNWSSIRNSTILQMISAIQTPFIGLMETQPKK-IMFHPSTGLCIVRKSLFQLKLGSCNRS 433

Query: 435 NRWSHEQDGTPIKLVGSILCLQAVGDGLPPILSKDCS-SQQSAWKYASNAKLQLATVDEE 494
             W              ILCL+A   G    L    S S  S WK  S++K+QL+++ + 
Sbjct: 434 ESWRLSSHRVLSLAEEQILCLKAYEKGKSVKLRLFFSESYCSKWKLFSDSKMQLSSITKN 493

Query: 495 GQALCLQRASHSHQILTNKCMC-PNDSECQGDPQSQWFTLVPS 531
           G ++CL   + ++ I+TN C C   +S C  DP+SQWF LV S
Sbjct: 494 GFSVCLDVDTENNNIVTNSCKCLRGNSSC--DPRSQWFKLVTS 524

BLAST of CsaV3_5G025530 vs. TAIR10
Match: AT1G13130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 424.9 bits (1091), Expect = 7.6e-119
Identity = 212/512 (41.41%), Postives = 308/512 (60.16%), Query Frame = 0

Query: 24  SLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLHLRPLDDIATMVANLRFNCVR 83
           S PLST+ RWIV+   G RVKL+C NWP H+Q +VAEGL  +P+D +A  +  + FNCVR
Sbjct: 32  SYPLSTSSRWIVD-ENGLRVKLVCANWPSHLQPVVAEGLSKQPVDAVAKKIVEMGFNCVR 91

Query: 84  LTYSIHMFTRYA---NLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGA 143
           LT+ + + T      N+TV+QSF++  + + I G   NNP+I+++ ++EAY+ VV +LG 
Sbjct: 92  LTWPLDLMTNETLANNVTVRQSFQSLGLNDDIVGFQTNNPSIIDLPLIEAYKTVVTTLGN 151

Query: 144 HGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQSLKNKPQVVAMSLR 203
           + VMV+ DNH+++P WCC+NDDGNGFFGD++F+   W+  L   A +      VV MSLR
Sbjct: 152 NDVMVILDNHLTKPGWCCANDDGNGFFGDQFFDPTVWVAALKKMAATFNGVSNVVGMSLR 211

Query: 204 NEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDNKL 263
           NE RGP QNV  WF+YM QGA+ VH  N   LV++SGLS+D DLSF+++R +  +   KL
Sbjct: 212 NELRGPKQNVNDWFKYMQQGAEAVHSANNKVLVILSGLSFDADLSFVRSRPVKLSFTGKL 271

Query: 264 VFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMG 323
           VFE H YSF++  S  W +   N  C  V     +  G+L+      PLF+SEFGI++ G
Sbjct: 272 VFELHWYSFSDGNS--WAANNPNDICGRVLNRIGNGGGYLL--NQGFPLFLSEFGIDERG 331

Query: 324 ANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFGVLDAKFTNVKNPKF 383
            N   NR+  C   +  END DW LWAL GSYY R+G     E +GVLD+ + +V+N  F
Sbjct: 332 VNTNDNRYFGCLTGWAAENDVDWSLWALTGSYYLRQGKVGMNEYYGVLDSDWISVRNSSF 391

Query: 384 LQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECV--QVNKKYQLGVSSCKTSNRWSHEQDG 443
           LQK   +Q+ LQ P        +++HPL+G C+   ++    L +  C +S  WS+ +  
Sbjct: 392 LQKISFLQSPLQGPGPRTDAYNLVFHPLTGLCIVRSLDDPKMLTLGPCNSSEPWSYTKKA 451

Query: 444 TPIKLVGSILCLQAVGDGLPPILSK-DCSSQQSAWKYASNAKLQLATVDEEGQALCLQRA 503
             IK     LCLQ+ G   P  +++  CS+  S W+  S +++ LA+      +LCL   
Sbjct: 452 LRIK--DQQLCLQSNGPKNPVTMTRTSCSTSGSKWQTISASRMHLASTTSNKTSLCLD-V 511

Query: 504 SHSHQILTNKCMC-PNDSECQGDPQSQWFTLV 529
             ++ ++ N C C   D  C  +P SQWF ++
Sbjct: 512 DTANNVVANACKCLSKDKSC--EPMSQWFKII 533

BLAST of CsaV3_5G025530 vs. TAIR10
Match: AT5G17500.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 368.6 bits (945), Expect = 6.4e-102
Identity = 201/517 (38.88%), Postives = 288/517 (55.71%), Query Frame = 0

Query: 22  AYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLHLRPLDDIATMVANLRFNC 81
           A   PL T  RWIV    G RVKL C NWP H++ +VAEGL  +P+D I+  + ++ FNC
Sbjct: 23  ATDYPLFTKSRWIVN-NKGHRVKLACANWPSHLKPVVAEGLSSQPMDSISKKIKDMGFNC 82

Query: 82  VRLTYSIHMF---TRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSL 141
           VRLT+ + +    T   N+TVKQSFE + +   + GI  +NP I+N  ++  ++AVV SL
Sbjct: 83  VRLTWPLELMINDTLAFNVTVKQSFERYGLDHELQGIYTHNPYIVNTPLINVFQAVVYSL 142

Query: 142 GAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQSLKNKPQVVAMS 201
           G H VMV+ DNH + P WCCSNDD + FFGD  FN   W+ GL   A    N   VV MS
Sbjct: 143 GRHDVMVILDNHKTVPGWCCSNDDPDAFFGDPKFNPDLWMLGLKKMATIFMNVKNVVGMS 202

Query: 202 LRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDN 261
           LRNE RG N   + W++YM +GA+ VH  NPN LV++SGL++D DLSFLK+R +  +   
Sbjct: 203 LRNELRGYNHTSKDWYKYMQKGAEAVHTSNPNVLVILSGLNFDADLSFLKDRPVNLSFKK 262

Query: 262 KLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQ 321
           KLV E H YSFT+  +  W S  +N FC+ +        GF++      PLF+SEFG +Q
Sbjct: 263 KLVLELHWYSFTDG-TGQWKSHNVNDFCSQMFSKERRTGGFVL--DQGFPLFLSEFGTDQ 322

Query: 322 MGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFGVLDAKFTNVKNP 381
            G +   NR+++C   +  E D DW +WA+ G YY+REG +   E +G+LDA + NV N 
Sbjct: 323 RGGDLEGNRYMNCMLAWAAEKDLDWAVWAVTGVYYFREGKRGVVEAYGMLDANWHNVHNY 382

Query: 382 KFLQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECVQVNKKY----QLGVSSCKTSNRWSH 441
            +L++  ++Q     P         ++HPL+G C+ V K +    +L +  C     WS+
Sbjct: 383 TYLRRLSVIQPPHTGPGVKHNHHKKIFHPLTGLCL-VRKSHCHESELTLGPCTKDEPWSY 442

Query: 442 EQDGTPIKLVGSILCLQ---AVGDGLPPILSKDCSSQQSAWKYASNAKLQLATVDEEGQA 501
              G      G   CL+   AVG  +   L + C+  +      S  K+ L+    +G  
Sbjct: 443 SHGGILEIRRGHKSCLEGETAVGKSVK--LGRICTKIEQ----ISATKMHLSFNTSDGSL 502

Query: 502 LCLQRASHSHQILTNKCMC-PNDSECQGDPQSQWFTL 528
           +CL      + ++ N C C   D+ C  +P SQWF +
Sbjct: 503 VCLD-VDSDNNVVANSCNCLTGDTTC--EPASQWFKI 525

BLAST of CsaV3_5G025530 vs. TAIR10
Match: AT5G16700.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 332.8 bits (852), Expect = 3.9e-91
Identity = 189/516 (36.63%), Postives = 275/516 (53.29%), Query Frame = 0

Query: 24  SLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLHLRPLDDIATMVANLRFNCVR 83
           S PLST  RWIV+   GQRVKL CVNWP H+Q  VAEGL  +PLD I+  + ++ FNCVR
Sbjct: 23  SYPLSTKSRWIVD-EKGQRVKLACVNWPAHLQPTVAEGLSKQPLDSISKKIVSMGFNCVR 82

Query: 84  LTYSIHMFTR---YANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGA 143
           LT+ + + T       +TVKQSFE+  + E + GI  +NP +L++ +  A++ VV +LG 
Sbjct: 83  LTWPLDLVTNDTLALKVTVKQSFESLKLFEDVLGIQTHNPKLLHLPLFNAFQEVVSNLGE 142

Query: 144 HGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQSLKNKPQVVAMSLR 203
           +GVMV+ DNH++ P WCC ++D + FFG  +F+   W +GL   A   +N   V+ MSLR
Sbjct: 143 NGVMVILDNHLTTPGWCCGDNDLDAFFGYPHFDPLVWAKGLRKMATLFRNFTHVIGMSLR 202

Query: 204 NEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDNKL 263
           NEPRG     ++WF++M QGA+ VH  NP  LV++SG+ +DT+LSFL++RS+  +  +KL
Sbjct: 203 NEPRGARDYPDLWFRHMPQGAEAVHAANPKLLVILSGIDFDTNLSFLRDRSVNVSFTDKL 262

Query: 264 VFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLV-RGQNPIPLFVSEFGINQM 323
           VFE H YSF++   D W     N FC  + +      GFL+ RG    PL +SEFG +Q 
Sbjct: 263 VFELHWYSFSDG-RDSWRKHNSNDFCVKIIEKVTHNGGFLLGRG---FPLILSEFGTDQR 322

Query: 324 GANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFGVLDAKFTNVKNPK 383
           G +   NR+++C   +  END DW +WAL G YY R G                      
Sbjct: 323 GGDMSGNRYMNCLVAWAAENDLDWAVWALTGDYYLRTGPG-------------------- 382

Query: 384 FLQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECVQVNKKYQ---LGVSSCKTSNRWSHEQ 443
                         P+ NL     ++HP +G CV  N       L +  C  S+ W+   
Sbjct: 383 ------------LRPNKNL-----LFHPSTGLCVTNNPSDNIPTLRLGPCPKSDPWTFNP 442

Query: 444 DGTPIKLVGSILCLQA---VGDGLPPILSKDCSSQQSAWKYASNAKLQLATVDEEGQALC 503
               + +  + +C++A   VG  +   +   CS         S  K+ L+     G  LC
Sbjct: 443 SEGILWI--NKMCVEAPNVVGQKVKLGVGTKCSKLGQ----ISATKMHLSFKTSNGLLLC 488

Query: 504 LQRASHSHQILTNKC-MCPNDSECQGDPQSQWFTLV 529
           L      + ++ N+C     D+ C  DP SQWF ++
Sbjct: 503 LDVDERDNSVVANRCKFLTMDASC--DPASQWFKVL 488

BLAST of CsaV3_5G025530 vs. Swiss-Prot
Match: sp|C0HLA0|GH5FP_CHAOB (Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1)

HSP 1 Score: 446.4 bits (1147), Expect = 4.4e-124
Identity = 236/543 (43.46%), Postives = 329/543 (60.59%), Query Frame = 0

Query: 9   LALACVFVLLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLHLRPLD 68
           L  A + +L+   ++SLPL T GRWIV+  TG RVKL CVNW GH++  + EGL+  P+ 
Sbjct: 13  LLTALLLLLVAAPSHSLPLLTRGRWIVDEATGLRVKLACVNWVGHLEPGLPEGLNRLPVA 72

Query: 69  DIATMVANLRFNCVRLTYSIHMFTR--YANLTVKQSFENFDMKEAIAGIAQNNPTILNMK 128
            +A  +++L FNCVRLTYSIHM TR  Y N TV Q+F   ++ EA +GI  NNP +L++ 
Sbjct: 73  TVAHTISSLGFNCVRLTYSIHMLTRTSYTNATVAQTFARLNLTEAASGIEHNNPELLDLG 132

Query: 129 VVEAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQ 188
            V AY  VV +L   GVMV+ DNH+S+P+WCC+ DDGNGFFGDRYFN   W+EGL L A 
Sbjct: 133 HVAAYHHVVAALSEAGVMVILDNHVSKPKWCCAVDDGNGFFGDRYFNPNTWVEGLGLMAT 192

Query: 189 SLKNKPQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSF 248
              N P VVAMSLRNE RG       W ++M  GA  VH+ NP  LV++SGL +DTDLSF
Sbjct: 193 YFNNTPNVVAMSLRNELRGNRSTPISWSRHMQWGAATVHKANPKVLVILSGLQFDTDLSF 252

Query: 249 LKNRSMGFNLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQN- 308
           L    +      K+V+E H YSF       W +   N  C N    F+   GF+    N 
Sbjct: 253 LPVLPVTLPFKEKIVYEGHWYSF----GVPWRTGLPNDVCKNETGRFKSNVGFVTSSANA 312

Query: 309 -PIPLFVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYR---EGVKND 368
              PLF+SEFGI+Q   N+  NR+L+C   YL E D DW LW + GSYYYR   + VK+ 
Sbjct: 313 TAAPLFMSEFGIDQRYVNDNDNRYLNCILAYLAEEDLDWALWTMGGSYYYRSDKQPVKDF 372

Query: 369 EETFGVLDAKFTNVKNPKFLQKFQLMQTKLQDPSSNLTTSF-IMYHPLSGECVQVNKKYQ 428
           EET+G  +  ++ ++NP F+ + + +Q  +QDP       + I+YHP SG CV+      
Sbjct: 373 EETYGFFNHDWSRIRNPDFISRLKEIQQPIQDPYLAPGPYYQIIYHPASGLCVESGIGNT 432

Query: 429 LGVSSCKT-SNRWSHEQD-GTPIKLVGSILCLQAVGDGLPPILSKDCSS-QQSAWKYASN 488
           + + SC++  +RW+++     PI L+GS  C+   G+GLP I++++CS+   + W   S+
Sbjct: 433 VHLGSCQSVRSRWNYDASVKGPIGLMGSSSCISTQGNGLPAIMTENCSAPNNTLWSTVSS 492

Query: 489 AKLQLAT----VDEEGQALCLQRASHSHQILTNKCMCPNDSEC--QGDPQSQWFTLVPSN 535
           A+LQL T     D + + +CL   S S  I TN+C+C  DS C  + +P+ QWF ++ +N
Sbjct: 493 AQLQLGTRVLGKDGKEKWMCLD-GSKSPLISTNECICITDSHCYPKLNPEKQWFKVITTN 550

BLAST of CsaV3_5G025530 vs. Swiss-Prot
Match: sp|P23548|GUN_PAEPO (Endoglucanase OS=Paenibacillus polymyxa OX=1406 PE=3 SV=2)

HSP 1 Score: 90.9 bits (224), Expect = 4.7e-17
Identity = 81/373 (21.72%), Postives = 153/373 (41.02%), Query Frame = 0

Query: 29  TNGRWIVEATTGQRVKLMCVNWPG-HMQAMVAEGLHLRPLDDIATMVANLRFNCVRLTYS 88
           T G  IV+  +G+      +NW G         GL  R +DD+   V    +N +RL YS
Sbjct: 41  TQGNKIVD-ESGKEAAFNGLNWFGLETPNYTLHGLWSRSMDDMLDQVKKEGYNLIRLPYS 100

Query: 89  IHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVS 148
                        Q F++    ++I      NP ++ +  ++  + +++  G  G+ ++ 
Sbjct: 101 ------------NQLFDSSSRPDSID--YHKNPDLVGLNPIQIMDKLIEKAGQRGIQIIL 160

Query: 149 DNHISQPRWCCSNDDGNGFFGDRYFNSQ----EWLEGLSLAAQSLKNKPQVVAMSLRNEP 208
           D H            G+G   + ++ SQ     W+    + A   KN P V+   L NEP
Sbjct: 161 DRH----------RPGSGGQSELWYTSQYPESRWISDWKMLADRYKNNPTVIGADLHNEP 220

Query: 209 RGP------NQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDT-----------DLSF 268
            G       N + + W     +    +  +NPN L++V G+ ++            +L+ 
Sbjct: 221 HGQASWGTGNASTD-WRLAAQRAGNAILSVNPNWLILVEGVDHNVQGNNSQYWWGGNLTG 280

Query: 269 LKNRSMGFNLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNP 328
           + N  +  ++ N++V+  H Y      S  W + P   F +N+   ++   G++ + QN 
Sbjct: 281 VANYPVVLDVPNRVVYSPHDYG-PGVSSQPWFNDP--AFPSNLPAIWDQTWGYISK-QNI 340

Query: 329 IPLFVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFG 380
            P+ V EFG   +  +  + ++ +    Y+  N+  +  W+L           N  +T G
Sbjct: 341 APVLVGEFGGRNVDLSCPEGKWQNALVHYIGANNLYFTYWSLN---------PNSGDTGG 374

BLAST of CsaV3_5G025530 vs. Swiss-Prot
Match: sp|P19487|GUNA_XANCP (Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (strain ATCC 33913 / DSM 3586 / NCPPB 528 / LMG 568 / P 25) OX=190485 GN=engXCA PE=1 SV=2)

HSP 1 Score: 60.5 bits (145), Expect = 6.8e-08
Identity = 87/391 (22.25%), Postives = 145/391 (37.08%), Query Frame = 0

Query: 9   LALACVFVLLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQA-MVAEGLHLRPL 68
           LALA    L    A+S  ++ N R IV+  +G+ V+L  VN  G      V  GL  R  
Sbjct: 10  LALATALALAAGPAFSYSIN-NSRQIVD-DSGKVVQLKGVNVFGFETGNHVMHGLWARNW 69

Query: 69  DDIATMVANLRFNCVRLTY---SIHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILN 128
            D+   +  L FN VRL +   ++   T  A++   +                 N  +  
Sbjct: 70  KDMIVQMQGLGFNAVRLPFCPATLRSDTMPASIDYSR-----------------NADLQG 129

Query: 129 MKVVEAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLA 188
           +  ++  + V+    A G+ V+ D+H       C+      + G   +   +WL  L   
Sbjct: 130 LTSLQILDKVIAEFNARGMYVLLDHHTPD----CAGISELWYTGS--YTEAQWLADLRFV 189

Query: 189 AQSLKNKPQVVAMSLRNEPR-----GPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLS 248
           A   KN P V+ + L+NEP      G       W +   +G+  V  + P  L+ V G++
Sbjct: 190 ANRYKNVPYVLGLDLKNEPHGAATWGTGNAATDWNKAAERGSAAVLAVAPKWLIAVEGIT 249

Query: 249 YDTDLSFLKNRSMGFNLD-----------NKLVFEAHLYSFTNNMSDYWTSKPLNTFCAN 308
            +   S       G NL            N+L+   H+Y     +  Y+     + F  N
Sbjct: 250 DNPVCSTNGGIFWGGNLQPLACTPLNIPANRLLLAPHVYGPDVFVQSYFND---SNFPNN 309

Query: 309 VNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWAL 368
           +   +E   G   +      L + EFG      +     +      YL     + G    
Sbjct: 310 MPAIWERHFG---QFAGTHALLLGEFGGKYGEGDARDKTWQDALVKYLRSKGINQG---- 361

Query: 369 QGSYYYREGVKNDEETFGVLDAKFTNVKNPK 380
               +Y     N  +T G+L   +T+V+  K
Sbjct: 370 ----FYWSWNPNSGDTGGILRDDWTSVRQDK 361

BLAST of CsaV3_5G025530 vs. TrEMBL
Match: tr|A0A0A0KL32|A0A0A0KL32_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G171770 PE=3 SV=1)

HSP 1 Score: 963.8 bits (2490), Expect = 1.6e-277
Identity = 452/533 (84.80%), Postives = 496/533 (93.06%), Query Frame = 0

Query: 4   QPWKNLALACVFVLLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLH 63
           Q WKN+AL CVFVLLTFKAYSLPLSTNGRWIV+ATTGQRVKLMCVNWPGHMQ M+AEGLH
Sbjct: 5   QQWKNIALVCVFVLLTFKAYSLPLSTNGRWIVDATTGQRVKLMCVNWPGHMQGMLAEGLH 64

Query: 64  LRPLDDIATMVANLRFNCVRLTYSIHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTIL 123
            RPLDDI ++VA LRFNCVRLTYSIHMFTR+ANLTV+QSFENFDMK+A+AGIAQNNP+++
Sbjct: 65  RRPLDDIISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIAQNNPSLV 124

Query: 124 NMKVVEAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSL 183
           N+ +VEAY AVVDSL AHGVMVVSDNHISQPRWCC+NDDGNGFFGDRYF+ +EWL+G+SL
Sbjct: 125 NLTLVEAYGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEEWLQGISL 184

Query: 184 AAQSLKNKPQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTD 243
           AAQSLK+K +VVAMS+RNEPRGPNQNVE WFQYMSQGAKL+HQINPNALVVVSGLSYDTD
Sbjct: 185 AAQSLKSKAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVSGLSYDTD 244

Query: 244 LSFLKNRSMGFNLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRG 303
           LSFLKNRSMGFNLDNKLVFEAHLYSFTNNM D+W SKPLNTFCA+VNQGFEDRAGFLVRG
Sbjct: 245 LSFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRG 304

Query: 304 QNPIPLFVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEE 363
           QNP+PLFVSEFGI+Q G NEGQNRFLSCFF+YLTENDFDWGLWALQGSYYYREGVKN EE
Sbjct: 305 QNPMPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYREGVKNAEE 364

Query: 364 TFGVLDAKFTNVKNPK-FLQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECVQVNKKYQLG 423
            FGVLD+ F   KN K FLQ+FQLMQTKLQDPSSN TTS IMYHPLSG CV++NKKYQLG
Sbjct: 365 NFGVLDSTFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVRMNKKYQLG 424

Query: 424 VSSCKTSNRWSHEQDGTPIKLVGSILCLQAVGDGLPPILSKDCSSQQSAWKYASNAKLQL 483
           +SSCKTSNRW HEQD +PIKL GS+LCL+A+G GLPPILS+DCSSQQS WKY S+AKLQL
Sbjct: 425 ISSCKTSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWKYGSSAKLQL 484

Query: 484 ATVDEEGQALCLQR-ASHSHQILTNKCMCPNDSECQGDPQSQWFTLVPSNVHL 535
           ATVDE+GQALCLQR ASHSHQI+TNKC+C NDS+CQ DPQSQWFTLVPSN+ L
Sbjct: 485 ATVDEQGQALCLQRAASHSHQIVTNKCLCSNDSQCQEDPQSQWFTLVPSNLRL 537

BLAST of CsaV3_5G025530 vs. TrEMBL
Match: tr|A0A1S3CT43|A0A1S3CT43_CUCME (endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504654 PE=3 SV=1)

HSP 1 Score: 951.4 bits (2458), Expect = 8.5e-274
Identity = 450/528 (85.23%), Postives = 491/528 (92.99%), Query Frame = 0

Query: 7   KNLALACVFV-LLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLHLR 66
           KN+AL CVFV LLTFKA+SLPLSTNGRWI++ATTG+RVKLMCVNW GHMQ M+ EGLH R
Sbjct: 8   KNIALVCVFVLLLTFKAFSLPLSTNGRWIIDATTGRRVKLMCVNWAGHMQGMLVEGLHRR 67

Query: 67  PLDDIATMVANLRFNCVRLTYSIHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNM 126
           PLDDIA +VA LRFNCVRLTYSIHMFTR+ANLTVKQSFENFDMK+A+AGIAQNNP+ILN+
Sbjct: 68  PLDDIAALVAKLRFNCVRLTYSIHMFTRHANLTVKQSFENFDMKDAMAGIAQNNPSILNL 127

Query: 127 KVVEAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAA 186
            +VEAY AVVDSL AHG+MVVSDNHISQPRWCC N+DGNGFFGDRYF+ QEWL+G+SLAA
Sbjct: 128 TLVEAYGAVVDSLVAHGIMVVSDNHISQPRWCCDNNDGNGFFGDRYFDPQEWLQGISLAA 187

Query: 187 QSLKNKPQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLS 246
           QSLK+K QVVAMSLRNE RGPNQNVE WFQYMSQGAKL+HQINPNALVVVSGLSYDTDLS
Sbjct: 188 QSLKSKAQVVAMSLRNELRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVSGLSYDTDLS 247

Query: 247 FLKNRSMGFNLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQN 306
           FLKNRSMGFNLDNKLVFEAHLYSFTNNM D+W SKPLNTFCA++NQGFEDRAGFLVRGQN
Sbjct: 248 FLKNRSMGFNLDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFEDRAGFLVRGQN 307

Query: 307 PIPLFVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETF 366
           PIPLFVSEFGI+Q G NEGQNRFLSCFF+YLTENDFDWGLWALQGSYYY+ GVKN EE F
Sbjct: 308 PIPLFVSEFGIDQTGTNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYKVGVKNAEENF 367

Query: 367 GVLDAKFTNVKNPK-FLQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECVQVNKKYQLGVS 426
           GVLD+ FT  KN K FLQ+FQLMQTKLQDPSSN TT+FIMYHPLSG CV++NKKYQLG+S
Sbjct: 368 GVLDSNFTKAKNSKLFLQRFQLMQTKLQDPSSNFTTTFIMYHPLSGGCVRMNKKYQLGIS 427

Query: 427 SCKTSNRWSHEQDGTPIKLVGSILCLQAVGDGLPPILSKDCSSQQSAWKYASNAKLQLAT 486
           SCKTSNRWSHEQDG PIKL GSILCL+A+G GLPPILS+DCSSQQS W+YASNAKLQLAT
Sbjct: 428 SCKTSNRWSHEQDGAPIKLAGSILCLKAIGVGLPPILSQDCSSQQSIWRYASNAKLQLAT 487

Query: 487 VDEEGQALCLQRASHSHQILTNKCMCPNDSECQGDPQSQWFTLVPSNV 533
           VDE+GQALCLQRASHSHQI+TNKC+C  DS+CQ DPQSQWFTLVPSN+
Sbjct: 488 VDEQGQALCLQRASHSHQIVTNKCLCTIDSQCQEDPQSQWFTLVPSNL 535

BLAST of CsaV3_5G025530 vs. TrEMBL
Match: tr|A0A0A0KNB6|A0A0A0KNB6_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G171760 PE=3 SV=1)

HSP 1 Score: 951.4 bits (2458), Expect = 8.5e-274
Identity = 448/533 (84.05%), Postives = 489/533 (91.74%), Query Frame = 0

Query: 4   QPWKNLALACVFVLLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLH 63
           Q W+N+AL CVFV L  KA SLPLSTNGRWIV+ATTG RVKLMCVNW GHMQ M+AEGLH
Sbjct: 5   QQWRNIALVCVFVFLISKACSLPLSTNGRWIVDATTGNRVKLMCVNWAGHMQGMLAEGLH 64

Query: 64  LRPLDDIATMVANLRFNCVRLTYSIHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTIL 123
           LRPLDDIA +V   RFNCVRLTYSIHMFTR+ANLTV+QSFENFDMK+A+AGIAQNNP+IL
Sbjct: 65  LRPLDDIAALVVKSRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDALAGIAQNNPSIL 124

Query: 124 NMKVVEAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSL 183
           NM VV+AY AV+DSL AH VMVVSDNHISQPRWCC+NDDGNGFFGDRYF+ QEWL+G+SL
Sbjct: 125 NMTVVQAYGAVIDSLAAHRVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPQEWLQGISL 184

Query: 184 AAQSLKNKPQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTD 243
           AAQ+LK+K QVVAMSLRNEPRGPNQNVEMWFQYMSQGAKL+HQINPNALVVVSGLSYDTD
Sbjct: 185 AAQNLKSKSQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLIHQINPNALVVVSGLSYDTD 244

Query: 244 LSFLKNRSMGFNLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRG 303
           LSFLKNRSMGFNLDNKLVFEAHLYSFTNNM D+W SKPLNTFCA+VNQGFEDRAGFLVRG
Sbjct: 245 LSFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRG 304

Query: 304 QNPIPLFVSEFGINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEE 363
           QNPIPLFVSEFGI+Q G NEGQNRFLSCFF+YLTENDFDWGLWALQGSYYYREGVKN EE
Sbjct: 305 QNPIPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYREGVKNAEE 364

Query: 364 TFGVLDAKFTNVKNPK-FLQKFQLMQTKLQDPSSNLTTSFIMYHPLSGECVQVNKKYQLG 423
            FGVLD+ F   KN K FLQ+FQLMQTKLQDPSSN TTS IMYHPLSG CV++NKKYQLG
Sbjct: 365 NFGVLDSTFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVRMNKKYQLG 424

Query: 424 VSSCKTSNRWSHEQDGTPIKLVGSILCLQAVGDGLPPILSKDCSSQQSAWKYASNAKLQL 483
           +SSCKTSNRW HEQD +PIKL GS+LCL+A+G GLPPILS+DCSSQQS WKY SNAKLQL
Sbjct: 425 ISSCKTSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWKYGSNAKLQL 484

Query: 484 ATVDEEGQALCLQR-ASHSHQILTNKCMCPNDSECQGDPQSQWFTLVPSNVHL 535
           AT+DE+GQALCLQR ASHSHQ++TNKC+C +DS+CQ DPQSQWFTLVPSN+ L
Sbjct: 485 ATIDEQGQALCLQRAASHSHQLVTNKCLCSSDSQCQEDPQSQWFTLVPSNLRL 537

BLAST of CsaV3_5G025530 vs. TrEMBL
Match: tr|A0A1S3C9U1|A0A1S3C9U1_CUCME (uncharacterized protein LOC103498458 OS=Cucumis melo OX=3656 GN=LOC103498458 PE=4 SV=1)

HSP 1 Score: 928.3 bits (2398), Expect = 7.7e-267
Identity = 438/530 (82.64%), Postives = 480/530 (90.57%), Query Frame = 0

Query: 15  FVLLTFKAYSLPLSTNGRWIVEATTGQRVKLMCVNWPGHMQAMVAEGLHLRPLDDIATMV 74
           ++ L  +AYSLPLSTNGRWI++ATTG+RVKLMCVNW GHMQ M+ EGLH RPLDDIA +V
Sbjct: 188 YIDLEPEAYSLPLSTNGRWIIDATTGRRVKLMCVNWAGHMQGMLVEGLHRRPLDDIAALV 247

Query: 75  ANLRFNCVRLTYSIHMFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAV 134
           A LRFNCVRLTYSIHMFTR+AN+TVKQSFENFDMK+A+ GIAQNNP+ILN+ +VEAY AV
Sbjct: 248 AKLRFNCVRLTYSIHMFTRHANVTVKQSFENFDMKDALVGIAQNNPSILNLTLVEAYGAV 307

Query: 135 VDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQSLKNKPQV 194
           VDSL AHG+MVVSDNHISQPRWCC NDDGNGFFGDRYF+ QEW +G+SLAAQSLK+K QV
Sbjct: 308 VDSLAAHGIMVVSDNHISQPRWCCDNDDGNGFFGDRYFDPQEWFQGISLAAQSLKSKAQV 367

Query: 195 VAMSLRNEPRGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGF 254
           VAMSLRNEPRGPNQNVE WFQYMSQGAKL+HQINPNALVVVSGLSYDTDLSFL+NRSMGF
Sbjct: 368 VAMSLRNEPRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVSGLSYDTDLSFLRNRSMGF 427

Query: 255 NLDNKLVFEAHLYSFTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEF 314
           NLDNKLVFEAHLYSFTNNM D+W SKPLNTFCA++NQGFEDRAGFLVRGQNPIPLFVSEF
Sbjct: 428 NLDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFEDRAGFLVRGQNPIPLFVSEF 487

Query: 315 GINQMGANEGQNRFLSCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFGVLDAKFTN 374
           GI+Q G NEGQNRFLSCFF+YLTENDFDWGLWALQGSYYY+ GVKN  E FGVLD+ FT 
Sbjct: 488 GIDQTGTNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYKVGVKNAGENFGVLDSNFTK 547

Query: 375 VKNPK-FLQKFQLMQTKLQ---------DPSSNLTTSFIMYHPLSGECVQVNKKYQLGVS 434
            KN K FLQ+FQLMQTKLQ         DPSSN TTSFIMYHPLSG C+++NKKYQLG+S
Sbjct: 548 AKNSKLFLQRFQLMQTKLQESQRLLFYTDPSSNFTTSFIMYHPLSGGCMRMNKKYQLGIS 607

Query: 435 SCKTSNRWSHEQDGTPIKLVGSILCLQAVGDGLPPILSKDCSSQQSAWKYASNAKLQLAT 494
           SCKTSNRWSHEQDG PIKL GSILCL+A+G GLPPILS+DCSSQQS WKY SNAKLQLAT
Sbjct: 608 SCKTSNRWSHEQDGAPIKLAGSILCLKAIGVGLPPILSQDCSSQQSIWKYGSNAKLQLAT 667

Query: 495 VDEEGQALCLQRASHSHQILTNKCMCPNDSECQGDPQSQWFTLVPSNVHL 535
           VDE+GQALCLQRASHSHQI+TNKC+C NDS+CQ DPQSQWFTLVPSN+ L
Sbjct: 668 VDEQGQALCLQRASHSHQIVTNKCLCSNDSQCQEDPQSQWFTLVPSNLRL 717

BLAST of CsaV3_5G025530 vs. TrEMBL
Match: tr|A0A0A0L6S7|A0A0A0L6S7_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G185670 PE=3 SV=1)

HSP 1 Score: 869.0 bits (2244), Expect = 5.5e-249
Identity = 415/427 (97.19%), Postives = 423/427 (99.06%), Query Frame = 0

Query: 90  MFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVSDN 149
           MFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVSDN
Sbjct: 1   MFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVSDN 60

Query: 150 HISQPRWCCSNDDGNGFFGDRYFNSQEWLEGLSLAAQSLKNKPQVVAMSLRNEPRGPNQN 209
           HISQPRWCCSNDDGNGFFGDRYFNSQEWL+GLSLA QSLK KPQVVAMSLRNEPRGPNQN
Sbjct: 61  HISQPRWCCSNDDGNGFFGDRYFNSQEWLQGLSLATQSLKTKPQVVAMSLRNEPRGPNQN 120

Query: 210 VEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLYSF 269
           VEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLYSF
Sbjct: 121 VEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLYSF 180

Query: 270 TNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNRFL 329
           TNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNRFL
Sbjct: 181 TNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNRFL 240

Query: 330 SCFFTYLTENDFDWGLWALQGSYYYREGVKNDEETFGVLDAKFTNVKNPKFLQKFQLMQT 389
           SCFFTYLT+NDFDWGLWALQGSYYYREGVKNDEETFGVLD+KFTNVKNPKFLQKFQLMQT
Sbjct: 241 SCFFTYLTKNDFDWGLWALQGSYYYREGVKNDEETFGVLDSKFTNVKNPKFLQKFQLMQT 300

Query: 390 KLQDPSSNLTTSFIMYHPLSGECVQVNKKYQLGVSSCKTSNRWSHEQDGTPIKLVGSILC 449
           KLQDPSSNLTTSFIMYHPLSGECV++NKKYQLGVSSCKTSNRWSHEQD TPIKL GSILC
Sbjct: 301 KLQDPSSNLTTSFIMYHPLSGECVRMNKKYQLGVSSCKTSNRWSHEQDDTPIKLAGSILC 360

Query: 450 LQAVGDGLPPILSKDCSSQQSAWKYASNAKLQLATVDEEGQALCLQRASHSHQILTNKCM 509
           LQAVGDGLPPILSKDCSSQQSAWKYASNAKLQLATVDE+GQALCLQRASHSHQILTNKC+
Sbjct: 361 LQAVGDGLPPILSKDCSSQQSAWKYASNAKLQLATVDEQGQALCLQRASHSHQILTNKCI 420

Query: 510 CPNDSEC 517
           CPNDS+C
Sbjct: 421 CPNDSDC 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011652778.12.0e-28781.12PREDICTED: uncharacterized protein LOC105435091 [Cucumis sativus][more]
KGN50395.12.5e-27784.80hypothetical protein Csa_5G171770 [Cucumis sativus][more]
XP_008467257.11.3e-27385.23PREDICTED: endoglucanase-like [Cucumis melo][more]
XP_011656162.11.3e-27384.05PREDICTED: uncharacterized protein LOC105435646 [Cucumis sativus] >KGN50394.1 hy... [more]
XP_008459280.11.2e-26682.64PREDICTED: uncharacterized protein LOC103498458 [Cucumis melo][more]
Match NameE-valueIdentityDescription
AT3G26140.11.6e-12142.35Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26130.18.9e-12043.40Cellulase (glycosyl hydrolase family 5) protein[more]
AT1G13130.17.6e-11941.41Cellulase (glycosyl hydrolase family 5) protein[more]
AT5G17500.16.4e-10238.88Glycosyl hydrolase superfamily protein[more]
AT5G16700.13.9e-9136.63Glycosyl hydrolase superfamily protein[more]
Match NameE-valueIdentityDescription
sp|C0HLA0|GH5FP_CHAOB4.4e-12443.46Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1[more]
sp|P23548|GUN_PAEPO4.7e-1721.72Endoglucanase OS=Paenibacillus polymyxa OX=1406 PE=3 SV=2[more]
sp|P19487|GUNA_XANCP6.8e-0822.25Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (stra... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KL32|A0A0A0KL32_CUCSA1.6e-27784.80Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G171770 PE=3 SV=1[more]
tr|A0A1S3CT43|A0A1S3CT43_CUCME8.5e-27485.23endoglucanase-like OS=Cucumis melo OX=3656 GN=LOC103504654 PE=3 SV=1[more]
tr|A0A0A0KNB6|A0A0A0KNB6_CUCSA8.5e-27484.05Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G171760 PE=3 SV=1[more]
tr|A0A1S3C9U1|A0A1S3C9U1_CUCME7.7e-26782.64uncharacterized protein LOC103498458 OS=Cucumis melo OX=3656 GN=LOC103498458 PE=... [more]
tr|A0A0A0L6S7|A0A0A0L6S7_CUCSA5.5e-24997.19Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G185670 PE=3 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: INTERPRO
TermDefinition
IPR035992Ricin_B-like_lectins
IPR017853Glycoside_hydrolase_SF
IPR001547Glyco_hydro_5
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_5G025530.1CsaV3_5G025530.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.20.20.80coord: 25..393
e-value: 5.7E-74
score: 251.4
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 15..530
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 15..530
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 66..349
e-value: 1.3E-24
score: 87.1
IPR017853Glycoside hydrolase superfamilySUPERFAMILYSSF51445(Trans)glycosidasescoord: 26..381
IPR035992Ricin B-like lectinsSUPERFAMILYSSF50370Ricin B-like lectinscoord: 399..508

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_5G025530CsGy5G016210Cucumber (Gy14) v2cgybcucB232
The following gene(s) are paralogous to this gene:

None