Cla97C05G108280 (gene) Watermelon (97103) v2

NameCla97C05G108280
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionCellulase (Glycosyl hydrolase family 5)
LocationCla97Chr05 : 34979328 .. 34982903 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGAAACAAGTAGATCGATGAAGGGGTTAATTTTTTTGTACTGTGTCGCCGCCGTGTGGCTTCAGGCGGCGGTGGGGCTGCCGTTACACACCGATTCACGGTGGATCGTAGACGAAGAAGGGCAAAGAGTAAAACTAAGATGCGTCAATTGGGTGTCGCATTTGGAGGCGGTGGTGGCGGAGGGACTTTCAAAGCAGCCCATCGACGCGATTTCGAACAGAATTGAGTCATTAGGATTCAACTGCGTGAGGCTGACTTGGCCACTGTTTTTAGCCACAAACGAGTCTTTGAATTCCCTCACCGTCCGACAGTCTTTCCAGCGGCTGGGACTTGCCGAAGCCATCGCCGGAGTCCAAGCCAACAACCCCTTCATTATCGATCTTCCTCTCATGAAAGCTTTTGAGGTAAAATTGTTAAAAGTGTTTTTGGATACGGCGGTGAATTGTGGGTTTGATTTTGAGTTTGGTTATTTTGACAGGCGGTGGTGGGGAGATTGGGGAATGGGAAATTAATGGTGATATTGGATAATCACATAAGCAAGCCAGGGTGGTGCTGCAGTAACTTCGACGGAAATGGGTTTTTTGGGGATCAGTATTTCAATCCGGAGTTATGGATTAAAGGCCTCACTCGAATGGCCGCCATGTTTAACGGCATGGCCCATGTTGTTGGAATGAGCCTTAGGAATGAACTTCGTGGCCCAAAACAGAATGTCAACGACTGGTACAGGTAATTTTACACTTCTATTTCTTACTTCTGATTTCTGAATATAACTGCTCTTTCCCACTAATTCTAACCCTAAATTTCTAGTTTCTCGCTCTTAGTTACTCGTTTTTCAATTTTGTATCTAATTACTCCTTTAAAGTTTAGATTTTCTTTTATAATTAATGGTTGGTTAAATTCCAAACTTGGTCACTCTATCTAATTTAGAGAAAGTTGGAGTTTAGTCTCTCGATTTATAATTAGAGTTCACTCTCTATAATTTGATAAAATGTTTATAAATAGTTTATATGTTAAGGACTATTTTCGAGGATTTTATTAAATTATAGCGACTAATTTCTGACTTTTAAAACTATTGAAACTAAACTCTAGAGAACCAAAATTTTAATTTAACTTTATTGATTTTAATTTTTTCTTTTATAAAATATTAAATGTAAATTTTAATCTTAGGTTCAATGACACTCCTTCAACTTTTGTCCAATAAATCAGTACACTAAATAGACACAAATTTAAATTTAAAGTTAAGAGATCAATTTTGTAATTTAACATAATTATAAATAAAAAATATCGTTAATTATAGTGGTTAATGAAGATACTAATTGGCTATAATTGTAGCAAACTTTGTTGTTATAATTTAATCTTAATAGTTCTATTAACTTTATGTTCAATAAAATTCTAATTTTCAACTTTTTGTTTAATAGGTTTGTGAATTTTAAAAAAATTGAAAGTTCAGACACAAATATTAAAGTTAAGGGACTAAATTTGTTGTTTTAACATAATTATAAATAAAAAACATAATTAATTATAGTAGTTAATTAAGATACTAATTGCTATAATTGTAGCAAACTTTGCTATTATATTCAACAAATATAGAATTAAAAGAAAAATAAATTTATCTATTCTAAGTATATAAATGAGACAATTATTATTTCGAAAGTTGATGGTTTGATTGTTGCCTACATAATTGTCGAACCAAACTATAAAAATAAACACAAACAAATATAGAACAAACATTTTAACCTTCAACCCATAAGGGAGAAAGTATAAGTGTCTATGTTTAAATGGACATAATCTGTTGCAATTGTTTACATCTCTAGGGTGATTATAGACATGTTTGTCATTGACAACATACATATTATAATCTTATGGACATATGATATTATATGCATTAATGATTGTAAAAATATATTCTACTCGTGTGTGTTTGGGACAATTTAAAAAAATGTTTTTTAATAAATAAAAAACACTTCCCTATAAGCATCTTTAGAAAAATTACTAAATTGGATTGTGTTTTGGAAGAACTTCTAGAGTGATATGGAAGAACTTCTAGAGTGATAAGATCATTATTATTTTTTTTTAAATGGTAAAATATACTCTTGTCAATTGTCATGTCTCTAAGACTTGAACCTAATTCTACTTTATTTTTGGATGCTCAAAATTAGCACTTTTGATAATTGAGGTTCGAATTTAGTTTTTATTTGGTTATTAGGATTGTTTCAAAAGTGACATGTTTAATCCTTGAAAAGTTTGAATTTGATTCATAATTAGTTTCTAAATGACTAGAAGTAGAGTTGATTTTTAGTTGACCCAATGGACATGAGAATTAAAATATGCTTACTAAACTTTATACACACACATATATTAAATGAATTGGAAATTCTTTTATTCTTTTGTTTCATTGTACTAAAAAACCAATTAGAAACTTAATGATGGTCCAAAATTGATTTTCTACTCTCTTGTGCTTCGATTTACAAATCATATTAAACTCAAAATTCATTCTTATTTCATATAAATGTTATGAATATGAATATTACATAAGGTACATGCAAAGAGGAGCAGAGGCAGTTCATTCGGCAAACCCAGACATTCTCATAATCCTCTCAGGACTTAGCTTCGACAAGGACTTATCTTTTCTCAAAAACCAACCCATAAACCTCACATTCACAGCCAAATTAGTCTATGAGGTTCATTGGTATGCTTTTTCAGACGGATCTTCATGGGAATCAAGCAACTCAAACCAAGTATGTGGAAGAGTAGTTAACAACTTAATGAAAATGTCTGGATTTGTATTAGAACAAGGATTTCCTCTGTTTATAAGTGAATTTGGAGTTGACCAAAGAGGCACTAATGTAAATGACAACAGATATTTAAGCTGCTTCTTGTCTGTAGCAGCTGAATTTGACCTTGATTGGGCACTTTGGACACTTGTTGGAAGCTACTATTTAAGAGAAGGGGTTGTTGGTTTGAATGAATTTTATGGAATTCTTGATTGGAATTGGTGTGGTCCTAGAAATTCTACTTTCCTTCAAAGGATTTCTGCTCTTCAATCTCCATTTCAAGGCCCAGGACTAGAAGAAAGAAGGCAATATAATGTAATTTTCCACCCATCAACTGGCTTATGTGTGGTGAGAAAATCATTGTTAGATCCATTAAGATTAGGCCCATGTGTTGATTCAGATCCTTGGTACTATACTCCACAGAAGTTTTTGACTCTCAAAGGCACTTATTTTTGTATACAAGCAGAGGAAATGGGGAAGCAAGCAAAATTGGGGATAATATGTACCGTCTCTGATGCTAAATGGGATATGATTTCTGATTCAAAAATGCATCTTTCTTCCAAAACTAGTAATGGGTCCATGGTTTGCTTGGATGTGGATTCAAGTACGAATGAAATTGTGACCAATTCTTGTAAGTGTTTGAGTCGGGATTCGACGTGCGACCCAAGTAGCCAGTGGTTCAAGCTTGTTAATAGTACGAGAAGTTTGGGCAGGGCGGGGTCGATGATTAGTATGGTTGGTTCGTCTTCGGCGAATGTTGTGCCCAAGTTTGTGGAGTTGAGTTATGGCAGTATGTGA

mRNA sequence

ATGAAGGAAACAAGTAGATCGATGAAGGGGTTAATTTTTTTGTACTGTGTCGCCGCCGTGTGGCTTCAGGCGGCGGTGGGGCTGCCGTTACACACCGATTCACGGTGGATCGTAGACGAAGAAGGGCAAAGAGTAAAACTAAGATGCGTCAATTGGGTGTCGCATTTGGAGGCGGTGGTGGCGGAGGGACTTTCAAAGCAGCCCATCGACGCGATTTCGAACAGAATTGAGTCATTAGGATTCAACTGCGTGAGGCTGACTTGGCCACTGTTTTTAGCCACAAACGAGTCTTTGAATTCCCTCACCGTCCGACAGTCTTTCCAGCGGCTGGGACTTGCCGAAGCCATCGCCGGAGTCCAAGCCAACAACCCCTTCATTATCGATCTTCCTCTCATGAAAGCTTTTGAGGCGGTGGTGGGGAGATTGGGGAATGGGAAATTAATGGTGATATTGGATAATCACATAAGCAAGCCAGGGTGGTGCTGCAGTAACTTCGACGGAAATGGGTTTTTTGGGGATCAGTATTTCAATCCGGAGTTATGGATTAAAGGCCTCACTCGAATGGCCGCCATGTTTAACGGCATGGCCCATGTTGTTGGAATGAGCCTTAGGAATGAACTTCGTGGCCCAAAACAGAATGTCAACGACTGGTACAGGTACATGCAAAGAGGAGCAGAGGCAGTTCATTCGGCAAACCCAGACATTCTCATAATCCTCTCAGGACTTAGCTTCGACAAGGACTTATCTTTTCTCAAAAACCAACCCATAAACCTCACATTCACAGCCAAATTAGTCTATGAGGTTCATTGGTATGCTTTTTCAGACGGATCTTCATGGGAATCAAGCAACTCAAACCAAGTATGTGGAAGAGTAGTTAACAACTTAATGAAAATGTCTGGATTTGTATTAGAACAAGGATTTCCTCTGTTTATAAGTGAATTTGGAGTTGACCAAAGAGGCACTAATGTAAATGACAACAGATATTTAAGCTGCTTCTTGTCTGTAGCAGCTGAATTTGACCTTGATTGGGCACTTTGGACACTTGTTGGAAGCTACTATTTAAGAGAAGGGGTTGTTGGTTTGAATGAATTTTATGGAATTCTTGATTGGAATTGGTGTGGTCCTAGAAATTCTACTTTCCTTCAAAGGATTTCTGCTCTTCAATCTCCATTTCAAGGCCCAGGACTAGAAGAAAGAAGGCAATATAATGTAATTTTCCACCCATCAACTGGCTTATGTGTGGTGAGAAAATCATTGTTAGATCCATTAAGATTAGGCCCATGTGTTGATTCAGATCCTTGGTACTATACTCCACAGAAGTTTTTGACTCTCAAAGGCACTTATTTTTGTATACAAGCAGAGGAAATGGGGAAGCAAGCAAAATTGGGGATAATATGTACCGTCTCTGATGCTAAATGGGATATGATTTCTGATTCAAAAATGCATCTTTCTTCCAAAACTAGTAATGGGTCCATGGTTTGCTTGGATGTGGATTCAAGTACGAATGAAATTGTGACCAATTCTTGTAAGTGTTTGAGTCGGGATTCGACGTGCGACCCAAGTAGCCAGTGGTTCAAGCTTGTTAATAGTACGAGAAGTTTGGGCAGGGCGGGGTCGATGATTAGTATGGTTGGTTCGTCTTCGGCGAATGTTGTGCCCAAGTTTGTGGAGTTGAGTTATGGCAGTATGTGA

Coding sequence (CDS)

ATGAAGGAAACAAGTAGATCGATGAAGGGGTTAATTTTTTTGTACTGTGTCGCCGCCGTGTGGCTTCAGGCGGCGGTGGGGCTGCCGTTACACACCGATTCACGGTGGATCGTAGACGAAGAAGGGCAAAGAGTAAAACTAAGATGCGTCAATTGGGTGTCGCATTTGGAGGCGGTGGTGGCGGAGGGACTTTCAAAGCAGCCCATCGACGCGATTTCGAACAGAATTGAGTCATTAGGATTCAACTGCGTGAGGCTGACTTGGCCACTGTTTTTAGCCACAAACGAGTCTTTGAATTCCCTCACCGTCCGACAGTCTTTCCAGCGGCTGGGACTTGCCGAAGCCATCGCCGGAGTCCAAGCCAACAACCCCTTCATTATCGATCTTCCTCTCATGAAAGCTTTTGAGGCGGTGGTGGGGAGATTGGGGAATGGGAAATTAATGGTGATATTGGATAATCACATAAGCAAGCCAGGGTGGTGCTGCAGTAACTTCGACGGAAATGGGTTTTTTGGGGATCAGTATTTCAATCCGGAGTTATGGATTAAAGGCCTCACTCGAATGGCCGCCATGTTTAACGGCATGGCCCATGTTGTTGGAATGAGCCTTAGGAATGAACTTCGTGGCCCAAAACAGAATGTCAACGACTGGTACAGGTACATGCAAAGAGGAGCAGAGGCAGTTCATTCGGCAAACCCAGACATTCTCATAATCCTCTCAGGACTTAGCTTCGACAAGGACTTATCTTTTCTCAAAAACCAACCCATAAACCTCACATTCACAGCCAAATTAGTCTATGAGGTTCATTGGTATGCTTTTTCAGACGGATCTTCATGGGAATCAAGCAACTCAAACCAAGTATGTGGAAGAGTAGTTAACAACTTAATGAAAATGTCTGGATTTGTATTAGAACAAGGATTTCCTCTGTTTATAAGTGAATTTGGAGTTGACCAAAGAGGCACTAATGTAAATGACAACAGATATTTAAGCTGCTTCTTGTCTGTAGCAGCTGAATTTGACCTTGATTGGGCACTTTGGACACTTGTTGGAAGCTACTATTTAAGAGAAGGGGTTGTTGGTTTGAATGAATTTTATGGAATTCTTGATTGGAATTGGTGTGGTCCTAGAAATTCTACTTTCCTTCAAAGGATTTCTGCTCTTCAATCTCCATTTCAAGGCCCAGGACTAGAAGAAAGAAGGCAATATAATGTAATTTTCCACCCATCAACTGGCTTATGTGTGGTGAGAAAATCATTGTTAGATCCATTAAGATTAGGCCCATGTGTTGATTCAGATCCTTGGTACTATACTCCACAGAAGTTTTTGACTCTCAAAGGCACTTATTTTTGTATACAAGCAGAGGAAATGGGGAAGCAAGCAAAATTGGGGATAATATGTACCGTCTCTGATGCTAAATGGGATATGATTTCTGATTCAAAAATGCATCTTTCTTCCAAAACTAGTAATGGGTCCATGGTTTGCTTGGATGTGGATTCAAGTACGAATGAAATTGTGACCAATTCTTGTAAGTGTTTGAGTCGGGATTCGACGTGCGACCCAAGTAGCCAGTGGTTCAAGCTTGTTAATAGTACGAGAAGTTTGGGCAGGGCGGGGTCGATGATTAGTATGGTTGGTTCGTCTTCGGCGAATGTTGTGCCCAAGTTTGTGGAGTTGAGTTATGGCAGTATGTGA

Protein sequence

MKETSRSMKGLIFLYCVAAVWLQAAVGLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQPIDAISNRIESLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFIIDLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTRMAAMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKDLSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQGFPLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGILDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLGPCVDSDPWYYTPQKFLTLKGTYFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHLSSKTSNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRSLGRAGSMISMVGSSSANVVPKFVELSYGSM
BLAST of Cla97C05G108280 vs. NCBI nr
Match: XP_008446127.1 (PREDICTED: uncharacterized protein LOC103488945 [Cucumis melo])

HSP 1 Score: 1031.9 bits (2667), Expect = 7.8e-298
Identity = 488/551 (88.57%), Postives = 518/551 (94.01%), Query Frame = 0

Query: 8   MKGLIFLYCVAAVWLQAAVGLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQ 67
           MKGL+ L  V  V   AAVGLPLHTD+RWIVD EG+RVKLRCVNWVSHLEAVVAEGLSKQ
Sbjct: 2   MKGLLILLAVWCVAASAAVGLPLHTDTRWIVDGEGERVKLRCVNWVSHLEAVVAEGLSKQ 61

Query: 68  PIDAISNRIESLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFII 127
           PI+ ISNRIE LGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGL EAIAG+QANNPFII
Sbjct: 62  PIEEISNRIEGLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLTEAIAGIQANNPFII 121

Query: 128 DLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTR 187
           DLPL+KAFEAVVG+LG GKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNP+LWIKGLTR
Sbjct: 122 DLPLLKAFEAVVGKLGEGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPDLWIKGLTR 181

Query: 188 MAAMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKD 247
           MA MFNG+ HVV MSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFD+D
Sbjct: 182 MATMFNGVNHVVAMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDRD 241

Query: 248 LSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQGF 307
           LSFLKNQPINLTFT+K VYEVHWYAFSDGSSWES NSNQVCGR  NNLMKMSGF+L+QGF
Sbjct: 242 LSFLKNQPINLTFTSKTVYEVHWYAFSDGSSWESGNSNQVCGRTTNNLMKMSGFLLQQGF 301

Query: 308 PLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGI 367
           PLFISEFG+DQRGTNVNDNRYLSCFL+VAAEFDLDWALWTLVGSYYLREGVVGLNEFYGI
Sbjct: 302 PLFISEFGIDQRGTNVNDNRYLSCFLAVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGI 361

Query: 368 LDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLGP 427
           LDWNWC  RNSTFLQRIS LQ+P QGPGL ERR+YN+IFHP +GLCVVRKSLLDPLRLGP
Sbjct: 362 LDWNWCNLRNSTFLQRISVLQTPLQGPGLAERREYNLIFHPLSGLCVVRKSLLDPLRLGP 421

Query: 428 CVDSDPWYYTPQKFLTLKGTYFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHLSSKT 487
           CVDSD WYYTPQKFLTLKGTYFCIQA+E+GKQAKLGIICTV++AKWDMISDSK+HLSSK+
Sbjct: 422 CVDSDAWYYTPQKFLTLKGTYFCIQADEIGKQAKLGIICTVNNAKWDMISDSKLHLSSKS 481

Query: 488 SNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRSLGRAGSMISMVGSS 547
           SNGS+VCLDVD++TNEIVTNSCKCLSRDS+CDPSSQWFKLVNSTRSLG   SMI+M GSS
Sbjct: 482 SNGSLVCLDVDANTNEIVTNSCKCLSRDSSCDPSSQWFKLVNSTRSLGGGRSMINMAGSS 541

Query: 548 SANVVPKFVEL 559
            ANVV KFVEL
Sbjct: 542 LANVVTKFVEL 552

BLAST of Cla97C05G108280 vs. NCBI nr
Match: XP_023514558.1 (uncharacterized protein LOC111778811 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1029.6 bits (2661), Expect = 3.9e-297
Identity = 484/548 (88.32%), Postives = 521/548 (95.07%), Query Frame = 0

Query: 8   MKGLIFLYCVAAVWLQAAVGLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQ 67
           MKGL+ L  VAA+WLQAAVGLPLHTDSRWIVDE+GQRVKLRCVNWVSHLEAVVAEGLSKQ
Sbjct: 1   MKGLVLLCSVAALWLQAAVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQ 60

Query: 68  PIDAISNRIESLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFII 127
           PID I+NRI SLGFNCVRLTWPLFLATNESL+SLTVRQSFQRLGL EAIAGVQANNPFII
Sbjct: 61  PIDEITNRIGSLGFNCVRLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFII 120

Query: 128 DLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTR 187
           DLPL+KAFEAVVGRLG  +LMV+LDNHISKPGWCCSNFDGNGFFGDQYFNPE WI+GLTR
Sbjct: 121 DLPLIKAFEAVVGRLGEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTR 180

Query: 188 MAAMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKD 247
           MA MFNG+AHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVH+ANPD+L+ILSGLSFDKD
Sbjct: 181 MATMFNGVAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKD 240

Query: 248 LSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQGF 307
           LSFLK QPINLTFT+KLVYEVHWYAFSDGSSW+S N NQVCGRVVNNLMKMSGF+LEQG 
Sbjct: 241 LSFLKTQPINLTFTSKLVYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGM 300

Query: 308 PLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGI 367
           PLF++EFGVDQRGTNVNDNRYLSCFLSVAAE+DLDWA+WTLVGSYYLREGVVGLNEFYG+
Sbjct: 301 PLFLTEFGVDQRGTNVNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGL 360

Query: 368 LDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLGP 427
           LDWNWC PRNS+FLQRISALQ+PFQGPGL ERR Y+V+FHPSTGLCV RKSLLDPLRLGP
Sbjct: 361 LDWNWCNPRNSSFLQRISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRKSLLDPLRLGP 420

Query: 428 CVDSDPWYYTPQKFLTLKGTYFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHLSSKT 487
           C DSDPWYYTPQKFLTLKGTYFCIQA++MGKQ +LGIICTVS+A+WDM+SDSKMHLSSK 
Sbjct: 421 CADSDPWYYTPQKFLTLKGTYFCIQAQDMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKL 480

Query: 488 SNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRSLGRAGSMISMVGSS 547
            +GS+VCLDVD STNEIVTN+CKCLSRDS+CDPSSQWFKLVNSTRSLG   SMISMVGSS
Sbjct: 481 DDGSVVCLDVDPSTNEIVTNACKCLSRDSSCDPSSQWFKLVNSTRSLGATRSMISMVGSS 540

Query: 548 -SANVVPK 555
            S+N+VPK
Sbjct: 541 LSSNLVPK 548

BLAST of Cla97C05G108280 vs. NCBI nr
Match: XP_004135502.2 (PREDICTED: uncharacterized protein LOC101217177 [Cucumis sativus] >KGN51727.1 hypothetical protein Csa_5G593400 [Cucumis sativus])

HSP 1 Score: 1026.9 bits (2654), Expect = 2.5e-296
Identity = 489/552 (88.59%), Postives = 521/552 (94.38%), Query Frame = 0

Query: 8   MKGLIFL-YCVAAVWLQAAVGLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSK 67
           MKGLI L +CVAA    AAVGLPLHTD+RWIVD  G+RVKLRCVNWVSHLEAVVAEGLSK
Sbjct: 2   MKGLILLVWCVAA---SAAVGLPLHTDTRWIVDGAGERVKLRCVNWVSHLEAVVAEGLSK 61

Query: 68  QPIDAISNRIESLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFI 127
           QPI+ ISNRI+ LGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAG+QANNPFI
Sbjct: 62  QPIEEISNRIQWLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGIQANNPFI 121

Query: 128 IDLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLT 187
           IDLPL+KAFEAVVG+LG GKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNP+LWIKGLT
Sbjct: 122 IDLPLLKAFEAVVGKLGEGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPDLWIKGLT 181

Query: 188 RMAAMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDK 247
           R+A MFNG+ HVV MSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLS+D+
Sbjct: 182 RIATMFNGVNHVVAMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSYDR 241

Query: 248 DLSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQG 307
           DLSFLKNQPINLTFT+K VYEVHWYAFSDGSSWES NSNQVCGR  NNLMKMSGF+L+QG
Sbjct: 242 DLSFLKNQPINLTFTSKTVYEVHWYAFSDGSSWESGNSNQVCGRTTNNLMKMSGFLLQQG 301

Query: 308 FPLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYG 367
           FPLFISEFG+DQRGTNVNDNRYLSCFL+VAAE DLDWA+WTLVGSYYLREGVVGLNEFYG
Sbjct: 302 FPLFISEFGIDQRGTNVNDNRYLSCFLAVAAELDLDWAVWTLVGSYYLREGVVGLNEFYG 361

Query: 368 ILDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLG 427
           ILDWNWC  RNSTFLQRISALQSPFQGPGL ERR+YNVIFHP +GLCVVRKSLLDPL LG
Sbjct: 362 ILDWNWCNLRNSTFLQRISALQSPFQGPGLAERREYNVIFHPLSGLCVVRKSLLDPLTLG 421

Query: 428 PCVDSDPWYYTPQKFLTLKGTYFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHLSSK 487
           PCVD+D WYYTPQKFLTLKGTYFCIQA+E+GKQAKLGIICTV++AKWDMISDSK+HLSSK
Sbjct: 422 PCVDTDAWYYTPQKFLTLKGTYFCIQADEIGKQAKLGIICTVNNAKWDMISDSKLHLSSK 481

Query: 488 TSNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRSLGRAGSMISMVGS 547
           +SNGS+VCLDVDSSTNEIVTNSCKCLSRDS+CDPSSQWFKLVNSTRSLGR  SMI+MVGS
Sbjct: 482 SSNGSLVCLDVDSSTNEIVTNSCKCLSRDSSCDPSSQWFKLVNSTRSLGRGRSMINMVGS 541

Query: 548 SSANVVPKFVEL 559
           S  NV  KFV+L
Sbjct: 542 SLPNVATKFVDL 550

BLAST of Cla97C05G108280 vs. NCBI nr
Match: XP_022957392.1 (uncharacterized protein LOC111458804 [Cucurbita moschata])

HSP 1 Score: 1026.2 bits (2652), Expect = 4.3e-296
Identity = 482/548 (87.96%), Postives = 521/548 (95.07%), Query Frame = 0

Query: 8   MKGLIFLYCVAAVWLQAAVGLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQ 67
           MKGL+ L  VAA+WLQAAVGLPLHTDSRWIVDE+GQRVKLRCVNWVSHLEAVVAEGLSKQ
Sbjct: 1   MKGLVLLCSVAALWLQAAVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQ 60

Query: 68  PIDAISNRIESLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFII 127
           PID I+NRI  LGFNCVRLTWPLFLATNESL+SLTVRQSFQRLGL EAIAG+QANNPFII
Sbjct: 61  PIDEITNRIGLLGFNCVRLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGIQANNPFII 120

Query: 128 DLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTR 187
           DLPL+KAFEAVVGRLG  +LMV+LDNHISKPGWCCSNFDGNGFFGDQYFNPE WI+GLTR
Sbjct: 121 DLPLIKAFEAVVGRLGEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTR 180

Query: 188 MAAMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKD 247
           MA MFNG+AHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVH+ANPD+L+ILSGLSFDKD
Sbjct: 181 MATMFNGVAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKD 240

Query: 248 LSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQGF 307
           LSFLK QPINLTFTAKLVYEVHWYAFSDGSSW+S N NQVCGRVVNNLMKMSGF+LEQG 
Sbjct: 241 LSFLKTQPINLTFTAKLVYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGM 300

Query: 308 PLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGI 367
           PLF++EFGVDQRGTNVNDNRYLSCFLSVAAE+DLDWA+WTLVGSYYLREGVVGLNEFYG+
Sbjct: 301 PLFLTEFGVDQRGTNVNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGL 360

Query: 368 LDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLGP 427
           LDWNWC PRNS+FLQRISALQ+PFQGPGL ERR Y+V+FHPSTGLCV RKSLLDPLRLGP
Sbjct: 361 LDWNWCNPRNSSFLQRISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRKSLLDPLRLGP 420

Query: 428 CVDSDPWYYTPQKFLTLKGTYFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHLSSKT 487
           C DSDPWYYTPQKFLTLKGTYFCIQA++MGKQ +LGIICTVS+A+WDM+SDSKMHLSSK 
Sbjct: 421 CADSDPWYYTPQKFLTLKGTYFCIQAQDMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKL 480

Query: 488 SNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRSLGRAGSMISMVGSS 547
            +GS+VCLDVD S+NEIVTN+CKCLSRDS+C+PSSQWFKLVNSTRSLG A SMISMVGSS
Sbjct: 481 DDGSVVCLDVDPSSNEIVTNACKCLSRDSSCNPSSQWFKLVNSTRSLGAARSMISMVGSS 540

Query: 548 -SANVVPK 555
            S+N+VPK
Sbjct: 541 LSSNLVPK 548

BLAST of Cla97C05G108280 vs. NCBI nr
Match: XP_022997703.1 (uncharacterized protein LOC111492584 [Cucurbita maxima])

HSP 1 Score: 1025.8 bits (2651), Expect = 5.6e-296
Identity = 486/552 (88.04%), Postives = 520/552 (94.20%), Query Frame = 0

Query: 8   MKGLIFLYCVAAVWLQAAVGLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQ 67
           MKGL+ L  VAA+WLQAAVGLPLHTDSRWIVDE+GQRVKLRCVNWVSHLEAVVAEGLSKQ
Sbjct: 2   MKGLVLLCSVAALWLQAAVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQ 61

Query: 68  PIDAISNRIESLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFII 127
           PID I+NRI SLGFNCVRLTWPLFLATNESL+SLTVRQSFQRLGL EAIAGVQANNPFII
Sbjct: 62  PIDEITNRIGSLGFNCVRLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFII 121

Query: 128 DLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTR 187
           DLPL+KAFEAVVG LG  +LMV+LDNHISKPGWCCSNFDGNGFFGDQYFNPE WI+GLTR
Sbjct: 122 DLPLIKAFEAVVGWLGEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTR 181

Query: 188 MAAMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKD 247
           MA MFNG+AHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVH+ANPD+L+ILSGLSFDKD
Sbjct: 182 MATMFNGVAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKD 241

Query: 248 LSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQGF 307
           LSFLKNQPINLTFTAKLVYEVHWYAFSDGSSW+S N NQVCGRVVNNLMKMSGF+LEQG 
Sbjct: 242 LSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGM 301

Query: 308 PLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGI 367
           PLF++EFGVDQRGTNVNDNRYLSCFLSVAAE+DLDWA+WTLVGSYYLREGVVGLNEFYG+
Sbjct: 302 PLFLTEFGVDQRGTNVNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGL 361

Query: 368 LDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLGP 427
           LDWNWC PRNS+FLQRISALQ+PFQGPGL ERR Y+V+FHPSTGLCV R SLLDPLRLGP
Sbjct: 362 LDWNWCNPRNSSFLQRISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLGP 421

Query: 428 CVDSDPWYYTPQKFLTLKGTYFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHLSSKT 487
           C DSDPWYYTPQKFLTLKGTYFCIQA+EMGKQ +LGIICTVS+A+WDM+SDSKMHLSSK 
Sbjct: 422 CADSDPWYYTPQKFLTLKGTYFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKL 481

Query: 488 SNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRSLGRAGSMISMVGSS 547
            NGS+VCLDVD +TNEIVTN+CKCLSRDS+CDPSSQWFKLVNSTRS     SMISMVGSS
Sbjct: 482 DNGSVVCLDVDPNTNEIVTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTRSMISMVGSS 541

Query: 548 -SANVVPK-FVE 558
            S N+VPK FVE
Sbjct: 542 LSFNLVPKLFVE 553

BLAST of Cla97C05G108280 vs. TrEMBL
Match: tr|A0A1S3BF06|A0A1S3BF06_CUCME (uncharacterized protein LOC103488945 OS=Cucumis melo OX=3656 GN=LOC103488945 PE=3 SV=1)

HSP 1 Score: 1031.9 bits (2667), Expect = 5.2e-298
Identity = 488/551 (88.57%), Postives = 518/551 (94.01%), Query Frame = 0

Query: 8   MKGLIFLYCVAAVWLQAAVGLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQ 67
           MKGL+ L  V  V   AAVGLPLHTD+RWIVD EG+RVKLRCVNWVSHLEAVVAEGLSKQ
Sbjct: 2   MKGLLILLAVWCVAASAAVGLPLHTDTRWIVDGEGERVKLRCVNWVSHLEAVVAEGLSKQ 61

Query: 68  PIDAISNRIESLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFII 127
           PI+ ISNRIE LGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGL EAIAG+QANNPFII
Sbjct: 62  PIEEISNRIEGLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLTEAIAGIQANNPFII 121

Query: 128 DLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTR 187
           DLPL+KAFEAVVG+LG GKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNP+LWIKGLTR
Sbjct: 122 DLPLLKAFEAVVGKLGEGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPDLWIKGLTR 181

Query: 188 MAAMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKD 247
           MA MFNG+ HVV MSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFD+D
Sbjct: 182 MATMFNGVNHVVAMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDRD 241

Query: 248 LSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQGF 307
           LSFLKNQPINLTFT+K VYEVHWYAFSDGSSWES NSNQVCGR  NNLMKMSGF+L+QGF
Sbjct: 242 LSFLKNQPINLTFTSKTVYEVHWYAFSDGSSWESGNSNQVCGRTTNNLMKMSGFLLQQGF 301

Query: 308 PLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGI 367
           PLFISEFG+DQRGTNVNDNRYLSCFL+VAAEFDLDWALWTLVGSYYLREGVVGLNEFYGI
Sbjct: 302 PLFISEFGIDQRGTNVNDNRYLSCFLAVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGI 361

Query: 368 LDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLGP 427
           LDWNWC  RNSTFLQRIS LQ+P QGPGL ERR+YN+IFHP +GLCVVRKSLLDPLRLGP
Sbjct: 362 LDWNWCNLRNSTFLQRISVLQTPLQGPGLAERREYNLIFHPLSGLCVVRKSLLDPLRLGP 421

Query: 428 CVDSDPWYYTPQKFLTLKGTYFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHLSSKT 487
           CVDSD WYYTPQKFLTLKGTYFCIQA+E+GKQAKLGIICTV++AKWDMISDSK+HLSSK+
Sbjct: 422 CVDSDAWYYTPQKFLTLKGTYFCIQADEIGKQAKLGIICTVNNAKWDMISDSKLHLSSKS 481

Query: 488 SNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRSLGRAGSMISMVGSS 547
           SNGS+VCLDVD++TNEIVTNSCKCLSRDS+CDPSSQWFKLVNSTRSLG   SMI+M GSS
Sbjct: 482 SNGSLVCLDVDANTNEIVTNSCKCLSRDSSCDPSSQWFKLVNSTRSLGGGRSMINMAGSS 541

Query: 548 SANVVPKFVEL 559
            ANVV KFVEL
Sbjct: 542 LANVVTKFVEL 552

BLAST of Cla97C05G108280 vs. TrEMBL
Match: tr|A0A0A0KTH1|A0A0A0KTH1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G593400 PE=3 SV=1)

HSP 1 Score: 1026.9 bits (2654), Expect = 1.7e-296
Identity = 489/552 (88.59%), Postives = 521/552 (94.38%), Query Frame = 0

Query: 8   MKGLIFL-YCVAAVWLQAAVGLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSK 67
           MKGLI L +CVAA    AAVGLPLHTD+RWIVD  G+RVKLRCVNWVSHLEAVVAEGLSK
Sbjct: 2   MKGLILLVWCVAA---SAAVGLPLHTDTRWIVDGAGERVKLRCVNWVSHLEAVVAEGLSK 61

Query: 68  QPIDAISNRIESLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFI 127
           QPI+ ISNRI+ LGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAG+QANNPFI
Sbjct: 62  QPIEEISNRIQWLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGIQANNPFI 121

Query: 128 IDLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLT 187
           IDLPL+KAFEAVVG+LG GKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNP+LWIKGLT
Sbjct: 122 IDLPLLKAFEAVVGKLGEGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPDLWIKGLT 181

Query: 188 RMAAMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDK 247
           R+A MFNG+ HVV MSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLS+D+
Sbjct: 182 RIATMFNGVNHVVAMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSYDR 241

Query: 248 DLSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQG 307
           DLSFLKNQPINLTFT+K VYEVHWYAFSDGSSWES NSNQVCGR  NNLMKMSGF+L+QG
Sbjct: 242 DLSFLKNQPINLTFTSKTVYEVHWYAFSDGSSWESGNSNQVCGRTTNNLMKMSGFLLQQG 301

Query: 308 FPLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYG 367
           FPLFISEFG+DQRGTNVNDNRYLSCFL+VAAE DLDWA+WTLVGSYYLREGVVGLNEFYG
Sbjct: 302 FPLFISEFGIDQRGTNVNDNRYLSCFLAVAAELDLDWAVWTLVGSYYLREGVVGLNEFYG 361

Query: 368 ILDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLG 427
           ILDWNWC  RNSTFLQRISALQSPFQGPGL ERR+YNVIFHP +GLCVVRKSLLDPL LG
Sbjct: 362 ILDWNWCNLRNSTFLQRISALQSPFQGPGLAERREYNVIFHPLSGLCVVRKSLLDPLTLG 421

Query: 428 PCVDSDPWYYTPQKFLTLKGTYFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHLSSK 487
           PCVD+D WYYTPQKFLTLKGTYFCIQA+E+GKQAKLGIICTV++AKWDMISDSK+HLSSK
Sbjct: 422 PCVDTDAWYYTPQKFLTLKGTYFCIQADEIGKQAKLGIICTVNNAKWDMISDSKLHLSSK 481

Query: 488 TSNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRSLGRAGSMISMVGS 547
           +SNGS+VCLDVDSSTNEIVTNSCKCLSRDS+CDPSSQWFKLVNSTRSLGR  SMI+MVGS
Sbjct: 482 SSNGSLVCLDVDSSTNEIVTNSCKCLSRDSSCDPSSQWFKLVNSTRSLGRGRSMINMVGS 541

Query: 548 SSANVVPKFVEL 559
           S  NV  KFV+L
Sbjct: 542 SLPNVATKFVDL 550

BLAST of Cla97C05G108280 vs. TrEMBL
Match: tr|A0A067K4F9|A0A067K4F9_JATCU (Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_17827 PE=3 SV=1)

HSP 1 Score: 855.1 bits (2208), Expect = 8.7e-245
Identity = 382/506 (75.49%), Postives = 454/506 (89.72%), Query Frame = 0

Query: 28  LPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQPIDAISNRIESLGFNCVRLT 87
           +PL TDSRWIVDE+GQRVKL CVNWVSHLEAVVAEGLS++P+D I+ +I S+GFNCVRLT
Sbjct: 31  VPLSTDSRWIVDEKGQRVKLACVNWVSHLEAVVAEGLSREPMDLIAKKIVSMGFNCVRLT 90

Query: 88  WPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFIIDLPLMKAFEAVVGRLGNGKL 147
           WPL+L TNE+  SLTV+QSFQ LGL E+I+G+QANNP IIDLPL+KA++AVV  LG+  +
Sbjct: 91  WPLYLVTNETYASLTVKQSFQNLGLLESISGIQANNPAIIDLPLIKAYQAVVSSLGDNNV 150

Query: 148 MVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTRMAAMFNGMAHVVGMSLRNEL 207
           +VILDNHISKPGWCCSNFDGNGFFGD YFNP+LWI GLT+MA +FNG+ +VVG+SLRNEL
Sbjct: 151 LVILDNHISKPGWCCSNFDGNGFFGDSYFNPDLWINGLTQMATIFNGVPNVVGLSLRNEL 210

Query: 208 RGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKDLSFLKNQPINLTFTAKLVYE 267
           RG KQNVNDWYRYM++GAEAVHSANPD+L+ILSGL++DKDLSFL+N+P+NLTFT KLV+E
Sbjct: 211 RGQKQNVNDWYRYMEKGAEAVHSANPDVLVILSGLNYDKDLSFLRNRPVNLTFTGKLVFE 270

Query: 268 VHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQGFPLFISEFGVDQRGTNVNDNR 327
           VHWY FSDG +W++ N NQVCGRV +N+M+MSG++L+QG+PLF+SEFG+DQRGTNVNDNR
Sbjct: 271 VHWYGFSDGQAWKNGNPNQVCGRVTSNMMRMSGYLLDQGWPLFVSEFGIDQRGTNVNDNR 330

Query: 328 YLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGILDWNWCGPRNSTFLQRISAL 387
           YL CFL  AAE DLDWALWTLVGSYYLREGV+GLNE+YG+L+WNWC  RNS+FLQ+ISAL
Sbjct: 331 YLGCFLGWAAELDLDWALWTLVGSYYLREGVLGLNEYYGVLNWNWCDIRNSSFLQQISAL 390

Query: 388 QSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLGPCVDSDPWYYTPQKFLTLKGT 447
           QSPFQGPG+ E   + VIFHP TGLCV RKS+L+PL+LGPC DS+ W YTPQK L+LKGT
Sbjct: 391 QSPFQGPGVSESNLHKVIFHPLTGLCVQRKSMLEPLKLGPCTDSEAWRYTPQKILSLKGT 450

Query: 448 YFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHLSSKTSNGSMVCLDVDSSTNEIVTN 507
           YFC+QA+E+GK AKLG+ICT SD+KWD+ISDS MHLSSK SNG+ +CLDVDS+ N IV N
Sbjct: 451 YFCLQADELGKPAKLGVICTDSDSKWDVISDSNMHLSSKISNGTTICLDVDSN-NTIVIN 510

Query: 508 SCKCLSRDSTCDPSSQWFKLVNSTRS 534
           +CKCLSRD+TCDP SQWFKLVNSTRS
Sbjct: 511 TCKCLSRDNTCDPGSQWFKLVNSTRS 535

BLAST of Cla97C05G108280 vs. TrEMBL
Match: tr|B9RMT6|B9RMT6_RICCO (Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis OX=3988 GN=RCOM_1083710 PE=3 SV=1)

HSP 1 Score: 854.4 bits (2206), Expect = 1.5e-244
Identity = 389/528 (73.67%), Postives = 462/528 (87.50%), Query Frame = 0

Query: 11  LIFLYCVAAVWLQAAV-GLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQPI 70
           + F   ++A+  Q+ V  LPL T+SRWIVDE GQRVKL CVNWVSHLEAVVAEGLSKQP+
Sbjct: 14  ITFFIAISAIIPQSQVTALPLSTNSRWIVDENGQRVKLACVNWVSHLEAVVAEGLSKQPM 73

Query: 71  DAISNRIESLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFIIDL 130
           D I+ +I S+GFNCVRLTWPL+L TN++L SL+VRQSFQ LGL E+I+G+QANNP IIDL
Sbjct: 74  DMIAKKIVSMGFNCVRLTWPLYLVTNDTLASLSVRQSFQGLGLLESISGIQANNPSIIDL 133

Query: 131 PLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTRMA 190
           PL+KA++AVV  LG+  +MVILDNHISKPGWCCSNFDGNGFFGD YFNP+LWIKGLT+MA
Sbjct: 134 PLIKAYQAVVSSLGDNNVMVILDNHISKPGWCCSNFDGNGFFGDTYFNPDLWIKGLTQMA 193

Query: 191 AMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKDLS 250
            +FNG+ +V+GMSLRNELRG KQNVNDWYRYM++GAEAVHSANPD+L+ILSGL++DKD S
Sbjct: 194 TLFNGVTNVIGMSLRNELRGQKQNVNDWYRYMEKGAEAVHSANPDVLVILSGLNYDKDFS 253

Query: 251 FLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQGFPL 310
           FL+N+P+NL+FT K+V+EVHWY FSDG +W S N NQVCGRVV+NLM++SGF+LEQG+P+
Sbjct: 254 FLRNRPVNLSFTGKVVFEVHWYGFSDGQAWRSGNPNQVCGRVVDNLMRISGFLLEQGWPM 313

Query: 311 FISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGILD 370
           F+SEFGVDQRGTNVNDNRYL CF+ VAAE D DWALWTLVGSYYLR+GV+GLNE+YG+L+
Sbjct: 314 FVSEFGVDQRGTNVNDNRYLGCFIGVAAELDWDWALWTLVGSYYLRQGVIGLNEYYGVLN 373

Query: 371 WNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLGPCV 430
           WNWC  RNS+FLQ+ISALQSPFQGPGL E   + VIFHPSTGLCV RKS+L+PLRLG C 
Sbjct: 374 WNWCDVRNSSFLQQISALQSPFQGPGLSETNPHKVIFHPSTGLCVQRKSMLEPLRLGSCT 433

Query: 431 DSDPWYYTPQKFLTLKGTYFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHLSSKTSN 490
           DS+ W YT +  LTL+GTYFC+QA+E+GK AKLGIICT S +KWD+ISDSKMHLSSK +N
Sbjct: 434 DSEAWRYTSENTLTLRGTYFCLQADELGKPAKLGIICTDSTSKWDVISDSKMHLSSKITN 493

Query: 491 GSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRSLGRA 538
           G+ VCLDVDS+ N IV ++CKCLSRD+TCDP SQWFKLVNSTRS   A
Sbjct: 494 GTAVCLDVDSN-NTIVISTCKCLSRDNTCDPESQWFKLVNSTRSSATA 540

BLAST of Cla97C05G108280 vs. TrEMBL
Match: tr|A0A061EEN1|A0A061EEN1_THECC (Cellulase protein OS=Theobroma cacao OX=3641 GN=TCM_010637 PE=3 SV=1)

HSP 1 Score: 850.1 bits (2195), Expect = 2.8e-243
Identity = 390/530 (73.58%), Postives = 457/530 (86.23%), Query Frame = 0

Query: 4   TSRSMKGLIFLYCVAAVWLQAAVGLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEG 63
           TS S+  L  ++ +     + A+ LPL T+SRWIVDE+GQRVKL CVNWVSHLE +VAEG
Sbjct: 5   TSLSILPLFIVFHIIIQDAKPAMSLPLSTNSRWIVDEKGQRVKLACVNWVSHLEPMVAEG 64

Query: 64  LSKQPIDAISNRIESLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANN 123
           LSK P+D I+ RI S GFNCVRLTWPLFL TN+SL SLTVRQSFQRLGL E+IAG+Q NN
Sbjct: 65  LSKLPMDVIAKRIVSTGFNCVRLTWPLFLVTNDSLASLTVRQSFQRLGLLESIAGIQTNN 124

Query: 124 PFIIDLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIK 183
           P IID+ L+KA++AVV  LG   +MVILDNHISKPGWCCSNFDGNGFFGDQYFNP++WI 
Sbjct: 125 PSIIDVSLLKAYQAVVCSLGENNVMVILDNHISKPGWCCSNFDGNGFFGDQYFNPDIWIT 184

Query: 184 GLTRMAAMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLS 243
           GLTRMA + N + +VVGMSLRNELRGPKQ VNDWYRYMQ+GAEAVHSANPD+L+ILSGL+
Sbjct: 185 GLTRMATLVNAVTNVVGMSLRNELRGPKQTVNDWYRYMQKGAEAVHSANPDVLVILSGLN 244

Query: 244 FDKDLSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVL 303
           +DKDLSF++N+P NLTFT KLV+EVHWY F+DG +W + N NQVCGRV N++M+ SGF++
Sbjct: 245 YDKDLSFIRNRPANLTFTGKLVFEVHWYGFTDGQTWVTGNPNQVCGRVANDMMRTSGFLV 304

Query: 304 EQGFPLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNE 363
           +QG+PLF+SEFGVDQRGTNVNDNRYL+CFL VAAE DLDWALWTLVGSYYLREGVVGLNE
Sbjct: 305 DQGYPLFVSEFGVDQRGTNVNDNRYLNCFLGVAAELDLDWALWTLVGSYYLREGVVGLNE 364

Query: 364 FYGILDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPL 423
           +YGIL+WNWC  RNS+FL+RISALQSPF+GPGL E + + VIFHPSTGLCV+RKSLLDPL
Sbjct: 365 YYGILNWNWCEIRNSSFLERISALQSPFRGPGLSETKLHKVIFHPSTGLCVLRKSLLDPL 424

Query: 424 RLGPCVDSDPWYYTPQKFLTLKGTYFCIQAEEMGKQAKLGIICTVSDAKWDMISDSKMHL 483
           RLGPC DS+ W Y+PQ  L +KGTYFC+QA+E G  A+LGIIC+ S++KW+MISDSKMHL
Sbjct: 425 RLGPCTDSEAWSYSPQNTLVVKGTYFCLQADESGTLARLGIICSESNSKWEMISDSKMHL 484

Query: 484 SSKTSNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRS 534
           SSK  NG+ +CLDVD STN IVTNSCKCLS D+ CDP SQWFKLV+STRS
Sbjct: 485 SSKLRNGTSICLDVD-STNTIVTNSCKCLSNDNMCDPESQWFKLVDSTRS 533

BLAST of Cla97C05G108280 vs. Swiss-Prot
Match: sp|C0HLA0|GH5FP_CHAOB (Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1)

HSP 1 Score: 443.7 bits (1140), Expect = 3.0e-123
Identity = 235/527 (44.59%), Postives = 310/527 (58.82%), Query Frame = 0

Query: 28  LPLHTDSRWIVDE-EGQRVKLRCVNWVSHLEAVVAEGLSKQPIDAISNRIESLGFNCVRL 87
           LPL T  RWIVDE  G RVKL CVNWV HLE  + EGL++ P+  +++ I SLGFNCVRL
Sbjct: 29  LPLLTRGRWIVDEATGLRVKLACVNWVGHLEPGLPEGLNRLPVATVAHTISSLGFNCVRL 88

Query: 88  TWPLFLATNESLNSLTVRQSFQRLGLAEAIAGVQANNPFIIDLPLMKAFEAVVGRLGNGK 147
           T+ + + T  S  + TV Q+F RL L EA +G++ NNP ++DL  + A+  VV  L    
Sbjct: 89  TYSIHMLTRTSYTNATVAQTFARLNLTEAASGIEHNNPELLDLGHVAAYHHVVAALSEAG 148

Query: 148 LMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTRMAAMFNGMAHVVGMSLRNE 207
           +MVILDNH+SKP WCC+  DGNGFFGD+YFNP  W++GL  MA  FN   +VV MSLRNE
Sbjct: 149 VMVILDNHVSKPKWCCAVDDGNGFFGDRYFNPNTWVEGLGLMATYFNNTPNVVAMSLRNE 208

Query: 208 LRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKDLSFLKNQPINLTFTAKLVY 267
           LRG +     W R+MQ GA  VH ANP +L+ILSGL FD DLSFL   P+ L F  K+VY
Sbjct: 209 LRGNRSTPISWSRHMQWGAATVHKANPKVLVILSGLQFDTDLSFLPVLPVTLPFKEKIVY 268

Query: 268 EVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQ----GFPLFISEFGVDQRGTN 327
           E HWY+F  G  W +   N VC           GFV         PLF+SEFG+DQR  N
Sbjct: 269 EGHWYSF--GVPWRTGLPNDVCKNETGRFKSNVGFVTSSANATAAPLFMSEFGIDQRYVN 328

Query: 328 VNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLR---EGVVGLNEFYGILDWNWCGPRNST 387
            NDNRYL+C L+  AE DLDWALWT+ GSYY R   + V    E YG  + +W   RN  
Sbjct: 329 DNDNRYLNCILAYLAEEDLDWALWTMGGSYYYRSDKQPVKDFEETYGFFNHDWSRIRNPD 388

Query: 388 FLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLRLGPCVD-SDPWYY-- 447
           F+ R+  +Q P Q P L     Y +I+HP++GLC V   + + + LG C      W Y  
Sbjct: 389 FISRLKEIQQPIQDPYLAPGPYYQIIYHPASGLC-VESGIGNTVHLGSCQSVRSRWNYDA 448

Query: 448 TPQKFLTLKGTYFCIQAEEMGKQAKLGIICTV-SDAKWDMISDSKMHLSS----KTSNGS 507
           + +  + L G+  CI  +  G  A +   C+  ++  W  +S +++ L +    K     
Sbjct: 449 SVKGPIGLMGSSSCISTQGNGLPAIMTENCSAPNNTLWSTVSSAQLQLGTRVLGKDGKEK 508

Query: 508 MVCLDVDSSTNEIVTNSCKCLSRDSTC----DPSSQWFKLVNSTRSL 535
            +CLD  S +  I TN C C++ DS C    +P  QWFK++ + + L
Sbjct: 509 WMCLD-GSKSPLISTNECICIT-DSHCYPKLNPEKQWFKVITTNKQL 550

BLAST of Cla97C05G108280 vs. TAIR10
Match: AT1G13130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 676.0 bits (1743), Expect = 2.0e-194
Identity = 312/510 (61.18%), Postives = 397/510 (77.84%), Query Frame = 0

Query: 29  PLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQPIDAISNRIESLGFNCVRLTW 88
           PL T SRWIVDE G RVKL C NW SHL+ VVAEGLSKQP+DA++ +I  +GFNCVRLTW
Sbjct: 34  PLSTSSRWIVDENGLRVKLVCANWPSHLQPVVAEGLSKQPVDAVAKKIVEMGFNCVRLTW 93

Query: 89  PLFLATNESL-NSLTVRQSFQRLGLAEAIAGVQANNPFIIDLPLMKAFEAVVGRLGNGKL 148
           PL L TNE+L N++TVRQSFQ LGL + I G Q NNP IIDLPL++A++ VV  LGN  +
Sbjct: 94  PLDLMTNETLANNVTVRQSFQSLGLNDDIVGFQTNNPSIIDLPLIEAYKTVVTTLGNNDV 153

Query: 149 MVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTRMAAMFNGMAHVVGMSLRNEL 208
           MVILDNH++KPGWCC+N DGNGFFGDQ+F+P +W+  L +MAA FNG+++VVGMSLRNEL
Sbjct: 154 MVILDNHLTKPGWCCANDDGNGFFGDQFFDPTVWVAALKKMAATFNGVSNVVGMSLRNEL 213

Query: 209 RGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKDLSFLKNQPINLTFTAKLVYE 268
           RGPKQNVNDW++YMQ+GAEAVHSAN  +L+ILSGLSFD DLSF++++P+ L+FT KLV+E
Sbjct: 214 RGPKQNVNDWFKYMQQGAEAVHSANNKVLVILSGLSFDADLSFVRSRPVKLSFTGKLVFE 273

Query: 269 VHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQGFPLFISEFGVDQRGTNVNDNR 328
           +HWY+FSDG+SW ++N N +CGRV+N +    G++L QGFPLF+SEFG+D+RG N NDNR
Sbjct: 274 LHWYSFSDGNSWAANNPNDICGRVLNRIGNGGGYLLNQGFPLFLSEFGIDERGVNTNDNR 333

Query: 329 YLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGILDWNWCGPRNSTFLQRISAL 388
           Y  C    AAE D+DW+LW L GSYYLR+G VG+NE+YG+LD +W   RNS+FLQ+IS L
Sbjct: 334 YFGCLTGWAAENDVDWSLWALTGSYYLRQGKVGMNEYYGVLDSDWISVRNSSFLQKISFL 393

Query: 389 QSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDP--LRLGPCVDSDPWYYTPQKFLTLK 448
           QSP QGPG      YN++FHP TGLC+VR SL DP  L LGPC  S+PW YT +K L +K
Sbjct: 394 QSPLQGPG-PRTDAYNLVFHPLTGLCIVR-SLDDPKMLTLGPCNSSEPWSYT-KKALRIK 453

Query: 449 GTYFCIQAEEMGKQAKL-GIICTVSDAKWDMISDSKMHLSSKTSNGSMVCLDVDSSTNEI 508
               C+Q+        +    C+ S +KW  IS S+MHL+S TSN + +CLDVD++ N +
Sbjct: 454 DQQLCLQSNGPKNPVTMTRTSCSTSGSKWQTISASRMHLASTTSNKTSLCLDVDTA-NNV 513

Query: 509 VTNSCKCLSRDSTCDPSSQWFKLVNSTRSL 535
           V N+CKCLS+D +C+P SQWFK++ +TR L
Sbjct: 514 VANACKCLSKDKSCEPMSQWFKIIKATRPL 539

BLAST of Cla97C05G108280 vs. TAIR10
Match: AT3G26130.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 644.4 bits (1661), Expect = 6.4e-185
Identity = 307/535 (57.38%), Postives = 402/535 (75.14%), Query Frame = 0

Query: 8   MKGLIFLYCVAAVWLQAAVGLPLHTDSRWIVDE--EGQRVKLRCVNWVSHLEAVVAEGLS 67
           M+   F+      ++      P  TDSRWIVD+  +G+RVKL CVNW SHLE  VAEGLS
Sbjct: 1   MEKFFFISVFLLPYVITTFAFPPSTDSRWIVDDGNKGRRVKLTCVNWPSHLETAVAEGLS 60

Query: 68  KQPIDAISNRIESLGFNCVRLTWPLFLATNESLNS-LTVRQSFQRLGLAEAIAGVQANNP 127
           KQP+DAI+ +I S+GFNCVRLTWPL+LAT+ES ++ +TVRQS ++  L EA++G Q +NP
Sbjct: 61  KQPLDAIAEKIVSMGFNCVRLTWPLYLATDESFSAFMTVRQSLRKFRLFEAVSGFQTHNP 120

Query: 128 FIIDLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKG 187
            I+DLPL+KAF+ VV  L   ++MVILDNHIS+PGWCCS+ DGNGFFGD++ NP++WIKG
Sbjct: 121 TILDLPLIKAFQEVVYCLEKHRVMVILDNHISQPGWCCSDNDGNGFFGDKHLNPQVWIKG 180

Query: 188 LTRMAAMF-NGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLS 247
           L +MA+MF N  ++VVGMSLRNELRGPKQN+ DWY+YM+ GAEAVHS NP++L+I+SGL+
Sbjct: 181 LKKMASMFANVSSNVVGMSLRNELRGPKQNIKDWYKYMREGAEAVHSVNPNVLVIVSGLN 240

Query: 248 FDKDLSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVL 307
           +  DLSFL+ +P  ++F  K+V+E+HWY F   ++WE  N N++CG+    +MKMSGF+L
Sbjct: 241 YATDLSFLRERPFEVSFRRKVVFEIHWYGF--WNTWEGDNLNKICGKETEKMMKMSGFLL 300

Query: 308 EQGFPLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNE 367
           E+G PLF+SEFG+DQRG N NDN++LSCF+++AA+ DLDW+LWTL GSYY+RE  +G +E
Sbjct: 301 EKGIPLFVSEFGIDQRGNNANDNKFLSCFMALAADRDLDWSLWTLAGSYYIREKSIGSDE 360

Query: 368 FYGILDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPL 427
            YG+LD+NW   RNST LQ ISA+Q+PF   GL E +   ++FHPSTGLC+VRKSL   L
Sbjct: 361 SYGVLDFNWSSIRNSTILQMISAIQTPF--IGLMETQPKKIMFHPSTGLCIVRKSLFQ-L 420

Query: 428 RLGPCVDSDPWYYTPQKFLTL-KGTYFCIQAEEMGKQAKLGIICTVS-DAKWDMISDSKM 487
           +LG C  S+ W  +  + L+L +    C++A E GK  KL +  + S  +KW + SDSKM
Sbjct: 421 KLGSCNRSESWRLSSHRVLSLAEEQILCLKAYEKGKSVKLRLFFSESYCSKWKLFSDSKM 480

Query: 488 HLSSKTSNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLVNSTRSLGR 537
            LSS T NG  VCLDVD+  N IVTNSCKCL  +S+CDP SQWFKLV STR   R
Sbjct: 481 QLSSITKNGFSVCLDVDTENNNIVTNSCKCLRGNSSCDPRSQWFKLVTSTRRRSR 530

BLAST of Cla97C05G108280 vs. TAIR10
Match: AT3G26140.1 (Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 619.4 bits (1596), Expect = 2.2e-177
Identity = 290/510 (56.86%), Postives = 390/510 (76.47%), Query Frame = 0

Query: 29  PLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQPIDAISNRIESLGFNCVRLTW 88
           PL T+SRWI+DE+GQRVKL CVNW SHL+ VVAEGLSKQ +D ++ +I ++GFNCVR TW
Sbjct: 4   PLSTNSRWIIDEKGQRVKLACVNWPSHLQPVVAEGLSKQSVDDLAKKIMAMGFNCVRFTW 63

Query: 89  PLFLATNESL-NSLTVRQSFQRLGLAEAIAGVQANNPFIIDLPLMKAFEAVVGRLGNGKL 148
           PL LATNE+L N++TVRQSFQ LGL + I+G +  NP +IDLPL++A++ VV +LGN  +
Sbjct: 64  PLDLATNETLANNVTVRQSFQSLGLNDDISGFETKNPSMIDLPLIEAYKKVVAKLGNNNV 123

Query: 149 MVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTRMAAMFNGMAHVVGMSLRNEL 208
           MVILDNH++KPGWCC   DGNGFFGD +F+P  WI GLT++A  F G  +VVGMSLRNEL
Sbjct: 124 MVILDNHVTKPGWCCGYNDGNGFFGDTFFDPTTWIAGLTKIAMTFKGATNVVGMSLRNEL 183

Query: 209 RGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKDLSFLKNQPINLTFTAKLVYE 268
           RGPKQNV+DW++YMQ+GAEAVH ANP++L+ILSGLS+D DLSF++++ +NLTFT KLV+E
Sbjct: 184 RGPKQNVDDWFKYMQQGAEAVHEANPNVLVILSGLSYDTDLSFVRSRHVNLTFTRKLVFE 243

Query: 269 VHWYAFSDGSSWESSNSNQVCGRVVNNLMKMSGFVLEQGFPLFISEFGVDQRGTNVNDNR 328
           +H Y+F++ ++W S N N+ CG ++ ++    GF L + FP+F+SEFG+D RG NVNDNR
Sbjct: 244 LHRYSFTNTNTWSSKNPNEACGEILKSIENGGGFNL-RDFPVFLSEFGIDLRGKNVNDNR 303

Query: 329 YLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGILDWNWCGPRNSTFLQRISAL 388
           Y+ C L  AAE D+DW++WTL GSYYLREGVVG++EFYGILD +W   R+ +FLQR+S +
Sbjct: 304 YIGCILGWAAENDVDWSIWTLQGSYYLREGVVGMSEFYGILDSDWVRVRSQSFLQRLSLI 363

Query: 389 QSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLLDPLR--LGPCVDSDPWYYTPQKFLTLK 448
            SP QGPG  + + YN++FHP TGLC++ +S+LDP +  LG C +S PW YTPQ  LTLK
Sbjct: 364 LSPLQGPG-SQSKVYNLVFHPLTGLCML-QSILDPTKVTLGLCNESQPWSYTPQNTLTLK 423

Query: 449 GTYFCIQAEEMGKQAKLGIICTVSD--AKWDMISDSKMHLSSKTSNGSMVCLDVDSSTNE 508
               C+++       KL      S   ++W+ IS S M L++K++N S+ CLDVD  TN 
Sbjct: 424 DKSLCLESTGPNAPVKLSETSCSSPNLSEWETISASNMLLAAKSTNNSL-CLDVD-ETNN 483

Query: 509 IVTNSCKCL-SRDSTCDPSSQWFKLVNSTR 533
           ++ ++CKC+   DS+CDP SQWFK+V  ++
Sbjct: 484 LMASNCKCVKGEDSSCDPISQWFKIVKVSK 508

BLAST of Cla97C05G108280 vs. TAIR10
Match: AT5G17500.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 584.7 bits (1506), Expect = 6.0e-167
Identity = 279/512 (54.49%), Postives = 367/512 (71.68%), Query Frame = 0

Query: 22  LQAAVGLPLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQPIDAISNRIESLGF 81
           L  A   PL T SRWIV+ +G RVKL C NW SHL+ VVAEGLS QP+D+IS +I+ +GF
Sbjct: 20  LTLATDYPLFTKSRWIVNNKGHRVKLACANWPSHLKPVVAEGLSSQPMDSISKKIKDMGF 79

Query: 82  NCVRLTWPLFLATNESLN-SLTVRQSFQRLGLAEAIAGVQANNPFIIDLPLMKAFEAVVG 141
           NCVRLTWPL L  N++L  ++TV+QSF+R GL   + G+  +NP+I++ PL+  F+AVV 
Sbjct: 80  NCVRLTWPLELMINDTLAFNVTVKQSFERYGLDHELQGIYTHNPYIVNTPLINVFQAVVY 139

Query: 142 RLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTRMAAMFNGMAHVVG 201
            LG   +MVILDNH + PGWCCSN D + FFGD  FNP+LW+ GL +MA +F  + +VVG
Sbjct: 140 SLGRHDVMVILDNHKTVPGWCCSNDDPDAFFGDPKFNPDLWMLGLKKMATIFMNVKNVVG 199

Query: 202 MSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKDLSFLKNQPINLTF 261
           MSLRNELRG      DWY+YMQ+GAEAVH++NP++L+ILSGL+FD DLSFLK++P+NL+F
Sbjct: 200 MSLRNELRGYNHTSKDWYKYMQKGAEAVHTSNPNVLVILSGLNFDADLSFLKDRPVNLSF 259

Query: 262 TAKLVYEVHWYAFSDGS-SWESSNSNQVCGRVVNNLMKMSGFVLEQGFPLFISEFGVDQR 321
             KLV E+HWY+F+DG+  W+S N N  C ++ +   +  GFVL+QGFPLF+SEFG DQR
Sbjct: 260 KKKLVLELHWYSFTDGTGQWKSHNVNDFCSQMFSKERRTGGFVLDQGFPLFLSEFGTDQR 319

Query: 322 GTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYGILDWNWCGPRNST 381
           G ++  NRY++C L+ AAE DLDWA+W + G YY REG  G+ E YG+LD NW    N T
Sbjct: 320 GGDLEGNRYMNCMLAWAAEKDLDWAVWAVTGVYYFREGKRGVVEAYGMLDANWHNVHNYT 379

Query: 382 FLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVRKSLL--DPLRLGPCVDSDPWYYT 441
           +L+R+S +Q P  GPG+ +   +  IFHP TGLC+VRKS      L LGPC   +PW Y+
Sbjct: 380 YLRRLSVIQPPHTGPGV-KHNHHKKIFHPLTGLCLVRKSHCHESELTLGPCTKDEPWSYS 439

Query: 442 PQKFLTL-KGTYFCIQAE-EMGKQAKLGIICTVSDAKWDMISDSKMHLSSKTSNGSMVCL 501
               L + +G   C++ E  +GK  KLG ICT    K + IS +KMHLS  TS+GS+VCL
Sbjct: 440 HGGILEIRRGHKSCLEGETAVGKSVKLGRICT----KIEQISATKMHLSFNTSDGSLVCL 499

Query: 502 DVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKL 528
           DVDS  N +V NSC CL+ D+TC+P+SQWFK+
Sbjct: 500 DVDSD-NNVVANSCNCLTGDTTCEPASQWFKI 525

BLAST of Cla97C05G108280 vs. TAIR10
Match: AT5G16700.1 (Glycosyl hydrolase superfamily protein)

HSP 1 Score: 516.2 bits (1328), Expect = 2.6e-146
Identity = 255/525 (48.57%), Postives = 346/525 (65.90%), Query Frame = 0

Query: 13  FLYCVAAVWLQAAVGL----PLHTDSRWIVDEEGQRVKLRCVNWVSHLEAVVAEGLSKQP 72
           F +C+  +++ +   L    PL T SRWIVDE+GQRVKL CVNW +HL+  VAEGLSKQP
Sbjct: 5   FYFCLFFLFISSTSKLTTSYPLSTKSRWIVDEKGQRVKLACVNWPAHLQPTVAEGLSKQP 64

Query: 73  IDAISNRIESLGFNCVRLTWPLFLATNESLN-SLTVRQSFQRLGLAEAIAGVQANNPFII 132
           +D+IS +I S+GFNCVRLTWPL L TN++L   +TV+QSF+ L L E + G+Q +NP ++
Sbjct: 65  LDSISKKIVSMGFNCVRLTWPLDLVTNDTLALKVTVKQSFESLKLFEDVLGIQTHNPKLL 124

Query: 133 DLPLMKAFEAVVGRLGNGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPELWIKGLTR 192
            LPL  AF+ VV  LG   +MVILDNH++ PGWCC + D + FFG  +F+P +W KGL +
Sbjct: 125 HLPLFNAFQEVVSNLGENGVMVILDNHLTTPGWCCGDNDLDAFFGYPHFDPLVWAKGLRK 184

Query: 193 MAAMFNGMAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDKD 252
           MA +F    HV+GMSLRNE RG +   + W+R+M +GAEAVH+ANP +L+ILSG+ FD +
Sbjct: 185 MATLFRNFTHVIGMSLRNEPRGARDYPDLWFRHMPQGAEAVHAANPKLLVILSGIDFDTN 244

Query: 253 LSFLKNQPINLTFTAKLVYEVHWYAFSDG-SSWESSNSNQVCGRVVNNLMKMSGFVLEQG 312
           LSFL+++ +N++FT KLV+E+HWY+FSDG  SW   NSN  C +++  +    GF+L +G
Sbjct: 245 LSFLRDRSVNVSFTDKLVFELHWYSFSDGRDSWRKHNSNDFCVKIIEKVTHNGGFLLGRG 304

Query: 313 FPLFISEFGVDQRGTNVNDNRYLSCFLSVAAEFDLDWALWTLVGSYYLREGVVGLNEFYG 372
           FPL +SEFG DQRG +++ NRY++C ++ AAE DLDWA+W L G YYLR           
Sbjct: 305 FPLILSEFGTDQRGGDMSGNRYMNCLVAWAAENDLDWAVWALTGDYYLR----------- 364

Query: 373 ILDWNWCGPRNSTFLQRISALQSPFQGPGLEERRQYNVIFHPSTGLCVVR--KSLLDPLR 432
                                     GPGL  R   N++FHPSTGLCV       +  LR
Sbjct: 365 -------------------------TGPGL--RPNKNLLFHPSTGLCVTNNPSDNIPTLR 424

Query: 433 LGPCVDSDPWYYTPQKFLTLKGTYFCIQAEE-MGKQAKLGIICTVSDAKWDMISDSKMHL 492
           LGPC  SDPW + P + + L     C++A   +G++ KLG+    S  K   IS +KMHL
Sbjct: 425 LGPCPKSDPWTFNPSEGI-LWINKMCVEAPNVVGQKVKLGVGTKCS--KLGQISATKMHL 484

Query: 493 SSKTSNGSMVCLDVDSSTNEIVTNSCKCLSRDSTCDPSSQWFKLV 529
           S KTSNG ++CLDVD   N +V N CK L+ D++CDP+SQWFK++
Sbjct: 485 SFKTSNGLLLCLDVDERDNSVVANRCKFLTMDASCDPASQWFKVL 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008446127.17.8e-29888.57PREDICTED: uncharacterized protein LOC103488945 [Cucumis melo][more]
XP_023514558.13.9e-29788.32uncharacterized protein LOC111778811 [Cucurbita pepo subsp. pepo][more]
XP_004135502.22.5e-29688.59PREDICTED: uncharacterized protein LOC101217177 [Cucumis sativus] >KGN51727.1 hy... [more]
XP_022957392.14.3e-29687.96uncharacterized protein LOC111458804 [Cucurbita moschata][more]
XP_022997703.15.6e-29688.04uncharacterized protein LOC111492584 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3BF06|A0A1S3BF06_CUCME5.2e-29888.57uncharacterized protein LOC103488945 OS=Cucumis melo OX=3656 GN=LOC103488945 PE=... [more]
tr|A0A0A0KTH1|A0A0A0KTH1_CUCSA1.7e-29688.59Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G593400 PE=3 SV=1[more]
tr|A0A067K4F9|A0A067K4F9_JATCU8.7e-24575.49Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_17827 PE=3 SV=1[more]
tr|B9RMT6|B9RMT6_RICCO1.5e-24473.67Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis OX=398... [more]
tr|A0A061EEN1|A0A061EEN1_THECC2.8e-24373.58Cellulase protein OS=Theobroma cacao OX=3641 GN=TCM_010637 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
sp|C0HLA0|GH5FP_CHAOB3.0e-12344.59Glycosyl hydrolase 5 family protein OS=Chamaecyparis obtusa OX=13415 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
AT1G13130.12.0e-19461.18Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26130.16.4e-18557.38Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26140.12.2e-17756.86Cellulase (glycosyl hydrolase family 5) protein[more]
AT5G17500.16.0e-16754.49Glycosyl hydrolase superfamily protein[more]
AT5G16700.12.6e-14648.57Glycosyl hydrolase superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: INTERPRO
TermDefinition
IPR035992Ricin_B-like_lectins
IPR017853Glycoside_hydrolase_SF
IPR000772Ricin_B_lectin
IPR001547Glyco_hydro_5
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016798 hydrolase activity, acting on glycosyl bonds
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G108280.1Cla97C05G108280.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 66..348
e-value: 1.4E-25
score: 90.2
NoneNo IPR availableGENE3DG3DSA:3.20.20.80coord: 28..389
e-value: 6.3E-77
score: 261.2
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 18..537
NoneNo IPR availablePANTHERPTHR31263:SF28SUBFAMILY NOT NAMEDcoord: 18..537
IPR000772Ricin B, lectin domainCDDcd00161RICINcoord: 402..509
e-value: 0.00218657
score: 36.3295
IPR017853Glycoside hydrolase superfamilySUPERFAMILYSSF51445(Trans)glycosidasescoord: 29..382
IPR035992Ricin B-like lectinsSUPERFAMILYSSF50370Ricin B-like lectinscoord: 400..527

The following gene(s) are paralogous to this gene:

None