CmaCh04G017890 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G017890
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionGlycosyl hydrolase family 5 protein
LocationCma_Chr04 : 9026796 .. 9029657 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAAAGGGTTCTGTAATTTATTGCATTGCCTTGGAAGTCCACCAAATGCTCAATAGGCCTGTACCTTTCAGACGCCTCGTCATCAAAGCTTTAGAAAAAACAACAAAAAGGGGCTTACAACACTTCTCCAATTATAAAATAATCACACAGTAGTTGAATTGAAGATGGGGGAAGAAACTGGATAGGATCAAGATCGATGATGAAGGGGCTGGTCTTGTTGTGCTCTGTCGCCGCCCTGTGGCTTCAGGCGGCAGTGGGGCTACCGTTGCACACCGATTCACGGTGGATCGTGGACGAACAAGGGCAGAGAGTGAAACTAAGATGCGTAAATTGGGTGTCGCATTTGGAGGCGGTGGTAGCGGAGGGGCTTTCGAAGCAGCCCATCGACGAGATTACAAACCGTATTGGGTCATTAGGATTCAACTGCGTGAGGCTCACTTGGCCACTGTTTTTAGCCACGAACGAGTCTTTGAGTTCCCTCACCGTCCGGCAGTCTTTCCAGCGGCTGGGTCTGAGGGAAGCCATCGCTGGAGTCCAAGCCAATAACCCCTTCATCATCGATCTTCCGCTCATAAAGGCTTTTGAGGTAAATCGTAAATGGGGTTCTCTGTAAATGTTTGGATATGGTGAATTATGGGTTTGATTTTGAAATTGGATTGCAGGCGGTGGTGGGGTGGTTGGGGGAAGCGGAGTTGATGGTGGTATTGGACAATCACATCAGCAAGCCAGGGTGGTGCTGCAGCAACTTCGACGGAAATGGGTTCTTTGGGGATCAGTACTTCAACCCGGAGAAGTGGATCGAAGGGCTCACTAGAATGGCCACCATGTTTAACGGCGTGGCCCATGTTGTTGGTATGAGCCTTAGGAATGAGCTTCGTGGCCCAAAGCAGAATGTCAACGATTGGTACAGGTACAATATACTTCAAATTTCATTTTTAAATATATAGGTTATAATATCAATTATCTAAATTTGATTTTTAGGGTCAAAATTAGATAAAAATATGTTTATTAGTAAGGGTATTCATGTCATTTCAGAATCTAGCTAATCCAAATTATCAACCCTGAATATATTTTGGTCACCTAAACCCTAACCAACCTGAAAATAAGGTTACAATCCAACCTAACTCAACCCTAGTTTAAGATTAAAAATATTAATAAAGTTTTTTAAATAATTAGAAAGAAAACCATTTGACTACTCAATCTAACCCAACCCAACTTGAATTGAGTTGTAAACTCTGTTCGGGTTATTCTAATCACCAATTCCTCACCCAATCTAAAATTTCAAATTGGTCTAAAAGATTTTCTCAACTCAACCCAGTGTATGTCTTATTTATTAGACATGGCTGATATTTAAATTAGAAGTTTAGGGGTTAATTCACGTGTTTAGGTCTCTAAACTTTCAAAATTACAATTTAGTCCCTAAATTTTTTACTTTTTTTTTGAAAAGGTAGGATTCCCATTTATACACTAAACTTATAATTTAAGTTTTTTTTTTTTTAACTTTAGAAATGAAATTAAAAACAAAGATAAACTCGAAAGTAAAAGGAATGGGTTTGTACTAAAATCATATGAACACAAGATTTGATCATAATTTCATATGAATGTGATGAACCAATGTTGGAAGGTACATGCAAAGAGGAGCCGAGGCAGTTCACGCAGCAAACCCAGACGTTCTTGTAATTCTCTCAGGACTAAGCTTCGACAAGGACTTATCTTTTCTCAAAAACCAGCCCATAAACCTCACATTCACAGCCAAATTAGTGTACGAGGTTCATTGGTATGCATTTTCAGATGGCTCCTCATGGCAATCAGGCAACCCAAATCAAGTCTGCGGAAGAGTAGTAAACAACTTAATGAAAATGTCCGGATTTCTCTTAGAACAGGGGATGCCTCTGTTTCTGACCGAATTCGGAGTCGACCAAAGAGGCACCAACGTGAACGACAACAGGTATCTAAGCTGCTTCCTGAGCGTCGCAGCCGAGTACGACCTCGATTGGGCGGTGTGGACGCTCGTTGGAAGCTACTATTTGAGAGAAGGGGTTGTTGGGTTGAACGAGTTCTACGGGTTGCTTGATTGGAATTGGTGTAATCCTAGAAATTCTAGCTTCCTCCAAAGGATCTCCGCCCTGCAGACACCCTTCCAAGGCCCAGGGCTAGCCGAAAGAAGGCCTTATAGTGTAATGTTCCACCCATCAACAGGGCTATGCGTTGGGAGAGCGTCGCTATTGGACCCATTAAGATTAGGCCCATGCGCCGATTCTGATCCTTGGTACTACACGCCGCAGAAGTTTTTAACTCTCAAAGGCACATACTTTTGCATACAAGCGCAGGAAATGGGGAAGCAGCCGAGATTGGGGATAATATGTACTGTCTCAAATGCTCAATGGGATATGGTTTCTGATTCAAAGATGCATCTTTCTTCGAAACTCGACAATGGCTCCGTGGTTTGCTTGGACGTGGACCCAAACACTAATGAAATTGTCACCAACGCTTGTAAATGCTTGAGTCGGGACTCGTCGTGTGACCCAAGTAGTCAATGGTTCAAGCTCGTTAATAGTACTAGAAGCTTTGACACAACAAGGTCGATGATTAGTATGGTGGGTTCTTCTTTGTCGTTCAATCTTGTGCCTAAGTTGTTTGTAGAGTAATTGTTTGGGATTTGCTCAATTTATGTGAATAAATCTAATATGCCCCATGCGGGATCGAACTACAGTCCTTTAGTTTACAAGACTGAGCTATAGGGGCGGTTGATGATAATCAAGATTTAAAATTGTTAATTATTATAGAAATCATTACATGCAATTTCATGTCAAGAAGAAGAAGACGACGTGTCAATTATTATTGAGTTATACACGTATTGGC

mRNA sequence

TGAAAGGGTTCTGTAATTTATTGCATTGCCTTGGAAGTCCACCAAATGCTCAATAGGCCTGTACCTTTCAGACGCCTCGTCATCAAAGCTTTAGAAAAAACAACAAAAAGGGGCTTACAACACTTCTCCAATTATAAAATAATCACACAGTAGTTGAATTGAAGATGGGGGAAGAAACTGGATAGGATCAAGATCGATGATGAAGGGGCTGGTCTTGTTGTGCTCTGTCGCCGCCCTGTGGCTTCAGGCGGCAGTGGGGCTACCGTTGCACACCGATTCACGGTGGATCGTGGACGAACAAGGGCAGAGAGTGAAACTAAGATGCGTAAATTGGGTGTCGCATTTGGAGGCGGTGGTAGCGGAGGGGCTTTCGAAGCAGCCCATCGACGAGATTACAAACCGTATTGGGTCATTAGGATTCAACTGCGTGAGGCTCACTTGGCCACTGTTTTTAGCCACGAACGAGTCTTTGAGTTCCCTCACCGTCCGGCAGTCTTTCCAGCGGCTGGGTCTGAGGGAAGCCATCGCTGGAGTCCAAGCCAATAACCCCTTCATCATCGATCTTCCGCTCATAAAGGCTTTTGAGGCGGTGGTGGGGTGGTTGGGGGAAGCGGAGTTGATGGTGGTATTGGACAATCACATCAGCAAGCCAGGGTGGTGCTGCAGCAACTTCGACGGAAATGGGTTCTTTGGGGATCAGTACTTCAACCCGGAGAAGTGGATCGAAGGGCTCACTAGAATGGCCACCATGTTTAACGGCGTGGCCCATGTTGTTGGTATGAGCCTTAGGAATGAGCTTCGTGGCCCAAAGCAGAATGTCAACGATTGGTACAGGTACATGCAAAGAGGAGCCGAGGCAGTTCACGCAGCAAACCCAGACGTTCTTGTAATTCTCTCAGGACTAAGCTTCGACAAGGACTTATCTTTTCTCAAAAACCAGCCCATAAACCTCACATTCACAGCCAAATTAGTGTACGAGGTTCATTGGTATGCATTTTCAGATGGCTCCTCATGGCAATCAGGCAACCCAAATCAAGTCTGCGGAAGAGTAGTAAACAACTTAATGAAAATGTCCGGATTTCTCTTAGAACAGGGGATGCCTCTGTTTCTGACCGAATTCGGAGTCGACCAAAGAGGCACCAACGTGAACGACAACAGGTATCTAAGCTGCTTCCTGAGCGTCGCAGCCGAGTACGACCTCGATTGGGCGGTGTGGACGCTCGTTGGAAGCTACTATTTGAGAGAAGGGGTTGTTGGGTTGAACGAGTTCTACGGGTTGCTTGATTGGAATTGGTGTAATCCTAGAAATTCTAGCTTCCTCCAAAGGATCTCCGCCCTGCAGACACCCTTCCAAGGCCCAGGGCTAGCCGAAAGAAGGCCTTATAGTGTAATGTTCCACCCATCAACAGGGCTATGCGTTGGGAGAGCGTCGCTATTGGACCCATTAAGATTAGGCCCATGCGCCGATTCTGATCCTTGGTACTACACGCCGCAGAAGTTTTTAACTCTCAAAGGCACATACTTTTGCATACAAGCGCAGGAAATGGGGAAGCAGCCGAGATTGGGGATAATATGTACTGTCTCAAATGCTCAATGGGATATGGTTTCTGATTCAAAGATGCATCTTTCTTCGAAACTCGACAATGGCTCCGTGGTTTGCTTGGACGTGGACCCAAACACTAATGAAATTGTCACCAACGCTTGTAAATGCTTGAGTCGGGACTCGTCGTGTGACCCAAGTAGTCAATGGTTCAAGCTCGTTAATAGTACTAGAAGCTTTGACACAACAAGGTCGATGATTAGTATGGTGGGTTCTTCTTTGTCGTTCAATCTTGTGCCTAAGTTGTTTGTAGAGTAATTGTTTGGGATTTGCTCAATTTATGTGAATAAATCTAATATGCCCCATGCGGGATCGAACTACAGTCCTTTAGTTTACAAGACTGAGCTATAGGGGCGGTTGATGATAATCAAGATTTAAAATTGTTAATTATTATAGAAATCATTACATGCAATTTCATGTCAAGAAGAAGAAGACGACGTGTCAATTATTATTGAGTTATACACGTATTGGC

Coding sequence (CDS)

ATGATGAAGGGGCTGGTCTTGTTGTGCTCTGTCGCCGCCCTGTGGCTTCAGGCGGCAGTGGGGCTACCGTTGCACACCGATTCACGGTGGATCGTGGACGAACAAGGGCAGAGAGTGAAACTAAGATGCGTAAATTGGGTGTCGCATTTGGAGGCGGTGGTAGCGGAGGGGCTTTCGAAGCAGCCCATCGACGAGATTACAAACCGTATTGGGTCATTAGGATTCAACTGCGTGAGGCTCACTTGGCCACTGTTTTTAGCCACGAACGAGTCTTTGAGTTCCCTCACCGTCCGGCAGTCTTTCCAGCGGCTGGGTCTGAGGGAAGCCATCGCTGGAGTCCAAGCCAATAACCCCTTCATCATCGATCTTCCGCTCATAAAGGCTTTTGAGGCGGTGGTGGGGTGGTTGGGGGAAGCGGAGTTGATGGTGGTATTGGACAATCACATCAGCAAGCCAGGGTGGTGCTGCAGCAACTTCGACGGAAATGGGTTCTTTGGGGATCAGTACTTCAACCCGGAGAAGTGGATCGAAGGGCTCACTAGAATGGCCACCATGTTTAACGGCGTGGCCCATGTTGTTGGTATGAGCCTTAGGAATGAGCTTCGTGGCCCAAAGCAGAATGTCAACGATTGGTACAGGTACATGCAAAGAGGAGCCGAGGCAGTTCACGCAGCAAACCCAGACGTTCTTGTAATTCTCTCAGGACTAAGCTTCGACAAGGACTTATCTTTTCTCAAAAACCAGCCCATAAACCTCACATTCACAGCCAAATTAGTGTACGAGGTTCATTGGTATGCATTTTCAGATGGCTCCTCATGGCAATCAGGCAACCCAAATCAAGTCTGCGGAAGAGTAGTAAACAACTTAATGAAAATGTCCGGATTTCTCTTAGAACAGGGGATGCCTCTGTTTCTGACCGAATTCGGAGTCGACCAAAGAGGCACCAACGTGAACGACAACAGGTATCTAAGCTGCTTCCTGAGCGTCGCAGCCGAGTACGACCTCGATTGGGCGGTGTGGACGCTCGTTGGAAGCTACTATTTGAGAGAAGGGGTTGTTGGGTTGAACGAGTTCTACGGGTTGCTTGATTGGAATTGGTGTAATCCTAGAAATTCTAGCTTCCTCCAAAGGATCTCCGCCCTGCAGACACCCTTCCAAGGCCCAGGGCTAGCCGAAAGAAGGCCTTATAGTGTAATGTTCCACCCATCAACAGGGCTATGCGTTGGGAGAGCGTCGCTATTGGACCCATTAAGATTAGGCCCATGCGCCGATTCTGATCCTTGGTACTACACGCCGCAGAAGTTTTTAACTCTCAAAGGCACATACTTTTGCATACAAGCGCAGGAAATGGGGAAGCAGCCGAGATTGGGGATAATATGTACTGTCTCAAATGCTCAATGGGATATGGTTTCTGATTCAAAGATGCATCTTTCTTCGAAACTCGACAATGGCTCCGTGGTTTGCTTGGACGTGGACCCAAACACTAATGAAATTGTCACCAACGCTTGTAAATGCTTGAGTCGGGACTCGTCGTGTGACCCAAGTAGTCAATGGTTCAAGCTCGTTAATAGTACTAGAAGCTTTGACACAACAAGGTCGATGATTAGTATGGTGGGTTCTTCTTTGTCGTTCAATCTTGTGCCTAAGTTGTTTGTAGAGTAA

Protein sequence

MMKGLVLLCSVAALWLQAAVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPIDEITNRIGSLGFNCVRLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFIIDLPLIKAFEAVVGWLGEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMATMFNGVAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPLFLTEFGVDQRGTNVNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLDWNWCNPRNSSFLQRISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLGPCADSDPWYYTPQKFLTLKGTYFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKLDNGSVVCLDVDPNTNEIVTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTRSMISMVGSSLSFNLVPKLFVE
BLAST of CmaCh04G017890 vs. TrEMBL
Match: A0A0A0KTH1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G593400 PE=3 SV=1)

HSP 1 Score: 979.5 bits (2531), Expect = 1.6e-282
Identity = 459/542 (84.69%), Postives = 501/542 (92.44%), Query Frame = 1

Query: 1   MMKGLVLLCSVAALWLQAAVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSK 60
           MMKGL+LL    A    AAVGLPLHTD+RWIVD  G+RVKLRCVNWVSHLEAVVAEGLSK
Sbjct: 1   MMKGLILLVWCVAA--SAAVGLPLHTDTRWIVDGAGERVKLRCVNWVSHLEAVVAEGLSK 60

Query: 61  QPIDEITNRIGSLGFNCVRLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFI 120
           QPI+EI+NRI  LGFNCVRLTWPLFLATNESL+SLTVRQSFQRLGL EAIAG+QANNPFI
Sbjct: 61  QPIEEISNRIQWLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGIQANNPFI 120

Query: 121 IDLPLIKAFEAVVGWLGEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLT 180
           IDLPL+KAFEAVVG LGE +LMV+LDNHISKPGWCCSNFDGNGFFGDQYFNP+ WI+GLT
Sbjct: 121 IDLPLLKAFEAVVGKLGEGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPDLWIKGLT 180

Query: 181 RMATMFNGVAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDK 240
           R+ATMFNGV HVV MSLRNELRGPKQNVNDWYRYMQRGAEAVH+ANPD+L+ILSGLS+D+
Sbjct: 181 RIATMFNGVNHVVAMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSYDR 240

Query: 241 DLSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQG 300
           DLSFLKNQPINLTFT+K VYEVHWYAFSDGSSW+SGN NQVCGR  NNLMKMSGFLL+QG
Sbjct: 241 DLSFLKNQPINLTFTSKTVYEVHWYAFSDGSSWESGNSNQVCGRTTNNLMKMSGFLLQQG 300

Query: 301 MPLFLTEFGVDQRGTNVNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYG 360
            PLF++EFG+DQRGTNVNDNRYLSCFL+VAAE DLDWAVWTLVGSYYLREGVVGLNEFYG
Sbjct: 301 FPLFISEFGIDQRGTNVNDNRYLSCFLAVAAELDLDWAVWTLVGSYYLREGVVGLNEFYG 360

Query: 361 LLDWNWCNPRNSSFLQRISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLG 420
           +LDWNWCN RNS+FLQRISALQ+PFQGPGLAERR Y+V+FHP +GLCV R SLLDPL LG
Sbjct: 361 ILDWNWCNLRNSTFLQRISALQSPFQGPGLAERREYNVIFHPLSGLCVVRKSLLDPLTLG 420

Query: 421 PCADSDPWYYTPQKFLTLKGTYFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSK 480
           PC D+D WYYTPQKFLTLKGTYFCIQA E+GKQ +LGIICTV+NA+WDM+SDSK+HLSSK
Sbjct: 421 PCVDTDAWYYTPQKFLTLKGTYFCIQADEIGKQAKLGIICTVNNAKWDMISDSKLHLSSK 480

Query: 481 LDNGSVVCLDVDPNTNEIVTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTRSMISMVGS 540
             NGS+VCLDVD +TNEIVTN+CKCLSRDSSCDPSSQWFKLVNSTRS    RSMI+MVGS
Sbjct: 481 SSNGSLVCLDVDSSTNEIVTNSCKCLSRDSSCDPSSQWFKLVNSTRSLGRGRSMINMVGS 540

Query: 541 SL 543
           SL
Sbjct: 541 SL 540

BLAST of CmaCh04G017890 vs. TrEMBL
Match: B9RMT6_RICCO (Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCOM_1083710 PE=3 SV=1)

HSP 1 Score: 848.2 bits (2190), Expect = 5.6e-243
Identity = 388/548 (70.80%), Postives = 472/548 (86.13%), Query Frame = 1

Query: 5   LVLLCSVAALWLQAAV-GLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPI 64
           +    +++A+  Q+ V  LPL T+SRWIVDE GQRVKL CVNWVSHLEAVVAEGLSKQP+
Sbjct: 14  ITFFIAISAIIPQSQVTALPLSTNSRWIVDENGQRVKLACVNWVSHLEAVVAEGLSKQPM 73

Query: 65  DEITNRIGSLGFNCVRLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFIIDL 124
           D I  +I S+GFNCVRLTWPL+L TN++L+SL+VRQSFQ LGL E+I+G+QANNP IIDL
Sbjct: 74  DMIAKKIVSMGFNCVRLTWPLYLVTNDTLASLSVRQSFQGLGLLESISGIQANNPSIIDL 133

Query: 125 PLIKAFEAVVGWLGEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMA 184
           PLIKA++AVV  LG+  +MV+LDNHISKPGWCCSNFDGNGFFGD YFNP+ WI+GLT+MA
Sbjct: 134 PLIKAYQAVVSSLGDNNVMVILDNHISKPGWCCSNFDGNGFFGDTYFNPDLWIKGLTQMA 193

Query: 185 TMFNGVAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLS 244
           T+FNGV +V+GMSLRNELRG KQNVNDWYRYM++GAEAVH+ANPDVLVILSGL++DKD S
Sbjct: 194 TLFNGVTNVIGMSLRNELRGQKQNVNDWYRYMEKGAEAVHSANPDVLVILSGLNYDKDFS 253

Query: 245 FLKNQPINLTFTAKLVYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPL 304
           FL+N+P+NL+FT K+V+EVHWY FSDG +W+SGNPNQVCGRVV+NLM++SGFLLEQG P+
Sbjct: 254 FLRNRPVNLSFTGKVVFEVHWYGFSDGQAWRSGNPNQVCGRVVDNLMRISGFLLEQGWPM 313

Query: 305 FLTEFGVDQRGTNVNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLD 364
           F++EFGVDQRGTNVNDNRYL CF+ VAAE D DWA+WTLVGSYYLR+GV+GLNE+YG+L+
Sbjct: 314 FVSEFGVDQRGTNVNDNRYLGCFIGVAAELDWDWALWTLVGSYYLRQGVIGLNEYYGVLN 373

Query: 365 WNWCNPRNSSFLQRISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLGPCA 424
           WNWC+ RNSSFLQ+ISALQ+PFQGPGL+E  P+ V+FHPSTGLCV R S+L+PLRLG C 
Sbjct: 374 WNWCDVRNSSFLQQISALQSPFQGPGLSETNPHKVIFHPSTGLCVQRKSMLEPLRLGSCT 433

Query: 425 DSDPWYYTPQKFLTLKGTYFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKLDN 484
           DS+ W YT +  LTL+GTYFC+QA E+GK  +LGIICT S ++WD++SDSKMHLSSK+ N
Sbjct: 434 DSEAWRYTSENTLTLRGTYFCLQADELGKPAKLGIICTDSTSKWDVISDSKMHLSSKITN 493

Query: 485 GSVVCLDVDPNTNEIVTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTRSMISMVGSSLS 544
           G+ VCLDVD N N IV + CKCLSRD++CDP SQWFKLVNSTRS  T +  + +  +S+ 
Sbjct: 494 GTAVCLDVDSN-NTIVISTCKCLSRDNTCDPESQWFKLVNSTRSSATAKPSLRI--NSIL 553

Query: 545 FNLVPKLF 552
            +L  K F
Sbjct: 554 LDLPAKEF 558

BLAST of CmaCh04G017890 vs. TrEMBL
Match: A0A067K4F9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17827 PE=3 SV=1)

HSP 1 Score: 846.7 bits (2186), Expect = 1.6e-242
Identity = 376/506 (74.31%), Postives = 452/506 (89.33%), Query Frame = 1

Query: 22  LPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPIDEITNRIGSLGFNCVRLT 81
           +PL TDSRWIVDE+GQRVKL CVNWVSHLEAVVAEGLS++P+D I  +I S+GFNCVRLT
Sbjct: 31  VPLSTDSRWIVDEKGQRVKLACVNWVSHLEAVVAEGLSREPMDLIAKKIVSMGFNCVRLT 90

Query: 82  WPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFIIDLPLIKAFEAVVGWLGEAEL 141
           WPL+L TNE+ +SLTV+QSFQ LGL E+I+G+QANNP IIDLPLIKA++AVV  LG+  +
Sbjct: 91  WPLYLVTNETYASLTVKQSFQNLGLLESISGIQANNPAIIDLPLIKAYQAVVSSLGDNNV 150

Query: 142 MVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMATMFNGVAHVVGMSLRNEL 201
           +V+LDNHISKPGWCCSNFDGNGFFGD YFNP+ WI GLT+MAT+FNGV +VVG+SLRNEL
Sbjct: 151 LVILDNHISKPGWCCSNFDGNGFFGDSYFNPDLWINGLTQMATIFNGVPNVVGLSLRNEL 210

Query: 202 RGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLSFLKNQPINLTFTAKLVYE 261
           RG KQNVNDWYRYM++GAEAVH+ANPDVLVILSGL++DKDLSFL+N+P+NLTFT KLV+E
Sbjct: 211 RGQKQNVNDWYRYMEKGAEAVHSANPDVLVILSGLNYDKDLSFLRNRPVNLTFTGKLVFE 270

Query: 262 VHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPLFLTEFGVDQRGTNVNDNR 321
           VHWY FSDG +W++GNPNQVCGRV +N+M+MSG+LL+QG PLF++EFG+DQRGTNVNDNR
Sbjct: 271 VHWYGFSDGQAWKNGNPNQVCGRVTSNMMRMSGYLLDQGWPLFVSEFGIDQRGTNVNDNR 330

Query: 322 YLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLDWNWCNPRNSSFLQRISAL 381
           YL CFL  AAE DLDWA+WTLVGSYYLREGV+GLNE+YG+L+WNWC+ RNSSFLQ+ISAL
Sbjct: 331 YLGCFLGWAAELDLDWALWTLVGSYYLREGVLGLNEYYGVLNWNWCDIRNSSFLQQISAL 390

Query: 382 QTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLGPCADSDPWYYTPQKFLTLKGT 441
           Q+PFQGPG++E   + V+FHP TGLCV R S+L+PL+LGPC DS+ W YTPQK L+LKGT
Sbjct: 391 QSPFQGPGVSESNLHKVIFHPLTGLCVQRKSMLEPLKLGPCTDSEAWRYTPQKILSLKGT 450

Query: 442 YFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKLDNGSVVCLDVDPNTNEIVTN 501
           YFC+QA E+GK  +LG+ICT S+++WD++SDS MHLSSK+ NG+ +CLDVD N N IV N
Sbjct: 451 YFCLQADELGKPAKLGVICTDSDSKWDVISDSNMHLSSKISNGTTICLDVDSN-NTIVIN 510

Query: 502 ACKCLSRDSSCDPSSQWFKLVNSTRS 528
            CKCLSRD++CDP SQWFKLVNSTRS
Sbjct: 511 TCKCLSRDNTCDPGSQWFKLVNSTRS 535

BLAST of CmaCh04G017890 vs. TrEMBL
Match: A0A061EEN1_THECC (Cellulase protein OS=Theobroma cacao GN=TCM_010637 PE=3 SV=1)

HSP 1 Score: 844.7 bits (2181), Expect = 6.2e-242
Identity = 386/517 (74.66%), Postives = 451/517 (87.23%), Query Frame = 1

Query: 19  AVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPIDEITNRIGSLGFNCV 78
           A+ LPL T+SRWIVDE+GQRVKL CVNWVSHLE +VAEGLSK P+D I  RI S GFNCV
Sbjct: 26  AMSLPLSTNSRWIVDEKGQRVKLACVNWVSHLEPMVAEGLSKLPMDVIAKRIVSTGFNCV 85

Query: 79  RLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFIIDLPLIKAFEAVVGWLGE 138
           RLTWPLFL TN+SL+SLTVRQSFQRLGL E+IAG+Q NNP IID+ L+KA++AVV  LGE
Sbjct: 86  RLTWPLFLVTNDSLASLTVRQSFQRLGLLESIAGIQTNNPSIIDVSLLKAYQAVVCSLGE 145

Query: 139 AELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMATMFNGVAHVVGMSLR 198
             +MV+LDNHISKPGWCCSNFDGNGFFGDQYFNP+ WI GLTRMAT+ N V +VVGMSLR
Sbjct: 146 NNVMVILDNHISKPGWCCSNFDGNGFFGDQYFNPDIWITGLTRMATLVNAVTNVVGMSLR 205

Query: 199 NELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLSFLKNQPINLTFTAKL 258
           NELRGPKQ VNDWYRYMQ+GAEAVH+ANPDVLVILSGL++DKDLSF++N+P NLTFT KL
Sbjct: 206 NELRGPKQTVNDWYRYMQKGAEAVHSANPDVLVILSGLNYDKDLSFIRNRPANLTFTGKL 265

Query: 259 VYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPLFLTEFGVDQRGTNVN 318
           V+EVHWY F+DG +W +GNPNQVCGRV N++M+ SGFL++QG PLF++EFGVDQRGTNVN
Sbjct: 266 VFEVHWYGFTDGQTWVTGNPNQVCGRVANDMMRTSGFLVDQGYPLFVSEFGVDQRGTNVN 325

Query: 319 DNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLDWNWCNPRNSSFLQRI 378
           DNRYL+CFL VAAE DLDWA+WTLVGSYYLREGVVGLNE+YG+L+WNWC  RNSSFL+RI
Sbjct: 326 DNRYLNCFLGVAAELDLDWALWTLVGSYYLREGVVGLNEYYGILNWNWCEIRNSSFLERI 385

Query: 379 SALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLGPCADSDPWYYTPQKFLTL 438
           SALQ+PF+GPGL+E + + V+FHPSTGLCV R SLLDPLRLGPC DS+ W Y+PQ  L +
Sbjct: 386 SALQSPFRGPGLSETKLHKVIFHPSTGLCVLRKSLLDPLRLGPCTDSEAWSYSPQNTLVV 445

Query: 439 KGTYFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKLDNGSVVCLDVDPNTNEI 498
           KGTYFC+QA E G   RLGIIC+ SN++W+M+SDSKMHLSSKL NG+ +CLDVD +TN I
Sbjct: 446 KGTYFCLQADESGTLARLGIICSESNSKWEMISDSKMHLSSKLRNGTSICLDVD-STNTI 505

Query: 499 VTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTRSMI 536
           VTN+CKCLS D+ CDP SQWFKLV+STRS    +S +
Sbjct: 506 VTNSCKCLSNDNMCDPESQWFKLVDSTRSRSGVKSFL 541

BLAST of CmaCh04G017890 vs. TrEMBL
Match: V4TIH1_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031098mg PE=3 SV=1)

HSP 1 Score: 837.4 bits (2162), Expect = 1.0e-239
Identity = 378/515 (73.40%), Postives = 446/515 (86.60%), Query Frame = 1

Query: 19  AVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPIDEITNRIGSLGFNCV 78
           A+GLPL T+SRWIVDE G RVKL CVNWVSHLE VVAEGLSKQP+D ++ RI  +GFNCV
Sbjct: 34  AIGLPLSTNSRWIVDENGHRVKLACVNWVSHLEPVVAEGLSKQPMDMLSKRIVDMGFNCV 93

Query: 79  RLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFIIDLPLIKAFEAVVGWLGE 138
           RLTWPL+LATN+SL+SLTVRQSFQ+LGL EAI G+Q+NNP I+DLPLIKAF+AVV  LG 
Sbjct: 94  RLTWPLYLATNDSLASLTVRQSFQKLGLLEAIGGIQSNNPSIVDLPLIKAFQAVVASLGN 153

Query: 139 AELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMATMFNGVAHVVGMSLR 198
             +MV+LDNHISKPGWCCSN DGNGFFGDQYFNP+ WI+GLT+MAT+FNGV +VVGMSLR
Sbjct: 154 NNVMVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLR 213

Query: 199 NELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLSFLKNQPINLTFTAKL 258
           NELRGPKQNV DWYRYMQ GAEAVHAANP+VLVILSGL+FDKDLSF++NQ +NLTFT KL
Sbjct: 214 NELRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKL 273

Query: 259 VYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPLFLTEFGVDQRGTNVN 318
           V+E HWY F+DG +W  GNPNQVCGRVV+N+M++SGFLLEQG PLF++EFG D RG NVN
Sbjct: 274 VFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVN 333

Query: 319 DNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLDWNWCNPRNSSFLQRI 378
           DNRYL+CF  VAAE D DWA+WTLVGSYYLREGV+GLNE+YGL DWNWC+ RNSSFL+RI
Sbjct: 334 DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERI 393

Query: 379 SALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLGPCADSDPWYYTPQKFLTL 438
           S+LQ+PF+GPG+ E   + V++HP+TGLCV R S LDPL LGPC +S+ W YTP K ++L
Sbjct: 394 SSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISL 453

Query: 439 KGTYFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKLDNGSVVCLDVDPNTNEI 498
           KG YFC+QA+ +GK  +LGIICT   + W+++SDSKMHLSSK DNG+ VCLDVD ++N I
Sbjct: 454 KGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVD-SSNTI 513

Query: 499 VTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTRS 534
           VTN CKCLSRD +CDP+SQWFKLV+STRS  TT+S
Sbjct: 514 VTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKS 547

BLAST of CmaCh04G017890 vs. TAIR10
Match: AT1G13130.1 (AT1G13130.1 Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 662.5 bits (1708), Expect = 2.2e-190
Identity = 308/515 (59.81%), Postives = 391/515 (75.92%), Query Frame = 1

Query: 23  PLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPIDEITNRIGSLGFNCVRLTW 82
           PL T SRWIVDE G RVKL C NW SHL+ VVAEGLSKQP+D +  +I  +GFNCVRLTW
Sbjct: 34  PLSTSSRWIVDENGLRVKLVCANWPSHLQPVVAEGLSKQPVDAVAKKIVEMGFNCVRLTW 93

Query: 83  PLFLATNESLSS-LTVRQSFQRLGLREAIAGVQANNPFIIDLPLIKAFEAVVGWLGEAEL 142
           PL L TNE+L++ +TVRQSFQ LGL + I G Q NNP IIDLPLI+A++ VV  LG  ++
Sbjct: 94  PLDLMTNETLANNVTVRQSFQSLGLNDDIVGFQTNNPSIIDLPLIEAYKTVVTTLGNNDV 153

Query: 143 MVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMATMFNGVAHVVGMSLRNEL 202
           MV+LDNH++KPGWCC+N DGNGFFGDQ+F+P  W+  L +MA  FNGV++VVGMSLRNEL
Sbjct: 154 MVILDNHLTKPGWCCANDDGNGFFGDQFFDPTVWVAALKKMAATFNGVSNVVGMSLRNEL 213

Query: 203 RGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLSFLKNQPINLTFTAKLVYE 262
           RGPKQNVNDW++YMQ+GAEAVH+AN  VLVILSGLSFD DLSF++++P+ L+FT KLV+E
Sbjct: 214 RGPKQNVNDWFKYMQQGAEAVHSANNKVLVILSGLSFDADLSFVRSRPVKLSFTGKLVFE 273

Query: 263 VHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPLFLTEFGVDQRGTNVNDNR 322
           +HWY+FSDG+SW + NPN +CGRV+N +    G+LL QG PLFL+EFG+D+RG N NDNR
Sbjct: 274 LHWYSFSDGNSWAANNPNDICGRVLNRIGNGGGYLLNQGFPLFLSEFGIDERGVNTNDNR 333

Query: 323 YLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLDWNWCNPRNSSFLQRISAL 382
           Y  C    AAE D+DW++W L GSYYLR+G VG+NE+YG+LD +W + RNSSFLQ+IS L
Sbjct: 334 YFGCLTGWAAENDVDWSLWALTGSYYLRQGKVGMNEYYGVLDSDWISVRNSSFLQKISFL 393

Query: 383 QTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDP--LRLGPCADSDPWYYTPQKFLTLK 442
           Q+P QGPG      Y+++FHP TGLC+ R SL DP  L LGPC  S+PW YT +K L +K
Sbjct: 394 QSPLQGPG-PRTDAYNLVFHPLTGLCIVR-SLDDPKMLTLGPCNSSEPWSYT-KKALRIK 453

Query: 443 GTYFCIQAQEMGKQP--RLGIICTVSNAQWDMVSDSKMHLSSKLDNGSVVCLDVDPNTNE 502
               C+Q+    K P       C+ S ++W  +S S+MHL+S   N + +CLDVD   N 
Sbjct: 454 DQQLCLQSNG-PKNPVTMTRTSCSTSGSKWQTISASRMHLASTTSNKTSLCLDVD-TANN 513

Query: 503 IVTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTR 533
           +V NACKCLS+D SC+P SQWFK++ +TR   ++R
Sbjct: 514 VVANACKCLSKDKSCEPMSQWFKIIKATRPLKSSR 543

BLAST of CmaCh04G017890 vs. TAIR10
Match: AT3G26130.1 (AT3G26130.1 Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 632.5 bits (1630), Expect = 2.5e-181
Identity = 299/510 (58.63%), Postives = 391/510 (76.67%), Query Frame = 1

Query: 23  PLHTDSRWIVDE--QGQRVKLRCVNWVSHLEAVVAEGLSKQPIDEITNRIGSLGFNCVRL 82
           P  TDSRWIVD+  +G+RVKL CVNW SHLE  VAEGLSKQP+D I  +I S+GFNCVRL
Sbjct: 22  PPSTDSRWIVDDGNKGRRVKLTCVNWPSHLETAVAEGLSKQPLDAIAEKIVSMGFNCVRL 81

Query: 83  TWPLFLATNESLSS-LTVRQSFQRLGLREAIAGVQANNPFIIDLPLIKAFEAVVGWLGEA 142
           TWPL+LAT+ES S+ +TVRQS ++  L EA++G Q +NP I+DLPLIKAF+ VV  L + 
Sbjct: 82  TWPLYLATDESFSAFMTVRQSLRKFRLFEAVSGFQTHNPTILDLPLIKAFQEVVYCLEKH 141

Query: 143 ELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMATMFNGVA-HVVGMSLR 202
            +MV+LDNHIS+PGWCCS+ DGNGFFGD++ NP+ WI+GL +MA+MF  V+ +VVGMSLR
Sbjct: 142 RVMVILDNHISQPGWCCSDNDGNGFFGDKHLNPQVWIKGLKKMASMFANVSSNVVGMSLR 201

Query: 203 NELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLSFLKNQPINLTFTAKL 262
           NELRGPKQN+ DWY+YM+ GAEAVH+ NP+VLVI+SGL++  DLSFL+ +P  ++F  K+
Sbjct: 202 NELRGPKQNIKDWYKYMREGAEAVHSVNPNVLVIVSGLNYATDLSFLRERPFEVSFRRKV 261

Query: 263 VYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPLFLTEFGVDQRGTNVN 322
           V+E+HWY F   ++W+  N N++CG+    +MKMSGFLLE+G+PLF++EFG+DQRG N N
Sbjct: 262 VFEIHWYGF--WNTWEGDNLNKICGKETEKMMKMSGFLLEKGIPLFVSEFGIDQRGNNAN 321

Query: 323 DNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLDWNWCNPRNSSFLQRI 382
           DN++LSCF+++AA+ DLDW++WTL GSYY+RE  +G +E YG+LD+NW + RNS+ LQ I
Sbjct: 322 DNKFLSCFMALAADRDLDWSLWTLAGSYYIREKSIGSDESYGVLDFNWSSIRNSTILQMI 381

Query: 383 SALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLGPCADSDPWYYTPQKFLTL 442
           SA+QTPF   GL E +P  +MFHPSTGLC+ R SL   L+LG C  S+ W  +  + L+L
Sbjct: 382 SAIQTPF--IGLMETQPKKIMFHPSTGLCIVRKSLFQ-LKLGSCNRSESWRLSSHRVLSL 441

Query: 443 -KGTYFCIQAQEMGKQPRLGIICTVSN-AQWDMVSDSKMHLSSKLDNGSVVCLDVDPNTN 502
            +    C++A E GK  +L +  + S  ++W + SDSKM LSS   NG  VCLDVD   N
Sbjct: 442 AEEQILCLKAYEKGKSVKLRLFFSESYCSKWKLFSDSKMQLSSITKNGFSVCLDVDTENN 501

Query: 503 EIVTNACKCLSRDSSCDPSSQWFKLVNSTR 527
            IVTN+CKCL  +SSCDP SQWFKLV STR
Sbjct: 502 NIVTNSCKCLRGNSSCDPRSQWFKLVTSTR 526

BLAST of CmaCh04G017890 vs. TAIR10
Match: AT3G26140.1 (AT3G26140.1 Cellulase (glycosyl hydrolase family 5) protein)

HSP 1 Score: 612.5 bits (1578), Expect = 2.6e-175
Identity = 288/510 (56.47%), Postives = 386/510 (75.69%), Query Frame = 1

Query: 23  PLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPIDEITNRIGSLGFNCVRLTW 82
           PL T+SRWI+DE+GQRVKL CVNW SHL+ VVAEGLSKQ +D++  +I ++GFNCVR TW
Sbjct: 4   PLSTNSRWIIDEKGQRVKLACVNWPSHLQPVVAEGLSKQSVDDLAKKIMAMGFNCVRFTW 63

Query: 83  PLFLATNESLSS-LTVRQSFQRLGLREAIAGVQANNPFIIDLPLIKAFEAVVGWLGEAEL 142
           PL LATNE+L++ +TVRQSFQ LGL + I+G +  NP +IDLPLI+A++ VV  LG   +
Sbjct: 64  PLDLATNETLANNVTVRQSFQSLGLNDDISGFETKNPSMIDLPLIEAYKKVVAKLGNNNV 123

Query: 143 MVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMATMFNGVAHVVGMSLRNEL 202
           MV+LDNH++KPGWCC   DGNGFFGD +F+P  WI GLT++A  F G  +VVGMSLRNEL
Sbjct: 124 MVILDNHVTKPGWCCGYNDGNGFFGDTFFDPTTWIAGLTKIAMTFKGATNVVGMSLRNEL 183

Query: 203 RGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLSFLKNQPINLTFTAKLVYE 262
           RGPKQNV+DW++YMQ+GAEAVH ANP+VLVILSGLS+D DLSF++++ +NLTFT KLV+E
Sbjct: 184 RGPKQNVDDWFKYMQQGAEAVHEANPNVLVILSGLSYDTDLSFVRSRHVNLTFTRKLVFE 243

Query: 263 VHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPLFLTEFGVDQRGTNVNDNR 322
           +H Y+F++ ++W S NPN+ CG ++ ++    GF L +  P+FL+EFG+D RG NVNDNR
Sbjct: 244 LHRYSFTNTNTWSSKNPNEACGEILKSIENGGGFNL-RDFPVFLSEFGIDLRGKNVNDNR 303

Query: 323 YLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLDWNWCNPRNSSFLQRISAL 382
           Y+ C L  AAE D+DW++WTL GSYYLREGVVG++EFYG+LD +W   R+ SFLQR+S +
Sbjct: 304 YIGCILGWAAENDVDWSIWTLQGSYYLREGVVGMSEFYGILDSDWVRVRSQSFLQRLSLI 363

Query: 383 QTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLR--LGPCADSDPWYYTPQKFLTLK 442
            +P QGPG ++ + Y+++FHP TGLC+   S+LDP +  LG C +S PW YTPQ  LTLK
Sbjct: 364 LSPLQGPG-SQSKVYNLVFHPLTGLCM-LQSILDPTKVTLGLCNESQPWSYTPQNTLTLK 423

Query: 443 GTYFCIQAQEMGKQPRLG-IICTVSN-AQWDMVSDSKMHLSSKLDNGSVVCLDVDPNTNE 502
               C+++       +L    C+  N ++W+ +S S M L++K  N S +CLDVD  TN 
Sbjct: 424 DKSLCLESTGPNAPVKLSETSCSSPNLSEWETISASNMLLAAKSTNNS-LCLDVD-ETNN 483

Query: 503 IVTNACKCL-SRDSSCDPSSQWFKLVNSTR 527
           ++ + CKC+   DSSCDP SQWFK+V  ++
Sbjct: 484 LMASNCKCVKGEDSSCDPISQWFKIVKVSK 508

BLAST of CmaCh04G017890 vs. TAIR10
Match: AT5G17500.1 (AT5G17500.1 Glycosyl hydrolase superfamily protein)

HSP 1 Score: 564.3 bits (1453), Expect = 8.2e-161
Identity = 272/523 (52.01%), Postives = 368/523 (70.36%), Query Frame = 1

Query: 5   LVLLCSVAALWLQAAVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPID 64
           L L  S+ +L L  A   PL T SRWIV+ +G RVKL C NW SHL+ VVAEGLS QP+D
Sbjct: 11  LFLFLSLISLTL--ATDYPLFTKSRWIVNNKGHRVKLACANWPSHLKPVVAEGLSSQPMD 70

Query: 65  EITNRIGSLGFNCVRLTWPLFLATNESLS-SLTVRQSFQRLGLREAIAGVQANNPFIIDL 124
            I+ +I  +GFNCVRLTWPL L  N++L+ ++TV+QSF+R GL   + G+  +NP+I++ 
Sbjct: 71  SISKKIKDMGFNCVRLTWPLELMINDTLAFNVTVKQSFERYGLDHELQGIYTHNPYIVNT 130

Query: 125 PLIKAFEAVVGWLGEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMA 184
           PLI  F+AVV  LG  ++MV+LDNH + PGWCCSN D + FFGD  FNP+ W+ GL +MA
Sbjct: 131 PLINVFQAVVYSLGRHDVMVILDNHKTVPGWCCSNDDPDAFFGDPKFNPDLWMLGLKKMA 190

Query: 185 TMFNGVAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLS 244
           T+F  V +VVGMSLRNELRG      DWY+YMQ+GAEAVH +NP+VLVILSGL+FD DLS
Sbjct: 191 TIFMNVKNVVGMSLRNELRGYNHTSKDWYKYMQKGAEAVHTSNPNVLVILSGLNFDADLS 250

Query: 245 FLKNQPINLTFTAKLVYEVHWYAFSDGS-SWQSGNPNQVCGRVVNNLMKMSGFLLEQGMP 304
           FLK++P+NL+F  KLV E+HWY+F+DG+  W+S N N  C ++ +   +  GF+L+QG P
Sbjct: 251 FLKDRPVNLSFKKKLVLELHWYSFTDGTGQWKSHNVNDFCSQMFSKERRTGGFVLDQGFP 310

Query: 305 LFLTEFGVDQRGTNVNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLL 364
           LFL+EFG DQRG ++  NRY++C L+ AAE DLDWAVW + G YY REG  G+ E YG+L
Sbjct: 311 LFLSEFGTDQRGGDLEGNRYMNCMLAWAAEKDLDWAVWAVTGVYYFREGKRGVVEAYGML 370

Query: 365 DWNWCNPRNSSFLQRISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLL--DPLRLG 424
           D NW N  N ++L+R+S +Q P  GPG+ +   +  +FHP TGLC+ R S      L LG
Sbjct: 371 DANWHNVHNYTYLRRLSVIQPPHTGPGV-KHNHHKKIFHPLTGLCLVRKSHCHESELTLG 430

Query: 425 PCADSDPWYYTPQKFLTL-KGTYFCIQAQ-EMGKQPRLGIICTVSNAQWDMVSDSKMHLS 484
           PC   +PW Y+    L + +G   C++ +  +GK  +LG ICT    + + +S +KMHLS
Sbjct: 431 PCTKDEPWSYSHGGILEIRRGHKSCLEGETAVGKSVKLGRICT----KIEQISATKMHLS 490

Query: 485 SKLDNGSVVCLDVDPNTNEIVTNACKCLSRDSSCDPSSQWFKL 522
               +GS+VCLDVD + N +V N+C CL+ D++C+P+SQWFK+
Sbjct: 491 FNTSDGSLVCLDVD-SDNNVVANSCNCLTGDTTCEPASQWFKI 525

BLAST of CmaCh04G017890 vs. TAIR10
Match: AT5G16700.1 (AT5G16700.1 Glycosyl hydrolase superfamily protein)

HSP 1 Score: 479.6 bits (1233), Expect = 2.7e-135
Identity = 239/505 (47.33%), Postives = 329/505 (65.15%), Query Frame = 1

Query: 23  PLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPIDEITNRIGSLGFNCVRLTW 82
           PL T SRWIVDE+GQRVKL CVNW +HL+  VAEGLSKQP+D I+ +I S+GFNCVRLTW
Sbjct: 25  PLSTKSRWIVDEKGQRVKLACVNWPAHLQPTVAEGLSKQPLDSISKKIVSMGFNCVRLTW 84

Query: 83  PLFLATNESLS-SLTVRQSFQRLGLREAIAGVQANNPFIIDLPLIKAFEAVVGWLGEAEL 142
           PL L TN++L+  +TV+QSF+ L L E + G+Q +NP ++ LPL  AF+ VV  LGE  +
Sbjct: 85  PLDLVTNDTLALKVTVKQSFESLKLFEDVLGIQTHNPKLLHLPLFNAFQEVVSNLGENGV 144

Query: 143 MVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMATMFNGVAHVVGMSLRNEL 202
           MV+LDNH++ PGWCC + D + FFG  +F+P  W +GL +MAT+F    HV+GMSLRNE 
Sbjct: 145 MVILDNHLTTPGWCCGDNDLDAFFGYPHFDPLVWAKGLRKMATLFRNFTHVIGMSLRNEP 204

Query: 203 RGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLSFLKNQPINLTFTAKLVYE 262
           RG +   + W+R+M +GAEAVHAANP +LVILSG+ FD +LSFL+++ +N++FT KLV+E
Sbjct: 205 RGARDYPDLWFRHMPQGAEAVHAANPKLLVILSGIDFDTNLSFLRDRSVNVSFTDKLVFE 264

Query: 263 VHWYAFSDG-SSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPLFLTEFGVDQRGTNVNDN 322
           +HWY+FSDG  SW+  N N  C +++  +    GFLL +G PL L+EFG DQRG +++ N
Sbjct: 265 LHWYSFSDGRDSWRKHNSNDFCVKIIEKVTHNGGFLLGRGFPLILSEFGTDQRGGDMSGN 324

Query: 323 RYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLDWNWCNPRNSSFLQRISA 382
           RY++C ++ AAE D                           LDW         +L+    
Sbjct: 325 RYMNCLVAWAAEND---------------------------LDWAVWALTGDYYLRT--- 384

Query: 383 LQTPFQGPGLAERRPYSVMFHPSTGLCVGR--ASLLDPLRLGPCADSDPWYYTPQKFLTL 442
                 GPGL   +  +++FHPSTGLCV    +  +  LRLGPC  SDPW + P + + L
Sbjct: 385 ------GPGLRPNK--NLLFHPSTGLCVTNNPSDNIPTLRLGPCPKSDPWTFNPSEGI-L 444

Query: 443 KGTYFCIQAQEM-GKQPRLGIICTVSNAQWDMVSDSKMHLSSKLDNGSVVCLDVDPNTNE 502
                C++A  + G++ +LG+    S      +S +KMHLS K  NG ++CLDVD   N 
Sbjct: 445 WINKMCVEAPNVVGQKVKLGVGTKCSKL--GQISATKMHLSFKTSNGLLLCLDVDERDNS 488

Query: 503 IVTNACKCLSRDSSCDPSSQWFKLV 523
           +V N CK L+ D+SCDP+SQWFK++
Sbjct: 505 VVANRCKFLTMDASCDPASQWFKVL 488

BLAST of CmaCh04G017890 vs. NCBI nr
Match: gi|659090649|ref|XP_008446127.1| (PREDICTED: uncharacterized protein LOC103488945 [Cucumis melo])

HSP 1 Score: 990.7 bits (2560), Expect = 1.0e-285
Identity = 465/553 (84.09%), Postives = 511/553 (92.41%), Query Frame = 1

Query: 1   MMKGLVLLCSVAALWLQAAVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSK 60
           MMKGL++L +V  +   AAVGLPLHTD+RWIVD +G+RVKLRCVNWVSHLEAVVAEGLSK
Sbjct: 1   MMKGLLILLAVWCVAASAAVGLPLHTDTRWIVDGEGERVKLRCVNWVSHLEAVVAEGLSK 60

Query: 61  QPIDEITNRIGSLGFNCVRLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFI 120
           QPI+EI+NRI  LGFNCVRLTWPLFLATNESL+SLTVRQSFQRLGL EAIAG+QANNPFI
Sbjct: 61  QPIEEISNRIEGLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLTEAIAGIQANNPFI 120

Query: 121 IDLPLIKAFEAVVGWLGEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLT 180
           IDLPL+KAFEAVVG LGE +LMV+LDNHISKPGWCCSNFDGNGFFGDQYFNP+ WI+GLT
Sbjct: 121 IDLPLLKAFEAVVGKLGEGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPDLWIKGLT 180

Query: 181 RMATMFNGVAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDK 240
           RMATMFNGV HVV MSLRNELRGPKQNVNDWYRYMQRGAEAVH+ANPD+L+ILSGLSFD+
Sbjct: 181 RMATMFNGVNHVVAMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSFDR 240

Query: 241 DLSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQG 300
           DLSFLKNQPINLTFT+K VYEVHWYAFSDGSSW+SGN NQVCGR  NNLMKMSGFLL+QG
Sbjct: 241 DLSFLKNQPINLTFTSKTVYEVHWYAFSDGSSWESGNSNQVCGRTTNNLMKMSGFLLQQG 300

Query: 301 MPLFLTEFGVDQRGTNVNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYG 360
            PLF++EFG+DQRGTNVNDNRYLSCFL+VAAE+DLDWA+WTLVGSYYLREGVVGLNEFYG
Sbjct: 301 FPLFISEFGIDQRGTNVNDNRYLSCFLAVAAEFDLDWALWTLVGSYYLREGVVGLNEFYG 360

Query: 361 LLDWNWCNPRNSSFLQRISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLG 420
           +LDWNWCN RNS+FLQRIS LQTP QGPGLAERR Y+++FHP +GLCV R SLLDPLRLG
Sbjct: 361 ILDWNWCNLRNSTFLQRISVLQTPLQGPGLAERREYNLIFHPLSGLCVVRKSLLDPLRLG 420

Query: 421 PCADSDPWYYTPQKFLTLKGTYFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSK 480
           PC DSD WYYTPQKFLTLKGTYFCIQA E+GKQ +LGIICTV+NA+WDM+SDSK+HLSSK
Sbjct: 421 PCVDSDAWYYTPQKFLTLKGTYFCIQADEIGKQAKLGIICTVNNAKWDMISDSKLHLSSK 480

Query: 481 LDNGSVVCLDVDPNTNEIVTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTRSMISMVGS 540
             NGS+VCLDVD NTNEIVTN+CKCLSRDSSCDPSSQWFKLVNSTRS    RSMI+M GS
Sbjct: 481 SSNGSLVCLDVDANTNEIVTNSCKCLSRDSSCDPSSQWFKLVNSTRSLGGGRSMINMAGS 540

Query: 541 SLSFNLVPKLFVE 554
           SL+ N+V K FVE
Sbjct: 541 SLA-NVVTK-FVE 551

BLAST of CmaCh04G017890 vs. NCBI nr
Match: gi|778704830|ref|XP_004135502.2| (PREDICTED: uncharacterized protein LOC101217177 [Cucumis sativus])

HSP 1 Score: 979.5 bits (2531), Expect = 2.3e-282
Identity = 459/542 (84.69%), Postives = 501/542 (92.44%), Query Frame = 1

Query: 1   MMKGLVLLCSVAALWLQAAVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSK 60
           MMKGL+LL    A    AAVGLPLHTD+RWIVD  G+RVKLRCVNWVSHLEAVVAEGLSK
Sbjct: 1   MMKGLILLVWCVAA--SAAVGLPLHTDTRWIVDGAGERVKLRCVNWVSHLEAVVAEGLSK 60

Query: 61  QPIDEITNRIGSLGFNCVRLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFI 120
           QPI+EI+NRI  LGFNCVRLTWPLFLATNESL+SLTVRQSFQRLGL EAIAG+QANNPFI
Sbjct: 61  QPIEEISNRIQWLGFNCVRLTWPLFLATNESLNSLTVRQSFQRLGLAEAIAGIQANNPFI 120

Query: 121 IDLPLIKAFEAVVGWLGEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLT 180
           IDLPL+KAFEAVVG LGE +LMV+LDNHISKPGWCCSNFDGNGFFGDQYFNP+ WI+GLT
Sbjct: 121 IDLPLLKAFEAVVGKLGEGKLMVILDNHISKPGWCCSNFDGNGFFGDQYFNPDLWIKGLT 180

Query: 181 RMATMFNGVAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDK 240
           R+ATMFNGV HVV MSLRNELRGPKQNVNDWYRYMQRGAEAVH+ANPD+L+ILSGLS+D+
Sbjct: 181 RIATMFNGVNHVVAMSLRNELRGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSYDR 240

Query: 241 DLSFLKNQPINLTFTAKLVYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQG 300
           DLSFLKNQPINLTFT+K VYEVHWYAFSDGSSW+SGN NQVCGR  NNLMKMSGFLL+QG
Sbjct: 241 DLSFLKNQPINLTFTSKTVYEVHWYAFSDGSSWESGNSNQVCGRTTNNLMKMSGFLLQQG 300

Query: 301 MPLFLTEFGVDQRGTNVNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYG 360
            PLF++EFG+DQRGTNVNDNRYLSCFL+VAAE DLDWAVWTLVGSYYLREGVVGLNEFYG
Sbjct: 301 FPLFISEFGIDQRGTNVNDNRYLSCFLAVAAELDLDWAVWTLVGSYYLREGVVGLNEFYG 360

Query: 361 LLDWNWCNPRNSSFLQRISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLG 420
           +LDWNWCN RNS+FLQRISALQ+PFQGPGLAERR Y+V+FHP +GLCV R SLLDPL LG
Sbjct: 361 ILDWNWCNLRNSTFLQRISALQSPFQGPGLAERREYNVIFHPLSGLCVVRKSLLDPLTLG 420

Query: 421 PCADSDPWYYTPQKFLTLKGTYFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSK 480
           PC D+D WYYTPQKFLTLKGTYFCIQA E+GKQ +LGIICTV+NA+WDM+SDSK+HLSSK
Sbjct: 421 PCVDTDAWYYTPQKFLTLKGTYFCIQADEIGKQAKLGIICTVNNAKWDMISDSKLHLSSK 480

Query: 481 LDNGSVVCLDVDPNTNEIVTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTRSMISMVGS 540
             NGS+VCLDVD +TNEIVTN+CKCLSRDSSCDPSSQWFKLVNSTRS    RSMI+MVGS
Sbjct: 481 SSNGSLVCLDVDSSTNEIVTNSCKCLSRDSSCDPSSQWFKLVNSTRSLGRGRSMINMVGS 540

Query: 541 SL 543
           SL
Sbjct: 541 SL 540

BLAST of CmaCh04G017890 vs. NCBI nr
Match: gi|1009153113|ref|XP_015894462.1| (PREDICTED: uncharacterized protein LOC107428448 [Ziziphus jujuba])

HSP 1 Score: 861.3 bits (2224), Expect = 9.2e-247
Identity = 386/521 (74.09%), Postives = 460/521 (88.29%), Query Frame = 1

Query: 17  QAAVGLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPIDEITNRIGSLGFN 76
           Q  + LPL+T+SRWIVDE GQRVKL C+NWVSHLEAVVAEGLSKQP+D I+ RI S+GFN
Sbjct: 28  QPVLALPLYTNSRWIVDEGGQRVKLACLNWVSHLEAVVAEGLSKQPLDTISKRIVSMGFN 87

Query: 77  CVRLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFIIDLPLIKAFEAVVGWL 136
           CVRLTWPLFLATN+SL+S+TVRQSFQ LGL E+IAG+QANNP IIDLP+I A++AVV  L
Sbjct: 88  CVRLTWPLFLATNDSLASITVRQSFQSLGLLESIAGIQANNPSIIDLPIINAYQAVVSNL 147

Query: 137 GEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMATMFNGVAHVVGMS 196
            +  +MV+LDNHIS PGWCCSNFDGNGFFGDQYF P+ WI+GLTRMAT+FNGV +VVGMS
Sbjct: 148 RDNNVMVILDNHISNPGWCCSNFDGNGFFGDQYFKPDLWIKGLTRMATLFNGVTNVVGMS 207

Query: 197 LRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLSFLKNQPINLTFTA 256
           LRNELRG KQNV DWYRYMQRGAEAVH+ANPDVLVILSGL++DKD SFLKN P+NLTF+ 
Sbjct: 208 LRNELRGSKQNVKDWYRYMQRGAEAVHSANPDVLVILSGLNYDKDFSFLKNSPVNLTFSG 267

Query: 257 KLVYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPLFLTEFGVDQRGTN 316
           KLV+EVHWY FSDG +W++GNPNQVCGRVV+N+M++SGFLL QG PLF++EFGVDQRGTN
Sbjct: 268 KLVFEVHWYGFSDGKAWETGNPNQVCGRVVDNMMRVSGFLLGQGWPLFVSEFGVDQRGTN 327

Query: 317 VNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLDWNWCNPRNSSFLQ 376
           VNDNRYLSCFL+ AAE DLDWA+WTLVGSYY R+GV+G+NE+YG+L+WNWC  RNSSFLQ
Sbjct: 328 VNDNRYLSCFLATAAELDLDWALWTLVGSYYFRQGVIGMNEYYGVLNWNWCETRNSSFLQ 387

Query: 377 RISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLGPCADSDPWYYTPQKFL 436
           RISA+Q+PFQGPG+A+RRPY ++FHP TGLCV R SLLDPL+LGPC++++ W YTPQK L
Sbjct: 388 RISAIQSPFQGPGIAQRRPYKIIFHPLTGLCVLRKSLLDPLKLGPCSEAEAWKYTPQKTL 447

Query: 437 TLKGTYFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKLDNGSVVCLDVDPNTN 496
           T+KGTYFC+QA E+GK  +LGIICT SN++W+++SDSK+H+SSK  +G VVCLDVD N N
Sbjct: 448 TVKGTYFCLQADELGKPAQLGIICTESNSKWEIISDSKLHISSKTSSGDVVCLDVDSN-N 507

Query: 497 EIVTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTRSMISM 538
            IVTN CKCLSR++ CDP SQWFKLV+STRS    +S + +
Sbjct: 508 VIVTNTCKCLSRENMCDPGSQWFKLVDSTRSLHAPKSNVKI 547

BLAST of CmaCh04G017890 vs. NCBI nr
Match: gi|255547996|ref|XP_002515055.1| (PREDICTED: uncharacterized protein LOC8285469 [Ricinus communis])

HSP 1 Score: 848.2 bits (2190), Expect = 8.1e-243
Identity = 388/548 (70.80%), Postives = 472/548 (86.13%), Query Frame = 1

Query: 5   LVLLCSVAALWLQAAV-GLPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPI 64
           +    +++A+  Q+ V  LPL T+SRWIVDE GQRVKL CVNWVSHLEAVVAEGLSKQP+
Sbjct: 14  ITFFIAISAIIPQSQVTALPLSTNSRWIVDENGQRVKLACVNWVSHLEAVVAEGLSKQPM 73

Query: 65  DEITNRIGSLGFNCVRLTWPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFIIDL 124
           D I  +I S+GFNCVRLTWPL+L TN++L+SL+VRQSFQ LGL E+I+G+QANNP IIDL
Sbjct: 74  DMIAKKIVSMGFNCVRLTWPLYLVTNDTLASLSVRQSFQGLGLLESISGIQANNPSIIDL 133

Query: 125 PLIKAFEAVVGWLGEAELMVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMA 184
           PLIKA++AVV  LG+  +MV+LDNHISKPGWCCSNFDGNGFFGD YFNP+ WI+GLT+MA
Sbjct: 134 PLIKAYQAVVSSLGDNNVMVILDNHISKPGWCCSNFDGNGFFGDTYFNPDLWIKGLTQMA 193

Query: 185 TMFNGVAHVVGMSLRNELRGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLS 244
           T+FNGV +V+GMSLRNELRG KQNVNDWYRYM++GAEAVH+ANPDVLVILSGL++DKD S
Sbjct: 194 TLFNGVTNVIGMSLRNELRGQKQNVNDWYRYMEKGAEAVHSANPDVLVILSGLNYDKDFS 253

Query: 245 FLKNQPINLTFTAKLVYEVHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPL 304
           FL+N+P+NL+FT K+V+EVHWY FSDG +W+SGNPNQVCGRVV+NLM++SGFLLEQG P+
Sbjct: 254 FLRNRPVNLSFTGKVVFEVHWYGFSDGQAWRSGNPNQVCGRVVDNLMRISGFLLEQGWPM 313

Query: 305 FLTEFGVDQRGTNVNDNRYLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLD 364
           F++EFGVDQRGTNVNDNRYL CF+ VAAE D DWA+WTLVGSYYLR+GV+GLNE+YG+L+
Sbjct: 314 FVSEFGVDQRGTNVNDNRYLGCFIGVAAELDWDWALWTLVGSYYLRQGVIGLNEYYGVLN 373

Query: 365 WNWCNPRNSSFLQRISALQTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLGPCA 424
           WNWC+ RNSSFLQ+ISALQ+PFQGPGL+E  P+ V+FHPSTGLCV R S+L+PLRLG C 
Sbjct: 374 WNWCDVRNSSFLQQISALQSPFQGPGLSETNPHKVIFHPSTGLCVQRKSMLEPLRLGSCT 433

Query: 425 DSDPWYYTPQKFLTLKGTYFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKLDN 484
           DS+ W YT +  LTL+GTYFC+QA E+GK  +LGIICT S ++WD++SDSKMHLSSK+ N
Sbjct: 434 DSEAWRYTSENTLTLRGTYFCLQADELGKPAKLGIICTDSTSKWDVISDSKMHLSSKITN 493

Query: 485 GSVVCLDVDPNTNEIVTNACKCLSRDSSCDPSSQWFKLVNSTRSFDTTRSMISMVGSSLS 544
           G+ VCLDVD N N IV + CKCLSRD++CDP SQWFKLVNSTRS  T +  + +  +S+ 
Sbjct: 494 GTAVCLDVDSN-NTIVISTCKCLSRDNTCDPESQWFKLVNSTRSSATAKPSLRI--NSIL 553

Query: 545 FNLVPKLF 552
            +L  K F
Sbjct: 554 LDLPAKEF 558

BLAST of CmaCh04G017890 vs. NCBI nr
Match: gi|643714004|gb|KDP26669.1| (hypothetical protein JCGZ_17827 [Jatropha curcas])

HSP 1 Score: 846.7 bits (2186), Expect = 2.4e-242
Identity = 376/506 (74.31%), Postives = 452/506 (89.33%), Query Frame = 1

Query: 22  LPLHTDSRWIVDEQGQRVKLRCVNWVSHLEAVVAEGLSKQPIDEITNRIGSLGFNCVRLT 81
           +PL TDSRWIVDE+GQRVKL CVNWVSHLEAVVAEGLS++P+D I  +I S+GFNCVRLT
Sbjct: 31  VPLSTDSRWIVDEKGQRVKLACVNWVSHLEAVVAEGLSREPMDLIAKKIVSMGFNCVRLT 90

Query: 82  WPLFLATNESLSSLTVRQSFQRLGLREAIAGVQANNPFIIDLPLIKAFEAVVGWLGEAEL 141
           WPL+L TNE+ +SLTV+QSFQ LGL E+I+G+QANNP IIDLPLIKA++AVV  LG+  +
Sbjct: 91  WPLYLVTNETYASLTVKQSFQNLGLLESISGIQANNPAIIDLPLIKAYQAVVSSLGDNNV 150

Query: 142 MVVLDNHISKPGWCCSNFDGNGFFGDQYFNPEKWIEGLTRMATMFNGVAHVVGMSLRNEL 201
           +V+LDNHISKPGWCCSNFDGNGFFGD YFNP+ WI GLT+MAT+FNGV +VVG+SLRNEL
Sbjct: 151 LVILDNHISKPGWCCSNFDGNGFFGDSYFNPDLWINGLTQMATIFNGVPNVVGLSLRNEL 210

Query: 202 RGPKQNVNDWYRYMQRGAEAVHAANPDVLVILSGLSFDKDLSFLKNQPINLTFTAKLVYE 261
           RG KQNVNDWYRYM++GAEAVH+ANPDVLVILSGL++DKDLSFL+N+P+NLTFT KLV+E
Sbjct: 211 RGQKQNVNDWYRYMEKGAEAVHSANPDVLVILSGLNYDKDLSFLRNRPVNLTFTGKLVFE 270

Query: 262 VHWYAFSDGSSWQSGNPNQVCGRVVNNLMKMSGFLLEQGMPLFLTEFGVDQRGTNVNDNR 321
           VHWY FSDG +W++GNPNQVCGRV +N+M+MSG+LL+QG PLF++EFG+DQRGTNVNDNR
Sbjct: 271 VHWYGFSDGQAWKNGNPNQVCGRVTSNMMRMSGYLLDQGWPLFVSEFGIDQRGTNVNDNR 330

Query: 322 YLSCFLSVAAEYDLDWAVWTLVGSYYLREGVVGLNEFYGLLDWNWCNPRNSSFLQRISAL 381
           YL CFL  AAE DLDWA+WTLVGSYYLREGV+GLNE+YG+L+WNWC+ RNSSFLQ+ISAL
Sbjct: 331 YLGCFLGWAAELDLDWALWTLVGSYYLREGVLGLNEYYGVLNWNWCDIRNSSFLQQISAL 390

Query: 382 QTPFQGPGLAERRPYSVMFHPSTGLCVGRASLLDPLRLGPCADSDPWYYTPQKFLTLKGT 441
           Q+PFQGPG++E   + V+FHP TGLCV R S+L+PL+LGPC DS+ W YTPQK L+LKGT
Sbjct: 391 QSPFQGPGVSESNLHKVIFHPLTGLCVQRKSMLEPLKLGPCTDSEAWRYTPQKILSLKGT 450

Query: 442 YFCIQAQEMGKQPRLGIICTVSNAQWDMVSDSKMHLSSKLDNGSVVCLDVDPNTNEIVTN 501
           YFC+QA E+GK  +LG+ICT S+++WD++SDS MHLSSK+ NG+ +CLDVD N N IV N
Sbjct: 451 YFCLQADELGKPAKLGVICTDSDSKWDVISDSNMHLSSKISNGTTICLDVDSN-NTIVIN 510

Query: 502 ACKCLSRDSSCDPSSQWFKLVNSTRS 528
            CKCLSRD++CDP SQWFKLVNSTRS
Sbjct: 511 TCKCLSRDNTCDPGSQWFKLVNSTRS 535

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KTH1_CUCSA1.6e-28284.69Uncharacterized protein OS=Cucumis sativus GN=Csa_5G593400 PE=3 SV=1[more]
B9RMT6_RICCO5.6e-24370.80Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCO... [more]
A0A067K4F9_JATCU1.6e-24274.31Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17827 PE=3 SV=1[more]
A0A061EEN1_THECC6.2e-24274.66Cellulase protein OS=Theobroma cacao GN=TCM_010637 PE=3 SV=1[more]
V4TIH1_9ROSI1.0e-23973.40Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031098mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G13130.12.2e-19059.81 Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26130.12.5e-18158.63 Cellulase (glycosyl hydrolase family 5) protein[more]
AT3G26140.12.6e-17556.47 Cellulase (glycosyl hydrolase family 5) protein[more]
AT5G17500.18.2e-16152.01 Glycosyl hydrolase superfamily protein[more]
AT5G16700.12.7e-13547.33 Glycosyl hydrolase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659090649|ref|XP_008446127.1|1.0e-28584.09PREDICTED: uncharacterized protein LOC103488945 [Cucumis melo][more]
gi|778704830|ref|XP_004135502.2|2.3e-28284.69PREDICTED: uncharacterized protein LOC101217177 [Cucumis sativus][more]
gi|1009153113|ref|XP_015894462.1|9.2e-24774.09PREDICTED: uncharacterized protein LOC107428448 [Ziziphus jujuba][more]
gi|255547996|ref|XP_002515055.1|8.1e-24370.80PREDICTED: uncharacterized protein LOC8285469 [Ricinus communis][more]
gi|643714004|gb|KDP26669.1|2.4e-24274.31hypothetical protein JCGZ_17827 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000772Ricin_B_lectin
IPR001547Glyco_hydro_5
IPR013781Glycoside hydrolase, catalytic domain
IPR017853Glycoside_hydrolase_SF
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G017890.1CmaCh04G017890.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000772Ricin B, lectin domainunknownSSF50370Ricin B-like lectinscoord: 395..521
score: 2.3
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 61..342
score: 1.8
IPR013781Glycoside hydrolase, catalytic domainGENE3DG3DSA:3.20.20.80coord: 22..376
score: 3.7
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 23..379
score: 1.23
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 445..527
score: 1.7E-239coord: 1..399
score: 1.7E
NoneNo IPR availablePANTHERPTHR31263:SF0CELLULASE (GLYCOSYL HYDROLASE FAMILY 5) PROTEIN-RELATEDcoord: 1..399
score: 1.7E-239coord: 445..527
score: 1.7E

The following gene(s) are paralogous to this gene:

None