Cla007754 (gene) Watermelon (97103) v1

NameCla007754
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionCellulase (Glycosyl hydrolase family 5) protein (AHRD V1 **-- F4JBE4_ARATH); contains Interpro domain(s) IPR001547 Glycoside hydrolase, family 5
LocationChr2 : 917534 .. 919397 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAGAAATTTACAAACCATTCTCTTTTTGGCAATAGTCTTTCAGTTTTTATCTTTCTCGGCTTATTCCTTGCCTTTGTCAACAAAAGGAAGATGGATCGTTGATTCGAAGACTGGGCATCGTGTCAAACTTGTTTGTGTAAATTGGCCTTCTCACACTCAAAGCATGCTGGCAGAAGGCCTCAACCATCGACCATTGAAAGAACTTGCTGATGAGGCAATCAAGTTGAGGTTCAATTGTGTGCGTCTCACATATGCAACTCACATGTTCACTCGCTATGCTAATAGGACAATTGAAGAGAACTTTGACCTTCTTGATTTAAAGCAAGCCAAAGCTGGATTGGCTCAACATAACCCTTTTGTACTGAATAAGACCATTGTTGAAGCCTATGAAGCTGTTGTTGATGTGCTTGGGGCAAGTGGGTTAATGGTTATTGCTGACAACCACATTAGTCAACCAAGATGGTGTTGCTCTCTTGATGATGGTAATGGCTTCTTTGGAAATCGAAATTTTGATCCTCAAGAATGGTTACAAGGTCTTCGCTTAGTCGCTCAACGTTTCATAAACAAGTCAACGGTATGTACTGTTTAAAAGAGTTTTTTGATATTCTAAATGAAACCAATATCAAACATCTGAAAATTAGAAAATAAATCATTTCATTAACAAGTTTTTTCATTATGGCATATAGGTGGTAGCAATGAGCCTACGAAATGAGATACGAGGAATGATGGAAAATGCAAATGATTGGAACAACTATGTAACTCAAGGAGTAACAACAATCCACAACATAAATCCAGAAGTTGTAGTGATTGTTTCGGGGCTAAATTATGATAATGATCTTCGATGCTTAAAAGAAAAGCCTTTGAACGTTAACACCTTAGACAATAAGCTAGTTTTTGAAGTCCACTTATATTCTTTTAGTGGAGATTCTGAGAGCAAATTCGTAACACAACCATTAAATGATATTTGTGCAACTATCATGAATGGGTTTATAGATCATGCTGAGTTTGTAATTGAAGGATCAAATCCATTTCCTTTGTTTGTTAGCGAATATGGGTATGATCAAAGGGAGGTTGATGAAGCTGAAAATCGGTTCATGAGTTGCTTTACCGCTCATCTTGTTCAGAAAGACTTGGATTGGGCATTATGGACTTGGCAAGGTAGTTATTATTATAGGGAAGGTCAAGCAGAGCCTGCAGAAACATTTGGTGTTCTCAACTCTAATTGGACTCAGATTAAGAACCCTAACTTTCCTAAAAGGTTCCAACTATTGCAGACAATGTTGCAGGGTAAGTAAATTAACAAATCTAAGTAGTTTTAAGCATTTGTAAAGAGGAAAAATTGTTATAAAAAATTATAAGCAGCCCACACAATTAAACCAGTTTATGGGATCAAAATATAATGTGTAACCTTTTATTGATTTGCAGATCCAAATTCCAATGCTTCTAACTCGTATGTTATGTATCATCCACAAAGTGGGCAATGTGCCCAAGTGTCAAATGACAAGACAGGAATATTTTTGAACAATTGCTCTACCTCAAGCCGTTGGAGTCATAATGGTGATGGTACCCCAATTAGGTTGACATCCACTGGTTTGTGTTTGAAGTCCAATGGAGAAGGCCTTGGGGCATCCCTTTCAAATGATTGTTTGAGTCAACAGAGCGCTTGGAGAGCCATTTCTAACTCTAAGCTTCACCTTGCCACCTTCACTCAAGATGGAAACAACCTTTGTTTACAAATTGAAAACTCCAACTCCTCAAAGATTGTGGTCAACTCCTGTATTTGCACCAATGGCGACCCAAAATGCCTTGAAGACACCCAAAGCCAATGGCTCACACTCGTTGCAACCAATACCTTGTAA

mRNA sequence

ATGGGAAGAAATTTACAAACCATTCTCTTTTTGGCAATAGTCTTTCAGTTTTTATCTTTCTCGGCTTATTCCTTGCCTTTGTCAACAAAAGGAAGATGGATCGTTGATTCGAAGACTGGGCATCGTGTCAAACTTGTTTGTGTAAATTGGCCTTCTCACACTCAAAGCATGCTGGCAGAAGGCCTCAACCATCGACCATTGAAAGAACTTGCTGATGAGGCAATCAAGTTGAGGTTCAATTGTGTGCGTCTCACATATGCAACTCACATGTTCACTCGCTATGCTAATAGGACAATTGAAGAGAACTTTGACCTTCTTGATTTAAAGCAAGCCAAAGCTGGATTGGCTCAACATAACCCTTTTGTACTGAATAAGACCATTGTTGAAGCCTATGAAGCTGTTGTTGATGTGCTTGGGGCAAGTGGGTTAATGGTTATTGCTGACAACCACATTAGTCAACCAAGATGGTGTTGCTCTCTTGATGATGGTAATGGCTTCTTTGGAAATCGAAATTTTGATCCTCAAGAATGGTTACAAGGTCTTCGCTTAGTCGCTCAACGTTTCATAAACAAGTCAACGGTGGTAGCAATGAGCCTACGAAATGAGATACGAGGAATGATGGAAAATGCAAATGATTGGAACAACTATGTAACTCAAGGAGTAACAACAATCCACAACATAAATCCAGAAGTTGTAGTGATTGTTTCGGGGCTAAATTATGATAATGATCTTCGATGCTTAAAAGAAAAGCCTTTGAACGTTAACACCTTAGACAATAAGCTAGTTTTTGAAGTCCACTTATATTCTTTTAGTGGAGATTCTGAGAGCAAATTCGTAACACAACCATTAAATGATATTTGTGCAACTATCATGAATGGGTTTATAGATCATGCTGAGTTTGTAATTGAAGGATCAAATCCATTTCCTTTGTTTGTTAGCGAATATGGGTATGATCAAAGGGAGGTTGATGAAGCTGAAAATCGGTTCATGAGTTGCTTTACCGCTCATCTTGTTCAGAAAGACTTGGATTGGGCATTATGGACTTGGCAAGGTAGTTATTATTATAGGGAAGGTCAAGCAGAGCCTGCAGAAACATTTGGTGTTCTCAACTCTAATTGGACTCAGATTAAGAACCCTAACTTTCCTAAAAGGTTCCAACTATTGCAGACAATGTTGCAGGATCCAAATTCCAATGCTTCTAACTCGTATGTTATGTATCATCCACAAAGTGGGCAATGTGCCCAAGTGTCAAATGACAAGACAGGAATATTTTTGAACAATTGCTCTACCTCAAGCCGTTGGAGTCATAATGGTGATGGTACCCCAATTAGGTTGACATCCACTGGTTTGTGTTTGAAGTCCAATGGAGAAGGCCTTGGGGCATCCCTTTCAAATGATTGTTTGAGTCAACAGAGCGCTTGGAGAGCCATTTCTAACTCTAAGCTTCACCTTGCCACCTTCACTCAAGATGGAAACAACCTTTGTTTACAAATTGAAAACTCCAACTCCTCAAAGATTGTGGTCAACTCCTGTATTTGCACCAATGGCGACCCAAAATGCCTTGAAGACACCCAAAGCCAATGGCTCACACTCGTTGCAACCAATACCTTGTAA

Coding sequence (CDS)

ATGGGAAGAAATTTACAAACCATTCTCTTTTTGGCAATAGTCTTTCAGTTTTTATCTTTCTCGGCTTATTCCTTGCCTTTGTCAACAAAAGGAAGATGGATCGTTGATTCGAAGACTGGGCATCGTGTCAAACTTGTTTGTGTAAATTGGCCTTCTCACACTCAAAGCATGCTGGCAGAAGGCCTCAACCATCGACCATTGAAAGAACTTGCTGATGAGGCAATCAAGTTGAGGTTCAATTGTGTGCGTCTCACATATGCAACTCACATGTTCACTCGCTATGCTAATAGGACAATTGAAGAGAACTTTGACCTTCTTGATTTAAAGCAAGCCAAAGCTGGATTGGCTCAACATAACCCTTTTGTACTGAATAAGACCATTGTTGAAGCCTATGAAGCTGTTGTTGATGTGCTTGGGGCAAGTGGGTTAATGGTTATTGCTGACAACCACATTAGTCAACCAAGATGGTGTTGCTCTCTTGATGATGGTAATGGCTTCTTTGGAAATCGAAATTTTGATCCTCAAGAATGGTTACAAGGTCTTCGCTTAGTCGCTCAACGTTTCATAAACAAGTCAACGGTGGTAGCAATGAGCCTACGAAATGAGATACGAGGAATGATGGAAAATGCAAATGATTGGAACAACTATGTAACTCAAGGAGTAACAACAATCCACAACATAAATCCAGAAGTTGTAGTGATTGTTTCGGGGCTAAATTATGATAATGATCTTCGATGCTTAAAAGAAAAGCCTTTGAACGTTAACACCTTAGACAATAAGCTAGTTTTTGAAGTCCACTTATATTCTTTTAGTGGAGATTCTGAGAGCAAATTCGTAACACAACCATTAAATGATATTTGTGCAACTATCATGAATGGGTTTATAGATCATGCTGAGTTTGTAATTGAAGGATCAAATCCATTTCCTTTGTTTGTTAGCGAATATGGGTATGATCAAAGGGAGGTTGATGAAGCTGAAAATCGGTTCATGAGTTGCTTTACCGCTCATCTTGTTCAGAAAGACTTGGATTGGGCATTATGGACTTGGCAAGGTAGTTATTATTATAGGGAAGGTCAAGCAGAGCCTGCAGAAACATTTGGTGTTCTCAACTCTAATTGGACTCAGATTAAGAACCCTAACTTTCCTAAAAGGTTCCAACTATTGCAGACAATGTTGCAGGATCCAAATTCCAATGCTTCTAACTCGTATGTTATGTATCATCCACAAAGTGGGCAATGTGCCCAAGTGTCAAATGACAAGACAGGAATATTTTTGAACAATTGCTCTACCTCAAGCCGTTGGAGTCATAATGGTGATGGTACCCCAATTAGGTTGACATCCACTGGTTTGTGTTTGAAGTCCAATGGAGAAGGCCTTGGGGCATCCCTTTCAAATGATTGTTTGAGTCAACAGAGCGCTTGGAGAGCCATTTCTAACTCTAAGCTTCACCTTGCCACCTTCACTCAAGATGGAAACAACCTTTGTTTACAAATTGAAAACTCCAACTCCTCAAAGATTGTGGTCAACTCCTGTATTTGCACCAATGGCGACCCAAAATGCCTTGAAGACACCCAAAGCCAATGGCTCACACTCGTTGCAACCAATACCTTGTAA

Protein sequence

MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNPFVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDKTGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNSKLHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL
BLAST of Cla007754 vs. Swiss-Prot
Match: GUNA_XANCP (Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (strain ATCC 33913 / DSM 3586 / NCPPB 528 / LMG 568 / P 25) GN=engXCA PE=1 SV=2)

HSP 1 Score: 68.6 bits (166), Expect = 2.4e-10
Identity = 94/410 (22.93%), Postives = 157/410 (38.29%), Query Frame = 1

Query: 9   LFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVN-WPSHTQSMLAEGLNHRPL 68
           L LA      +  A+S  ++   R IVD  +G  V+L  VN +   T + +  GL  R  
Sbjct: 10  LALATALALAAGPAFSYSINNS-RQIVDD-SGKVVQLKGVNVFGFETGNHVMHGLWARNW 69

Query: 69  KELADEAIKLRFNCVRLTYATHMF---TRYANRTIEENFDLLDLKQAKAGLAQHNPFVLN 128
           K++  +   L FN VRL +        T  A+     N DL  L   +         +L+
Sbjct: 70  KDMIVQMQGLGFNAVRLPFCPATLRSDTMPASIDYSRNADLQGLTSLQ---------ILD 129

Query: 129 KTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLV 188
           K I E          A G+ V+ D+H      C  + +    +   ++   +WL  LR V
Sbjct: 130 KVIAE--------FNARGMYVLLDHHTPD---CAGISE---LWYTGSYTEAQWLADLRFV 189

Query: 189 AQRFINKSTVVAMSLRNEIRGMM-----ENANDWNNYVTQGVTTIHNINPEVVVIVSGLN 248
           A R+ N   V+ + L+NE  G         A DWN    +G   +  + P+ ++ V G+ 
Sbjct: 190 ANRYKNVPYVLGLDLKNEPHGAATWGTGNAATDWNKAAERGSAAVLAVAPKWLIAVEGIT 249

Query: 249 ------------YDNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDIC 308
                       +  +L+ L   PLN+    N+L+   H+Y         FV    ND  
Sbjct: 250 DNPVCSTNGGIFWGGNLQPLACTPLNIPA--NRLLLAPHVY-----GPDVFVQSYFND-- 309

Query: 309 ATIMNGFIDHAEFVIEG-----SNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDL 368
               + F ++   + E      +    L + E+G    E D  +  +      +L  K +
Sbjct: 310 ----SNFPNNMPAIWERHFGQFAGTHALLLGEFGGKYGEGDARDKTWQDALVKYLRSKGI 368

Query: 369 DWAL-WTWQGSYYYREGQAEPAETFGVLNSNWTQIKNPNFPKRFQLLQTM 392
           +    W+W              +T G+L  +WT ++      +  LL+T+
Sbjct: 370 NQGFYWSW---------NPNSGDTGGILRDDWTSVRQ----DKMTLLRTL 368

BLAST of Cla007754 vs. TrEMBL
Match: A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 939.1 bits (2426), Expect = 2.4e-270
Identity = 454/538 (84.39%), Postives = 487/538 (90.52%), Query Frame = 1

Query: 1   MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAE 60
           M R +Q IL LA+V  F SFSAYSLPLST GRWI+DS++G RVKLVCVNWPSHTQSML E
Sbjct: 1   MERTIQVILLLALVSVFSSFSAYSLPLSTHGRWIIDSQSGKRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNP 120
           GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRT+EENFDLLDL+QAKAGLAQ+NP
Sbjct: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLAQYNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLNKTI EAYEAVVDVLGASGLMVIADNH+SQPRWCCSLDDGNGFFGNR FDPQEWLQG
Sbjct: 121 FVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQEWLQG 180

Query: 181 LRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNY 240
           L LVAQRF NKSTVV MSLRNE+RGMMENANDWNNYVTQGVTTIH INP V+VIVSGLNY
Sbjct: 181 LSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVSGLNY 240

Query: 241 DNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEF 300
           DNDLRCLK+KPLNV+TLDNKL FEVHLYSFSGDSESKFV QPLN+ICA IM+ FIDHAEF
Sbjct: 241 DNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFIDHAEF 300

Query: 301 VIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQA 360
           VIEG NPFPLFVSEYGYDQREVD+AENRFMSCFTAHL QKDLDWALWTWQGSYYYREGQA
Sbjct: 301 VIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYREGQA 360

Query: 361 EPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK 420
           E AETFGVL+SNWTQIKNPNF ++FQLLQTMLQDP SNAS SYV+YH QSGQC +VSND 
Sbjct: 361 ELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEVSNDN 420

Query: 421 TGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNS 480
             IFL NCSTSSRWSH+ D TPI+++STGLCLK++GEGL ASLS DC+ +QS W AISNS
Sbjct: 421 KEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSAISNS 480

Query: 481 KLHLATFTQDGNNLCLQ-IENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL 538
            LHL T T+DG +LCLQ IE+SNSSKIV NSCICT  DP CL+DTQSQW  LVATNTL
Sbjct: 481 NLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVATNTL 538

BLAST of Cla007754 vs. TrEMBL
Match: A0A0A0KL32_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171770 PE=3 SV=1)

HSP 1 Score: 612.1 bits (1577), Expect = 6.6e-172
Identity = 296/526 (56.27%), Postives = 379/526 (72.05%), Query Frame = 1

Query: 11  LAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRPLKEL 70
           L  VF  L+F AYSLPLST GRWIVD+ TG RVKL+CVNWP H Q MLAEGL+ RPL ++
Sbjct: 12  LVCVFVLLTFKAYSLPLSTNGRWIVDATTGQRVKLMCVNWPGHMQGMLAEGLHRRPLDDI 71

Query: 71  ADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNPFVLNKTIVEA 130
                KLRFNCVRLTY+ HMFTR+AN T++++F+  D+K A AG+AQ+NP ++N T+VEA
Sbjct: 72  ISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIAQNNPSLVNLTLVEA 131

Query: 131 YEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQRFIN 190
           Y AVVD L A G+MV++DNHISQPRWCC+ DDGNGFFG+R FDP+EWLQG+ L AQ   +
Sbjct: 132 YGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEEWLQGISLAAQSLKS 191

Query: 191 KSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRCLKEK 250
           K+ VVAMS+RNE RG  +N   W  Y++QG   IH INP  +V+VSGL+YD DL  LK +
Sbjct: 192 KAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVSGLSYDTDLSFLKNR 251

Query: 251 PLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSNPFPL 310
            +  N LDNKLVFE HLYSF+ +    ++++PLN  CA++  GF D A F++ G NP PL
Sbjct: 252 SMGFN-LDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQNPMPL 311

Query: 311 FVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAETFGVLN 370
           FVSE+G DQR V+E +NRF+SCF ++L + D DW LW  QGSYYYREG     E FGVL+
Sbjct: 312 FVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYREGVKNAEENFGVLD 371

Query: 371 SNWTQIKNPN-FPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDKTGIFLNNCS 430
           S + + KN   F +RFQL+QT LQDP+SN + S +MYHP SG C ++ N K  + +++C 
Sbjct: 372 STFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVRM-NKKYQLGISSCK 431

Query: 431 TSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNSKLHLATFTQ 490
           TS+RW H  D +PI+L  + LCLK+ G GL   LS DC SQQS W+  S++KL LAT  +
Sbjct: 432 TSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWKYGSSAKLQLATVDE 491

Query: 491 DGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATN 536
            G  LCLQ   S+S +IV N C+C+N D +C ED QSQW TLV +N
Sbjct: 492 QGQALCLQRAASHSHQIVTNKCLCSN-DSQCQEDPQSQWFTLVPSN 534

BLAST of Cla007754 vs. TrEMBL
Match: A0A0A0KNB6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171760 PE=3 SV=1)

HSP 1 Score: 603.2 bits (1554), Expect = 3.1e-169
Identity = 294/526 (55.89%), Postives = 377/526 (71.67%), Query Frame = 1

Query: 11  LAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRPLKEL 70
           L  VF FL   A SLPLST GRWIVD+ TG+RVKL+CVNW  H Q MLAEGL+ RPL ++
Sbjct: 12  LVCVFVFLISKACSLPLSTNGRWIVDATTGNRVKLMCVNWAGHMQGMLAEGLHLRPLDDI 71

Query: 71  ADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNPFVLNKTIVEA 130
           A   +K RFNCVRLTY+ HMFTR+AN T++++F+  D+K A AG+AQ+NP +LN T+V+A
Sbjct: 72  AALVVKSRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDALAGIAQNNPSILNMTVVQA 131

Query: 131 YEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQRFIN 190
           Y AV+D L A  +MV++DNHISQPRWCC+ DDGNGFFG+R FDPQEWLQG+ L AQ   +
Sbjct: 132 YGAVIDSLAAHRVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPQEWLQGISLAAQNLKS 191

Query: 191 KSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRCLKEK 250
           KS VVAMSLRNE RG  +N   W  Y++QG   IH INP  +V+VSGL+YD DL  LK +
Sbjct: 192 KSQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLIHQINPNALVVVSGLSYDTDLSFLKNR 251

Query: 251 PLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSNPFPL 310
            +  N LDNKLVFE HLYSF+ +    ++++PLN  CA++  GF D A F++ G NP PL
Sbjct: 252 SMGFN-LDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQNPIPL 311

Query: 311 FVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAETFGVLN 370
           FVSE+G DQR V+E +NRF+SCF ++L + D DW LW  QGSYYYREG     E FGVL+
Sbjct: 312 FVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYREGVKNAEENFGVLD 371

Query: 371 SNWTQIKNPN-FPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDKTGIFLNNCS 430
           S + + KN   F +RFQL+QT LQDP+SN + S +MYHP SG C ++ N K  + +++C 
Sbjct: 372 STFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVRM-NKKYQLGISSCK 431

Query: 431 TSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNSKLHLATFTQ 490
           TS+RW H  D +PI+L  + LCLK+ G GL   LS DC SQQS W+  SN+KL LAT  +
Sbjct: 432 TSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWKYGSNAKLQLATIDE 491

Query: 491 DGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATN 536
            G  LCLQ   S+S ++V N C+C++ D +C ED QSQW TLV +N
Sbjct: 492 QGQALCLQRAASHSHQLVTNKCLCSS-DSQCQEDPQSQWFTLVPSN 534

BLAST of Cla007754 vs. TrEMBL
Match: B9RCJ5_RICCO (Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCOM_1689380 PE=3 SV=1)

HSP 1 Score: 592.8 bits (1527), Expect = 4.2e-166
Identity = 286/535 (53.46%), Postives = 374/535 (69.91%), Query Frame = 1

Query: 2   GRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEG 61
           G+  +TILF +     LS S YSLPLS   RWI+D+K+G RVKL CVNW SH Q MLAEG
Sbjct: 3   GKASKTILFFSFFLLVLSLS-YSLPLSINKRWIIDAKSGERVKLACVNWASHLQPMLAEG 62

Query: 62  LNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNPF 121
           L+ +PL  LA +  +  FNCVR T ATHMFTRY   T+ ++FD L+L +AKAG+A+HN F
Sbjct: 63  LDKKPLSYLASKLARYHFNCVRFTCATHMFTRYGKLTVAQSFDSLNLTKAKAGIARHNSF 122

Query: 122 VLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGL 181
           +LN T+V+AYEAVV+ LGA GLMV+ DNH+SQP+WCC  DD NGFFG+ +F P+EWL+GL
Sbjct: 123 LLNLTVVQAYEAVVNELGAHGLMVLLDNHVSQPKWCCPQDDENGFFGDIHFHPKEWLRGL 182

Query: 182 RLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYD 241
            +VA+ F  KS VVAMS+RNE+RG  +N +DW  Y+ +G   +H +NPEV+V+VSGL + 
Sbjct: 183 AIVAKIFQGKSQVVAMSMRNELRGPYQNEHDWYKYIQEGARMVHKLNPEVLVLVSGLVWG 242

Query: 242 NDLRCLKEKPLNVN-TLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEF 301
            DL  LK+KPL++   LDNKLV+E H YSFSGD +  +  QPLN IC       +D + F
Sbjct: 243 TDLSFLKKKPLHLGLNLDNKLVYEAHWYSFSGDPK-VWEVQPLNRICDLKTQIQVDLSGF 302

Query: 302 VIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQA 361
           VI G NP PLF+ E G DQR V+ A+NRF +CF A++ + DLDW LW +QGSYY++EG A
Sbjct: 303 VITGENPVPLFLGEVGIDQRGVNRADNRFFTCFLAYVAENDLDWGLWAFQGSYYFKEGIA 362

Query: 362 EPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK 421
            P E +G++N +W  +++P F  R  L++ M+QDP+S  S SY+MYHP SG C   S +K
Sbjct: 363 GPDENYGLMNFDWNYLRSPEFDDRIWLIKRMIQDPDSILSTSYLMYHPLSGNCVHAS-EK 422

Query: 422 TGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNS 481
             I+ +     SRWSH+GDG PIRL  + LCLK+ G+GL   LSNDC SQQS+W+ +S+S
Sbjct: 423 NEIYASRFQQHSRWSHDGDGAPIRLMGSALCLKAIGDGLEPVLSNDCFSQQSSWKLLSSS 482

Query: 482 KLHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATN 536
           KLHL    + G  LCL+ E+ NSSK+    CIC   D  C E+ QSQW  L+ TN
Sbjct: 483 KLHLGVKDEHGEYLCLEKESFNSSKVFTRKCICIEDDSDCQENPQSQWFKLIKTN 534

BLAST of Cla007754 vs. TrEMBL
Match: A0A059CGH5_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_D01800 PE=3 SV=1)

HSP 1 Score: 570.5 bits (1469), Expect = 2.2e-159
Identity = 279/535 (52.15%), Postives = 372/535 (69.53%), Query Frame = 1

Query: 4   NLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLN 63
           +L   LFL +V   ++F++  LPLST GRWIVDS TG R+KL CVNW SH + MLAEGL+
Sbjct: 9   SLVLFLFLLLVLS-INFTS-PLPLSTNGRWIVDSATGRRMKLACVNWASHLEPMLAEGLD 68

Query: 64  HRPLKELADEAIKLRFNCVRLTYATHMFTR--YANRTIEENFDLLDLKQAKAGLAQHNPF 123
            +PL  +  E  +LRFNCVRLT+AT+MFT+  + ++ +EE  D L L +AK G+A++NP 
Sbjct: 69  KKPLGVIVAEIRRLRFNCVRLTWATYMFTQPGHGDQPVEETLDSLGLAEAKGGVARNNPL 128

Query: 124 VLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGL 183
           VLN T VEAY AVVD LG  G+MV+ DNH+S+P+WCC+ DDGNGFFG+  FDP+EWL+GL
Sbjct: 129 VLNMTHVEAYAAVVDELGKQGVMVVLDNHVSKPKWCCAYDDGNGFFGDEYFDPEEWLRGL 188

Query: 184 RLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYD 243
             VA+ F  KS VV MS+RNE+RG  +N  DW  Y+    T +H  NP V+VI+SGLN+ 
Sbjct: 189 VAVAEHFNGKSQVVGMSVRNELRGPRQNDYDWYQYIRTAATKVHQANPNVLVILSGLNWA 248

Query: 244 NDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFV 303
           +DL  L+++P+ + +L  KLV+E H YSFSGD +  +  QP++ +CA  +    D A F+
Sbjct: 249 SDLSFLRKRPVGL-SLGRKLVYEAHWYSFSGDRKI-WEVQPVDRVCANAVQRMEDQAGFL 308

Query: 304 IEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAE 363
             G    PLF+ E+G+DQ    +A++RF+SCF  +   KDLDWALW  QGSYYYR+G   
Sbjct: 309 SSGPGAVPLFLGEFGFDQTGKSQADDRFLSCFMGYAAGKDLDWALWALQGSYYYRQGVVG 368

Query: 364 PAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDKT 423
           P ETFGVL+ NW  ++NP F +RFQL+QTM+QDP+SN+  SY+MYHPQSG C + +N+  
Sbjct: 369 PEETFGVLDFNWDGLRNPKFKERFQLVQTMVQDPSSNSPMSYIMYHPQSGLCIRANNNHE 428

Query: 424 GIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNSK 483
            I    C   SRW H  DG+PIRL  T LCLK+ G+GL   LSNDC +++SAWR+ISNSK
Sbjct: 429 -IGTAECQHWSRWIHYRDGSPIRLMGTPLCLKALGDGLPPVLSNDCSNRRSAWRSISNSK 488

Query: 484 LHLATFTQDGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNT 537
           LH+A   + GN LCL+ +++ SS I+   CIC + D  C E+ Q QW   V TNT
Sbjct: 489 LHVAATDEHGNRLCLEKKSNESSVILTRKCICVDDDSGCTENPQGQWFKFVPTNT 538

BLAST of Cla007754 vs. NCBI nr
Match: gi|778721997|ref|XP_011658389.1| (PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus])

HSP 1 Score: 939.1 bits (2426), Expect = 3.4e-270
Identity = 454/538 (84.39%), Postives = 487/538 (90.52%), Query Frame = 1

Query: 1   MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAE 60
           M R +Q IL LA+V  F SFSAYSLPLST GRWI+DS++G RVKLVCVNWPSHTQSML E
Sbjct: 1   MERTIQVILLLALVSVFSSFSAYSLPLSTHGRWIIDSQSGKRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNP 120
           GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRT+EENFDLLDL+QAKAGLAQ+NP
Sbjct: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLAQYNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLNKTI EAYEAVVDVLGASGLMVIADNH+SQPRWCCSLDDGNGFFGNR FDPQEWLQG
Sbjct: 121 FVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQEWLQG 180

Query: 181 LRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNY 240
           L LVAQRF NKSTVV MSLRNE+RGMMENANDWNNYVTQGVTTIH INP V+VIVSGLNY
Sbjct: 181 LSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVSGLNY 240

Query: 241 DNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEF 300
           DNDLRCLK+KPLNV+TLDNKL FEVHLYSFSGDSESKFV QPLN+ICA IM+ FIDHAEF
Sbjct: 241 DNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFIDHAEF 300

Query: 301 VIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQA 360
           VIEG NPFPLFVSEYGYDQREVD+AENRFMSCFTAHL QKDLDWALWTWQGSYYYREGQA
Sbjct: 301 VIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYREGQA 360

Query: 361 EPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK 420
           E AETFGVL+SNWTQIKNPNF ++FQLLQTMLQDP SNAS SYV+YH QSGQC +VSND 
Sbjct: 361 ELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEVSNDN 420

Query: 421 TGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNS 480
             IFL NCSTSSRWSH+ D TPI+++STGLCLK++GEGL ASLS DC+ +QS W AISNS
Sbjct: 421 KEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSAISNS 480

Query: 481 KLHLATFTQDGNNLCLQ-IENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL 538
            LHL T T+DG +LCLQ IE+SNSSKIV NSCICT  DP CL+DTQSQW  LVATNTL
Sbjct: 481 NLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVATNTL 538

BLAST of Cla007754 vs. NCBI nr
Match: gi|659090006|ref|XP_008445780.1| (PREDICTED: uncharacterized protein LOC103488703 [Cucumis melo])

HSP 1 Score: 802.7 bits (2072), Expect = 3.8e-229
Identity = 384/448 (85.71%), Postives = 410/448 (91.52%), Query Frame = 1

Query: 90  MFTRYANRTIEENFDLLDLKQAKAGLAQHNPFVLNKTIVEAYEAVVDVLGASGLMVIADN 149
           MFTRYANRT+EENFDLLDL QAKAGL Q+NPFVLNKTI EAYEAVVDVLGASGLMVIADN
Sbjct: 1   MFTRYANRTVEENFDLLDLGQAKAGLTQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADN 60

Query: 150 HISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQRFINKSTVVAMSLRNEIRGMMEN 209
           H+SQPRWCCSLDDGNGFFGNR FDPQEWLQGL LVAQRF NKSTVV MSLRNEIRGMMEN
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNRYFDPQEWLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMEN 120

Query: 210 ANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRCLKEKPLNVNTLDNKLVFEVHLYS 269
           ANDWN+YVTQGVTTIHNINPEV+VIV GLNYDNDLRCLKEKPLNV+TLDNKLVFEVHLYS
Sbjct: 121 ANDWNHYVTQGVTTIHNINPEVLVIVGGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYS 180

Query: 270 FSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSNPFPLFVSEYGYDQREVDEAENRF 329
           FSG SESKFV QPLN+ICA I+N FIDHAEFVIEGSNPFPLFVSEYGYDQREVD+AENRF
Sbjct: 181 FSGASESKFVQQPLNNICAKIINEFIDHAEFVIEGSNPFPLFVSEYGYDQREVDDAENRF 240

Query: 330 MSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAETFGVLNSNWTQIKNPNFPKRFQLLQ 389
           MSCFTAHL QKDLDWALWTWQGSYYYREGQAE  ETFGVL SNWTQIKNPNF ++FQLLQ
Sbjct: 241 MSCFTAHLAQKDLDWALWTWQGSYYYREGQAELPETFGVLESNWTQIKNPNFVQKFQLLQ 300

Query: 390 TMLQDPNSNASNSYVMYHPQSGQCAQVSNDKTGIFLNNCSTSSRWSHNGDGTPIRLTSTG 449
           TMLQDPNSNAS SYV+YHPQSGQC +VSND   IFL NCSTSSRWSH+ D TPI++++TG
Sbjct: 301 TMLQDPNSNASFSYVIYHPQSGQCIEVSNDNKDIFLTNCSTSSRWSHDNDSTPIKMSNTG 360

Query: 450 LCLKSNGEGLGASLSNDCLSQQSAWRAISNSKLHLATFTQDGNNLCLQIENSNSSKIVVN 509
           LCLK++GEGL ASLSNDCL +QS W AISNSKLHLAT T++G +LCLQIE+SNSSKIV N
Sbjct: 361 LCLKASGEGLAASLSNDCLGKQSVWSAISNSKLHLATVTENGKSLCLQIESSNSSKIVTN 420

Query: 510 SCICTNGDPKCLEDTQSQWLTLVATNTL 538
           SCICT  DP CL+DTQSQW  LV TNTL
Sbjct: 421 SCICTTDDPTCLQDTQSQWFELVETNTL 448

BLAST of Cla007754 vs. NCBI nr
Match: gi|659073199|ref|XP_008467306.1| (PREDICTED: uncharacterized protein LOC103504686 [Cucumis melo])

HSP 1 Score: 797.0 bits (2057), Expect = 2.1e-227
Identity = 375/539 (69.57%), Postives = 440/539 (81.63%), Query Frame = 1

Query: 1   MGRNLQTILFLAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAE 60
           MG   Q    +     F S  +YSLPLST GRWIVDS TGHRVKLVCVNWPSHTQSML E
Sbjct: 1   MGITTQFSFVVLAFICFFSSLSYSLPLSTNGRWIVDSATGHRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNP 120
           GL+ RPLK+LA+E ++L+FNCVRLTYATHMFTRYANRT+EENFDLLDL+ +K GLA HNP
Sbjct: 61  GLDRRPLKDLANEVMRLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKVGLALHNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLN TI EAYEAVVDVLG SGLMVIADNHISQPRWCCSL+DGNGFFG+R FD +EWL+G
Sbjct: 121 FVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDSEEWLEG 180

Query: 181 LRLVAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNY 240
           LRLVA+RF NKS VVAMSLRNE+RG    + DWN YVTQG TTIHNINP ++VI+SGLN+
Sbjct: 181 LRLVARRFYNKSAVVAMSLRNELRGASSKSKDWNKYVTQGATTIHNINPNILVIISGLNF 240

Query: 241 DNDLRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEF 300
           DNDLRC ++ PL +N L NKLVFEVHLYSFSG+S+SKF+  PLN IC+ I+NGF+  AEF
Sbjct: 241 DNDLRCQRQYPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKIINGFVQRAEF 300

Query: 301 VIEGSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQA 360
           V+EG+   PLFVSE+G DQ  V+EA++RF+SCF+AHLV+KDLDWALW WQGSYYYR+G+ 
Sbjct: 301 VMEGAEAVPLFVSEFGLDQTGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYRQGKV 360

Query: 361 EPAETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK 420
           E  E FGVLN NW+ ++NP F + FQLLQTMLQDPNSN+SN+Y+MYHPQSGQC QV + K
Sbjct: 361 ELEEVFGVLNYNWSDVRNPRFSQMFQLLQTMLQDPNSNSSNTYLMYHPQSGQCVQVHDMK 420

Query: 421 -TGIFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISN 480
              IFLNNCS +S WS+ GDGTPI L ST  CLK+NG GL  SLS DC  +QS W AIS+
Sbjct: 421 QKEIFLNNCSNASHWSYEGDGTPIMLASTNFCLKANGNGLPPSLSRDCFGEQSVWTAISD 480

Query: 481 SKLHLATFTQDGNN-LCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL 538
           SKLHLAT T+ GNN +CL+ E+SNSS+I++ SC+C   D  CL+DTQ+QW  LV TNTL
Sbjct: 481 SKLHLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGSDSNCLQDTQAQWFQLVVTNTL 539

BLAST of Cla007754 vs. NCBI nr
Match: gi|449451950|ref|XP_004143723.1| (PREDICTED: uncharacterized protein LOC101213113 [Cucumis sativus])

HSP 1 Score: 796.2 bits (2055), Expect = 3.5e-227
Identity = 373/536 (69.59%), Postives = 445/536 (83.02%), Query Frame = 1

Query: 7   TILFLAIVFQFLSFSA---YSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLN 66
           T  F  +V  F+SF +   YSLPLST GRWIVDS TG RVKLVCVNWPSHTQSML EGL+
Sbjct: 4   TTQFGFVVLAFISFFSSLSYSLPLSTNGRWIVDSATGRRVKLVCVNWPSHTQSMLIEGLD 63

Query: 67  HRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNPFVL 126
            RPLK+LA+E ++LRFNCVRLTYATHMFTRYANRT+EENFDLLDL+ AK GLA HNPFVL
Sbjct: 64  RRPLKDLANEVVRLRFNCVRLTYATHMFTRYANRTVEENFDLLDLRAAKVGLAFHNPFVL 123

Query: 127 NKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRL 186
           N TI EAYEAVVDVLG SGLMVIADNHISQPRWCCSL+DGNGFFG+R FD +EWL+GLRL
Sbjct: 124 NMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDTEEWLEGLRL 183

Query: 187 VAQRFINKSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDND 246
           VA+RF NKS VVAMSLRNE+RG    + DWN Y+TQG TTIHNINP+++VI+SGLN+DND
Sbjct: 184 VARRFYNKSAVVAMSLRNELRGASSKSKDWNKYITQGATTIHNINPKILVIISGLNFDND 243

Query: 247 LRCLKEKPLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIE 306
           LRC ++ PL +N L NKLVFEVHLYSFSG+S+SKF+  PLN IC+ ++NGF++ AEFV+E
Sbjct: 244 LRCQRQYPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKVINGFVERAEFVME 303

Query: 307 GSNPFPLFVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPA 366
           G+   PLFVSE+G DQR V+EA++RF+SCF+AHLV+KDLDWALW WQGSYYYR+G+  P 
Sbjct: 304 GAEAVPLFVSEFGLDQRGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYRQGKVGPE 363

Query: 367 ETFGVLNSNWTQIKNPNFPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDK-TG 426
           E FGVLN NW+ ++NP+F + FQLLQTMLQDPNSN+SN+YVMYHPQSGQC  V + K   
Sbjct: 364 EVFGVLNYNWSDVRNPHFSQMFQLLQTMLQDPNSNSSNTYVMYHPQSGQCVLVQDMKHMQ 423

Query: 427 IFLNNCSTSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNSKL 486
           I+LN+CS +S WS+ GDGTPI L ST  CLK++G+GL  SLS DC  +QS W AIS+SKL
Sbjct: 424 IYLNDCSNASHWSYEGDGTPIMLASTNFCLKASGDGLPPSLSRDCFGEQSVWTAISDSKL 483

Query: 487 HLATFTQDGNN-LCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATNTL 538
           HLAT T+ GNN +CL+ E+SNSS+I++ SC+C   D  CL+DTQ+QW  LV TNTL
Sbjct: 484 HLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGNDSNCLQDTQAQWFQLVVTNTL 539

BLAST of Cla007754 vs. NCBI nr
Match: gi|700195218|gb|KGN50395.1| (hypothetical protein Csa_5G171770 [Cucumis sativus])

HSP 1 Score: 612.1 bits (1577), Expect = 9.5e-172
Identity = 296/526 (56.27%), Postives = 379/526 (72.05%), Query Frame = 1

Query: 11  LAIVFQFLSFSAYSLPLSTKGRWIVDSKTGHRVKLVCVNWPSHTQSMLAEGLNHRPLKEL 70
           L  VF  L+F AYSLPLST GRWIVD+ TG RVKL+CVNWP H Q MLAEGL+ RPL ++
Sbjct: 12  LVCVFVLLTFKAYSLPLSTNGRWIVDATTGQRVKLMCVNWPGHMQGMLAEGLHRRPLDDI 71

Query: 71  ADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQHNPFVLNKTIVEA 130
                KLRFNCVRLTY+ HMFTR+AN T++++F+  D+K A AG+AQ+NP ++N T+VEA
Sbjct: 72  ISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIAQNNPSLVNLTLVEA 131

Query: 131 YEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLRLVAQRFIN 190
           Y AVVD L A G+MV++DNHISQPRWCC+ DDGNGFFG+R FDP+EWLQG+ L AQ   +
Sbjct: 132 YGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEEWLQGISLAAQSLKS 191

Query: 191 KSTVVAMSLRNEIRGMMENANDWNNYVTQGVTTIHNINPEVVVIVSGLNYDNDLRCLKEK 250
           K+ VVAMS+RNE RG  +N   W  Y++QG   IH INP  +V+VSGL+YD DL  LK +
Sbjct: 192 KAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVSGLSYDTDLSFLKNR 251

Query: 251 PLNVNTLDNKLVFEVHLYSFSGDSESKFVTQPLNDICATIMNGFIDHAEFVIEGSNPFPL 310
            +  N LDNKLVFE HLYSF+ +    ++++PLN  CA++  GF D A F++ G NP PL
Sbjct: 252 SMGFN-LDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQNPMPL 311

Query: 311 FVSEYGYDQREVDEAENRFMSCFTAHLVQKDLDWALWTWQGSYYYREGQAEPAETFGVLN 370
           FVSE+G DQR V+E +NRF+SCF ++L + D DW LW  QGSYYYREG     E FGVL+
Sbjct: 312 FVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYREGVKNAEENFGVLD 371

Query: 371 SNWTQIKNPN-FPKRFQLLQTMLQDPNSNASNSYVMYHPQSGQCAQVSNDKTGIFLNNCS 430
           S + + KN   F +RFQL+QT LQDP+SN + S +MYHP SG C ++ N K  + +++C 
Sbjct: 372 STFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVRM-NKKYQLGISSCK 431

Query: 431 TSSRWSHNGDGTPIRLTSTGLCLKSNGEGLGASLSNDCLSQQSAWRAISNSKLHLATFTQ 490
           TS+RW H  D +PI+L  + LCLK+ G GL   LS DC SQQS W+  S++KL LAT  +
Sbjct: 432 TSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWKYGSSAKLQLATVDE 491

Query: 491 DGNNLCLQIENSNSSKIVVNSCICTNGDPKCLEDTQSQWLTLVATN 536
            G  LCLQ   S+S +IV N C+C+N D +C ED QSQW TLV +N
Sbjct: 492 QGQALCLQRAASHSHQIVTNKCLCSN-DSQCQEDPQSQWFTLVPSN 534

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUNA_XANCP2.4e-1022.93Major extracellular endoglucanase OS=Xanthomonas campestris pv. campestris (stra... [more]
Match NameE-valueIdentityDescription
A0A0A0K853_CUCSA2.4e-27084.39Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1[more]
A0A0A0KL32_CUCSA6.6e-17256.27Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171770 PE=3 SV=1[more]
A0A0A0KNB6_CUCSA3.1e-16955.89Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171760 PE=3 SV=1[more]
B9RCJ5_RICCO4.2e-16653.46Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCO... [more]
A0A059CGH5_EUCGR2.2e-15952.15Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_D01800 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
gi|778721997|ref|XP_011658389.1|3.4e-27084.39PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus][more]
gi|659090006|ref|XP_008445780.1|3.8e-22985.71PREDICTED: uncharacterized protein LOC103488703 [Cucumis melo][more]
gi|659073199|ref|XP_008467306.1|2.1e-22769.57PREDICTED: uncharacterized protein LOC103504686 [Cucumis melo][more]
gi|449451950|ref|XP_004143723.1|3.5e-22769.59PREDICTED: uncharacterized protein LOC101213113 [Cucumis sativus][more]
gi|700195218|gb|KGN50395.1|9.5e-17256.27hypothetical protein Csa_5G171770 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000772Ricin_B_lectin
IPR001547Glyco_hydro_5
IPR013781Glycoside hydrolase, catalytic domain
IPR017853Glycoside_hydrolase_SF
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla007754Cla007754.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000772Ricin B, lectin domainPROFILEPS50231RICIN_B_LECTINcoord: 400..531
score: 10
IPR000772Ricin B, lectin domainunknownSSF50370Ricin B-like lectinscoord: 402..512
score: 1.4
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 63..350
score: 1.8
IPR013781Glycoside hydrolase, catalytic domainGENE3DG3DSA:3.20.20.80coord: 25..390
score: 1.1
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 26..380
score: 1.22
NoneNo IPR availableGENE3DG3DSA:2.80.10.50coord: 409..511
score: 1.
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 1..406
score: 4.4E-228coord: 452..537
score: 4.4E
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 1..406
score: 4.4E-228coord: 452..537
score: 4.4E