Cla011399 (gene) Watermelon (97103) v1

NameCla011399
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionCellulase (Glycosyl hydrolase family 5) protein (AHRD V1 **-- Q9LTM8_ARATH); contains Interpro domain(s) IPR013781 Glycoside hydrolase, subgroup, catalytic core
LocationChr1 : 1979420 .. 1980807 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTGTTTTTGTTATGTTAACCATTAAAGCTTATTCGTTGCCTCTTTCAACCAATGGAAGATGGATTATTGACTCTACAACAAACCAATGCGTGAAGTTAGTATTTGTGAATTGGCCTTCACACATGCAGGCAATGCTAGCAGAGGGTCTTCATCATCGTCCTCTAAATGACATCACCACTATGGTGGCAAAATTGTGGTTTAATCGTGTGCAGTTGACGTACTCGATCCACATGTTTACACGCTATGCCAATTTGACCGTTCAACAATCCTTGGAGAATTTTGATATGAAAGATGCAATGACAGGTATAGCCCAAAACAATCATTTTGTGTTAAATATGACATTAGTTGAAGCTTATGGAGTGGTGGTAGATTCACTTACAACAAATGGAATCATGGTGATTTCTGATAACCATATAAGCCAGCCAAGATGGTGTTGTAGTGATGATGATGGTAATGGCTTTCTTGGAGATCGGTATTTTGATCCTCAAGAATGGCTTCATGGAATTAGTTTGGCAGCTCAAAGCTTAAAGGGTAAACCCGTATGTTTGAAATTATATAAAATAGGGTTATTACCTATGAGGGCAGTTCGACAATAGTTTCTAAACATTGATTTTATCGTTATTTGCTCGTTATTTGCTGGCTAAATTACATCAAATATTCTTCAACAATGTTTTCTTTTAAAATTTTATTTTTTAAGAAAATCTAAAAGTTTCTTAGAAATACATAATTGAAACCAATCAGTTGTTTTATATAATTTAAATTGTTTTTTATATGATATTTTGACCACTTCCTTGATATATATATATACATGCAAAAGGTTGTGGTAATGAGTCTAAGAAATGAACCGCGAGGACCAAATCAAAATGTGGAGGTGTGGTTTCAATACATCAACCAAGGAGCTAAGCTTATGCACCAAATTAACCCAAATCATGTTTTAGTAGTGGTTTCTGGACTAAGTTGTGATACCGATCTAAGCTTCTTGAAGAATAAGTCGATGGGCTTCATCTTGGACAACAAGCTTGTATTTGAGGCTCACTTGTACTCCTTTACAAACAACATGGGAGATTCTTGGACATCGAAGCCATTAAACACATTTTGTGCTAGCGTGAACCAAGGATTTAAAAACTGGGCTGGGTTTCTTATTAGAGGACAAAGCCCAATGCTTCTCTTTGTGAGTAAGTTTGGGATTGACCAAACAGGCACCAATGAGGGTCAAAATCGATTCTTGAGTTGCTTTTTTACCTATCTTACCGAGAATGATTTTGATTGGGGCTTGTGGGTTTGCAAGGTAGCTATTATTATAGGGAAGACGTGA

mRNA sequence

ATGACTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTGTTTTTGTTATGTTAACCATTAAAGCTTATTCGTTGCCTCTTTCAACCAATGGAAGATGGATTATTGACTCTACAACAAACCAATGCGTGAAGTTAGTATTTGTGAATTGGCCTTCACACATGCAGGCAATGCTAGCAGAGGGTCTTCATCATCGTCCTCTAAATGACATCACCACTATGGTGGCAAAATTGTGGTTTAATCGTGTGCAGTTGACGTACTCGATCCACATGTTTACACGCTATGCCAATTTGACCGTTCAACAATCCTTGGAGAATTTTGATATGAAAGATGCAATGACAGGTATAGCCCAAAACAATCATTTTGTGTTAAATATGACATTAGTTGAAGCTTATGGAGTGGTGGTAGATTCACTTACAACAAATGGAATCATGGTGATTTCTGATAACCATATAAGCCAGCCAAGATGGTGTTGTAGTGATGATGATGGTAATGGCTTTCTTGGAGATCGGTATTTTGATCCTCAAGAATGGCTTCATGGAATTAGTTTGGCAGCTCAAAGCTTAAAGGGTAAACCCGTTGTGGTAATGAGTCTAAGAAATGAACCGCGAGGACCAAATCAAAATGTGGAGGTGTGGTTTCAATACATCAACCAAGGAGCTAAGCTTATGCACCAAATTAACCCAAATCATGTTTTAGTAGTGGTTTCTGGACTAAGTTGTGATACCGATCTAAGCTTCTTGAAGAATAAGTCGATGGGCTTCATCTTGGACAACAAGCTTGTATTTGAGGCTCACTTGTACTCCTTTACAAACAACATGGGAGATTCTTGGACATCGAAGCCATTAAACACATTTTGTGCTAGCGTGAACCAAGGATTTAAAAACTGGGCTGGGTTTCTTATTAGAGGACAAAGCCCAATGCTTCTCTTTGTGAGTAAGTTTGGGATTGACCAAACAGGCACCAATGAGGGTCAAAATCGATTCTTGAGTTGCTTTTTTACCTATCTTACCGAGAATGATTTTGATTGGGGCTTGTGGGTTTGCAAGGTAGCTATTATTATAGGGAAGACGTGA

Coding sequence (CDS)

ATGACTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTGTTTTTGTTATGTTAACCATTAAAGCTTATTCGTTGCCTCTTTCAACCAATGGAAGATGGATTATTGACTCTACAACAAACCAATGCGTGAAGTTAGTATTTGTGAATTGGCCTTCACACATGCAGGCAATGCTAGCAGAGGGTCTTCATCATCGTCCTCTAAATGACATCACCACTATGGTGGCAAAATTGTGGTTTAATCGTGTGCAGTTGACGTACTCGATCCACATGTTTACACGCTATGCCAATTTGACCGTTCAACAATCCTTGGAGAATTTTGATATGAAAGATGCAATGACAGGTATAGCCCAAAACAATCATTTTGTGTTAAATATGACATTAGTTGAAGCTTATGGAGTGGTGGTAGATTCACTTACAACAAATGGAATCATGGTGATTTCTGATAACCATATAAGCCAGCCAAGATGGTGTTGTAGTGATGATGATGGTAATGGCTTTCTTGGAGATCGGTATTTTGATCCTCAAGAATGGCTTCATGGAATTAGTTTGGCAGCTCAAAGCTTAAAGGGTAAACCCGTTGTGGTAATGAGTCTAAGAAATGAACCGCGAGGACCAAATCAAAATGTGGAGGTGTGGTTTCAATACATCAACCAAGGAGCTAAGCTTATGCACCAAATTAACCCAAATCATGTTTTAGTAGTGGTTTCTGGACTAAGTTGTGATACCGATCTAAGCTTCTTGAAGAATAAGTCGATGGGCTTCATCTTGGACAACAAGCTTGTATTTGAGGCTCACTTGTACTCCTTTACAAACAACATGGGAGATTCTTGGACATCGAAGCCATTAAACACATTTTGTGCTAGCGTGAACCAAGGATTTAAAAACTGGGCTGGGTTTCTTATTAGAGGACAAAGCCCAATGCTTCTCTTTGTGAGTAAGTTTGGGATTGACCAAACAGGCACCAATGAGGGTCAAAATCGATTCTTGAGTTGCTTTTTTACCTATCTTACCGAGAATGATTTTGATTGGGGCTTGTGGGTTTGCAAGGTAGCTATTATTATAGGGAAGACGTGA

Protein sequence

MTCKMTYKHIALACKMTYKHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHRPLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMTLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQSLKGKPVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLSFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQSPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLWVCKVAIIIGKT
BLAST of Cla011399 vs. TrEMBL
Match: A0A0A0KL32_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171770 PE=3 SV=1)

HSP 1 Score: 574.3 bits (1479), Expect = 1.1e-160
Identity = 276/342 (80.70%), Postives = 307/342 (89.77%), Query Frame = 1

Query: 18  YKHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHR 77
           +K+IAL CVFV+LT KAYSLPLSTNGRWI+D+TT Q VKL+ VNWP HMQ MLAEGLH R
Sbjct: 7   WKNIALVCVFVLLTFKAYSLPLSTNGRWIVDATTGQRVKLMCVNWPGHMQGMLAEGLHRR 66

Query: 78  PLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNM 137
           PL+DI ++VAKL FN V+LTYSIHMFTR+ANLTVQQS ENFDMKDAM GIAQNN  ++N+
Sbjct: 67  PLDDIISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIAQNNPSLVNL 126

Query: 138 TLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAA 197
           TLVEAYG VVDSL  +G+MV+SDNHISQPRWCC++DDGNGF GDRYFDP+EWL GISLAA
Sbjct: 127 TLVEAYGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEEWLQGISLAA 186

Query: 198 QSLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDL 257
           QSLK K  VV MS+RNEPRGPNQNVE WFQY++QGAKL+HQINPN  LVVVSGLS DTDL
Sbjct: 187 QSLKSKAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDL 246

Query: 258 SFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQ 317
           SFLKN+SMGF LDNKLVFEAHLYSFTNNMGD W SKPLNTFCASVNQGF++ AGFL+RGQ
Sbjct: 247 SFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQ 306

Query: 318 SPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           +PM LFVS+FGIDQ G NEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 307 NPMPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLW 347

BLAST of Cla011399 vs. TrEMBL
Match: A0A0A0KNB6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171760 PE=3 SV=1)

HSP 1 Score: 552.4 bits (1422), Expect = 4.3e-154
Identity = 266/342 (77.78%), Postives = 299/342 (87.43%), Query Frame = 1

Query: 18  YKHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHR 77
           +++IAL CVFV L  KA SLPLSTNGRWI+D+TT   VKL+ VNW  HMQ MLAEGLH R
Sbjct: 7   WRNIALVCVFVFLISKACSLPLSTNGRWIVDATTGNRVKLMCVNWAGHMQGMLAEGLHLR 66

Query: 78  PLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNM 137
           PL+DI  +V K  FN V+LTYSIHMFTR+ANLTVQQS ENFDMKDA+ GIAQNN  +LNM
Sbjct: 67  PLDDIAALVVKSRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDALAGIAQNNPSILNM 126

Query: 138 TLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAA 197
           T+V+AYG V+DSL  + +MV+SDNHISQPRWCC++DDGNGF GDRYFDPQEWL GISLAA
Sbjct: 127 TVVQAYGAVIDSLAAHRVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPQEWLQGISLAA 186

Query: 198 QSLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDL 257
           Q+LK K  VV MSLRNEPRGPNQNVE+WFQY++QGAKL+HQINPN  LVVVSGLS DTDL
Sbjct: 187 QNLKSKSQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDL 246

Query: 258 SFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQ 317
           SFLKN+SMGF LDNKLVFEAHLYSFTNNMGD W SKPLNTFCASVNQGF++ AGFL+RGQ
Sbjct: 247 SFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQ 306

Query: 318 SPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           +P+ LFVS+FGIDQ G NEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 307 NPIPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLW 347

BLAST of Cla011399 vs. TrEMBL
Match: A0A0A0L6S7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G185670 PE=3 SV=1)

HSP 1 Score: 433.0 bits (1112), Expect = 3.8e-118
Identity = 207/258 (80.23%), Postives = 232/258 (89.92%), Query Frame = 1

Query: 102 MFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMTLVEAYGVVVDSLTTNGIMVISDN 161
           MFTRYANLTV+QS ENFDMK+A+ GIAQNN  +LNM +VEAY  VVDSL  +G+MV+SDN
Sbjct: 1   MFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVSDN 60

Query: 162 HISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQSLKGKP-VVVMSLRNEPRGPNQN 221
           HISQPRWCCS+DDGNGF GDRYF+ QEWL G+SLA QSLK KP VV MSLRNEPRGPNQN
Sbjct: 61  HISQPRWCCSNDDGNGFFGDRYFNSQEWLQGLSLATQSLKTKPQVVAMSLRNEPRGPNQN 120

Query: 222 VEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLSFLKNKSMGFILDNKLVFEAHLYS 281
           VE+WFQY++QGAKL+HQINPN  LVVVSGLS DTDLSFLKN+SMGF LDNKLVFEAHLYS
Sbjct: 121 VEMWFQYMSQGAKLVHQINPN-ALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLYS 180

Query: 282 FTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQSPMLLFVSKFGIDQTGTNEGQNRF 341
           FTNNM D WTSKPLNTFCA+VNQGF++ AGFL+RGQ+P+ LFVS+FGI+Q G NEGQNRF
Sbjct: 181 FTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNRF 240

Query: 342 LSCFFTYLTENDFDWGLW 359
           LSCFFTYLT+NDFDWGLW
Sbjct: 241 LSCFFTYLTKNDFDWGLW 257

BLAST of Cla011399 vs. TrEMBL
Match: A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 1.2e-100
Identity = 186/340 (54.71%), Postives = 235/340 (69.12%), Query Frame = 1

Query: 21  IALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHRPLN 80
           + LA V V  +  AYSLPLST+GRWIIDS + + VKLV VNWPSH Q+ML EGL+HRPL 
Sbjct: 9   LLLALVSVFSSFSAYSLPLSTHGRWIIDSQSGKRVKLVCVNWPSHTQSMLIEGLNHRPLK 68

Query: 81  DITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMTLV 140
           ++     KL FN V+LTY+ HMFTRYAN TV+++ +  D++ A  G+AQ N FVLN T+ 
Sbjct: 69  ELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLAQYNPFVLNKTIA 128

Query: 141 EAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQSL 200
           EAY  VVD L  +G+MVI+DNH+SQPRWCCS DDGNGF G+RYFDPQEWL G+SL AQ  
Sbjct: 129 EAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQEWLQGLSLVAQRF 188

Query: 201 KGKPVVV-MSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLSFL 260
             K  VV MSLRNE RG  +N   W  Y+ QG   +H+INP  VLV+VSGL+ D DL  L
Sbjct: 189 NNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINP-AVLVIVSGLNYDNDLRCL 248

Query: 261 KNKSMGF-ILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQSP 320
           K+K +    LDNKL FE HLYSF+ +    +  +PLN  CA +   F + A F+I G +P
Sbjct: 249 KDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFIDHAEFVIEGPNP 308

Query: 321 MLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
             LFVS++G DQ   ++ +NRF+SCF  +L + D DW LW
Sbjct: 309 FPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALW 347

BLAST of Cla011399 vs. TrEMBL
Match: B9RCJ5_RICCO (Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCOM_1689380 PE=3 SV=1)

HSP 1 Score: 349.4 bits (895), Expect = 5.5e-93
Identity = 172/343 (50.15%), Postives = 236/343 (68.80%), Query Frame = 1

Query: 19  KHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHRP 78
           K I     F+++   +YSLPLS N RWIID+ + + VKL  VNW SH+Q MLAEGL  +P
Sbjct: 7   KTILFFSFFLLVLSLSYSLPLSINKRWIIDAKSGERVKLACVNWASHLQPMLAEGLDKKP 66

Query: 79  LNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMT 138
           L+ + + +A+  FN V+ T + HMFTRY  LTV QS ++ ++  A  GIA++N F+LN+T
Sbjct: 67  LSYLASKLARYHFNCVRFTCATHMFTRYGKLTVAQSFDSLNLTKAKAGIARHNSFLLNLT 126

Query: 139 LVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQ 198
           +V+AY  VV+ L  +G+MV+ DNH+SQP+WCC  DD NGF GD +F P+EWL G+++ A+
Sbjct: 127 VVQAYEAVVNELGAHGLMVLLDNHVSQPKWCCPQDDENGFFGDIHFHPKEWLRGLAIVAK 186

Query: 199 SLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLS 258
             +GK  VV MS+RNE RGP QN   W++YI +GA+++H++NP  VLV+VSGL   TDLS
Sbjct: 187 IFQGKSQVVAMSMRNELRGPYQNEHDWYKYIQEGARMVHKLNP-EVLVLVSGLVWGTDLS 246

Query: 259 FLKNK--SMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRG 318
           FLK K   +G  LDNKLV+EAH YSF+ +    W  +PLN  C    Q   + +GF+I G
Sbjct: 247 FLKKKPLHLGLNLDNKLVYEAHWYSFSGD-PKVWEVQPLNRICDLKTQIQVDLSGFVITG 306

Query: 319 QSPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           ++P+ LF+ + GIDQ G N   NRF +CF  Y+ END DWGLW
Sbjct: 307 ENPVPLFLGEVGIDQRGVNRADNRFFTCFLAYVAENDLDWGLW 347

BLAST of Cla011399 vs. NCBI nr
Match: gi|700195218|gb|KGN50395.1| (hypothetical protein Csa_5G171770 [Cucumis sativus])

HSP 1 Score: 574.3 bits (1479), Expect = 1.5e-160
Identity = 276/342 (80.70%), Postives = 307/342 (89.77%), Query Frame = 1

Query: 18  YKHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHR 77
           +K+IAL CVFV+LT KAYSLPLSTNGRWI+D+TT Q VKL+ VNWP HMQ MLAEGLH R
Sbjct: 7   WKNIALVCVFVLLTFKAYSLPLSTNGRWIVDATTGQRVKLMCVNWPGHMQGMLAEGLHRR 66

Query: 78  PLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNM 137
           PL+DI ++VAKL FN V+LTYSIHMFTR+ANLTVQQS ENFDMKDAM GIAQNN  ++N+
Sbjct: 67  PLDDIISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIAQNNPSLVNL 126

Query: 138 TLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAA 197
           TLVEAYG VVDSL  +G+MV+SDNHISQPRWCC++DDGNGF GDRYFDP+EWL GISLAA
Sbjct: 127 TLVEAYGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEEWLQGISLAA 186

Query: 198 QSLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDL 257
           QSLK K  VV MS+RNEPRGPNQNVE WFQY++QGAKL+HQINPN  LVVVSGLS DTDL
Sbjct: 187 QSLKSKAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDL 246

Query: 258 SFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQ 317
           SFLKN+SMGF LDNKLVFEAHLYSFTNNMGD W SKPLNTFCASVNQGF++ AGFL+RGQ
Sbjct: 247 SFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQ 306

Query: 318 SPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           +PM LFVS+FGIDQ G NEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 307 NPMPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLW 347

BLAST of Cla011399 vs. NCBI nr
Match: gi|659073106|ref|XP_008467257.1| (PREDICTED: uncharacterized protein LOC103504654 [Cucumis melo])

HSP 1 Score: 557.4 bits (1435), Expect = 1.9e-155
Identity = 273/342 (79.82%), Postives = 302/342 (88.30%), Query Frame = 1

Query: 19  KHIALACVFVML-TIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHR 78
           K+IAL CVFV+L T KA+SLPLSTNGRWIID+TT + VKL+ VNW  HMQ ML EGLH R
Sbjct: 8   KNIALVCVFVLLLTFKAFSLPLSTNGRWIIDATTGRRVKLMCVNWAGHMQGMLVEGLHRR 67

Query: 79  PLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNM 138
           PL+DI  +VAKL FN V+LTYSIHMFTR+ANLTV+QS ENFDMKDAM GIAQNN  +LN+
Sbjct: 68  PLDDIAALVAKLRFNCVRLTYSIHMFTRHANLTVKQSFENFDMKDAMAGIAQNNPSILNL 127

Query: 139 TLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAA 198
           TLVEAYG VVDSL  +GIMV+SDNHISQPRWCC ++DGNGF GDRYFDPQEWL GISLAA
Sbjct: 128 TLVEAYGAVVDSLVAHGIMVVSDNHISQPRWCCDNNDGNGFFGDRYFDPQEWLQGISLAA 187

Query: 199 QSLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDL 258
           QSLK K  VV MSLRNE RGPNQNVE WFQY++QGAKL+HQINPN  LVVVSGLS DTDL
Sbjct: 188 QSLKSKAQVVAMSLRNELRGPNQNVEKWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDL 247

Query: 259 SFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQ 318
           SFLKN+SMGF LDNKLVFEAHLYSFTNNM D W SKPLNTFCAS+NQGF++ AGFL+RGQ
Sbjct: 248 SFLKNRSMGFNLDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFEDRAGFLVRGQ 307

Query: 319 SPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           +P+ LFVS+FGIDQTGTNEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 308 NPIPLFVSEFGIDQTGTNEGQNRFLSCFFSYLTENDFDWGLW 348

BLAST of Cla011399 vs. NCBI nr
Match: gi|778708310|ref|XP_011656162.1| (PREDICTED: uncharacterized protein LOC105435646 [Cucumis sativus])

HSP 1 Score: 552.4 bits (1422), Expect = 6.1e-154
Identity = 266/342 (77.78%), Postives = 299/342 (87.43%), Query Frame = 1

Query: 18  YKHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHR 77
           +++IAL CVFV L  KA SLPLSTNGRWI+D+TT   VKL+ VNW  HMQ MLAEGLH R
Sbjct: 7   WRNIALVCVFVFLISKACSLPLSTNGRWIVDATTGNRVKLMCVNWAGHMQGMLAEGLHLR 66

Query: 78  PLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNM 137
           PL+DI  +V K  FN V+LTYSIHMFTR+ANLTVQQS ENFDMKDA+ GIAQNN  +LNM
Sbjct: 67  PLDDIAALVVKSRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDALAGIAQNNPSILNM 126

Query: 138 TLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAA 197
           T+V+AYG V+DSL  + +MV+SDNHISQPRWCC++DDGNGF GDRYFDPQEWL GISLAA
Sbjct: 127 TVVQAYGAVIDSLAAHRVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPQEWLQGISLAA 186

Query: 198 QSLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDL 257
           Q+LK K  VV MSLRNEPRGPNQNVE+WFQY++QGAKL+HQINPN  LVVVSGLS DTDL
Sbjct: 187 QNLKSKSQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDL 246

Query: 258 SFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQ 317
           SFLKN+SMGF LDNKLVFEAHLYSFTNNMGD W SKPLNTFCASVNQGF++ AGFL+RGQ
Sbjct: 247 SFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQ 306

Query: 318 SPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           +P+ LFVS+FGIDQ G NEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 307 NPIPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLW 347

BLAST of Cla011399 vs. NCBI nr
Match: gi|659118742|ref|XP_008459280.1| (PREDICTED: uncharacterized protein LOC103498458 [Cucumis melo])

HSP 1 Score: 543.9 bits (1400), Expect = 2.2e-151
Identity = 262/333 (78.68%), Postives = 293/333 (87.99%), Query Frame = 1

Query: 27  FVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHRPLNDITTMV 86
           ++ L  +AYSLPLSTNGRWIID+TT + VKL+ VNW  HMQ ML EGLH RPL+DI  +V
Sbjct: 188 YIDLEPEAYSLPLSTNGRWIIDATTGRRVKLMCVNWAGHMQGMLVEGLHRRPLDDIAALV 247

Query: 87  AKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMTLVEAYGVV 146
           AKL FN V+LTYSIHMFTR+AN+TV+QS ENFDMKDA+ GIAQNN  +LN+TLVEAYG V
Sbjct: 248 AKLRFNCVRLTYSIHMFTRHANVTVKQSFENFDMKDALVGIAQNNPSILNLTLVEAYGAV 307

Query: 147 VDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQSLKGK-PV 206
           VDSL  +GIMV+SDNHISQPRWCC +DDGNGF GDRYFDPQEW  GISLAAQSLK K  V
Sbjct: 308 VDSLAAHGIMVVSDNHISQPRWCCDNDDGNGFFGDRYFDPQEWFQGISLAAQSLKSKAQV 367

Query: 207 VVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLSFLKNKSMG 266
           V MSLRNEPRGPNQNVE WFQY++QGAKL+HQINPN  LVVVSGLS DTDLSFL+N+SMG
Sbjct: 368 VAMSLRNEPRGPNQNVEKWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDLSFLRNRSMG 427

Query: 267 FILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQSPMLLFVSK 326
           F LDNKLVFEAHLYSFTNNM D W SKPLNTFCAS+NQGF++ AGFL+RGQ+P+ LFVS+
Sbjct: 428 FNLDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFEDRAGFLVRGQNPIPLFVSE 487

Query: 327 FGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           FGIDQTGTNEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 488 FGIDQTGTNEGQNRFLSCFFSYLTENDFDWGLW 519

BLAST of Cla011399 vs. NCBI nr
Match: gi|700202305|gb|KGN57438.1| (hypothetical protein Csa_3G185670 [Cucumis sativus])

HSP 1 Score: 433.0 bits (1112), Expect = 5.4e-118
Identity = 207/258 (80.23%), Postives = 232/258 (89.92%), Query Frame = 1

Query: 102 MFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMTLVEAYGVVVDSLTTNGIMVISDN 161
           MFTRYANLTV+QS ENFDMK+A+ GIAQNN  +LNM +VEAY  VVDSL  +G+MV+SDN
Sbjct: 1   MFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVSDN 60

Query: 162 HISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQSLKGKP-VVVMSLRNEPRGPNQN 221
           HISQPRWCCS+DDGNGF GDRYF+ QEWL G+SLA QSLK KP VV MSLRNEPRGPNQN
Sbjct: 61  HISQPRWCCSNDDGNGFFGDRYFNSQEWLQGLSLATQSLKTKPQVVAMSLRNEPRGPNQN 120

Query: 222 VEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLSFLKNKSMGFILDNKLVFEAHLYS 281
           VE+WFQY++QGAKL+HQINPN  LVVVSGLS DTDLSFLKN+SMGF LDNKLVFEAHLYS
Sbjct: 121 VEMWFQYMSQGAKLVHQINPN-ALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLYS 180

Query: 282 FTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQSPMLLFVSKFGIDQTGTNEGQNRF 341
           FTNNM D WTSKPLNTFCA+VNQGF++ AGFL+RGQ+P+ LFVS+FGI+Q G NEGQNRF
Sbjct: 181 FTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNRF 240

Query: 342 LSCFFTYLTENDFDWGLW 359
           LSCFFTYLT+NDFDWGLW
Sbjct: 241 LSCFFTYLTKNDFDWGLW 257

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KL32_CUCSA1.1e-16080.70Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171770 PE=3 SV=1[more]
A0A0A0KNB6_CUCSA4.3e-15477.78Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171760 PE=3 SV=1[more]
A0A0A0L6S7_CUCSA3.8e-11880.23Uncharacterized protein OS=Cucumis sativus GN=Csa_3G185670 PE=3 SV=1[more]
A0A0A0K853_CUCSA1.2e-10054.71Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1[more]
B9RCJ5_RICCO5.5e-9350.15Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCO... [more]
Match NameE-valueIdentityDescription
gi|700195218|gb|KGN50395.1|1.5e-16080.70hypothetical protein Csa_5G171770 [Cucumis sativus][more]
gi|659073106|ref|XP_008467257.1|1.9e-15579.82PREDICTED: uncharacterized protein LOC103504654 [Cucumis melo][more]
gi|778708310|ref|XP_011656162.1|6.1e-15477.78PREDICTED: uncharacterized protein LOC105435646 [Cucumis sativus][more]
gi|659118742|ref|XP_008459280.1|2.2e-15178.68PREDICTED: uncharacterized protein LOC103498458 [Cucumis melo][more]
gi|700202305|gb|KGN57438.1|5.4e-11880.23hypothetical protein Csa_3G185670 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001547Glyco_hydro_5
IPR013781Glycoside hydrolase, catalytic domain
IPR017853Glycoside_hydrolase_SF
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU65260watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla011399Cla011399.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU65260WMU65260transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 129..359
score: 4.3
IPR013781Glycoside hydrolase, catalytic domainGENE3DG3DSA:3.20.20.80coord: 37..360
score: 1.9
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 38..359
score: 1.18
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 10..358
score: 6.3E
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 10..358
score: 6.3E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla011399Cucurbita moschata (Rifu)cmowmB267
Cla011399Cucurbita moschata (Rifu)cmowmB313
Cla011399Cucurbita moschata (Rifu)cmowmB563
Cla011399Melon (DHL92) v3.5.1mewmB029
Cla011399Melon (DHL92) v3.5.1mewmB101
Cla011399Melon (DHL92) v3.5.1mewmB480
Cla011399Melon (DHL92) v3.5.1mewmB530
Cla011399Watermelon (Charleston Gray)wcgwmB142
Cla011399Watermelon (Charleston Gray)wcgwmB298
Cla011399Watermelon (Charleston Gray)wcgwmB337
Cla011399Cucumber (Chinese Long) v2cuwmB260
Cla011399Cucumber (Chinese Long) v2cuwmB330
Cla011399Cucumber (Chinese Long) v2cuwmB408
Cla011399Cucumber (Chinese Long) v2cuwmB504
Cla011399Cucurbita pepo (Zucchini)cpewmB183
Cla011399Cucurbita pepo (Zucchini)cpewmB717
Cla011399Bottle gourd (USVL1VR-Ls)lsiwmB027
Cla011399Bottle gourd (USVL1VR-Ls)lsiwmB032
Cla011399Bottle gourd (USVL1VR-Ls)lsiwmB378
Cla011399Cucumber (Gy14) v2cgybwmB304
Cla011399Cucumber (Gy14) v2cgybwmB381
Cla011399Cucumber (Gy14) v2cgybwmB471
Cla011399Melon (DHL92) v3.6.1medwmB027
Cla011399Melon (DHL92) v3.6.1medwmB099
Cla011399Melon (DHL92) v3.6.1medwmB515
Cla011399Silver-seed gourdcarwmB0925
Cla011399Cucumber (Chinese Long) v3cucwmB270
Cla011399Cucumber (Chinese Long) v3cucwmB343
Cla011399Cucumber (Chinese Long) v3cucwmB433
Cla011399Cucumber (Chinese Long) v3cucwmB529
Cla011399Watermelon (97103) v2wmwmbB235
Cla011399Watermelon (97103) v2wmwmbB266
Cla011399Watermelon (97103) v2wmwmbB273
Cla011399Wax gourdwgowmB076
Cla011399Watermelon (97103) v1wmwmB122
Cla011399Watermelon (97103) v1wmwmB142
Cla011399Watermelon (97103) v1wmwmB151
Cla011399Cucumber (Gy14) v1cgywmB150
Cla011399Cucumber (Gy14) v1cgywmB232
Cla011399Cucumber (Gy14) v1cgywmB552
Cla011399Cucurbita maxima (Rimu)cmawmB277
Cla011399Cucurbita maxima (Rimu)cmawmB319
Cla011399Cucurbita maxima (Rimu)cmawmB568
Cla011399Cucurbita maxima (Rimu)cmawmB712