Cla011399.1 (mRNA) Watermelon (97103) v1

NameCla011399
TypemRNA
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionCellulase (Glycosyl hydrolase family 5) protein (AHRD V1 **-- Q9LTM8_ARATH); contains Interpro domain(s) IPR013781 Glycoside hydrolase, subgroup, catalytic core
LocationChr1 : 1979420 .. 1980807 (+)
Sequence length1110
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTGTTTTTGTTATGTTAACCATTAAAGCTTATTCGTTGCCTCTTTCAACCAATGGAAGATGGATTATTGACTCTACAACAAACCAATGCGTGAAGTTAGTATTTGTGAATTGGCCTTCACACATGCAGGCAATGCTAGCAGAGGGTCTTCATCATCGTCCTCTAAATGACATCACCACTATGGTGGCAAAATTGTGGTTTAATCGTGTGCAGTTGACGTACTCGATCCACATGTTTACACGCTATGCCAATTTGACCGTTCAACAATCCTTGGAGAATTTTGATATGAAAGATGCAATGACAGGTATAGCCCAAAACAATCATTTTGTGTTAAATATGACATTAGTTGAAGCTTATGGAGTGGTGGTAGATTCACTTACAACAAATGGAATCATGGTGATTTCTGATAACCATATAAGCCAGCCAAGATGGTGTTGTAGTGATGATGATGGTAATGGCTTTCTTGGAGATCGGTATTTTGATCCTCAAGAATGGCTTCATGGAATTAGTTTGGCAGCTCAAAGCTTAAAGGGTAAACCCGTATGTTTGAAATTATATAAAATAGGGTTATTACCTATGAGGGCAGTTCGACAATAGTTTCTAAACATTGATTTTATCGTTATTTGCTCGTTATTTGCTGGCTAAATTACATCAAATATTCTTCAACAATGTTTTCTTTTAAAATTTTATTTTTTAAGAAAATCTAAAAGTTTCTTAGAAATACATAATTGAAACCAATCAGTTGTTTTATATAATTTAAATTGTTTTTTATATGATATTTTGACCACTTCCTTGATATATATATATACATGCAAAAGGTTGTGGTAATGAGTCTAAGAAATGAACCGCGAGGACCAAATCAAAATGTGGAGGTGTGGTTTCAATACATCAACCAAGGAGCTAAGCTTATGCACCAAATTAACCCAAATCATGTTTTAGTAGTGGTTTCTGGACTAAGTTGTGATACCGATCTAAGCTTCTTGAAGAATAAGTCGATGGGCTTCATCTTGGACAACAAGCTTGTATTTGAGGCTCACTTGTACTCCTTTACAAACAACATGGGAGATTCTTGGACATCGAAGCCATTAAACACATTTTGTGCTAGCGTGAACCAAGGATTTAAAAACTGGGCTGGGTTTCTTATTAGAGGACAAAGCCCAATGCTTCTCTTTGTGAGTAAGTTTGGGATTGACCAAACAGGCACCAATGAGGGTCAAAATCGATTCTTGAGTTGCTTTTTTACCTATCTTACCGAGAATGATTTTGATTGGGGCTTGTGGGTTTGCAAGGTAGCTATTATTATAGGGAAGACGTGA

mRNA sequence

ATGACTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTGTTTTTGTTATGTTAACCATTAAAGCTTATTCGTTGCCTCTTTCAACCAATGGAAGATGGATTATTGACTCTACAACAAACCAATGCGTGAAGTTAGTATTTGTGAATTGGCCTTCACACATGCAGGCAATGCTAGCAGAGGGTCTTCATCATCGTCCTCTAAATGACATCACCACTATGGTGGCAAAATTGTGGTTTAATCGTGTGCAGTTGACGTACTCGATCCACATGTTTACACGCTATGCCAATTTGACCGTTCAACAATCCTTGGAGAATTTTGATATGAAAGATGCAATGACAGGTATAGCCCAAAACAATCATTTTGTGTTAAATATGACATTAGTTGAAGCTTATGGAGTGGTGGTAGATTCACTTACAACAAATGGAATCATGGTGATTTCTGATAACCATATAAGCCAGCCAAGATGGTGTTGTAGTGATGATGATGGTAATGGCTTTCTTGGAGATCGGTATTTTGATCCTCAAGAATGGCTTCATGGAATTAGTTTGGCAGCTCAAAGCTTAAAGGGTAAACCCGTTGTGGTAATGAGTCTAAGAAATGAACCGCGAGGACCAAATCAAAATGTGGAGGTGTGGTTTCAATACATCAACCAAGGAGCTAAGCTTATGCACCAAATTAACCCAAATCATGTTTTAGTAGTGGTTTCTGGACTAAGTTGTGATACCGATCTAAGCTTCTTGAAGAATAAGTCGATGGGCTTCATCTTGGACAACAAGCTTGTATTTGAGGCTCACTTGTACTCCTTTACAAACAACATGGGAGATTCTTGGACATCGAAGCCATTAAACACATTTTGTGCTAGCGTGAACCAAGGATTTAAAAACTGGGCTGGGTTTCTTATTAGAGGACAAAGCCCAATGCTTCTCTTTGTGAGTAAGTTTGGGATTGACCAAACAGGCACCAATGAGGGTCAAAATCGATTCTTGAGTTGCTTTTTTACCTATCTTACCGAGAATGATTTTGATTGGGGCTTGTGGGTTTGCAAGGTAGCTATTATTATAGGGAAGACGTGA

Coding sequence (CDS)

ATGACTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTAAAATGACATACAAACACATTGCCTTAGCTTGTGTTTTTGTTATGTTAACCATTAAAGCTTATTCGTTGCCTCTTTCAACCAATGGAAGATGGATTATTGACTCTACAACAAACCAATGCGTGAAGTTAGTATTTGTGAATTGGCCTTCACACATGCAGGCAATGCTAGCAGAGGGTCTTCATCATCGTCCTCTAAATGACATCACCACTATGGTGGCAAAATTGTGGTTTAATCGTGTGCAGTTGACGTACTCGATCCACATGTTTACACGCTATGCCAATTTGACCGTTCAACAATCCTTGGAGAATTTTGATATGAAAGATGCAATGACAGGTATAGCCCAAAACAATCATTTTGTGTTAAATATGACATTAGTTGAAGCTTATGGAGTGGTGGTAGATTCACTTACAACAAATGGAATCATGGTGATTTCTGATAACCATATAAGCCAGCCAAGATGGTGTTGTAGTGATGATGATGGTAATGGCTTTCTTGGAGATCGGTATTTTGATCCTCAAGAATGGCTTCATGGAATTAGTTTGGCAGCTCAAAGCTTAAAGGGTAAACCCGTTGTGGTAATGAGTCTAAGAAATGAACCGCGAGGACCAAATCAAAATGTGGAGGTGTGGTTTCAATACATCAACCAAGGAGCTAAGCTTATGCACCAAATTAACCCAAATCATGTTTTAGTAGTGGTTTCTGGACTAAGTTGTGATACCGATCTAAGCTTCTTGAAGAATAAGTCGATGGGCTTCATCTTGGACAACAAGCTTGTATTTGAGGCTCACTTGTACTCCTTTACAAACAACATGGGAGATTCTTGGACATCGAAGCCATTAAACACATTTTGTGCTAGCGTGAACCAAGGATTTAAAAACTGGGCTGGGTTTCTTATTAGAGGACAAAGCCCAATGCTTCTCTTTGTGAGTAAGTTTGGGATTGACCAAACAGGCACCAATGAGGGTCAAAATCGATTCTTGAGTTGCTTTTTTACCTATCTTACCGAGAATGATTTTGATTGGGGCTTGTGGGTTTGCAAGGTAGCTATTATTATAGGGAAGACGTGA

Protein sequence

MTCKMTYKHIALACKMTYKHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHRPLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMTLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQSLKGKPVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLSFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQSPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLWVCKVAIIIGKT
BLAST of Cla011399 vs. TrEMBL
Match: A0A0A0KL32_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171770 PE=3 SV=1)

HSP 1 Score: 574.3 bits (1479), Expect = 1.1e-160
Identity = 276/342 (80.70%), Postives = 307/342 (89.77%), Query Frame = 1

Query: 18  YKHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHR 77
           +K+IAL CVFV+LT KAYSLPLSTNGRWI+D+TT Q VKL+ VNWP HMQ MLAEGLH R
Sbjct: 7   WKNIALVCVFVLLTFKAYSLPLSTNGRWIVDATTGQRVKLMCVNWPGHMQGMLAEGLHRR 66

Query: 78  PLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNM 137
           PL+DI ++VAKL FN V+LTYSIHMFTR+ANLTVQQS ENFDMKDAM GIAQNN  ++N+
Sbjct: 67  PLDDIISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIAQNNPSLVNL 126

Query: 138 TLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAA 197
           TLVEAYG VVDSL  +G+MV+SDNHISQPRWCC++DDGNGF GDRYFDP+EWL GISLAA
Sbjct: 127 TLVEAYGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEEWLQGISLAA 186

Query: 198 QSLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDL 257
           QSLK K  VV MS+RNEPRGPNQNVE WFQY++QGAKL+HQINPN  LVVVSGLS DTDL
Sbjct: 187 QSLKSKAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDL 246

Query: 258 SFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQ 317
           SFLKN+SMGF LDNKLVFEAHLYSFTNNMGD W SKPLNTFCASVNQGF++ AGFL+RGQ
Sbjct: 247 SFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQ 306

Query: 318 SPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           +PM LFVS+FGIDQ G NEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 307 NPMPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLW 347

BLAST of Cla011399 vs. TrEMBL
Match: A0A0A0KNB6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171760 PE=3 SV=1)

HSP 1 Score: 552.4 bits (1422), Expect = 4.3e-154
Identity = 266/342 (77.78%), Postives = 299/342 (87.43%), Query Frame = 1

Query: 18  YKHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHR 77
           +++IAL CVFV L  KA SLPLSTNGRWI+D+TT   VKL+ VNW  HMQ MLAEGLH R
Sbjct: 7   WRNIALVCVFVFLISKACSLPLSTNGRWIVDATTGNRVKLMCVNWAGHMQGMLAEGLHLR 66

Query: 78  PLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNM 137
           PL+DI  +V K  FN V+LTYSIHMFTR+ANLTVQQS ENFDMKDA+ GIAQNN  +LNM
Sbjct: 67  PLDDIAALVVKSRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDALAGIAQNNPSILNM 126

Query: 138 TLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAA 197
           T+V+AYG V+DSL  + +MV+SDNHISQPRWCC++DDGNGF GDRYFDPQEWL GISLAA
Sbjct: 127 TVVQAYGAVIDSLAAHRVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPQEWLQGISLAA 186

Query: 198 QSLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDL 257
           Q+LK K  VV MSLRNEPRGPNQNVE+WFQY++QGAKL+HQINPN  LVVVSGLS DTDL
Sbjct: 187 QNLKSKSQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDL 246

Query: 258 SFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQ 317
           SFLKN+SMGF LDNKLVFEAHLYSFTNNMGD W SKPLNTFCASVNQGF++ AGFL+RGQ
Sbjct: 247 SFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQ 306

Query: 318 SPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           +P+ LFVS+FGIDQ G NEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 307 NPIPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLW 347

BLAST of Cla011399 vs. TrEMBL
Match: A0A0A0L6S7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G185670 PE=3 SV=1)

HSP 1 Score: 433.0 bits (1112), Expect = 3.8e-118
Identity = 207/258 (80.23%), Postives = 232/258 (89.92%), Query Frame = 1

Query: 102 MFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMTLVEAYGVVVDSLTTNGIMVISDN 161
           MFTRYANLTV+QS ENFDMK+A+ GIAQNN  +LNM +VEAY  VVDSL  +G+MV+SDN
Sbjct: 1   MFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVSDN 60

Query: 162 HISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQSLKGKP-VVVMSLRNEPRGPNQN 221
           HISQPRWCCS+DDGNGF GDRYF+ QEWL G+SLA QSLK KP VV MSLRNEPRGPNQN
Sbjct: 61  HISQPRWCCSNDDGNGFFGDRYFNSQEWLQGLSLATQSLKTKPQVVAMSLRNEPRGPNQN 120

Query: 222 VEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLSFLKNKSMGFILDNKLVFEAHLYS 281
           VE+WFQY++QGAKL+HQINPN  LVVVSGLS DTDLSFLKN+SMGF LDNKLVFEAHLYS
Sbjct: 121 VEMWFQYMSQGAKLVHQINPN-ALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLYS 180

Query: 282 FTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQSPMLLFVSKFGIDQTGTNEGQNRF 341
           FTNNM D WTSKPLNTFCA+VNQGF++ AGFL+RGQ+P+ LFVS+FGI+Q G NEGQNRF
Sbjct: 181 FTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNRF 240

Query: 342 LSCFFTYLTENDFDWGLW 359
           LSCFFTYLT+NDFDWGLW
Sbjct: 241 LSCFFTYLTKNDFDWGLW 257

BLAST of Cla011399 vs. TrEMBL
Match: A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 1.2e-100
Identity = 186/340 (54.71%), Postives = 235/340 (69.12%), Query Frame = 1

Query: 21  IALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHRPLN 80
           + LA V V  +  AYSLPLST+GRWIIDS + + VKLV VNWPSH Q+ML EGL+HRPL 
Sbjct: 9   LLLALVSVFSSFSAYSLPLSTHGRWIIDSQSGKRVKLVCVNWPSHTQSMLIEGLNHRPLK 68

Query: 81  DITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMTLV 140
           ++     KL FN V+LTY+ HMFTRYAN TV+++ +  D++ A  G+AQ N FVLN T+ 
Sbjct: 69  ELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLAQYNPFVLNKTIA 128

Query: 141 EAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQSL 200
           EAY  VVD L  +G+MVI+DNH+SQPRWCCS DDGNGF G+RYFDPQEWL G+SL AQ  
Sbjct: 129 EAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQEWLQGLSLVAQRF 188

Query: 201 KGKPVVV-MSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLSFL 260
             K  VV MSLRNE RG  +N   W  Y+ QG   +H+INP  VLV+VSGL+ D DL  L
Sbjct: 189 NNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINP-AVLVIVSGLNYDNDLRCL 248

Query: 261 KNKSMGF-ILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQSP 320
           K+K +    LDNKL FE HLYSF+ +    +  +PLN  CA +   F + A F+I G +P
Sbjct: 249 KDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFIDHAEFVIEGPNP 308

Query: 321 MLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
             LFVS++G DQ   ++ +NRF+SCF  +L + D DW LW
Sbjct: 309 FPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALW 347

BLAST of Cla011399 vs. TrEMBL
Match: B9RCJ5_RICCO (Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCOM_1689380 PE=3 SV=1)

HSP 1 Score: 349.4 bits (895), Expect = 5.5e-93
Identity = 172/343 (50.15%), Postives = 236/343 (68.80%), Query Frame = 1

Query: 19  KHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHRP 78
           K I     F+++   +YSLPLS N RWIID+ + + VKL  VNW SH+Q MLAEGL  +P
Sbjct: 7   KTILFFSFFLLVLSLSYSLPLSINKRWIIDAKSGERVKLACVNWASHLQPMLAEGLDKKP 66

Query: 79  LNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMT 138
           L+ + + +A+  FN V+ T + HMFTRY  LTV QS ++ ++  A  GIA++N F+LN+T
Sbjct: 67  LSYLASKLARYHFNCVRFTCATHMFTRYGKLTVAQSFDSLNLTKAKAGIARHNSFLLNLT 126

Query: 139 LVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQ 198
           +V+AY  VV+ L  +G+MV+ DNH+SQP+WCC  DD NGF GD +F P+EWL G+++ A+
Sbjct: 127 VVQAYEAVVNELGAHGLMVLLDNHVSQPKWCCPQDDENGFFGDIHFHPKEWLRGLAIVAK 186

Query: 199 SLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLS 258
             +GK  VV MS+RNE RGP QN   W++YI +GA+++H++NP  VLV+VSGL   TDLS
Sbjct: 187 IFQGKSQVVAMSMRNELRGPYQNEHDWYKYIQEGARMVHKLNP-EVLVLVSGLVWGTDLS 246

Query: 259 FLKNK--SMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRG 318
           FLK K   +G  LDNKLV+EAH YSF+ +    W  +PLN  C    Q   + +GF+I G
Sbjct: 247 FLKKKPLHLGLNLDNKLVYEAHWYSFSGD-PKVWEVQPLNRICDLKTQIQVDLSGFVITG 306

Query: 319 QSPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           ++P+ LF+ + GIDQ G N   NRF +CF  Y+ END DWGLW
Sbjct: 307 ENPVPLFLGEVGIDQRGVNRADNRFFTCFLAYVAENDLDWGLW 347

BLAST of Cla011399 vs. NCBI nr
Match: gi|700195218|gb|KGN50395.1| (hypothetical protein Csa_5G171770 [Cucumis sativus])

HSP 1 Score: 574.3 bits (1479), Expect = 1.5e-160
Identity = 276/342 (80.70%), Postives = 307/342 (89.77%), Query Frame = 1

Query: 18  YKHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHR 77
           +K+IAL CVFV+LT KAYSLPLSTNGRWI+D+TT Q VKL+ VNWP HMQ MLAEGLH R
Sbjct: 7   WKNIALVCVFVLLTFKAYSLPLSTNGRWIVDATTGQRVKLMCVNWPGHMQGMLAEGLHRR 66

Query: 78  PLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNM 137
           PL+DI ++VAKL FN V+LTYSIHMFTR+ANLTVQQS ENFDMKDAM GIAQNN  ++N+
Sbjct: 67  PLDDIISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIAQNNPSLVNL 126

Query: 138 TLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAA 197
           TLVEAYG VVDSL  +G+MV+SDNHISQPRWCC++DDGNGF GDRYFDP+EWL GISLAA
Sbjct: 127 TLVEAYGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEEWLQGISLAA 186

Query: 198 QSLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDL 257
           QSLK K  VV MS+RNEPRGPNQNVE WFQY++QGAKL+HQINPN  LVVVSGLS DTDL
Sbjct: 187 QSLKSKAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDL 246

Query: 258 SFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQ 317
           SFLKN+SMGF LDNKLVFEAHLYSFTNNMGD W SKPLNTFCASVNQGF++ AGFL+RGQ
Sbjct: 247 SFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQ 306

Query: 318 SPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           +PM LFVS+FGIDQ G NEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 307 NPMPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLW 347

BLAST of Cla011399 vs. NCBI nr
Match: gi|659073106|ref|XP_008467257.1| (PREDICTED: uncharacterized protein LOC103504654 [Cucumis melo])

HSP 1 Score: 557.4 bits (1435), Expect = 1.9e-155
Identity = 273/342 (79.82%), Postives = 302/342 (88.30%), Query Frame = 1

Query: 19  KHIALACVFVML-TIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHR 78
           K+IAL CVFV+L T KA+SLPLSTNGRWIID+TT + VKL+ VNW  HMQ ML EGLH R
Sbjct: 8   KNIALVCVFVLLLTFKAFSLPLSTNGRWIIDATTGRRVKLMCVNWAGHMQGMLVEGLHRR 67

Query: 79  PLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNM 138
           PL+DI  +VAKL FN V+LTYSIHMFTR+ANLTV+QS ENFDMKDAM GIAQNN  +LN+
Sbjct: 68  PLDDIAALVAKLRFNCVRLTYSIHMFTRHANLTVKQSFENFDMKDAMAGIAQNNPSILNL 127

Query: 139 TLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAA 198
           TLVEAYG VVDSL  +GIMV+SDNHISQPRWCC ++DGNGF GDRYFDPQEWL GISLAA
Sbjct: 128 TLVEAYGAVVDSLVAHGIMVVSDNHISQPRWCCDNNDGNGFFGDRYFDPQEWLQGISLAA 187

Query: 199 QSLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDL 258
           QSLK K  VV MSLRNE RGPNQNVE WFQY++QGAKL+HQINPN  LVVVSGLS DTDL
Sbjct: 188 QSLKSKAQVVAMSLRNELRGPNQNVEKWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDL 247

Query: 259 SFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQ 318
           SFLKN+SMGF LDNKLVFEAHLYSFTNNM D W SKPLNTFCAS+NQGF++ AGFL+RGQ
Sbjct: 248 SFLKNRSMGFNLDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFEDRAGFLVRGQ 307

Query: 319 SPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           +P+ LFVS+FGIDQTGTNEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 308 NPIPLFVSEFGIDQTGTNEGQNRFLSCFFSYLTENDFDWGLW 348

BLAST of Cla011399 vs. NCBI nr
Match: gi|778708310|ref|XP_011656162.1| (PREDICTED: uncharacterized protein LOC105435646 [Cucumis sativus])

HSP 1 Score: 552.4 bits (1422), Expect = 6.1e-154
Identity = 266/342 (77.78%), Postives = 299/342 (87.43%), Query Frame = 1

Query: 18  YKHIALACVFVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHR 77
           +++IAL CVFV L  KA SLPLSTNGRWI+D+TT   VKL+ VNW  HMQ MLAEGLH R
Sbjct: 7   WRNIALVCVFVFLISKACSLPLSTNGRWIVDATTGNRVKLMCVNWAGHMQGMLAEGLHLR 66

Query: 78  PLNDITTMVAKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNM 137
           PL+DI  +V K  FN V+LTYSIHMFTR+ANLTVQQS ENFDMKDA+ GIAQNN  +LNM
Sbjct: 67  PLDDIAALVVKSRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDALAGIAQNNPSILNM 126

Query: 138 TLVEAYGVVVDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAA 197
           T+V+AYG V+DSL  + +MV+SDNHISQPRWCC++DDGNGF GDRYFDPQEWL GISLAA
Sbjct: 127 TVVQAYGAVIDSLAAHRVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPQEWLQGISLAA 186

Query: 198 QSLKGK-PVVVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDL 257
           Q+LK K  VV MSLRNEPRGPNQNVE+WFQY++QGAKL+HQINPN  LVVVSGLS DTDL
Sbjct: 187 QNLKSKSQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDL 246

Query: 258 SFLKNKSMGFILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQ 317
           SFLKN+SMGF LDNKLVFEAHLYSFTNNMGD W SKPLNTFCASVNQGF++ AGFL+RGQ
Sbjct: 247 SFLKNRSMGFNLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFEDRAGFLVRGQ 306

Query: 318 SPMLLFVSKFGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           +P+ LFVS+FGIDQ G NEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 307 NPIPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLW 347

BLAST of Cla011399 vs. NCBI nr
Match: gi|659118742|ref|XP_008459280.1| (PREDICTED: uncharacterized protein LOC103498458 [Cucumis melo])

HSP 1 Score: 543.9 bits (1400), Expect = 2.2e-151
Identity = 262/333 (78.68%), Postives = 293/333 (87.99%), Query Frame = 1

Query: 27  FVMLTIKAYSLPLSTNGRWIIDSTTNQCVKLVFVNWPSHMQAMLAEGLHHRPLNDITTMV 86
           ++ L  +AYSLPLSTNGRWIID+TT + VKL+ VNW  HMQ ML EGLH RPL+DI  +V
Sbjct: 188 YIDLEPEAYSLPLSTNGRWIIDATTGRRVKLMCVNWAGHMQGMLVEGLHRRPLDDIAALV 247

Query: 87  AKLWFNRVQLTYSIHMFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMTLVEAYGVV 146
           AKL FN V+LTYSIHMFTR+AN+TV+QS ENFDMKDA+ GIAQNN  +LN+TLVEAYG V
Sbjct: 248 AKLRFNCVRLTYSIHMFTRHANVTVKQSFENFDMKDALVGIAQNNPSILNLTLVEAYGAV 307

Query: 147 VDSLTTNGIMVISDNHISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQSLKGK-PV 206
           VDSL  +GIMV+SDNHISQPRWCC +DDGNGF GDRYFDPQEW  GISLAAQSLK K  V
Sbjct: 308 VDSLAAHGIMVVSDNHISQPRWCCDNDDGNGFFGDRYFDPQEWFQGISLAAQSLKSKAQV 367

Query: 207 VVMSLRNEPRGPNQNVEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLSFLKNKSMG 266
           V MSLRNEPRGPNQNVE WFQY++QGAKL+HQINPN  LVVVSGLS DTDLSFL+N+SMG
Sbjct: 368 VAMSLRNEPRGPNQNVEKWFQYMSQGAKLIHQINPN-ALVVVSGLSYDTDLSFLRNRSMG 427

Query: 267 FILDNKLVFEAHLYSFTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQSPMLLFVSK 326
           F LDNKLVFEAHLYSFTNNM D W SKPLNTFCAS+NQGF++ AGFL+RGQ+P+ LFVS+
Sbjct: 428 FNLDNKLVFEAHLYSFTNNMRDFWMSKPLNTFCASINQGFEDRAGFLVRGQNPIPLFVSE 487

Query: 327 FGIDQTGTNEGQNRFLSCFFTYLTENDFDWGLW 359
           FGIDQTGTNEGQNRFLSCFF+YLTENDFDWGLW
Sbjct: 488 FGIDQTGTNEGQNRFLSCFFSYLTENDFDWGLW 519

BLAST of Cla011399 vs. NCBI nr
Match: gi|700202305|gb|KGN57438.1| (hypothetical protein Csa_3G185670 [Cucumis sativus])

HSP 1 Score: 433.0 bits (1112), Expect = 5.4e-118
Identity = 207/258 (80.23%), Postives = 232/258 (89.92%), Query Frame = 1

Query: 102 MFTRYANLTVQQSLENFDMKDAMTGIAQNNHFVLNMTLVEAYGVVVDSLTTNGIMVISDN 161
           MFTRYANLTV+QS ENFDMK+A+ GIAQNN  +LNM +VEAY  VVDSL  +G+MV+SDN
Sbjct: 1   MFTRYANLTVKQSFENFDMKEAIAGIAQNNPTILNMKVVEAYEAVVDSLGAHGVMVVSDN 60

Query: 162 HISQPRWCCSDDDGNGFLGDRYFDPQEWLHGISLAAQSLKGKP-VVVMSLRNEPRGPNQN 221
           HISQPRWCCS+DDGNGF GDRYF+ QEWL G+SLA QSLK KP VV MSLRNEPRGPNQN
Sbjct: 61  HISQPRWCCSNDDGNGFFGDRYFNSQEWLQGLSLATQSLKTKPQVVAMSLRNEPRGPNQN 120

Query: 222 VEVWFQYINQGAKLMHQINPNHVLVVVSGLSCDTDLSFLKNKSMGFILDNKLVFEAHLYS 281
           VE+WFQY++QGAKL+HQINPN  LVVVSGLS DTDLSFLKN+SMGF LDNKLVFEAHLYS
Sbjct: 121 VEMWFQYMSQGAKLVHQINPN-ALVVVSGLSYDTDLSFLKNRSMGFNLDNKLVFEAHLYS 180

Query: 282 FTNNMGDSWTSKPLNTFCASVNQGFKNWAGFLIRGQSPMLLFVSKFGIDQTGTNEGQNRF 341
           FTNNM D WTSKPLNTFCA+VNQGF++ AGFL+RGQ+P+ LFVS+FGI+Q G NEGQNRF
Sbjct: 181 FTNNMSDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGANEGQNRF 240

Query: 342 LSCFFTYLTENDFDWGLW 359
           LSCFFTYLT+NDFDWGLW
Sbjct: 241 LSCFFTYLTKNDFDWGLW 257

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KL32_CUCSA1.1e-16080.70Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171770 PE=3 SV=1[more]
A0A0A0KNB6_CUCSA4.3e-15477.78Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171760 PE=3 SV=1[more]
A0A0A0L6S7_CUCSA3.8e-11880.23Uncharacterized protein OS=Cucumis sativus GN=Csa_3G185670 PE=3 SV=1[more]
A0A0A0K853_CUCSA1.2e-10054.71Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1[more]
B9RCJ5_RICCO5.5e-9350.15Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCO... [more]
Match NameE-valueIdentityDescription
gi|700195218|gb|KGN50395.1|1.5e-16080.70hypothetical protein Csa_5G171770 [Cucumis sativus][more]
gi|659073106|ref|XP_008467257.1|1.9e-15579.82PREDICTED: uncharacterized protein LOC103504654 [Cucumis melo][more]
gi|778708310|ref|XP_011656162.1|6.1e-15477.78PREDICTED: uncharacterized protein LOC105435646 [Cucumis sativus][more]
gi|659118742|ref|XP_008459280.1|2.2e-15178.68PREDICTED: uncharacterized protein LOC103498458 [Cucumis melo][more]
gi|700202305|gb|KGN57438.1|5.4e-11880.23hypothetical protein Csa_3G185670 [Cucumis sativus][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR001547Glyco_hydro_5
IPR013781Glycoside hydrolase, catalytic domain
IPR017853Glycoside_hydrolase_SF
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla011399Cla011399gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla011399Cla011399.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla011399.1.cds1Cla011399.1.cds1CDS
Cla011399.1.cds2Cla011399.1.cds2CDS


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 129..359
score: 4.3
IPR013781Glycoside hydrolase, catalytic domainGENE3DG3DSA:3.20.20.80coord: 37..360
score: 1.9
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 38..359
score: 1.18
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 10..358
score: 6.3E
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 10..358
score: 6.3E