Cla011400 (gene) Watermelon (97103) v1

NameCla011400
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionCellulase (Glycosyl hydrolase family 5) protein (AHRD V1 *--- F4JBE4_ARATH); contains Interpro domain(s) IPR013781 Glycoside hydrolase, subgroup, catalytic core
LocationChr1 : 1983334 .. 1985194 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAGAACATTGCAAGCCCTTGTATGTTTAGCTTTGGTCTTTATGATTTCATCCCTTTCAACTTATTCCTTACCATTATCAACTCGGGGAAGATGGATCATTGATTCTAAAACAGGACGTCGAGTGAAGCTTGTGTGTGTGAATTGGCCTTCTCATACCCAAAGCATGTTGGCAGAAGGCTTAAATCACAGACCACTAAAAGAACTTGCTGACGAGGCAATCAAATTGAAGTTCAATTGTGTGCGTCTCACATATGCAACCCATATGTTCACTCGATATGCTAATAGGACAATTGCAGAGAATTTCGACCTTTTAGATTTGAAACAAGCCAAAGCAAGATTGGCTCAGTATAACCCTTTTGTGTTGAATAAGACCATTGTGGAAGCCTATGAAGCAGTTGTTGATGTGCTTGGGGCAAGTGGTTTAATGGTGATTGCTGACAACCACATTAGTCAACCAAGATGGTGTTGTTCTCTTGATGATGGCAATGGCTTCTTTGGAAATCGAAATTTTGACCCTCAAGAATGGCTACAAGGTCTTAGCATAGTCGCTCAACGTTTTATAAACAAATCAATGGTATGTAACTTTAAAAGACTTTCTCTTTATTTTAGTCTAACTCCAAGAATTTGAAGACTACTAAAAGTGGTAGTAATTCATTAATATTAAAACAAGATCCTATGGCATGTGTAGGTGGTAGCAATGAGCTTACGAAATGAGATACGAGGAACAATGGAAAATGCAAATGATTGGAACAACTATGTAACTCAAGGAGTAACAACAATCCACCAGATAAACCCGACTGTTTTAGTGATTGTTTCAGGCCTAAATTATGATAATGATCTTCGATGCTTAAAGGAAAATCCCTTGACCGTAAATACCTTAGACAATAAATTGGTTTTTGAGGTGCATTTGTATTCTTTTAGTGGAGATGAAAGTAGGTATGTACAACAACCATTGAATGACATTTGTGCAAATGTCATCAACGATTTTGTAGATCATGCAGGGTTTGTAATGGAAGGACCAAATCCATTTCCTTTATTTGTTAGCGAATATGGGTATGATTAAAGAGGGACCAATGATGCTGAAAATCGGTACATGAGTTGCTTCACTGCTCATCTTGCTAAGAAAGACATGGATTGGGCACTATGGACTTGGCAAGGCAGCTATTATTATAGAGAAGGTGAAGCAGAGTCTACAGAAGTATTTGGAGTTCTCAACCCCAATTGGACTCAAATTCAAAATCCTAACTTTACTAAGAAGTTTCAGCTATTACAGACGATGTTGCAAGGTAATTTTATTAATTATTTAAATAATAAATAATGATTCATAAAACAAAAGAATTATTCTAAAATATACAAAAACCATAAATTATTCAACTAATTACTGGAGTTATAGATCAAATTTGTTTTACATTTTTTGGCATGTAGATCCAAATTCCAATACATCTTCTTATGTTATGTACCATCCACAAAGTGGCCAATGTGTCCAAGCTTCAAATGACAATAGTGAAATTTTTTTGAGCAATTGCTCCATCTCAAGTCGGTGGAGTCACGGTGATGATGGCTCTCCAATCAAGATGACAACAAATGGTTTGTGTTTAAAGGCTAATGGAGAAGGCCTTCGAGCATCCCTCTCAAGTGATTGTTTGGGTCAACAGAGTGTTTGGAGAGCAATTTCTAACAGTAAGCTTCATTTGGCTACTATCACTCAAGATGGGAAGAGCCTTTGCTTGCAAGTTGAGAGCTCCAAATCTTCAAAAATTATGACCAACTCTTGTATTTGCACCAATGGCGAACCAAACTGCCTTCAAGACACCCGAAGTCAATGGTTTGAACTTGTTGCAACCAACACATTGTAA

mRNA sequence

ATGGGAAGAACATTGCAAGCCCTTGTATGTTTAGCTTTGGTCTTTATGATTTCATCCCTTTCAACTTATTCCTTACCATTATCAACTCGGGGAAGATGGATCATTGATTCTAAAACAGGACGTCGAGTGAAGCTTGTGTGTGTGAATTGGCCTTCTCATACCCAAAGCATGTTGGCAGAAGGCTTAAATCACAGACCACTAAAAGAACTTGCTGACGAGGCAATCAAATTGAAGTTCAATTGTGTGCGTCTCACATATGCAACCCATATGTTCACTCGATATGCTAATAGGACAATTGCAGAGAATTTCGACCTTTTAGATTTGAAACAAGCCAAAGCAAGATTGGCTCAGTATAACCCTTTTGTGTTGAATAAGACCATTGTGGAAGCCTATGAAGCAGTTGTTGATGTGCTTGGGGCAAGTGGTTTAATGGTGATTGCTGACAACCACATTAGTCAACCAAGATGGTGTTGTTCTCTTGATGATGGCAATGGCTTCTTTGGAAATCGAAATTTTGACCCTCAAGAATGGCTACAAGGTCTTAGCATAGTCGCTCAACGTTTTATAAACAAATCAATGAAAGACATGGATTGGGCACTATGGACTTGGCAAGGCAGCTATTATTATAGAGAAGGTGAAGCAGAGTCTACAGAAGTATTTGGAGTTCTCAACCCCAATTGGACTCAAATTCAAAATCCTAACTTTACTAAGAAGTTTCAGCTATTACAGACGATGTTGCAAGATCCAAATTCCAATACATCTTCTTATGTTATGTACCATCCACAAAGTGGCCAATGTGTCCAAGCTTCAAATGACAATAGTGAAATTTTTTTGAGCAATTGCTCCATCTCAAGTCGGTGGAGTCACGGTGATGATGGCTCTCCAATCAAGATGACAACAAATGGTTTGTGTTTAAAGGCTAATGGAGAAGGCCTTCGAGCATCCCTCTCAAGTGATTGTTTGGGTCAACAGAGTGTTTGGAGAGCAATTTCTAACAGTAAGCTTCATTTGGCTACTATCACTCAAGATGGGAAGAGCCTTTGCTTGCAAGTTGAGAGCTCCAAATCTTCAAAAATTATGACCAACTCTTGTATTTGCACCAATGGCGAACCAAACTGCCTTCAAGACACCCGAAGTCAATGGTTTGAACTTGTTGCAACCAACACATTGTAA

Coding sequence (CDS)

ATGGGAAGAACATTGCAAGCCCTTGTATGTTTAGCTTTGGTCTTTATGATTTCATCCCTTTCAACTTATTCCTTACCATTATCAACTCGGGGAAGATGGATCATTGATTCTAAAACAGGACGTCGAGTGAAGCTTGTGTGTGTGAATTGGCCTTCTCATACCCAAAGCATGTTGGCAGAAGGCTTAAATCACAGACCACTAAAAGAACTTGCTGACGAGGCAATCAAATTGAAGTTCAATTGTGTGCGTCTCACATATGCAACCCATATGTTCACTCGATATGCTAATAGGACAATTGCAGAGAATTTCGACCTTTTAGATTTGAAACAAGCCAAAGCAAGATTGGCTCAGTATAACCCTTTTGTGTTGAATAAGACCATTGTGGAAGCCTATGAAGCAGTTGTTGATGTGCTTGGGGCAAGTGGTTTAATGGTGATTGCTGACAACCACATTAGTCAACCAAGATGGTGTTGTTCTCTTGATGATGGCAATGGCTTCTTTGGAAATCGAAATTTTGACCCTCAAGAATGGCTACAAGGTCTTAGCATAGTCGCTCAACGTTTTATAAACAAATCAATGAAAGACATGGATTGGGCACTATGGACTTGGCAAGGCAGCTATTATTATAGAGAAGGTGAAGCAGAGTCTACAGAAGTATTTGGAGTTCTCAACCCCAATTGGACTCAAATTCAAAATCCTAACTTTACTAAGAAGTTTCAGCTATTACAGACGATGTTGCAAGATCCAAATTCCAATACATCTTCTTATGTTATGTACCATCCACAAAGTGGCCAATGTGTCCAAGCTTCAAATGACAATAGTGAAATTTTTTTGAGCAATTGCTCCATCTCAAGTCGGTGGAGTCACGGTGATGATGGCTCTCCAATCAAGATGACAACAAATGGTTTGTGTTTAAAGGCTAATGGAGAAGGCCTTCGAGCATCCCTCTCAAGTGATTGTTTGGGTCAACAGAGTGTTTGGAGAGCAATTTCTAACAGTAAGCTTCATTTGGCTACTATCACTCAAGATGGGAAGAGCCTTTGCTTGCAAGTTGAGAGCTCCAAATCTTCAAAAATTATGACCAACTCTTGTATTTGCACCAATGGCGAACCAAACTGCCTTCAAGACACCCGAAGTCAATGGTTTGAACTTGTTGCAACCAACACATTGTAA

Protein sequence

MGRTLQALVCLALVFMISSLSTYSLPLSTRGRWIIDSKTGRRVKLVCVNWPSHTQSMLAEGLNHRPLKELADEAIKLKFNCVRLTYATHMFTRYANRTIAENFDLLDLKQAKARLAQYNPFVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLSIVAQRFINKSMKDMDWALWTWQGSYYYREGEAESTEVFGVLNPNWTQIQNPNFTKKFQLLQTMLQDPNSNTSSYVMYHPQSGQCVQASNDNSEIFLSNCSISSRWSHGDDGSPIKMTTNGLCLKANGEGLRASLSSDCLGQQSVWRAISNSKLHLATITQDGKSLCLQVESSKSSKIMTNSCICTNGEPNCLQDTRSQWFELVATNTL
BLAST of Cla011400 vs. TrEMBL
Match: A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 344.7 bits (883), Expect = 1.4e-91
Identity = 166/192 (86.46%), Postives = 178/192 (92.71%), Query Frame = 1

Query: 1   MGRTLQALVCLALVFMISSLSTYSLPLSTRGRWIIDSKTGRRVKLVCVNWPSHTQSMLAE 60
           M RT+Q ++ LALV + SS S YSLPLST GRWIIDS++G+RVKLVCVNWPSHTQSML E
Sbjct: 1   MERTIQVILLLALVSVFSSFSAYSLPLSTHGRWIIDSQSGKRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLKFNCVRLTYATHMFTRYANRTIAENFDLLDLKQAKARLAQYNP 120
           GLNHRPLKELADEAIKL+FNCVRLTYATHMFTRYANRT+ ENFDLLDL+QAKA LAQYNP
Sbjct: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLAQYNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLNKTI EAYEAVVDVLGASGLMVIADNH+SQPRWCCSLDDGNGFFGNR FDPQEWLQG
Sbjct: 121 FVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQEWLQG 180

Query: 181 LSIVAQRFINKS 193
           LS+VAQRF NKS
Sbjct: 181 LSLVAQRFNNKS 192

BLAST of Cla011400 vs. TrEMBL
Match: A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 327.0 bits (837), Expect = 3.1e-86
Identity = 151/201 (75.12%), Postives = 176/201 (87.56%), Query Frame = 1

Query: 192 SMKDMDWALWTWQGSYYYREGEAESTEVFGVLNPNWTQIQNPNFTKKFQLLQTMLQDPNS 251
           + KD+DWALWTWQGSYYYREG+AE  E FGVL+ NWTQI+NPNF +KFQLLQTMLQDP S
Sbjct: 338 AQKDLDWALWTWQGSYYYREGQAELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYS 397

Query: 252 NTS-SYVMYHPQSGQCVQASNDNSEIFLSNCSISSRWSHGDDGSPIKMTTNGLCLKANGE 311
           N S SYV+YH QSGQC++ SNDN EIFL+NCS SSRWSH +D +PIKM++ GLCLKA+GE
Sbjct: 398 NASFSYVIYHVQSGQCIEVSNDNKEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGE 457

Query: 312 GLRASLSSDCLGQQSVWRAISNSKLHLATITQDGKSLCLQ-VESSKSSKIMTNSCICTNG 371
           GL ASLS+DC+G+QS+W AISNS LHL T+T+DGKSLCLQ +ESS SSKI+TNSCICT  
Sbjct: 458 GLEASLSTDCIGKQSLWSAISNSNLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTN 517

Query: 372 EPNCLQDTRSQWFELVATNTL 391
           +P CLQDT+SQWFELVATNTL
Sbjct: 518 DPTCLQDTQSQWFELVATNTL 538


HSP 2 Score: 329.7 bits (844), Expect = 4.8e-87
Identity = 157/228 (68.86%), Postives = 185/228 (81.14%), Query Frame = 1

Query: 164 NGFFGNRNFDPQEWLQGLSIVAQRFINKSMKDMDWALWTWQGSYYYREGEAESTEVFGVL 223
           NGF  +  F     +QG +       + + +D+DWALW WQGSYY+REG+AE  E FGVL
Sbjct: 125 NGFIDHAGFV----MQGPNPFPLFVTHLAQRDLDWALWAWQGSYYFREGQAEPGESFGVL 184

Query: 224 NPNWTQIQNPNFTKKFQLLQTMLQDPNSNTS-SYVMYHPQSGQCVQASNDNSEIFLSNCS 283
           + NWTQI+NPNF +KFQLLQTMLQDPNSN S SYV+YHPQS QC+Q SNDN EIFL+NCS
Sbjct: 185 DSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCS 244

Query: 284 ISSRWSHGDDGSPIKMTTNGLCLKANGEGLRASLSSDCLGQQSVWRAISNSKLHLATITQ 343
             +RWSH +DG+PI+M++ GL LKA+G+GL ASLSSD L QQSVW AISNSKLHLAT TQ
Sbjct: 245 TPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWSAISNSKLHLATFTQ 304

Query: 344 DGKSLCLQVESSKSSKIMTNSCICTNGEPNCLQDTRSQWFELVATNTL 391
            GKSLCLQ++SS SSK++TNSCICTNG+PNCLQDTRSQWFELV TNTL
Sbjct: 305 GGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGTNTL 348

BLAST of Cla011400 vs. TrEMBL
Match: A0A0A0L644_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G186670 PE=4 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 2.8e-47
Identity = 94/103 (91.26%), Postives = 97/103 (94.17%), Query Frame = 1

Query: 90  MFTRYANRTIAENFDLLDLKQAKARLAQYNPFVLNKTIVEAYEAVVDVLGASGLMVIADN 149
           M TRYANRTI ENFDLLDLKQAKA LAQYNPFVLNKT+ EAYEAVVDVLGASGLMVIADN
Sbjct: 1   MLTRYANRTIEENFDLLDLKQAKAGLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADN 60

Query: 150 HISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLSIVAQRFINKS 193
           H+SQPRWCCSLDDGNGFFGN NFDPQEWLQGLS+VAQRF NKS
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKS 103


HSP 2 Score: 264.6 bits (675), Expect = 1.9e-67
Identity = 121/200 (60.50%), Postives = 159/200 (79.50%), Query Frame = 1

Query: 194 KDMDWALWTWQGSYYYREGEAESTEVFGVLNPNWTQIQNPNFTKKFQLLQTMLQDPNSNT 253
           KD+DWALW WQGSYYYR+G+    EVFGVLN NW+ ++NP+F++ FQLLQTMLQDPNSN+
Sbjct: 144 KDLDWALWGWQGSYYYRQGKVGPEEVFGVLNYNWSDVRNPHFSQMFQLLQTMLQDPNSNS 203

Query: 254 S-SYVMYHPQSGQCVQASN-DNSEIFLSNCSISSRWSHGDDGSPIKMTTNGLCLKANGEG 313
           S +YVMYHPQSGQCV   +  + +I+L++CS +S WS+  DG+PI + +   CLKA+G+G
Sbjct: 204 SNTYVMYHPQSGQCVLVQDMKHMQIYLNDCSNASHWSYEGDGTPIMLASTNFCLKASGDG 263

Query: 314 LRASLSSDCLGQQSVWRAISNSKLHLATITQDGKS-LCLQVESSKSSKIMTNSCICTNGE 373
           L  SLS DC G+QSVW AIS+SKLHLAT+T+ G + +CL+ ESS SS+I+  SC+C   +
Sbjct: 264 LPPSLSRDCFGEQSVWTAISDSKLHLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGND 323

Query: 374 PNCLQDTRSQWFELVATNTL 391
            NCLQDT++QWF+LV TNTL
Sbjct: 324 SNCLQDTQAQWFQLVVTNTL 343

BLAST of Cla011400 vs. TrEMBL
Match: A0A0A0KN72_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G168955 PE=4 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 4.6e-66
Identity = 126/157 (80.25%), Postives = 138/157 (87.90%), Query Frame = 1

Query: 9   VCLALVFMISSLSTYSLPLSTRGRWIIDSKTGRRVKLVCVNWPSHTQSMLAEGLNHRPLK 68
           V LA +   SSLS YSLPLST GRWI+DS TGRRVKLVCVNWPSHTQSML EGL+ RPLK
Sbjct: 10  VVLAFISFFSSLS-YSLPLSTNGRWIVDSATGRRVKLVCVNWPSHTQSMLIEGLDRRPLK 69

Query: 69  ELADEAIKLKFNCVRLTYATHMFTRYANRTIAENFDLLDLKQAKARLAQYNPFVLNKTIV 128
           +LA+E ++L+FNCVRLTYATHMFTRYANRT+ ENFDLLDL+ AK  LA +NPFVLN TI 
Sbjct: 70  DLANEVVRLRFNCVRLTYATHMFTRYANRTVEENFDLLDLRAAKVGLAFHNPFVLNMTIF 129

Query: 129 EAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNG 166
           EAYEAVVDVLG SGLMVIADNHISQPRWCCSL+DGNG
Sbjct: 130 EAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNG 165

BLAST of Cla011400 vs. TrEMBL
Match: A0A0A0L8X0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G186170 PE=4 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 3.7e-63
Identity = 114/183 (62.30%), Postives = 145/183 (79.23%), Query Frame = 1

Query: 9   VCLALVFMISSLSTYSLPLSTRGRWIIDSKTGRRVKLVCVNWPSHTQSMLAEGLNHRPLK 68
           V LA VF++ +   YSLPLST GRWI+++ TG+RVKL+CVNWP H Q+M+AEGL+ +PL 
Sbjct: 9   VSLACVFVLLTFEAYSLPLSTNGRWIVEATTGQRVKLICVNWPGHMQAMVAEGLHLKPLD 68

Query: 69  ELADEAIKLKFNCVRLTYATHMFTRYANRTIAENFDLLDLKQAKARLAQYNPFVLNKTIV 128
           ++A   +KL+FNCVRLTY+ HMFTRYAN T+ ++F+  DLK+A   +AQ NP +LN  +V
Sbjct: 69  DIAAMVVKLRFNCVRLTYSIHMFTRYANLTVKQSFENFDLKEAIVGIAQNNPTILNMKVV 128

Query: 129 EAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLSIVAQRF 188
           EAYEAVVD LGA G+MV++DNHISQPRWCCS DDGNGFFG+R F+ QEWLQGLS+  Q  
Sbjct: 129 EAYEAVVDSLGAHGVMVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLQGLSLATQSL 188

Query: 189 INK 192
             K
Sbjct: 189 KTK 191

BLAST of Cla011400 vs. NCBI nr
Match: gi|659074661|ref|XP_008437725.1| (PREDICTED: uncharacterized protein LOC103483072 [Cucumis melo])

HSP 1 Score: 346.3 bits (887), Expect = 7.1e-92
Identity = 168/190 (88.42%), Postives = 177/190 (93.16%), Query Frame = 1

Query: 3   RTLQALVCLALVFMISSLSTYSLPLSTRGRWIIDSKTGRRVKLVCVNWPSHTQSMLAEGL 62
           +T  A++ LALVF+ S LS  SLPLSTRGRWIIDS+TGRRVKLVC+NWPSHTQSML EGL
Sbjct: 2   KTFLAILLLALVFVFSPLSANSLPLSTRGRWIIDSQTGRRVKLVCMNWPSHTQSMLIEGL 61

Query: 63  NHRPLKELADEAIKLKFNCVRLTYATHMFTRYANRTIAENFDLLDLKQAKARLAQYNPFV 122
           NHRPLK+LADEAIKL+FNCVRLTYATHMFTRYANRTI ENFDLLDLKQAKA LAQYNPFV
Sbjct: 62  NHRPLKDLADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQYNPFV 121

Query: 123 LNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLS 182
           LNKTI EAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGN NFDPQEWLQGLS
Sbjct: 122 LNKTIAEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNNNFDPQEWLQGLS 181

Query: 183 IVAQRFINKS 193
           +VAQRF NKS
Sbjct: 182 LVAQRFRNKS 191

BLAST of Cla011400 vs. NCBI nr
Match: gi|778721997|ref|XP_011658389.1| (PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus])

HSP 1 Score: 344.7 bits (883), Expect = 2.1e-91
Identity = 166/192 (86.46%), Postives = 178/192 (92.71%), Query Frame = 1

Query: 1   MGRTLQALVCLALVFMISSLSTYSLPLSTRGRWIIDSKTGRRVKLVCVNWPSHTQSMLAE 60
           M RT+Q ++ LALV + SS S YSLPLST GRWIIDS++G+RVKLVCVNWPSHTQSML E
Sbjct: 1   MERTIQVILLLALVSVFSSFSAYSLPLSTHGRWIIDSQSGKRVKLVCVNWPSHTQSMLIE 60

Query: 61  GLNHRPLKELADEAIKLKFNCVRLTYATHMFTRYANRTIAENFDLLDLKQAKARLAQYNP 120
           GLNHRPLKELADEAIKL+FNCVRLTYATHMFTRYANRT+ ENFDLLDL+QAKA LAQYNP
Sbjct: 61  GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLAQYNP 120

Query: 121 FVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQG 180
           FVLNKTI EAYEAVVDVLGASGLMVIADNH+SQPRWCCSLDDGNGFFGNR FDPQEWLQG
Sbjct: 121 FVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQEWLQG 180

Query: 181 LSIVAQRFINKS 193
           LS+VAQRF NKS
Sbjct: 181 LSLVAQRFNNKS 192

BLAST of Cla011400 vs. NCBI nr
Match: gi|778721997|ref|XP_011658389.1| (PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus])

HSP 1 Score: 327.0 bits (837), Expect = 4.4e-86
Identity = 151/201 (75.12%), Postives = 176/201 (87.56%), Query Frame = 1

Query: 192 SMKDMDWALWTWQGSYYYREGEAESTEVFGVLNPNWTQIQNPNFTKKFQLLQTMLQDPNS 251
           + KD+DWALWTWQGSYYYREG+AE  E FGVL+ NWTQI+NPNF +KFQLLQTMLQDP S
Sbjct: 338 AQKDLDWALWTWQGSYYYREGQAELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYS 397

Query: 252 NTS-SYVMYHPQSGQCVQASNDNSEIFLSNCSISSRWSHGDDGSPIKMTTNGLCLKANGE 311
           N S SYV+YH QSGQC++ SNDN EIFL+NCS SSRWSH +D +PIKM++ GLCLKA+GE
Sbjct: 398 NASFSYVIYHVQSGQCIEVSNDNKEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGE 457

Query: 312 GLRASLSSDCLGQQSVWRAISNSKLHLATITQDGKSLCLQ-VESSKSSKIMTNSCICTNG 371
           GL ASLS+DC+G+QS+W AISNS LHL T+T+DGKSLCLQ +ESS SSKI+TNSCICT  
Sbjct: 458 GLEASLSTDCIGKQSLWSAISNSNLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTN 517

Query: 372 EPNCLQDTRSQWFELVATNTL 391
           +P CLQDT+SQWFELVATNTL
Sbjct: 518 DPTCLQDTQSQWFELVATNTL 538


HSP 2 Score: 336.7 bits (862), Expect = 5.6e-89
Identity = 154/200 (77.00%), Postives = 177/200 (88.50%), Query Frame = 1

Query: 192 SMKDMDWALWTWQGSYYYREGEAESTEVFGVLNPNWTQIQNPNFTKKFQLLQTMLQDPNS 251
           + KD+DWALWTWQGSYYYREG+AE  E FGVL  NWTQI+NPNF +KFQLLQTMLQDPNS
Sbjct: 249 AQKDLDWALWTWQGSYYYREGQAELPETFGVLESNWTQIKNPNFVQKFQLLQTMLQDPNS 308

Query: 252 NTS-SYVMYHPQSGQCVQASNDNSEIFLSNCSISSRWSHGDDGSPIKMTTNGLCLKANGE 311
           N S SYV+YHPQSGQC++ SNDN +IFL+NCS SSRWSH +D +PIKM+  GLCLKA+GE
Sbjct: 309 NASFSYVIYHPQSGQCIEVSNDNKDIFLTNCSTSSRWSHDNDSTPIKMSNTGLCLKASGE 368

Query: 312 GLRASLSSDCLGQQSVWRAISNSKLHLATITQDGKSLCLQVESSKSSKIMTNSCICTNGE 371
           GL ASLS+DCLG+QSVW AISNSKLHLAT+T++GKSLCLQ+ESS SSKI+TNSCICT  +
Sbjct: 369 GLAASLSNDCLGKQSVWSAISNSKLHLATVTENGKSLCLQIESSNSSKIVTNSCICTTDD 428

Query: 372 PNCLQDTRSQWFELVATNTL 391
           P CLQDT+SQWFELV TNTL
Sbjct: 429 PTCLQDTQSQWFELVETNTL 448

BLAST of Cla011400 vs. NCBI nr
Match: gi|659090006|ref|XP_008445780.1| (PREDICTED: uncharacterized protein LOC103488703 [Cucumis melo])

HSP 1 Score: 194.5 bits (493), Expect = 3.4e-46
Identity = 93/103 (90.29%), Postives = 96/103 (93.20%), Query Frame = 1

Query: 90  MFTRYANRTIAENFDLLDLKQAKARLAQYNPFVLNKTIVEAYEAVVDVLGASGLMVIADN 149
           MFTRYANRT+ ENFDLLDL QAKA L QYNPFVLNKTI EAYEAVVDVLGASGLMVIADN
Sbjct: 1   MFTRYANRTVEENFDLLDLGQAKAGLTQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADN 60

Query: 150 HISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLSIVAQRFINKS 193
           H+SQPRWCCSLDDGNGFFGNR FDPQEWLQGLS+VAQRF NKS
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNRYFDPQEWLQGLSLVAQRFNNKS 103


HSP 2 Score: 329.7 bits (844), Expect = 6.8e-87
Identity = 157/228 (68.86%), Postives = 185/228 (81.14%), Query Frame = 1

Query: 164 NGFFGNRNFDPQEWLQGLSIVAQRFINKSMKDMDWALWTWQGSYYYREGEAESTEVFGVL 223
           NGF  +  F     +QG +       + + +D+DWALW WQGSYY+REG+AE  E FGVL
Sbjct: 125 NGFIDHAGFV----MQGPNPFPLFVTHLAQRDLDWALWAWQGSYYFREGQAEPGESFGVL 184

Query: 224 NPNWTQIQNPNFTKKFQLLQTMLQDPNSNTS-SYVMYHPQSGQCVQASNDNSEIFLSNCS 283
           + NWTQI+NPNF +KFQLLQTMLQDPNSN S SYV+YHPQS QC+Q SNDN EIFL+NCS
Sbjct: 185 DSNWTQIKNPNFVRKFQLLQTMLQDPNSNASFSYVIYHPQSSQCIQVSNDNKEIFLTNCS 244

Query: 284 ISSRWSHGDDGSPIKMTTNGLCLKANGEGLRASLSSDCLGQQSVWRAISNSKLHLATITQ 343
             +RWSH +DG+PI+M++ GL LKA+G+GL ASLSSD L QQSVW AISNSKLHLAT TQ
Sbjct: 245 TPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWSAISNSKLHLATFTQ 304

Query: 344 DGKSLCLQVESSKSSKIMTNSCICTNGEPNCLQDTRSQWFELVATNTL 391
            GKSLCLQ++SS SSK++TNSCICTNG+PNCLQDTRSQWFELV TNTL
Sbjct: 305 GGKSLCLQIDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVGTNTL 348

BLAST of Cla011400 vs. NCBI nr
Match: gi|700202307|gb|KGN57440.1| (hypothetical protein Csa_3G186670 [Cucumis sativus])

HSP 1 Score: 197.6 bits (501), Expect = 4.1e-47
Identity = 94/103 (91.26%), Postives = 97/103 (94.17%), Query Frame = 1

Query: 90  MFTRYANRTIAENFDLLDLKQAKARLAQYNPFVLNKTIVEAYEAVVDVLGASGLMVIADN 149
           M TRYANRTI ENFDLLDLKQAKA LAQYNPFVLNKT+ EAYEAVVDVLGASGLMVIADN
Sbjct: 1   MLTRYANRTIEENFDLLDLKQAKAGLAQYNPFVLNKTVAEAYEAVVDVLGASGLMVIADN 60

Query: 150 HISQPRWCCSLDDGNGFFGNRNFDPQEWLQGLSIVAQRFINKS 193
           H+SQPRWCCSLDDGNGFFGN NFDPQEWLQGLS+VAQRF NKS
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKS 103


HSP 2 Score: 300.1 bits (767), Expect = 5.8e-78
Identity = 152/222 (68.47%), Postives = 172/222 (77.48%), Query Frame = 1

Query: 1   MGRTLQ-ALVCLALVFMISSLSTYSLPLSTRGRWIIDSKTGRRVKLVCVNWPSHTQSMLA 60
           MG T Q + V LA +   SSLS YSLPLST GRWI+DS TG RVKLVCVNWPSHTQSML 
Sbjct: 1   MGITTQFSFVVLAFICFFSSLS-YSLPLSTNGRWIVDSATGHRVKLVCVNWPSHTQSMLI 60

Query: 61  EGLNHRPLKELADEAIKLKFNCVRLTYATHMFTRYANRTIAENFDLLDLKQAKARLAQYN 120
           EGL+ RPLK+LA+E ++LKFNCVRLTYATHMFTRYANRT+ ENFDLLDL+ +K  LA +N
Sbjct: 61  EGLDRRPLKDLANEVMRLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKVGLALHN 120

Query: 121 PFVLNKTIVEAYEAVVDVLGASGLMVIADNHISQPRWCCSLDDGNGFFGNRNFDPQEWLQ 180
           PFVLN TI EAYEAVVDVLG SGLMVIADNHISQPRWCCSL+DGNGFFG+R FD +EWL+
Sbjct: 121 PFVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDSEEWLE 180

Query: 181 GLSIVAQRFINK---------------SMKDMDWALWTWQGS 207
           GL +VA+RF NK               S K  DW  +  QG+
Sbjct: 181 GLRLVARRFYNKSAVVAMSLRNELRGASSKSKDWNKYVTQGA 221

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0K853_CUCSA1.4e-9186.46Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1[more]
A0A0A0K853_CUCSA3.1e-8675.12Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1[more]
A0A0A0L644_CUCSA2.8e-4791.26Uncharacterized protein OS=Cucumis sativus GN=Csa_3G186670 PE=4 SV=1[more]
A0A0A0KN72_CUCSA4.6e-6680.25Uncharacterized protein OS=Cucumis sativus GN=Csa_5G168955 PE=4 SV=1[more]
A0A0A0L8X0_CUCSA3.7e-6362.30Uncharacterized protein OS=Cucumis sativus GN=Csa_3G186170 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659074661|ref|XP_008437725.1|7.1e-9288.42PREDICTED: uncharacterized protein LOC103483072 [Cucumis melo][more]
gi|778721997|ref|XP_011658389.1|2.1e-9186.46PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus][more]
gi|778721997|ref|XP_011658389.1|4.4e-8675.12PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus][more]
gi|659090006|ref|XP_008445780.1|3.4e-4690.29PREDICTED: uncharacterized protein LOC103488703 [Cucumis melo][more]
gi|700202307|gb|KGN57440.1|4.1e-4791.26hypothetical protein Csa_3G186670 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000772Ricin_B_lectin
IPR013781Glycoside hydrolase, catalytic domain
IPR017853Glycoside_hydrolase_SF
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU65260watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla011400Cla011400.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU65260WMU65260transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000772Ricin B, lectin domainPROFILEPS50231RICIN_B_LECTINcoord: 253..384
score: 9
IPR000772Ricin B, lectin domainunknownSSF50370Ricin B-like lectinscoord: 255..364
score: 3.5
IPR013781Glycoside hydrolase, catalytic domainGENE3DG3DSA:3.20.20.80coord: 25..191
score: 4.4
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 26..232
score: 2.11
NoneNo IPR availableGENE3DG3DSA:2.80.10.50coord: 262..364
score: 8.
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 1..259
score: 1.4E-113coord: 305..390
score: 1.4E
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 305..390
score: 1.4E-113coord: 1..259
score: 1.4E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla011400Watermelon (97103) v2wmwmbB266
Cla011400Watermelon (97103) v2wmwmbB273
Cla011400Wax gourdwgowmB076
Cla011400Watermelon (97103) v1wmwmB122
Cla011400Watermelon (97103) v1wmwmB142
Cla011400Watermelon (97103) v1wmwmB151
Cla011400Cucumber (Gy14) v1cgywmB150
Cla011400Cucumber (Gy14) v1cgywmB232
Cla011400Cucumber (Gy14) v1cgywmB552
Cla011400Cucurbita maxima (Rimu)cmawmB277
Cla011400Cucurbita maxima (Rimu)cmawmB319
Cla011400Cucurbita maxima (Rimu)cmawmB568
Cla011400Cucurbita maxima (Rimu)cmawmB712
Cla011400Cucurbita moschata (Rifu)cmowmB267
Cla011400Cucurbita moschata (Rifu)cmowmB313
Cla011400Cucurbita moschata (Rifu)cmowmB563
Cla011400Melon (DHL92) v3.5.1mewmB029
Cla011400Melon (DHL92) v3.5.1mewmB101
Cla011400Melon (DHL92) v3.5.1mewmB480
Cla011400Melon (DHL92) v3.5.1mewmB530
Cla011400Watermelon (Charleston Gray)wcgwmB298
Cla011400Watermelon (Charleston Gray)wcgwmB337
Cla011400Cucumber (Chinese Long) v2cuwmB260
Cla011400Cucumber (Chinese Long) v2cuwmB330
Cla011400Cucumber (Chinese Long) v2cuwmB408
Cla011400Cucumber (Chinese Long) v2cuwmB504
Cla011400Cucurbita pepo (Zucchini)cpewmB183
Cla011400Cucurbita pepo (Zucchini)cpewmB717
Cla011400Bottle gourd (USVL1VR-Ls)lsiwmB032
Cla011400Bottle gourd (USVL1VR-Ls)lsiwmB378
Cla011400Cucumber (Gy14) v2cgybwmB304
Cla011400Cucumber (Gy14) v2cgybwmB381
Cla011400Cucumber (Gy14) v2cgybwmB471
Cla011400Melon (DHL92) v3.6.1medwmB099
Cla011400Melon (DHL92) v3.6.1medwmB515
Cla011400Silver-seed gourdcarwmB0925
Cla011400Cucumber (Chinese Long) v3cucwmB270
Cla011400Cucumber (Chinese Long) v3cucwmB343
Cla011400Cucumber (Chinese Long) v3cucwmB433
Cla011400Cucumber (Chinese Long) v3cucwmB529