Cla007962 (gene) Watermelon (97103) v1

NameCla007962
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionCellulase (Glycosyl hydrolase family 5) protein (AHRD V1 **-- F4JBE4_ARATH); contains Interpro domain(s) IPR001547 Glycoside hydrolase, family 5
LocationChr8 : 5024732 .. 5026431 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGGCAGAAGGCCTCAACCATAGGCCGTTAAAAGAACTAGCAGACGAGGCAATCAAGTTAAGATTCAATTGTGTACGACTCACATATGCAACCCACATGTTCACTCGCTATGCAAATAGGACGATTGAAGAAAACTTCGATCTTCTTGATTTAAAACAAGCAAAAGCTGGATTGGCTCAATATAATCCATTTCTATTGAACAAGACTATTGTTGAAGCTTATGAAGCTGTTGTTGATGTGTTGGGGGCAAGTGGTCTAATGGTGATTGCTGATAATCACATGAGCCAGCCAAGATGGTGTTGCTCTCTTGATGACGGCAATGGCTTCTTTGGAAATCGTAATTTTGATCCTCAGGAATGGCTACAAGGTCTTAGCTTAGTTGCTCAACGATTTAACAACAAATCCACGGTATCTAACATCAAATGATTTTCTCAATTTCTTGAAACTGAAGTCAAGCATTTAAAAACTTCTATAATATTTCATTAAATAAAGAAAGTCTTTTGGTGGTTGCTTAGGTGGTAGGAATGAGCTTACGAAATGAGATACGAGGAATGATGGAAAATGCAAATGATTGGAACAACTATGTAACTCAAGGAGTAACCACGATCCATAGCATAAATCCGGAAGTTTTAGTGATTGTTTCAGGCTTAAATTATGACAATGATCTCCGATGCTTAAAGGAGAAGCCTTTGACTGTAAAGACCTTAAACAATAAGTTGGTTTTCGAGGTACACTTGTATTCTTTTAGTGGAGATTCTGAGAGCAAGTTTGTGCAACAACCATTGAACAATATTTGTGCAAATATTATGAATGGCTTTATAGACCATGCTGAGTTTGTAATTGAAGGATCAAACCCATTTCCTTTATTTGTTAGTGAATATGGGTATGATCAAAGAGAGGTTAACGATGCAGAAAATCGATTCATGAGTTGCTTCACAGCCCATCTTGCACAGAAAGATTTGGATTGGGCATTGTGGACATGGCAAGGTAGTTATTATTATAGAGAAGGTCAAGCAGAACCTGCAGAAACTTTTGGAGTGCTTGATTCTAATTGGACTCAAATTAAGAACCCCAACTTTGTCAAGAAGTTTCAACTATTGCAGACCATGTTGCAAGGTAACTATATTGATGATTTTGAATATATGCAAGGATTTATAACTAGTAACCTACTATGATTTGTGAAAATAAATGTCAAAAAGTTCCACTAGAGTTTTGTAAAACTAGCTAAGTAACCATTTGGATAACATTTTTTATGGTACACAGATCCAAATTCCAATGCATCTTTCTCATATGTTATATACCATCCACAAAGTGGCCAGTGTGTCCAAGTCTCTAATGACAACAAAGAAATTTTCCTCACCAATTGCTCCACTTCAAGTCGATGGAGTCATGACAATGATAACACTCCAATTAGGATGTCAGACACTGGTTTGTGCTTGAAAGCTAGCGGAGAAGGCCTTGAAGCATCACTTTCAACTGACTGCTTAGGCAAACTAAGTGTTTGGAGTGCCATTTCAAACTCTAAGCTTCATTTGGCAACTTTCACTGAAGATGGAAAGAGTCTTTGTCTGCAGGTTGAAAGCTCAAATTCTTCAAAAATTGTGACCAACTCTTGTATTTGCACGAATGGTGATCCAACTTGCCTCCAAGACACCCAAAGCCAATGGTTTGAACTTGTTGGAACCAACACATTGTGA

mRNA sequence

ATGCTGGCAGAAGGCCTCAACCATAGGCCGTTAAAAGAACTAGCAGACGAGGCAATCAAGTTAAGATTCAATTGTGTACGACTCACATATGCAACCCACATGTTCACTCGCTATGCAAATAGGACGATTGAAGAAAACTTCGATCTTCTTGATTTAAAACAAGCAAAAGCTGGATTGGCTCAATATAATCCATTTCTATTGAACAAGACTATTGTTGAAGCTTATGAAGCTGTTGTTGATGTGTTGGGGGCAAGTGGTCTAATGGTGATTGCTGATAATCACATGAGCCAGCCAAGATGGTGTTGCTCTCTTGATGACGGCAATGGCTTCTTTGGAAATCGTAATTTTGATCCTCAGGAATGGCTACAAGGTCTTAGCTTAGTTGCTCAACGATTTAACAACAAATCCACGGTGGTAGGAATGAGCTTACGAAATGAGATACGAGGAATGATGGAAAATGCAAATGATTGGAACAACTATGTAACTCAAGGAGTAACCACGATCCATAGCATAAATCCGGAAGTTTTAGTGATTGTTTCAGGCTTAAATTATGACAATGATCTCCGATGCTTAAAGGAGAAGCCTTTGACTGTAAAGACCTTAAACAATAAGTTGGTTTTCGAGGTACACTTGTATTCTTTTAGTGGAGATTCTGAGAGCAAGTTTGTGCAACAACCATTGAACAATATTTGTGCAAATATTATGAATGGCTTTATAGACCATGCTGAGTTTGTAATTGAAGGATCAAACCCATTTCCTTTATTTGTTAGTGAATATGGGTATGATCAAAGAGAGGTTAACGATGCAGAAAATCGATTCATGAGTTGCTTCACAGCCCATCTTGCACAGAAAGATTTGGATTGGGCATTGTGGACATGGCAAGGTAGTTATTATTATAGAGAAGGTCAAGCAGAACCTGCAGAAACTTTTGGAGTGCTTGATTCTAATTGGACTCAAATTAAGAACCCCAACTTTGTCAAGAAGTTTCAACTATTGCAGACCATGTTGCAAGATCCAAATTCCAATGCATCTTTCTCATATGTTATATACCATCCACAAAGTGGCCAGTGTGTCCAAGTCTCTAATGACAACAAAGAAATTTTCCTCACCAATTGCTCCACTTCAAGTCGATGGAGTCATGACAATGATAACACTCCAATTAGGATGTCAGACACTGGTTTGTGCTTGAAAGCTAGCGGAGAAGGCCTTGAAGCATCACTTTCAACTGACTGCTTAGGCAAACTAAGTGTTTGGAGTGCCATTTCAAACTCTAAGCTTCATTTGGCAACTTTCACTGAAGATGGAAAGAGTCTTTGTCTGCAGGTTGAAAGCTCAAATTCTTCAAAAATTGTGACCAACTCTTGTATTTGCACGAATGGTGATCCAACTTGCCTCCAAGACACCCAAAGCCAATGGTTTGAACTTGTTGGAACCAACACATTGTGA

Coding sequence (CDS)

ATGCTGGCAGAAGGCCTCAACCATAGGCCGTTAAAAGAACTAGCAGACGAGGCAATCAAGTTAAGATTCAATTGTGTACGACTCACATATGCAACCCACATGTTCACTCGCTATGCAAATAGGACGATTGAAGAAAACTTCGATCTTCTTGATTTAAAACAAGCAAAAGCTGGATTGGCTCAATATAATCCATTTCTATTGAACAAGACTATTGTTGAAGCTTATGAAGCTGTTGTTGATGTGTTGGGGGCAAGTGGTCTAATGGTGATTGCTGATAATCACATGAGCCAGCCAAGATGGTGTTGCTCTCTTGATGACGGCAATGGCTTCTTTGGAAATCGTAATTTTGATCCTCAGGAATGGCTACAAGGTCTTAGCTTAGTTGCTCAACGATTTAACAACAAATCCACGGTGGTAGGAATGAGCTTACGAAATGAGATACGAGGAATGATGGAAAATGCAAATGATTGGAACAACTATGTAACTCAAGGAGTAACCACGATCCATAGCATAAATCCGGAAGTTTTAGTGATTGTTTCAGGCTTAAATTATGACAATGATCTCCGATGCTTAAAGGAGAAGCCTTTGACTGTAAAGACCTTAAACAATAAGTTGGTTTTCGAGGTACACTTGTATTCTTTTAGTGGAGATTCTGAGAGCAAGTTTGTGCAACAACCATTGAACAATATTTGTGCAAATATTATGAATGGCTTTATAGACCATGCTGAGTTTGTAATTGAAGGATCAAACCCATTTCCTTTATTTGTTAGTGAATATGGGTATGATCAAAGAGAGGTTAACGATGCAGAAAATCGATTCATGAGTTGCTTCACAGCCCATCTTGCACAGAAAGATTTGGATTGGGCATTGTGGACATGGCAAGGTAGTTATTATTATAGAGAAGGTCAAGCAGAACCTGCAGAAACTTTTGGAGTGCTTGATTCTAATTGGACTCAAATTAAGAACCCCAACTTTGTCAAGAAGTTTCAACTATTGCAGACCATGTTGCAAGATCCAAATTCCAATGCATCTTTCTCATATGTTATATACCATCCACAAAGTGGCCAGTGTGTCCAAGTCTCTAATGACAACAAAGAAATTTTCCTCACCAATTGCTCCACTTCAAGTCGATGGAGTCATGACAATGATAACACTCCAATTAGGATGTCAGACACTGGTTTGTGCTTGAAAGCTAGCGGAGAAGGCCTTGAAGCATCACTTTCAACTGACTGCTTAGGCAAACTAAGTGTTTGGAGTGCCATTTCAAACTCTAAGCTTCATTTGGCAACTTTCACTGAAGATGGAAAGAGTCTTTGTCTGCAGGTTGAAAGCTCAAATTCTTCAAAAATTGTGACCAACTCTTGTATTTGCACGAATGGTGATCCAACTTGCCTCCAAGACACCCAAAGCCAATGGTTTGAACTTGTTGGAACCAACACATTGTGA

Protein sequence

MLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQYNPFLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRNFDPQEWLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMENANDWNNYVTQGVTTIHSINPEVLVIVSGLNYDNDLRCLKEKPLTVKTLNNKLVFEVHLYSFSGDSESKFVQQPLNNICANIMNGFIDHAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYREGQAEPAETFGVLDSNWTQIKNPNFVKKFQLLQTMLQDPNSNASFSYVIYHPQSGQCVQVSNDNKEIFLTNCSTSSRWSHDNDNTPIRMSDTGLCLKASGEGLEASLSTDCLGKLSVWSAISNSKLHLATFTEDGKSLCLQVESSNSSKIVTNSCICTNGDPTCLQDTQSQWFELVGTNTL
BLAST of Cla007962 vs. Swiss-Prot
Match: GUN_PAEPO (Endoglucanase OS=Paenibacillus polymyxa PE=3 SV=2)

HSP 1 Score: 70.1 bits (170), Expect = 7.5e-11
Identity = 70/339 (20.65%), Postives = 132/339 (38.94%), Query Frame = 1

Query: 5   GLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLAQYNP 64
           GL  R + ++ D+  K  +N +RL Y+  +F   +        D +D  +        NP
Sbjct: 73  GLWSRSMDDMLDQVKKEGYNLIRLPYSNQLFDSSSRP------DSIDYHK--------NP 132

Query: 65  FLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNG----FFGNRNFDPQE 124
            L+    ++  + +++  G  G+ +I D H            G+G     +    +    
Sbjct: 133 DLVGLNPIQIMDKLIEKAGQRGIQIILDRHRP----------GSGGQSELWYTSQYPESR 192

Query: 125 WLQGLSLVAQRFNNKSTVVGMSLRNEIRGMME----NAN-DWNNYVTQGVTTIHSINPEV 184
           W+    ++A R+ N  TV+G  L NE  G       NA+ DW     +    I S+NP  
Sbjct: 193 WISDWKMLADRYKNNPTVIGADLHNEPHGQASWGTGNASTDWRLAAQRAGNAILSVNPNW 252

Query: 185 LVIVSGLNYD-----------NDLRCLKEKPLTVKTLNNKLVFEVHLYSFSGDSESKFVQ 244
           L++V G++++            +L  +   P+ V  + N++V+  H Y   G S   +  
Sbjct: 253 LILVEGVDHNVQGNNSQYWWGGNLTGVANYPV-VLDVPNRVVYSPHDYG-PGVSSQPWFN 312

Query: 245 QPLNNICANIMNGFIDHAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQK 304
            P      + +    D     I   N  P+ V E+G    +++  E ++ +    ++   
Sbjct: 313 DP---AFPSNLPAIWDQTWGYISKQNIAPVLVGEFGGRNVDLSCPEGKWQNALVHYIGAN 372

Query: 305 DLDWALWTWQGSYYYREGQAEPAETFGVLDSNWTQIKNP 324
           +L +  W+               +T G+L  +WT    P
Sbjct: 373 NLYFTYWSL---------NPNSGDTGGLLLDDWTTWNRP 373

BLAST of Cla007962 vs. TrEMBL
Match: A0A0A0K853_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1)

HSP 1 Score: 917.1 bits (2369), Expect = 8.6e-264
Identity = 443/482 (91.91%), Postives = 459/482 (95.23%), Query Frame = 1

Query: 1   MLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRT+EENFDLLDL+QAKAGLA
Sbjct: 57  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLA 116

Query: 61  QYNPFLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRNFDPQE 120
           QYNPF+LNKTI EAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNR FDPQE
Sbjct: 117 QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQE 176

Query: 121 WLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMENANDWNNYVTQGVTTIHSINPEVLVIVS 180
           WLQGLSLVAQRFNNKSTVVGMSLRNE+RGMMENANDWNNYVTQGVTTIH INP VLVIVS
Sbjct: 177 WLQGLSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVS 236

Query: 181 GLNYDNDLRCLKEKPLTVKTLNNKLVFEVHLYSFSGDSESKFVQQPLNNICANIMNGFID 240
           GLNYDNDLRCLK+KPL V TL+NKL FEVHLYSFSGDSESKFVQQPLNNICA IM+ FID
Sbjct: 237 GLNYDNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFID 296

Query: 241 HAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 300
           HAEFVIEG NPFPLFVSEYGYDQREV+DAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR
Sbjct: 297 HAEFVIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 356

Query: 301 EGQAEPAETFGVLDSNWTQIKNPNFVKKFQLLQTMLQDPNSNASFSYVIYHPQSGQCVQV 360
           EGQAE AETFGVLDSNWTQIKNPNFV+KFQLLQTMLQDP SNASFSYVIYH QSGQC++V
Sbjct: 357 EGQAELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEV 416

Query: 361 SNDNKEIFLTNCSTSSRWSHDNDNTPIRMSDTGLCLKASGEGLEASLSTDCLGKLSVWSA 420
           SNDNKEIFLTNCSTSSRWSHDND+TPI+MS TGLCLKASGEGLEASLSTDC+GK S+WSA
Sbjct: 417 SNDNKEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSA 476

Query: 421 ISNSKLHLATFTEDGKSLCLQ-VESSNSSKIVTNSCICTNGDPTCLQDTQSQWFELVGTN 480
           ISNS LHL T TEDGKSLCLQ +ESSNSSKIVTNSCICT  DPTCLQDTQSQWFELV TN
Sbjct: 477 ISNSNLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVATN 536

Query: 481 TL 482
           TL
Sbjct: 537 TL 538

BLAST of Cla007962 vs. TrEMBL
Match: A0A0A0KL32_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171770 PE=3 SV=1)

HSP 1 Score: 543.9 bits (1400), Expect = 2.0e-151
Identity = 260/480 (54.17%), Postives = 347/480 (72.29%), Query Frame = 1

Query: 1   MLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLA 60
           MLAEGL+ RPL ++     KLRFNCVRLTY+ HMFTR+AN T++++F+  D+K A AG+A
Sbjct: 58  MLAEGLHRRPLDDIISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIA 117

Query: 61  QYNPFLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRNFDPQE 120
           Q NP L+N T+VEAY AVVD L A G+MV++DNH+SQPRWCC+ DDGNGFFG+R FDP+E
Sbjct: 118 QNNPSLVNLTLVEAYGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEE 177

Query: 121 WLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMENANDWNNYVTQGVTTIHSINPEVLVIVS 180
           WLQG+SL AQ   +K+ VV MS+RNE RG  +N   W  Y++QG   IH INP  LV+VS
Sbjct: 178 WLQGISLAAQSLKSKAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVS 237

Query: 181 GLNYDNDLRCLKEKPLTVKTLNNKLVFEVHLYSFSGDSESKFVQQPLNNICANIMNGFID 240
           GL+YD DL  LK + +    L+NKLVFE HLYSF+ +    ++ +PLN  CA++  GF D
Sbjct: 238 GLSYDTDLSFLKNRSMGF-NLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFED 297

Query: 241 HAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 300
            A F++ G NP PLFVSE+G DQR VN+ +NRF+SCF ++L + D DW LW  QGSYYYR
Sbjct: 298 RAGFLVRGQNPMPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYR 357

Query: 301 EGQAEPAETFGVLDSNWTQIKNPN-FVKKFQLLQTMLQDPNSNASFSYVIYHPQSGQCVQ 360
           EG     E FGVLDS + + KN   F+++FQL+QT LQDP+SN + S ++YHP SG CV+
Sbjct: 358 EGVKNAEENFGVLDSTFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVR 417

Query: 361 VSNDNKEIFLTNCSTSSRWSHDNDNTPIRMSDTGLCLKASGEGLEASLSTDCLGKLSVWS 420
           + N   ++ +++C TS+RW H+ D++PI+++ + LCLKA G GL   LS DC  + S+W 
Sbjct: 418 M-NKKYQLGISSCKTSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWK 477

Query: 421 AISNSKLHLATFTEDGKSLCLQVESSNSSKIVTNSCICTNGDPTCLQDTQSQWFELVGTN 480
             S++KL LAT  E G++LCLQ  +S+S +IVTN C+C+N D  C +D QSQWF LV +N
Sbjct: 478 YGSSAKLQLATVDEQGQALCLQRAASHSHQIVTNKCLCSN-DSQCQEDPQSQWFTLVPSN 534

BLAST of Cla007962 vs. TrEMBL
Match: A0A0A0KNB6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171760 PE=3 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 9.8e-151
Identity = 259/480 (53.96%), Postives = 347/480 (72.29%), Query Frame = 1

Query: 1   MLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLA 60
           MLAEGL+ RPL ++A   +K RFNCVRLTY+ HMFTR+AN T++++F+  D+K A AG+A
Sbjct: 58  MLAEGLHLRPLDDIAALVVKSRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDALAGIA 117

Query: 61  QYNPFLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRNFDPQE 120
           Q NP +LN T+V+AY AV+D L A  +MV++DNH+SQPRWCC+ DDGNGFFG+R FDPQE
Sbjct: 118 QNNPSILNMTVVQAYGAVIDSLAAHRVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPQE 177

Query: 121 WLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMENANDWNNYVTQGVTTIHSINPEVLVIVS 180
           WLQG+SL AQ   +KS VV MSLRNE RG  +N   W  Y++QG   IH INP  LV+VS
Sbjct: 178 WLQGISLAAQNLKSKSQVVAMSLRNEPRGPNQNVEMWFQYMSQGAKLIHQINPNALVVVS 237

Query: 181 GLNYDNDLRCLKEKPLTVKTLNNKLVFEVHLYSFSGDSESKFVQQPLNNICANIMNGFID 240
           GL+YD DL  LK + +    L+NKLVFE HLYSF+ +    ++ +PLN  CA++  GF D
Sbjct: 238 GLSYDTDLSFLKNRSMGF-NLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFED 297

Query: 241 HAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 300
            A F++ G NP PLFVSE+G DQR VN+ +NRF+SCF ++L + D DW LW  QGSYYYR
Sbjct: 298 RAGFLVRGQNPIPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYR 357

Query: 301 EGQAEPAETFGVLDSNWTQIKNPN-FVKKFQLLQTMLQDPNSNASFSYVIYHPQSGQCVQ 360
           EG     E FGVLDS + + KN   F+++FQL+QT LQDP+SN + S ++YHP SG CV+
Sbjct: 358 EGVKNAEENFGVLDSTFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVR 417

Query: 361 VSNDNKEIFLTNCSTSSRWSHDNDNTPIRMSDTGLCLKASGEGLEASLSTDCLGKLSVWS 420
           + N   ++ +++C TS+RW H+ D++PI+++ + LCLKA G GL   LS DC  + S+W 
Sbjct: 418 M-NKKYQLGISSCKTSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWK 477

Query: 421 AISNSKLHLATFTEDGKSLCLQVESSNSSKIVTNSCICTNGDPTCLQDTQSQWFELVGTN 480
             SN+KL LAT  E G++LCLQ  +S+S ++VTN C+C++ D  C +D QSQWF LV +N
Sbjct: 478 YGSNAKLQLATIDEQGQALCLQRAASHSHQLVTNKCLCSS-DSQCQEDPQSQWFTLVPSN 534

BLAST of Cla007962 vs. TrEMBL
Match: A0A059CGH5_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_D01800 PE=3 SV=1)

HSP 1 Score: 526.9 bits (1356), Expect = 2.5e-146
Identity = 254/482 (52.70%), Postives = 337/482 (69.92%), Query Frame = 1

Query: 1   MLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTR--YANRTIEENFDLLDLKQAKAG 60
           MLAEGL+ +PL  +  E  +LRFNCVRLT+AT+MFT+  + ++ +EE  D L L +AK G
Sbjct: 60  MLAEGLDKKPLGVIVAEIRRLRFNCVRLTWATYMFTQPGHGDQPVEETLDSLGLAEAKGG 119

Query: 61  LAQYNPFLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRNFDP 120
           +A+ NP +LN T VEAY AVVD LG  G+MV+ DNH+S+P+WCC+ DDGNGFFG+  FDP
Sbjct: 120 VARNNPLVLNMTHVEAYAAVVDELGKQGVMVVLDNHVSKPKWCCAYDDGNGFFGDEYFDP 179

Query: 121 QEWLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMENANDWNNYVTQGVTTIHSINPEVLVI 180
           +EWL+GL  VA+ FN KS VVGMS+RNE+RG  +N  DW  Y+    T +H  NP VLVI
Sbjct: 180 EEWLRGLVAVAEHFNGKSQVVGMSVRNELRGPRQNDYDWYQYIRTAATKVHQANPNVLVI 239

Query: 181 VSGLNYDNDLRCLKEKPLTVKTLNNKLVFEVHLYSFSGDSESKFVQQPLNNICANIMNGF 240
           +SGLN+ +DL  L+++P+ + +L  KLV+E H YSFSGD +   V QP++ +CAN +   
Sbjct: 240 LSGLNWASDLSFLRKRPVGL-SLGRKLVYEAHWYSFSGDRKIWEV-QPVDRVCANAVQRM 299

Query: 241 IDHAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYY 300
            D A F+  G    PLF+ E+G+DQ   + A++RF+SCF  + A KDLDWALW  QGSYY
Sbjct: 300 EDQAGFLSSGPGAVPLFLGEFGFDQTGKSQADDRFLSCFMGYAAGKDLDWALWALQGSYY 359

Query: 301 YREGQAEPAETFGVLDSNWTQIKNPNFVKKFQLLQTMLQDPNSNASFSYVIYHPQSGQCV 360
           YR+G   P ETFGVLD NW  ++NP F ++FQL+QTM+QDP+SN+  SY++YHPQSG C+
Sbjct: 360 YRQGVVGPEETFGVLDFNWDGLRNPKFKERFQLVQTMVQDPSSNSPMSYIMYHPQSGLCI 419

Query: 361 QVSNDNKEIFLTNCSTSSRWSHDNDNTPIRMSDTGLCLKASGEGLEASLSTDCLGKLSVW 420
           + +N+N EI    C   SRW H  D +PIR+  T LCLKA G+GL   LS DC  + S W
Sbjct: 420 R-ANNNHEIGTAECQHWSRWIHYRDGSPIRLMGTPLCLKALGDGLPPVLSNDCSNRRSAW 479

Query: 421 SAISNSKLHLATFTEDGKSLCLQVESSNSSKIVTNSCICTNGDPTCLQDTQSQWFELVGT 480
            +ISNSKLH+A   E G  LCL+ +S+ SS I+T  CIC + D  C ++ Q QWF+ V T
Sbjct: 480 RSISNSKLHVAATDEHGNRLCLEKKSNESSVILTRKCICVDDDSGCTENPQGQWFKFVPT 538

BLAST of Cla007962 vs. TrEMBL
Match: B9RCJ5_RICCO (Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCOM_1689380 PE=3 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 3.3e-146
Identity = 255/480 (53.12%), Postives = 336/480 (70.00%), Query Frame = 1

Query: 1   MLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLA 60
           MLAEGL+ +PL  LA +  +  FNCVR T ATHMFTRY   T+ ++FD L+L +AKAG+A
Sbjct: 57  MLAEGLDKKPLSYLASKLARYHFNCVRFTCATHMFTRYGKLTVAQSFDSLNLTKAKAGIA 116

Query: 61  QYNPFLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRNFDPQE 120
           ++N FLLN T+V+AYEAVV+ LGA GLMV+ DNH+SQP+WCC  DD NGFFG+ +F P+E
Sbjct: 117 RHNSFLLNLTVVQAYEAVVNELGAHGLMVLLDNHVSQPKWCCPQDDENGFFGDIHFHPKE 176

Query: 121 WLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMENANDWNNYVTQGVTTIHSINPEVLVIVS 180
           WL+GL++VA+ F  KS VV MS+RNE+RG  +N +DW  Y+ +G   +H +NPEVLV+VS
Sbjct: 177 WLRGLAIVAKIFQGKSQVVAMSMRNELRGPYQNEHDWYKYIQEGARMVHKLNPEVLVLVS 236

Query: 181 GLNYDNDLRCLKEKPLTV-KTLNNKLVFEVHLYSFSGDSESKFVQQPLNNICANIMNGFI 240
           GL +  DL  LK+KPL +   L+NKLV+E H YSFSGD +   V QPLN IC       +
Sbjct: 237 GLVWGTDLSFLKKKPLHLGLNLDNKLVYEAHWYSFSGDPKVWEV-QPLNRICDLKTQIQV 296

Query: 241 DHAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYY 300
           D + FVI G NP PLF+ E G DQR VN A+NRF +CF A++A+ DLDW LW +QGSYY+
Sbjct: 297 DLSGFVITGENPVPLFLGEVGIDQRGVNRADNRFFTCFLAYVAENDLDWGLWAFQGSYYF 356

Query: 301 REGQAEPAETFGVLDSNWTQIKNPNFVKKFQLLQTMLQDPNSNASFSYVIYHPQSGQCVQ 360
           +EG A P E +G+++ +W  +++P F  +  L++ M+QDP+S  S SY++YHP SG CV 
Sbjct: 357 KEGIAGPDENYGLMNFDWNYLRSPEFDDRIWLIKRMIQDPDSILSTSYLMYHPLSGNCVH 416

Query: 361 VSNDNKEIFLTNCSTSSRWSHDNDNTPIRMSDTGLCLKASGEGLEASLSTDCLGKLSVWS 420
            S  N EI+ +     SRWSHD D  PIR+  + LCLKA G+GLE  LS DC  + S W 
Sbjct: 417 ASEKN-EIYASRFQQHSRWSHDGDGAPIRLMGSALCLKAIGDGLEPVLSNDCFSQQSSWK 476

Query: 421 AISNSKLHLATFTEDGKSLCLQVESSNSSKIVTNSCICTNGDPTCLQDTQSQWFELVGTN 480
            +S+SKLHL    E G+ LCL+ ES NSSK+ T  CIC   D  C ++ QSQWF+L+ TN
Sbjct: 477 LLSSSKLHLGVKDEHGEYLCLEKESFNSSKVFTRKCICIEDDSDCQENPQSQWFKLIKTN 534

BLAST of Cla007962 vs. NCBI nr
Match: gi|778721997|ref|XP_011658389.1| (PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus])

HSP 1 Score: 917.1 bits (2369), Expect = 1.2e-263
Identity = 443/482 (91.91%), Postives = 459/482 (95.23%), Query Frame = 1

Query: 1   MLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRT+EENFDLLDL+QAKAGLA
Sbjct: 57  MLIEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTVEENFDLLDLEQAKAGLA 116

Query: 61  QYNPFLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRNFDPQE 120
           QYNPF+LNKTI EAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNR FDPQE
Sbjct: 117 QYNPFVLNKTIAEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQE 176

Query: 121 WLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMENANDWNNYVTQGVTTIHSINPEVLVIVS 180
           WLQGLSLVAQRFNNKSTVVGMSLRNE+RGMMENANDWNNYVTQGVTTIH INP VLVIVS
Sbjct: 177 WLQGLSLVAQRFNNKSTVVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVS 236

Query: 181 GLNYDNDLRCLKEKPLTVKTLNNKLVFEVHLYSFSGDSESKFVQQPLNNICANIMNGFID 240
           GLNYDNDLRCLK+KPL V TL+NKL FEVHLYSFSGDSESKFVQQPLNNICA IM+ FID
Sbjct: 237 GLNYDNDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFID 296

Query: 241 HAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 300
           HAEFVIEG NPFPLFVSEYGYDQREV+DAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR
Sbjct: 297 HAEFVIEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 356

Query: 301 EGQAEPAETFGVLDSNWTQIKNPNFVKKFQLLQTMLQDPNSNASFSYVIYHPQSGQCVQV 360
           EGQAE AETFGVLDSNWTQIKNPNFV+KFQLLQTMLQDP SNASFSYVIYH QSGQC++V
Sbjct: 357 EGQAELAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDPYSNASFSYVIYHVQSGQCIEV 416

Query: 361 SNDNKEIFLTNCSTSSRWSHDNDNTPIRMSDTGLCLKASGEGLEASLSTDCLGKLSVWSA 420
           SNDNKEIFLTNCSTSSRWSHDND+TPI+MS TGLCLKASGEGLEASLSTDC+GK S+WSA
Sbjct: 417 SNDNKEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSA 476

Query: 421 ISNSKLHLATFTEDGKSLCLQ-VESSNSSKIVTNSCICTNGDPTCLQDTQSQWFELVGTN 480
           ISNS LHL T TEDGKSLCLQ +ESSNSSKIVTNSCICT  DPTCLQDTQSQWFELV TN
Sbjct: 477 ISNSNLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELVATN 536

Query: 481 TL 482
           TL
Sbjct: 537 TL 538

BLAST of Cla007962 vs. NCBI nr
Match: gi|659090006|ref|XP_008445780.1| (PREDICTED: uncharacterized protein LOC103488703 [Cucumis melo])

HSP 1 Score: 857.1 bits (2213), Expect = 1.5e-245
Identity = 412/448 (91.96%), Postives = 429/448 (95.76%), Query Frame = 1

Query: 34  MFTRYANRTIEENFDLLDLKQAKAGLAQYNPFLLNKTIVEAYEAVVDVLGASGLMVIADN 93
           MFTRYANRT+EENFDLLDL QAKAGL QYNPF+LNKTI EAYEAVVDVLGASGLMVIADN
Sbjct: 1   MFTRYANRTVEENFDLLDLGQAKAGLTQYNPFVLNKTIAEAYEAVVDVLGASGLMVIADN 60

Query: 94  HMSQPRWCCSLDDGNGFFGNRNFDPQEWLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMEN 153
           HMSQPRWCCSLDDGNGFFGNR FDPQEWLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMEN
Sbjct: 61  HMSQPRWCCSLDDGNGFFGNRYFDPQEWLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMEN 120

Query: 154 ANDWNNYVTQGVTTIHSINPEVLVIVSGLNYDNDLRCLKEKPLTVKTLNNKLVFEVHLYS 213
           ANDWN+YVTQGVTTIH+INPEVLVIV GLNYDNDLRCLKEKPL V TL+NKLVFEVHLYS
Sbjct: 121 ANDWNHYVTQGVTTIHNINPEVLVIVGGLNYDNDLRCLKEKPLNVSTLDNKLVFEVHLYS 180

Query: 214 FSGDSESKFVQQPLNNICANIMNGFIDHAEFVIEGSNPFPLFVSEYGYDQREVNDAENRF 273
           FSG SESKFVQQPLNNICA I+N FIDHAEFVIEGSNPFPLFVSEYGYDQREV+DAENRF
Sbjct: 181 FSGASESKFVQQPLNNICAKIINEFIDHAEFVIEGSNPFPLFVSEYGYDQREVDDAENRF 240

Query: 274 MSCFTAHLAQKDLDWALWTWQGSYYYREGQAEPAETFGVLDSNWTQIKNPNFVKKFQLLQ 333
           MSCFTAHLAQKDLDWALWTWQGSYYYREGQAE  ETFGVL+SNWTQIKNPNFV+KFQLLQ
Sbjct: 241 MSCFTAHLAQKDLDWALWTWQGSYYYREGQAELPETFGVLESNWTQIKNPNFVQKFQLLQ 300

Query: 334 TMLQDPNSNASFSYVIYHPQSGQCVQVSNDNKEIFLTNCSTSSRWSHDNDNTPIRMSDTG 393
           TMLQDPNSNASFSYVIYHPQSGQC++VSNDNK+IFLTNCSTSSRWSHDND+TPI+MS+TG
Sbjct: 301 TMLQDPNSNASFSYVIYHPQSGQCIEVSNDNKDIFLTNCSTSSRWSHDNDSTPIKMSNTG 360

Query: 394 LCLKASGEGLEASLSTDCLGKLSVWSAISNSKLHLATFTEDGKSLCLQVESSNSSKIVTN 453
           LCLKASGEGL ASLS DCLGK SVWSAISNSKLHLAT TE+GKSLCLQ+ESSNSSKIVTN
Sbjct: 361 LCLKASGEGLAASLSNDCLGKQSVWSAISNSKLHLATVTENGKSLCLQIESSNSSKIVTN 420

Query: 454 SCICTNGDPTCLQDTQSQWFELVGTNTL 482
           SCICT  DPTCLQDTQSQWFELV TNTL
Sbjct: 421 SCICTTDDPTCLQDTQSQWFELVETNTL 448

BLAST of Cla007962 vs. NCBI nr
Match: gi|449451950|ref|XP_004143723.1| (PREDICTED: uncharacterized protein LOC101213113 [Cucumis sativus])

HSP 1 Score: 706.4 bits (1822), Expect = 3.3e-200
Identity = 325/483 (67.29%), Postives = 403/483 (83.44%), Query Frame = 1

Query: 1   MLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGL+ RPLK+LA+E ++LRFNCVRLTYATHMFTRYANRT+EENFDLLDL+ AK GLA
Sbjct: 57  MLIEGLDRRPLKDLANEVVRLRFNCVRLTYATHMFTRYANRTVEENFDLLDLRAAKVGLA 116

Query: 61  QYNPFLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRNFDPQE 120
            +NPF+LN TI EAYEAVVDVLG SGLMVIADNH+SQPRWCCSL+DGNGFFG+R FD +E
Sbjct: 117 FHNPFVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDTEE 176

Query: 121 WLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMENANDWNNYVTQGVTTIHSINPEVLVIVS 180
           WL+GL LVA+RF NKS VV MSLRNE+RG    + DWN Y+TQG TTIH+INP++LVI+S
Sbjct: 177 WLEGLRLVARRFYNKSAVVAMSLRNELRGASSKSKDWNKYITQGATTIHNINPKILVIIS 236

Query: 181 GLNYDNDLRCLKEKPLTVKTLNNKLVFEVHLYSFSGDSESKFVQQPLNNICANIMNGFID 240
           GLN+DNDLRC ++ PL +  L+NKLVFEVHLYSFSG+S+SKF+  PLN IC+ ++NGF++
Sbjct: 237 GLNFDNDLRCQRQYPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKVINGFVE 296

Query: 241 HAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 300
            AEFV+EG+   PLFVSE+G DQR VN+A++RF+SCF+AHL +KDLDWALW WQGSYYYR
Sbjct: 297 RAEFVMEGAEAVPLFVSEFGLDQRGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYR 356

Query: 301 EGQAEPAETFGVLDSNWTQIKNPNFVKKFQLLQTMLQDPNSNASFSYVIYHPQSGQCVQV 360
           +G+  P E FGVL+ NW+ ++NP+F + FQLLQTMLQDPNSN+S +YV+YHPQSGQCV V
Sbjct: 357 QGKVGPEEVFGVLNYNWSDVRNPHFSQMFQLLQTMLQDPNSNSSNTYVMYHPQSGQCVLV 416

Query: 361 SN-DNKEIFLTNCSTSSRWSHDNDNTPIRMSDTGLCLKASGEGLEASLSTDCLGKLSVWS 420
            +  + +I+L +CS +S WS++ D TPI ++ T  CLKASG+GL  SLS DC G+ SVW+
Sbjct: 417 QDMKHMQIYLNDCSNASHWSYEGDGTPIMLASTNFCLKASGDGLPPSLSRDCFGEQSVWT 476

Query: 421 AISNSKLHLATFTEDGKS-LCLQVESSNSSKIVTNSCICTNGDPTCLQDTQSQWFELVGT 480
           AIS+SKLHLAT T+ G + +CL+ ESSNSS+I+  SC+C   D  CLQDTQ+QWF+LV T
Sbjct: 477 AISDSKLHLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGNDSNCLQDTQAQWFQLVVT 536

Query: 481 NTL 482
           NTL
Sbjct: 537 NTL 539

BLAST of Cla007962 vs. NCBI nr
Match: gi|659073199|ref|XP_008467306.1| (PREDICTED: uncharacterized protein LOC103504686 [Cucumis melo])

HSP 1 Score: 704.9 bits (1818), Expect = 9.6e-200
Identity = 327/483 (67.70%), Postives = 399/483 (82.61%), Query Frame = 1

Query: 1   MLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLA 60
           ML EGL+ RPLK+LA+E ++L+FNCVRLTYATHMFTRYANRT+EENFDLLDL+ +K GLA
Sbjct: 57  MLIEGLDRRPLKDLANEVMRLKFNCVRLTYATHMFTRYANRTVEENFDLLDLRASKVGLA 116

Query: 61  QYNPFLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRNFDPQE 120
            +NPF+LN TI EAYEAVVDVLG SGLMVIADNH+SQPRWCCSL+DGNGFFG+R FD +E
Sbjct: 117 LHNPFVLNMTIFEAYEAVVDVLGTSGLMVIADNHISQPRWCCSLEDGNGFFGDRYFDSEE 176

Query: 121 WLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMENANDWNNYVTQGVTTIHSINPEVLVIVS 180
           WL+GL LVA+RF NKS VV MSLRNE+RG    + DWN YVTQG TTIH+INP +LVI+S
Sbjct: 177 WLEGLRLVARRFYNKSAVVAMSLRNELRGASSKSKDWNKYVTQGATTIHNINPNILVIIS 236

Query: 181 GLNYDNDLRCLKEKPLTVKTLNNKLVFEVHLYSFSGDSESKFVQQPLNNICANIMNGFID 240
           GLN+DNDLRC ++ PL +  L+NKLVFEVHLYSFSG+S+SKF+  PLN IC+ I+NGF+ 
Sbjct: 237 GLNFDNDLRCQRQYPLQLNNLHNKLVFEVHLYSFSGESQSKFIHNPLNKICSKIINGFVQ 296

Query: 241 HAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 300
            AEFV+EG+   PLFVSE+G DQ  VN+A++RF+SCF+AHL +KDLDWALW WQGSYYYR
Sbjct: 297 RAEFVMEGAEAVPLFVSEFGLDQTGVNEADDRFLSCFSAHLVEKDLDWALWGWQGSYYYR 356

Query: 301 EGQAEPAETFGVLDSNWTQIKNPNFVKKFQLLQTMLQDPNSNASFSYVIYHPQSGQCVQV 360
           +G+ E  E FGVL+ NW+ ++NP F + FQLLQTMLQDPNSN+S +Y++YHPQSGQCVQV
Sbjct: 357 QGKVELEEVFGVLNYNWSDVRNPRFSQMFQLLQTMLQDPNSNSSNTYLMYHPQSGQCVQV 416

Query: 361 SN-DNKEIFLTNCSTSSRWSHDNDNTPIRMSDTGLCLKASGEGLEASLSTDCLGKLSVWS 420
            +   KEIFL NCS +S WS++ D TPI ++ T  CLKA+G GL  SLS DC G+ SVW+
Sbjct: 417 HDMKQKEIFLNNCSNASHWSYEGDGTPIMLASTNFCLKANGNGLPPSLSRDCFGEQSVWT 476

Query: 421 AISNSKLHLATFTEDGKS-LCLQVESSNSSKIVTNSCICTNGDPTCLQDTQSQWFELVGT 480
           AIS+SKLHLAT T+ G + +CL+ ESSNSS+I+  SC+C   D  CLQDTQ+QWF+LV T
Sbjct: 477 AISDSKLHLATLTKQGNNGMCLEKESSNSSRILMRSCVCVGSDSNCLQDTQAQWFQLVVT 536

Query: 481 NTL 482
           NTL
Sbjct: 537 NTL 539

BLAST of Cla007962 vs. NCBI nr
Match: gi|700195218|gb|KGN50395.1| (hypothetical protein Csa_5G171770 [Cucumis sativus])

HSP 1 Score: 543.9 bits (1400), Expect = 2.8e-151
Identity = 260/480 (54.17%), Postives = 347/480 (72.29%), Query Frame = 1

Query: 1   MLAEGLNHRPLKELADEAIKLRFNCVRLTYATHMFTRYANRTIEENFDLLDLKQAKAGLA 60
           MLAEGL+ RPL ++     KLRFNCVRLTY+ HMFTR+AN T++++F+  D+K A AG+A
Sbjct: 58  MLAEGLHRRPLDDIISLVAKLRFNCVRLTYSIHMFTRHANLTVQQSFENFDMKDAMAGIA 117

Query: 61  QYNPFLLNKTIVEAYEAVVDVLGASGLMVIADNHMSQPRWCCSLDDGNGFFGNRNFDPQE 120
           Q NP L+N T+VEAY AVVD L A G+MV++DNH+SQPRWCC+ DDGNGFFG+R FDP+E
Sbjct: 118 QNNPSLVNLTLVEAYGAVVDSLAAHGVMVVSDNHISQPRWCCNNDDGNGFFGDRYFDPEE 177

Query: 121 WLQGLSLVAQRFNNKSTVVGMSLRNEIRGMMENANDWNNYVTQGVTTIHSINPEVLVIVS 180
           WLQG+SL AQ   +K+ VV MS+RNE RG  +N   W  Y++QG   IH INP  LV+VS
Sbjct: 178 WLQGISLAAQSLKSKAEVVAMSMRNEPRGPNQNVEKWFQYMSQGAKLIHQINPNALVVVS 237

Query: 181 GLNYDNDLRCLKEKPLTVKTLNNKLVFEVHLYSFSGDSESKFVQQPLNNICANIMNGFID 240
           GL+YD DL  LK + +    L+NKLVFE HLYSF+ +    ++ +PLN  CA++  GF D
Sbjct: 238 GLSYDTDLSFLKNRSMGF-NLDNKLVFEAHLYSFTNNMGDFWMSKPLNTFCASVNQGFED 297

Query: 241 HAEFVIEGSNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYR 300
            A F++ G NP PLFVSE+G DQR VN+ +NRF+SCF ++L + D DW LW  QGSYYYR
Sbjct: 298 RAGFLVRGQNPMPLFVSEFGIDQRGVNEGQNRFLSCFFSYLTENDFDWGLWALQGSYYYR 357

Query: 301 EGQAEPAETFGVLDSNWTQIKNPN-FVKKFQLLQTMLQDPNSNASFSYVIYHPQSGQCVQ 360
           EG     E FGVLDS + + KN   F+++FQL+QT LQDP+SN + S ++YHP SG CV+
Sbjct: 358 EGVKNAEENFGVLDSTFAKAKNSKLFLQRFQLMQTKLQDPSSNFTTSLIMYHPLSGGCVR 417

Query: 361 VSNDNKEIFLTNCSTSSRWSHDNDNTPIRMSDTGLCLKASGEGLEASLSTDCLGKLSVWS 420
           + N   ++ +++C TS+RW H+ D++PI+++ + LCLKA G GL   LS DC  + S+W 
Sbjct: 418 M-NKKYQLGISSCKTSNRWIHEQDSSPIKLAGSVLCLKAIGVGLPPILSQDCSSQQSIWK 477

Query: 421 AISNSKLHLATFTEDGKSLCLQVESSNSSKIVTNSCICTNGDPTCLQDTQSQWFELVGTN 480
             S++KL LAT  E G++LCLQ  +S+S +IVTN C+C+N D  C +D QSQWF LV +N
Sbjct: 478 YGSSAKLQLATVDEQGQALCLQRAASHSHQIVTNKCLCSN-DSQCQEDPQSQWFTLVPSN 534

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUN_PAEPO7.5e-1120.65Endoglucanase OS=Paenibacillus polymyxa PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0K853_CUCSA8.6e-26491.91Uncharacterized protein OS=Cucumis sativus GN=Csa_6G028440 PE=3 SV=1[more]
A0A0A0KL32_CUCSA2.0e-15154.17Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171770 PE=3 SV=1[more]
A0A0A0KNB6_CUCSA9.8e-15153.96Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171760 PE=3 SV=1[more]
A0A059CGH5_EUCGR2.5e-14652.70Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_D01800 PE=3 SV=1[more]
B9RCJ5_RICCO3.3e-14653.13Hydrolase, hydrolyzing O-glycosyl compounds, putative OS=Ricinus communis GN=RCO... [more]
Match NameE-valueIdentityDescription
gi|778721997|ref|XP_011658389.1|1.2e-26391.91PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus][more]
gi|659090006|ref|XP_008445780.1|1.5e-24591.96PREDICTED: uncharacterized protein LOC103488703 [Cucumis melo][more]
gi|449451950|ref|XP_004143723.1|3.3e-20067.29PREDICTED: uncharacterized protein LOC101213113 [Cucumis sativus][more]
gi|659073199|ref|XP_008467306.1|9.6e-20067.70PREDICTED: uncharacterized protein LOC103504686 [Cucumis melo][more]
gi|700195218|gb|KGN50395.1|2.8e-15154.17hypothetical protein Csa_5G171770 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000772Ricin_B_lectin
IPR001547Glyco_hydro_5
IPR013781Glycoside hydrolase, catalytic domain
IPR017853Glycoside_hydrolase_SF
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016798 hydrolase activity, acting on glycosyl bonds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla007962Cla007962.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000772Ricin B, lectin domainunknownSSF50370Ricin B-like lectinscoord: 344..456
score: 7.
IPR001547Glycoside hydrolase, family 5PFAMPF00150Cellulasecoord: 10..294
score: 7.0
IPR013781Glycoside hydrolase, catalytic domainGENE3DG3DSA:3.20.20.80coord: 3..334
score: 3.8
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 4..329
score: 1.33
NoneNo IPR availableGENE3DG3DSA:2.80.10.50coord: 348..456
score: 2.
NoneNo IPR availablePANTHERPTHR31263FAMILY NOT NAMEDcoord: 396..481
score: 2.3E-209coord: 1..350
score: 2.3E
NoneNo IPR availablePANTHERPTHR31263:SF10SUBFAMILY NOT NAMEDcoord: 396..481
score: 2.3E-209coord: 1..350
score: 2.3E