IT-Analysis.com
IT-Analysis.com Logo
Technology Applications
Enterprise SME Business Issues Technology Services Channels
Module Header
Philip HowardBloor IM Blog
Philip Howard
8th February - Bribery
Nigel StanleyBloor Security Blog
Nigel Stanley
8th February - Conficker grounds police checks
David NorfolkThe Norfolk Punt
David Norfolk
3rd February - What's wrong with "security"
Laurie McCabeLaurie McCabe
Laurie McCabe
2nd February - What is Total Cost of Ownership, and Why Should You Care?
Philip HowardBloor IM Blog
Philip Howard
2nd February - Calpont finally comes to market
Module Header
Q. What features do you want to see on this site?
 
Analysis
Oracle data mining, not only good but affordable too!
David Norris By: David Norris, Associate Analyst - Marketing and Analytics, Bloor Research
Published: 30th November 2006
Copyright Bloor Research © 2006
Logo for Bloor Research

Having been very impressed by what Microsoft has to offer I wanted to learn more about what Oracle has to offer, and I am equally impressed. I, like many others I suspect, tend to rather overlook that Oracle does data mining; because Oracle has so much on offer, some elements do tend to get neglected. In my case, having been a Tru64 user for many years, Oracle was not an option because they did not support the platform. But now that the mining is integral to the kernel of the database it is universally available, but old habits die hard and it has rather been off my radar.

An aspect of the tool that I have to comment on is its pricing. I hear a lot from Oracle users about how expensive everything is getting, it therefore comes as a very pleasant surprise to find that the Oracle data mining option is one of the great bargains available today because it is affordable and when you look at what you are getting it is a real Rolls Royce of capability and features.

That Oracle data mining should be as sophisticated as it is should not really be a surprise, because when Oracle went out to acquire data mining capability they went after one of the top offerings. The origins of the expertise comes from Thinking Machines whose Darwin product was one of the best that I had seen; but at the time I examined it for a leading Telco it was deemed by my managers to be too expensive so we went for Clementine, but I have always held in high esteem the experts behind Darwin. Having been acquired by Oracle, those experts obviously not only had the huge customer base to go after, but also the highly competent base technology of the Oracle database itself. So they redesigned from scratch and put the algorithms into the database to ensure that, not only is the execution of the algorithms efficient, but the vast amounts of data handling that typifies traditional data mining is minimised. This cannot be emphasised enough because data mining analysts traditionally only spend a tiny proportion of their time actually modelling; something like 80% of effort is expended on getting data to the tool to be used. So Oracle are leaving the database in situ and mining it there, which saves a lot of effort and will greatly increase productivity.

Whilst Microsoft may have impressed me by bringing data mining to the masses, Oracle is pitching at a different segment. This is a fully featured, highly sophisticated data mining capability to enable professionals to operate against Oracle data sets with productivity and precision. Oracle data mining has a broad range of available algorithms, which enable it to undertake virtually every kind of business and scientific analysis that one can think of. So whether you want to run supervised learning to classify data and make predictions, or you want to run unsupervised sessions to discover anomalies and clusters, Oracle has the right tool for the job.

When Oracle Data Mining in the database was first released I was again discouraged from using it because the algorithms were there but we had no commercially available means of easily accessing them. SPSS have been making their front end tools available to enable the benefits of the in-database offerings of Microsoft, IBM and Oracle to be readily exploited, and now Oracle are developing GUI based interfaces of their own, so again reasons not to examine this product are becoming increasingly hard to find.

One of the features that Microsoft are offering next year is the inclusion of data mining add-ins to Excel, but Oracle users need not feel obliged to migrate to Microsoft in order to gain such useful capabilities because Oracle themselves offer Excel users a one click access to mine data whether it is held in Oracle or Excel.

Oracle is where an awful lot of data exists today, and of the general purpose databases (so I am excluding the likes of Alterian from this) in my experience it is by far the most performant. With the acquisition of Seibel and their BI suite Oracle created a formidable BI offering, and now that I realise what a hidden gem their data mining suite is, it really is a very hard act for others to compete with. Whilst Microsoft is about bringing data mining to the masses Oracle are giving the data mining professional a real alternative to SAS and SPSS with an offering that is equally as well featured, but which promises to outperform any standalone offering.

Reader Comments

Sorry, we are no longer accepting comments on this item. We suggest trying to contact the author directly.

30th November 2006: 'James' said:

oh, have u used Oracle Data Mining? Oracle Data Mining is crap, holy s**t. I have worked on Oracle Data Mining for 2.5 years, from ODM to JDM. Only one word can describe it : RABBISH.

Reply to James?

1st December 2006: 'Marcos' said:

I beg to differ from James. James' criticisms are so vague and lacking merit that I will refrain to reply to them directly. I have worked with Oracle Data Mining (ODM) for 7 years now and I can confidently say that James does not know what he is talking about. I will state my bias upfront, I am one of the developers of the product. But knowing what I know about the product and the competing alternatives, I can say with ease that I would use this product even if I was not developing it. James indicates that he has worked on ODM for 2.5 years. It so happens that I know ALL the developers that have worked and currently work on the product. His opinions do not reflect the views of the development team and show a complete lack of data mining knowledge. If he had any knowledge of the field he would understand the value of mining in the database, the scalability of the product, the ability to integrate data mining functionality with ease in arbitrary SQL queries, and the automation provided by the product to perform most steps required to be successful during model building and scoring. I did not want to make this sound like a product sheet, however, I wanted to provide some substance to my comments instead of just making baseless comments like James.

Reply to Marcos?

5th February 2007: 'James' said:

Marcos, are you working in Boston Oracle Data Mining team? I don't know you, but I happend to know your manager (do u know Jacek , if you are really an Oracle Developer?).

Marcos, I know what I am talking about. I have indeed worked on that for 2.5 years, for commecial use.

Maybe Oracle Data Mining is not totally rabbish (sorry if it's offensive), but Oracle Data Mining is not something I would recommend to anyone. I hate it a lot personally anyway.

Reply to James?

1st December 2006: 'Shibar Letap' said:

Hi Rabbish,
Which database release are you using? And
what problems are you facing?
Shibar

Reply to Shibar Letap?

2nd December 2006: 'steve shy' said:

when you pay nothing you get nothing. all those in database data mining tools are like flying a plane with auto pilot. while you are air born all is great. but how do you get air born and how do you land saftely ?

Reply to steve shy?

5th December 2006: 'Karl' said:

Steve, I don't think I understand your comparison to airplane take-off and landing. It's my experience that in-database mining eases any potential problems with precisely those activities: 1) Take-off is more straight-forward and error-free because the data is right there. The mining staff does not need to extract it from it's original source, move it around, import it into a data mining tool, and re-assess formats, etc. 2) Landing is also more straight-forward and error-free using in-database mining. Since you're right in the database, models can be scored directly there without needing to be translated. Additionally, if sensitive data is being used, the database's security is another factor that eases implementation (the airplane landing).

Reply to Karl?

28th January 2007: 'Will Dwinnell' said:

This review makes only vague claims about the product in question. Things which might have been interesting would include: What modeling algorithms are included with this product? What test procedures are available? How much data (variables and cases) can this tool accomodate? In what forms are finished models delivered? What facilities does this tool provide for missing values or outliers? What about variable transformations?

Other data mining tools (only three of them) are mentioned, but only in passing. See the Software section at KDnuggets for a much more comprehensive list of data mining tools.

Reply to Will Dwinnell?

Advertisement



Published by: IT Analysis Communications Ltd.
T: +44 (0)190 888 0760 | F: +44 (0)190 888 0761
Email: