IT-Analysis.com
IT-Analysis.com Logo
Enterprise SME Business Issues Technology Services Channels
Module Header
Fern HalperFern Halper
Dr Fern Halper
9th October - MarketSight 7.0 - Survey Analysis Made Simple
Michael WarrilowIMHO
Michael Warrilow
9th October - This is not your grandparent's Symantec
Bob TarzeyQuocirca
Bob Tarzey
8th October - What is Symantec's vision?
Peter AbrahamsAbrahams Accessibility
Peter Abrahams
7th October - Using scripting to improve accessibility
Module Header
Q. Which database do you use most?
 
  • addtomyyahoo4
  • Subscribe in NewsGator Online
  • Add to My AOL
  • Subscribe with Bloglines
  • Add to netvibes
  • Add to Google
Blogs > Fern Halper
Syndicating Text Analytics
Fern Halper By: Dr Fern Halper, Partner, Hurwitz & Associates
Published: 9th July 2008
Copyright Hurwitz & Associates © 2008
Logo for Hurwitz & Associates

Over the past several weeks, I've been briefed by a number of text analytics vendors and companies in partnership with text analytics vendors about syndicated services that make use of text analytics. Of course, syndicated services such as brand monitoring and news services that make use of this technology to some degree have been around for a while. But, how about some of the newer services?

An interesting example of this is illumin8, which is being offered by Elsevier, in partnership with Netbase. The service is targeted at R&D knowledge workers looking to solve technical and business problems. According to Elsevier, knowledge workers spend more time per week trying to discover relevant content relating to a particular problem area than analyzing that information (5.5 hours/week accessing vs. 4.7 hours/week analyzing). These workers are usually using a google-like search engine. I think everyone can agree that the google-like search engine is not ideal for research purposes, so I won't belabor the point here. In the case of the R&D knowledge worker, often one goal is to gather information relating to a particular problem, finding products that solve that problem, as well as understanding the approach used to solve the problem.

Elsevier has aggregated 5 billion business sources, 3 million full text articles, 33 million scientific records, and 21 million patents as the source of information for this service. Using the Netbase semantic index, Elsevier crawls through the information and extracts solutions that solve a problem and the approaches used to address a specific issue. In this way, R&D can help answer the following questions:

  • Solutions that exist to solve a problem
  • New applications and processes that might exist to help solve a problem
  • Information about what competitors are doing in the particular problem space
  • What the experts are saying about a particular problem area

Below is a screen shot of what an end-user might see using this service. In this example, the user is interested in solving the problem of fuel efficiency in boats. He or she wants to see what products and approaches are out on the market to address this problem and what companies are providing these solutions.The user enters the topic (boats) and the benefit (fuel efficiency) in the search box and gets back information that is organized in a logical way. In this example, you can see that query returns information about products that address the problem as well as the companies that make the products, organizations that deal with energy, as well approaches to solving the problem (drag, stroke, etc). These are ranked. Users can then drill down on any of these areas to get snippets (and full text) associated with areas that he/she is interested in analyzing.

During the demo, I asked to see what would happen if we input "text analytics" as the problem space in the search box. I was actually impressed that what was returned was a good set of information about the players, organizations dealing with text analytics and other information about it. The service is not inexpensive, but it does cull a lot of information.

Syndicated Services
I believe that the number of syndicated services using text analytics will continue to grow. We're certainly seeing action in the brand monitoring space on this front. Vendors are also getting into the act. Expert System, for example, has its own service that is targeted at the auto industry. I believe that other vendors may get into the act if they determine that the financial benefits of offering syndicated services (as opposed to SaaS offerings) makes sense.

Reader Comments

We are no longer accepting comments against this item. We suggest contacting the author directly.

9th July 2008: 'T. Benson' said:

Thank you, Dr. Halper, for an informative article. We at Cognition Technologies (www.cognition.com) agree that keyword/pattern-matching search engines are inadequate and the next wave of Semantic Natural Language Processing technologies needs to go mainstream.

All Natural Language Processing companies are trying to solve the “relevancy issue” – meaning, how can the relevancy of the text they are processing (either in-bound -> reading, or out-bound -> delivering text out) be improved for the end-user? This issue is addressed both by technological adjustments to existing solutions (e.g. more sophisticated mathematical and statistical algorithms) and/or by market segmentation, such as through dataset specialization (e.g. automotive, music, videos, medical, etc.). Cognition, on the other hand, addresses the relevancy challenge by changing the NLP paradigm through its Semantic Map, a unique and complete combination of linguistic elements to optimize semantic understanding:

Morphology
• The various forms of word, e.g. singular, plural, tense

Syntax
• The grammatical structure, e.g. verbs, nouns

Semantics
• Word and sentence meaning

Spelling
• The various ways words are spelled (or misspelled)

You can try Cognition on three datasets: Wikipedia, Medline, and a US caselaw dataset. They can all be accessed from www.cognition.com.

Thanks again for a great article.

Reply to T. Benson?

Advertisement



Published by: IT Analysis Communications Ltd.
T: +44 (0)203 051 5760 | F: +44 (0)870 345 9922
Email: