Entity extraction everywhere

I haven’t had a chance to test-drive Twine, which is Radar Networks’ still-unreleased “Revolutionary Semantic Web Application,” but I’ve read Tim O’Reilly’s writeup based on a demo he saw, and I’ve been meaning to amplify something that appeared in a comment there. Jeffrey Carr wrote:

I really don’t see anything unique in what Twine has released so far. ClearForest, for example, has offered a Firefox add-on that does the same entity extraction for any web page that your Twine screenshot illustrates, and they had that available several months ago.

Like Tim O’Reilly I’ll reserve judgment on Twine until I’ve tried it myself, and seen it operate at scale. I did, however, recently try the Firefox extension that Jeffrey Carr mentions. It’s called Gnosis, from ClearForest, a company recently acquired by Reuters.

Here’s a picture of Gnosis summarizing Tim’s posting:

Gnosis finds and highlights entities — that is, companies, people, products, and industry terms. Here’s an expanded view of the industry terms, products, and technologies it extracted:

I’d love to see this kind of entity extraction turn into a commodity service that we can wire into our existing email, blogging, social networking, and social bookmarking systems. Being able to easily express, in all those contexts, that twine refers to the company, or the product, not the strong kind of string, would be a huge win.

4 Comments

  1. > I’d love to see this kind of entity extraction turn into a commodity
    >service that we can wire into our existing email, blogging, social
    > networking, and social bookmarking systems.

    You will see that a lot of this is already covered by Jiglu.com, which plugs staight into blogs

  2. At Orchestr8, we’ve been bringing Entity Extraction and other text mining capabilities “into the cloud”. Developers can utilize our REST api or SDKs to integrate natural language processing capabilities into their apps.

    AlchemyAPI supports 6+ spoken languages (English, French, ..), extraction of dozens of entity types, disambiguation support, text classification, etc.

    Other worthwhile Entity Extraction solutions include those from BasisTech and Teragram.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s