? What’s Calais? <ul><li>Is a semantic metadata generation service provided by Thomson Reuters </li></ul><ul><li>Extracts entities, facts and events from unstructured text </li></ul><ul><li>Is a web service supported by a growing toolkit of developer frameworks, plugins and applications </li></ul><ul><li>Is available for commercial and non-commercial use for free </li></ul><ul><li>Available at www.opencalais.com </li></ul>
? What’s Calais? Unstructured Documents (Text / HTML / XML) Calais Named Entities Facts Events People, Companies, Organization, Geographies, Books, Albums, Authors, etc. Position, Alliance, Person-Education, Person-Political, etc. Sporting, Management Change, IPO, Labor Action, etc.
1 250,000 articles. Three Editors. Do the Math. Use Calais to automatically tag your historical archive assets Improved search, navigation and reader participation
1 250,000 articles. Three Editors. Do the Math. Extract your articles from their current content store Generate article metadata using one of the Calais tools or your own code. Drop us a note to speed things up. Load the generated metadata back to your content store Expose the generated tags to users linked to search functionality Consider implementing a user feedback mechanism Use Calais to automatically tag your historical archive assets
1 Example Page. Use Calais to automatically tag your historical archive assets Visit http://bit.ly/CalaisPM
2 Feed the Monkey. Support Yahoo’s Semantic Search. Use Calais to create microformat metadata for Yahoo! SearchMonkey Better searchability and user experience
2 Feed the Monkey. Support Yahoo’s Semantic Search. Get a Calais API key Download Marmoset – a few lines of PHP code Paste Marmoset into your site template Wait for the monkey to visit Use Calais to create microformat metadata for Yahoo’s SearchMonkey Visit http://bit.ly/marmoset
3 Alert on Events – Not Just Keywords Use Calais to drive alerts or feeds based on events – not just keywords Highly targeted notification of key events your users care about
3 Alert on Events – Not Just Keywords. As you ingest new content send it to Calais Parse the Calais results to look for events. ManagementChange, Investment, Merger, etc Store the Calais results in your content repository Allow your users to specify keywords + events as subscription or feed criteria: ManagementChange + Enron Use Calais to detect and alert on significant events in your content
4 Do Things Search Can’t. Use Calais to enable semantic knowledge discovery Content Insight
4 Semantic Analysis. Move beyond the document to the corpus Calais your content to generate semantic metadata (note the new simple tags format) Create a flat file with one row per document Use SAS, Excel or other tool to start asking questions: Who are the top 5 CEO’s mentioned in the news in the context of a merger or acquisition? Use Calais to drive statistical analysis of document semantics
5 Semantic Navigation. Use semantic metadata to improve your site navigation and end user experience Happier users, more pageviews
5 Semantic Navigation. Use semantic data to improve site navigation Visit http://bit.ly/mailguardian
A particular slide catching your eye?
Clipping is a handy way to collect important slides you want to go back to later.