Intro To The Calais Web Service @


Published on

This is a publisher-centric look at version 4.0 of the free Calais Web Service and open API at

  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • Intro To The Calais Web Service @

    1. 2. Introducing the Calais Web service (4.0) <ul><li>A Thomson Reuters initiative designed to power next generation publishing solutions </li></ul><ul><li>A free API that anyone can use at </li></ul><ul><li>The fastest way to categorize & metatag the people, places, companies, facts and events in your content </li></ul><ul><li>An easy way to connect to the open data sources in the Linked Data Cloud , including Wikipedia, DBPedia,, the Internet Movie Database (IMDB) and more </li></ul>
    2. 3. Why? Tagging text is costly and time-consuming <ul><li>We help in areas where: </li></ul><ul><ul><li>The economics don’t support metadata creation </li></ul></ul><ul><ul><li>The value of metadata is potentially high </li></ul></ul><ul><ul><li>The value of aggregated metadata is potentially extremely high </li></ul></ul>Seconds Years Seconds Years Tweets Blogs News Scient. Pubs Great Novels Latency Shelf Life
    3. 4. Why? What Calais can help you do… <ul><li>Automate: Automatically tag the people, places, companies, facts and events in your content to increase its value and interoperability. </li></ul><ul><li>Enhance: Enrich your content with open data from Wikipedia, the Internet Movie Database (IMDB), and more. </li></ul><ul><li>Engage: Optimize your user experience, increase engagement and drive repeat visits with topic pages, personalized filtering and real-time alerts.    </li></ul><ul><li>Extend: Increase your syndication to next generation search engines, news aggregators, ‘related stories’ applications and others. </li></ul><ul><li>Connect: Enter the emerging Linked Content Economy. Compete in a rapidly evolving ecosystem of enriched and interconnected content. </li></ul>
    4. 5. How it works: <ul><li>A semantic metadata generation service that extracts entities, facts and events from unstructured text </li></ul><ul><li>Creates linkages from extracted entities to linked data ecosystem </li></ul><ul><li>Provides a transportation layer for rich semantic metadata from producers to consumers </li></ul>
    5. 6. <Acquisition offset=&quot;494&quot; length=&quot;130&quot;>   <Company_Acquirer>Reuters</Company_Acquirer>   <Company_Acquired>ClearForest Ltd.</Company_Acquired>   <Status>Planned</Status> </Acquisition> <Company>Reuters</Company> <Company>ClearForest Ltd.</Company> Reuters Announced the Acquisition of ClearForest New York - April 30, 2007 Reuters, the global information company, has entered into an agreement to acquire all of the outstanding shares of ClearForest Ltd., a privately held provider of Text Analytics solutions, whose tagging platform and analytical products allow clients to derive precise business information from huge amounts of textual content. ClearForest has received sufficient shareholder approval to complete the transaction, which is expected to close in approximately 30 days, subject to customary closing conditions. The financial terms were not disclosed. Reuters plans to retain and continue to work with the existing management team and their highly skilled workforces in the US and Israel. It also plans to continue to support existing products and customers. Reuters believes that search will be a pivotal element to the future of how financial information is sourced and consumed. As part of its drive into this space, Reuters has created a new strategic group and appointed Gerry Campbell, who will oversee the integration of ClearForest and drive this innovation. <Product>Text Analytic Solution </Product> <Company>ClearForest Ltd.</Company> <Company>Reuters</Company> <Country>United States</Country> <Country>Israel</Country> <Company>Reuters</Company> <Person>Gerry Campbell</Person> <ManagementChange offset=&quot;2789&quot; length=&quot;92&quot;> <Person>Gerry Campbell</Person> <Company>Reuters</Company> <Action>Enters</Position> </ManagementChange> Text markup by Calais <Topic>M&A</Topic>
    6. 7. NEW! NEW ! The Linked Data Cloud with new OpenCalais and Thomson Reuters information assets
    7. 8. Unstructured Text Calais extracts entities, facts and events Metadata returned to the user with keys Keys provide access to the Calais Linked Data cloud Which provides information and other Linked Data pointers To a range of open and partner Linked data assets, including Thomson Reuters 1 2 3 4 5 6 The Process
    8. 9. Quick online demo <ul><ul><li>Copy and paste the text of a business news article into the viewer here: and press submit. The article is sent to the Calais engine which tags the content and returns it, marked-up. </li></ul></ul><ul><ul><li>The tags appear on the left hand rail, and you can click on the plus (+) sign to see the tags expand. (Note that the Calais Viewer is not the Calais service. It is merely a demonstration of how the service works.) </li></ul></ul><ul><ul><li>Since we are now on Calais 4.0, you can also use the viewer to see the Linked Data assets related to the tags Calais returns. </li></ul></ul><ul><ul><li>For example, here is the Calais summary page for IBM: </li></ul></ul><ul><ul><li>And here is the summary page for IBM in DBPedia (the Wikipedia translated into computer language): </li></ul></ul>
    9. 10. Calais progress to date <ul><li>Launched in late January, 2008 </li></ul><ul><li>Already, 9,500 developers have joined </li></ul><ul><li>1-3 million content ‘transactions’ per day </li></ul><ul><li>Delivered four major update releases </li></ul><ul><li>Lots of interesting apps & integrations </li></ul><ul><ul><li>Drupal </li></ul></ul><ul><ul><li>WordPress </li></ul></ul><ul><ul><li>Afresco </li></ul></ul><ul><ul><li>Many others </li></ul></ul>
    10. 11. What’s coming <ul><li>French language support – DONE! </li></ul><ul><li>Linked Data Integration – DONE! </li></ul><ul><li>Spanish language support – in process… </li></ul><ul><li>Social tags – simple topical tags for “aboutness” </li></ul><ul><li>? Tell us what you’d like to see @ </li></ul>
    11. 12. Sample Calais Applications
    12. 13. Example: The Mail & Guardian Online, South African Newspaper <ul><li>Using Calais to metatag new and historical articles, and: </li></ul><ul><ul><li>Build an index or topics A-Z </li></ul></ul><ul><ul><li>Pull out automatic related articles or pictures </li></ul></ul><ul><ul><li>Create news alerts on companies or people </li></ul></ul><ul><ul><li>Pull up maps for the countries named in articles </li></ul></ul><ul><ul><li>Predict readers’ interests based on browsing habits </li></ul></ul><ul><ul><li>Create tag clouds to show popular subjects, people, etc. </li></ul></ul>Using Calais to optimize search and navigation; drive consumer engagement
    13. 14. Example: Gist - today’s news filtered by people, places & events GIST uses Calais to prioritize stories, rank newsmakers & reveal trends / reader demand. It automatically aggregates multiple news sources and slots them into topic.
    14. 15. Example: The Powerhouse Museum in Sydney Using Calais to tag historical archives & using tags as search terms