News the New Way: Semantics in the Driver's Seat

563
-1

Published on

Slides from presentation by Philip Dudchuk (RIA Novosti) and Daniel Hladky (Ontos/W3C) at SemTechBiz 2012 in San-Francisco

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
563
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
10
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

News the New Way: Semantics in the Driver's Seat

  1. 1. News the New Way Semantics in the Driver’s SeatPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  2. 2. Philip Dudchuk Head of Semantic Platform, RIA Novosti Daniel Hladky Deputy Director, W3C Russia Member of the Board, Ontos AGPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  3. 3. 1941 Founded in the beginning of the WW2, RIA Novosti was initially a news agency reporting on the situation at the war frontPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  4. 4. First news websites looked like simple feedsPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  5. 5. Boom of platforms in late 2000sPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  6. 6. Metadata rules the world of news • News metadata gets right content to right departments of the customer (big media) • Metadata locates reported events (local newspapers) • Metadata enables vertical products focused on selected areas (banking, automotive, government)Philip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  7. 7. Distinct metadata setsPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  8. 8. 2011: Need in a common Semantic Publishing Platform • Build and manage a common news ontology and vocabularies for all products and news websites • Generate metadata for both news items and articles on websites • Aggregate content and metadata for further use in end- user applications (websites and mobile apps)Philip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  9. 9. Evolution of the Publishing ProcessPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  10. 10. News OntologyPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  11. 11. Managing the Triple Store Triple Store updates • Editorial meetings • Statistics about ‘heuristic’ entities • Adding an entity directly from CMS Linguistic Information in the Triple Store • Morphology • Disambiguation rules & attributesPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  12. 12. ImpactsPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  13. 13. Impact 1: Broadcasting News with Semantic Metadata Filtering news content by triple queries at the customer’s end (via API): • content about any oil & gas company • content about any employee of any public body in a given region of Russia • content about any event going to happen in my city Common metadata for newswire and web content allow to blend free and paid content into new products (news archive)Philip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  14. 14. Impact 2: Adaptive Content of Websites My ria.ru • Locating the user and filtering the content by region • Gathering user interests and filtering content by entities and topicsPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  15. 15. Impact 3: Non-traditional Aggregations and Analytics Putting together news metadata with external content • summer forest fires • juvenile delinquency in towns and regions • election fraud casesPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  16. 16. 3 21 4 10 10 2 11 3 1 12 16 3 9 1 14 11 2 1 2 12 17 1 5 Combination of crowd-sourced geo data about forest fires and local reports by RIA NovostiPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  17. 17. A case study: country image analysisPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  18. 18. Country image analysis • Searching news content related to Russia across more than 3,000 foreign sources • Processing search results, tagging and aggregating content with its metadata • Producing statistics about reaction on subjects connected to Russia (events, people, organizations)Philip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  19. 19. Negativity Index Tymoshenko’s case in Ukraine, threat to boycott Euro 2012 ‘Pussy riots’ punks arrested Top sources with biggest number of negative publications on involvement of Russian politicians and businessmen in Yulia Tymoshenko’s casePhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  20. 20. US media on Russia’s reaction on the events in Syria The New York Times The Financial Times The Washington Post Syria’s media on the same topicPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  21. 21. Further Challenge • Processing the content from social media to create adaptive social applications • Semantic metadata for pictures and video (image & voice recognition) • Making RIA content & metadata API public • Creating a LOD cloud bubble out of RIA ontology and vocabulariesPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  22. 22. Thank you! @philip_dudchuk @daniel_hladkyPhilip Dudchuk & Daniel HladkySemTechBiz, San Francisco, June 5, 2012
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×