Your SlideShare is downloading. ×
Observing Linked Data Dynamics
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Observing Linked Data Dynamics

324

Published on

Presentation held at ESWC 2013, Montpellier, France

Presentation held at ESWC 2013, Montpellier, France

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
324
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
3
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. KIT – University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association !) INSTITUTE AIFB, KARLSRUHE INSTITUTE OF TECHNOLOGY, GERMANY; 2) DERI, NATIONAL UNIVERSITY OF IRELAND, GALWAY http://swse.deri.org/dyldo/ Observing Linked Data Dynamics Tobias Käfer1, Ahmed Abdelrahman2, Patrick O’Byrne2, Jürgen Umbrich2, Aidan Hogan2 May 30, 2013 Extended Semantic Web Conference (ESWC 2013), Montpellier, France
  • 2. 2 http://swse.deri.org/dyldo/ Linked Data Dynamics … more than the growth of the LOD-Cloud Why you might care: As a publisher: Versioning Link Maintenance As a consumer: Reasoning Hybrid Linked Data Warehouses Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013
  • 3. 3 http://swse.deri.org/dyldo/ The Dynamic Linked Data Observatory – Part of a Bigger Movement (Web Observatories) “[…] in order to study the Web, you need to observe what happens on the Web. To do this, one has to study it every day to understand the dynamics of the Web and the interaction with technology, and what people do with it.” “[…] to create a distributed archive of data on the Web and its activity, and […] mechanisms and tools that will be able to explore its development in the past, to examine its present condition and to establish potential developments in the future.” Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Prof. Dame Wendy Hall, 2013 http://www.thehindu.com/sci-tech/internet/web-observatory-for- cybergazing/article4386613.ece WebScience Trust: definition of a Web Observatory A definition of the Web Observatory
  • 4. 4 http://swse.deri.org/dyldo/ Mission: To capture the dynamics of Linked Data Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 The Dynamic Linked Data Observatory May 30, 2013 Billion Triple Challenge Dataset of 2010 + LOD cloud Fixed URI list The Linked Data Web
  • 5. 5 http://swse.deri.org/dyldo/ Mission: To capture the dynamics of Linked Data Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 The Dynamic Linked Data Observatory May 30, 2013 Billion Triple Challenge Dataset of 2010 + LOD cloud Fixed URI list The Linked Data Web Core part: Combination of LOD/CKAN and BTC 220 example URIs from the data sets in the LOD cloud 220 top PageRanked URIs from the BTC 2010 dataset Crawled from there to get approx. 100k URIs (Union of 10 crawls)
  • 6. 6 http://swse.deri.org/dyldo/ Mission: To capture the dynamics of Linked Data  Weekly snapshots of a URI list derived from the LOD cloud and 2010‘s Billion triple challenge dataset, chosen for coverage and variety. Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 The Dynamic Linked Data Observatory May 30, 2013 Billion Triple Challenge Dataset of 2010 + LOD cloud Fixed URI list The Linked Data Web May 6, 2012 today 1 week
  • 7. 7 http://swse.deri.org/dyldo/ Nominal size of a snapshot: 95,737 (Kernel) / 191,474 URIs (Extended) May to November 2012: 6 months, 29 (weekly) snapshots Statistics on the data basis: This presentation: Findings from the first half year of observation Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Statistic Kernel Extended Mean pay-level domains 573.6 ± 16.6 1,738.6 ± 218 Mean documents 68,996.9 ± 5,555.2 152,355.7 ± 2,356.3 Mean quadruples 16,001,671 ± 988,820 94,725,595 ± 10,279,806 Sum quadruples 464,048,460 2,747,042,282 May 6, 2012 today 1 week
  • 8. 8 http://swse.deri.org/dyldo/ Secret questions of a Linked Data geek  Call for observations on different levels of abstraction: Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 granularity RDF Graphs Documents Hosts (PLD)
  • 9. 9 http://swse.deri.org/dyldo/ Document-level dynamics: Life (Availability)… Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 snapshots 10 0 20 30 % documents of 87k *) 0 5 10 15 20 25 Mean = 23.1 (~80%) 26% URIs available in all snapshots *)86,696RDFdocumentseverappearedin≥1kernelsnapshot
  • 10. 10 http://swse.deri.org/dyldo/ Document-level dynamics: … and Death Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Last Heart-Beat: Overestimates death… … and death certificate filled: underestimates death HTTP-500etc.
  • 11. 11 http://swse.deri.org/dyldo/ Document-level dynamics: Changes Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013
  • 12. 12 http://swse.deri.org/dyldo/ Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 avg.#Snapshotswithchanges indocumentswithchanges Share of documents with changes on the host (PLD) Document-level changes clustered by host (PLD)
  • 13. 13 http://swse.deri.org/dyldo/ Document-level changes per topic and party Grouping domains by metadata from the LOD cloud and the DataHub Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 The LOD cloud colour-coded by topic LOD-cloudtopicParty
  • 14. 14 http://swse.deri.org/dyldo/ RDF-level dynamics: triples Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Only 27,6% of the documents updated values for terms (i.e. one per triple) 24% monotonic additions * * given there are changes at all *
  • 15. 15 http://swse.deri.org/dyldo/ RDF-level dynamics: terms Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013
  • 16. 16 http://swse.deri.org/dyldo/ RDF-level dynamics: The most dynamic predicates Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Indicating a timestamp *) provenance time updated, and provenance time added respectively
  • 17. 17 http://swse.deri.org/dyldo/ Dynamics of the RDF link structure Outward links from the kernel to other documents Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Low-volume but constant stream of fresh outward links : sec.gov, identi.ca, zitgist.com, dbtropes.org, ontologycentral.com, freebase.com New links in batches: bbc.co.uk, bnf.fr, dbpedia.org, linkedct.org, bio2rdf.org Cf. Ntoulas et al. (2004): 25% new links each week (in a growing HTML data set)
  • 18. 18 http://swse.deri.org/dyldo/ Summary and Q&A Analyses from first half year Data collection is continuing Future work: More sources & analyses, results as RDF We appreciate your feed- back and speculations What would you look for in the data? Thanks for your attention Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 10 0 20 30 % documents of the 87k 0 5 10 15 20 25 snapshots http://swse.deri.org/dyldo/
  • 19. 19 http://swse.deri.org/dyldo/ This presentation is CC BY SA – picture credits Picture on title slide based on a picture by A. Sparrow http://www.flickr.com/photos/49937157@N03/ CC BY 2.0 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/ CC BY SA Evolution http://commons.wikimedia.org/wiki/File:Human_evolution_scheme.svg CC BY SA Death http://commons.wikimedia.org/wiki/File:Death.jpg CC BY SA 3.0 Seismogram http://www.flickr.com/photos/brettneilson/2281403809/ CC BY Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013

×