SlideShare a Scribd company logo
1 of 19
KIT – University of the State of Baden-Wuerttemberg and
National Research Center of the Helmholtz Association
!) INSTITUTE AIFB, KARLSRUHE INSTITUTE OF TECHNOLOGY, GERMANY; 2) DERI, NATIONAL UNIVERSITY OF IRELAND, GALWAY
http://swse.deri.org/dyldo/
Observing Linked Data Dynamics
Tobias Käfer1, Ahmed Abdelrahman2, Patrick O’Byrne2, Jürgen Umbrich2, Aidan Hogan2
May 30, 2013
Extended Semantic Web Conference (ESWC 2013), Montpellier, France
2
http://swse.deri.org/dyldo/
Linked Data Dynamics
… more than the growth of the LOD-Cloud
Why you might care:
As a publisher:
Versioning
Link Maintenance
As a consumer:
Reasoning
Hybrid Linked Data Warehouses
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
3
http://swse.deri.org/dyldo/
The Dynamic Linked Data Observatory – Part of a
Bigger Movement (Web Observatories)
“[…] in order to study the Web, you
need to observe what happens on
the Web. To do this, one has to
study it every day to understand
the dynamics of the Web and the
interaction with technology, and
what people do with it.”
“[…] to create a distributed archive
of data on the Web and its
activity, and […] mechanisms and
tools that will be able to explore its
development in the past, to
examine its present condition and
to establish potential
developments in the future.”
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Prof. Dame Wendy Hall, 2013
http://www.thehindu.com/sci-tech/internet/web-observatory-for-
cybergazing/article4386613.ece
WebScience Trust: definition of a Web Observatory
A definition of the Web Observatory
4
http://swse.deri.org/dyldo/
Mission: To capture the dynamics of Linked Data
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
The Dynamic Linked Data Observatory
May 30, 2013
Billion
Triple
Challenge
Dataset
of 2010
+
LOD cloud
Fixed
URI list
The Linked Data Web
5
http://swse.deri.org/dyldo/
Mission: To capture the dynamics of Linked Data
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
The Dynamic Linked Data Observatory
May 30, 2013
Billion
Triple
Challenge
Dataset
of 2010
+
LOD cloud
Fixed
URI list
The Linked Data Web
Core part: Combination of
LOD/CKAN and BTC
220 example URIs from the data
sets in the LOD cloud
220 top PageRanked URIs from the
BTC 2010 dataset
Crawled from there to get approx.
100k URIs (Union of 10 crawls)
6
http://swse.deri.org/dyldo/
Mission: To capture the dynamics of Linked Data
 Weekly snapshots of a URI list derived from the LOD cloud and 2010‘s
Billion triple challenge dataset, chosen for coverage and variety.
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
The Dynamic Linked Data Observatory
May 30, 2013
Billion
Triple
Challenge
Dataset
of 2010
+
LOD cloud
Fixed
URI list
The Linked Data Web
May 6, 2012 today
1 week
7
http://swse.deri.org/dyldo/
Nominal size of a snapshot: 95,737 (Kernel) / 191,474 URIs (Extended)
May to November 2012: 6 months, 29 (weekly) snapshots
Statistics on the data basis:
This presentation: Findings from the first half year
of observation
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Statistic Kernel Extended
Mean pay-level domains 573.6 ± 16.6 1,738.6 ± 218
Mean documents 68,996.9 ± 5,555.2 152,355.7 ± 2,356.3
Mean quadruples 16,001,671 ± 988,820 94,725,595 ± 10,279,806
Sum quadruples 464,048,460 2,747,042,282
May 6, 2012 today
1 week
8
http://swse.deri.org/dyldo/
Secret questions of a Linked Data geek
 Call for observations on different levels of abstraction:
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
granularity
RDF Graphs Documents Hosts (PLD)
9
http://swse.deri.org/dyldo/
Document-level dynamics: Life (Availability)…
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
snapshots
10
0
20
30
% documents of 87k *)
0 5 10 15 20 25
Mean = 23.1 (~80%)
26% URIs available
in all snapshots
*)86,696RDFdocumentseverappearedin≥1kernelsnapshot
10
http://swse.deri.org/dyldo/
Document-level dynamics: … and Death
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Last Heart-Beat:
Overestimates death…
… and death certificate filled:
underestimates death
HTTP-500etc.
11
http://swse.deri.org/dyldo/
Document-level dynamics: Changes
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
12
http://swse.deri.org/dyldo/
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
avg.#Snapshotswithchanges
indocumentswithchanges
Share of documents with changes
on the host (PLD)
Document-level changes clustered by host (PLD)
13
http://swse.deri.org/dyldo/
Document-level changes per topic and party
Grouping domains by metadata from the
LOD cloud and the DataHub
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
The LOD cloud colour-coded by topic
LOD-cloudtopicParty
14
http://swse.deri.org/dyldo/
RDF-level dynamics: triples
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Only 27,6% of the
documents updated
values for terms
(i.e. one per triple)
24% monotonic
additions
*
* given there are changes at all
*
15
http://swse.deri.org/dyldo/
RDF-level dynamics: terms
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
16
http://swse.deri.org/dyldo/
RDF-level dynamics: The most dynamic
predicates
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Indicating a timestamp
*) provenance time updated, and provenance time added respectively
17
http://swse.deri.org/dyldo/
Dynamics of the RDF link structure
Outward links from the kernel to other documents
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Low-volume but constant stream of fresh outward links :
sec.gov, identi.ca, zitgist.com,
dbtropes.org, ontologycentral.com,
freebase.com
New links in batches: bbc.co.uk, bnf.fr,
dbpedia.org, linkedct.org, bio2rdf.org
Cf. Ntoulas et al.
(2004): 25% new
links each week
(in a growing
HTML data set)
18
http://swse.deri.org/dyldo/
Summary and Q&A
Analyses from first half year
Data collection is continuing
Future work:
More sources & analyses, results as RDF
We appreciate your feed-
back and speculations
What would you
look for in the data?
Thanks for your attention
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
10
0
20
30
% documents of the 87k
0 5 10 15 20 25
snapshots
http://swse.deri.org/dyldo/
19
http://swse.deri.org/dyldo/
This presentation is CC BY SA – picture credits
Picture on title slide based on a picture by A. Sparrow
http://www.flickr.com/photos/49937157@N03/
CC BY 2.0
Linking Open Data cloud diagram, by Richard Cyganiak and Anja
Jentzsch. http://lod-cloud.net/
CC BY SA
Evolution
http://commons.wikimedia.org/wiki/File:Human_evolution_scheme.svg
CC BY SA
Death http://commons.wikimedia.org/wiki/File:Death.jpg
CC BY SA 3.0
Seismogram http://www.flickr.com/photos/brettneilson/2281403809/
CC BY
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013

More Related Content

What's hot

The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?Martin Hepp
 
To the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly CommunicationTo the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly CommunicationMartin Klein
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Werner Leyh
 
Adoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical DomainsAdoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical DomainsChris Bizer
 
Webtracks at JISC Managing Research Data Meeting
Webtracks at JISC Managing Research Data MeetingWebtracks at JISC Managing Research Data Meeting
Webtracks at JISC Managing Research Data MeetingCameron Neylon
 
Flagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis VZW
 
20180226 data driven smart governance
20180226 data driven smart governance20180226 data driven smart governance
20180226 data driven smart governanceDongpo Deng
 
The methods and practices of Linked Open Data
The methods and practices of Linked Open DataThe methods and practices of Linked Open Data
The methods and practices of Linked Open DataDongpo Deng
 
Answers to usual issues in getting started with consuming Linked Data
Answers to usual issues in getting started with consuming Linked DataAnswers to usual issues in getting started with consuming Linked Data
Answers to usual issues in getting started with consuming Linked DataOlaf Hartig
 
Linked Data and Services
Linked Data and ServicesLinked Data and Services
Linked Data and ServicesBarry Norton
 
Open Data - The Fingal Perspective
Open Data - The Fingal PerspectiveOpen Data - The Fingal Perspective
Open Data - The Fingal PerspectiveFingal Open Data
 
OpenDataHK Meetup 13 June 2013 What is Open Data?
OpenDataHK Meetup 13 June 2013 What is Open Data? OpenDataHK Meetup 13 June 2013 What is Open Data?
OpenDataHK Meetup 13 June 2013 What is Open Data? Mr. Bill Proudfit
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingTobias Kuhn
 
Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count Mat Kelly
 
Mining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMinerMining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMinerHeiko Paulheim
 
Data mining on social networks for students learning experiences
Data mining on social networks for students learning experiences Data mining on social networks for students learning experiences
Data mining on social networks for students learning experiences Biplab Debnath
 
Integrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge GraphIntegrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge GraphJennifer D'Souza
 

What's hot (20)

The Web We Want
The Web We WantThe Web We Want
The Web We Want
 
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
 
To the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly CommunicationTo the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly Communication
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
 
Adoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical DomainsAdoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical Domains
 
Webtracks at JISC Managing Research Data Meeting
Webtracks at JISC Managing Research Data MeetingWebtracks at JISC Managing Research Data Meeting
Webtracks at JISC Managing Research Data Meeting
 
PID Signposting Pattern
PID Signposting PatternPID Signposting Pattern
PID Signposting Pattern
 
Flagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertier
 
20180226 data driven smart governance
20180226 data driven smart governance20180226 data driven smart governance
20180226 data driven smart governance
 
The methods and practices of Linked Open Data
The methods and practices of Linked Open DataThe methods and practices of Linked Open Data
The methods and practices of Linked Open Data
 
Linking Open Data
Linking Open DataLinking Open Data
Linking Open Data
 
Answers to usual issues in getting started with consuming Linked Data
Answers to usual issues in getting started with consuming Linked DataAnswers to usual issues in getting started with consuming Linked Data
Answers to usual issues in getting started with consuming Linked Data
 
Linked Data and Services
Linked Data and ServicesLinked Data and Services
Linked Data and Services
 
Open Data - The Fingal Perspective
Open Data - The Fingal PerspectiveOpen Data - The Fingal Perspective
Open Data - The Fingal Perspective
 
OpenDataHK Meetup 13 June 2013 What is Open Data?
OpenDataHK Meetup 13 June 2013 What is Open Data? OpenDataHK Meetup 13 June 2013 What is Open Data?
OpenDataHK Meetup 13 June 2013 What is Open Data?
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized Publishing
 
Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count
 
Mining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMinerMining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMiner
 
Data mining on social networks for students learning experiences
Data mining on social networks for students learning experiences Data mining on social networks for students learning experiences
Data mining on social networks for students learning experiences
 
Integrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge GraphIntegrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge Graph
 

Similar to Observing Linked Data Dynamics

From Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeFrom Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeSören Auer
 
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...GigaScience, BGI Hong Kong
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013Stefan Dietze
 
Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)Oscar Corcho
 
Web at 25 - Ontos Linked Open Data
Web at 25 - Ontos Linked Open DataWeb at 25 - Ontos Linked Open Data
Web at 25 - Ontos Linked Open DataAI4BD GmbH
 
Dataset Sources Repositories.pptx
Dataset Sources Repositories.pptxDataset Sources Repositories.pptx
Dataset Sources Repositories.pptxmantatheralyasriy
 
Dynamic Data Center concept
Dynamic Data Center concept  Dynamic Data Center concept
Dynamic Data Center concept Miha Ahronovitz
 
Data accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereData accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereAlex Hardisty
 
Linked Data Overview - AGI Technical SIG
Linked Data Overview - AGI Technical SIGLinked Data Overview - AGI Technical SIG
Linked Data Overview - AGI Technical SIGChris Ewing
 
Visualizing linkeddata aall2012d-ss
Visualizing linkeddata aall2012d-ssVisualizing linkeddata aall2012d-ss
Visualizing linkeddata aall2012d-ssF. Tim Knight
 
Experiences as a producer, consumer and observer of open data
Experiences as a producer, consumer and observer of open dataExperiences as a producer, consumer and observer of open data
Experiences as a producer, consumer and observer of open dataProgCity
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVEUDAT
 
KESW2012 Hackathon St Petersburg
KESW2012 Hackathon St PetersburgKESW2012 Hackathon St Petersburg
KESW2012 Hackathon St PetersburgAI4BD GmbH
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?Anna Fensel
 
Internet2 Support for Biomedical Research
Internet2 Support for Biomedical ResearchInternet2 Support for Biomedical Research
Internet2 Support for Biomedical ResearchEd Dodds
 

Similar to Observing Linked Data Dynamics (20)

Cornell 2011 05-13
Cornell 2011 05-13Cornell 2011 05-13
Cornell 2011 05-13
 
Ciard Initiative and a Global Infrastructure for Linked Open Data
Ciard Initiative and a Global Infrastructure for Linked Open Data Ciard Initiative and a Global Infrastructure for Linked Open Data
Ciard Initiative and a Global Infrastructure for Linked Open Data
 
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeFrom Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
 
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013
 
Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)
 
Web at 25 - Ontos Linked Open Data
Web at 25 - Ontos Linked Open DataWeb at 25 - Ontos Linked Open Data
Web at 25 - Ontos Linked Open Data
 
Dataset Sources Repositories.pptx
Dataset Sources Repositories.pptxDataset Sources Repositories.pptx
Dataset Sources Repositories.pptx
 
Open Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon HodsonOpen Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon Hodson
 
Dynamic Data Center concept
Dynamic Data Center concept  Dynamic Data Center concept
Dynamic Data Center concept
 
Data accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereData accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphere
 
Linked Data Overview - AGI Technical SIG
Linked Data Overview - AGI Technical SIGLinked Data Overview - AGI Technical SIG
Linked Data Overview - AGI Technical SIG
 
Visualizing linkeddata aall2012d-ss
Visualizing linkeddata aall2012d-ssVisualizing linkeddata aall2012d-ss
Visualizing linkeddata aall2012d-ss
 
Experiences as a producer, consumer and observer of open data
Experiences as a producer, consumer and observer of open dataExperiences as a producer, consumer and observer of open data
Experiences as a producer, consumer and observer of open data
 
LOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink SoftwareLOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink Software
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
LOD2 Webinar Series FOX
LOD2 Webinar Series FOXLOD2 Webinar Series FOX
LOD2 Webinar Series FOX
 
KESW2012 Hackathon St Petersburg
KESW2012 Hackathon St PetersburgKESW2012 Hackathon St Petersburg
KESW2012 Hackathon St Petersburg
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?
 
Internet2 Support for Biomedical Research
Internet2 Support for Biomedical ResearchInternet2 Support for Biomedical Research
Internet2 Support for Biomedical Research
 

Recently uploaded

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 

Observing Linked Data Dynamics

  • 1. KIT – University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association !) INSTITUTE AIFB, KARLSRUHE INSTITUTE OF TECHNOLOGY, GERMANY; 2) DERI, NATIONAL UNIVERSITY OF IRELAND, GALWAY http://swse.deri.org/dyldo/ Observing Linked Data Dynamics Tobias Käfer1, Ahmed Abdelrahman2, Patrick O’Byrne2, Jürgen Umbrich2, Aidan Hogan2 May 30, 2013 Extended Semantic Web Conference (ESWC 2013), Montpellier, France
  • 2. 2 http://swse.deri.org/dyldo/ Linked Data Dynamics … more than the growth of the LOD-Cloud Why you might care: As a publisher: Versioning Link Maintenance As a consumer: Reasoning Hybrid Linked Data Warehouses Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013
  • 3. 3 http://swse.deri.org/dyldo/ The Dynamic Linked Data Observatory – Part of a Bigger Movement (Web Observatories) “[…] in order to study the Web, you need to observe what happens on the Web. To do this, one has to study it every day to understand the dynamics of the Web and the interaction with technology, and what people do with it.” “[…] to create a distributed archive of data on the Web and its activity, and […] mechanisms and tools that will be able to explore its development in the past, to examine its present condition and to establish potential developments in the future.” Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Prof. Dame Wendy Hall, 2013 http://www.thehindu.com/sci-tech/internet/web-observatory-for- cybergazing/article4386613.ece WebScience Trust: definition of a Web Observatory A definition of the Web Observatory
  • 4. 4 http://swse.deri.org/dyldo/ Mission: To capture the dynamics of Linked Data Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 The Dynamic Linked Data Observatory May 30, 2013 Billion Triple Challenge Dataset of 2010 + LOD cloud Fixed URI list The Linked Data Web
  • 5. 5 http://swse.deri.org/dyldo/ Mission: To capture the dynamics of Linked Data Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 The Dynamic Linked Data Observatory May 30, 2013 Billion Triple Challenge Dataset of 2010 + LOD cloud Fixed URI list The Linked Data Web Core part: Combination of LOD/CKAN and BTC 220 example URIs from the data sets in the LOD cloud 220 top PageRanked URIs from the BTC 2010 dataset Crawled from there to get approx. 100k URIs (Union of 10 crawls)
  • 6. 6 http://swse.deri.org/dyldo/ Mission: To capture the dynamics of Linked Data  Weekly snapshots of a URI list derived from the LOD cloud and 2010‘s Billion triple challenge dataset, chosen for coverage and variety. Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 The Dynamic Linked Data Observatory May 30, 2013 Billion Triple Challenge Dataset of 2010 + LOD cloud Fixed URI list The Linked Data Web May 6, 2012 today 1 week
  • 7. 7 http://swse.deri.org/dyldo/ Nominal size of a snapshot: 95,737 (Kernel) / 191,474 URIs (Extended) May to November 2012: 6 months, 29 (weekly) snapshots Statistics on the data basis: This presentation: Findings from the first half year of observation Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Statistic Kernel Extended Mean pay-level domains 573.6 ± 16.6 1,738.6 ± 218 Mean documents 68,996.9 ± 5,555.2 152,355.7 ± 2,356.3 Mean quadruples 16,001,671 ± 988,820 94,725,595 ± 10,279,806 Sum quadruples 464,048,460 2,747,042,282 May 6, 2012 today 1 week
  • 8. 8 http://swse.deri.org/dyldo/ Secret questions of a Linked Data geek  Call for observations on different levels of abstraction: Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 granularity RDF Graphs Documents Hosts (PLD)
  • 9. 9 http://swse.deri.org/dyldo/ Document-level dynamics: Life (Availability)… Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 snapshots 10 0 20 30 % documents of 87k *) 0 5 10 15 20 25 Mean = 23.1 (~80%) 26% URIs available in all snapshots *)86,696RDFdocumentseverappearedin≥1kernelsnapshot
  • 10. 10 http://swse.deri.org/dyldo/ Document-level dynamics: … and Death Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Last Heart-Beat: Overestimates death… … and death certificate filled: underestimates death HTTP-500etc.
  • 11. 11 http://swse.deri.org/dyldo/ Document-level dynamics: Changes Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013
  • 12. 12 http://swse.deri.org/dyldo/ Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 avg.#Snapshotswithchanges indocumentswithchanges Share of documents with changes on the host (PLD) Document-level changes clustered by host (PLD)
  • 13. 13 http://swse.deri.org/dyldo/ Document-level changes per topic and party Grouping domains by metadata from the LOD cloud and the DataHub Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 The LOD cloud colour-coded by topic LOD-cloudtopicParty
  • 14. 14 http://swse.deri.org/dyldo/ RDF-level dynamics: triples Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Only 27,6% of the documents updated values for terms (i.e. one per triple) 24% monotonic additions * * given there are changes at all *
  • 15. 15 http://swse.deri.org/dyldo/ RDF-level dynamics: terms Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013
  • 16. 16 http://swse.deri.org/dyldo/ RDF-level dynamics: The most dynamic predicates Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Indicating a timestamp *) provenance time updated, and provenance time added respectively
  • 17. 17 http://swse.deri.org/dyldo/ Dynamics of the RDF link structure Outward links from the kernel to other documents Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Low-volume but constant stream of fresh outward links : sec.gov, identi.ca, zitgist.com, dbtropes.org, ontologycentral.com, freebase.com New links in batches: bbc.co.uk, bnf.fr, dbpedia.org, linkedct.org, bio2rdf.org Cf. Ntoulas et al. (2004): 25% new links each week (in a growing HTML data set)
  • 18. 18 http://swse.deri.org/dyldo/ Summary and Q&A Analyses from first half year Data collection is continuing Future work: More sources & analyses, results as RDF We appreciate your feed- back and speculations What would you look for in the data? Thanks for your attention Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 10 0 20 30 % documents of the 87k 0 5 10 15 20 25 snapshots http://swse.deri.org/dyldo/
  • 19. 19 http://swse.deri.org/dyldo/ This presentation is CC BY SA – picture credits Picture on title slide based on a picture by A. Sparrow http://www.flickr.com/photos/49937157@N03/ CC BY 2.0 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/ CC BY SA Evolution http://commons.wikimedia.org/wiki/File:Human_evolution_scheme.svg CC BY SA Death http://commons.wikimedia.org/wiki/File:Death.jpg CC BY SA 3.0 Seismogram http://www.flickr.com/photos/brettneilson/2281403809/ CC BY Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013