This document provides an overview of Linked Open Data and examples of applications that use LOD. It defines key concepts like Linked Data and the 5 star system. It discusses where to find LOD datasets and how to access and work with remote LOD through HTTP requests, SPARQL queries, or downloading datasets. Examples are given of LOD browsers, mashups, and intelligent web applications that have been built using LOD. The presenter's own projects involving cultural heritage data, history texts, and international aid data are described. Overall the document aims to illustrate the benefits and potential of LOD for developers and provide inspiration for new applications.
1. Linked Open Data and
Applications
Victor de Boer
Intelligent Web Applications
24/2/2014
With slides from Rinke Hoekstra
2. Victor de Boer
Assistant professor at VU University Amsterdam
Domain-driven Semantic Technologies, Linked Data
Cultural Heritage
Digital History
Linked Data for Development
3. Today
• What is LOD?
– What’s in it for me (the developer)?
• What can I do with LOD
– Examples
• My LOD projects and applications
• Pointers, examples, inspiration
5. Linked Data
is about technology for interoperability
Open Data
is about licenses to allow reuse
6. Linked Data five star system
★
Available on the web (whatever
format), but with an open license
★★
Available as machine-readable
structured data (e.g. excel instead
of image scan of a table)
★★★
as (2) plus non-proprietary format
(e.g. CSV instead of excel)
★★★★
All the above plus, Use open
standards from W3C (RDF and
SPARQL) to identify things, so that
people can point at your stuff
★★★★★
All the above, plus: Link your data
to other people’s data to provide
context
www.w3.org/designissues/linkeddata.html
7. “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
8. Where can I get it?
•
•
•
•
•
•
•
•
•
•
•
•
http://datahub.io/
http://datadryad.org/
https://easy.dans.knaw.nl
http://datacatalogs.org/
http://www.openstreetmap.org/
http://www.w3.org/wiki/DataSetRDFDumps
http://www.freebase.com/
https://data.overheid.nl/
http://data.gov.uk/
http://www.data.gov/
http://publicdata.eu/
http://doc.metalex.eu
• http://linkeddata.few.vu.nl/
Rinke Hoekstra
15. Three main ways of accessing remote
Linked Data
1. Through HTTP request on the resource URI
2. Through SPARQL queries
3. Get a copy of a dataset
(Linked Data Platform, Linked Data API)
16. 1. Through HTTP request on the
resource URI
• HTTP GET on resource, parse, follow links
– Simple HTTP requests and RDF parsing
– One request per resource: may require many
requests
• Local caching can be done
• Crawling
17. 2. Through SPARQL queries
• Full-blown query language
• Needs SPARQL endpoint
$query =
"SELECT distinct ?title ?description WHERE {
?x <http://data.open.ac.uk/podcast/ontology/relatesToCourse>
<http://data.open.ac.uk/course/t209>.
?x <http://purl.org/dc/terms/title> ?title.
?x <http://www.w3.org/TR/2010/WD-mediaont-10-20100608/description> ?description }
LIMIT 10
";
$requestURL = 'http://data.open.ac.uk/query?query='.urlencode($query);
$response = request($requestURL);
SPARQL in PHP example
http://www.greenhughes.com/content/approach-consuming-linked-data-php
18. 3. Get a local copy of a dataset
•
•
•
through SPARQL CONSTRUCT,
crawling or
direct file download
•
Save in triple store
–
or convert to something else
21. Examples: Linked Data Browsers
Allow you to browse remote linked data,
clicking through linked resources and
visualising images etc.
TimBL's Tabulator
http://dig.csail.mit.edu/2005/ajar/ajaw/tab.html
Disco
Marbles
OpenLink data browser
Explore more!
• http://sig.ma
• http://mspace.fm/
• http://www.google.com/publicdata
• http://datamarket.com/
• http://www.wolframalpha.com/
27. MultimediaN E-Culture project
• Museums have increasingly nice websites
• But: most of them are driven by stand-alone collection
databases
• Data is isolated, both syntactically and semantically
• If users can do cross-collection search, the individual
collections become more valuable!
• Semantic Search
28. MultimediaN E-Culture
2003-2008: UvA, VU, CWI, DEN, ICN
“Use Semantic Web techniques to
facilitate interoperability between
cultural heritage institutes”
30. Semantic search
• Search for via
annotations with
concepts (semantic links)
• Use the type of semantic
link to provide
meaningful presentation
of the search results
Query
“Paris”
Paris
PartOf
Montmartre
30
32. Search, Show and Annotate
Multiple search/browse/result interfaces
Google-like
Local view
Map
Graph display
Facet browser
Split up data based on attribute-values
Multiple criteria
Annotation Interface
Use of semantic autocompletion
Experts
Laypeople?
32
35. Het Koninkrijk der Nederlanden in de Tweede Wereldoorlog
History of German occupied Dutch society
(1940-1945)
Published between 1969 and 1991 in 14
volumes, 30 parts, 18.000 pages
1. Digitization,
2. Open Data,
3. Enriched access with Linked Open Data
50. Linked Data for International
Aid Transparency Initiative
Msc. Thesis by Kasper Brandt
Victor de Boer
51. Introduction - IATI
“IATI is a voluntary, multi-stakeholder initiative that seeks
to improve the transparency of aid in order to increase
its effectiveness in tackling poverty.”
-- http://www.aidtransparency.net/
53. Model and links based on requirements
elicitated from experts
Iterative Requirements Engineering Process Model by Loucopoulos and Karakostas
54. Linking datasets and Applications
User questions
1. In total, how much does a given country receive in
aid?
2. A comparative index of aid versus the Human
Development Index.
3. What is the geographic location of a project? How
much aid went to a given province, constituency or
village?
o Is the aid spent in places where the need is
highest? Is it well distributed across the
country?
o Can we attribute sub-national breakdowns for
aid so we can see how much goes to different
parts of recipient countries?
4. How does violent conflict in recipient countries
affect aid activities?
5. How does aid spending as registered in the IATI
standard compare to World Bank indicators?
58. http://iati2lod.appspot.com/
4. How does violent conflict in recipient countries affect aid activities?
5. How does aid spending as registered in the IATI standard compare to World
Bank indicators?
62. Linked Data for Development (LD4D)
Agrovoc
GeoNames
DBpedia
‘Allo, Linked
Data?
RadioMarché
Linked market data
*
<VoiceXML> to SPARQL
Web applications
Voice browser
Tel: +31208080855
Skype: +990009369996162208
63. Linked Open Data…
…should make it easier for developers to develop IWAs
… from distributed datasources
… with heterogeneous data
Go build stuff!
Lauradoet: - Sem tech / search - Patronen Cases modeleren en publiceren van Linked DataModeleren van Events - Polimedia( - E-culture)Victor doet: - Am.museum - Tools – Carmen? - Historische use cases - VK, Bioned, DSS
Interactie momenthier!
General purpose
Multiple medical and biological datasets
Mashups mix data from different sourcesLinks to Dbpedia, MusicBrainz
“Published between 1969 and 1991, the 30 volumes still combine the qualities of an authoritative work for a general audience, and an inevitable point of reference for scholars”Digitized version online in 2011, crashing the server