-
1.
Digital Enterprise Research Institute www.deri.ie
Linked Open Data
Deirdre Lee
Digital Enterprise Research Institute (DERI), NUI Galway
22nd November 2012
© Copyright 2011 Digital Enterprise Research Institute. All rights reserved.
Enabling Networked Knowledge
-
2.
DERI, NUI Galway
Digital Enterprise Research Institute www.deri.ie
Centre for Science, Engineering and Technology (CSET) established in
2003 with funding from the Science Foundation Ireland (SFI)
~130 researchers
Research Areas: Semantic Web, Web Science, Social Networks, Data
Mining, Information Systems
Application Areas: eGovernment, Bioinformatics, Security, eBusiness and
financial services, eHealth, and Green & Sustainable IT.
Enabling Networked Knowledge
-
3.
DERI, NUI Galway
Digital Enterprise Research Institute www.deri.ie
National funding from SFI, EI, IRCSET and industrial collaborations
EC Funding: FP6, FP7, etc.
DERI technology driving 100,000s of Websites (i.e. in Drupal)
DERI technology installed on countless desktops (i.e. in Linux)
~100 industry and public partners
Avaya, Alcatel-Lucent, Celtrak, Cisco, Ericsson, FBK, OpenLink, Storm Technology, etc.
> 1,000 peer-reviewed papers
Actively participate in 17 standardisation activities (W3C, OASIS)
Enabling Networked Knowledge
-
4.
Digital Enterprise Research Institute www.deri.ie
What is Linked Open Data?
Open Data?
Linked Data Standards & Tools
Linked Open Data in Practice
Enabling Networked Knowledge
-
5.
Digital Enterprise Research Institute www.deri.ie
Public OP N
E
Data Difficult to find Data
Difficult to reuse
Difficult to integrate
Enabling Networked Knowledge
-
6.
What is Linked Open Data?
Digital Enterprise Research Institute www.deri.ie
? Enabling Networked Knowledge
-
7.
What is Linked Open Data?
Digital Enterprise Research Institute www.deri.ie
hasRugbyTeam
hasCapital
IRELAND
hasGovernment
hasMusicGroup
hasUniversity
hasUnemployment
Enabling Networked Knowledge
-
8.
What is Linked Open Data?
Digital Enterprise Research Institute www.deri.ie
Facilitating data integration through:
Common data model
Building relations
Enabling Networked Knowledge
-
9.
Two Key Ingredients
Digital Enterprise Research Institute www.deri.ie
1. RDF – Resource Description Framework
(Graph based Data)
Identifies objects (URIs)
Interlink information (Relationships)
1. Vocabularies (Ontologies)
Provide shared understanding of a domain
Organise knowledge in a machine-comprehensible way
Give an exploitable meaning to the data
Enabling Networked Knowledge
9 of 46
-
10.
LOD Cloud
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
http://lod-cloud.net
-
11.
TimBL’s 5 Open Data
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
-
12.
★ On the Web, Open License
Digital Enterprise Research Institute www.deri.ie
On the Web
Wide access
Google can index it
People can find it themselves
Open License
Regulates reuse of data
Helps maintain provenance
Strengthens business reuse
– http://opendefinition.org/licenses/
Enabling Networked Knowledge
-
13.
★ ★ Structured Data
Digital Enterprise Research Institute www.deri.ie
Machine-readable
Enabling Networked Knowledge
-
14.
Screenscraping
Digital Enterprise Research Institute www.deri.ie
People use tools like ScraperWiki to
get at data that isn't machine-
readable
https://scraperwiki.com/tags/ireland
Scraping is problematic because:
It is expensive
It is brittle
It puts a strain on computing resources
Enabling Networked Knowledge
-
15.
Formats
Digital Enterprise Research Institute www.deri.ie
Good:
MS Excel, CSV, XML, JSON, Microdata
Not so good:
Pure websites, MS Word
Bad:
PDF
Really bad:
Only charts/maps without numbers, images
Enabling Networked Knowledge
-
16.
★ ★ ★ Non-Proprietary Formats
Digital Enterprise Research Institute www.deri.ie
Freedom of how to process, analyse and visualise
data
Proprietary:
Word, Excel, PDF
Non-proprietary:
CSV, XML, JSON, Microdata, RDF
Enabling Networked Knowledge
-
17.
★ ★ ★ ★ Use URIs
Digital Enterprise Research Institute www.deri.ie
Unique identifiers enable others to point to the data.
<http://www.deri.ie/about/team/member/Deirdre_Lee>
<http://www.deri.ie/publications#uid_339 >
Enabling Networked Knowledge
-
18.
★ ★ ★ ★ ★ Linking Data
Digital Enterprise Research Institute www.deri.ie
Link your data to other data to provide context
http://lod-cloud.net
Enabling Networked Knowledge
-
19.
Digital Enterprise Research Institute www.deri.ie
What is Linked Data
Linked Data Standards & Tools
Linked Open Data in Practice
Enabling Networked Knowledge
-
20.
Linked Data Standards
Digital Enterprise Research Institute www.deri.ie
Government Linked Data (GLD) WG www.w3.org/2011/gld/
Enabling Networked Knowledge
-
21.
Linked Open Metadata
Digital Enterprise Research Institute www.deri.ie
Data Catalog Vocabulary (DCAT)
http://www.w3.org/TR/vocab-dca
Enabling Networked Knowledge
-
22.
Integrating Linked Metadata
Repositories
Digital Enterprise Research Institute www.deri.ie
Shukair, G., et al., Integrating Linked Metadata Repositories in the Web of Data, in Third
International Workshop on Consuming Linked Data (COLD 2012)at ISWC 2012: Boston, US.
Enabling Networked Knowledge
-
23.
Domain-Specific Vocabularies
Digital Enterprise Research Institute www.deri.ie
JOINUP European Commission ISA Semantic Assets
Core Person Vocabulary
Core Location Vocabulary
Core Business Vocabulary
Core Public Service Vocabulary
– http://joinup.ec.europa.eu/
Data Cube Vocabulary
http://www.w3.org/2011/gld/wiki/Data_Cube_Vocabulary
Vocab Lists
http://vocab.deri.ie/
http://vocab.data.gov/
Enabling Networked Knowledge
-
24.
Linked Data Tools
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
-
25.
Digital Enterprise Research Institute www.deri.ie
What is Linked Data
Linked Data Standards & Tools
Linked Open Data in Practice
Enabling Networked Knowledge
-
26.
Lets Do It Galway 2012
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
-
27.
Galway Open Data Portal
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
-
28.
Galway Compass -Piers
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
-
29.
County Rank
Digital Enterprise Research Institute www.deri.ie
http://county-rank.data-gov.ie/
Enabling Networked Knowledge
-
30.
Fingal Fact Finder
Digital Enterprise Research Institute www.deri.ie
http://vmsgov03.deri.ie:8080/data-cube-searcher/about.html
Enabling Networked Knowledge
-
31.
World Bank Linked Data
Digital Enterprise Research Institute www.deri.ie
World Bank Indicators http://worldbank.270a.info
World Bank Finances
World Bank Projects and
Operations
World Bank Climate Change
Sarven Capadisli
Enabling Networked Knowledge
-
32.
Europeana LOD Pilot
Digital Enterprise Research Institute www.deri.ie
http://data.europeana.eu http://srvgal85.deri.ie/ab-app/
Fully open metadata
2.4 M objects
200 individual
providers
15 countries
Enabling Networked Knowledge
-
33.
Linked Sensor Middleware (LSM)
Digital Enterprise Research Institute www.deri.ie
Live data
http://lsm.deri.ie
Enabling Networked Knowledge
-
34.
Over 110,000 Live Data Sources
Digital Enterprise Research Institute www.deri.ie
…and growing!!!
Enabling Networked Knowledge
-
35.
Digital Enterprise Research Institute www.deri.ie
3
Railway Station
a
Flight information update
CallSign: EIN432. Latitude: 47.17525.
Longitude: 8.61251. Altitude: 34000.0 (feet). Speed:
392 (kts). Departure: ARN. Destination: LHR
1
RDF data b
2
Traffic camera
4
Enabling Networked Knowledge
-
36.
Super Stream Collider
Digital Enterprise Research Institute www.deri.ie
LSM Sensors
SPARQL
Endpoint
http://superstreamcollider.org
Enabling Networked Knowledge
36
-
37.
Linked Data in Systems Biology
Digital Enterprise Research Institute www.deri.ie
~20 000
genes ~100 interesting
High-throughput technologies
genes/proteins
~ 10 interesting
Computational statistics
pathways
Browse databases
~5 proteins testable in
the lab
Literature
Linked Data
Hypothesis
Generation
“I like to call it low-input, high-
throughput, no-output biology.”
Enabling Networked Knowledge
-
38.
Data.gov.uk Linked Open Data
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
-
39.
Data.gov Linked Open Data
Digital Enterprise Research Institute www.deri.ie
Clinical Quality Linked Data on Health.data.gov
EPA s Facility Registry and Substance Registry
Enabling Networked Knowledge
-
40.
Norwegian National Master
Data as LOD
Digital Enterprise Research Institute www.deri.ie
Norwegian master data:
Business Property
Citizen
(Legal Entities) (inc. map data)
The Central Coordinating Register for Legal Entities (RLE)
~1 million companies, 40 attributes
Norwegian Semantic Repository of Electronic Services (SERES)
Metadata repository
Register of Company Accounts
Myrseth, P., et al., National Master Data as 5 Star Linked Open Data, in Electronic
Government (eGov2012). 2012, Trauner-Verlag: Kristiansand, Norway.
Enabling Networked Knowledge
-
41.
Fire Department
Amsterdam-Amstelland
Digital Enterprise Research Institute www.deri.ie
Bart van Leeuwen – Fire fighter & netage.nl
Problem:
http://blog.resc.info/
Masses of data
Navigation system didn’t work
Operational risks due to communication failure
Need for:
Structured incident information
Used by >15 Fire Stations in the greater
Amsterdam area
All Linked Data published on Web
Enabling Networked Knowledge
-
42.
New York Times
Digital Enterprise Research Institute www.deri.ie
http://data.nytimes.com/
Enabling Networked Knowledge
-
43.
How the BBC makes Websites
Digital Enterprise Research Institute www.deri.ie
Develop a domain model
Populate your data model
Design URIs
Build pages
Apply layout and decor CSS
Test and iterate
Mike Atherton, ‘Beyond the Polar Bear’
http://www.slideshare.net/reduxd/beyond-the-polar-bear
Enabling Networked Knowledge
-
44.
Proof Points
Digital Enterprise Research Institute www.deri.ie
Massive Industry Adoption
Enabling Networked Knowledge
-
45.
Open Data Publishing Pipeline
Digital Enterprise Research Institute
(ODPP) www.deri.ie
Difficulty with Publishing Open Data:
Remains quite a manual process
Modular Data Management System for publishing
standard Open Data, based on Open Source
components.
http://publishing-pipeline.com/
Enabling Networked Knowledge
-
46.
Open Data Publishing Pipeline
Digital Enterprise Research Institute
(ODPP) www.deri.ie
Enabling Networked Knowledge
-
47.
European Data Forum 2013
Digital Enterprise Research Institute www.deri.ie
April 9th/10th, Dublin
Enabling Networked Knowledge
Interoperability solutions for European public administrations
All of the datasets that contain statistics at the time of writing (they were about 60)
All of the datasets that contain statistics at the time of writing (they were about 60)
Linked Data workshop at DRI’s Realising the Opportunities of Digital Humanities
LSM (Linked Sensor Middleware): a platform that brings together the live real world sensed data and the Semantic Web. A LSM deployment is available at http://lsm.deri.ie/ . It provides many functionalities such as: i) wrappers for real time data collection and publishing; ii) a web interface for data annotation and visualisation; and iii) a SPARQL endpoint for querying unified Linked Stream Data and Linked Data.
SCC (Super Stream Collider): Developed on top of LSM, SCC is a platform, which provides a web-based interface and tools for building sophisticated mashups combining semantically annotated Linked Stream and Linked Data sources into easy to use resources for applications.
Messages: Finding the mathematics of biology; patterns and interrealtedness of biological entities Biological data in computational formats; automate data analysis and annotation is a dream which is not yet achieved Technologies that could help make such a dream reality; transform the www into a computational platform where read and write operations are supported and boundaries between knowledge systems are erased
For years, RLE has offered online access via web interface & web services. Main groups of users were government bodies and legal entities. Most common usage patterns are verifying existence of a legal entity and listing the CEO, board, etc. But also increating request for interoperability
Understanding – No acknowledgement of information shared ● Interpretation – Terms not always used in right context – Non aligned vocabularies between disciplines
Massive update in industry adoption ~400 suppliers Enterprise Software HP, IBM, Microsoft, Oracle, SAP, and Software AG Search: Bing & Google Freebase, Refine, Squared & Rich snippets Social: Linked In & Facebook eCommerce Best Buy & Overstock Publishing Thomson Reuters Standards OMG, ISO, W3W and OASIS Linked Open Data Interdisciplinary data set of 50B Facts Exponential Growth