This document discusses opportunities and challenges of Linked Data. It begins with an overview of Linked Data principles like using URIs to identify things and linking related things. It then discusses enabling technologies like HTTP URIs and SPARQL queries. Opportunities mentioned include using the LOD cloud as a test bed and benefiting from linked context in applications. Challenges include large-scale processing of Linked Data and quality of links. The document concludes by emphasizing the potential of Linked Data to make data more valuable.
Streamlining Python Development: A Guide to a Modern Project Setup
Linked Data: opportunities and challenges
1. Digital Enterprise Research Institute www.deri.ie
Linked Data:
opportunities and challenges
Dr. Michael Hausenblas, DERI, NUI Galway
Open Science Data Cloud NSF PIRE Workshop, Edinburgh, UK, 18 July 2012
Copyright 2011 Digital Enterprise Research Institute. All rights reserved.
Enabling Networked Knowledge
3. Linked Data principles
① Use URIs to identify the “things” in your data
② Use HTTP URIs so people & machines can look them up
③ When a URI is looked up return a description of the thing in a
structured format (RDF)
④ Link to related things to provide context
http://www.w3.org/DesignIssues/LinkedData.html
10. Linked Open Data cloud
2007 2008
2008 2010
2009
2008 2009
2008
10
11. Linked Open Data cloud
http://lod-cloud.net/
Over 300 open data sets with 40 billion facts, interlinked by 500 million typed links.
12. Linked Open Data cloud stats
Digital Enterprise Research Institute www.deri.ie
triples distribution
links distribution
http://lod-cloud.net/state/
Enabling Networked Knowledge
19. Schema.org – Linked Data
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
20. Publishing
Digital Enterprise Research Institute www.deri.ie
1 2 3 4 5 6
data modeling publishing discovery integration use cases
awareness
LOD cloud Neologism Google Refine FYN LATC 24/7 data-gov.ie
5stardata.info Schema.org D2RQ LATC DSI
Enabling Networked Knowledge
21. Google Refine extension
Digital Enterprise Research Institute www.deri.ie
http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/
Enabling Networked Knowledge
22. RDB2RDF – D2RQ
Digital Enterprise Research Institute www.deri.ie
http://d2rq.org/
Enabling Networked Knowledge
23. Discovery
Digital Enterprise Research Institute www.deri.ie
1 2 3 4 5 6
data modeling publishing discovery integration use cases
awareness
LOD cloud Neologism Google Refine FYN LATC 24/7 data-gov.ie
5stardata.info Schema.org D2RQ LATC DSI
Enabling Networked Knowledge
26. Integration
Digital Enterprise Research Institute www.deri.ie
1 2 3 4 5 6
data modeling publishing discovery integration use cases
awareness
LOD cloud Neologism Google Refine FYN LATC 24/7 data-gov.ie
5stardata.info Schema.org D2RQ LATC DSI
Enabling Networked Knowledge
27. Why linking?
Digital Enterprise Research Institute www.deri.ie
http://webofdata.wordpress.com/2011/05/22/why-we-link/
Central Contractor Registration (CCR)
Geonames
Enabling Networked Knowledge
28. Effort distribution
Digital Enterprise Research Institute www.deri.ie
Third
Fix Publisher‘s Party
Effort
Overall Data Effort
Integration
Effort
Consumer‘s
Effort
Enabling Networked Knowledge
29. LATC – Interlinking Platform
Digital Enterprise Research Institute www.deri.ie
http://latc-project.eu/platform
Enabling Networked Knowledge
31. Conclusion
Digital Enterprise Research Institute www.deri.ie
Opportunities
Use the LOD cloud as test-bed (experiments)
Benefit from LOD cloud in apps (context)
Contribute to make your data more valuable
Challenges
Large-scale processing of Linked Data
Distributed/federated SPARQL queries
Quality of links and the data
Enabling Networked Knowledge
32. Resources
Digital Enterprise Research Institute www.deri.ie
Tutorials, technologies, specifications:
http://linkeddatabook.com
http://lod-cloud.net
http://linkeddata.org
http://linkeddata-specs.info
http://schema.rdfs.org
Videos:
http://ted.com/talks/tim_berners_lee_on_the_next_web.html - Tim Berners-Lee’s TED talk
http://www.youtube.com/watch?v=GKfJ5onP5SQ - Linked Data (and the Web of Data)
http://www.youtube.com/watch?v=4x_xzT5eF5Q - What is Linked Data?
http://vimeo.com/36752317 - Linked Open Data (by Europeana)
Enabling Networked Knowledge
Editor's Notes
In the Figure each node representsa distinct dataset and arcs indicate the existenceof links between data elements in the two data sets.
Some 300 datasets, 35billion facts, over 500 million links