Successfully reported this slideshow.

Linked Open Data

5

Share

Loading in …3
×
1 of 47
1 of 47

Linked Open Data

5

Share

Download to read offline

Presentation at 2nd Open Data Ireland Meetup, Dublin, Ireland
https://groups.google.com/forum/?fromgroups=#!forum/open-data-ireland

Presentation at 2nd Open Data Ireland Meetup, Dublin, Ireland
https://groups.google.com/forum/?fromgroups=#!forum/open-data-ireland

More Related Content

Related Books

Free with a 14 day trial from Scribd

See all

Linked Open Data

  1. 1. Digital Enterprise Research Institute www.deri.ie Linked Open Data Deirdre Lee Digital Enterprise Research Institute (DERI), NUI Galway 22nd November 2012 © Copyright 2011 Digital Enterprise Research Institute. All rights reserved. Enabling Networked Knowledge
  2. 2. DERI, NUI Galway Digital Enterprise Research Institute www.deri.ie  Centre for Science, Engineering and Technology (CSET) established in 2003 with funding from the Science Foundation Ireland (SFI)  ~130 researchers  Research Areas: Semantic Web, Web Science, Social Networks, Data Mining, Information Systems  Application Areas: eGovernment, Bioinformatics, Security, eBusiness and financial services, eHealth, and Green & Sustainable IT. Enabling Networked Knowledge
  3. 3. DERI, NUI Galway Digital Enterprise Research Institute www.deri.ie  National funding from SFI, EI, IRCSET and industrial collaborations  EC Funding: FP6, FP7, etc.  DERI technology driving 100,000s of Websites (i.e. in Drupal)  DERI technology installed on countless desktops (i.e. in Linux)  ~100 industry and public partners  Avaya, Alcatel-Lucent, Celtrak, Cisco, Ericsson, FBK, OpenLink, Storm Technology, etc.  > 1,000 peer-reviewed papers  Actively participate in 17 standardisation activities (W3C, OASIS) Enabling Networked Knowledge
  4. 4. Digital Enterprise Research Institute www.deri.ie  What is Linked Open Data? Open Data?  Linked Data Standards & Tools  Linked Open Data in Practice Enabling Networked Knowledge
  5. 5. Digital Enterprise Research Institute www.deri.ie Public OP N E Data  Difficult to find Data  Difficult to reuse  Difficult to integrate Enabling Networked Knowledge
  6. 6. What is Linked Open Data? Digital Enterprise Research Institute www.deri.ie ? Enabling Networked Knowledge
  7. 7. What is Linked Open Data? Digital Enterprise Research Institute www.deri.ie hasRugbyTeam hasCapital IRELAND hasGovernment hasMusicGroup hasUniversity hasUnemployment Enabling Networked Knowledge
  8. 8. What is Linked Open Data? Digital Enterprise Research Institute www.deri.ie Facilitating data integration through:  Common data model  Building relations Enabling Networked Knowledge
  9. 9. Two Key Ingredients Digital Enterprise Research Institute www.deri.ie 1. RDF – Resource Description Framework (Graph based Data)  Identifies objects (URIs)  Interlink information (Relationships) 1. Vocabularies (Ontologies)  Provide shared understanding of a domain  Organise knowledge in a machine-comprehensible way  Give an exploitable meaning to the data Enabling Networked Knowledge 9 of 46
  10. 10. LOD Cloud Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge http://lod-cloud.net
  11. 11. TimBL’s 5 Open Data Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge
  12. 12. ★ On the Web, Open License Digital Enterprise Research Institute www.deri.ie  On the Web  Wide access  Google can index it  People can find it themselves  Open License  Regulates reuse of data  Helps maintain provenance  Strengthens business reuse – http://opendefinition.org/licenses/ Enabling Networked Knowledge
  13. 13. ★ ★ Structured Data Digital Enterprise Research Institute www.deri.ie  Machine-readable Enabling Networked Knowledge
  14. 14. Screenscraping Digital Enterprise Research Institute www.deri.ie  People use tools like ScraperWiki to get at data that isn't machine- readable  https://scraperwiki.com/tags/ireland  Scraping is problematic because:  It is expensive  It is brittle  It puts a strain on computing resources Enabling Networked Knowledge
  15. 15. Formats Digital Enterprise Research Institute www.deri.ie  Good:  MS Excel, CSV, XML, JSON, Microdata  Not so good:  Pure websites, MS Word  Bad:  PDF  Really bad:  Only charts/maps without numbers, images Enabling Networked Knowledge
  16. 16. ★ ★ ★ Non-Proprietary Formats Digital Enterprise Research Institute www.deri.ie Freedom of how to process, analyse and visualise data  Proprietary:  Word, Excel, PDF  Non-proprietary:  CSV, XML, JSON, Microdata, RDF Enabling Networked Knowledge
  17. 17. ★ ★ ★ ★ Use URIs Digital Enterprise Research Institute www.deri.ie Unique identifiers enable others to point to the data.  <http://www.deri.ie/about/team/member/Deirdre_Lee>  <http://www.deri.ie/publications#uid_339 > Enabling Networked Knowledge
  18. 18. ★ ★ ★ ★ ★ Linking Data Digital Enterprise Research Institute www.deri.ie Link your data to other data to provide context http://lod-cloud.net Enabling Networked Knowledge
  19. 19. Digital Enterprise Research Institute www.deri.ie  What is Linked Data  Linked Data Standards & Tools  Linked Open Data in Practice Enabling Networked Knowledge
  20. 20. Linked Data Standards Digital Enterprise Research Institute www.deri.ie Government Linked Data (GLD) WG www.w3.org/2011/gld/ Enabling Networked Knowledge
  21. 21. Linked Open Metadata Digital Enterprise Research Institute www.deri.ie Data Catalog Vocabulary (DCAT) http://www.w3.org/TR/vocab-dca Enabling Networked Knowledge
  22. 22. Integrating Linked Metadata Repositories Digital Enterprise Research Institute www.deri.ie Shukair, G., et al., Integrating Linked Metadata Repositories in the Web of Data, in Third International Workshop on Consuming Linked Data (COLD 2012)at ISWC 2012: Boston, US. Enabling Networked Knowledge
  23. 23. Domain-Specific Vocabularies Digital Enterprise Research Institute www.deri.ie  JOINUP European Commission ISA Semantic Assets  Core Person Vocabulary  Core Location Vocabulary  Core Business Vocabulary  Core Public Service Vocabulary – http://joinup.ec.europa.eu/  Data Cube Vocabulary  http://www.w3.org/2011/gld/wiki/Data_Cube_Vocabulary  Vocab Lists  http://vocab.deri.ie/  http://vocab.data.gov/ Enabling Networked Knowledge
  24. 24. Linked Data Tools Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge
  25. 25. Digital Enterprise Research Institute www.deri.ie  What is Linked Data  Linked Data Standards & Tools  Linked Open Data in Practice Enabling Networked Knowledge
  26. 26. Lets Do It Galway 2012 Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge
  27. 27. Galway Open Data Portal Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge
  28. 28. Galway Compass -Piers Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge
  29. 29. County Rank Digital Enterprise Research Institute www.deri.ie http://county-rank.data-gov.ie/ Enabling Networked Knowledge
  30. 30. Fingal Fact Finder Digital Enterprise Research Institute www.deri.ie http://vmsgov03.deri.ie:8080/data-cube-searcher/about.html Enabling Networked Knowledge
  31. 31. World Bank Linked Data Digital Enterprise Research Institute www.deri.ie  World Bank Indicators http://worldbank.270a.info  World Bank Finances  World Bank Projects and Operations  World Bank Climate Change Sarven Capadisli Enabling Networked Knowledge
  32. 32. Europeana LOD Pilot Digital Enterprise Research Institute www.deri.ie http://data.europeana.eu http://srvgal85.deri.ie/ab-app/  Fully open metadata  2.4 M objects  200 individual providers  15 countries Enabling Networked Knowledge
  33. 33. Linked Sensor Middleware (LSM) Digital Enterprise Research Institute www.deri.ie Live data http://lsm.deri.ie Enabling Networked Knowledge
  34. 34. Over 110,000 Live Data Sources Digital Enterprise Research Institute www.deri.ie …and growing!!! Enabling Networked Knowledge
  35. 35. Digital Enterprise Research Institute www.deri.ie 3 Railway Station a Flight information update CallSign: EIN432. Latitude: 47.17525. Longitude: 8.61251. Altitude: 34000.0 (feet). Speed: 392 (kts). Departure: ARN. Destination: LHR 1 RDF data b 2 Traffic camera 4 Enabling Networked Knowledge
  36. 36. Super Stream Collider Digital Enterprise Research Institute www.deri.ie LSM Sensors SPARQL Endpoint http://superstreamcollider.org Enabling Networked Knowledge 36
  37. 37. Linked Data in Systems Biology Digital Enterprise Research Institute www.deri.ie ~20 000 genes ~100 interesting High-throughput technologies genes/proteins ~ 10 interesting Computational statistics pathways Browse databases ~5 proteins testable in the lab Literature Linked Data Hypothesis Generation “I like to call it low-input, high- throughput, no-output biology.” Enabling Networked Knowledge
  38. 38. Data.gov.uk Linked Open Data Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge
  39. 39. Data.gov Linked Open Data Digital Enterprise Research Institute www.deri.ie  Clinical Quality Linked Data on Health.data.gov  EPA s Facility Registry and Substance Registry  Enabling Networked Knowledge
  40. 40. Norwegian National Master Data as LOD Digital Enterprise Research Institute www.deri.ie Norwegian master data: Business Property Citizen (Legal Entities) (inc. map data)  The Central Coordinating Register for Legal Entities (RLE)  ~1 million companies, 40 attributes  Norwegian Semantic Repository of Electronic Services (SERES)  Metadata repository  Register of Company Accounts Myrseth, P., et al., National Master Data as 5 Star Linked Open Data, in Electronic Government (eGov2012). 2012, Trauner-Verlag: Kristiansand, Norway. Enabling Networked Knowledge
  41. 41. Fire Department Amsterdam-Amstelland Digital Enterprise Research Institute www.deri.ie Bart van Leeuwen – Fire fighter & netage.nl  Problem: http://blog.resc.info/  Masses of data  Navigation system didn’t work  Operational risks due to communication failure  Need for:  Structured incident information  Used by >15 Fire Stations in the greater Amsterdam area  All Linked Data published on Web Enabling Networked Knowledge
  42. 42. New York Times Digital Enterprise Research Institute www.deri.ie http://data.nytimes.com/ Enabling Networked Knowledge
  43. 43. How the BBC makes Websites Digital Enterprise Research Institute www.deri.ie  Develop a domain model  Populate your data model  Design URIs  Build pages  Apply layout and decor CSS  Test and iterate Mike Atherton, ‘Beyond the Polar Bear’ http://www.slideshare.net/reduxd/beyond-the-polar-bear Enabling Networked Knowledge
  44. 44. Proof Points Digital Enterprise Research Institute www.deri.ie  Massive Industry Adoption Enabling Networked Knowledge
  45. 45. Open Data Publishing Pipeline Digital Enterprise Research Institute (ODPP) www.deri.ie  Difficulty with Publishing Open Data:  Remains quite a manual process  Modular Data Management System for publishing standard Open Data, based on Open Source components. http://publishing-pipeline.com/ Enabling Networked Knowledge
  46. 46. Open Data Publishing Pipeline Digital Enterprise Research Institute (ODPP) www.deri.ie Enabling Networked Knowledge
  47. 47. European Data Forum 2013 Digital Enterprise Research Institute www.deri.ie April 9th/10th, Dublin Enabling Networked Knowledge

Editor's Notes

  • Interoperability solutions for European public administrations
  • All of the datasets that contain statistics at the time of writing (they were about 60)
  • All of the datasets that contain statistics at the time of writing (they were about 60)
  • Linked Data workshop at DRI’s Realising the Opportunities of Digital Humanities
  • LSM (Linked Sensor Middleware): a platform that brings together the live real world sensed data and the Semantic Web. A LSM deployment is available at http://lsm.deri.ie/ . It provides many functionalities such as: i) wrappers for real time data collection and publishing; ii) a web interface for data annotation and visualisation; and iii) a SPARQL endpoint for querying unified Linked Stream Data and Linked Data.
  • SCC (Super Stream Collider): Developed on top of LSM, SCC is a platform, which provides a web-based interface and tools for building sophisticated mashups combining semantically annotated Linked Stream and Linked Data sources into easy to use resources for applications.
  • Messages: Finding the mathematics of biology; patterns and interrealtedness of biological entities Biological data in computational formats; automate data analysis and annotation is a dream which is not yet achieved Technologies that could help make such a dream reality; transform the www into a computational platform where read and write operations are supported and boundaries between knowledge systems are erased
  • For years, RLE has offered online access via web interface &amp; web services. Main groups of users were government bodies and legal entities. Most common usage patterns are verifying existence of a legal entity and listing the CEO, board, etc. But also increating request for interoperability
  • Understanding – No acknowledgement of information shared ● Interpretation – Terms not always used in right context – Non aligned vocabularies between disciplines
  • Massive update in industry adoption ~400 suppliers Enterprise Software HP, IBM, Microsoft, Oracle, SAP, and Software AG Search: Bing &amp; Google Freebase, Refine, Squared &amp; Rich snippets Social: Linked In &amp; Facebook eCommerce Best Buy &amp; Overstock Publishing Thomson Reuters Standards OMG, ISO, W3W and OASIS Linked Open Data Interdisciplinary data set of 50B Facts Exponential Growth
  • ×