Linked Data: The Jargon-free Primer on Integrating Data on the Web

  • 1,125 views
Uploaded on

Dr. David Wood and Ms. Bernadette Hyland delivered this jargon-free presentation at the National Health Datapalooza in Washington DC on how and why integrating data from the Web matters and why a …

Dr. David Wood and Ms. Bernadette Hyland delivered this jargon-free presentation at the National Health Datapalooza in Washington DC on how and why integrating data from the Web matters and why a Linked Data approach is relevant.

More in: Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
1,125
On Slideshare
0
From Embeds
0
Number of Embeds
3

Actions

Shares
Downloads
0
Comments
0
Likes
6

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. David Wood, Bernadette Hyland{david, bhyland}@3RoundStones.com@3RoundStonesWashington DC, 4 June 2013Linked Data:Structured Data on the Web(the jargon-free version)Health Datapalooza
  • 2. 40% annual growth in data produced5% annual growth in IT spending1.8 ZB35 ZB2012 2020Digital Information Produced294B1 Trillion2 Trillion3 Trillion4 Trillion5 TrillionOnline AdImpressionsEmails TweetsDaily (2013)230M4.8T
  • 3. 314 million Total population90 million software end users55 millionusers of spreadsheets/databases13 million “end user programmers”3 million professional programmersThe United Statesin 2012
  • 4. “Most programs today are written not by professionalsoftware developers, but by people with expertise inother domains working towards goals for which theyneed computational support.”
  • 5. Readable bypeopleData in the Physical World
  • 6. Machine readableReadablebymotivatedpeople
  • 7. Someone else (we don’t know)Schemas/Vocabularies
  • 8. johnson@example.comWhich Copy?
  • 9. Today’s Dataon the Web
  • 10. Lack of Context
  • 11. Required Context
  • 12. my datacollectorcollected bymeasurementMichaelfirst nameHausenblaslast namePersonaa measurement2011-01-01date0valueunits of measuredegreesCentigrade...Galway Airportcollected at
  • 13. Linked Dataon the Webmy datacollectorcollected bymeasurementMichaelfirst nameHausenblaslast namePersonaa measurement2011-01-01date0valueunits of measuredegreesCentigrade...Galway Airportcollected ator
  • 14. Summary of Problems• How can we archive our data in an openmanner?• How can we record data context?• How can we record data provenance?• How can we know whether our data is upto date?• How can we share our data with others?
  • 15. Linked Data isa way toanswer thesequestions
  • 16. Linked Data• Provides an international standardmechanism to put reusable data on theWorld Wide Web• Provides a single data model withmultiple formats• Provides context, provenance and access• Allows for both human and machine reuse
  • 17. Linked Data Principles• Name data files and elements with URIs• Use HTTP URIs so people can resolve themon the Web• Provide useful information at those URIs, usingthe standards (RDF, SPARQL)• Include links to other URIs so people candiscover more information.
  • 18. NOAA EPA AirNow EPA SunwiseWikipedia NLM
  • 19. CreditsPopulation density image(public domain)http://en.wikipedia.org/wiki/File:USA-2000-population-density.gif2012 population estimate(CC-BY-SA)http://en.wikipedia.org/wiki/Demographics_of_the_United_StatesProgrammer estimatesScaffidi, C.; Shaw, M.; Myers, Brad, "Estimating the numbers of end users and end userprogrammers,"Visual Languages and Human-Centric Computing, 2005 IEEESymposium on , vol., no., pp.207,214, 20-24 Sept. 2005doi: 10.1109/VLHCC.2005.34Retrieved 27 May 2013 from http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1509505&isnumber=32326End user programmer quoteAndrew J. Ko, Robin Abraham, Laura Beckwith,Alan Blackwell, Margaret Burnett,Martin Erwig, Chris Scaffidi, Joseph Lawrance, Henry Lieberman, Brad Myers, MaryBeth Rosson, Gregg Rothermel, Mary Shaw, and Susan Wiedenbeck. 2011.The stateof the art in end-user software engineering.ACM Comput. Surv. 43, 3,Article 21(April 2011), 44 pages. DOI=10.1145/1922649.1922658 http://doi.acm.org/10.1145/1922649.1922658. Retrieved 27 May 2013 from http://faculty.washington.edu/ajko/papers/Ko2011EndUserSoftwareEngineering.pdf.pdfBag of chips ideaOpen, Linked Data for a Global Community,Tim Berners-Lee,W3C, Gov2.0 Expo,Washington DC, May 25-27 2010. https://www.youtube.com/watch?v=1E7lV5_0M38Social media iconsCourtesy of http://designreviver.com/freebies/6-free-new-social-icons-digg-twitter-stumble-rss-delicious-reddit/Corporate and product logos, CAMC credit card image and book covers © their respective owners and used underFair Use for educational purposesCorporate and product logos, CAMC credit card image and book covers © their respective owners and used underFair Use for educational purposes
  • 20. This work is Copyright © 2011 3 Round Stones Inc.It is licensed under the Creative Commons Attribution 3.0 Unported LicenseFull details at: http://creativecommons.org/licenses/by/3.0/You are free:to Share — to copy, distribute and transmit the workto Remix — to adapt the workUnder the following conditions:Attribution. You must attribute the work in the manner specified by theauthor or licensor (but not in any way that suggests that they endorseyou or your use of the work).Share Alike. If you alter, transform, or build upon this work, you maydistribute the resulting work only under the same or similar license to thisone.
  • 21. David Wood, Bernadette Hyland{david, bhyland}@3RoundStones.com@3RoundStonesWashington DC, 4 June 2013Linked Data:Structured Data on the Web(the jargon-free version)Health Datapalooza