Sharing data on the web (2013)

494 views

Published on

Toronto Semantic Web Meetup

0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
494
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
17
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Sharing data on the web (2013)

  1. 1. Sharing Data on The Web A Semantic Web Tutorial James Leigh @jamesrdf james@3roundstones.com February 2013
  2. 2. Towards Linked Open Data● On the Web with an open license ★ “Its great to have the data accessible on the Web under an open license, however, the data is locked-up in a document. Other than writing a custom scraper, its hard to get the data out of the document.”
  3. 3. Towards Linked Open Data On the Web with an open license● ★ Machine-readable data ★● “Splendid! The data is accessible on the Web in a structured way (that is, machine-readable), however, the data is still locked-up in a document. To get the data out of the document you depend on proprietary software.”
  4. 4. Towards Linked Open Data On the Web with an open license● ★ Machine-readable data ★● Non-proprietary format ★● “Excellent! The data is not only available via the Web but now everyone can use the data easily. On the other hand, its still data on the Web and not data in the Web.”
  5. 5. Towards Linked Open Data● On the Web with an open license ★ Machine-readable data ★● Non-proprietary format ★●● RDF standards “Wonderful! Now its data in the Web. The★ (most important) data items have a URI and can be shared on the Web. A native way to represent the data is using RDF, however other formats such as Atom can be converted/mapped, if required.”
  6. 6. Towards Linked Open Data On the Web with an open license● ★ Machine-readable data ★● Non-proprietary format ★●● RDF standards● Linked RDF ★ ★ “Brilliant! Now its data, in the Web linked to other data. Both the consumer and the publisher benefit from the network effect. ”
  7. 7. Path to Success● Start easy – Well curated – Small dataset● Reach out to developers● Get others involved early● Ensure internal benefit● Maximize productivity● Integrate related dataset
  8. 8. Choosing a License● Allow modification?● Require attribution?● Permit commercial uses?● Must others share alike?● What jurisdiction?
  9. 9. Address Data Quality● Completeness● Conformity● Consistency● Accuracy● Integrity● Eliminate duplication
  10. 10. Put It On The Web● Upload it and share ★● Document what is available● Document how to use it – Solve a consumer need● Encourage feedback – Continues improvement
  11. 11. Make It Machine-readable Simplify data structure● ★ Dont include unstable identifiers ★●● Automate publishing
  12. 12. Use a Non-proprietary Format Standardized formats ★● – CSV instead of excel – XML instead of MDB ★● Benefits: ★ – Accessibility – Interoperability – Reduce risk of ● Confidential info ● Software viruses
  13. 13. API Service Data access layer● ★ Atom Publishing Protocol ★● Google Data Protocol ★●● Open Data Protocol● Facebook Graph API● Linked Data
  14. 14. The Web Universal unidirectional links using URLs● ★ Graph of nodes identified by URL ★ –● Cooperation without coordination● Its simple nodes and Links ★
  15. 15. The Semantic Web A universal scalable interconnection bus● ★ Share and reuse data across ★● – Community boundaries – Application ★ – Enterprise
  16. 16. (re)Model the Data Identify existing model● ★ Reuse common vocabularies ★● Model data relationships ★● – Put aside particular needs of any application● Diagram relationships● Test assumptions with subject matter experts
  17. 17. Universal Identifiers Others can reference things ★●● The Web uses URL for ★ downloadable things● Uses URI for real world things and● abstract concepts Two references with the same URI ★ are the same thing● Quick, easy and scalable look-up
  18. 18. Publish RDF● RDF connects better than any other model ★ Cooperation without coordination ★● Data can be combined safely ★●● Mix of vocabularies ★
  19. 19. VoID Description● De facto standard for describing Linked Data ★ Used in linked data catalogues ★● Include provenance and attributions ★●● Associate consumers license ★
  20. 20. RDF Schema● Understand structure ★ of the graph – Self-describing ★● Define terms and ★ relationships ★● Classes of things
  21. 21. Simple Knowledge Organization● Controlled vocabularies ★ Taxonomies ★● Thesauri ★● ★
  22. 22. Controlled Vocabularies Strongly agree Mr. AB ★● ● ● ● BC● Agree Ms. MB ★ ● ●● Somewhat agree ● NB Neither agree nor Mrs. NL ★● ● ● disagree ● NS ● Miss. ON Somewhat disagree ★ ●● ● PE● Disagree ● Dr. ● QC● Strongly disagree ● SK ● NT ● NU ● YT
  23. 23. Taxonomies ★ ★ ★ ★
  24. 24. Thesauri ★ ★ ★ ★
  25. 25. Web Ontology Language● Property restrictions ★ Equivalences across ★● ontologies● Constraints ★● Automatic reasoners ★● Annotations
  26. 26. Linked RDF Linked your data to others● ★ Adds more related data ★● Add value to the data ★● – By joining it to other data ★ ★
  27. 27. Generating Data Websites Give ordinary users direct access the data● ★ Templates to generate HTML ★● – Generate static HTML offline – Produce HTML on-demand ★ ★ ★
  28. 28. Social Responsibility● Responsibility to maintain published data● Publish frequency of data updates● Have a persistence strategy● Ensure data is accurate as possible● Respond to reports of problematic data
  29. 29. Comments or Questions?

×