Linked Data to Improve the OER Experience


Published on

Nelson Piedra , Janneth Chicaiza
and Jorge López, Universidad Técnica Particular de Loja, Edmundo
Tovar, Universidad Politécnica de Madrid,
and Oscar Martínez, Universitas
Miguel Hernández
Explore the advantages of using linked data with OERs.

Published in: Education
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Linked Data to Improve the OER Experience

  1. 1. Linked Data to enhance users experience with Open Education Resources<br />Edmundo Tovar (UPM ) - Nelson Piedra (UTPL,<br />Jorge López, JannethChicaiza, Oscar Martinez<br />OCWC Global 2011: Celebrating 10 years of OpenCourseWare<br />Cambridge, Massachusetts, USA<br />May 4, 2011 – May 6, 2011<br />this work is licensed under a Creative Commons Attribution 3.0 License<br />
  2. 2. some motivations<br />
  3. 3. Status of the Current Web<br />Web Content:<br />Diversity of Platforms and systems (CMS, APIs); Web silos of content.<br />A lot of Data contained in millions of web-documents (embedded within Content Management Systems).<br />Discoverability and reusability:<br />Not possible to reuse data outside the CMS/WebApps<br />Not data available as unified machine readable format<br />Diversity of APIs <br />
  4. 4. Silos of data/relations/informations: Walled Social Networks Cartoon<br />
  5. 5. The World Wide Web is defined as the universal, all-encompassing space containing all Internet - and other - resources referenced by Uniform Resource Identifiers (URIs, sometimes commonly called "URLs").<br />
  6. 6. The Web was designed as an information space, with the goal that it should be useful not only for human-human communication, but also that machines would be able to participate and help. [Tim Berners-Lee] <br />
  7. 7. One of the major obstacles to this has been the fact that most information on the Web is designed for human consumption, and even if it was derived from a database with well defined meanings (in at least some terms) for its columns, that the structure of the data is not evident to a robot browsing the web. [TBL]<br />
  8. 8. Leaving aside the artificial intelligence problem of training machines to behave like people, the Semantic Web approach instead develops languages for expressing information in a machine processable form. [TBL]<br />
  9. 9. Web of Documentwhere links are relationships anchors in hypertext documents written in HTML<br />from humans for humans<br />and machines? no really welcome....<br />
  10. 10. Web of Data<br />Discovery, Access, and Usages of Resources in the Web<br />
  11. 11. Web of Data,Link DATA rather than DOCUMENTS <br />Linked Data is about using the Web to connect related data that wasn't previously linked, or using the Web to lower the barriers to linking data currently linked using other methods. <br />Linked Data as a best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF. Links  between arbitrary things described by RDF<br />
  12. 12. Semantic WEB<br />The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or machine can explore the web of data.  With linked data, when you have some of it, you can find other, related, data.<br />
  13. 13. Linked-Data Principles<br />Like the web of hypertext, the web of data is constructed with documents on the web.<br />Use URIs to name things on the Web (for naming everything real or abstract you could think of)<br />Use HTTP URIs so that people can look up those names. <br />When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) <br />Include links to other URIs. so that they can discover more things. <br />
  14. 14. Linked Data Key Concepts<br />resource -- an thing of interest (OCW, syllabus, test, lecture, presentation, document, OER, ... )<br />URI -- global identifier for a resource. The URIs identify any kind of object or  concept. <br />representation -- data corresponding to the state of a resource <br />information resource -- a “document” containing information <br />non-information resource -- anything else <br />associated description -- representation describing a Semantic Web resource<br />
  15. 15. The Resource Description Framework (RDF)<br />RDF: The Resource Description Framework (RDF) is a family of World Wide Web Consortium (W3C) specifications originally designed as a metadatadata model. It has come to be used as a general method for conceptual description or modeling of information that is implemented in web resources, using a variety of syntax formats.<br />Resource - everything with a URI<br />Description - properties of these resources<br />Framework - a system to build on<br />
  16. 16. The RDF Data Model<br />RDF Triples<br />Resource Description Framework"<br />subject<br /><uri><br />traits or aspects of the resource<br />predicate<br />typed-link<br />traits or aspects of the resource<br />object<br /><uri> or “literal”<br />traits or aspects of the resource<br />
  17. 17. Linked Data core stack<br />SPARQL<br />RDF<br />HTTP<br />URI<br />
  18. 18. Linked Data Core Stack, Specifications<br />RFC 2616 Hypertext Transfer Protocol -- HTTP/1.1: Defines HTTP, a generic and stateless application-level protocol for distributed, collaborative, hypermedia information systems.<br />RFC 3986 Uniform Resource Identifier (URI): Generic Syntax: Defines a generic URI syntax and a process for resolving URI references that might be in relative form, along with guidelines and security considerations for the use of URIs on the Internet.<br />RDF Concepts and Abstract Syntax: Defines the RDF graph data model and key concepts.<br />SPARQL Query Language for RDF: Defines defines the syntax and semantics of the SPARQL query language for RDF.<br />
  19. 19. The Linking Open Data cloud diagram, evolution<br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.”<br />Next images sequence shows datasets that have been published in Linked Data format, by contributors to the Linking Open Data community project and other individuals and organisations. It is based on metadata collected and curated by contributors to the CKAN directory. Last updated: 2010-09-22. <br />
  20. 20. 25 datasets<br />20071008<br /><br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.” This work is available under a CC-BY-SA license. <br />
  21. 21. 28 datasets<br />20071107<br /><br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.” This work is available under a CC-BY-SA license. <br />
  22. 22. 28 datasets<br />20071110<br /><br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.” This work is available under a CC-BY-SA license. <br />
  23. 23. 32 datasets<br />20080228<br /><br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.” This work is available under a CC-BY-SA license. <br />
  24. 24. TED2009 Tim Berners-Lee on the next Web<br />Linked Data is Real<br />
  25. 25. 34 datasets<br />20080331<br /><br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.” This work is available under a CC-BY-SA license. <br />
  26. 26. 45 datasets<br />20080919<br /><br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.” This work is available under a CC-BY-SA license. <br />
  27. 27. 89 datasets<br />20090305<br /><br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.” This work is available under a CC-BY-SA license. <br />
  28. 28. 93 datasets<br />20090327<br /><br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.” This work is available under a CC-BY-SA license. <br />
  29. 29. 95 datasets<br />20090714<br /><br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.” This work is available under a CC-BY-SA license. <br />
  30. 30. 203 datasets<br />20100922<br /><br />“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.” This work is available under a CC-BY-SA license. <br />
  31. 31. LinkedOCWData Project<br />
  32. 32. GOAL LinkedOCWData Project<br />Our Goal: One of the main objectives of our team is:<br />to connect Linked Data with Open Educational Resources, aiming to improve users’ experience of OERs and OCWs.<br />To explore how the principles of Linked Data can be used to help teachers, students and self-learners find open content.<br />Motivation: There is an enormous amount of information and digital resources that our OCWs Officess produce in their respective universities. We must find a way to exchange quick and easy access; we must put all information in one place and under the same principles of design.<br />
  33. 33. ABOUT LinkedOCWData Project<br />About Project: LinkedOCWData project is to make the OpenCourseWare (OCW) and OpenEducationalResources(OER) datasets easily available as RDF. Additionally, we are providing interlinking with DBpedia, GeoNames and Univerisities Linked Datasets.<br />Expected outcome: The result is a rich, open and integrated dataset which we hope to be useful for research and application development. The datasets can be publicly accessed via downloads, Linked Data, and SPARQL-endpoints. We have also launched an experimental “Live-SPARQL-endpoint” that is synchronized with the minutely updates from OCW/OER initiatives whereas the changes to our store are republished as RDF.<br />
  34. 34. Solutions:<br />The key to achieving these objectives is the disambiguation of open educational materials with unique identifiers through a consensual vocabulary. That is, permanent links which contain structured data that can be read automatically by computers.<br />In resolving a URI, data may be obtained on the course/open resource -author, content, licensing, data on the principles of design and content localization-. <br />Development Cycle: LinkedOCWData Project<br />
  35. 35. Life Cycle: LinkedOCWData Project<br />Linked Data requires you to assign URLs to identify things: people, places, events, whatever.<br />
  36. 36. Consensual vocabulary to assess the advantages of using Linked Data in Open Educational Materials.<br />1. Deciding what content to look for. Faced with a wide range of open content, it is difficult to decide between materials of the same subject. Thus, it will be possible to anticipate some of the interests of OCW users. <br />2. We can help people get more information by using the URL of an open educational resource, and connecting it automatically to related information found in the internet cloud, conveniently and unobtrusively. It consequently adds context to knowledge and open educational materials<br />A challenging task: LinkedOCWData Project<br />
  37. 37.
  38. 38. A Best Practice: Linked Open Data star scheme by example<br />Tim Berners-Lee suggested a 5-star deployment scheme for Linked Open Data and Ed Summers provided a nice rendering of it. <br />★ make your stuff available on the Web (whatever format) under an open license<br />★★ make it available as structured data (e.g., Excel instead of image scan of a table)<br />★★★ use non-proprietary formats (e.g., CSV instead of Excel)<br />★★★★ use URIs to identify things, so that people can point at your stuff<br />★★★★★ link your data to other data to provide context<br />from: <br />
  39. 39. Example: Linked Data applied to OpenCourseWare<br />Opencourseware description:<br />Ontologies and Semantic Web, 2008<br />The general objective is to provide students with a sound grounding of scientific, methodological and technological fundamentals in Ontological Engineering and the Semantic Web areas. This knowledge will be later used to build applications that can integrate, combine and infer heterogeneous and distributed information.<br />Guadalupe Aguado;  Oscar Corcho; Asunción Gómez<br />Departamento de Inteligencia Artificial.Facultad de Informática.<br />A four-monthly subject in the following official master programs: Master in Information Technologies<br />European Master in Logic Computation (ERASMUS Mundus Program)<br />Master in Research in Artificial Intelligence<br />Ontology Network<br /> <br />Last revision: April 2008.<br /> <br />It is credited 6 ECTS.<br />Presential classes: 2 hours/week.<br />Total learning time: 32 hours.<br /><br />
  40. 40. raw data within UPM opencourseware web page<br />OCW University<br />knowledge area<br />Title<br />Author(s)<br />Department<br />syllabus<br />bibliography<br />year<br />ects credits<br />time autoself<br />description<br />
  41. 41. ‘Ontologies and Semantic Web’<br />subject<br />has a<br />author-creator<br />predicate<br />whose name is<br />Asunción Gómez Pérez<br />object<br />
  42. 42.<br />subject<br />has a<br /><br />predicate<br />whose name is<br />Asunción Gómez Pérez<br />object<br />
  43. 43. RDF graph of OpenCourseWare<br />
  44. 44. Next Applications<br />“investigate and prototype the use of linked data technologies and approaches to linking and exposing data for students and researchers”. Plan for development of specific applications targeting: <br />Students: in finding resources related to courses, topics, and helping selecting courses to enroll to – <br />Researchers: Identify interesting connections/ research questions from research data linked to OU/external sources• Already a number of (more generic) applications emerging…<br />
  45. 45. Resource Types<br />geo:<br />sioc:<br />tag:<br />rdf:<br />rdfs:<br />owl:<br />Dublin Core, dc:<br />dcterms:<br />Person, foaf:<br />fti:<br />Concept, skos:<br />Bibliographic Ontology, bibo:<br />
  46. 46. example for locwd dataset<br />
  47. 47. OCW-dataset guidelines<br />There must be resolvable http:// (or https://) URIs.<br />They must resolve, with or without content negotiation, to RDF data in one of the popular RDF formats (RDFa, RDF/XML, Turtle, N-Triples).<br />The dataset must contain at least 1000 triples. (Hence, your FOAF file most likely does not qualify.)<br />The dataset must be connected via RDF links to a dataset that is already in the diagram. This means, either your dataset must use URIs from the other dataset, or vice versam. We arbitrarily require at least 50 links.<br />Access of the entire dataset must be possible via RDF crawling, via an RDF dump, or via a SPARQL endpoint.<br />
  48. 48. So a lot of work has gone into use and reuse resources integration: we wanted to explore how to automate some of the process. To publish OCW and OER on our websites, we needed to capture the metadata naturally produced in the production process and expose it online. CMS:<br />Educommons<br />Moodle<br />Drupal<br />DSpace<br />integration of OCW Content Management Systems <br />
  49. 49. to finish<br />Linked Data as a best practice for exposing, sharing, connecting, and interoperate from opencourseware resources, pieces of data, information, and knowledge on the Semantic Web using URIs and RDF. <br />
  50. 50. TED2009 Tim Berners-Lee on the next Web<br />Linked Data is Data Interoperability<br />The need for communication and interoperation between autonomous and distributed information systems is increasing with the increasing usage of the Web.<br />
  51. 51. linkeddata in the internet of future<br />The OSI/OSI 7-Layer architecture is a conceptual view on networking architectures. One possible view is a look at Linked Data as an independent layer in the Internet architecture, on top of the networking layer, but below the application layers, since it provides a common data model for all applications as shown in the figure below. <br />Figure OSI/OSI Model and envisioned Linked Data Interoperability Layer<br /><br />
  52. 52. Linked Data achievements and challenges<br />achievements:<br />The Linked Data initiative has given rise to an increasing number of RDF datasets, many of which are freely accessible online. Extension of the Web using standards technologies (HTTP, URI, RDF, SPARQL)<br />Enterprise and Industrial uptake begins (e.g. BBC, Thomson Reuters)<br />Emerging governmental adoption in sight (e.g. UK, USA)<br />Establishing Linked Data as a deployment path for the Semantic Web<br />Enthusiastic global community <br />challenges:<br />To scale beyond the initial Linked Data community: academia, “skilled artisans”. <br />Enconding knowledge in the form of tools, vocabularies, best practices and design patterns <br />Changing the nature of the linked Data: largely static and slow moving environment to one that is much more lively and real-time.<br />Increasing number of primary sources with high quality data.<br />
  53. 53. Conclusion<br />Linked data is essential to actually connect the semantic web.  It is quite easy to do with a little thought, and becomes second nature.   Various common sense considerations determine when to make a link and when not to.<br />Linked Data, and RDF specifically, it is so much more portable than any other format. This means that teorically any organization/person can easily replicate data, to share and providing access. With Linked Data the OCW Initiatives have the option of federating or sharing data across the web.<br />Linked data benefits: linking the pieces of data, information, knowledge on the Web from disparate resources. Machine processable information ready to use by web software-agents. Improved retrieval and discovery services<br />
  54. 54. References<br />The Semantic Web, Scientific American, May 2001, Berners-Lee, Hendler and Lassila W3C Semantic Web Activity, February 2001<br />Dave Beckett's RDF Resource Guide about RDF.<br />Semantic Web History: Nodes and Arcs 1989-1999 - The WWW Proposal and RDF, Dan Brickley, 1999-11-12<br />BERNERS-LEE, Tim. Linked data [online]. W3C, Last change 2009/06/18. Available from WWW:<br />