Linked Data and the OpenART project

1,087
-1

Published on

Published in: Technology, Education
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,087
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
14
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide
  • Has been described as a ‘data commons’, or more usually a Web of Data.
  • Linked Data and the OpenART project

    1. 1. Linked Data and the OpenART Project<br />Tate, 7th October 2011<br />Julie Allinson, University of York<br />Richard Stephens<br />Steve Baylissand <br />Martin Dow, Acuity Unlimited<br />
    2. 2. “The term Linked Data refers to a set of best practices for publishing and connecting structured data on the Web.”<br />“the Semantic Web is the goal or end result… Linked Data provides the means to reach that goal”<br />From ‘Linked Data: The Story So Far’ - Heath, Bizer and Berners-Lee 2009<br />
    3. 3. The goal of Linked Data is to enable people to share structured data on the Web as easily as they can share documents today.<br />Bizer/Cyganiak/Heath Linked Data Tutorial, linkeddata.org<br />
    4. 4. Haven’t we been putting linked data on the web for years?<br /><ul><li>The web is full of data, yes, but structure in traditional web pages is largely ‘formatting’
    5. 5. And the ‘links’ are pretty unintelligent
    6. 6. Databases, search interfaces, XML etc are not really integrated into the web, just on it
    7. 7. Web 2.0 services tend to work against a fixed set of sources</li></li></ul><li>links to galleries<br />links to exhibitions<br />
    8. 8. Linked data<br />Linked data is about identifying the ‘real-world’ things we want to describe<br />and exposing rich information about these ‘things’<br />in the form of links<br />linked data applications operate on top of this unbound structured web of information<br />the www as one big database<br />
    9. 9. RDF<br />RDF (Resource Description Framework) is a way of modelling data, conceptually (as graphs)<br />it links things, not just documents<br />and creates ‘typed’ links<br />allows us to say meaningful things about entities<br />and the relationships between them<br />
    10. 10. Triples<br />Triples are the basis of RDF<br />Triples statements<br />Subject – Predicate - Object<br />Rather than<br />Displays<br />Painted<br />Links to<br />The <br />Snail<br />The <br />Snail<br />Gallery<br />Matisse<br />Web <br />Page<br />Web <br />Page<br />
    11. 11. One more thing about triples …<br />If we want to do ‘linked’ data.<br />Each piece of the triple should be a link.<br />In web speak, that means a HTTP URI<br /><ul><li>A unique identifier
    12. 12. MAY be de-referenced to a location (a URL)</li></ul>HTTP URIs are used to identify “real world” things<br />http://dbpedia.org/resource/Snail<br />http://viaf.org/viaf/42630086/#Matisse,_Henri,_1869-1954<br />
    13. 13. Linked<br />The job of OpenART is to expose data<br />But linking is where it gets interesting<br />And where the (potential) value for linked data starts to show<br />
    14. 14. Some Examples …<br />
    15. 15. data.gov.uk<br />Officially launched 21st January 2010<br />
    16. 16. Visualisation Prototype<br />Using Timemap – <br />Googlemaps and Simile<br />http://code.google.com/p/timemap/<br />Early stages with this<br />Will give location and ‘extent’ of archive.<br />Will link through to Archives Hub <br />
    17. 17.
    18. 18. SIG.MA<br />
    19. 19. Visualising multiple data sources<br />
    20. 20.
    21. 21. For discussion?<br />Quality and Trust<br />Licensing<br />Control and Loss of Control<br />Hype Curve<br />Other approaches<br />
    22. 22. OpenART : Open metadata for Art Research at the Tate<br />JISC funded<br /> part of their Infrastructure for Resource Discovery Strand<br />6 months (March to July) – short!<br />£30k – cheap!<br />Building on the (much bigger) York’s History of Art Department and Tate’s Court Country City project funded by the AHRC.<br />The Digital Library are leading OpenART and providing technical development to Court Country City<br />Acuity Unlimited are working with us on OpenART.<br />
    23. 23. Doing what?<br />designing and exposing linked open [meta]data for an important research dataset entitled "The London Art World 1660-1735", created as part of the AHRC funded Court, Country, City: British Art 1660 – 1735 project<br />
    24. 24. And again, doing what?<br />‘The London Art World 1660-1735’ is set of spreadsheets, the result of several years research into Sources (sale catalogues, newspaper adverts) of information about Sales, People, and Places (and the things being sold)from the period<br />The Digital Library is already helping put this information on the web.<br />OpenART is extending this to create ‘linked open data’.<br />
    25. 25. Snapshot from the Court Country City Web site<br />
    26. 26. OpenART - in brief<br />Take the spreadsheets<br />Anaylse them and model into an ontology<br />Decide how to create the linked data and how to expose it to the web<br />Experiment a bit<br />Test and validate the ontology<br />Create the data and expose<br />
    27. 27. Describing things with ontologies<br />an ontology is a standardized representation of knowledge as a set of concepts within a domain, and the relationships between those concepts. It can be used to reason about the entities within that domain, and may be used to describe the domain (wikipedia)<br />or, put another way “a way of describing things in a way other people/machines can understand” (me)<br />
    28. 28. There are many ontologies out there<br />Dublin Core is a simple ontology for describing web resources<br />Geonames describes places<br />For OpenART we have used two core event-based ontologies<br />Dolce Ultra-lite <br />Linked events (LODE)<br />and have extended these for the domain<br />
    29. 29. Example!<br />Much of the OpenART data set is about art sales, which are ‘events’<br />Linked Events Ontology has a class for ‘Event’<br />Which we have specialized to create an OpenART class for ‘Sale’<br />DUL has a class for ‘Social Agent’ (or person)<br />Which is a superclass for our term ‘Organiser’<br />
    30. 30. Lost yet?<br />What this means is that we can describe our dataset for a very narrow domain (people researching the art market)<br />But it can be generalised and understood much more widely<br />Because our narrow classes are members of broader, more general classes used in standard ontologies<br />
    31. 31. Example (again)!<br />Linked Events Ontology has a class for ‘Event’<br />Which we have specialized to create an OpenART class for ‘Sale’<br />Someone only looking for ‘art sales’ in 1670 can answer their research question<br />Someone looking for ‘events’ in 1670 could infer from our ‘Sale’ class that these sales are also events<br />
    32. 32.
    33. 33. OpenART output<br />Unique identifiers for the primary entities (Sale, Source, Person, Place, and Artwork)<br />RDF/XML (and probably ttl) documents representing each primary entity<br />RDF/A embedded in the CCC web site<br />Owl:sameas statements for personal names and VIAF<br />
    34. 34. Lessons learnt<br />Ontology development is hard work and needs sustaining and a community<br />Linked data is simple in principle, but in application needs expertise and commitment<br />The original data is semi-structured and this needs clean-up to fit the precision of the ontology<br />
    35. 35. Final thoughts<br />I believe there is a lot that could be done with open and linked data, by institutions<br />… not least efficiencies, adding-value to local information, information sharing and resource discovery … <br />Contact Julie Allinson julie.allinson@york.ac.uk<br />http://tinyurl.com/dlib-openart -- http://yorkdl.wordpress.com/<br />(thanks to Adrian Stevenson for some of the slides; <br />cover image: http://www.flickr.com/photos/bostonjal/5280851979/)<br />
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×