SlideShare a Scribd company logo
Adventures in Linked Data Land:
bringing RDF to the Wordsworth
Trust


Richard Light




CT Linked Data Meeting, 22 February 2010
Discovering Linked Data
Four principles of Linked Data (Tim B-L):

●   Use URIs to identify resources

●   Use HTTP URIs so that people can look them up

●   Provide useful information about the resource

●   Include links to other URIs in your data
Discovering dbPedia
●   Extraction of Linked Data from Wikipedia
●   Statements in info boxes (mainly) become RDF
    triples:
<rdf:Description
  rdf:about="http://dbpedia.org/resource/Ber
  lin_Marathon">
<dbpprop:location
  rdf:resource="http://dbpedia.org/resource/
  Berlin"/>
</rdf:Description>

Note the URLs
Browsing Linked Data

●   View RDF as a web page:
    http://dbpedia.org/page/Berlin

●   Navigate from one data source to another

●   Specialist Linked Data browsers/plugins:
    –   DISCO
    –   Marbles
    –   Openlink Data Explorer
    –   Tabulator
dbPedia page for Berlin
OpenLink Data Explorer: What
OpenLink Data Explorer: Where
Querying Linked Data

●   SPARQL query language:
    http://www.w3.org/TR/2008/REC-rdf-sparql-query-
      20080115/

●   And SPARQL XML results format:
    http://www.w3.org/TR/rdf-sparql-XMLres/

●   “SPARQL end-points”:
    http://dbpedia.org/sparql
    http://dbtune.org/bbc/peel/sparql
    http://data.linkedmdb.org/sparql
dbPedia SPARQL endpoint page
Asking interesting questions

●   German musicians born in Berlin:
●
So what do we have here?

●   An initiative to generate lots of Linked Data

●   A Linked Data Cloud, containing a growing
    number of RDF datasets

●   A hard-to-use query language capable of very
    precise and powerful querying

Where do museums come into this picture?
The Wordsworth Trust

●   Typical museum collection: about 60,000 objects

●   Major collection of manuscripts (notebooks,
    letters, etc.)

●   Objects published to the Web from a ModesXML
    database

●   Unwise enough to allow me Remote Desktop
    access ...
Typical collections object

GRMDC.C104.2
Same object represented as RDF
Same object represented as XTM
One identifier; three “views”
●   This object has a single persistent identifier:
    http://collections.wordsworth.org.uk/object/GRMDC.C104.2

●   This maps to different views depending on the
    “Accept” header in the HTTP request:

    –   application/rdf+xml >> RDF
    –   application/xtm+xml >> XTM Topic Map
    –   Otherwise >> HTML (human-readable)

●   Achieved through a custom 404 “page not found”
    handler
“Page not found” handler (1)

●   All URLs are fictitious, so they generate a 404

●   Modified a generic smart 404 handler from:
    http://evolvedcode.net/content/code_smart404/

●   Added support for “303 See other” redirects

●   added wild card matching to re-format URLs
“Page not found” handler (2)

●   Generic URL, plus requested Accept format,
    determine initial “303 See other” mapping, e.g.:
    http://collections.wordsworth.org.uk/object/GRMDC.C104.2
    +
    Accept: application/rdf+xml
    =
    http://collections.wordsworth.org.uk/object/rdf/GRMDC.C104.2

●   When this is passed back in, the 404 handler has to
    generate the required RDF directly

●   Can't just keep redirecting requests!
“Page not found” handler (3)

●   Redirect rules declare mappings:
“Page not found” handler (4)

●   Generic URL plus a supported Accept type
    generates a “303 See other” redirect

●   If it comes back as a page request, it is further
    redirected with a “301 Moved permanently” to the
    object's web page

●   If it comes back as an RDF or XTM request, the
    record is fetched as XML and subjected to an
    XSLT transform by the handler
Does it work? - yes, sort of
Data Explorer place view
Implementation details

●   HTML needed a “back link” to RDF to keep
    OpenLink Explorer happy:
    <link rel="alternate" type="application/rdf+xml"
      href="http://collections.wordsworth.org.uk/object/data/GRMDC
      .C104.2" title="RDF" />

●   Result is totally unfindable: need a search or
    harvesting mechanism:
     – OAI support (possible)
     – SPARQL end-point (harder)
What has been learnt? (1)
●   The Linked Data paradigm encourages simple
    RDF triples: no “blank nodes”

●   For an object, this becomes a simple metadata set,
    very analogous to the PNDS DCAP format

●   The properties involved need to encapsulate the
    whole relation between object and data, e.g.
    <p:title>Ulswater from Pooley Bridge</p:title>
    <p:technique>drawn</p:technique>
    <p:maker>Farington, Joseph (1747-1821)</p:maker>
    <p:technique>engraved</p:technique>
    <p:maker>Middiman, Samuel (1750-1831)</p:maker>
What has been learnt? (2)

●   Data in linked resources can “add value” to your
    own recording efforts (e.g. place data)
Properties: which framework?

●   I have used dbPedia properties (for compatibility
    with other Linked Data resources … ?):
    http://dbpedia.org/property/title
    http://dbpedia.org/property/maker

●   A viable alternative would be PNDS DCAP:
    http://purl.org/dc/elements/1.1/title
    http://purl.org/dc/elements/1.1/creator

●   One framework which doesn't fit is the CIDOC
    CRM:
    E21 Physical Thing – E12 Production – E39 Actor = “creator”
Do we need “museum” properties?

●   DbPedia properties are not coherent

●   Need something richer than simple metadata

●   Could use CIDOC CRM as basis

●   Existing interchange formats such as LIDO could
    be re-expressed in RDF

●   Could broaden scope: “history” property set?
The problem of URIs

●   Good Linked Data requires URIs everywhere

●   Most of my museum RDF resolves to strings

●   One exception is Geonames lookup:
        Ullswater
    becomes
        http://www.geonames.org/2635191/

●   In the absence of a central “people” registry,
    should be minting URIs myself for people, etc.
Conclusions
●   Implementing an RDF Linked Data front-end to a
    museum database is feasible if:
    –   You can generate multiple outputs from your database
        (XML is sufficient)
    –   You can implement a suitable URL rewriter or 404
        handler

●   It's easy (and a good idea) to mint and publish
    URIs for your collection objects

●   It's less clear where all the other URIs we'll need
    will come from
Challenges for museum linked data

●   Agreeing an ontology to enable cross-collection
    [SPARQL] queries

●   Shared URLs for in-common concepts: people,
    places, events

●   Mechanisms for getting URLs into museum data

●   Getting existing authorities, e.g. AAT, to be
    available as Linked Data
Ask Multimap where Lancaster is
Thank you!




Richard Light
richard@light.demon.co.uk

More Related Content

What's hot

Enabling access to Linked Media with SPARQL-MM
Enabling access to Linked Media with SPARQL-MMEnabling access to Linked Media with SPARQL-MM
Enabling access to Linked Media with SPARQL-MM
Thomas Kurz
 
Flagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertier
Flagis VZW
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Sergio Fernández
 
JSONpedia - Facilitating consumption of MediaWiki content
JSONpedia - Facilitating consumption of MediaWiki contentJSONpedia - Facilitating consumption of MediaWiki content
JSONpedia - Facilitating consumption of MediaWiki content
Michele Mostarda
 
Graph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraGraph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandra
Ravindra Ranwala
 
Getting Started With The Talis Platform
Getting Started With The Talis PlatformGetting Started With The Talis Platform
Getting Started With The Talis Platform
Leigh Dodds
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
Jesse Wang
 
Semantic Media Management with Apache Marmotta
Semantic Media Management with Apache MarmottaSemantic Media Management with Apache Marmotta
Semantic Media Management with Apache Marmotta
Thomas Kurz
 
Open Data and CKAN Data Catalogues
Open Data and CKAN Data CataloguesOpen Data and CKAN Data Catalogues
Open Data and CKAN Data Catalogues
david-read
 
BDT204 Awesome Applications of Open Data - AWS re: Invent 2012
BDT204 Awesome Applications of Open Data - AWS re: Invent 2012BDT204 Awesome Applications of Open Data - AWS re: Invent 2012
BDT204 Awesome Applications of Open Data - AWS re: Invent 2012
Amazon Web Services
 
Drupal and Apache Stanbol
Drupal and Apache StanbolDrupal and Apache Stanbol
Drupal and Apache Stanbol
Alkuvoima
 
RDTF Metadata Guidelines: an update
RDTF Metadata Guidelines: an updateRDTF Metadata Guidelines: an update
RDTF Metadata Guidelines: an update
Andy Powell
 
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Fabrizio Orlandi
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
Ivan Ermilov
 
Linked data based semantic annotation using Drupal and Apache Stanbol
Linked data based semantic annotation using Drupal and Apache StanbolLinked data based semantic annotation using Drupal and Apache Stanbol
Linked data based semantic annotation using Drupal and Apache Stanbol
Gabriel Dragomir
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
giurca
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
National Institute of Informatics
 
Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021
Fabrizio Orlandi
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Sergio Fernández
 
CKAN - the open source data portal platform
CKAN - the open source data portal platformCKAN - the open source data portal platform
CKAN - the open source data portal platform
Maurizio Napolitano
 

What's hot (20)

Enabling access to Linked Media with SPARQL-MM
Enabling access to Linked Media with SPARQL-MMEnabling access to Linked Media with SPARQL-MM
Enabling access to Linked Media with SPARQL-MM
 
Flagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertier
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
 
JSONpedia - Facilitating consumption of MediaWiki content
JSONpedia - Facilitating consumption of MediaWiki contentJSONpedia - Facilitating consumption of MediaWiki content
JSONpedia - Facilitating consumption of MediaWiki content
 
Graph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraGraph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandra
 
Getting Started With The Talis Platform
Getting Started With The Talis PlatformGetting Started With The Talis Platform
Getting Started With The Talis Platform
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
 
Semantic Media Management with Apache Marmotta
Semantic Media Management with Apache MarmottaSemantic Media Management with Apache Marmotta
Semantic Media Management with Apache Marmotta
 
Open Data and CKAN Data Catalogues
Open Data and CKAN Data CataloguesOpen Data and CKAN Data Catalogues
Open Data and CKAN Data Catalogues
 
BDT204 Awesome Applications of Open Data - AWS re: Invent 2012
BDT204 Awesome Applications of Open Data - AWS re: Invent 2012BDT204 Awesome Applications of Open Data - AWS re: Invent 2012
BDT204 Awesome Applications of Open Data - AWS re: Invent 2012
 
Drupal and Apache Stanbol
Drupal and Apache StanbolDrupal and Apache Stanbol
Drupal and Apache Stanbol
 
RDTF Metadata Guidelines: an update
RDTF Metadata Guidelines: an updateRDTF Metadata Guidelines: an update
RDTF Metadata Guidelines: an update
 
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
Linked data based semantic annotation using Drupal and Apache Stanbol
Linked data based semantic annotation using Drupal and Apache StanbolLinked data based semantic annotation using Drupal and Apache Stanbol
Linked data based semantic annotation using Drupal and Apache Stanbol
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
 
Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
 
CKAN - the open source data portal platform
CKAN - the open source data portal platformCKAN - the open source data portal platform
CKAN - the open source data portal platform
 

Viewers also liked

Reducing Unauthorized Deductions
Reducing Unauthorized DeductionsReducing Unauthorized Deductions
Reducing Unauthorized Deductions
Credit Management Association
 
Jugandoalhuevopodrido
JugandoalhuevopodridoJugandoalhuevopodrido
Jugandoalhuevopodrido
mirna_zoff
 
Opening The Ivory Tower
Opening The Ivory TowerOpening The Ivory Tower
Opening The Ivory Tower
Antonio Vantaggiato
 
Nuevas tecnologías en las escuels
Nuevas tecnologías en las escuelsNuevas tecnologías en las escuels
Nuevas tecnologías en las escuels
Antonio Vantaggiato
 
es fascinante
es fascinantees fascinante
es fascinante
gazulla
 
DistanceEducation
DistanceEducationDistanceEducation
DistanceEducation
Antonio Vantaggiato
 
9th Grade Chapter 3 Lesson 5
9th Grade Chapter 3 Lesson 59th Grade Chapter 3 Lesson 5
9th Grade Chapter 3 Lesson 5
MRS.KDUNCAN
 
9th Grade Chapter 1 Lesson 4
9th Grade Chapter 1 Lesson 49th Grade Chapter 1 Lesson 4
9th Grade Chapter 1 Lesson 4
MRS.KDUNCAN
 
Non-invasive Glucose Monitor
Non-invasive Glucose MonitorNon-invasive Glucose Monitor
Non-invasive Glucose Monitor
Charlie Aylward
 
Moodle.2009
Moodle.2009Moodle.2009
Moodle.2009
Antonio Vantaggiato
 
Laser applications to medicine and biology
Laser applications to medicine and biologyLaser applications to medicine and biology
Laser applications to medicine and biology
Viorica Tonu
 

Viewers also liked (11)

Reducing Unauthorized Deductions
Reducing Unauthorized DeductionsReducing Unauthorized Deductions
Reducing Unauthorized Deductions
 
Jugandoalhuevopodrido
JugandoalhuevopodridoJugandoalhuevopodrido
Jugandoalhuevopodrido
 
Opening The Ivory Tower
Opening The Ivory TowerOpening The Ivory Tower
Opening The Ivory Tower
 
Nuevas tecnologías en las escuels
Nuevas tecnologías en las escuelsNuevas tecnologías en las escuels
Nuevas tecnologías en las escuels
 
es fascinante
es fascinantees fascinante
es fascinante
 
DistanceEducation
DistanceEducationDistanceEducation
DistanceEducation
 
9th Grade Chapter 3 Lesson 5
9th Grade Chapter 3 Lesson 59th Grade Chapter 3 Lesson 5
9th Grade Chapter 3 Lesson 5
 
9th Grade Chapter 1 Lesson 4
9th Grade Chapter 1 Lesson 49th Grade Chapter 1 Lesson 4
9th Grade Chapter 1 Lesson 4
 
Non-invasive Glucose Monitor
Non-invasive Glucose MonitorNon-invasive Glucose Monitor
Non-invasive Glucose Monitor
 
Moodle.2009
Moodle.2009Moodle.2009
Moodle.2009
 
Laser applications to medicine and biology
Laser applications to medicine and biologyLaser applications to medicine and biology
Laser applications to medicine and biology
 

Similar to Adventures in Linked Data Land (presentation by Richard Light)

Webofdata
WebofdataWebofdata
Webofdata
Bill Roberts
 
SemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in PracticeSemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in Practice
Dan Brickley
 
Linked Data
Linked DataLinked Data
Linked Data
cyriacsmail
 
Engineering a Semantic Web (Spring 2018)
Engineering a Semantic Web (Spring 2018)Engineering a Semantic Web (Spring 2018)
Engineering a Semantic Web (Spring 2018)
Rensselaer Polytechnic Institute
 
Danbri Drupalcon Export
Danbri Drupalcon ExportDanbri Drupalcon Export
Danbri Drupalcon Export
Drupalcon Paris
 
RDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFaRDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFa
Platypus
 
How RDFa works
How RDFa worksHow RDFa works
How RDFa works
JISC Netskills
 
Madrid SPARQL handson
Madrid SPARQL handsonMadrid SPARQL handson
Madrid SPARQL handson
Victor de Boer
 
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Rensselaer Polytechnic Institute
 
SemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsSemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n Bolts
Rinke Hoekstra
 
Semantic Web introduction
Semantic Web introductionSemantic Web introduction
Semantic Web introduction
Graphity
 
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked DataIFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
Lars G. Svensson
 
Lifting the Lid on Linked Data
Lifting the Lid on Linked DataLifting the Lid on Linked Data
Lifting the Lid on Linked Data
Jane Stevenson
 
Deploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application ServerDeploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application Server
webhostingguy
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
Laura Po
 
Madrid Building blocks of Linked Data
Madrid Building blocks of Linked DataMadrid Building blocks of Linked Data
Madrid Building blocks of Linked Data
Victor de Boer
 
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
Rensselaer Polytechnic Institute
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2
Dimitris Kontokostas
 
Linked data HHS 2015
Linked data HHS 2015Linked data HHS 2015
Linked data HHS 2015
Cason Snow
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
Sören Auer
 

Similar to Adventures in Linked Data Land (presentation by Richard Light) (20)

Webofdata
WebofdataWebofdata
Webofdata
 
SemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in PracticeSemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in Practice
 
Linked Data
Linked DataLinked Data
Linked Data
 
Engineering a Semantic Web (Spring 2018)
Engineering a Semantic Web (Spring 2018)Engineering a Semantic Web (Spring 2018)
Engineering a Semantic Web (Spring 2018)
 
Danbri Drupalcon Export
Danbri Drupalcon ExportDanbri Drupalcon Export
Danbri Drupalcon Export
 
RDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFaRDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFa
 
How RDFa works
How RDFa worksHow RDFa works
How RDFa works
 
Madrid SPARQL handson
Madrid SPARQL handsonMadrid SPARQL handson
Madrid SPARQL handson
 
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
 
SemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsSemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n Bolts
 
Semantic Web introduction
Semantic Web introductionSemantic Web introduction
Semantic Web introduction
 
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked DataIFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
 
Lifting the Lid on Linked Data
Lifting the Lid on Linked DataLifting the Lid on Linked Data
Lifting the Lid on Linked Data
 
Deploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application ServerDeploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application Server
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Madrid Building blocks of Linked Data
Madrid Building blocks of Linked DataMadrid Building blocks of Linked Data
Madrid Building blocks of Linked Data
 
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2
 
Linked data HHS 2015
Linked data HHS 2015Linked data HHS 2015
Linked data HHS 2015
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 

Recently uploaded

Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Tatiana Kojar
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Jeffrey Haguewood
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
LucaBarbaro3
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
flufftailshop
 

Recently uploaded (20)

Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
 

Adventures in Linked Data Land (presentation by Richard Light)

  • 1. Adventures in Linked Data Land: bringing RDF to the Wordsworth Trust Richard Light CT Linked Data Meeting, 22 February 2010
  • 2. Discovering Linked Data Four principles of Linked Data (Tim B-L): ● Use URIs to identify resources ● Use HTTP URIs so that people can look them up ● Provide useful information about the resource ● Include links to other URIs in your data
  • 3. Discovering dbPedia ● Extraction of Linked Data from Wikipedia ● Statements in info boxes (mainly) become RDF triples: <rdf:Description rdf:about="http://dbpedia.org/resource/Ber lin_Marathon"> <dbpprop:location rdf:resource="http://dbpedia.org/resource/ Berlin"/> </rdf:Description> Note the URLs
  • 4. Browsing Linked Data ● View RDF as a web page: http://dbpedia.org/page/Berlin ● Navigate from one data source to another ● Specialist Linked Data browsers/plugins: – DISCO – Marbles – Openlink Data Explorer – Tabulator
  • 8. Querying Linked Data ● SPARQL query language: http://www.w3.org/TR/2008/REC-rdf-sparql-query- 20080115/ ● And SPARQL XML results format: http://www.w3.org/TR/rdf-sparql-XMLres/ ● “SPARQL end-points”: http://dbpedia.org/sparql http://dbtune.org/bbc/peel/sparql http://data.linkedmdb.org/sparql
  • 10. Asking interesting questions ● German musicians born in Berlin: ●
  • 11. So what do we have here? ● An initiative to generate lots of Linked Data ● A Linked Data Cloud, containing a growing number of RDF datasets ● A hard-to-use query language capable of very precise and powerful querying Where do museums come into this picture?
  • 12. The Wordsworth Trust ● Typical museum collection: about 60,000 objects ● Major collection of manuscripts (notebooks, letters, etc.) ● Objects published to the Web from a ModesXML database ● Unwise enough to allow me Remote Desktop access ...
  • 16. One identifier; three “views” ● This object has a single persistent identifier: http://collections.wordsworth.org.uk/object/GRMDC.C104.2 ● This maps to different views depending on the “Accept” header in the HTTP request: – application/rdf+xml >> RDF – application/xtm+xml >> XTM Topic Map – Otherwise >> HTML (human-readable) ● Achieved through a custom 404 “page not found” handler
  • 17. “Page not found” handler (1) ● All URLs are fictitious, so they generate a 404 ● Modified a generic smart 404 handler from: http://evolvedcode.net/content/code_smart404/ ● Added support for “303 See other” redirects ● added wild card matching to re-format URLs
  • 18. “Page not found” handler (2) ● Generic URL, plus requested Accept format, determine initial “303 See other” mapping, e.g.: http://collections.wordsworth.org.uk/object/GRMDC.C104.2 + Accept: application/rdf+xml = http://collections.wordsworth.org.uk/object/rdf/GRMDC.C104.2 ● When this is passed back in, the 404 handler has to generate the required RDF directly ● Can't just keep redirecting requests!
  • 19. “Page not found” handler (3) ● Redirect rules declare mappings:
  • 20. “Page not found” handler (4) ● Generic URL plus a supported Accept type generates a “303 See other” redirect ● If it comes back as a page request, it is further redirected with a “301 Moved permanently” to the object's web page ● If it comes back as an RDF or XTM request, the record is fetched as XML and subjected to an XSLT transform by the handler
  • 21. Does it work? - yes, sort of
  • 23. Implementation details ● HTML needed a “back link” to RDF to keep OpenLink Explorer happy: <link rel="alternate" type="application/rdf+xml" href="http://collections.wordsworth.org.uk/object/data/GRMDC .C104.2" title="RDF" /> ● Result is totally unfindable: need a search or harvesting mechanism: – OAI support (possible) – SPARQL end-point (harder)
  • 24. What has been learnt? (1) ● The Linked Data paradigm encourages simple RDF triples: no “blank nodes” ● For an object, this becomes a simple metadata set, very analogous to the PNDS DCAP format ● The properties involved need to encapsulate the whole relation between object and data, e.g. <p:title>Ulswater from Pooley Bridge</p:title> <p:technique>drawn</p:technique> <p:maker>Farington, Joseph (1747-1821)</p:maker> <p:technique>engraved</p:technique> <p:maker>Middiman, Samuel (1750-1831)</p:maker>
  • 25. What has been learnt? (2) ● Data in linked resources can “add value” to your own recording efforts (e.g. place data)
  • 26. Properties: which framework? ● I have used dbPedia properties (for compatibility with other Linked Data resources … ?): http://dbpedia.org/property/title http://dbpedia.org/property/maker ● A viable alternative would be PNDS DCAP: http://purl.org/dc/elements/1.1/title http://purl.org/dc/elements/1.1/creator ● One framework which doesn't fit is the CIDOC CRM: E21 Physical Thing – E12 Production – E39 Actor = “creator”
  • 27. Do we need “museum” properties? ● DbPedia properties are not coherent ● Need something richer than simple metadata ● Could use CIDOC CRM as basis ● Existing interchange formats such as LIDO could be re-expressed in RDF ● Could broaden scope: “history” property set?
  • 28. The problem of URIs ● Good Linked Data requires URIs everywhere ● Most of my museum RDF resolves to strings ● One exception is Geonames lookup: Ullswater becomes http://www.geonames.org/2635191/ ● In the absence of a central “people” registry, should be minting URIs myself for people, etc.
  • 29. Conclusions ● Implementing an RDF Linked Data front-end to a museum database is feasible if: – You can generate multiple outputs from your database (XML is sufficient) – You can implement a suitable URL rewriter or 404 handler ● It's easy (and a good idea) to mint and publish URIs for your collection objects ● It's less clear where all the other URIs we'll need will come from
  • 30. Challenges for museum linked data ● Agreeing an ontology to enable cross-collection [SPARQL] queries ● Shared URLs for in-common concepts: people, places, events ● Mechanisms for getting URLs into museum data ● Getting existing authorities, e.g. AAT, to be available as Linked Data
  • 31. Ask Multimap where Lancaster is