Presentation for the OCLC Linked Data Roundtable event for IFLA Helsinki 2012. Covers the reasoning behind the BL's linked open data version of the British National Bibliography, the processes needed to create the service and challenges to be addressed.
1. Making The Connection
Linked Open Data at the British Library
Neil Wilson
Head, Metadata Services
neil.wilson@bl.uk
2. British Library
Metadata Services
The BL has a long history of
supplying metadata to:
Increase visibility of BL
holdings & connect users to content
Contribute to collaborative
cataloguing initiatives
Support free and priced
bibliographic services
2
3. Changing Environment
The Web - fostered expectation
information should be open to all
Data has new value - from
open technologies enabling rapid
connection & reuse
‘Open + Data’ drives strategies
for :
Transparency
Economic growth
Research & discovery
4. Changing Environment
Public Sector Open Data Rating
★ Available on Web + open
licence
★★ + machine-readable
★★★ + non-proprietary format
★★★★ + W3C open standards
★★★★★ + link to others data
5. Linked Data
Potential Benefits for Libraries
Improved web
integration of resources -
increasing visibility & reaching
new users
A global pool of
reusable data - for libraries
to add unique value
New library leadership
opportunities - due to
persistence, stability & authority
5
6. Changing Environment
Library Data
Users were once lone
scholars but now include:
Collaborative Learners
Data miners
Machines
Libraries must respond to:
Stimulate innovation
Enable knowledge creation
Increase relevance & utility
7. British Library Open Metadata Strategy
Aims
Migrate - from library
formats to cross domain
standards
Collaborate - with
community on new services
advancing debate from theory
to practice
7
8. British Library Open Metadata Strategy
Aims
Remove barriers - enabling
innovation without unnecessary
restrictions
Achieve recognition -
while using licensing appropriate
for widest re-use
8
9. Open Service Objectives
Target services for:
Traditional libraries
(MARC21/Z39.50)
Researchers - using new
metadata processing
techniques (RDF/XML)
Linked data developers
(RDF Triples/SPARQL)
9
10. Linked Open Data
Preliminaries
To concentrate effort on our
data we identified:
The best licensing
option for our
objectives (CC0)
A proven hosting
platform
External sources of
expert comment
10
11. Linked Open Data
Objectives
A staff & organisational
development opportunity using:
In-house staff
Existing tools & technologies
Library MARC21 data
Established linked data
resources
11
12. Linked Open Data
Why BNB?
General bibliography - not a
unique institutional catalogue
Consistent format - over 60
years
Size & range of content - 3
million records on all subjects in
many languages
12
13. Linked Open Data
Why BNB?
Underlined commitment
by offering:
Large, popular dataset -
not niche sample
Reusable metadata - not
unique resource
Foundational service &
model - for others to build on
13
14. Linked Open Data
MARC Migration
Started simply - improving with
developing staff expertise
Used expert training in:
Linked data concepts
RDF modelling
Presentation options
Used the opportunity to re-
evaluate & innovate - blending
the best of old & new
14
15. Linked Open Data
Creating Links
Techniques:
Automatic generation from
record data
Auto text match with linked
data dumps
Crosswalk matching for
coded data
15
16. Linked Open Data
Linked Site Selection
To embed data in a wider
context we linked:
General resources:
GeoNames
Lexvo
RDF Book
Mashup
Plus library resources:
LCSH
VIAF
Dewey.info
16
17. MARC21 to RDF XML Conversion Workflow
• • Selection
• Selection
Selection
• • Pre-processing
• Pre-processing
Pre-processing
• • Character conversion
• Character setset conversion
Character set conversion
• • URI generation
• URI generation
URI generation
• Datatransformation
Data transformation
• Create & load triples
MARC to RDF conversion
Consists of multiple automated steps
17
18. Access Options
• thedatahub.org/dataset/bluk-bnb-basic
• thedatahub.org/dataset/bluk-bnb
BNB Books 1950-2012
2.8 Million Records
85 Million Unique Triples
.
• bnb.data.bl.uk/sparql
• bnb.data.bl.uk/describe
• bnb.data.bl.uk/search
18
19. Linked Open Data
A Change in Perspective
From self contained records for resources
To linking triple statements about resources
With ‘records’ assembled from statements
19
20. Linked Open Data
Achievements
Presence & visibility
New library data model -
being utilised by wider groups
New opportunities for
collaboration - with public &
private sector organisations
Confirmation that valuable
data will be used – e.g. up to 8
million monthly transactions
20
21. Linked Open Data
Challenges
How to:
Capture service value & gain
attribution for effort
Ensure expectations are Free
met - both Government & wider Offering
community
License derived 3rd party
data - to support open re-supply
Ensure effective service
maintenance - & persistence
21
22. Library Linked Data Wish List?
We Need More…
Tools to link library data to
other resources
LMS integration of linked
data options
Navigation & visualisation
applications
Feedback on usage
Collaboration on shared
approaches
22
23. Linked Open BNB
http://www.bl.uk/bibliographic/datafree.html
Next Steps:
Regular updates
More documentation &
refinement of model
Extend to cover further
material & linked
resources
23
Images from