Exists for everyone who wants to do research – for academic, personal, and commercial purposes.
Covers all subject areas – sciences, technology, medicine, arts, humanities, social sciences…
Receives a copy of every item published in the UK.
Holds over 150 million items , with 3 million items added each year.
Used by over 16,000 people each day (on site and online).
The British Library: some facts and figures Helping people advance knowledge to enrich lives British Library Act 1972 National centre for reference, study, bibliographical and other information services, in relation both to scientific and technological matters, and to the humanities. Science and Innovation Investment Framework 2004-2014, H.M. Treasury (2004) UK research base must have ready and efficient access to information of all kinds – such as experimental data sets, journals, theses, conference proceedings and patents. This is the life blood of research and innovation . The largest document supply service in the world. Secure e-delivery and ‘just in time’ digitisation enables desktop delivery within 2 hours
GIA Funding 08/09:
£94.8m operational, £12m capital Other funding secured 07/08: c.£33m
National library of the UK. Serves researchers, business, libraries, education & the general public Collection includes over 2m sound recordings, 5m reports, theses and conference papers, the world’s largest patents collection (c.50m) 3 main sites in London and Yorkshire. Circa 2,000 staff Business and IP Centre: Providing inspiration, and enabling protection of creative capital and business development
Generates value to the UK economy each year of 4.4 times public funding
Collection fills over 600km of shelving and grows at 11km per year 70 Tb of digital material through voluntary deposit
Projects and activities www.bl.uk/ datasets Follow us on twitter @ datasetsBL
A Key Component for Many Goals Persistent Identification Make Visible Find Access Track Impact Verify Reuse Cite ?
Citation using Digital Object Identifiers (DOIs)
G.Yancheva, N. R. Nowaczyk et al (2007)
Rock magnetism and X-ray flourescence spectrometry analyses on sediment cores of the Lake Huguang Maar, Southeast China, PANGAEA
G. Yancheva, N. R. Nowaczyk et al (2007)
Influence of the intertropical convergence zone on the East Asian monsoon
Nature 445, 74-77
How to reference Published Article (Abstract or full text) The DOI system offers an easy, internet actionable way to connect the article with the underlying publication But a complete scholarly record would also link to the evidential datasets and their location, e.g. PANGAEA doi:10.1038/nature05431
doi:10.1038/nature05431 leads to a landing page
Connecting an Article with the Underlying Data
Digital Object Identifiers (DOIs) offer a solution
Mostly widely used identifier for scientific articles
Researchers, authors, publishers know how to use them
Put datasets on the same playing field as articles
Yancheva et al (2007). Analyses on sediment of Lake Maar. PANGAEA.
URIs are commonly used but can decay
(e.g. Wren JD: URL decay in MEDLINE- a 4-year follow-up study . Bioinformatics. 2008, Jun 1;24(11):1381-5).
SageCite: Data citation in bioinformatics workflow
Sage bionetworks data capture and analysis workflow (Tavenra: MyExperiemnt)
Data Citation service integration points and citation targets (e.g. data-models)
SageCite: Integration of data citation services into multi-contributor bio-informatics workflow. Establishing data attribution and credit mechanisms . ► INCENTIVE Sage Bionetworks : Aggregating datasets from contributors to create massive coherent datasets that can be used for systems level analysis of disease
Sustainability models for datasets supplementary to publication
Dryad UK: Define a business case and pilot service integrating DataCite DOIs and dataset archiving into publisher workflows ► SUSTAINABILITY Leveraging the Dryad Consortium, which is addressing the acquisition and storage of long tail supplementary data
For more information on the BL Datasets Programme