This document provides an overview of social science data and digital resources available through the British Library. It summarizes the British Library's datasets strategy, key resources like the Economic and Social Data Service (ESDS) and census data, spatial data sources, open data initiatives, the UK Web Archive, and tools for working with data. The goal is to help researchers discover, access, reuse, reference, and track the impact of datasets.
Mobile access to educational resources in humanities and social sciences - Jasmin Klindžić, Nadja Soldatic, Kemal Kacapor, Maja Perkovic
Presented at Moodlemoot Edinburgh 2014
www.moodlemoot.ie
9 July 2013; Presented by Joan Starr and Carly Strasser. Description: EZID makes it simple for researchers and others to obtain and manage long-term identifiers (DOIs and ARKs) for their digital content. EZID is a great tool for data management, and researchers can build EZID and identifiers into their data management plans. In this free summer webinar, we cover: The advantages of EZID and identifiers for data management; How to configure the DMPTool to point to your library's EZID services; How to use the DMPTool as a ready source of contact information for your outreach
This presentation was part of a workshop of IEDA (http://www.iedadata.org) at the AGU (American Geophysical Union) Fall Meeting 2013 in San Francisco that was intended as an introduction to the topic of data publication.
Now we are six: Integrating Edinburgh DataShare into local and internet in...Robin Rice
#iassist40 presentation, Toronto, 6/6/2014.
Abstract:
Edinburgh DataShare, an institutional data repository, is six years old. It was built as a demonstrator in DSpace by EDINA and Data Library and has been given new life by the University of Edinburgh’s Research Data Management initiative. Following testing by pilot users in various departments last year, DataShare is confirmed as a key RDM service. Since 2008 much external infrastructure has grown around data sharing, and software developers, publishers and librarians are creating new innovations around the sharing and re-use of data daily. How can DataShare be shaped to fit in to this ever-more-sophisticated environment? A number of ongoing developments are helping us integrate the repository in the global context. DataShare is being indexed in Thomson-Reuter’s Data Citation Index. We aspire to attain the Data Seal of Approval for DataShare, a badge that confers trustworthiness through peer review. It is listed in re3data.org and databib registries of data repositories. We offer via extension, peer review of datasets to our depositors by listing journals that publish ‘data papers’ such as F1000 Research. Locally, as Information Services builds new data services such as the Data Store, [private data] Vault and the [metadata-only] Register, we can focus DataShare on its named purpose.
Mobile access to educational resources in humanities and social sciences - Jasmin Klindžić, Nadja Soldatic, Kemal Kacapor, Maja Perkovic
Presented at Moodlemoot Edinburgh 2014
www.moodlemoot.ie
9 July 2013; Presented by Joan Starr and Carly Strasser. Description: EZID makes it simple for researchers and others to obtain and manage long-term identifiers (DOIs and ARKs) for their digital content. EZID is a great tool for data management, and researchers can build EZID and identifiers into their data management plans. In this free summer webinar, we cover: The advantages of EZID and identifiers for data management; How to configure the DMPTool to point to your library's EZID services; How to use the DMPTool as a ready source of contact information for your outreach
This presentation was part of a workshop of IEDA (http://www.iedadata.org) at the AGU (American Geophysical Union) Fall Meeting 2013 in San Francisco that was intended as an introduction to the topic of data publication.
Now we are six: Integrating Edinburgh DataShare into local and internet in...Robin Rice
#iassist40 presentation, Toronto, 6/6/2014.
Abstract:
Edinburgh DataShare, an institutional data repository, is six years old. It was built as a demonstrator in DSpace by EDINA and Data Library and has been given new life by the University of Edinburgh’s Research Data Management initiative. Following testing by pilot users in various departments last year, DataShare is confirmed as a key RDM service. Since 2008 much external infrastructure has grown around data sharing, and software developers, publishers and librarians are creating new innovations around the sharing and re-use of data daily. How can DataShare be shaped to fit in to this ever-more-sophisticated environment? A number of ongoing developments are helping us integrate the repository in the global context. DataShare is being indexed in Thomson-Reuter’s Data Citation Index. We aspire to attain the Data Seal of Approval for DataShare, a badge that confers trustworthiness through peer review. It is listed in re3data.org and databib registries of data repositories. We offer via extension, peer review of datasets to our depositors by listing journals that publish ‘data papers’ such as F1000 Research. Locally, as Information Services builds new data services such as the Data Store, [private data] Vault and the [metadata-only] Register, we can focus DataShare on its named purpose.
How best to explore recent and historic censuses of Canada. Presentation given at the 2014 Data Liberation Initiative National Training.
Authors: Alex Guindon and Susan Mowers
Towards Dispersal or Congregation? The Spatial Distribution of Smaller Cultur...UKDSCensus
Census output confirms that England and Wales is the home to a large number of ethnic and religion-based groups; these groups might collectively be referred to as cultural groups. Many of the larger of these groups have been the subject of much research; however, this presentation focuses on five smaller groups – Arabs, Buddhists, Chinese, Jews, and Sikhs – each of which constitutes only about half of one per cent of the England and Wales population. Most of these groups have individually received limited attention and this is the first study to consider them in parallel, making use of data from the 2011 census. The primary focus of the paper is the spatial distribution of the groups, set in the context of a brief examination of socio-economic differences. In addition to presenting information in terms of traditional distribution indices, a graphic representation of the degree of concentration of the groups is presented. The work also specifically considers whether Jews, as the longest established of these groups in Britain, act as pathfinders for the future spatial distribution of other cultural groups. The results are used to challenge traditional perspectives of dispersal and spatial assimilation of migrant groups; positing that the benefits of congregation will prevent ultimate dispersion.
Digitization in Support of Services @ Smithsonian Libraries (May)Martin Kalfatovic
Digitization in Support of Services @ Smithsonian Libraries. Martin R. Kalfatovic. Smithsonian Libraries Advisory Board Orientation. Washington, DC. 5 May 2014.
Since modern science began, data have been a critical part of the scientific enterprise, not only for conducting science but also for communicating and validating scientific results. From the beginning, it was clear that for the scientific community to continually verify scientific results, the underlying data had to be made accessible. But that has not been, and is still not, always the case. In recent years however, public data repositories have grown significantly, making many research data sets easily accessible to others. The Dataverse project, an open-source software for building repositories to share research data (such as the Harvard Dataverse), has played an important role in making this happen, by giving incentives to researchers to share their own data. In this talk, I will discuss how we got here, and introduce current projects that extend Dataverse to address the next challenges in sharing research data. In particular, I'll present a project that, through integrating Dataverse with remote computing sites, makes large-scale structural biology data widely accessible and helps validate previous results.
Freckle Report for May 2021 from the Freckle ProjectEveryLibrary
Please visit https://www.everylibraryinstitute.org/freckle_project_surveys_reports for more information about the Freckle Project and to find links to other presentations and datasets.
How best to explore recent and historic censuses of Canada. Presentation given at the 2014 Data Liberation Initiative National Training.
Authors: Alex Guindon and Susan Mowers
Towards Dispersal or Congregation? The Spatial Distribution of Smaller Cultur...UKDSCensus
Census output confirms that England and Wales is the home to a large number of ethnic and religion-based groups; these groups might collectively be referred to as cultural groups. Many of the larger of these groups have been the subject of much research; however, this presentation focuses on five smaller groups – Arabs, Buddhists, Chinese, Jews, and Sikhs – each of which constitutes only about half of one per cent of the England and Wales population. Most of these groups have individually received limited attention and this is the first study to consider them in parallel, making use of data from the 2011 census. The primary focus of the paper is the spatial distribution of the groups, set in the context of a brief examination of socio-economic differences. In addition to presenting information in terms of traditional distribution indices, a graphic representation of the degree of concentration of the groups is presented. The work also specifically considers whether Jews, as the longest established of these groups in Britain, act as pathfinders for the future spatial distribution of other cultural groups. The results are used to challenge traditional perspectives of dispersal and spatial assimilation of migrant groups; positing that the benefits of congregation will prevent ultimate dispersion.
Digitization in Support of Services @ Smithsonian Libraries (May)Martin Kalfatovic
Digitization in Support of Services @ Smithsonian Libraries. Martin R. Kalfatovic. Smithsonian Libraries Advisory Board Orientation. Washington, DC. 5 May 2014.
Since modern science began, data have been a critical part of the scientific enterprise, not only for conducting science but also for communicating and validating scientific results. From the beginning, it was clear that for the scientific community to continually verify scientific results, the underlying data had to be made accessible. But that has not been, and is still not, always the case. In recent years however, public data repositories have grown significantly, making many research data sets easily accessible to others. The Dataverse project, an open-source software for building repositories to share research data (such as the Harvard Dataverse), has played an important role in making this happen, by giving incentives to researchers to share their own data. In this talk, I will discuss how we got here, and introduce current projects that extend Dataverse to address the next challenges in sharing research data. In particular, I'll present a project that, through integrating Dataverse with remote computing sites, makes large-scale structural biology data widely accessible and helps validate previous results.
Freckle Report for May 2021 from the Freckle ProjectEveryLibrary
Please visit https://www.everylibraryinstitute.org/freckle_project_surveys_reports for more information about the Freckle Project and to find links to other presentations and datasets.
British Library Social Science National Postgraduate Training Day - Datasets ...
BL Doctoral Open Days Feb 2012 - Social Science Data and Digital Resources
1. Social Science Data and Digital
Resources
February 2012
John Kaye – Lead Curator Digital Social Science
http://www.slideshare.net/johnkayebl
2. Overview
• British Library Datasets Strategy
• ESDS
• Census Resources
• Spatial Data
• Open Data
• UK Web Archive
• Other Data and Resources
• Tools, Software and Visualisation
• Identifying, Citing and Sharing Data
2
3. What is a dataset?
Seismic measurements taken by a
geologist.
Genetic data collected by a medical
researcher.
A survey of public opinions collected
by a sociologist.
3
4. The Foundation for Research
Data is a crucial component of the scholarly record.
Re-acquisition may be impossible
Datasets are essential to the British Library’s mission
to advance the World’s knowledge.
4
5. The British Library Datasets Strategy
We envision a future where researchers can:
Discover, access, reuse, and reference
datasets.
Track the impact of the data that they
generate and receive appropriate credit.
Our approach is to:
Provide a focus for the community to
establish needs, requirements and
agreement.
Explore novel technology and creative
solutions.
5
9. Economic and Social Data Service - (ESDS) www.esds.ac.uk
Data search and download
Research method guides
Thematic guides
Online analysis
Secure Data Service http://securedata.data-archive.ac.uk/
UK Data Service
9
10. Economic and Social Data Service - (ESDS) www.esds.ac.uk
ESDS Government
large-scale government surveys, such as the Labour Force Survey and
the General Household Survey
ESDS International
multi-nation databanks, such as World Bank's World Development
Indicators, and survey data including Eurobarometer
ESDS Longitudinal
major UK surveys following individuals over time, such as the British
Household Panel Survey
ESDS Qualidata
a range of multimedia qualitative data sources
10
11. 2011 Census
Data available on www.ons.gov.uk -
latest release is output area key statistics
Academic releases will be made
available via Census Dissemination Unit
via their InFuse tool
http://cdu.mimas.ac.uk/
Lots of information and support at
http://census.ac.uk/
Experian Geodemographic Data
http://cdu.mimas.ac.uk/experian/index.htm
11
12. Previous Censuses
Data available for 1981-2011 on http://www.nomisweb.co.uk/
Academic data release from 1971 to 2001 on casweb (also
contains geographic boundary data) http://casweb.mimas.ac.uk/
Histpop – The Online Historical Reports Collection (OHPR)
provides online access to population reports for Britain and
Ireland from 1801 to 1937 http://www.histpop.org/
Look at changes between census questions, structures and
geographies
12
15. BL Official Publications Collection – Census Reports
1851 Population Pyramid
UK Census Reports 85+
80-84
BL holds statistical reports relating 75-79
70-74
65-69
to each census. 60-64
55-59
Age Group
50-54
45-49 Male
Reports for 1921-1991 in the 40-44
35-39
30-34
Female
reading room on open shelves
25-29
20-24
15-19
10-14
National and county aggregate
5-9
0-4
1,500,000 1,000,000 500,000 0 500,000 1,000,000 1,500,000
reports for England and Wales, Num ber of People
Scotland, Northern Ireland and
Great Britain Number of Households Lacking or sharing Amenities (England and
Wales)
Aggregate statistical information 3,500,000
at each level for all census 3,000,000
2,500,000
questions 2,000,000
Compliments Histpop which has 1,500,000
1,000,000
digitised reports between 1801 – 500,000
1937 and Casweb: 1971 – 2001 0
1951 (Lack or 1961 (Lack or 1971 (Lack or 1981 (Lack or 1991 (Lack or 2001 (Without
Share Flushing Share Flushing Share Inside Share Inside Share Inside Sole use of
Some older reports can be found Toilet) Toilet) Toilet) Toilet) Toilet and/or Toilet and/or
Bath or Shower) Bath or Shower)
in parliamentary papers
15
16. Maps
The library holds a number of maps generated with census and
population data from UK and all over the world
Ireland map for railways
Augustus Petermann, Map of the British Isles, elucidating the
distribution of the population based on the 1841 census.
London,1861. 16
17. Spatial Data
Edina Digimap and UK
Borders
http://edina.ac.uk/digimap/
http://edina.ac.uk/ukborders/
Go Geo! Search
http://www.gogeo.ac.uk/cgi-bin/index.cgi
17
18. Spatial Data
Landmap
http://landmap.mimas.ac.uk/
Ordanance Survey Open Data
http://www.ordnancesurvey.co.uk/oswebsite/products/os-opendata.htm
18
19. UK Government Open Data
http://data.gov.uk/
Admin and Statistical data portal
Office for National Statistics
http://www.statistics.gov.uk/default.asp
http://www.neighbourhood.statistics.gov.uk/dissemination/
https://www.nomisweb.co.uk/Default.asp
National Digital Archive of Datasets
http://www.ndad.nationalarchives.gov.uk/
Regional
http://data.london.gov.uk/
http://datagm.org.uk/
19
20. International open data
United Nations
http://data.un.org/
European Union
http://epp.eurostat.ec.europa.eu/portal/page/portal/eurostat/home/
OECD
http://www.oecd.org/statsportal/
World Bank
http://data.worldbank.org/
IMF
http://www.imf.org/external/data.htm
Public Data EU
http://publicdata.eu/
20
21. UK Web Archive http://www.webarchive.org.uk
Selective Web Archive
over 11,000 websites collected since
2004
over 50,000 instances
Over 16TB of compressed data
British Library, National Library
of Wales, JISC
Also National Library of Scotland,
the National Archives, Wellcome
Library
Many collaborators
eg Women’s Library, Live Arts
Development Agency, Quakers in
Britain
21
22. A typical event-based special collection
Collect, preserve, and
make accessible
eb sites of
cultural and scholarly
importance
from the UK domain
23. A comprehensive special collection
Collect, preserve, and
make accessible
eb sites of
cultural and scholarly
importance
from the UK domain
24. JISC UK Web Domain Dataset (1996-2010)
Funded by JISC to create a research collection of UK
websites
Collaboration between the Internet Archive, JISC and the
British Library
Copy of subset of the Internet Archive’s web collection that
relates to the UK
470466 files, mostly arc.gz, with 4494 warc.gz.
Total size: 32TB
No local access – possible through the Internet Archive
Can be used to generate secondary datasets and make
these available
Analytical access the main route
24
25. Other Data and Resources
Arts and Humanities data Service (AHDS)
http://ahds.ac.uk
Guardian Data Store
http://www.guardian.co.uk/data-store
Financial Times
http://www.ft.com/home/uk
Economist Intelligence Unit
http://www.eiu.com/Default.aspx
UK Government Web Archive
http://www.nationalarchives.gov.uk/webarchive/
25
26. Other Data and Resources
The Mass Observation Archive
Specialises in material about everyday life
in Britain. It contains papers generated by
the original Mass Observation social
research organisation (1937 to early
1950s), and newer material collected
continuously since 1981
http://www.massobs.org.uk/index.htm
A Vision of Britain through Time
Contains historical Maps, Census Reports,
Election reports and other historical
material, searchable by local area.
http://www.visionofbritain.org.uk/
Charles Booth Online Archive
Gives access to archive material from the
Booth collections ofthe London School of
Economics and Political Science and the
Senate House Library
http://booth.lse.ac.uk/
Images from The Mass Observation Archive
26
28. Examples of Online Analysis Tools
ESDS NESSTAR
http://nesstar.esds.ac.uk
ESDS Spatial Tools
http://www.ccsr.ac.uk/esds/gis/
Economists Online Dataverse
http://dvn.iq.harvard.edu/dvn/dv/NEEO
United Nations
http://data.un.org/Explorer.aspx
London Profiler
http://www.londonprofiler.org/
London Heat Map
http://www.londonheatmap.org.uk/Mapping/
28
29. Online Mapping Tools using Google Maps
MapTube
http://www.maptube.org/
Google Drive
https://drive.google.com
Gmap Creator
http://www.casa.ucl.ac.uk/software/gmapcreator.asp
Other, more advanced online mapping
(requires coding):
Open Layers http://openlayers.org/
OS Openspace
http://www.ordnancesurvey.co.uk/oswebsite/web-services/os-openspace/index.html
29
30. Data Visualization
Presenting data in a useful
and interesting manner
Allowing concepts to be
easily understood
Lots of examples online
e.g:
http://flowingdata.com/
http://datavisualization.ch/
http://www.guardian.co.uk/n
ews/datablog
30
32. DataCite
DataCite is an international consortium which
aims to:
Establish easier access to research data on
the Internet
Increase acceptance of research data as
legitimate, citable contributions to the scholarly
record
Support data archiving that will permit results
to be verified and re-purposed for future study
http://datacite.org/
32
33. Connecting an Article with the Underlying Data
URLs are not persistent
(e.g. Wren JD: URL decay in
MEDLINE- a 4-year follow-up
study. Bioinformatics. 2008, Jun
1;24(11):1381-5).
Digital Object Identifiers (DOIs)
offer a solution
Mostly widely used identifier for
scientific articles Dataset
Researchers, authors, publishers Yancheva et al (2007). Analyses
know how to use them on sediment of Lake Maar.
Put datasets on the same playing PANGAEA.
field as articles doi:10.1594/PANGAEA.587840
33
35. Open Researcher and Contributor ID (ORCID)
http://about.orcid.org/
•Infrastructure is being created for researchers to build up an open
portfolio of research objects
36. Open Researcher and Contributor ID (ORCID)
•Register an ORCID ID www.orcid.org and link published papers
using ORCID’s tools
37. Sharing Data - Figshare
•Non published outputs (working papers, datasets) can be deposited
in figshare http://figshare.com/ given a DataCite DOI and linked back
and added to ORCID profile
38. Impact of Data
•View the impact of your work using traditional citation metrics and
social citations http://www.impactstory.org/
39. Depositing and Archiving Data
Why Archive?
Institutional Repositories
UK Data Archive/ESDS
Metadata and Code!
39
40. BL Social Science Research Blog
http://britishlibrary.typepad.co.uk/socialscience/
40
41. Contact Details
John Kaye
Lead Curator – Digital Social Science
Socials Sciences The British Library
96 Euston Road London
NW1 2DB
Telephone: 020 7412 7450
Email: john.kaye@bl.uk
Twitter: @johnkayebl
http://britishlibrary.typepad.co.uk/socialscience/
Slides - http://www.slideshare.net/johnkayebl
41
Editor's Notes
I am here to tell you about the datasets programme across the BL and in the Social Sciences team, There have been rapid changes in the digital landscape which has led to people generating and sharing ever increasing volumes of data. We refer to collections of data as datasets. While the nature of datasets varies across disciplines, researchers within each discipline typically agree on what constitutes a dataset for them. Examples of datasets include (1) example of volcanic data (2) cluster of chromosomes inside a breast cancer cell (3) uk poll of voting intention (blue cons, red labour, yellow liberal) Within the Dataset Programme, we consider a dataset to be an organised collection of digital objects that is produced or consumed during research. We emphasise the role that the dataset plays in the research activity, its importance to researchers, its impact, and its potential for reuse. Despite the differing nature of datasets, many of the services required by researchers are shared, such as methods of citation, discovery, and preservation.
So why focus on datasets? Data is the foundation for research It is an essential component of the scientific record. Time-consuming, costly to produce. Re-acquisition may be impossible. Therefore essential that it is preserved and shared.
As a result of these challenges In Dec 2009 the BL produced a Datasets Strategy. This strategy has been transformed into a programme of work in a number of departments in the Library and a number of significant projects. The datasets programme has been established to explore how the Library can help… Not only do we want to ensure data is preserved, we envision a future where… Our approach is to foster collaboration and…
Example Project 3 – Resource Discovery The BL is developing improved discovery services by deploying the Primo system from Ex Libris. We are investigating ways of including datasets alongside of other catalogue material such as articles and monographs. Now you can see how this works together with DataCite. There is a link next to the dataset that shows that you can get it as an on-line resource. This link uses the DataCite DOI. If we follow it, the DOI system takes us directly to the dataset. The same mechanism is also being used to link to articles and datasets in Elsevier's ScienceDirect and Thompson-Reuters Web of Science.
In social sciences we haven ’t assigned any datacite DOIs yet, hopefully that is coming soon, but we are using the Primo system in our new projects. Dataset resources will be included in the release of the management and business studies portal. As you can see from this search for flexible working datset results from UKDA are displayed alongside articles We have tested the search functionality out with users and have had some good feedback that we are currently incorporating before the launch. A resource guide for the MBS datsets will also be published on the portal at launch.
The types of data linked to include Data from ESDS/UKDA, UK government data, regional and local government data, international organisations etc.
Page
Search by URL, title, full-text Browse by
Footer text here...
Footer text here...
Page
Example Project 1 – DataCite Our long term vision is to support researchers by providing methods for them to locate, identify, and cite research datasets with confidence.
How does datacite work? The approach that DataCite is taking – using DOIs - has some important social benefits. Researchers, authors, publishers are comfortable, understand, and know how to use them. They put datasets on a level playing field with articles.
We have recently launched our social sciences team blog focusing on research methods and resources, it has posts from our curators about projects we are working on, but we are also keen to hear what members of the research community are doing so we gladly accept guest posts and contributions, so if any of you would like a place to talk about your work then please get in touch.