BL Social Sciences Post Graduate Training Day - Datasets


Published on

Published in: Education, Technology
  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • I am here to tell you about the datasets programme across the BL and in the Social Sciences team, There have been rapid changes in the digital landscape which has led to people generating and sharing ever increasing volumes of data. We refer to collections of data as datasets. While the nature of datasets varies across disciplines, researchers within each discipline typically agree on what constitutes a dataset for them. Examples of datasets include (1) example of volcanic data (2) cluster of chromosomes inside a breast cancer cell (3) uk poll of voting intention (blue cons, red labour, yellow liberal) Within the Dataset Programme, we consider a dataset to be an organised collection of digital objects that is produced or consumed during research. We emphasise the role that the dataset plays in the research activity, its importance to researchers, its impact, and its potential for reuse. Despite the differing nature of datasets, many of the services required by researchers are shared, such as methods of citation, discovery, and preservation.
  • So why focus on datasets? Data is the foundation for research It is an essential component of the scientific record. Time-consuming, costly to produce. Re-acquisition may be impossible. Therefore essential that it is preserved and shared.
  • As a result of these challenges In Dec 2009 the BL produced a Datasets Strategy. This strategy has been transformed into a programme of work in a number of departments in the Library and a number of significant projects. The datasets programme has been established to explore how the Library can help… Not only do we want to ensure data is preserved, we envision a future where… Our approach is to foster collaboration and…
  • Example Project 3 – Resource Discovery The BL is developing improved discovery services by deploying the Primo system from Ex Libris. We are investigating ways of including datasets alongside of other catalogue material such as articles and monographs. Now you can see how this works together with DataCite. There is a link next to the dataset that shows that you can get it as an on-line resource. This link uses the DataCite DOI. If we follow it, the DOI system takes us directly to the dataset. The same mechanism is also being used to link to articles and datasets in Elsevier's ScienceDirect and Thompson-Reuters Web of Science.
  • In social sciences we haven’t assigned any datacite DOIs yet, hopefully that is coming soon, but we are using the Primo system in our new projects. Dataset resources will be included in the release of the management and business studies portal. As you can see from this search for flexible working datset results from UKDA are displayed alongside articles We have tested the search functionality out with users and have had some good feedback that we are currently incorporating before the launch. A resource guide for the MBS datsets will also be published on the portal at launch.
  • The types of data linked to include Data from ESDS/UKDA, UK government data, regional and local government data, international organisations etc.
  • Example Project 1 – DataCite Our long term vision is to support researchers by providing methods for them to locate, identify, and cite research datasets with confidence.
  • How does datacite work? The approach that DataCite is taking – using DOIs - has some important social benefits. Researchers, authors, publishers are comfortable, understand, and know how to use them. They put datasets on a level playing field with articles.
  • BL Social Sciences Post Graduate Training Day - Datasets

    1. 1. Social Science Datasets November 2011 John Kaye – Social Sciences Dataset Lead http://
    2. 2. What is a dataset? <ul><li>Seismic measurements taken by a geologist. </li></ul><ul><li>Genetic data collected by a medical researcher. </li></ul><ul><li>A survey of public opinions collected by a sociologist. </li></ul>
    3. 3. The Foundation for Research <ul><li>Data is a crucial component of the scholarly record. </li></ul><ul><li>Re-acquisition may be impossible </li></ul><ul><li>Datasets are essential to the British Library’s mission to advance the World’s knowledge. </li></ul>
    4. 4. The British Library Datasets Programme <ul><li>We envision a future where researchers can: </li></ul><ul><li>Discover, access, reuse, and reference datasets. </li></ul><ul><li>Track the impact of the data that they generate and receive appropriate credit. </li></ul><ul><li>Our approach is to: </li></ul><ul><li>Provide a focus for the community to establish needs, requirements and agreement. </li></ul><ul><li>Explore novel technology and creative solutions. </li></ul>
    5. 5. Datasets in Explore The British Library
    6. 6. Explore The British Library (Portals)
    7. 7. Explore The British Library (Portals)
    8. 8. British Library Resource Guides <ul><li>Topical Bibliographies and Dataset Resource Guides are currently in production </li></ul><ul><li> </li></ul><ul><li>Quantitative methods in social research </li></ul><ul><li>Management and Business Studies </li></ul><ul><li>Coming soon </li></ul><ul><ul><ul><li>Free GIS resource guide </li></ul></ul></ul><ul><ul><ul><li>Local Area Statistics </li></ul></ul></ul><ul><ul><ul><li>UK government data </li></ul></ul></ul><ul><ul><ul><li>Sport and Society </li></ul></ul></ul>
    9. 9. Economic and Social Data Service - (ESDS) <ul><li>Data search and download </li></ul><ul><li>Research method guides </li></ul><ul><li>Thematic guides </li></ul><ul><li>Online analysis </li></ul>
    10. 10. Economic and Social Data Service - (ESDS) <ul><li>http:// / </li></ul><ul><li>ESDS Government </li></ul><ul><ul><ul><li>large-scale government surveys, such as the Labour Force Survey and the General Household Survey </li></ul></ul></ul><ul><ul><li>ESDS International </li></ul></ul><ul><ul><ul><li>multi-nation databanks, such as World Bank's World Development Indicators, and survey data including Eurobarometer </li></ul></ul></ul><ul><ul><li>ESDS Longitudinal </li></ul></ul><ul><ul><ul><li>major UK surveys following individuals over time, such as the British Household Panel Survey </li></ul></ul></ul><ul><ul><li>ESDS Qualidata </li></ul></ul><ul><ul><ul><li>a range of multimedia qualitative data sources </li></ul></ul></ul>
    11. 11. Other Sources of Data – EDiNA - Spatial Data <ul><li>Go Geo! Search </li></ul><ul><li> </li></ul><ul><li>Edina Digimap and UK Borders </li></ul><ul><ul><li> </li></ul></ul><ul><ul><li> </li></ul></ul>
    12. 12. Other Sources of Data – Other Spatial Data <ul><li>Ordanance Survey Open Data </li></ul><ul><li> </li></ul><ul><ul><li>Landmap </li></ul></ul><ul><ul><li>http:// / </li></ul></ul>
    13. 13. Census Dissemination Unit <ul><li> </li></ul><ul><li>1971-2001 Census statistics - </li></ul><ul><li>Experian Geodemographic Data </li></ul><ul><li>Infuse 2001 Census analysis tool </li></ul><ul><li>Geoconvert – Postcode Data </li></ul>
    14. 14. UK Government Open Data <ul><li> </li></ul><ul><li>Admin and Statistical data portal </li></ul><ul><li>Office for National Statistics </li></ul><ul><li>http:// </li></ul><ul><li>http:// /dissemination/ </li></ul><ul><li>https:// </li></ul><ul><li>National Digital Archive of Datasets </li></ul><ul><li>http:// / </li></ul><ul><li>Regional Government </li></ul><ul><li>http:// / </li></ul>
    15. 15. Other Sources of Data – International Organisations <ul><li>United Nations </li></ul><ul><li> </li></ul><ul><li>European Union </li></ul><ul><li> </li></ul><ul><li>OECD </li></ul><ul><li> </li></ul><ul><li>World Bank </li></ul><ul><li> </li></ul><ul><li>IMF </li></ul><ul><li>http:// </li></ul>
    16. 16. Examples of Other Sources of Data <ul><li>Arts and Humanities data Service (AHDS) </li></ul><ul><li> </li></ul><ul><li>Guardian Data Store </li></ul><ul><li> </li></ul><ul><li>Financial Times </li></ul><ul><li> </li></ul><ul><li>Economist Intelligence Unit </li></ul><ul><li> </li></ul><ul><li>Web Archive </li></ul><ul><li> </li></ul>
    17. 17. Analysis Tools and Software <ul><li>Statistical - SPSS, SATA, R (open source) </li></ul><ul><li>GIS - ArcGIS, MapInfo, Quantum GIS (open source) </li></ul><ul><li>Excel </li></ul><ul><li>Online Tools </li></ul>
    18. 18. Examples of Online Analysis Tools <ul><li>ESDS NESSTAR </li></ul><ul><li> </li></ul><ul><li>ESDS Spatial Tools </li></ul><ul><li>http:// / </li></ul><ul><li>Economists Online Dataverse </li></ul><ul><li> </li></ul><ul><li>United Nations </li></ul><ul><li> </li></ul><ul><li>London Profiler </li></ul><ul><li> </li></ul><ul><li>London Heat Map </li></ul><ul><li> </li></ul>
    19. 19. Online Mapping Tools using Google Maps <ul><li>MapTube </li></ul><ul><li> </li></ul><ul><li>Google Fusion Tables </li></ul><ul><li> </li></ul><ul><li>Gmap Creator </li></ul><ul><li>http:// </li></ul>
    20. 20. Data Visualization <ul><li>Presenting data in a useful and interesting manner </li></ul><ul><li>Allowing concepts to be easily understood </li></ul><ul><li>Lots of examples online e.g: </li></ul><ul><li> </li></ul><ul><li> </li></ul><ul><li> </li></ul>
    21. 21. Citing Data
    22. 22. DataCite <ul><li>DataCite is an international consortium which aims to: </li></ul><ul><li>Establish easier access to research data on the Internet </li></ul><ul><li>Increase acceptance of research data as legitimate, citable contributions to the scholarly record </li></ul><ul><li>Support data archiving that will permit results to be verified and re-purposed for future study </li></ul><ul><li> </li></ul>
    23. 23. Connecting an Article with the Underlying Data <ul><li>Digital Object Identifiers (DOIs) offer a solution </li></ul><ul><li>Mostly widely used identifier for scientific articles </li></ul><ul><li>Researchers, authors, publishers know how to use them </li></ul><ul><li>Put datasets on the same playing field as articles </li></ul><ul><ul><li>Dataset </li></ul></ul><ul><ul><li>Yancheva et al (2007). Analyses on sediment of Lake Maar. PANGAEA. </li></ul></ul><ul><ul><li>doi:10.1594/PANGAEA.587840 </li></ul></ul><ul><li>URLs are not persistent </li></ul><ul><li>(e.g. Wren JD: URL decay in MEDLINE- a 4-year follow-up study . Bioinformatics. 2008, Jun 1;24(11):1381-5). </li></ul> 
    24. 24. Depositing and Archiving Data <ul><li>Why Archive? </li></ul><ul><li>Institutional Repositories </li></ul><ul><li>UK Data Archive/ESDS </li></ul><ul><li>Metadata and Code! </li></ul>
    25. 25. John Kaye Lead Curator – Datasets Socials Sciences The British Library 96 Euston Road London NW1 2DB   Telephone: 020 7412 7450 Email: [email_address] Twitter: @johnkayebl [email_address] Slides -