Keynote presentation given by Mahendra Mahey at the Research Data Management in Digital Humanities International Conference, 17-18 April, 2018, Doha, UCL Qatar, room 1D02. Entitled: Working with the British Library’s Digital Collections & DataInsights from British Library Labs and an emerging role for Libraries (Keynote speech)
Building Better GLAM Labs - Opening talk at Museum Big Data Conference - UCL ...labsbl
Talk given on 30 April 2019, between 1500 - 1520 at the UCL Qatar Museum Big Data 1st Conference, UCL Qatar, given at the Auditorium, Qatar National Library.
BL Labs Presentation at Open Science Infrastructures for Big Cultural Datalabsbl
Presentation given in Plovdiv, 13 December 2018 by Mahendra Mahey from British LIbrary Labs.
Fostering Excellence in Scholarship with Big Cultural Heritage CollectionsInsights from British Library Labs
Mahendra Mahey, Manager of BL Labs
1630 - 1715, Thursday, 13th December 2018,
Fostering Excellence in Scholarship with Big CH Collections (in Humanities data and their research use session), Open Science Infrastructures for Big Cultural Data, International Advanced Masterclass,Fifth Floor Conference Room, Hotel Trimontium, Plovdiv, Bulgaria.
Presentation to the National Science Library of the Chinese Academy of Scienceslabsbl
1100 - 1300, Thursday, 26th April 2018,
British Library Labs and Digital Scholarship at the British Library, Harley Room, British Library, St Pancras, London.
Presentation to the National Science Library of the Chinese Academy of Sciences
by Mahendra Mahey Manager of BL Labs
The Work of British Library Labs and Digital ScholarshipInsights from British Library Labs and an emerging role for Libraries
Building Better GLAM Labs - Opening talk at Museum Big Data Conference - UCL ...labsbl
Talk given on 30 April 2019, between 1500 - 1520 at the UCL Qatar Museum Big Data 1st Conference, UCL Qatar, given at the Auditorium, Qatar National Library.
BL Labs Presentation at Open Science Infrastructures for Big Cultural Datalabsbl
Presentation given in Plovdiv, 13 December 2018 by Mahendra Mahey from British LIbrary Labs.
Fostering Excellence in Scholarship with Big Cultural Heritage CollectionsInsights from British Library Labs
Mahendra Mahey, Manager of BL Labs
1630 - 1715, Thursday, 13th December 2018,
Fostering Excellence in Scholarship with Big CH Collections (in Humanities data and their research use session), Open Science Infrastructures for Big Cultural Data, International Advanced Masterclass,Fifth Floor Conference Room, Hotel Trimontium, Plovdiv, Bulgaria.
Presentation to the National Science Library of the Chinese Academy of Scienceslabsbl
1100 - 1300, Thursday, 26th April 2018,
British Library Labs and Digital Scholarship at the British Library, Harley Room, British Library, St Pancras, London.
Presentation to the National Science Library of the Chinese Academy of Sciences
by Mahendra Mahey Manager of BL Labs
The Work of British Library Labs and Digital ScholarshipInsights from British Library Labs and an emerging role for Libraries
Building Better GLAM Labs - Keynote Presentation at Simon Fraser Universitylabsbl
Presentation given by Mahendra Mahey, Manager of BL Labs at Simon Fraser University between 1030 - 1200, Monday 25 February, 2019.
See: https://www.lib.sfu.ca/help/publish/dh/dhil/bl-labs
For more details.
A hands-on data exploration & challenge to become a derived data-set author o...labsbl
Mahendra Mahey, manager of British Library Labs (BL Labs) will examine some of the BL’s digital collections/data & discuss challenges he has had in making the BL's cultural heritage data available openly or onsite at the British Library.
Mahendra will invite delegates to explore data-sets at their leisure, setting a challenge for those who are interested, skilled in exploring, finding patterns and grouping data. They could become data-set authors/creators of derived data-sets, based on pre-existing digital collections/data provided on the day or already available on https://data.bl.uk.
The workshop will conclude with reflections from the delegates and possibly highlighting a number derived data-sets that were generated by participants on the day that could now potentially exist on https://data.bl.uk. If selected, these new derived data-sets will be attributed with the creators' / authors' details and each will have its own cite-able Digital Object Identifier (D.O.I). These new data-sets would then be available for reuse by any researcher in the world.
GUIDANCE FOR THIS WORKSHOP
We strongly recommend you come to this workshop with an appropriate device such as a laptop pre-installed with appropriate tools to analayse different kinds of data-sets, e.g. Microsoft Excel may work with smaller data-sets such as metadata (see other data exploration tools below). If you don't have one, and would still like to attend, please request to 'pair up' with someone who is willing to share and has already signed up.
Other data exploration tools include: Notepad++ (e.g. for viewing text and XML); Open Refine (e.g. for cleaning data); Tableau Public (e.g. for visualising data); Google Fusion Tables (e.g for visualising geo-spatial data); Spacy (e.g. for text and data mining), RStudio (an open source Statistical package), MATLAB (data analysis tool) & NLTK (Natural Language processing).
Please note that this workshop is NOT about training you in using any of these tools, just tools you may be already familiar with to explore and find patterns in our data.
Datatypes you may be examining in this workshop could include: .ZIP, .PDF, .TXT, .CSV, .TSV. .XLS, .XLSX, RDF, .nt, XML (TEI, ALTO and bespoke), .JSON, .JPG, .JPEG, .TIFF and .WARC
Please ensure you are able to read these files on your device before the workshop if you are interested in exploring them during our session.
Slides for session: http://goo.gl/
URL for specific data: http://
Mahendra Mahey tweets at @BL_Labs & @mahendra_mahey
Presentation given by Mahendra Mahey, Manager of BL Labs, 1400 - 1430, 2 July 2018
London Psychology Librarians Group Meeting
Dickins Room, Conference Centre,
British Library
Rethink research, illuminate history with the British LibraryMia
Join Dr Mia Ridge, Digital Curator for Western Heritage Collections at the British Library, to discover how research and technology can create a richer picture of our past. Living with Machines is a collaborative project between the Alan Turing Institute, universities and the British Library – home to the world’s most comprehensive research collection. Together, they are using data science and digital history methods to analyse millions of historical documents and understand the impact of mechanisation in the 19th century. Their initial approach has focused on specific regions like Yorkshire that will help tell us the story of industrialisation in Britain.
Since Wikipedia launched in 2001, librarians have maintained a cautious and, at times, hostile relationship with the online, crowd-sourced encyclopedia. Librarians have largely ignored Wikipedia, citing it as an unreliable and non-authoritative resource, and steering information seekers toward traditional reference materials. While librarians waged this quiet war, Wikipedia has gained increasing dominance as an information resource, and is now the indisputable starting point for most quick research. In this presentation, attendees will learn how to wield the power of Wikipedia in their libraries and embrace Wikipedia as an information resource. Presenters will discuss how to use Wikipedia for reference and instruction, linking online resources, increasing search engine optimization, and creating linked data for the semantic web. Presenters will also discuss the great need for librarians to delve into the world of Wikipedia as researchers and contributors; including the ethics of contributing to Wikipedia. Presenters: Dustin Fife, Rebekah Cummings, Jessica Breiman
Cross-sector collaboration for digital museum and library projectsMia
I provide some examples of cross-sector collaboration from the UK, and include some examples of different models for international collaboration. Invited presentation for the Chinese Association of Museums, Taipei, Taiwan, August 2017
Introduction to information visualisation for humanities PhDsMia
Training workshop for the CHASE Arts and Humanities in the Digital Age programme. (
This session will give you an overview of a variety of techniques and tools available for data visualisation and analysis in the humanities. You will learn about common types of visualisations and the role of exploratory and explanatory visualisations, explore examples of scholarly visualisations, try some visualisation tools, and know where to find further information about analysing and building data visualisations.
Envisioning Social Applications of Library Linked DataUldis Bojars
This talk discusses two streams of innovation on the Web--the Social Web and Linked Data--and explains how bringing them together can move library services to the 21st century.
The core of the presentation will look at a few of the envisioned social use cases for library linked data: Social Annotation, Peer-to-Peer Bookswapping and Social Recommendations.
The goal is to create interest in combining new technologies and to start a discussion about how to bring these and similar use cases to fruition.
Presented at the ELAG-2012 conference: http://www.elag2012.com/
Building Better GLAM Labs - Keynote Presentation at Simon Fraser Universitylabsbl
Presentation given by Mahendra Mahey, Manager of BL Labs at Simon Fraser University between 1030 - 1200, Monday 25 February, 2019.
See: https://www.lib.sfu.ca/help/publish/dh/dhil/bl-labs
For more details.
A hands-on data exploration & challenge to become a derived data-set author o...labsbl
Mahendra Mahey, manager of British Library Labs (BL Labs) will examine some of the BL’s digital collections/data & discuss challenges he has had in making the BL's cultural heritage data available openly or onsite at the British Library.
Mahendra will invite delegates to explore data-sets at their leisure, setting a challenge for those who are interested, skilled in exploring, finding patterns and grouping data. They could become data-set authors/creators of derived data-sets, based on pre-existing digital collections/data provided on the day or already available on https://data.bl.uk.
The workshop will conclude with reflections from the delegates and possibly highlighting a number derived data-sets that were generated by participants on the day that could now potentially exist on https://data.bl.uk. If selected, these new derived data-sets will be attributed with the creators' / authors' details and each will have its own cite-able Digital Object Identifier (D.O.I). These new data-sets would then be available for reuse by any researcher in the world.
GUIDANCE FOR THIS WORKSHOP
We strongly recommend you come to this workshop with an appropriate device such as a laptop pre-installed with appropriate tools to analayse different kinds of data-sets, e.g. Microsoft Excel may work with smaller data-sets such as metadata (see other data exploration tools below). If you don't have one, and would still like to attend, please request to 'pair up' with someone who is willing to share and has already signed up.
Other data exploration tools include: Notepad++ (e.g. for viewing text and XML); Open Refine (e.g. for cleaning data); Tableau Public (e.g. for visualising data); Google Fusion Tables (e.g for visualising geo-spatial data); Spacy (e.g. for text and data mining), RStudio (an open source Statistical package), MATLAB (data analysis tool) & NLTK (Natural Language processing).
Please note that this workshop is NOT about training you in using any of these tools, just tools you may be already familiar with to explore and find patterns in our data.
Datatypes you may be examining in this workshop could include: .ZIP, .PDF, .TXT, .CSV, .TSV. .XLS, .XLSX, RDF, .nt, XML (TEI, ALTO and bespoke), .JSON, .JPG, .JPEG, .TIFF and .WARC
Please ensure you are able to read these files on your device before the workshop if you are interested in exploring them during our session.
Slides for session: http://goo.gl/
URL for specific data: http://
Mahendra Mahey tweets at @BL_Labs & @mahendra_mahey
Presentation given by Mahendra Mahey, Manager of BL Labs, 1400 - 1430, 2 July 2018
London Psychology Librarians Group Meeting
Dickins Room, Conference Centre,
British Library
Rethink research, illuminate history with the British LibraryMia
Join Dr Mia Ridge, Digital Curator for Western Heritage Collections at the British Library, to discover how research and technology can create a richer picture of our past. Living with Machines is a collaborative project between the Alan Turing Institute, universities and the British Library – home to the world’s most comprehensive research collection. Together, they are using data science and digital history methods to analyse millions of historical documents and understand the impact of mechanisation in the 19th century. Their initial approach has focused on specific regions like Yorkshire that will help tell us the story of industrialisation in Britain.
Since Wikipedia launched in 2001, librarians have maintained a cautious and, at times, hostile relationship with the online, crowd-sourced encyclopedia. Librarians have largely ignored Wikipedia, citing it as an unreliable and non-authoritative resource, and steering information seekers toward traditional reference materials. While librarians waged this quiet war, Wikipedia has gained increasing dominance as an information resource, and is now the indisputable starting point for most quick research. In this presentation, attendees will learn how to wield the power of Wikipedia in their libraries and embrace Wikipedia as an information resource. Presenters will discuss how to use Wikipedia for reference and instruction, linking online resources, increasing search engine optimization, and creating linked data for the semantic web. Presenters will also discuss the great need for librarians to delve into the world of Wikipedia as researchers and contributors; including the ethics of contributing to Wikipedia. Presenters: Dustin Fife, Rebekah Cummings, Jessica Breiman
Cross-sector collaboration for digital museum and library projectsMia
I provide some examples of cross-sector collaboration from the UK, and include some examples of different models for international collaboration. Invited presentation for the Chinese Association of Museums, Taipei, Taiwan, August 2017
Introduction to information visualisation for humanities PhDsMia
Training workshop for the CHASE Arts and Humanities in the Digital Age programme. (
This session will give you an overview of a variety of techniques and tools available for data visualisation and analysis in the humanities. You will learn about common types of visualisations and the role of exploratory and explanatory visualisations, explore examples of scholarly visualisations, try some visualisation tools, and know where to find further information about analysing and building data visualisations.
Envisioning Social Applications of Library Linked DataUldis Bojars
This talk discusses two streams of innovation on the Web--the Social Web and Linked Data--and explains how bringing them together can move library services to the 21st century.
The core of the presentation will look at a few of the envisioned social use cases for library linked data: Social Annotation, Peer-to-Peer Bookswapping and Social Recommendations.
The goal is to create interest in combining new technologies and to start a discussion about how to bring these and similar use cases to fruition.
Presented at the ELAG-2012 conference: http://www.elag2012.com/
Digital Research Support by Stella Wisdom, 20th & 21st Century Collections, D...
Similar to Working with the British Library’s Digital Collections & Data - Insights from British Library Labs and an emerging role for Libraries (Keynote speech)
Building Better GLAM Labs - Keynote at University of Victoria, Victoria, BC, ...labsbl
Presentation given by Mahendra Mahey, Manager of BL Labs entitled 'Building Better GLAM Labs'.
Experiences and lessons learned from the British Library and around the world with Galleries, Libraries , Archives and Museums engaging with researchers, artists, educators and entrepreneurs who want to use digitised and born digital cultural heritage collections and data for innovative projects.
Mahendra Mahey, Manager of British Library, British Library, London, UK.
Wednesday 27 February 2019, 1330 – 1500 (Keynote)
Talk given on behalf of the British Columbia Research Libraries Group, in the McPherson Library/Mearns Centre for Learning, Digital Scholarship Commons, Room A308, University of Victoria, Victoria, British Columbia, Canada
British Library Labs Roadshow - Sussex Humanities Lablabsbl
Presentation given by Mahendra Mahey, Manager of British Library Labs on Friday 5th of May, at Sussex Humanities Lab, 2017 as part of the BL Labs Roadshow 2017
British Library Labs Roadshow 2017 at the University of Birminghamlabsbl
Presentation given by Mahendra Mahey, Manager of British Library Labs at the College of Arts and Law, the University of Birmingham on Wednesday 10th of May, 2017.
British Library Labs Presentation at Elpub 2014, June 20, 2014labsbl
Key note presentation given at ElPub2014, June 20 about the Digital Scholarship department and the work of the Digital Research Team and British Library Labs.
Experiences and lessons learned through British Library Labs How have we eng...labsbl
Presentation by Mahendra Mahey, Manager of BL Labs.
1100 - 1130, Thursday, 17th May 2018,Part of Plenary Session ‘Cultural Innovation: experiences from the field’,
CAMP iC4: A Breeding Ground for Useful Innovation,
BASE Milano, Via Bergognone, 34, Milan, Italy
Digital Magical Mystery Tour - British Librarylabsbl
Presentation given by Mahendra Mahey, BL Labs Manager about the British Library and it's digital collections and how they have been used by the public.
British Library Labs Presentation at the Accelerating Human Imagination Workshoplabsbl
Presentation given by Mahendra Mahey at the Accelerating Human Imagination Workshop at the University of Liverpool in London, 24-25 November 2016. Presentation given on Day 1, 24 November, Second Session Part II: Imagination and Speculative Cultures, 1445 - 1500
The Digital Research Suite at the British Library - The next step for BL Labslabsbl
Fifth British Library Labs (BL Labs) Symposium, Monday October 30, 2017.
12:05 – 12:15 The Digital Research Suite at the British Library - The next step for BL Labs
Dr Adam Farquhar
Adam will give a brief overview of the next stage of the Labs project.
Presentation given to visitors from the University of Sunderland on the 10th of February, 2014 about BL Labs at the British Library in the Panizzi Room.
Library labs as experimental incubators for digital humanities researchSally Chambers
This presentation was delivered as one of the keynotes at the 23rd International Conference on Theory and Practice of Digital Libraries (TPDL 2019) on 9-12 September 2019 at OsloMet - Oslo Metropolitan University, Oslo, Norway. http://www.tpdl.eu/tpdl2019/keynotes/
Can we consider libraries as the laboratories of the humanities? If so, would they be good places to observe and better understand the everyday practices of the humanist at work? Similarly, can the notion of the laboratory as a place of scientific experimentation be applied to libraries as a place to experiment with digital cultural heritage collections? Could “library labs” enable humanities researchers, cultural heritage professionals and computer scientists to work more closely together to push the boundaries of contemporary humanistic enquiry? Using Bruno Latour’s anthropological observations of the scientific practices of biologists in their laboratory as a starting point, we will consider the concept of libraries as the laboratories of the humanities. Extending this concept further, we will consider, “what is a library lab?” by examining the activities of library labs internationally. Finally, we will introduce the emerging Digital Research Lab at the Royal Library of Belgium (KBR) as part of a long-term collaboration with the Ghent Centre for Digital Humanities (GhentCDH). Using “KBR Labs” as a case study, we will consider the role that library labs could play as experimental incubators for digital humanities research.
Similar to Working with the British Library’s Digital Collections & Data - Insights from British Library Labs and an emerging role for Libraries (Keynote speech) (20)
7th BL Labs Symposium (2019): 12_Digital Research team projects updatelabsbl
Neil Fitzgerald, Head of Digital Research, British Library
--
Highlights of some innovative recent and current projects in the Digital Research team at the British Library.
Mahendra Mahey, BL Labs Manager, British Library
--
This Award recognises an artistic or creative endeavour that has used the Library’s digital content to inspire, amaze and provoke.
Maja Maricevic, Head of Higher Education and Science, British Library
--
This Award recognises a current member of staff, or team, who has played a key role in an innovative project using the Library’s digital content or data.
7th BL Labs Symposium (2019): 08_An update on the ‘Living with machines’ projectlabsbl
Mia Ridge, Digital Curator and Co-Investigator for Living with machines, British Library
The 'Living with machines' project is a collaboration between the British Library and the Alan Turing Institute for Data Science and Artificial Intelligence.
7th BL Labs Symposium (2019): 06_An overview of digital preservation at the B...labsbl
Maureen Pennock, Head of Digital Preservation, British Library
An overview of the challenges of preserving an ever-growing and complex set of digital collections and a presentation of the work of the Flashback project.
7th BL Labs Symposium (2019): 05_The Research Awardlabsbl
James Perkins, Research & Postgraduate Development Manager, British Library
This Award recognises a project or activity which demonstrates the development of new knowledge, research methods or tools, using the Library’s digital content.
7th BL Labs Symposium (2019): 04_The story of the GLAM Labs community and how...labsbl
Sophie-Carolin Wagner, Project Manager, Austrian National Library Labs, Austrian National Library
A report on the work to develop a global community of Galleries, Libraries, Archives and Museums (GLAM) Labs and the creation of a handbook for professionals wanting to set up, maintain and ensure digital innovation Labs thrive in their organisations.
Mahendra Mahey, BL Labs Manager, British Library
This Award celebrates quality learning experiences created for learners of any age and ability that use the Library's digital content.
Introduction to BL Labs and Reading 35,000 Books: The UCD Contagion Project ...labsbl
Presentation given by Mahendra Mahey at the Reading 35,000 Books: The UCD Contagion
Project and the British Library Digital Corpus event on 20 February 2019
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
How to Split Bills in the Odoo 17 POS ModuleCeline George
Bills have a main role in point of sale procedure. It will help to track sales, handling payments and giving receipts to customers. Bill splitting also has an important role in POS. For example, If some friends come together for dinner and if they want to divide the bill then it is possible by POS bill splitting. This slide will show how to split bills in odoo 17 POS.
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
The Art Pastor's Guide to Sabbath | Steve ThomasonSteve Thomason
What is the purpose of the Sabbath Law in the Torah. It is interesting to compare how the context of the law shifts from Exodus to Deuteronomy. Who gets to rest, and why?
We all have good and bad thoughts from time to time and situation to situation. We are bombarded daily with spiraling thoughts(both negative and positive) creating all-consuming feel , making us difficult to manage with associated suffering. Good thoughts are like our Mob Signal (Positive thought) amidst noise(negative thought) in the atmosphere. Negative thoughts like noise outweigh positive thoughts. These thoughts often create unwanted confusion, trouble, stress and frustration in our mind as well as chaos in our physical world. Negative thoughts are also known as “distorted thinking”.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
Basic phrases for greeting and assisting costumers
Working with the British Library’s Digital Collections & Data - Insights from British Library Labs and an emerging role for Libraries (Keynote speech)
1. 1@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
1000 - 1100, Tuesday, 17th
April 2018,
Research Data Management in Digital Humanities (RDM in DH) International Conference,
UCL Qatar, Room 1 D02, Georgetown University Building,
Education City, Qatar Foundation, Doha, Qatar.
Working with the British Library’s Digital Collections & Data
Insights from British Library Labs and an emerging role for Libraries
(Keynote speech)
mahendra.mahey@bl.uk
Mahendra Mahey, Manager of British Library Labs (BL Labs)
2. 2@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
http://www.bl.uk/projects/british-library-labs
Funded by the Andrew W. Mellon Foundation & British Library
Running since March 2013
Core Team
•Adam Farquhar (Principal Investigator) (0.1)
•Mahendra Mahey (Manager) (Full Time)
•Ben O’Steen (Technical Lead) (Full Time)
•Eleanor Cooper (Project Officer) (0.5)
3. 3@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
http://www.bl.uk/projects/british-library-labs
Funded by the Andrew W. Mellon Foundation & British Library
Running since March 2013
Core Team
•Adam Farquhar (Principal Investigator) (0.1)
•Mahendra Mahey (Manager) (Full Time)
•Ben O’Steen (Technical Lead) (Full Time)
•Eleanor Cooper (Project Officer) (0.5)
4. 4@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Wider engagement…not just
Digital Humanities / Scholarship Researchers
Researchers
https://goo.gl/WutNyi Artists
http://goo.gl/nNKhQ2
Librarians
Curators
https://goo.gl/9NWZUW
Software Developers
https://goo.gl/7QQ5Tf
Archivists
https://goo.gl/x7b4tg
Educators
https://goo.gl/qh01Mi
Working and Communicating
Inspirational
examples
Experiences
Challenges
Lessons Learned
Entrepreneurs
https://goo.gl/Fx8RG7
5. 5@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
The British Library
Inside the British Library
Space for 1200 readers, around 500,000 visitors per year
Building 37 uses low oxygen and robots
Reading room and delivery to London
Many items stored at Document Supply and Storage centre 48 hours away
Stockton-on-Tees
Author right to payment each time their books
are borrowed from public libraries.
St Pancras, London, UK
Many books are stored 4 stories below the building
UK Legal Deposit Library – Reference only
Founded in 1973 though origins stem back to British Museum Library 1753
Boston-Spa
6. 6@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Living Knowledge Vision (2015 – 2023)
Custodianship Research Business
Culture Learning International
To make our intellectual heritage accessible to everyone,
for research, inspiration and enjoyment and be the most open, creative
and innovative institution of its kind by 2023 (50 year anniversary).
Document:http://goo.gl/h41wW7 Speech:https://goo.gl/Py9uHK
Roly Keating (Chief Executive Officer of the British Library)
To make our intellectual heritage accessible to everyone,
for research, inspiration and enjoyment and be the most open, creative
and innovative institution of its kind by 2023 (50 year anniversary).
7. 7@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Physical collections – not just books!
> 180*million items
> 0.8* m serial titles
> 8* m stamps
> 14* m books
> 6* m sound recordings
> 4* m maps
> 1.6* m musical scores
> 0.3* m manuscripts
> 60* m patents
King’s Library *Estimates
9. 9@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
/
Knowledge Quarter London
80 knowledge organisations (as of 14/04/18) within 1 mile radius of
Kings Cross, http://www.knowledgequarter.london
Born digital
10. 10@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
#bldigital
1-2 %* digitised
* estimate
Digitisation
Partnerships
Commercial & Other Organisations
Amount
increasing rapidly
e.g. Heritage Made Digital
Bias in digitisation
http://goo.gl/bR9UJL
Sample Generator
11. 11@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
The Story of the Digital Collection…
Digital
Collection
Curator
Who paid for the digitisation?
Who did the digitisation?
Technology used
Born digital?
Published
Unpublished
Where is it?
Can it still be accessed?
Generates income
Reputational risk in using?
Legalities /
Ethics / Morality
Politics when digitised
Personalities involved
Surprises (e.g. gaps)
Descriptive information
Old format not supported
What media was the
digitisation done from?
Is there any background documentation?
No Descriptive information
Inconsistent descriptive information
Still there?
Good to know the background ‘story’ of a Digital Collection
if you want to use it for research and make conclusions…
13. 13@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Openly Licensed Digital Content?
15% Openly
Licensed
Around 80%*
available online
Working through to make more open through
Access and Re-use committee which meets once a month…
Though some collections will always only be available onsite due to
various reasons including legal, ethical etc.
Breakdown by collection*
Manuscripts 59%
Books 9%
Maps and Views 7%
Newspapers 3%
Archives and Records 3%
Paintings, Prints and Drawings 2%
*Based on number of digitisation projects (702 as of 17/04/18)
Largest proportion of funding
Public / Private Partnership
15 %* Openly Licensed – most online
85 %* Available onsite only at the moment
*Estimates
18. 18@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Why are doing this? (1)
We support research it’s our job!
We want to work closely with and
listening to those who want use
our digital collections and data
for their work!
https://goo.gl/esqpRb
19. 19@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
We can learn how we are and should be supporting you and
this therefore shapes the problems we work on, such as:
https://goo.gl/esqpRb
Why are doing this? (2)
• Access to digital collections / data?
• Advice, guidance, technical support,
training
• Services, Tools and Processes?
• Many more reasons…
20. 20@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Where are the gaps between what you want & what we can
give?
How do we build the bridges to overcome the gaps?
Why are doing this? (3)
https://goo.gl/6CwCeE
21. 21@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
How do we help you ‘navigate’ their way through the
‘maze’ (sometimes) of the
Library to what they want to do?
Sometimes requires understanding the culture of the organisation
https://goo.gl/62JnQT
Why are doing this? (4)
22. 22@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Have you got X?
https://upload.wikimedia.org/wikipedia/commons/5/50/Real_wuerzburg.jpg
Looking for Physical Content in the British Library
23. 23@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Have you got X digitised / in digital form?
http://www.yorkmix.com/wp-content/uploads/2014/04/mr-simms-sweet-shoppe-york.jpg
Looking for Digitised / Digital Content in the BL
24. 24@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
• The Library has to go out to meet researchers, regularly and
cyclically to tell them what we have and learn what they
want to do
• Debunk ‘myths’ about the Library
• Show / tell researchers about the reality of our data
• Researcher’s ideas always change once they explore the
data!
https://goo.gl/esqpRb
Lots of two-way communication!
BL Labs runs annual ‘
Roadshows’ around the UK and the World
25. 25@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
https://goo.gl/qpCLlk
https://goo.gl/wMTS3Z
• Dialogue typically:
– you are ‘lucky’ & we have the digital content
/ data relevant to your research
– we don’t have exactly what your looking for,
but is there anything of interest? Let’s talk…
– engagement can be hard work and it’s
constantly required to maintain interest in our
digital collections!
• We also tend to attract researchers with ‘fuzzier’
research boundaries and possibly open to more
interdisciplinary / collaborative research
• Artists find this dialogue easier…
What engagement does the BL have with
researchers wanting use our digital content?
26. 26@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Our Audience and You
Audience
research &
Digital
interests
Digital
collections
you have
This is where Labs works
It starts with a conversation!
Only a small amount of content is digitised!
Might not be the treasure expected at the end of a digital journey!
28. 28@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Phase 1: Exploration
Allows a researcher to:
– Understand the data in open-ended fashion.
– Discover potential tools to work with the data.
– Gain awareness of their capabilities and limitations.
– Develop a firmer research query.
– Gauge the costs, resources, risks and time needed.
•Outputs of the exploration are not intended to be shareable,
beyond personal experience and key features (data size, formats, tool
successes, etc.).
29. 29@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Phase 2: Query-Focussed
• A firmer and more informed query by the researcher where:
– Suitable datasets already lined up
– There is a good idea of the initial toolset and capabilities (human
and computer) required
– The project output is outlined, and relevant reuse applications are
begun.
– Clear agreements on what happens at the end of the project – data
deletion, virtual machine deletion/archiving/etc.
– Project may iterate on initial ideas,depending on researcher’s
cost/risk appetite
Submit idea
for support
30. 30@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Phase 3: Wrap-up
• Wrap-up
– Work (code, notes) exported and given to researcher
– All derivative data is licenced or retained based on reuse
agreements (Access & Reuse board, etc.)
– Provisions made for the project are wound-down, as agreed
(derivative data deleted after a grace period, etc.)
31. 31@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Playbills, Books, Newspapers
(includes Optical Character Recognition (OCR))
Digital collections and Datasets
British National
Bibliography
http://bnb.data.bl.uk
http://sounds.bl.ukhttp://dml.city.ac.uk/
Music (Recordings & Sheet) & Sounds
http://goo.gl/frSMJt
Broadcast News (TV and Radio)
http://goo.gl/cwThHw
http://goo.gl/pBkisZhttp://goo.gl/E8aRyQ
Usage data
EtHOS
Web ArchiveImages, Manuscripts & Maps
http://www.qdl.qa/
Qatar Digital Library
http://idp.bl.uk/
International
Dunhuang
Project
Maps
http://www.bl.uk/maps/
Hebrew Manuscripts
http://goo.gl/4sbCp9
Flickr &
Wikimedia Commons
https://goo.gl/LZRmaZ
32. 32@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Finding Open Cultural Heritage Datasets
http://abs.bl.uk/Digital+Collections
Collection Guides (199 as of 17/04/2018)
https://www.bl.uk/collection-guides/
Datasets about our collections
Bibliographic datasets relating to our published and archival
holdings
Datasets for content mining
Content suitable for use in text and data mining research
Datasets for image analysis
Image collections suitable for large-scale image-analysis-
based research
Datasets from UK Web Archive
Data and API services available for accessing UK Web
Archive
Digital mapping
Geospatial data, cartographic applications, digital aerial
photography and scanned historic map materials
https://data.bl.uk
Download collections as zips, no API
Each dataset has a Digital Object Identifier (DOI)
can be referenced for research
Not all discoverable via
search engines!
33. 33@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Messiness in historical data
• 'Begun in Kiryu, Japan, finished in France'
• 'Bali? Java? Mexico?'
• Variations on USA:
– U.S.
– U.S.A
– U.S.A.
– USA
– United States of America
– USA ?
– United States (case)
• Inconsistency in uncertainty
– U.S.A. or England
– U.S.A./England ?
– England & U.S.A.
34. 34@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Open Refine
http://openrefine.org/
http://freeyourmetadata.org/cleanup/
offers useful advice to cleaning up data
35. 35@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Characterising your data
http://blogs.bl.uk/digital-scholarship/2013/09/data-exploration-through-visualisation.html
36. 36@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Data / Digital Curation / Data Librarian
Digitisation
Collecting
Born Digital
Data
Management
Data
Curation
Data
Characterisatio
n
37. 37@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Many researchers have the domain knowledge but lack
technical / digital skills to use Digital Research
methods.
Should they be teamed up with those that want to solve
problems or get trained?
Digital skills training needed for Humanities
researchers/ Librarians…
https://goo.gl/i5GVfI
https://goo.gl/kwcK8Jhttps://software-carpentry.org/
https://librarycarpentry.github.io/
http://www.datacarpentry.org/
38. 38@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
British Library Data Projects
http://dx.doi.org/10.15123/PUB.4307
https://goo.gl/xCM9A7
https://www.datacite.org/
https://odin-project.eu/
https://project-thor.eu/
39. 39@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Data Strategy (2017)
• Data Management
• Data Creation
• Data Archiving and Preservation
• Data Access, Discovery & Reuse
http://blogs.bl.uk/files/britishlibrarydatastrategyoutline.pdf
datasets@bl.uk
https://data.bl.uk
http://bl.uk/datasets
https://goo.gl/X129Yp
41. 41@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Digital research methods
Digital Scholarship
Visualisations
Application Programming Interfaces (APIs)
for datasets e.g. Metadata, Images, etc Annotation
Location based searching & Geo-tagging Crowdsourcing
Human Computation
In 20 years time?
42. 42@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Competition
Awards
Projects
Tell us your ideas (2013-16/17)
Show us what you have already done
in Research, Artistic, Commercial,
Educational & BL Staff categories
Talk to us about working on
collaborative projects
Tell us your ideas (2018
onwards) <=5 days support
• Roadshows
• Events
• Online
• F2F & Virtual
Conversations
New!
Digital Research Support
11 Oct 2018
Engaging with our Digital Collections / Data
More details at:
http://labs.bl.uk
43. 43@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
What did people
actually do?
Examples from Text and Images
Over 200 examples (including sound, video) from
Competition and Awards:
http://labs.bl.uk/Ideas+for+Labs
http://labs.bl.uk/Other+Uses+of+Collections
44. 44@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Example Pattern of Research
1, 2, 3
1. Find / identify new things in messy stuff
2. Unlock hidden history / data
3. Celebrate new discoveries
46. 46@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
https://goo.gl/oUNj5N
https://goo.gl/ImAUv4
Finding things in ‘messy’
Optical Character Recognised (OCR) text
Mrs Folly • Clean up some manually
• Get human ‘ground truth’
• Write computer code (sometimes
it’s machine learning) to find
things reliably in it ‘automatically’
• Try code on messy content
• Tweak if necessary
• Digital ‘lasso’ around content
• Human sift through
Mrs Folly
An example pattern of research
47. 47@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
http://victorianhumour.tubmblr.com
Victorian Meme Machine (2014)
https://goo.gl/HMqDt3
Bob Nicholson
http://victorianhumour.tumblr.com/
Bob Nicholson interviewed on
BBC Radio 4 Making History Programme:
http://goo.gl/fmV9ep
And telling jokes to the public:
http://goo.gl/xIDRhz
Bob obtained further funding from his university
Looking for more collaborations
https://www.youtube.com/watch?v=-GRgj7Q5OM0
Rob Walker, Victorian Mother-in-law Jokes
Victorian Comedy Night, 7 Nov 2016
Learnt about access paths
to digital collections
48. 48@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Katrina Navickas (2015)
Political Meetings Mapper
http://politicalmeetingsmapper.co.uk
https://goo.gl/Qq78Oa
Labs Symposium 2015
https://goo.gl/BSA3be
Interview 2015
The Chartist Newspaper
http://goo.gl/vOLSnH
Chartist Monster Meeting
Chartists Walking Tour and
Re-enactment London
Learnt that domain knowledge
reduces noise
49. 49@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Black Abolitionist Performances & their
Presence in Britain (2016) – Hannah-Rose Murray
Frederick
Douglass
Ellen
Craft
Josiah
Henson
Ida B
Wells
A Performance by
Joe Williams &
Martelle Edinborough
http://frederickdouglassinbritain.com/
Started to implement
Machine Learning Techniques
50. 50@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Data-mining verse in 18th
Century newspapers
BL Labs Project 16-17, Jennifer Batt
https://goo.gl/5Akthd
Slides courtesy Jennifer Batt
51. 51@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Transcribing historical Arabic Scientific
Manuscripts for OCR research
https://fromthepage.com/bldigital/arabic-scientific-manuscripts
http://blogs.bl.uk/digital-scholarship/2018/03/arabic-handwrittten-ocr.html
52. 52@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Virtual Infrastructure for OCR text
OCR text ‘scraped’ from
digitised newspapers
and put in internal cloud
Jupyter notebook
Write python code and results
in web browser
http://jupyter.org
Access available for researchers ‘in residence’
54. 54@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Worked better for female faces than men’s
Press
http://mechanicalcurator.tumblr.com
Posts image every 30 minutes
http://www.flickr.com/photos/britishlibrary/
1,020,418 images
need tagging!
Creative uses of images
Face recognition
Algorithms based on photos
Mechanical Curator
with an algorithmic brain
(Circles, Squares and Slanty etc)
http://goo.gl/qPPgxX
Snipping out images
from 65,000 Digitised Books*
>800,000,000* views
>17,000,000* tags
https://goo.gl/FgZ4HM
Work @ BL by Ben O’Steen, Labs
and Digital Research Team
*Matt Prior - http://goo.gl/j29Tnx
Since Dec 2013
Tumblr
*Estimates
>More demand to see
physical items
56. 56@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Tagging a million images
Iterative Crowdsourcing
http://goo.gl/j6fxac
Cardiff University’s
Lost Visions Project
http://www.metadatagames.org/
Metadata Games
James Heald
Mario Klingemann
Chico 45
Use computational methods
Human Tagger
Top British Library Flickr Commons Taggers
18 hard core taggers
How to reward and keep motivated this ‘small group?
Average for ‘crowd’ is 1 tag per person
What kind of ‘task’ can this ‘crowd’ do?
Mobile games for ‘Ships’, ‘Covers’ and ‘Portraits’ Interface for tagging
57. 57@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Adam Crymble (2015)
Crowdsource Arcade
What if crowd sourcing
looked like this?
http://goo.gl/LBfJ4W
http://goo.gl/OH9pOZ
https://goo.gl/7z0j8p
30 mins talk
Labs Symposium (2015)
https://goo.gl/SSRsdd
5 min interview (2015)
http://goo.gl/0APpE8
Game Jam
Using Arcade Games
to help Tag images
‘Art Treachery’ and ‘Tag Attack’
58. 58@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Special Jury’s Prize (2015)
James Heald – Wikimedia and Map work
https://goo.gl/WYZCB2
http://goo.gl/HNQq5e
https://goo.gl/VPgffL
https://commons.wikimedia.org/
https://goo.gl/djtm1b
Labs Symposium (2015)Geotagging maps
50,000 Maps
Found in Flickr 1 million
Human & Computational Tagging
& Community engagement
Geo-referencing work
https://www.bl.uk/georeferencer
59. 59@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
SherlockNet: Competition Winner 2016
Karen Wang, Luda Zhao and Brian Do
Using Convolutional Neural Networks to Automatically Tag and Caption
the British Library Flickr Commons 1 million Image Collection
12 categories
>15.5 million tags added
>100,000 captions
bit.ly/sherlocknet
Pooled surrounding
OCR text on page
from similar images
Used Microsoft COCO (photographs) &
British Museum Prints and Drawings
collections as training sets.
Tags Captions
60. 60@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
http://goo.gl/dM8ieA
Mario Klingeman (2015)
Code Artist / Curator
http://goo.gl/bNxGZZ
Kris Hoffman (2016)
Animation for Fashion Week 2016
https://goo.gl/QilqqT
Jiayi Chong 2016 - Animation tool
https://www.facebook.com/RealmlandStory/
Paul Rand Pierce 2016
Graphic Novel on Facebook
Tragic Looking Women
44 Men who Look 44
(Notice the direction faces)
A Hat on the Ground
Spells trouble
Artistic / Creative Works
https://www.youtube.com/watch?v=Q3SBxO34Zlc
David Normal 2014 and 2015
Collages/Paintings & Lightboxes
61. 61@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Imaginary Cities – BL Labs Project /
Exhibition 16-18 (Michael Takeo Magruder)
An artistic exploration seeking to create provocative fictional cityscapes for the Information Age
from the British Library’s digital collection of historic urban maps
63. 63@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Careful of making conclusions based on
‘black box’ software & techniques (e.g.
sentiment analysis), learn the assumptions
behind them first!
Lessons Learned & Challenges…
Beware of ‘Black Box’ software…
65. 65@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Huge appetite to use digital content & data
for anyone’s ideas!
(e.g. Flickr Commons stats).
Lessons Learned & Challenges…
Huge demand for open digital content…
https://goo.gl/yQ5s4U
66. 66@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Labs mindset…
1. Start a conversation, generate positive energy,
be nice, have fun and try to support ideas .
2. Start with small experiments, but think big!
3. Fail faster (don’t be afraid) and persevere.
4. Reject perfectionism! Good enough is
sometimes…good enough!
5. Celebrate the uses of digital collections, tell
the world!
https://goo.gl/noASfl
67. 67@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Library Labs around the world
Meeting in London to share experiences 13-14 September 2018
BL Labs
Royal Danish Library
Austrian National Library
Library of Congress
BnF
KB
DXLab
Swedish National Library
Norwegian National Library
Berlin State Library
68. 68@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Hey there Young Sailor!
Ling Low 2016 – Hey there Young Sailor
https://www.youtube.com/watch?v=bcOP1E5bRE0VIMEO.COM/SWEETANDLOWFIL
MS
@SWEETNLOWFILMS ON
INSTAGRAM
@SWEETNLOWLING ON TWITTER
The Impatient Sisters
Play to fade!
70. 70@BL_Labs @Britishlibrary @QNLib @UCL_Qatar
Questions?
Prompt Question
I didn’t understand…. Can you tell me more about…
Why did you… I am not sure about…
What if… Why didn’t you…
What’s the best thing about… What was the worst thing…
If you could have your time again,
…
How did you…
I am not sure I agree about… What was the biggest challenge…
What was the most successful
thing about…
Who did…
Editor's Notes
90 seconds (270 words)
Salaam malekham, sabah alkhyr (sabah hul-klearee). My name’s Mahendra Mahey and it’s my great honour to be the keynote speaker for this conference, I really hope you have a wonderful and productive time here and in Qatar. So let’s begin…&lt;CLICK&gt;
I manage a project at the British Library called British Library Labs or ‘BL Labs’ for short. It’s made up of a team of 4 people and we also work occasionally with our Digital Research and Digital Scholarship colleagues. The project’s been running for over 4 years and is kindly supported by the Andrew W. Mellon Foundation and the BL.
&lt;CLICK&gt;
I am going to take you on a journey so that you learn about our experiences of working with the BL‘s digital collections. I will identify issues, challenges, problems and solutions we have encountered and look at the impact our work is having on Research Data Management, Digital Humanities, research infrastructure and data literacy amongst researchers and librarians in particular. I will show you how and why we have engaged with a range of people using our data, highlighting their work and findings, and present some of the lessons we have learned and examine the wider impact of the project on the Library and other organisations.&lt;CLICK&gt;
A link to download my presentation appears on the bottom of each slide and for those of you using social media I have also included some relevant tags if you would like to tweet. Hopefully, there will be 5 minutes at the end for questions and my email address is below in case you are shy or think of something afterwards.
18 seconds (55 words)
to ‘experiment’ with our digital collections. We’re particularly interested in people who have questions which focus on finding and creating NEW things using our large collections data which use digital research methods, especially when manual methods aren’t possible such as looking for patterns across thousands of digitised books or millions of newspaper pages.
23 seconds (71 words)
Though the project focusses on working and communicating with Digital Humanities and Digital Scholarship researchers, we have also engaged with amazing Artists, Librarians, Curators, Educators, Entrepreneurs, Archivists, Software Developers and other innovators. Hopefully, I will show you&lt;CLICK&gt;
some inspirational examples of work they have done which have used our digital collections.&lt;CLICK&gt;
I will also reflect on our experiences, challenges and lessons we have learned working with some amazing and pioneering people.
73 seconds (220 words)
BL Labs is based at the British Library in London, which was founded in 1973, though it’s origins stem back to the British Museum in 1753. It’s probably the largest research library in the world. The St Pancras site which you can see, opened in 1997 and stores many of the frequently requested items around the building including 4 stories below the ground floor. Much of the collection has been built up through legal deposit, where a copy of every UK and Ireland publication must be given to us.&lt;CLICK&gt;
The St Pancras building can seat 1,200 researchers across 5 reading rooms where our readers can access collection items. We get around half a million visitors per year.&lt;CLICK&gt;
Medium and long term requested items are held at Boston Spa 300km away from London, in huge ‘factory like’ storage facilities. For example, building 37 as pictured, is a low oxygen warehouse, using robots to retrieve items . Boston Spa also has a reading room and it takes 48 hours for requested items to get to London or vice versa. In total, the library has over 700 km of shelving across both sites, growing by 12 km every year. &lt;CLICK&gt;
Finally, we also manage the public lending right at Stockton-on-Tees around 400km from London, which is the Author’s right to payment each time an in-copyright book is borrowed from a public library.
42 seconds (128 words)
The Library focuses most of its work and collaborations through it’s 8 year Living Knowledge vision. Initiated in 2015, to coincide with the 50th anniversary of the creation of the Library, our vision is to make our intellectual heritage accessible to everyone, for research, inspiration and enjoyment and be the most open, creative and innovative institution of its kind by 2023. The Library’s two core purposes are to build, curate and preserve the UK national collection of published, written and digital content and to support and stimulate research of all kinds.&lt;CLICK&gt;
We also support businesses helping them to innovate and grow, engaging everyone with memorable cultural experiences, inspiring young people and learners of all ages and working with international partners around the world to advance knowledge and mutual understanding.
41 seconds (123 words)
We have huge collections of physical items. Here you can see inside the main building in London, it’s the King’s Library – King George the Third and Fourth’s personal library! We currently estimate our total number of physical items exceed &lt;CLICK&gt;
180 million items, representing every age of written civilisation and every known language. Our archives now contain the earliest surviving printed book in the world, the Diamond Sutra, dating from 868 C.E…., we have around 14 million books, only 7% of our collections. We also have around…&lt;CLICK&gt;
60 million patents, 8 million stamps, 4 million maps, 6 million sound recordings, 1.6 million music scores, over 300,000 manuscripts and 800,000 serial titles (which are of course made up of many volumes/editions/issues and series). We employ around 1600 people across all sites.
6 seconds (20 words)
BL Labs focuses on getting people to experiment with its digital collections, things that are already &lt;CLICK&gt;
born digital&lt;CLICK&gt;
or digitised.
36 seconds (110 Words)
&lt;CLICK&gt;
In 2013, legal deposit was extended to cover non-print material, consequently we have been collecting UK websites through the UK Web Archive, e-books, e-journals, CDs, DVDs etc. As a result terabytes and billions of items are being archived at the BL every year. &lt;CLICK&gt;
We are the headquarters of the Alan Turing Institute for Data Science and BL Labs is active research partner.&lt;CLICK&gt;
We are also part of the Knowledge Quarter London Hub, comprising of 80 world class knowledge based organisations situated within a 1km radius of the BL, sharing ideas, best practice, meetings and events e.g.&lt;CLICK&gt;
Companies such as Google and our sister organisation the British Museum to name a few.
24 seconds (72 words)
The BL are world renowned experts in digitising materials from our physical holdings. One common misconception that many people have is that much if not all of our collections are digitised. So, the actual proportion of our collections that are digitised surprises many&lt;CLICK&gt;
The figure is around 1-2% of our physical collections.&lt;CLICK&gt;
Much of our digitisation activity happens through partnerships with commercial, philanthropic, charitable and foundation partners such as the Qatar Foundation. &lt;CLICK&gt;
What is for certain, is the amount we are digitising is increasing rapidly. Our new programme called Heritage Made Digital for example prioritises those collections for digitisation where there is a clear researcher demand.&lt;CLICK&gt;
One important thing we have learned is that researchers need to take heed when doing research based on our digital collections, as they are rarely complete, having gaps and not necessarily being representative of our physical collections.
41 words (125 seconds)
Our work in Labs has taught us that it always pays for researchers to know the back ‘story’ of a digital collection especially if they want to use it for research and analysis.&lt;CLICK&gt;
There are too many things to consider right now, but a few highlights are such as, ‘are there gaps in the collection?’, ‘can they still be accessed?’, but perhaps most important of all is whether the curator or a human being who knows about the collection is still around who could be asked about it. Our experience has told us that so much will probably be in their head that isn’t written down, information that could be vital, important and useful for knowing about before carrying out research or re-use.
26 seconds (66 words)
So why is so ‘little’ digitised? Simply put, it costs money, time and resources to digitise physical materials to a professional standard. However, even though our digitised collections are a small fraction of our physical, combined with our born digital collections they still represent an impressive and sometimes un-imaginable amount of data.&lt;CLICK&gt;
Currently we have 702 collections, ranging from the UK Web which includes billions of websites, to a collection of 130 digitised Chinese scroll maps. Some items on this list are confidential and require due diligence/risk management before we can tell the world about them.
35 seconds (106 words)
Further analysis of our digital collections reveals that only 15% (that’s 105 collections) are openly licensed of which four fifths are available online. &lt;CLICK&gt;
85% of our digital collections are only available onsite. Each month, more collections are being made available under an open access license, through our ‘Access and Re-use’ committee, but this takes time, especially for collections that were digitised before 2012, when we didn’t have such a group.&lt;CLICK&gt;
Here’s a breakdown of our digital collections by type, &lt;CLICK&gt;
Our digitised collections include born digital, e-acquistions, and of course the results of many digitisation projects funded by public/private partnerships, some of which are still in progress.
11 seconds (24 words)
Giving access to our openly licensed digitised materials is obviously much easier than&lt;CLICK&gt;
Digital collections that are only available onsite such as those that are still within copyright to name one of many reasons.&lt;CLICK&gt;
9 seconds (28 words)
So, how do we give access to onsite-only Digital Collections at the British Library? (that’s the 85% of our data).Well there are further challenges in doing this.
55 seconds (167 words)
&lt;CLICK&gt;Sometimes digital content is only available onsite due to license restrictions, or even only on a specific computer in a reading room! Technically of course, there are actually very few reasons why digital content can’t be online, though it might be too big or it hasn’t been transferred from the original digital media device it was stored on, such as CD, minidisc, Vinyl for example.&lt;CLICK&gt;
Sometimes, access is provided through a paywall. Finally, &lt;CLICK&gt;
some content is in the happy sunny place, online, open and freely available to all of humanity.
The real reasons why there are challenges to accessing digital content are of course human. They require different approaches from the Library and may often involve an honest, open dialogue and negotiation with the publishers who gave us the content in the first place.
The Labs project has tried to address this problem by creating a ‘residency model’ where they are security cleared using hot desks in staff areas or trailing areas in the reading rooms &lt;CLICK&gt;
for researchers to work intensively with a digital collection on-site, so as to not infringe access conditions.
2 seconds (5 words)
Why are we doing this?
9 seconds (28 words)
We support research it’s our job!We want to work closely with and listening to those who want use our digital collections and data for their work!
13 seconds (39 words)
We can learn how we are and should be supporting you and this therefore shapes the problems we work on, such as:
Access to digital collections / data?
Advice, guidance, technical support, training
Services, Tools and Processes?
Many more reasons…
7 seconds (22 words)
Where are the gaps between what you want & what we can give?
How do we build the bridges to overcome the gaps?
10 seconds (30 words)
How do we help you ‘navigate’ their way through the ‘maze’ (sometimes) of theLibrary to what they want to do?
It sometimes requires understanding the culture of the organisation
28 seconds (85 words)
This what I imagine it feels like for a researcher looking for our physical collections. &lt;CLICK&gt;
Everything is on an industrial scale and it can feel overwhelming. Sometimes it isn’t always straightforward to find our items, as there are many that are not on our digital library catalogue, e.g. still on card catalogues and some items are in the secret and very secure parts of the Library where you would need very special permission because the items are extremely valuable and fragile for example.
36 seconds (109 words)
Our digital offering is perhaps like this.&lt;CLICK&gt;
Imagine entering a boutique sweet shop. We have some lovely things to tempt you, but it’s much smaller than the hypermarket you just visited. The shop keeper tells you there are some things behind the back door in a giant warehouse. However, you will need special access to enter that space. She also states that there are rooms in that warehouse, even she isn’t allowed to look. She isn’t even allowed to share the full list of stock because there are items on there she may never be able to be see because they were meant to be secret.
33 seconds (99 words)
Given these challenges, the Library has to do lots of external engagement, to tell people what we have. Every year we have a roadshow around the UK and sometimes we get to go to other places in the world, such as Qatar, thank you Milena.&lt;CLICK&gt;
We do this to partly ‘de-bunk’ the myths about the Library.&lt;CLICK&gt;
And to show / tell researchers about the reality of our data.&lt;CLICK&gt;
What we have learned is that researcher’s project ideas of what they want to do with our digital collections always change once they explore and see the reality of our data.
49 seconds (148 words)
So what kind of conversations do we have with researchers who may want to use our digital collections and data?&lt;CLICK&gt;
The dialogue typically can be: ‘Ah, you are ‘lucky’ & we have the exact digital content / data relevant to your research’, informally we call these our ‘lucky dip researchers’.&lt;CLICK&gt;
Or the conversation might go like this…’Ah, we don’t exactly have what you are looking for, but here is what we do have, is there anything of interest that you like? Let’s talk…&lt;CLICK&gt;
We have learned that engagement can be hard work. But it’s constantly required to maintain interest in our digital collections because they aren’t all instantly discoverable on search engines.&lt;CLICK&gt;
We also tend to attract researchers with ‘fuzzier’ and ‘flexible’ research boundaries and those who are possibly open to more interdisciplinary / collaborative research.&lt;CLICK&gt;
Finally, we have found that artists find this dialogue easier.
12 seconds (37 words).
In another way, we are trying to match our audiences research needs and digital interests &lt;CLICK&gt;
With the digital collections we have&lt;CLICK&gt;
It is at this intersection where Labs works best and it usually starts with a conversation.
24 seconds (72 words)
Let’s look a little further at the types of interactions we have with our researchers. We have summarised these phases as ‘Exploration’ where people often ‘rethink’ their ideas of what they want to do with the data, ‘Query-Focused’ where they often have to iterate to come up with a realistic proposal of what they want to do and a ‘Wrap-up’ phase to end their project with us, if it is relevant.
26 seconds (78 words)
The ‘exploration’ phase allows the researcher to understand the data in an open-ended fashion, discover potential tools to work with the data, gain awareness of their own capabilities and limitations and develop a firmer research query, gauging costs, resources, risks and the realistic time needed to complete the project.&lt;CLICK&gt;
The outputs of this exploration are not necessarily intended to be shareable, beyond personal experience and identifying key features of their enquiry (data size, formats, tool successes, etc.).
43 seconds (129 words)
The ‘query-focussed’ phase allows the researcher to develop a firmer and more informed query where: Suitable datasets are already lined up, there is a good idea of the initial toolset and capabilities required, that is human and technical requirements. The project outputs are outlined, and relevant reuse applications are begun. There are clear agreements on what happens at the end of the project – data deletion, virtual machine deletion/archiving/etc. The project may iterate on initial ideas, depending on researcher’s cost and their’s and the BL’s appetite for risk.&lt;CLICK&gt;
This phase may typically be supported by the Library through our new Digital Research Support phase where researchers can get up to 5 days of support for them to further develop their project ideas. More about this later.
33 seconds (99 words)
Finally, when working on projects it’s important that there is a wrap-up phase. Here, the Library may give back the researcher’s work (such as code and notes) through an export from BL hosted tools (especially for those that are onsite). Also, all derivative data is licenced or retained based on reuse agreements (such as our Access & Reuse board, etc.). Provisions are made for the project to be wound-down, as agreed (for example, derivative data is deleted after a grace period, or hosted by the Library if requested by the researcher and appropriate for further re-use by others).
76 seconds (228 words)
So let’s have a very brief overview of our digital collections, datasets and derived data. &lt;CLICK&gt;
We have thousands of playbills from theatres, cuttings from magazines, books and millions of newspaper pages digitised, including their Optically Character Recognised text.&lt;CLICK&gt;
We have been using external platforms to host our digital collections because this is often a more effective way to make them more visible on the internet, such as Flickr and Wikimedia Commons. We have of course been helping develop the Qatar Digital Library, making digitised manuscripts available from the middle east to all. The International Dunhuang Project makes digitised manuscripts from China available. The Polonsky foundation is helping us make Hebrew Manuscripts accessible and we have thousands of geo-referenced historic maps as well as an online crowdsourcing geo-referencer tool.&lt;CLICK&gt;
We are making millions of Library data available from UK and Irish National Library catalogues through our British National Bibliography service&lt;CLICK&gt;
We can provide usage data from our readers. EtHOS holds all UK PhDs, either born digital or some digitised, and as previously mentioned the UK Web Archive.&lt;CLICK&gt;
We have been recording English language TV news broadcasts since 2010 and archiving historic and current UK radio programmes.&lt;CLICK&gt;
We have derived data from the Digital Music Lab project which analysed world and traditional music to look for similarities across countries, digitised sheet music and digitised environmental sounds, music and oral history.
56 seconds (169 words)
Despite our digital collections being a small fraction of our physical holdings and over 85% only being available onsite, here are some ways you can find out about our openly licensed cultural heritage collections. &lt;CLICK&gt;
First, on the Labs website we have created a guide pointing to over 100 digital collections. Then as of today, curators have created nearly 200 collections guides by subject, each one having a section on what is available digitally onsite and online if relevant.&lt;CLICK&gt;
As part of the Labs project and overall data strategy for the Library we have created a data service, ‘data.bl.uk’ where users can download over 100 datasets. Importantly, it provides the ability to download entire collections instead of single items. Each collection is treated as a dataset with it’s own citeable Digital Object Identifier (D.O.I) for replicable research purposes. The site also includes derived data from experiments that have been carried out on our digital collections.&lt;CLICK&gt;
Please note that not all of these datasets are discoverable on all search engines.
40 seconds (124 words)
Because of time and resources in Labs, we didn’t spend much of it cleaning the data before putting it on data.bl.uk. We embraced ‘dirty data’ somewhat. Our data therefore comes with a health warning especially those of you who would like to carry out computational research on cultural heritage data, as it tends to pretty messy for computers to make sense of.
The problem is that computers think U.S., U. S. , U.S.A., U. S. A. , United States, United States of America are six different places.
Fields also contain things like internal notes about potential duplicates, unexpected extra information - notes on what type of location, etc. Lots of inconsistencies - uncertainty and date ranges expressed in different ways.
35 seconds (107 words)
Open Refine is an amazing tool which we have been using to clean up data. It will suggest ways to make the data more consistent for example. You can then export the data and keep working on it in with other tools, or put it into Open Refine. Because it runs locally it can be used for sensitive data you mightn&apos;t put online.
One issue is that Libraries tend to use question marks to record uncertainty in attribution, but Refine strips out all punctuation, so you have to be careful about preserving things like that (if that&apos;s what you want). It also takes in various data formats.
39 Seconds (117 words)
We have been learning that characterising our data is a really valuable way for researchers to begin to understand what we have. Though this is pretty resource intensive, we have carried out some simple experiments. &lt;CLICK&gt;
Here, you can see that an analysis of our catalogue data reveals the use of different versions of the Dewey Decimal System across the years.&lt;CLICK&gt;
Secondly, in the left column you can see what looks like random data/noise. However, when grouped, we can see the dark blue visualisation indicates there is some similarity in the data, in this case it was subtitles from digitised TV broadcasts.&lt;CLICK&gt;
We know this is something we should do more of, if we had more resources.
25 seconds (77 words)
We are also learning that our digital collection will need significant curation to make them more accessible and re-usuable to researchers. At the moment, a definition of a collection really comes from the efforts of digitising it as a digitisation project. This can be meaningless to a researcher. We believe a new role is emerging for researchers, perhaps libraries to develop roles which enable the ability to characterise, manage and curate data for meaningful research by scholars.
22 seconds (67 words)
We are learning that only a small group of researchers that Labs is working with posses the digital skills to use digital research methods. Many lack them, including library staff&lt;CLICK&gt;
Should they be teamed up with those that have those skills such as computer scientists or should there be a focus on training such as Software / Library and Data carpentry courses for Librarians and budding Digital Humanists.
43 seconds (129 words)
The BL has been active in providing research services for data for many years. &lt;CLICK&gt;
Opportunities for Data Exchange (ODE) looked at the ways in which data centres, publishers, libraries and researchers encourage better data citation.&lt;CLICK&gt;
The DataCite service enables researchers to obtain credit and recognition for sharing their research data, built on digital object identifiers (DOIs)&lt;CLICK&gt;
ODIN is built on the Open Researcher & Contributor ID Initiative (ORCID) and DataCite to uniquely identify scientists and data sets.&lt;CLICK&gt;
The Unlocking Thesis Data project promoted the use of persistent identifiers for theses, their underlying data and their authors.&lt;CLICK&gt;
THOR (Technical and Human Infrastructure for Open Research) established integration between articles, data, and researchers across the research lifecycle.
&lt;CLICK&gt;
A new project called FREYA will simplify the links between people, research outputs and funding.
40 seconds (121 words)
Our updated data strategy sees research data as integral to our collections, research and services as text is today. The strategy is structured around 4 central themes.&lt;CLICK&gt;
Data Management involves the creation of a data management plans and processes to meet our obligations under funding council requirements.&lt;CLICK&gt;
Data creation of datasets derived from our collections, and supporting those who want to so the same.&lt;CLICK&gt;
Datasets collected and created by the Library will be archived and preserved in line with its other collection policies.&lt;CLICK&gt;
The Library ensures that there is appropriate discovery, access and reuse of the datasets it holds, as well as those available from third parties.&lt;CLICK&gt;
A useful email address and websites are displayed should you want to make further investigations.
6 Seconds (19 Words)
‘how’ do we try and engage those who might be interested in the BL’s digital collections and data?
75 seconds (225 words)
Here are the kinds digital research methods our digital scholars are using.&lt;CLICK&gt;
For example, searching for items based on and time and location can reveal very interesting patterns, e.g. when and where works were published. Geotagging digitised objects, putting them in space can add new dimensions to the kinds of research questions we might want to ask.
&lt;CLICK&gt;
Corpus analysis of text in language and Text mining are methods which can find patterns in text through computational analysis.&lt;CLICK&gt;
Tasks that require humans to use technology to complete a task that computers would hard fall under the area of Crowdsourcing and Human Computation&lt;CLICK&gt;
Annotation involves augmenting an item with additional information, usually text.&lt;CLICK&gt;
Similarly transcribing can be the conversion of speech into text through human or computing power to then be used for further analysis.
&lt;CLICK&gt;
Providing Application Programming Interfaces or APIs to data can be very powerful ways for computational access to datasets, used by software developers to build software applications for example.
&lt;CLICK&gt;
Many researchers want to see the patterns that are emerging in large amounts of data and are now using a number of very powerful tools to visualise them to see patterns.
&lt;CLICK&gt;
What is clear is that digital methods are much more that searching for an individual item in a catalogue and Libraries, publishers, service and content providers have to change to support that.
63 seconds (191 words)
So how do we engage people to use our digital collections and data?&lt;CLICK&gt;
Between 2013-2016 we ran an international competition, where we asked people to come up with project ideas. We chose two and then worked with them for 4-6 months and showed the results to the world at our annual symposium in November.&lt;CLICK&gt;
This has now been replaced by a new service where we will provide up to 5 days Digital Research Support to develop researcher’s project ideas of what they want to do with the Library’s digital collections.&lt;CLICK&gt;
Our international annual Awards, recognise work already done with our digital collections in Research, Artistic, Commercial and Educational categories. We also try to recognise and celebrate the superb work our own staff do with our digital collections through our annual BL Labs Staff Awards.&lt;CLICK&gt;
Finally, talk to us about working on collaborative projects, which tend to be aligned to our overall Library vision wherever possible.&lt;CLICK&gt;
We also do a lot of external engagement, running annual roadshows, events, providing online materials, video interviews, case studies, arranging face to face meetings in London or virtually.&lt;CLICK&gt;
More information is available via our website.
21 Seconds (65 Words)
Katrina Navickas was particularly interested in the &lt;Click&gt;Chartist Movement who were a group who were campaigning for the vote for working people. &lt;Click&gt;They were the biggest popular movement for democracy in 19th century British history, just as this is early picture shows a huge monster meeting at Kennington Common&lt;Click&gt;She wanted to use a combination of manual and computational methods to explore our Digitised Newspapers to find out when and where they met and plot them on map. &lt;Click&gt;and hopefully unearthing new history.
X Seconds (X Words)
Posts small illustrations taken almost at random from the digitised book corpus to a Tumblr blog.
This experiment with undirected engagement was a by-product of work to uncover the hidden wealth of illustrations within the digitised pages.
27 Seconds (82 Words)
Adam Crymble &lt;Click&gt;wanted to harness the power of playing fun games on arcade machines to help with crowdsourcing the tagging of un-described images. He particularly wanted to engage a younger audience into crowdsourcing .&lt;Click&gt;On the right you can see a replica 1980’s arcade machine we built and &lt;Click&gt;and on the bottom left some tagging games that were developed through a ‘Games Jam’ for the machine. &lt;Click&gt;. Let’s take a closer look at two of the games…&lt;Click&gt;
18 Seconds (56 Words)
Indexing BL the 1 million & Mapping the Maps – was led by James Heald and collaboration with others &lt;Click&gt;They produced an index of 1 million &apos;Mechanical Curator collection&apos; images on &lt;Click&gt;Wikimedia Commons from a collection of largely un-described images. &lt;Click&gt;This gave rise to finding 50,000 maps within the collection partially through a map-tag-a-thon &lt;Click&gt;These are now being geo-referenced. &lt;Click&gt;
15 seconds (47 Words)
Start a conversation, generate positive energy, be nice, have fun and try to support ideas.&lt;CLICK&gt;
Start with small experiments, but think big! &lt;CLICK&gt;
Fail faster (don’t be afraid) and persevere. &lt;CLICK&gt;
Reject perfectionism! Good enough is sometimes…good enough! &lt;CLICK&gt;
Celebrate the uses of digital collections, tell the world!
23 Seconds (70 Words)
Many national Libraries are now developing Labs around the world, we hope some of their work has been inspired by us. We are holding a meeting in London to share experiences and learn from each other in September. It would be wonderful if the QNL could come along, of course you are invited as well as others who might be interested. &lt;CLICK&gt;
Here are a snapshot of some of Libraries attending.
19 seconds (58 Words)
shukraan lakum (thank you). I would like to thank the organisers of the conference for getting me here and particularly my lovely colleague Milena Dobreva for thinking about me giving the keynote, I’m here until Saturday so please feel free to speak to or email me, I am very approachable, I believe we have 5 minutes for questions.