(1) The British Library's Digital Scholarship team aims to enable the use of the library's digital collections for research, inspiration, creativity, and enjoyment.
(2) The team is cross-disciplinary and supports the creation and innovative use of the library's digital collections.
(3) Recent projects include making Arabic manuscripts searchable through handwriting recognition software, digitizing South Asian printed books from 1713-1914, and exploring optical character recognition for languages like Bengali.
7th BL Labs Symposium (2019): 08_An update on the ‘Living with machines’ projectlabsbl
Mia Ridge, Digital Curator and Co-Investigator for Living with machines, British Library
The 'Living with machines' project is a collaboration between the British Library and the Alan Turing Institute for Data Science and Artificial Intelligence.
7th BL Labs Symposium (2019): 08_An update on the ‘Living with machines’ projectlabsbl
Mia Ridge, Digital Curator and Co-Investigator for Living with machines, British Library
The 'Living with machines' project is a collaboration between the British Library and the Alan Turing Institute for Data Science and Artificial Intelligence.
Stella Wisdom's slides for a talk to UCL BASc students on 02/03/2015.
Including information on BL Labs, Mechanical Curator, Mechanical Comedian, David Normal and Off the Map
Beyond the space: the LoCloud Historical Place Names microservicelocloud
Presentation given by Rimvydas Laužikas, Justinas Jaronis and Ingrida Vosyliūtė
Vilnius University Faculty of Communication, Lithuania
LoCloud Conference
Sharing local cultural heritage online with LoCloud services
Amersfoort, Netherlands
5 February 2016
Presentation "Digitisation at KU Leuven University Libraries: Towards consolidation" by Nele Gabriëls, KU Leuven, at IMPACT Members' Meeting 2017. http://bib.kuleuven.be/ub
Presentation to the National Science Library of the Chinese Academy of Scienceslabsbl
1100 - 1300, Thursday, 26th April 2018,
British Library Labs and Digital Scholarship at the British Library, Harley Room, British Library, St Pancras, London.
Presentation to the National Science Library of the Chinese Academy of Sciences
by Mahendra Mahey Manager of BL Labs
The Work of British Library Labs and Digital ScholarshipInsights from British Library Labs and an emerging role for Libraries
Stella Wisdom's slides for a talk to UCL BASc students on 02/03/2015.
Including information on BL Labs, Mechanical Curator, Mechanical Comedian, David Normal and Off the Map
Beyond the space: the LoCloud Historical Place Names microservicelocloud
Presentation given by Rimvydas Laužikas, Justinas Jaronis and Ingrida Vosyliūtė
Vilnius University Faculty of Communication, Lithuania
LoCloud Conference
Sharing local cultural heritage online with LoCloud services
Amersfoort, Netherlands
5 February 2016
Presentation "Digitisation at KU Leuven University Libraries: Towards consolidation" by Nele Gabriëls, KU Leuven, at IMPACT Members' Meeting 2017. http://bib.kuleuven.be/ub
Presentation to the National Science Library of the Chinese Academy of Scienceslabsbl
1100 - 1300, Thursday, 26th April 2018,
British Library Labs and Digital Scholarship at the British Library, Harley Room, British Library, St Pancras, London.
Presentation to the National Science Library of the Chinese Academy of Sciences
by Mahendra Mahey Manager of BL Labs
The Work of British Library Labs and Digital ScholarshipInsights from British Library Labs and an emerging role for Libraries
Enabling digital scholarship through staff training: the British Library's ex...Mia
A talk at the DH Lab at the University of Exeter in February 2019.
The British Library's Digital Scholarship Training Programme provides colleagues with the space and support to
develop the necessary skills and knowledge to support emerging areas of modern scholarship. Their familiarity with the foundational concepts, methods and tools of digital scholarship in turn helps promote a spirit of innovation and creativity, encouraging digital initiatives within the Library and with external partners. Finally, the programme of events helps nourish and sustain an internal digital scholarship community of interest/practice.
In this talk, Digital Curator Dr. Mia Ridge will share some of the lessons the team have learnt about delivering Digital Scholarship training in a library environment since it began several years ago, and some of the challenges they still face.
Digital Cultural Heritage: Experiences from British LibraryNora McGregor
Slides from seminar on Digital Cultural Heritage given to UCL Institute of Sustainable Heritage's two programmes: the MSc Sustainable Heritage and the MRes Science and Engineering in Arts, Heritage and Archaeology.
Slides from seminar on Digital Cultural Heritage given to UCL Institute of Sustainable Heritage's two programmes: the MSc Sustainable Heritage and the MRes Science and Engineering in Arts, Heritage and Archaeology.
British Library Labs Presentation at Elpub 2014, June 20, 2014labsbl
Key note presentation given at ElPub2014, June 20 about the Digital Scholarship department and the work of the Digital Research Team and British Library Labs.
Supporting the Digital Scholar:Experiences from the British Library Labslabsbl
The presentation will first give a very brief overview of the Library and then tell you a number of ‘stories’ mostly from a Humanities perspective on how researchers did things in the past and how that is changing because of rapid developments in digital technology. With more and more digital content, data, tools and services being made available, researchers are able to ask questions they had never dreamed of before, share their findings in an open way and collaborate, some of them are becoming the ‘digital’ scholar.
It will bring back the story to the British Library, and how the digital scholar is changing the way we do things. It will then move on to the efforts of digitisation across the British Library, giving a whistle stop tour of some of the incredible digital collections we now have and highlight some of the challenges that we face given our historical origins, licensing and technical restrictions. Importantly, it will also try to address how we are trying to tackle some of these challenges. It will outline the work of Digital Scholarship department, created to support the changing research landscape, focusing particularly on the work on the Digital Research Team and that of British Library Labs, both of which sit in the same department. It will point out some of the surprising findings we have discovered and some of the lessons we have learned so far and what we are planning for the future. Finally, it will finish with some important final ‘take away’ messages and The Presentation will be asking you what excites you most about digital scholarship. Hopefully, if there is time, there will be an opportunity to take a few questions too.
Similar to 7th BL Labs Symposium (2019): 12_Digital Research team projects update (20)
Mahendra Mahey, BL Labs Manager, British Library
--
This Award recognises an artistic or creative endeavour that has used the Library’s digital content to inspire, amaze and provoke.
Maja Maricevic, Head of Higher Education and Science, British Library
--
This Award recognises a current member of staff, or team, who has played a key role in an innovative project using the Library’s digital content or data.
7th BL Labs Symposium (2019): 06_An overview of digital preservation at the B...labsbl
Maureen Pennock, Head of Digital Preservation, British Library
An overview of the challenges of preserving an ever-growing and complex set of digital collections and a presentation of the work of the Flashback project.
7th BL Labs Symposium (2019): 05_The Research Awardlabsbl
James Perkins, Research & Postgraduate Development Manager, British Library
This Award recognises a project or activity which demonstrates the development of new knowledge, research methods or tools, using the Library’s digital content.
7th BL Labs Symposium (2019): 04_The story of the GLAM Labs community and how...labsbl
Sophie-Carolin Wagner, Project Manager, Austrian National Library Labs, Austrian National Library
A report on the work to develop a global community of Galleries, Libraries, Archives and Museums (GLAM) Labs and the creation of a handbook for professionals wanting to set up, maintain and ensure digital innovation Labs thrive in their organisations.
Mahendra Mahey, BL Labs Manager, British Library
This Award celebrates quality learning experiences created for learners of any age and ability that use the Library's digital content.
Digital Magical Mystery Tour - British Librarylabsbl
Presentation given by Mahendra Mahey, BL Labs Manager about the British Library and it's digital collections and how they have been used by the public.
Building Better GLAM Labs - Opening talk at Museum Big Data Conference - UCL ...labsbl
Talk given on 30 April 2019, between 1500 - 1520 at the UCL Qatar Museum Big Data 1st Conference, UCL Qatar, given at the Auditorium, Qatar National Library.
Building Better GLAM Labs - Keynote at University of Victoria, Victoria, BC, ...labsbl
Presentation given by Mahendra Mahey, Manager of BL Labs entitled 'Building Better GLAM Labs'.
Experiences and lessons learned from the British Library and around the world with Galleries, Libraries , Archives and Museums engaging with researchers, artists, educators and entrepreneurs who want to use digitised and born digital cultural heritage collections and data for innovative projects.
Mahendra Mahey, Manager of British Library, British Library, London, UK.
Wednesday 27 February 2019, 1330 – 1500 (Keynote)
Talk given on behalf of the British Columbia Research Libraries Group, in the McPherson Library/Mearns Centre for Learning, Digital Scholarship Commons, Room A308, University of Victoria, Victoria, British Columbia, Canada
Building Better GLAM Labs - Keynote Presentation at Simon Fraser Universitylabsbl
Presentation given by Mahendra Mahey, Manager of BL Labs at Simon Fraser University between 1030 - 1200, Monday 25 February, 2019.
See: https://www.lib.sfu.ca/help/publish/dh/dhil/bl-labs
For more details.
Introduction to BL Labs and Reading 35,000 Books: The UCD Contagion Project ...labsbl
Presentation given by Mahendra Mahey at the Reading 35,000 Books: The UCD Contagion
Project and the British Library Digital Corpus event on 20 February 2019
BL Labs Presentation at Open Science Infrastructures for Big Cultural Datalabsbl
Presentation given in Plovdiv, 13 December 2018 by Mahendra Mahey from British LIbrary Labs.
Fostering Excellence in Scholarship with Big Cultural Heritage CollectionsInsights from British Library Labs
Mahendra Mahey, Manager of BL Labs
1630 - 1715, Thursday, 13th December 2018,
Fostering Excellence in Scholarship with Big CH Collections (in Humanities data and their research use session), Open Science Infrastructures for Big Cultural Data, International Advanced Masterclass,Fifth Floor Conference Room, Hotel Trimontium, Plovdiv, Bulgaria.
A hands-on data exploration & challenge to become a derived data-set author o...labsbl
Mahendra Mahey, manager of British Library Labs (BL Labs) will examine some of the BL’s digital collections/data & discuss challenges he has had in making the BL's cultural heritage data available openly or onsite at the British Library.
Mahendra will invite delegates to explore data-sets at their leisure, setting a challenge for those who are interested, skilled in exploring, finding patterns and grouping data. They could become data-set authors/creators of derived data-sets, based on pre-existing digital collections/data provided on the day or already available on https://data.bl.uk.
The workshop will conclude with reflections from the delegates and possibly highlighting a number derived data-sets that were generated by participants on the day that could now potentially exist on https://data.bl.uk. If selected, these new derived data-sets will be attributed with the creators' / authors' details and each will have its own cite-able Digital Object Identifier (D.O.I). These new data-sets would then be available for reuse by any researcher in the world.
GUIDANCE FOR THIS WORKSHOP
We strongly recommend you come to this workshop with an appropriate device such as a laptop pre-installed with appropriate tools to analayse different kinds of data-sets, e.g. Microsoft Excel may work with smaller data-sets such as metadata (see other data exploration tools below). If you don't have one, and would still like to attend, please request to 'pair up' with someone who is willing to share and has already signed up.
Other data exploration tools include: Notepad++ (e.g. for viewing text and XML); Open Refine (e.g. for cleaning data); Tableau Public (e.g. for visualising data); Google Fusion Tables (e.g for visualising geo-spatial data); Spacy (e.g. for text and data mining), RStudio (an open source Statistical package), MATLAB (data analysis tool) & NLTK (Natural Language processing).
Please note that this workshop is NOT about training you in using any of these tools, just tools you may be already familiar with to explore and find patterns in our data.
Datatypes you may be examining in this workshop could include: .ZIP, .PDF, .TXT, .CSV, .TSV. .XLS, .XLSX, RDF, .nt, XML (TEI, ALTO and bespoke), .JSON, .JPG, .JPEG, .TIFF and .WARC
Please ensure you are able to read these files on your device before the workshop if you are interested in exploring them during our session.
Slides for session: http://goo.gl/
URL for specific data: http://
Mahendra Mahey tweets at @BL_Labs & @mahendra_mahey
Presentation given by Mahendra Mahey, Manager of BL Labs, 1400 - 1430, 2 July 2018
London Psychology Librarians Group Meeting
Dickins Room, Conference Centre,
British Library
The Indian economy is classified into different sectors to simplify the analysis and understanding of economic activities. For Class 10, it's essential to grasp the sectors of the Indian economy, understand their characteristics, and recognize their importance. This guide will provide detailed notes on the Sectors of the Indian Economy Class 10, using specific long-tail keywords to enhance comprehension.
For more information, visit-www.vavaclasses.com
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
Model Attribute Check Company Auto PropertyCeline George
In Odoo, the multi-company feature allows you to manage multiple companies within a single Odoo database instance. Each company can have its own configurations while still sharing common resources such as products, customers, and suppliers.
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
This is a presentation by Dada Robert in a Your Skill Boost masterclass organised by the Excellence Foundation for South Sudan (EFSS) on Saturday, the 25th and Sunday, the 26th of May 2024.
He discussed the concept of quality improvement, emphasizing its applicability to various aspects of life, including personal, project, and program improvements. He defined quality as doing the right thing at the right time in the right way to achieve the best possible results and discussed the concept of the "gap" between what we know and what we do, and how this gap represents the areas we need to improve. He explained the scientific approach to quality improvement, which involves systematic performance analysis, testing and learning, and implementing change ideas. He also highlighted the importance of client focus and a team approach to quality improvement.
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
We all have good and bad thoughts from time to time and situation to situation. We are bombarded daily with spiraling thoughts(both negative and positive) creating all-consuming feel , making us difficult to manage with associated suffering. Good thoughts are like our Mob Signal (Positive thought) amidst noise(negative thought) in the atmosphere. Negative thoughts like noise outweigh positive thoughts. These thoughts often create unwanted confusion, trouble, stress and frustration in our mind as well as chaos in our physical world. Negative thoughts are also known as “distorted thinking”.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Thesis Statement for students diagnonsed withADHD.ppt
7th BL Labs Symposium (2019): 12_Digital Research team projects update
1. Neil Fitzgerald, Head of Digital Research
BL Labs Symposium 2019
@N_Fitzgerald
Digital Scholarship Update
2. www.bl.uk
The British Library's Digital Scholarship team
2
Our mission is to enable the use of the British Library’s digital
collections for research, inspiration, creativity, and enjoyment.
Connect and
share
Support digital
scholars
Agents for
change
Invest in our
staff
Innovate and
collaborate
3. Neil Fitzgerald
Head
Digital Research Team
Mahendra Mahey
Manager
BL Labs
Rossitza Atanassova
Digital Curator
Digitisation
Adi Keinan-Schoonbaert
Digital Curator
Asian & African
Stella Wisdom
Digital Curator
Contemporary British
Mia Ridge
Digital Curator
Western Heritage
Tom Derrick
Digital Curator
Two Centuries Indian Print
Nora McGregor
Digital Curator
European & American
The Digital Scholarship Team is
a cross-disciplinary mix of
curators, researchers,
librarians and programmers
supporting the creation and
innovative use of British
Library's digital collections.
Filipe Bento
Technical Lead
BL Labs
BL Labs Team
Deirdre Sullivan
Digital Research and
Coordinator Apprentice Maja Maricevic
Head of Higher Education
and Science
4. www.bl.uk
DH Award 2018: Best Blog
4
The Digital Scholarship Department is delighted to have won
the 2019 DH Award for ‘Best interesting Digital Humanities
Blog Post or Series of Posts’.
The Digital Humanities Awards are a set of annual awards
where the public is able to nominate resources for the
recognition of talent and expertise in the digital humanities
community.
The awards are intended as an awareness-raising activity to
help put interesting Digital Humanities resources in the
spotlight and engage Digital Humanities users (and the
general public) in the work of the community.
https://blogs.bl.uk/digital-scholarship
http://dhawards.org/
5. www.bl.uk 5
Our aim: to make Arabic texts fully
searchable and available for large-scale
analysis
Main objective: To train Handwritten
Text Recognition (HTR) software to read
historical Arabic manuscripts
Collection: Scientific Manuscripts
available on QDL (https://www.qdl.qa/en)
Automatic Transcription of Historical Handwritten
Arabic Texts
Method:
• Running competitions to find an optimal solution for Arabic HTR
• Participants used our ground truth set to train their recognition software and then
evaluate how accurately the software automatically transcribed the text
• Ground Truth: a complete and accurate record of every character and word in the
scanned images
6. www.bl.uk 6
All ground truth resources will be hosted by the
British Library and made freely available for anyone
wishing to advance the state-of-the-art in text
recognition technology
Resources:
• https://www.bl.uk/projects/arabic-htr
• https://www.primaresearch.org/RASM2019/
• https://blogs.bl.uk/digital-
scholarship/2019/02/automatic-transcription-of-
historical-arabic-scientific-manuscripts-round-
2.html
Automatic Transcription of Historical Handwritten
Arabic Texts
7. www.bl.uk 7
• Digitising and cataloguing rare and unique printed books from
the British Library's South Asian printed books collection, 1713
to 1914, mostly Bengali
• Digital Curator Tom Derrick is exploring OCR technologies for
Bengali print, digital research approaches to Book History and
more
• To support computationally driven research, such as text mining,
we’re providing the digitisation outputs on data.bl.uk under
public domain license
Two Centuries of Indian Print
Right: Pleasing Tales designed to improve the understanding, and direct the conduct of young persons, 1825
https://www.bl.uk/projects/two-centuries-of-indian-print
8. www.bl.uk 8
• The project is exploring OCR solutions for Bengali text and Quarterly
Lists (challenging table layouts)
• Benefit: this enables search and research at scale across many items
• Currently running an OCR competition in collaboration with PRImA
(Pattern Recognition and Image Analysis) Research Lab at Salford
University
• Aim: finding find the best automated text recognition solution for
Bengali and Indian languages
• Resources:
• https://www.primaresearch.org/REID2019/
• https://blogs.bl.uk/digital-scholarship/2019/02/competition-to-
automate-text-recognition-for-printed-bangla-books.html
Two Centuries of Indian Print: OCR
9. www.bl.uk 9
Two Centuries of Indian Print: OCR
Quarterly Lists: descriptive catalogue
records of books published quarterly and
by province of British India between 1867
and 1947. The Quarterly Lists are available
to download as searchable PDFs and as
OCR XML via the British Library's datasets
portal, data.bl.uk.
12. www.bl.uk
Living with Machines: data science, digital history
The national institute for data science and
artificial intelligence, The Alan Turing Institute,
offers the expertise to harness this data to
answer research questions at scale.
A five-year, £9.2 million research project combining expertise from the
nation's research library with data-driven analysis
The British Library has digitised millions of pages
from its collections and established a Digital
Scholarship team to enable the use of its digital
collections for research, inspiration, creativity,
and enjoyment
+
https://www.bl.uk/projects/living-with-machines
12
13. Training library staff in digital scholarship
Digital Curators dedicate 20% of time to training staff throughout the Library in
the opportunities for and practices of digital scholarship, which is primarily
delivered via the Digital Scholarship Training Programme (DSTP).
Our mission:
Provide colleagues with the space and opportunity to delve into and explore all
that digital content and new technologies have to offer in the research domain
today.
Create a variety of opportunities for staff to develop necessary skills and
knowledge to support emerging areas of modern scholarship.
14. Training library staff in digital scholarship
Now in its 7th year, the DSTP includes a wide range of training opportunities:
https://www.bl.uk/projects/digital-scholarship-training-programme
• Formal training courses
• Hands-on workshops
• Monthly Hack & Yacks
• 21st Century Curatorship talks
• Monthly Digital Scholarship Reading Group
In 2018/2019 we delivered 40 training events,
amounting to 224 training days! 848 attendees!
15. www.bl.uk
Computing for Cultural Heritage PGCert
The British Library and partners Birkbeck University and The National
Archives have been awarded £222,420 in funding by the Institute of
Coding (IoC) to co-develop a one-year part-time postgraduate
Certificate (PGCert), Computing for Cultural Heritage, as part of a £4.8
million University skills drive.
15
Throughout 2019-20, Nora McGregor, Digital Curator, will work closely
with a newly appointed Lecturer at Birkbeck to develop a new part-time
PGCert, covering topics such as;
• Module 1: Demystifying computing for heritage professionals
• Module 2: Analytic tools for cultural heritage professionals
• Module 3: Work-based digital project design and development
Trial
Autumn term 2019: Module 1 (15 credits) 6 hrs week/2 nights for 5 weeks
Spring term 2020: Module 3 (30 credits)
There are fully funded places on the trial for 20 staff from within the
British Library and the National Archives to attend in order to evaluate the
framework and programme content before it is fully launched in Autumn
2020.
Project page: https://www.bl.uk/projects/computingculturalheritage
Contact: nora.mcgregor@bl.uk
16. www.bl.uk 16
Digital Scholarship Training Seasons
A 'season' is a new, flexible format for learning and maintaining skills in the Library, with
training delivered through shorter modules that combine to build your knowledge of a
particular topic over time.
We know that it's hard to find the time to attend a whole day workshop, and that sometimes
you're only interested in specific aspects of a digital method or tool.
Running shorter sessions over a longer time-frame also allows us to respond to the rapid pace
of change for a subject like text and data mining, and gives you time to try out methods
between sessions.
Each season will have an introductory module outlining key concepts and terms, then you can
attend as many or as few of sessions as you like, depending on the skills you want to learn,
maintain or put into practice.
17. www.bl.uk 17
Season of Text & Data Mining
Led by Mia Ridge
Text and data mining (TDM) uses automated analytical techniques to analyse text and data for
patterns, trends and other useful information. TDM methods have been applied to digitised and digital
historic, cultural and scientific collections to help scholars answer new research questions, or
investigate questions at scale, analysing hundreds or hundreds of thousands of items.
In addition to supporting new forms of digital scholarship that apply TDM methods, institutions like the
British Library may also be able to use TDM to enhance records to make collection items more
discoverable. TDM in cultural heritage draws on data science, 'distant reading' and other techniques to
categorise items; identify concepts and entities such as people, places and events; apply sentiment
analysis and analyse items at scale.
Course 120 Content Mining in Digital Scholarship
18. www.bl.uk 18
Season of Place
Led by Adi Keinan-Schoonbaert
Recent season of talks and workshops on Digital Mapping for Cultural Heritage Collections. In
recent years digital mapping technologies have transformed the way we interact with the
world through GPS, mobile apps and spatial data. British Library collection items are replete
with geographic information, for example, place of publication, place-names within content
and many others we might not have considered.
Digital mapping provides a different perspective on Library collections, creating possibilities for
discovery and analysis and supporting new forms of digital scholarship and research. Anyone
with an interest in the collections could search, visualise, and analyse via geospatial web tools
or desktop Geographical Information System (GIS) applications. The digital scholarship ‘Season
of Place’ aims to open up these technologies for use on the library’s collections.
Course 108 Digital Mapping
19. www.bl.uk 19
Season of Emerging Formats
Led by Stella Wisdom
The Digital Scholarship and Contemporary British Collections teams are excited to announce a
season of talks and workshops about 'emerging formats', these are types of digital publications
that are in scope to collect under the UK’s Non-Print Legal Deposit Regulations, but whose
content and structure are more challenging compared to those currently collected.
Working with the UK legal deposit libraries, the British Library is building its knowledge and
capability before it can collect these publications and make them available onsite to readers. The
British Library's Emerging Formats project focused on three format types:
• eBook mobile apps
• web-based interactive narratives
• structured data
Course 122 Introduction to Emerging Formats
22. www.bl.uk 22
Get in touch!
Web: http://www.bl.uk/subjects/digital-scholarship
Blog: http://britishlibrary.typepad.co.uk/digital-scholarship/
Email: digitalresearch@bl.uk
Twitter: : @BL_DigiSchol
Editor's Notes
We support Digital Scholars
We promote the use of British Library’s digital collections and data and offer support for anyone wishing to use them in exciting and innovative ways. We work closely with scholars to understand their needs, enable access to content, and provide guidance and technical assistance to fulfil their digital and data-intensive project goals.
o Examples: BL Labs, external training, collaborative PhDs.
We Connect & Share
Through our connection to a global ecosystem of scholars, labs and institutions operating in the digital scholarship domain we maintain awareness of developing trends in this changing research landscape. We share knowledge, expertise and experience across this vibrant community and can leverage the network to connect Library users to the resources they seek.
o Examples: LIBER DH and RLUK working groups
We are Agents for Change
We ensure the Library’s systems, services and policies will meet the needs of anyone wishing to undertake computational and data-driven research based on our digital collections and data. We develop and pilot new digital scholarship services that can be transitioned into production.
o Examples: Data.bl.uk and plans in development for a more ‘Digital Reading Room’
We invest in our Staff
We are building the Library’s capacity to understand and support the emerging needs of digital scholars by investing in our staff skill development. We provide colleagues with the space and opportunity to delve into and explore all that digital content and new technologies have to offer in the research domain today. Through our bespoke programme of workshops, hands-on training, lectures and reading groups we raise awareness of the opportunities new digital methods bring to our users and our profession.
o Examples: Training programme/Digital Curator Matrix
We Innovate & Collaborate
We undertake innovative research, projects and collaborations, applying and experimenting with digital methods on our own collections to find solutions to address barriers to access for users.
o Examples: Bengali/Arabic OCR, Mechanical Curator, Libcrowds/Playbills, IIIF.
We work in the open as much as possible across a range of channels, e.g. our Digital Scholarship blog – over the last year we’ve written about the following projects as worked progressed on the blog.
Earlier this year, the British Library in collaboration with PRImA Research Lab and the Alan Turing Institute launched a competition on the Recognition of Historical Arabic Scientific Manuscripts. This competition was held in the context of the 15th International Conference on Document Analysis and Recognition (ICDAR2019). It was the second competition of this type, following the first one which took place in 2018.
The Library has an extensive collection of Arabic manuscripts, comprising of almost 15,000 works. We have been digitising several hundred manuscripts as part of the British Library/Qatar Foundation Partnership, making them available on Qatar Digital Library. A natural next-step would be the creation of machine-readable content from scanned images, for enhanced search and whole new avenues of research.
Running a competition helps us identify software providers and tool developers, as well as introduce us to the specific challenges that pattern recognition systems face when dealing with historic, handwritten materials. For this year’s competition we provided a ground truth set of 120 images and associated XML files: 20 pages to be used to train text recognition systems to automatically identify Arabic script, and 100 pages to evaluate the training.
Aside from providing larger training and evaluation sets, for this year’s competition we’ve added an extra challenge – marginalia. Notes written in the margins are often less consistent and less coherent than main blocks of text, and can go in different directions. The competition set out three different challenges: page segmentation, text line detection and Optical Character Recognition (OCR). Tackling marginalia was a bonus challenge!
When evaluating the results, PRImA compared established systems used in industry and academia – Tesseract 4.0, ABBYY FineReader Engine 12 (FRE12), and Google Cloud Vision API. The evaluation approach was the same as last year’s in order to gain an insight into the algorithms.
At the end of 2015, an international partnership led by the British Library received funding from the Newton Fund to digitise rare material from its South Asian printed books collection. The Two Centuries of Indian Print project has digitised more than 1,000 early printed Bengali books which are now available online and is currently digitising a range of the other 22 South Asian languages in our collections to drive digital scholarship opportunities for non-Western materials.
The project is exploring how digital research methods and tools can be applied to this digitised collection, this is especially important as many DH tools are optimised for working with Western language materials.
For the first time the project has made freely available in digital format the library's collection of bound Quarterly Lists. These are descriptive catalogue records of books published quarterly and by province of British India between 1867 and 1947. The Quarterly Lists are available to download as searchable PDFs and as OCR XML via the British Library's datasets portal, data.bl.uk.
Map shows the location of the printers that were active in Kolkata and when clicking on one of the place markers shows some information for each printer about how many books were printed there, average number of copies printed and the average number of pages and price of a book across all the books they printed. It is using all the data from July-December 1867 from one of our Quarterly Lists.
We also want to explore different methods of presenting and providing access to our data, here the Quarterly Lists data is visualised with Tableau Public – one of the tools we experimented with during one of our Hack & Yack sessions, a casual, hands-on session arranged by the Digital Research Team every THIRD Tuesday of the month to work through an online tutorial at everyone's own pace but with support of colleagues. We use it as an opportunity to explore new tools/techniques/applications relevant to digital research and keep our own skills up to speed. These sessions supplement our larger digital scholarship training programme.
Grew out of a desire for The Alan Turing Institute and British Library to partner with each other. BL and other humanities scholars had been working for some time to interest Turing in the interesting problems that historical data presents to data science and AI; very much in line with our wider programmes of work in Digital Scholarship
Running since 2012, this innovative digital skill training initiative has provided the time and space for colleagues to develop digital skills and new ways of thinking. We aim to have something for everyone, from introductory courses aimed at novices to more advanced opportunities. It is very important to us that learning is inclusive and accessible, but also challenging
In 2018/2019 alone the team held 40 different staff training events! Within that, 147 individuals (60% women) attended 15 of our formal courses.
Background
A recent job advertisement for a curatorial role at the British Library reflects the changing nature, and digital competency requirements for professionals working in the cultural heritage sector:
-contribute to and undertake work on digitisation and digital projects
-assist in implementing new technologies to make the collections more accessible through online presence or through digital tools
-have experience or familiarity with a variety of information technology skills underpinning digital research methods and practices (e.g. geo-referencing, text mining)
This is no less the case for professionals already working in post, who have often come to their role many years ago, having deep domain expertise in a particular subject, yet now find themselves with increased responsibility for assisting on the design and delivery of complex digital projects, without a foundation in computing to truly empower them. Additionally, due to the scale and diversity of the digital collections held by the BL, and changing Library services and researcher demands, it is of great importance that all staff are aware of the issues, opportunities and strategies involved in working with large-scale digital collections and developing innovative digital projects. This requires having an understanding of approaches used in programming, data science, big data, machine learning, text mining, data analytics, cloud computing, and visualisation.
My colleague Nora McGregor has said: “Over the last seven years, the Digital Curator Team have delivered a ground-breaking Digital Scholarship training programme for staff at British Library. In this time we’ve experienced first-hand the incredible transformations that arise when time, space and opportunity is created for colleagues eager to keep apace of the technological innovations that underpin their work. This is an exciting opportunity to consolidate all that we’ve learned about the skills and knowledge they seek and encode it in a course uniquely designed to meet our needs in the cultural heritage sector."
Over time the delivery of our internal training programme has evolved to reflect the changing needs of staff working in operational and curatorial roles. Implemented over the last year the programme is now structured in more flexible way to accommodate the differing needs of all staff.
An example of a related project is our work with Transkribus. Transkribus is software designed to improve automatic handwritten text recognition. It works by training algorithms to understand handwritten text by comparing images of digitised pages with 'ground truth' transcriptions of those pages.
Following a pilot with records from the East India Office (we have 9 miles of holdings just for this one collection), the British Library signed a memorandum of understanding with the READ Project in 2017 and became a founding member of the newly established READ-COOP over the summer of 2019. A European Cooperative Society with limited liability will serve as the basis for sustaining and further developing the Transkribus platform and related services and tools.
Handwritten text recognition (HTR) will be as transformative for handwritten documents as optical character recognition was for printed materials. Our work with this project should help integrate HTR into the BL’s digitisation and digital library workflows.
A detailed overview of the work done for this season will soon be available in a co-authored article in the Journal of Map and Geography Libraries: Special Issue on Information Literacy Instruction.
A project we have also provided support to was a two-day hack event to produce a JavaScript web map with time slider component (Web Maps-T) and specifications for Timeline visualisation. The main aim is to enhance the ability to visualise Linked Open Data (LOD) on web maps.
Outcomes:
Web Maps-T: A GitHub repository containing a Minimum Viable Product (MVP) web maps with time-slides (a component for use within broader systems)
Timeline Visualisations: GitHub repository containing specifications, design outlines and user-stories for visualising temporal data
White Paper: summarising the hack event, position papers, Web Maps-T MVP and timeline, plans for their integration and next steps for the component
The nature of storytelling and publishing is changing through the possibilities offered by digital technologies and the definition of a digital story includes dynamic publications created for mobile devices and the internet.
In the United Kingdom, Legal Deposit Libraries have the right to collect material published digitally such as websites, blogs, eBooks and e-journals. However, what happens when an eBook/app behaves in an unexpected way and needs to turn to external sources of information to explain a story? What tools and methods do libraries need to store these eBooks/apps? What challenges are posed by software and hardware? How is the relationship between creators, libraries, technology companies and user communities changing? What do researchers need to access emerging formats in a library?
Working with colleagues across the Library over the last year this programme of activity enabled us to start to explore these issues and will feed into our plans for the forthcoming year.
If you’re interested to find out more about the range of activities we’re involved in please see the case studies on our webpages.
We have good experience of working with external research partners to attract joint funding from research councils and trusts. We welcome proposals that promise to produce research that leads to mutually beneficial outcomes.
If you’d like to know more than please get in touch or follow developments via the channels on screen.