The document discusses the British Library's Emerging Formats Project which aims to collect and preserve complex digital publications that are within the scope of the UK's Non-Print Legal Deposit regulations but present new challenges. These "emerging formats" include web-based interactive narratives, book apps, and structured data. The project seeks to identify such publications, understand how to collect and manage them, and ensure long-term access despite dependencies on specific software or hardware. Key challenges include a lack of standards, technical complexity, and continuous changes in digital publishing. The project has so far collected some interactive narratives and a mobile book app.
4. www.bl.uk/projects/emerging-formats
The Emerging Formats Project
4
“The term 'emerging formats' refers to types of publication that are in
scope to collect under the UK’s Non-Print Legal Deposit Regulations,
but whose content and structure are more challenging compared to
those currently collected.”
(https://www.bl.uk/projects/emerging-formats)
Two year project (April 2017 – March 2019) to:
• Identify publications that are in scope to collect
• Identify and explore the collection management needs of these more-
complex digital publications
1. WHAT
5. www.bl.uk/projects/emerging-formats
What do we mean by ‘emerging formats’?
• Born-digital with no print
counterpart
• Often consist of more than
one media type
• Device dependent
5
• Comprised of non-standard
format and metadata types
• Not typically part of existing
collections
• At risk of obsolescence
(Source: Smith, C., & Cooke, I. (2017). Emerging Formats: Complex digital media and its impact on the UK Legal Deposit
Libraries. Alexandria, 27(3), 175–187)
1. WHAT
9. www.bl.uk/projects/emerging-formats
UK Non-Print Legal Deposit (NPLD)
9
• In April 2013 the Regulations
come into force
• UK Legal Deposit
Libraries have been
collecting various types of
born-digital publications
since 2013:
eBooks
eJournals
archived websites
geospatial
datasets
digital sheet music
http://www.legislation.gov.uk/uksi/2013/777/contents/made
2. WHY
10. www.bl.uk/projects/emerging-formats
Benefits for libraries:
• NPLD helps mitigate the possibility of a digital black hole
• Ensure we represent the changing nature and diversity of
the UK digital culture
• Develop capability that can be applied to more
‘mainstream’ collections
• Share knowledge and experience with other cultural
institutions to address related challenges
102. WHY
12. www.bl.uk/projects/emerging-formats
Benefits for publishers and creators:
122. WHY
• Ensure their works can be collected and made
accessible over the longer term
• Provide evidence of past uses of technologies
• Provide examples of creative and technical
innovation
14. www.bl.uk/projects/emerging-formats
How to collect complex digital objects in a way that
ensures preservation and access over the long-term?
14
At the acquisition stage we have to consider:
users expectations and needs (to determine what)
+
technical complexities and dependencies (to determine how)
GOAL: maximise long-term reusability & provide a
meaningful access experience
3. HOW
15. www.bl.uk/projects/emerging-formats
We collect:
• Selectively:
• High cost
• Low level of standardisation
• Under Non-Print Legal Deposit regulations
• Following prioritisation criteria agreed upon by all LDLs:
• Risk of loss identified
• Engagement from creators and publishers
• Existing capability that we can build on
153. HOW
17. www.bl.uk/projects/emerging-formats
Challenges…
17
Identify:
• Horizon scanning
• Geographical scope
• Authorship and copyright definition
• Digital publishing environment is in continual change
Collect:
• DRM
• Multiple versions
• Partial capture
• Lack of standards for how content is produced
Preserve:
• Technically and structurally complex publications
• Strong hardware and software dependencies
3. HOW
18. www.bl.uk/projects/emerging-formats
…and more challenges
18
The challenges presented by these publications are not unique to
Legal Deposit Libraries, but cultural organisations worldwide
Describe:
• Lack of standards for how metadata is produced
• Publications consist of multiple media types
Access:
• Access to be provided within Legal Deposit Regulations
• Strong hardware and software dependencies for access
• Hardware collection and maintenance
• Re-use
3. HOW
20. www.bl.uk/projects/emerging-formats
Next steps
• Create new collections on the UK Web Archive + keep adding
to existing collections
• Further research into discovery and description and developing
a standard for catalogue records
• Providing access to offline emerging formats
• Identify new formats to consider for collection and related
challenges (social media, chatbots, XR, AI, fanfictions, etc.)
20
Collecting Approach to Complex Digital Publications
UK Legal Deposit Libraries are looking at publications whose formats are more complex than what we currently have in our collection and could have an impact on our collecting practices
These publications exist solely in digital form and could not be recreated in print without impacting the delivery of content and user experience; they are defined by their native digital format and intended access environment. These formats are complex by nature, and components often consist of more than one media type (e.g. film, audio, and text).
publications are dependent on software (proprietary and bespoke) and hardware in order to run.
In some cases, a format may standardise over time; in other cases, it will remain non-standard or disappear from current use without ever standardising.
In some cases, Emerging Formats refers to capability; ie where a type of publication has existed for some time, but without the means available to collect such content.
These publications are highly ephemeral and at risk of loss
Knowledge that can be applied to more ‘mainstream’ collection management challenges (eg academic journals with mixed-media content; changes in use of EPUB and HTML 5 formats for publishing)
Rapid technological change means complex digital publications are at risk of obsolescence
How to decide what and how we capture (in what format)
How should we collect (what format) to ensure long-term preservation
high cost involved in collecting most types of Emerging Formats
low likelihood of standardisation in most cases (Where activities can be scaled up, this will be done e.g. web harvesting tools to collect online interactive narratives)
The objective in collection and preservation is to reflect, represent, and enable research on innovation and change in UK digital publishing.
NPLD to support the role of Legal Deposit Libraries in collecting as comprehensively as possible born-digital works
Prioritisation of specific publications within a format type will be the responsibilities of curators and librarians at the legal deposit libraries
Web crawler (ACT and Webrecorder)
No comprehensive directory/Rapid technological change
Multiple versions may be collected with the agreement of the publisher (Updates are considered as new editions of a work). Mixed approach to collecting
The collection of some examples of hardware may be necessary to support collection, preservation and access
Documenting any loss of functionality or restriction to access
A mixed approach to collecting (in terms of file formats and collection methods) might be our best option