SlideShare a Scribd company logo
1 of 54
Download to read offline
Peter Brantley
Internet Archive
A.   Re: preservation

B.   National policies
Preservation for digital materials –

A. Heritage is obligatory for social continuity.
B. Maintenance of assets is business requisite.
• Myriads of ebook formats
• Digital rights management
• No national policy in place


    No one is in charge of the preservation of our
    growing cultural heritage in digital books.
Digital storage means it is easy to preserve in
multiple locations, in different formats. More
redundancy than paper could ever afford.

Storage costs trending toward insignificant
on a per-byte basis.
High density storage:
Each rack stores between
0.5 and 0.75 petabytes.
A book is a complex assembly of content –
Text, video, foreword, illustrations, photos.
Each item needs careful management.
Digital production workflow requires:

 - content management system
 - sophisticated rights / use tracking
 - skilled personnel esp. engineering
Possibility of loss should motivate preservation.
Loss can arise for oneself – or one’s partners –

 via –

 1. complex, tightly coupled systems
 2. struggle of memory vs. forgetting
From an engineering perspective, complexity
in work flow systems raises the chance of
catastrophic loss. As system efficiency
increases, interactions become “tightly-
coupled”.

Tightly coupled systems are prone to routine
accidents with unforeseen cascading effects.
- Charles Perrow
    Sociology Dept, Yale University

Ex.: Apollo 13

  “[The] accident was not the result of a chance
  malfunction in a statistical sense, but rather
  resulted from an unusual combination of
  mistakes, coupled with a somewhat deficient
  and unforgiving design.” source: Wikipedia.
Complexity of production systems and
distribution platforms growing alongside
online inventory and commerce.

Potential trigger events are ubiquitous.
“We're working quickly to recover from a major
 issue in one of our database clusters. We're
 incredibly sorry for the inconvenience.”
(2 2011):
‘Unfortunately, I have mixed up the accounts and
accidentally deleted yours. I am terribly sorry for
this grave error and hope that this mistake can be
reconciled. ’ …
‘Our teams are currently working hard to try to
restore the contents of this user's account. We are
working on a process that would allow us to easily
restore deleted accounts and we plan on rolling
this functionality out soon.’ (em. Added)
Many archives are lost by simply forgetting
where or how they were stored. Canisters of
film, shelves of books, or backup tapes.

“Those books are in a warehouse in Jersey.”
“The servers are in one of our datacenters.”
Commonly, companies get acquired or
shutter business operations; records are
abandoned.

Servers are redeployed without an audit.
Databases run with no backup routines.
Preserving without metadata is not helpful.
Metadata provides necessary context.

Need to point backwards (provenance) and
forward (to find superseded content)

Without necessary metadata infrastructure,
preservation architectures are rickety.
If you are a publisher, do you safeguard your
digital assets? Has that process been audited
against security threats? Technical accidents?
Are your workflows well understood? Have
you conducted trial asset recovery exercises?

Does your insurance company know?
Do you mind if I give them a call?
For 100s of years, libraries preserved books.
Sort of anyway (witness Alexandria).

By-product of the inefficiency of distribution:
Lots of libraries wound up with same books.

Lots of copies keep stuff safe.
Books were once bespoke – monks with pens.

 Over around 500 years, books became
 industrialized – mass produced. Although
 ebooks are the ultimate industrial product,
 we come full circle.

 Digital books stand on threshold of
 tremendous mutability.
It is easy to store objects now, arguably, but …
   It is also easy to save objects that cannot be
   easily recovered.

  E.g., PDF can be a stew of non standard
  formats and arbitrary data. EPUB with DRM
  locks data with risk that key will be lost.
We may be able to preserve digital editions.
Can we preserve the book of the future?

• Scripted and Interactive
• Networked and Distributed
• Personalized and Mobile
In U.S., beyond public and academic library
collections, the Library of Congress plays a
unique role:

§407. The Required copies or phonorecords shall
be deposited in the Copyright Office for the use
or disposition of the Library of Congress.
Current U.S. code privileges print as best
edition for preservation. Slowly being
changed.

(Congress approved e-journal demand
deposit for LoC in the 2010 legislative
session).
The Copyright Office historically held that
transmission to the LoC of deposited digital
books would be an infringing faithful copy.
despite –

§ 704. In the case of published works, all copies,
phonorecords, and identifying material
deposited are available to the Library of
Congress for its collections ….
I led an initiative to confront this issue
in NYC in the summer of 2008.

Convened meeting via Digital Library
Federation with LoC, Mellon, Portico, NISO,
IDPF, BISG, and publisher consultants and
representatives to discuss digital archives
that would also serve as escrow.
The group decided that we should attempt a
trial project with a small set of publishers who
would deposit sample books into Portico’s
repository; the pilot would inform against
business, legal, and policy issues.

With that in hand, LoC would be in stronger
position to solicit rule changes by Congress.
Portico is a not for profit jointly representing
publishers and libraries preserving a growing
volume of digital content with high reliability in a
rights respecting, secure archive. Serving as an
escrow, access to the archive by its members is
triggered only under carefully-defined and
contractually-specified circumstances.

Initial funding came from Mellon and LoC.
The AAP evidenced support.

Ed McCoyd, Director of Digital Policy:

 “Thank you for speaking at the AAP Digital Issues
 Working Group meeting on [11 June 2008],
 regarding your interest in bringing parties together
 to develop digital book archives for preservation.
“Your points about preserving cultural patrimony,
and assuring permanent access by libraries and other
digital content customers as well as by the publishers
themselves, certainly resonated with the group.

“I look forward to talking further about your initiative
in the coming months, and will keep the publishers
apprised of the additional details as they develop.”
It fizzled due to the inability of the LoC to
determine whether it had the capacity to
pursue a deposit initiative, and if so, what
department of the Library should proceed.
Portico decided to continue the pursuit of
ebooks deposits on behalf of its members.

Members are primarily research libraries –
‘cuz, Who else has a mandate to care about
preservation?

With inevitable focus on academic ebook
publishers, e.g. Elsevier, starting in 2008.
Portico has been unable to penetrate trade
publishing sector through private initiative.

No national policy requiring digital deposit.
Google Book Search (GBS) has emerged in
the absence of international rulemaking as
the default archive for some institutions.

GBS does not do preservation quality
imaging, and there is requirement for
comprehensive publisher participation.

This is NOT a good solution.
Europeana digital library seeking to build a
collection preserving Europe’s vast cultural
heritage.

In September 2010,Ghent Univ. Library
became first in Europe to deposit public
domain books scanned by Google into
Europeana.
"Work begins this week to add over 5 million digital
objects, ranging from Spanish civil war photographs
and handwritten letters from philosopher Immanuel
Kant, to Europeana from 19 of Europe’s leading
research and university libraries.

“… It will also add extensive collections from Google
Books, theses, dissertations and open-access journal
articles to the 15 million items amassed in Europeana
to date. Providers include some of Europe’s most
prestigious universities and research institutes … ."
                                             (2 2011)
GBS partners’ collection management of
older public domain books is a pale shadow of
the comprehensive international policy
framework needed to mandate preservation
of our cultural heritage in digital books.

Preservation must mandate participation for
libraries and publishers in a legal framework.
Widespread recognition of need for new
copyright regime supporting digital use.

Internationally interoperable network of
rights assertions in rights registries capable of
automated status query in geographic and
national domains is a very useful support.
Mandating the deposit of digital works for
preservation purposes to ensure adequate
representation of copyright assertions for a
content manifest would be one approach.

Arguably avoids Berne / TRIPS.
Most publishers are demonstrably willing to
engage in discussion of efforts supporting the
development of persistent heritage archives.

Requires hard political work conjoining some
subset of copyright policies, the imperatives
of cultural heritage, technical architecture.
If nothing is done, and we cannot solve this
problem for the simple digital books of the
20th Century, what are we going to do with
the books of the 21st Century?
peter brantley

            director, bookserver project
                  internet archive
                  san francisco, ca

(twitter)        @naypinya        (slideshare)

                peter at archive.org

More Related Content

What's hot

Advantages and disadvantages of digital library
Advantages and disadvantages of digital libraryAdvantages and disadvantages of digital library
Advantages and disadvantages of digital library
yhen06
 
Letthe trumpet sound 2003 version
Letthe trumpet sound 2003 versionLetthe trumpet sound 2003 version
Letthe trumpet sound 2003 version
guest3e969a
 
User Focused Digital Library: A Practical Guide
User Focused Digital Library: A Practical GuideUser Focused Digital Library: A Practical Guide
User Focused Digital Library: A Practical Guide
Sophia Guevara
 
Digital libraries
Digital librariesDigital libraries
Digital libraries
ssmith7027
 

What's hot (20)

Digital Library
Digital LibraryDigital Library
Digital Library
 
Drc Chapter 3
Drc Chapter 3Drc Chapter 3
Drc Chapter 3
 
Advantages and disadvantages of digital library
Advantages and disadvantages of digital libraryAdvantages and disadvantages of digital library
Advantages and disadvantages of digital library
 
Letthe trumpet sound 2003 version
Letthe trumpet sound 2003 versionLetthe trumpet sound 2003 version
Letthe trumpet sound 2003 version
 
12997 article text-48831-1-10-20160701
12997 article text-48831-1-10-2016070112997 article text-48831-1-10-20160701
12997 article text-48831-1-10-20160701
 
Building the 'digital' library - Open Repositories 2014
Building the 'digital' library - Open Repositories 2014Building the 'digital' library - Open Repositories 2014
Building the 'digital' library - Open Repositories 2014
 
Library and It's Uses
Library and It's UsesLibrary and It's Uses
Library and It's Uses
 
Gujranwala medical collge digital library access
Gujranwala medical collge digital library accessGujranwala medical collge digital library access
Gujranwala medical collge digital library access
 
Digitization of Library Resources in Academic Libraries: Challenges and Impli...
Digitization of Library Resources in Academic Libraries: Challenges and Impli...Digitization of Library Resources in Academic Libraries: Challenges and Impli...
Digitization of Library Resources in Academic Libraries: Challenges and Impli...
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Week 8 Presentation Angela Wade
Week 8 Presentation Angela WadeWeek 8 Presentation Angela Wade
Week 8 Presentation Angela Wade
 
National Digital Library
National Digital LibraryNational Digital Library
National Digital Library
 
Basic Concepts of Digital Library
Basic Concepts of Digital LibraryBasic Concepts of Digital Library
Basic Concepts of Digital Library
 
User Focused Digital Library: A Practical Guide
User Focused Digital Library: A Practical GuideUser Focused Digital Library: A Practical Guide
User Focused Digital Library: A Practical Guide
 
Digital libraries
Digital librariesDigital libraries
Digital libraries
 
Digital Library
Digital LibraryDigital Library
Digital Library
 
Digital Library UNIT-3
Digital Library UNIT-3Digital Library UNIT-3
Digital Library UNIT-3
 
Connected Learning and FryskLab at Nationaal Bibliotheekcongres 2014
Connected Learning and FryskLab at Nationaal Bibliotheekcongres 2014Connected Learning and FryskLab at Nationaal Bibliotheekcongres 2014
Connected Learning and FryskLab at Nationaal Bibliotheekcongres 2014
 
Presentation #SocialMediaCaster @ #InternetLibrarian 2012
Presentation #SocialMediaCaster @ #InternetLibrarian 2012Presentation #SocialMediaCaster @ #InternetLibrarian 2012
Presentation #SocialMediaCaster @ #InternetLibrarian 2012
 
Bookless Libraries
Bookless LibrariesBookless Libraries
Bookless Libraries
 

Viewers also liked

Relatiedag260607
Relatiedag260607Relatiedag260607
Relatiedag260607
NSO
 
HistòRia Duna Pastanaga
HistòRia Duna PastanagaHistòRia Duna Pastanaga
HistòRia Duna Pastanaga
virgi
 
Happy Birthday Graham
Happy Birthday GrahamHappy Birthday Graham
Happy Birthday Graham
stuian
 

Viewers also liked (20)

OBA @ EC Google Book Hearing
OBA @ EC Google Book HearingOBA @ EC Google Book Hearing
OBA @ EC Google Book Hearing
 
Reading the Next Book
Reading the Next BookReading the Next Book
Reading the Next Book
 
Re Experiencing The Book
Re Experiencing The BookRe Experiencing The Book
Re Experiencing The Book
 
Abelles Micos I Dofinsdsffff
Abelles Micos I DofinsdsffffAbelles Micos I Dofinsdsffff
Abelles Micos I Dofinsdsffff
 
Relatiedag260607
Relatiedag260607Relatiedag260607
Relatiedag260607
 
What Rupert would tell the DLF
What Rupert would tell the DLFWhat Rupert would tell the DLF
What Rupert would tell the DLF
 
BookServer: A Web of Books
BookServer: A Web of BooksBookServer: A Web of Books
BookServer: A Web of Books
 
Extending the Digital Self
Extending the Digital SelfExtending the Digital Self
Extending the Digital Self
 
The Inline Interface
The Inline InterfaceThe Inline Interface
The Inline Interface
 
Biblioteca l'illa dels llibres, emocions
Biblioteca l'illa dels llibres, emocionsBiblioteca l'illa dels llibres, emocions
Biblioteca l'illa dels llibres, emocions
 
Railties
RailtiesRailties
Railties
 
Digital book markets: Building markets for access
Digital book markets: Building markets for accessDigital book markets: Building markets for access
Digital book markets: Building markets for access
 
Forever Together
Forever TogetherForever Together
Forever Together
 
HistòRia Duna Pastanaga
HistòRia Duna PastanagaHistòRia Duna Pastanaga
HistòRia Duna Pastanaga
 
Reflections on the Google Book Search Settlement by Pamela Samuelson
Reflections on the Google Book Search Settlement by Pamela SamuelsonReflections on the Google Book Search Settlement by Pamela Samuelson
Reflections on the Google Book Search Settlement by Pamela Samuelson
 
Happy Birthday Graham
Happy Birthday GrahamHappy Birthday Graham
Happy Birthday Graham
 
Literature as a (web) Service
Literature as a (web) ServiceLiterature as a (web) Service
Literature as a (web) Service
 
Cloud Libraries
Cloud LibrariesCloud Libraries
Cloud Libraries
 
Digital Books and Flying Cars: The Library edition
Digital Books and Flying Cars: The Library editionDigital Books and Flying Cars: The Library edition
Digital Books and Flying Cars: The Library edition
 
Mexico
MexicoMexico
Mexico
 

Similar to Save This Book

Digital preservation and curation of information.presentation
Digital preservation and curation of information.presentationDigital preservation and curation of information.presentation
Digital preservation and curation of information.presentation
Prince Sterling
 
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global MonitorArchiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
EDINA, University of Edinburgh
 

Similar to Save This Book (20)

Digitallibrary
DigitallibraryDigitallibrary
Digitallibrary
 
StacyKfoury.pptx
StacyKfoury.pptxStacyKfoury.pptx
StacyKfoury.pptx
 
Why Open Access to Bibliographic Metadata Matters
Why Open Access to Bibliographic Metadata MattersWhy Open Access to Bibliographic Metadata Matters
Why Open Access to Bibliographic Metadata Matters
 
An Introduction to digital preservation at the Library of Congress
An Introduction to digital preservation at the Library of CongressAn Introduction to digital preservation at the Library of Congress
An Introduction to digital preservation at the Library of Congress
 
Digital preservation and curation of information.presentation
Digital preservation and curation of information.presentationDigital preservation and curation of information.presentation
Digital preservation and curation of information.presentation
 
New trends in Libraries with IT, AI & i4.0
New trends in Libraries with IT, AI & i4.0New trends in Libraries with IT, AI & i4.0
New trends in Libraries with IT, AI & i4.0
 
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global MonitorArchiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
 
Who is looking after your e-journals?
Who is looking after your e-journals?Who is looking after your e-journals?
Who is looking after your e-journals?
 
Digital library-overview
Digital library-overviewDigital library-overview
Digital library-overview
 
Need of Digital Libraries in Education.
Need of Digital Libraries in Education.Need of Digital Libraries in Education.
Need of Digital Libraries in Education.
 
08 chapter 03
08 chapter 0308 chapter 03
08 chapter 03
 
Preservation for all: the future of government documents and the “digital FDL...
Preservation for all: the future of government documents and the “digital FDL...Preservation for all: the future of government documents and the “digital FDL...
Preservation for all: the future of government documents and the “digital FDL...
 
Digital Libraries, Digital Repositories, Digital Copyright: Overview, Challen...
Digital Libraries, Digital Repositories, Digital Copyright: Overview, Challen...Digital Libraries, Digital Repositories, Digital Copyright: Overview, Challen...
Digital Libraries, Digital Repositories, Digital Copyright: Overview, Challen...
 
An overview of digitization project in university libraries in nigeria a pers...
An overview of digitization project in university libraries in nigeria a pers...An overview of digitization project in university libraries in nigeria a pers...
An overview of digitization project in university libraries in nigeria a pers...
 
Digital library
Digital libraryDigital library
Digital library
 
Evolution and Development of E Library An Historical Study
Evolution and Development of E Library An Historical StudyEvolution and Development of E Library An Historical Study
Evolution and Development of E Library An Historical Study
 
Drc Chapter 1 And 2
Drc Chapter 1 And 2Drc Chapter 1 And 2
Drc Chapter 1 And 2
 
Drc Chapter 1 And 2
Drc Chapter 1 And 2Drc Chapter 1 And 2
Drc Chapter 1 And 2
 
Digital library
Digital libraryDigital library
Digital library
 
Future of Academic Libraries
Future of Academic LibrariesFuture of Academic Libraries
Future of Academic Libraries
 

More from Peter Brantley

What ebooks mean for Libraries
What ebooks mean for LibrariesWhat ebooks mean for Libraries
What ebooks mean for Libraries
Peter Brantley
 
OPDS and the Future of Digital Books
OPDS and the Future of Digital BooksOPDS and the Future of Digital Books
OPDS and the Future of Digital Books
Peter Brantley
 

More from Peter Brantley (16)

Publishing and the Future of STM
Publishing and the Future of STMPublishing and the Future of STM
Publishing and the Future of STM
 
Digital Books and Flying Cars: Disruption in Publishing
Digital Books and Flying Cars: Disruption in PublishingDigital Books and Flying Cars: Disruption in Publishing
Digital Books and Flying Cars: Disruption in Publishing
 
Breaking the catalog
Breaking the catalogBreaking the catalog
Breaking the catalog
 
What if the future (of libraries)
What if the future (of libraries)What if the future (of libraries)
What if the future (of libraries)
 
What ebooks mean for Libraries
What ebooks mean for LibrariesWhat ebooks mean for Libraries
What ebooks mean for Libraries
 
Reading on a Holodeck
Reading on a HolodeckReading on a Holodeck
Reading on a Holodeck
 
OPDS and the Future of Digital Books
OPDS and the Future of Digital BooksOPDS and the Future of Digital Books
OPDS and the Future of Digital Books
 
Organizational Fields and the Book Industry
Organizational Fields and the Book IndustryOrganizational Fields and the Book Industry
Organizational Fields and the Book Industry
 
Digital book markets v7-日本語版
Digital book markets v7-日本語版Digital book markets v7-日本語版
Digital book markets v7-日本語版
 
Finding Vorpal Blades: Questing for Content
Finding Vorpal Blades: Questing for ContentFinding Vorpal Blades: Questing for Content
Finding Vorpal Blades: Questing for Content
 
GBS Amended Settlement: A status update
GBS Amended Settlement: A status updateGBS Amended Settlement: A status update
GBS Amended Settlement: A status update
 
Samuelson: GBS as Copyright Reform
Samuelson: GBS as Copyright ReformSamuelson: GBS as Copyright Reform
Samuelson: GBS as Copyright Reform
 
Books and Webs: Pulling the Down Rows
Books and Webs: Pulling the Down RowsBooks and Webs: Pulling the Down Rows
Books and Webs: Pulling the Down Rows
 
Web Of Books
Web Of BooksWeb Of Books
Web Of Books
 
Redefining Libraries
Redefining LibrariesRedefining Libraries
Redefining Libraries
 
NASA' Use of Immersive Environments
NASA' Use of Immersive EnvironmentsNASA' Use of Immersive Environments
NASA' Use of Immersive Environments
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 

Save This Book

  • 2. A. Re: preservation B. National policies
  • 3.
  • 4. Preservation for digital materials – A. Heritage is obligatory for social continuity. B. Maintenance of assets is business requisite.
  • 5.
  • 6. • Myriads of ebook formats • Digital rights management • No national policy in place No one is in charge of the preservation of our growing cultural heritage in digital books.
  • 7. Digital storage means it is easy to preserve in multiple locations, in different formats. More redundancy than paper could ever afford. Storage costs trending toward insignificant on a per-byte basis.
  • 8. High density storage: Each rack stores between 0.5 and 0.75 petabytes.
  • 9. A book is a complex assembly of content – Text, video, foreword, illustrations, photos. Each item needs careful management.
  • 10. Digital production workflow requires: - content management system - sophisticated rights / use tracking - skilled personnel esp. engineering
  • 11. Possibility of loss should motivate preservation. Loss can arise for oneself – or one’s partners – via – 1. complex, tightly coupled systems 2. struggle of memory vs. forgetting
  • 12.
  • 13. From an engineering perspective, complexity in work flow systems raises the chance of catastrophic loss. As system efficiency increases, interactions become “tightly- coupled”. Tightly coupled systems are prone to routine accidents with unforeseen cascading effects.
  • 14. - Charles Perrow Sociology Dept, Yale University Ex.: Apollo 13 “[The] accident was not the result of a chance malfunction in a statistical sense, but rather resulted from an unusual combination of mistakes, coupled with a somewhat deficient and unforgiving design.” source: Wikipedia.
  • 15. Complexity of production systems and distribution platforms growing alongside online inventory and commerce. Potential trigger events are ubiquitous.
  • 16. “We're working quickly to recover from a major issue in one of our database clusters. We're incredibly sorry for the inconvenience.”
  • 17. (2 2011): ‘Unfortunately, I have mixed up the accounts and accidentally deleted yours. I am terribly sorry for this grave error and hope that this mistake can be reconciled. ’ … ‘Our teams are currently working hard to try to restore the contents of this user's account. We are working on a process that would allow us to easily restore deleted accounts and we plan on rolling this functionality out soon.’ (em. Added)
  • 18.
  • 19. Many archives are lost by simply forgetting where or how they were stored. Canisters of film, shelves of books, or backup tapes. “Those books are in a warehouse in Jersey.” “The servers are in one of our datacenters.”
  • 20. Commonly, companies get acquired or shutter business operations; records are abandoned. Servers are redeployed without an audit. Databases run with no backup routines.
  • 21. Preserving without metadata is not helpful. Metadata provides necessary context. Need to point backwards (provenance) and forward (to find superseded content) Without necessary metadata infrastructure, preservation architectures are rickety.
  • 22. If you are a publisher, do you safeguard your digital assets? Has that process been audited against security threats? Technical accidents? Are your workflows well understood? Have you conducted trial asset recovery exercises? Does your insurance company know? Do you mind if I give them a call?
  • 23.
  • 24. For 100s of years, libraries preserved books. Sort of anyway (witness Alexandria). By-product of the inefficiency of distribution: Lots of libraries wound up with same books. Lots of copies keep stuff safe.
  • 25. Books were once bespoke – monks with pens. Over around 500 years, books became industrialized – mass produced. Although ebooks are the ultimate industrial product, we come full circle. Digital books stand on threshold of tremendous mutability.
  • 26. It is easy to store objects now, arguably, but … It is also easy to save objects that cannot be easily recovered. E.g., PDF can be a stew of non standard formats and arbitrary data. EPUB with DRM locks data with risk that key will be lost.
  • 27. We may be able to preserve digital editions. Can we preserve the book of the future? • Scripted and Interactive • Networked and Distributed • Personalized and Mobile
  • 28.
  • 29. In U.S., beyond public and academic library collections, the Library of Congress plays a unique role: §407. The Required copies or phonorecords shall be deposited in the Copyright Office for the use or disposition of the Library of Congress.
  • 30. Current U.S. code privileges print as best edition for preservation. Slowly being changed. (Congress approved e-journal demand deposit for LoC in the 2010 legislative session).
  • 31. The Copyright Office historically held that transmission to the LoC of deposited digital books would be an infringing faithful copy. despite – § 704. In the case of published works, all copies, phonorecords, and identifying material deposited are available to the Library of Congress for its collections ….
  • 32.
  • 33. I led an initiative to confront this issue in NYC in the summer of 2008. Convened meeting via Digital Library Federation with LoC, Mellon, Portico, NISO, IDPF, BISG, and publisher consultants and representatives to discuss digital archives that would also serve as escrow.
  • 34. The group decided that we should attempt a trial project with a small set of publishers who would deposit sample books into Portico’s repository; the pilot would inform against business, legal, and policy issues. With that in hand, LoC would be in stronger position to solicit rule changes by Congress.
  • 35. Portico is a not for profit jointly representing publishers and libraries preserving a growing volume of digital content with high reliability in a rights respecting, secure archive. Serving as an escrow, access to the archive by its members is triggered only under carefully-defined and contractually-specified circumstances. Initial funding came from Mellon and LoC.
  • 36.
  • 37. The AAP evidenced support. Ed McCoyd, Director of Digital Policy: “Thank you for speaking at the AAP Digital Issues Working Group meeting on [11 June 2008], regarding your interest in bringing parties together to develop digital book archives for preservation.
  • 38. “Your points about preserving cultural patrimony, and assuring permanent access by libraries and other digital content customers as well as by the publishers themselves, certainly resonated with the group. “I look forward to talking further about your initiative in the coming months, and will keep the publishers apprised of the additional details as they develop.”
  • 39. It fizzled due to the inability of the LoC to determine whether it had the capacity to pursue a deposit initiative, and if so, what department of the Library should proceed.
  • 40.
  • 41. Portico decided to continue the pursuit of ebooks deposits on behalf of its members. Members are primarily research libraries – ‘cuz, Who else has a mandate to care about preservation? With inevitable focus on academic ebook publishers, e.g. Elsevier, starting in 2008.
  • 42. Portico has been unable to penetrate trade publishing sector through private initiative. No national policy requiring digital deposit.
  • 43.
  • 44. Google Book Search (GBS) has emerged in the absence of international rulemaking as the default archive for some institutions. GBS does not do preservation quality imaging, and there is requirement for comprehensive publisher participation. This is NOT a good solution.
  • 45. Europeana digital library seeking to build a collection preserving Europe’s vast cultural heritage. In September 2010,Ghent Univ. Library became first in Europe to deposit public domain books scanned by Google into Europeana.
  • 46. "Work begins this week to add over 5 million digital objects, ranging from Spanish civil war photographs and handwritten letters from philosopher Immanuel Kant, to Europeana from 19 of Europe’s leading research and university libraries. “… It will also add extensive collections from Google Books, theses, dissertations and open-access journal articles to the 15 million items amassed in Europeana to date. Providers include some of Europe’s most prestigious universities and research institutes … ." (2 2011)
  • 47. GBS partners’ collection management of older public domain books is a pale shadow of the comprehensive international policy framework needed to mandate preservation of our cultural heritage in digital books. Preservation must mandate participation for libraries and publishers in a legal framework.
  • 48. Widespread recognition of need for new copyright regime supporting digital use. Internationally interoperable network of rights assertions in rights registries capable of automated status query in geographic and national domains is a very useful support.
  • 49.
  • 50. Mandating the deposit of digital works for preservation purposes to ensure adequate representation of copyright assertions for a content manifest would be one approach. Arguably avoids Berne / TRIPS.
  • 51. Most publishers are demonstrably willing to engage in discussion of efforts supporting the development of persistent heritage archives. Requires hard political work conjoining some subset of copyright policies, the imperatives of cultural heritage, technical architecture.
  • 52. If nothing is done, and we cannot solve this problem for the simple digital books of the 20th Century, what are we going to do with the books of the 21st Century?
  • 53.
  • 54. peter brantley director, bookserver project internet archive san francisco, ca (twitter) @naypinya (slideshare) peter at archive.org