SlideShare a Scribd company logo
Digital preservation and the
web: challenges for libraries
Corey Davis, Council of Prairie and Pacific University Libraries (COPPUL)
Digital Preservation Coordinator
The big challenge for all of us
“Much of our global
cultural heritage, and
our own individual and
social imprint, is at
serious risk of
disappearing.”
Richard S. Whitt, Corporate Director for
Strategic Initiatives at Google
Keepers “…represents
only about 20% of the
‘continuing resources’
and ‘integrated
resources’ having an
ISSN.”
http://library.ifla.org/121/1/
098-burnhill-en.pdf
Traditional library collections…
…and the early web
The web now…
1. With AJAX and HTML5, the web is transitioning from a document-
centric information space, to an applications-based information
space
2. Content is tailored to people, locations, and devices. There is often
no “canonical version” of a webpage anymore
Amnesiac civilization
• “HTML5, in effect, changes the language
of the Web from HTML to Javascript,
from a static document description
language to a programming language.”
• “I've been warning for some time that
one of the fundamental problems facing
digital preservation is the evolution of
content from static to dynamic.”
• http://blog.dshr.org/2011/08/moonalice-
plays-palo-alto.html
Current preservation services…
• Tend to focus on discrete objects or packages (PDFs, images, XML)
• And the creation of Archival Information Packages (AIPs)
• “I have always thought of the ‘autonomous AIP’ zipped up and held on a
storage device as an residue of paper-thinking.” Jon Tilbury, Preservica (Pasig-
discuss listserv)
Some examples of the challenges
of preserving dynamic web
content
The short tail and long tail
1. CNN http://cnn.com
2. Colonial Despatches https://bcgenesis.uvic.ca/
The short tail: CNN
• “CNN.com has been unarchivable since
2016-11-01T15:01:31”
• http://ws-dl.blogspot.ca/2017/01/2017-01-
20-cnncom-has-been-unarchivable.html
January 20th, 2017, Inauguration Day
• “In short, the archival failure is caused
by changes CNN made to their CDN
(content delivery network); these
changes are reflected in the JavaScript
used to render the homepage.”
• John Berlin http://ws-
dl.blogspot.ca/2017/01/2017-01-20-
cnncom-has-been-unarchivable.html
The long tail:
Colonial Despatches
• “This digital archive contains the
original correspondence
between the British Colonial
Office and the colonies of
Vancouver Island and British
Columbia.”
• https://bcgenesis.uvic.ca/
How can we address these
challenges together?
Working with the long-tail
• Major project at University of Victoria to explore the archiving of
dynamic, interactive websites in the digital humanities
• Working with information producers and developers to create
preservation-friendly applications
Selecting technologies for long-term survival
• “We have settled on building web applications which have
virtually no server-side requirements beyond response to
HTTP requests, but instead are based on client-side HTML5,
JavaScript and Cascading Style Sheets.”
• “Using these core standards, we are building completely
‘static’ websites which can actually function locally in any
current web browser, with no server at all, but which still
preserve virtually all of the appearance and functionality of
the original web applications they replace ”
• Martin Holmes, Programmer/Consultant, University of Victoria
Humanities Computing and Media Centre
Best practices for content creators: Distill.pub
• “A Distill article (at least
in its ideal, aspirational
form) isn’t just a paper.
It’s an interactive
medium that lets users
– ‘readers’ is no longer
sufficient – work
directly with machine
learning models.”
• http://distill.pub/about/
Distill.pub
Interactivity and preservation
• “Distill does an excellent job of publishing articles that use
interactivity to provide high-quality explanations … without sacrificing
preservability.”
• David Rosenthal http://blog.dshr.org/2017/05/distill-is-this-what-journals-
should.html
Capturing the dynamic
web: Webrecorder.io
• Developed by Rhizome for
preservation of interactive
online art
• Focus on dynamic web content
Academic publications and CLOCKSS
• Digital preservation
collaboration
between research
libraries and
publishers
• Working to develop
functionality to
harvest dynamic
content from
publishers’ websites
To sum up…
Significant issues
• Costs
• Dynamic content
• Presents significant technical and policy issues for preservation
• Scale
• A technical and financial issue
• Incentives
• Public policy could address some of this
• Proprietary information and DRM
• Copyright legislation for preservation not likely forthcoming
Collaboration is key
• Libraries need to work together
• Libraries and publishers and other content creators need to work
together
• Publishers can practice “preservation in place”
Thanks
• corey@coppul.ca
• @coreyleedavis

More Related Content

What's hot

Qatar Digital Library Project Workshop
Qatar Digital Library Project WorkshopQatar Digital Library Project Workshop
Qatar Digital Library Project Workshop
Asad Nafees
 
Digital fabrication as a library integrated service
Digital fabrication as a library integrated serviceDigital fabrication as a library integrated service
Digital fabrication as a library integrated service
Matt Bernhardt
 
New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...
Derek Keats
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projects
ac2182
 
A distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaA distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL India
Enno Meijers
 
Next Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital PlatformNext Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital Platform
Trevor Owens
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics Amsterdam
Enno Meijers
 
Connected heritage: How should Cultural Institutions Open and Connect Data?
Connected heritage: How should Cultural Institutions Open and Connect Data?Connected heritage: How should Cultural Institutions Open and Connect Data?
Connected heritage: How should Cultural Institutions Open and Connect Data?
Mia
 
Towards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA projectTowards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA project
PRELIDA Project
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
Jon Voss
 
Ingrid Dillo - Digital humanities challenges and the Research Data Alliance
Ingrid Dillo - Digital humanities challenges and the Research Data AllianceIngrid Dillo - Digital humanities challenges and the Research Data Alliance
Ingrid Dillo - Digital humanities challenges and the Research Data Alliance
dri_ireland
 
ICPC SOS workshop
ICPC SOS workshop ICPC SOS workshop
ICPC SOS workshop
Bethany Davis
 
Cross-sector collaboration for digital museum and library projects
Cross-sector collaboration for digital museum and library projectsCross-sector collaboration for digital museum and library projects
Cross-sector collaboration for digital museum and library projects
Mia
 
Julian D. Richards - Open Data in European Archaeology
Julian D. Richards -  Open Data in European ArchaeologyJulian D. Richards -  Open Data in European Archaeology
Julian D. Richards - Open Data in European Archaeology
OpenPompei
 
The role of a Socio-informatrician
The role of a Socio-informatricianThe role of a Socio-informatrician
The role of a Socio-informatrician
Greg D'Arcy
 
Input friendly intranets
Input friendly intranetsInput friendly intranets
Input friendly intranets
Hazel Hall
 
Global Networked Digital Environment: How Libraries Shape the Future
Global Networked Digital Environment: How Libraries Shape the FutureGlobal Networked Digital Environment: How Libraries Shape the Future
Global Networked Digital Environment: How Libraries Shape the Future
Ingrid Parent
 
Endings and new beginnings: update on the Jisc Content programme 2011-13
Endings and new beginnings: update on the Jisc Content programme 2011-13 Endings and new beginnings: update on the Jisc Content programme 2011-13
Endings and new beginnings: update on the Jisc Content programme 2011-13
PaolaMarchionni
 
Bl labs roadshow aab_open_university.2016
Bl labs roadshow aab_open_university.2016Bl labs roadshow aab_open_university.2016
Bl labs roadshow aab_open_university.2016
Aquiles Alencar Brayner
 
BL Labs Roadshow 2016 - Digital Research Team
BL Labs Roadshow 2016 - Digital Research TeamBL Labs Roadshow 2016 - Digital Research Team
BL Labs Roadshow 2016 - Digital Research Team
labsbl
 

What's hot (20)

Qatar Digital Library Project Workshop
Qatar Digital Library Project WorkshopQatar Digital Library Project Workshop
Qatar Digital Library Project Workshop
 
Digital fabrication as a library integrated service
Digital fabrication as a library integrated serviceDigital fabrication as a library integrated service
Digital fabrication as a library integrated service
 
New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projects
 
A distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaA distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL India
 
Next Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital PlatformNext Steps for IMLS's National Digital Platform
Next Steps for IMLS's National Digital Platform
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics Amsterdam
 
Connected heritage: How should Cultural Institutions Open and Connect Data?
Connected heritage: How should Cultural Institutions Open and Connect Data?Connected heritage: How should Cultural Institutions Open and Connect Data?
Connected heritage: How should Cultural Institutions Open and Connect Data?
 
Towards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA projectTowards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA project
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
Ingrid Dillo - Digital humanities challenges and the Research Data Alliance
Ingrid Dillo - Digital humanities challenges and the Research Data AllianceIngrid Dillo - Digital humanities challenges and the Research Data Alliance
Ingrid Dillo - Digital humanities challenges and the Research Data Alliance
 
ICPC SOS workshop
ICPC SOS workshop ICPC SOS workshop
ICPC SOS workshop
 
Cross-sector collaboration for digital museum and library projects
Cross-sector collaboration for digital museum and library projectsCross-sector collaboration for digital museum and library projects
Cross-sector collaboration for digital museum and library projects
 
Julian D. Richards - Open Data in European Archaeology
Julian D. Richards -  Open Data in European ArchaeologyJulian D. Richards -  Open Data in European Archaeology
Julian D. Richards - Open Data in European Archaeology
 
The role of a Socio-informatrician
The role of a Socio-informatricianThe role of a Socio-informatrician
The role of a Socio-informatrician
 
Input friendly intranets
Input friendly intranetsInput friendly intranets
Input friendly intranets
 
Global Networked Digital Environment: How Libraries Shape the Future
Global Networked Digital Environment: How Libraries Shape the FutureGlobal Networked Digital Environment: How Libraries Shape the Future
Global Networked Digital Environment: How Libraries Shape the Future
 
Endings and new beginnings: update on the Jisc Content programme 2011-13
Endings and new beginnings: update on the Jisc Content programme 2011-13 Endings and new beginnings: update on the Jisc Content programme 2011-13
Endings and new beginnings: update on the Jisc Content programme 2011-13
 
Bl labs roadshow aab_open_university.2016
Bl labs roadshow aab_open_university.2016Bl labs roadshow aab_open_university.2016
Bl labs roadshow aab_open_university.2016
 
BL Labs Roadshow 2016 - Digital Research Team
BL Labs Roadshow 2016 - Digital Research TeamBL Labs Roadshow 2016 - Digital Research Team
BL Labs Roadshow 2016 - Digital Research Team
 

Similar to Davis Digital Preservation and the Web: Challenges for Libraries

Website designing company_in_delhi_digitization practices
Website designing company_in_delhi_digitization practicesWebsite designing company_in_delhi_digitization practices
Website designing company_in_delhi_digitization practices
Css Founder
 
Save This Book
Save This BookSave This Book
Save This Book
Peter Brantley
 
intro to library 2.0
intro to library 2.0intro to library 2.0
intro to library 2.0
Lifelong Learning
 
Introduction to digital libraries - definitions, examples, concepts and trend...
Introduction to digital libraries - definitions, examples, concepts and trend...Introduction to digital libraries - definitions, examples, concepts and trend...
Introduction to digital libraries - definitions, examples, concepts and trend...
Olaf Janssen
 
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
nullhandle
 
Doing DH in Theological Libraries
Doing DH in Theological LibrariesDoing DH in Theological Libraries
Doing DH in Theological Libraries
Clifford Anderson
 
Library 2[1].0 Kimms
Library 2[1].0 KimmsLibrary 2[1].0 Kimms
Library 2[1].0 Kimms
guestfa5009
 
Anchorage public focus group web version
Anchorage public focus group   web versionAnchorage public focus group   web version
Anchorage public focus group web versionCarson Block
 
Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...
Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...
Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...
The Frick Collection
 
Linked Open Data for Libraries, Archives, and Museums: An Aggregators View
Linked Open Data for Libraries, Archives, and Museums: An Aggregators ViewLinked Open Data for Libraries, Archives, and Museums: An Aggregators View
Linked Open Data for Libraries, Archives, and Museums: An Aggregators View
Richard Urban
 
Wikipedia: Why? Who? and How?
Wikipedia: Why? Who? and How?Wikipedia: Why? Who? and How?
Wikipedia: Why? Who? and How?
Don Boozer
 
12997 article text-48831-1-10-20160701
12997 article text-48831-1-10-2016070112997 article text-48831-1-10-20160701
12997 article text-48831-1-10-20160701
Ankit Dubey
 
Collection management in a digital age ola2011 revised
Collection management in a digital age ola2011 revisedCollection management in a digital age ola2011 revised
Collection management in a digital age ola2011 revised
Tony Horava
 
Collection management in a digital age ola2011
Collection management in a digital age ola2011Collection management in a digital age ola2011
Collection management in a digital age ola2011Tony Horava
 
Web 2.0, library 2.0, librarian 2.0, innovative services for sustainable car...
Web 2.0, library 2.0, librarian 2.0,  innovative services for sustainable car...Web 2.0, library 2.0, librarian 2.0,  innovative services for sustainable car...
Web 2.0, library 2.0, librarian 2.0, innovative services for sustainable car...
Cheryl Peltier-Davis
 
Slideshare1 phpapp01
Slideshare1  phpapp01Slideshare1  phpapp01
Slideshare1 phpapp01
Valentina Rovacchi
 
The Dynamic Web
The Dynamic WebThe Dynamic Web
The Dynamic Web
Dave Wallace
 
Dynamic Web
Dynamic WebDynamic Web
Dynamic Web
Dave Wallace
 
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS ProgramLots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
nullhandle
 
201399627 kovacs-collection-cyberspace
201399627 kovacs-collection-cyberspace201399627 kovacs-collection-cyberspace
201399627 kovacs-collection-cyberspace
homeworkping4
 

Similar to Davis Digital Preservation and the Web: Challenges for Libraries (20)

Website designing company_in_delhi_digitization practices
Website designing company_in_delhi_digitization practicesWebsite designing company_in_delhi_digitization practices
Website designing company_in_delhi_digitization practices
 
Save This Book
Save This BookSave This Book
Save This Book
 
intro to library 2.0
intro to library 2.0intro to library 2.0
intro to library 2.0
 
Introduction to digital libraries - definitions, examples, concepts and trend...
Introduction to digital libraries - definitions, examples, concepts and trend...Introduction to digital libraries - definitions, examples, concepts and trend...
Introduction to digital libraries - definitions, examples, concepts and trend...
 
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
 
Doing DH in Theological Libraries
Doing DH in Theological LibrariesDoing DH in Theological Libraries
Doing DH in Theological Libraries
 
Library 2[1].0 Kimms
Library 2[1].0 KimmsLibrary 2[1].0 Kimms
Library 2[1].0 Kimms
 
Anchorage public focus group web version
Anchorage public focus group   web versionAnchorage public focus group   web version
Anchorage public focus group web version
 
Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...
Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...
Making the Black Hole Gray: Implementing the Web Archiving of Specialist Art ...
 
Linked Open Data for Libraries, Archives, and Museums: An Aggregators View
Linked Open Data for Libraries, Archives, and Museums: An Aggregators ViewLinked Open Data for Libraries, Archives, and Museums: An Aggregators View
Linked Open Data for Libraries, Archives, and Museums: An Aggregators View
 
Wikipedia: Why? Who? and How?
Wikipedia: Why? Who? and How?Wikipedia: Why? Who? and How?
Wikipedia: Why? Who? and How?
 
12997 article text-48831-1-10-20160701
12997 article text-48831-1-10-2016070112997 article text-48831-1-10-20160701
12997 article text-48831-1-10-20160701
 
Collection management in a digital age ola2011 revised
Collection management in a digital age ola2011 revisedCollection management in a digital age ola2011 revised
Collection management in a digital age ola2011 revised
 
Collection management in a digital age ola2011
Collection management in a digital age ola2011Collection management in a digital age ola2011
Collection management in a digital age ola2011
 
Web 2.0, library 2.0, librarian 2.0, innovative services for sustainable car...
Web 2.0, library 2.0, librarian 2.0,  innovative services for sustainable car...Web 2.0, library 2.0, librarian 2.0,  innovative services for sustainable car...
Web 2.0, library 2.0, librarian 2.0, innovative services for sustainable car...
 
Slideshare1 phpapp01
Slideshare1  phpapp01Slideshare1  phpapp01
Slideshare1 phpapp01
 
The Dynamic Web
The Dynamic WebThe Dynamic Web
The Dynamic Web
 
Dynamic Web
Dynamic WebDynamic Web
Dynamic Web
 
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS ProgramLots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
 
201399627 kovacs-collection-cyberspace
201399627 kovacs-collection-cyberspace201399627 kovacs-collection-cyberspace
201399627 kovacs-collection-cyberspace
 

More from National Information Standards Organization (NISO)

Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
National Information Standards Organization (NISO)
 
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
National Information Standards Organization (NISO)
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
National Information Standards Organization (NISO)
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
National Information Standards Organization (NISO)
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
National Information Standards Organization (NISO)
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
National Information Standards Organization (NISO)
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
National Information Standards Organization (NISO)
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
National Information Standards Organization (NISO)
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
National Information Standards Organization (NISO)
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
National Information Standards Organization (NISO)
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
National Information Standards Organization (NISO)
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
National Information Standards Organization (NISO)
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
National Information Standards Organization (NISO)
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
National Information Standards Organization (NISO)
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
National Information Standards Organization (NISO)
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
National Information Standards Organization (NISO)
 

More from National Information Standards Organization (NISO) (20)

Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
Mattingly "AI & Prompt Design: Limitations and Solutions with LLMs"
 
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
 
Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 

Recently uploaded

TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 

Recently uploaded (20)

TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 

Davis Digital Preservation and the Web: Challenges for Libraries

  • 1. Digital preservation and the web: challenges for libraries Corey Davis, Council of Prairie and Pacific University Libraries (COPPUL) Digital Preservation Coordinator
  • 2. The big challenge for all of us
  • 3. “Much of our global cultural heritage, and our own individual and social imprint, is at serious risk of disappearing.” Richard S. Whitt, Corporate Director for Strategic Initiatives at Google
  • 4. Keepers “…represents only about 20% of the ‘continuing resources’ and ‘integrated resources’ having an ISSN.” http://library.ifla.org/121/1/ 098-burnhill-en.pdf
  • 7. The web now… 1. With AJAX and HTML5, the web is transitioning from a document- centric information space, to an applications-based information space 2. Content is tailored to people, locations, and devices. There is often no “canonical version” of a webpage anymore
  • 8.
  • 9.
  • 10. Amnesiac civilization • “HTML5, in effect, changes the language of the Web from HTML to Javascript, from a static document description language to a programming language.” • “I've been warning for some time that one of the fundamental problems facing digital preservation is the evolution of content from static to dynamic.” • http://blog.dshr.org/2011/08/moonalice- plays-palo-alto.html
  • 11. Current preservation services… • Tend to focus on discrete objects or packages (PDFs, images, XML) • And the creation of Archival Information Packages (AIPs) • “I have always thought of the ‘autonomous AIP’ zipped up and held on a storage device as an residue of paper-thinking.” Jon Tilbury, Preservica (Pasig- discuss listserv)
  • 12. Some examples of the challenges of preserving dynamic web content
  • 13. The short tail and long tail 1. CNN http://cnn.com 2. Colonial Despatches https://bcgenesis.uvic.ca/
  • 14. The short tail: CNN • “CNN.com has been unarchivable since 2016-11-01T15:01:31” • http://ws-dl.blogspot.ca/2017/01/2017-01- 20-cnncom-has-been-unarchivable.html
  • 15.
  • 16. January 20th, 2017, Inauguration Day
  • 17. • “In short, the archival failure is caused by changes CNN made to their CDN (content delivery network); these changes are reflected in the JavaScript used to render the homepage.” • John Berlin http://ws- dl.blogspot.ca/2017/01/2017-01-20- cnncom-has-been-unarchivable.html
  • 18. The long tail: Colonial Despatches • “This digital archive contains the original correspondence between the British Colonial Office and the colonies of Vancouver Island and British Columbia.” • https://bcgenesis.uvic.ca/
  • 19.
  • 20.
  • 21.
  • 22. How can we address these challenges together?
  • 23. Working with the long-tail • Major project at University of Victoria to explore the archiving of dynamic, interactive websites in the digital humanities • Working with information producers and developers to create preservation-friendly applications
  • 24. Selecting technologies for long-term survival • “We have settled on building web applications which have virtually no server-side requirements beyond response to HTTP requests, but instead are based on client-side HTML5, JavaScript and Cascading Style Sheets.” • “Using these core standards, we are building completely ‘static’ websites which can actually function locally in any current web browser, with no server at all, but which still preserve virtually all of the appearance and functionality of the original web applications they replace ” • Martin Holmes, Programmer/Consultant, University of Victoria Humanities Computing and Media Centre
  • 25. Best practices for content creators: Distill.pub • “A Distill article (at least in its ideal, aspirational form) isn’t just a paper. It’s an interactive medium that lets users – ‘readers’ is no longer sufficient – work directly with machine learning models.” • http://distill.pub/about/
  • 27. Interactivity and preservation • “Distill does an excellent job of publishing articles that use interactivity to provide high-quality explanations … without sacrificing preservability.” • David Rosenthal http://blog.dshr.org/2017/05/distill-is-this-what-journals- should.html
  • 28. Capturing the dynamic web: Webrecorder.io • Developed by Rhizome for preservation of interactive online art • Focus on dynamic web content
  • 29. Academic publications and CLOCKSS • Digital preservation collaboration between research libraries and publishers • Working to develop functionality to harvest dynamic content from publishers’ websites
  • 31. Significant issues • Costs • Dynamic content • Presents significant technical and policy issues for preservation • Scale • A technical and financial issue • Incentives • Public policy could address some of this • Proprietary information and DRM • Copyright legislation for preservation not likely forthcoming
  • 32. Collaboration is key • Libraries need to work together • Libraries and publishers and other content creators need to work together • Publishers can practice “preservation in place”