SlideShare a Scribd company logo
1 of 17
Download to read offline
Digital preservation
                       for ongoing access

Presentation for Council July 2008

            David Pearson
  Manager, Digital Preservation Section
Overview

1.   We have lots of “digital stuff” in our collections
     and it is growing
2.   We will lose access to it unless we take action
3.   We need to manage the process of keeping it
     accessible and usable
4.   Solutions have to be scalable, reliable and
     automated
1.   “Digital stuff”- many collections




             Pictures        Oral History




                        Manuscripts         Historical    Web sites
        Sheet music                         Newspapers

Maps

                         Ephemera                 Books               Serial
How does it grow?

1.   We collect it
     –   Physical carriers
     –   Online
         •   PANDORA web archive
         •   Australian web domain harvests

2.   We create it
     –   Oral history interviews
     –   Photographs
     –   Publications

3.   We convert it
     –   Digitise our collections
Web Archives

• Web sites are collected selectively
   – Individually for access via PANDORA, or
   – On a large scale via annual domain snapshots
• No control over content creation
• Lots of
   – File formats
   – Individual files (Pandora ≈ 51 million, Domain
     harvest ≈ 1.3 billion files)
   – Links
   – Software (browser, plug-ins, readers)
• Internet content changes over time
Digitisation

  • Around 135,000 items
    digitised

  • Newspaper project = 4
    million pages by 2010

  • Internally created so we
    can control
     – Standards
     – File formats (e.g. TIFF,
       JPEG, PDF )
     – Metadata
     – Workflows

  • Issues
     – Growing volume
Physical carriers

 •   Approx. 12,000 items – grows by
     1,000 a year

 Issues
 • No control over creation
 •   Time lag before acquisition
 •   Variety of carriers (fragile) and file
     formats
 •   Require various hardware, software,
     operating systems, drivers to
     access
 •   Labour intensive to process and
     transfer to safe storage (growing
     backlog)
Growth : digital collection storage

                           350



                           300



                           250
Storage size (terabytes)




                           200
                                                                                            Newspapers


                           150

                                                                     Australian Web
                           100                                       Harvests


                           50



                            0
                            Jan-03   Jul-03   Jan-04   Jul-04   Jan-05   Jul-05   Jan-06   Jul-06   Jan-07   Jul-07   Jan-08   Jul-08
Type of Digital Collections
                  Pandora
                               3%
      2008                     Maps
                                2%
                                    Sheet Music
                                        4%
                                      Manuscripts
                                         2%
                                          Pictures
Australian Web                               7%
   Harvest
      40%


                                           Oral History
                                               18%



                                      Other
                                       3%
                  Historical
                 Newspapers
                     21%
Growth: compared to books

                                               Comparison of books collection &
                                              digital collection "book equivalents"
                                6.00
"Book Equivalents" (millions)




                                5.00

                                4.00
                                                                                      Digital Collection
                                                                                      20 mb "book
                                3.00                                                  equivalents"
                                                                                      Books Collection
                                2.00

                                1.00

                                0.00
                                       2005       2006         2007       2008
                                                    Year end June
2. Act or risk losing it

• “Digital stuff” is dependent on technology at all
  stages
   – Creation/capture
   – Storage
   – Access
• Technology changes rapidly thus software,
  hardware, media, file formats, operating systems
  become obsolete
• Unless managed deterioration can occur rapidly
  e.g. data can be corrupted or lost in storage or
  transfer process
Computer Museum
3. Managing to keep it
• “Not managing it” is not an option
• We need to
   – Understand our “digital stuff” & associated risks
   – Provide safe storage & ensure integrity
   – Ensure access over time as technology
     changes
   – Develop & implement preservation workflows,
     skills, standards, & strategies for ongoing
     access
   – Enable content to be shared and used in
     different ways in the future
4. Solutions and implications
• Large scale automated processes
• Original research & time to deliver the solutions
• Reasonably long lead times
• Audit processes and quality control monitoring
  are critical
• Significant resources are required
Conclusions

• We are responsible for a lot of “digital stuff”
• If we simply collect and store it, it will become
  unusable in a relatively short time as
  technologies change
• Maintaining the ability to access it requires a
  lot of good management, planning, &
  dedicated resources
• We have to find and use solutions that can be
  applied automatically and reliably to billions of
  digital files

More Related Content

Similar to Digital presevation

Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office VictoriaJust Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office VictoriaNational Library of Australia
 
Digital librarie
Digital librarieDigital librarie
Digital librarieDlgltsbm
 
The Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
The Wonderful Technicolor World Digital Goodness @ Smithsonian LibrariesThe Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
The Wonderful Technicolor World Digital Goodness @ Smithsonian LibrariesMartin Kalfatovic
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital LibraryEd Fay
 
Just digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office VictoriaJust digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office VictoriaNational Library of Australia
 
Small pieces loosely joined: getting louse research online.
Small pieces loosely joined: getting louse research online.Small pieces loosely joined: getting louse research online.
Small pieces loosely joined: getting louse research online.Vince Smith
 
Open Source Web Content Management Technologies for Libraries
Open Source Web Content Management Technologies for LibrariesOpen Source Web Content Management Technologies for Libraries
Open Source Web Content Management Technologies for LibrariesAnil Mishra
 
Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office VictoriaJust Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office VictoriaNational Library of Australia
 
SI Libraries HAC Update for CIMC
SI Libraries HAC Update for CIMCSI Libraries HAC Update for CIMC
SI Libraries HAC Update for CIMCSCPilsk
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Vince Smith
 
Nelson-Atkins Technology Roundtable
Nelson-Atkins Technology RoundtableNelson-Atkins Technology Roundtable
Nelson-Atkins Technology RoundtableRobert J. Stein
 
We've Got Issues: Issue Tracking and Workflow in the Digital Library
We've Got Issues: Issue Tracking and Workflow in the Digital LibraryWe've Got Issues: Issue Tracking and Workflow in the Digital Library
We've Got Issues: Issue Tracking and Workflow in the Digital LibraryElectronic Resources & Libraries
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 
Managing your library's online presence
Managing your library's online presenceManaging your library's online presence
Managing your library's online presenceSuhui Ho
 

Similar to Digital presevation (20)

Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office VictoriaJust Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
 
Digital librarie
Digital librarieDigital librarie
Digital librarie
 
The Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
The Wonderful Technicolor World Digital Goodness @ Smithsonian LibrariesThe Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
The Wonderful Technicolor World Digital Goodness @ Smithsonian Libraries
 
Just Digitise It! - Daniel Wilksch
Just Digitise It! - Daniel WilkschJust Digitise It! - Daniel Wilksch
Just Digitise It! - Daniel Wilksch
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital Library
 
Just digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office VictoriaJust digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office Victoria
 
Small pieces loosely joined: getting louse research online.
Small pieces loosely joined: getting louse research online.Small pieces loosely joined: getting louse research online.
Small pieces loosely joined: getting louse research online.
 
Open Source Web Content Management Technologies for Libraries
Open Source Web Content Management Technologies for LibrariesOpen Source Web Content Management Technologies for Libraries
Open Source Web Content Management Technologies for Libraries
 
Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office VictoriaJust Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
 
Greatest Hits from Pew Internet’s Library Research
Greatest Hits from Pew Internet’s Library ResearchGreatest Hits from Pew Internet’s Library Research
Greatest Hits from Pew Internet’s Library Research
 
Cua2010
Cua2010Cua2010
Cua2010
 
SI Libraries HAC Update for CIMC
SI Libraries HAC Update for CIMCSI Libraries HAC Update for CIMC
SI Libraries HAC Update for CIMC
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
 
Nelson-Atkins Technology Roundtable
Nelson-Atkins Technology RoundtableNelson-Atkins Technology Roundtable
Nelson-Atkins Technology Roundtable
 
Just Digitise It - Daniel Wilksch - 2015
Just Digitise It - Daniel Wilksch - 2015Just Digitise It - Daniel Wilksch - 2015
Just Digitise It - Daniel Wilksch - 2015
 
We've Got Issues: Issue Tracking and Workflow in the Digital Library
We've Got Issues: Issue Tracking and Workflow in the Digital LibraryWe've Got Issues: Issue Tracking and Workflow in the Digital Library
We've Got Issues: Issue Tracking and Workflow in the Digital Library
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
Aba adams
Aba adamsAba adams
Aba adams
 
Managing your library's online presence
Managing your library's online presenceManaging your library's online presence
Managing your library's online presence
 
Digitization
DigitizationDigitization
Digitization
 

More from National Library of Australia

Publicity and media - Anna Gressier & Sarah Kleven (Communications and Market...
Publicity and media - Anna Gressier & Sarah Kleven (Communications and Market...Publicity and media - Anna Gressier & Sarah Kleven (Communications and Market...
Publicity and media - Anna Gressier & Sarah Kleven (Communications and Market...National Library of Australia
 
CHG recipient case study - Julia Mant of the National Institute of Dramatic Art
CHG recipient case study - Julia Mant of the National Institute of Dramatic ArtCHG recipient case study - Julia Mant of the National Institute of Dramatic Art
CHG recipient case study - Julia Mant of the National Institute of Dramatic ArtNational Library of Australia
 
Trove - a window to our community heritage - Hilary Berthon of Trove, NLA
Trove - a window to our community heritage - Hilary Berthon of Trove, NLATrove - a window to our community heritage - Hilary Berthon of Trove, NLA
Trove - a window to our community heritage - Hilary Berthon of Trove, NLANational Library of Australia
 
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...National Library of Australia
 
Assessing Significance and Significance 2.0: an introduction - Margaret Birt...
 Assessing Significance and Significance 2.0: an introduction - Margaret Birt... Assessing Significance and Significance 2.0: an introduction - Margaret Birt...
Assessing Significance and Significance 2.0: an introduction - Margaret Birt...National Library of Australia
 
Assessing the significance of cultural heritage - Tania Cleary
Assessing the significance of cultural heritage - Tania ClearyAssessing the significance of cultural heritage - Tania Cleary
Assessing the significance of cultural heritage - Tania ClearyNational Library of Australia
 
Publicity, Media & Completing your CHG project - 2017 - Fran D'Castro
Publicity, Media & Completing your CHG project - 2017 - Fran D'CastroPublicity, Media & Completing your CHG project - 2017 - Fran D'Castro
Publicity, Media & Completing your CHG project - 2017 - Fran D'CastroNational Library of Australia
 
TROVE - a window to our community heritage - Hilary Berthon of Trove, NLA
TROVE - a window to our community heritage - Hilary Berthon of Trove, NLATROVE - a window to our community heritage - Hilary Berthon of Trove, NLA
TROVE - a window to our community heritage - Hilary Berthon of Trove, NLANational Library of Australia
 
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...National Library of Australia
 
CHG recipient case study - Donna Bailey of the Catholic Diocese of Sandhurst
CHG recipient case study - Donna Bailey of the Catholic Diocese of SandhurstCHG recipient case study - Donna Bailey of the Catholic Diocese of Sandhurst
CHG recipient case study - Donna Bailey of the Catholic Diocese of SandhurstNational Library of Australia
 
Assessing the significance of cultural heritage - Tania Cleary
Assessing the significance of cultural heritage - Tania ClearyAssessing the significance of cultural heritage - Tania Cleary
Assessing the significance of cultural heritage - Tania ClearyNational Library of Australia
 
Significance Assessment and Significance 2.0: an introduction - Veronica Bull...
Significance Assessment and Significance 2.0: an introduction - Veronica Bull...Significance Assessment and Significance 2.0: an introduction - Veronica Bull...
Significance Assessment and Significance 2.0: an introduction - Veronica Bull...National Library of Australia
 
Disaster preparedness - Kim Morris of Art and Archival Pty Ltd
Disaster preparedness - Kim Morris of Art and Archival Pty LtdDisaster preparedness - Kim Morris of Art and Archival Pty Ltd
Disaster preparedness - Kim Morris of Art and Archival Pty LtdNational Library of Australia
 
TROVE - Discovering community heritage - Cathie Oats (NLA)
TROVE - Discovering community heritage - Cathie Oats (NLA)TROVE - Discovering community heritage - Cathie Oats (NLA)
TROVE - Discovering community heritage - Cathie Oats (NLA)National Library of Australia
 

More from National Library of Australia (20)

Publicity and media - Anna Gressier & Sarah Kleven (Communications and Market...
Publicity and media - Anna Gressier & Sarah Kleven (Communications and Market...Publicity and media - Anna Gressier & Sarah Kleven (Communications and Market...
Publicity and media - Anna Gressier & Sarah Kleven (Communications and Market...
 
CHG recipient case study - Julia Mant of the National Institute of Dramatic Art
CHG recipient case study - Julia Mant of the National Institute of Dramatic ArtCHG recipient case study - Julia Mant of the National Institute of Dramatic Art
CHG recipient case study - Julia Mant of the National Institute of Dramatic Art
 
Completing your CHG project - Fran D'Castro
Completing your CHG project - Fran D'CastroCompleting your CHG project - Fran D'Castro
Completing your CHG project - Fran D'Castro
 
Trove - a window to our community heritage - Hilary Berthon of Trove, NLA
Trove - a window to our community heritage - Hilary Berthon of Trove, NLATrove - a window to our community heritage - Hilary Berthon of Trove, NLA
Trove - a window to our community heritage - Hilary Berthon of Trove, NLA
 
National Archives of Australia
National Archives of AustraliaNational Archives of Australia
National Archives of Australia
 
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...
 
Assessing Significance and Significance 2.0: an introduction - Margaret Birt...
 Assessing Significance and Significance 2.0: an introduction - Margaret Birt... Assessing Significance and Significance 2.0: an introduction - Margaret Birt...
Assessing Significance and Significance 2.0: an introduction - Margaret Birt...
 
Preservation Needs Assessment - Tamara Lavrencic
Preservation Needs Assessment  - Tamara LavrencicPreservation Needs Assessment  - Tamara Lavrencic
Preservation Needs Assessment - Tamara Lavrencic
 
Assessing the significance of cultural heritage - Tania Cleary
Assessing the significance of cultural heritage - Tania ClearyAssessing the significance of cultural heritage - Tania Cleary
Assessing the significance of cultural heritage - Tania Cleary
 
Publicity, Media & Completing your CHG project - 2017 - Fran D'Castro
Publicity, Media & Completing your CHG project - 2017 - Fran D'CastroPublicity, Media & Completing your CHG project - 2017 - Fran D'Castro
Publicity, Media & Completing your CHG project - 2017 - Fran D'Castro
 
TROVE - a window to our community heritage - Hilary Berthon of Trove, NLA
TROVE - a window to our community heritage - Hilary Berthon of Trove, NLATROVE - a window to our community heritage - Hilary Berthon of Trove, NLA
TROVE - a window to our community heritage - Hilary Berthon of Trove, NLA
 
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...
Disaster Prevention, Preparedness, Response and Recovery for Collections - Ki...
 
CHG recipient case study - Donna Bailey of the Catholic Diocese of Sandhurst
CHG recipient case study - Donna Bailey of the Catholic Diocese of SandhurstCHG recipient case study - Donna Bailey of the Catholic Diocese of Sandhurst
CHG recipient case study - Donna Bailey of the Catholic Diocese of Sandhurst
 
Preservation Needs Assessment - Tamara Lavrencic
Preservation Needs Assessment - Tamara LavrencicPreservation Needs Assessment - Tamara Lavrencic
Preservation Needs Assessment - Tamara Lavrencic
 
Assessing the significance of cultural heritage - Tania Cleary
Assessing the significance of cultural heritage - Tania ClearyAssessing the significance of cultural heritage - Tania Cleary
Assessing the significance of cultural heritage - Tania Cleary
 
Significance Assessment and Significance 2.0: an introduction - Veronica Bull...
Significance Assessment and Significance 2.0: an introduction - Veronica Bull...Significance Assessment and Significance 2.0: an introduction - Veronica Bull...
Significance Assessment and Significance 2.0: an introduction - Veronica Bull...
 
Preservation assessment - Tamara Lavrencic
Preservation assessment - Tamara LavrencicPreservation assessment - Tamara Lavrencic
Preservation assessment - Tamara Lavrencic
 
Disaster preparedness - Kim Morris of Art and Archival Pty Ltd
Disaster preparedness - Kim Morris of Art and Archival Pty LtdDisaster preparedness - Kim Morris of Art and Archival Pty Ltd
Disaster preparedness - Kim Morris of Art and Archival Pty Ltd
 
TROVE - Discovering community heritage - Cathie Oats (NLA)
TROVE - Discovering community heritage - Cathie Oats (NLA)TROVE - Discovering community heritage - Cathie Oats (NLA)
TROVE - Discovering community heritage - Cathie Oats (NLA)
 
Significance assessment process - Tania Cleary
Significance assessment process - Tania ClearySignificance assessment process - Tania Cleary
Significance assessment process - Tania Cleary
 

Recently uploaded

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 

Recently uploaded (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 

Digital presevation

  • 1. Digital preservation for ongoing access Presentation for Council July 2008 David Pearson Manager, Digital Preservation Section
  • 2. Overview 1. We have lots of “digital stuff” in our collections and it is growing 2. We will lose access to it unless we take action 3. We need to manage the process of keeping it accessible and usable 4. Solutions have to be scalable, reliable and automated
  • 3. 1. “Digital stuff”- many collections Pictures Oral History Manuscripts Historical Web sites Sheet music Newspapers Maps Ephemera Books Serial
  • 4. How does it grow? 1. We collect it – Physical carriers – Online • PANDORA web archive • Australian web domain harvests 2. We create it – Oral history interviews – Photographs – Publications 3. We convert it – Digitise our collections
  • 5. Web Archives • Web sites are collected selectively – Individually for access via PANDORA, or – On a large scale via annual domain snapshots • No control over content creation • Lots of – File formats – Individual files (Pandora ≈ 51 million, Domain harvest ≈ 1.3 billion files) – Links – Software (browser, plug-ins, readers) • Internet content changes over time
  • 6.
  • 7.
  • 8. Digitisation • Around 135,000 items digitised • Newspaper project = 4 million pages by 2010 • Internally created so we can control – Standards – File formats (e.g. TIFF, JPEG, PDF ) – Metadata – Workflows • Issues – Growing volume
  • 9. Physical carriers • Approx. 12,000 items – grows by 1,000 a year Issues • No control over creation • Time lag before acquisition • Variety of carriers (fragile) and file formats • Require various hardware, software, operating systems, drivers to access • Labour intensive to process and transfer to safe storage (growing backlog)
  • 10. Growth : digital collection storage 350 300 250 Storage size (terabytes) 200 Newspapers 150 Australian Web 100 Harvests 50 0 Jan-03 Jul-03 Jan-04 Jul-04 Jan-05 Jul-05 Jan-06 Jul-06 Jan-07 Jul-07 Jan-08 Jul-08
  • 11. Type of Digital Collections Pandora 3% 2008 Maps 2% Sheet Music 4% Manuscripts 2% Pictures Australian Web 7% Harvest 40% Oral History 18% Other 3% Historical Newspapers 21%
  • 12. Growth: compared to books Comparison of books collection & digital collection "book equivalents" 6.00 "Book Equivalents" (millions) 5.00 4.00 Digital Collection 20 mb "book 3.00 equivalents" Books Collection 2.00 1.00 0.00 2005 2006 2007 2008 Year end June
  • 13. 2. Act or risk losing it • “Digital stuff” is dependent on technology at all stages – Creation/capture – Storage – Access • Technology changes rapidly thus software, hardware, media, file formats, operating systems become obsolete • Unless managed deterioration can occur rapidly e.g. data can be corrupted or lost in storage or transfer process
  • 15. 3. Managing to keep it • “Not managing it” is not an option • We need to – Understand our “digital stuff” & associated risks – Provide safe storage & ensure integrity – Ensure access over time as technology changes – Develop & implement preservation workflows, skills, standards, & strategies for ongoing access – Enable content to be shared and used in different ways in the future
  • 16. 4. Solutions and implications • Large scale automated processes • Original research & time to deliver the solutions • Reasonably long lead times • Audit processes and quality control monitoring are critical • Significant resources are required
  • 17. Conclusions • We are responsible for a lot of “digital stuff” • If we simply collect and store it, it will become unusable in a relatively short time as technologies change • Maintaining the ability to access it requires a lot of good management, planning, & dedicated resources • We have to find and use solutions that can be applied automatically and reliably to billions of digital files