SlideShare a Scribd company logo
1 of 24
The Digital Archive
Challenges &
Best Practice
Axiell Conference 05/05/2017
Why digitize?
Long-Term
Digital
Preservation
Access
So why are some people afraid of digitization?
• Fear of change
• Fear of irrelevancy of original material
• Fear of lack of need
The OPPOSITE holds true:
Digitization:
INCREASES awareness
INCREASES need
INCREASES relevancy
Who moved my cheese?
What we do…..
• We’re a national library….
• Many types of material
• Many sizes of material
• Many different projects
• Thousands of scans a day
The 6 Considerations …….
• What is your objective?
• What is the material scope?
• Condition of the source material
• How will the material be used?
• Who is your audience?
• Do you require LTDP?
What exactly are we digitizing?
What exactly are we preserving?
DATA
To accomplish this….
• Standards
• Methodology
• Workflow
PRINT MATERIAL
Source Material to Digital File / Microfilm to Digital File
Standards for Digital Image Capture
Material Type News Papers Manuscripts Books / Documents Jacket Maps Photographs
Master / Preservation Images
Target File Bi-Tonal Greyscale Color Greyscale Color Bi-Tonal Greyscale Color Color Greyscale Color Color Color
Format TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF
Resolution/DPI 300/400 300/400 400-600 300 400-600 300 300 400 300 400 600 600 600
Bit Depth 1-bit 8-bit 24-bit 8-bit 24-bit 1-bit 8-bit 24-bit 24-bit 8-bit 24-bit 24-bit 48-bit
Compression None None None None None None None None None None None None None
Color Mode - - RGB - RGB - - RGB RGB - RGB RGB RGB
Criteria standard Tabloid,
magazine
standard standard If w images standard standard When high
quality
required
Secondary / Production Images
Format TIFF/JPEG2K/
JPEG
TIFF/JPEG2K/
JPEG
TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2K/
JPEG
TIFF/JPEG2K/
JPEG
TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2
K
Resolution 300 DPI 300 DPI 400 DPI 300 DPI 400-600 DPI 300 DPI 300 DPI 400 DPI 300 DPI 400 DPI 400 DPI 600 DPI 600DPI
Compression LZW/ CCITT-4 LZW/ LZW/ LZW/ CCITT-4 LZW/
Image
Processing
De-skew
De-speckle
Crop to edge
De-Skew
De-Speckle
Crop
De-Skew
De-Speckle
Crop
De-Skew
De-Speckle
Crop
De-Skew
Crop
Presentation / Access Images
Format JPEG/PDF JPEG/PDF JPEG/PDF JPEG JPEG JPEG/PDF JPEG/PDF JPEG/PDF JPEG/PDF JPEG JPEG JPEG JPEG
Size
Thumbnail Images
Format JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG
Size 150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
150x150
‫בהתאם‬ ‫או‬
‫לצורך‬
What we do…..
Preservation Master
Derivative Master
Derivative Access
DAMS
LTDP
TIFF
Single page
no compression
300 + dpi
TIFF
Single page
no compression
300 + dpi
CROPPED
JPEG
As required
Digitization… What does it mean?
Analog
• = 1 book (250 pages)
• = generally 1 metadata (catalog) record
• = 2 items
Digitization… What does it mean?
Digital
• = 250 preservation master files
• = (250 derivative master files)
• = 250 JPEG access files
• = 1 metadata (catalog) record
• = 1 DAMS record
• = at least (502) 752 items
Digitization… What does it mean?
=DAMS!
Digitization… What does it mean?
Storage - Costs of 1 GB
$438,0001980
$105,0001985
$11,0001990
$112000
$12005
$0.12012
$0.032014
FREE – Google 15 GB2016
Project # 1 Project # 2
Digitize 100,000 pages Digitize 100,000 pages
6 months
$80,000
14 months
$120,000
What’s in a name?
001.Tiff
002.Tiff
003.Tiff
004.Tiff
005.Tiff
GF_0015-01_001_0001.Tiff
GF_0015-10_001_0002.Tiff
GF_0015-10_002_0001.Tiff
GF_0015-10_025_0543.Tiff
GF_0034-00_001_0002.Tiff
What’s in a name?
GF_0015-01_001_0001.Tiff
GF_0015-10_001_0002.Tiff
001.Tiff
002.Tiff
189.Tiff
Project # 1 Project # 2
Digitize 100,000 pages Digitize 100,000 pages
6 months
$80,000
14 months
$120,000
GF_0015-01_001_0001.Tiff
GF_0015-01_001_0002.Tiff
GF_165-02_005_0189.Tiff
What exactly IS digital preservation?
Long-term Digital
Preservation
(LTDP)
Backup LTDP
May 05 2017
2035May 04 2017
Digital Preservation:
- Technological dependence
- Proprietary software dependence
- Technological obsolescence (= questionable longevity)
- hardware
- software
- media
- formats
“Long-term” Digital Preservation = an Oxymoron?
= THREATS
Approach to LTDP
- Adopt internationally accepted strategy
- Framework (OAIS, PREMIS)
- Methodology
- Best practice and Standards (of digitization, metadata)
- CONTINUITY
- Assess Threats
- Implement an LTDP System (Risk assessment and consequent actions)
LTDP System
- NLI implemented the ExLibris Rosetta System
- Deep storage of digital material
- Based on OAIS / PREMIS
- Periodic migration of file formats – CONTINUITY
- On-demand
- Early
- Late
- Ensures bit level integrity
- Constant risk assessment, timely risk resolving actions
Food for Thought:
Define ”Long-Term”?
At some point down the road… will someone be
“digitizing” it all over again?!
Answer: YES!!!!!! But WHY???
Chezkie Kasnett
+972-54-307-5321
chezkiek@nli.org.il

More Related Content

Similar to Towards the Digital Archive – Challenges and Best Practice: A Look at Digitization Practices, Standards, and Methodology at the National Library of Israel

Tim Keefe - DRI Training Series: 2. Digitising Your Collection
Tim Keefe - DRI Training Series: 2. Digitising Your CollectionTim Keefe - DRI Training Series: 2. Digitising Your Collection
Tim Keefe - DRI Training Series: 2. Digitising Your Collectiondri_ireland
 
Manuscript, document and negative digitisation, Taryn Ellis
Manuscript, document and negative digitisation, Taryn EllisManuscript, document and negative digitisation, Taryn Ellis
Manuscript, document and negative digitisation, Taryn EllisPublicLibraryServices
 
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015Bipin Singh
 
Just digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office VictoriaJust digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office VictoriaNational Library of Australia
 
Getting Started With Digitisation
Getting Started With DigitisationGetting Started With Digitisation
Getting Started With DigitisationNicholas Poole
 
Back2 basics - A Day In The Life Of An Oracle Analytics Query
Back2 basics - A Day In The Life Of An Oracle Analytics QueryBack2 basics - A Day In The Life Of An Oracle Analytics Query
Back2 basics - A Day In The Life Of An Oracle Analytics QueryChristian Berg
 
Trouble with distribution
Trouble with distributionTrouble with distribution
Trouble with distributionJ On The Beach
 
File types pro forma(1)
File types pro forma(1)File types pro forma(1)
File types pro forma(1)joe cole
 
Big Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with AzureBig Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with AzureChristos Charmatzis
 
Taming the Search: A Practical Way of Enforcing GDPR and CCPA in Very Large D...
Taming the Search: A Practical Way of Enforcing GDPR and CCPA in Very Large D...Taming the Search: A Practical Way of Enforcing GDPR and CCPA in Very Large D...
Taming the Search: A Practical Way of Enforcing GDPR and CCPA in Very Large D...Databricks
 
Digitization Basics for Archives and Special Collections – Part 1: Select and...
Digitization Basics for Archives and Special Collections – Part 1: Select and...Digitization Basics for Archives and Special Collections – Part 1: Select and...
Digitization Basics for Archives and Special Collections – Part 1: Select and...WiLS
 
Pro formula digital graphics
Pro formula digital graphicsPro formula digital graphics
Pro formula digital graphicsEllieDawson
 
Database Shootout: What's best for BI?
Database Shootout: What's best for BI?Database Shootout: What's best for BI?
Database Shootout: What's best for BI?Jos van Dongen
 
The unique requirements of RIPS for industrial inkjet systems
The unique requirements of RIPS for industrial inkjet systemsThe unique requirements of RIPS for industrial inkjet systems
The unique requirements of RIPS for industrial inkjet systemsMeteor Inkjet Ltd
 
NDF2017 - Digitisation 101 Workshop
NDF2017 - Digitisation 101 WorkshopNDF2017 - Digitisation 101 Workshop
NDF2017 - Digitisation 101 WorkshopDavid Sanderson
 
Introduction to Digital Preservation - Digitising your collection kevin lon...
Introduction to Digital Preservation - Digitising your collection   kevin lon...Introduction to Digital Preservation - Digitising your collection   kevin lon...
Introduction to Digital Preservation - Digitising your collection kevin lon...dri_ireland
 

Similar to Towards the Digital Archive – Challenges and Best Practice: A Look at Digitization Practices, Standards, and Methodology at the National Library of Israel (20)

Tim Keefe - DRI Training Series: 2. Digitising Your Collection
Tim Keefe - DRI Training Series: 2. Digitising Your CollectionTim Keefe - DRI Training Series: 2. Digitising Your Collection
Tim Keefe - DRI Training Series: 2. Digitising Your Collection
 
Just Digitise It! - Daniel Wilksch
Just Digitise It! - Daniel WilkschJust Digitise It! - Daniel Wilksch
Just Digitise It! - Daniel Wilksch
 
Manuscript, document and negative digitisation, Taryn Ellis
Manuscript, document and negative digitisation, Taryn EllisManuscript, document and negative digitisation, Taryn Ellis
Manuscript, document and negative digitisation, Taryn Ellis
 
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
 
Just digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office VictoriaJust digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office Victoria
 
Getting Started With Digitisation
Getting Started With DigitisationGetting Started With Digitisation
Getting Started With Digitisation
 
An Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your RequirementsAn Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your Requirements
 
Back2 basics - A Day In The Life Of An Oracle Analytics Query
Back2 basics - A Day In The Life Of An Oracle Analytics QueryBack2 basics - A Day In The Life Of An Oracle Analytics Query
Back2 basics - A Day In The Life Of An Oracle Analytics Query
 
Trouble with distribution
Trouble with distributionTrouble with distribution
Trouble with distribution
 
File types pro forma(1)
File types pro forma(1)File types pro forma(1)
File types pro forma(1)
 
Big Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with AzureBig Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with Azure
 
Taming the Search: A Practical Way of Enforcing GDPR and CCPA in Very Large D...
Taming the Search: A Practical Way of Enforcing GDPR and CCPA in Very Large D...Taming the Search: A Practical Way of Enforcing GDPR and CCPA in Very Large D...
Taming the Search: A Practical Way of Enforcing GDPR and CCPA in Very Large D...
 
Digitization Basics for Archives and Special Collections – Part 1: Select and...
Digitization Basics for Archives and Special Collections – Part 1: Select and...Digitization Basics for Archives and Special Collections – Part 1: Select and...
Digitization Basics for Archives and Special Collections – Part 1: Select and...
 
Pro formula digital graphics
Pro formula digital graphicsPro formula digital graphics
Pro formula digital graphics
 
Unit 78 technical file
Unit 78 technical fileUnit 78 technical file
Unit 78 technical file
 
Database Shootout: What's best for BI?
Database Shootout: What's best for BI?Database Shootout: What's best for BI?
Database Shootout: What's best for BI?
 
The unique requirements of RIPS for industrial inkjet systems
The unique requirements of RIPS for industrial inkjet systemsThe unique requirements of RIPS for industrial inkjet systems
The unique requirements of RIPS for industrial inkjet systems
 
NDF2017 - Digitisation 101 Workshop
NDF2017 - Digitisation 101 WorkshopNDF2017 - Digitisation 101 Workshop
NDF2017 - Digitisation 101 Workshop
 
Big Data and Hadoop in the Cloud
Big Data and Hadoop in the CloudBig Data and Hadoop in the Cloud
Big Data and Hadoop in the Cloud
 
Introduction to Digital Preservation - Digitising your collection kevin lon...
Introduction to Digital Preservation - Digitising your collection   kevin lon...Introduction to Digital Preservation - Digitising your collection   kevin lon...
Introduction to Digital Preservation - Digitising your collection kevin lon...
 

More from Axiell ALM

Batch Upload of Multimedia Files Using Import Tool
Batch Upload of Multimedia Files Using Import ToolBatch Upload of Multimedia Files Using Import Tool
Batch Upload of Multimedia Files Using Import ToolAxiell ALM
 
Status update: Axiell Roadmap/ New features
Status update: Axiell Roadmap/ New featuresStatus update: Axiell Roadmap/ New features
Status update: Axiell Roadmap/ New featuresAxiell ALM
 
Exploring Sapphire
Exploring SapphireExploring Sapphire
Exploring SapphireAxiell ALM
 
Welcome from Axiell
Welcome from AxiellWelcome from Axiell
Welcome from AxiellAxiell ALM
 
Digital Preservation Discussion Group
Digital Preservation Discussion GroupDigital Preservation Discussion Group
Digital Preservation Discussion GroupAxiell ALM
 
Where’s EMu (at the Canadian Museum of History)?
Where’s EMu (at the Canadian Museum of History)?Where’s EMu (at the Canadian Museum of History)?
Where’s EMu (at the Canadian Museum of History)?Axiell ALM
 
Centralized Rights Management - the Licensing Module
Centralized Rights Management - the Licensing ModuleCentralized Rights Management - the Licensing Module
Centralized Rights Management - the Licensing ModuleAxiell ALM
 
Batch Management: A Conveyor Mindset for Mass Digitization
Batch Management: A Conveyor Mindset for Mass DigitizationBatch Management: A Conveyor Mindset for Mass Digitization
Batch Management: A Conveyor Mindset for Mass DigitizationAxiell ALM
 
Digital Preservation Discussion Group
Digital Preservation Discussion GroupDigital Preservation Discussion Group
Digital Preservation Discussion GroupAxiell ALM
 
Status update: Axiell Roadmap/ New features
Status update: Axiell Roadmap/ New featuresStatus update: Axiell Roadmap/ New features
Status update: Axiell Roadmap/ New featuresAxiell ALM
 
Using Emu to Manage a Directory of the World’s Herbari
Using Emu to Manage a Directory of the World’s HerbariUsing Emu to Manage a Directory of the World’s Herbari
Using Emu to Manage a Directory of the World’s HerbariAxiell ALM
 
Welcome from Axiell
Welcome from AxiellWelcome from Axiell
Welcome from AxiellAxiell ALM
 
Welcome from the New York Botanical Garden
Welcome from the New York Botanical GardenWelcome from the New York Botanical Garden
Welcome from the New York Botanical GardenAxiell ALM
 
Axiell ALM Customer Calendar 2018
Axiell ALM Customer Calendar 2018Axiell ALM Customer Calendar 2018
Axiell ALM Customer Calendar 2018Axiell ALM
 
Mimsy XG Resource Session
Mimsy XG Resource SessionMimsy XG Resource Session
Mimsy XG Resource SessionAxiell ALM
 
2018 Roadshow & Beyond
2018 Roadshow & Beyond2018 Roadshow & Beyond
2018 Roadshow & BeyondAxiell ALM
 
Global Collection Dashboard – Using data we have to uncover data we don’t
Global Collection Dashboard – Using data we have to uncover data we don’tGlobal Collection Dashboard – Using data we have to uncover data we don’t
Global Collection Dashboard – Using data we have to uncover data we don’tAxiell ALM
 
Collaborative Project to Improve EMu for Managing Archives – Update
Collaborative Project to Improve EMu for Managing Archives – UpdateCollaborative Project to Improve EMu for Managing Archives – Update
Collaborative Project to Improve EMu for Managing Archives – UpdateAxiell ALM
 
Everything Must Go: Using Axiell Move and Barcodes to Relocate a Collection
Everything Must Go: Using Axiell Move and Barcodes to Relocate a CollectionEverything Must Go: Using Axiell Move and Barcodes to Relocate a Collection
Everything Must Go: Using Axiell Move and Barcodes to Relocate a CollectionAxiell ALM
 
Welcome from Axiell
Welcome from AxiellWelcome from Axiell
Welcome from AxiellAxiell ALM
 

More from Axiell ALM (20)

Batch Upload of Multimedia Files Using Import Tool
Batch Upload of Multimedia Files Using Import ToolBatch Upload of Multimedia Files Using Import Tool
Batch Upload of Multimedia Files Using Import Tool
 
Status update: Axiell Roadmap/ New features
Status update: Axiell Roadmap/ New featuresStatus update: Axiell Roadmap/ New features
Status update: Axiell Roadmap/ New features
 
Exploring Sapphire
Exploring SapphireExploring Sapphire
Exploring Sapphire
 
Welcome from Axiell
Welcome from AxiellWelcome from Axiell
Welcome from Axiell
 
Digital Preservation Discussion Group
Digital Preservation Discussion GroupDigital Preservation Discussion Group
Digital Preservation Discussion Group
 
Where’s EMu (at the Canadian Museum of History)?
Where’s EMu (at the Canadian Museum of History)?Where’s EMu (at the Canadian Museum of History)?
Where’s EMu (at the Canadian Museum of History)?
 
Centralized Rights Management - the Licensing Module
Centralized Rights Management - the Licensing ModuleCentralized Rights Management - the Licensing Module
Centralized Rights Management - the Licensing Module
 
Batch Management: A Conveyor Mindset for Mass Digitization
Batch Management: A Conveyor Mindset for Mass DigitizationBatch Management: A Conveyor Mindset for Mass Digitization
Batch Management: A Conveyor Mindset for Mass Digitization
 
Digital Preservation Discussion Group
Digital Preservation Discussion GroupDigital Preservation Discussion Group
Digital Preservation Discussion Group
 
Status update: Axiell Roadmap/ New features
Status update: Axiell Roadmap/ New featuresStatus update: Axiell Roadmap/ New features
Status update: Axiell Roadmap/ New features
 
Using Emu to Manage a Directory of the World’s Herbari
Using Emu to Manage a Directory of the World’s HerbariUsing Emu to Manage a Directory of the World’s Herbari
Using Emu to Manage a Directory of the World’s Herbari
 
Welcome from Axiell
Welcome from AxiellWelcome from Axiell
Welcome from Axiell
 
Welcome from the New York Botanical Garden
Welcome from the New York Botanical GardenWelcome from the New York Botanical Garden
Welcome from the New York Botanical Garden
 
Axiell ALM Customer Calendar 2018
Axiell ALM Customer Calendar 2018Axiell ALM Customer Calendar 2018
Axiell ALM Customer Calendar 2018
 
Mimsy XG Resource Session
Mimsy XG Resource SessionMimsy XG Resource Session
Mimsy XG Resource Session
 
2018 Roadshow & Beyond
2018 Roadshow & Beyond2018 Roadshow & Beyond
2018 Roadshow & Beyond
 
Global Collection Dashboard – Using data we have to uncover data we don’t
Global Collection Dashboard – Using data we have to uncover data we don’tGlobal Collection Dashboard – Using data we have to uncover data we don’t
Global Collection Dashboard – Using data we have to uncover data we don’t
 
Collaborative Project to Improve EMu for Managing Archives – Update
Collaborative Project to Improve EMu for Managing Archives – UpdateCollaborative Project to Improve EMu for Managing Archives – Update
Collaborative Project to Improve EMu for Managing Archives – Update
 
Everything Must Go: Using Axiell Move and Barcodes to Relocate a Collection
Everything Must Go: Using Axiell Move and Barcodes to Relocate a CollectionEverything Must Go: Using Axiell Move and Barcodes to Relocate a Collection
Everything Must Go: Using Axiell Move and Barcodes to Relocate a Collection
 
Welcome from Axiell
Welcome from AxiellWelcome from Axiell
Welcome from Axiell
 

Recently uploaded

Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfryanfarris8
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...kalichargn70th171
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfproinshot.com
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024Mind IT Systems
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...software pro Development
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 

Recently uploaded (20)

Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 

Towards the Digital Archive – Challenges and Best Practice: A Look at Digitization Practices, Standards, and Methodology at the National Library of Israel

  • 1. The Digital Archive Challenges & Best Practice Axiell Conference 05/05/2017
  • 3. So why are some people afraid of digitization? • Fear of change • Fear of irrelevancy of original material • Fear of lack of need The OPPOSITE holds true: Digitization: INCREASES awareness INCREASES need INCREASES relevancy Who moved my cheese?
  • 4. What we do….. • We’re a national library…. • Many types of material • Many sizes of material • Many different projects • Thousands of scans a day
  • 5. The 6 Considerations ……. • What is your objective? • What is the material scope? • Condition of the source material • How will the material be used? • Who is your audience? • Do you require LTDP?
  • 6. What exactly are we digitizing?
  • 7. What exactly are we preserving? DATA
  • 8. To accomplish this…. • Standards • Methodology • Workflow PRINT MATERIAL Source Material to Digital File / Microfilm to Digital File Standards for Digital Image Capture Material Type News Papers Manuscripts Books / Documents Jacket Maps Photographs Master / Preservation Images Target File Bi-Tonal Greyscale Color Greyscale Color Bi-Tonal Greyscale Color Color Greyscale Color Color Color Format TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF TIFF Resolution/DPI 300/400 300/400 400-600 300 400-600 300 300 400 300 400 600 600 600 Bit Depth 1-bit 8-bit 24-bit 8-bit 24-bit 1-bit 8-bit 24-bit 24-bit 8-bit 24-bit 24-bit 48-bit Compression None None None None None None None None None None None None None Color Mode - - RGB - RGB - - RGB RGB - RGB RGB RGB Criteria standard Tabloid, magazine standard standard If w images standard standard When high quality required Secondary / Production Images Format TIFF/JPEG2K/ JPEG TIFF/JPEG2K/ JPEG TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2K/ JPEG TIFF/JPEG2K/ JPEG TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2K TIFF/JPEG2 K Resolution 300 DPI 300 DPI 400 DPI 300 DPI 400-600 DPI 300 DPI 300 DPI 400 DPI 300 DPI 400 DPI 400 DPI 600 DPI 600DPI Compression LZW/ CCITT-4 LZW/ LZW/ LZW/ CCITT-4 LZW/ Image Processing De-skew De-speckle Crop to edge De-Skew De-Speckle Crop De-Skew De-Speckle Crop De-Skew De-Speckle Crop De-Skew Crop Presentation / Access Images Format JPEG/PDF JPEG/PDF JPEG/PDF JPEG JPEG JPEG/PDF JPEG/PDF JPEG/PDF JPEG/PDF JPEG JPEG JPEG JPEG Size Thumbnail Images Format JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG JPEG Size 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬ 150x150 ‫בהתאם‬ ‫או‬ ‫לצורך‬
  • 9. What we do….. Preservation Master Derivative Master Derivative Access DAMS LTDP TIFF Single page no compression 300 + dpi TIFF Single page no compression 300 + dpi CROPPED JPEG As required
  • 10. Digitization… What does it mean? Analog • = 1 book (250 pages) • = generally 1 metadata (catalog) record • = 2 items
  • 11. Digitization… What does it mean? Digital • = 250 preservation master files • = (250 derivative master files) • = 250 JPEG access files • = 1 metadata (catalog) record • = 1 DAMS record • = at least (502) 752 items
  • 12. Digitization… What does it mean? =DAMS!
  • 13. Digitization… What does it mean? Storage - Costs of 1 GB $438,0001980 $105,0001985 $11,0001990 $112000 $12005 $0.12012 $0.032014 FREE – Google 15 GB2016
  • 14. Project # 1 Project # 2 Digitize 100,000 pages Digitize 100,000 pages 6 months $80,000 14 months $120,000 What’s in a name?
  • 17. 001.Tiff 002.Tiff 189.Tiff Project # 1 Project # 2 Digitize 100,000 pages Digitize 100,000 pages 6 months $80,000 14 months $120,000 GF_0015-01_001_0001.Tiff GF_0015-01_001_0002.Tiff GF_165-02_005_0189.Tiff
  • 18. What exactly IS digital preservation? Long-term Digital Preservation (LTDP)
  • 19. Backup LTDP May 05 2017 2035May 04 2017
  • 20. Digital Preservation: - Technological dependence - Proprietary software dependence - Technological obsolescence (= questionable longevity) - hardware - software - media - formats “Long-term” Digital Preservation = an Oxymoron? = THREATS
  • 21. Approach to LTDP - Adopt internationally accepted strategy - Framework (OAIS, PREMIS) - Methodology - Best practice and Standards (of digitization, metadata) - CONTINUITY - Assess Threats - Implement an LTDP System (Risk assessment and consequent actions)
  • 22. LTDP System - NLI implemented the ExLibris Rosetta System - Deep storage of digital material - Based on OAIS / PREMIS - Periodic migration of file formats – CONTINUITY - On-demand - Early - Late - Ensures bit level integrity - Constant risk assessment, timely risk resolving actions
  • 23. Food for Thought: Define ”Long-Term”? At some point down the road… will someone be “digitizing” it all over again?! Answer: YES!!!!!! But WHY???

Editor's Notes

  1. Towards the Digital Archive – Challenges and Best Practice   A look at digitization practices, standards, and methodology at the National Library of Israel
  2. Material types, PREservation Metadata: Implementation Strategies Open Archival Information System
  3. PREservation Metadata: Implementation Strategies Open Archival Information System
  4. Material types, PREservation Metadata: Implementation Strategies Open Archival Information System
  5. PREservation Metadata: Implementation Strategies Open Archival Information System
  6. PREservation Metadata: Implementation Strategies Open Archival Information System
  7. PREservation Metadata: Implementation Strategies Open Archival Information System
  8. Material types, PREservation Metadata: Implementation Strategies Open Archival Information System
  9. Material types, PREservation Metadata: Implementation Strategies Open Archival Information System
  10. Material types, PREservation Metadata: Implementation Strategies Open Archival Information System
  11. Material types, PREservation Metadata: Implementation Strategies Open Archival Information System
  12. Material types, PREservation Metadata: Implementation Strategies Open Archival Information System
  13. Material types, PREservation Metadata: Implementation Strategies Open Archival Information System
  14. ברגע שטקסט הוא דאטה, הטקסט נהיה האובייקט. (בעייתי – בגלל אי- דיוק של OCR?) פרויקטים דיגיטליים מייצרים תרבות חדשה של דאטה. ניתן לתרגם את הדאטה לצורות אחרות עבור שימוש שונה למטרות שונות הטכנולוגיה היא לא הנקודה. היום זה דיגיטיזציה. מחר זה יהיה משהו אחר. הטכנולוגיה היא רק נקודה בדרך
  15. ברגע שטקסט הוא דאטה, הטקסט נהיה האובייקט. (בעייתי – בגלל אי- דיוק של OCR?) פרויקטים דיגיטליים מייצרים תרבות חדשה של דאטה. ניתן לתרגם את הדאטה לצורות אחרות עבור שימוש שונה למטרות שונות הטכנולוגיה היא לא הנקודה. היום זה דיגיטיזציה. מחר זה יהיה משהו אחר. הטכנולוגיה היא רק נקודה בדרך
  16. ברגע שטקסט הוא דאטה, הטקסט נהיה האובייקט. (בעייתי – בגלל אי- דיוק של OCR?) פרויקטים דיגיטליים מייצרים תרבות חדשה של דאטה. ניתן לתרגם את הדאטה לצורות אחרות עבור שימוש שונה למטרות שונות הטכנולוגיה היא לא הנקודה. היום זה דיגיטיזציה. מחר זה יהיה משהו אחר. הטכנולוגיה היא רק נקודה בדרך
  17. ברגע שטקסט הוא דאטה, הטקסט נהיה האובייקט. (בעייתי – בגלל אי- דיוק של OCR?) פרויקטים דיגיטליים מייצרים תרבות חדשה של דאטה. ניתן לתרגם את הדאטה לצורות אחרות עבור שימוש שונה למטרות שונות הטכנולוגיה היא לא הנקודה. היום זה דיגיטיזציה. מחר זה יהיה משהו אחר. הטכנולוגיה היא רק נקודה בדרך
  18. LTDP is NOT an oxymoron when implemented properly. A successful digital preservation strategy must account for and mitigate the impact of various threats to the accessibility and usability of digital materials over time So by adopting the above, we don’t necessarily GUARANTEE that our material today will be available in the future, we can be fairly certain that no matter where the future will take us, the material we have today will remain accessible to future generations
  19. OPEN ARCHIVE INFORMATION SYSTEM – A REFERENCE MODEL FOR LTDP Preservation Metadata: Implementation Strategies (PREMIS) In Summary: It is understood that Digital Content, with all of its challenges and disadvantages, still outweighs analog preservation in terms of the advantages it offers